Wei Liu's picture

Wei Liu

lefutonku

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

SkVM: Compiling Skills for Efficient Execution Everywhere

upvoted a paper 7 days ago

GigaWorld-Policy: An Efficient Action-Centered World--Action Model

upvoted a paper 7 days ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

View all activity

Organizations

None yet

upvoted 6 papers 7 days ago

SkVM: Compiling Skills for Efficient Execution Everywhere

Paper • 2604.03088 • Published 30 days ago • 10

GigaWorld-Policy: An Efficient Action-Centered World--Action Model

Paper • 2603.17240 • Published Mar 18 • 26

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published 9 days ago • 116

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Paper • 2604.24763 • Published 9 days ago • 68

ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning

Paper • 2604.24300 • Published 9 days ago • 64

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published 12 days ago • 223

liked a Space 8 days ago

FineVision: Open Data is All You Need

A new open-source dataset for training VLMs

liked 3 datasets 10 days ago

leggedrobotics/wildos

Viewer • Updated Feb 24 • 1.17k • 796 • 2

YiboZhang2001/TexVerse

Updated Sep 3, 2025 • 55.9k • 27

allenai/objaverse

Updated Mar 31, 2023 • 540k • 445

upvoted 5 papers 10 days ago

Vista4D: Video Reshooting with 4D Point Clouds

Paper • 2604.21915 • Published 13 days ago • 12

A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications

Paper • 2503.07137 • Published Mar 10, 2025 • 2

MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4, 2025 • 167

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 85

EasyVideoR1: Easier RL for Video Understanding

Paper • 2604.16893 • Published 18 days ago • 40

upvoted a paper 11 days ago

AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model

Paper • 2604.19747 • Published 15 days ago • 38

upvoted a paper 12 days ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 14 days ago • 239

upvoted 3 papers 13 days ago

PersonaVLM: Long-Term Personalized Multimodal LLMs

Paper • 2604.13074 • Published Mar 20 • 45

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 72

Bitnet.cpp: Efficient Edge Inference for Ternary LLMs

Paper • 2502.11880 • Published Feb 17, 2025 • 18