VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding Paper • 2603.22285 • Published about 17 hours ago • 40
Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD Paper • 2603.20155 • Published 4 days ago • 7
view article Article **LoRA Fine-Tuning BitNet b1.58 LLMs on Heterogeneous Edge GPUs via QVAC Fabric** 7 days ago • 14
WorldAgents: Can Foundation Image Models be Agents for 3D World Models? Paper • 2603.19708 • Published 4 days ago • 10
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF Image-Text-to-Text • 27B • Updated 3 days ago • 33.8k • 78
LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation Paper • 2603.20192 • Published 4 days ago • 22
FlowScene: Style-Consistent Indoor Scene Generation with Multimodal Graph Rectified Flow Paper • 2603.19598 • Published 4 days ago • 28
Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer Paper • 2603.19227 • Published 5 days ago • 40
LaDe: Unified Multi-Layered Graphic Media Generation and Decomposition Paper • 2603.17965 • Published 6 days ago • 5
Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens Paper • 2603.19232 • Published 5 days ago • 31
Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing Paper • 2603.17942 • Published 6 days ago • 6
Running Featured 45 Nemotron 3 Nano WebGPU ⚛ 45 A compact reasoning-capable model running in your browser.
BenchPreS: A Benchmark for Context-Aware Personalized Preference Selectivity of Persistent-Memory LLMs Paper • 2603.16557 • Published 7 days ago • 21
Alignment Makes Language Models Normative, Not Descriptive Paper • 2603.17218 • Published 6 days ago • 46
AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents Paper • 2603.16496 • Published 7 days ago • 13
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 7 days ago • 241