-
ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning
Paper • 2604.24300 • Published • 65 -
Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis
Paper • 2604.24198 • Published • 21 -
KernelBench-X: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
Paper • 2605.04956 • Published • 5
amjad
heroali
·
AI & ML interests
None yet
Recent Activity
liked a model about 23 hours ago
TencentARC/Track4World liked a model 1 day ago
microsoft/Phi-Ground-Any liked a model 1 day ago
HiDream-ai/HiDream-O1-ImageOrganizations
None yet