Closing the Data Loop: Using OpenDataArena to Engineer Superior Training Datasets Paper • 2601.09733 • Published Dec 30, 2025 • 8
ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch Paper • 2601.13606 • Published 14 days ago • 10
Scientific Image Synthesis: Benchmarking, Methodologies, and Downstream Utility Paper • 2601.17027 • Published 17 days ago • 41
MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods Paper • 2601.21821 • Published 5 days ago • 54
MMFineReason Collection High-quality STEM reasoning dataset for Multimodal LLM post-training. • 14 items • Updated about 2 hours ago • 19
MMFineReason Collection High-quality STEM reasoning dataset for Multimodal LLM post-training. • 14 items • Updated about 2 hours ago • 19
OpenDataArena/MMFineReason-SFT-123K-Qwen3-VL-235B-Thinking Viewer • Updated about 2 hours ago • 123k • 222 • 46
OpenDataArena/MMFineReason-1.8M-Qwen3-VL-235B-Thinking Viewer • Updated 4 days ago • 1.81M • 1.04k • 82
MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods Paper • 2601.21821 • Published 5 days ago • 54
Scientific Image Synthesis: Benchmarking, Methodologies, and Downstream Utility Paper • 2601.17027 • Published 17 days ago • 41
ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch Paper • 2601.13606 • Published 14 days ago • 10
ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch Paper • 2601.13606 • Published 14 days ago • 10