pinned
Running on L40S
Agents
589
MinerU Document Extraction Tools
📚
Easy converting PDF and Office docs into Markdown and JSON
OpenDataLab provides high-quality open datasets and tools for large models. China Large model corpus Data Alliance open source data service designated platform
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora
MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale
Easy converting PDF and Office docs into Markdown and JSON
demo of MinerU-Diffusion
Convert table images into HTML tags with TRivia-3B
Evaluate formula recognition accuracy
Demo for DocLayout-YOLO
Recognize math equations from images