view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 282
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family 7 days ago • 64
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs Paper • 2512.14698 • Published Dec 16, 2025 • 21
Perceptual Taxonomy: Evaluating and Guiding Hierarchical Scene Reasoning in Vision-Language Models Paper • 2511.19526 • Published Nov 24, 2025 • 2
view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval +1 Mar 22, 2024 • 126
view article Article Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads 21 days ago • 6
view article Article Diversity Vs Density: A data strategy comparison for fine-tuning VLMs 21 days ago • 5
CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition Paper • 2509.19768 • Published Sep 24, 2025 • 7
FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition Paper • 2512.13884 • Published Dec 15, 2025 • 15
fiNERweb Collection A multilingual dataset for NER covering 91 langauges and 25 scripts • 3 items • Updated Dec 16, 2025 • 1