InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper • 2508.18265 • Published Aug 25, 2025 • 213
PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing Paper • 2601.21957 • Published 12 days ago • 19
Sleeping 📚 AI Semantic Book Recommender 📖 Find book recommendations based on description, category, and tone
Sleeping 📚 AI Semantic Book Recommender 📖 Find book recommendations based on description, category, and tone
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models Paper • 2109.10282 • Published Sep 21, 2021 • 12