Shamik
Shamik
AI & ML interests
CV, NLP, Speech, Python
Organizations
Audio Models
Benchmark
- RunningAgents45
OCRBenchv2 Leaderboard
🏆45Display OCRBench leaderboard for text recognition models
- RunningAgents208
Vidore Leaderboard
🥇208Browse and compare visual document retrieval model scores
- Running on CPU UpgradeAgentsFeatured1.39k
Open ASR Leaderboard
🏆1.39kExplore speech model performance benchmarks
Interesting Spaces
- RunningFeatured197
Attention Visualization
🔥197Vision Transformer Attention Visualization
- Runtime errorAgents143
Open NotebookLM
🎙143Generate a podcast to discuss the topic of your choice!
- Running on ZeroMCP415
Multimodal OCR
🍍415Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
- SleepingMCPFeatured143
Multimodal OCR2
💻143FireRed / Nanonets / Monkey / Thyme / Typhoon / SmolDocling
Text Generation Models
Vision Models
Audio Models
MCP Servers
Benchmark
- RunningAgents45
OCRBenchv2 Leaderboard
🏆45Display OCRBench leaderboard for text recognition models
- RunningAgents208
Vidore Leaderboard
🥇208Browse and compare visual document retrieval model scores
- Running on CPU UpgradeAgentsFeatured1.39k
Open ASR Leaderboard
🏆1.39kExplore speech model performance benchmarks
Multi modal Document Parser
Interesting Spaces
- RunningFeatured197
Attention Visualization
🔥197Vision Transformer Attention Visualization
- Runtime errorAgents143
Open NotebookLM
🎙143Generate a podcast to discuss the topic of your choice!
- Running on ZeroMCP415
Multimodal OCR
🍍415Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
- SleepingMCPFeatured143
Multimodal OCR2
💻143FireRed / Nanonets / Monkey / Thyme / Typhoon / SmolDocling