Lance: Unified Multimodal Modeling by Multi-Task Synergy Paper • 2605.18678 • Published 16 days ago • 78
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 46
🐶 IDEFICS 🐶 Collection Collection assembling all the models and spaces related to IDEFICS • 6 items • Updated Apr 15, 2024 • 9
Nomic Embed: Training a Reproducible Long Context Text Embedder Paper • 2402.01613 • Published Feb 2, 2024 • 18
RoFormer: Enhanced Transformer with Rotary Position Embedding Paper • 2104.09864 • Published Apr 20, 2021 • 18
MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation Paper • 2605.27366 • Published 8 days ago • 25
Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini Paper • 2605.27295 • Published 8 days ago • 22
Position: AI Security Policy Should Target Systems, Not Models Paper • 2605.09504 • Published 24 days ago • 1
ActiveLab: Active Learning with Re-Labeling by Multiple Annotators Paper • 2301.11856 • Published Jan 27, 2023 • 1
Who judges the judges? Governance from metrics: a runtime framework for continuous LLM compliance monitoring Paper • 2605.24737 • Published 11 days ago • 1
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 104
distil-large-v3.5 Collection This collection contains the model repositories for distil-large-v3.5, which provides support for the most popular Whisper libraries. • 5 items • Updated Mar 25, 2025 • 10
Modeling Sparse and Bursty Vulnerability Sightings: Forecasting Under Data Constraints Paper • 2604.16038 • Published Apr 17 • 4
Natural language guidance of high-fidelity text-to-speech with synthetic annotations Paper • 2402.01912 • Published Feb 2, 2024 • 14
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents Paper • 1901.08149 • Published Jan 23, 2019 • 4