Open-Models - a th3nolo Collection

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 4.5M • • 4.99k

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published Dec 23, 2025 • 62

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

Paper • 2512.15560 • Published Dec 17, 2025 • 25

Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Paper • 2512.22615 • Published Dec 27, 2025 • 51

tencent/HY-Motion-1.0

Text-to-3D • Updated Dec 31, 2025 • 230 • 424

Lightricks/LTX-2

Image-to-Video • 19B • Updated Mar 2 • 464k • • 1.76k

LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR

Paper • 2601.14251 • Published Jan 20 • 31

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Paper • 2601.22153 • Published Jan 29 • 75

tencent/Youtu-VL-4B-Instruct

Image-Text-to-Text • 5B • Updated Feb 10 • 619 • 158

Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation

Paper • 2601.21406 • Published Jan 29 • 6

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published Jan 28 • 51

DeepSeek-OCR 2: Visual Causal Flow

Paper • 2601.20552 • Published Jan 28 • 73

zai-org/GLM-OCR

Image-Text-to-Text • 1B • Updated May 19 • 3.53M • • 1.93k

unsloth/Qwen3-Coder-Next-FP8-Dynamic

Text Generation • 80B • Updated Feb 3 • 19k • 44

Qwen/Qwen3-Coder-Next

Text Generation • 80B • Updated Feb 3 • 1.08M • • 1.53k

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published Feb 12 • 62

Lightricks/LTX-2.3

Image-to-Video • Updated 8 days ago • 2.12M • 1.6k

mistralai/Leanstral-2603

Updated 2 days ago • 152 • 168

Qianfan-OCR: A Unified End-to-End Model for Document Intelligence

Paper • 2603.13398 • Published Mar 11 • 155

Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF

Image-Text-to-Text • 4B • Updated 13 days ago • 21.2k • 142

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published Mar 17 • 101

YTan2000/Qwen3.5-27B-TQ3_1S

Image-Text-to-Text • 27B • Updated Apr 23 • 169 • 38

bartowski/arcee-ai_Trinity-Large-Thinking-GGUF

Text Generation • 399B • Updated Apr 1 • 630 • 12

zed-industries/zeta-2

Text Generation • 8B • Updated Mar 23 • 403 • 186

mudler/Qwen3.5-35B-A3B-APEX-GGUF

Text Generation • 35B • Updated Apr 27 • 12.1k • 92

Jackrong/Qwopus3.5-27B-v3

Image-Text-to-Text • 27B • Updated Apr 16 • 457 • • 248

Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding

Paper • 2604.00528 • Published Apr 1 • 12

0xSero/Gemma-4-21B

Text Generation • 21B • Updated May 30 • 845 • 101

datalab-to/chandra-ocr-2

Image-Text-to-Text • 5B • Updated 21 days ago • 2.09M • 446

lightonai/LightOnOCR-2-1B

Image-Text-to-Text • 1B • Updated 9 days ago • 148k • 718

selimaktas/MiniMax-M2.75-460B-A20B

Text Generation • 453B • Updated Apr 21 • 7 • 26

Qwen/Qwen3.6-27B

Image-Text-to-Text • 28B • Updated Apr 24 • 5.33M • • 1.98k

XiaomiMiMo/MiMo-V2-Flash

Text Generation • 310B • Updated 8 days ago • 64.2k • • 744

openai/privacy-filter

Token Classification • 1B • Updated Apr 22 • 418k • • 1.7k

concavity-ai/superlinear-exp-v0.1

Text Generation • 32B • Updated Feb 6 • 21 • 22

openbmb/InfLLM-V2-Long-Sparse-Base

8B • Updated Dec 1, 2025 • 61 • 7

deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • 685B • Updated Nov 18, 2025 • 284k • • 992

HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention

Paper • 2603.28458 • Published Mar 30 • 44

oongaboongahacker/Gemini-Nano

Updated Jun 25, 2024 • 43

Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings

Paper • 2605.22391 • Published May 21 • 42

Kaikaku/epicure-cooc

Feature Extraction • Updated May 27 • 441 • 37

unsloth/gemma-4-12b-it-GGUF

Image-Text-to-Text • 12B • Updated 8 days ago • 660k • 736

nex-agi/Nex-N2-Pro

Text Generation • 397B • Updated Jun 11 • 2.4k • 368

MiniMaxAI/MiniMax-M3

Image-Text-to-Text • 427B • Updated 6 days ago • 227k • • 1.33k

DiffusionGemma vs Gemma-4 — Post-OCR Correction

📰

21

Diffusion vs autoregressive LLM on historical OCR cleanup

CohereLabs/North-Mini-Code-1.0

Text Generation • 30B • Updated Jun 15 • 24.8k • 526

SupraLabs/Supra-1.5-50M-Instruct-exp

Text Generation • 51.8M • Updated Jun 12 • 2.64k • 48

zai-org/GLM-5.2

Text Generation • 753B • Updated 15 days ago • 535k • • 4.06k

SupraLabs/Supra-A2A-Nano-Exp

Any-to-Any • 29.7M • Updated 26 days ago • 35

baidu/Unlimited-OCR

Image-Text-to-Text • 3B • Updated 14 days ago • 1.99M • 2.02k

docling-project/DocLayNet

Updated Jan 25, 2023 • 755 • 144

Mercity/mamba-790m-resoning

Updated Oct 29, 2025

mistralai/Leanstral-1.5-119B-A6B

Updated 2 days ago • 434 • 204