Falcon-H1-Tiny Collection A series of extremely small, yet powerful language models redefining capabilities at small scale • 19 items • Updated 21 days ago • 37
Distilled LLms Collection This is a collection of various distilled llms by Cannae ai! • 7 items • Updated about 6 hours ago • 1
Context Cascade Compression: Exploring the Upper Limits of Text Compression Paper • 2511.15244 • Published Nov 19, 2025 • 3
GraphMind Collection More in https://arxiv.org/pdf/2507.17168, Graph Reasoning Model series • 4 items • Updated Aug 22, 2025 • 1
GPT Reddit Comment Detection Collection Collection of datasets and models used for detecting LLM bots on reddit. • 6 items • Updated 19 days ago • 1
Heretic - Abliterated, Uncensored, Unrestricted POWER. Collection Models that have be abliterated using the HERETIC method. Done properly, this completely removed almost all censorship with no damage to the model. • 100 items • Updated about 19 hours ago • 41
Clara Medical Collection NVIDIA Clara Open Models for medical imaging AI: segment, generate, and reason across CT, MRI, and X-ray. Built on MONAI by NVIDIA. • 6 items • Updated 2 days ago • 14
Inference Optimized Checkpoints (with Model Optimizer) Collection A collection of generative models quantized and optimized for inference with Model Optimizer. • 56 items • Updated 2 days ago • 124
view article Article Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement Dec 3, 2025 • 14
Qwen 3 / 2.5 Reasoning/Thinking REG + MOEs. Collection Qwen 3 / 2.5 Reasoning/Thinking models in both regular and MOE configuration built by me. Source code links also below too. • 67 items • Updated 19 days ago • 12
BioNeMo - Optimize Collection NVIDIA BioNeMo Models for Optimization • 4 items • Updated 2 days ago • 7
MXFP4 Hybrid GGUF Collection MXFP4 hybrid GGUF models getting... well.. Getting some interesting results. THIS IS NOW SURPASSED VASTLY BY MAGIC QUANT COLLECTION! • 11 items • Updated Dec 1, 2025 • 6
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated 26 days ago • 133
OpenMed NER: Open-Source, Domain-Adapted State-of-the-Art Transformers for Biomedical NER Across 12 Public Datasets Paper • 2508.01630 • Published Aug 3, 2025 • 15