LiquidAI (Liquid AI)

posted an update 8 days ago

Post

935

Big update to llm-datasets, my curated list of datasets and tools for post-training LLMs.

> Added many new datasets
> New "thinking" column
> Refreshed recommended tools.

Thanks to everyone who told me they used it for their research at ICLR, you motivated this update!

2 replies

·

mlabonne

authored 2 papers 3 months ago

LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 59

Zero-Overhead Introspection for Adaptive Test-Time Compute

Paper • 2512.01457 • Published Dec 1, 2025 • 2

mlabonne

posted an update 4 months ago

Post

10332

New family of 1B models just dropped!

> LiquidAI/LFM2.5-1.2B-Base: 10T → 28T tokens
> LiquidAI/LFM2.5-1.2B-Instruct: new large-scale multi-stage RL
> LiquidAI/LFM2.5-1.2B-JP: our most polite model
> LiquidAI/LFM2.5-VL-1.6B: multi-image multilingual
> LiquidAI/LFM2.5-Audio-1.5B: 8x times faster, no quality loss

Super proud of this release 🤗

3 replies

·

ykhrustalev

authored a paper 4 months ago

LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 59

adityatadimeti

authored a paper 5 months ago

LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 59

fernandofernandes

authored 3 papers 5 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 16

Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation

Paper • 2406.14971 • Published Jun 21, 2024

Training-Free Tokenizer Transplantation via Orthogonal Matching Pursuit

Paper • 2506.06607 • Published Jun 7, 2025 • 3

zetianli

authored a paper 5 months ago

LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 59

fernandofernandes

authored a paper 5 months ago

LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 59

kohsei

authored a paper 5 months ago

MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation

Paper • 2511.22989 • Published Nov 28, 2025 • 16

sam-paech

authored 3 papers 6 months ago

EQ-Bench: An Emotional Intelligence Benchmark for Large Language Models

Paper • 2312.06281 • Published Dec 11, 2023 • 2

Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy

Paper • 2508.07485 • Published Aug 10, 2025 • 10

Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive Patterns in Language Models

Paper • 2510.15061 • Published Oct 16, 2025 • 3

GAD-cell

authored a paper 7 months ago

Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer

Paper • 2510.05846 • Published Oct 7, 2025 • 3

mlabonne

posted an update 7 months ago

Post

8435

LiquidAI/LFM2-8B-A1B just dropped!

8.3B params with only 1.5B active/token 🚀

> Quality ≈ 3–4B dense, yet faster than Qwen3-1.7B
> MoE designed to run on phones/laptops (llama.cpp / vLLM)
> Pre-trained on 12T tokens → strong math/code/IF

1 reply

·

s-jse

authored 2 papers 7 months ago

Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with Large Language Models

Paper • 2509.23233 • Published Sep 27, 2025 • 4

CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition

Paper • 2509.19768 • Published Sep 24, 2025 • 7

mlabonne

posted an update 7 months ago

Post

3887

⚛️ New drop of tiny task-specific models!

Want to do data extraction, translation, RAG, tool use, or math on a Raspberry Pi? We got you covered! ✅

These tiny models were fine-tuned to perform narrow tasks extremely well, making them competitive with much larger models.

You can deploy them today on-device or even on GPUs for big data operations!

LiquidAI/liquid-nanos-68b98d898414dd94d4d5f99a

1 reply

·

Liquid AI

AI & ML interests

Papers

LFM2 Technical Report

Zero-Overhead Introspection for Adaptive Test-Time Compute

LFM2 Technical Report

LFM2 Technical Report

Spectrum: Targeted Training on Signal to Noise Ratio

Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation

Training-Free Tokenizer Transplantation via Orthogonal Matching Pursuit

LFM2 Technical Report

LFM2 Technical Report

MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation

EQ-Bench: An Emotional Intelligence Benchmark for Large Language Models

Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy

Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive Patterns in Language Models

Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer

Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with Large Language Models

CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition

AI & ML interests

Papers

Team members 78

LiquidAI's activity