Together

company

Verified

https://together.ai

togethercompute

togethercomputer

Inference Provider

2,535,602 monthly requests

AI & ML interests

Foundation Models, Decentralized Computing, Open Source AI.

Recent Activity

JamesSand authored a paper about 1 month ago

No Hidden Prompts Needed! You Can Game AI Peer Review with Presentation-Only Revisions

Zhongzhu submitted a paper about 1 month ago

Taylor-Calibrate: Principled Initialization for Hybrid Linear Attention Distillation

JamesSand submitted a paper about 1 month ago

No Hidden Prompts Needed! You Can Game AI Peer Review with Presentation-Only Revisions

View all activity

Papers

Taylor-Calibrate: Principled Initialization for Hybrid Linear Attention Distillation

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

View all Papers

Articles

Fine-tune Any LLM from the Hugging Face Hub with Together AI

KaiserWhoLearns

authored 2 papers 3 months ago

What Is Seen Cannot Be Unseen: The Disruptive Effect of Knowledge Conflict on Large Language Models

Paper • 2506.06485 • Published Jun 6, 2025 • 5

What do Language Models Learn and When? The Implicit Curriculum Hypothesis

Paper • 2604.08510 • Published Apr 9 • 4

KaiserWhoLearns

submitted a paper to Daily Papers 3 months ago

What do Language Models Learn and When? The Implicit Curriculum Hypothesis

Paper • 2604.08510 • Published Apr 9 • 4

submitted a paper to Daily Papers 3 months ago

Introspective Diffusion Language Models

Paper • 2604.11035 • Published Apr 13 • 25

KaiserWhoLearns

authored a paper 5 months ago

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Paper • 2603.09095 • Published Mar 10 • 29

KaiserWhoLearns

submitted a paper to Daily Papers 5 months ago

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Paper • 2603.09095 • Published Mar 10 • 29

submitted a paper to Daily Papers 5 months ago

Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking

Paper • 2602.21196 • Published Feb 24 • 7

KaiserWhoLearns

authored a paper 6 months ago

FIRE-Bench: Evaluating Agents on the Rediscovery of Scientific Insights

Paper • 2602.02905 • Published Feb 2 • 5

posted an update 12 months ago

Post

414

🚀 Full-Quality Wan2.2 Video Generation on a single 24GB GPU — Powered by DFloat11

We just released the DFloat11 compressed Wan2.2 models. Now you can run full-quality Wan2.2 video generation on a single 24GB GPU, thanks to DFloat11 compression and CPU offloading.

🔗 Image-to-Video: DFloat11/Wan2.2-I2V-A14B-DF11
🔗 Text-to-Video: DFloat11/Wan2.2-T2V-A14B-DF11

authored a paper over 1 year ago

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

Paper • 2504.11651 • Published Apr 15, 2025 • 31

authored 2 papers over 1 year ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14, 2025 • 61

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 59

authored a paper almost 2 years ago

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Paper • 2408.15237 • Published Aug 27, 2024 • 42

KaiserWhoLearns

authored a paper almost 2 years ago

Amuro & Char: Analyzing the Relationship between Pre-Training and Fine-Tuning of Large Language Models

Paper • 2408.06663 • Published Aug 13, 2024 • 16

authored a paper about 2 years ago

Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees

Paper • 2110.03313 • Published Oct 7, 2021 • 1

authored a paper about 2 years ago

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 59

KaiserWhoLearns

authored a paper about 2 years ago

The Validity of Evaluation Results: Assessing Concurrence Across Compositionality Benchmarks

Paper • 2310.17514 • Published Oct 26, 2023 • 1

authored 3 papers about 2 years ago

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 39

RuCoLA: Russian Corpus of Linguistic Acceptability

Paper • 2210.12814 • Published Oct 23, 2022 • 1

Petals: Collaborative Inference and Fine-tuning of Large Models

Paper • 2209.01188 • Published Sep 2, 2022 • 2