5 6 14

Quentin Anthony

qanthony

https://quentin-anthony.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

ZAYA1-8B Technical Report

upvoted a paper 7 months ago

Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space

liked a model over 1 year ago

Zyphra/Zonos-v0.1-hybrid

View all activity

Organizations

upvoted a paper about 8 hours ago

ZAYA1-8B Technical Report

Paper • 2605.05365 • Published 6 days ago • 3

upvoted a paper 7 months ago

Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space

Paper • 2510.04476 • Published Oct 6, 2025 • 16

liked 2 models over 1 year ago

Zyphra/Zonos-v0.1-hybrid

Text-to-Speech • Updated Jun 3, 2025 • 2.46k • 1.11k

Zyphra/Zonos-v0.1-transformer

Text-to-Speech • Updated Jun 3, 2025 • 9.32k • 431

liked a dataset over 1 year ago

Zyphra/Zyda-2

Preview • Updated Aug 6, 2025 • 47.5k • 94

liked 4 models over 1 year ago

updated a model over 1 year ago

EleutherAI/neox_mistral_7b_dpo_ultrafeedback

7B • Updated Sep 17, 2024 • 25

liked a Space over 1 year ago

Transformer Calculator

📊

Calculate memory, parameters, and FLOPs for transformer models

liked a model over 1 year ago

Zyphra/Zamba2-1.2B

1B • Updated Feb 7, 2025 • 3.7k • 75

authored a paper over 1 year ago

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7, 2024 • 5

upvoted a paper almost 2 years ago

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7, 2024 • 5

liked a dataset almost 2 years ago

openai/gsm8k

Benchmark • Updated Mar 23 • 17.6k • 914k • 1.3k

New activity in Zyphra/Zamba2-2.7B almost 2 years ago

Was the new PHI-2.7B used in the Comparison?

#1 opened almost 2 years ago by

S4sch

liked a model almost 2 years ago

Zyphra/Zamba2-2.7B

Text Generation • 3B • Updated Feb 14, 2025 • 1.11k • 79

upvoted a paper almost 2 years ago

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16, 2024 • 57

authored 2 papers almost 2 years ago

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Paper • 2404.05892 • Published Apr 8, 2024 • 40

Zyda: A 1.3T Dataset for Open Language Modeling

Paper • 2406.01981 • Published Jun 4, 2024 • 5