Leo's picture

Leo PRO

leideng

·

https://leideng.github.io/

AI & ML interests

Efficient AI, Sparse Attention

Recent Activity

liked a model about 7 hours ago

google/umt5-xxl

liked a model 1 day ago

updated a collection 2 days ago

View all activity

Organizations

None yet

liked a model about 7 hours ago

google/umt5-xxl

Updated Jul 3, 2023 • 94k • 61

liked a model 1 day ago

facebook/cwm

33B • Updated Oct 15, 2025 • 23.4k • 269

updated a collection 2 days ago

SFT

7 items • Updated 2 days ago

authored 8 papers 2 days ago

Extending Context Window of Large Language Models via Semantic Compression

Paper • 2312.09571 • Published Dec 15, 2023 • 16

Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory

Paper • 2405.08707 • Published May 14, 2024 • 34

High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models

Paper • 2309.15889 • Published Sep 27, 2023

HyLRA: Hybrid Layer Reuse Attention for Efficient Long-Context Inference

Paper • 2602.00777 • Published Jan 31

A Mathematical Theory of Top-$k$ Sparse Attention via Total Variation Distance

Paper • 2512.07647 • Published Dec 8, 2025

Retrieval Meets Reasoning: Dynamic In-Context Editing for Long-Text Understanding

Paper • 2406.12331 • Published Jun 18, 2024

A Mean Field Ansatz for Zero-Shot Weight Transfer

Paper • 2408.08681 • Published Aug 16, 2024

Bayesian Test-Time Adaptation for Vision-Language Models

Paper • 2503.09248 • Published Mar 17, 2025

updated a collection 3 days ago

SFT

7 items • Updated 2 days ago

liked a dataset 3 days ago

GAIR/lima

Viewer • Updated Jun 8, 2023 • 1.33k • 5.08k • 464

updated 3 collections 3 days ago

SFT

7 items • Updated 2 days ago

Pretrain

2 items • Updated 3 days ago

RL

5 items • Updated 3 days ago