EvaByte

non-profit

AI & ML interests

None defined yet.

submitted a paper to Daily Papers 3 months ago

Scratchpad Patching: Decoupling Compute from Patch Size in Byte-Level Language Models

Paper • 2605.09630 • Published May 10 • 1

submitted a paper to Daily Papers 6 months ago

Proxy Compression for Language Modeling

Paper • 2602.04289 • Published Feb 4 • 3

updated 3 models over 1 year ago

EvaByte/EvaByte

6B • Updated Feb 28, 2025 • 224 • 36

EvaByte/EvaByte-Phase1

6B • Updated Feb 28, 2025 • 2 • 8

EvaByte/EvaByte-SFT

6B • Updated Feb 28, 2025 • 475 • 41

published 3 models over 1 year ago

EvaByte/EvaByte-SFT

6B • Updated Feb 28, 2025 • 475 • 41

EvaByte/EvaByte-Phase1

6B • Updated Feb 28, 2025 • 2 • 8

EvaByte/EvaByte

6B • Updated Feb 28, 2025 • 224 • 36

authored a paper over 2 years ago

Self-Infilling Code Generation

Paper • 2311.17972 • Published Nov 29, 2023

authored a paper about 3 years ago

CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling

Paper • 2210.07661 • Published Oct 14, 2022