Jialiang Cheng
Julius-L
·
AI & ML interests
None yet
Recent Activity
authored
a paper
about 23 hours ago
SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models
authored
a paper
about 23 hours ago
EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models
liked
a dataset
6 months ago
Salesforce/wikitext