Zmushko Philip

fzmushko

AI & ML interests

None yet

Recent Activity

submitted a paper about 2 hours ago

One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining

upvoted a paper about 2 months ago

MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

upvoted a paper 3 months ago

Reasoning Shift: How Context Silently Shortens LLM Reasoning

View all activity

Organizations

None yet

fzmushko 's models

None public yet