Zmushko Philip
fzmushko
AI & ML interests
None yet
Recent Activity
submitted a paper about 2 hours ago
One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining upvoted a paper about 2 months ago
MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning upvoted a paper 3 months ago
Reasoning Shift: How Context Silently Shortens LLM ReasoningOrganizations
None yet