Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zy's picture
5 7 13

zy

lu-vae
gengdapeng's profile picture jerryzzzzz3's profile picture bhxiang's profile picture
·

AI & ML interests

NLP text generation

Recent Activity

upvoted a paper about 12 hours ago
Mixture-of-Depths Attention
upvoted a paper about 12 hours ago
Attention Residuals
liked a dataset 1 day ago
stepfun-ai/Step-3.5-Flash-SFT
View all activity

Organizations

Chinese-Vicuna's profile picture StepFun's profile picture mask-mask-mask-mask's profile picture

authored 4 papers about 1 month ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24, 2025 • 32

Virtual Width Networks

Paper • 2511.11238 • Published Nov 14, 2025 • 38

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 192
authored a paper about 1 year ago

Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think

Paper • 2503.00948 • Published Mar 2, 2025 • 3
authored 2 papers over 1 year ago

On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion

Paper • 2406.15480 • Published Jun 17, 2024 • 2

Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging

Paper • 2406.15479 • Published Jun 17, 2024 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs