Xiangchendong
Xiang-cd
AI & ML interests
pre-train models
Recent Activity
authored a paper about 1 month ago
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning upvoted a paper about 1 month ago
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning upvoted a paper about 1 month ago
Geometry-Aware Rotary Position Embedding for Consistent Video World ModelOrganizations
None yet