Revert to last working version (verbose logs + VAE + simple MSE loss — before v3/v4 broke training) ce37111 verified krystv commited on 5 days ago
Revert to v4 epoch-based loop + fix streaming repeat + fix scheduler + remove data std norm 2ddc44d verified krystv commited on 5 days ago
v4.1: Fix training loop — step-based (not epochs), auto-cycle dataset, fix streaming exhaust, fix scheduler warning 1c4aa64 verified krystv commited on 5 days ago
v4: Add Min-SNR-γ + velocity direction loss + CCA + multi-scale loss + gate bias (DeepSeek/FasterDiT/DiCo/DiMR research) e53aa97 verified krystv commited on 5 days ago
v3: Add large anime/art datasets, cosine-with-restarts schedule, 2x LR, smart warmup, streaming support, resume training d01fc8b verified krystv commited on 6 days ago
Add verbose training logs: ETA, loss trend, speed, VRAM, grad norm, epoch summaries 0c2542e verified krystv commited on 6 days ago
Optimize notebook: 40% faster CfC blocks, simplified spatial mix a4f6778 verified krystv commited on 6 days ago
v2: Add VAE latent training, fix datasets, streaming support c2b4760 verified krystv commited on 6 days ago