ground-zero / notebooks

Commit History

step 2: add app_minimal.py — zero-shot baseline for rebuild.
34edd5f

Broulaye Doumbia commited on

fix read me
da3a060

Broulaye Doumbia commited on

restructuring
76aac1b

Broulaye Doumbia commited on

Add torchcodec install for datasets 4.x audio decoding
bb78cbf

jefffffff9 Claude Opus 4.7 commited on

Prepare training stack for RunPod: env-aware notebook + bootstrap script
9049ef3

jefffffff9 Claude Sonnet 4.6 commited on

Fix jiwer crash on post-normalisation empty refs; register SLR106/105 datasets
6682858

jefffffff9 Claude Sonnet 4.6 commited on

Fix SyntaxError in Cell 17: unterminated f-string literal
58f431a

jefffffff9 Claude Sonnet 4.6 commited on

Fix compute_metrics crash on empty eval references in Fula training
3632a23

jefffffff9 Claude Sonnet 4.6 commited on

Add Adlam/Pular Fula integration: transliterator + 3 new datasets + normalisation pipeline
ced078c

jefffffff9 Claude Sonnet 4.6 commited on

Fix language mixing: per-language prompts + Mali Bambara / Guinea Pular context
40cf84d

jefffffff9 Claude Sonnet 4.6 commited on

Phase 3: Voice-to-Voice S2S pipeline — F5-TTS, LLM brain, CER metric
8952fff

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 16 ValueError: load model fp32 so AMP gradient scaler works
cd017e2

jefffffff9 Claude Sonnet 4.6 commited on

Fix Wikipedia and HF harvest buttons
a5737ac

jefffffff9 Claude Sonnet 4.6 commited on

Fix eval_loss nan + add generation_config to Seq2SeqTrainer
9f07d46

jefffffff9 Claude Sonnet 4.6 commited on

Fix GenerationConfig deprecation warning in cells 13 and 16
1f2993e

jefffffff9 Claude Sonnet 4.6 commited on

Auto-cap MAX_STEPS based on dataset size after Cell 12 builds train_ds
73a3f3a

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 15: detect eval_strategy vs evaluation_strategy at runtime
1c462f3

jefffffff9 Claude Sonnet 4.6 commited on

Proactive notebook audit: fix 5 issues before they cause errors
aef60bc

jefffffff9 Claude Sonnet 4.6 commited on

Simplify Cell 2: no restart needed since PEFT is removed
8e41441

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 16 TypeError: replace PEFT LoRA with freeze/unfreeze fine-tuning
6dc0dfc

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 16 TypeError: target encoder-only LoRA to avoid PEFT/transformers 5.x conflict
c37ce59

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 14: cast input_features to fp16 in collator
3ed28bd

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 16: remove tokenizer arg from Seq2SeqTrainer (renamed in transformers 5.x)
829fd3d

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 14: replace evaluate.load('wer') with jiwer directly
c1bce9f

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 13: drop bitsandbytes, load whisper-small in fp16
8cf3564

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 2: restart kernel after torchvision reinstall using sentinel file
c3a3cb7

jefffffff9 Claude Sonnet 4.6 commited on

Cell 12: never raise, let Cell 12b handle no-data case gracefully
a481b98

jefffffff9 Claude Sonnet 4.6 commited on

Cell 4: remove Common Voice (moved to Mozilla Data Collective, off HF)
c8517b6

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 12: add diagnostics + synthetic fallback cell (12b)
262db3e

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 11: bypass WhisperProcessor entirely to avoid torchvision crash
65e1d64

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 2: reinstall torchvision to match Kaggle CUDA version at runtime
257bae8

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cells 2+11: unpin torch, use direct WhisperProcessor import
0d6357e

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 10: correct r3 assertion to 'visit now!' (URL strips link not preceding word)
5a82441

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 10: add ! and ? to allowlist, fix smoke test assertions
295b7c6

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 8: remove FLEURS fallback + strip emoji (charmap safe)
f1012fa

jefffffff9 Claude Sonnet 4.6 commited on

Fix Kaggle push: title slug + Windows UTF-8 encoding
842ded5

jefffffff9 Claude Sonnet 4.6 commited on

Add Kaggle auto-sync: kernel-metadata.json + post-commit hook
4bd45bb

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 8: correct WaxalNLP subset names + drop fleurs fallback
bfa9d46

jefffffff9 Claude Sonnet 4.6 commited on

Fix Cell 6: replace get_last_checkpoint import with inline scanner
39604b3

jefffffff9 Claude Sonnet 4.6 commited on

Add Kaggle Master Trainer — Deep Sleep fine-tuning notebook
427d4a2

jefffffff9 Claude Sonnet 4.6 commited on

Phase 2: Waxal TTS — Bambara voice output + Fula training notebook
dd38e25

jefffffff9 Claude Sonnet 4.6 commited on

Initial commit: Sahel-Agri Voice AI
76db545

jefffffff9 commited on