NaiveNeuron
/

whisper-medium-sk

@@ -1,6 +1,14 @@
 ---
 language:
 - sk
 tags:
 - speech
 - asr
@@ -9,11 +17,6 @@ tags:
 - parliament
 - legal
 - politics
-base_model: openai/whisper-medium
-datasets:
-- erikbozik/slovak-plenary-asr-corpus
-metrics:
-- wer
 model-index:
 - name: whisper-medium-sk
   results:
@@ -24,9 +27,9 @@ model-index:
       name: Common Voice 21 (Slovak test set)
       type: common_voice
     metrics:
-    - name: WER
-      type: wer
       value: 18
   - task:
       type: automatic-speech-recognition
       name: Automatic Speech Recognition
@@ -34,15 +37,15 @@ model-index:
       name: FLEURS (Slovak test set)
       type: fleurs
     metrics:
-    - name: WER
-      type: wer
       value: 7.6
-license: mit
 ---
 # Whisper Medium — Fine-tuned on SloPalSpeech
-This model is a fine-tuned version of [`openai/whisper-medium`](https://huggingface.co/openai/whisper-medium).
 It is adapted for **Slovak ASR** using [SloPalSpeech](https://huggingface.co/datasets/erikbozik/slovak-plenary-asr-corpus): **2,806 hours** of aligned, ≤30 s speech–text pairs from official plenary sessions of the **Slovak National Council**.
 - **Language:** Slovak
@@ -73,7 +76,7 @@ It is adapted for **Slovak ASR** using [SloPalSpeech](https://huggingface.co/dat
 - Multilingual performance is not guaranteed (full-parameter finetuning emphasized Slovak).
 ## 📝 Citation & Paper
-For more details, please see our paper on [arXiv](https://arxiv.org/abs/2509.19270). If you use this model in your work, please cite it as:
 ```bibtex
 @misc{božík2025slopalspeech2800hourslovakspeech,
       title={SloPalSpeech: A 2,800-Hour Slovak Speech Corpus from Parliamentary Data},

 ---
+base_model: openai/whisper-medium
+datasets:
+- erikbozik/slovak-plenary-asr-corpus
 language:
 - sk
+license: mit
+metrics:
+- wer
+library_name: transformers
+pipeline_tag: automatic-speech-recognition
 tags:
 - speech
 - asr
 - parliament
 - legal
 - politics
 model-index:
 - name: whisper-medium-sk
   results:
       name: Common Voice 21 (Slovak test set)
       type: common_voice
     metrics:
+    - type: wer
       value: 18
+      name: WER
   - task:
       type: automatic-speech-recognition
       name: Automatic Speech Recognition
       name: FLEURS (Slovak test set)
       type: fleurs
     metrics:
+    - type: wer
       value: 7.6
+      name: WER
 ---
 # Whisper Medium — Fine-tuned on SloPalSpeech
+This model is a fine-tuned version of [`openai/whisper-medium`](https://huggingface.co/openai/whisper-medium), presented in the paper [SloPal: A 60-Million-Word Slovak Parliamentary Corpus with Aligned Speech and Fine-Tuned ASR Models](https://huggingface.co/papers/2509.19270).
 It is adapted for **Slovak ASR** using [SloPalSpeech](https://huggingface.co/datasets/erikbozik/slovak-plenary-asr-corpus): **2,806 hours** of aligned, ≤30 s speech–text pairs from official plenary sessions of the **Slovak National Council**.
 - **Language:** Slovak
 - Multilingual performance is not guaranteed (full-parameter finetuning emphasized Slovak).
 ## 📝 Citation & Paper
+For more details, please see our paper on [arXiv](https://arxiv.org/abs/2509.19270) or the [Hugging Face paper page](https://huggingface.co/papers/2509.19270). If you use this model in your work, please cite it as:
 ```bibtex
 @misc{božík2025slopalspeech2800hourslovakspeech,
       title={SloPalSpeech: A 2,800-Hour Slovak Speech Corpus from Parliamentary Data},