4 3 10

Andrew Rouditchenko

h9LtLSb

http://people.csail.mit.edu/roudi/

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 month ago

kyutai/DailyTalkContiguous

upvoted an article about 1 month ago

Decoding Strategies in Large Language Models

authored a paper 9 months ago

mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition

View all activity

Organizations

liked a dataset about 1 month ago

kyutai/DailyTalkContiguous

Preview • Updated Mar 24, 2025 • 11.7k • 19

upvoted an article about 1 month ago

Article

Decoding Strategies in Large Language Models

Oct 29, 2024

•

107

authored a paper 9 months ago

mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition

Paper • 2502.01547 • Published Feb 3, 2025

commented a paper 9 months ago

Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM?

Paper • 2505.09439 • Published May 14, 2025 • 10 •

upvoted a paper 10 months ago

Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation

Paper • 2406.10082 • Published Jun 14, 2024 • 1

liked a model 10 months ago

ibm-granite/granite-speech-3.3-8b

Automatic Speech Recognition • 9B • Updated Aug 19, 2025 • 116k • 154

liked a model 11 months ago

saurabhati/DASS_small_AudioSet_47.2

Audio Classification • 29.9M • Updated Mar 31, 2025 • 7 • 1

New activity in microsoft/Phi-4-multimodal-instruct 12 months ago

Does the model support beam search for ASR?

👍 1

#31 opened 12 months ago by

h9LtLSb

upvoted an article 12 months ago

Article

A Complete Guide to Audio Datasets

Dec 15, 2022

•

liked a Space almost 3 years ago

CER

🤗

updated a model about 3 years ago

h9LtLSb/whisper-small-uk

Automatic Speech Recognition • Updated Feb 3, 2023

liked a Space about 3 years ago

WER

🤗

updated 4 models about 3 years ago

New activity in google/fleurs about 3 years ago

Text Normalization

#9 opened about 3 years ago by

h9LtLSb

liked a dataset over 3 years ago

google/xtreme_s

Updated Sep 10, 2024 • 8.57k • 66

liked a model over 3 years ago

voidful/wav2vec2-xlsr-multilingual-56

Automatic Speech Recognition • 0.3B • Updated Mar 18, 2023 • 7.64k • 33

liked a Space over 3 years ago

The 🤗 Speech Bench

📈

Find the best speech‑recognition model for your language

Andrew Rouditchenko

AI & ML interests

Recent Activity

Organizations

h9LtLSb's activity

Decoding Strategies in Large Language Models

Does the model support beam search for ASR?

A Complete Guide to Audio Datasets

CER

WER

Text Normalization

The 🤗 Speech Bench