Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Building on HF
563.5
TFLOPS
12
5
30
Yuriy Perezhohin
PRO
yuriyvnv
Follow
cahlen's profile picture
lingbejiling's profile picture
21world's profile picture
23 followers
Ā·
24 following
https://scholar.google.com/citations?user=I5uzFtwAAAAJ&hl=en
yuriyvnv
yuriyperezhohin
AI & ML interests
Automatic Speech Recognition, Embeddings, Code Generation, Synthetic Data Generation and Filtering
Recent Activity
posted
an
update
about 16 hours ago
š The WAVe paper is officially out in the Information Sciences Journal. You saw the PT and NL model releases earlier this year. This is the peer-reviewed paper behind them, with the full method, ablations, and downstream ASR evaluation. Quick recap: WAVe is a 1B multimodal embedding model that filters synthetic speech at the word level, not the sentence level. On Portuguese ASR it cuts training steps by 34%, improves cross-domain generalization by 50%, and matches WER with 30% less synthetic data. š¦ Resources - Paper: https://www.sciencedirect.com/science/article/pii/S0020025526005220 - PT model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-PT - NL model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-NL - Collection: https://huggingface.co/collections/yuriyvnv/multi-modal-embeddings-for-synthetic-transcript-filtering - Code: https://github.com/yuriyvnv/WAVe If you train ASR on synthetic or back-translated data, would like to see WAVe benchmarked on other languages. @reach-vb @ylacombe @hf-audio @BramVanroy #speech #asr #multimodal #syntheticdata #lowresource
updated
a collection
about 16 hours ago
Multi-Modal Embeddings for Synthetic Transcript Filtering
updated
a collection
about 16 hours ago
Multi-Modal Embeddings for Synthetic Transcript Filtering
View all activity
Organizations
yuriyvnv
's datasets
6
Sort:Ā Recently updated
yuriyvnv/synthetic_transcript_pt
Viewer
ā¢
Updated
2 days ago
ā¢
253k
ā¢
724
yuriyvnv/synthetic_asr_et_sl
Viewer
ā¢
Updated
Feb 17
ā¢
80.7k
ā¢
103
yuriyvnv/synthetic_transcript_nl
Viewer
ā¢
Updated
Nov 24, 2025
ā¢
34.9k
ā¢
744
yuriyvnv/capes_synthetic_audio_filtered
Viewer
ā¢
Updated
Jul 20, 2025
ā¢
72.6k
ā¢
100
yuriyvnv/triage_synthetic_classification
Viewer
ā¢
Updated
Jun 27, 2025
ā¢
100
ā¢
21
yuriyvnv/triage_transcriptions
Viewer
ā¢
Updated
Jun 25, 2025
ā¢
87
ā¢
39