Mayank022
/

Audio-Language-Model

Audio-Text-to-Text

audio-language-model

Model card Files Files and versions

Mayank022 commited on Feb 26

Commit

d8fc8a0

·

verified ·

1 Parent(s): e9c54ba

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -25,7 +25,8 @@ Vocal LLM is a joint audio-language model that bridges a frozen [Whisper](https:
 ## Architecture
-<img src="Joint_embedding_model_Sarvam_with_Whisper.svg" alt="Vocal LLM Architecture" width="100%">
 Vocal LLM consists of three components:

 ## Architecture
+![Joint_embedding_model_Sarvam_with_Whisper](https://cdn-uploads.huggingface.co/production/uploads/666c3d6489e21df7d4a02805/hpryyOCYGnA3a5LD6BeZI.png)
 Vocal LLM consists of three components: