Alan Gunning
AI & ML interests
Recent Activity
Organizations
- SleepingAgents2
Pdf To Clean Txt
🏢2Takes a pdf, cleans and outputs a txt
- Running4
Historical OCR
⚙4advanced OCR application for historical document analysis
- SleepingAgents3
FadedTextRestoration
🔥3Restore faded text from images
- RunningAgents14
Scanned Document Denoise Reconstruct
⚡14Clean and restore noisy scanned documents
- Runtime errorAgentsFeatured2.77k
XTTS
🐸2.77kGenerate speech from text using a reference voice
-
coqui/XTTS-v2
Text-to-Speech • Updated • 9.11M • 3.56k -
myshell-ai/OpenVoice
Text-to-Speech • Updated • 488 - RunningAgentsFeatured1.13k
OpenVoice
🤗1.13kGenerate speech in a cloned voice from a short audio clip
-
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 5.34M • • 5.74k -
nvidia/canary-1b
Automatic Speech Recognition • Updated • 2.57k • 457 - RunningAgents63
Insanelyfastwhisper
💻63Convert audio to subtitles
-
j-macnamara/wav2vec2-large-xls-r-2b-Irish-gaIE
Automatic Speech Recognition • 2B • Updated • 9 • 1
-
speechbrain/sepformer-wham-enhancement
Audio-to-Audio • Updated • 149 • 33 -
speechbrain/sepformer-whamr-enhancement
Audio-to-Audio • Updated • 238 • 14 -
speechbrain/sepformer-dns4-16k-enhancement
Audio-to-Audio • Updated • 299 • 27 -
speechbrain/sepformer-wham16k-enhancement
Audio-to-Audio • Updated • 4.55k • 32
- Runtime errorAgentsFeatured490
YOLO World
🔥490Detect objects in images or videos
- Runtime errorAgentsFeatured279
CoTracker
🎨279Track points in a video
- PausedAgentsFeatured315
PaliGemma Demo
🤲315Annotate and describe images with text prompts
- Running on ZeroAgentsFeatured844
Florence 2
📉844Generate captions, detections, and segmentations for any image
-
partypress/partypress-monolingual-ireland
Text Classification • Updated • 3 -
pyannote/speaker-diarization-3.1
Automatic Speech Recognition • Updated • 9.91M • 2.01k - Running on ZeroAgents94
VIDEO TRANSLATION TRANSCRIPTION
🔥94Generate translated subtitles and embed them into your video
- Runtime errorAgentsFeatured329
Video Dubbing
🚀329Dub videos in another language with cloned voice
- SleepingAgents2
Pdf To Clean Txt
🏢2Takes a pdf, cleans and outputs a txt
- Running4
Historical OCR
⚙4advanced OCR application for historical document analysis
- SleepingAgents3
FadedTextRestoration
🔥3Restore faded text from images
- RunningAgents14
Scanned Document Denoise Reconstruct
⚡14Clean and restore noisy scanned documents
- Runtime errorAgentsFeatured2.77k
XTTS
🐸2.77kGenerate speech from text using a reference voice
-
coqui/XTTS-v2
Text-to-Speech • Updated • 9.11M • 3.56k -
myshell-ai/OpenVoice
Text-to-Speech • Updated • 488 - RunningAgentsFeatured1.13k
OpenVoice
🤗1.13kGenerate speech in a cloned voice from a short audio clip
- Runtime errorAgentsFeatured490
YOLO World
🔥490Detect objects in images or videos
- Runtime errorAgentsFeatured279
CoTracker
🎨279Track points in a video
- PausedAgentsFeatured315
PaliGemma Demo
🤲315Annotate and describe images with text prompts
- Running on ZeroAgentsFeatured844
Florence 2
📉844Generate captions, detections, and segmentations for any image
-
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 5.34M • • 5.74k -
nvidia/canary-1b
Automatic Speech Recognition • Updated • 2.57k • 457 - RunningAgents63
Insanelyfastwhisper
💻63Convert audio to subtitles
-
j-macnamara/wav2vec2-large-xls-r-2b-Irish-gaIE
Automatic Speech Recognition • 2B • Updated • 9 • 1
-
partypress/partypress-monolingual-ireland
Text Classification • Updated • 3 -
pyannote/speaker-diarization-3.1
Automatic Speech Recognition • Updated • 9.91M • 2.01k - Running on ZeroAgents94
VIDEO TRANSLATION TRANSCRIPTION
🔥94Generate translated subtitles and embed them into your video
- Runtime errorAgentsFeatured329
Video Dubbing
🚀329Dub videos in another language with cloned voice
-
speechbrain/sepformer-wham-enhancement
Audio-to-Audio • Updated • 149 • 33 -
speechbrain/sepformer-whamr-enhancement
Audio-to-Audio • Updated • 238 • 14 -
speechbrain/sepformer-dns4-16k-enhancement
Audio-to-Audio • Updated • 299 • 27 -
speechbrain/sepformer-wham16k-enhancement
Audio-to-Audio • Updated • 4.55k • 32