Ingrid Tveten
ingridtv
·
AI & ML interests
Medical image analysis and machine learning
Recent Activity
updated a collection about 1 month ago
Multimodal/VLM updated a collection about 1 month ago
Multimodal/VLM updated a collection 4 months ago
GenAI/LLMOrganizations
None yet
Medical images, encoding
Multimodal/VLM
-
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 395k • 1.6k -
microsoft/Phi-4-mini-instruct
Text Generation • Updated • 1.53M • • 737 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 157 -
Emerging Properties in Unified Multimodal Pretraining
Paper • 2505.14683 • Published • 134
Medical LM, Specific
GenAI/LLM
-
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 491k • • 1.5k -
Qwen/CodeQwen1.5-7B-Chat
Text Generation • 7B • Updated • 30k • 353 -
lmstudio-community/gemma-3-12b-it-GGUF
Image-Text-to-Text • 12B • Updated • 35.2k • 44 -
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text • 12B • Updated • 4.56k • 270
Document understanding
Medical LM, Specific
Medical images, encoding
GenAI/LLM
-
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 491k • • 1.5k -
Qwen/CodeQwen1.5-7B-Chat
Text Generation • 7B • Updated • 30k • 353 -
lmstudio-community/gemma-3-12b-it-GGUF
Image-Text-to-Text • 12B • Updated • 35.2k • 44 -
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text • 12B • Updated • 4.56k • 270
Multimodal/VLM
-
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 395k • 1.6k -
microsoft/Phi-4-mini-instruct
Text Generation • Updated • 1.53M • • 737 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 157 -
Emerging Properties in Unified Multimodal Pretraining
Paper • 2505.14683 • Published • 134