Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Federico Cocchi's picture

Federico Cocchi

fede97

FMCalisto's profile picture

tobi1modna's profile picture

ecandeloro's profile picture

·

https://federico1-creator.github.io/Federico_Cocchi/

federico1-creator

AI & ML interests

Multimodal LLM - Computer Vision

Organizations

fede97 's collections 5

Model and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025]

aimagelab/ReflectiVA

Image-Text-to-Text • 8B • Updated Apr 5, 2025 • 50 • 2
aimagelab/ReflectiVA-Data

Preview • Updated Apr 5, 2025 • 71

ELSA EU Project

Dataset and models created inside the ELSA – European Lighthouse on Secure and Safe AI project on Multimedia use case.

aimagelab/CoDE

Image Feature Extraction • 5.52M • Updated Dec 12, 2024 • 259 • 2
elsaEU/ELSA_D3

Viewer • Updated Mar 27, 2025 • 2.31M • 1.25k • 12
elsaEU/ELSA_D3_external_test

Viewer • Updated Mar 25, 2024 • 194k • 319 • 3
elsaEU/ELSA1M_track1

Viewer • Updated Aug 27, 2023 • 12.2k • 189 • 3

LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1

aimagelab/LLaVA_MORE-llama_3_1-8B-finetuning

Image-Text-to-Text • 8B • Updated Aug 2, 2025 • 352 • 11
aimagelab/LLaVA_MORE-llama_3_1-8B-S2-siglip-finetuning

Image-Text-to-Text • 8B • Updated Apr 24, 2025 • 4 • 2

The first generative model trained from scratch for Latin language

Sleeping

Agents

LatinGPT

📚

LatinGPT
itserr/scratch_2-nodes_tokenizer_latbert-original_packing_fcocchi

Text Generation • 0.1B • Updated Jun 6, 2024 • 3
itserr/latin_dataset_1.0

Viewer • Updated Jan 9, 2024 • 11.5M • 7

https://arxiv.org/abs/2311.16254

aimagelab/safeclip_vit-l_14_336

Text-to-Image • 0.4B • Updated Jul 11, 2024 • 3
aimagelab/safeclip_vit-l_14

Text-to-Image • 0.4B • Updated Jul 15, 2024 • 26 • 3
aimagelab/safeclip_vit-h_14

Text-to-Image • 1.0B • Updated Jul 11, 2024 • 10
aimagelab/safeclip_sd_20

Text-to-Image • 1.0B • Updated Jul 11, 2024 • 7

Model and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025]

aimagelab/ReflectiVA

Image-Text-to-Text • 8B • Updated Apr 5, 2025 • 50 • 2
aimagelab/ReflectiVA-Data

Preview • Updated Apr 5, 2025 • 71

The first generative model trained from scratch for Latin language

Sleeping

Agents

LatinGPT

📚

LatinGPT
itserr/scratch_2-nodes_tokenizer_latbert-original_packing_fcocchi

Text Generation • 0.1B • Updated Jun 6, 2024 • 3
itserr/latin_dataset_1.0

Viewer • Updated Jan 9, 2024 • 11.5M • 7

ELSA EU Project

Dataset and models created inside the ELSA – European Lighthouse on Secure and Safe AI project on Multimedia use case.

aimagelab/CoDE

Image Feature Extraction • 5.52M • Updated Dec 12, 2024 • 259 • 2
elsaEU/ELSA_D3

Viewer • Updated Mar 27, 2025 • 2.31M • 1.25k • 12
elsaEU/ELSA_D3_external_test

Viewer • Updated Mar 25, 2024 • 194k • 319 • 3
elsaEU/ELSA1M_track1

Viewer • Updated Aug 27, 2023 • 12.2k • 189 • 3

https://arxiv.org/abs/2311.16254

aimagelab/safeclip_vit-l_14_336

Text-to-Image • 0.4B • Updated Jul 11, 2024 • 3
aimagelab/safeclip_vit-l_14

Text-to-Image • 0.4B • Updated Jul 15, 2024 • 26 • 3
aimagelab/safeclip_vit-h_14

Text-to-Image • 1.0B • Updated Jul 11, 2024 • 10
aimagelab/safeclip_sd_20

Text-to-Image • 1.0B • Updated Jul 11, 2024 • 7

LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1

aimagelab/LLaVA_MORE-llama_3_1-8B-finetuning

Image-Text-to-Text • 8B • Updated Aug 2, 2025 • 352 • 11
aimagelab/LLaVA_MORE-llama_3_1-8B-S2-siglip-finetuning

Image-Text-to-Text • 8B • Updated Apr 24, 2025 • 4 • 2

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs