Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Ali Janati's picture

36 2 1

Ali Janati

Na0s

tahamajs's profile picture

teabagging80's profile picture

kramp's profile picture

·

https://alijanati.netlify.app/

Na00s
alijanati

AI & ML interests

NLP, Speech Recognition, Computer Vision, Time Series Forecasting.

Organizations

Na0s 's collections 5

Medical Chatbot

Na0s/Llama-3.2-3B-Instruct-Medical-Chatbot-LoRA-FT

Text Generation • 3B • Updated Dec 16, 2024 • 3
Na0s/Llama-3.2-3B-Medical-Chatbot-LoRA-FT

Text Generation • 3B • Updated Dec 16, 2024 • 3
meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 2.21M • • 1.98k
meta-llama/Llama-3.2-3B

Text Generation • 3B • Updated Oct 24, 2024 • 723k • 696

Depth pruned and fine tuned Llama-3.1-8B

Na0s/Llama-3.1-8B-Pruned-4-Layers_LoRA-PEFT-3.0

Text Generation • 7B • Updated Sep 5, 2024 • 2 • 3
Na0s/Llama-3.1-8B-Pruned-4-Layers_LoRA-PEFT-2.0

Text Generation • 7B • Updated Aug 8, 2024 • 4 • 1
Na0s/Llama-3.1-8B-Pruned-4-Layers_LoRA-PEFT-1.0

Text Generation • 7B • Updated Aug 8, 2024 • 3
Na0s/Llama-3.1-8B-Pruned-4-Layers_LoRA-PEFT

Text Generation • 7B • Updated Aug 7, 2024 • 3

Pruned MoEs (Mixtral-8x7B-Instruct-v0.1)

Pruned experts from Mixtral-8x7B-Instruct-v0.1 with respect to the paper "A Provably Effective Method for Pruning Experts in Fine-tuned Sparse MoEs"

mistralai/Mixtral-8x7B-Instruct-v0.1

47B • Updated Jul 24, 2025 • 490k • 4.64k
Na0s/Mixtral-8x7B-Instruct-v0.1-LoRA-on-Gates

Text Generation • 47B • Updated Sep 26, 2024 • 5 • 1
Na0s/Mixtral-8x7B-Instruct-v0.1-exhaustive-LoRA

Text Generation • 47B • Updated Nov 12, 2024 • 6
Na0s/Mixtral-8x7B-v0.1-instruct-pruned-random-1-experts

Text Generation • 41B • Updated Oct 6, 2024 • 8

Differential transformers

Fine-tuning foundation Llama-3.2-3B-Instruct on medical Q&A using differential attention (In progress). Paper: https://arxiv.org/pdf/2410.05258

meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 2.21M • • 1.98k

Medical Whisper

Fine-tuned Whisper Large v3 on Doctor/ Patient consultations.

Na0s/Medical-Whisper-Large-v3

Automatic Speech Recognition • 2B • Updated Oct 6, 2024 • 856 • 9

Medical Chatbot

Na0s/Llama-3.2-3B-Instruct-Medical-Chatbot-LoRA-FT

Text Generation • 3B • Updated Dec 16, 2024 • 3
Na0s/Llama-3.2-3B-Medical-Chatbot-LoRA-FT

Text Generation • 3B • Updated Dec 16, 2024 • 3
meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 2.21M • • 1.98k
meta-llama/Llama-3.2-3B

Text Generation • 3B • Updated Oct 24, 2024 • 723k • 696

Differential transformers

Fine-tuning foundation Llama-3.2-3B-Instruct on medical Q&A using differential attention (In progress). Paper: https://arxiv.org/pdf/2410.05258

meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 2.21M • • 1.98k

Depth pruned and fine tuned Llama-3.1-8B

Na0s/Llama-3.1-8B-Pruned-4-Layers_LoRA-PEFT-3.0

Text Generation • 7B • Updated Sep 5, 2024 • 2 • 3
Na0s/Llama-3.1-8B-Pruned-4-Layers_LoRA-PEFT-2.0

Text Generation • 7B • Updated Aug 8, 2024 • 4 • 1
Na0s/Llama-3.1-8B-Pruned-4-Layers_LoRA-PEFT-1.0

Text Generation • 7B • Updated Aug 8, 2024 • 3
Na0s/Llama-3.1-8B-Pruned-4-Layers_LoRA-PEFT

Text Generation • 7B • Updated Aug 7, 2024 • 3

Medical Whisper

Fine-tuned Whisper Large v3 on Doctor/ Patient consultations.

Na0s/Medical-Whisper-Large-v3

Automatic Speech Recognition • 2B • Updated Oct 6, 2024 • 856 • 9

Pruned MoEs (Mixtral-8x7B-Instruct-v0.1)

Pruned experts from Mixtral-8x7B-Instruct-v0.1 with respect to the paper "A Provably Effective Method for Pruning Experts in Fine-tuned Sparse MoEs"

mistralai/Mixtral-8x7B-Instruct-v0.1

47B • Updated Jul 24, 2025 • 490k • 4.64k
Na0s/Mixtral-8x7B-Instruct-v0.1-LoRA-on-Gates

Text Generation • 47B • Updated Sep 26, 2024 • 5 • 1
Na0s/Mixtral-8x7B-Instruct-v0.1-exhaustive-LoRA

Text Generation • 47B • Updated Nov 12, 2024 • 6
Na0s/Mixtral-8x7B-v0.1-instruct-pruned-random-1-experts

Text Generation • 41B • Updated Oct 6, 2024 • 8

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs