Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
36.9
TFLOPS
32
6
28
ManniX
PRO
ManniX-ITA
Follow
francisong's profile picture
noxeternae's profile picture
tegridydev's profile picture
66 followers
·
18 following
https://github.com/mann1x
mann1x
AI & ML interests
None yet
Recent Activity
new
activity
about 16 hours ago
ManniX-ITA/Qwen3.6-27B-Omnimerge-v4-MTP-GGUF:
My experience
new
activity
1 day ago
ManniX-ITA/gemma-4-A4B-98e-v7-coder-it-GGUF:
Gets stuck in loops using llama.cpp
posted
an
update
3 days ago
--- 🚀 Gemma-4-A4B 98e v7-coder cohort — loop-fixed re-release. Two 20.8B MoE coders (4B-active), fresh-map prunes of Gemma 4 26B-A4B, 30/128 experts dropped per layer. The headline isn't a benchmark: the agentic loop is gone at the weights, not papered over by the sampler. 🔧 How: at prune time we force-keep the 46 agentic_eog experts a loop-protection signal flags as load-bearing for clean multi-turn termination (+ shared-FFN α=1.2). Result: 0 loops across 48 seeds on every published tier. 📊 Q6_K · llama.cpp · greedy · same host (from summary.json): ⚖️ v7-coder (fkbroad code3/lcb2) — balanced coder: LCB-med-55 98.18, HumanEval 98.17, HE+ 92.07, AIME 80.0, MATH-500 95.0, GSM8K 91, IFEval 92, MultiPL-E 89.7, ARC 92.2. ⚡ v7-coderx (code4/lcb3) — code-maximal: all-hard LCB-77 85.71 (cohort-best; 128e 79.22, v7-coder 84.42), HE+ 93.29, GSM8K 93, MATH-500 95.0, AIME 76.67. Whole budget on code. 🎯 Both land near GPQA ~51 — graduate science is the budget axis, neither is a science model. Pick v7-coder for the broad LCB-medium + HumanEval lead; v7-coderx for the all-hard slice and HE+. 🧪 The harness we used to prove the fix is now an omk tool: agentic-loop-harness replays a frozen agentic conversation across a sampler×seed matrix and reports a fail-rate per chat-template, so you can isolate a loop to one variable. Model-agnostic — any OpenAI-compatible server. The version we shared with Google: https://huggingface.co/google/gemma-4-12B-it/discussions/41#6a3926720abc934d03fd85c0 📦 Each ships bf16 · GGUF (+ CD-* + imatrix + mmproj vision) · NVFP4A16 (~13 GB) · Ollama. 🔗 https://huggingface.co/ManniX-ITA/gemma-4-A4B-98e-v7-coder-it (+ -it-GGUF, -NVFP4A16) · https://ollama.com/mannix/gemma4-98e-v7-coder 🔗 https://huggingface.co/ManniX-ITA/gemma-4-A4B-98e-v7-coderx-it (+ -it-GGUF, -NVFP4A16) · https://ollama.com/mannix/gemma4-98e-v7-coderx 🔧 https://github.com/mann1x/omnimergekit/tree/main/tools/agentic-loop-harness
View all activity
Organizations
None yet
ManniX-ITA
's models
53
Sort: Recently updated
ManniX-ITA/gemma-4-A4B-98e-v7-coder-NVFP4A16
11B
•
Updated
3 days ago
•
117
ManniX-ITA/gemma-4-A4B-98e-v7-coder-it-GGUF
20B
•
Updated
3 days ago
•
50.8k
•
4
ManniX-ITA/gemma-4-A4B-98e-v7-coder-it
20B
•
Updated
3 days ago
•
475
ManniX-ITA/gemma-4-A4B-98e-v7-coderx-NVFP4A16
11B
•
Updated
3 days ago
•
128
ManniX-ITA/gemma-4-A4B-98e-v7-coderx-it-GGUF
20B
•
Updated
3 days ago
•
33.4k
•
1
ManniX-ITA/gemma-4-A4B-98e-v7-coderx-it
20B
•
Updated
3 days ago
•
134
•
1
ManniX-ITA/gemma-4-A4B-98e-v6-coder-it-GGUF
20B
•
Updated
17 days ago
•
17k
•
3
ManniX-ITA/gemma-4-A4B-98e-v6-coder-it
20B
•
Updated
17 days ago
•
133
•
1
ManniX-ITA/Qwen3.5-4B-MicroCoder-GGUF
4B
•
Updated
May 25
•
93
•
1
ManniX-ITA/Qwen3.5-4B-MicroCoder
Image-Text-to-Text
•
5B
•
Updated
May 25
•
4
ManniX-ITA/gemma-4-A4B-98e-v5-coder-it
20B
•
Updated
May 24
•
15
•
3
ManniX-ITA/Qwen3.6-27B-Omnimerge-v4-MTP-GGUF
27B
•
Updated
May 22
•
3.69k
•
9
ManniX-ITA/Qwen3.6-27B-Omnimerge-v4
Image-Text-to-Text
•
28B
•
Updated
May 22
•
78
•
14
ManniX-ITA/Qwen3.6-27B-Omnimerge-v4-GGUF
Image-Text-to-Text
•
27B
•
Updated
May 22
•
7.22k
•
33
ManniX-ITA/gemma-4-31b-he1-it
Text Generation
•
31B
•
Updated
May 21
•
19
•
1
ManniX-ITA/gemma-4-A4B-98e-v5-coder-it-GGUF
20B
•
Updated
May 20
•
6.56k
•
3
ManniX-ITA/gemma-4-31b-he1-it-GGUF
31B
•
Updated
May 20
•
1.35k
•
2
ManniX-ITA/gemma-4-31b-he1-it-NVFP4A16
Text Generation
•
17B
•
Updated
May 19
•
4
ManniX-ITA/Gemma-4-31B-it-NVFP4A16
Text Generation
•
17B
•
Updated
May 18
•
9
ManniX-ITA/gemma-4-A4B-98e-v5-coder-NVFP4A16
Text Generation
•
11B
•
Updated
May 18
•
14
ManniX-ITA/gemma-4-A4B-98e-v4-it
20B
•
Updated
May 17
•
4
ManniX-ITA/gemma-4-A4B-98e-v4-NVFP4A16
11B
•
Updated
May 14
•
3
ManniX-ITA/Gemma-4-26B-A4B-it-NVFP4A16
14B
•
Updated
May 14
•
31
ManniX-ITA/gemma-4-A4B-98e-v3-it
20B
•
Updated
May 11
•
5
•
3
ManniX-ITA/Qwen3.6-27B-Omnimerge-v4-MLX-VL-4bit
Image-Text-to-Text
•
5B
•
Updated
May 11
•
290
•
1
ManniX-ITA/Qwen3.6-27B-Omnimerge-v4-MLX-4bit
Text Generation
•
27B
•
Updated
May 11
•
144
ManniX-ITA/gemma-4-A4B-98e-v4-it-GGUF
20B
•
Updated
May 10
•
330
ManniX-ITA/Qwen3.5-4B-M8
5B
•
Updated
May 1
•
2
•
1
ManniX-ITA/Qwen3.5-4B-M8-GGUF
4B
•
Updated
May 1
•
49
ManniX-ITA/Qwen3.5-4B-M4-v2-ex-LRP-turbo
Text Generation
•
5B
•
Updated
May 1
•
7
Previous
1
2
Next