Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
bartowski 
posted an update 12 days ago
Post
11404
You may have noticed that my upload of MiMo-V2.5 upload didn't have the author in the model name:

bartowski/MiMo-V2.5-GGUF

Going forward, I plan to upload models from major 1st party developers without the author name attached for cleanliness, I feel it results in a nicer and more expected user experience

I will continue to uploaded fine tunes with that author + "_" appended for clarity, I personally feel it's nice to know at a glance who's tune it is, but it's also for the reason I first started doing it, to avoid it being confused for a new version of the official release

I hope this change makes sense, it seemed most reasonable to me and a poll I did (forever ago, I move slow sometimes) made it seem likely others would find it reasonable as well (feel free to let me know if you disagree, may not change my mind but I do value knowing what others think)

Thanks for downloading :)

شكرا لك ولكن أنا أستخدم الهاتف

Hi Bartowski, thank you for all your GGUF work.

I know I’m just a small hobbyist, but I wanted to ask if you might consider looking at Zyphra/ZAYA1-8B. It’s a small MoE model reportedly trained on AMD hardware, and I’m very interested in testing it on edge devices.

I’m building local AI devices for my family and currently run your Gemma-4-E2B GGUFs on a Jetson Orin Nano Super Kit and a Raspberry Pi 5 16GB. The Jetson is fast enough, but the 8GB RAM limit makes small MoE models very interesting for me.

I saw a couple of existing ZAYA1 GGUF quantizations, but they don’t seem to work well yet. I wanted to try my luck and ask if this model is something you might be willing to look at someday.

Thank you very much for everything you do.

·

No support merged yet, will keep an eye on this draft PR :)

https://github.com/ggml-org/llama.cpp/pull/23112