APEX Quants (GGUF) Collection MoE models quantized with the APEX Quantization technique ( https://github.com/mudler/apex-quant ) • 36 items • Updated 8 days ago • 110
REAM Collection Compressed MoE models with a reduced number of experts. See additional models at https://huggingface.co/bknyaz. • 12 items • Updated Apr 20 • 6
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated Feb 25 • 143