Commit History
Delete granite-4.0-tiny-preview-iq4_xs_T3UD.gguf 18eac05 verified
Delete SmartQuant-Falcon-H1-0.5B-Instruct.gguf 04cfb42 verified
Delete llama-server-6343-cuda aad2b42 verified
Rename llama-quantize to llama-quantize-sq c7dfad7 verified
Upload granite-4.0-tiny-preview-iq4_xs_T3UD.gguf with huggingface_hub 2375290 verified
Upload llama-server-6343-cuda with huggingface_hub b02ea94 verified
Upload Tiny-Moe.Q6_K_T3.gguf with huggingface_hub 4885b4c verified
Upload SmartQuant-Falcon-H1-0.5B-Instruct.gguf with huggingface_hub 2eda48e verified
Rename Llama-3.3-70B-Instruct-SmartQuant.gguf to SmartQuant-Llama-3.3-70B-Instruct.gguf 4755934 verified
Rename granite-3.3-8b-instruct-SmartQuant.gguf to SmartQuant-granite-3.3-8b-instruct.gguf 3344ebb verified
add quantization tool 8ec229d
TobDeBer commited on
add granite-3.3-8b-instruct-SmartQuant.gguf aaed805
TobDeBer commited on
add first SmartQuant model ef482e3
TobDeBer commited on
Update README.md 60b1740 verified
Update README.md f0b7865 verified
Update README.md c6e6867 verified
track Llama-3.3-70B-Instruct-SmartQuant.gguf 9f6fb97
Tobias Bergmann commited on