Commit History

Document 128K context and KV cache quantization defaults
5717969
Running
verified

AlexandreScriptsMT commited on

Enable q4 KV cache and 128K context defaults
64daee6
verified

AlexandreScriptsMT commited on

Use absolute llama-server path in entrypoint
6ca36c7
verified

AlexandreScriptsMT commited on

Simplify Dockerfile for llama.cpp base image
6280c2f
verified

AlexandreScriptsMT commited on

Add README for Gemma 4 CPU Basic API Space
12010f8
verified

AlexandreScriptsMT commited on

Add runtime entrypoint for Gemma 4 API
43aaf2b
verified

AlexandreScriptsMT commited on

Add Dockerfile for llama.cpp server Space
fde826b
verified

AlexandreScriptsMT commited on