Enable q4 KV cache and 128K context defaults 64daee6 verified AlexandreScriptsMT commited on 15 days ago
Use absolute llama-server path in entrypoint 6ca36c7 verified AlexandreScriptsMT commited on 15 days ago