Instructions to use tensorblock/llama-3-sqlcoder-8b-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use tensorblock/llama-3-sqlcoder-8b-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="tensorblock/llama-3-sqlcoder-8b-GGUF", filename="llama-3-sqlcoder-8b-Q2_K.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": "What is the capital of France?" } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use tensorblock/llama-3-sqlcoder-8b-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K # Run inference directly in the terminal: llama-cli -hf tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K # Run inference directly in the terminal: ./llama-cli -hf tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K # Run inference directly in the terminal: ./build/bin/llama-cli -hf tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K
Use Docker
docker model run hf.co/tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K
- LM Studio
- Jan
- vLLM
How to use tensorblock/llama-3-sqlcoder-8b-GGUF with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "tensorblock/llama-3-sqlcoder-8b-GGUF" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "tensorblock/llama-3-sqlcoder-8b-GGUF", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K
- Ollama
How to use tensorblock/llama-3-sqlcoder-8b-GGUF with Ollama:
ollama run hf.co/tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K
- Unsloth Studio new
How to use tensorblock/llama-3-sqlcoder-8b-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/llama-3-sqlcoder-8b-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for tensorblock/llama-3-sqlcoder-8b-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for tensorblock/llama-3-sqlcoder-8b-GGUF to start chatting
- Docker Model Runner
How to use tensorblock/llama-3-sqlcoder-8b-GGUF with Docker Model Runner:
docker model run hf.co/tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K
- Lemonade
How to use tensorblock/llama-3-sqlcoder-8b-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull tensorblock/llama-3-sqlcoder-8b-GGUF:Q2_K
Run and chat with the model
lemonade run user.llama-3-sqlcoder-8b-GGUF-Q2_K
List all available models
lemonade list
Keep Q2_K/Q3_K_M gguf only
Browse files- llama-3-sqlcoder-8b-Q3_K_L.gguf +0 -3
- llama-3-sqlcoder-8b-Q3_K_S.gguf +0 -3
- llama-3-sqlcoder-8b-Q4_0.gguf +0 -3
- llama-3-sqlcoder-8b-Q4_K_M.gguf +0 -3
- llama-3-sqlcoder-8b-Q4_K_S.gguf +0 -3
- llama-3-sqlcoder-8b-Q5_0.gguf +0 -3
- llama-3-sqlcoder-8b-Q5_K_M.gguf +0 -3
- llama-3-sqlcoder-8b-Q5_K_S.gguf +0 -3
- llama-3-sqlcoder-8b-Q6_K.gguf +0 -3
- llama-3-sqlcoder-8b-Q8_0.gguf +0 -3
llama-3-sqlcoder-8b-Q3_K_L.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:174d7cf3770d105a5a3c8e3605ddbf4de158e700392f63d4270e4a35d5fee3f8
|
| 3 |
-
size 4321956448
|
|
|
|
|
|
|
|
|
|
|
|
llama-3-sqlcoder-8b-Q3_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:49d6942e090c01f8270b3ff024fcf32c031f34033b2746441b957ff4fdf8ae34
|
| 3 |
-
size 3664499296
|
|
|
|
|
|
|
|
|
|
|
|
llama-3-sqlcoder-8b-Q4_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:e9e23b617a98cd1aa387a9d36c90cde368ac0b69852efec301b7aefc3b00246e
|
| 3 |
-
size 4661211744
|
|
|
|
|
|
|
|
|
|
|
|
llama-3-sqlcoder-8b-Q4_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:13f58b3f0d1df594b3aba28760f4a509785c92e7c7644cfa5e30ecc4ae91e7f2
|
| 3 |
-
size 4920734304
|
|
|
|
|
|
|
|
|
|
|
|
llama-3-sqlcoder-8b-Q4_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:66cfd356b183283d0a675477affafba4c56a714214312e2284aeee8b959cca69
|
| 3 |
-
size 4692669024
|
|
|
|
|
|
|
|
|
|
|
|
llama-3-sqlcoder-8b-Q5_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:89e923f6d2b4aab4562ebb214334b77943d6c1ee8b6e63b24d3316abf51d676e
|
| 3 |
-
size 5599294048
|
|
|
|
|
|
|
|
|
|
|
|
llama-3-sqlcoder-8b-Q5_K_M.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:f24016ce0f902119b45eba5881adc280a3be2c731ef43099b397a6b48a0a385d
|
| 3 |
-
size 5732987488
|
|
|
|
|
|
|
|
|
|
|
|
llama-3-sqlcoder-8b-Q5_K_S.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:747fe72f620ca4b0fa377119b2edb2e6f51e06a690e1879385249564f0f8d836
|
| 3 |
-
size 5599294048
|
|
|
|
|
|
|
|
|
|
|
|
llama-3-sqlcoder-8b-Q6_K.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:ea04eb472b0d36b65c47e5837c27ddc374fb4de43853c17d9bb450eaa7794490
|
| 3 |
-
size 6596006496
|
|
|
|
|
|
|
|
|
|
|
|
llama-3-sqlcoder-8b-Q8_0.gguf
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:9c9bd736b425a930a2d4bea2cf90df958fbac368584e49d0ad8a7f088fb3333b
|
| 3 |
-
size 8540770912
|
|
|
|
|
|
|
|
|
|
|
|