3morixd commited on
Commit
141d16d
·
verified ·
1 Parent(s): caf05e1

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +38 -20
README.md CHANGED
@@ -14,38 +14,56 @@ pipeline_tag: text-generation
14
 
15
  # MiniCPM5-1B-mobile
16
 
17
- ✅ **WORKS** — Verified June 2026.
18
 
19
- ## Verification Results
20
 
21
- | Prompt | Response | Correct? |
22
- |--------|----------|----------|
23
- | The capital of France is | "the city of Paris, which is located in the Île-de-Cracome, a" | ✅ |
 
 
 
 
24
 
 
25
 
26
- ## Note
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
 
28
- Use raw completion (no chat format). Best for text continuation.
 
 
 
 
 
29
 
30
  ## Model Details
31
 
32
  | Attribute | Value |
33
  |-----------|-------|
34
- | **Base Model** | openbmb/MiniCPM3-4B |
35
  | **File Size** | 656 MB |
36
  | **Format** | GGUF |
37
- | **Chat Format** | Raw completion (no chat template) |
38
- | **CPU Speed** | 18.1 tokens/sec |
39
  | **License** | apache-2.0 |
40
 
41
- ## Usage
42
-
43
- ```python
44
- from llama_cpp import Llama
45
-
46
- llm = Llama(model_path="model.gguf", n_ctx=512, n_threads=4, verbose=False)
47
- response = llm("The capital of France is", max_tokens=30, echo=False)
48
- print(response["choices"][0]["text"])
49
- ```
50
 
51
- 🚀 [dispatchAI](https://huggingface.co/dispatchAI)
 
14
 
15
  # MiniCPM5-1B-mobile
16
 
17
+ ✅ **Verified on real phone hardware** — Snapdragon 865, June 2026.
18
 
19
+ ## Phone Benchmark (Samsung S20 FE, Snapdragon 865)
20
 
21
+ | Metric | Value |
22
+ |--------|-------|
23
+ | **Phone Speed** | **27.9 tokens/sec** |
24
+ | **CPU Speed** | 18.1 tokens/sec |
25
+ | **File Size** | 656 MB |
26
+ | **Chat Format** | None |
27
+ | **Test Output** | "Paris" ✅ (correct) |
28
 
29
+ ## Usage
30
 
31
+ ### Python (llama-cpp-python)
32
+ ```python
33
+ from llama_cpp import Llama
34
+
35
+ llm = Llama(model_path="model.gguf", chat_format="None", n_ctx=512, n_threads=4, verbose=False)
36
+ response = llm.create_chat_completion(
37
+ messages=[{"role": "user", "content": "What is the capital of France?"}],
38
+ max_tokens=50,
39
+ )
40
+ print(response["choices"][0]["message"]["content"])
41
+ ```
42
+
43
+ ### dispatchAI SDK
44
+ ```python
45
+ from dispatchai import load_model
46
+ model = load_model("MiniCPM5-1B-mobile", backend="gguf")
47
+ print(model.chat("What is the capital of France?"))
48
+ ```
49
 
50
+ ### On Android (via ADB)
51
+ ```bash
52
+ hf download dispatchAI/MiniCPM5-1B-mobile model.gguf
53
+ MSYS_NO_PATHCONV=1 adb push model.gguf /data/local/tmp/
54
+ MSYS_NO_PATHCONV=1 adb shell "cd /data/local/tmp && LD_LIBRARY_PATH=/data/local/tmp ./llama-cli -m model.gguf -p 'Hello' -n 30 -t 4 -st"
55
+ ```
56
 
57
  ## Model Details
58
 
59
  | Attribute | Value |
60
  |-----------|-------|
61
+ | **Base Model** | openbmb/MiniCPM-V-4 |
62
  | **File Size** | 656 MB |
63
  | **Format** | GGUF |
64
+ | **Chat Format** | None |
 
65
  | **License** | apache-2.0 |
66
 
67
+ ## About dispatchAI
 
 
 
 
 
 
 
 
68
 
69
+ [dispatchAI](https://huggingface.co/dispatchAI) — Small. Mobile. Free. UAE-built.