Seems broken on Mac Studio M1

#2
by tarruda - opened

I cloned and built https://github.com/stepfun-ai/Step-3.5-Flash according to instructions:

git clone git@github.com:stepfun-ai/Step-3.5-Flash.git
(
cd Step-3.5-Flash
cmake -S . -B build-macos \      
  -DCMAKE_BUILD_TYPE=Release \
  -DGGML_METAL=ON \
  -DGGML_ACCELERATE=ON \
  -DLLAMA_BUILD_EXAMPLES=ON \
  -DLLAMA_BUILD_COMMON=ON \
  -DGGML_LTO=ON && \
cmake --build build-macos -j8
)

Then I ran with the concatenated GGUF from the parts in this repo:

./Step-3.5-Flash/llama.cpp/build-macos/bin/llama-cli -m step3p5_flash_Q4_K_S.gguf -c 16384 -b 2048 -ub 2048 -fa on --temp 1.0 -p "What's your name?"

The result is that it keeps repeating the token = forever:

image

Weight sha256sum:

sha256sum step3p5_flash_Q4_K_S.gguf
cd3442c4e4c17322e6a432c0db1ec6383029f4f28c94e25c6f0b773b967b82db  step3p5_flash_Q4_K_S.gguf

merge might be corrupted i had similar error but its got fixed after re download

You're right. After comparing the sha256 of two parts I got mismatches. Will re-download, thanks.

tarruda changed discussion status to closed

Sign up or log in to comment