assemsabry
/

flash

Model card Files Files and versions

flash / README.md

assemsabry's picture

Add README

0c8fb25 verified 2 days ago

|

history blame contribute delete

719 Bytes

	---
	tags:
	- gguf
	- llama.cpp
	- unsloth

	---

	# flash : GGUF

	This model was finetuned and converted to GGUF format using [Unsloth](https://github.com/unslothai/unsloth).

	Example usage:
	- For text only LLMs: `llama-cli -hf assemsabry/flash --jinja`
	- For multimodal models: `llama-mtmd-cli -hf assemsabry/flash --jinja`

	## Available Model files:
	- `Llama-3.1-Minitron-4B-Width-Base.F16.gguf`

	## Note
	The model's BOS token behavior was adjusted for GGUF compatibility.
	This was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth)
	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)