CompactAI
/

TinyLlama-1.1B-Chat-v1.0-python-aggressive

Text Generation

Model card Files Files and versions

TinyLlama-1.1B-Chat-v1.0-python-aggressive / README.md

CompactAI's picture

Upload folder using huggingface_hub

e6d06db verified 6 days ago

|

history blame contribute delete

1.7 kB

	---
	license: apache-2.0
	tags:
	- pruned
	- python
	- optimized
	- wanda
	base_model: TinyLlama/TinyLlama-1.1B-Chat-v1.0
	pipeline_tag: text-generation
	---

	# TinyLlama-1.1B-Chat-v1.0-python-aggressive

	> 🎯 PYTHON-optimized \| 📦 Aggressive pruning \| ⚡ 1% weights pruned

	This model is a aggressively pruned version of [TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0).

	## Performance Comparison

	\| Category \| Original \| Pruned \| Change \|
	\|----------\|----------\|--------\|--------\|
	\| Python \| 0.0% \| 0.0% ⭐ \| → \|
	\| Html \| 0.0% \| 0.0% \| → \|
	\| Trivia \| 35.7% \| 35.7% \| → \|
	\| Math \| 0.0% \| 0.0% \| → \|
	\| Reasoning \| 0.0% \| 0.0% \| → \|
	\| Medical \| 50.0% \| 50.0% \| → \|
	\| Linux \| 80.0% \| 80.0% \| → \|
	\| Writing \| 16.7% \| 16.7% \| → \|

	Average: 22.8% → 22.8% (+0.0%)



	![Comparison Graph](comparison_graph.png)

	## Quick Start

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer

	model = AutoModelForCausalLM.from_pretrained("CompactAI/TinyLlama-1.1B-Chat-v1.0-python-aggressive")
	tokenizer = AutoTokenizer.from_pretrained("CompactAI/TinyLlama-1.1B-Chat-v1.0-python-aggressive")

	inputs = tokenizer("Your prompt here", return_tensors="pt")
	outputs = model.generate(**inputs, max_new_tokens=100)
	print(tokenizer.decode(outputs[0], skip_special_tokens=True))
	```

	## Technical Details

	\| Property \| Value \|
	\|----------\|-------\|
	\| Base Model \| [TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) \|
	\| Specialization \| Python \|
	\| Prune Mode \| Aggressive \|
	\| Weight Reduction \| 1% weights pruned \|

	## License

	This model inherits the license from the base model.