CompactAI
/

EXAONE-4.0-1.2B-python-aggressive

Text Generation

activation-pruning

Model card Files Files and versions

EXAONE-4.0-1.2B-python-aggressive / README.md

CompactAI's picture

Upload folder using huggingface_hub

82efaf1 verified about 9 hours ago

|

history blame contribute delete

2.58 kB

	---
	license: apache-2.0
	tags:
	- pruned
	- python
	- optimized
	- wanda
	- activation-pruning
	base_model: LGAI-EXAONE/EXAONE-4.0-1.2B
	pipeline_tag: text-generation
	---

	# EXAONE-4.0-1.2B-python-aggressive

	> 🎯 PYTHON-optimized \| 📦 Aggressive pruning \| ⚡ 7% weights pruned

	This model is a aggressively pruned version of [LGAI-EXAONE/EXAONE-4.0-1.2B](https://huggingface.co/LGAI-EXAONE/EXAONE-4.0-1.2B), specialized for PYTHON tasks using activation-aware weight pruning (Wanda-style).

	## ✨ Key Features

	- Specialization: Optimized for Python tasks
	- Pruning Method: Wanda-style (\|W\| × \|activation\|) importance scoring
	- Size Reduction: 7% weights pruned
	- Use Case: Maximum compression for edge deployment

	## 📊 Performance Comparison

	\| Category \| Original \| Pruned \| Change \|
	\|----------\|----------\|--------\|--------\|
	\| Python \| 20.0% \| 20.0% ⭐ \| → \|
	\| Html \| 6.7% \| 6.7% \| → \|
	\| Trivia \| 26.7% \| 53.3% \| ↑ 26.7% \|
	\| Math \| 60.0% \| 53.3% \| ↓ 6.7% \|
	\| Reasoning \| 60.0% \| 73.3% \| ↑ 13.3% \|
	\| Medical \| 73.3% \| 80.0% \| ↑ 6.7% \|
	\| Linux \| 93.3% \| 93.3% \| → \|
	\| Writing \| 60.0% \| 53.3% \| ↓ 6.7% \|

	Average: 50.0% → 54.2% (+4.2%)

	Python Retention: 100.0% of original performance

	![Comparison Graph](comparison_graph.png)

	## 🚀 Quick Start

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer

	model = AutoModelForCausalLM.from_pretrained("CompactAI/EXAONE-4.0-1.2B-python-aggressive")
	tokenizer = AutoTokenizer.from_pretrained("CompactAI/EXAONE-4.0-1.2B-python-aggressive")

	# Example usage
	inputs = tokenizer("Your prompt here", return_tensors="pt")
	outputs = model.generate(**inputs, max_new_tokens=100)
	print(tokenizer.decode(outputs[0], skip_special_tokens=True))
	```

	## 📋 Technical Details

	\| Property \| Value \|
	\|----------\|-------\|
	\| Base Model \| [LGAI-EXAONE/EXAONE-4.0-1.2B](https://huggingface.co/LGAI-EXAONE/EXAONE-4.0-1.2B) \|
	\| Specialization \| Python \|
	\| Prune Mode \| Aggressive \|
	\| Pruning Method \| Activation-based weight pruning (Wanda) \|
	\| Weight Reduction \| 7% weights pruned \|

	## 🔗 Related Models

	This model is part of the EXAONE-4.0-1.2B pruned model collection. Variants:
	- Safe - Conservative pruning (~10-20%), high accuracy retention
	- Aggressive - Maximum compression (~40-50%), best for edge deployment

	## 📜 License

	This model inherits the license from the base model [LGAI-EXAONE/EXAONE-4.0-1.2B](https://huggingface.co/LGAI-EXAONE/EXAONE-4.0-1.2B).

	---
	Generated by ZANNPS [Zeto Automatic Neural Network Pruning System]