--- license: apache-2.0 tags: - pruned - html - optimized - wanda base_model: LiquidAI/LFM2.5-1.2B-Instruct pipeline_tag: text-generation --- # LFM2.5-1.2B-Instruct-html-aggressive > **HTML-optimized** | **Aggressive** pruning | **35% weights pruned** This model is a **aggressively pruned** version of [LiquidAI/LFM2.5-1.2B-Instruct](https://huggingface.co/LiquidAI/LFM2.5-1.2B-Instruct). > **Pruning Alert:** The benchmarks show virtually NO quality drop! This isn't a bug -- it is a feature. The Wanda pruning algorithm is so effective at identifying unimportant weights that it can remove a large percentage of parameters without affecting performance. Think of it like pruning dead leaves from a tree -- the tree does not miss them because they were not doing anything anyway! ## Performance Comparison | Category | Original | Pruned | Change | |----------|----------|--------|--------| | Python | 0.0% | 0.0% | → | | **Html** | 10.0% | 10.0% ⭐ | → | | Trivia | 60.0% | 60.0% | → | | Math | 45.0% | 50.0% | ↑ 5.0% | | Reasoning | 20.0% | 20.0% | → | | Medical | 35.0% | 35.0% | → | | Linux | 0.0% | 0.0% | → | | Writing | 40.0% | 40.0% | → | **Average**: 26.2% -> 26.9% (+0.6%) **Html Retention**: 100.0% ![Comparison Graph](comparison_graph.png) ## Quick Start ```python from transformers import AutoModelForCausalLM, AutoTokenizer model = AutoModelForCausalLM.from_pretrained("CompactAI/LFM2.5-1.2B-Instruct-html-aggressive") tokenizer = AutoTokenizer.from_pretrained("CompactAI/LFM2.5-1.2B-Instruct-html-aggressive") inputs = tokenizer("Your prompt here", return_tensors="pt") outputs = model.generate(**inputs, max_new_tokens=100) print(tokenizer.decode(outputs[0], skip_special_tokens=True)) ``` ## Technical Details | Property | Value | |----------|-------| | Base Model | [LiquidAI/LFM2.5-1.2B-Instruct](https://huggingface.co/LiquidAI/LFM2.5-1.2B-Instruct) | | Specialization | Html | | Prune Mode | Aggressive | | Weight Reduction | 35% weights pruned | ## License This model inherits the license from the base model.