Add model card and metadata

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +31 -1
README.md CHANGED
@@ -1 +1,31 @@
1
- arxiv.org/abs/2603.27027
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ tags:
6
+ - speculative-decoding
7
+ ---
8
+
9
+ # TAPS: Task-Aware Proposal Distributions for Speculative Sampling
10
+
11
+ This repository contains a draft model introduced in the paper [TAPS: Task Aware Proposal Distributions for Speculative Sampling](https://huggingface.co/papers/2603.27027).
12
+
13
+ ## Overview
14
+
15
+ TAPS (Task-Aware Proposal Distributions) investigates how the training distribution of draft models affects the efficiency of speculative decoding. In speculative decoding, a lightweight draft model proposes future tokens that a larger target model (like Meta-Llama-3-8B-Instruct) verifies in parallel.
16
+
17
+ This model is a lightweight LLaMA-style drafter (approximately 0.8B parameters with 1 layer) designed to accelerate autoregressive generation. The research demonstrates that task-specific training (e.g., on MathInstruct or ShareGPT) yields significant specialization, and that specialized drafters are best combined at inference time using strategies like confidence-based routing or merged-tree verification.
18
+
19
+ ## Resources
20
+ - **Paper:** [TAPS: Task Aware Proposal Distributions for Speculative Sampling](https://huggingface.co/papers/2603.27027)
21
+ - **GitHub Repository:** [https://github.com/Moe-Zbeeb/TAPS](https://github.com/Moe-Zbeeb/TAPS)
22
+
23
+ ## Citation
24
+ ```bibtex
25
+ @article{zbib2026taps,
26
+ title={TAPS: Task Aware Proposal Distributions for Speculative Sampling},
27
+ author={Zbib, Mohamad and Bazzi, Mohamad and Mohanna, Ammar and Ghanem, Bernard and Hammoud, Hasan Abed Al Kader},
28
+ year={2026},
29
+ note={Technical report}
30
+ }
31
+ ```