Add model card.
Browse files
README.md
CHANGED
|
@@ -1,4 +1,4 @@
|
|
| 1 |
-
**Companion artifact for [_GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization_](https://github.com/codezakh/gpu-surrogates).
|
| 2 |
|
| 3 |
LoRA adapter for [`openai/gpt-oss-20b`](https://huggingface.co/openai/gpt-oss-20b)
|
| 4 |
fine-tuned with the correctness reward to forecast kernel speedups.
|
|
@@ -18,3 +18,14 @@ model = AutoPeftModelForCausalLM.from_pretrained("codezakh/gpu-forecasters-gpt-o
|
|
| 18 |
|
| 19 |
See `runbook/02_train_surrogate.py` in the paper repo
|
| 20 |
([codezakh/gpu-surrogates](https://github.com/codezakh/gpu-surrogates)).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
**Companion artifact for [_GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization_](https://arxiv.org/abs/2605.31464)** ([PDF](https://arxiv.org/pdf/2605.31464), [code](https://github.com/codezakh/gpu-surrogates)).
|
| 2 |
|
| 3 |
LoRA adapter for [`openai/gpt-oss-20b`](https://huggingface.co/openai/gpt-oss-20b)
|
| 4 |
fine-tuned with the correctness reward to forecast kernel speedups.
|
|
|
|
| 18 |
|
| 19 |
See `runbook/02_train_surrogate.py` in the paper repo
|
| 20 |
([codezakh/gpu-surrogates](https://github.com/codezakh/gpu-surrogates)).
|
| 21 |
+
|
| 22 |
+
## Citation
|
| 23 |
+
|
| 24 |
+
```bibtex
|
| 25 |
+
@article{khan2026gpuforecasters,
|
| 26 |
+
title={GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization},
|
| 27 |
+
author={Khan, Zaid and Chen, Justin Chih-Yao and Cho, Jaemin and Stengel-Eskin, Elias and Bansal, Mohit},
|
| 28 |
+
journal={arXiv preprint arXiv:2605.31464},
|
| 29 |
+
year={2026}
|
| 30 |
+
}
|
| 31 |
+
```
|