codezakh
/

gpu-forecasters-gpt-oss-20b-correctness

Model card Files Files and versions

codezakh commited on 2 days ago

Commit

b41f33a

·

verified ·

1 Parent(s): b62c268

Add model card.

Files changed (1) hide show

README.md +12 -1

README.md CHANGED Viewed

@@ -1,4 +1,4 @@
-**Companion artifact for [_GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization_](https://github.com/codezakh/gpu-surrogates).**
 LoRA adapter for [`openai/gpt-oss-20b`](https://huggingface.co/openai/gpt-oss-20b)
 fine-tuned with the correctness reward to forecast kernel speedups.
@@ -18,3 +18,14 @@ model = AutoPeftModelForCausalLM.from_pretrained("codezakh/gpu-forecasters-gpt-o
 See `runbook/02_train_surrogate.py` in the paper repo
 ([codezakh/gpu-surrogates](https://github.com/codezakh/gpu-surrogates)).

+**Companion artifact for [_GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization_](https://arxiv.org/abs/2605.31464)** ([PDF](https://arxiv.org/pdf/2605.31464), [code](https://github.com/codezakh/gpu-surrogates)).
 LoRA adapter for [`openai/gpt-oss-20b`](https://huggingface.co/openai/gpt-oss-20b)
 fine-tuned with the correctness reward to forecast kernel speedups.
 See `runbook/02_train_surrogate.py` in the paper repo
 ([codezakh/gpu-surrogates](https://github.com/codezakh/gpu-surrogates)).
+## Citation
+```bibtex
+@article{khan2026gpuforecasters,
+  title={GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization},
+  author={Khan, Zaid and Chen, Justin Chih-Yao and Cho, Jaemin and Stengel-Eskin, Elias and Bansal, Mohit},
+  journal={arXiv preprint arXiv:2605.31464},
+  year={2026}
+}
+```