| | --- |
| | license: apache-2.0 |
| | --- |
| | |
| | # Tiger Model Card |
| |
|
| | ## Model details |
| |
|
| | Tactic-guided reasoner (Tiger) is a language model that solves *reasoning in the wild* task proposed in paper [Can LLMs Reason in the Wild with Programs](https://arxiv.org/abs/2406.13764). |
| | It is trained by fine-tuning the LLaMA3-8B model on the [ReWild](https://huggingface.co/datasets/yuan-yang/ReWild) dataset. |
| |
|
| | **Model type:** |
| | This repo contains the LoRA delta weights for `Tiger-Routing-8B` |
| |
|
| | We also provide the delta weights of other versions: |
| | - [Tiger-Routing-8B](https://huggingface.co/yuan-yang/Tiger-Routing-8B/) |
| | - [Tiger-PJ-8B](https://huggingface.co/yuan-yang/Tiger-PJ-8B) |
| | - [Tiger-IPJ-8B](https://huggingface.co/yuan-yang/Tiger-IPJ-8B) |
| |
|
| | **License:** |
| | Apache License 2.0 |
| |
|
| | ## Using the model |
| |
|
| | Check out how to use the model on our project page: https://github.com/gblackout/Reason-in-the-Wild/ |
| |
|
| |
|
| | **Primary intended uses:** |
| | Tiger is intended to be used for research. |
| |
|
| |
|
| | ## Citation |
| |
|
| | ``` |
| | @article{yang2024can, |
| | title={Can LLMs Reason in the Wild with Programs?}, |
| | author={Yang, Yuan and Xiong, Siheng and Payani, Ali and Shareghi, Ehsan and Fekri, Faramarz}, |
| | journal={arXiv preprint arXiv:2406.13764}, |
| | year={2024} |
| | } |
| | ``` |