Spaces:
Running
Running
| title: README | |
| emoji: 🚀 | |
| colorFrom: blue | |
| colorTo: gray | |
| sdk: static | |
| pinned: true | |
| # 🐙 Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations | |
| Project Page: [https://mathoctopus.github.io/](https://mathoctopus.github.io/) | |
| Paper: [https://arxiv.org/abs/2310.20246.pdf](https://arxiv.org/abs/2310.20246.pdf) | |
| Code: [https://github.com/microsoft/MathOctopus](https://github.com/microsoft/MathOctopus) | |
| ## About Mathoctopus | |
| Mathoctopus is a series of multilingual math reasoning large language models based on LLaMA. | |
| This work pioneers exploring and building powerful Multilingual Math Reasoning (xMR) LLMs. To accomplish this, we make the following works: | |
| - **MGSM8KInstruct**, the multilingual math reasoning instruction dataset, encompassing ten distinct languages, thus addressing the issue of training data scarcity in xMR tasks. | |
| - **MSVAMP**, an out-of-domain xMR test dataset, to conduct a more exhaustive and comprehensive evaluation of the model’s multilingual mathematical capabilities. | |
| - **MathOctopus**, our effective Multilingual Math Reasoning LLMs, training with different strategies, which notably outperform conventional open-source LLMs and exhibit superiority over ChatGPT in few-shot scenarios. | |