TableMind Reinforced Model Weights
These are the official reinforcement learning (RL) fine-tuned model checkpoints for the paper: "TableMind: An Autonomous Programmatic Agent for Tool-Augmented Table Reasoning".
π¦ Model Details
- Base Model: Qwen3-8B
- Tuning Framework: Verl + LLaMA Factory
This model follows the standard Hugging Face transformers format and uses the efficient safetensors backend.
Time-R1/
βββ added_tokens.json
βββ config.json
βββ generation_config.json
βββ merges.txt
βββ model-00001-of-00004.safetensors
βββ model-00002-of-00004.safetensors
βββ model-00003-of-00004.safetensors
βββ model-00004-of-00004.safetensors
βββ model.safetensors.index.json
βββ special_tokens_map.json
βββ tokenizer.json
βββ tokenizer_config.json
βββ vocab.json
- Downloads last month
- 80
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support