tatsu-lab/linguistic-calibration-lc-sft-wdiff
Text Generation
• 7B • Updated • 3
tatsu-lab/linguistic-calibration-factuality-sft-wdiff
Text Generation
• 7B • Updated • 3
tatsu-lab/linguistic-calibration-claude-distill-wdiff
Text Generation
• 7B • Updated • 1
tatsu-lab/linguistic-calibration-extract-answers
Text Generation
• 3B • Updated • 2
tatsu-lab/linguistic-calibration-lc-rl-wdiff
Text Generation
• 7B • Updated tatsu-lab/linguistic-calibration-factuality-rl-wdiff
Text Generation
• 7B • Updated • 3
tatsu-lab/linguistic-calibration-reward-model-forecastprobs-wdiff
7B • Updated • 2
tatsu-lab/linguistic-calibration-reward-model-factuality-wdiff
7B • Updated • 3
tatsu-lab/alpaca-farm-ppo-human-wdiff
Text Generation
• Updated • 21
• 1
tatsu-lab/alpaca-farm-expiter-human-wdiff
Text Generation
• Updated • 6
tatsu-lab/alpaca-farm-ppo-sim-gpt4-20k-wdiff
Text Generation
• Updated • 18
tatsu-lab/alpaca-farm-ppo-sim-wdiff
Text Generation
• Updated • 1
tatsu-lab/alpaca-farm-reward-model-human-wdiff
Updated • 16
• 1
tatsu-lab/alpaca-farm-feedme-sim-wdiff
Text Generation
• Updated • 2
tatsu-lab/alpaca-farm-feedme-human-wdiff
Text Generation
• Updated • 1
tatsu-lab/alpaca-farm-reward-condition-sim-wdiff
Text Generation
• Updated • 3
tatsu-lab/alpaca-farm-reward-model-sim-wdiff
Updated
tatsu-lab/alpaca-farm-expiter-sim-wdiff
Text Generation
• Updated • 1
tatsu-lab/alpaca-farm-sft10k-wdiff
Text Generation
• Updated • 17
tatsu-lab/alpaca-7b-wdiff
Text Generation
• Updated • 145
• 58