tatsu-lab/linguistic-calibration-lc-sft-wdiff
Text Generation
•
7B
•
Updated
•
2
tatsu-lab/linguistic-calibration-factuality-sft-wdiff
Text Generation
•
7B
•
Updated
•
2
tatsu-lab/linguistic-calibration-claude-distill-wdiff
Text Generation
•
7B
•
Updated
•
6
tatsu-lab/linguistic-calibration-extract-answers
Text Generation
•
3B
•
Updated
•
5
tatsu-lab/linguistic-calibration-lc-rl-wdiff
Text Generation
•
7B
•
Updated
•
5
tatsu-lab/linguistic-calibration-factuality-rl-wdiff
Text Generation
•
7B
•
Updated
•
2
tatsu-lab/linguistic-calibration-reward-model-forecastprobs-wdiff
7B
•
Updated
•
4
tatsu-lab/linguistic-calibration-reward-model-factuality-wdiff
7B
•
Updated
•
5
tatsu-lab/alpaca-farm-ppo-human-wdiff
Text Generation
•
Updated
•
41
•
1
tatsu-lab/alpaca-farm-expiter-human-wdiff
Text Generation
•
Updated
•
6
tatsu-lab/alpaca-farm-ppo-sim-gpt4-20k-wdiff
Text Generation
•
Updated
•
22
tatsu-lab/alpaca-farm-ppo-sim-wdiff
Text Generation
•
Updated
•
10
tatsu-lab/alpaca-farm-reward-model-human-wdiff
Updated
•
9
•
1
tatsu-lab/alpaca-farm-feedme-sim-wdiff
Text Generation
•
Updated
•
4
tatsu-lab/alpaca-farm-feedme-human-wdiff
Text Generation
•
Updated
•
5
tatsu-lab/alpaca-farm-reward-condition-sim-wdiff
Text Generation
•
Updated
•
4
tatsu-lab/alpaca-farm-reward-model-sim-wdiff
tatsu-lab/alpaca-farm-expiter-sim-wdiff
Text Generation
•
Updated
•
5
tatsu-lab/alpaca-farm-sft10k-wdiff
Text Generation
•
Updated
•
6
tatsu-lab/alpaca-7b-wdiff
Text Generation
•
Updated
•
927
•
57