Post-Training Preference-Optimization Reasoning 2025-01-23
Think Less, Achieve More: Cut Reasoning Costs by 50% Without Sacrificing Accuracy
We introduce Sky-T1-32B-Flash, our reasoning model that cuts generation length by up to 50% while maintaining accuracy.