NovaSky
About Us
Blog Posts
Sort by Tags
Post-Training Reasoning
2025-02-21
S*: Test-Time Scaling for Code Generation
Post-Training Reinforcement Learning Distillation Reasoning
2025-02-13
Unlocking the Potential of Reinforcement Learning in Improving Reasoning Models
Post-Training Preference-Optimization Reasoning
2025-01-23
Think Less, Achieve More: Cut Reasoning Costs by 50% Without Sacrificing Accuracy
Post-Training Distillation
2025-01-10
Sky-T1: Train your own O1 preview model within $450
1