What Makes SFT Data Useful
SFT data must show the model what good answers look like. It should include clear prompts, reliable answers, and enough domain variety to generalize.
- Instruction-response pairs
- Reasoning and explanation traces
- Domain-specific answer formats