DOCAWS Machine Learning Blog·6d ago
Improve your agent’s tool-calling accuracy with SFT and DPO on Amazon SageMaker AI
This post explains how to use Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) to enhance the tool-calling accuracy of small language models. It details how to leverage Amazon SageMaker AI training jobs to focus on training code and evaluate model quality.
28