Talks AWS re:Invent 2025 - Customize & scale foundation models using Amazon SageMaker AI (AIM363) VIDEO
AWS re:Invent 2025 - Customize & scale foundation models using Amazon SageMaker AI (AIM363) Customizing and Scaling Foundation Models with Amazon SageMaker AI
Challenges with Foundation Models in Production
Demos of foundation models look impressive, but production requirements are much more demanding
Production needs:
Accurate models that can be defended
Cost-effective economics that work at scale
Teams often face a false choice between accuracy and cost when using foundation models
The Power of Model Customization
Foundation models are powerful generalists, but lack domain-specific knowledge
Customizing models with your data and business rules can significantly improve accuracy and reduce costs
Customization encodes your domain expertise directly into the model's weights
Overcoming the Customization Tax
Customizing models has traditionally been complex and time-consuming
SageMaker AI solves this with three key principles:
Serverless infrastructure for training and deployment
Integrated workflow from experimentation to production
Production-ready customization techniques
Customization Techniques in SageMaker AI
Supervised Fine-Tuning (SFT) : Teaches the model your domain knowledge through labeled examples
Direct Preference Optimization (DPO) : Teaches the model your judgment and style by showing it preferred responses
Reinforcement Learning with AI Feedback (RLaif) : Trains the model to achieve complex, subjective outcomes through feedback
Customization Workflows in SageMaker Studio
Three customization experiences in SageMaker Studio:
Visual UI-based workflows
AI agent-guided experiences
Code-based SDK for programmatic control
All workflows use the same underlying service and provide a unified audit trail
New SageMaker AI Capabilities
Serverless Model Evaluation : Easily evaluate customized models against benchmarks or custom criteria
Serverless MLflow : Automatically track metrics, hyperparameters, and lineage for all model customization experiments
Compute Options
Serverless Training : Rapid iteration and experimentation without infrastructure management
SageMaker Hyperod : Persistent cluster for long-running, frontier-scale model training with fault tolerance
Key Takeaways
Choice : Flexibility to use open-source or proprietary models, and choose from multiple customization techniques
Efficiency : Serverless infrastructure and integrated workflows dramatically reduce the "customization tax"
Safety : Built-in governance, lineage, and security for enterprise-grade model customization
Business Impact
Customization allows you to turn your proprietary data into a competitive advantage
Improved accuracy and reduced costs enable new AI-powered use cases in production
Examples:
Transaction categorization with 92% accuracy instead of 8% miscategorization
Conversational AI that sounds more human-like and aligned with your brand
Your Digital Journey deserves a great story. Build one with us.