TalksAWS re:Invent 2025 - Build a well-architected foundation for scaling generative AI and agentic apps
AWS re:Invent 2025 - Build a well-architected foundation for scaling generative AI and agentic apps
Building a Well-Architected Foundation for Scaling Generative AI and Agentic Applications
Overview
This presentation from AWS re:Invent 2025 discusses the challenges and complexities of taking generative AI and agentic applications into production at scale. The speakers outline a comprehensive foundation for building these applications, covering key components like gateways, orchestration, observability, and security.
Challenges of Generative AI and Agentic Applications
Complexity Beyond Simple Chatbots
Generative AI and agentic applications involve complex orchestration, agents, and safeguards - not just simple language models.
Scaling these applications to production introduces new challenges around performance, scalability, security, and context management.
Key Challenges
Performance and Latency: Agents need to reason, plan, and act with low latency over extended sessions.
Scalability: Deploying thousands of agents requires purpose-built orchestration and infrastructure.
Security and Isolation: Agents need secure access to sensitive data and tools, with proper controls in place.
These cookies are used to collect information about how you interact with this website and allow us to remember you. We use this information to improve and customize your browsing experience, as well as for analytics.
If you decline, your information won’t be tracked when you visit this website. A single cookie will be used in your browser to remember your preference.