AWS re:Invent 2025 - Observability for Reliable Agentic AI with Strands SDK & OpenTelemetry (NTA406)

Observability for Reliable Agentic AI with Strands SDK & OpenTelemetry

Challenges with Deploying AI Agents in Production

Agents can become a "black box" with limited visibility into their behavior and decision-making

Issues with model evolution, contradictory prompts, and integration with external tools

Difficulty identifying why agents are making specific decisions, especially in multi-agent architectures

Leveraging AWS Agent Core for Reliable Agent Deployment

Agent Core provides a modular, component-based approach to building and hosting AI agents

Reduces time-to-market by handling infrastructure, hosting, and integration with native/third-party services

Enables the use of different agent frameworks (e.g., Strands) and observability tools (e.g., OpenTelemetry)

Implementing a Multi-Agent Financial Advisor

Developed a financial advisor agent using the Strands SDK and running on AWS Agent Core

Comprised of several specialized agents (rate checker, loan calculator, eligibility checker) orchestrated by a central agent

Agents integrated with external knowledge bases to provide personalized financial recommendations

Observability Challenges and Improvements

Latency Issues:

Initial agent response times were slow (13 seconds)
Identified high "temperature" parameter causing the model to be overly creative
Reduced temperature and max tokens to make the model more constrained
Upgraded to newer language models (Haiku 4.5) to improve performance

Multilingual Support:

Encountered issues with Spanish language prompts due to parameter misconfiguration
Leveraged AWS Bedrock Prompt Management to store and version language-specific prompts
Deployed multiple agent endpoints (e.g., Spanish, English) to handle different user preferences

Knowledge Base Integration:

Discovered that the knowledge base data was outdated, leading to inaccurate recommendations
Investigated the observability traces to identify the knowledge base retrieval process
Configured the observability sampling rate to 100% to ensure all relevant data was captured

Optimizing Multi-Agent Architectures

The initial architecture used a "swarm" of interconnected agents, leading to a complex full-mesh communication pattern

Explored simplifying the architecture to a "workflow orchestrator" or "principal agent with subordinates" model

This can reduce the number of agent-to-agent hops and improve overall response times

Key Observability Metrics and Features

Latency, error rates, token usage, and other runtime-specific metrics available in AWS Agent Core Observability

Ability to trace individual user sessions and understand the flow of agent interactions

Option to send logs, metrics, and traces to external observability tools using OpenTelemetry

Business Impact and Real-World Applications

Improved observability and visibility into AI agent behavior can help organizations deploy reliable, production-ready agents

Ability to quickly identify and resolve issues like latency, prompt conflicts, and outdated knowledge bases

Optimized multi-agent architectures can enhance performance and cost-efficiency for complex AI-powered applications

Demonstrated use case in the financial services industry, but applicable to any domain leveraging agentic AI

Resources and Next Steps

AWS Agent Core Toolkit for simplified agent deployment and management

Code samples for integrating other agent frameworks and observability tools with Agent Core

AWS re:Invent workshop on "Deep Dive on AWS Agent Core"

AWS re:Invent 2025 - Observability for Reliable Agentic AI with Strands SDK & OpenTelemetry (NTA406)

Observability for Reliable Agentic AI with Strands SDK & OpenTelemetry

Challenges with Deploying AI Agents in Production

Leveraging AWS Agent Core for Reliable Agent Deployment

Implementing a Multi-Agent Financial Advisor

Observability Challenges and Improvements

Optimizing Multi-Agent Architectures

Key Observability Metrics and Features

Business Impact and Real-World Applications

Resources and Next Steps

Your Digital Journey deserves a great story.

Build one with us.

Headquarters

Delivery Centre

AWS re:Invent 2025 - Observability for Reliable Agentic AI with Strands SDK & OpenTelemetry (NTA406)

Observability for Reliable Agentic AI with Strands SDK & OpenTelemetry

Challenges with Deploying AI Agents in Production

Leveraging AWS Agent Core for Reliable Agent Deployment

Implementing a Multi-Agent Financial Advisor

Observability Challenges and Improvements

Optimizing Multi-Agent Architectures

Key Observability Metrics and Features

Business Impact and Real-World Applications

Resources and Next Steps

Your Digital Journey deserves a great story.

Build one with us.

This website stores cookies on your computer.