AWS re:Invent 2025 - Create hyper-personalized voice interactions with Amazon Nova Sonic (AIM374)

Summary of AWS re:Invent 2025 - Create hyper-personalized voice interactions with Amazon Nova Sonic (AIM374)

Introduction to Amazon Nova Sonic

Amazon Nova Sonic is a speech-to-speech foundation model for real-time, human-like conversational AI

It was launched as the second generation of the model, with significant improvements over the previous version

Key features include:

Accurate speech understanding across different accents, speaking styles, and background noise
Bi-directional streaming API for low-latency conversations
Ability to detect user sentiment and tonality, and adapt responses accordingly
Knowledge grounding and task completion capabilities

New Capabilities in Amazon Nova Sonic 2

Support for 7 languages (English, Spanish, French, Italian, German, Hindi, Portuguese) with masculine and feminine voice options

Language switching - ability to switch between languages within the same conversation

Asynchronous task completion - allows users to switch topics while a task is being processed in the background

Cross-modal input/output - handles both text and speech input/output seamlessly

Configurable turn-taking sensitivity - allows adjusting the duration of user pauses before the AI responds

Technical Advancements

Significant improvements in speech recognition accuracy, especially for noisy conditions and alphanumeric inputs

State-of-the-art speech reasoning capabilities, outperforming other models on benchmarks

Conversation quality rated higher than competing models by human evaluators

Architectural Approach

Limitations of traditional cascaded voice AI systems (separate speech-to-text, reasoning, and text-to-speech models)

Motivation behind building a unified, foundation model-based approach with Nova Sonic

Benefits of the unified architecture:

Improved context carryover and personalization
Reduced latency and more natural conversations
Ability to adapt to speech nuances like tone and emotion

Real-World Use Cases

Self-service voice-first customer service automation

Voice-enabled personal assistants

Education and language learning applications

Customers building on Nova Sonic include Crescendo, iFrame, Rejume, Cisco, Ring, and Amazon Connect

Developer Tools and Integrations

Partnerships with frameworks like LiveKit and PipeCat to simplify integration and session management

Integration with Amazon Connect for customer service call automation

Telephony integrations with AudioCodes, Twilio, and others for outbound calling and IVR use cases

Demonstration and Customer Testimonial

Demo of an AI receptionist use case built by Cisco, showcasing Nova Sonic's capabilities

Key features highlighted:

Multimodal input/output (voice, text, phone number, address)
Seamless topic switching and asynchronous task completion
Accurate name pronunciation and personalized responses

Conclusion

Nova Sonic represents a significant advancement in conversational AI, enabled by a unified foundation model architecture

Key technical improvements in speech recognition, reasoning, and conversation quality

Broad range of real-world use cases and seamless integration options for developers

Demonstration of a practical AI receptionist use case highlighting the practical benefits

AWS re:Invent 2025 - Create hyper-personalized voice interactions with Amazon Nova Sonic (AIM374)

Summary of AWS re:Invent 2025 - Create hyper-personalized voice interactions with Amazon Nova Sonic (AIM374)

Introduction to Amazon Nova Sonic

New Capabilities in Amazon Nova Sonic 2

Technical Advancements

Architectural Approach

Real-World Use Cases

Developer Tools and Integrations

Demonstration and Customer Testimonial

Conclusion

Your Digital Journey deserves a great story.

Build one with us.

Headquarters

Delivery Centre

AWS re:Invent 2025 - Create hyper-personalized voice interactions with Amazon Nova Sonic (AIM374)

Summary of AWS re:Invent 2025 - Create hyper-personalized voice interactions with Amazon Nova Sonic (AIM374)

Introduction to Amazon Nova Sonic

New Capabilities in Amazon Nova Sonic 2

Technical Advancements

Architectural Approach

Real-World Use Cases

Developer Tools and Integrations

Demonstration and Customer Testimonial

Conclusion

Your Digital Journey deserves a great story.

Build one with us.

This website stores cookies on your computer.