AWS re:Invent 2025 - Delighting Slack users safely and quickly with Amazon Nova and Bedrock (AIM384)

Scaling Slack's Generative AI Capabilities with AWS Bedrock and Experimentation

Developing a Scalable and Secure Infrastructure

Slack's key priorities for their Slack AI features:

Trust: Ensure customer data is not used to train models and provide opt-out options
Security: Operate within FedRAMP Moderate compliance and maintain data security
Reliability: Ensure high availability and contextual relevance of AI responses

Challenges with initial SageMaker-based architecture:

Migration to AWS Bedrock:

Leveraged Bedrock's FedRAMP Moderate compliance and data isolation guarantees
Performed gradual migration with shadow traffic testing and staged cutover
Utilized Bedrock's on-demand pricing and cross-region inference to improve cost efficiency
Implemented backup models, emergency stops, and other operational improvements

Key benefits of the Bedrock migration:

Increased flexibility to experiment with 15+ language models in production
Improved reliability through model fallbacks and emergency response capabilities
Over 90% cost savings, equating to over $20 million annually

Developing an Experimentation Framework for Quality Assurance

Challenges in evaluating generative AI quality:

Slack's quality evaluation framework:

Experimentation workflow:

Example use case: Search query understanding optimization

Overall impact:

Integrating Generative AI Across Slack's Product

Spectrum of generative AI complexity at Slack:

Importance of selecting the right language model for each use case

Example: Search query understanding optimization

Overall business impact:

Scaling Slack's Generative AI Capabilities with AWS Bedrock and Experimentation