TalksSupercharge app intelligence using gen AI with Amazon DocumentDB (DAT320)

Supercharge app intelligence using gen AI with Amazon DocumentDB (DAT320)

Here is a detailed summary of the video transcription in Markdown format with sections for better readability and single-level bullet points:

Introduction and Overview

Presentation is part of re:Invent 2024
Focused on "Supercharging App Intelligence using Gen with Amazon DocumentDB"
Covers vector search, access patterns, vector search on DocumentDB, and best practices

Vector Search Fundamentals

Vector search is used for more intuitive, smart, and context-relevant searching
Involves tokenizing data into elements (words, paragraphs, sentences, documents) and passing them through a large language model to create vector embeddings
Vector embeddings represent the data in a multi-dimensional space where similar items are closer together
Allows finding semantically similar results through mathematical distance calculations

Amazon DocumentDB for Vector Search

DocumentDB supports two indexing methods for vector search: IVF (Inverted File) Flat and HNSW (Hierarchical Navigable Small World)
IVF Flat:
- Splits documents into lists with centroids
- Faster indexing but requires data before index creation
- Performs better with static data
HNSW:
- Organizes vectors into a graph structure
- Slower indexing but can index first, then add data
- Better for dynamic data with updates and deletes

Best Practices

Vector embeddings consume space, so consider the optimal number of dimensions
For HNSW indexes:
- Start with "balanced" default values for M and EF Construction
- Adjust M and EF Search based on performance and recall needs
For IVF Flat indexes:
- Set the number of Lists based on the number of documents
- Adjust the number of Probes to balance performance and recall

Demonstration and Resources

Demonstrated a Python notebook implementing a DocumentDB chatbot using a Retrieval Augmented Generation (RAG) architecture
Highlighted the many variables to consider when building generative AI solutions, such as language models, chunking, and index parameters
Provided resources:
- GitHub repository with sample notebooks
- Data Modeling eBook
- DocumentDB skill builder courses

Your Digital Journey deserves a great story.

Build one with us.

These cookies are used to collect information about how you interact with this website and allow us to remember you. We use this information to improve and customize your browsing experience, as well as for analytics.

If you decline, your information won’t be tracked when you visit this website. A single cookie will be used in your browser to remember your preference.

Supercharge app intelligence using gen AI with Amazon DocumentDB (DAT320)

Introduction and Overview

Vector Search Fundamentals

Amazon DocumentDB for Vector Search

Best Practices

Demonstration and Resources

Your Digital Journey deserves a great story.

Build one with us.

Headquarters

Delivery Centre

Supercharge app intelligence using gen AI with Amazon DocumentDB (DAT320)

Introduction and Overview

Vector Search Fundamentals

Amazon DocumentDB for Vector Search

Best Practices

Demonstration and Resources

Your Digital Journey deserves a great story.

Build one with us.

This website stores cookies on your computer.