Next-generation Amazon SageMaker: The center for data, analytics & AI(ANT206-NEW)

Summary of the Video Transcription

Why Build the Next Generation of Amazon SageMaker?

  • Advent of generative AI technology is changing the nature of work and how organizations leverage data and AI.
  • Organizations are struggling to become data-driven due to complex tool sets, data sprawl, and fragmented governance experiences.
  • Customers want unified experiences across services with no assembly required.

Key Components of the Next Generation of Amazon SageMaker

  1. SageMaker Unified Studio:

    • A unified development environment for data processing, SQL analytics, machine learning, and generative app development.
    • Integrated with Amazon Q for increased productivity.
    • Provides a unified data and AI development experience.
  2. SageMaker Catalog:

    • An enterprise-wide catalog for data and ML assets.
    • Enables efficient permission management and key governance capabilities.
  3. SageMaker Lakehouse:

    • Breaks down data silos across the data estate.
    • Provides a unified Lakehouse catalog across data sources and supports open-source Iceberg-compatible APIs.
    • Enables a zero-ETL future.

SageMaker Unified Studio

  • Offers best-in-class tools for data processing, SQL analytics, machine learning, and generative app development.
  • Provides a unified development experience with Git integration and shared access to project data.
  • Supports users of varying skill levels, from data engineers to generative app developers.
  • Integrates with Amazon Q for increased productivity across various tasks.

Key Concepts of SageMaker Unified Studio

  1. Domains: Modeling of the organization, accounts, and workloads to provide a single interface.
  2. Projects: Containers for code, data, ML assets, and compute resources necessary for an end-to-end application.
  3. Project Roles: Controlling access and permissions within a project.
  4. Environments: Enabling additional capabilities in a consistent manner.
  5. Blueprints: Defining and deploying resources within projects in a governed manner.

Olympics.com Use Case

  • Olympics is a highly complex and bursty event, requiring significant scaling of data and AI infrastructure.
  • Onboarding data workers in advance and provisioning resources quickly during peak times are crucial challenges.
  • Governance and controlled access to data across various organizations are important requirements.

Demo with Sports Vision

  • Showcases how different personas (ETL engineers, data engineers, and generative app developers) can collaborate using SageMaker Unified Studio.
  • Demonstrates the seamless integration of data processing, analytics, machine learning, and generative app development within a unified experience.

Your Digital Journey deserves a great story.

Build one with us.

Cookies Icon

These cookies are used to collect information about how you interact with this website and allow us to remember you. We use this information to improve and customize your browsing experience, as well as for analytics.

If you decline, your information won’t be tracked when you visit this website. A single cookie will be used in your browser to remember your preference.

Talk to us