[NEW LAUNCH] Amazon SageMaker Lakehouse: Accelerate analytics & AI (ANT354-NEW)

AWS Lake House - Unified Data Management

Introduction

Niranjan Rintala and Mahesh Mishra introduce the new AWS Lake House, a unified data management solution that brings together the best of data lakes and data warehouses.

The presentation will cover the customer problems that Lake House aims to solve, the AWS vision for Lake House, its features and capabilities, and some use cases and demos.

The Need for Unified Data Management

Data and generative AI are driving innovation, enabling new customer experiences, employee productivity, and product ideation.

However, the foundation for leveraging generative AI is the data - operational databases, data lakes, data warehouses, and integration tools.

Customers often struggle with data silos, inconsistent access controls, and longer time-to-value when working with data lakes and data warehouses separately.

The Promise of a Lake House

The Lake House concept aims to bring the best of both data lakes and data warehouses - the flexibility and openness of data lakes, and the performance and transactional capabilities of data warehouses.

There are different approaches to building a Lake House, each with their own trade-offs.

The AWS vision for Lake House is to provide a unified experience with three key tenets:

Unified data across data lakes and data warehouses
Accessible data through open interfaces like Iceberg API
Integrated security controls for consistent access management

Key Components of AWS Lake House

Storage Flexibility: Support for S3 data lakes, S3 Tables, and Amazon Redshift Managed Storage.

Unified Catalog: Glue Data Catalog extended with flexible hierarchies to manage data from different storage types.

Integrated Permissions: Fine-grained access control at the table, column, and cell level, using industry-standard models like tag-based and role-based access control.

Iceberg-compatible APIs: Allow any Iceberg-compatible query engine to access data across data lakes and data warehouses.

Demo: Bringing Data Warehouse to Lake House

Demonstration of registering an Amazon Redshift data warehouse into the Lake House, making the data accessible through Iceberg-compatible APIs.

Ability to query the data from multiple Redshift clusters without the need for explicit data sharing.

Demo: Writing Data to Lake House using Spark

Demonstration of using Spark to write data directly to the Lake House, leveraging Redshift Managed Storage as the underlying storage option.

The data is made available to other query engines like Amazon Athena and Amazon Redshift through the Iceberg-compatible APIs.

Key Takeaways

AWS Lake House provides a unified data management experience, allowing customers to leverage the best of data lakes and data warehouses.

Customers can bring their existing data in data lakes and data warehouses to the Lake House without changes to their data architecture.

The Iceberg-compatible APIs enable seamless access to data from a variety of AWS and third-party query engines.

Integrated security controls and storage flexibility empower customers to manage their data effectively and efficiently.

[NEW LAUNCH] Amazon SageMaker Lakehouse: Accelerate analytics & AI (ANT354-NEW)

AWS Lake House - Unified Data Management

Introduction

The Need for Unified Data Management

The Promise of a Lake House

Key Components of AWS Lake House

Demo: Bringing Data Warehouse to Lake House

Demo: Writing Data to Lake House using Spark

Key Takeaways

Your Digital Journey deserves a great story.

Build one with us.

Headquarters

Delivery Centre

[NEW LAUNCH] Amazon SageMaker Lakehouse: Accelerate analytics & AI (ANT354-NEW)

AWS Lake House - Unified Data Management

Introduction

The Need for Unified Data Management

The Promise of a Lake House

Key Components of AWS Lake House

Demo: Bringing Data Warehouse to Lake House

Demo: Writing Data to Lake House using Spark

Key Takeaways

Your Digital Journey deserves a great story.

Build one with us.

This website stores cookies on your computer.