AWS's poorly named but powerful Lakehouse for Sagemaker


Is it a Data Lake or a Data Warehouse? Well Lakehouse looks to marry the two together creating a singular interface to access both. You can query parquet files in S3 or more structured data in Redshift.

It also boasts it can replicate data from not just AWS native data sources like DynamoDB but also Facebook/Instagram ads and a lot more.

You can query it using Athena like you might parquet files in S3 but also via Redshift or Jupyter notebook.

This makes me think it's similar to a AWS Kendra service but specifically tailored for SageMaker. It wouldn’t be the first time AWS launched two or more completely redundant services.

I am curious who in my audience has used SageMaker. What did you think about it?