Building an Open, Petabyte-Scale Data Platform With Databricks
Come hear how digital native companies are leveraging a new data architecture — the data lakehouse — which delivers data warehouse performance at data lake economics, all powered by open source technologies. The data lakehouse architecture combines the best of data warehouses and data lakes into a single, unified architecture that can serve all data use cases, including BI, streaming analytics, data science and machine learning.
At this event, we’ll also explore Delta Lake — an open source storage layer that brings reliability to data lakes. Delta Lake provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing. We’ll talk about how Delta Lake makes the lakehouse vision possible.
We’ll also cover best practices to help organizations use powerful open source technologies so you can build and extend your data platform investments. You’ll learn about the advantages of cloud-based data lakes in terms of security and cost. And finally, you’ll learn how data teams at digital native companies are having a huge impact — lowering costs, speeding up time to market — and powering new innovations to disrupt industries.
Lastly, you’ll be able to interact with data engineers, data scientists and ML engineers and learn from each other. Databricks engineers and open source committers for Apache Spark™, Delta Lake and MLflow will be present todiscuss emerging trends and ways for you to get involved in the open source community.
Register today so you can:
- Hear about the open lakehouse architecture and the advantages it offers over data warehouses and lakes
- Find out how to extend and simplify your data platform by adopting lakehouse architecture concepts
- See how you can add reliability, performance and governance to your open data lake
- Hear how digital natives build highly scalable and reliable data pipelines for analytics and machine learning
- Network with and hear from your data engineering and machine learning peers at other digital native companies
Agenda (PT):
- 10:00–10:20 AM Enabling an Open, Petabyte-Scale Data Architecture With Databricks
- 10:20–10:40 AM Lakehouse Architecture in Practice at Scribd
- 10:40–11:20 AM Customer Panel — Lessons Learned in Building Data Platforms
- 11:20–12:00 PM AMA With the Databricks Technical Team
Speaker and Presenter Information

Tyler Croy
Director of Platform Engineering
Scribd

Hien Luu
Sr. Engineering Manager
DoorDash

Chris Locklin
Engineering Manager
Grammarly

Sherwin Wu
Engineering Manager
Relevant Government Agencies
Other Federal Agencies, Federal Government, State & Local Government
Event Type
Virtual
This event has no exhibitor/sponsor opportunities
When
Wed, Sep 29, 2021, 1:00pm
ET
Cost
Complimentary: $ 0.00
Website
Click here to visit event website
Organizer
Databricks