< session />

Unified Data Platform with Lakehouse Architecture and Apache Iceberg

Tue, 22 April, 11:30 AM - 12:00 PM GMT+5:30

This session explores the evolution of data storage, from basic CSV files to the advanced capabilities of Apache Iceberg. Discover how Iceberg bridges the gap between data lakes and data warehouses. We'll delve into Iceberg's key features, including seamless schema evolution, flexible partitioning strategies, and advanced filtering techniques for optimized query performance. We will examine Iceberg's robust catalog layout, a critical element for efficient metadata management and scalable data lake operations.

Learn how Goldman Sachs' Lakehouse team is leveraging Iceberg and its REST Catalog specification to develop a next-generation data platform for better operational efficiency and cost-effectiveness. We will also cover how Iceberg gives freedom to switch query engines without requiring data rewrites and  decoupling from vendor locking.

< speaker_info />

About the speaker

Sumit Rastogi

Tech Fellow and Vice President, Goldman Sachs

Sumit Rastogi is a Tech Fellow and Vice President at Goldman Sachs, with 22 years of experience in engineering within the banking industry. His career began with a decade as a banking consultant, providing him with a broad perspective on the technological challenges and opportunities in the financial world. Since joining Goldman Sachs, Sumit has focused on building and scaling data infrastructure. A recognized innovator, Sumit has filed two patents for his work in data and reporting platforms. Sumit is passionate about leveraging technology to drive efficiency and innovation in the financial services sector.