Welcome!
RSS FeedThis blog will contain articles about the Iceberg Data Lakehouse (using your data lake are your data warehouse with Apache Iceberg) and The Agentic Lakehouse (Lakehouses Optimized for working with AI Agents). What is Apache Iceberg? What is a Data Lakehouse? Where can I find more resources are all the types of content you'll find on this blog.
This blog is not affiliated with the Apache Foundation or the Apache Iceberg project whose official page is iceberg.apache.org.
Join the Data Lakehouse Hub Slack Community: Join Now!
Subscribe to our calendar of Data Lakehouse events: Subscribe!
Recent Posts
The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg
Published: at 09:00 AMLearn how to automate compaction, snapshot expiration, and layout optimization in Apache Iceberg using metadata-driven triggers and orchestration tools for a self-healing lakehouse.
Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery
Published: at 09:00 AMLearn how to scale Apache Iceberg table optimizations across large datasets using parallelism, checkpointing, and fail recovery to ensure reliability and performance.
Unlocking the Power of Agentic AI with Apache Iceberg and Dremio
Published: at 09:00 AMUnlocking the Power of Agentic AI with Apache Iceberg and Dremio
Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg
Published: at 09:00 AMPartition evolution in Apache Iceberg is a powerful feature, but if not managed carefully, it can introduce fragmentation and impact compaction performance. Learn how to handle it effectively.