Building Pangolin - My Holiday Break, an AI IDE, and a Lakehouse Catalog for the Curious
A developer shares the story of building Pangolin, an open-source lakehouse catalog, using an AI coding agent during a holiday break.
A developer shares the story of building Pangolin, an open-source lakehouse catalog, using an AI coding agent during a holiday break.
A technical guide on designing and implementing a modern data lakehouse architecture using the Apache Iceberg table format in 2025.
A look at 10 upcoming features and enhancements for the Apache Iceberg data lakehouse table format, expected in 2025.
A guide to setting up and using Dremio's Auto-Ingest feature for automated, event-driven data loading into Apache Iceberg tables from cloud storage.
A tutorial on using SQL with Apache Iceberg tables in the Dremio data lakehouse platform, covering setup and core operations.
Explores how Dremio and Apache Iceberg create AI-ready data by ensuring accessibility, scalability, and governance for machine learning workloads.
A hands-on tutorial for setting up a local data lakehouse with Apache Iceberg, Dremio, and Nessie using Docker in under 10 minutes.
Quarterly roundup of data lakehouse trends, table formats, and major industry news from Apache Iceberg to Delta Lake.
Explains how to implement access control and security for Apache Iceberg tables at the file, engine, and catalog levels.
A comprehensive directory of Apache Iceberg resources, including tutorials, guides, and educational materials for data engineers and developers.
Explores how combining data lakehouse, virtualization, and mesh architectures with Dremio solves modern data scaling and silo challenges.
A hands-on tutorial for building a Data Lakehouse on your laptop using Apache Iceberg, Spark, Nessie, Minio, and Dremio.
Explains why data professionals should adopt Dremio and Apache Iceberg for flexible, high-performance data lakehouse architecture.
Explores five key trends shaping the data lakehouse architecture, including storage evolution, table formats, and catalog competition.
Explains how Apache Iceberg uses delete files for efficient row-level data deletions without rewriting entire datasets.
Explains the role and structure of Apache Iceberg manifest files, key metadata components for tracking data files and optimizing queries in data lakehouses.
Explains the role and structure of the Apache Iceberg Manifest List file in managing table snapshots and optimizing data lakehouse queries.
Explains the critical role and structure of the metadata.json file in Apache Iceberg, the open-source table format for data lakehouses.
An introduction to data lakehouses, explaining what they are, why they're used, and how to migrate to this modern data architecture.
Explores Polaris, an open-source catalog service for managing Apache Iceberg tables in data lakehouses, covering its architecture, entities, and security.