Apache Iceberg Reliability
Explains how Apache Iceberg's design ensures data reliability, atomic operations, and serializable isolation for large-scale data lakehouses.
Explains how Apache Iceberg's design ensures data reliability, atomic operations, and serializable isolation for large-scale data lakehouses.
A list of upcoming tech talks and events by Alex Merced, focusing on Apache Iceberg, data lakehouses, and data engineering topics.
Explains the data lakehouse architecture, its layers (storage, table format, catalog, processing), and its advantages over traditional data warehouses.
An introduction to Apache Iceberg, a table format for data lakehouses, explaining its architecture and providing learning resources.
Explores the evolution of Apache Iceberg catalogs, focusing on the current REST Catalog and future proposals for server-side optimizations.
A hands-on tutorial on building a data lakehouse pipeline using Spark, Dremio, and Superset to move and analyze data.
An overview of five impactful open-source data projects, including Apache Iceberg and Arrow, that are revolutionizing data management and analytics.
Explains why Dremio is a top platform for Apache Iceberg lakehouses, highlighting features like dataset promotion and data reflections.
Explores Apache Iceberg's catalog system, its role in data lakehouse architecture, and key considerations for choosing the right catalog.
Explains the role, types, and selection criteria for catalogs in Apache Iceberg, a key component for managing data lakehouse tables.
Explores 10 reasons to adopt Apache Iceberg and Dremio for building a modern, flexible, and cost-effective data lakehouse architecture.
Explains the data lakehouse architecture and the roles of Apache Iceberg, Nessie, and Dremio in modern data management.
Table of Contents Context Introduction Short Version for Quick Readers My Journey with Table Formats and Lakehouses Ecosystem Over Features Key Takeaw
Explores the Data Lakehouse architecture and the roles of Apache Iceberg and Dremio in modern, integrated data management.
A comprehensive directory of resources for learning about and building Open Lakehouses using Apache Iceberg, Nessie, and Dremio.
Explores how Dremio's platform simplifies building and managing Apache Iceberg-based data lakehouses with governance, performance, and self-service.
Explores Apache Iceberg and Project Nessie, key open-source technologies powering the flexible and vendor-neutral Open Lakehouse data architecture.
A guide to building a cost-effective, high-performance, and self-service data lakehouse architecture, addressing common pitfalls and outlining key principles.
A tutorial on building a local Data Lakehouse using Docker Compose with Apache Spark, Minio, Dremio, and Nessie.
Explains Project Nessie, an open-source data catalog for Apache Iceberg tables, and its importance for data engineers and architects building data lakehouses.