Iceberg articles

7/6/2026 • EN

Block vs. Object Storage: A Deep Dive Into the Foundation of Modern Data, and How the Lakehouse Made the Slow Option Fast

A deep dive comparing block vs. object storage, explaining how lakehouses made slower object storage fast for analytics.

Block Storage Iceberg Lakehouse Object Storage Parquet

Alex Merced

7/6/2026 • EN

Conversational AI on Managed Iceberg: Exposing Amazon S3 Tables through MCP

Explores an architecture pattern combining Amazon S3 Tables and MCP for governed conversational AI access to Iceberg data.

Amazon S3 Conversational AI Data Governance Iceberg mcp

Alex Merced

7/6/2026 • EN

Trustworthy Concurrency in the Agentic Lakehouse: Reconciling Academic Proofs with High-Frequency Production Writes

Explores concurrency challenges in agentic lakehouses with Iceberg, balancing academic proofs and high-frequency production writes.

Agentic Lakehouse concurrency Iceberg Optimistic Concurrency Control Production Writes

Alex Merced

6/22/2026 • EN

Rust vs C++ in Native Iceberg Scan Operators

Analysis of Rust vs C++ for building native Iceberg scan operators, focusing on production performance, safety, and interoperability.

c Iceberg Native Scan Operators query execution rust

Alex Merced

6/22/2026 • EN

REST Catalog V2 LoadTable and Client Capability

Analysis of REST Catalog V2 LoadTable and client capability negotiation for lakehouse platforms.

Client Capability Iceberg Lakehouse Loadtable REST Catalog

Alex Merced

6/22/2026 • EN

Iceberg v4 Performance: Root Manifests and Calls

Analysis of Apache Iceberg v4 performance focusing on metadata round trips, root manifests, and object storage latency for platform engineers.

Iceberg Metadata Round Trips Object Storage Production Performance Query Latency

Alex Merced

6/8/2026 • EN

Agentic Lakehouse Concurrency and Isolation

Explores concurrency and isolation challenges when AI agents write to Apache Iceberg lakehouses, covering OCC mechanics, failure modes, and architectural patterns.

Agent Concurrency Iceberg Isolation Patterns Lakehouse Optimistic Concurrency Control

Alex Merced

6/8/2026 • EN

Zero-Copy Mirroring for Modern Lakehouse Migration

Analysis of zero-copy mirroring for safer lakehouse migration, focusing on architecture, governance, and multi-engine data platforms.

Data Architecture Iceberg Lakehouse Migration Metadata Governance Zero Copy Mirroring

Alex Merced

6/8/2026 • EN

Bidirectional Iceberg Writes with Horizon Catalog

Explains Snowflake's bidirectional Iceberg writes via Horizon Catalog, powered by Apache Polaris, enabling external engines to write to Snowflake-managed Iceberg tables.

Apache Polaris Horizon Catalog Iceberg rest api Snowflake

Alex Merced

6/8/2026 • EN

Securing Agent Identities in the Lakehouse

This article discusses authentication and authorization patterns for securing AI agent identities in the Iceberg lakehouse, including OAuth 2.0 token exchange and credential vending.

Credential Vending Iceberg Lakehouse Security Oauth20 Token Exchange

Alex Merced

6/8/2026 • EN

Iceberg Remote Signing for Regulated Datasets

Explains Iceberg remote signing for regulated datasets, enhancing security by issuing per-file, one-time-use pre-signed URLs instead of storage credentials.

Access Delegation Iceberg Regulated Datasets Remote Signing REST Catalog

Alex Merced

6/8/2026 • EN

Anatomy of an Agentic Lakehouse

Explains the four-layer architecture of an agentic lakehouse for reliable AI agent data access.

agentic ai Apache Polaris Iceberg Lakehouse Architecture Object Storage

Alex Merced

5/28/2026 • EN

Building the Brain of the Agentic Lakehouse: Designing an Open Catalog Architecture

Explains designing an open catalog architecture for AI agents in an agentic lakehouse, covering Apache Polaris and Dremio's Open Catalog.

Agentic Lakehouse Apache Polaris Dremio Iceberg Open Catalog

Alex Merced

5/28/2026 • EN

Designing an Immutable Data Lakehouse: Best Practices for Iceberg Snapshot Expiration

Best practices for managing Apache Iceberg snapshot expiration in data lakehouses to optimize query performance and metadata size.

Data Lakehouse Iceberg Metadata Management Query Performance Snapshot Expiration

Alex Merced

5/24/2026 • EN

When Paimon Beats Iceberg for Mutable Streams

Compares Apache Paimon and Iceberg for handling mutable streams, focusing on Paimon's LSM-tree architecture for high-frequency updates.

Iceberg Lsm Tree Mutable Streams Paimon streaming

Alex Merced

5/24/2026 • EN

Lance and Iceberg for Multimodal AI Data

Explores using Lance and Iceberg formats for multimodal AI data, addressing scan-heavy analytics vs. random-access retrieval for ML training.

Data Architecture Iceberg Lance Multimodal AI Vector Database

Alex Merced

5/24/2026 • EN

Using DuckDB and Polars to Query Iceberg Tables

Explores using DuckDB and Polars to query and write to Iceberg tables, covering new features, workflows, and practical patterns.

Duckdb Iceberg Lakehouse Polars REST Catalog

Alex Merced

4/29/2026 • EN

Performance and Apache Iceberg's Metadata

Explains how Apache Iceberg uses metadata for data skipping, enabling fast query performance by eliminating 90-99% of files before scanning.

Data Skipping Iceberg metadata performance Query Optimization

Alex Merced

4/13/2026 • EN

What is Apache Polaris? Unifying the Iceberg Ecosystem

Apache Polaris is an open-source catalog service that unifies the Iceberg ecosystem by implementing the Iceberg REST API for vendor-neutral lakehouse metadata management.

Apache Iceberg Apache Polaris Catalog Iceberg open source

Alex Merced