Alex Merced

Alex Merced — Developer and technical writer sharing in-depth insights on data engineering, Apache Iceberg, data lakehouse architectures, Python tooling, and modern analytics platforms, with a strong focus on practical, hands-on learning.

https://tuts.alexmercedcoder.dev

RSS Feed

12/31/2025

data engineering apache iceberg data lakehouse python analytics

Articles from this Blog

333 articles from this blog

1/15/2026 • EN

A Practical Guide to AI-Assisted Coding Tools

A comprehensive guide exploring the taxonomy, tools, and best practices for using AI-assisted coding tools in modern software development.

developer tools ai-assisted coding workflow automation

1/10/2026 • EN

What Are Recursive Language Models?

Explains Recursive Language Models (RLMs), which are LLMs that call themselves to break complex tasks into structured, reusable steps.

ai programming Function Calling Reasoning Models

12/29/2025 • EN

2025 Year in Review Apache Iceberg, Polaris, Parquet, and Arrow

A 2025 year-in-review of key Apache data projects: Iceberg, Polaris, Parquet, and Arrow, detailing their major updates and future roadmap.

Apache Arrow Apache Iceberg Data Lakehouse

12/5/2025 • EN

dremioframe & iceberg - Pythonic interfaces for Dremio and Apache Iceberg

Introduces DremioFrame and IceFrame, two new Python libraries for simplifying work with Dremio and Apache Iceberg tables.

Python Polars Apache Iceberg

11/29/2025 • EN

Introducing dremioframe - A Pythonic DataFrame Interface for Dremio

Introduces dremioframe, a Python DataFrame library for querying Dremio with a pandas-like API, generating SQL under the hood.

Python sql data analysis

11/12/2025 • EN

Comprehensive Hands-on Walk Through of Dremio Cloud Next Gen (Hands-on with Free Trial)

A hands-on tutorial exploring Dremio Cloud Next Gen's new free trial, covering its lakehouse platform, AI features, and SQL capabilities.

sql Cloud Platform Apache Iceberg

10/23/2025 • EN

2025-2026 Guide to Learning about Apache Iceberg, Data Lakehouse & Agentic AI

A comprehensive guide to learning Apache Iceberg, data lakehouse architecture, and Agentic AI with curated tutorials, tools, and resources.

agentic ai Data Engineering Apache Iceberg

10/21/2025 • EN

An Exploration of the Commercial Iceberg Catalog Ecosystem

Explores the commercial Apache Iceberg catalog ecosystem, focusing on REST Catalog standards, optimization strategies, and architectural trade-offs.

optimization Metadata Management Data Lakehouse

10/17/2025 • EN

Building a Universal Lakehouse Catalog - Beyond Iceberg Tables

Explores two paths for building a universal lakehouse catalog that extends beyond Apache Iceberg tables to manage diverse data formats and sources.

Data Lakehouse Table Format REST Catalog

10/16/2025 • EN

Intro to Apache Iceberg with Apache Polaris and Apache Spark

A technical guide on using Apache Iceberg with Apache Spark and Polaris for building and managing a data lakehouse, covering setup, operations, and optimization.

Apache Spark Data Engineering Apache Iceberg

10/14/2025 • EN

The State of Apache Iceberg v4 - October 2025 Edition

Overview of key proposals in Apache Iceberg v4, focusing on performance, metadata efficiency, and portability for modern data workloads.

Data Engineering Apache Iceberg Data Lakehouse

9/24/2025 • EN

The Ultimate Guide to Open Table Formats - Iceberg, Delta Lake, Hudi, Paimon, and DuckLake

A comprehensive guide comparing five major open table formats (Iceberg, Delta Lake, Hudi, Paimon, DuckLake) for modern data lakehouses, covering their internals and use cases.

Apache Iceberg Apache Hudi Delta Lake

9/23/2025 • EN

The 2025 & 2026 Ultimate Guide to the Data Lakehouse and the Data Lakehouse Ecosystem

A comprehensive guide to the data lakehouse architecture, its core components (Iceberg, Delta, Hudi, Paimon), and the surrounding ecosystem for modern data platforms.

Data Architecture Apache Iceberg Data Lakehouse

9/16/2025 • EN

The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg

A guide to building an autonomous, self-healing optimization pipeline for Apache Iceberg tables to maintain performance and cost efficiency.

Metadata Management Apache Iceberg Data Lakehouse

9/9/2025 • EN

Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery

Strategies for scaling and optimizing Apache Iceberg data compaction jobs, including parallelism, checkpointing, and failure recovery.

parallelism Checkpointing Apache Iceberg

9/2/2025 • EN

Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg

Explores challenges and best practices for managing partition evolution and compaction in Apache Iceberg to maintain query performance.

Metadata Management Apache Iceberg Data Lakehouse

8/26/2025 • EN

Using Iceberg Metadata Tables to Determine When Compaction Is Needed

Explains how to use Apache Iceberg's metadata tables to dynamically trigger data compaction based on file size, manifest health, and snapshot patterns.

Apache Iceberg Data Lakehouse Table Optimization