Alex Merced

Alex Merced — Developer and technical writer sharing in-depth insights on data engineering, Apache Iceberg, data lakehouse architectures, Python tooling, and modern analytics platforms, with a strong focus on practical, hands-on learning.

https://tuts.alexmercedcoder.dev

RSS Feed

12/31/2025

data engineering apache iceberg data lakehouse python analytics

Articles from this Blog

333 articles from this blog

10/16/2024 • EN

Data Lakehouse Roundup 1 - News and Insights on the Lakehouse

Quarterly roundup of data lakehouse trends, table formats, and major industry news from Apache Iceberg to Delta Lake.

Apache Iceberg Data Lakehouse Table Formats

10/15/2024 • EN

Getting Started with Data Analytics Using PyArrow in Python

A tutorial on using PyArrow for data analytics in Python, covering core concepts, file I/O, and analytical operations.

Python Data Analytics Apache Arrow

10/14/2024 • EN

Working with Collections in Rust | A Comprehensive Guide

A comprehensive guide to using Rust's built-in collection types, including vectors, arrays, hashmaps, and sets, with performance tips and examples.

rust Collections Iterators

10/7/2024 • EN

Exploring Data Operations with PySpark, Pandas, DuckDB, Polars, and DataFusion in a Python Notebook

A guide to performing data operations using PySpark, Pandas, DuckDB, Polars, and DataFusion within a pre-configured Docker environment.

Pyspark Pandas Duckdb

10/7/2024 • EN

A Brief Guide to the Governance of Apache Iceberg Tables

Explains how to implement access control and security for Apache Iceberg tables at the file, engine, and catalog levels.

Access Control Governance Apache Iceberg

10/5/2024 • EN

Ultimate Directory of Apache Iceberg Resources

A comprehensive directory of Apache Iceberg resources, including tutorials, guides, and educational materials for data engineers and developers.

metadata Data Engineering Apache Iceberg

9/25/2024 • EN

Virtualization + Lakehouse + Mesh = Data At Scale

Explores how combining data lakehouse, virtualization, and mesh architectures with Dremio solves modern data scaling and silo challenges.

Data Architecture Apache Iceberg Data Lakehouse

9/22/2024 • EN

Deep Dive into Data Apps with Streamlit

A comprehensive guide to building interactive data applications using the Streamlit framework, covering setup, visualization, ML integration, and deployment.

Python Machine Learning data visualization

9/21/2024 • EN

A Deep Dive into Docker Compose

A comprehensive guide to Docker Compose, covering file structure, service configuration, networking, volumes, and best practices for multi-container applications.

DevOps configuration docker

9/14/2024 • EN

In-Depth Guide to Working with Strings in Rust

A comprehensive guide to string handling in Rust, covering types, conversions, operations, and performance best practices.

memory management string manipulation rust

9/13/2024 • EN

Getting Started with Rust - A Modern Systems Programming Language

An introductory guide to Rust, covering its key features like memory safety, ownership, and setup for developers new to the language.

rust systems programming borrow checker

9/12/2024 • EN

Hands-on with Apache Iceberg on Your Laptop - Deep Dive with Apache Spark, Nessie, Minio, Dremio, Polars and Seaborn

A hands-on tutorial for building a Data Lakehouse on your laptop using Apache Iceberg, Spark, Nessie, Minio, and Dremio.

Apache Spark Apache Iceberg Data Lakehouse

9/10/2024 • EN

Why Data Analysts, Engineers, Architects and Scientists Should Care about Dremio and Apache Iceberg

Explains why data professionals should adopt Dremio and Apache Iceberg for flexible, high-performance data lakehouse architecture.

Data Architecture Apache Iceberg Data Lakehouse

9/1/2024 • EN

5 Trends in the Data Lakehouse Space

Explores five key trends shaping the data lakehouse architecture, including storage evolution, table formats, and catalog competition.

cloud storage Apache Iceberg Data Lakehouse

8/30/2024 • EN

Using the alexmerced/datanotebook Docker Image

A guide on using the alexmerced/datanotebook Docker image for a quick data notebook environment with pre-installed libraries like pandas, Polars, and PySpark.

docker Pyspark Data Manipulation

8/29/2024 • EN

Understanding Apache Iceberg Delete Files

Explains how Apache Iceberg uses delete files for efficient row-level data deletions without rewriting entire datasets.

Data Management Apache Iceberg Data Lakehouse

8/27/2024 • EN

Understanding the Apache Iceberg Manifest

Explains the role and structure of Apache Iceberg manifest files, key metadata components for tracking data files and optimizing queries in data lakehouses.

metadata Apache Iceberg Data Lakehouse