Using Helm with Kubernetes - A Guide to Helm Charts and Their Implementation
A guide to using Helm, the package manager for Kubernetes, covering Helm charts, installation, deployment, and best practices.
Alex Merced — Developer and technical writer sharing in-depth insights on data engineering, Apache Iceberg, data lakehouse architectures, Python tooling, and modern analytics platforms, with a strong focus on practical, hands-on learning.
501 articles from this blog
A guide to using Helm, the package manager for Kubernetes, covering Helm charts, installation, deployment, and best practices.
A guide to building AI applications using the LangChain framework, covering core concepts, installation, and practical examples.
A comprehensive 2025 guide to Apache Iceberg, covering its architecture, ecosystem, and practical use for data lakehouse management.
Explores solutions like Apache XTable and Delta Lake Uniform for enabling interoperability between different data lakehouse table formats.
Argues that RAG system failures stem from data engineering issues like fragmented data and governance, not from model or vector database choices.
A developer shares the story of building Pangolin, an open-source lakehouse catalog, using an AI coding agent during a holiday break.
A technical guide on designing and implementing a modern data lakehouse architecture using the Apache Iceberg table format in 2025.
A look at 10 upcoming features and enhancements for the Apache Iceberg data lakehouse table format, expected in 2025.
A guide to setting up and using Dremio's Auto-Ingest feature for automated, event-driven data loading into Apache Iceberg tables from cloud storage.
A tutorial on using SQL with Apache Iceberg tables in the Dremio data lakehouse platform, covering setup and core operations.
A guide to understanding and using the cargo.toml file, the central configuration file for managing Rust projects and dependencies with Cargo.
Explores how Dremio and Apache Iceberg create AI-ready data by ensuring accessibility, scalability, and governance for machine learning workloads.
Explores using Python's pattern matching and comprehensions for efficient data cleaning, transformation, and analysis.
A hands-on tutorial for setting up a local data lakehouse with Apache Iceberg, Dremio, and Nessie using Docker in under 10 minutes.
Explores the differences between event and entity data modeling, when to use each approach, and practical design considerations for structuring data effectively.
Explains how Parquet handles schema evolution, including adding/removing columns and changing data types, for data engineers.
Explains encoding techniques in Parquet files, including dictionary, RLE, bit-packing, and delta encoding, to optimize storage and performance.
Explores compression algorithms in Parquet files, comparing Snappy, Gzip, Brotli, Zstandard, and LZO for storage and performance.
Explores how metadata in Parquet files improves data efficiency and query performance, covering file, row group, and column-level metadata.
An introduction to Apache Parquet, a columnar storage file format for efficient data processing and analytics.