All About Parquet Part 04 - Schema Evolution in Parquet
Explains how Parquet handles schema evolution, including adding/removing columns and changing data types, for data engineers.
Explains how Parquet handles schema evolution, including adding/removing columns and changing data types, for data engineers.
Explains the hierarchical structure of Parquet files, detailing how pages, row groups, and columns optimize storage and query performance.
A guide on using the FFmpeg command-line tool to convert PNG images to the smaller, web-friendly JPG format.
An in-depth exploration of the undocumented JDK Flight Recorder (JFR) file format, detailing its structure and uses.
An exploration of the undocumented JDK Flight Recorder (JFR) file format, detailing its structure and potential uses for analysis.
A technical guide on parsing and unzipping files in Go, exploring the ZIP file format structure and implementing a basic parser.
An analysis of the CSV data format, covering its advantages, drawbacks, and common parsing pitfalls in data processing.
A technical guide explaining the JPEG compression algorithm and demonstrating how to write a basic JPEG decoder from scratch using Python.
A macOS app for converting HEIC images to JPEG or PNG, addressing compatibility issues with Apple's newer image format.
Explains how to create a self-reproducing zip file that contains itself, drawing parallels to self-replicating programs.