Canada's 15.8M Addresses
A technical guide on downloading and analyzing Canada's National Address Register (15.8M addresses) using Python, DuckDB, and QGIS to create settlement centroids.
A technical guide on downloading and analyzing Canada's National Address Register (15.8M addresses) using Python, DuckDB, and QGIS to create settlement centroids.
A monthly roundup of tech links covering DuckDB, Kafka, data visualization, and stream processing, with highlights and commentary.
A guide to building a data pipeline using DuckDB, covering data ingestion, transformation, and analytics with real-world environmental data.
A guide to exporting SQL queries from DuckDB's new UI notebooks using a custom Bash script for backup or sharing.
A hands-on review of the new DuckDB UI, exploring its features for data analysis and comparing it to previous workflows with Rill Data.
A technical walkthrough of loading and exploring UK Environment Agency flood data using DuckDB and Rill for a streaming pipeline project.
How to use DuckDB's COLUMNS expression to automatically rename fields when using SELECT * across joined tables, avoiding ambiguous column names.
jOOQ 3.20 released with new SQL dialect support for ClickHouse and Databricks, expanded DuckDB features, and various new modules and improvements.
A guide to performing data operations using PySpark, Pandas, DuckDB, Polars, and DataFusion within a pre-configured Docker environment.
A benchmark analysis of DuckDB's performance on a massive 1.1 billion row NYC taxi dataset, comparing it to other database technologies.
A technical guide to solving the One Billion Row Challenge (1BRC) using SQL and DuckDB, including data loading and aggregation.
A technical guide on extracting and analyzing OpenStreetMap (OSM) features using PBF files, DuckDB, and QGIS for data filtering.
A technical guide to downloading, processing, and visualizing global geospatial datasets from the public domain Natural Earth project using tools like AWS S3, DuckDB, and QGIS.
A technical guide on accessing, downloading, and analyzing Maxar's freely available satellite imagery from their Open Data Programme using tools like GDAL, Python, and DuckDB.
A technical review of the Clickgis extension, which adds WKB and GeoJSON geospatial support to the ClickHouse database.
A tutorial on using Uber's H3 geospatial indexing system for clustering and visualization within DuckDB and QGIS.
A technical walkthrough of scraping and visualizing global airline passenger route data using Python, DuckDB, and QGIS.
A technical overview of DuckDB's new Spatial extension, detailing its geospatial file format support, core dependencies, and setup process.
A quick guide on using DuckDB's SQL commands to efficiently convert CSV files to the Parquet format, including schema modifications.
How to handle mismatched Parquet file schemas when querying multiple files in DuckDB using the UNION_BY_NAME option.