Microsoft's 2026 Global ML Building Footprints
Analysis of Microsoft's 2026 Global ML Building Footprints dataset, including technical setup and data exploration using DuckDB and QGIS.
Analysis of Microsoft's 2026 Global ML Building Footprints dataset, including technical setup and data exploration using DuckDB and QGIS.
Analyzing All The Places' open-source location data project, detailing the technical setup and process for downloading and examining millions of brand locations.
A technical analysis and comparison of various administrative boundary datasets, including OpenStreetMap, using Python, DuckDB, and QGIS.
Analyzing Business Insider's dataset on US data center locations, ownership, and resource consumption using Python, DuckDB, and QGIS.
A technical walkthrough of converting the US Wind Turbine Database to Parquet format and analyzing it using tools like GDAL, DuckDB, and QGIS.
A guide on importing Java Flight Recorder (JFR) profiling data into DuckDB for analysis using SQL queries.
Exploring the GM-SEUS dataset of US solar farms using GIS tools like QGIS and DuckDB for spatial data analysis.
A technical exploration of the ICMM's global mining dataset, detailing the setup, tools, and process for data analysis using Python, DuckDB, and QGIS.
An analysis of Statistics Canada's Open Database of Buildings (ODB) dataset, covering data processing, tools used, and technical setup.
A monthly roundup of 78 curated links on data engineering, architecture, AI, and tech trends, with top picks highlighted.
A tutorial on using Positron's Connections Pane to connect to and query DuckDB databases efficiently, especially for handling large datasets.
A monthly roundup of tech links covering data lakehouses (DuckLake, Iceberg), Kafka, event streaming, and stream processing developments.
A guide on using the new ArcGIS Pro add-in to download and work with Overture Maps Foundation's global geospatial datasets via Parquet files and DuckDB.
An analysis of DuckLake, a new open table format and catalog specification for data engineering, comparing it to existing solutions like Iceberg and Delta Lake.
A technical guide on downloading and analyzing Canada's National Address Register (15.8M addresses) using Python, DuckDB, and QGIS to create settlement centroids.
A monthly roundup of tech links covering DuckDB, Kafka, data visualization, and stream processing, with highlights and commentary.
A guide to building a data pipeline using DuckDB, covering data ingestion, transformation, and analytics with real-world environmental data.
A guide to exporting SQL queries from DuckDB's new UI notebooks using a custom Bash script for backup or sharing.
A hands-on review of the new DuckDB UI, exploring its features for data analysis and comparing it to previous workflows with Rill Data.
A technical walkthrough of loading and exploring UK Environment Agency flood data using DuckDB and Rill for a streaming pipeline project.