DuckDB tricks - renaming fields in a SELECT * across tables
How to use DuckDB's COLUMNS expression to automatically rename fields when using SELECT * across joined tables, avoiding ambiguous column names.
How to use DuckDB's COLUMNS expression to automatically rename fields when using SELECT * across joined tables, avoiding ambiguous column names.
jOOQ 3.20 released with new SQL dialect support for ClickHouse and Databricks, expanded DuckDB features, and various new modules and improvements.
A guide to performing data operations using PySpark, Pandas, DuckDB, Polars, and DataFusion within a pre-configured Docker environment.
A benchmark analysis of DuckDB's performance on a massive 1.1 billion row NYC taxi dataset, comparing it to other database technologies.
A technical guide to solving the One Billion Row Challenge (1BRC) using SQL and DuckDB, including data loading and aggregation.
A technical guide on extracting and analyzing OpenStreetMap (OSM) features using PBF files, DuckDB, and QGIS for data filtering.
A technical guide to downloading, processing, and visualizing global geospatial datasets from the public domain Natural Earth project using tools like AWS S3, DuckDB, and QGIS.
A technical guide on accessing, downloading, and analyzing Maxar's freely available satellite imagery from their Open Data Programme using tools like GDAL, Python, and DuckDB.
A technical review of the Clickgis extension, which adds WKB and GeoJSON geospatial support to the ClickHouse database.
A tutorial on using Uber's H3 geospatial indexing system for clustering and visualization within DuckDB and QGIS.
A technical walkthrough of scraping and visualizing global airline passenger route data using Python, DuckDB, and QGIS.
A technical overview of DuckDB's new Spatial extension, detailing its geospatial file format support, core dependencies, and setup process.
A quick guide on using DuckDB's SQL commands to efficiently convert CSV files to the Parquet format, including schema modifications.
How to handle mismatched Parquet file schemas when querying multiple files in DuckDB using the UNION_BY_NAME option.
Exploring DuckDB's geospatial extensions and features for optimizing a large-scale fiber-optic network data pipeline.
A technical walkthrough of using dbt and DuckDB to clean and analyze session feedback data from a tech conference.
A hands-on exploration of using dbt (data build tool) with DuckDB for local data engineering, based on a tutorial project.
Analyzing conference session ratings using DuckDB and Jupyter Notebooks to demonstrate data wrangling and SQL on raw CSV data.