Data analysis articles

10/26/2023 • EN

Yet Another How-to on Labelling Bar Graphs in ggplot2

A tutorial on customizing bar chart labels in ggplot2, focusing on placing category labels above bars and styling visualizations.

Bar Chart data analysis data visualization Ggplot2 R

Cedric Scherer

10/22/2023 • EN

RObservations #48: Exploring All Possible Hands in 5 card Poker

A technical blog post demonstrating how to use R programming to computationally enumerate all possible 5-card poker hands from a standard deck.

combinatorics data analysis programming R statistics

Benjamin Smith

10/15/2023 • EN

Better data analysis with logic programming

Explores using logic programming (Prolog) for data analysis, demonstrating its application on a diamond pricing dataset to build robust models.

data analysis Logic Programming Prolog R Statistical Modeling

Emir U

10/5/2023 • EN

A Review of Esri's Spatial Data Science MOOC

A review of Esri's Spatial Data Science MOOC, covering the history of GIS, ArcGIS Pro's features, and the author's training experience.

Arcgis Pro data analysis Geographic Information Systems Spatial Data Science 깃

Mark Litwintschik

8/3/2023 • EN

Numbers or Brackets for numeric questions?

Analysis of using numerical inputs vs. brackets for survey questions like age and income, focusing on UX and data analysis trade-offs.

data analysis Numerical Data Survey Design User Input ux

Lea Verou

4/22/2023 • EN

R for Everyone: Analytical Superpowers in under 10 Minutes!

A quick-start guide to using the R programming language for data analysis, covering installation, data exploration, and basic plotting with the iris dataset.

data analysis Iris Dataset R Programming Rstudio Statistical Computing

Holger K. von Jouanne-Diedrich

3/28/2023 • EN

The 30DayChartChallenge is Ready to Kick Off

An introduction to the #30DayChartChallenge, a community data visualization event with daily prompts for April, including its origins and format.

Charting Community Challenge data analysis Data Science data visualization

Cedric Scherer

3/24/2023 • EN

AI-assisted computer interfaces of the future

Explores a future AI-assisted computer interface model inspired by sci-fi, where AI highlights data anomalies for human specialist review.

ai computer vision data analysis Human AI Interaction user interface

Hugo

12/7/2022 • EN

Read and Visualize your Twitter Archive

A technical tutorial on using R to read, analyze, and visualize your downloaded Twitter archive data, including tweets, likes, and ad history.

data analysis R Tidyverse Twitter Archive Visualization

Garrick Aden-Buie

11/16/2022 • EN

Rolling summaries with {slider} in R

A tutorial demonstrating how to use the R `slider` package for rolling window analysis, using NFL quarterback performance data as an example.

data analysis R Slider Tidyverse Time Series

Tom Mock

11/5/2022 • EN

Bus pruning

Analysis of Auckland bus cancellations using R and GTFS data to visualize which trips are being removed from the timetable.

data analysis Gtf Public Transport R

Thomas Lumley

10/14/2022 • EN

Current 22 - Session Analysis with DuckDB and Jupyter Notebook

Analyzing conference session ratings using DuckDB and Jupyter Notebooks to demonstrate data wrangling and SQL on raw CSV data.

data analysis docker Duckdb Jupyter Notebook sql

Robin Moffatt

10/3/2022 • EN

State of CSS 2022 now open!

The State of CSS 2022 survey is now open, gathering developer feedback on new CSS features, pain points, and usage patterns.

css data analysis Frontend Survey Web Development

Lea Verou

9/26/2022 • EN

Pandas Groupby Warning

A warning about a subtle pandas groupby issue that can lead to incorrect data aggregation sums if missing values are not handled properly.

data analysis Groupby Pandas Python Warning

Chris Moffitt

9/12/2022 • EN

Futurist prediction methods and accuracy

An analysis of futurist prediction methods, comparing accurate forecasters with those who have poor track records.

data analysis forecasting methodology prediction models superforecasting

Dan Luu

8/31/2022 • EN

Inside the Sausage Factory: How we Built the Program for Current 2022

A behind-the-scenes look at how the program committee used data and tools to select talks for the Current 2022 and Kafka Summit tech conferences.

Airtable conferences data analysis program committee Sessionize

Robin Moffatt

6/21/2022 • EN

KQL lessons learnt from #365daysofKQL

A security engineer shares key lessons and query patterns learned from a year-long #365daysofKQL challenge, focusing on threat hunting and log analysis.

cybersecurity data analysis Kql query language Threat Hunting

Matt Zorich

6/13/2022 • EN

Using Document Properties to Track Your Excel Reports

A guide to embedding source notebook metadata in Excel reports using Python's pandas and xlsxwriter to simplify tracking and refreshing analyses.

data analysis Excel Pandas Python Xlsxwriter

Chris Moffitt

6/13/2022 • EN

Beautiful tables in R with gtExtras

Introducing gtExtras, an R package for creating beautiful, functional tables with opinionated themes and inline graphics.

data analysis data visualization Gt Gtextras R

Tom Mock

5/20/2022 • EN

Marginalia: A guide to figuring out what the heck marginal effects, marginal slopes, average marginal effects, marginal effects at the mean, and all these other marginal things are

A guide explaining marginal effects in regression analysis, including definitions and differences between types like average marginal effects, using R packages.

data analysis Marginal Effects R Regression statistics

Andrew Heiss