Thomas Lumley

Thomas Lumley writes thoughtful, in-depth articles on statistics, data analysis, and statistical modeling. His blog explores topics like survey methods, regression, simulations, and inference with a rigorous yet reflective approach.

https://notstatschat.rbind.io

RSS Feed

1/25/2026

statistics data analysis statistical modeling applied mathematics research methods

Articles from this Blog

215 articles from this blog

11/15/2013 • EN

Moving the goalposts?

A critique of a proposal to lower the p-value threshold for statistical significance from 0.05 to 0.005, arguing it addresses symptoms, not root causes.

statistics Hypothesis Testing Bayesian Inference

10/20/2013 • EN

Barren proxies

Explores the concept of 'barren proxies' in causal inference, arguing that measurement reliability is more critical than the proxy's barrenness.

measurement statistics Causal Inference

10/6/2013 • EN

Rock, paper, scissors, Wilcoxon test

Explores non-transitivity in games like rock-paper-scissors, its history, and connections to statistics, evolution, and voting systems.

mathematics statistics Probability

9/13/2013 • EN

An absolutely minimal way to increase invited speaker diversity

A simple two-stage list method to increase diversity among invited speakers at tech and academic conferences.

conferences technology Diversity

8/8/2013 • EN

In defense of theory

Argues for the importance of statistical theory in data science, using examples from medical research to show where abstract theory solved practical problems.

statistics bootstrap Data Science

8/4/2013 • EN

Some failure modes of statistics research talks

A critique of common pitfalls and unproductive patterns in statistics research presentations, aimed at improving academic discourse.

data analysis research methodology statistics

7/15/2013 • EN

Graphs and counterfactuals

Explores the equivalence between causal graphs and counterfactual reasoning in statistics, simplifying the connection between two major causal inference frameworks.

statistics Causal Inference Causal Graphs

7/8/2013 • EN

Big data linear models

Explains how to parallelize QR decomposition for linear models on big data using R's biglm package and incremental merging.

Big Data R Parallel Computing

7/8/2013 • EN

Sparse linear systems and calibration of weights

Explores using sparse matrix techniques in R to efficiently calibrate survey weights for large-scale population data.

sparse matrices R Programming Statistical Computing

7/6/2013 • EN

Problems with faithfulness and the causal Markov property (II)

Explores limitations of causal graph assumptions in statistical modeling, discussing when variables like poverty or diet may violate the faithfulness condition.

Statistical Modeling Causal Inference Causal Graphs

7/2/2013 • EN

Problems with faithfulness and the causal Markov property (I)

Examines statistical challenges with the causal Markov and faithfulness properties, focusing on measurement error's impact on causal inference.

statistics Causal Inference Measurement Error

6/28/2013 • EN

Two simple notes on error in regression models

Explores the concept of 'error' in regression models, clarifying when it represents measurement error versus model prediction error.

data analysis statistics Regression

6/27/2013 • EN

When is Bayesian introductory statistics better?

Compares Bayesian vs frequentist statistics for introductory courses, highlighting pedagogical pros and cons of each approach.

pedagogy Bayesian Statistics Frequentist Statistics

6/8/2013 • EN

My Setup

A statistics professor details his hardware and software setup, including Mac laptops, R, LaTeX, and plans to learn JavaScript.

statistics Latex Emac

6/7/2013 • EN

Talks in the near future

A summary of upcoming technical talks on statistical computing, rare DNA variant analysis, and handling large datasets with R and SQL.

sql data analysis statistics

Previous 1 ... 7 8 9 10 11

Thomas Lumley

Articles from this Blog

Select Language