The Kiwi PRNG
Analysis of a bug in New Zealand's official pseudo-random number generator used for electoral vote counting, based on the Wichmann-Hill algorithm.
Analysis of a bug in New Zealand's official pseudo-random number generator used for electoral vote counting, based on the Wichmann-Hill algorithm.
Explores SQL Server 2019's improved DBCC CLONEDATABASE command for automatically extracting Columnstore Index statistics into a cloned database.
Discusses the proposal to lower p-value thresholds in statistical analysis, arguing it addresses symptoms not root causes of unreliable research.
Explains Chebyshev's inequality, a probability bound, and its application to calculating Upper Confidence Limits (UCL) in environmental monitoring.
Critiques a statistics position paper for ignoring computing, software, and reproducibility in modern statistical science and faculty evaluation.
Explores statistical estimation for complex samples, focusing on design-weighted U-statistics and their Hoeffding projections for pair-based analyses.
A data scientist's 2017 year in review, highlighting top R, Python, and data visualization resources and projects shared each month.
Explores methods for computing tail probabilities of linear combinations of chi-squared variables, focusing on applications in genetics with large datasets.
Explores Bayesian inference when data strongly contradicts prior expectations, analyzing how heavy-tailed priors and likelihoods affect posterior beliefs.
A technical article exploring tail probability bounds for sums of random variables under 'sparse correlation' conditions, extending concepts like Bernstein's Inequality.
A data scientist shares his career journey from psychology to Lazada, debunks common myths about the field, and offers practical advice for aspiring practitioners.
A technical discussion on asymptotic approximations in stratified sampling when sampling probabilities approach zero, relevant for rare disease studies.
A two-day workshop on survival analysis, covering data exploration, regression modeling, and practical sessions for time-to-event data.
Explores a potential 'Polymath' project on the Wilcoxon test's non-transitive behavior with dice, connecting math and statistics.
A statistician argues that advanced math like calculus isn't a strict prerequisite for learning statistics, using personal experience and examples.
Announcing a public lecture series honoring statistician Ross Ihaka, featuring talks on statistical computing, data visualization, and data journalism.
Explores statistical scenarios where the bootstrap resampling method fails to provide accurate variance estimates or confidence intervals.
Explores defining and computing design-based pseudo-R-squared statistics for logistic regression models under complex survey sampling, like case-control designs.
Analyzing the Monty Hall problem, exploring learning strategies and optimal decisions based on observed game history and host behavior.
Critique of the classic iris dataset as a misleading example in modern machine learning education, exploring its original scientific purpose.