Michael Toth
Michael Toth is a consulting data scientist and writer known for clear, practical guides to data visualization in R. His work focuses on ggplot, effective graph design, and helping data scientists communicate insights with impact.
Michael Toth is a consulting data scientist and writer known for clear, practical guides to data visualization in R. His work focuses on ggplot, effective graph design, and helping data scientists communicate insights with impact.
Hilary Parker is a data scientist focused on the intersection of data science and product, with experience at Stitch Fix, Etsy, and the Biden 2020 campaign. She co-hosts the popular Not So Standard Deviations podcast and writes about reproducibility, statistics, and real-world data work.
Noam Ross is a research software engineer and data scientist writing about open-source software, reproducible research, and the ethics of technology. His blog explores tooling choices, CI/CD workflows, and the social impact of software in scientific work.
Oscar Baruffa is a Senior Analytics Manager and data professional helping organisations improve data maturity and build systems from scratch. With a background spanning engineering, strategy, and analytics, he also supports professionals transitioning into data-driven careers.
Edwin Thoen’s blog explores practical insights in data science, machine learning, and R programming. He writes about reproducible workflows, model management, data analysis, and strategies to avoid overengineering and improve project outcomes.
Tom Mock’s blog focuses on R programming, data visualization, and reproducible research. He shares practical tutorials for RMarkdown, Quarto, tidyverse, and creative data presentation techniques.
Randy Zwitch is a software engineer specializing in Python and data engineering. His blog features detailed tutorials on building and optimizing Python tools like PyArrow with GPU/CUDA support, Docker workflows, and high-performance data processing.
Zoe Locke writes about the R programming ecosystem, covering package development, testing, data manipulation, and tips for R users at all levels. Her posts include practical advice, tutorials, and insights for R developers and data scientists.
Mara Averick is a data enthusiast and R programmer who shares insights, tutorials, and tips on data analysis, visualization, and reproducible workflows. She creates and explores tools in the R ecosystem, including packages like {datapasta}, and enjoys making data more accessible and visually engaging.
Cédric is an independent data visualization specialist and ecologist who helps organizations communicate insights through engaging visualizations and reproducible data products. He combines analytical expertise, programming, and design to create interactive graphics, reports, and web applications.
Jakub Nowosad is a computational geographer and Associate Professor at Adam Mickiewicz University, also serving as a Visiting Scientist at the University of Münster. He develops open-source tools and spatial methods for reproducible, scalable environmental and ecological analysis, and co-authors Geocomputation with R and Geocomputation with Python.
Garrick Aden-Buie is a Software Engineer for Shiny at Posit (formerly RStudio), building tools with R, Shiny, and R Markdown to support data science workflows. He is also an educator and former data scientist with experience in health analytics and smart home research.
Andrew Heiss is a researcher and educator focused on data visualization, causal inference, and applied statistics using R and Bayesian methods. He writes extensively about reproducible research, GIS, and analytical workflows, and teaches data science and social science methods.
Yihui Xie is a statistician and software developer best known for his work on R Markdown, knitr, and tools for reproducible research. He blogs about R, web technologies, publishing workflows, and practical programming techniques.
Musings of a Young Kenyan is a personal tech blog sharing hands-on projects, programming tutorials, and reflections on data science and software development. It features practical guides, automation experiments, and thoughtful takes on technology and problem-solving.
Thomas Lumley writes thoughtful, in-depth articles on statistics, data analysis, and statistical modeling. His blog explores topics like survey methods, regression, simulations, and inference with a rigorous yet reflective approach.
John Blischak is a software developer and data scientist who writes about R programming, bioinformatics, and open-source tools for data analysis and visualization.