Michael Toth
Michael Toth is a consulting data scientist and writer known for clear, practical guides to data visualization in R. His work focuses on ggplot, effective graph design, and helping data scientists communicate insights with impact.
Michael Toth is a consulting data scientist and writer known for clear, practical guides to data visualization in R. His work focuses on ggplot, effective graph design, and helping data scientists communicate insights with impact.
Hilary Parker is a data scientist focused on the intersection of data science and product, with experience at Stitch Fix, Etsy, and the Biden 2020 campaign. She co-hosts the popular Not So Standard Deviations podcast and writes about reproducibility, statistics, and real-world data work.
Noam Ross is a research software engineer and data scientist writing about open-source software, reproducible research, and the ethics of technology. His blog explores tooling choices, CI/CD workflows, and the social impact of software in scientific work.
Oscar Baruffa is a Senior Analytics Manager and data professional helping organisations improve data maturity and build systems from scratch. With a background spanning engineering, strategy, and analytics, he also supports professionals transitioning into data-driven careers.
Allie Coleman writes about real-world software engineering through practical stories and experiments, with a strong focus on Ruby on Rails, performance, observability, and modern tools like ChatGPT. Her posts blend hands-on problem solving with thoughtful takes on product and engineering practices.
Edwin Thoen’s blog explores practical insights in data science, machine learning, and R programming. He writes about reproducible workflows, model management, data analysis, and strategies to avoid overengineering and improve project outcomes.
Tom Mock’s blog focuses on R programming, data visualization, and reproducible research. He shares practical tutorials for RMarkdown, Quarto, tidyverse, and creative data presentation techniques.
Benjamin Smith’s blog RObservations focuses on R programming, data analysis, and statistical computing. He shares tutorials, package development insights, and practical examples for data visualization, social network analysis, and reproducible research.
Randy Zwitch is a software engineer specializing in Python and data engineering. His blog features detailed tutorials on building and optimizing Python tools like PyArrow with GPU/CUDA support, Docker workflows, and high-performance data processing.
Zoe Locke writes about the R programming ecosystem, covering package development, testing, data manipulation, and tips for R users at all levels. Her posts include practical advice, tutorials, and insights for R developers and data scientists.
Colin Fay is a data scientist and engineer at ThinkR, focused on building production-grade Shiny apps and innovative webR solutions. He is an international speaker, open-source contributor, and shares insights on R, NodeJS, and data engineering through his blog and talks.
Mara Averick is a data enthusiast and R programmer who shares insights, tutorials, and tips on data analysis, visualization, and reproducible workflows. She creates and explores tools in the R ecosystem, including packages like {datapasta}, and enjoys making data more accessible and visually engaging.
Tal Galili is an R developer and data scientist passionate about the R community. He is the creator and maintainer of popular R packages such as installr and heatmaply, and frequently shares tutorials, insights, and updates about R, data visualization, and statistical computing.
Kieran Healy is a Professor of Sociology at Duke University, specializing in social networks, data visualization, and sociological theory. He is the author of several books, including The Ordinal Society and Data Visualization.
Cédric is an independent data visualization specialist and ecologist who helps organizations communicate insights through engaging visualizations and reproducible data products. He combines analytical expertise, programming, and design to create interactive graphics, reports, and web applications.
Jakub Nowosad is a computational geographer and Associate Professor at Adam Mickiewicz University, also serving as a Visiting Scientist at the University of Münster. He develops open-source tools and spatial methods for reproducible, scalable environmental and ecological analysis, and co-authors Geocomputation with R and Geocomputation with Python.
Garrick Aden-Buie is a Software Engineer for Shiny at Posit (formerly RStudio), building tools with R, Shiny, and R Markdown to support data science workflows. He is also an educator and former data scientist with experience in health analytics and smart home research.
Christophe Dervieux is a software engineer at RStudio (Posit) working on the R Markdown ecosystem, including rmarkdown, knitr, blogdown, and bookdown. He shares tips and experiences with R, reproducible reporting, and open-source contributions.
Andrew Heiss is a researcher and educator focused on data visualization, causal inference, and applied statistics using R and Bayesian methods. He writes extensively about reproducible research, GIS, and analytical workflows, and teaches data science and social science methods.
Yihui Xie is a statistician and software developer best known for his work on R Markdown, knitr, and tools for reproducible research. He blogs about R, web technologies, publishing workflows, and practical programming techniques.