2025 highlights: AI research and code
A 2025 AI research review covering tabular machine learning, the societal impacts of AI scale, and open-source data-science tools.
A 2025 AI research review covering tabular machine learning, the societal impacts of AI scale, and open-source data-science tools.
The author announces their new role as Probabl's CSO to accelerate development of the scikit-learn machine learning library and its ecosystem.
A tutorial on using the {fs} package in R for easier file path manipulation, extension management, and directory information retrieval.
A guide for R users to learn basics of Python, HTML, CSS, JS, and C++ to enhance their data science and web development projects.
Explains the key difference between AI models and algorithms, using linear regression and OLS as examples.
The article discusses the spin-off of scikit-learn's open-source development from Inria to a new mission-driven enterprise, Probabl, focusing on sustainable funding and growth.
A guide to useful RStudio shortcuts, settings, and tips to improve productivity and code readability for R programming.
Announcement of an upcoming O'Reilly book titled 'Python Polars: The Definitive Guide', a comprehensive guide to the Polars DataFrame library.
Archive of a cohort-based online course teaching developers and researchers how to use the command line for automation and data science tasks.
A retrospective on forming a research team in 2022 to apply machine learning to challenges in health and social sciences, including data management and validation.
Author announces closing his data science training company after seven years and shares his new role as a Senior Machine Learning Engineer.
Scikit-learn foundation seeks a community and partnerships developer to grow the open-source ecosystem and foster industry sponsorships.
A data scientist's 2020 review, focusing on machine learning projects for healthcare, including mining COVID-19 EHR data and brain signal analysis.
A researcher reviews their 2019 scientific work, focusing on computational statistics for brain imaging and data science.
An analysis and English translation of Jacek Kaczmarski's poem 'The Statues', exploring the legacy of tyranny.
The article critiques the overuse and devaluation of the titles 'Engineer' and 'Scientist' in modern IT, focusing on data science and engineering roles.
An exploration of predictive analytics, its historical roots in human nature, and its modern implementation through data science and AI technologies.
A researcher's 2018 highlights: using machine learning for cognitive brain mapping, analyzing non-curated data, and contributing to scikit-learn development.
Explains four levels of customer targeting, from no segmentation to advanced recommendation systems, and their business applications.
A summary of a Python Frederick meetup featuring Christine Lee's presentation on data science tools and features available in Python.