Why your AI might be biased (and what you can do about it)
Explains the causes of bias in AI systems, focusing on training data and proxy variables, and offers practical steps for developers to mitigate it.
Explains the causes of bias in AI systems, focusing on training data and proxy variables, and offers practical steps for developers to mitigate it.
Explores the importance of data quality and validation in data engineering, covering key dimensions and tools for reliable pipelines.
Notes on dataset engineering from Chip Huyen's 'AI Engineering', covering data curation, quality, coverage, quantity, and acquisition for AI models.
An interview with Salma Bakouk, CEO of Sifflet, discussing data stack observability, data quality, lineage, and building a modern data team.
Explores the importance of high-quality human-annotated data for training AI models, covering task design, rater selection, and the wisdom of the crowd.
Interview with Chad Sanderson on data platform leadership, experimentation culture, data quality, and the rise of data contracts.
An introduction to Great Expectations, an open-source Python tool for data quality testing, documentation, and profiling.
Adding a PDF course completion report for students in a SaaS application built with Python and Django.
Explores six unexpected challenges that arise after deploying machine learning models in production, from data schema changes to organizational issues.
An enterprise architect discusses the challenges of data validation speed, automation, and the essential role of human intuition in ensuring data quality.