The History of Speech Recognition to the Year 2030
A forecast of speech recognition technology's evolution from 2010 to 2030, analyzing past progress and predicting future trends.
A forecast of speech recognition technology's evolution from 2010 to 2030, analyzing past progress and predicting future trends.
Profile of Amazon applied scientist Eugene Yan, focusing on his career in data science and his influential technical writing about machine learning.
Explores methods like semi-supervised and active learning to create training labels when labeled datasets are unavailable, with industry examples.
Explains how to bootstrap training labels for a semantic search system using initial lexical search and user click data instead of costly human annotation.
A talk on system design for recommendation and search systems, covering architecture and production considerations.
An in-depth technical explanation of diffusion models, a class of generative AI models that create data by reversing a noise-adding process.
A comprehensive deep learning course covering fundamentals, neural networks, computer vision, and generative models using PyTorch.
A comprehensive deep learning course overview with PyTorch tutorials, covering fundamentals, neural networks, and advanced topics like CNNs and GANs.
A data scientist shares practical strategies and mindsets for influencing technical teams and driving change without formal authority.
Analyzes the legal implications of GitHub Copilot potentially being a derivative work of GPL-licensed code used in its training.
Explores system design patterns for industrial-scale recommendation and search engines, focusing on offline/online components and retrieval/ranking stages.
Explores machine learning patterns like bandits, sequential, and graph-based models for personalizing recommendations and search results.
A guide to implementing few-shot learning using the GPT-Neo language model and Hugging Face's inference API for NLP tasks.
An interview with data scientist Eugene Yan discussing his career path from psychology to Amazon, favorite ML projects, and advice for aspiring data scientists.
A high-level guide to tools and methods for understanding AI/ML models and their predictions, known as Explainable AI (XAI).
Explores the distinction between using regression models for causal inference versus predictive inference, and the role of generalizability in prediction.
Explores the strategic 'metagame' of applying machine learning in industry, focusing on problem selection and business impact over pure technical knowledge.
Explores how mutual information and KL divergence can be used to derive information-theoretic generalization bounds for Stochastic Gradient Descent (SGD).
A guest post sharing personal stories of imposter syndrome in tech and academia, with lessons on recognizing and managing self-doubt.
A developer builds a Chrome extension using TensorFlow.js to toggle dark/light mode on Netlify by clapping hands.