Deep Learning articles

6/14/2018 • EN

The Hitchhiker's Guide to Hyperparameter Tuning

A practical guide to implementing a hyperparameter tuning script for machine learning models, based on real-world experience from Taboola's engineering team.

Deep Learning Hyperparameter Tuning Machine Learning Neural Networks Scikit Learn

Yoel Zeldes

4/8/2018 • EN

Policy Gradient Algorithms

A comprehensive overview of policy gradient algorithms in reinforcement learning, covering key concepts, notations, and various methods.

algorithms Deep Learning Machine Learning Policy Gradient Reinforcement Learning

Lilian Weng

3/22/2018 • EN

Gated Multimodal Units for Information Fusion

Explains the Gated Multimodal Unit (GMU), a deep learning architecture for intelligently fusing data from different sources like images and text.

Attention Mechanism Deep Learning Multimodal Fusion Neural Networks Tensorflow

Yoel Zeldes

2/19/2018 • EN

A (Long) Peek into Reinforcement Learning

An introductory guide to Reinforcement Learning (RL), covering key concepts, algorithms like SARSA and Q-learning, and its role in AI breakthroughs.

artificial intelligence Deep Learning Machine Learning Q Learning Reinforcement Learning

Lilian Weng

12/31/2017 • EN

Object Detection for Dummies Part 3: R-CNN Family

Explores the R-CNN family of models for object detection, covering R-CNN, Fast R-CNN, Faster R-CNN, and Mask R-CNN with technical details.

Cnn computer vision Deep Learning Object Detection R Cnn

Lilian Weng

12/17/2017 • EN

Training Sequence Models with Attention

Practical tips for training sequence-to-sequence models with attention, focusing on debugging and ensuring the model learns to condition on input.

Attention Mechanism Deep Learning Language Model Neural Networks Sequence To Sequence

Awni Hannun

12/15/2017 • EN

Object Detection for Dummies Part 2: CNN, DPM and Overfeat

Explores classic CNN architectures for image classification, including AlexNet, VGG, and ResNet, as foundational models for object detection.

Cnn computer vision Convolutional Neural Networks Deep Learning Object Detection

Lilian Weng

12/4/2017 • EN

The Last 5 Years In Deep Learning

A retrospective on the transformative impact of deep learning over the past five years, covering its rise, key applications, and future potential.

ai computer vision Deep Learning Machine Learning Neural Networks

Adit Deshpande

11/15/2017 • EN

After PyData Warsaw 2017

A recap of PyData Warsaw 2017, covering key talks, new package announcements, and analytics on the conference's international attendees.

Data Science Deep Learning Machine Learning Pydata Python

Piotr Migdał

10/11/2017 • EN

Speech Recognition Is Not Solved

Argues that speech recognition hasn't reached human-level performance, highlighting persistent challenges with accents, noise, and semantic errors.

Accent Recognition Asr Deep Learning speech recognition Word Error Rate

Awni Hannun

9/28/2017 • EN

Anatomize Deep Learning with Information Theory

Explores applying information theory, specifically the Information Bottleneck method, to analyze training phases and learning bounds in deep neural networks.

Deep Learning Information Bottleneck Information Theory Neural Networks Training Dynamics

Lilian Weng

8/20/2017 • EN

From GAN to WGAN

Explains the math behind GANs, their training challenges, and introduces WGAN as a solution for improved stability.

Deep Learning Gan Generative Adversarial Networks Machine Learning Wgan

Lilian Weng

8/17/2017 • EN

PyTorch or TensorFlow?

A comparison of PyTorch and TensorFlow deep learning frameworks, focusing on programmability, flexibility, and ease of use for different project scales.

Deep Learning Machine Learning Frameworks Neural Networks Pytorch Tensorflow

Awni Hannun

8/1/2017 • EN

How to Explain the Prediction of a Machine Learning Model?

Explores the importance of interpreting ML model predictions, especially in regulated fields, and reviews methods like linear regression and interpretable models.

Deep Learning Ethics Explainable AI Machine Learning Model Interpretability

Lilian Weng

7/25/2017 • EN

How I Used Deep Learning To Train A Chatbot To Talk Like Me &#40&#83orta&#41

A developer explores using deep learning and sequence-to-sequence models to train a chatbot on personal social media data to mimic their conversational style.

Chatbot Deep Learning Machine Learning Natural Language Processing Neural Networks

Adit Deshpande