2023  3

March  1

Prompt Engineering

March 15, 2023 · 21 min · Lilian Weng

January  2

The Transformer Family Version 2.0

January 27, 2023 · 45 min · Lilian Weng

Large Transformer Model Inference Optimization

January 10, 2023 · 31 min · Lilian Weng

2022  4

September  1

Some Math behind Neural Tangent Kernel

September 8, 2022 · 17 min · Lilian Weng

June  1

Generalized Visual Language Models

June 9, 2022 · 25 min · Lilian Weng

April  1

Learning with not Enough Data Part 3: Data Generation

April 15, 2022 · 28 min · Lilian Weng

February  1

Learning with not Enough Data Part 2: Active Learning

February 20, 2022 · 22 min · Lilian Weng

2021  6

December  1

Learning with not Enough Data Part 1: Semi-Supervised Learning

December 5, 2021 · 26 min · Lilian Weng

September  1

How to Train Really Large Models on Many GPUs?

September 24, 2021 · 21 min · Lilian Weng

July  1

What are Diffusion Models?

July 11, 2021 · 26 min · Lilian Weng

May  1

Contrastive Representation Learning

May 31, 2021 · 39 min · Lilian Weng

March  1

Reducing Toxicity in Language Models

March 21, 2021 · 23 min · Lilian Weng

January  1

Controllable Neural Text Generation

January 2, 2021 · 42 min · Lilian Weng

2020  5

October  1

How to Build an Open-Domain Question Answering System?

October 29, 2020 · 33 min · Lilian Weng

August  1

Neural Architecture Search

August 6, 2020 · 32 min · Lilian Weng

June  1

Exploration Strategies in Deep Reinforcement Learning

June 7, 2020 · 36 min · Lilian Weng

April  1

The Transformer Family

April 7, 2020 · 25 min · Lilian Weng

January  1

Curriculum for Reinforcement Learning

January 29, 2020 · 24 min · Lilian Weng

2019  6

November  1

Self-Supervised Representation Learning

November 10, 2019 · 38 min · Lilian Weng

September  1

Evolution Strategies

September 5, 2019 · 22 min · Lilian Weng

June  1

Meta Reinforcement Learning

June 23, 2019 · 22 min · Lilian Weng

May  1

Domain Randomization for Sim2Real Transfer

May 5, 2019 · 15 min · Lilian Weng

March  1

Are Deep Neural Networks Dramatically Overfitted?

March 14, 2019 · 22 min · Lilian Weng

January  1

Generalized Language Models

January 31, 2019 · 36 min · Lilian Weng

2018  9

December  1

Object Detection Part 4: Fast Detection Models

December 27, 2018 · 19 min · Lilian Weng

November  1

Meta-Learning: Learning to Learn Fast

November 30, 2018 · 30 min · Lilian Weng

October  1

Flow-based Deep Generative Models

October 13, 2018 · 21 min · Lilian Weng

August  1

From Autoencoder to Beta-VAE

August 12, 2018 · 21 min · Lilian Weng

June  1

Attention? Attention!

June 24, 2018 · 21 min · Lilian Weng

May  1

Implementing Deep Reinforcement Learning Models with Tensorflow + OpenAI Gym

May 5, 2018 · 13 min · Lilian Weng

April  1

Policy Gradient Algorithms

April 8, 2018 · 52 min · Lilian Weng

February  1

A (Long) Peek into Reinforcement Learning

February 19, 2018 · 31 min · Lilian Weng

January  1

The Multi-Armed Bandit Problem and Its Solutions

January 23, 2018 · 10 min · Lilian Weng

2017  10

December  2

Object Detection for Dummies Part 3: R-CNN Family

December 31, 2017 · 13 min · Lilian Weng

Object Detection for Dummies Part 2: CNN, DPM and Overfeat

December 15, 2017 · 7 min · Lilian Weng

October  2

Object Detection for Dummies Part 1: Gradient Vector, HOG, and SS

October 29, 2017 · 15 min · Lilian Weng

Learning Word Embedding

October 15, 2017 · 18 min · Lilian Weng

September  1

Anatomize Deep Learning with Information Theory

September 28, 2017 · 9 min · Lilian Weng

August  2

From GAN to WGAN

August 20, 2017 · 21 min · Lilian Weng

How to Explain the Prediction of a Machine Learning Model?

August 1, 2017 · 18 min · Lilian Weng

July  2

Predict Stock Prices Using RNN: Part 2

July 22, 2017 · 9 min · Lilian Weng

Predict Stock Prices Using RNN: Part 1

July 8, 2017 · 12 min · Lilian Weng

June  1

An Overview of Deep Learning for Curious People

June 21, 2017 · 12 min · Lilian Weng