Reinforcement learning (RL) practitioners have produced a number of excellent tutorials. Most, however, describe RL in terms of mathematical equations and abstract diagrams. We like to think of the field from a different perspective. RL itself is inspired by how animals learn, so why not translate the underlying RL machinery back into the natural phenomena they’re designed to mimic? Humans learn best through stories. This is a story about the Actor Advantage Critic (A2C) model. Actor-Critic models are a popular form of Policy Gradient model, which is itself a vanilla RL algorithm. If you understand the A2C, you understand deep RL. After you’ve gained an intuition for the A2C, check out: Our of the A2C (for learning) or our industrial-strength based on model simple code implementation PyTorch version OpenAI’s TensorFlow Baselines , , and for a deep dive in RL Barto & Sutton’s Introduction to RL David Silver’s canonical course Yuxi Li’s overview Denny Britz’ GitHub repo for intuitive and practical coverage of deep learning in general, implemented in PyTorch fast.ai’s awesome course ’s , implemented in TensorFlow. Arthur Juliani tutorials on RL Illustrations by @embermarke

Twitter

Too Long; Didn't Read

In your car, at home, or at work — Bosch technology shapes many areas of life.

Intuitive RL: Intro to Advantage-Actor-Critic (A2C)

Too Long; Didn't Read

People Mentioned

Company Mentioned

Rudy Gilman

Share Your Thoughts

About Author

TOPICS

Languages

THIS ARTICLE WAS FEATURED IN...

Intuitive RL: Intro to Advantage-Actor-Critic (A2C)

Too Long; Didn't Read

People Mentioned

Company Mentioned

Rudy Gilman

Share Your Thoughts

About Author

TOPICS

Languages

THIS ARTICLE WAS FEATURED IN...

RELATED STORIES