Reinforcement Learning

AI Reinforcement Learning

Is Reinforcement Learning Still a Viable Path to AGI?

This video is from Machine Learning with Phil. While I have come out in support of John Carmack as a dark horse candidate to implement the world’s first AGI, we can’t discount one of the OGs of reinforcement learning: Richard Sutton. In a recent paper he outlined the path to AGI, and in the paper […]

Read More
AI Reinforcement Learning

AI Learns to Walk (deep reinforcement learning)

This video is from AI Warehouse. In this video an AI named Albert learns how to walk to escape 5 rooms I created. The AI was trained using Deep Reinforcement Learning, a method of Machine Learning which involves rewarding the agent for doing something correctly, and punishing it for doing anything incorrectly. Albert’s actions are […]

Read More
AI Reinforcement Learning

Multi-Armed Bandits: A Cartoon Introduction – DCBA #1

Academic Gamer provides this introduction to Multi-Armed Bandits, an exciting field of AI research that aims to address the exploration/exploitation dilemma. We discuss the problem formulation, popular strategies for addressing it (epsilon-first, epsilon-greedy, and UCB1), and several of the variations on the MAB problem currently pursued in research.

Read More
AI Reinforcement Learning

Multi-Armed Bandit : Data Science Concepts

This video from ritvikmath explores the Multi-Armed Bandit problem and Making decisions with limited information!

Read More
Livestream Red Hat Reinforcement Learning

Open Source Textbooks plus the Multi-Armed Bandit Reinforcement learning problem

Here’s a livestream remix of my AWS re:Invent talk, plus Andy shows up! No tags,

Read More
AI Reinforcement Learning

An AI Learns to Play Starcraft 2 with Reinforcement Learning

Interesting video tinkering with reinforcement learning via Stable Baselines 3 and Starcraft 2. Code and model: https://github.com/Sentdex/SC2RL

Read More
Reinforcement Learning

Can AI Learn to Cooperate?

Machine Learning with Phil covers Multi Agent Deep Deterministic Policy Gradients (MADDPG) in this video. Multi agent deep deterministic policy gradients is one of the first successful algorithms for multi agent artificial intelligence. Cooperation and competition among AI agents is going to be critical as applications of deep learning expand in our daily lives. In […]

Read More