
Reinforcement Learning
Deep Q-Network Solves Cart and Pole – Reinforcement Learning Code Project
- Frank
- March 28, 2022
- agent environment
- AI
- AlphaGo
- artificial intelligence
- artificial neural network
- Bellman equation
- CNN
- Deep Learning
- Deep Q-network
- DQN
- Education
- experience replay
- Machine Learning
- markov decision process
- MDP
- Neural Network
- OpenAI Five
- OpenAI Gym
- policy gradients
- policy network
- Python
- PyTorch
- Q-learning
- Q-value
- Reinforcement Learning
- replay memory
- SGD
- stochastic gradient descent
- Supervised Learning
- TensorFlow
- Tutorial
- Unsupervised Learning
In this episode, learn how to use a deep Q-network to solve the Cart and Pole environment.
Read More
Reinforcement Learning
Can AI Learn to Cooperate?
- Frank
- April 9, 2021
- actor critic methods
- cooperative reinforcement learning
- ddpg
- deep deterministic policy gradients
- maddpg
- maddpg algorithm
- maddpg openai gym
- maddpg pytorch
- maddpg tutorial
- multi agent actor critic
- multi agent actor critic algorithm
- multi agent actor critic explained
- multi agent actor critic tutorial
- multi agent actor deep deterministic policy gradients
- multi agent deep deterministic policy gradients
- multi agent reinforcement learning
- policy gradients
Machine Learning with Phil covers Multi Agent Deep Deterministic Policy Gradients (MADDPG) in this video. Multi agent deep deterministic policy gradients is one of the first successful algorithms for multi agent artificial intelligence. Cooperation and competition among AI agents is going to be critical as applications of deep learning expand in our daily lives. In […]
Read More