policy gradients

Reinforcement Learning

Can AI Learn to Cooperate?

Machine Learning with Phil covers Multi Agent Deep Deterministic Policy Gradients (MADDPG) in this video. Multi agent deep deterministic policy gradients is one of the first successful algorithms for multi agent artificial intelligence. Cooperation and competition among AI agents is going to be critical as applications of deep learning expand in our daily lives. In […]

Read More