Research

Auto Added by WPeMatico

GamePad: A learning environment for theorem proving

Artificial Intelligence, Research

GamePad: A learning environment for theorem proving Read More »

Learning policy representations in multiagent systems

Artificial Intelligence, Research

Learning policy representations in multiagent systems Read More »

AI and compute

Artificial Intelligence, Research

We’re releasing an analysis showing that since 2012, the amount of compute used in the largest AI training runs has been increasing exponentially with a 3.4-month doubling time (by comparison, Moore’s Law had a 2-year doubling period)[^footnote-correction]. Since 2012, this metric has grown by more than 300,000x (a 2-year doubling period would yield only a

AI and compute Read More »

Variance reduction for policy gradient with action-dependent factorized baselines

Artificial Intelligence, Research

Variance reduction for policy gradient with action-dependent factorized baselines Read More »

Retro Contest

Artificial Intelligence, Research

We’re launching a transfer learning contest that measures a reinforcement learning algorithm’s ability to generalize from previous experience.

Retro Contest Read More »

Gotta Learn Fast: A new benchmark for generalization in RL

Artificial Intelligence, Research

Gotta Learn Fast: A new benchmark for generalization in RL Read More »

Evolved Policy Gradients

Artificial Intelligence, Research

We’re releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on novel tasks. Agents trained with EPG can succeed at basic tasks at test time that were outside their training regime, like learning to navigate to an object on a

Evolved Policy Gradients Read More »

Reptile: A scalable meta-learning algorithm

Artificial Intelligence, Research

We’ve developed a simple meta-learning algorithm called Reptile which works by repeatedly sampling a task, performing stochastic gradient descent on it, and updating the initial parameters towards the final parameters learned on that task. Reptile is the application of the Shortest Descent algorithm to the meta-learning setting, and is mathematically similar to first-order MAML (which

Reptile: A scalable meta-learning algorithm Read More »

On first-order meta-learning algorithms

Artificial Intelligence, Research

On first-order meta-learning algorithms Read More »

Improving GANs using optimal transport

Artificial Intelligence, Research

Improving GANs using optimal transport Read More »