Research

Auto Added by WPeMatico

PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications

PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications Read More »

How AI training scales

We’ve discovered that the gradient noise scale, a simple statistical metric, predicts the parallelizability of neural network training on a wide range of tasks. Since complex tasks tend to have noisier gradients, increasingly large batch sizes are likely to become useful in the future, removing one potential limit to further growth of AI systems. More

How AI training scales Read More »

Computational limitations in robust classification and win-win results

Artificial Intelligence, Research

Computational limitations in robust classification and win-win results Read More »

Better language models and their implications

Artificial Intelligence, Research

We’ve trained a large-scale unsupervised language model which generates coherent paragraphs of text, achieves state-of-the-art performance on many language modeling benchmarks, and performs rudimentary reading comprehension, machine translation, question answering, and summarization—all without task-specific training.

Better language models and their implications Read More »

Spinning Up in Deep RL

Artificial Intelligence, Research

We’re releasing Spinning Up in Deep RL, an educational resource designed to let anyone learn to become a skilled practitioner in deep reinforcement learning. Spinning Up consists of crystal-clear examples of RL code, educational exercises, documentation, and tutorials.

Spinning Up in Deep RL Read More »

Quantifying generalization in reinforcement learning

Artificial Intelligence, Research

We’re releasing CoinRun, a training environment which provides a metric for an agent’s ability to transfer its experience to novel situations and has already helped clarify a longstanding puzzle in reinforcement learning. CoinRun strikes a desirable balance in complexity: the environment is simpler than traditional platformer games like Sonic the Hedgehog but still poses a worthy generalization challenge

Quantifying generalization in reinforcement learning Read More »

Reinforcement learning with prediction-based rewards

Artificial Intelligence, Research

We’ve developed Random Network Distillation (RND), a prediction-based method for encouraging reinforcement learning agents to explore their environments through curiosity, which for the first time exceeds average human performance on Montezuma’s Revenge.

Reinforcement learning with prediction-based rewards Read More »

Plan online, learn offline: Efficient learning and exploration via model-based control

Artificial Intelligence, Research

Plan online, learn offline: Efficient learning and exploration via model-based control Read More »

Learning concepts with energy functions

Artificial Intelligence, Research

We’ve developed an energy-based model that can quickly learn to identify and generate instances of concepts, such as near, above, between, closest, and furthest, expressed as sets of 2d points. Our model learns these concepts after only five demonstrations. We also show cross-domain transfer: we use concepts learned in a 2d particle environment to solve tasks on

Learning concepts with energy functions Read More »

The International 2018: Results

Artificial Intelligence, Research

OpenAI Five lost two games against top Dota 2 players at The International in Vancouver this week, maintaining a good chance of winning for the first 20–35 minutes of both games.

The International 2018: Results Read More »