Research

Auto Added by WPeMatico

Equivalence between policy gradients and soft Q-learning

Artificial Intelligence, Research

Equivalence between policy gradients and soft Q-learning Read More »

One-shot imitation learning

Artificial Intelligence, Research

One-shot imitation learning Read More »

Evolution strategies as a scalable alternative to reinforcement learning

Artificial Intelligence, Research

We’ve discovered that evolution strategies (ES), an optimization technique that’s been known for decades, rivals the performance of standard reinforcement learning (RL) techniques on modern RL benchmarks (e.g. Atari/MuJoCo), while overcoming many of RL’s inconveniences.

Evolution strategies as a scalable alternative to reinforcement learning Read More »

Spam detection in the physical world

Artificial Intelligence, Research

We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot.

Spam detection in the physical world Read More »

Unsupervised sentiment neuron

Artificial Intelligence, Research

We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon reviews.

Unsupervised sentiment neuron Read More »