Research

Auto Added by WPeMatico

Multi-Goal Reinforcement Learning: Challenging robotics environments and request for research

Multi-Goal Reinforcement Learning: Challenging robotics environments and request for research Read More »

Ingredients for robotics research

We’re releasing eight simulated robotics environments and a Baselines implementation of Hindsight Experience Replay, all developed for our research over the past year. We’ve used these environments to train models which work on physical robots. We’re also releasing a set of requests for robotics research.

Ingredients for robotics research Read More »

Some considerations on learning to explore via meta-reinforcement learning

Artificial Intelligence, Research

Some considerations on learning to explore via meta-reinforcement learning Read More »

Hello GPT-4o

Artificial Intelligence, Research

We’re announcing GPT-4 Omni, our new flagship model which can reason across audio, vision, and text in real time.

Hello GPT-4o Read More »

Understanding the source of what we see and hear online

Artificial Intelligence, Research

Today we’re introducing new technology to help researchers identify content created by our tools and joining the Coalition for Content Provenance and Authenticity Steering Committee to promote industry standards.

Understanding the source of what we see and hear online Read More »

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Artificial Intelligence, Research

Today’s LLMs are susceptible to prompt injections, jailbreaks, and other attacks that allow adversaries to overwrite a model’s original instructions with their own malicious prompts.

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions Read More »

Video generation models as world simulators

Artificial Intelligence, Research

We explore large-scale training of generative models on video data. Specifically, we train text-conditional diffusion models jointly on videos and images of variable durations, resolutions and aspect ratios. We leverage a transformer architecture that operates on spacetime patches of video and image latent codes. Our largest model, Sora, is capable of generating a minute of

Video generation models as world simulators Read More »

Building an early warning system for LLM-aided biological threat creation

Artificial Intelligence, Research

We’re developing a blueprint for evaluating the risk that a large language model (LLM) could aid someone in creating a biological threat. In an evaluation involving both biology experts and students, we found that GPT-4 provides at most a mild uplift in biological threat creation accuracy. While this uplift is not large enough to be conclusive,

Building an early warning system for LLM-aided biological threat creation Read More »

Democratic inputs to AI

Artificial Intelligence, Research

Our nonprofit organization, OpenAI, Inc., is launching a program to award ten $100,000 grants to fund experiments in setting up a democratic process for deciding what rules AI systems should follow, within the bounds defined by the law.

Democratic inputs to AI Read More »

Improving mathematical reasoning with process supervision

Artificial Intelligence, Research

We’ve trained a model to achieve a new state-of-the-art in mathematical problem solving by rewarding each correct step of reasoning (“process supervision”) instead of simply rewarding the correct final answer (“outcome supervision”). In addition to boosting performance relative to outcome supervision, process supervision also has an important alignment benefit: it directly trains the model to

Improving mathematical reasoning with process supervision Read More »