Safety

Auto Added by WPeMatico

Announcing the OpenAI Safety Fellowship

ai, AI (Artificial Intelligence), Artificial Intelligence, Safety

A pilot program to support independent safety and alignment research and develop the next generation of talent

Announcing the OpenAI Safety Fellowship Read More »

Introducing the OpenAI Safety Bug Bounty program

ai, AI (Artificial Intelligence), Artificial Intelligence, Safety

OpenAI launches a Safety Bug Bounty program to identify AI abuse and safety risks, including agentic vulnerabilities, prompt injection, and data exfiltration.

Introducing the OpenAI Safety Bug Bounty program Read More »

Introducing the OpenAI Safety Bug Bounty program

ai, AI (Artificial Intelligence), Artificial Intelligence, Safety

OpenAI launches a Safety Bug Bounty program to identify AI abuse and safety risks, including agentic vulnerabilities, prompt injection, and data exfiltration.

Introducing the OpenAI Safety Bug Bounty program Read More »

Helping developers build safer AI experiences for teens

ai, AI (Artificial Intelligence), Artificial Intelligence, Safety

OpenAI releases prompt-based teen safety policies for developers using gpt-oss-safeguard, helping moderate age-specific risks in AI systems.

Helping developers build safer AI experiences for teens Read More »

Creating with Sora Safely

ai, AI (Artificial Intelligence), Artificial Intelligence, Safety

To address the novel safety challenges posed by a state-of-the-art video model as well as a new social creation platform, we’ve built Sora 2 and the Sora app with safety at the foundation. Our approach is anchored in concrete protections.

Creating with Sora Safely Read More »

Creating with Sora Safely

ai, AI (Artificial Intelligence), Artificial Intelligence, Safety

Creating with Sora Safely Read More »

How we monitor internal coding agents for misalignment

ai, AI (Artificial Intelligence), Artificial Intelligence, Safety

How OpenAI uses chain-of-thought monitoring to study misalignment in internal coding agents—analyzing real-world deployments to detect risks and strengthen AI safety safeguards.

How we monitor internal coding agents for misalignment Read More »

OpenAI Japan announces Japan Teen Safety Blueprint to put teen safety first

ai, AI (Artificial Intelligence), Artificial Intelligence, Safety

OpenAI Japan announces the Japan Teen Safety Blueprint, introducing stronger age protections, parental controls, and well-being safeguards for teens using generative AI.

OpenAI Japan announces Japan Teen Safety Blueprint to put teen safety first Read More »

An update on our mental health-related work

ai, AI (Artificial Intelligence), Artificial Intelligence, Safety

OpenAI shares updates on its mental health safety work, including parental controls, trusted contacts, improved distress detection, and recent litigation developments.

An update on our mental health-related work Read More »

Introducing Lockdown Mode and Elevated Risk labels in ChatGPT

ai, AI (Artificial Intelligence), Artificial Intelligence, Safety

Introducing Lockdown Mode and Elevated Risk labels in ChatGPT to help organizations defend against prompt injection and AI-driven data exfiltration.

Introducing Lockdown Mode and Elevated Risk labels in ChatGPT Read More »