gpt-oss-safeguard technical report
- ID
- 361
- Status
- new
- Published
- 29 Oct 2025, 8:00 AM
- Fetched
- 27 Jun 2026, 7:47 PM
- Provider
- OpenAI News
- Category
- ai-labs
- Original URL
- https://openai.com/index/gpt-oss-safeguard-technical-report
- Source URL
- https://openai.com/news/rss.xml
Excerpt
gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are two open-weight reasoning models post-trained from the gpt-oss models and trained to reason from a provided policy in order to label content under that policy. In this report, we describe gpt-oss-safeguard’s capabilities and provide our baseline safety evaluations on the gpt-oss-safeguard models, using the underlying gpt-oss models as a baseline. For more information about the development and architecture of the underlying gpt-oss models, see the original gpt-oss model model card.
Summary
No summary yet. It will appear after the daemon summarizes this item.