OpenAI is training models to ‘confess’ when they lie – what it means for future AI

Published

1 month ago

December 5, 2025

ZDNet

[ad_1]

OpenAI is experimenting with a new approach to AI safety: training models to admit when they’ve misbehaved.

In a study published Wednesday, researchers tasked a version of GPT-5 Thinking, the company’s…

Related Topics:

Climbing the career ladder? 5 secrets to building resilience from leaders who were once in your shoes

I found the best Verizon holiday deals on iPhones, Android, and more

StartupNews.fyi – Startup & Technology News