E-Paper | May 06, 2026

Stories by Marian-Andrei Rizoiu

AI safety measures found to be shallow as models still struggle to grasp harmful intent

The main problem is that the model can generate harmful content, but isn’t truly aware of what is harmful, or why it should refuse to generate it. Published 08 Oct, 2025 04:25pm

TECHNOLOGY: PROMPTED TO LIE

AI models are trained to refuse harmful requests, but new research reveals their safety measures are alarmingly shallow Published 05 Oct, 2025 07:10am