AI Safety
- Home /
- Categories /
- AI Safety

AI in 2025: Three concepts from a Meta and OpenAI researcher that will change everything
Introduction: beyond hype and fear The future of artificial intelligence sparks radically opposed opinions. On one side, a skeptical quantitative trader shrugs that ChatGPT is “cool, but it can’t really do [his] job.” On the other, a researcher in a top AI lab believes we have “two to three years of work left before AI takes our jobs.” How do we navigate this vast spectrum of uncertainty?
Read More
Stress Testing AI Alignment: Why Models That Seem Safe Might Just Be Good at Taking Tests
A highly capable AI system might secretly pursue misaligned goals—a phenomenon referred to as “scheming”. Since a scheming AI would deliberately attempt to hide its misaligned goals and actions, evaluating and mitigating this threat demands strategies different from those traditionally employed in Machine Learning (ML).
Read More