News

To fix the way we test and measure models, AI is learning tricks from social science. It’s not easy being one of Silicon Valley’s favorite benchmarks. SWE-Bench (pronounced “swee bench”) launched in ...
For caretech providers, using data goes beyond compliance. Expert Tanya Johnston explains how data can be used to optimize scheduling, improve client outcomes, and drive better business decisions.
The new benchmark, called Elephant, makes it easier to spot when AI models are being overly sycophantic—but there’s no current fix. Back in April, OpenAI announced it was rolling back an update to its ...
As you may have seen, OpenAI has just released two new AI models – gpt‑oss‑20b and gpt‑oss-120b – which are the first open‑weight models from the firm since GPT‑2. These two models – one is more ...