Most AI benchmarks measure intelligence and instruction-following rather than psychological safety. Humane Bench evaluates ...
MIT Technology Review’s senior reporter for features and investigations, Eileen Guo, and FT tech correspondent Melissa ...