Up until today, Mrinank Sharma was the head of the safeguards research team at Anthropic, the company behind popular AI chatbot Claude. He stepped down on Monday and publicly published the letter he ...
Anthropic has entrusted Amanda Askell to endow its chatbot, Claude, with a sense of right and wrong.
Researchers at the company are trying to understand their A.I. system’s mind—examining its neurons, running it through psychology experiments, and putting it on the therapy couch.