Anthropic built its brand on warning that advanced AI could cause catastrophic harm. Now the company behind Claude is facing a sharper question: can an AI lab claim to make the technology safer while racing to become more powerful?
The company’s answer is that it has to compete if it wants influence over how advanced AI is built and governed. Anthropic describes its mission as being “to ensure the world safely makes the transition through transformative AI.”
Several former employees said Anthropic’s leaders and staff often see the company as a responsible actor in a race that will happen with or without them. CEO Dario Amodei put the argument in competitive terms in a company-posted conversation, saying, “You have to find a way to actually be competitive.”
That approach has made Anthropic a central player in the AI boom. The company is now among the leading developers of frontier models, courts government customers and has been valued at almost $1 trillion.
But critics say the safety-first pitch leaves a problem unresolved. If Anthropic believes the most powerful AI should be built by the people most alert to its risks, it still asks the public to trust a small group of private executives with decisions that could affect governments, workers and militaries.
The tension has already surfaced inside and outside the company. Anthropic’s partnership with Palantir to offer AI services to U.S. intelligence and defense agencies drew internal debate, according to former employees. The company later released a Claude safeguard aimed at blocking some frontier AI development uses, then walked it back after industry criticism and said it had not found the right balance.



