Essayer OR - Gratuit

DeepSeek's hidden warning for AI safety

Time

|

February 24, 2025

THE RELEASE OF DEEPSEEK R1 STUNNED WALL STREET and Silicon Valley in January, spooking investors and impressing tech leaders.

- BY BILLY PERRIGO

DeepSeek's hidden warning for AI safety

But amid all the talk, many overlooked a critical detail about the way the new Chinese artificial intelligence model functions-a nuance that has researchers worried about humanity's ability to control sophisticated new AI systems.

It's all down to an innovation in how DeepSeek R1 was trained-one that led to surprising behaviors in an early version of the model, which researchers described in the technical documentation accompanying its release.

During testing, researchers noticed that the model would spontaneously switch between English and Chinese while it was solving problems. When they forced it to stick to one language, thus making it easier for users to follow along, they found that the system's ability to solve the same problems would diminish.

That finding rang alarm bells for some AI-safety researchers. Currently, the most capable AI systems "think" in human-legible languages, writing out their reasoning before coming to a conclusion. That has been a boon for safety teams, whose most effective guardrails involve monitoring models' so-called chains of thought for signs of dangerous behaviors. But DeepSeek's results raised the possibility of a decoupling on the horizon: one where new AI capabilities might be gained from freeing models of the constraints of human language altogether.

To be sure, DeepSeek's language switching is not by itself cause for alarm. Instead, what worries researchers is the new innovation that caused it. The DeepSeek paper describes a novel training method whereby the model was rewarded purely for getting correct answers, regardless of how comprehensible its thinking process was to humans. The worry is that this incentive-based approach could eventually lead AI systems to develop completely inscrutable ways of reasoning, maybe even creating their own nonhuman languages, if doing so proves to be more effective.

PLUS D'HISTOIRES DE Time

Time

Time

The journalist and the jinx in a suburban standoff

CLAIRE DANES GETS A LOT OF ATTENTION for her “cry face.” It is, indeed, a sight to behold. Engulfed by waves of sorrow, her chin vibrates, her eyes scrunch, the corners of her mouth turn down as though tugged by invisible weights.

time to read

4 mins

December 08, 2025

Time

Time

LIVING IN PUBLIC

“The camera eats first.” A decade ago, that phrase was a joke about influencers and their avocado toast. Now it's shorthand for how every corner of life—dinners, cleaning, milestones, even grief—can be packaged for public consumption. We live in a world where intimacy has become inventory, where the difference between living and posting is often just a matter of lighting.

time to read

3 mins

December 08, 2025

Time

Time

5 migraine symptoms that aren't headaches

NEARLY 40 MILLION people in the U.S. suffer from migraines, making the painful disorder one of the most common that neurologists treat. It's also among the most confusing. Because of the many ways it can show up, it can take more than a decade to receive an accurate diagnosis.

time to read

2 mins

December 08, 2025

Time

Time

Distress Signal

WHAT THE L.A. FIRES REVEAL ABOUT AMERICA'S BLEAK CLIMATE FUTURE

time to read

13 mins

December 08, 2025

Time

The food pyramid may be back on the menu

EARLY PUBLIC NUTRITION ADVICE CAME AS A WARNING. Wilbur O. Atwater, a chemist and renowned nutritionist, wrote in an 1902 edition of the U.S. Department of Agriculture's (USDA) digest, Farmers' Bulletin, that \"Unless care is exercised in selecting food, a diet may result which is one-sided or badly balanced—that is, one in which either protein or fuel ingredients (carbohydrate and fat) are provided in excess ... The evils of overeating may not be felt at once, but sooner or later they are sure to appear.\"

time to read

2 mins

December 08, 2025

Time

Time

Where top U.S. leaders earn their stripes

AS THE INDUSTRIES AND COMPANIES driving the American economy change, new generations of leaders are rotated in to take the helm.

time to read

3 mins

December 08, 2025

Time

Time

The Risk Report

THREE YEARS AND NINE MONTHS after Russia's full-scale invasion of Ukraine, the war grinds on. There's been plenty of news and noise of late. Yet as we approach the end of 2025, there's no sign of resolution on the horizon.

time to read

2 mins

December 08, 2025

Time

Time

JON CHU'S AMERICAN DREAM

The Wicked: For Good director on trying to change the world, one blockbuster at a time

time to read

6 mins

December 08, 2025

Time

Time

Ken Burns'

The filmmaker on his 12-hour documentary The American Revolution, the importance of undertow, and what's next

time to read

2 mins

December 08, 2025

Time

Time

A seductive Dangerous Liaisons remix, with feminist intentions

There are no heroes in Les Liaisons Dangereuses, Pierre Choderlos de Laclos' 1782 novel of end-stage French aristocratic decadence. Its chief villain is Marquise Isabelle de Merteuil, a master manipulator who exploits her former lover the Vicomte de Valmont's resurgent desire for her with a wager that dooms them both. As a teenage Fiona Apple dryly noted: “It's a sad, sad world when a girl will break a boy just because she can.”

time to read

1 mins

December 08, 2025

Listen

Translate

Share

-
+

Change font size