AI 'GOLD RUSH' FOR CHATBOT TRAINING DATA COULD RUN OUT OF HUMAN-WRITTEN TEXT
Techlife News|June 08, 2024
Artificial intelligence systems like ChatGPT could soon run out of what keeps making them smarter — the tens of trillions of words people have written and shared online.
AI 'GOLD RUSH' FOR CHATBOT TRAINING DATA COULD RUN OUT OF HUMAN-WRITTEN TEXT

A new study released Thursday by research group Epoch AI projects that tech companies will exhaust the supply of publicly available training data for AI language models by roughly the turn of the decade -- sometime between 2026 and 2032.

Comparing it to a “literal gold rush” that depletes finite natural resources, Tamay Besiroglu, an author of the study, said the AI field might face challenges in maintaining its current pace of progress once it drains the reserves of human generated writing.

In the short term, tech companies like ChatGPTmaker OpenAI and Google are racing to secure and sometimes pay for high-quality data sources to train their AI large language models – for instance, by signing deals to tap into the steady flow of sentences coming out of Reddit forums and news media outlets.

In the longer term, there won’t be enough new blogs, news articles and social media commentary to sustain the current trajectory of AI development, putting pressure on companies to tap into sensitive data now considered private — such as emails or text messages — or relying on less-reliable “synthetic data” spit out by the chatbots themselves.

“There is a serious bottleneck here,” Besiroglu said. “If you start hitting those constraints about how much data you have, then you can’t really scale up your models efficiently anymore. And scaling up models has been probably the most important way of expanding their capabilities and improving the quality of their output.”

The researchers first made their projections two years ago — shortly before ChatGPT’s debut — in a working paper that forecast a more imminent 2026 cutoff of high-quality text data. Much has changed since then, including new techniques that enabled AI researchers to make better use of the data they already have and sometimes “overtrain” on the same sources multiple times.

This story is from the June 08, 2024 edition of Techlife News.

Start your 7-day Magzter GOLD free trial to access thousands of curated premium stories, and 9,000+ magazines and newspapers.

This story is from the June 08, 2024 edition of Techlife News.

Start your 7-day Magzter GOLD free trial to access thousands of curated premium stories, and 9,000+ magazines and newspapers.

MORE STORIES FROM TECHLIFE NEWSView All
AUSTRALIA PLANS TO TAX DIGITAL PLATFORMS THAT DON'T PAY FOR NEWS
Techlife News

AUSTRALIA PLANS TO TAX DIGITAL PLATFORMS THAT DON'T PAY FOR NEWS

The Australian government said it will tax large digital platforms and search engines unless they agree to share revenue with Australian news media organizations.

time-read
2 mins  |
December 21, 2024
JAPAN'S NISSAN RESHUFFLES MANAGEMENT TO FIX ITS MONEY-LOSING BUSINESS
Techlife News

JAPAN'S NISSAN RESHUFFLES MANAGEMENT TO FIX ITS MONEY-LOSING BUSINESS

Embattled Japanese automaker Nissan has tapped Jeremie Papin, who was overseeing its U.S. operations, as its chief financial officer in a major management reshuffle billed as key to a turnaround.

time-read
2 mins  |
December 21, 2024
EPA AWARDS $135 MILLION TO CALIFORNIA TO PHASE OUT BIG DIESEL TRUCKS
Techlife News

EPA AWARDS $135 MILLION TO CALIFORNIA TO PHASE OUT BIG DIESEL TRUCKS

The Environmental Protection Agency is awarding $135 million in grants to fund 13 projects in California to help the state wean off fossil fuels and phase out big rigs that run on diesel.

time-read
1 min  |
December 21, 2024
NEARLY HALF OF US TEENS ARE ONLINE 'CONSTANTLY,' PEW REPORT FINDS
Techlife News

NEARLY HALF OF US TEENS ARE ONLINE 'CONSTANTLY,' PEW REPORT FINDS

Nearly half of American teenagers say they are online “constantly” despite concerns about the effects of social media and smartphones on their mental health, according to a new report published by the Pew Research Center.

time-read
1 min  |
December 21, 2024
OPENAI'S LEGAL BATTLE WITH ELON MUSK REVEALS INTERNAL TURMOIL OVER AVOIDING AI 'DICTATORSHIP'
Techlife News

OPENAI'S LEGAL BATTLE WITH ELON MUSK REVEALS INTERNAL TURMOIL OVER AVOIDING AI 'DICTATORSHIP'

A 7-year-old rivalry between tech leaders Elon Musk and Sam Altman over who should run OpenAI and prevent an artificial intelligence “dictatorship” is now heading to a federal judge as Musk seeks to halt the ChatGPT maker’s ongoing shift into a for-profit company.

time-read
3 mins  |
December 21, 2024
TECH TIP: HOW TO PROTECT YOUR COMMUNICATIONS THROUGH ENCRYPTION
Techlife News

TECH TIP: HOW TO PROTECT YOUR COMMUNICATIONS THROUGH ENCRYPTION

After a sprawling hacking campaign exposed the communications of an unknown number of Americans, U.S. cybersecurity officials are advising people to use encryption in their communications.

time-read
3 mins  |
December 21, 2024
TRUMP HOSTS APPLE CEO AT MAR-A-LAGO AS BIG TECH LEADERS CONTINUE OUTREACH TO PRESIDENT-ELECT
Techlife News

TRUMP HOSTS APPLE CEO AT MAR-A-LAGO AS BIG TECH LEADERS CONTINUE OUTREACH TO PRESIDENT-ELECT

Donald Trump hosted Apple CEO Tim Cook for a Friday evening dinner at the president-elect's Mar-a-Lago resort, according to a person familiar with the matter who was not authorized to comment publicly.

time-read
2 mins  |
December 21, 2024
MUSK SAYS US IS DEMANDING HE PAY PENALTY OVER DISCLOSURES OF HIS TWITTER STOCK PURCHASES
Techlife News

MUSK SAYS US IS DEMANDING HE PAY PENALTY OVER DISCLOSURES OF HIS TWITTER STOCK PURCHASES

Elon Musk says the Securities and Exchange Commission wants him to pay a penalty or face charges involving what he disclosed or failed to disclose - about his purchases of Twitter stock before he bought the social media platform in 2022.

time-read
2 mins  |
December 21, 2024
ELON MUSK WANTS TO TURN SPACEX'S STARBASE SITE INTO A TEXAS CITY
Techlife News

ELON MUSK WANTS TO TURN SPACEX'S STARBASE SITE INTO A TEXAS CITY

SpaceX is launching a new mission: making its Starbase site a new Texas city.

time-read
1 min  |
December 21, 2024
OPENAI RELEASES AI VIDEO GENERATOR SORA BUT LIMITS HOW IT DEPICTS PEOPLE
Techlife News

OPENAI RELEASES AI VIDEO GENERATOR SORA BUT LIMITS HOW IT DEPICTS PEOPLE

OpenAl has publicly released its new artificial intelligence video generator Sora but the company won't let most users depict people as it monitors for patterns of misuse.

time-read
1 min  |
December 21, 2024