
But Whisper has a major flaw: It is prone to making up chunks of text or even entire sentences, according to interviews with more than a dozen software engineers, developers and academic researchers. Those experts said some of the invented text — known in the industry as hallucinations — can include racial commentary, violent rhetoric and even imagined medical treatments.
Experts said that such fabrications are problematic because Whisper is being used in a slew of industries worldwide to translate and transcribe interviews, generate text in popular consumer technologies and create subtitles for videos.
More concerning, they said, is a rush by medical centers to utilize Whisper-based tools to transcribe patients’ consultations with doctors, despite OpenAI’ s warnings that the tool should not be used in “high-risk domains.”
The full extent of the problem is difficult to discern, but researchers and engineers said they frequently have come across Whisper’s hallucinations in their work. A University of Michigan researcher conducting a study of public meetings, for example, said he found hallucinations in eight out of every 10 audio transcriptions he inspected, before he started trying to improve the model.
A machine learning engineer said he initially discovered hallucinations in about half of the over 100 hours of Whisper transcriptions he analyzed. A third developer said he found hallucinations in nearly every one of the 26,000 transcripts he created with Whisper.
The problems persist even in well-recorded, short audio samples. A recent study by computer scientists uncovered 187 hallucinations in more than 13,000 clear audio snippets they examined.
That trend would lead to tens of thousands of faulty transcriptions over millions of recordings, researchers said.
この記事は Techlife News の November 02, 2024 版に掲載されています。
7 日間の Magzter GOLD 無料トライアルを開始して、何千もの厳選されたプレミアム ストーリー、9,000 以上の雑誌や新聞にアクセスしてください。
すでに購読者です ? サインイン
この記事は Techlife News の November 02, 2024 版に掲載されています。
7 日間の Magzter GOLD 無料トライアルを開始して、何千もの厳選されたプレミアム ストーリー、9,000 以上の雑誌や新聞にアクセスしてください。
すでに購読者です? サインイン

SOCIAL MEDIA BLOCKS FOR MINORS IN THE U.S.: A NATIONAL DEBATE ON KIDS AND TECH
Across the United States, lawmakers and families alike are wrestling with a critical question: should minors be barred from social media to protect their well-being?

IOS 18.4 BETA 2: HERE'S EVERYTHING NEW
Apple has rolled out the second developer beta of iOS 18.4 this Monday, March 3, offering a fresh batch of enhancements and refinements to its mobile operating system, now available for testing among registered developers.

PRIVATE LUNAR LANDER BLUE GHOST TOUCHES DOWN: A NEW MOON MILESTONE
Earlier this week, the moon welcomed a new visitor as Firefly Aerospace's Blue Ghost lunar lander nailed a soft touchdown, marking a historic win for private space exploration.

UBER TEAMS UP WITH WAYMO TO START SELLING DRIVERLESS RIDES IN AUSTIN, TEXAS
Uber shifted gears in Austin, Texas, earlier this week, launching a landmark service that lets riders hail driverless cars through its app, thanks to a partnership with Waymo, the autonomous vehicle arm of Google’s parent company, Alphabet.

ALWAYS "ON" EMPLOYEES AND BURNOUT: A DEEP DIVE INTO TODAY'S WORKPLACE CRISIS
The modern workplace has morphed into a relentless machine, with employees tethered to their devices, perpetually reachable, and increasingly drained—a phenomenon dubbed the “always on” culture.

TSMC'S BIG U.S. BET: $100 BILLION FUELS CHIPMAKING SURGE
Taiwan Semiconductor Manufacturing Company (TSMC), the world’s top chipmaker, unveiled a massive $100 billion investment to expand its U.S. operations this Monday, March 3, standing alongside President Donald Trump at the White House.

NASA'S TWO STUCK ASTRONAUTS ARE FINALLY CLOSING IN ON THEIR RETURN TO EARTH AFTER 9 MONTHS IN SPACE
NASA’s two stuck astronauts are just a few weeks away from finally returning to Earth after nine months in space.

MICROSOFT SHUTTING DOWN SKYPE IN MAY: A 22-YEAR LEGACY ENDS
Microsoft dropped a seismic announcement confirming it will shutter Skype, the pioneering internet calling service, on May 5, 2025— ending a 22-year run that reshaped how the world connects. The company is steering users toward Microsoft Teams’ free consumer version, part of a broader strategy to streamline its communications offerings under a single banner.

SEVERANCE SEASON 2 ON APPLE TV+: MIND-BENDING DRAMA HITS NEW HEIGHTS
Apple TV+’s Severance has taken the streaming world by storm this week, solidifying its place as the platform’s most-watched series ever, surpassing even Ted Lasso’s three-season run, Apple proudly announced last month.

OPENAI'S GPT-4.5 ARRIVES: BIGGER, BOLDER, AND READY TO CHAT
OpenAI turned heads last week with the launch of GPT-4.5, its largest and most ambitious chatbot model yet, unveiled as a research preview for ChatGPT Pro users on February 27, 2025.