Factlen ExplainerAnimal AIScientific BreakthroughJun 14, 2026, 9:08 PM· 4 min read· #5 of 5 in ai

AI Breakthrough Reveals Sperm Whales Communicate Using a Complex Phonetic Alphabet

Machine learning algorithms have decoded the structural foundations of sperm whale communication, revealing a linguistic system with vowels, dialects, and striking parallels to human speech.

By Factlen Editorial Team

Marine Biologists & AI Researchers 45%Animal Rights & Legal Advocates 30%Linguists & Cognitive Scientists 25%
Marine Biologists & AI Researchers
Focus on the technological breakthrough of using machine learning to decode non-human acoustic data.
Animal Rights & Legal Advocates
Argue that proof of complex language and culture should trigger a reevaluation of the legal rights and protections afforded to cetaceans.
Linguists & Cognitive Scientists
Fascinated by the structural parallels between whale codas and human phonetics, such as the use of vowels and dialects.

What's not represented

  • · Indigenous coastal communities
  • · Commercial fishing industry

Why this matters

This breakthrough fundamentally shifts our understanding of animal intelligence, proving that humans are not the only species with complex, rule-based language. It paves the way for interspecies communication and could radically alter global environmental conservation laws.

Key points

  • AI analysis reveals sperm whale clicks contain complex phonetic structures similar to human vowels.
  • Researchers identified 156 distinct 'codas' that form a foundational whale alphabet.
  • Different whale social groups utilize distinct dialects and conversational tempos.
  • The breakthrough was achieved by adapting large language models to process marine acoustic data.
  • Legal scholars are exploring how these findings could impact environmental and animal rights laws.
  • Project CETI aims to fully translate 20 specific whale expressions within the next five years.
156
Distinct codas identified
20M
Years of generational knowledge
90M
Years since shared human ancestor
20
Phrases targeted for translation

For decades, the clicks and groans of the ocean's largest predators were dismissed by many as rudimentary signals—simple broadcasts of location or basic instinct. But a groundbreaking application of artificial intelligence has upended that assumption, revealing that sperm whales communicate using a complex phonetic alphabet that closely mirrors the structure of human language.[1]

The discovery, spearheaded by the Cetacean Translation Initiative (Project CETI), marks a watershed moment in both marine biology and machine learning. By deploying advanced natural language processing algorithms against massive datasets of underwater recordings, researchers have decoded the structural foundations of sperm whale "codas"—the rapid sequences of clicks the animals use to converse in the deep.[1][5]

What the AI uncovered was not a simple Morse code of fixed meanings, but a highly dimensional linguistic system. The algorithms detected that whales vary the rhythm, tempo, and duration of their clicks to create the equivalent of human vowels and consonants. These subtle acoustic shifts allow the whales to convey a vast array of information, utilizing rising and falling tones in a manner strikingly similar to tonal human languages like Mandarin.[1][2]

Researchers have identified 156 distinct codas that function similarly to human vowels and consonants.
Researchers have identified 156 distinct codas that function similarly to human vowels and consonants.

"We are starting to see that these signals are organized in ways we didn't fully appreciate before," noted David Gruber, founder and president of Project CETI. The organization, which operates a primary research base off the coast of the Caribbean island of Dominica, has spent years dropping specialized hydrophones into the water to capture the continuous chatter of resident whale pods.[1][5]

The sheer volume of acoustic data collected by CETI would take human researchers lifetimes to manually annotate. Instead, the team utilized deep learning models—similar in architecture to the large language models that power modern generative AI—to find hidden patterns in the noise. However, because these models were originally designed for human speech, the team had to heavily modify their algorithms to process the unique acoustic properties of the marine environment.[4][6]

The results have been astonishing. Researchers have now identified at least 156 distinct codas and their basic components, forming a foundational "sperm whale phonetic alphabet." Even more remarkably, the AI analysis revealed that different social groups utilize distinct dialects, and that individual whales possess unique conversational tempos—much like humans who speak at varying speeds.[2][5]

The power of this communication network was vividly demonstrated in a recent recording released by the project. CETI scientists captured the audio and video of a sperm whale giving birth, an event that involved highly synchronized coordination among at least a dozen female whales. The AI analysis of the event highlighted a complex, communal conversation that guided and supported the mother, offering an unprecedented window into the rich cultural lives of cetaceans.[1][3]

Project CETI researchers deploy hydrophones off the coast of Dominica to capture thousands of hours of whale communication.
Project CETI researchers deploy hydrophones off the coast of Dominica to capture thousands of hours of whale communication.
The power of this communication network was vividly demonstrated in a recent recording released by the project.

The structural parallels between whale codas and human speech are particularly striking given the vast evolutionary distance between the two species. Humans and sperm whales last shared a common ancestor more than 90 million years ago. The fact that both species independently evolved such complex, rule-based communication systems suggests that the need for sophisticated social coordination drives the development of language across the animal kingdom.[1][6]

As the scientific understanding of whale communication deepens, the ethical and legal implications are beginning to ripple through the academic world. A new collaboration between Project CETI and New York University's More Than Human Life (MOTH) program is actively exploring how AI-assisted translations of animal language could reshape environmental law.[3]

Legal scholars and animal rights advocates argue that empirical proof of complex animal language could fundamentally alter how society views non-human sentience. If scientists can definitively prove that whales share generational knowledge, debate communal decisions, and express distinct cultural identities, the legal frameworks governing ocean conservation and animal welfare may face unprecedented pressure to evolve.[3][6]

While the AI has successfully mapped the structure of the whales' language, translating the actual meaning of their conversations remains a formidable challenge. Researchers caution that we are still in the infancy of interspecies translation. As Gruber explained, the current state of understanding is akin to a two-year-old learning to speak; the goal is to reach the comprehension level of a five-year-old in the coming years.[1]

Despite diverging 90 million years ago, humans and sperm whales independently evolved highly structured communication systems.
Despite diverging 90 million years ago, humans and sperm whales independently evolved highly structured communication systems.

To that end, Project CETI has set an ambitious target: to fully comprehend 20 distinct vocalized expressions—such as commands related to diving, foraging, or resting—within the next five years. Achieving this will require not only more data and refined AI models, but also a deeper contextual understanding of whale behavior to map specific sounds to specific actions.[1][4][6]

The implications of this research extend far beyond the ocean. By proving that machine learning can decode the communication systems of alien intelligences right here on Earth, CETI is laying the groundwork for a new era of biological discovery. It is a profound reminder that humanity is not alone in its capacity for complex thought, culture, and conversation.[1][2][6]

How we got here

  1. 1950s

    Scientists first confirm that sperm whales actively vocalize underwater.

  2. 2020

    Project CETI is founded to apply advanced machine learning to marine communication.

  3. 2024

    Researchers identify the foundational components of a sperm whale phonetic alphabet.

  4. October 2025

    NYU's MOTH program publishes research on the legal impacts of AI-assisted animal translation.

  5. April 2026

    CETI releases breakthrough findings on whale vowels and footage of a synchronized birth event.

Viewpoints in depth

The Biological & AI Perspective

Focuses on the technological triumph of decoding non-human data.

For marine biologists and computer scientists, the breakthrough represents a triumph of interdisciplinary collaboration. By treating whale codas not as random noise but as a structured dataset, researchers were able to apply the same natural language processing tools that power modern AI chatbots. The challenge lay in adapting these human-centric models to the unique acoustic environment of the ocean, proving that machine learning can uncover hidden biological patterns that human observers would miss.

The Linguistic Perspective

Examines the structural similarities between whale codas and human speech.

Linguists are fascinated by the revelation that sperm whales use rhythm, tempo, and tone to convey meaning, much like human vowels and consonants. The discovery that whales utilize rising and falling tones—similar to Mandarin—and possess distinct regional dialects suggests that the fundamental rules of language are not exclusively human. Instead, complex communication appears to be a convergent evolutionary trait driven by the need for sophisticated social coordination.

The Legal & Ethical Perspective

Explores how proof of animal language could reshape conservation laws.

Legal scholars and animal rights advocates view this research as a potential turning point in environmental law. If science can definitively prove that whales possess generational culture, debate decisions, and communicate complex thoughts, the legal definition of sentience may need to be expanded. Advocates argue that such findings could provide the empirical foundation needed to grant cetaceans enhanced legal rights, fundamentally altering how humanity interacts with the natural world.

What we don't know

  • The specific meanings of the 156 identified codas.
  • Whether sperm whales discuss abstract concepts or only immediate environmental factors.
  • How the AI models will scale to decode the languages of other cetacean species.

Key terms

Coda
A rapid sequence of clicks used by sperm whales to communicate with one another.
Phonology
The branch of linguistics that deals with systems of sounds within a language.
Hydrophone
An underwater microphone designed to record or listen to underwater sound.
Natural Language Processing (NLP)
A branch of artificial intelligence focused on enabling computers to understand, interpret, and generate language.
Cetacean
A marine mammal of the order Cetacea, which includes whales, dolphins, and porpoises.

Frequently asked

Can the AI actually translate what the whales are saying?

Not yet. While AI has successfully decoded the complex phonetic structure of the language, mapping those specific sounds to actual meanings remains an ongoing challenge.

How do sperm whales make these clicking sounds?

They generate clicks using specialized organs located above their jaw, pushing air through a pair of 'lips' inside their head to produce powerful acoustic signals.

Why is Project CETI based in Dominica?

The deep waters off the coast of the Eastern Caribbean island are home to a resident population of sperm whales, providing researchers with consistent access for acoustic monitoring.

What are the legal implications of this research?

Legal scholars argue that proving whales possess complex language and generational culture could strengthen the case for granting them enhanced legal protections and rights.

Sources

Source coverage

6 outlets

3 viewpoints surfaced

Marine Biologists & AI Researchers 45%Animal Rights & Legal Advocates 30%Linguists & Cognitive Scientists 25%
  1. [1]The GuardianLinguists & Cognitive Scientists

    Analysis shows whales' coda vocalizations are 'highly complex' and remarkably similar to our own

    Read on The Guardian
  2. [2]Psychology TodayLinguists & Cognitive Scientists

    Hidden Parallels in Whale Language

    Read on Psychology Today
  3. [3]New York UniversityAnimal Rights & Legal Advocates

    What if We Understood What Animals Are Saying?

    Read on New York University
  4. [4]BioneersMarine Biologists & AI Researchers

    Project CETI's mission to crack the code of sperm whale communication

    Read on Bioneers
  5. [5]WikipediaMarine Biologists & AI Researchers

    Project CETI

    Read on Wikipedia
  6. [6]Factlen Editorial TeamMarine Biologists & AI Researchers

    Synthesis by Factlen editorial team

    Read on Factlen Editorial Team
Stay informed

Every angle. Every day.

Get ai stories with full source coverage and perspective breakdowns delivered to your inbox.