Factlen Deep DiveHerculaneum ScrollsEvidence PackJun 19, 2026, 7:38 PM· 5 min read· #5 of 5 in science

How AI is Finally Reading the Lost Library of Herculaneum at Scale

Advanced machine learning models and high-resolution CT scans are now automating the digital unwrapping of the Herculaneum scrolls, recovering lost classical literature without touching the fragile artifacts.

By Factlen Editorial Team

Papyrologists and Classicists 35%Computer Scientists 35%Archaeological Preservationists 30%
Papyrologists and Classicists
Focused on the historical and literary value of the recovered texts.
Computer Scientists
Focused on the algorithmic challenges of segmentation and ink detection.
Archaeological Preservationists
Focused on protecting the physical integrity of the artifacts.

What's not represented

  • · The Italian Ministry of Culture (managing the physical site and access to the artifacts)
  • · Modern residents of Ercolano (living above the unexcavated portions of the ancient villa)

Why this matters

For centuries, the greatest barrier to understanding the ancient world was the physical destruction of its texts. By successfully automating the digital recovery of the Herculaneum scrolls, AI is not just solving an archaeological puzzle—it is unlocking a time capsule that could restore thousands of lost works of classical philosophy, history, and literature to humanity.

Key points

  • Advanced 3D CT scans and machine learning are allowing researchers to read the carbonized Herculaneum scrolls without physically opening them.
  • By early 2026, the Vesuvius Challenge successfully automated the digital unwrapping of nearly 70% of a single scroll.
  • AI models detect the presence of ink by recognizing microscopic textural changes on the papyrus surface, rather than relying on density.
  • A recently deciphered scroll revealed the exact burial location of the philosopher Plato, proving the historical value of the texts.
  • The project is now focused on scaling the technology to read the remaining 600 unopened scrolls in the collection.
800+
Carbonized scrolls recovered from the villa
2 µm
Resolution of the 3D CT scans
70%
Portion of Scroll PHerc. 172 digitally unwrapped by 2026
$1.8M
Total prize money awarded by the Vesuvius Challenge

In AD 79, the eruption of Mount Vesuvius unleashed a torrent of superheated gas and ash that buried the Roman towns of Pompeii and Herculaneum. While Pompeii is famous for its frozen human figures, Herculaneum holds a different kind of treasure: the Villa of the Papyri. Beneath 60 feet of volcanic mud, archaeologists in the 18th century discovered the only intact library to survive from classical antiquity.[1][8]

The library contained over 800 papyrus scrolls, but the intense heat of the eruption had flash-fried them into brittle, carbonized lumps resembling briquettes of charcoal. For more than 250 years, these scrolls represented one of archaeology's most agonizing paradoxes. The texts were right there, yet utterly inaccessible. Early attempts to physically unroll them proved disastrous, causing the fragile layers to shatter into dust and destroying whatever text they held.[3][5]

Today, that paradox is finally being resolved. Through a convergence of high-energy particle physics and advanced machine learning, an international coalition of researchers and technologists is reading the scrolls without ever opening them. The initiative, known as the Vesuvius Challenge, has transformed a centuries-old archaeological puzzle into a modern computer vision problem, and by mid-2026, the project has reached a critical inflection point: automation.[1][2]

The breakthrough relies on a technique called "virtual unwrapping," pioneered by computer scientist Brent Seales. The process begins at a particle accelerator, where the carbonized scrolls are bombarded with X-rays to create 3D computed tomography (CT) scans at a staggering resolution of 2 micrometers—roughly one-fiftieth the width of a human hair. These scans map the internal geometry of the scroll in billions of three-dimensional pixels, known as voxels.[3][7]

The four-step digital pipeline used to extract text from the fused layers of the scrolls.
The four-step digital pipeline used to extract text from the fused layers of the scrolls.

However, seeing the shape of the papyrus was only half the battle. The Romans wrote with a carbon-based ink made of soot and water. Because the ink and the charred papyrus have nearly identical densities, the letters do not show up on standard X-rays. To the human eye, the digital surface of the unwrapped scroll appears completely blank.[3][8]

This is where artificial intelligence entered the equation. In 2023, the Vesuvius Challenge offered massive cash prizes to anyone who could train a machine learning model to detect the ink. Researchers discovered that while the ink didn't change the density of the papyrus, it did subtly alter its physical texture. The ink sat on top of the papyrus fibers, creating microscopic ridges. AI models were trained to recognize this specific topographical signature, suddenly illuminating hidden Greek letters across the digital void.[1][7]

By early 2026, the technology had matured from a proof-of-concept into a scalable pipeline. The initial models required painstaking manual effort to trace the crumpled, fused layers of papyrus within the 3D scans—a process known as segmentation. It could take months to map a few square centimeters. But recent algorithmic leaps have largely automated this process, allowing computers to untangle the virtual geometry with minimal human intervention.[2][8]

By early 2026, the technology had matured from a proof-of-concept into a scalable pipeline.

The results of this automation are staggering. In early 2026, the Vesuvius Challenge announced that nearly 70% of a single scroll, designated PHerc. 172, had been digitally unwrapped. This marked a monumental shift: the technology was no longer just recovering isolated paragraphs, but entire continuous texts. The bottleneck had officially moved from detecting the ink to simply processing the massive terabytes of data.[1][8]

Segmentation algorithms trace the crumpled layers of papyrus within the 3D CT scan data.
Segmentation algorithms trace the crumpled layers of papyrus within the 3D CT scan data.

The stakes of this endeavor are difficult to overstate. The scrolls deciphered so far have predominantly contained works of Epicurean philosophy, including texts by Philodemus of Gadara, the resident philosopher of the villa. But the library's contents are already rewriting historical consensus and providing granular details about the lives of antiquity's greatest thinkers.[4][6]

In 2024, researchers using similar infrared and scanning methodologies on a partially opened Herculaneum scroll made a stunning announcement. The text, a "History of the Academy," revealed the exact burial location of the philosopher Plato—in a private garden near a shrine to the Muses within his Athenian academy. Furthermore, the text revealed that Plato had been sold into slavery much earlier in his life than historians previously believed.[4][5]

Discoveries of this magnitude highlight the tantalizing potential of the remaining unopened scrolls. Classicists estimate that up to 99% of ancient Greek and Roman literature has been lost to time. The Herculaneum library could hold missing dialogues of Aristotle, lost plays by Aeschylus and Sophocles, or early histories of the Roman Republic.[6][8]

To accelerate the recovery, the Vesuvius Challenge launched a new Kaggle competition in early 2026, challenging data scientists to build even better auto-segmentation models. The goal is to handle the most severely damaged sections of the scrolls, where the papyrus layers are tightly compressed, torn, or fused together by the volcanic heat.[2][8]

The automation of the unwrapping pipeline has led to an exponential increase in recovered text.
The automation of the unwrapping pipeline has led to an exponential increase in recovered text.

A primary concern within the scientific community is the risk of AI "hallucination"—the possibility that the machine learning models might invent letters that aren't there. To mitigate this, the Vesuvius Challenge employs strict evidentiary protocols. Multiple independent AI models, built by different teams using different architectures, must produce the exact same Greek characters from the raw CT data. If the models agree, the ink is considered verified.[1][3]

Furthermore, the models are constrained by the physical topology of the scroll. The AI cannot simply guess a word based on linguistic context, as a large language model might. It is strictly a computer vision task, identifying the physical presence of ink based solely on the geometric data of the voxels.[2][7]

The success of the Vesuvius Challenge is establishing a new paradigm for archaeology. Non-destructive analysis is becoming the gold standard, proving that artifacts do not need to be physically manipulated to yield their secrets. This approach preserves the physical integrity of the scrolls for future generations, who will undoubtedly possess even more advanced scanning technologies.[3][8]

As 2026 progresses, the focus is shifting toward "unwrapping at scale." With generalist AI models now capable of reading across multiple scrolls without needing to be retrained for each specific author's handwriting, the pace of discovery is accelerating exponentially. The ancient world is speaking again, and for the first time in two millennia, we finally have the tools to listen.[1][8]

The Villa of the Papyri in Herculaneum was buried under 60 feet of volcanic mud during the AD 79 eruption of Mount Vesuvius.
The Villa of the Papyri in Herculaneum was buried under 60 feet of volcanic mud during the AD 79 eruption of Mount Vesuvius.

How we got here

  1. AD 79

    Mount Vesuvius erupts, burying the Villa of the Papyri in Herculaneum and carbonizing its library.

  2. 1750

    Farmworkers digging a well discover the buried villa and the blackened scrolls.

  3. 2015

    Computer scientist Brent Seales pioneers virtual unwrapping techniques on the En-Gedi scroll.

  4. March 2023

    The Vesuvius Challenge launches, offering prize money to anyone who can use machine learning to read the scrolls.

  5. April 2024

    Researchers announce that a deciphered scroll reveals the exact burial location of the philosopher Plato.

  6. Early 2026

    The project reaches a major milestone, successfully automating the digital unwrapping of nearly 70% of a single scroll.

Viewpoints in depth

Papyrologists and Classicists

Focused on the historical and literary value of the recovered texts.

For scholars of antiquity, the technology is merely a means to an end. Their primary concern is the rigorous, peer-reviewed translation of the Greek and Latin texts. They caution against rushing the interpretation of AI-generated character maps, noting that ancient texts often lack spaces or punctuation and require deep contextual knowledge to translate accurately. This camp is highly motivated by the prospect of recovering lost works that could fundamentally alter our understanding of classical philosophy and history.

Computer Scientists

Focused on the algorithmic challenges of segmentation and ink detection.

The machine learning community views the scrolls as one of the ultimate computer vision challenges. Their focus is on building generalist models that can handle the extreme noise, compression, and topological distortion of the 3D scans without hallucinating data. For this camp, the breakthrough isn't just reading a single scroll, but developing an automated, scalable pipeline that can be applied to any tightly wrapped, damaged manuscript globally.

Archaeological Preservationists

Focused on protecting the physical integrity of the artifacts.

Preservationists champion the Vesuvius Challenge because it represents the gold standard of non-destructive archaeology. After centuries of well-intentioned but disastrous attempts to physically peel the scrolls apart—which destroyed countless layers—this camp advocates for leaving the remaining physical scrolls entirely untouched. They argue that as scanning technology improves, even the most fused and damaged scrolls will eventually be readable digitally.

What we don't know

  • Whether the remaining unread scrolls contain lost masterpieces of classical literature or primarily more Epicurean philosophy.
  • How many more scrolls remain buried in the unexcavated lower levels of the Villa of the Papyri.
  • Whether the auto-segmentation models can successfully untangle the most severely crushed and fused sections of the papyri.

Key terms

Virtual Unwrapping
The process of using 3D CT scans and software to digitally flatten and read a rolled or folded object without physically touching it.
Segmentation
In computer vision, the task of tracing and separating the individual, tangled layers of papyrus within a 3D scan.
Voxel
A 3D pixel; the smallest distinguishable box of volume in a CT scan, used to map the internal structure of the scrolls.
Epicureanism
An ancient Greek philosophical system emphasizing that the goal of life is happiness achieved through the absence of pain and fear.

Frequently asked

Why can't archaeologists just unroll the scrolls?

The scrolls were carbonized by the extreme heat of Mount Vesuvius. They are incredibly brittle and will shatter into dust if physically manipulated.

How does the AI detect the ink?

The AI is trained to detect microscopic differences in the physical texture and thickness of the papyrus caused by the application of the carbon-based ink, rather than relying on density.

What kind of texts are inside the scrolls?

Most of the scrolls opened so far contain Epicurean philosophy written in Greek, but scholars hope the remaining library contains lost works of history, poetry, and drama.

Could the AI be making up the letters?

To prevent hallucination, researchers require multiple independent AI models to produce the exact same Greek characters from the raw geometric data before verifying the text.

Sources

Source coverage

8 outlets

3 viewpoints surfaced

Papyrologists and Classicists 35%Computer Scientists 35%Archaeological Preservationists 30%
  1. [1]Vesuvius ChallengeComputer Scientists

    Resurrect an ancient library from the ashes of a volcano

    Read on Vesuvius Challenge
  2. [2]KaggleComputer Scientists

    Vesuvius Challenge - Ink Detection

    Read on Kaggle
  3. [3]Understanding AIComputer Scientists

    How AI is reading the Herculaneum scrolls

    Read on Understanding AI
  4. [4]Smithsonian MagazinePapyrologists and Classicists

    Newly Deciphered Papyrus Scroll Reveals Location of Plato's Grave

    Read on Smithsonian Magazine
  5. [5]National Endowment for the HumanitiesPapyrologists and Classicists

    Newly Deciphered Papyrus Scroll Reveals Location of Plato's Grave

    Read on National Endowment for the Humanities
  6. [6]University of PisaPapyrologists and Classicists

    The GreekSchools Project: New discoveries from Herculaneum papyri

    Read on University of Pisa
  7. [7]NatureComputer Scientists

    AI reads text from ancient Herculaneum scroll for the first time

    Read on Nature
  8. [8]Factlen Editorial TeamArchaeological Preservationists

    Synthesis by Factlen editorial team

    Read on Factlen Editorial Team
Stay informed

Every angle. Every day.

Get science stories with full source coverage and perspective breakdowns delivered to your inbox.