High Schooler's AI Model Uncovers 1.5 Million Hidden Cosmic Objects in NASA Archives
An 18-year-old student's machine-learning algorithm has mapped 1.5 million previously invisible celestial phenomena using archived telescope data. The breakthrough catalog is now powering real-time alerts at major global observatories, highlighting how artificial intelligence is democratizing space exploration.
By Factlen Editorial Team
- Independent Researchers
- Argue that open-source data and accessible AI tools are democratizing science, allowing individuals outside traditional institutions to make major discoveries.
- Institutional Astronomers
- Value the massive influx of new catalog data to establish baselines for next-generation observatories, emphasizing the importance of expert mentorship.
- AI Technologists
- Focus on the algorithmic achievement of combining wavelet transforms with deep learning to solve the scale problem of processing 200 terabytes of noisy temporal data.
What's not represented
- · Traditional observational astronomers
- · Educators advocating for earlier STEM AI integration
Why this matters
This achievement proves that the next era of major scientific discovery won't just come from institutional supercomputers, but from open-source data and accessible AI tools. It demonstrates how artificial intelligence is lowering the barrier to entry for world-class research, allowing independent minds to solve problems that traditional agencies lack the bandwidth to address.
Key points
- An 18-year-old student used AI to discover 1.5 million new cosmic objects in archived NASA data.
- The machine-learning model, VARNet, processed nearly 200 billion rows of infrared measurements from the NEOWISE telescope.
- The breakthrough earned a $250,000 science prize and a single-author publication in a major astronomical journal.
- The massive new catalog is now being used by global observatories to calibrate real-time alerts for supernovae and black holes.
- The achievement highlights how accessible AI tools are democratizing advanced scientific discovery.
In early 2026, the Vera C. Rubin Observatory in the high deserts of Chile switched on its highly anticipated real-time alert system, sending astronomers worldwide an astonishing 800,000 notifications in a single night. These alerts flagged exploding stars, active black holes, and passing asteroids, marking a new era of high-speed astronomy. Yet, the foundation for this massive observational shift wasn't built entirely by a veteran astrophysics team working inside a government laboratory. Instead, a crucial piece of the underlying baseline data that makes these alerts possible was mapped by an 18-year-old high school student from California, proving that the tools of modern discovery are shifting.[1]
Matteo Paz, a senior from Pasadena High School, successfully identified 1.5 million previously unknown cosmic objects by applying a custom artificial intelligence model to archived space data. Mining nearly 200 terabytes of dense, noisy information from NASA's retired NEOWISE telescope, Paz achieved a scale of discovery that fundamentally expands the known catalog of the universe. His work bypassed the traditional decades-long slog of manual astronomical observation, utilizing advanced machine learning to spot patterns that human eyes and standard software had completely missed.[2][7]
The breakthrough has sent significant ripples through the global scientific community, validating the power of student-led research. It earned Paz the $250,000 first-place prize at the prestigious Regeneron Science Talent Search, resulted in a rare single-author publication in The Astronomical Journal, and recently drew formal public praise from NASA leadership. For a high schooler to not only participate in data analysis but to single-handedly architect a pipeline that uncovers over a million new celestial bodies is an unprecedented milestone that has forced major institutions to rethink how they engage with open data.[1][3][6]

The project originally began in the summer of 2022 when Paz joined the Planet Finder Academy at the California Institute of Technology. Under the direct mentorship of Davy Kirkpatrick, a senior scientist at Caltech's Infrared Processing and Analysis Center (IPAC), Paz was initially tasked with a relatively modest goal: looking for variable stars in a small, manageable slice of the night sky. The idea was to manually identify a few interesting objects to show the broader astronomical community the untapped potential hiding within the massive archives of retired space missions.[2][6]
Kirkpatrick desperately wanted to extract more value from the NEOWISE mission. Originally launched in 2009 to hunt for near-Earth asteroids and comets, the infrared telescope had scanned the entire sky for over a decade, amassing nearly 200 billion rows of precise thermal measurements. While tracking those local space rocks, the telescope inadvertently captured the faint, fluctuating heat signatures of distant quasars, supernovae, and eclipsing binary stars. Astronomers knew these variable objects were hidden in the data, but the sheer volume of the archive made extracting them seem like an impossible task.[4][6]
Kirkpatrick desperately wanted to extract more value from the NEOWISE mission.
The core problem was one of overwhelming scale. The dataset was simply far too massive for manual human analysis, and traditional algorithmic software struggled to filter out the background noise to find the faint, irregular signals of these distant objects. Drawing on a remarkably strong background in theoretical math and coding, Paz decided to abandon the manual search entirely. Instead, he set out to build an automated machine-learning pipeline capable of processing the entire 200-billion-row archive without human intervention, turning a summer project into a massive computational challenge.[3][5]
The resulting artificial intelligence model, dubbed VARNet, represented a highly sophisticated approach to data mining. It combined wavelet decomposition to handle the dataset's inherent noise with a novel Fourier transform technique designed to extract key features related to light variability. Deep learning algorithms then processed these extracted features, allowing the system to achieve sub-millisecond processing speeds per light curve on standard graphics processing units. This meant the model could evaluate the entire history of a star's brightness in a fraction of a second, scaling up to handle the entire sky.[3][5]

The model's incredible computational efficiency allowed Paz to analyze the entire NEOWISE archive in a matter of weeks. VARNet ultimately flagged 1.9 million potential objects of interest exhibiting variable behavior. When this list was meticulously cross-referenced with existing astronomical databases, roughly 1.5 million of these sources showed absolutely no prior record. They were immediately distinguished as fresh targets for astrophysical research, instantly providing the scientific community with a massive new catalog of potential binary systems, pulsating stars, and active galactic nuclei.[3]
Released publicly to the scientific community in late 2025, this massive catalog is already driving active observations at major facilities around the globe. It provides an immediate, highly detailed roadmap for studying rare transients and cataclysmic variables—objects that change too subtly or unpredictably for standard sky surveys to catch. By establishing a baseline of what is normally fluctuating in the night sky, astronomers can more easily spot the truly anomalous events that signal major cosmic phenomena, saving thousands of hours of telescope time.[1]
The timing of the catalog's release is absolutely critical for the next generation of astronomy. When the Vera C. Rubin Observatory begins its massive ten-year Legacy Survey of Space and Time later this year, it is expected to issue up to seven million alerts every single night. To make sense of that unprecedented data deluge, astronomers desperately need the kind of baseline variable data that Paz's AI model has provided. Without it, the new observatories would be overwhelmed by false positives, flagging known variable stars as new discoveries.[1]

Beyond the immediate benefits to astrophysics, the underlying machine-learning algorithm has significant cross-disciplinary potential. Because VARNet is fundamentally designed to analyze noisy, time-based signals and detect subtle anomalies, researchers note it could eventually be adapted for a wide variety of terrestrial applications. Experts suggest the architecture could be utilized for environmental monitoring, high-frequency finance, or neuroscience, where tracking subtle temporal changes over massive datasets often reveals critical underlying patterns that traditional statistical methods miss. The ability to isolate a faint signal from an ocean of noise is a universal challenge in modern data science, making Paz's astronomical tool highly relevant to Earth-bound industries.[1][5]
Ultimately, the achievement serves as a powerful reminder of how technology is fundamentally reshaping the landscape of academic research. As open-source data proliferates and artificial intelligence capabilities expand, the barrier to entry for world-class discovery is rapidly lowering. It proves that revolutionary scientific leaps no longer require multi-billion-dollar institutional supercomputers or decades of tenure. Sometimes, they just require open data, accessible algorithms, and a fresh perspective from an independent researcher willing to ask new questions of old information.[1][5]
How we got here
2009–2021
NASA's NEOWISE telescope scans the sky, collecting nearly 200 billion rows of infrared measurements.
Summer 2022
High school student Matteo Paz joins Caltech's Planet Finder Academy and begins analyzing the NEOWISE archive.
Late 2024
Paz publishes his peer-reviewed findings on the VARNet AI model in The Astronomical Journal.
March 2025
The breakthrough wins the $250,000 first-place prize at the Regeneron Science Talent Search.
Late 2025
The complete catalog of 1.5 million newly discovered cosmic objects is released publicly.
Early 2026
Major facilities like the Vera C. Rubin Observatory begin using the catalog to power real-time cosmic alert systems.
Viewpoints in depth
The Open-Science Advocates
Emphasize that the future of discovery relies on public data and accessible AI.
Proponents of open science view this breakthrough as the ultimate validation of making institutional data public. They argue that when agencies like NASA release massive archival datasets, they invite a global pool of talent to solve problems that internal teams lack the bandwidth to address. By combining open data with increasingly accessible machine-learning frameworks, independent researchers and students can bypass traditional academic bottlenecks and accelerate the pace of discovery.
Institutional Astronomers
Focus on the necessity of expert mentorship and the foundational value of the new catalog.
While celebrating the algorithmic achievement, institutional scientists emphasize that raw AI power requires domain expertise to yield valid results. They point out that the project succeeded because it was anchored by mentorship from Caltech's IPAC, ensuring the model's outputs were scientifically rigorous. For this camp, the true value lies in the resulting catalog, which provides a critical baseline for next-generation facilities like the Vera C. Rubin Observatory to filter out known variables and focus on truly unprecedented cosmic events.
What we don't know
- Exactly how many of the 1.5 million flagged objects are entirely new classes of celestial phenomena versus known types of variable stars.
- How effectively the VARNet algorithm can be adapted for non-astronomical fields like finance or environmental monitoring.
Key terms
- Variable Object
- A celestial body, such as a star or quasar, whose brightness fluctuates over time due to internal changes or eclipses.
- NEOWISE
- A retired NASA space telescope that scanned the entire sky in infrared light, primarily to hunt for near-Earth asteroids.
- Light Curve
- A graph showing the brightness of an astronomical object over a period of time.
- Fourier Transform
- A mathematical tool used to break down complex signals into simpler components, helping the AI identify repeating patterns in the noisy telescope data.
Frequently asked
What exactly did the AI model discover?
The model identified 1.5 million previously uncatalogued variable objects—such as quasars, supernovae, and eclipsing binary stars—by detecting subtle fluctuations in their infrared light.
Where did the data come from?
The data came from NASA's retired NEOWISE telescope, which collected nearly 200 billion rows of infrared measurements over more than a decade.
How fast did the AI process the data?
Using deep learning and specialized mathematical transforms, the model achieved sub-millisecond processing speeds per light curve on standard GPUs.
Why wasn't this found earlier?
The dataset was simply too massive for manual analysis, and traditional software struggled to filter out the noise to find the faint, irregular signals of these distant objects.
Sources
[1]Futura-SciencesIndependent Researchers
An unexpected breakthrough: a high school student's AI uncovers 1.5 million previously invisible cosmic phenomena
Read on Futura-Sciences →[2]Smithsonian MagazineInstitutional Astronomers
High School Student Discovers 1.5 Million Potential New Astronomical Objects by Developing an A.I. Algorithm
Read on Smithsonian Magazine →[3]R&D WorldAI Technologists
Matteo Paz's AI model identifies 1.5 million celestial objects
Read on R&D World →[4]Universe MagazineAI Technologists
High school student found 1.5 million space objects using AI
Read on Universe Magazine →[5]BetterMind LabsIndependent Researchers
Machine Learning in Astronomy: How Matteo Did It at Age 17
Read on BetterMind Labs →[6]University of ArizonaInstitutional Astronomers
Steward PhD Alum Mentors Teen Who Wins $250K for Using AI to Discover 1.5 Million Hidden Objects in Space
Read on University of Arizona →[7]GrokipediaAI Technologists
Matteo Paz
Read on Grokipedia →[8]The Astronomical JournalInstitutional Astronomers
VARNet: A Deep Learning Approach to Identifying Variable Sources in NEOWISE Data
Read on The Astronomical Journal →
Every angle. Every day.
Get ai stories with full source coverage and perspective breakdowns delivered to your inbox.








