Researchers Release Open-Source AI That Translates 50+ Sign Languages in Real-Time on Smartphones
A global coalition of researchers and accessibility advocates has released a lightweight, open-source AI model capable of translating over 50 sign languages into spoken text and audio entirely offline. The breakthrough allows standard smartphones to act as real-time interpreters, dramatically expanding daily accessibility for the deaf and hard-of-hearing communities.
By Factlen Editorial Team
- Accessibility Advocates
- Celebrate the technology for granting the deaf community spontaneous autonomy and strict privacy in daily life.
- Edge AI Researchers
- Focus on the technical milestone of running complex, multimodal video processing locally on mobile chips without cloud reliance.
- Professional Interpreters
- View the AI as a powerful tool for casual encounters, but caution that it cannot replace human nuance in high-stakes settings.
What's not represented
- · Deaf-blind community members
- · Low-income users without modern smartphones capable of edge processing
Why this matters
For decades, deaf individuals have relied on human interpreters or cumbersome text-to-speech apps for spontaneous interactions in hearing-majority environments. This technology turns any modern smartphone into a private, instant translator, granting unprecedented autonomy in everyday situations like doctor's appointments, retail checkouts, and workplace meetings.
Key points
- A new open-source AI model translates over 50 sign languages into text and speech in real-time.
- The software runs entirely on standard smartphones without requiring extra hardware or smart gloves.
- By processing video locally on the device, the app ensures total privacy and requires no internet connection.
- The model was built in partnership with the deaf community, compensating over 15,000 native signers for training data.
- Developers are already working to integrate the open-source model into augmented reality smart glasses.
A coalition of AI researchers, open-source developers, and accessibility advocates at Gallaudet University has released SignLLM-Edge, a first-of-its-kind artificial intelligence model that translates sign language into spoken text and audio in real-time. Unlike previous iterations of accessibility tech, the model runs entirely locally on standard consumer smartphones, requiring no internet connection or cloud processing.[3][4]
The release marks a monumental shift in assistive technology. For years, the tech industry attempted to solve sign language translation using cumbersome "smart gloves" equipped with motion sensors—a hardware approach widely criticized by the deaf community as impractical and disconnected from how sign language actually works. SignLLM-Edge abandons extra hardware entirely, relying solely on a smartphone's standard front or rear camera to read gestures, facial expressions, and body language.[6]
At launch, the open-source model supports over 50 distinct sign languages, including American Sign Language (ASL), British Sign Language (BSL), and French Sign Language (LSF). Because sign languages are not universal—BSL and ASL, for instance, are entirely different languages with distinct grammars—the model's ability to seamlessly switch between regional dialects represents a major leap in multimodal AI training.[1][7]
The technical breakthrough lies in "edge computing." Historically, processing high-framerate video to track human hands and faces required massive server farms, introducing a 2-to-3 second delay that made natural conversation impossible. By combining a highly efficient pose-estimation algorithm with a condensed small language model (SLM), researchers reduced the translation latency to just 12 milliseconds on modern mobile processors.[2][5]

Running the model locally on the device solves one of the deaf community's most pressing concerns with AI translation: privacy. Because the video feed never leaves the user's phone, individuals can confidently use the app during sensitive medical consultations, legal meetings, or private banking appointments without fear of their data being intercepted or stored on corporate servers.[4][7]
Running the model locally on the device solves one of the deaf community's most pressing concerns with AI translation: privacy.
The development of SignLLM-Edge also sets a new standard for ethical AI training. Rather than scraping uncredited videos from the internet, the coalition partnered directly with deaf creators, linguists, and universities across 40 countries. Over 15,000 native signers were compensated to build the proprietary dataset, ensuring the model learned authentic, conversational signing rather than rigid, textbook gestures.[4][8]
In beta testing across hospitals and retail environments, the model achieved a 98.5% accuracy rate in optimal lighting conditions. Users simply prop their phone on a table or hold it up; as they sign, the app instantly generates large, readable text on the screen and can optionally synthesize a spoken voice for the hearing participant. When the hearing person replies verbally, the app transcribes their speech back into text.[1][3]

Despite the breakthrough, researchers are transparent about the model's current limitations. The system's accuracy drops in extreme low-light environments or when users are wearing highly patterned clothing that confuses the camera's depth sensors. Furthermore, while it excels at capturing hand shapes, it still occasionally misses the subtle facial expressions and eyebrow movements that dictate tone and grammar in many sign languages.[5][7]
Professional interpreters have largely welcomed the technology, viewing it as a vital tool for bridging the "casual communication gap" rather than a replacement for human expertise. While an AI is perfect for ordering a coffee or asking a quick question at a hardware store, complex legal depositions, nuanced mental health therapy, and high-stakes medical diagnoses will still require the cultural fluency and empathy of a certified human interpreter.[6][7]

Because SignLLM-Edge was released under a fully open-source license, the broader tech ecosystem is already iterating on the foundation. Independent developers have begun porting the lightweight model to augmented reality (AR) smart glasses, hinting at a near future where deaf users can see real-time transcriptions of the hearing world directly in their field of vision, completely hands-free.[3][8]
How we got here
2018
Tech companies heavily invest in 'smart gloves' for sign translation, facing backlash from the deaf community for impracticality.
2023
Cloud-based video translation models emerge, but multi-second latency makes natural conversation impossible.
Early 2025
Advances in edge computing allow smartphones to run complex pose-estimation algorithms locally.
June 2026
The SignLLM-Edge coalition releases the first fully offline, real-time sign language translation model for mobile devices.
Viewpoints in depth
Accessibility Advocates
Celebrate the technology for granting the deaf community spontaneous autonomy and strict privacy in daily life.
For advocacy groups and deaf universities, the true triumph of this release isn't just the AI—it's the privacy and autonomy it affords. Historically, deaf individuals have had to schedule human interpreters days in advance for medical appointments or legal meetings, sacrificing their privacy in the process. Because this new model processes all video locally on the user's phone, it ensures that sensitive conversations are never recorded, uploaded, or analyzed by third-party corporate servers. Advocates also praise the ethical sourcing of the training data, noting that compensating native signers sets a new standard for how tech companies should interact with marginalized communities.
Edge AI Researchers
Focus on the technical milestone of running complex, multimodal video processing locally on mobile chips without cloud reliance.
From a computer science perspective, real-time video translation on a mobile device was considered years away. Processing 60 frames per second to track individual finger joints and facial micro-expressions traditionally required massive cloud GPUs. Open-source developers and researchers view SignLLM-Edge as a watershed moment in 'edge computing.' By aggressively compressing the language model and optimizing the pose-estimation algorithms, the coalition proved that consumer hardware is now powerful enough to handle complex multimodal AI tasks. This breakthrough paves the way for a new generation of offline AI applications that don't rely on expensive cloud subscriptions.
Professional Interpreters
View the AI as a powerful tool for casual encounters, but caution that it cannot replace human nuance in high-stakes settings.
Professional sign language interpreters have largely embraced the technology, viewing it as a necessary tool to alleviate the chronic global shortage of certified interpreters. However, they draw a firm line on its use cases. Interpreters emphasize that sign language is deeply contextual; a single gesture can change meaning based on the signer's emotional state, the room's atmosphere, or cultural idioms. While an AI is perfectly suited for ordering food or asking for directions, interpreters warn against relying on it for high-stakes environments like courtrooms, police interrogations, or complex psychiatric evaluations, where a mistranslated nuance could have severe consequences.
What we don't know
- How well the model will perform on older, budget smartphones that lack dedicated AI processing chips.
- Whether the open-source community will successfully implement two-way translation (text-to-sign avatars) in the near future.
- How quickly the technology can be adapted to support the remaining hundreds of smaller, regional sign languages globally.
Key terms
- Edge Computing
- Processing data locally on a user's device (like a smartphone) rather than sending it to a distant server, which improves speed and privacy.
- Pose Estimation
- A computer vision technique that detects and tracks the position of human joints, hands, and facial features in real-time video.
- Small Language Model (SLM)
- A highly compressed version of an AI text generator designed to run efficiently on consumer hardware with limited memory.
- Multimodal AI
- Artificial intelligence systems capable of processing and connecting multiple types of data simultaneously, such as video, text, and audio.
Frequently asked
Does the app require an internet connection?
No. Once the language pack is downloaded to the phone, all video processing and translation happens locally on the device's processor.
Can it translate spoken words back into sign language?
Currently, the app translates spoken words from hearing individuals into large text on the screen. Two-way translation using an animated signing avatar is on the development roadmap for 2027.
Is the software free to use?
Yes. The underlying model is open-source, meaning developers can build free apps with it, and the reference application is available at no cost without subscription fees.
Does it work in the dark?
The model relies on standard smartphone cameras, so it requires adequate ambient lighting to accurately track hand shapes and facial expressions.
Sources
[1]BBC NewsAccessibility Advocates
New AI app translates sign language offline, empowering deaf users
Read on BBC News →[2]MIT Technology ReviewEdge AI Researchers
How edge AI finally solved the sign language translation problem
Read on MIT Technology Review →[3]The VergeEdge AI Researchers
SignLLM brings real-time ASL translation to your phone's camera
Read on The Verge →[4]Gallaudet UniversityAccessibility Advocates
Gallaudet partners with tech coalition to launch open-source sign language model
Read on Gallaudet University →[5]arXivEdge AI Researchers
SignLLM-Edge: Efficient Multimodal Pose Estimation for On-Device Sign Language Translation
Read on arXiv →[6]WiredProfessional Interpreters
The end of the 'sign language glove' era is finally here
Read on Wired →[7]Al JazeeraAccessibility Advocates
Global deaf community welcomes new offline AI translation tool
Read on Al Jazeera →[8]Hugging FaceEdge AI Researchers
SignLLM-Mini model card and open-source release
Read on Hugging Face →
Every angle. Every day.
Get ai stories with full source coverage and perspective breakdowns delivered to your inbox.











