Created by

Add on-device
 computer vision 
to your
React Native app

On-device AI running in a React Native app

Features & Use cases

/1

OCR

Extract text from any image, receipt, or document – right on the device. No image ever leaves the phone, which makes it a natural fit for sensitive documents: contracts, medical records, financial statements.

With React Native ExecuTorch, you can ship a fully functional document scanner in your app without routing user data through an external OCR API.

/2

Computer vision

Run object detection, segmentation, and image classification. React Native ExecuTorch supports models including YOLO, RF-DETR, MobileNet, and Segment Anything (SAM), giving you a full computer vision toolkit.

From background blur in video calls to product recognition in retail apps – real-time object detection is the feature end users will notice.

/3

Voice capabilities

Add studio-quality speech synthesis and accurate transcription to your app. React Native ExecuTorch ships with Whisper for speech-to-text and Kokoro for text-to-speech, so users can listen to content or dictate input even when they're offline.

Build an audiobook reader, a voice note app, or an accessibility tool that works even on a plane.

/4

Image & text embedding

Generate vector embeddings from images and text – enabling semantic search, similarity matching, and local RAG without a cloud vector database.

A great fit for note-taking apps, photo managers, or knowledge tools – with the entire embedding and retrieval pipeline kept local.

/5

LLMs & VLMs

Run large language models and vision-language models. React Native ExecuTorch supports Llama, Phi, and other popular open models.

Build AI-powered chat, smart autocomplete, or image-aware assistants – for apps where privacy isn't a feature checkbox, but the entire product.

/6

OCR + LLM Pipeline

Chain OCR and an LLM together on-device: snap a photo of a sign, menu, or document in any language – React Native ExecuTorch extracts the text and feeds it to a local language model.

For travel apps, document assistants, or translation tools – like Google Lens, but private and offline.

Developers are already shipping with it

See what's possible

An AI personal assistant built entirely with React Native ExecuTorch

Private Mind is a production-grade AI assistant app that runs entirely on your phone. It uses React Native ExecuTorch under the hood for LLM inference, speech recognition, and on-device embeddings.

Get it on Google PlayDownload on the App Store

FAQ

On-device AI runs inference entirely on the user's phone. It means no server round-trips, no API costs, and no user data ever leaving the device. Your app works offline, responds faster (no network latency), and keeps all sensitive data private by default.

React Native ExecuTorch supports a range of open-source models across multiple categories: YOLO and RF-DETR for object detection, Segment Anything (SAM) for image segmentation, Whisper for speech-to-text, Kokoro for text-to-speech, Llama and Phi for large language model inference, and CLIP-style models for image and text embeddings. See the full list of available models on our Hugging Face.

Yes – all inference happens locally on the device. Once the model is bundled with or downloaded to the app, it runs completely offline. No internet connection is needed for any inference task, including LLM chat, speech recognition, image processing, or embedding generation.

Edge AI is a broader category that covers any inference running outside a central cloud server – including on-device phones, IoT devices, and edge servers. On-device AI specifically refers to inference running on the end user's device (smartphone, tablet, laptop). React Native ExecuTorch is built for iOS and Android – the model runs on the phone itself, not on a nearby server.

Yes. React Native ExecuTorch is open source and maintained by Software Mansion. It's available on GitHub under an open-source license. Commercial use is permitted – see the repository for the full license terms.

We built the library so most teams can ship on-device features themselves with the APIs it provides. For teams with tighter timelines, more complex pipelines, or custom model requirements, we also work directly with companies to design, train, and integrate on-device AI into production React Native apps. If that sounds useful, get in touch.