OCR & Speech-to-Text on iPhone — Offline AI Tools

Q: What languages does speech-to-text support?

Filemorph supports speech-to-text in 40+ languages using Apple's on-device Speech framework.

Text & speech

3 operationsRead text and listen

OCR — image to text

Pro

Extract printed and handwritten text from photos using Vision.

Speech-to-text

Pro

Transcribe audio in 40+ languages using Apple's Speech framework.

Language detect

Pro

Detect the language of any block of text.

Vision & detection

5 operationsFind things in photos

Image classify

Pro

Identify objects, scenes and concepts in any photo.

Face detect

Pro

Find faces with bounding boxes and landmark points.

Smart crop

Pro

Auto-crop to the salient subject of the image.

Document detect

Pro

Find document edges in a photo for scanning.

Barcode & QR read

Pro

Read every common barcode and QR code type.

Generate

1 operationQR codes from any URL or text

QR code generate

Pro

Generate QR codes from URLs, contacts, Wi-Fi credentials and more.

Core ML enhancement

3 operationsImage restoration & upscale

AI super-resolution

Pro

2× and 4× upscale with DRCT or RealESRGAN models.

AI image restore

Pro

Restore old or damaged photos with InstructIR.

Colorize B&W

Pro

Bring black-and-white photos to life with DDColor.

Tier-based model loading

3 tiersDevice-aware AI

Tier 1 — iPhone

≤ 500 MB

Models like HVI-CIDNet (8 MB), Adaptive 3DLUT (5 MB), FBCNN (70 MB), InstructIR (64 MB).

Tier 2 — iPad Pro / Mac

≤ 2 GB

DRCT, DDColor, GFPGAN, BiSeNet for advanced edits.

Tier 3 — Pro / Pro Max

Multi-GB

Depth Anything, RealESRGAN x4, NAFNet, CodeFormer, LaMa.

FAQ

AI tools — common questions.

Does Filemorph OCR work offline? +

Yes. OCR uses Apple's Vision framework, which runs entirely on-device. No internet connection is needed and your photos are never uploaded.

What languages does speech-to-text support? +

Filemorph supports speech-to-text in 40+ languages including English, Spanish, French, German, Japanese, Korean, Chinese, Russian and Arabic, using Apple's on-device Speech framework.

Do AI upscale and colorize need downloads? +

Yes, the Core ML models are downloaded once on first use, then run locally on the Neural Engine for every subsequent conversion. Models are tier-based: small for iPhone, larger for iPad Pro and Mac.

Are my photos sent anywhere? +

No. After the initial model download, every AI operation runs on your device. Nothing about your content leaves your phone.

On-device AI.
Private by default.

13 AI tools running on your Neural Engine.

Download on theApp Store