OCR for 20+ Languages
Extract text from images in Spanish, French, Chinese, Japanese, Arabic, and many other languages.
Buail comhaid anseo, nó cliceáil chun brabhsáil
JPG, PNG, WebP, BMP gan teorainn méid comhaid
Ní fhágann do chomhaid do ghléas riamh. Tarlaíonn gach próiseáil go háitiúil i do bhrabhsálaí.
Multi-language tips
Pick the language matching the image content. Wrong language setting causes Tesseract to "hallucinate" English-like text from non-English content.
Each language downloads a ~10 MB training data file the first time. Once cached, subsequent uses are fast.
For Asian languages (Chinese, Japanese, Korean), use high-resolution images. CJK characters have many strokes and need pixel density to OCR well.
For mixed-language documents, run OCR with each language separately and combine the best parts — there's no single-pass multilingual mode.
Conas a Oibríonn sé
Cén fáth ár gceann a úsáid?
Also check out…
Extract Text from Screenshots
Pull text out of screenshots — error messages, cha
Convert Scanned PDFs and Photos to Text
Extract text from scanned documents, contracts, re
Digitize Business Cards
Extract names, emails, phone numbers from photogra
Make Images Accessible
Extract text from images that lack proper alt text
