Languages/Scripts supported in different versions of Tesseract
Languages
| LangCode | Language | 3.02 | 3.04 | 4.00 | 4.0.0 | 4.0.0 | 4.0.0 |
|---|---|---|---|---|---|---|---|
| Nov. 2016 | tessdata | tessdata_best | tessdata_fast | ||||
| afr | Afrikaans | x | x | x | x | x | x |
| amh | Amharic | x | x | x | x | x | |
| ara | Arabic | x | x | x | x | x | x |
| asm | Assamese | x | x | x | x | x | |
| aze | Azerbaijani | x | x | x | x | x | |
| aze_cyrl | Azerbaijani - Cyrilic | x | x | x | x | x | x |
| bel | Belarusian | x | x | x | x | x | x |
| ben | Bengali | x | x | x | x | x | x |
| bod | Tibetan | x | x | x | x | x | |
| bos | Bosnian | x | x | x | x | x | |
| bre | Breton | x | x | x | x | ||
| bul | Bulgarian | x | x | x | x | x | x |
| cat | Catalan; Valencian | x | x | x | x | x | x |
| ceb | Cebuano | x | x | x | x | x | |
| ces | Czech | x | x | x | x | x | x |
| chi_sim | Chinese - Simplified | x | x | x | x | x | x |
| chi_tra | Chinese - Traditional | x | x | x | x | x | x |
| chr | Cherokee | x | x | x | x | x | x |
| cos | Corsican | x | x | x | |||
| cym | Welsh | x | x | x | x | x | |
| dan | Danish | x | x | x | x | x | x |
| dan_frak | Danish - Fraktur (contrib) | x | x | ||||
| deu | German | x | x | x | x | x | x |
| deu_frak | German - Fraktur (contrib) | x | x | ||||
| deu_latf | German (Fraktur Latin) | x | x | x | x | ||
| dzo | Dzongkha | x | x | x | x | x | |
| ell | Greek, Modern (1453-) | x | x | x | x | x | x |
| eng | English | x | x | x | x | x | x |
| enm | English, Middle (1100-1500) | x | x | x | x | x | x |
| epo | Esperanto | x | x | x | x | x | x |
| equ | Math / equation detection module | x | x | x | x | x | |
| est | Estonian | x | x | x | x | x | x |
| eus | Basque | x | x | x | x | x | x |
| fao | Faroese | x | x | x | |||
| fas | Persian | x | x | x | x | x | |
| fil | Filipino (old - Tagalog) | x | x | x | |||
| fin | Finnish | x | x | x | x | x | x |
| fra | French | x | x | x | x | x | x |
| frk | German - Fraktur (now deu_latf) | x | x | x | x | x | x |
| frm | French, Middle (ca.1400-1600) | x | x | x | x | x | x |
| fry | Western Frisian | x | x | x | |||
| gla | Scottish Gaelic | x | x | x | |||
| gle | Irish | x | x | x | x | x | |
| glg | Galician | x | x | x | x | x | x |
| grc | Greek, Ancient (to 1453) (contrib) | x | x | x | x | x | x |
| guj | Gujarati | x | x | x | x | x | |
| hat | Haitian; Haitian Creole | x | x | x | x | x | |
| heb | Hebrew | x | x | x | x | x | x |
| hin | Hindi | x | x | x | x | x | x |
| hrv | Croatian | x | x | x | x | x | x |
| hun | Hungarian | x | x | x | x | x | x |
| hye | Armenian | x | x | x | |||
| iku | Inuktitut | x | x | x | x | x | |
| ind | Indonesian | x | x | x | x | x | x |
| isl | Icelandic | x | x | x | x | x | x |
| ita | Italian | x | x | x | x | x | x |
| ita_old | Italian - Old | x | x | x | x | x | x |
| jav | Javanese | x | x | x | x | x | |
| jpn | Japanese | x | x | x | x | x | x |
| kan | Kannada | x | x | x | x | x | x |
| kat | Georgian | x | x | x | x | x | |
| kat_old | Georgian - Old | x | x | x | x | x | |
| kaz | Kazakh | x | x | x | x | x | |
| khm | Central Khmer | x | x | x | x | x | |
| kir | Kirghiz; Kyrgyz | x | x | x | x | x | |
| kmr | Kurmanji (Kurdish - Latin Script) | x | x | x | x | ||
| kor | Korean | x | x | x | x | x | x |
| kor_vert | Korean (vertical) | x | x | x | x | ||
| kur | Kurdish (Arabic Script) | x | |||||
| lao | Lao | x | x | x | x | x | |
| lat | Latin | x | x | x | x | x | |
| lav | Latvian | x | x | x | x | x | x |
| lit | Lithuanian | x | x | x | x | x | x |
| ltz | Luxembourgish | x | x | x | x | ||
| mal | Malayalam | x | x | x | x | x | x |
| mar | Marathi | x | x | x | x | x | |
| mkd | Macedonian | x | x | x | x | x | x |
| mlt | Maltese | x | x | x | x | x | x |
| mon | Mongolian | x | x | x | x | ||
| mri | Maori | x | x | x | x | ||
| msa | Malay | x | x | x | x | x | x |
| mya | Burmese | x | x | x | x | x | |
| nep | Nepali | x | x | x | x | x | |
| nld | Dutch; Flemish | x | x | x | x | x | x |
| nor | Norwegian | x | x | x | x | x | |
| oci | Occitan (post 1500) | x | x | x | x | x | |
| ori | Oriya | x | x | x | x | x | |
| osd | Orientation and script detection module | x | x | x | x | x | x |
| pan | Panjabi; Punjabi | x | x | x | x | x | |
| pol | Polish | x | x | x | x | x | x |
| por | Portuguese | x | x | x | x | x | x |
| pus | Pushto; Pashto | x | x | x | x | x | |
| que | Quechua | x | x | x | x | ||
| ron | Romanian; Moldavian; Moldovan | x | x | x | x | x | x |
| rus | Russian | x | x | x | x | x | x |
| san | Sanskrit | x | x | x | x | x | |
| sin | Sinhala; Sinhalese | x | x | x | x | x | |
| slk | Slovak | x | x | x | x | x | x |
| slk_frak | Slovak - Fraktur (contrib) | x | x | ||||
| slv | Slovenian | x | x | x | x | x | x |
| snd | Sindhi | x | x | x | x | ||
| spa | Spanish; Castilian | x | x | x | x | x | x |
| spa_old | Spanish; Castilian - Old | x | x | x | x | x | x |
| sqi | Albanian | x | x | x | x | x | x |
| srp | Serbian | x | x | x | x | x | x |
| srp_latn | Serbian - Latin | x | x | x | x | x | |
| sun | Sundanese | x | x | x | x | ||
| swa | Swahili | x | x | x | x | x | x |
| swe | Swedish | x | x | x | x | x | x |
| syr | Syriac | x | x | x | x | x | |
| tam | Tamil | x | x | x | x | x | x |
| tat | Tatar | x | x | x | x | ||
| tel | Telugu | x | x | x | x | x | x |
| tgk | Tajik | x | x | x | x | x | |
| tgl | Tagalog (new - Filipino) | x | x | x | |||
| tha | Thai | x | x | x | x | x | x |
| tir | Tigrinya | x | x | x | x | x | |
| ton | Tonga | x | x | x | x | ||
| tur | Turkish | x | x | x | x | x | x |
| uig | Uighur; Uyghur | x | x | x | x | x | |
| ukr | Ukrainian | x | x | x | x | x | x |
| urd | Urdu | x | x | x | x | x | |
| uzb | Uzbek | x | x | x | x | x | |
| uzb_cyrl | Uzbek - Cyrilic | x | x | x | x | x | |
| vie | Vietnamese | x | x | x | x | x | x |
| yid | Yiddish | x | x | x | x | x | |
| yor | Yoruba | x | x | x | x |
Scripts
| Script | 3.02 | 3.04 | 4.00 | 4.0.0 | 4.0.0 | 4.0.0 | |
|---|---|---|---|---|---|---|---|
| Nov 2016 | tessdata | tessdata_best | tessdata_fast | ||||
| arab | Arabic | x | x | x | |||
| armn | Armenian | x | x | x | |||
| beng | Bengali | x | x | x | |||
| cans | Canadian_Aboriginal | x | x | x | |||
| cher | Cherokee | x | x | x | |||
| cyrl | Cyrillic | x | x | x | |||
| deva | Devanagari | x | x | x | |||
| ethi | Ethiopic | x | x | x | |||
| frak | Fraktur | x | x | x | |||
| geor | Georgian | x | x | x | |||
| grek | Greek | x | x | x | |||
| gujr | Gujarati | x | x | x | |||
| guru | Gurmukhi | x | x | x | |||
| hans | HanS (Han simplified) | x | x | x | |||
| hans-vert | HanS_vert (Han simplified vertical) | x | x | x | |||
| hant | HanT (Han traditional) | x | x | x | |||
| hant-vert | HanT_vert (Han traditional vertical) | x | x | x | |||
| hang | Hangul | x | x | x | |||
| hang-vert | Hangul_vert (Hangul vertical) | x | x | x | |||
| hebr | Hebrew | x | x | x | |||
| jpan | Japanese | x | x | x | |||
| jpan-vert | Japanese_vert (Japanese vertical) | x | x | x | |||
| knda | Kannada | x | x | x | |||
| khmr | Khmer | x | x | x | |||
| laoo | Lao | x | x | x | |||
| latn | Latin | x | x | x | |||
| mlym | Malayalam | x | x | x | |||
| mymr | Myanmar | x | x | x | |||
| orya | Oriya(Odia) | x | x | x | |||
| sinh | Sinhala | x | x | x | |||
| syrc | Syriac | x | x | x | |||
| taml | Tamil | x | x | x | |||
| telu | Telugu | x | x | x | |||
| thaa | Thaana | x | x | x | |||
| thai | Thai | x | x | x | |||
| tibt | Tibetan | x | x | x | |||
| viet | Vietnamese | x | x | x |
For detalls about the languages that each Script.traindata file supports, see the files that end with langs.txt (e.g. Latin.langs.txt) here.