OCR Preferences

The OCR Preferences section contains all of the OCR (Optical Character Recognition) settings for Xodo PDF Studio.

To open the OCR Preferences dialog:

    1. Go to File Tab > Preferences
    2. Select OCR from the panel on the left of the preferences dialog to view or modify these preferences.

Settings

Recognition Languages - Options to set the default OCR language and ability to download new languages.

  • Primary Language: sets the default OCR language to be used.
  • Enable Secondary Language: (Experimental) Enable this option when working with documents that contain multiple languages to recognize text for a secondary language as well as the default selected language.
  • Download OCR Languages: opens the language download manager.

Image Processing - Settings used when passing the scanned image to the OCR engine to recognize text.

  • DPI Resolution: Sets the resolution of the image to be sent to the OCR engine.

Note: From our testing, a resolution of 300 DPI produces good OCR results for most images. When dealing with scans containing noise, you may try using a lower DPI setting to get rid of the noise and obtain better OCR results.

Available OCR Languages

The following language dictionary files are available for download directly from within Xodo PDF Studio OCR functions.

  • Afrikaans
  • Albanian – shqip
  • Arabic – العربية
  • Azerbaijani – azərbaycan
  • Basque – euskara
  • Belarusian – беларуская
  • Bengali – বাংলা
  • Bulgarian – български
  • Catalan – català
  • Cherokee
  • Chinese (Simplified) – 中文(体中文)
  • Chinese (Traditional) – 中文(繁體)
  • Croatian – hrvatski
  • Czech – čeština “da”>Danish – dansk
  • Danish – Dansk
  • Danish (Fraktur) – Dansk (Fraktur)
  • Dutch - Netherlandish
  • English
  • Estonian – eesti
  • Finnish - Suomalainen
  • French - Français
  • Galician – galego
  • German - Deutsche
  • Greek – Ελληνικά
  • Hebrew – עברית
  • Hindi – हिन्दी
  • Hungarian – magyar
  • Icelandic – íslenska
  • Indonesian – Bahasa Indonesia
  • Italian - Italiano
  • Italian (old) – italino vecchio
  • Japanese – 日本語
  • Kannada – ಕನ್ನಡ
  • Korean – 한국어
  • Latvian – latviešu
  • Lithuanian – lietuvių
  • Macedonian – македонски
  • Malay – Bahasa Melayu
  • Malayalam – മലയാളം
  • Maltese – Malti
  • Math / Equations
  • Norwegian - Norsk
  • Polish - Polskie
  • Portuguese - Português
  • Romanian – română
  • Russian – русский
  • Serbian – српски
  • Slovakian – slovenčina
  • Slovakian (Fraktur) – slovenčina (Fraktur)
  • Slovenian – slovenščina
  • Spanish - Español
  • Spanish (Old) – español (Antiguo)
  • Swahili – Kiswahili
  • Swedish - Svensk
  • Tagalog
  • Tamil – தமிழ்
  • Telugu – తెలుగు
  • Thai – ไทย
  • Turkish – Türkçe
  • Ukrainian – українська
  • Vietnamese – Tiếng Việt

Using the appropriate language file will improve the accuracy of OCR results. See Tips on Improving OCR Results for additional information