OCR PDF

Modified on Tue, 18 Jun, 2024 at 8:30 AM

OCR PDF

Xodo PDF Studio is capable of OCRing documents using any of the available OCR languages to add text to documents. OCR allows you to add text to scanned documents or images so that the document can be searched or marked up as you would any other text document. Xodo PDF Studio can also run OCR with two languages at once. For more information on using OCR with two languages see OCR Preferences.

What is OCR?

Optical character recognition (OCR) is the mechanical or electronic conversion of images of typed or printed text into machine-encoded searchable text data.

 

From Existing Document

Text can be added to an existing document using OCR

  1. Launch Xodo PDF Studio and open the PDF document that you wish to add searchable text to
  2. Go to the Document Tab > OCR from the toolbar
  3. From the Language drop down select the language you wish to use
    • Note: The first time using OCR you will need to download the language packs. To do so click on “Download OCR Languages“, then select the languages you wish to use and click on “Download”.
  4. Select the Page Range and Resolution that you wish to use
    • Note: A resolution of 300 DPI produces good OCR results for most images. When dealing with scans containing noise, you may try using a lower DPI setting to get rid of the noise and obtain better OCR results.
  5. Choose additional options
    • Discard Invisible Text - removes any previous OCR text that has been added to the page.
    • Auto Deskew Images - When checked, if the document’s text/images are slanting too far in one direction or is misaligned, Xodo PDF Studio will attempt to auto-rotate the document so that the alignment is corrected.
  6. Click on “OK” to begin the OCR process
  7. You will see a progress dialog showing you the current page being processed. Once complete click on “OK” to close the dialog
  8. Your document is now ready to be searched, edited, or marked up with highlights, underlined, crossed-out or used with caret annotations.

When Scanning a Document

OCR can add text to a document at the same time it is being scanned with Xodo PDF Studio

  1. Start the Scanning Dialog as normal
  2. In the scanning dialog you will see an option to OCR the document after scanning
  3. From the Language drop down select the language you wish to use
    • Note: The first time using OCR you will need to download the language packs. To do so click on “Download OCR Languages“, then select the languages you wish to use and click on “Download”.
  4. After setting all of your scanning and OCR settings click on “Scan” to begin scanning the document
  5. Once the scanning completes the OCR process will begin and you will see a progress dialog showing you the current page being processed. Once complete click on “OK” to close the dialog
  6. Your document is now ready to be searched, edited, or marked up with highlights, underlined, crossed-out or used with caret annotations.

Available OCR Languages

The following language dictionary files are available for download directly from within Xodo PDF Studio OCR functions. Using the appropriate language file will improve the accuracy of OCR results. See Tips on Improving OCR Results for additional information

  • Afrikaans
  • Albanian – shqip
  • Arabic – العربية
  • Azerbaijani – azərbaycan
  • Basque – euskara
  • Belarusian – беларуская
  • Bengali – বাংলা
  • Bulgarian – български
  • Catalan – català
  • Cherokee
  • Chinese (Simplified) – 中文(体中文)
  • Chinese (Traditional) – 中文(繁體)
  • Croatian – hrvatski
  • Czech – čeština “da”>Danish – dansk
  • Danish – Dansk
  • Danish (Fraktur) – Dansk (Fraktur)
  • Dutch - Netherlandish
  • English
  • Estonian – eesti
  • Finnish - Suomalainen
  • French - Français
  • Galician – galego
  • German - Deutsche
  • Greek – Ελληνικά
  • Hebrew – עברית
  • Hindi – हिन्दी
  • Hungarian – magyar
  • Icelandic – íslenska
  • Indonesian – Bahasa Indonesia
  • Italian - Italiano
  • Italian (old) – italino vecchio
  • Japanese – 日本語
  • Kannada – ಕನ್ನಡ
  • Korean – 한국어
  • Latvian – latviešu
  • Lithuanian – lietuvių
  • Macedonian – македонски
  • Malay – Bahasa Melayu
  • Malayalam – മലയാളം
  • Maltese – Malti
  • Math / Equations
  • Norwegian - Norsk
  • Polish - Polskie
  • Portuguese - Português
  • Romanian – română
  • Russian – русский
  • Serbian – српски
  • Slovakian – slovenčina
  • Slovakian (Fraktur) – slovenčina (Fraktur)
  • Slovenian – slovenščina
  • Spanish - Español
  • Spanish (Old) – español (Antiguo)
  • Swahili – Kiswahili
  • Swedish - Svensk
  • Tagalog
  • Tamil – தமிழ்
  • Telugu – తెలుగు
  • Thai – ไทย
  • Turkish – Türkçe
  • Ukrainian – українська
  • Vietnamese – Tiếng Việt

 

 

 

Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select at least one of the reasons
CAPTCHA verification is required.

Feedback sent

We appreciate your effort and will try to fix the article