How To Ocr A Pdf
<h3 class="heading-h6"><a name="THLToolboxhomegtScanningampOCRgtHowtoOCRaPDF" class="anchorpoint"></a><a href="/tools/wiki/home.html">THL Toolbox</a> > <a href="/tools/wiki/Scanning%20%26%20OCR.html">Scanning & OCR</a> > How to OCR a PDF</h3><p class="paragraph">
</p><h3 class="heading-h1"><a name="HowtoOCRaPDFUsingAdobeAcrobatProfessional" class="anchorpoint"></a>How to OCR a PDF Using Adobe Acrobat Professional</h3><p class="paragraph"><strong class="bold">Contributor(s):</strong> Scholars' Lab staff, Adriana Barcenas, Steven Weinberger, Zach Rowinski</p><p class="paragraph">This is the process for running OCR on a PDF so that it is searchable, using Acrobat Professional:</p><ol><li>For most PDFs, you want to run Optimize after you scan them. First rename the file; then pull down the Document menu and select Optimize.</li>
<li>Then, to run OCR: open the PDF file you want to run OCR on.</li>
<li>Pull down the File menu, choose "Save as," and add "-ocr.pdf" to the file name</li>
<li>Pull down the Document menu, point to "OCR Text Recognition," and then point to "Recognize Text Using OCR…" and "start"</li>
<li>The OCR process will start. It will take some time, depending on the number of pages in the PDF.</li>
<li>When it finishes, save the file. Be sure to check by doing a search on "the" or another word in the file and make sure it returns results.</li></ol><p class="paragraph">To OCR roman text with diacritic characters, investigate using Abbyy's FineReader (<img src="/" alt="external link: " title="external link"/><span class="nobr"><a href="http://www.abbyy.com/" target="rwikiexternal">http://www.abbyy.com/</a></span>). No THL staff have used this and we have no experience with it. For more information, see <a href="/tools/wiki/zach-abbyy-ocr-diacritics-assessment.html">Zach Rowinski's assesssment</a>.
</p><h3 class="heading-h6"><a name="ProvidedforunrestrictedusebythespanclassnobrimgsrcsakairwikitoolimagesicklearrowgifaltexternallinktitleexternallinkahrefhttpwwwthliborgtargetrwikiexternalTibetanandHimalayanLibraryaspan" class="anchorpoint"></a><em class="italic">Provided for unrestricted use by the <span class="nobr"><img src="/" alt="external link: " title="external link"/><a href="http://www.thlib.org" target="rwikiexternal">Tibetan and Himalayan Library</a></span></em></h3>