readme

来自「一个google的OCR源码」· 代码 · 共 44 行

TXT

44 行

How to run UNLV tests.The scripts in this directory make it possible to duplicate the testspublished in the Fourth Annual Test of OCR Accuracy.See http://www.isri.unlv.edu/downloads/AT-1995.pdfbut first you have to get the tools and data from UNLV:Step 1: to download the images gotohttp://www.isri.unlv.edu/ISRI/OCRtkand get 3b.tgz, Bb.tgz, Mb.tgz and Nb.tgz.Step 2: extract the files. It doesn't really matter wherein your filesystem you put them, but they must go under a commonroot so you have directories 3, B, M and N in, for example,/users/me/ISRI-OCRtk.Step 3: Reorg the filesThe lack of tif extensions on the images is inconvenient, so thereis a script to reorganize the data to match the rest of the testscripts.cd to /users/me/ISRI-OCRtk or wherever 3, B, M and N ended up and run/blah/blah/tesseract-ocr/testing/reorgdata.sh 3BThis makes directories doe3.3B, bus.3B, mag.3B and news.3B.You can now get rid of 3, B, M, and N unless you want to get some of theother scanning resolutions out of them.Step 4: Download the ISRI toolkit from:http://www.isri.unlv.edu/downloads/ftk-1.0.tgzStep 5: If they work for you, use the binaries directly from the bindirectory and put them in tesseract-ocr/testing/unlvotherwise build the tools for yourself and put them there.Step 6: cd back to your main tesseract-ocr dir and Build tesseract.Step 7: run testing/runalltests.sh with the root data dir and testname:testing/runalltests.sh /users/me/ISRI-OCRtk tess2.0and go to the gym, have lunch etc.Step 8: There should be a filetesting/reports/tess2.0.summary that contains the final summarized accuracyreport and comparison with the 1995 results.

readme - 源码说明

本页面展示了「一个google的OCR源码」中的 readme 源码文件，采用编程语言编写，共 44 行代码。您可以在线阅读完整代码内容，也可以返回资源详情页下载完整源码包进行本地学习和开发。

虫虫下载站收录了大量与google相关的技术资源，包括源代码、技术文档、电路图等，是电子工程师和嵌入式开发者的专业学习平台。

⌨️ 快捷键说明

复制代码Ctrl + C

搜索代码Ctrl + F

全屏模式F11

增大字号Ctrl + =

减小字号Ctrl + -

显示快捷键?