Use Python to automate Tesseract
- Install tesseract for English (
-eng
)- On Ubuntu Linux:
sudo apt install tesseract-ocr-eng
- On Ubuntu Linux:
- Write a Python script that does the following:
- For each book image
- Run tesseract (subprocess: capture stdout)
- Write the results to a text file
- For each book image