A quick bash script that wraps around tesseract and allows tesseract to work on pdfs of scans
LICENSE | ||
README.md | ||
scantopdf |
scantopdf
A quick bash script that wraps around tesseract and allows tesseract to work on pdfs of scans
Installation
To install this script, just paste these commands in the terminal
git clone https://git.karma-riuk.com/karma/scantopdf /tmp/scantopdf
sudo cp /tmp/scantopdf/scantopdf /usr/local/bin/scantopdf
sudo chmod 755 /usr/local/bin/scantopdf
Please verify that the path /usr/local/bin
is in the global variable $PATH
.
To verify this, issue the following command:
echo "$PATH" | grep "/usr/local/bin"
if there is a line of output, you are all good! If there isn't, then issue the following command:
export PATH="/usr/local/bin:$PATH"
echo "export PATH=\"/usr/local/bin:\$PATH\"" | sudo tee -a /etc/profile
and everything should be okay.
Ensure successful installation
To make sure the installation has been done correctly, the following command
which scantopdf
should have the following output
/usr/local/bin/scantopdf
if it doesn't, try to restart the installation from scratch.
Usage
To use this script, follow this steps
- Open a terminal
- Go the location of the file you want to "convert" with the following command
wherecd <path>
<path>
is the location of the folder where lies the file to convert. - Use the command
scantopdf
and give it, as an argument, the file you want to convert. If the file name contains spaces, surround it with quotes"
Optionally you can give it thescantopdf "I Promessi Sposi.pdf"
-v
flag (before the name of the file), to make the script verbose, so instead of doing its job without saying anything, it will print each step that it's doing.scantopdf -v "La Divina Commedia.pdf"