Software experiments for the OuNuPo bookscanner. Part of Special Issue 5.
You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
jvdhorst d86c273ddb updated readme 7 years ago
src Forgot to add my actual script 7 years ago
.gitignore Moved output list.txt,plain.txt from src/ to output/. src/. 7 years ago
Makefile Added my wordtagger script. Use make wordtagger to execute it 7 years ago
README updated readme 7 years ago

README

*Tools for working with scanned pages*

Makefile:
* `make dirs`: create the working folder structure
* `make tessaract` **dependency: tesseract**
* `make myscript`
* `make visualization`: **dependency: mplayer**  creates visualization of images/ dir, by cating the images content into mplayer. See more option ins [shiftop](https://git.bleu255.com/shiftop/file/shiftop.html)
* `make wordtagger`: **dependency: tesseract** Uses scanned pages as an input, tags each word for their wordtype (noun, verb etc) and saves it in a text file.