Tag Archives: ocr

Reducir tamaño de documento escaneado con fondo sucio/oscuro, ganando claridad y nitidez.

Posted on May 12, 2023 by manoftherambla

Si el PDF original era bastante pesado finalmente se consigue ahorrar la mitad del espacio más o menos.

Posted in Uncategorized | Tagged abbyfinereader, adobe, ocr, pdf, scan | Leave a comment

Convertir PDF escaneado a EPUB. (Ver. 2.0)

Posted on February 11, 2020 by manoftherambla

Posted in Uncategorized | Tagged abbyfinereader, calibre, epub, ocr, pdf | Leave a comment

Capture full webpage and convert to PDF with OCR.

Posted on August 11, 2015 by manoftherambla

1. Capture webpage using Awesome Screenshot add-on in firefox. 2. You’ll get a very big image with 72 pp. Adobe Professional can’t perform the OCR in such a big image. You need to resize the image to a lower size, … Continue reading →

Posted in Uncategorized | Tagged capture, ocr, pdf, web | Leave a comment

Lios. Very interesting OCR software for linux

Posted on April 12, 2013 by manoftherambla

I haven’t proved it yet, but it seems great. http://www.tiflolinux.org/node/455 http://code.google.com/p/linux-intelligent-ocr-solution/

Posted in Uncategorized | Tagged cuneiform, ocr, scan, tesseract | 1 Comment

OCR from terminal

Posted on June 29, 2012 by manoftherambla

http://www.webupd8.org/2010/02/how-to-extract-all-text-from-pdfs.html

Posted in Uncategorized | Tagged ghostscript, ocr, tesseract | Leave a comment

Yagf

Posted on June 22, 2012 by manoftherambla

Yagf is another graphical front-end cuneiform OCR tool. It can use tesseract and cuneiform. To install it you must do it from http://www.getdeb.net and previously have added the getdeb repositories. In precise pangolin: http://www.ubuntuupdates.org/ppa/getdeb_apps wget -q -O – http://archive.getdeb.net/getdeb-archive.key | … Continue reading →

Posted in Uncategorized | Tagged cuneiform, getdeb, ocr, tesseract, yagf | Leave a comment

gimagereader, lios

Posted on June 17, 2012 by manoftherambla

gimagereader and lios are other good ocr software for linux. http://code.google.com/p/linux-intelligent-ocr-solution/

Posted in Uncategorized | Tagged gimagereader, lios, ocr | Leave a comment

Ocrfeeder in spanish

Posted on January 7, 2012 by manoftherambla

http://hatteras.wordpress.com/2011/10/28/escanear-con-ocr-reconocimiento-optico-de-caracteres-ocrfeeder/ Por defecto el programa usa en el OCR el idioma ingles (es decir el paquete tesseract-ocr-eng) , aunque tengas todo el sistema en español y hayas instalado el paquete tesseract-ocr-spa; para “obligar” a usar el español, en Argumentos del … Continue reading →