Extract text from PDF(I have link to PDF) in ruby -
i have link like
http://www.downloads.com/help.pdf
i want download this, , parse text content.
how go this? plan tag-ize(if there word that) extracted text
you can either use pdf-reader gem (the example/text.rb example simple , worked me): https://github.com/yob/pdf-reader
or command-line utility pdftotext.
Comments
Post a Comment