Mailing List Archive: 49091 messages
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

[REBOL] Re: Converting Word .DOC and .PDF to text files

From: g:santilli:tiscalinet:it at: 26-Sep-2002 19:36

Hi jose, On Thursday, September 26, 2002, 6:21:10 PM, you wrote: j> I want to get the text of any arbitrary PDF file. Is j> there a spec I can look at ? On the Adobe web site you'll find the full specifications for the PDF format. I can send it to you, if you don't want to search for it. However, as I said, parsing a PDF file is harder than creating one, because you'll have to deal with all possibilities (compression, encryption, linearized format...); of course, this does not mean it is impossible. Regards, Gabriele. -- Gabriele Santilli <[g--santilli--tiscalinet--it]> -- REBOL Programmer Amigan -- AGI L'Aquila -- REB: http://web.tiscali.it/rebol/index.r