New LGPL library to extract data from pdf

26
mark writes: “The Java Pdf Extraction Decoding Access Library is an LGPL library to extract content from Adobes pdf files as well as drawing page previews. It allows the contents to be extracted as XML and to preserve the font information as XML metadata. Full source code is available and it is being actively developed.

Try it now at http://www.jpedal.org