70 projects tagged "PDF"
CUPS is a standards-based printing system for Mac OS X and other Unix-like operating systems. It provides the System V and Berkeley command line interfaces, and uses the Internet Printing Protocol ("IPP") as the basis for managing print jobs and queues. The Line Printer Daemon (LPD) Server Message Block (SMB), and AppSocket (a.k.a. JetDirect) protocols are also supported with reduced functionality. CUPS adds network printer browsing and PostScript Printer Description ("PPD") based printing options to support real world printing.
PDFTextStream is a PDF text and metadata extraction library available for Java, Python, and .NET. It supports all versions of the PDF document specification, (including v1.7, used by Acrobat 8 and 9), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of 40-bit and 128-bit encrypted documents, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations). Easy integration with Jakarta Lucene is included, as well as interactive form update capability.
PDFreactor is a formatting processor to convert HTML and XML to PDF. It uses Cascading Style Sheets (CSS) to define page layout and styles. It allows you to dynamically generate PDF documents such as invoices, delivery notes, shipping documents, or print versions of Web content on-the-fly. Vector graphics (SVG), barcodes, MathML, XSLT, and CMYK colors are supported. All common J2EE application servers are supported. Complete .NET, PHP, Perl, Python, and Ruby APIs are included. Direct integration into automatic build processes using Apache Ant is also possible.
jPDFText is a Java library to extract text from PDF documents. PDF documents can be processed to extract the textual content for archiving, storage, searching, or indexing. jPDFText is built on top of Qoppa's proprietary PDF technology, so there is no need for any third party software or drivers. Main Features: loading PDF documents from files, network drives, URLs, or input streams; extracting text; and extracting words as a vector of Strings. It is written entirely in Java, which allows your application to remain platform independent. There is no need to install or configure additional drivers or software when deploying.