BentoPDF 1.0.0: Open-Source PDF Tool with Privacy Focus Released
The new open-source tool BentoPDF appears in version 1.0 with extensive PDF functions, Docker integration, and a focus on local data processing.
(Image: heise medien)
With BentoPDF 1.0.0, a new open-source tool for PDF processing is now available. The developers place particular emphasis on local data processing without cloud connection. The first major release brings numerous functions for professional workflows.
Among the most important updates is the Posterize function, which splits large PDFs into several smaller documents for poster printing. The Linearize function optimizes PDFs for fast web viewing through progressive loading optimization. In addition, there are bulk operations: multiple PDFs can be compressed or split into individual pages simultaneously.
The tool automatically removes blank pages from documents and offers an Interleave-Merge mode, where multiple PDFs are merged interleaved—practical, for example, when scanning front and back sides. In addition, files can be embedded directly into PDFs as attachments.
Improved OCR and Performance Optimizations
Furthermore, the developers have fundamentally revised the OCR functions since the beta phase. Whitelist character sets have been newly added here for more precise text recognition. The project has optimized the performance of split, merge, and compression operations and reduced memory consumption in bulk operations.
Videos by heise
Various Docker configurations are available for deployment, suitable for both development environments and production. The revised Docker Compose configuration is intended to simplify the setup of BentoPDF. Users of Unraid will find a pre-built template for integration into their infrastructure.
The project is available on GitHub, where interested users can also find a complete list of supported features. The complete technical details and installation instructions can be found in the project documentation.
(fo)