GitHub

Stirling-PDF

What is the project about?

Stirling-PDF is a locally hosted, web-based PDF manipulation tool that runs within a Docker container. It allows users to perform a wide variety of operations on PDF files.

What problem does it solve?

It provides a comprehensive, locally-hosted solution for all PDF-related needs, eliminating the reliance on external web services for PDF manipulation, thus enhancing privacy and security. It solves the problem of needing multiple different tools or websites to perform different operations on PDFs.

What are the features of the project?

  • Page Operations: Merging, splitting, rotating, reorganizing, removing pages, multi-page layout, scaling, cropping, auto-splitting, extracting pages, overlaying PDFs.
  • Conversion: PDF to/from images, other file types (using LibreOffice), HTML, XML, CSV, URL, Markdown.
  • Security: Password protection, permissions management, watermarking, signing, sanitization, auto-redaction.
  • Other: Signature creation, splitting by size, repairing, blank page detection/removal, text comparison, image addition/extraction/removal, compression, annotation removal, page numbering, auto-renaming, OCR, PDF/A conversion, metadata editing, flattening, and viewing/exporting PDF information.
  • Parallel file processing.
  • Dark mode.
  • Custom download options.
  • "Pipelines" for automated task sequences.
  • API for external script integration.
  • Optional login/authentication.
  • Database backup/import.
  • Enterprise features (SSO, etc.).
  • Supports 39 Languages.

What are the technologies used in the project?

  • Docker (for containerization)
  • LibreOffice (for file conversions)
  • qpdf (for PDF compression)
  • Tesseract OCR (for optical character recognition)
  • PDF.js (view and modify PDFs)
  • Joxit and Liberation fonts
  • Java

What are the benefits of the project?

  • Privacy and Security: Files are processed locally, not sent to external servers.
  • Comprehensive Functionality: A wide range of PDF operations are supported in one tool.
  • Extensibility: API and "Pipelines" allow for automation and integration.
  • Customization: Dark mode, custom download options, and more.
  • Enterprise-Ready: Features like SSO are available.
  • Open Source: Community contributions and transparency.

What are the use cases of the project?

  • Individuals needing to manage and manipulate PDF documents.
  • Businesses requiring secure, in-house PDF processing.
  • Developers integrating PDF functionality into their applications via the API.
  • Organizations needing to automate PDF workflows.
  • Anyone who needs to frequently perform operations on PDFs and wants a single, reliable, local tool.
Stirling-PDF screenshot