Files
docker/paperless
2024-07-17 16:35:18 +00:00
..
2022-02-15 15:05:34 +00:00
2024-06-19 08:03:37 +00:00
2024-07-17 16:35:18 +00:00

Paperless ngx

Paperless

Paperless ngx ist ein von der Community unterstütztes Open-Source-Dokumentenverwaltungssystem, das Ihre physischen Dokumente in ein durchsuchbares Online-Archiv umwandelt, sodass Sie weniger Papier benötigen.

Paperless ngx

HowTo


/etc/fstab Mount zu Scanner Verzeichniss

192.168.0.102:/mnt/pool1/scanner /opt/paperless/consume nfs auto 0 0

gegebenfalls noch die NFS Tools installieren
apt install nfs-common

Script für das entfernen von leeren Seiten

Added /usr/src/paperless/scripts/pre-consume.sh

#!/bin/sh
set -x

# Remove blank pages /scripts/remove-blank-pages.sh

Added /usr/src/paperless/scripts/remove-blank-pages.sh

#!/bin/bash
# set -x -e -o pipefail
set -e -o pipefail
export LC_ALL=C

# IN="$1" IN="$DOCUMENT_WORKING_PATH"

# Check for PDF format TYPE=$(file -b "$IN")

if [ "${TYPE%%,*}" != "PDF document" ]; then

&2 echo "Skipping $IN - non PDF [$TYPE]." exit 0 fi

# PDF file - proceed

# PAGES=$(pdfinfo "$IN" | grep ^Pages: | tr -dc '0-9') PAGES=$(pdfinfo "$IN" | awk '/Pages:/ {print $2}')

&2 echo Total pages $PAGES

# Threshold for HP scanners THRESHOLD=1 # Threshold for Lexmar MC2425 # THRESHOLD=0.8

non_blank() { for i in $(seq 1 $PAGES) ; do PERCENT=$(gs -o - -dFirstPage=${i} -dLastPage=${i} -sDEVICE=ink_cov "${IN}" | grep CMYK | nawk 'BEGIN { sum=0; } {sum += $1 + $2 + $3 + $4;} END { printf "%.5f\n", sum } ') >&2 echo -n "Color-sum in page $i is $PERCENT: " if awk "BEGIN { exit !($PERCENT > $THRESHOLD) }"; then echo $i >&2 echo "Page added to document" else >&2 echo "Page removed from document" fi done }

NON_BLANK=$(non_blank)

if [ -n "$NON_BLANK" ]; then NON_BLANK=$(echo $NON_BLANK | tr ' ' ",") qpdf "$IN" --warning-exit-0 --replace-input --pages . $NON_BLANK -- fi