Fujitsu ScanSnap: Difference between revisions
Appearance
| Line 9: | Line 9: | ||
<pre> | <pre> | ||
tmpfs /tmp tmpfs defaults,noatime,nosuid 0 0 | tmpfs /tmp tmpfs defaults,noatime,nosuid 0 0 | ||
</pre> | |||
* create folders | |||
<pre> | |||
</pre> | </pre> | ||
Revision as of 10:44, 4 July 2020
Prepare Raspberry OS
apt update apt full-upgrade apt install aptitude sane-utils img2pdf
- set /tmp to ram in /etc/fstab:
tmpfs /tmp tmpfs defaults,noatime,nosuid 0 0
- create folders
Test Scanner Connection
lsusb sane-find-scanner scanimage -L
Scan Button Daemon
- Install via apt:
apt install scanbd
- change /etc/scanbd/dll.conf to fix kernel message:
#canon_pp fujitsu plustek_pp
- Test:
systemctl -t service tail -F /var/log/syslog
- /etc/scanbd/scanbd.conf
action scan {
filter = "^scan.*"
numerical-trigger {
from-value = 1
to-value = 0
}
desc = "Scan to file"
script = "/srv/scanbd/scan.script"
include(scanner.d/fujitsu.conf)
- /srv/scanbd/scan.script
#!/bin/bash
TMP_DIR=$(mktemp -d)
OUT_DIR=/srv/scanfolder_combined
TIMESTAMP=$(date +%Y-%m-%d_%H-%M-%S)
cd $TMP_DIR
scanimage --batch="$TMP_DIR/scan_%03d.tiff" --format=tiff \
--resolution 300 --mode color --source "ADF Duplex" \
--brightness 25 --contrast 15 \
--page-width 210 --page-height 297 -x 210 -y 297 \
--device-name='fujitsu:ScanSnap S1500:74986'
# --mode gray | color
echo "convert to pdf..."
img2pdf --pagesize A4 *.tiff --output ${OUT_DIR}/${TIMESTAMP}.pdf
rm *.tiff
Testing manual scans
- disable scanbd to test manual scanning
systemctl stop scanbd
- simple scan
scanimage --batch="/srv/scan_%03d.pnm" --format=pnm --resolution 300 --mode Color --source "ADF Duplex"
OCRmyPDF via pip3
- install
sudo apt install ghostscript libxml2 tesseract-ocr tesseract-ocr-eng tesseract-ocr-deu pngquant unpaper leptonica-progs libleptonica-dev automake libtool zlib1g-dev libjpeg-dev python3 python3-pip libxml2-dev libxslt1-dev libffi-dev git git clone https://github.com/agl/jbig2enc cd jbig2enc ./autogen.sh ./configure && make sudo make install cd .. git clone https://github.com/qpdf/qpdf cd qpdf ./configure && make sudo make install sudo ldconfig cd .. sudo pip3 install --upgrade pip sudo pip3 install pybind11 sudo pip3 install ocrmypdf
- processing pdf files that appear in folder
- /srv/daemon_ocr.sh:
#!/bin/bash
IN_DIR=/srv/scanfolder_combined
OUT_DIR=/srv/scanfolder_ocred
cd $IN_DIR
while true
do
FILE_TO_PROCESS=$(ls -1 -t *.pdf | head -1)
if [[ -z $FILE_TO_PROCESS ]]; then
echo "no files found"
sleep 15
continue
fi
echo "-----> processing : "$FILE_TO_PROCESS
ocrmypdf --output-type 'pdfa' \
--rotate-pages --deskew --clean-final --optimize 3 \
--language 'deu+eng' \
$FILE_TO_PROCESS $OUT_DIR/$FILE_TO_PROCESS
if [[ -f "$OUT_DIR/$FILE_TO_PROCESS" ]]; then
echo "ocr file was successfully created, deleting input file"
rm $FILE_TO_PROCESS
fi
sleep 5
done
upload to nextcloud
- /srv/daemon_upload.sh
#!/bin/bash
IN_DIR=/srv/scanfolder_ocred
OUT_DIR=/srv/scanfolder_uploaded
cd $IN_DIR
while true
do
FILE_TO_PROCESS=$(ls -1 -t *.pdf | head -1)
if [[ -z $FILE_TO_PROCESS ]]; then
echo "no files found"
sleep 15
continue
fi
echo "-----> processing : "$FILE_TO_PROCESS
curl -X PUT "https://nextcloud.domain.com/remote.php/webdav/00_Document Archive/00_New Scans/" -T $FILE_TO_PROCESS -u user:pass
if [[ $? == "0" ]]; then
echo "upload successful, moving file to out folder"
mv $FILE_TO_PROCESS $OUT_DIR/$FILE_TO_PROCESS
fi
sleep 5
done
- autostart daemons via /etc/rc.local
sleep 30 ./srv/daemon_ocr.sh & ./srv/daemon_upload.sh &
Links
- https://ocrmypdf.readthedocs.io/en/latest/cookbook.html
- https://askubuntu.com/questions/246647/convert-a-directory-of-jpeg-files-to-a-single-pdf-document
- https://superuser.com/questions/104656/convert-a-pdf-to-greyscale-on-the-command-line-in-floss
- https://unix.stackexchange.com/questions/93959/how-to-convert-a-color-pdf-to-black-white
- https://superuser.com/questions/508472/how-to-recognize-black-and-white-images
Windows: Deactivate ScanSnap folder
- regsvr32 /u "C:\Program Files (x86)\PFU\ScanSnap\SSFolder\SSFolder.dll"