Dear James, OCRmyPDF's test suite is currently failing under a freshly-installed Debian Sid chroot. I've attached the output to this e-mail. Since the test suite worked on yesterday's version of Debian Sid, I think that this must be due to a bug introduced in a new version of one the dependencies. That means it's my job to figure out what the problem is, and it is unlikely to be a bug in OCRmyPDF for you to fix. I'm e-mailing you just in case the problem is obvious to you from reading the output. Thanks. -- Sean Whitton
============================= test session starts ==============================
platform linux -- Python 3.4.4, pytest-2.8.7, py-1.4.31, pluggy-0.3.1
rootdir: /build/ocrmypdf-4.0.1, inifile: pytest.ini
collected 44 items
tests/test_hocrtransform.py .
tests/test_main.py ...................FF..................
tests/test_pageinfo.py ....
=================================== FAILURES ===================================
____________________________ test_autorotate[hocr] _____________________________
spoof_tesseract_cache = {'BIBINPUTS': '/home/swhitton/doc:/home/swhitton/doc/papers:', 'BROWSER': 'iceweasel', 'BUILDRESULTGID': '1000', 'BUILDRESULTUID': '1000', ...}
renderer = 'hocr'
@pytest.mark.parametrize('renderer', [
'hocr',
'tesseract',
])
def test_autorotate(spoof_tesseract_cache, renderer):
import ocrmypdf.ghostscript as ghostscript
import logging
gslog = logging.getLogger()
# cardinal.pdf contains four copies of an image rotated in each cardinal
# direction - these ones are "burned in" not tagged with /Rotate
out = check_ocrmypdf('cardinal.pdf', 'test_autorotate_%s.pdf' % renderer,
'-r', '-v', '1', env=spoof_tesseract_cache)
for n in range(1, 4+1):
correlation = check_monochrome_correlation(
reference_pdf=_infile('cardinal.pdf'),
reference_pageno=1,
test_pdf=out,
test_pageno=n)
> assert correlation > 0.80
E assert 0.01808749884366989 > 0.8
tests/test_main.py:310: AssertionError
----------------------------- Captured stdout call -----------------------------
/build/ocrmypdf-4.0.1/.pybuild/pythonX.Y_3.4/build/tests/output/main/cardinal.pdf.ref0001.png
/build/ocrmypdf-4.0.1/.pybuild/pythonX.Y_3.4/build/tests/output/main/cardinal.pdf.ref0001.png
__________________________ test_autorotate[tesseract] __________________________
spoof_tesseract_cache = {'BIBINPUTS': '/home/swhitton/doc:/home/swhitton/doc/papers:', 'BROWSER': 'iceweasel', 'BUILDRESULTGID': '1000', 'BUILDRESULTUID': '1000', ...}
renderer = 'tesseract'
@pytest.mark.parametrize('renderer', [
'hocr',
'tesseract',
])
def test_autorotate(spoof_tesseract_cache, renderer):
import ocrmypdf.ghostscript as ghostscript
import logging
gslog = logging.getLogger()
# cardinal.pdf contains four copies of an image rotated in each cardinal
# direction - these ones are "burned in" not tagged with /Rotate
out = check_ocrmypdf('cardinal.pdf', 'test_autorotate_%s.pdf' % renderer,
'-r', '-v', '1', env=spoof_tesseract_cache)
for n in range(1, 4+1):
correlation = check_monochrome_correlation(
reference_pdf=_infile('cardinal.pdf'),
reference_pageno=1,
test_pdf=out,
test_pageno=n)
> assert correlation > 0.80
E assert 0.01808749884366989 > 0.8
tests/test_main.py:310: AssertionError
----------------------------- Captured stdout call -----------------------------
/build/ocrmypdf-4.0.1/.pybuild/pythonX.Y_3.4/build/tests/output/main/cardinal.pdf.ref0001.png
/build/ocrmypdf-4.0.1/.pybuild/pythonX.Y_3.4/build/tests/output/main/cardinal.pdf.ref0001.png
==================== 2 failed, 42 passed in 667.14 seconds =====================
Attachment:
signature.asc
Description: PGP signature