Menu
Xerox scanners found to sometimes alter numbers

Xerox scanners found to sometimes alter numbers

A certain combination of source document and scanner settings can produce reproductions different from the original

Xerox scanners have been found to randomly alter numbers on documents when reproducing them if a certain combination of image quality and compression setting is used.

The problem first came to light last week when David Kriesel, a computer scientist pursuing a PhD at the University of Bonn, posted results of several scans on his website.

On Tuesday, Xerox acknowledged the problem and advised customers to use a higher quality scanner setting if it wanted to avoid the problem.

Kriesel said he first noticed the problem when he used a Xerox WorkCentre to scan to PDF some building construction documents. The documents were of a building floorplan and each room was marked with a small box that contained a room name and the area in square meters: 14.13m2, 21.11m2 and 17.42m2.

On first glance, the PDF reproductions of the plans appeared to be identical to the originals -- as anyone would expect them to be -- but closer examination revealed that wasn't actually the case.

The areas of the three rooms in the reproduced version were incorrect.

Kriesel set out to investigate the problem. When scanned in TIFF mode, a pixel-for-pixel reproduction, the copy was identical to the original. But when image compression was used, things started getting weird.

A Xerox WorkCentre 7535 reproduced an image where every room was labeled 14.13m2 in area. The same thing happened on a Xerox WorkCentre 7556 on one scan. A second scan on the same machine had two rooms at 17.42m2 and one at 21.11m2 and a third scan produced two rooms labeled 14.13m2 and one at 17.42m2.

Kriesel had switched off optical character recognition so it wasn't related to that, he wrote.

"There seems to be a correlation between font size, scan dpi used. I was able to reliably reproduce the error for 200 DPI PDF scans without OCR, of sheets with Arial 7pt and 8pt numbers," he said on his blog.

Once publicized, he says he began receiving emails from other Xerox users who were able to replicate the problem and also offer a few clues. He narrowed the problem down to the way the scanner's JBIG2 image compression works -- subsequently confirmed by Xerox as being at the root of the problem.

In order to reduce file space, the compression system looks for areas of an image that are similar and, when it finds them, makes one compressed version and reuses it across all the similar areas. Because the numbers in the document were printed in a small, fine font, the scanner apparently mistook them for identical and reused data resulting in figures for room area getting reproduced.

"The problem stems from a combination of compression level and resolution setting," Xerox said in a statement on Tuesday. "The devices mentioned are shipped from the factory with a compression level and resolution that produces scanned files which are optimized for viewing or printing while maintaining a reasonable file size. We do not normally see a character substitution issue with the factory default settings however, the defect may be seen at lower quality and resolution settings."

"For data integrity purposes, we recommend the use of the factory defaults with a quality level set to 'higher,'" the company advised.

Xerox said the machines had warned users for years that character substitution could occur at lower quality and higher compression settings.

A message on a web interface to the copier warns: "The normal quality option produces small file sizes by using advanced compression techniques. Image quality is generally acceptable, however, text quality degradation and character substitution errors may occur with some originals."

Martyn Williams covers mobile telecoms, Silicon Valley and general technology breaking news for The IDG News Service. Follow Martyn on Twitter at @martyn_williams. Martyn's e-mail address is martyn_williams@idg.com


Follow Us

Join the newsletter!

Or

Sign up to gain exclusive access to email subscriptions, event invitations, competitions, giveaways, and much more.

Membership is free, and your security and privacy remain protected. View our privacy policy before signing up.

Error: Please check your email address.

Tags softwareXeroxhardware systems

Featured

Slideshows

Meet the leading customer-centric Microsoft channel partners

Meet the leading customer-centric Microsoft channel partners

Microsoft honoured leading partners across the channel following a year of customer innovation and market growth in New Zealand. The 2018 Microsoft Partner Awards recognised excellence within the context of the end-user, spanning a host of emerging and established providers.

Meet the leading customer-centric Microsoft channel partners
Reseller News launches new-look Awards at 2018 Judges’ Lunch

Reseller News launches new-look Awards at 2018 Judges’ Lunch

Introducing the Reseller News Innovation Awards, launched to the channel at the 2018 Judges’ Lunch in Auckland. With more than 70 judges now part of the voting panel, the new-look awards will reflect the changing dynamics of the channel, recognising excellence across customer value and innovation - spanning start-ups, partners, distributors and vendors.

Reseller News launches new-look Awards at 2018 Judges’ Lunch
Show Comments