jgreely - 01 Lego-based build for paperbacks
Builder: | jgreely |
Forum Post: | Lego-based paperback build |
Material: | Lego |
Platen Angle: | 180° |
Camera Setup: | single |
Camera Models: | Minolta Dimage A2 (8MP) |
Lego-based single-camera build for paperbacks
post by jgreely » 04 Apr 2011, 02:40
I have a large number of Japanese paperbacks that I want to OCR, roughly 4x6 and 5x7 inches. I had everything I needed except a platen and a design, and after browsing these forums for a while, I came up with both. Because my camera has manual focus and exposure controls (and remote-control software!), I wanted a design that kept the platen at a constant distance from the camera. There's still some tinkering to do, but I think it's going to work nicely.
Here (3.6MB JPEG) is the raw output from the first test shot, done with just room lighting.
What was needed
Materials:
* Minolta Dimage A2 8MP camera, Dimage Capture remote-control software, and AC-11 power adapter.
* Kaiser copystand.
* Speedotron Black Line 1205CX power pack,
* plus two each of 202VF color-corrected flash head with 7" reflector, 10-degree grid insert,
* and 8-foot light stand.
* Wein wireless flash trigger and slave.
* Abbyy Finereader Pro 9.0.
* 32x32-post Lego baseplate and a whole bunch of Legos in various sizes and shapes.
* 7.75x9.25-inch (roughly 25x30-post) sheet of 1/4-inch Acrylite GP for the platen.
* a bunch of pennies.
* several sheets of matte black construction paper.
* two sheets of cardstock.
* several sheets of packing foam.
* tape.
Side view; the platen is simply sandwiched between layers of Legos. Note the extra Legos on the side to position the assembly for a 4x6 book:
Rendered model from Lego Digital Designer:
Back view, with a spacer column to position 4x6 books consistently:
Back view of rendered model from Lego Digital Designer:
Using it
In use; construction paper is taped to the vertical riser to eliminate glare, and another (black) sheet placed behind the page being scanned to significantly reduce bleed-through.
The pennies add enough weight to the assembly so I can press the book up into the platen:
Camera support (note the camera doesn't need to be in Macro mode):
Cardstock-and-foam cradle to press the hand-held book up into the platen consistently.
Cleaned-up detail from the test scan linked above:
Software
Without doing any cleanup on the test scan, Finereader was able to OCR it with only two single-character errors, plus about a dozen garbage characters added to short lines where it interpreted the noise in the low-contrast JPEG as ASCII.
The camera can shoot RAW, the flash rig will provide crisp, even lighting, and doing a little bit of pre-processing before OCR will significantly improve the recognition rate and get rid of the noise. I know from experience that a 300dpi scan is adequate for Japanese in Finereader, and this setup will get ~600dpi for the 4x6 books and ~450 for the 5x7 ones, so it should work out nicely.
Testing the flash light rig
I tested the flash rig before I built the scanner, so I know it will fill the platen with crisp, even, glare-free light, but it's powerful enough to light a large room, and at its minimum setting (1/4 power, single channel, for an effective 37 watt-seconds per head), the reflectors have to be nearly six feet away from the platen to deliver only f/8 at ISO 64, with an exposure time of 1/500th seconds.
I'll be doing some real scans tonight. The target books are the murder mystery To Die in the Rain (4x6, page 5 was the test scan), Japanese Train Lunches (4x6, full-color photos), and My Youth with Ultraman (5x7, with B&W photos). I'll post some samples once they're done.
Samples
post by jgreely » 04 Apr 2011, 04:36
First color sample with the flash rig hooked up, shot RAW and developed with default settings in Photoshop, with only a rotate and an auto-contrast applied before saving to JPEG:
Platen
To improve on the pictures, here's what the platen assembly looks like in Lego Digital Designer (not exactly like mine…):
The angled pieces protect the edge of the facing page from rubbing against the sharp bottom corner of the riser. The yellow spacers are just the right size to eliminate almost all wobble in the platen. The long plates help hold everything together. The 32x32-post baseplate is not shown.
Parts:
* 216 2x4 bricks
* 28 2x2 bricks
* 2 1x2 bricks
* 13 1x6 bricks
* 14 2x2 inverted sloped bricks
* 2 2x4 plates (black)
* 16 2x8 plates (black)
* 6 1x2x1 panels (yellow spacer)
* 1 32x32 baseplate
Scan from a 4x6 novel, batch-processed with Adobe Camera Raw. Finereader was able to OCR this 36-page short story with less than one single-character error per page (and about a quarter of those were always getting one particular character wrong). Calculated DPI is 542, which is plenty, even if some pages need deskewing.
Using remote control software
I'm not using the remote-control software yet, and I might not. Since the exposure and focus are constant, and the camera support is rock-solid, I might as well just shoot with the cable release, and then carry the CompactFlash card over to the computer when I finish a book.
I averaged about six pages/minute for this first batch. To speed that up, I need a finger cot or glove to turn pages more reliably, and some way to keep the backing sheet from slipping out when I pull the book out of the scanner. And practice.
Comments
Postby jgreely » 04 Apr 2011, 02:40
Added some rendered models to clarify the pictures. Built in Lego Digital Designer, edited in Bricksmith to add a phony baseplate and platen, and then rendered with LDView.
Re: Lego-based single-camera paperback build
post by jck57 » 04 Apr 2011, 03:01
Outstanding work! Just shows what you can do with imagination and common cents!
jck57