fast document scanner

Book Scanner

Sponsored Links

The Royal Library of Alexandria, founded by Ptolemy I in 286 BC in Egypt, was one of the most famous libraries of ancient times.  The librarians of Alexandria collected hundreds and thousands of books through centuries of conquest.  In AD 641, invaders fed the bound books and papyrus scrolls into the public bath, where a furnace was lit and the books were said to have burnt for six months.

It is the fondest dream of the information age to rebuild this Alexandrian fantasy, containing an archive of all knowledge.  Amazon.com, with currently over 250,000 titles, strived to create this knowledge archive in a unique way.  Taking into consideration the passion inspired by the physical form of books, they chose not to make an e-book project.  Using the book scanner, they first scanned books into their archive.  The 'Search inside a book' feature allows users to search keywords both in book titles as well as deep inside the book contents.  The search returns pictures of pages, and not text. You can find the page, read it on your screen and browse a fixed number of pages.  You cannot copy download or read the entire book. A user cannot view more than a fixed number of pages per month.  Thus the point is just to help the user find a book and not to provide him with a information source.  Once he finds it, he can conveniently purchase the complete physical copy of the book from Amazon, thus increasing the sale of books!

Amazon’s handicap in not being able to create a completely readable digital archive lies in the fact that it deals in commercially viable titles with copyrights spread across countless owners.  There are other ventures like Project Gutenberg that is dedicated to digitizing only public domain titles that are free from copyright.  Having fed over 30,000 titles into the book scanner averaging 50 titles a week, this project is run by the non profit Literary Archive Foundation.

book scanner

Google Books is another service that combines both Amazon’s approach of limited viewing to copyright bound books and Gutenberg’s approach to public domain books.  Out of the 10 million books that Google has fed into the book scanner, 1 million are full preview based on publisher agreements.  Most of the scanned works are no longer commercially available or are in public domain.

There are plenty of ways to scan a book.  The simplest and least expensive method is destructive scanning, where you cut of the binding of the book converting it to a sheaf of loose papers. The papers are then loaded into an automatic page feeder which feeds it into a book scanner.  This is however not a desirable solution when it comes to old and uncommon books or an expensive collectors’ item.

A non-destructive way of scanning avoids the need to remove the binding of the book. Workers in India, China and Philippines earn around 40 cents an hour to manually turn pages and scan the book using an overhead book scanner.  There are expensive robots, which can do the same at a rate of 1200 pages an hour. 

There are also high end book scanners that employ static charges in air and vacuum to flip pages. A camera, placed over a V shaped cradle, scans the pages automatically.  Images can then be moved from the camera to different editing software where the image can be further processed into archival friendly formats like JPEG or TIFF or web friendly outputs like PDF or JPEG. 

Thus, it seems that it won’t be long before we attain the Alexandrian dream of treasury of all knowledge.