Source : http://figoblog.org/node/1916 from 23/03/2008 (Automatically
translated text)
In this article,
a guy Google explains the problems of information management and document that
they have encountered in setting up Google Search Books. It contains
reflections on the OCR, document analysis, metadata extraction, image
processing, display and the display of documents or sections of documents, free
software and R & D.
In Wired, you can see a photographic
report on digitization done by the Internet Archive in the project OCA.
Note the very handwork side of the thing ...
A check with the other hand: Framework for good digital
collections (NISO document, version 3, December 2007) and the Preservation in
the age of large-scale digitization probably already cited in the age
of large-scale digitization (Report of CLIR, Oya Rieger of the university
Cornell).
References:
- Lorcan Dempsey
- disruptive
library technology jester
Comments