|
|
|
|
<!DOCTYPE html>
|
|
|
|
|
<!DOCTYPE html>
|
|
|
|
|
<html>
|
|
|
|
|
<head>
|
|
|
|
|
<meta charset="utf-8">
|
|
|
|
|
<title>Tasks of the Contingent Librarian</title>
|
|
|
|
|
<link rel="stylesheet" type="text/css" href="tasks.css">
|
|
|
|
|
<script src="tasks.js"></script>
|
|
|
|
|
</head>
|
|
|
|
|
<body>
|
|
|
|
|
|
|
|
|
|
<div class="card"><DOCUMENT_FRAGMENT><div class="mw-parser-output"><h1><span class="mw-headline" id="scanning">scanning</span><span class="mw-editsection"><span class="mw-editsection-bracket">[</span><a href="/mw-mediadesign/index.php?title=User:Simon/Scanning&action=edit&section=1" title="Edit section: scanning">edit</a><span class="mw-editsection-bracket">]</span></span></h1>
|
|
|
|
|
<p>see also <a href="Human_reading.html" title="User:Simon/Human reading">human reading</a>, <a href="Machine_reading.html" title="User:Simon/Machine reading">machine reading</a>
|
|
|
|
|
</p><p>Scanning is reading for particular details (such as names and numbers) by running one’s eyes over every word in a line. Sometimes I find myself using my index finger to guide my eyes when scanning a printed text. With a computer and full-text search capabilities, <i>control-f</i> helps find instances of a particular word or phrase.
|
|
|
|
|
</p><p>Scanning is also a way to process printed matter so that it may be electronically archived, modified and distributed. A bookscanner is the tool of choice for many archivists. It has two cameras, one to capture the odd pages, and one for the even pages. Most bookscanners consist of a system of pulleys which allow the book to be raised to two perpendicular sheets of glass, laying the pages flat and ensuring the focus is correct. It’s quite a workout, and is usually reserved for books which are difficult to find in digital format. Essentially the bookscanner takes two photographs, one each for the even and odd sides of a spread. So the sequence goes; flip, click click, flip, click click, and so on, and so on. Next, these images must go through a variety of processes to produce a digital book; rotating, cropping to the size of the page, merging into a single PDF. Ultimately, the most useful digital books include a digital text layer generated by OCR (Optical Character Recognition) software, making the text searchable and copy/pasteable.
|
|
|
|
|
</p><p>Image: An archivist bookscanner
|
|
|
|
|
</p>
|
|
|
|
|
<!--
|
|
|
|
|
NewPP limit report
|
|
|
|
|
Cached time: 20200620082449
|
|
|
|
|
Cache expiry: 86400
|
|
|
|
|
Dynamic content: false
|
|
|
|
|
CPU time usage: 0.004 seconds
|
|
|
|
|
Real time usage: 0.004 seconds
|
|
|
|
|
Preprocessor visited node count: 2/1000000
|
|
|
|
|
Preprocessor generated node count: 8/1000000
|
|
|
|
|
Post‐expand include size: 0/2097152 bytes
|
|
|
|
|
Template argument size: 0/2097152 bytes
|
|
|
|
|
Highest expansion depth: 2/40
|
|
|
|
|
Expensive parser function count: 0/100
|
|
|
|
|
Unstrip recursion depth: 0/20
|
|
|
|
|
Unstrip post‐expand size: 0/5000000 bytes
|
|
|
|
|
-->
|
|
|
|
|
<!--
|
|
|
|
|
Transclusion expansion time report (%,ms,calls,template)
|
|
|
|
|
100.00% 0.000 1 -total
|
|
|
|
|
-->
|
|
|
|
|
|
|
|
|
|
<!-- Saved in parser cache with key wdka_mw_mediadesign-mw_:pcache:idhash:31471-0!canonical and timestamp 20200620082449 and revision id 173978
|
|
|
|
|
-->
|
|
|
|
|
</div></DOCUMENT_FRAGMENT></div>
|
|
|
|
|
|
|
|
|
|
</body>
|
|
|
|
|
</html>
|