<divclass="cardback"><DOCUMENT_FRAGMENT><divclass="mw-parser-output"><divclass="thumb tright"><divclass="thumbinner"style="width:152px;"><aclass="image"href="https://pzwiki.wdka.nl/mw-mediadesign/images/thumb/a/a3/Hidden_characters.jpg/960px-Hidden_characters.jpg"><imgalt=""class="thumbimage"decoding="async"src="https://pzwiki.wdka.nl/mw-mediadesign/images/thumb/a/a3/Hidden_characters.jpg/320px-Hidden_characters.jpg"></a><divclass="thumbcaption"><divclass="magnify"><aclass="internal"href="File:Hidden_characters.jpg.html"title="Enlarge"></a></div>Hidden characters (e.g. tabs, spaces, carriage and ‘soft’ returns)</div></div></div>
<divclass="cardback"><DOCUMENT_FRAGMENT><divclass="mw-parser-output"><divclass="thumb tright"><divclass="thumbinner"style="width:152px;"><aclass="image"href="https://pzwiki.wdka.nl/mw-mediadesign/images/thumb/a/a3/Hidden_characters.jpg/960px-Hidden_characters.jpg"><imgalt=""class="thumbimage"decoding="async"src="https://pzwiki.wdka.nl/mw-mediadesign/images/thumb/a/a3/Hidden_characters.jpg/320px-Hidden_characters.jpg"></a><divclass="thumbcaption"><divclass="magnify"><aclass="internal"href="File:Hidden_characters.jpg.html"title="Enlarge"></a></div>Hidden characters (e.g. tabs, spaces, carriage and ‘soft’ returns)</div></div></div>
<h2><spanclass="mw-headline"id="Extracting_text_from_a_PDF">Extracting text from a PDF</span><spanclass="mw-editsection"><spanclass="mw-editsection-bracket">[</span><ahref="/mw-mediadesign/index.php?title=User:Simon/Trim4/Extracting_text_from_PDF&action=edit&section=T-1"title="Edit section: ">edit</a><spanclass="mw-editsection-bracket">]</span></span></h2>
<h2><spanclass="mw-headline"id="Extracting_text_from_a_PDF">Extracting text from a PDF</span><spanclass="mw-editsection"><spanclass="mw-editsection-bracket">[</span><ahref="/mw-mediadesign/index.php?title=User:Simon/Trim4/Extracting_text_from_PDF&action=edit&section=T-1"title="Edit section: ">edit</a><spanclass="mw-editsection-bracket">]</span></span></h2>
<p>In Al Sweigart's <i>Automate the Boring Stuff with Python</i>, there's a nice section on a Python library called PyPDF2 that allows you to work with the contents of PDFs. To begin with, I thought I'd try extracting text from a PDF of William S. Burrough's <i>The Electronic Revolution</i>. I chose this PDF as the only version I've found of it online is a 40pp document published by ubuclassics (which I suppose is the publishing house for ubuweb.com). There was no identifier other than this (no ISBN etc.), and it was impossible locating any other version online. What's more, the PDF had very small text, which was uncomfortable to read when I ran the <ahref="Michaels_booklet_script_for_PDF_imposition.html"title="User:Simon/Trim4/Michaels booklet script for PDF imposition">booklet.sh</a> script on it.
<p>In Al Sweigart's <i>Automate the Boring Stuff with Python</i>, there's a nice section on a Python library called PyPDF2 that allows you to work with the contents of PDFs. To begin with, I thought I'd try extracting text from a PDF of William S. Burrough's <i>The Electronic Revolution</i>. I chose this PDF as the only version I've found of it online is a 40pp document published by ubuclassics (which I suppose is the publishing house for ubuweb.com). There was no identifier other than this (no ISBN etc.), and it was impossible locating any other version online. What's more, the PDF had very small text, which was uncomfortable to read when I ran the booklet.sh script on it.
</p><p>I thought it would be worthwhile laying out this book again for print reading purposes, and the first step is to get the text from the PDF. Pandoc is usually my go to for extracting text, but it doesn't work with PDFs, so I tried <aclass="external text"href="https://pythonhosted.org/PyPDF2/index.html"rel="nofollow">PyPDF2</a>.
</p><p>I thought it would be worthwhile laying out this book again for print reading purposes, and the first step is to get the text from the PDF. Pandoc is usually my go to for extracting text, but it doesn't work with PDFs, so I tried <aclass="external text"href="https://pythonhosted.org/PyPDF2/index.html"rel="nofollow">PyPDF2</a>.
<p><i>Text Laundrette</i> is a workshop in which we use a home-made, DIY book scanner, and open-source software to scan, process, and add digital features to printed texts brought by the participants to the workshop. These are included in the “bootleg library”, a shadow library accessible over a local network. The workshop was organised by Simon Browne and Pedro Sá Couto, for the 2020 <ahref="Py.rate.chnic_sessions.html"title="Py.rate.chnic sessions">py.rate.chnic sessions</a> and first held at WdKA in the Publication Station, February 2020.
<p><i>Text Laundrette</i> is a workshop in which we use a home-made, DIY book scanner, and open-source software to scan, process, and add digital features to printed texts brought by the participants to the workshop. These are included in the “bootleg library”, a shadow library accessible over a local network. The workshop was organised by Simon Browne and Pedro Sá Couto, for the 2020 py.rate.chnic sessions and first held at WdKA in the Publication Station, February 2020.
</p><p><aclass="external text"href="https://pad.xpub.nl/p/sh_encoding_decoding%7C" rel="nofollow">Annotations on Stuart Hall's <i>Encoding, Decoding</i></a>
</p><p><aclass="external text"href="https://pad.xpub.nl/p/sh_encoding_decoding" rel="nofollow">Annotations on Stuart Hall's <i>Encoding, Decoding</i></a>
<h2><spanid="Reflections_on_classification_in_Hope_A._Olson,_Mapping_beyond_Dewey's_Boundaries"></span><spanclass="mw-headline"id="Reflections_on_classification_in_Hope_A._Olson.2C_Mapping_beyond_Dewey.27s_Boundaries">Reflections on classification in Hope A. Olson, <i>Mapping beyond Dewey's Boundaries</i></span><spanclass="mw-editsection"><spanclass="mw-editsection-bracket">[</span><ahref="/mw-mediadesign/index.php?title=User:Simon/spatial_classification&action=edit&section=T-1"title="Edit section: ">edit</a><spanclass="mw-editsection-bracket">]</span></span></h2>
<h2><spanid="Reflections_on_classification_in_Hope_A._Olson,_Mapping_beyond_Dewey's_Boundaries"></span><spanclass="mw-headline"id="Reflections_on_classification_in_Hope_A._Olson.2C_Mapping_beyond_Dewey.27s_Boundaries">Reflections on classification in Hope A. Olson, <i>Mapping beyond Dewey's Boundaries</i></span><spanclass="mw-editsection"><spanclass="mw-editsection-bracket">[</span><ahref="/mw-mediadesign/index.php?title=User:Simon/spatial_classification&action=edit&section=T-1"title="Edit section: ">edit</a><spanclass="mw-editsection-bracket">]</span></span></h2>
<p>Hope A Olson's text "Mapping Beyond Dewey's Boundaries" on spatial representations in classification systems, explored through a project that attempted to cross-reference two classification systems - <aclass="external text"href="https://www.amazon.com/Womens-Thesaurus-Index-Language-Ellen/dp/0061811718"rel="nofollow"><i>A Woman's Thesaurus</i></a> and Dewey Decimal Classification by <aclass="external text"href="http://capekconsulting.com/about-mary-ellen-capek/"rel="nofollow">Mary Ellen S Capek</a>. Stating that "classifications are locational systems" suggests that spatial representations can be used with various effect; describing, exposing, and when used as metaphors, shifting the discourse.
</p><p>Hope A Olson's text "Mapping Beyond Dewey's Boundaries" on spatial representations in classification systems, explored through a project that attempted to cross-reference two classification systems - <aclass="external text"href="https://www.amazon.com/Womens-Thesaurus-Index-Language-Ellen/dp/0061811718"rel="nofollow"><i>A Woman's Thesaurus</i></a> and Dewey Decimal Classification by <aclass="external text"href="http://capekconsulting.com/about-mary-ellen-capek/"rel="nofollow">Mary Ellen S Capek</a>. Stating that "classifications are locational systems" suggests that spatial representations can be used with various effect; describing, exposing, and when used as metaphors, shifting the discourse.
</p><p><b>1. Spatial representation of classification systems reveals the ideological conditions that form them.</b>
</p><p><b>1. Spatial representation of classification systems reveals the ideological conditions that form them.</b>
</p><p>Olson refers to spatial representations of classification systems in the form of diagrams.
</p><p>Olson refers to spatial representations of classification systems in the form of diagrams.
</p><p>The first diagram is one that shows distribution of subjects, with the idea of a mainstream core that diffuses towards the margins. The second is a Venn diagram that illustrates how "mainstream" or "core" descriptors actually eventuate in very small "cores" due to limitations by Boolean "ands'. Venn diagrams operate on the basis of duality - something is or isn't part of a set.<br>
</p><p>The first diagram is one that shows distribution of subjects, with the idea of a mainstream core that diffuses towards the margins. The second is a Venn diagram that illustrates how "mainstream" or "core" descriptors actually eventuate in very small "cores" due to limitations by Boolean "ands'. Venn diagrams operate on the basis of duality - something is or isn't part of a set.<br>
<h2><spanclass="mw-headline"id="first_trials_with_the_bookscanner">first trials with the bookscanner</span><spanclass="mw-editsection"><spanclass="mw-editsection-bracket">[</span><ahref="/mw-mediadesign/index.php?title=User:Simon/Trim4/Using_the_bookscanner&action=edit&section=T-1"title="Edit section: ">edit</a><spanclass="mw-editsection-bracket">]</span></span></h2>
<h2><spanclass="mw-headline"id="first_trials_with_the_bookscanner">first trials with the bookscanner</span><spanclass="mw-editsection"><spanclass="mw-editsection-bracket">[</span><ahref="/mw-mediadesign/index.php?title=User:Simon/Trim4/Using_the_bookscanner&action=edit&section=T-1"title="Edit section: ">edit</a><spanclass="mw-editsection-bracket">]</span></span></h2>
<p>I tried using the Bookscanner, built as part of <ahref="OuNuPo.html"title="OuNuPo">Special Issue 5 OuNuPo</a>. The Bookscanner needed a few adjustments before it was ready to use. The documentation is rather limited, but the software is set up so that it is quite easy to work out how to use it.
<p>I tried using the Bookscanner, built as part of Special Issue 5 OuNuPo. The Bookscanner needed a few adjustments before it was ready to use. The documentation is rather limited, but the software is set up so that it is quite easy to work out how to use it.
</p><p>The scanner takes photos of even and odd pages, from cameras mounted above the glass. First, you have to mount a drive in which the scanned images will be stored. Then, you can adjust the zoom, and shutter speed. I found it impossible to take an image of only the page, so I will need to crop out everything around it in the future.
</p><p>The scanner takes photos of even and odd pages, from cameras mounted above the glass. First, you have to mount a drive in which the scanned images will be stored. Then, you can adjust the zoom, and shutter speed. I found it impossible to take an image of only the page, so I will need to crop out everything around it in the future.
</p><p>I scanned a book, and made jpegs of each page. The pages are oriented from the camera's perspective, like so:
</p><p>I scanned a book, and made jpegs of each page. The pages are oriented from the camera's perspective, like so:
</p><p>Next, I ran a script that does OCR on jpegs that Pedro, Tancre and Bo made for their workshop <ahref="Blurry_Boundaries.html"title="Blurry Boundaries"> Blurry Boundaries</a> as part of Special Issue 9: The Library Is Open. This created two PDFs, with OCR. The next step will to be to work out how to rotate the images 90 degrees to the correct orientation, (clockwise for the odd pages, anti-clockwise for the even pages), and crop the images.
</p><p>Next, I ran a script that does OCR on jpegs that Pedro, Tancre and Bo made for their workshop Blurry Boundaries as part of Special Issue 9: The Library Is Open. This created two PDFs, with OCR. The next step will to be to work out how to rotate the images 90 degrees to the correct orientation, (clockwise for the odd pages, anti-clockwise for the even pages), and crop the images.
<divclass="cardback"><DOCUMENT_FRAGMENT><divclass="mw-parser-output"><divclass="thumb tright"><divclass="thumbinner"style="width:152px;"><aclass="image"href="https://pzwiki.wdka.nl/mw-mediadesign/images/thumb/5/59/Rosetta_stone.jpeg/960px-Rosetta_stone.jpeg"><imgalt=""class="thumbimage"decoding="async"src="https://pzwiki.wdka.nl/mw-mediadesign/images/thumb/5/59/Rosetta_stone.jpeg/320px-Rosetta_stone.jpeg"></a><divclass="thumbcaption"><divclass="magnify"><aclass="internal"href="File:Rosetta_stone.jpeg.html"title="Enlarge"></a></div><i>The Rosetta Stone</i>, a tablet discovered in 1799 inscribed with three versions of a decree written in Ancient Egyptian and Ancient Greek</div></div></div>
<divclass="cardback"><DOCUMENT_FRAGMENT><divclass="mw-parser-output"><divclass="thumb tright"><divclass="thumbinner"style="width:152px;"><aclass="image"href="https://pzwiki.wdka.nl/mw-mediadesign/images/thumb/5/59/Rosetta_stone.jpeg/960px-Rosetta_stone.jpeg"><imgalt=""class="thumbimage"decoding="async"src="https://pzwiki.wdka.nl/mw-mediadesign/images/thumb/5/59/Rosetta_stone.jpeg/320px-Rosetta_stone.jpeg"></a><divclass="thumbcaption"><divclass="magnify"><aclass="internal"href="File:Rosetta_stone.jpeg.html"title="Enlarge"></a></div><i>The Rosetta Stone</i>, a tablet discovered in 1799 inscribed with three versions of a decree written in Ancient Egyptian and Ancient Greek</div></div></div>
<h2><spanclass="mw-headline"id="traces_of_book_use_in_from_the_books">traces of book use in <i>from the books</i></span><spanclass="mw-editsection"><spanclass="mw-editsection-bracket">[</span><ahref="/mw-mediadesign/index.php?title=User:Simon/Annotation_typologies&action=edit&section=T-1"title="Edit section: ">edit</a><spanclass="mw-editsection-bracket">]</span></span></h2>
<h2><spanclass="mw-headline"id="traces_of_book_use_in_from_the_books">traces of book use in <i>from the books</i></span><spanclass="mw-editsection"><spanclass="mw-editsection-bracket">[</span><ahref="/mw-mediadesign/index.php?title=User:Simon/Annotation_typologies&action=edit&section=T-1"title="Edit section: ">edit</a><spanclass="mw-editsection-bracket">]</span></span></h2>
<p>Typologies of traces of use identified from a previous project called <a href="From_the_Books:_SLV_RBRR_000-099.html"title="User:Simon/From the Books: SLV RBRR 000-099">From the Books</a>, which explored books from the 000-099 section of the Redmond Barry Reading Room in the State Library of Victoria.
<p>Typologies of traces of use identified from a previous project called <em>From the Books: SLV RBRR 000-099</em>, which explored books from the 000-099 section of the Redmond Barry Reading Room in the State Library of Victoria.