From 50755f23fb36ca6bbda918c219e515fe7553e49a Mon Sep 17 00:00:00 2001 From: simoon Date: Sun, 5 Jul 2020 22:59:54 +0200 Subject: [PATCH] edits to rvrs --- tasks/Cleaning_up_text_rvrs.html | 2 +- tasks/Digitising_Printing_rvrs.html | 2 +- tasks/Human_reading_rvrs.html | 2 +- tasks/Indexing_rvrs.html | 3 +-- tasks/Scanning_rvrs.html | 4 ++-- tasks/Technologising_the_word_rvrs.html | 2 +- tasks/Trusting_rvrs.html | 12 +++++++----- 7 files changed, 14 insertions(+), 13 deletions(-) diff --git a/tasks/Cleaning_up_text_rvrs.html b/tasks/Cleaning_up_text_rvrs.html index b757454..46de71a 100644 --- a/tasks/Cleaning_up_text_rvrs.html +++ b/tasks/Cleaning_up_text_rvrs.html @@ -11,7 +11,7 @@
Hidden characters (e.g. tabs, spaces, carriage and ‘soft’ returns)

Extracting text from a PDF[edit]

-

In Al Sweigart's Automate the Boring Stuff with Python, there's a nice section on a Python library called PyPDF2 that allows you to work with the contents of PDFs. To begin with, I thought I'd try extracting text from a PDF of William S. Burrough's The Electronic Revolution. I chose this PDF as the only version I've found of it online is a 40pp document published by ubuclassics (which I suppose is the publishing house for ubuweb.com). There was no identifier other than this (no ISBN etc.), and it was impossible locating any other version online. What's more, the PDF had very small text, which was uncomfortable to read when I ran the booklet.sh script on it. +

In Al Sweigart's Automate the Boring Stuff with Python, there's a nice section on a Python library called PyPDF2 that allows you to work with the contents of PDFs. To begin with, I thought I'd try extracting text from a PDF of William S. Burrough's The Electronic Revolution. I chose this PDF as the only version I've found of it online is a 40pp document published by ubuclassics (which I suppose is the publishing house for ubuweb.com). There was no identifier other than this (no ISBN etc.), and it was impossible locating any other version online. What's more, the PDF had very small text, which was uncomfortable to read when I ran the booklet.sh script on it.

I thought it would be worthwhile laying out this book again for print reading purposes, and the first step is to get the text from the PDF. Pandoc is usually my go to for extracting text, but it doesn't work with PDFs, so I tried PyPDF2.

28.09.19[edit]

diff --git a/tasks/Digitising_Printing_rvrs.html b/tasks/Digitising_Printing_rvrs.html index 93097c3..f6a2f32 100644 --- a/tasks/Digitising_Printing_rvrs.html +++ b/tasks/Digitising_Printing_rvrs.html @@ -29,7 +29,7 @@

Text Laundrette[edit]

-

Text Laundrette is a workshop in which we use a home-made, DIY book scanner, and open-source software to scan, process, and add digital features to printed texts brought by the participants to the workshop. These are included in the “bootleg library”, a shadow library accessible over a local network. The workshop was organised by Simon Browne and Pedro Sá Couto, for the 2020 py.rate.chnic sessions and first held at WdKA in the Publication Station, February 2020. +

Text Laundrette is a workshop in which we use a home-made, DIY book scanner, and open-source software to scan, process, and add digital features to printed texts brought by the participants to the workshop. These are included in the “bootleg library”, a shadow library accessible over a local network. The workshop was organised by Simon Browne and Pedro Sá Couto, for the 2020 py.rate.chnic sessions and first held at WdKA in the Publication Station, February 2020.

Description[edit]

The bookscanner
diff --git a/tasks/Human_reading_rvrs.html b/tasks/Human_reading_rvrs.html index ad6b8b2..655d665 100644 --- a/tasks/Human_reading_rvrs.html +++ b/tasks/Human_reading_rvrs.html @@ -82,7 +82,7 @@ At Leeszaal:

  • Transcription of the voice performance (we can be inspired by film transcriptions) - perhaps we could annotate this as well:

https://static1.squarespace.com/static/583ae0a12994ca4dbbf813f6/t/58572e856a49634cd5602264/1531923111860 -

Annotations on Stuart Hall's Encoding, Decoding +

Annotations on Stuart Hall's Encoding, Decoding

+bootleg library sessions pad: Documentation of Session
+

+