BASH script
A BASH (Born Again SHell) script is a series of commands stored in a text file. It contains commands that one would use in command line and others. We used BASH scripts to automatically upload batches of files including metadata to the Wiki.
if page.exists: url = page.imageinfo['descriptionurl'] print_colormsg( f'Already exists in {url} Will NOT be uploaded', level='warning') else: img_smw_prop_val = smw_propval_template.render( title=args.title, date=args.date, part=n + 1, partof=len(lsimgs), creator=(', ').join(args.creator[1:]), organization=(', ').join(args.org[1:]), format=(', ').join(args.format[1:]), event=(', ').join(args.event[1:]), topic=(', ').join(args.topic[1:]), language=(', ').join(args.language[1:]) )
Dark Web
The Dark Web is a portion of the Internet that is not indexed by search engines. Users need "specific software, configurations, or authorization to access" to be able to use it. This shelter of anonymity is of great importance while dealing with censorship and privacy concerns. It is also a haven for criminal activities. Tor browser is probably the most famous tool used to access the dark web. The Dark Web serves us with the protective layers to make sure the location of the servers and the identity of all participants is secure in the case of possible attempts of persecution of whatever kind.
Tor Browser
Using the Tor Browser helps prevent others from tracking your browsing behavior and user details like location. It is provided by the Tor project, a non-profit organization. It works by routing all your internet traffic trough random nodes, listed in the Tor network and thus rendering the origin of it unrecognizable. It is probably the most famous tool used to access the Dark Web.
Pandoc
Pandoc is a free Command-line tool used to convert from one markup format into different a one. We used Pandoc in the process of converting documents written in markdown into html and later into a print layout(?).
Tesseract
Tesseract is a free "optical character recognition" (OCR) engine and was supposedly the fist of its kind. It recognizes text within images and outputs either text or a PDF containing the image and selectable, searchable text laid over it. Tesseract is of great use and essential to provide a comprehensive overview when working with a big batch of scanned documents. The quality of the results depends very much on the quality of the documents and thus can result in the need for large-scale revisions.
MediaWiki
MediaWiki, the basis for Wikipedia and most other MediaWiki websites, is a free and open-source "collaboration and documentation" platform engine. Wiki markup is the language used to write content on wiki website. MediaWiki served us as a platform for many users to handle large amounts of image files and metadata. It features helpful tools to do so, like edit forms, semantic queries and an API.
Code of Conduct
We created a code of conduct to make sure we were on the same page about the fundamentals of our regular interactions as a group. It served as a reference on how to approach each other and helped prevent avoidable unconstructive conflict.
- Listen don’t interrupt
- Listen actively - Show you’ve understood
- If you don’t agree, you can still accept
- When negotiating, be convivial (be nice!)
- Ask what is needed before you give
- If personal issues arise, try to address/resolve - if not park it for the sake of the project
- When you don’t understand, ask for help (be humble)
- Be aware of the space you take (Check temperature - silence is OKAY!)
- Before making a decision, check with others (might be an unhappy average)
- Speak for yourself and your own experience (see links: nonviolent communication)
- Use „AND“ instead of „BUT“ and „HOWEVER“
- These rules can be modified by consensus