special-issue-11-wiki2html/README.md

# Wiki to HTML pages script
![](https://pzwiki.wdka.nl/mw-mediadesign/images/8/82/Workflow-wiki2html.svg)

## Depencencies
* python3
* [pip](https://pip.pypa.io/en/stable/installing/) Python library installed
    * Install:
        * `curl https://bootstrap.pypa.io/get-pip.py -o get-pip.py`
        *  `python3 get-pip.py`

* [mwclient](https://mwclient.readthedocs.io/en/latest/index.html) Python library
    * Install:
        * `pip3 install mwclient`
* [jinja2](https://jinja.palletsprojects.com/en/2.11.x/) Python library
    * Install:
        * `pip3 install jinja2`
* [pandoc](https://pandoc.org/)
    * Install:
        * Debian/Ubuntu: `sudo apt install pandoc`
        * Mac: `brew install pandoc`


## login.txt
`login.txt` is a local and individual file, ignored by git, where you place you itch wiki username and password, in separate lines.

It is used to let mwclient access the wiki, since it is close for reading and writing.
```
myusername
mypassword
```


## Run

`cd special-issue-11-wiki2html/`

Run scripts together with `./run.sh`


1 script at a time:

`python3 download_imgs.py` 
* Downloads all images from wiki to `images/` directory 
* and stores each image's metadata to `images.json`

`python3 query2html.py`
* with ask API perform a query: 
    * help `python3 query2html.py --help`
    * run dry `python3 query2html.py --dry` only printing request, not executing it
    * build custom query with arguments `--conditions  --printouts  --sort  --order`
    * default query is: `[[File:+]][[Title::+]][[Part::+]][[Date::+]]|?Title|?Date|?Part|?Partof|sort=Date,Title,Part|order=asc,asc,asc`  
    * custom query `python3 query2html.py -c '[[Date::>=1970/01/01]][[Date::<=1979/12/31]]' -p '?Title|?Date|?Part|?Partof' -s 'Date,Title,Part' -o 'asc,asc,asc'`

* The results, with the same Title, are stored
    * into 1 single HTML
    * sorted by Part


## TODO
* remove HTML files at each new query
* revise `def unpack_response()` so that it returns the values of all properties printed out
* revise template so that they include the values of all properties printed out \
and do not break on missing values
images being downloaded 5 years ago			`# Wiki to HTML pages script`
updates to readme & run.sh 5 years ago			`![](https://pzwiki.wdka.nl/mw-mediadesign/images/8/82/Workflow-wiki2html.svg)`
images being downloaded 5 years ago
			`## Depencencies`
			`* python3`
updates to readme & run.sh 5 years ago			`* [pip](https://pip.pypa.io/en/stable/installing/) Python library installed`
images being downloaded 5 years ago			`* Install:`
			* `curl https://bootstrap.pypa.io/get-pip.py -o get-pip.py`
			* `python3 get-pip.py`

			`* [mwclient](https://mwclient.readthedocs.io/en/latest/index.html) Python library`
			`* Install:`
			* `pip3 install mwclient`
images2html 5 years ago			`* [jinja2](https://jinja.palletsprojects.com/en/2.11.x/) Python library`
			`* Install:`
			* `pip3 install jinja2`
			`* [pandoc](https://pandoc.org/)`
			`* Install:`
			* Debian/Ubuntu: `sudo apt install pandoc`
			* Mac: `brew install pandoc`

images being downloaded 5 years ago
			`## login.txt`
updates to readme & run.sh 5 years ago			`login.txt` is a local and individual file, ignored by git, where you place you itch wiki username and password, in separate lines.
images being downloaded 5 years ago
			`It is used to let mwclient access the wiki, since it is close for reading and writing.`
			```
			`myusername`
			`mypassword`
			```


			`## Run`
updates to readme & run.sh 5 years ago
			`cd special-issue-11-wiki2html/`

			Run scripts together with `./run.sh`


			`1 script at a time:`
sh script 5 years ago
README + change script name 5 years ago			`python3 download_imgs.py`
			* Downloads all images from wiki to `images/` directory
			* and stores each image's metadata to `images.json`

change name of script publication2html.py --> ask2html.py 5 years ago			`python3 query2html.py`
README + change script name 5 years ago			`* with ask API perform a query:`
change name of script publication2html.py --> ask2html.py 5 years ago			* help `python3 query2html.py --help`
			* run dry `python3 query2html.py --dry` only printing request, not executing it
ask broken down into several arguments; --dry run 5 years ago			* build custom query with arguments `--conditions --printouts --sort --order`
			* default query is: `[[File:+]][[Title::+]][[Part::+]][[Date::+]]\|?Title\|?Date\|?Part\|?Partof\|sort=Date,Title,Part\|order=asc,asc,asc`
change name of script publication2html.py --> ask2html.py 5 years ago			* custom query `python3 query2html.py -c '[[Date::>=1970/01/01]][[Date::<=1979/12/31]]' -p '?Title\|?Date\|?Part\|?Partof' -s 'Date,Title,Part' -o 'asc,asc,asc'`
ask broken down into several arguments; --dry run 5 years ago
README + change script name 5 years ago			`* The results, with the same Title, are stored`
			`* into 1 single HTML`
			`* sorted by Part`


sh script 5 years ago
updates to readme & run.sh 5 years ago			`## TODO`
ask broken down into several arguments; --dry run 5 years ago			`* remove HTML files at each new query`
cleaning static_html dir beefore creating new html 5 years ago			* revise `def unpack_response()` so that it returns the values of all properties printed out
			`* revise template so that they include the values of all properties printed out \`
			`and do not break on missing values`