We Are One - A Global Film Festival's HTML page parser to create an ICS file containing details for each film (date/hour/summary/etc).
html | ||
.gitignore | ||
parser.py | ||
README.md | ||
requirements.txt |
We Are One : A Global Film Festival parser
What does it do ?
This short scripts parses HTML pages from the official WAOFF schedule webpage (http://www.weareoneglobalfestival.com/schedule) in order to create an iCalendar-formatted file, that you can directly import in your calendar app.
Usage :
python3 parser.py
Note : the webpage is JS-based, which means you can't just wget
it to extract the HTML page. You'll need to browse to the schedule page you want, go to the console, and type :
console.log(document.getElementsByTagName('html')[0].innerHTML);
, then copy-paste it to and html file locally, that you'll browse with the script.