We Are One - A Global Film Festival's HTML page parser to create an ICS file containing details for each film (date/hour/summary/etc).
Go to file
2020-05-30 16:46:19 +02:00
.gitignore First commit of a python parser 2020-05-30 16:46:19 +02:00
index.html First commit of a python parser 2020-05-30 16:46:19 +02:00
parser.py First commit of a python parser 2020-05-30 16:46:19 +02:00
README.md First commit of a python parser 2020-05-30 16:46:19 +02:00
requirements.txt First commit of a python parser 2020-05-30 16:46:19 +02:00
waoff-calendar.ics First commit of a python parser 2020-05-30 16:46:19 +02:00

We Are One : A Global Film Festival parser

What does it do ?

This short scripts parses an HTML page from the official WAOFF schedule webpage (http://www.weareoneglobalfestival.com/schedule) and parses it to create an iCalendar-formatted file, that you can directly import in your calendar app.

Note : the webpage is JS-based, which means you can't just wget it to extract the HTML page. You'll need to browse to the schedule page you want, go to the console, and type :

console.log(document.getElementsByTagName('html')[0].innerHTML);

, then copy-paste it to and html file locally, that you'll browse with the script.

TODO

  • fix the start time problem in the ICS file
  • handle timezone (the time on the website is EST)
  • interactive choice of HTML page and name of calendar