We Are One - A Global Film Festival's HTML page parser to create an ICS file containing details for each film (date/hour/summary/etc).
.gitignore | ||
index.html | ||
parser.py | ||
README.md | ||
requirements.txt | ||
waoff-calendar.ics |
We Are One : A Global Film Festival parser
What does it do ?
This short scripts parses an HTML page from the official WAOFF schedule webpage (http://www.weareoneglobalfestival.com/schedule) and parses it to create an iCalendar-formatted file, that you can directly import in your calendar app.
Note : the webpage is JS-based, which means you can't just wget
it to extract the HTML page. You'll need to browse to the schedule page you want, go to the console, and type :
console.log(document.getElementsByTagName('html')[0].innerHTML);
, then copy-paste it to and html file locally, that you'll browse with the script.
TODO
- fix the start time problem in the ICS file
- handle timezone (the time on the website is EST)
- interactive choice of HTML page and name of calendar