Scripts and static assets related to parsing timetable pdf of bphc
- Adjust variables like path/url to pdf, start & end page numbers, area for tabula, columns to parse in
pdf2json.py
- Ensure you have a Java runtime and set the PATH for it
pip install -r requirements.txt
python3 pdf2json.py
Lookout for the following while parsing the output json:
- null values in midsem_date , compre_date in courses
- empty lists for days, hours in sections