Repository from project to get data from technology section on The Guardian API and store as csv files locally.
Before start, you have to register your account to get the api-key here: https://bonobo.capi.gutools.co.uk/register/developer
You need python 3.6 and pip installed on your machine to run this application.
And, you have to set the configurations in config.ini
file:
[DEFAULT]
SECTION=technology
API_KEY=<your-api-key>
URI=https://content.guardianapis.com/search
PAGE_SIZE=200
OUTPUT_DIR=
pip install -r requirements.txt
To run the ETL process, you have to pass a start date and an end date, with arguments --from-date
and --to-date
python run.py --from-date "2018-08-15" --to-date "2018-08-17"
To run all test cases you need to run the following command:
python -m unittest discover ./app/tests
- Python 3.6 - Main language of project
- PIP - Dependency manager
- Pandas - Used pandas for test case