Comparative Analysis of Predictive Models on Human Wellbeing

This project is centered around exploring the application of machine learning techniques to predict human wellbeing, comparing traditional econometric methods, such as Ordinary Least Squares (OLS), with modern algorithms like Least Absolute Shrinkage and Selection Operator (LASSO), Random Forests (RF), and Gradient Boosting (GB). Inspired by the work of Oparina et al. (2023), the aim is to assess the effectiveness of various models in predicting human wellbeing.

Data

The dataset for this project is sourced from the German Socio-Economic Panel (SOEP), covering the years 2010 to 2018. This timeframe aligns with the original paper, and flexibility is maintained to consider other years as long as data remains available.

Data Management and Variables of Interest

Focus will be placed on the "restricted set" mentioned in the paper, including variables such as Age, Area of Residence, BMI, Disability Status, Education, Labour-force status, Log HH income, Ethnicity/Migration Background, Health, Housing Status, Marital Status, Month of Interview, Number of children in HH, Number of people in HH, Religion, Sex, and Working Hours. Categorical data will be transformed into sets of dummy variables for analysis.

Analysis

a) Generate descriptive statistics for the variables of interest. b) Utilize the four algorithms to regress life satisfaction on the variables of interest. c) Compute performance metrics as R². d) Compare performance metrics across the different models.

Figures/Final Analysis

Produce figures akin to those presented in Oparina et al. (2023), encompassing model performance, performance improvement through the use of machine learning, variable importance, and wellbeing patterns concerning age and income.

Additional

a) Consider expanding the dataset by including other years for a more comprehensive analysis. b) Explore and apply additional machine learning algorithms beyond the ones mentioned in the original paper. c) Compare the performance of these new algorithms with those previously examined. As a considerable amount of time was spent on cleaning the data and selecting relevant variables, the additional part was disconsidered for this project and only the replication of the paper was maintained (the original code for this project was not available).

References

Oparina, E., Kaiser, C., Gentile, N., Tkatchenko, A., Clark, A. E., De Neve, J. E., & D'Ambrosio, C. (2023). Machine Learning in the Prediction of Human Wellbeing. Working Paper see.

Usage

To get started, create and activate the environment with

$ conda/mamba env create
$ conda activate wellbeing

To build the project, type

$ pytask

The dataset is privately owned.

Credits

This project was created with cookiecutter and the econ-project-templates.

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.github		.github
inst		inst
paper		paper
src/wellbeing_and_machine_learning		src/wellbeing_and_machine_learning
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.yamllint.yml		.yamllint.yml
CHANGES.md		CHANGES.md
CITATION		CITATION
MANIFEST.in		MANIFEST.in
README.md		README.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Comparative Analysis of Predictive Models on Human Wellbeing

Data

Data Management and Variables of Interest

Analysis

Figures/Final Analysis

Additional

References

Usage

Credits

This project was created with cookiecutter and the econ-project-templates.

About

Releases

Packages

Contributors 2

Languages

willbackes/wellbeing_machine_learning

Folders and files

Latest commit

History

Repository files navigation

Comparative Analysis of Predictive Models on Human Wellbeing

Data

Data Management and Variables of Interest

Analysis

Figures/Final Analysis

Additional

References

Usage

Credits

This project was created with cookiecutter and the econ-project-templates.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages