Skip to content

This repository contains my data analysis projects using SQL, python, R, Tableau, etc.

Notifications You must be signed in to change notification settings

sara1594/Data_Analyst_Portfolio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

90 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data Analyst Portfolio

About

Hi, I am Sachie, a junior analyst. The following are the part of my data journeys, and I’d like to share my skills in data analytics. Looking back, I've been always interacting with data. But after earning Google Data Aanalytics Professional Certificate, I was more into the world of data that with sofisticated tools. Every day I make a progress to move forward on finding meaningful insights from a messay process, and that brings me a joy. My mission is; with the power of data analytics, be a part of people who help make the world a better place.

Table of Contents

My Projects

Code : Student_list_4.sql, alcohol_consumption.Rmd

Background : I found an interesting dataset, "Student Alcohol Consumption", which is about secondary school students and their alcohol consumption. Drinking is a very common habit for many people. It brings us a lot of fun, but sometimes causes serious problems in our lives. Thus I thought it would be wonderful if I could bring my insights about correct understanding of alcohol espceially to young people pursuing their academic goals through my case study of this dataset.

The goal of this project is to answer this question ;

  • Does alcohol consumption relate to students’ school performance, and if it does, what can they do for their goals?

Technologies and Libraries :
R, SQL, Microsoft Excel

Processes :

  1. Collect data from kaggle
  2. Data screening with Microsoft Excel
  3. Data cleaning and manipulation with SQL
  4. Data analysis and Data visualization with R

Code : Job_Comparison.sql, jobs_from_bls.gov.ipynb

Background : Interestingly, both U.S. and Japan are at the same level on the national annual median wages (in 2020), but it doesn't mean their work envrionments are the same. This time, I was curious comparing two countries, especially in wages and educational backgrounds of different careers. New findings from this analysis might be useful for people who are looking for a career in each country.

The goal of this project is to answer these questions ;

  • Do educational backgrounds affect earnings? If yes, how?
  • Do the wages on the same professions differ between two countries? If yes, how?

Technologies and Libraries :
Python, SQL, Tableau, Microsoft Excel

Processes :

  1. Collect data with web scraping (https://www.bls.gov/ooh/) with Python
  2. Data screening with Microsoft Excel
  3. Data cleaning and manipulation with SQL
  4. Data analysis and Data visualization with Tableau

Background : Every life matters, but bias and indifferene cause the disparities of human life. Racial disparity in Media is "White Syndrome". Although some reasonings can be used for the explanations of this matter, the fact from data will make us think twice that the chance of saving people's lives should be equal.

The goal of this project is to answer these questions ;

  • Who is actually missing?
  • How does 'White Syndrome' affect on media coverage of missing persons?

Technologies and Libraries :
Tableau, Microsoft Excel

Processes :

  1. Extract data from reliable source (NCIC website)
  2. Clean and organize data with Microsoft Excel
  3. Visualize data with Tableau

My Certificates

Google Data Analytics Professional Certificate


My Study Projects

Project 1. How Can a Wellness Technology Company Play It Smart? (From Google Data Analytics Course)

Code : fit_data_analysis.Rmd
Senario :
You are a junior data analyst working on the marketing analyst team at Bellabeat, a high-tech manufacturer of health-focused products for women. Bellabeat is a successful small company, but they have the potential to become a larger player in the global smart device market. Urška Sršen, cofounder and Chief Creative Officer of Bellabeat, believes that analyzing smart device fitness data could help unlock new growth opportunities for the company. You have been asked to focus on one of Bellabeat’s products and analyze smart device data to gain insight into how consumers are using their smart devices. The insights you discover will then help guide marketing strategy for the company. You will present your analysis to the Bellabeat executive team along with your high-level recommendations for Bellabeat’s marketing strategy. (referred from Google Data Analytics Course)

The goal of this project is to answer these questions ;

  • What are the desirable features for Bellabeat considering the current wearable device trend?
  • What feature can we add for better self-management?

Technologies and Libraries :
R, Microsoft Excel

Processes :

  1. Data import and storing
  2. Data cleaning, manipulation, and validation with Microsoft Excel and R
  3. Data analyzing and visualization with R

Project 2. US Population with Tableau

Tableau Public : US States Population
Code : Job_Comparison.sql
Background :
Hex tile maps are sometimes an effective way to comprehend data. Tableau enables us to create those simplified maps, so I wanted to try it myself. This time, I used a diamond shape instead of a hex shape. There are many useful resources on the Internet for creating a hex-tile map, but I mainly followed How to use hex-tile maps to eliminate the Alaska effect. It is not a very complicated process, so this method will be in my tool box of data visualization.

Technologies and Libraries :
SQL, Tableau, Microsoft Excel

Processes :

  1. Data import and storing
  2. Data cleaning, manipulation, and validation with Microsoft Excel and SQL
  3. Data analyzing and visualization with Tableau

About

This repository contains my data analysis projects using SQL, python, R, Tableau, etc.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages