Bike Sharing Demand Prediction

This project aims to model the demand for shared bikes based on various independent variables. The model will help management understand how demand fluctuates with different features, enabling them to adjust business strategies to meet demand and customer expectations. It will also provide insights into demand dynamics in new markets.

General Information

What is the background of the project?
- A bike-sharing system is a service in which bikes are made available for shared use to individuals on a short term basis for a price or free. Many bike share systems allow people to borrow a bike from a "dock" which is usually computer-controlled wherein the user enters the payment information, and the system unlocks it. This bike can then be returned to another dock belonging to the same system.
What is the business probem that your project is trying to solve?
- You are required to model the demand for shared bikes with the available independent variables. It will be used by the management to understand how exactly the demands vary with different features. They can accordingly manipulate the business strategy to meet the demand levels and meet the customer's expectations. Further, the model will be a good way for management to understand the demand dynamics of a new market.
What is the dataset that is being used?
- The dataset 'day.csv' have the following fields:
  - instant: record index
  - dteday : date
  - season : season (1:spring, 2:summer, 3:fall, 4:winter)
  - yr : year (0: 2018, 1:2019)
  - mnth : month ( 1 to 12)
  - holiday : weather day is a holiday or not (extracted from http://dchr.dc.gov/page/holiday-schedule)
  - weekday : day of the week
  - workingday : if day is neither weekend nor holiday is 1, otherwise is 0.
  - weathersit :
    1. Clear, Few clouds, Partly cloudy, Partly cloudy
    2. Mist + Cloudy, Mist + Broken clouds, Mist + Few clouds, Mist
    3. Light Snow, Light Rain + Thunderstorm + Scattered clouds, Light Rain + Scattered clouds
    4. Heavy Rain + Ice Pallets + Thunderstorm + Mist, Snow + Fog
  - temp : temperature in Celsius
  - atemp: feeling temperature in Celsius
  - hum: humidity
  - windspeed: wind speed
  - casual: count of casual users
  - registered: count of registered users
  - cnt: count of total rental bikes including both casual and registered
Project Steps

Data Loading and Exploration:
- Load the bike-sharing dataset (day.csv).
- Explore data types, check for missing values, and perform descriptive statistics.
- Visualize data using various plots (bar plots, pair plots, heatmaps, box plots) to understand relationships between variables, identify outliers and multicollinearity.
- Reverse label encoding of categorical variables for EDA.
Data Preprocessing:
- Drop irrelevant columns (instant, casual, registered, atemp).
- Handle outliers.
- Address multicollinearity by removing highly correlated features ('yr', 'mnth', 'hum', 'season') using VIF and correlation analysis.
- Further feature engineering if necessary.
Model Building:
- Split data into training and testing sets.
- Train a linear regression model.
- Make predictions on the test set.
Model Evaluation:
- Evaluate model performance using R-squared and visualization of actual vs. predicted values.

Conclusions

The project successfully builds a linear regression model to predict bike-sharing demand. The model performance is evaluated using R-squared metric and a visual comparison of predictions against actual values. Data exploration and preprocessing steps, including handling outliers and multicollinearity, were critical in achieving reasonable model accuracy of ~ 71 %.

Technologies Used

pandas==1.5.3
numpy==1.23.5
matplotlib==3.7.1
seaborn==0.12.2
scikit-learn==1.2.2
statsmodels==0.13.5

Acknowledgements

I would like to thank Upgrad for giving me this opportunity to work on this project.

Contact

Created by Anirudha Kumar Sahu - feel free to contact me!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
Regression+Subjective+Questions.docx		Regression+Subjective+Questions.docx
Regression+Subjective+Questions.pdf		Regression+Subjective+Questions.pdf
bike_sharing_multiple_linear_regression_22_2_25.ipynb		bike_sharing_multiple_linear_regression_22_2_25.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bike Sharing Demand Prediction

Table of Contents

General Information

Conclusions

Technologies Used

Acknowledgements

Contact

About

Releases

Packages

Languages

anirudhasahu92/bike_sharing_assignment

Folders and files

Latest commit

History

Repository files navigation

Bike Sharing Demand Prediction

Table of Contents

General Information

Conclusions

Technologies Used

Acknowledgements

Contact

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages