Skip to content

This project explores the Google Play Store app ecosystem using a Kaggle dataset. It will analyze app ratings, categories, genres, user reviews, and sentiment to gain insights.

Notifications You must be signed in to change notification settings

pantakanch/Google-Play-Store-Apps-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 

Repository files navigation

Google Play Store Apps Analysis

Overview:

The "Google Play Store Apps SQL Data Analysis" project dives into the vast ecosystem of the Google Play Store, leveraging a Kaggle dataset to glean insights into app ratings, categories, genres, user reviews, and sentiments.

Dataset on Kaggle: https://www.kaggle.com/datasets/lava18/google-play-store-apps

Key Questions:

  1. Average App Rating:

    • What is the average app rating?
  2. App Ratings Distribution:

    • How do app ratings distribute?
  3. Category-wise Installs:

    • Which app categories have the most installs?
  4. Highest Rated Genres:

    • Which app genres have the highest average ratings?
  5. Free and Paid Apps Breakdown:

    • How many free and paid apps are there in each category?
  6. User Reviews Sentiment Overview:

    • What is the overall sentiment of user reviews?
  7. App-specific Sentiment Analysis:

    • How does sentiment vary for each app?
  8. Most Positively Reviewed App:

    • Which app has the most positive reviews?
  9. Top 10 Positively Reviewed Games:

    • What are the top 10 positively reviewed apps in the Games category?

SQL Queries:

  • Average App Rating:
SELECT 
	ROUND(AVG(Rating), 2) AS average_rating
FROM googleplaystore;

image

  • App Ratings Distribution:
SELECT 
    rating_range, 
    COUNT(*) AS count_rating
FROM (
    SELECT
        CASE
            WHEN Rating BETWEEN 1 AND 1.9 THEN 1
            WHEN Rating BETWEEN 2 AND 2.9 THEN 2
            WHEN Rating BETWEEN 3 AND 3.9 THEN 3
            WHEN Rating BETWEEN 4 AND 4.9 THEN 4
            WHEN Rating = 5 THEN 5
            ELSE "no_rating"
        END AS rating_range
    FROM googleplaystore
)
GROUP BY rating_range
ORDER BY rating_range;

image

  • Category-wise Installs:
SELECT 
	Category,
	SUM(CAST(REPLACE(Installs, '+', '') * 1000 AS INTEGER)) AS total_installs
FROM googleplaystore
GROUP BY Category
ORDER BY total_installs DESC;

image

  • Highest Rated Genres:
SELECT 
	Genres, 
	ROUND(AVG(Rating), 2) AS average_rating
FROM googleplaystore
GROUP BY Genres
ORDER BY average_rating DESC;

image

  • Free and Paid Apps Breakdown:
SELECT 
	Category, 
	Type, 
	COUNT(*) AS count_type
FROM googleplaystore
GROUP BY Category, Type;

image

  • User Reviews Sentiment Overview:
SELECT 
	Sentiment, 
	COUNT(*) AS sentiment_count
FROM googleplaystore_user_reviews
GROUP BY Sentiment
ORDER BY Sentiment;

image

  • App-specific Sentiment Analysis:
SELECT 
	App, 
	Sentiment,
	count(*) AS sentiment_count
FROM googleplaystore_user_reviews
GROUP BY App, Sentiment;

image

  • Most Positively Reviewed App:
SELECT 
	App, 
	Sentiment,
	count(*) AS sentiment_count
FROM googleplaystore_user_reviews
WHERE Sentiment = 'Positive'
GROUP BY App
ORDER BY sentiment_count DESC;

image

  • Top 10 Positively Reviewed Games:
SELECT 
	googleplaystore_user_reviews.App AS google_app,
	AVG(Sentiment_Polarity) AS avg_sentiment
FROM googleplaystore_user_reviews
	JOIN googleplaystore 
	ON googleplaystore_user_reviews.App = googleplaystore.App
WHERE googleplaystore.Category = 'GAME'
GROUP BY google_app
ORDER BY avg_sentiment DESC
LIMIT 10;

image

About

This project explores the Google Play Store app ecosystem using a Kaggle dataset. It will analyze app ratings, categories, genres, user reviews, and sentiment to gain insights.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published