Scala-Language-on-GitHub

This data science project is an analysis of GitHub repositories, specifically of all Scala repositories, to track the most influential developers on the language history. This project was conducted using Python, including libraries like pandas, numpy and seaborn, for data cleaning, transformation and visualization.

The project consisted on importing a dataset in .csv format, of all Scala pulls requests, and cleaning that dataset to select the proper categories. From the dataset, I filtered large projects with individual commits. Then, unveil those users, by fitering their user names. Finally, I selected the most recent pull requests, and for those users, which ones did the most total contribution in Scala. The results indicated 2 users, xeno-by and soc, were resposible for over 50% of the largest Scala contributions, and are still involved in creating projects.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
datasets		datasets
notebook.ipynb		notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scala-Language-on-GitHub

About

Releases

Packages

DaniCoimbra/Scala-Language-on-GitHub

Folders and files

Latest commit

History

Repository files navigation

Scala-Language-on-GitHub

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages