Skip to content

Latest commit

 

History

History

microtask-2

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 

Microtask 2

Same as microtask-1, but now using pandas.

NOTE: I have two files in each microtask folder, microtask-#-online.ipynb which uses elastic projects and microtask-#.ipynb which uses FOSSASIA projects as data source. If you are checking the code in MyBinder, I suggest you to open microtask-#-online.ipynb file as you can run the code. If you are opening in GitHub web interface, I suggest you to open microtask-#.ipynb so that you can see the analysis of 5 repositories.

Solution

Link to the Notebook

CSV files: csv_files/badgeyay.csv

mt2

Explanation

The first 5 steps are same as the previous microtask. We will be using pandas in this microtask. The contents of the repository commits/issues/prs are stored in the contents dataframe which will be directly. Using pandas, things are pretty simple as I have the power to operate on columns directly.

  1. The contributions are divided into quarters by checking the created_at date to fall in between the start and end of that quarter and then categorize the quarter. The oldcontributors are updated using the set function to avoid duplicates. The newcontributors are filtered by using the oldcontributors set by checking whether they are already contributed to the org. The activities are are filtered using the has of the commit/issue/pull_request.

  2. Later, I made a dataframe to store all these data and converted it to csv and printed it as table using pandas library.