Same as microtask-1, but now using pandas.
NOTE: I have two files in each microtask folder, microtask-#-online.ipynb
which uses elastic projects and microtask-#.ipynb
which uses FOSSASIA projects as data source. If you are checking the code in MyBinder, I suggest you to open microtask-#-online.ipynb
file as you can run the code. If you are opening in GitHub web interface, I suggest you to open microtask-#.ipynb
so that you can see the analysis of 5 repositories.
CSV files: csv_files/badgeyay.csv
The first 5 steps are same as the previous microtask. We will be using pandas in this microtask. The contents of the repository commits/issues/prs are stored in the contents dataframe which will be directly. Using pandas, things are pretty simple as I have the power to operate on columns directly.
-
The contributions are divided into quarters by checking the
created_at
date to fall in between the start and end of that quarter and then categorize the quarter. The oldcontributors are updated using the set function to avoid duplicates. The newcontributors are filtered by using the oldcontributors set by checking whether they are already contributed to the org. The activities are are filtered using the has of the commit/issue/pull_request. -
Later, I made a dataframe to store all these data and converted it to csv and printed it as table using pandas library.