A copy of the data set I used for this project can be obtained here: https://www.dropbox.com/s/wfrhyy7veba8gw0/indexByMatch_batchUdacity_205_CLEANED_NORM.zip?dl=0
To rebuild my dataset from scratch, you will need an API key, and matchIDS.
In order for this to work, you need to first request an API-Key from Bungie.net You can do so here: http://destinyapiguide.azurewebsites.net/getKey
MatchIds were given to me by https://destinytrialsreport.com/
With these two things, you will call PostGameCarnageReports (PGCRs) from Destiny
Libraries used in the totality of this process:
-
Python 3.5
-
Pandas (to make use of the DataFrame)
-
requests (to call Bungie's API)
-
deepcopy (from copy)
-
MatchClass (a custom class to hold the data from each match we call)
-
Numpy
-
Tensorflow
-
Keras (I used a Tensorflow backend).
-
SciKit-Learn
-
matplotlib.pyplot
-
Seaborn
-
display from IPython.display (Allows the use of display() for DataFrames)
-
PdfPages from matplotlib.backends.backend_pdf
Project pipeline:
- Cluster analysis to determine play-style clusters given individualized match data from Destiny (at the moment, the contenders are Expectation Maximation (GMMs) and K-means for the first round of testing).
- Since matches are between teams, each team can be looked at a sequence of players. Therefore, using the play-style as a feature of each person, one should be able to learn sequences of different playstyles constitute a good team. (Current thoughts are use an LSTM).
- Given an entire match of two teams, predict which team will win.
Therefore, the scale starts at the individual, progresses to the team level, and then to the match level.
This repo is part 3 of that pipeline. Parts 1 and 2 are being worked on currently.