How to use: Run the "Wikipedia similarity graph" notebook.
Other files:
- wikipedia_preprocessing.py: dataset management and preprocessing functions.
- "Wikipedia Portals": used to visualise results during processing
- "Page2vec proof of concept": early draft.