Skip to content

Latest commit

 

History

History
8 lines (6 loc) · 278 Bytes

README.md

File metadata and controls

8 lines (6 loc) · 278 Bytes

wikipediaNLP

How to use: Run the "Wikipedia similarity graph" notebook.

Other files:

  • wikipedia_preprocessing.py: dataset management and preprocessing functions.
  • "Wikipedia Portals": used to visualise results during processing
  • "Page2vec proof of concept": early draft.