Full python (>=3.5) text mining utility for extract and pre-process text from documents, incluiding language detection and tf (l1 and l2 normalization) vector space model representation.
- pip3 install -r requirements.txt
- python3 -m spacy download [model]
Replace [model] with the pretrained model choosed for your language (English, Spanish or both). Visit here for more information.