A tool to generate summaries.
- Use a vritual environment using either virtualenv or anaconda
- a. virtualenv:
sudo pip3 install -U virtualenv virtualenv --system-site-packages -p python3 ./venv source ./venv/bin/activate # sh, bash, ksh, or zsh
- b. anaconda: Download anaconda or miniconda from their website
conda create -n tfg tensorflow-gpu tensorflow=2 conda activate tfg
- Download the following packages and tools (we used
pip install
):
rouge
nltk
tensorflow
tensorflow_hub
tqdm
sklearn
scipy
numpy
pickle
To fully run this project you will need to individually obtain the data from Cornell NEWSROOM. After downloading and unzipping the data, make the directory ~/AJ-summarization/input_data and place the train, dev, and test files in that folder.
- Make sure to be in virtual environment
- cd to AJ-summarization
python3 [summarization strategy file]
(listed below)- cd evaluation_src
- python3 evaluation.py
List of summarization strategies:
summarization-baseline.py
: Takes the first sentence of each article.
summarization.py
: Uses our modified TextRank to generate the summary.
first-bias.py
: First bias technique
weighted.py
: Statistical weighting strategy
In the future, we plan to add a bash file that can run the desired strategies and evaluation in one command.
Accepting No Pull Requests For Now.