Installation¶
Clone repository¶
$ git clone https://github.com/nisargjhaveri/clstk
Python dependencies¶
The dependencies are listed in requirements.txt.
To install all the dependencies, run pip as followed.
$ pip install --upgrade -r requirements.txt
Also install nltk packages called stopwords and punkt.
$ python -m nltk.downloader stopwords punkt -d $NLTK_DATA
Setup CLUTO (optional)¶
http://glaros.dtc.umn.edu/gkhome/cluto/cluto/download
This is required if you want to use “linBilmes” summarizer.
Set an environment variable CLUTO_BIN_PATH with the path of directory containing vcluster binary file.
Setup ROUGE 1.5.5 (optional)¶
https://github.com/nisargjhaveri/ROUGE-1.5.5-unicode
This is required only if you plan to evaluate the summaries using ROUGE score.
Obtain and setup ROUGE 1.5.5 according to the instructions there.
Set an environment variable ROUGE_HOME with the path to ROUGE root directory, the one containing ROUGE-1.5.5.pl file.
Setup dependencies for TQE (optional)¶
https://github.com/nisargjhaveri/tqe
Install dependencies for tqe module according to the details provided in the link above.
Setup NeuralTextSimplification (optional)¶
https://github.com/senisioi/NeuralTextSimplification
Setup system from above URL and set NTS_OPENNMT_PATH, NTS_MODEL_PATH and NTS_GPUS variables accordingly.