Python (v 3.7.4, 32-bit) implementation of NMF based Ensembles for Extractive Single Document Summarization.
Author: Alka Khurana
Acknowledgement: Vasudha Bhatnagar
- Clone the repository.
- Put all the documents to be summarized in 'Documents' folder.
- In all .py file change the current directory path to the path in your system where the repository is cloned.
- Run 'Preprocessing.py'
- Run 'Ensembles.py' for generating summaries coressponding to different combining methods and different ensemble size.
- Run 'EnsemblesDUCNMFBest.py' for generating the base model summaries and ensemble summaries.
- Run 'EnsemblesDUCRandom_Method1.py' and 'EnsemblesDUCRandom_Method2.py' for generating summaries by varying the number of latent topics into which the document is decomposed.
- 'NMF.py' and 'Tokenization.py' are helper files.
- System (algorithmic) summaries are stored in 'Summaries' folder.
- Evaluate the performance of summaries using ROUGE toolkit.