E-Summ : Investigating Entropy for Extractive Document Summarization

This repository contains Python (v 3.7) scripts for implementation of E-Summ for extractive single document summarization.

E-Summ : E-Summ follows an information theoretic approach for unsupervised, extractive single document summarization. The proposed E-Summ algorithm is domain-, collection-independent and is agnostic to the language of the document. Moreover, the method is explainable and fast enough to meet realtime requirements for on-the-fly summarization of web documents in languages other than English.

Author: Alka Khurana Acknowledgement: Vasudha Bhatnagar

Citation:

@article{khurana2021investigating,
  title={Investigating entropy for extractive document summarization},
  author={Khurana, Alka and Bhatnagar, Vasudha},
  journal={Expert Systems with Applications},
  pages={115820},
  year={2021},
  publisher={Elsevier}
}

Pipeline:

Clone the complete directory.
Put the documents in the Documents folder for which summary is to be generated.
In all the .py files, change the current directory to working directory of your system.
Run Preprocessing.py for pre-processing the input documents.
Run Find_Topics.py to find the number of latent topics in the document.
Run ESumm_DUC.py for generating the summary E-Summ summary of the any DUC data-set document.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
ESumm_DUC.py		ESumm_DUC.py
Find_Topics.py		Find_Topics.py
Preprocessing.py		Preprocessing.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

E-Summ : Investigating Entropy for Extractive Document Summarization

Citation:

Pipeline:

About

Releases

Packages

Languages

alkakhurana/E-Summ

Folders and files

Latest commit

History

Repository files navigation

E-Summ : Investigating Entropy for Extractive Document Summarization

Citation:

Pipeline:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages