Generation Of Extractive Summary Based On Document Semantics

Nirmala S

doi:10.15520/ajcsit.v7i1.53

Generation Of Extractive Summary Based On Document Semantics

Authors

Nirmala S Christ university, Bengaluru

DOI:

https://doi.org/10.15520/ajcsit.v7i1.53

Abstract

In the recent years, significant research contribution and progress observed in developing methods for machines to understand concepts within documents. For machines a document represents language based information which consist of meaningful units known as data patterns or document units. These document units are the languageâ€™s verbs, adverbs, nouns, prepositions, etc. that contributes towards building the document. The current research activities in this field, is not just limited to picking some keywords to understand the document concepts but aims to gain a precise understanding of the concepts through correlationÂ Â of words and extracting sentences to obtain summaries. This would help in retrieving meaningful information and reducing the effort of going through the whole document to get its main insight.In our application, we use the Latent Semantic Analysis (LSA) algorithm for text summarization. The dataset is trained using the algorithm and a matrix is generated. This matrix gives us the correlation of words within documents. LSA uses the SVD to capture all correlations latent within a document by modelling relationships among words and sentences within the text.

Article Metrics Graph

Chart Graph | Range Graph

Downloads

Published

2017-03-23

Issue

Asian Journal of Computer Science and Information Technology

Section

Articles

License

COPYRIGHT AGREEMENT AND AUTHORSHIP RESPONSIBILITY

Â All paper submissions must carry the following duly signed by all the authors:

â€œI certify that I have participated sufficiently in the conception and design of this work and the analysis of the data (wherever applicable), as well as the writing of the manuscript, to take public responsibility for it. I believe the manuscript represents valid work. I have reviewed the final version of the manuscript and approve it for publication. Neither has the manuscript nor one with substantially similar content under my authorship been published nor is being considered for publication elsewhere, except as described in an attachment. Furthermore I attest that I shall produce the data upon which the manuscript is based for examination by the editors or their assignees, if requested.â€