Bi lingual translation and multi document text summarization using clustering approach| International Journal of Innovative Science and Research Technology

Bi-Lingual Translation and Multi-Document Text Summarization using Clustering Approach

Authors : Dolly Arun Bonde, Gurvinderpalkaur.P. Dhindsa, Damini Rangnath Landge, Prerna Didwania.

Volume/Issue : Volume 3 - 2018, Issue 2 - February

Google Scholar : https://goo.gl/DF9R4u

Thomson Reuters ResearcherID : https://goo.gl/3bkzwv

Abstract : Agriculture is the back-bone of economy in India. In order to enhance development in this field knowing only basics of agriculture is not enough. Researcher or framer should be aware of agriculture practices throughout the world. As English is global language most of the information is available in English, so when one browses agriculture information, it becomes difficult for some farmers to get the correct meaning of the information and quite difficult to go through each and every information. This result in language barrier and information overload that either leads to wastage of significant time browsing all information or else useful information is missed out. Hence text summarization in native language in agriculture field is very essential for user to get concise information about new technology. The proposed methodology comprises of machine translation, data pre-processing and automatic text summarization. The machine translation phase translates documents which are either in Hindi or Marathi language to the English document. After that data pre-processing takes place. Data pre-processing step involves noise removal, tokenization, stop word removal and stemming. On the pre-processed data automatic text summarization is performed using clustering approach. Then according to the user choice the summary will be translated to Hindi or Marathi language.

Keywords : Data mining, text summarization, extractive summarization, K-means clustering, Machine translation.

Agriculture is the back-bone of economy in India. In order to enhance development in this field knowing only basics of agriculture is not enough. Researcher or framer should be aware of agriculture practices throughout the world. As English is global language most of the information is available in English, so when one browses agriculture information, it becomes difficult for some farmers to get the correct meaning of the information and quite difficult to go through each and every information. This result in language barrier and information overload that either leads to wastage of significant time browsing all information or else useful information is missed out. Hence text summarization in native language in agriculture field is very essential for user to get concise information about new technology. The proposed methodology comprises of machine translation, data pre-processing and automatic text summarization. The machine translation phase translates documents which are either in Hindi or Marathi language to the English document. After that data pre-processing takes place. Data pre-processing step involves noise removal, tokenization, stop word removal and stemming. On the pre-processed data automatic text summarization is performed using clustering approach. Then according to the user choice the summary will be translated to Hindi or Marathi language.

Keywords : Data mining, text summarization, extractive summarization, K-means clustering, Machine translation.

Paper Submission Last Date
31 - August - 2026

SUBMIT YOUR PAPER CALL FOR PAPERS

Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.