Text summarisation is compressed or condensed version of any text document. Due to increasing use of digitisation, massive amount of information is available on internet. Text summarisation is an emerging alternative for users to find relative information in automated shortened versions. In this paper we propose single document summarisation technique using shallow features of sentence to generate summary. The weight of sentences is calculated by applying score of different words and sentence-based statistical features. Here, most salient sentences are selected based on weight of sentences and are put together to generate summary. This is modeled using fuzzy inference system. This approach utilises fuzzy inference and fuzzy measures to find most significant sentences. The result of our proposed method is compared with other methods using recall oriented understudy for Gisting evaluation (ROUGE-N) measures on document understanding conferences (DUC) 2002 dataset and results show that our proposed method outperforms a few baseline methods.
Huge volume of data from domain specific applications such as medical, financial, library, telephone,shopping records and individual are regularly generated. Sharing of these data is proved to be beneficialfor data mining application. On one hand such datais an important asset to business decision making byanalyzing it. On the other hand data privacy concerns may prevent data owners from sharing informationfor data analysis. In order to share data while preserving privacy, data owner must come up with a solutionwhich achieves the dual goal of privacy preservation as well as an accuracy of data mining task –clustering and classification. An efficient and effective approach has been proposed that aims to protectprivacy of sensitive information and obtaining dataclustering with minimum information loss
Sentiment analysis identifies users in the textual reviews available in social networking sites, tweets, blog posts, forums, status updates to share their emotions or reviews and these reviews are to be used by market researchers to do know the product reviews and current trends in the market. The sentiment analysis is performed by two methods. Machine learning approaches and lexicon methods which are also known as the knowledge base approach. These. In this article, the authors evaluate the performance of some machine learning techniques: Maximum Entropy, Naïve Bayes and Support Vector Machines on two benchmark datasets: the positive-negative dataset and a Movie Review dataset by measuring parameters like accuracy, precision, recall and F-score. In this article, the authors present the performance of various sentiment analysis and classification methods by classifying the reviews in binary classes as positive, negative opinion about reviews on different domains of dataset. It is also justified that sentiment analysis using the Support Vector Machine outperforms other machine learning techniques.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.