Please use this identifier to cite or link to this item: http://localhost:8080/xmlui/handle/123456789/3650
Title: PERFORMANCE ANALYSIS OF ABSTRACT-BASED CLASSIFICATION OF MEDICAL JOURNALS USING MACHINE LEARNING TECHNIQUES
Authors: Deepika, A
Radha, N
Keywords: Text classification
Cancer
Naïve Bayes
Support vector machine
LSTM
Latent Dirichlet algorithm
Non-negative matrix factorization
Topic modelling
Issue Date: 14-Sep-2021
Publisher: Springer Link
Abstract: Researchers face many challenges in finding the opt web-based resources by giving the queries based on keyword search. Due to advent of Internet, there are huge biological literatures that are deposited in the medical database repository in recent years. Nowadays, as many web-based medical researchers evolved in the field of medicine, there is need for an intelligent and efficient extraction technique required to filter appropriate and opt literature from the growing body of biomedical literature repository. In this research work, new combination of model is proposed in order to find the new insights in applying the combination of algorithm on biological data set. The information in the biomedical field is the basic information for healthy living. National Center for Biotechnology Information (NCBI)’s PubMed is the major source of peer-reviewed biomedical documents for researchers and health practitioners in the field of health-related management. In this paper, abstracts available in PubMed database is used for experimentation. In recent years, deep learning-based neural approach models provide an efficient way to create an end-to-end model that can accurately measure classification labels. This research work is a systematic analysis of performance of the supervised learning models such as Naïve Bayes (NB), support vector machine (SVM) and long short-term memory (LSTM) by implementing on textual medical data. The novelty in this work is the process of incorporating certain topic modelling techniques after the pre-processing phase to automatically label the documents. Topic modelling is a useful technique in increasing the efficiency and improves the ability of researchers to interpret biological information. So, the classification algorithms thus proposed are implemented in combination with popular topic modelling algorithms such as latent Dirichlet algorithm (LDA) and non-negative matrix factorization (NMF). The final performance of the combination of algorithms is also analysed and is found that SVM with NMF outperforms the other models.
URI: https://link.springer.com/chapter/10.1007/978-981-16-3728-5_47
Appears in Collections:Book Chapter



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.