Enhanced Cluster Merging and Deep Learning Techniques for Entity Name Identification from Biomedical Corpus

Authors

  • Nilanjana Das Vidyasagar University
  • Rakesh Dutta Hijli College
  • Uttam Kumar Mondal Vidyasagar University
  • Mukta Majumder University of North Bengal
  • Jyotsna Kumar Mandal Kalyani University

DOI:

https://doi.org/10.7494/csci.2025.26.1.5600

Abstract

For mining biomedical information identifying names is the prime task. Complex and uncertain naming styles of biomedical entities are the major setbacks here. Thus, state-of-the-art accuracy of biomedical name identification is reasonably inferior compared to general domain. This study includes machine learning and deep learning techniques to recognize names from biomedical corpus. In supervised classification, a classifier is built by finding required statistics from training corpus. Accordingly, performance of the system is primarily dependent on quantity and quality of training corpus. But manually preparing a large training dataset with enriched feature samples is laborious and time-taking. Therefore, various techniques were adopted in the literature to make effective use of raw corpora. We have incorporated a novel Cluster Merging technique and Attention Mechanism with BERT embedding for boosting machine learning and deep learning classifiers respectively. The suggested results outpour that profound techniques are competent and delineate signifying improvement over surviving methods.

Downloads

Downloads

Published

2025-04-01

Issue

Section

Articles

How to Cite

Das, N., Dutta, R., Mondal, U. K., Majumder, M., & Mandal, J. K. (2025). Enhanced Cluster Merging and Deep Learning Techniques for Entity Name Identification from Biomedical Corpus. Computer Science, 26(1). https://doi.org/10.7494/csci.2025.26.1.5600