Transformer-based Automatic Mapping of Clinical Notes to Specific Clinical Concepts

Jay Ganesh, Ajay Bansal

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A significant proportion of medical errors exist in crucial medical information, and most stem from misinterpreting non-standardized clinical notes. This research compares four transformer-based models namely: BERT (Bidirectional Encoder Representations from Transformers) Base Uncased, Emilyalsentzer Bio-ClinicalBERT, RoBERTa (Robustly Optimized BERT Pre-Training Approach), and DeBERTa (Decoding-enhanced BERT with disentangled attention) to determine which among the four is the best backbone model for mapping free text in clinical notes to specific clinical concepts. Besides, the impact of context-specific embeddings on BERT was also studied to determine the need for a clinical BERT in Clinical Skills exam scoring. This research proposes the use of DeBERTa as a backbone model in patient note scoring for the United States Medical Licensing Examination (USMLE) Clinical Skills exam after comparing it with three other transformer models. Disentangled attention and enhanced mask decoder integrated into DeBERTa were credited for its high performance. Besides, the effect of meta pseudo labeling was also investigated in this research, which in turn, further enhanced DeBERTa's performance.

Original languageEnglish (US)
Title of host publicationProceedings - 2023 IEEE 47th Annual Computers, Software, and Applications Conference, COMPSAC 2023
EditorsHossain Shahriar, Yuuichi Teranishi, Alfredo Cuzzocrea, Moushumi Sharmin, Dave Towey, AKM Jahangir Alam Majumder, Hiroki Kashiwazaki, Ji-Jiang Yang, Michiharu Takemoto, Nazmus Sakib, Ryohei Banno, Sheikh Iqbal Ahamed
PublisherIEEE Computer Society
Pages558-563
Number of pages6
ISBN (Electronic)9798350326970
DOIs
StatePublished - 2023
Event47th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2023 - Hybrid, Torino, Italy
Duration: Jun 26 2023Jun 30 2023

Publication series

NameProceedings - International Computer Software and Applications Conference
Volume2023-June
ISSN (Print)0730-3157

Conference

Conference47th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2023
Country/TerritoryItaly
CityHybrid, Torino
Period6/26/236/30/23

Keywords

  • BERT
  • BERT Base Uncased
  • DeBERTa
  • RoBERTa
  • clinical BERT
  • meta pseudo labeling

ASJC Scopus subject areas

  • Software
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Transformer-based Automatic Mapping of Clinical Notes to Specific Clinical Concepts'. Together they form a unique fingerprint.

Cite this