Generating Vocabulary Sets for Implicit Language Learning using Masked Language Modeling

Vatricia Edgar, Ajay Bansal

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A well-balanced language curriculum must include both explicit vocabulary learning and implicit vocabulary learning. However, most language learning applications focus on explicit instruction. Students require support with implicit vocabulary learning because they need enough context to guess and acquire new words. Traditional techniques aim to teach students enough vocabulary to comprehend the text, thus enabling them to acquire new words. Despite the wide variety of support for vocabulary learning offered by learning applications today, few offer guidance on how to select an optimal vocabulary study set. This paper proposes a novel method of student modeling with masked language modeling to detect words that are required for comprehension of a text. It explores the efficacy of using deep learning via a pre-trained masked language model to model human reading comprehension and presents a vocabulary study set generation pipeline (VSGP). Promising results show that masked language modeling can be used to model human comprehension and the pipeline produces reasonably sized vocabulary study sets that can be integrated into language learning systems.

Original languageEnglish (US)
Title of host publicationProceedings of the 55th Annual Hawaii International Conference on System Sciences, HICSS 2022
EditorsTung X. Bui
PublisherIEEE Computer Society
Pages758-767
Number of pages10
ISBN (Electronic)9780998133157
StatePublished - 2022
Event55th Annual Hawaii International Conference on System Sciences, HICSS 2022 - Virtual, Online, United States
Duration: Jan 3 2022Jan 7 2022

Publication series

NameProceedings of the Annual Hawaii International Conference on System Sciences
Volume2022-January
ISSN (Print)1530-1605

Conference

Conference55th Annual Hawaii International Conference on System Sciences, HICSS 2022
Country/TerritoryUnited States
CityVirtual, Online
Period1/3/221/7/22

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'Generating Vocabulary Sets for Implicit Language Learning using Masked Language Modeling'. Together they form a unique fingerprint.

Cite this