TY - GEN
T1 - Generating Vocabulary Sets for Implicit Language Learning using Masked Language Modeling
AU - Edgar, Vatricia
AU - Bansal, Ajay
N1 - Publisher Copyright:
© 2022 IEEE Computer Society. All rights reserved.
PY - 2022
Y1 - 2022
N2 - A well-balanced language curriculum must include both explicit vocabulary learning and implicit vocabulary learning. However, most language learning applications focus on explicit instruction. Students require support with implicit vocabulary learning because they need enough context to guess and acquire new words. Traditional techniques aim to teach students enough vocabulary to comprehend the text, thus enabling them to acquire new words. Despite the wide variety of support for vocabulary learning offered by learning applications today, few offer guidance on how to select an optimal vocabulary study set. This paper proposes a novel method of student modeling with masked language modeling to detect words that are required for comprehension of a text. It explores the efficacy of using deep learning via a pre-trained masked language model to model human reading comprehension and presents a vocabulary study set generation pipeline (VSGP). Promising results show that masked language modeling can be used to model human comprehension and the pipeline produces reasonably sized vocabulary study sets that can be integrated into language learning systems.
AB - A well-balanced language curriculum must include both explicit vocabulary learning and implicit vocabulary learning. However, most language learning applications focus on explicit instruction. Students require support with implicit vocabulary learning because they need enough context to guess and acquire new words. Traditional techniques aim to teach students enough vocabulary to comprehend the text, thus enabling them to acquire new words. Despite the wide variety of support for vocabulary learning offered by learning applications today, few offer guidance on how to select an optimal vocabulary study set. This paper proposes a novel method of student modeling with masked language modeling to detect words that are required for comprehension of a text. It explores the efficacy of using deep learning via a pre-trained masked language model to model human reading comprehension and presents a vocabulary study set generation pipeline (VSGP). Promising results show that masked language modeling can be used to model human comprehension and the pipeline produces reasonably sized vocabulary study sets that can be integrated into language learning systems.
UR - http://www.scopus.com/inward/record.url?scp=85152237924&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85152237924&partnerID=8YFLogxK
U2 - 10.24251/hicss.2022.095
DO - 10.24251/hicss.2022.095
M3 - Conference contribution
AN - SCOPUS:85152237924
T3 - Proceedings of the Annual Hawaii International Conference on System Sciences
SP - 758
EP - 767
BT - Proceedings of the 55th Annual Hawaii International Conference on System Sciences, HICSS 2022
A2 - Bui, Tung X.
PB - IEEE Computer Society
T2 - 55th Annual Hawaii International Conference on System Sciences, HICSS 2022
Y2 - 3 January 2022 through 7 January 2022
ER -