YMCQ: Reasoning-Enhanced MCQ Generation

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Automated multiple-choice question (MCQ) generation is valuable for scalable assessment and enhanced learning experiences. However, existing MCQ generation methods face challenges in ensuring plausible distractors and maintaining answer consistency. This paper introduces a method for MCQ generation that integrates reasoning-based explanations for both correct answers and distractors, leveraging open-source language models finetuned on publicly available datasets. Our approach addresses these issues with 3 major improvements. First, over 300k questions from public datasets were augmented with synthetically generated reasoning explanations. Second, we fine-tune a Large Language Model (LLM) with reasoning-based explanations to condition the generation while accounting for correct reasoning and possible misconceptions. Third, we introduce a multi-step filtering pipeline to ensure the validity of the question and the diversity of the generated distractors. This work argues for the effectiveness of reasoning-enhanced finetuning in improving MCQ generation quality while maintaining accessibility and cost efficiency. We release all resources, including synthetically augmented questions, training code, and the best model as open-source.

Original languageEnglish (US)
Title of host publicationArtificial Intelligence in Education - 26th International Conference, AIED 2025, Proceedings
EditorsAlexandra I. Cristea, Erin Walker, Yu Lu, Olga C. Santos, Seiji Isotani
PublisherSpringer Science and Business Media Deutschland GmbH
Pages308-315
Number of pages8
ISBN (Print)9783031984648
DOIs
StatePublished - 2025
Event26th International Conference on Artificial Intelligence in Education, AIED 2025 - Palermo, Italy
Duration: Jul 22 2025Jul 26 2025

Publication series

NameLecture Notes in Computer Science
Volume15882 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference26th International Conference on Artificial Intelligence in Education, AIED 2025
Country/TerritoryItaly
CityPalermo
Period7/22/257/26/25

Keywords

  • Large Language Models
  • MCQ generation
  • Multi-step filtering
  • Synthetic augmentation

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'YMCQ: Reasoning-Enhanced MCQ Generation'. Together they form a unique fingerprint.

Cite this