Leveraging Large Language Models to Generate Multiple-Choice Questions for Ophthalmology Education

Published: November 1, 2025

Publication: JAMA Ophthalmology

Summary

This study evaluated whether OpenAI's GPT-4 could reliably generate high-quality, novel multiple-choice questions (MCQs) for ophthalmology education comparable to those produced by experienced human experts. The findings demonstrated that LLM-generated MCQs matched human experts with identical median scores in appropriateness, clarity, and relevance. Notably, 95% of the AI-generated questions were novel, showing minimal similarity to existing databases while maintaining professional readability standards. These results suggest that LLMs offer an efficient way to expand examination resources and support ophthalmology residency training.

Publication Details

PMID: 41100119