Events The 1st International Online Conference on Education Sciences

Event submissions

Published

This submission belongs to the session S5. STEM Education of the event The 1st International Online Conference on Education Sciences

Published date

10 Jun, 2026

Academic Editor

Patricia Arriaga

Citation

Hacı Hasan Yolcu, Performance Evaluation of Generative AI Models in Chemistry Lesson Design Using Role-Based Prompting: Insights from Micro- and Nanoplastics, in Proceedings of The 1st International Online Conference on Education Sciences, 15 June–17 June 2026, MDPI: Basel, Switzerland

Facebook

Twitter

Performance Evaluation of Generative AI Models in Chemistry Lesson Design Using Role-Based Prompting: Insights from Micro- and Nanoplastics

Hacı Hasan Yolcu ¹

1. Dede Korkut Education Faculty, Kafkas University, Kars, Turkey, Turkey (Türkiye)

Abstract

Recent advances in generative artificial intelligence (GenAI) offer new possibilities for supporting teachers in instructional design; however, comparative empirical evidence on the performance of different models in discipline-specific lesson planning remains limited. This study evaluates five widely accessible GenAI models—ChatGPT (GPT-5.2), Claude (Sonnet 4.5), DeepSeek, Google Gemini 1.5, and Microsoft Copilot—in generating secondary-level chemistry lesson plans using role-assigned prompting (RAP). The instructional context was framed around the socio-scientific issue (SSI) of microplastic and nanoplastic impacts.

All models received the same role-assigned prompt (RAP) to act as experienced chemistry teachers and design curriculum-aligned lesson plans for 11th-grade students, which were then evaluated by five veteran chemistry teachers using an analytic rubric. The author-developed rubric, based on relevant theory, evaluated eight criteria, including learning outcome alignment, 5E model adherence, teacher–student role clarity, inquiry support, chemical accuracy, SSI integration, assessment quality, and language appropriateness.

Significant differences in model performance were observed. Claude produced the most comprehensive and pedagogically robust plans, with strong alignment to learning outcomes, effective SSI integration, and thorough assessments. ChatGPT offered structurally coherent plans aligned with the 5E model, but content depth was moderate. DeepSeek generated organized and practical plans, yet showed inconsistencies in 5E alignment and learning outcome coherence. Gemini and Microsoft Copilot performed weaker, with limited alignment to learning outcomes and more superficial chemistry content.

Overall, while all models generated broadly implementable lesson plans, their pedagogical quality varied significantly. The findings highlight the importance of model selection and prompt design in leveraging GenAI for chemistry education and suggest that RAP can be an effective strategy for enhancing instructional outputs.

Keywords

generative AI models

chemistry education

lesson planning

role-assigned prompting

socio-scientific issues,

Oral Presentation

Dr. Yolcu.pdf

Poster

Dr. Yolcu.pptx

Emotional Experiences and Support Stability among Teaching Assistants in Inclusive Education

Effects of Parent Training Facilitated by Elementary School Teachers for Parents of Children with Developmental Disabilities: Changes in Parenting Attitudes, Behavioral Problems, and Quality of Life