Author(s):
Zhao, Tongyu ; Gao, Jiaying ; Feng, Yu ; Zu, Yatong ; Tavares, Adriano ; Gomes, Tiago Manuel Ribeiro ; Pinto, Sandro ; Xu, Hao
Date: 2025
Persistent ID: https://hdl.handle.net/1822/95483
Origin: RepositóriUM - Universidade do Minho
Subject(s): Learning confusion; Discussion forum; Text classification; Confusion characterization
Description
Understanding and identifying the nature of learner confusion is important for online learning platforms. In this study, we address this problem by analyzing forum posts from large-scale online courses. However, due to the large volume of comments and frequent interactions, confusion posts are often overlooked. Existing methods and models, while capable of detecting confusion, typically rely on linguistic features of posts and community factors (e.g. votes, views) but ignore personalized contexts, such as the specific causes and types of confusion. To address this problem, we create the first deep learning dataset focused on confusion types and develop a BERT-based network to model personalized features and identify confusion types. Considering the highly imbalanced distribution of different types of confusion, we further design a novel loss function that adaptively optimizes the training weights for each type. Our method’s effectiveness is confirmed through extensive experimentation.