快速检索:      
引用本文:
【打印本页】   【下载PDF全文】   查看/发表评论  【EndNote】   【RefMan】   【BibTex】
←前一篇|后一篇→ 过刊浏览    高级检索
本文已被:浏览 14次   下载 4 本文二维码信息
码上扫一扫!
分享到: 微信 更多
基于局部全局特征提取的Res-Swin学生课堂行为图像分类
夏峥昱, 郭畅
上海师范大学 信息与机电工程学院, 上海 201418
摘要:
本文提出一种Res-Swin模型,通过残差网络(ResNet)模块提取课堂行为图片的局部信息特征,提升模型对图像的计算速度.通过Swin-Transformer进一步分析局部特征提取后的图像全局信息,通过窗口移动分析相邻patch的空间特征关系,再通过合并窗口下采样增大感受野,达到提高图像分类准确度的目的.基于公开数据源的课堂行为图像库,将Res-Swin模型与其他基线进行了比较,实验结果验证了该模型在处理课堂行为图像分类问题时兼顾了准确度与计算速度.
关键词:  课堂行为分类  局部与全局特征  计算速度  准确度
DOI:10.20192/j.cnki.JSHNU(NS).2025.02.016
分类号:TP301
基金项目:国家自然科学基金(62201350)
Res-Swin model for student classroom behaviour image classification based on local and global feature extraction
XIA Zhengyu, GUO Chang
College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 201418, China
Abstract:
A Res-Swin model was proposed in this paper to extracte the local information features of the classroom behaviour images through the residual network (ResNet) module, which improved the computational speed of the model on the images. The image vectors with local information extracted were further analyzed by Swin-Transformer. To analyze their global features and spatial features of adjacent patches, the patches were analyzed by window shifting. Furtherly, the receptive field was increased by merging the window to downsample, which could improve the accuracy of image classification. Based on the classroom behaviour image of public data sources, the Res-Swin model was compared with other baselines. The experimental results verified that the proposed Res-Swin model could reach a balance between the accuracy and computational speed in dealing with classroom behaviour image classification problem.
Key words:  class behaviour classification  local and global feature  computational speed  accuracy