论文标题
使用自然语言处理对18年的PERC程序的主题分析
Thematic Analysis of 18 Years of PERC Proceedings using Natural Language Processing
论文作者
论文摘要
We have used an unsupervised machine learning method called Latent Dirichlet Allocation (LDA) to thematically analyze all papers published in the Physics Education Research Conference Proceedings between 2001 and 2018. By looking at co-occurrences of words across the data corpus, this technique has allowed us to identify ten distinct themes or "topics" that have seen varying levels of prevalence in Physics Education Research (PER) over time and to rate the distribution of these topics within each 纸。我们的分析表明,尽管所有确定的主题随着时间的流逝都持续了兴趣,但对某些主题的兴趣增加了几率,首先是对学生理解的定性,理论建设研究的最初兴趣,这使2010年代后期的重点是解决问题的关注。自2010年以来,该领域已经向更社会文化的教学观点转变,特别关注实践,学生身份和机构变革的社区。基于这些结果,我们建议无监督的文本分析技术(如LDA)可能有望提供对教育研究文献的定量,独立和可复制分析的希望。
We have used an unsupervised machine learning method called Latent Dirichlet Allocation (LDA) to thematically analyze all papers published in the Physics Education Research Conference Proceedings between 2001 and 2018. By looking at co-occurrences of words across the data corpus, this technique has allowed us to identify ten distinct themes or "topics" that have seen varying levels of prevalence in Physics Education Research (PER) over time and to rate the distribution of these topics within each paper. Our analysis suggests that although all identified topics have seen sustained interest over time, PER has also seen several waves of increased interest in certain topics, beginning with initial interest in qualitative, theory-building studies of student understanding, which has given way to a focus on problem solving in the late 2010s. Since 2010 the field has seen a shift towards more sociocultural views of teaching and learning with a particular focus on communities of practice, student identities, and institutional change. Based on these results, we suggest that unsupervised text analysis techniques like LDA may hold promise for providing quantitative, independent, and replicable analyses of educational research literature.