论文标题
建立单个编码样式的隐式矢量表示
Building Implicit Vector Representations of Individual Coding Style
论文作者
论文摘要
为了促进团队合作,我们提出了一种新的方法,通过捕获个人贡献样式或编码方式来构建个人开发人员的向量表示。这样的表示可以在下一代软件开发团队协作工具中找到使用,例如,通过启用工具来跟踪团队中的知识转移。我们方法的关键思想是避免使用编码样式的明确定义的指标,而是通过训练模型来构建对作者身份识别的模型,并从训练有素的模型中提取单个开发人员的表示。通过经验评估我们方法的输出,我们发现隐式构建的个别表示反映了团队结构的某些属性:报告彼此学习的开发人员彼此之间更加亲密。
With the goal of facilitating team collaboration, we propose a new approach to building vector representations of individual developers by capturing their individual contribution style, or coding style. Such representations can find use in the next generation of software development team collaboration tools, for example by enabling the tools to track knowledge transfer in teams. The key idea of our approach is to avoid using explicitly defined metrics of coding style and instead build the representations through training a model for authorship recognition and extracting the representations of individual developers from the trained model. By empirically evaluating the output of our approach, we find that implicitly built individual representations reflect some properties of team structure: developers who report learning from each other are represented closer to each other.