论文标题
人为评估的AI:专业评估者需要什么?
AI for human assessment: What do professional assessors need?
论文作者
论文摘要
最近的组织已开始采用基于AI的决策支持工具来优化人力资源开发实践,同时面临在高度上下文和敏感领域中使用AIS的各种挑战。我们介绍了我们的案例研究,旨在帮助专业评估者在人类评估中做出决定,他们在其中对被评估者进行访谈并评估其对某些工作角色的适用性。我们与两个工业评估师的研讨会阐明了他们面临的麻烦(即,对被评估者行为的稳定和非主体观察)和AI系统的派生要求(即,以可解释的方式从面试视频中提取非语言提示)。作为响应,我们使用了多模式的行为特征,例如面部关键点,身体和头部姿势和凝视,采用了无监督的异常检测算法。该算法根据行为特征从视频中提取异常场景,并告知哪些功能有助于异常值。我们首先评估了评估人员如何看待提取的提示,并发现该算法可用于提出有关评估者的关注的场景,这要归功于其可解释性。然后,我们开发了一个结合算法的接口原型,并让六个评估者将其用于实际评估。他们的评论揭示了引入无监督的异常检测的有效性,以增强评估的信心和客观性以及这种基于AI的系统在人类评估中的潜在使用方案。我们的方法以分离人类合作中的观察和解释为基础,它将促进在高度背景领域(例如人类评估)中的人类决策,同时保持其对系统的信任。
Recent organizations have started to adopt AI-based decision support tools to optimize human resource development practices, while facing various challenges of using AIs in highly contextual and sensitive domains. We present our case study that aims to help professional assessors make decisions in human assessment, in which they conduct interviews with assessees and evaluate their suitability for certain job roles. Our workshop with two industrial assessors elucidated troubles they face (i.e., maintaining stable and non-subjective observation of assessees' behaviors) and derived requirements of AI systems (i.e., extracting their nonverbal cues from interview videos in an interpretable manner). In response, we employed an unsupervised anomaly detection algorithm using multimodal behavioral features such as facial keypoints, body and head pose, and gaze. The algorithm extracts outlier scenes from the video based on behavioral features as well as informing which feature contributes to the outlierness. We first evaluated how the assessors would perceive the extracted cues and discovered that the algorithm is useful in suggesting scenes to which assessors would pay attention, thanks to its interpretability. Then, we developed an interface prototype incorporating the algorithm and had six assessors use it for their actual assessment. Their comments revealed the effectiveness of introducing unsupervised anomaly detection to enhance their feeling of confidence and objectivity of the assessment along with potential use scenarios of such AI-based systems in human assessment. Our approach, which builds on top of the idea of separating observation and interpretation in human-AI collaboration, will facilitate human decision making in highly contextual domains, such as human assessment, while keeping their trust in the system.