论文标题
要优化OCR以供可访问性
Towards Optimizing OCR for Accessibility
论文作者
论文摘要
诸如结构,重点和图标之类的视觉提示在有效的信息觅食,并带来令人愉悦的阅读体验。自目前的OCR和文本到语音软件以来,盲人,低视觉和其他印刷的人都错过了这些提示,从而忽略了它们,从而带来了乏味的阅读体验。我们确定了四个语义目标,以获得愉快的聆听体验,并确定有助于朝着这些目标取得进步的句法视觉提示。从经验上讲,我们发现以听觉形式保留一个或两个视觉提示可以显着增强聆听打印内容的体验。
Visual cues such as structure, emphasis, and icons play an important role in efficient information foraging by sighted individuals and make for a pleasurable reading experience. Blind, low-vision and other print-disabled individuals miss out on these cues since current OCR and text-to-speech software ignore them, resulting in a tedious reading experience. We identify four semantic goals for an enjoyable listening experience, and identify syntactic visual cues that help make progress towards these goals. Empirically, we find that preserving even one or two visual cues in aural form significantly enhances the experience for listening to print content.