论文标题

剪辑如何理解纹理?

How well does CLIP understand texture?

论文作者

Wu, Chenyun, Maji, Subhransu

论文摘要

我们研究了夹理学对自然语言描述的自然图像中纹理的理解。为此,我们分析了剪辑的能力:(1)对各种纹理和材料分类数据集执行零拍学习; (2)表示纹理的组成属性,例如详细描述纹理(DTDD)数据集上的红点或黄色条纹; (3)用颜色和身体部位的质地描述的照片中的鸟类对鸟类的细粒度分类。

We investigate how well CLIP understands texture in natural images described by natural language. To this end, we analyze CLIP's ability to: (1) perform zero-shot learning on various texture and material classification datasets; (2) represent compositional properties of texture such as red dots or yellow stripes on the Describable Texture in Detail(DTDD) dataset; and (3) aid fine-grained categorization of birds in photographs described by color and texture of their body parts.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源