泰米尔元音识别具有增强的类似MNIST的数据集

论文标题

泰米尔元音识别具有增强的类似MNIST的数据集

Tamil Vowel Recognition With Augmented MNIST-like Data Set

论文作者

Annamalai, Muthiah

论文摘要

我们报告了泰米尔语元音的MNIST [4]兼容数据集[1]，以使泰米尔OCR/手写应用程序的分类DNN或其他此类ML/AI深度学习[2]模型。我们报告了60,000个灰度，28x28像素数据集的能力，以建立92％的精度（训练）和82％的交叉验证4层CNN，具有100,000多个参数，在张力流中具有100,000多个参数。对于同一网络，我们还报告了手写元音的前1个分类精度为70％，TOP-2分类精度为92％。

We report generation of a MNIST [4] compatible data set [1] for Tamil vowels to enable building a classification DNN or other such ML/AI deep learning [2] models for Tamil OCR/Handwriting applications. We report the capability of the 60,000 grayscale, 28x28 pixel dataset to build a 92% accuracy (training) and 82% cross-validation 4-layer CNN, with 100,000+ parameters, in TensorFlow. We also report a top-1 classification accuracy of 70% and top-2 classification accuracy of 92% on handwritten vowels showing, for the same network.

下载PDF全文

下载文献需遵守相关版权规定

论文标题