UKARA 1.0挑战赛1：印度尼西亚巴哈萨的自动短暂回答得分

论文标题

UKARA 1.0挑战赛1：印度尼西亚巴哈萨的自动短暂回答得分

UKARA 1.0 Challenge Track 1: Automatic Short-Answer Scoring in Bahasa Indonesia

论文作者

Septiandri, Ali Akbar, Winatmoko, Yosef Ardhito

论文摘要

我们描述了我们针对自动论文评分的UKARA 1.0挑战的第三名解决方案。该任务包括两个数据集上的二进制分类问题|来自两个不同问题的答案。我们最终对两个数据集使用了两个不同的模型。对于任务A，我们在使用带有潜在语义分析（LSA）的Unigram提取的特征上应用了随机森林算法。另一方面，对于任务B，我们仅在TF-IDF功能上使用逻辑回归。我们的模型导致F1得分为0.812。

We describe our third-place solution to the UKARA 1.0 challenge on automated essay scoring. The task consists of a binary classification problem on two datasets | answers from two different questions. We ended up using two different models for the two datasets. For task A, we applied a random forest algorithm on features extracted using unigram with latent semantic analysis (LSA). On the other hand, for task B, we only used logistic regression on TF-IDF features. Our model results in F1 score of 0.812.

下载PDF全文

下载文献需遵守相关版权规定

论文标题