论文标题
在存在噪声的情况下分析盎格鲁因算法的鲁棒性
Analyzing Robustness of Angluin's L* Algorithm in Presence of Noise
论文作者
论文摘要
Angluin的L*算法使用会员资格和等价查询了解了常规语言的最低(完整)确定性有限自动机(DFA)。它的概率近似正确(PAC)版本用足够大的随机会员查询替换了等效查询,以使答案具有很高的信心。因此,它可以应用于任何类型的(也是非规范)设备,可以将其视为合成自动机的算法,该算法根据观测值抽象该设备的行为。 在这里,我们对Angluin的PAC学习算法的表现感兴趣,这些设备是通过引入一些噪声从DFA获得的设备。更确切地说,我们研究盎格鲁因算法是否会降低噪声并产生与原始设备更接近原始设备的DFA。 我们提出了几种介绍噪声的方法:(1)嘈杂的设备将单词的分类W.R.T.倒置。 (2)嘈杂的设备以较小的概率修饰单词的字母在询问其分类W.R.T. DFA和(3)嘈杂的设备结合了单词W.R.T.的分类。 DFA及其分类W.R.T.柜台自动机。 我们的实验是在数百个DFA上进行的。 直言不讳地表明,我们的主要贡献表明:(1)每当随机过程产生嘈杂的设备时,Angluin的算法的表现都很好,(2)但结构化的噪声却很差,并且(3)几乎肯定的随机性收益率具有非概念性的概述的语言。
Angluin's L* algorithm learns the minimal (complete) deterministic finite automaton (DFA) of a regular language using membership and equivalence queries. Its probabilistic approximatively correct (PAC) version substitutes an equivalence query by a large enough set of random membership queries to get a high level confidence to the answer. Thus it can be applied to any kind of (also non-regular) device and may be viewed as an algorithm for synthesizing an automaton abstracting the behavior of the device based on observations. Here we are interested on how Angluin's PAC learning algorithm behaves for devices which are obtained from a DFA by introducing some noise. More precisely we study whether Angluin's algorithm reduces the noise and produces a DFA closer to the original one than the noisy device. We propose several ways to introduce the noise: (1) the noisy device inverts the classification of words w.r.t. the DFA with a small probability, (2) the noisy device modifies with a small probability the letters of the word before asking its classification w.r.t. the DFA, and (3) the noisy device combines the classification of a word w.r.t. the DFA and its classification w.r.t. a counter automaton. Our experiments were performed on several hundred DFAs. Our main contributions, bluntly stated, consist in showing that: (1) Angluin's algorithm behaves well whenever the noisy device is produced by a random process, (2) but poorly with a structured noise, and, that (3) almost surely randomness yields systems with non-recursively enumerable languages.