Language identification in the limit

From Wikipedia, the free encyclopedia

Language identification in the limit is a formal model for inductive inference. It was introduced by E. Mark Gold in his paper with the same title [1]. In this model, a learner is provided with an enumeration of some language, and is asked to converge to the index of the language in the limit (i.e., a finite number of incorrect guesses are allowed).

1 Learnability
2 Learnability characterization
3 Language classes learnable in the limit
4 Language classes not learnable in the limit
5 Sufficient conditions for learnability
- 5.1 Finite thickness
- 5.2 Finite elasticity
6 Mind change bound
7 Other concepts
- 7.1 Infinite cross property
8 Relations between concepts
9 Open questions
10 External links

[edit] Learnability

This model is the first known attempt to capture the notion of learnability; another learnability model is the so-called Probably approximately correct learning (PAC) model.

[edit] Learnability characterization

Dana Angluin gave the characterizations of learnability in her paper [2].

If a learner is required to be effective, then an indexed class of recursive languages is learnable in the limit if there is an effective procedure that uniformly enumerates tell-tales for each language in the class (Condition 1). It is not hard to see that if we allow an ideal learner (i.e., an arbitrary function), then an indexed class of languages is learnable in the limit if each language in the class has a tell-tale (Condition 2).

[edit] Language classes learnable in the limit

Finite cardinality languages
Pattern languages

[edit] Language classes not learnable in the limit

Regular languages

[edit] Sufficient conditions for learnability

Condition 1 in Angluin's paper is not always easy to verify. Therefore, people come up with various sufficient conditions for the learnability of a language class.

[edit] Finite thickness

A class of languages has finite thickness if for every string s, there are only a finite number of languages in the class that are consistent with s. This is exactly Condition 3 in Angluin's paper. Angluin showed that if a class of recursive languages has finite thickness, then it is learnable in the limit.

A class with finite thickness certainly satisfies MEF-condition and MFF-condition; in other words, finite thickness implies M-finite thickness.

[edit] Finite elasticity

A class of languages is said to have finite elasticity if for every infinite sequence of strings $s 0, s 1,...$ and every infinite sequence of languages in the class $L 1, L 2,...$ , there exists a finite number n such that $s_n\not\in L_n$ implies $L n$ is inconsistent with ${s 1,..., s n - 1}$ . [3]

It is shown that a class of recursively enumerable languages is learnable in the limit if it has finite elasticity.

[edit] Mind change bound

[edit] Other concepts

[edit] Infinite cross property

A language L has infinite cross property within a class of languages $\mathcal{L}$ if there is an infinite sequence $L i$ of distinct languages in $\mathcal{L}$ and a sequence of finite subset $T i$ such that:

$T_1 \sub T_2\sub ...$ ,
$T_i \in L_i$ ,
$T_{i+1}\not\in L_i$ , and
$\lim_{n=\infty}T_i=L$ .

Note that L is not necessarily a member of the class of language.

It is not hard to see that if there is a language with infinite cross property within a class of languages, then that class of languages has infinite elasticity.

[edit] Relations between concepts

Finite thickness implies finite elasticity; the converse is not true.
Finite elasticity and conservatively learnable implies the existence of a mind change bound. [4]
Finite elasticity and M-finite thickness implies the existence of a mind change bound. However, M-finite thickness alone does not imply the existence of a mind change bound; neither does the existence of a mind change bound imply M-finite thickness. [5]
Existence of a mind change bound implies learnability; the converse is not true.
If we allow for noncomputable learners, then finite elasticity implies the existence of a mind change bound; the converse is not true.
If there is no accumulation order for a class of languages, then there is a language (not necessarily in the class) that has infinite cross property within the class, which in turn implies infinite elasticity of the class.

[edit] Open questions

If a countable class of recursive languages has a mind change bound for noncomputable learners, does the class also have a mind change bound for computable learners, or is the class unlearnable by a computable learner?