ID3 algorithm
From Wikipedia, the free encyclopedia
ID3 (Iterative Dichotomiser 3) is an algorithm used to generate a decision tree.
The algorithm is based on Occam's razor: it prefers smaller decision trees (simpler theories) over larger ones. However, it does not always produce the smallest tree, and is therefore a heuristic. Occam's razor is formalized using the concept of information entropy:
The ID3 algorithm can be summarized as follows:
- Take all unused attributes and count their entropy concerning test samples
- Choose attribute for which entropy is minimum
- Make node containing that attribute
An explanation of the implementation of ID3 can be found at C4.5 algorithm, which is an extended version of ID3.
Contents |
[edit] Algorithm
The actual algorithm is as follows:
ID3 (Examples, Target_Attribute, Attributes)
- Create a root node for the tree
- If all examples are positive, Return the single-node tree Root, with label = +.
- If all examples are negative, Return the single-node tree Root, with label = -.
- If number of predicting attributes is empty, then Return the single node tree Root, with label = most common value of the target attribute in the examples.
- Otherwise Begin
- A = The Attribute that best classifies examples.
- Decision Tree attribute for Root = A.
- For each possible value, vi, of A,
- Add a new tree branch below Root, corresponding to the test A = vi.
- Let Examples(vi), be the subset of examples that have the value vi for A
- If Examples(vi) is empty
- Then below this new branch add a leaf node with label = most common target value in the examples
- Else below this new branch add the subtree ID3 (Examples(vi), Target_Attribute, Attributes – {A})
- End
- Return Root
[edit] External links
- Seminars - http://www2.cs.uregina.ca/
- Description and examples - http://www.cise.ufl.edu/
- Description and examples - http://www.cis.temple.edu/
- An implementation of ID3 in Python
- Implementation of ID3 algorithm in C# - http://www.codeproject.com/cs/algorithms/id3.asp
- Decision tree Perl Module - An implementation of ID3 in Perlatomic resolution.
[edit] See also
[edit] References
- Mitchell, Tom M. Machine Learning. McGraw-Hill, 1997.