AIML

Artificial Intelligence Markup Language
Filename extension .aiml
Developed by Dr. Richard S. Wallace
Initial release July 16, 2001 [1]
Latest release
1.0.1 (rev 008)
(October 30, 2011 [2])
Type of format Artificial intelligence
Extended from XML
Open format? Yes
Website http://www.alicebot.org/aiml.html

AIML, or Artificial Intelligence Markup Language, is an XML dialect for creating natural language software agents.

History

The XML dialect called AIML was developed by Richard Wallace and a worldwide free software community between 1995 and 2002. AIML formed the basis for what was initially a highly extended Eliza called "A.L.I.C.E." ("Artificial Linguistic Internet Computer Entity"), which won the annual Loebner Prize Competition in Artificial Intelligence[3] three times, and was also the Chatterbox Challenge[4] Champion in 2004.

Because the A.L.I.C.E. AIML set was released under the GNU GPL, and because most AIML interpreters are offered under a free or open source license, many "Alicebot clones" have been created based upon the original implementation of the program and its AIML knowledge base. Free AIML sets[5] in several languages have been developed and made available by the user community. There are AIML interpreters available in Java, Ruby, Python, C++, C#, Pascal, and other languages (see below). A semi-formal specification[2] and a W3C XML Schema for AIML[6] are available.

Since early 2013, The A.L.I.C.E foundation have been working on a draft specification for AIML 2.0. [7]

Elements of AIML

AIML contains several elements. The most important of these are described in further detail below.

Categories

Categories in AIML are the fundamental unit of knowledge. A category consists of at least two further elements: the pattern and template elements. Here is a simple category:

<category>
  <pattern>WHAT IS YOUR NAME</pattern>
  <template>My name is John.</template>
</category>

When this category is loaded, an AIML bot will respond to the input "What is your name" with the response "My name is John."

Patterns

A pattern is a string of characters intended to match one or more user inputs. A literal pattern like

 WHAT IS YOUR NAME

will match only one input, ignoring case: "what is your name". But patterns may also contain wildcards, which match one or more words. A pattern like

 WHAT IS YOUR *

will match an infinite number of inputs, including "what is your name", "what is your shoe size", "what is your purpose in life", etc.

The AIML pattern syntax is a very simple pattern language, substantially less complex as regular expressions and as such less than level 3 in the Chomsky hierarchy. To compensate for the simple pattern matching capabilities, AIML interpreters can provide preprocessing functions to expand abbreviations, remove misspellings, etc.

The AIML syntax itself is at least as complex as finite state machines and as such at least of level 3 in the Chomsky hierarchy. This is because a state correlates to one topic. To implement that behavior, the topic should have a "*" Pattern to make sure, that the state is not left accidentally. A state transit is implemented with the <think><set name="topic">state2</set></think> Tag. This way, the bot will be able to "remember" the topic talked about or even user privileges, which are gained during the chat.

Templates

A template specifies the response to a matched pattern. A template may be as simple as some literal text, like

  My name is John.

A template may use variables, such as the example

 My name is <bot name="name"/>.

which will substitute the bot's name into the sentence, or

 You told me you are <get name="user-age"/> years old.

which will substitute the user's age (if known) into the sentence.

Template elements include basic text formatting, conditional response (if-then/else), and random responses.

Templates may also redirect to other patterns, using an element called srai. This can be used to implement synonymy, as in this example (where CDATA is used to avoid the need for XML escaping):

<category>
  <pattern>WHAT IS YOUR NAME</pattern>
  <template><![CDATA[My name is <bot name="name"/>.]]></template>
</category>
<category>
  <pattern>WHAT ARE YOU CALLED</pattern>
  <template>
    <srai>what is your name</srai>
  </template>
</category>

The first category simply answers an input "what is your name" with a statement of the bot's name. The second category, however, says that the input "what are you called" should be redirected to the category that matches the input "what is your name"--in other words, it is saying that the two phrases are equivalent.

Templates can contain other types of content, which may be processed by whatever user interface the bot is talking through. So, for example, a template may use HTML tags for formatting, which can be ignored by clients that don't support HTML.

Criticism

AIML is considered clever and simple, and a good start for beginners writing simple bots. However, it is difficult to maintain and author more complex bots, and has weak pattern matching ability.

References