Term (logic)

In analogy to natural language, where a noun phrase refers to an object and a whole sentence refers to a fact, in mathematical logic, a term denotes a mathematical object and a formula denotes a mathematical fact. In particular, terms appear as components of a formula.

A first-order term is recursively constructed from constant symbols, variables and function symbols. An expression formed by applying a predicate symbol to an appropriate number of terms is called an atomic formula, which evaluates to true or false in bivalent logics, given an interpretation. For example, (x+1)*(x+1) is a term built from the constant 1, the variable x, and the binary function symbols + and *; it is part of the atomic formula (x+1)*(x+1) ≥ 0 which evaluates to true for each real-numbered value of x.

Besides in logic, terms play important roles in universal algebra, and rewriting systems.

Elementary mathematics

In the context of polynomials, sometimes term is used for a monomial with a coefficient: to 'collect like terms' in a polynomial is the operation of making it a linear combination of distinct monomials. Terms, in this sense, are things that are added or subtracted. A series is often represented as the sum of a sequence of terms. Individual factors in an expression representing a product are multiplicative terms. For example, in 6 + 3x  2, 6, 3x, and 2 are all terms.

In elementary mathematics,[1]

Formal definition

Tree structure of terms (n⋅(n+1))/2 and n⋅((n+1)/2)

Given a set V of variable symbols, a set C of constant symbols and sets Fn of n-ary function symbols, also called operator symbols, for each natural number n ≥ 1, the set of (unsorted first-order) terms T is recursively defined to be the smallest set with the following properties:[2]

Using an intuitive, pseudo-grammatical notation, this is sometimes written as: t ::= x | c | f(t1, ..., tn). Usually, only the first few function symbol sets Fn are inhabited. Well-known examples are the unary function symbols sin, cosF1, and the binary function symbols +, −, ⋅, / ∈ F2, while ternary operations are less known, let alone higher-arity functions. Many authors consider constant symbols as 0-ary function symbols F0, thus needing no special syntactic class for them.

A term denotes a mathematical object from the domain of discourse. A constant c denotes a named object from that domain, a variable x ranges over the objects in that domain, and an n-ary function f maps n-tuples of objects to objects. For example, if nV is a variable symbol, 1 ∈ C is a constant symbol, and addF2 is a binary function symbol, then nT, 1 ∈ T, and (hence) add(n, 1) ∈ T by the first, second, and third term building rule, respectively. The latter term is usually written as n+1, using infix notation and the more common operator symbol + for convenience.

Term structure vs. representation

Originally, logicians defined a term to be a character string adhering to certain building rules.[3] However, since the concept of tree became popular in computer science, it turned out to be more convenient to think of a term as a tree. For example, several distinct character strings, like "(n⋅(n+1))/2", "((n⋅(n+1)))/2", and "\frac{n(n+1)}{2}", denote the same term and correspond to the same tree, viz. the left tree in the above picture. Separating the tree structure of a term from its graphical representation on paper, it is also easy to account for parentheses (being only representation, not structure) and invisible multiplication operators (existing only in structure, not in representation).

Structural equality

Two terms are said to be structurally, literally, or syntactically equal if they correspond to the same tree. For example, the left and the right tree in the above picture are structurally unequal terms, although they might be considered "semantically equal" as they always evaluate to the same value in rational arithmetic. While structural equality can be checked without any knowledge about the meaning of the symbols, semantic equality cannot. If the function / is e.g. interpreted not as rational but as truncating integer division, then at n=2 the left and right term evaluates to 3 and 2, respectively. Structural equal terms need to agree in their variable names.

In contrast, a term t is called a renaming, or a variant, of a term u if the latter resulted from consistently renaming all variables of the former, i.e. if u = for some renaming substitution σ. In that case, u is a renaming of t, too, since a renaming substitution σ has an inverse σ−1, and t = uσ−1. Both terms are then also said to be equal modulo renaming. In many contexts, the particular variable names in a term don't matter, e.g. the commutativity axiom for addition can be stated as x+y=y+x or as a+b=b+a; in such cases the whole term may be replaced by a renamed term, while an arbitrary subterm usually may not, e.g. x+y=b+a is not a valid version of the commutativity axiom. [note 1] [note 2]

Ground and linear terms

The set of variables of a term t is denoted by vars(t). A term that doesn't contain any variables is called a ground term; a term that doesn't contain multiple occurrences of a variable is called a linear term. For example, 2+2 is a ground term and hence also a linear term, x⋅(n+1) is a linear term, n⋅(n+1) is a non-linear term. These properties are important e.g. in term rewriting.

Given a signature for the function symbols, the set of all terms forms the free term algebra. The set of all ground terms forms the initial term algebra.

Abbreviating the number of constants as f0, and the number of i-ary function symbols as fi, the number θh of distinct ground terms of a height up to h can be computed by the following recursion formula:

Building formulas from terms

Given a set Rn of n-ary relation symbols for each natural number n ≥ 1, an (unsorted first-order) atomic formula is obtained by applying an n-ary relation symbol to n terms. As for function symbols, a relation symbol set Rn is usually non-empty only for small n. In mathematical logic, more complex formulas are built from atomic formulas using logical connectives and quantifiers. For example, letting ℝ denote the set of real numbers, ∀x: x ∈ ℝ ⇒ (x+1)⋅(x+1) ≥ 0 is a mathematical formula evaluating to true in the algebra of complex numbers. An atomic formula is called ground if it is build entirely from ground terms; all ground atomic formulas composable from a given set of function and predicate symbols make up the Herbrand universe for these symbol sets.

Operations with terms

Tree structure of black example term \frac{a*((a+1)*(a+2))}{1*(2*3)}, with blue redex x*(y*z)

Related concepts

Sorted terms

Main article: Many-sorted logic

When the domain of discourse contains elements of basically different kinds, it is useful to split the set of all terms accordingly. To this end, a sort (sometimes also called type) is assigned to each variable and each constant symbol, and a declaration [note 3] of domain sorts and range sort to each function symbol. A sorted term f(t1,...,tn) may be composed from sorted subterms t1,...,tn only if the ith subterm's sort matches the declared ith domain sort of f. Such a term is also called well-sorted; any other term (i.e. obeying the unsorted rules only) is called ill-sorted.

For example, a vector space comes with an associated field of scalar numbers. Let W and N denote the sort of vectors and numbers, respectively, let VW and VN be the set of vector and number variables, respectively, and CW and CN the set of vector and number constants, respectively. Then e.g. \vec{0}CW and 0 ∈ CN, and the vector addition, the scalar multiplication, and the inner product is declared as +:W×WW, *:W×NW, and ⟨.,.⟩:W×WN, respectively. Assuming variable symbols \vec{v},\vec{w}VW and a,bVN, the term \langle (\vec{v}+\vec{0})*a,\vec{w}*b \rangle is well-sorted, while \vec{v}+a is not (since + doesn't accept a term of sort N as 2nd argument). In order to make a*\vec{v} a well-sorted term, an additional declaration *:N×WW is required. Function symbols having several declarations are called overloaded.

See many-sorted logic for more information, including extensions of the many-sorted framework described here.

Lambda terms

Terms with bound variables
Notation Bound Free Written as
example variables variables lambda-term
limn→∞ x/n n x limitn. div(x,n))
\sum_{i=1}^n i^2 i n sum(1,ni. power(i,2))
\int_a^b \sin(k \cdot t) dt t a, b, k integral(a,bt. sin(kt))

Motivation

Mathematical notations as shown in the table do not fit into the scheme of a first-order term as defined above, as they all introduce an own local, or bound, variable that may not appear outside the notation's scope, e.g. t \cdot \int_a^b \sin(k \cdot t) \; dt doesn't make sense. In contrast, the other variables, referred to as free, behave like ordinary first-order term variables, e.g. k \cdot \int_a^b \sin(k \cdot t) \; dt does make sense.

All these operators can be viewed as taking a function rather than a value term as one of their arguments. For example, the lim operator is applied to a sequence, i.e. to a mapping from positive integer to e.g. real numbers. As another example, a C function to implement the second example from the table, ∑, would have a function pointer argument (see box below).

Lambda terms can be used to denote anonymous functions to be supplied as arguments to lim, ∑, ∫, etc.

For example, the function square from the C program below can be written anonymously as a lambda term λi. i2. The general sum operator ∑ can then be considered as a ternary function symbol taking a lower bound value, an upper bound value and a function to be summed-up. Due to its latter argument, the ∑ operator is called a second-order function symbol. As another example, the lambda term λn. x/n denotes a function that maps 1, 2, 3, ... to x/1, x/2, x/3, ..., respectively, that is, it denotes the sequence (x/1, x/2, x/3, ...). The lim operator takes such a sequence and returns its limit (if defined).

The rightmost column of the table indicates how each mathematical notation example can be represented by a lambda term, also converting common infix operators into prefix form.

int sum(int lwb, int upb, int fct(int)) {    // implements general sum operator
    int res = 0;
    for (int i=lwb; i<=upb; ++i)
        res += fct(i);
    return res;
}
 
int square(int i) { return i*i; }            // implements anonymous function (lambda i. i*i); however, C requires a name for it
 
#include <stdio.h>
int main(void) {
    int n;
    scanf(" %d",&n);
    printf("%d\n", sum(1,n,square) );        // applies sum operator to sum up squares
    return 0;
}

See also

Notes

  1. Strictly speaking, x+y=y+x is an atomic formula, not a term, since = is a predicate, not a function symbol. However, since atomic formulas can be viewed as trees, too, and renaming is essentially a concept on trees, atomic (and, more generally, quantifier-free) formulas can be renamed in a similar way as terms. In fact, some authors consider a quantifier-free formula as a term (of type bool rather than e.g. int, cf. #Sorted terms below).
  2. Renaming of the commutativity axiom can be viewed as alpha-conversion on the universal closure of the axiom: "x+y=y+x" actually means "∀x,y: x+y=y+x", which is synonymous to "∀a,b: a+b=b+a"; see also #Lambda terms below.
  3. called "symbol type" in the Signature (logic)#Many-sorted signatures article

References

  1. Schwartzman, Steven (1994). The words of mathematics: An etymological dictionary of mathematical terms used in English. The Mathematical Association of America. p. 219. ISBN 0-88385-511-9.
  2. C.C. Chang; H. Jerome Keisler (1977). Model Theory. Studies in Logic and the Foundation of Mathematics 73. North Holland.; here: Sect.1.3
  3. Hermes, Hans (1973). Introduction to Mathematical Logic. Springer London. ISBN 3540058192. ISSN 1431-4657.; here: Sect.II.1.3