Summation

From Wikipedia, the free encyclopedia

For evaluation of sums in closed form see evaluating sums.
Summation is also a term used to describe a process in synapse biology.

Summation is the addition of a set of numbers; the result is their sum or total. The "numbers" to be summed may be natural numbers, complex numbers, matrices, or still more complicated objects. An infinite sum is a subtle procedure known as a series. Note that the term summation has a special meaning in the context of divergent series related to extrapolation.

Contents

[edit] Notation

The summation of 1, 2, and 4 is 1 + 2 + 4 = 7. The sum is 7. Since addition is associative, it does not matter whether we interpret "1 + 2 + 4" as (1 + 2) + 4 or as 1 + (2 + 4); the result is the same, so parentheses are usually omitted in a sum. Finite addition is also commutative, so the order in which the numbers are written does not affect its sum. (For issues with infinite summation, see absolute convergence.)

If a sum has too many terms to be written out individually, the sum may be written with an ellipsis to mark out the missing terms. Thus, the sum of all the natural numbers from 1 to 100 is 1 + 2 + … + 99 + 100 = 5050.

[edit] Capital-sigma notation

Mathematical notation has a special representation for compactly representing summation of many similar terms: the summation symbol, a large upright capital Sigma. This is defined thus:

\sum_{i=m}^n x_i = x_m + x_{m+1} + x_{m+2} +\cdots+ x_{n-1} + x_n.

The subscript gives the symbol for an index variable, i. Here, i represents the index of summation; m is the lower bound of summation, and n is the upper bound of summation. Here i = m under the summation symbol means that the index i starts out equal to m. Successive values of i are found by adding 1 to the previous value of i, stopping when i = n. We could as well have used k instead of i, as in

\sum_{k=2}^6 k^2 = 2^2+3^2+4^2+5^2+6^2 = 90.

Informal writing sometimes omits the definition of the index and bounds of summation when these are clear from context, as in

\sum x_i^2

which is informally equivalent to

\sum_{i=1}^n x_i^2.

One often sees generalizations of this notation in which an arbitrary logical condition is supplied, and the sum is intended to be taken over all values satisfying the condition. For example:

\sum_{0\le k< 100} f(k)

is the sum of f(k) over all (integer) k in the specified range,

\sum_{x\in S} f(x)

is the sum of f(x) over all elements x in the set S, and

\sum_{d|n}\;\mu(d)

is the sum of μ(d) over all integers d dividing n.

(Remark: Although the name of the dummy variable does not matter (by definition), one usually uses letters from the middle of the alphabet (i through q) to denote integers, if there is a risk of confusion. For example, even if there should be no doubt about the interpretation, it could look slightly confusing to many mathematicians to see x instead of k in the above formulae involving k. See also typographical conventions in mathematical formulae.)

There are also ways to generalize the use of many sigma signs. For example,

\sum_{\ell,\ell'}

is the same as

\sum_\ell\sum_{\ell'}.

[edit] Programming language notation

Summations can also be represented in a programming language.  \sum_{i=m}^{n} x_{i} Some languages use a notation for summation similar to the mathematical one. For example, this is Python:

sum(x[m:n+1])

and this is Fortran (or Matlab):

sum(x(m:n))

In other languages you have to use loops, as in the following Visual Basic/VBScript program:

Sum = 0
 For I = M To N
     Sum = Sum + X(I)
 Next I

or the following C/C++/C#/Java code, which assumes that the variables m and n are defined as integer types no wider than int, such that m ≥ n, and that the variable x is defined as an array of values of integer type no wider than int, containing at least m − n + 1 defined elements:

int i;
int sum = 0;
for (i = m; i <= n; i++)
    sum += x[i];

In some cases a loop can be written more concisely, as in this Perl code:

$sum = 0;
$sum += $x[$_] for ($m..$n);

or these alternative Ruby expressions:

x[m..n].inject{|a,b| a+b}
x[m..n].inject(0){|a,b| a+b}

or in C++, using its standard library:

std::accumulate(&x[m], &x[n] + 1, 0)

when x is an built-in array or a std::vector.

Note that most of these examples begin by initializing the sum variable to 0, the identity element for addition. (See "special cases" below).

Also note that the traditional \sum notation allows for the upper bound to be less than the lower bound. In this case, the index variable is initialized with the upper bound instead of the lower bound, and it is decremented instead of incremented. Since addition is commutative, this might also be accomplished by swapping the upper and lower bound and incrementing in a positive direction as usual.

The exact meaning of \sum, and therefore its translation into a programming language, changes depending on the data type of the subscript and upper bound. In other words, \sum is an overloaded symbol.

In the above examples, the subscript of \sum was translated into an assignment statement to an index variable at the beginning of a for loop. But the subscript is not always an assignment statement. Sometimes the subscript sets up the iterator for a foreach loop, and sometimes the subscript is itself an array, with no index variable or iterator provided.

In the example below:

\sum_{x\in S} f(x)

x is an iterator, which implies a foreach loop, but S is a set, which is an array-like data structure that can store values of mixed type. The summation routine for a set would have to account for the fact that it is possible to store non-numerical data in a set.

The return value of \sum is a scalar in all examples given above.

[edit] Special cases

It is possible to sum fewer than 2 numbers:

  • If you sum the single term x, then the sum is x.
  • If you sum zero terms, then the sum is zero, because zero is the identity for addition. This is known as the empty sum.

These degenerate cases are usually only used when the summation notation gives a degenerate result in a special case. For example, if m = n in the definition above, then there is only one term in the sum; if m = n + 1, then there is none.

[edit] Approximation by definite integrals

Many such approximations can be obtained by the following connection between sums and integrals, which holds for any:

increasing function f:

 \int_{s=a-1}^{b} f(s)\, ds \le \sum_{i=a}^{b} f(i) \le \int_{s=a}^{b+1} f(s)\, ds.

decreasing function f:

 \int_{s=a}^{b+1} f(s)\, ds \le \sum_{i=a}^{b} f(i) \le \int_{s=a-1}^{b} f(s)\, ds.

For more general approximations, see the Euler–Maclaurin formula.

For functions that are integrable on the interval [a, b], the Riemann sum can be used as an approximation of the definite integral. For example, the following formula is the left Riemann sum with equal partitioning of the interval:

 \frac{b-a}{n}\sum_{i=0}^{n-1} f\left(a+i\frac{b-a}n\right)\approx \int_a^b f(x)\,dx.

The accuracy of such an approximation increases with the number n of subintervals, such that:

 \lim_{n\rightarrow \infty} \frac{b-a}{n}\sum_{i=0}^{n-1} f\left(a+i\frac{b-a}n\right) = \int_a^b f(x)\,dx.

[edit] Identities

The following are useful identities:

  • \sum_{n=s}^t C\sdot f(n) = C\sdot \sum_{n=s}^t f(n), where 'C' is a distributed constant. (See Scalar multiplication)
  • \sum_{n=s}^t f(n) + \sum_{n=s}^{t} g(n) = \sum_{n=s}^t \left[f(n) + g(n)\right]
  • \sum_{n=s}^t f(n) = \sum_{n=s+p}^{t+p} f(n-p)
  • \sum_{n=s}^j f(n) + \sum_{n=j+1}^t f(n) = \sum_{n=s}^t f(n)
  • \sum_{i=m}^n x = (n-m+1)x
  • \sum_{i=1}^n x = nx, definition of multiplication where n is an integer multiplier to x
  • \sum_{i=m}^n i = \frac{(n-m+1)(n+m)}{2} (see arithmetic series)
  • \sum_{i=0}^n i = \sum_{i=1}^n i = \frac{(n+1)(n)}{2} (Special case of the arithmetic series)
  • \sum_{i=1}^n i^2 = \frac{n(n+1)(2n+1)}{6} = \frac{n^3}{3} + \frac{n^2}{2} + \frac{n}{6}
  • \sum_{i=1}^n i^3 = \left(\frac{n(n+1)}{2}\right)^2 = \frac{n^4}{4} + \frac{n^3}{2} + \frac{n^2}{4} = \left[\sum_{i=1}^n i\right]^2
  • \sum_{i=1}^n i^4 = \frac{n(n+1)(2n+1)(3n^2+3n-1)}{30} = \frac{n^5}{5} + \frac{n^4}{2} + \frac{n^3}{3} - \frac{n}{30}
  • \sum_{i=0}^n i^p = \frac{(n+1)^{p+1}}{p+1} + \sum_{k=1}^p\frac{B_k}{p-k+1}{p\choose k}(n+1)^{p-k+1}
where Bk is the kth Bernoulli number.

[edit] Growth rates

The following are useful approximations (using theta notation):

  • \sum_{i=1}^n i^c \in \Theta(n^{c+1}) for real c greater than -1
  • \sum_{i=1}^n \frac{1}{i} \in \Theta(\log n)
  • \sum_{i=1}^n c^i \in \Theta(c^n) for real c greater than 1
  • \sum_{i=1}^n \log(i)^c \in \Theta(n \cdot \log(n)^{c}) for nonnegative real c
  • \sum_{i=1}^n \log(i)^c \cdot i^d \in \Theta(n^{d+1} \cdot \log(n)^{c}) for nonnegative real c, d
  • \sum_{i=1}^n \log(i)^c \cdot i^d \cdot b^i \in \Theta (n^d \cdot \log(n)^c \cdot b^n) for nonnegative real b > 1, c, d

[edit] See also

[edit] External links