Pi-calculus

From Wikipedia, the free encyclopedia

In theoretical computer science, the π-calculus is a process calculus originally developed by Robin Milner, Joachim Parrow and David Walker as a continuation of the body of work on the process calculus CCS (Calculus of Communicating Systems). The aim of the π-calculus is to be able to describe concurrent computations whose configuration may change during the computation.

Contents

[edit] Informal definition

The π-calculus is in the family of process calculi, mathematical formalisms for describing and analyzing properties of concurrent computation. In fact, the π-calculus, like the λ-calculus, is so minimal that it does not contain primitives such as numbers, booleans, data structures, variables, functions, or even the usual flow control statements (such as if... then...else, while...).

[edit] Process constructs

Central to the π-calculus is the notion of name. The simplicity of the calculus is due to the fact that names play a dual role as communication channels and variables.

The process constructs available in the calculus are the following (a precise definition is given in the following section):

  • concurrency, written P \mid Q, where P and Q are two processes or threads executed concurrently
  • communication, where
    • input prefixing c\left(x\right).P is a process waiting for a message to be sent on a communication channel named c before proceeding as P, binding the name received to the name x. Typically, this models either a process expecting a communication from the network or a label c usable only once by a goto c operation.
    • output prefixing \overline{c} \langle y \rangle.P describes that the name y is emitted on channel c before proceeding as P. Typically, this models either sending a message on the network or a goto c operation.
  • replication, written !\,P, which may be seen as a process which can always create a new copy of P. Typically, this models either a network service or a label c waiting for any number of goto c operations.
  • creation of a new name, written \left(\nu x\right)P, which may be seen as a process allocating a new constant x within P. As opposed to functional programming's let x=... in... operation, the constants of π-calculus are defined by their name only and are always communication channels.
  • the nil process, written 0, is a process whose execution is complete and has stopped.

Although the minimality of the π-calculus prevents us from writing programs in the normal sense, it is easy to extend the calculus. In particular, it is easy to define both control structures such as recursion, loops and sequential composition and datatypes such as first-order functions, truth values, list and integers. Moreover, extensions of the π-calculus have been proposed which take into account distribution or public-key cryptography. The applied π-calculus due to Abadi and Fournet puts these various extensions on a formal footing by extending the π-calculus with arbitrary datatypes.

[edit] A small example

Below is a tiny example of a process which consists of three parallel components. The channel name x is only known by the first two components.

(\nu x)(\overline{x} \langle z \rangle.0  |  x(y).   \overline{y}\langle x \rangle . x(y).0 ) | z(v) . \overline{v}\langle v \rangle. 0

The first two components are able to communicate on the channel x, and the name z becomes bound to y. The continuation of the process is therefore

(\nu x)(0|  \overline{z}\langle x \rangle . x(y). 0 ) | z(v). \overline{v}\langle v \rangle .0

Note that the remaining y is not affected because it is defined in an inner scope. The second and third parallel components can now communicate on the channel name z, and x is bound to v. The continuation of the process is now

(\nu x)(0|  x(y). 0  | \overline{x}\langle x \rangle .0)

Note that since the local name x has been output, the scope of x is extended to cover the third component as well. Finally, the channel x can be used for sending the name x.

[edit] Formal definition

[edit] Syntax

Let Χ be a set of objects called names. The processes of π-calculus are built from names by the syntax (where x and y are any names from Χ)

\begin{matrix} P ::= & x(y).P\\     | & \overline{x} \langle y \rangle.P \\     | & P\;|\;P \\     | & (\nu x)P \\     | & !P \\     | & 0 \end{matrix}

Names can be bound by the restriction and input prefix constructs. The sets of free and bound names of a process in π–calculus are defined inductively as follows.

  • The 0 process has no free names and no bound names.
  • The free names of a\langle x \rangle.P are a, x, and the free names of P. The bound names of a\langle x \rangle.P are the bound names of P.
  • The free names of a(x).P are a and the free names of P, except for x. The bound names of a(x).P are x and the bound names of P.
  • The free names of P | Q are those of P together with those of Q. The bound names of P | Q are those of P together with those of Q.
  • The free names of x).P are those of P, except for x. The bound names of x).P are x and the bound names of P.
  • The free names of !P are those of P. The bound names of !P are those of P.

[edit] Structural congruence

Central to both the reduction semantics and the labelled transition semantics is the notion of structural congruence. Two processes are structurally congruent, if they are identical up to structure. In particular, parallel composition is commutative and associative.

More precisely, structural congruence is defined as the least equivalence relation preserved by the process constructs and satisfying:

Alpha-conversion:

  • P \equiv Q if Q can be obtained from P by renaming one or more bound names in P.

Axioms for parallel composition:

  • P|Q \equiv Q|P
  • (P|Q)|R \equiv P|(Q|R)
  • P | 0 \equiv P

Axioms for restriction:

  • (\nu x)(\nu y)P \equiv (\nu y)(\nu x)P
  • (\nu x)0 \equiv 0

Axiom relating restriction and parallel:

  • (\nu x)(P | Q) \equiv (\nu x)P | Q if x is not a free name of Q.

This last axiom is known as the "scope extension" axiom. This axiom is central, since it describes how a bound name x may be extruded by an output action, causing the scope of x to be extended.

[edit] Reduction semantics

We write P \rightarrow P' if P can perform a computation step, following which it is now P'. This reduction relation \rightarrow is defined as the least relation closed under a set of reduction rules.

The main reduction rule which captures the ability of processes to communicate through channels is the following:

  • \overline{x}\langle z \rangle.P | x(y).Q \rightarrow P | Q[z/y]
where Q[z / y] denotes the process Q in which the free name z has been substituted for the free name y. Note that this substitution may involve alpha-conversion to avoid name clashes.

There are three additional rules:

  • If P \rightarrow Q then also P|R \rightarrow Q|R.
This rule says that parallel composition does not inhibit computation.
  • If P \rightarrow Q, then also (\nu x)P \rightarrow (\nu x)Q.
This rule ensures that computation can proceed underneath a restriction.
  • If P \equiv P' and P' \rightarrow Q' where Q' \equiv Q, then also P \rightarrow Q.

The latter rule states that processes that are structurally congruent have the same reductions.

[edit] The example revisited

Consider again the process

(\nu x)(\overline{x} \langle z \rangle.0  |  x(y).   \overline{y}\langle x \rangle . x(y).0 ) | z(v) . \overline{v}\langle v \rangle. 0

Applying the definition of the reduction semantics, we get the reduction

(\nu x)(\overline{x} \langle z \rangle.0  |  x(y).   \overline{y}\langle x \rangle . x(y).0 ) | z(v) . \overline{v}\langle v \rangle. 0  \rightarrow (\nu x)(0|  \overline{z}\langle x \rangle . x(y). 0 ) | z(v). \overline{v}\langle v \rangle .0

Next, we get the reduction

(\nu x)(0|  \overline{z}\langle x \rangle . x(y). 0 ) | z(v). \overline{v}\langle v \rangle .0 \rightarrow (\nu x)(0|  x(y). 0  | \overline{x}\langle x \rangle .0)

Note that since the local name x has been output, the scope of x is extended to cover the third component as well. This was captured using the scope extension axiom.

[edit] Labelled semantics

Alternatively, one may give the pi-calculus a labelled transition semantics (as has been done with the Calculus of Communicating Systems). Transitions in this semantics are of the form:

\begin{matrix}   & \alpha & \\ P & \rightarrow  & P' \\ \end{matrix}

This notation signifies that p after the action α becomes p'. α can be an input action a(x), an output action a\langle x \rangle, or a tau-action τ corresponding to an internal communication.

A standard result about the labelled semantics is that it agrees with the reduction semantics in the sense that P \rightarrow P' if and only if \begin{matrix}   & \tau & \\ P & \rightarrow  & P' \\ \end{matrix} for some action τ.

[edit] Extensions and variants

The syntax given above is a minimal one. However, the syntax may be modified in various ways.

A nondeterministic choice operator P + Q can be added to the syntax.

A test for name equality [x = y]P can be added to the syntax. This match operator can proceed as P if and only if x and y are the same name. Similarly, one may add a mismatch operator for name inequality.

The asynchronous π-calculus allows only outputs with no continuation, i.e. output atoms of the form \overline{x}\langle y \rangle. However, any process can be represented by the asynchronous π-calculus using an extra channel to simulate explicit acknowledgement from the receiving process.

The polyadic π-calculus allows communicating more than one name in a single action: \overline{x}<z_1,...z_n>.P (polyadic output) and x(z1,...zn) (polyadic input). This polyadic extension can be encoded in the monadic calculus by passing the name of a private channel through which the multiple arguments are then passed in sequence. The encoding is defined recursively by the clauses

\overline{x}\langle y_1,\cdots,y_n\rangle.P is encoded as (\nu w) \overline{x}\langle w \rangle.\overline{w}\langle y_1\rangle.\cdots.\overline{w}\langle y_n\rangle.[P]

x(y_1,\cdots,y_n).P is encoded as x(w).w(y_1).\cdots.w(y_n).[P]

All other process constructs are left unchanged by the encoding.

In the above, [P] denotes the encoding of all prefixes in the continuation P in the same way.

The full power of replication !P is not needed. Often, one only considers replicated input !x(y).P, whose structural congruence axiom is ! x(y).P \equiv x(y).P | !x(y).P.

Replicated input process such as !x(y).P can be understood as servers, waiting on channel x to be invoked by clients. Invocation of a server spawns a new copy of the process P[a / y], where a is the name passed by the client to the server, during the latter's invocation.

A higher order π-calculus can be defined where not names but processes are sent through channels. The key reduction rule for the higher order case is

\overline{x}\langle R \rangle.P | x(Y).Q \rightarrow P | Q[R/Y]

Here, Y denotes a process variable which can be instantiated by a process term. Sangiorgi established the surprising result that the ability to pass processes does not increase the expressivity of the π-calculus: passing a process P can be simulated by just passing a name that points to P instead.

[edit] Properties

[edit] Turing completeness

The π-calculus is a universal model of computation. This was first observed by Milner in his paper "Functions as Processes" (Mathematical Structures in Computer Science, Vol. 2, pp. 119-141, 1992), in which he presents two encodings of the lambda-calculus in the π-calculus. One encoding simulates the call-by-value reduction strategy, the other encoding simulates the lazy (call-by-name) strategy.

The features of the π-calculus that make these encodings possible are name-passing and replication (or, equivalently, recursively defined agents). In the absence of replication/recursion, the π-calculus ceases to be Turing-powerful. This can be seen by the fact the bisimulation equivalence becomes decidable for the recursion-free calculus and even for the finite-control π-calculus where the number of parallel components in any process is bounded by a constant (Mads Dam: On the Decidability of Process Equivalences for the pi-Calculus. Theoretical Computer Science 183, 1997, pp. 215-228.)

[edit] Bisimulations in the π-calculus

See also article: Bisimulation

As for process calculi, the π-calculus allows for a definition of bisimulation equivalence. In the π-calculus, the definition of bisimulation equivalence (aka bisimilarity) may be based on either the reduction semantics or on the labelled transition semantics.

There are (at least) three different ways of defining labelled bisimulation equivalence in the π-calculus: Early, late and open bisimilarity. This stems from the fact that the π-calculus is a value-passing process calculus.

In the remainder of this section, we let p and q denote processes and R denote binary relations over processes.

[edit] Early and late bisimilarity

Early and late bisimilarity were both discovered by Milner, Parrow and Walker in their original paper on the π-calculus.[1]

A binary relation R over processes is an early bisimulation if for every pair of elements (p, q) \in R, whenever

\begin{matrix}   & a(x) & \\ p & \rightarrow  & p' \\ \end{matrix}

then for every name y there exists some q' \in R such that

\begin{matrix}   & a(x) & \\ q &  \rightarrow & q' \\ \end{matrix}

and (p'[y/x],q'[y/x]) \in R , and for any other kind of action α:

\begin{matrix}   & \alpha & \\ p & \rightarrow  & p' \\ \end{matrix}

implies that there exists some q' \in R such that

\begin{matrix}   & \alpha & \\ q &  \rightarrow & q' \\ \end{matrix}

and (p',q') \in R Processes p and q are said to be early bisimilar, written p \sim_e q if the pair (p,q) \in R for some early bisimulation R. \sim_e

In late bisimilarity, the transition match must be independent of the name being transmitted. A binary relation R over processes is a late bisimulation if for every pair of elements (p, q) \in R, whenever

\begin{matrix}   & a(x) & \\ p & \rightarrow  & p' \\ \end{matrix}

then for some q' \in R it holds that

\begin{matrix}   & a(x) & \\ q &  \rightarrow & q' \\ \end{matrix}

and (p'[y/x],q'[y/x]) \in R for every name y, and for any other kind of action α:

\begin{matrix}   & \alpha & \\ p & \rightarrow  & p' \\ \end{matrix}

implies that there exists some q' \in R such that

\begin{matrix}   & \alpha & \\ q &  \rightarrow & q' \\ \end{matrix}

and (p',q') \in R

Processes p and q are said to be late bisimilar, written p \sim_l q if the pair (p,q) \in R for some late bisimulation R.

Both \sim_e and \sim_l suffer from the problem that they are not congruence relations in the sense that they are not preserved by all process constructs. More precisely, there exist processes p and q such that p \sim_e q but a(x)p \not \sim_e a(x).q. One may remedy this problem by considering the maximal congruence relations included in \sim_e and \sim_l, known as early congruence and late congruence, respectively.

[edit] Open bisimilarity

Fortunately, a third definition is possible, which avoids this problem, namely that of open bisimilarity, due to Sangiorgi (D. Sangiorgi. A theory of bisimulation for the π-calculus. Acta Informatica, Volume 33 , Issue 1, 1996.)

A binary relation R over processes is an open bisimulation if for every pair of elements (p, q) \in R and for every name substitution σ and every action α, whenever

\begin{matrix}   & \alpha & \\ p\sigma & \rightarrow  & p' \\ \end{matrix}

then there exists some q' \in R such that

\begin{matrix}   & \alpha & \\ q\sigma &  \rightarrow & q' \\ \end{matrix}

and (p',q') \in R

Processes p and q are said to be open bisimilar, written p \sim_o q if the pair (p,q) \in R for some open bisimulation R. \sim_o

Early, late and open bisimilarity are in fact all distinct. The containments are proper, so \sim_o \subsetneq \sim_l \subsetneq \sim_e.

In certain subcalculi such as the asynchronous pi-calculus, late, early and open bisimilarity are known to coincide. However, in this setting a more appropriate notion is that of asynchronous bisimilarity.

The reader should note that, in the literature, the term open bisimulation usually refers to a more sophisticated notion, where processes and relations are indexed by distinction relations; details are in Sangiorgi's paper cited above.

[edit] Barbed equivalence

Alternatively, one may define bisimulation equivalence directly from the reduction semantics. We write p \Downarrow a if process p immediately allows an input or an output on name a.

A binary relation R over processes is a barbed bisimulation if it is a symmetric relation which satisfies that for every pair of elements (p, q) \in R we have that

(1) p \Downarrow a if and only if q \Downarrow a for every name a

and

(2) for every reduction p \rightarrow p' there exists a reduction q \rightarrow  q'

such that (p',q') \in R.

We say that p and q are barbed bisimilar if there exists a barbed bisimulation R where (p,q) \in R.

Definying a context as a π term with a hole [] we say that two processes P and Q are barbed congruent, written P˜b if for every context C[] we have that C[Pbb[Q]. It turns out that barbed congruence coincides with the congruence induced by early bisimilarity.

[edit] Applications

The π-calculus has been used to describe many different kinds of concurrent systems. In fact, some of the most recent applications lie outside the realm of computer science.

In 1997, Martin Abadi and Andrew Gordon proposed an extension of the π-calculus, the Spi-calculus, as a formal notation for describing and reasoning about cryptographic protocols. The spi-calculus extends the π-calculus with primitives for encryption and decryption. There is now a large body of work devoted to variants of the spi-calculus, including a number of experimental verification tools. One example is the tool ProVerif [1] due to Bruno Blanchet, based on a translation of the applied π-calculus into Blanchet's logic programming framework. Another example is Cryptyc [2], due to Andrew Gordon and Alan Jeffrey, which uses Woo and Lam's method of correspondence assertions as the basis for type systems that can check for authentication properties of cryptographic protocols.

Around 2002, Howard Smith and Peter Fingar became interested in using the π-calculus as a description tool for modelling business processes. As of July 2006, there is discussion in the community as to how useful this will be. Most recently, the π-calculus has been used as the theoretical basis of Business Process Modeling Language (BPML), and of Microsoft's XLANG.[citation needed]

The π-calculus has also attracted interest in molecular biology. In 1999, Aviv Regev and Ehud Shapiro showed that one can describe a cellular signaling pathway (the so-called RTK/MAPK cascade) and in particular the molecular "lego" which implements these tasks of communication in an extension of the π-calculus.[2]

[edit] Implementations

The following programming languages are implementations either of the π-calculus or of its variants:

  • Acute
  • BPML (Business Process Modeling Language)
  • The Kell machine
  • Nomadic Pict
  • Occam-Pi
  • Pict

[edit] Notes

  1. ^ Milner, R., J. Parrow and D. Walker (1992). "A calculus of mobile processes". Information and Computation (100): 1--40.
  2. ^ Regev, Aviv, William Silverman and Ehud Y. Shapiro. "Representation and Simulation of Biochemical Processes Using the pi-Calculus Process Algebra". Pacific Symposium on Biocomputing 2001: 459–470.

[edit] References

  • Davide Sangiorgi and David Walker: The Pi-calculus: A Theory of Mobile Processes, Cambridge University Press, ISBN 0-521-78177-9

[edit] External links

In other languages