Actor model and process calculi

From Wikipedia, the free encyclopedia

In computer science, the Actor model and process calculi are two closely related approaches to the modelling of concurrent digital computation. See Actor model and process calculi history.

There are many similarities between the two approaches, but also several differences (some philosophical, some technical):

  • There is only one Actor model (although it has numerous formal systems for design, analysis, verification, modeling, etc.); there are numerous process calculi, developed for reasoning about a variety of different kinds of concurrent systems at various levels of detail (including calculi that incorporate time, stochastic transitions, or constructs specific to application areas such as security analysis).
  • The Actor model was inspired by the laws of physics and depends on them for its fundamental axioms, i.e. physical laws (see Actor model theory); the process calculi were originally inspired by algebra (Milner 1993).
  • Processes in the process calculi are anonymous, and communicate by sending messages either through named channels (synchronous or asynchronous), or via ambients (which can also be used to model channel-like communications (Cardelli and Gordon 1998)). In contrast, actors in the Actor model possess an identity, and communicate by sending messages to the mailing addresses of other actors (this style of communication can also be used to model channel-like communications — see below).

The publications on the Actor model and on process calculi have a fair number of cross-references, acknowledgments, and reciprocal citations (see Actor model and process calculi history).

Contents

[edit] How do channels work?

Indirect communication using channels (e.g. Gilles Kahn and David MacQueen [1977]) has been an important issue for communication in parallel and concurrent computation affecting both semantics and performance. Some process calculi differ from the Actor model in their use of channels as opposed to direct communication.

[edit] Issues with synchronous channels

Synchronous channels have the property that a sender putting a message in the channel must wait for a receiver to get the message out of the channel before the sender can proceed.

[edit] Simple synchronous channels

A synchronous channel can be modeled by an Actor that receives put and get communications. The following is the behavior of an Actor for a simple synchronous channel:

  • Each put communication has a message and an address to which an acknowledgment is sent when the message is gotten by a get communication from the channel in FIFO order.
  • Each get communication has an address to which the gotten message is sent.

[edit] Synchronous channels in process calculi

However, simple synchronous channels do not suffice for process calculi such as Communicating Sequential Processes (CSP) [Hoare 1978 and 1985] because use of the guarded choice (after Dijkstra) command (called the alternative comannd in CSP). In a guarded choice command multiple offers (called guards) can be made concurrently on multiple channels to put and get messages; however at most one of the guards can be chosen for each execution of the guarded choice command. Because only one guard can be chosen, a guarded choice command in general effectively requires a kind of two-phase commit protocol or perhaps even a three-phase commit protocol if time-outs are allowed in guards (as in Occam 3 [1992]).

Consider the following program written in CSP [Hoare 1978]:

[X :: Z!stop() ||
 Y :: guard: boolean; guard := true;
     *[guard  →  Z!go(); Z?guard] ||
 Z :: n: integer; n:= 0;
       *[X?stop()  →  Y!false; print!n;
         [] Y?go()  →  n := n+1; Y!true]
]

According to Clinger [1981], this program illustrates global nondeterminism, since the nondeterminism arises from incomplete specification of the timing of signals between the three processes X, Y, and Z. The repetitive guarded command in the definition of Z has two alternatives:

  1. the stop message is accepted from X, in which case Y is sent the value false and print is sent the value n
  2. a go message is accepted from Y, in which case n is incremented and Y is sent the value true.

If Z ever accepts the stop message from X, then X terminates. Accepting the stop causes Y to be sent false which when input as the value of its guard and will cause Y to terminate. When both X and Y have terminated, Z terminates because it no longer has live processes providing input.

In the above program, there are synchronous channels from X to Z, Y to Z, and Z to Y.

[edit] Analogy with the committee coordination problem

According to Knabe [1992], Chandy and Misra [1988] characterized this as analogous to the committee coordination problem:

Professors in a university are assigned to various committees. Occasionally a professor will decide to attend a meeting of any of her committees, and will wait until that is possible. Meetings may begin only if there is full attendance. The task is to ensure that if all the members of a committee are waiting, then at least one of them will attend some meeting.
The crux of this problem is that two or more committees might share a professor. When that professor becomes available, she can only choose one of the meetings, while the others continue to wait.

[edit] A simple distributed protocol

This section presents a simple distributed protocol for channels in synchronous process calculi. The protocol has some problems that are addressed in the sections below.

The behavior of a guarded choice command is as follows:

  • The command sends a message to each of its guards to prepare.
  • When it receives the first response from one of its guards that it is prepared, the it sends a message to that guard to prepare to commit and sends messages to all of the other guards to abort.
    • When it receives a message from the guard that it is prepared to commit, then it sends the guard a commit message. However, if the guard throws an exception that it cannot prepare to commit, then guarded choice command starts the whole process all over again.
  • If all of its guards respond that they cannot prepare, then the guarded command does nothing.

The behavior of a guard is as follows:

  • When a message to prepare is received, then the guard sends a prepare message to each of the channels with which it is offering to communicate. If the guard has booleans such that it cannot prepare or if any of the channles respond that they cannot prepare, then it sends abort messages to the other channels and then responds that it cannot prepare.
  • When a message to prepare to commit is received, then the guard sends a prepare to commit message to each of the channels. If any of the channels respond that they cannot prepare to commit, then it sends abort messages to the other channels and then throws an exception that it cannot prepare to commit.
  • When a message to commit is received, then the guard sends an commit message to each of the channels.
  • When a message to abort is received, then the guard sends an abort message to each of the channels.

The behavior of a channel is as follows:

  • When a prepare to put communication is received, then respond that it is prepared if there is a prepare to get communication pending unless a terminate communication has been received, in which case throw an exception that it cannot prepare to put.
  • When a prepare to get communication is received, then respond that it is prepared if there is a prepare to put communication pending unless a terminate communication has been received, in which case throw an exception that it cannot prepare to get.
  • When a prepare to commit to put communication is received, then respond that it is prepared if there is a prepare to commit to get communication pending unless a terminate communication has been received, in which case throw an exception that it cannot prepare to commit to put.
  • When a prepare to commit to get communication is received, then respond that it is prepared if there is a prepare to commit to put communication pending unless a terminate communication has been received, in which case throw an exception that it cannot prepare to commit to get.
  • When a commit put communication is received, then depending on which of the following is received:
    • When a commit get communication is received, then if not already done perform the put and get and clean up the preparations.
    • When an abort get communication is received, then cancel the preparations
  • When a commit get communication is received, then depending on which of the following is received:
    • When a commit put communication is received, then if not already done perform the get and put and clean up the preparations.
    • When an abort put communication is received, then cancel the preparations.
  • When an abort put communication is received, then cancel the preparations.
  • When an abort get communication is received, then cancel the preparations.

[edit] Starvation on getting from multiple channels

Again consider the program written in CSP (discussed in Synchronous channels in process calculi above):

[X :: Z!stop() ||
 Y :: guard: boolean; guard := true;
     *[guard  →  Z!go(); Z?guard] ||
 Z :: n: integer; n:= 0;
       *[X?stop()  →  Y!false; print!n;
         [] Y?go()  →  n := n+1; Y!true]
]

As pointed out in Knabe [1992], an issue with the above protocol (A simple distributed protocol) is that the process Z might never accept the stop message from X (a phenomenon called starvation) and consequently the above program might never print anything.

In contrast consider, a simple Actor system that consists of Actors X, Y, Z, and print where

the Actor X is created with the following behavior:
  • If the message "start" is received, then send Z the message "stop"
the Actor Y is created with the following behavior:
  • If the message "start" is received, then send Z the message "go"
  • If the message true is received, then send Z the message "go"
  • If the message false is received, then do nothing
the Actor Z is created with the following behavior that has a count n that is initially 0:
  • If the message "start" is received, then do nothing.
  • If the message "stop" is received, then send Y the message false and send print the message the count n.
  • If the message "go" is received, then send Y the message true and process the next message received with count n being n+1.

By the laws of Actor semantics, the above Actor system will always halt when the Actors X, Y, are Z are each sent a "start" message resulting in sending print a number that can be unbounded large.

The difference between the CSP program and the Actor system is that the Actor Z does not get messages using a guarded choice command from multiple channels. Instead it processes messages in arrival ordering, and by the laws for Actor systems, the stop message is guaranteed to arrive.

[edit] Livelock on getting from multiple channels

Consider the following program written in CSP [Hoare 1978]:

[Bidder1 :: b: bid;
       *[Bids1?b  →  process1!b;
         [] Bids2?b  →  process1!b;] ||
 Bidder2 :: b: bid;
       *[Bids1?b  →  process2!b;
         [] Bids2?b  →  process2!b;] 
]

As pointed out in Knabe [1992], an issue with the above protocol (A simple distributed protocol) is that the process Bidder2 might never accept a bid from Bid1 or Bid2 (a phenomenon called livelock) and consequently process2 might never be sent anything. In each attempt to accept a message, Bidder2 is thwarted because the bid that was offered by Bids1 or Bids2 is snatched away by Bidder1 because it turns out that Bidder1 has much faster access than Bidder2 to Bids1 and Bids2. Consequently Bidder1 can accept a bid, process it and accept another bid before Bidder2 can commit to accepting a bid.

[edit] Efficiency

As pointed out in Knabe [1992], an issue with the above protocol (A simple distributed protocol) is the large number of communications that must be sent in order to perform the handshaking in order to send a message through a synchronous channel. Indeed as shown in the previous section (Livelock), the number of communications can be unbounded.

[edit] Summary of Issues

The subsections above have articulated the following three issues concerned with the use of synchronous channels for process calculi:

  1. Starvation. The use of sychronous channels can cause starvation when a process attempts to get messages from multiple channels in a guarded choice command.
  2. Livelock. The use of synchronous channels can cause a process to be caught in livelock when it attempts to get messages from multiple channels in a guarded choice command.
  3. Efficiency. The use of synchronous channels can require a large number of communications in order to get messages from multiple channels in a guarded choice command.

It is notable that in all of the above, issues arise from the use of a guarded choice command to get messages from multiple channels.

[edit] Asynchronous channels

Asynchronous channels have that property that a sender putting a message in the channel need not wait for a receiver to get the message out of the channel.

[edit] Simple asynchronouus channels

An asynchronous channel can be modeled by an Actor that receives put and get communications. The following is the behavior of an Actor for a simple asynchronous channel:

  • Each put communication has a message and an address to which an acknowledgment is sent immediately (without waiting for the message to be gotten by a get communication).
  • Each get communication has an address to which the gotten message is sent.

[edit] Asynchronous channels in process calculi

The Join-calculus programming language (published in 1996) implemented local and distributed concurrent computations. It incorporated asynchronous channels as well as a kind of synchronous channel that is used for procedure calls. Agha's Aπ Actor calculus (Agha and Thati 2004) is based on a typed version of the asynchronous π-calculus.

[edit] Migration

Migration is the ability of computational agencies to change locations. E.g., in his dissertation, Aki Yonezawa modeled a post office that customer Actors could enter, change locations within while operating, and exit. An Actor that can migrate can be modeled by having a location Actor that changes when the Actor migrates. However the faithfullness of this modeling is controversial and the subject of research.

Note that there is a potential confusion in that the original literature on process calculi used the term mobility to mean the ability to change the topology of communication whereas later mobility is sometimes use to mean migration. E.g., process calculi such as the higher-order π-calculus and API-Calculus used mobility to mean change in topology whereas the mobile ambients use mobility to mean migration. In this respect the ambient calculus differs from the π-calculus in that communication is through ambients as opposed to being though channels in the π-calculus.

[edit] Algebras

The use of algebraic techniques was pioneered in the process calculi. Subsequently several different process calculi intended to provide algebraic reasoning about Actor systems have been developed in (Gaspari and Zavattaro 1997), (Gaspari and Zavattaro 1999), (Agha and Thati 2004)

[edit] Denotational Semantics

Will Clinger (building on the work of Irene Greif [1975], Gordon Plotkin [1976], Henry Baker [1978], Michael Smyth [1978], and Francez, Hoare, Lehmann, and de Roever [1979]) published the first satisfactory mathematical denotational theory of the Actor model using domain theory in his dissertation in 1981. His semantics contrasted the unbounded nondeterminism of the Actor model with the bounded nondeterminism of CSP [Hoare 1978] and Concurrent Processes [Milne and Milner 1979] (see denotational semantics). Roscoe [2005] has developed a denotatinal semantics with unbounded nondeterminism for a subsequent version of Communicating Sequential Processes Hoare [1985]. More recently Hewitt [2006b] developed a denotational semantics for Actors based on timed diagrams.

Ugo Montanari and Carolyn Talcott [1998] have contributed to attempting to reconcile Actors with process calculi.

[edit] References

  • Carl Hewitt, Peter Bishop and Richard Steiger. A Universal Modular Actor Formalism for Artificial Intelligence IJCAI 1973.
  • Robin Milner. Processes: A Mathematical Model of Computing Agents in Logic Colloquium 1973.
  • Irene Greif and Carl Hewitt. Actor Semantics of PLANNER-73 Conference Record of ACM Symposium on Principles of Programming Languages. January 1975.
  • Irene Greif. Semantics of Communicating Parallel Professes MIT EECS Doctoral Dissertation. August 1975.
  • Gordon Plotkin. A powerdomain construction SIAM Journal of Computing September 1976.
  • Carl Hewitt and Henry Baker Actors and Continuous Functionals Proceeding of IFIP Working Conference on Formal Description of Programming Concepts. August 1-5, 1977.
  • Gilles Kahn and David MacQueen. Coroutines and networks of parallel processes IFIP. 1977
  • Aki Yonezawa Specification and Verification Techniques for Parallel Programs Based on Message Passing Semantics MIT EECS Doctoral Dissertation. December 1977.
  • Michael Smyth. Power domains Journal of Computer and System Sciences. 1978.
  • George Milne and Robin Milner. Concurrent processes and their syntax JACM. April, 1979.
  • CAR Hoare. Communicating Sequential Processes CACM. August, 1978.
  • Nissim Francez, C.A.R. Hoare, Daniel Lehmann, and Willem de Roever. Semantics of nondetermiism, concurrency, and communication Journal of Computer and System Sciences. December 1979.
  • Mathew Hennessy and Robin Milner. On Observing Nondeterminism and Concurrency LNCS 85. 1980.
  • Will Clinger. Foundations of Actor Semantics MIT Mathematics Doctoral Dissertation. June 1981.
  • Mathew Hennessy. A Term Model for Synchronous Processes Computer Science Dept. Edinburgh University. CSR-77-81. 1981.
  • J.A. Bergstra and J.W. Klop. Process algebra for synchronous communication Information and Control. 1984.
  • Luca Cardelli. An implementation model of rendezvous communication Seminar on Concurrency. Lecture Notes in Computer Science 197. Springer-Verlag. 1985
  • Robert van Glabbeek. Bounded nondeterminism and the approximation induction principle in process algebra Symposium on Theoretical Aspects of Computer Sciences on STACS 1987.
  • K. Mani Chandy and Jayadev Misra. Parallel Program Design: A Foundation Addison-Wesley 1988.
  • Robin Milner, Joachim Parrow and David Walker. A calculus of mobile processes Computer Science Dept. Edinburgh. Reports ECS-LFCS-89-85 and ECS-LFCS-89-86. June 1989. Revised Sept. 1990 and Oct. 1990 respectively.
  • Robin Milner. The Polyadic pi-Calculus: A Tutorial Edinburgh University. LFCS report ECS-LFCS-91-180. 1991.
  • Kohei Honda and Mario Tokoro. An Object Calculus for Asynchronous Communication ECOOP 91.
  • José Meseguer. Conditional rewriting logic as a unified model of concurrency in Selected papers of the Second Workshop on Concurrency and compositionality. 1992.
  • Frederick Knabe. A Distributed Protocol for Channel-Based Communication with Choice PARLE 1992.
  • Geoff Barrett. Occam 3 reference manual INMOS. 1992.
  • Benjamin Pierce, Didier Rémy and David Turner. A typed higher-order programming language based on the pi-calculus Workshop on type Theory and its application to computer Systems. Kyoto University. July 1993.
  • - Robin Milner: Elements of interaction: Turing award lecture, CACM .January 1993. (DOI).
  • R. Amadio and S. Prasad. Locations and failures Foundations of Software Technology and Theoretical Computer Science Conference. 1994.
  • Cédric Fournet and Georges Gonthier. The reflexive chemical abstract machine and the join-calculus POPL 1996.
  • Cédric Fournet, Georges Gonthier, Jean-Jacques Lévy, Luc Maranget, and Didier Rémy. A Calculus of Mobile Agents CONCUR 1996.
  • Tatsurou Sekiguchi and Akinori Yonezawa. A Calculus with Code Mobility FMOODS 1997.
  • - Mauro Gaspari and Gianluigi Zavattaro: An Algebra of Actors, Technical Report UBLCS-97-4, University of Bologna, May 1997
  • - Luca Cardelli and Andrew Gordon. Mobile Ambients Foundations of Software Science and Computational Structures, Maurice Nivat (Ed.), Lecture Notes in Computer Science, Vol. 1378, Springer, 1998.
  • Ugo Montanari and Carolyn Talcott. Can Actors and Pi-Agents Live Together? Electronic Notes in Theoretical Computer Science. 1998.
  • Robin Milner. Communicating and Mobile Systems: the Pi-Calculus Cambridge University Press. 1999.
  • - M. Gaspari and G. Zavattaro: An Algebra of Actors Formal Methods for Open Object Based Systems, 1999.
  • Davide Sangiorgi and David Walker. The Pi-Calculus : A Theory of Mobile Processes Cambridge University Press. 2001.
  • P. Thati, R. Ziaei, and G. Agha. A theory of may testing for asynchronous calculi with locality and no name matching Algebraic Methodology and Software Technology. Springer Verlag. September 2002. LNCS 2422.
  • - Gul Agha and Prasanna Thati. An Algebraic Theory of Actors and Its Application to a Simple Object-Based Language, From OO to FM (Dahl Festschrift) LNCS 2635. Springer-Verlag. 2004.
  • J.C.M. Baeten, T. Basten, and M.A. Reniers. Algebra of Communicating Processes Cambridge University Press. 2005.
  • He Jifeng and C.A.R. Hoare. Linking Theories of Concurrency United Nations University International Institute for Software Technology UNU-IIST Report No. 328. July, 2005.
  • Luca Aceto and Andrew D. Gordon (editors). Algebraic Process Calculi: The First Twenty Five Years and Beyond Process Algebra. Bertinoro, Forl`ı, Italy, August 1–5, 2005.
  • - A. W. Roscoe: The Theory and Practice of Concurrency, Prentice Hall, ISBN 0-13-674409-5. Revised 2005.
  • Carl Hewitt (2006b) What is Commitment? Physical, Organizational, and Social COIN@AAMAS. 2006.