Functional programming

From Wikipedia, the free encyclopedia

Functional programming is a programming paradigm that conceives computation as the evaluation of mathematical functions and avoids state and mutable data. Functional programming emphasizes the application of functions, in contrast with imperative programming, which emphasizes changes in state and the execution of sequential commands.[1]

A broader conception of functional programming simply defines a set of common concerns and themes rather than a list of distinctions from other paradigms. Often considered important are higher-order and first-class functions, closures, and recursion. Other common features of functional programming languages are continuations, Hindley-Milner type inference systems, non-strict evaluation (including, but not limited to, "laziness"), and monads.

Functional programming languages, especially "purely functional" ones, have largely been emphasized in academia rather than in commercial software development. However, notable functional programming languages used in industry and commercial applications include Erlang (concurrent applications),[2] R (statistics), Mathematica (symbolic math),[3] J and K (financial analysis), and domain-specific programming languages like XSLT.[4][5] Important influences on functional programming have been the lambda calculus, the APL programming language, the Lisp programming language, and more recently the ML programming language and the Haskell programming language.

Contents

[edit] History

Lambda calculus, invented by Alonzo Church in the 1930s, provides a theoretical framework for describing functions and their evaluation. Though it is a mathematical abstraction rather than a programming language, lambda calculus forms the basis of almost all functional programming languages today.

Combinatory logic is an equivalent theoretical foundation, developed by Moses Schönfinkel and Haskell Curry. It was originally developed to achieve a larger goal: a clearer approach to the foundations of mathematics.[6] Combinatory logic, which is commonly perceived as more abstract than lambda calculus, preceded lambda-calculus in invention.

The first computer-based functional programming language was Information Processing Language (IPL), developed by Newell, Shaw, and Simon at RAND Corporation for the JOHNNIAC computer in the mid-1950s. An early functional-flavored programming language was LISP (now mixed-case "Lisp"), developed by John McCarthy while at MIT for the IBM 700/7000 series scientific computers in the late 1950s.[7] LISP introduced many of the features now found in modern functional programming languages, though LISP is technically a multi-paradigm language. Scheme and Dylan were later attempts to simplify and improve LISP.

Kenneth E. Iverson developed the APL programming language in the early 1960s, described in his 1962 book "A Programming Language." APL was the primary influence on John Backus's FP programming language. In the early 1990s, Iverson and Roger Hui created a successor to APL, the J programming language. In the mid 1990s, Arthur Whitney, who had previously worked with Iverson, created the K programming language, which is used commercially in financial industries.

John Backus presented the FP programming language in his 1977 Turing Award lecture Can Programming Be Liberated From the von Neumann Style? A Functional Style and its Algebra of Programs. He defines functional programs as being built up in a hierarchical way by means of "combining forms" that allow an "algebra of programs"; in modern language, this means that functional programs follow the principle of compositionality. Backus's paper popularized research into functional programming, though it emphasized function-level programming rather than the lambda-calculus style which has come to be associated with functional programming.

In the 1970s the ML programming language was created by Robin Milner at the University of Edinburgh, and David Turner developed the language Miranda at the University of Kent. ML eventually developed into several dialects, the most common of which are now Objective Caml and Standard ML. The Haskell programming language was released in the late 1980s in an attempt to gather together many ideas in functional programming research.

[edit] Concepts

A number of concepts and paradigms are specific to functional programming, and generally foreign to imperative programming (including object oriented programming). However, programming languages are often hybrids of several programming paradigms so programmers using "mostly imperative" languages may have utilized some of these concepts.[8]

[edit] Higher-order functions

Functional programming uses the notion of higher-order functions. Functions are higher-order when they can take other functions as arguments, and/or return functions as results. (The derivative and antiderivative in calculus are mathematical examples of functions that map a function to another function.)

Higher-order functions are closely related to first-class functions, in that higher-order functions and first-class functions both allow functions as arguments and results of other functions. The distinction between the two is subtle: "higher-order" describes a mathematical concept of functions that operate on other functions, while "first-class" is a computer science term that describes programming language entities that have no restriction on their use (thus first-class functions can appear anywhere in the program that other first-class entities like numbers can, including as arguments to other functions and as their return values).

Higher-order functions enable currying, a technique in which a function is applied to its arguments one at a time, with each application returning a new (higher-order) function that accepts the next argument.

[edit] Pure functions

Purely functional programs have no side effects. Since pure functions do not modify state, no data may be changed by parallel function calls. Pure functions are therefore thread-safe, which allow interpreters and compilers to use call-by-future evaluation. The lack of side-effects allows some languages, such as Haskell, to use call-by-need evaluation. Haskell is therefore a pure functional language. However, not all functional languages are pure. The lisp family of languages are not pure because they allow side-effects.

Pure functional programming languages typically enforce referential transparency, which is the notion that 'equals can be substituted for equals': if two expressions have "equal" values (for some notion of equality), then one can be substituted for the other in any larger expression without affecting the result of the computation. For example, in

y = f(x) * f(x);    

a compiler can factor out f(x) if it is pure, transforming the program to

z = f(x);   
y = z * z;       

and eliminating the second evaluation of the (possibly costly) call to f(x). This optimization is called common subexpression elimination.

However, if a function has side effects, the function call cannot be eliminated. Consider the following program fragment:

y = random() * random();    

The second call to random cannot be eliminated, because its return value may be different from that of the first call. Similarly,

y = printf("x") * printf("x");      

cannot be optimized away; even if printf returns the same value both times, failing to make the second call would result in different program output.

Most compilers for imperative programming languages do not perform common subexpression elimination for function calls, because they do not track whether a function is pure. It is possible for an advanced compiler to infer whether a local function has effects and optimize accordingly; however, most pre-compiled libraries do not expose this information, preventing calls to external functions from being optimized away. Some compilers, such as gcc, add extra keywords for a programmer to explicitly mark pure functions so that this optimization can be performed.

[edit] Recursion

Iteration (looping) in functional languages is usually accomplished via recursion. Recursive functions invoke themselves, allowing an operation to be performed over and over. Recursion may require maintaining a stack, but tail recursion can be recognized and optimized by a compiler into the same code used to implement iteration in imperative languages. The Scheme programming language standard requires implementations to recognize and optimize tail recursion.

Common patterns of recursion can be factored out using higher order functions, catamorphisms and anamorphisms (or "folds" and "unfolds") being the most obvious examples. Such higher order functions play a role analogous to built-in control structures such as loops in imperative languages.

[edit] Functional programming in non-functional languages

It is possible to employ a functional style of programming in languages that are not traditionally considered functional languages.[9] Some non-functional languages have borrowed features such as lambda functions, higher-order functions, and list comprehensions from functional programming languages. This makes it easier to adopt a functional style when using these languages. Functional constructs such as higher-order functions and lazy lists can be obtained in C++ via libraries.[10] In C one can use function pointers to get some of the effects of higher-order functions, for example one can implement the common function map using function pointers. Widespread declarative domain specific languages like SQL and Lex/Yacc, while not Turing-complete, use some elements of functional programming, especially in eschewing mutable values.[11]

[edit] Comparison of functional and imperative programming

Functional programming is very different from imperative programming. The most significant differences stem from the fact that functional programming avoids side effects, which are used in imperative programming to implement state and I/O. Pure functional programming disallows side effects completely. Disallowing side effects provides for referential transparency, which makes it easier to verify, optimize, and parallelize programs, and easier to write automated tools to perform those tasks.

Higher order functions are rarely used in older imperative programming. Where a traditional imperative program might use a loop to traverse a list, a functional style would often use a higher-order function, map, that takes as argument a function and a list, applies the function to each element of the list, and returns a list of the results.

[edit] Simulating state

There are tasks—for example, maintaining a bank account balance—that often seem most naturally implemented with state. Pure functional programming performs these tasks, and I/O tasks such as accepting user input and printing to the screen, in a different way.

The pure functional programming language Haskell implements them using monads, derived from category theory in mathematics. Monads are extremely powerful and offer an intuitive way to model state (and other side effects such as IO) in an imperative manner without losing purity. While existing monads are easy to use, many find it difficult to understand how to define new monads (which is sometimes needed for certain types of libraries).[12]

Alternative methods such as Hoare logic have been developed to track side effects in programs. Some modern research languages use effect systems to make explicit the presence of side effects.

[edit] Efficiency issues

Functional programming languages have automatic memory management with garbage collection, in contrast to older imperative languages like C and Pascal which use explicit memory management. Functional programming languages have been perceived as less efficient in their use of CPU and memory than those languages. However, many modern imperative languages such as Java, Perl, Python, and Ruby also perform automatic memory management.

Functional programming languages have become more efficient over the years. For programs which perform intensive numerical computations, functional languages such as OCaml and Clean are similar in speed to C.[13] For programs that handle large matrices and multidimensional databases, array functional languages (such as J and K) were designed with speed optimization in mind. Despite purely functional languages having a reputation for being slower, any imperative algorithm is expressible in these languages with a logarithmic slowdown in the worst case. Moreover, the immutability of data can in many cases lead to greater execution efficiency owing to the compiler making assumptions that are unsafe in an imperative language.

[edit] Coding styles

Imperative programs tend to emphasize the series of steps taken by a program in carrying out an action, while functional programs tend to emphasize the composition and arrangement of functions, often without specifying explicit steps. A simple example of two solutions to the same programming goal (using the same multi-paradigm language Python) illustrates this.

# imperative style
target = []               # create empty list
for item in source_list:  # iterate over each thing in source
    trans1 = G(item)      # transform the item with the G() function
    trans2 = F(trans1)    # second transform with the F() function
    target.append(trans2) # add transformed item to target

A functional version has a different feel to it:

# functional style
def compose2(F, G):       # FP-oriented languages often have standard compose()
    def C(item):          # here we create utility-function for composition
        return F(G(item))
    return C
target = map(compose2(F,G), source_list)
# More compact examples of the functional paradigm in Python:
target =  map(lambda x: F(G(x)), source_list) # uses a lambda form with map
# or:
target = [F(G(item)) for item in source_list] # a list comprehension

In contrast to the imperative style that describes the steps involved in building target, the functional style describes the mathematical relationship between source_list and target.

[edit] See also

[edit] Notes

  1. ^ Hudak, Paul (September 1989). "Conception, evolution, and application of functional programming languages". ACM Computing Surveys 21 (3): 359-411.
  2. ^ Who uses Erlang for product development?. Frequently asked questions about Erlang. Retrieved on 2006-06-27. "The largest user of Erlang is (surprise!) Ericsson. Ericsson use it to write software used in telecommunications systems. Many (dozens) projects have used it, a particularly large one is the extremely scalable AXD301 ATM switch." Other commercial users listed as part of the FAQ include: Nortel, Deutsche Flugsicherung (the German national air traffic control organisation), and T-Mobile.
  3. ^ Department of Applied Math, University of Colorado. Functional vs. Procedural Programming Language. Retrieved on 2006-08-28.
  4. ^ Dimitre Novatchev. The Functional Programming Language XSLT - A proof through examples. TopXML. Retrieved on May 27, 2006.
  5. ^ David Mertz. XML Programming Paradigms (part four): Functional Programming approached to XML processing. IBM developerWorks. Retrieved on May 27, 2006.
  6. ^ Curry, Haskell Brooks, Robert Feys and Craig, William (1958). Combinatory Logic. Volume I. Amsterdam: North-Holland Publishing Company.
  7. ^ McCarthy, John (June 1978). "History of Lisp". In ACM SIGPLAN History of Programming Languages Conference: 173—196. " The implementation of LISP began in Fall 1958."
  8. ^ Dick Pountain. Functional Programming Comes of Age. BYTE.com (August 1994). Retrieved on August 31, 2006.
  9. ^ Hartel, Pieter, Henk Muller and Hugh Glaser (March 2004). "The Functional C experience". The Journal of Functional Programming 14 (2): 129–135.; David Mertz. Functional programming in Python, Part 3. IBM developerWorks. Retrieved on 2006-09-17.(Part 1, Part 2)
  10. ^ McNamara, B.. FC++: Functional Programming in C++. Retrieved on 2006-05-28.
  11. ^ Donald D. Chamberlin and Raymond F. Boyce (1974). "SEQUEL: A structured English query language". Proceedings of the 1974 ACM SIGFIDET: 249-264.. In this paper, one of the first formal presentations of the concepts of SQL (and before the name was later abbreviated), Chamberlin and Boyce emphasize that SQL was developed "Without resorting to the concepts of bound variables and quantifiers".
  12. ^ Newbern, J.. All About Monads: A comprehensive guide to the theory and practice of monadic programming in Haskell. Retrieved on 2006-05-27., "The sheer number of different monad tutorials on the internet is a good indication of the difficulty many people have understanding the concept. This is due to the abstract nature of monads and to the fact that they are used in several different capacities, which can confuse the picture of exactly what a monad is and what it is good for."
  13. ^ See here for a good comparison between different languages on implementing a ray tracing algorithm.

[edit] References

  • Cousineau, Guy and Michel Mauny. The Functional Approach to Programming. Cambridge, UK: Cambridge University Press, 1998.
  • Felleisen, Matthias, Robert Findler, Matthew Flatt, and Shriram Krishnamurthi. How to Design Programs HTDP. MIT Press. 2001. on-line
  • Graham, Paul. ANSI Common LISP. Englewood Cliffs, New Jersey: Prentice Hall, 1996.
  • Curry, Haskell Brooks and Feys, Robert and Craig, William. Combinatory Logic. Volume I. North-Holland Publishing Company, Amsterdam, 1958.
  • Curry, Haskell Brooks and Hindley, J. Roger and Seldin, Jonathan P. Combinatory Logic. Volume II. North-Holland Publishing Company, Amsterdam * London, 1972.
  • MacLennan, Bruce J. Functional Programming: Practice and Theory. Addison-Wesley, 1990.
  • Pratt, Terrence, W. and Marvin V. Zelkowitz. Programming Languages: Design and Implementation. 3rd ed. Englewood Cliffs, New Jersey: Prentice Hall, 1996.
  • Salus, Peter H. Functional and Logic Programming Languages. Vol. 4 of Handbook of Programming Languages. Indianapolis, Indiana: Macmillan Technical Publishing, 1998.
  • Thompson, Simon. Haskell: The Craft of Functional Programming. Harlow, England: Addison-Wesley Longman Limited, 1996.
  • Harrop, Jon. Objective CAML for Scientists. Cambridge, England: Flying Frog Consultancy, 2005.

[edit] External links