Perl 6
From Wikipedia, the free encyclopedia
Perl 6 is a new major revision of the Perl programming language.
Larry Wall, the creator of Perl, has called Perl 6 "the community's rewrite of Perl and of the community", because he based the changes largely on 361 "requests for change" submitted by the Perl community in 2000.[1] This has prompted criticisms describing the Perl 6 design process as design by committee. [citation needed] Wall outlined these changes in a series of essays, called Apocalypses, which are numbered to correspond to chapters in Programming Perl ("The Camel Book"). These are now considered historical, and the current, unfinalized, specification of Perl 6 is encapsulated in design documents called "Synopses", which are numbered to correspond to Apocalypses.
Perl 6 is a re-design of the language, introducing elements of many modern and historical languages. Perl 6 grammar is distinct from Perl and not intended to be backwards-compatible, though a compatibility mode is part of the specification.
Perl 6 has been under development for over six years, prompting some Perl users to suggest that Perl 6 may be vaporware. There is currently a slow, but largely complete, implementation written in Haskell called Pugs. Two alternate implementations exist, based on PGE and Perl 5, and Wall announced at a recent conference in Europe that one of these implementations would become a Perl 6 alpha release by late December 2006.[citation needed]
Contents
|
[edit] History
The Perl 6 design process was first announced on July 19, 2000, on day 4 of that year's YAPC Conference,[2] by Larry Wall in his State of the Onion 2000 talk.[3] At that time, the primary goals were to remove "historical warts" from the language; "easy things should stay easy, hard things should get easier, and impossible things should get hard;" a general cleanup of the internal design and APIs. The process began with a series of requests for comments or "RFCs". This process was open to all contributors, and left no aspect of the language closed to change.[4]
Once the RFC process was complete, Wall reviewed and classified each request. He then began the process of writing the Apocalypses, the name of which refers to "a revealing" according to the first such document.[1] While the original goal was to write one Apocalypse for each chapter of Programming Perl, it became obvious that, as each Apocalypse was written, previous Apocalypses were being invaldidated by later changes. For this reason, a set of Synopses were published, each one relating the contents of an Apocalypse, but with any subsequent changes reflected in updates. Today, Perl 6 specification continues almost entirely within the Synopses.[5]
There are also a series of Exegeses written by Damian Conway that explain the content of each Apocalypse in terms of practical usage. Each Exegesis consists of code examples along with discussion of the usage and implications of the examples.[6]
There are three primary methods of communication used in the development of Perl 6 today. The first is the #perl6 IRC channel on freenode. The second is a set of mailing lists on The Perl Foundation's servers at perl.org.[7] The third is the subversion source code repository used by the pugs team. Pugs is an early implementation of Perl 6; see the Implementations section for more detail.
[edit] Goals
The break in compatibility was mandated from the start of the project, and immediately allowed some of the changes that Larry Wall had suggested in his initial speech. "Historical warts" such as the confusion surrounding sigil usage for containers; the ambiguity between the select functions; the syntactic impact of bareword filehandles; and many other problems that Perl programmers had discussed fixing for years were some of the first issues addressed.
Over the years, Perl 6 has undergone several alterations in its direction. The introduction of concepts from Python and Ruby were early influences, but as the pugs interpreter was written in the Haskell programming language, many functional programming influences were absorbed by the Perl 6 design team.
[edit] Implementations
Parrot is a virtual machine designed for interpreted languages, primarily for Perl 6. The self-hosting Perl 6 compiler will (according to plan) target and also run on Parrot.[8][9]
Pugs is an implementation of Perl 6 in the Haskell programming language that will be used for bootstrapping. Pugs's goal is to provide a platform on which to write the Perl 6 compiler in Perl 6. This could, for example, involve translating the Pugs Haskell source code to Perl 6.[10] After that, Perl 6 will be self-hosted—it will be used to compile itself. Much of the implementation of Perl will then be exposed to programmers using the language. For example, this would make it possible to extend the parser from within a program or library.
Pugs can execute Perl 6 code directly, as well as compile Perl 6 to JavaScript, Perl 5 or Parrot bytecode.
v6.pm
is a pure Perl 5 implementation of Perl 6, making liberal use of existing CPAN modules, such as Moose
and Pugs::Compiler::Rule
. It aims to make the existing perl runtime a first-class virtual machine for both Perl 5 and Perl 6.[11]
[edit] Major changes from Perl 5
Perl 5 and Perl 6 differ fundamentally, though in general the intent has been to "keep Perl 6 Perl". Most of the changes are intended to normalize the language, to make it easier for learning and expert programmers alike to understand, and to make "easy things easier and hard things more possible".
[edit] A Specification
A major, but non-technical difference between Perl 5 and Perl 6 is that Perl 6 began as a specification.[8] This means that Perl 6 can be re-implemented if needed, and it also means that programmers don't have to read the source code for the ultimate authority on any given feature. While Perl 5's documentation was regarded as excellent, even outside of the Perl community[12], if the documentation and the source code of the Perl 5 interpreter disagreed, the documentation would be changed.
[edit] A Type System
In Perl 6, the dynamic type system of Perl 5 has been augmented by the addition of static types.[13] For example:
my Int $i = 0; my Num $n = 3.142; my Str $s = "Hello, world";
However, as with Perl 5, programmers can do most things without any explicit typing at all:
my $i = "25" + 10; # $i is 35
Proponents of Static typing claim that it is beneficial to compiler optimization, reducing programming errors and increasing maintainability, especially in large software projects.
On the other hand, Dynamic typing reduces input source code size, which is the bottleneck on smaller projects. It is helpful when writing quick scripts, one-liners or "one-off" code (i.e., code that is written to achieve some temporary purpose and will be run once).
Perl 6 offers a hybrid typing system whereby the programmer may choose to use Static Typing, Dynamic Typing or mix the two.
[edit] Formal Subroutine Parameter Lists
Perl 5 defined subroutines without formal parameter lists at all (though simple parameter counting and some very loose type checking can be done using Perl 5's "prototypes"). Subroutine arguments passed in were aliased into the elements of the array @_. If @_ were modified, the changes would be reflected in the original data.
Perl 6 introduces true formal parameters to the language.[14] In Perl 6, a subroutine declaration looks like this:
sub do_something(Str $thing, Int $other) { ... }
As in Perl 5, the formal parameters (i.e., the variables in the parameter list) are aliases to the actual parameters (the values passed in), but by default, the aliases are constant so they cannot be modified. They may be declared explicitly as read-write aliases for the original value or as copies using the is rw
or is copy
directives should the programmer require them to be modified locally.
[edit] Parameter passing modes
Perl 6 provides three basic modes of parameter passing:
- Positional
- Named
- Slurpy
Positional parameters are the typical ordered list of parameters that most programming languages use. Named parameters may be passed in any order, and are assigned to their respective parameters based on a name tag that is associated with them. Named-only parameters never capture positional arguments, and are indicated with a leading :
character. Slurpy parameters (indicated by an *
before the parameter name) are Perl 6's tool for creating variadic functions. A slurpy hash will capture remaining passed-by-name parameters, whereas a slurpy array will capture remaining passed-by-position parameters.
Here is an example of the use of all three parameter-passing modes:
sub somefunction($a, $b, :$c, :$d, *@e) {...} somefunction(1, 2, :d<3>, 4, 5, 6); # $a=1, $b=2, $d=3, @e=(4,5,6) somefunction(:a<1>, :b<2>); # $a=1, $b=2
Positional parameters, such as those used above are always required, unless followed by ?
to indicate that they are optional. Named parameters are optional by default, but may be marked as required by adding !
after the variable name. Slurpy parameters are always optional.
[edit] Sigil invariance
In Perl 5, sigils—the punctuation characters that precede a variable name—changed depending on how the variable was used:
# Perl 5 code my @array = (1, 2, 3); my $element = $array[1]; # $element equals 2
This was inspired by natural language:
"The apple". # CORRECT "These apples. # CORRECT "The third one of the apples". # CORRECT "These third one of the apples". # WRONG
In Perl 6, sigils are invariant[13]:
my @array = (1, 2, 3); my $element = @array[1]; # $element equals 2
This change is meant to reduce the cognitive load of recognizing that a variable spelled $array...
is actually the variable @array
.
[edit] Object-oriented programming
Perl 5 supported object-oriented programming via a mechanism known as blessing. Any reference could be blessed into being an object of a particular class, a blessed object could have methods invoked on it using the "arrow syntax" which would cause Perl to locate or "dispatch" an appropriate subroutine by name, and call it with the blessed variable as its first argument.
While extremely powerful—virtually any other computer language's object model could be simulated using this simple facility—it made the most common case of object orientation, a struct-like object with some associated code, unnecessarily difficult. In addition, because Perl could make no assumptions about the object model in use, method invocation could not be optimized very well.
In the spirit of making the "easy things easy but hard things possible", Perl 6 retains the blessing model for programmers who desire unusual behavior, but supplies a more robust object model for the common cases.[15] For example, a class to encapsulate a Cartesian point could be defined and used this way:
class Point is rw { has $.x; has $.y; } my Point $point .= new( :x<1.2>, :y<-3.7> ); # Now change x (note method "x" used as lvalue): $point.x = 2; say "Point is at X location: ", $point.x;
The dot replaces the arrow in a nod to the many other languages (e.g. Java, Python, and many others) that have coalesced around dot as the syntax for method invocation.
In the terminology of Perl 6, $.x
is called an "attribute". Some languages call these fields or members. The method used to access an attribute is called an "accessor". Auto-accessors are methods that are created automatically, as the method x
is in the example above. These accessor functions return the value of the attribute. When a class or individual attribute is declared with the is rw
modifier (short for "read/write"), the auto-accessor can be passed a new value to set the attribute to, or it can be directly assigned to as an lvalue (as in the example). Auto-accessors can be replaced by user-defined methods, should the programmer desire a richer interface to an attribute. Attributes can only be accessed directly from within a class definition. All other access must go through the accessor methods.
[edit] Roles
Roles in Perl 6 take on the function of both interfaces in Java and traits[16] in Smalltalk variant, Squeak. These are much like classes, but are entirely abstract. These are used to perform composition when used with classes rather than adding to their inheritance chain. Roles define nominal types; they provide semantic names for collections of behavior and state.
The fundamental difference between a role and a class is that classes are instantiable; roles are not.[17]
[edit] Regular expressions
Perl's regular expression and string-processing support has always been one of its defining features.[18] Since Perl's pattern-matching constructs have exceeded the capabilities of formal regular expressions for some time, Perl 6 documentation will exclusively refer to them as regexes, distancing the term from the formal definition.
Perl 6 provides a superset of Perl 5 features with respect to regexes, folding them into a larger framework called "rules" which provide the capabilities of context-sensitive parsing formalisms (such as the syntactic predicates of parsing expression grammars and ANTLR), as well as acting as a closure with respect to their lexical scope.[19] Rules are introduced with the rule
keyword which has a usage quite similar to subroutine definition. Anonymous rules can also be introduced with the regex
(or rx
) keyword, or they can simply be used inline as regexps were in Perl 5 via the m
(matching) or s
(search and replace) operators.
In Apocalypse 5, Larry Wall enumerated 20 problems with "current regex culture". Among these were that Perl's regexes were "too compact and 'cute'", had "too much reliance on too few metacharacters", "little support for named captures", "little support for grammars", and "poor integration with [the] 'real' language".[20]
[edit] Syntactic simplification
The parentheses (round brackets) required in control flow constructs in Perl 5 are now optional[21]:
if is_true() { for @array { ... } }
This simplification exists in many areas. For example, the ,
operator is now a list constructor, so enclosing parentheses are no longer required around lists.
[edit] Chained comparisons
Perl 6 allows comparisons to "chain". That is, a sequence of comparisons such as the following are allowed:
if C(20) <= $temperature <= C(25) { say "Room temperature!" }
This is treated as if each left-to-right comparison were performed on its own, and the result is logically combined via the and
operation.
[edit] Lazy evaluation
Perl 6 uses the technique of lazy evaluation of lists that has been a feature of some functional programming languages such as Haskell[22]:
@integers = 0..Inf; # integers from 0 to infinity
The code above will not crash by attempting to assign a list of infinite size to the array @integers
, nor will it hang indefinitely in attempting to expand the list if a limited number of slots are searched.
This simplifies many common tasks in Perl 6 including input/output operations, list transformations and parameter passing.
[edit] Junctions
Perl 6 introduces the concept of junctions: values that are composites of other values.[22] In the earliest days of Perl 6's design, these were called "superpositions", by analogy to the concept in quantum physics of quantum superpositions — waveforms that can simultaneously occupy several states until observation "collapses" them. A Perl 5 module released in 2000 by Damian Conway called Quantum::Superpositions
[23] provided an initial proof of concept. While at first, such superpositional values seemed like merely a programmatic curiosity, over time their utility and intuitiveness became widely recognized, and junctions now occupy a central place in Perl 6's design.
In their simplest form, junctions are created by combining a set of values with junctive operators:
my $any_even_digit = 0|2|4|6|8; # any(0, 2, 4, 6, 8) my $all_odd_digits = 1&3&5&7&9; # all(1, 3, 5, 7, 9)
|
indicates a value which is equal to either its left or right-hand arguments. &
indicates a value which is equal to both its left and right-hand arguments. These values can be used in any code that would use a normal value. Operations performed on a junction act on all members of the junction equally, and combine according to the junctive operator. So, ("apple"|"banana") ~ "s"
would yield "apples"|"bananas"
. In comparisons, junctions return a single true or false result for the comparison. |
or any
junctions return true if the comparison is true for any one of the elements of the junction. &
or all
junctions return true if the comparison is true for all of the elements of the junction.
Junctions can also be used to more richly augment the type system by introducing a style of generic programming that is constrained to junctions of types:
sub get_tint ( RGB_Color|CMYK_Color $color, num $opacity) { ... } sub store_record (Record&Storable $rec) { ... }
[edit] Autothreading
Junctions are unordered; 1|2|3
and 3|2|1
represent the same value. This lack of ordering means that the Perl 6 compiler can choose to evaluate junctive expressions in parallel. For instance, the code:
for all(@array) { ... }
would indicate to the compiler that the for
loop should be run in parallel, possibly in separate threads. This feature is dubbed "autothreading"; its usage and implementation is not finalized and is a matter of debate in the development community.[24]
[edit] Macros
In low-level languages, the concept of macros has become synonymous with textual substitution of source-code due to the widespread use of the C preprocessor. However, high-level languages such as Lisp pre-dated C in their use of macros that were far more powerful.[25] It is this Lisp-like macro concept that Perl 6 will take advantage of.[14] The power of this sort of macro stems from the fact that it operates on the program as a high-level data structure, rather than as simple text, and has the full capabilities of the programming language at its disposal.
A Perl 6 macro definition will look like a subroutine or method definition, and can operate on unparsed strings, an AST representing pre-parsed code, or a combination of the two. A macro definition would look like this:
macro hello($what) { q:code { say "Hello { {{{$what}}} }" }; }
In this particular example, the macro is no more complex than a C-style textual substitution, but because parsing of the macro parameter occurs before the macro operates on the calling code, diagnostic messages would be far more informative. However, because the body of a macro is executed at compile time each time it is used, many techniques of optimization can be employed. It is even possible to entirely eliminate complex computations from resulting programs by performing the work at compile-time.
[edit] Hello world
The hello world program is often used in demonstrating a programming language's most routine usage. In Perl 6, hello world can be written:
say "Hello world"
Though, there is more than one way to do it. say
is new to Perl 6 which prints its parameters (like the print
function), but with a trailing newline.
[edit] References
- ^ a b Wall, Larry (2001-04-02). Apocalypse 1: The Ugly, the Bad, and the Good.
- ^ Kline, Joe (2000-08-21). Report from the Perl Conference.
- ^ Wall, Larry (2000). State of the Onion 2000. O'Reilly Network.
- ^ The Perl Foundation (2000). About Perl6 RFCs.
- ^ The Perl Foundation (2002). Perl Development: Synopses.
- ^ The Perl Foundation (2001). Exegeses.
- ^ The Perl Foundation (2002). Perl Development: Mailing Lists.
- ^ a b Wall, Larry (2004-08-10). Synopsis 1: Overview.
- ^ Tang, Autrijus (2005). Perl 6 Is Here Today: Self hosting approach. pugscode.org.
- ^ Tang, Audrey (2005). pugscode: Pugs.
- ^ audreyt. Pugs 6.2.12 and v6.pm released!. use Perl.
- ^ Lee, Xah (2005-09-25). Perl and Python Documentations.
- ^ a b Wall, Larry (2004-08-10). Synopsis 2: Bits and Pieces.
- ^ a b Wall, Larry (2003-03-21). Synopsis 6: Subroutines.
- ^ Wall, Larry (2006-08-18). Synopsis 12: Objects.
- ^ The Software Composition Group (2003). Traits.
- ^ chromatic (2006). Roles: Composable Units of Object Behavior.
- ^ Parlante, Nick (2000). Essential Perl: String Processing with Regular Expressions.
- ^ Wall, Larry (2002-06-24). Synopsis 5: Regexes and Rules.
- ^ Wall, Larry (2002-06-04). Apocalypse 5: Pattern Matching.
- ^ Wall, Larry (2004-08-19). Synopsis 4: Blocks and Statements.
- ^ a b Wall, Larry (2004-09-13). Synopsis 9: Data Structures.
- ^ Conway, Damian (2002). Quantum::Superpositions - QM-like superpositions in Perl.
- ^ Adams, Rod (2005-02-15). Junction Values.
- ^ Lamkins, David B. (2004-12-08). Successful Lisp: How to Understand and Use Common Lisp. bookfix.com.
[edit] External links
[edit] Specification
- The original community RFCs
- Apocalypses - The first-pass drafts from Larry Wall
- Exegeses - Further explanation of each Apocalypse with code samples
- Synopses - The evolving summary of each Apocalypse, updated regularly. These are considered to be the official language specification.
- Official Perl 6 Documentation
[edit] Implementations
- The main site for Perl 6 development
- Pugs - A current implementation of Perl 6, written in Haskell.
- v6.pm - A pure-perl implementation, based on a Perl 6's CPAN stack.
- parrotcode - The main site for Parrot development.
[edit] Other
- Planet Perl 6 - Perl 6's blogs aggregrator.
- Present Continous, Future Perfect - Larry Wall's talk about Perl 5 and Perl 6 at Open Source Developer Conference 2006 in Netanya, Israel (HTML, with links to slides and audio recording)
- Perl 6 FAQ - Answers a wide range of questions about Perl 6.
- Perl 6 Quick-Start - A step-by-step guide to writing and running your first Perl 6 program.