PL/I

PL/I
Paradigm(s) procedural, imperative, structured
Appeared in 1964
Designed by IBM and the SHARE Language Development Committee
Developer IBM
Dialects PL/M, XPL, PL/P, PL/C, PL/S, PL/AS, PL/X, PL/8, EPL
Influenced by COBOL, Fortran, ALGOL
Influenced SP/k, B, REXX
PL/I at Wikibooks

PL/I ("Programming Language One", pronounced "pee-el-one") is a procedural, imperative computer programming language designed for scientific, engineering, business and systems programming applications. It has been used by various academic, commercial and industrial organizations since it was introduced in the 1960s, and is actively used as of 2011.[1][2][3]

PL/I's principal domains are data processing, numerical computation, and scientific computing; it supports recursion, structured programming, linked data structure handling, fixed-point, floating-point, complex, character[4] string handling, and bit string handling. The language syntax is English-like and suited for describing complex data formats, with a wide set of functions available to verify and manipulate them.

Contents

Early history

In the 1950s and early 1960s business and scientific users programmed for different computer hardware using different programming languages. Business users were moving from Autocoders via COMTRAN to COBOL, while scientific users programmed in General Interpretive Programme (GIP), FORTRAN, Algol, GEORGE, and others. The IBM System 360[5] (announced in 1964 but not delivered until 1966) was designed as a common machine architecture for both groups of users, superseding all existing IBM architectures. Similarly, IBM wanted a single programming language for all users. It hoped that FORTRAN could be extended to include the features needed by commercial programmers. In October 1963 a committee was formed[6] composed originally of 3 IBMers from New York and 3 members of SHARE, the IBM scientific users group, to propose these extensions to FORTRAN. Given the constraints of Fortran, they were unable to do this and embarked on the design of a “new programming language” based loosely on Algol labeled “NPL". This acronym conflicted with that of the UK’s National Physical Laboratory and was replaced briefly by MPPL[7] and, in 1965, with PL/I (with a Roman “I” like Fortran’s “IV”). The first definition appeared in April 1964.[8][9]

IBM took NPL as a starting point and completed the design to a level that the first compiler could be written: the NPL definition was incomplete in scope and in detail.[10] Control of the PL/I language[11] was vested initially in the New York Programming Center and later at the IBM UK Laboratory at Hursley. The SHARE and GUIDE user groups were involved in extending the language and had a role in IBM’s process for controlling the language through their PL/I Projects. The experience of defining such a large language showed the need for a formal definition of PL/I. A project was set up in 1967 in IBM Vienna to make an unambiguous and complete specification.[12] This led in turn to one of the first large scale Formal Methods for development, VDM.

The language was first specified in detail in the manual “PL/I Language Specifications. C28-6571” written in New York from 1965 and superseded by “PL/I Language Specifications. GY33-6003” written in Hursley from 1967. IBM continued to develop PL/I in the late sixties and early seventies, publishing it in the GY33-6003 manual. These manuals were used by the Multics group and other early implementers.

The first compiler was delivered in 1966. The Standard for PL/I was approved in 1976.

Goals and principles

The Share 3by3 committee set these goals for “NPL”:

These goals evolved during the early development of the language. Competitiveness with COBOL’s record handling and report writing capabilities was needed. The “scope of usefulness” of the language grew to include system programming and event-driven programming. The additional goals[5] for PL/I were:

To meet these goals PL/I borrowed ideas from contemporary languages while adding substantial new capabilities and casting it with a distinctive concise and readable syntax. A number of principles and capabilities combined to give the language its character and were key in meeting the goals:

These principles inevitably resulted in a large language which would need compilers substantially more complex than those for COBOL or Fortran. This was not seen as a drawback since though the few — the compiler writers — would have more work, the many — the programmers — would have less.

Language summary

The language is designed to be all things to all programmers.[14] The summary is extracted from ANSI PL/I Standard.[15]

A PL/I program consists of a set of procedures, each of which is written as a sequence of statements. The %INCLUDE construct is used to include text from other sources during program translation. All of the statement types are summarized here in groupings which give an overview of the language (the Standard uses this organization).

Category Statement
Structural PROCEDURE (or PROC)
ENTRY
BEGIN
DO
END
Declarative DECLARE (or DCL)
DEFAULT (or DFT)
FORMAT
Flow of control CALL
IF
GO TO
RETURN
STOP
Null statement
Interrupt handling ON
REVERT
SIGNAL
Storage ALLOCATE (or ALLOC)
FREE
Assignment statement
Input/Output OPEN
CLOSE
Stream input/output GET
PUT
Record input/output READ
WRITE
REWRITE
LOCATE
DELETE

(Features such as multi-tasking and the PL/I compile-time preprocessor not in the Standard but supported in the PL/I F compiler and some other implementations are discussed in the Language evolution section.)

Names may be declared to represent data of the following types, either as single values, or as aggregates in the form of arrays, with a lower-bound and upper-bound per dimension, or structures (comprising nested structure, array and scalar variables):

  • Arithmetic (expanded below)
  • CHARACTER
  • PICTURE for Arithmetic data
  • PICTURE for Character data
  • AREA
  • BIT
  • ENTRY
  • FILE
  • FORMAT
  • LABEL
  • OFFSET
  • POINTER

The arithmetic type comprises these attributes:

  • a base (BINARY or DECIMAL), and
  • a scale (FIXED or FLOAT), and
  • a mode (REAL or COMPLEX), and
  • a PRECISION (number of digits, and for fixed point numbers, a scale factor)

The base, scale, precision and scale factor of the Picture-for-arithmetic type is encoded within the picture-specification. The mode is specified separately, with the picture specification applied to both the real and the imaginary parts.

Values are computed by expressions written using a specific set of operations and builtin functions, most of which may be applied to aggregates as well as to single values, together with user-defined procedures which, likewise, may operate on and return aggregate as well as single values. The assignment statement assigns values to one or more variables.

There are no reserved words in PL/I. A statement is terminated by a semi-colon. The maximum length of a statement is implementation defined. A comment may appear anywhere in a program where a space is permitted and is preceded by the characters forward slash, asterisk and is terminated by the characters asterisk, forward slash.
/*This is a comment. */
Statements may have a label-prefix introducing an entry name (ENTRY and PROCEDURE statements) or label name, and a condition prefix enabling or disabling a computational condition - e.g. (NOSIZE)). Entry and label names may be single identifiers or identifiers followed by a subscript list of constants (as in L(12,2):A=0;).

A sequence of statements becomes a group when preceded by a DO statement and followed by an END statement. Groups may include nested groups and begin blocks. The IF statement specifies a group or a single statement as the THEN part and the ELSE part (see the sample program). The group is the unit of iteration. The begin block (BEGIN; stmt-list END;) may contain declarations for names and internal procedures local to the block. A procedure starts with a PROCEDURE statement and is terminated syntactically by an END statement. The body of a procedure is a sequence of blocks, groups, and statements and contains declarations for names and procedures local() to the procedure or EXTERNAL to the procedure.

An on-unit is a single statement or block of statements written to be executed when one or more of these conditions occur:

a computational condition,

  • CONVERSION (CONV)
  • FIXEDOVERFLOW (FOFL)
  • OVERFLOW (OFL)
  • SIZE
  • STRINGRANGE (STRG)
  • STRINGSIZE (STRZ)
  • SUBSCRIPTRANGE (SUBRG)
  • UNDERFLOW (UFL)
  • ZERODIVIDE (ZDIV)

or an Input/Output condition,

  • ENDFILE(file)
  • ENDPAGE(file)
  • KEY(file)
  • NAME(file)
  • RECORD(file)
  • TRANSMIT(file)
  • UNDEFINEDFILE(file) (UNDF)

or one of the conditions:

A declaration of an identifier may contain one or more of the following attributes (but they need to be mutually consistent):

Data Attributes Input/Output Attributes Other Attributes
ALIGNED DIRECT AUTOMATIC or AUTO
AREA[( area-size)] ENVIRONMENT(options) or ENV... BASED[(reference)
BINARY [(precision)] or BIN... INPUT BUILTIN
BIT [(maximum-length)] KEYED CONDITION or COND
CHARACTER[(maximum-length)] or CHAR... OUTPUT CONSTANT
COMPLEX [(precision)] or CPLX... PRINT CONTROLLED or CTL
DECIMAL [(precision)] or DEC... SEQUENTIAL or SEQL DEFINED[( reference)] or DEF...
(dimension-attribute) STREAM EXTERNAL or EXT
ENTRY[(parameter descriptor list] UPDATE GENERIC(criteria list)
FILE RECORD INITIAL(value-list) or INIT...
FIXED [(precision)] INTERNAL or INT
FLOAT [(number of digits)] LIKE unsubscripted reference
FORMAT LOCAL
LABEL OPTIONS(options)
MEMBER PARAMETER or PARM
NONVARYING or NONVAR POSITION [(expression)] or POS...
OFFSET[(reference)] STATIC
PICTURE picture-specification or PIC... VARIABLE
POINTER or PTR
STRUCTURE
UNALIGNED or UNAL
VARYING or VAR

Current compilers from Kednos, Micro Focus, and particularly that from IBM implement many extensions over the standardized version of the language. The IBM extensions are summarised in the Implementation sub-section for the compiler later. Although there are some extensions common to these compilers the lack of a current standard means that compatibility is not guaranteed.

Standardization

Language standardization began in April 1966 in Europe with ECMA TC10. In 1969 ANSI established a "Composite Language Development Committee", nicknamed "Kludge", which fortunately was renamed X3J1 PL/I.[16] Standardization became a joint effort of ECMA TC/10 and ANSI X3J1. A subset of the GY33-6003[17] document was offered to the joint effort by IBM and became the base document for standardization. The major features omitted from the base document were multitasking and the attributes for program optimization (e.g. NORMAL and ABNORMAL).

Proposals to change the base document were voted upon by both committees. In the event that the committees disagreed, the chairs, initially Michael Marcotty of General Motors and C.A.R. Hoare representing ICL had to resolve the disagreement. In addition to IBM - Honeywell, CDC, Data General, Digital Equipment, Prime Computer, Burroughs, RCA, and Univac served on X3J1 along with major users, Eastman Kodak, MITRE, Union Carbide, Bell Laboratories, and various government and university representatives. Further development of the language occurred in the standards bodies, with continuing improvements in structured programming and internal consistency, and with the omission of the more obscure or contentious features.

As language development neared an end, X3J1/TC10 realized that there were a number of problems with a document written in English text. Discussion of a single item might appear in multiple places which might or might not agree. It was difficult to determine if there were omissions as well as inconsistencies. Consequently, David Beech (IBM), Robert Freiburghouse (Honeywell), Milton Barber (CDC), M. Donald MacLaren (Argonne Nat'l Laboratory), Barry Folsom (Data General), Lois Frampton (Digital Equipment), and editor, D.J. Andrews of IBM undertook to rewrite the entire document, each producing one or more complete chapters. The standard is couched as a formal definition[15] using a "PL/I Machine"[18] to specify the semantics. It was the first, and possibly the only, programming language standard to be written as a semi-formal definition.

A "PL/I General-Purpose Subset" standard was issued by ANSI in 1981[19] and a revision published in 1987.[20] The General Purpose subset was widely adopted as the kernel for PL/I implementations.

Implementations

IBM PL/I F and D compilers

PL/I was first implemented by IBM, at its Hursley Laboratories in the United Kingdom, as part of the development of System/360. The first production PL/I compiler was the PL/I F Compiler for the OS/360 Operating System, built by John Nash's team at Hursley in the UK: the runtime library team was managed by I.M.(Nobby)Clarke. Release 1 shipped in 1966. OS/360 was a real-memory environment and the compiler was designed for systems with as little as 64kBytes of real storage – F being 64k in S/360 parlance. To fit a large compiler into the 44kByte memory allowed on a 64kByte machine, the compiler consisted of a control phase and a large number of compiler phases (approaching 100). The phases were brought into memory from disk, and released, one at a time to handle particular language features and aspects of compilation.

Aspects of the language were still being designed as PL/I F was implemented, so some were missed out until later releases. PL/I RECORD I/O was shipped with PL/I F Release 2. The list processing functions[21] - Based Variables, Pointers, Areas and Offsets and LOCATE-mode I/O - were first shipped in Release 4. In a major attempt to speed up PL/I code to compete with Fortran object code, PL/I F Release 5 did substantial program optimization of DO-loops facilitated by the REORDER option on procedures.

A version of PL/I F was released on the TSS/360 timesharing operating system for the S/360 Model 67, adapted at the IBM Mohansic Lab. The IBM La Gaude Lab in France developed “Language Conversion Programs”[22] to convert Fortran, Cobol, and Algol programs to the PL/I F level of PL/I. The PL/I D compiler, using 16 kilobytes of memory, was developed by IBM Germany for the DOS/360 low end operating system. It implemented a subset of the PL/I language requiring all strings and arrays to have fixed extents, thus simplifying the run-time environment.It was shipped within a year of PL/I F.

Multics PL/I and derivatives

Compilers were implemented by several groups in the early 1960s. The Multics project at MIT, one of the first to develop an operating system in a high level language, used Early PL/I (EPL), a subset dialect of PL/I, as their implementation language in 1964. EPL was developed at Bell Labs and MIT by Douglas McIlroy, Robert Morris, and others. The influential Multics PL/I compiler, described on the "Multicians" website,[23] was the source of compiler technology used by a number of manufacturers and software groups.

The Honeywell PL/I compiler (for Series 60) was an implementation of the full ANSI X3J1 standard.[24]

IBM PL/I Optimizing and Checkout compilers

The PL/I Optimizer and Checkout compilers produced in Hursley supported a common level of PL/I language[25] and aimed to replace the PL/I F compiler. The performance objectives set for the compilers are shown in an IBM presentation to the BCS.[26] The compilers had to produce identical results - the Checkout Compiler was used to debug programs that would then be submitted to the Optimizer. Given that the compilers had entirely different designs and were handling the full PL/I language this goal was challenging: it was achieved.

The PL/I Optimizing compiler took over from the PL/I F compiler and was IBM’s workhorse compiler from the 1970s to the 1990s. Like PL/I F, it was a multiple pass compiler with a 44kByte design point, but it was an entirely new design. Unlike the F compiler it had to perform compile time evaluation of constant expressions using the run-time library - reducing the maximum memory for a compiler phase to 28 kilobytes. A second-time around design, it succeeded in eliminating the annoyances of PL/I F such as cascading diagnostics.[27] It was written in S/360 Macro Assembler by a team, led by Tony Burbridge, most of whom had worked on PL/I F. Macros were defined to automate common compiler services and to shield the compiler writers from the task of managing real-mode storage - allowing the compiler to be moved easily to other memory models. The gamut of program optimization techniques developed for the contemporary IBM FORTRAN H compiler were deployed: the Optimizer equaled FORTRAN execution speeds in the hands of good programmers. Announced with the IBM S/370 in 1970, it shipped first for the DOS/360 operating system in Aug 1971, and shortly afterward for OS/360, and the first virtual memory IBM operating systems OS/VS1, MVS and VM/CMS (the developers were unaware that while they were shoehorning the code into 28kB sections, IBM Poughkeepsie was finally ready to ship virtual memory support in OS/360). It supported the batch programming environments and, under TSO and CMS, it could be run interactively. This compiler went through many versions covering all mainframe operating systems including the operating systems of the Japanese PCMs.

The compiler has been superseded by "IBM PL/I for OS/2, AIX, Linux, z/OS" below.

The PL/I Checkout compiler,[28][29] (colloquially "The Checker") announced in August 1970 was designed to speed and improve the debugging of PL/I programs. The team was led by Brian Marks. The three-pass design cut the time to compile a program to 25% of that taken by the F Compiler. It was run from an interactive terminal, converting PL/I programs into an internal format, “H-text”. This format was interpreted by the Checkout compiler at run-time, detecting virtually all types of errors. Pointers were represented in 16 bytes, containing the target address and a description of the referenced item, thus permitting "bad" pointer use to be diagnosed. In a conversational environment when an error was detected, control was passed to the user who could inspect any variables, introduce debugging statements and edit the source program. Over time the debugging capability of mainframe programming environments developed most of the functions offered by this compiler and it was withdrawn (in the 1990s?)

DEC PL/I

Perhaps the most commercially successful implementation outside of IBM's was Digital Equipment's 1988 release of the ANSI PL/I 1987 subset. The implementation is "a strict superset of the ANSI X3.4-1981 PL/I General Purpose Subset and provides most of the features of the new ANSI X3.74-1987 PL/I General Purpose Subset".[30] The front end was designed by Robert Freiburghouse, has a code generator implemented by Dave Cutler, who managed the design and implementation of VAX/VMS. It runs on VMS on VAX and ALPHA and on ALPHA UNIX. UniPrise Systems, Inc., was responsible for the compiler;[31] it is currently supported by Kednos Corporation.[32]

Teaching subset compilers

In the late 1960s and early 1970s, many US and Canadian Universities were establishing campus time-sharing services and needed conversational compiler/interpreters for use in teaching science, mathematics, engineering, and computer science. Dartmouth were developing BASIC, but PL/I was a popular choice, as it was concise and easy to teach. As the IBM offerings were unsuitable,[33] a number of schools built their own subsets of PL/I and their own interactive support. Examples are:

IBM PL/I for OS/2, AIX, Linux, z/OS

In a major revamp of PL/I, IBM Santa Teresa in California launched an entirely new compiler in 1992. The initial shipment was for OS/2 and included most ANSI-G features and many new PL/I features.[38] Subsequent releases covered additional platforms (MVS, VM, OS/390, AIX and Windows)[39] and continued to add functions to make PL/I fully competitive with other languages offered on the PC (particularly C and C++) in areas where it had been overtaken. The corresponding “IBM Language Environment" supports inter-operation of PL/I programs with Database and Transaction systems, and with programs written in C, C++, and COBOL, the compiler supports all the data types needed for intercommunication with these languages.

The PL/I design principles were retained and withstood this major extension comprising several new data types, new statements and statement options, new exceptional conditions, and new organisations of program source. The resulting language is a compatible super-set of the PL/I Standard and of the earlier IBM compilers. Major topics added to PL/I were:

Object orientation

ORDINAL is a new computational data type. The ordinal facilities are like those in Pascal, e.g. DEFINE ORDINAL Colour (red, yellow, green, blue, violet); but in addition the name and internal values are accessible via built-in functions Built-in functions provide access to an ordinal value's predecessor and successor.

The DEFINE-statement (see below) allows additional TYPEs to be declared composed from PL/I's built-in attributes.

The HANDLE(data structure) locator data type is similar to the POINTER data type, but strongly typed to bind only to a particular data structure. The => operator is used to select a data structure using a handle.

The UNION attribute (equivalent to CELL in early PL/I specifications) permits several scalar variables, arrays, or structures to share the same storage in a unit that occupies the amount of storage needed for the largest alternative.

Competitiveness on PC and with C

These attributes were added:

New string-handling functions were added - to centre text, to edit using a picture format, and to trim blanks or selected characters from the head or tail of text, VERIFYR to VERIFY from the right. and SEARCH and TALLY functions.

Compound assignment operators a la C e.g. +=, &=, -=, ¦¦= were added. A+=1 is equivalent to A=A+1.

Additional parameter descriptors and attributes were added for omitted arguments and variable length argument lists.

Program Readability – making intentions explicit

The VALUE attribute declares an identifier as a constant (derived from a specific literal value or restricted expression).

Parameters can have the BYADDR (pass by address) or BYVALUE (pass by value) attributes.

The ASSIGNABLE and NONASSIGNABLE attributes prevent unintended assignments.

DO FOREVER; obviates the need for the contrived construct DO WHILE ( '1'B );.

The DEFINE-statement introduces user-specified names (e.g. INTEGER) for combinations of built-in attributes (e.g. FIXED BINARY(31,0)). Thus DEFINE ALIAS INTEGER FIXED BINARY(31.0) creates the TYPE name INTEGER as an alias for the set of built-in attributes FIXED BINARY(31.0). DEFINE STRUCTURE applies to structures and their members; it provides a TYPE name for a set of structure attributes and corresponding substructure member declarations for use in a structure declaration (a generalisation of the LIKE attribute).

Structured Programming Additions

A LEAVE statement to exit a loop, and an ITERATE to continue with the next iteration of a loop.

UPTHRU and DOWNTHRU options on iterative groups.

The package construct consisting of a set of procedures and declarations for use as a unit. Variables declared outside of the procedures are local to the package, and can use STATIC, BASED or CONTROLLED storage. Procedure names used in the package also are local, but can be made external by means of the EXPORTS option of the PACKAGE-statement.

Interrupt Handling

The RESIGNAL-statement executed in an ON-unit terminates execution of the ON-unit, and raises the condition again in the procedure that called the current one (thus passing control to the corresponding ON-unit for that procedure).

The INVALIDOP condition handles invalid operation codes detected by the PC processor, as well as illegal arithmetic operations such as subtraction of two infinite values.

The ANYCONDITION condition is provided to intercept conditions for which no specific ON-unit has been provided in the current procedure.

The STORAGE condition is raised when an ALLOCATE statement is unable to obtain sufficient storage.

Other Mainframe and minicomputer compilers

A number of vendors produced compilers to compete with IBM PL/I F or Optimizing compiler on mainframes and minicomputers in the 1970s. In the 1980s the target was usually the emerging ANSI-G subset.

PL/I compilers for Personal Computers and UNIX

Special purpose and system PL/I compilers

PL/I dialect compilers

Usage

PL/I implementations were developed for mainframes from the late 1960s, mini computers in the 1970s, and Personal Computers[52] in the 1980s and 1990s. Although its main use has been on mainframes, there are PL/I versions for DOS, Microsoft Windows, OS/2, AIX, OpenVMS, and Unix.

It has been widely used in business data processing[53] and for system use for authoring operating systems on certain platforms. Very complex and powerful systems have been built with PL/I:

PL/I did not fulfill its supporters' hopes that it would displace FORTRAN and COBOL and become the major player on mainframes. It remained a minority but significant player. There cannot be a definitive explanation for this, but some trends in the 1970s and 1980s militated against its success by progressively reducing the territory on which PL/I enjoyed a competitive advantage.

On mainframes there were substantial business issues at stake too:

Evolution of the PL/I language

This article uses the PL/I standard as the reference point for language features. But a number of features of significance in the early implementations were not in the Standard; and some were offered by non-IBM compilers. And the de facto language continued to grow after the standard, ultimately driven by developments on the Personal Computer.

Significant features omitted from the Standard

Multi tasking was implemented by PL/I F, the Optimizer and the newer AIX and Z/OS compilers. It comprised the data types EVENT and TASK, the TASK-option on the CALL-statement (Fork), the WAIT-statement (JOIN), the DELAY(delay-time), EVENT-options on the record I/O statements and the UNLOCK statement to unlock locked records on EXCLUSIVE files. Event data identify a particular event and indicate whether it is complete ('1'B) or incomplete ('0'B): task data items identify a particular task (or process) and indicate its priority relative to other tasks.

The first IBM Compile time preprocessor was built by the IBM Boston Advanced Programming Center located in Cambridge, Mass, and shipped with the PL/I F compiler. The %INCLUDE statement was in the Standard, but the rest of the features were not. The DEC and Kednos[32] PL/I compilers implemented much the same set of features as IBM, with some additions of their own. IBM has continued to add preprocessor features to its compilers. The preprocessor treats the written source program as a sequence of tokens, copying them to an output source file or acting on them. When a % token is encountered the following compile time statement is executed: when an identifier token is encountered and the identifier has been DECLAREd, ACTIVATEd, and assigned a compile time value, the identifier is replaced by this value. Tokens are added to the output stream if they do not require action (e.g. +), as are the values of ACTIVATEd compile time expressions. Thus a compile time variable PI could be declared, activated, and assigned using %PI='3.14159265'. Subsequent occurrences of PI would be replaced by 3.14159265.

The data type supported are FIXED DECIMAL integers and CHARACTER strings of varying length with no maximum length. The structure statements are

and the simple statements, which also may have a [label-list:]

The feature allowed programmers to use identifiers for constants - e.g. product part numbers or mathematical constants - and was superseded in the standard by named constants for computational data. Conditional compiling and iterative generation of souce code, possible with compile-time facilities, was not supported by the standard. Several manufacturers implemented these facilities.

Structured programming additions were made to PL/I during standardization but were not accepted into the standard. These features were the LEAVE-statement to exit from an iterative DO, the UNTIL-option and REPEAT-option added to DO, and a case statement of the general form: SELECT (expression) {WHEN (expression) group}... OTHERWISE group
These features were all included in DEC PL/I.[56]

PL/I F had offered some debug facilities that were not put forward for the standard but were implemented by others - notably the CHECK(variable-list) condition prefix, CHECK on-condition and the SNAP option. The IBM Optimizing and Checkout compilers added additional features appropriate to the conversational mainframe programming environment (e.g. an ATTENTION condition).

Significant features developed since the Standard

Several attempts had been made to design a structure member type that could have one of several datatypes (CELL in early IBM). With the growth of classes in programming theory, approaches to this became possible on a PL/I base - UNION, TYPE etc. have been added by several compilers.

PL/I had been conceived in a single byte character world. With support for Japanese and Chinese language becoming essential, and the developments on International Code Pages, the character string concept was expanded to accommodate wide non-ASCII/EBCDIC strings.

TIME and DATE handling were overhauled to deal with the millennium problem.

Criticisms

Implementation Issues

Programmer Issues

Special Topics in PL/I

Storage classes

PL/I provides several 'storage classes' to indicate how the lifetime of variables' storage is to be managed - AUTOMATIC, STATIC, CONTROLLED and BASED. The simplest to implement is STATIC, which indicates that memory is allocated and initialized at load-time, as is done in COBOL "working-storage" and FORTRAN IV. But this is only the default for EXTERNAL variables. PL/I's default storage class for INTERNAL variables is AUTOMATIC, similar to that of other block-structured languages influenced by ALGOL, like the "auto" storage class in the C language, and default storage allocation in Pascal and "local-storage" in IBM COBOL. Storage for AUTOMATIC variables is allocated upon entry into the BEGIN-block, procedure, or on-unit in which they are declared. The compiler and runtime system allocate memory for a stack frame to contain them and other housekeeping information. If a variable is declared with an INITIAL-attribute, code to set it to an initial value is executed at this time. Care is required to manage the use of initialization properly. Large amounts of code can be executed to initialize variables every time a scope is entered, especially if the variable is an array or structure. Storage for AUTOMATIC variables is freed at block exit: STATIC, CONTROLLED or BASED variables are used to retain variables' contents between invocations of a procedure or block. CONTROLLED storage is also managed using a stack, but the pushing and popping of allocations on the stack is managed by the programmer, using ALLOCATE and FREE statements. Storage for BASED variables is managed using ALLOCATE/FREE, but instead of a stack these allocations have independent lifetimes and are addressed through OFFSET or POINTER variables.

Storage type sharing

There are several ways of accessing allocated storage through different data declarations. Some of these are well defined and safe, some can be used safely with careful programming, and some are inherently unsafe and/or machine dependent.

Passing a variable as an argument to a parameter by reference allows the argument's allocated storage to be referenced using the parameter. The DEFINED attribute (e.g. DCL A(10,10), B(2:9,2:9) DEFINED A) allows part or all of a variable's storage to be used with a different, but consistent, declaration. These two usages are safe and machine independent.

Record I/O and list processing produce situations where the programmer needs to fit a declaration to the storage of the next record or item, before knowing what type of data structure it has. Based variables and pointers are key to such programs. The data structures must be designed appropriately, typically using fields in a data structure to encode information about its type and size. The fields can be held in the preceding structure or, with some constraints, in the current one. Where the encoding is in the preceding structure, the program needs to allocate a based variable with a declaration that matches the current item (using expressions for extents where needed). Where the type and size information are to be kept in the current structure ("self defining structures") the type-defining fields must be ahead of the type dependent items and in the same place in every version of the data structure. The REFER-option is used for self-defining extents (e.g. string lengths as in DCL 1 A BASED, 2 N BINARY, 2 B CHAR(LENGTH REFER A.N), etc - where LENGTH is used to allocate instances of the data structure. For self-defining structures, any typing and REFERed fields are placed ahead of the "real" data. If the records in a data set, or the items in a list of data structures, are organised this way they can be handled safely in a machine independent way.

PL/I implementations do not (except for the PL/I Checkout compiler) keep track of the data structure used when storage is first allocated. Any BASED declaration can be used with a pointer into the storage to access the storage - inherently unsafe and machine dependent. However this usage has become important for "pointer arithmetic" (typically adding a certain amount to a known address). This has been a contentious subject in computer science. In addition to the problem of wild references and buffer overruns, issues arise due to the alignment and length for data types used with particular machines and compilers. Many cases where pointer arithmetic might be needed involve finding a pointer to an element inside a larger data structure. The ADDR function computes such pointers, safely and machine independently.

Pointer arithmetic is accomplished by aliasing a binary variable with a pointer as in
DCL P POINTER, N FIXED BINARY(31) BASED(ADDR(P)); N=N+255;
It relies on pointers being the same length as FIXED BINARY(31) integers and aligned on the same boundaries.

With the prevalence of C and its free and easy attitude to pointer arithmetic, recent IBM PL/I compilers allow pointers to be used with the addition and subtraction operators to giving the simplest syntax (but compiler options can disallow these practices where safety and machine independence are paramount).

On-Units and exception handling

When PL/I was designed, programs only ran in batch mode, with no possible intervention from the programmer at a terminal. An exceptional condition such as division by zero would abort the program yielding only a hexadecimal core dump. PL/I exception handling, via on-units, allowed the program to stay in control in the face of hardware or operating system exceptions and to recover debugging information before closing down more gracefully. As a program became properly debugged most of the exception handling could be removed or disabled: this level of control became less important when conversational execution became commonplace.

Computational exception handling is enabled and disabled per PL/I condition by condition prefixes on statements, blocks(including on-units) and procedures. – e.g. (SIZE, NOSUBSCRIPTRANGE): A(I)=B(I)*C; . Operating system exceptions for Input/Output and storage management are always enabled.

The on-unit is a single statement or BEGIN-block introduced by an ON-statement and is established for a particular condition. When the exception for this condition occurs and the condition is enabled, an on-unit for the condition is executed. On-units are inherited down the call chain. When a block, procedure or on-unit is activated, the on-units established by the invoking activation are inherited by the new activation. They may be over-ridden by another ON-statement and can be reestablished by the REVERT-statement. The exception can be simulated using the SIGNAL-statement – e.g. to help debug the exception handlers. The dynamic inheritance principle for on-units allows a routine to handle the exceptions occurring within the subroutines it uses.

If no on-unit is in effect when a condition is raised a standard system action is taken (often this is to raise the ERROR condition). The system action can be reestablished using the SYSTEM option of the ON-statement. With some conditions it is possible to complete executing an on-unit and return to the point of interrupt (the CONVERT, AREA and FILE conditions) and resume normal execution. With other conditions (ZERODIVIDE, SIZE) the ERROR condition is raised when this is attempted. An on-unit may be terminated with a GO TO preventing a return to the point of interrupt.

An on-unit needs to be designed to deal with exceptions that occur in the on-unit itself. The ON ERROR SYSTEM; statement allows a nested error trap; if an error occurs within an on-unit, control passes to the operating system where a system dump might be produced.

The PL/I RECORD I/O statements have relatively simple syntax as they do not offer options for the many situations from end-of-file to record transmission errors that can occur when a record is read or written. Instead, these complexities are handled in the on-units for the various file conditions. The same approach was adopted for AREA sub-allocation and the AREA condition.

The existence of exception handling on-units makes the task of optimizing PL/I programs particularly difficult. Variables can be inspected or altered in ON-units, and the flow of control may be very hard to analyze. This is discussed in the section on Implementation Issues above.

Sample programs

Hello world program

Hello2: proc options(main);
     put list ('Hello, world!');
end Hello2;

Search for a string

/* Read in a line, which contains a string,
/* and then print every subsequent line that contains that string. */

find_strings: procedure options (main);
   declare pattern character (100) varying;
   declare line    character (100) varying;
   declare (line_no, end_file) fixed binary;

   end_file = 0;
   on endfile (sysin) end_file = 1;

   get edit (pattern) (L);
   line_no = 1;
   do while (end_file = 0);
      if index(line, pattern) > 0 then
         put skip list (line_no, line);
      line_no = line_no + 1;
      get edit (line) (L);
   end;

end find_strings;

See also

Notes

  1. ^ PL/I Resources. members.dodo.com.au
  2. ^ Kednos PL/I for HP OpenVMS VAX and Alpha kednos.com
  3. ^ PL/I for IBM mainframe, Linux on zSeries and Windows. ibm.com
  4. ^ Fred Brooks is credited with ensuring PL/I had the CHARACTER data type. CS.unc.edu
  5. ^ a b IBM's 360 and Early 370 Systems, Emerson W. Pugh, Lyle R Johnson, John H Palmer, MIT Press, 1991
  6. ^ The committee actually had 8 members at the time the report was released. They were:
    • Hans Berg, Lockheed Burbank
    • George Radin, IBM
    • James Cox, Union Carbide
    • Bruce Rosenblatt, Standard Oil of California, Chair.
    • Douglas McIlroy, Bell Laboratories
    • Robert Sheppard, Procter & Gamble. From the GUIDE organization
    • C.W. Medlock, IBM
    • Bernice Weitzenhoffer, IBM.
  7. ^ "MultiPurpose Programming Language"
  8. ^ Report II of the SHARE Advanced Language Development Committee, June 25, 1964
  9. ^ Highlights of a New Programming Language, G.Radin and Y.Paul Rogaway CACM V8 No 1, Jan 1965
  10. ^ a b Radin, G. The Early History and Characteristics of PL/I. August 1978. ACM SIGPLAN Notices, Vol 13, No.8
  11. ^ Control of the language was vested in a dedicated Language Control group and a Language Control Board that included representatives of the compiler groups (6 by 1973) in three countries. Daily communication was by telex. Changes and extensions were made through a Language Log that reached several thousand detailed points. A management level Language Review Board resolved issues escalated to it by the Control Board.
  12. ^ The Universal Language Document (ULD). Technical reports TR25.081, TR25.082, TR25.083, TR25.0S4, TR25.085, TR25.086 and TR25.087, IBM Corp Vienna Austria June 1968
  13. ^ Some compilers chose to reserve these identifiers, or issue warnings if they found them used as identifiers. But the subsequent introduction of a case statement shows the value of the principle.
  14. ^ Vowels, Robin (2001). Introduction to PL/I. pp. x. ISBN 0959638490. ""PL/I is the first language to provide adequate facilities for scientific computations, business data processing and systems programming in a single language" - Jean Sammet, Some Approaches to, and Illustrations of, Programming Language History" 
  15. ^ a b ANS Programming Language PL/I. X3.53-1976,
  16. ^ Reacting to IBM's involvement, a number of manufacturers had been reluctant to endorse an effort on a Standard. The initial remit of CLDG/X3J1 was to investigate whether PL/I was a suitable candidate for standardization. The joint effort processed over 3500 proposals for language or textual changes.
  17. ^ “PL/I Language Specifications. GY33-6003”
  18. ^ The PL/I machine defined in the standard is covered in The Definition Mechanism for Standard PL/I. Michael Marcotty, Frederick G Sayward. cs.yale.edu
  19. ^ ANSI X3.74-1981 (R1998) Information Systems - Programming Language - PL/I General-Purpose Subset
  20. ^ 1987 PL/I General-Purpose Subset (ANSI X3.74-1987)
  21. ^ These were designed in 1966 for an early Optimizing compiler, written in PL/I and to be bootstrapped starting with the PL/I F compiler
  22. ^ There were many delays in shipping these, so a common PL/I joke at the time was that Gen de Gaulle forbade La Gaude from shipping them until the Concorde had its first flight
  23. ^ Multicians.org
  24. ^ Honeywell, Series 60 (Level 66)/6000 PL/I Reference Manual, Order No. DE05, 1976.
  25. ^ OS PL/I Checkout and Optimizing Compilers: Language Reference Manual. GC33-0009 1970
  26. ^ A.M.Krasun. The first 35+ years of Hursley software 1958-1993 with some hardware asides. BCS, The Chartered Institute for IT, Advanced Programming Specialist Group. October 2006 bcs.org
  27. ^ For an entertaining description, see "My First PL/I Program" by Jim Horning. horningtales.blogspot.com
  28. ^ A Conversational Compiler for Full PL/I R.N.Cuff The Computer Journal.1972; 15: 99-104
  29. ^ The design of a checkout compiler, B.L.Marks, IBM Systems Journal Volume 12, Number 3, Page 315 (1973)
  30. ^ Uniprise PL/I for UNIX Reference Manual, Uniprise Systems, Inc., Irvine, California, 1995, p. xxi.
  31. ^ Uniprise PL/I for UNIX Reference Manual, Uniprise Systems, Inc., Irvine, California, 1995
  32. ^ a b Kednos.com
  33. ^ The "Fatal Disease" article - plg.uwaterloo.ca
  34. ^ Department of Computer Science, User's Guide to PL/C - The Cornell Compiler for PL/I, Cornell University, Ithica, 1977.
  35. ^ PLAGO/360 User's Manual, Polytechnic Institute of Brooklyn.
  36. ^ J. N. P. Hume and R. C. Holt, Structured Programming using PL/I and SP/k, Reston, Reston, 1975.
  37. ^ M. V. Zelkowitz, PL/I Programming with PLUM, 2nd Ed., Paladin House, Geneva (Ill.), 1978.
  38. ^ Robin A Vowels. PL/I for OS/2. ACM SIGPLAN Notices, Volume 31, No 3. March 1996.
  39. ^ PL/I for IBM mainframe, Linux on zSeries and Windows. ibm.com
  40. ^ Burroughs Corporation, B 6700/B 7700: PL/I Language Reference Manual, Reference 5001530, Detroit, 1977.
  41. ^ Sperry-Univac Computer Systems, SPERRY UNIVAC 1100 Series PL/I Programmer Reference, Reference UP-8277, 1976.
  42. ^ Data General Corporation, AOS PL/I Reference Manual, Order No. 093-000204, c. 1978.
  43. ^ Control Data Corporation, PL/I Reference Manual, 1978, Publication No. 60388100A.
  44. ^ VOS PL/I User's Guide, Stratus Computer, Inc., Marlboro, Massachusetts, 1995.
  45. ^ The Norton chronicles: Peter Norton, PC magazine May 15, 1984. This article is enthusiastic towards PL/I
  46. ^ Micro Focus Open PL/I
  47. ^ IBM PL/I for OS/2 portal.acm.org
  48. ^ IBM Series/1 PL/I: Language Reference, 2nd Ed., Boca Raton, Florida, 1978.
  49. ^ Proceedings of the 1979 SIGPLAN symposium on Compiler construction. Pages 107-116nts. ISBN 0-89791-002-8
  50. ^ Cocke, John; Markstein, V. (January 1990). "The evolution of RISC technology at IBM" (PDF). IBM Journal of Research & Development (IBM) 34 (1): 4–11. doi:10.1147/rd.341.0004. http://www.research.ibm.com/journal/rd/341/ibmrd3401C.pdf. Retrieved 2008-03-06. 
  51. ^ The compiler is described in: The 801 Minicomputer. George Radin. Nov 1983. IBM Journal of Research and Development. Vol 27, No 3.
  52. ^ The Norton Chronicles Peter Norton, PC Magazine May 15, 1984 This article is enthusiastic towards PL/I
  53. ^ The Article about Liant (now Micro Focus) Open PL/I estimated that in 1995 20% of mainframe legacy applications were in PL/I, with 60% in COBOL: there were 300,000 PL/I programmers worldwide.
  54. ^ Schultz, G.; D.Rose, C.West, J.Gray, (April 1980). "Executable description and validation of SNA". IEEE Xplorer 28 (4): 661. doi:10.1109/TCOM.1980.1094695. http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=1094695. 
  55. ^ In his slides on IBM Operating System/360 Fred Brooks says OS/360 should have been written in PL/I not PL/S and Assembler. The article is a great summary of the OS/360 program. capgemini-sdm.com, 2001.
  56. ^ See the Kednos Manuals, e.g kednos.com
  57. ^ The compiler could not tell whether a statement was a declaration or a multiple assignment statement until encountering the "=" of the assignment or ";" of the DECLARE - which could be several lines later. The fact that DECLARE/DCL were not reserved is the proximate cause in this example - consider the fragment DECLARE(I,J,K),L= vs DECLARE(I,J,K),L;.
  58. ^ Horningtales.blogspot.com

References

Textbooks

Standards

Reference Manuals

External links