Paradigm | procedural, object-oriented |
---|---|
Appeared in | 1959 |
Designed by | Grace Hopper, William Selden, Gertrude Tierney, Howard Bromberg, Howard Discount, Vernon Reeves, Jean E. Sammet |
Stable release | COBOL 2002 (2002) |
Typing discipline | strong, static |
Major implementations | OpenCOBOL, Micro Focus International |
Dialects | HP3000 COBOL/II, COBOL/2, IBM OS/VS COBOL, IBM COBOL/II, IBM COBOL SAA, IBM Enterprise COBOL, IBM COBOL/400, IBM ILE COBOL, Unix COBOL X/Open, Micro Focus COBOL, Microsoft COBOL, Ryan McFarland RM/COBOL, Ryan McFarland RM/COBOL-85, DOSVS COBOL, UNIVAC COBOL, Realia COBOL, Fujitsu COBOL, ICL COBOL, ACUCOBOL-GT, DEC COBOL-10, DEC VAX COBOL, Wang VS COBOL, Visual COBOL |
Influenced by | FLOW-MATIC, COMTRAN, FACT |
Influenced | PL/I, CobolScript, ABAP |
COBOL at Wikibooks |
COBOL (pronounced /ˈkoʊbɒl/) is one of the oldest programming languages. Its name is an acronym for COmmon Business-Oriented Language, defining its primary domain in business, finance, and administrative systems for companies and governments.
The COBOL 2002 standard includes support for object-oriented programming and other modern language features.[1]
Contents |
The COBOL specification was created by Grace Hopper during the second half of 1959. The scene was set on April 8, 1959 at a meeting of computer manufacturers, users, and university people at the University of Pennsylvania Computing Center. The United States Department of Defense subsequently agreed to sponsor and oversee the next activities. A meeting chaired by Charles A. Phillips was held at the Pentagon on May 28 and 29 of 1959 (exactly one year after the Zürich ALGOL 58 meeting); there it was decided to set up three committees: short, intermediate and long range (the last one was never actually formed). It was the Short Range Committee, chaired by Joseph Wegstein of the US National Bureau of Standards, that during the following months created a description of the first version of COBOL.[2] The committee was formed to recommend a short range approach to a common business language. The committee was made up of members representing six computer manufacturers and three government agencies. The six computer manufacturers were Burroughs Corporation, IBM, Minneapolis-Honeywell (Honeywell Labs), RCA, Sperry Rand, and Sylvania Electric Products. The three government agencies were the US Air Force, the David Taylor Model Basin, and the National Bureau of Standards (now National Institute of Standards and Technology). The intermediate-range committee was formed but never became operational. In the end a sub-committee of the Short Range Committee developed the specifications of the COBOL language. This sub-committee was made up of six individuals:
This subcommittee completed the specifications for COBOL in December 1959. The specifications were to a great extent inspired by the FLOW-MATIC language invented by Grace Hopper - commonly referred to as "the mother of the COBOL language" - the IBM COMTRAN language invented by Bob Bemer, and the FACT language from Honeywell.
The decision to use the name "COBOL" was made at a meeting of the committee held on 18 September 1959.
The first compilers for COBOL were subsequently implemented during the year 1960 and on 6 and 7 December essentially the same COBOL program was run on two different makes of computers, an RCA computer and a Remington-Rand Univac computer, demonstrating that compatibility could be achieved.
After 1959 COBOL underwent several modifications and improvements. In an attempt to overcome the problem of incompatibility between different versions of COBOL, the American National Standards Institute (ANSI) developed a standard form of the language in 1968. This version was known as American National Standard (ANS) COBOL.
In 1974, ANSI published a revised version of (ANS) COBOL, containing a number of features that were not in the 1968 version.
In 1985, ANSI published still another revised version that had new features not in the 1974 standard, most notably structured language constructs ("scope terminators"), including END-IF
, END-PERFORM
, END-READ
, etc.
The language continues to evolve today. In the early 1990s it was decided to add object-orientation in the next full revision of COBOL. The initial estimate was to have this revision completed by 1997 and an ISO CD (Committee Draft) was available by 1997. Some implementers (including Micro Focus, Fujitsu, and IBM) introduced object-oriented syntax based on the 1997 or other drafts of the full revision. The final approved ISO Standard (adopted as an ANSI standard by INCITS) was approved and made available in 2002.
Like the C++ programming language, Java, object-oriented COBOL compilers are available even as the language moves toward standardization. Fujitsu and Micro Focus currently support object-oriented COBOL compilers targeting the .NET framework.[4]
The 2002 (4th revision) of COBOL included many other features beyond object-orientation. These included (but are not limited to):
The specifications approved by the full Short Range Committee were approved by the Executive Committee on January 3, 1960, and sent to the government printing office, which edited and printed these specifications as Cobol 60.
The American National Standards Institute (ANSI) produced several revisions of the COBOL standard, including:
After the Amendments to the 1985 ANSI Standard (which were adopted by ISO), primary development and ownership was taken over by ISO. The following editions and TRs (Technical Reports) have been issued by ISO (and adopted as ANSI) Standards:
From 2002, the ISO standard is also available to the public coded as ISO/IEC 1989.
Work is progressing on the next full revision of the COBOL Standard. It is expected to be approved and available in the early 2010s. For information on this revision, to see the latest draft of this revision, or to see what other works is happening with the COBOL Standard, see the COBOL Standards Website.
COBOL programs are in use globally in governmental and military agencies and in commercial enterprises, and are running on operating systems such as IBM's z/OS, the POSIX families (Unix/Linux etc.), and Microsoft's Windows as well as ICL's VME operating system and Unisys' OS 2200. In 1997, the Gartner Group reported that 80% of the world's business ran on COBOL with over 200 billion lines of code in existence and with an estimated 5 billion lines of new code annually.[5]
Near the end of the twentieth century the year 2000 problem was the focus of significant COBOL programming effort, sometimes by the same programmers who had designed the systems decades before. The particular level of effort required for COBOL code has been attributed both to the large amount of business-oriented COBOL, as COBOL is by design a business language and business applications use dates heavily, and to constructs of the COBOL language such as the PICTURE clause, which can be used to define fixed-length numeric fields, including two-digit fields for years.
COBOL as defined in the original specification included a PICTURE clause for detailed field specification. It did not support local variables, recursion, dynamic memory allocation, or structured programming constructs. Support for some or all of these features has been added in later editions of the COBOL standard. COBOL has many reserved words (over 400), called keywords.
The original COBOL specification supported self-modifying code via the infamous "ALTER X TO PROCEED TO Y" statement. X and Y are paragraph labels, and any "GOTO X" statements executed after such an ALTER statement have the meaning "GOTO Y" instead. Most compilers still support it, but it should not be used in new programs.
COBOL provides an update-in-place syntax, for example
ADD YEARS TO AGE
The equivalent construct in many procedural languages would be
age := age + years
This syntax is similar to the compound assignment operator later adopted by C:
age += years
The abbreviated conditional expression
IF SALARY > 9000 OR SUPERVISOR-SALARY OR = PREV-SALARY
is equivalent to
IF SALARY > 9000 OR SALARY > SUPERVISOR-SALARY OR SALARY = PREV-SALARY
COBOL provides "named conditions" (so-called 88-levels). These are declared as sub-items of another item (the conditional variable). The named condition can be used in an IF statement, and tests whether the conditional variable is equal to any of the values given in the named condition's VALUE clause. The SET statement can be used to make a named condition TRUE (by assigning the first of its values to the conditional variable).
COBOL allows identifiers to be up to 30 characters long. When COBOL was introduced, much shorter lengths (e.g., 6 characters for FORTRAN) were prevalent.
The concept of copybooks was introduced by COBOL; these are chunks of text which can be inserted into a program's code. This is done with the COPY statement, which also allows parts of the copybook's text to be replaced with other text (using the REPLACING ... BY ... clause).
Standard COBOL provides the following data types:
Data type | Sample declaration | Notes |
---|---|---|
Character | PIC X(20) |
Alphanumeric and alphabetic-only Single-byte character set (SBCS) |
Edited character | PIC X99BAXX |
Formatted and inserted characters |
Numeric fixed-point binary | PIC S999V99 or BINARY |
Binary 16, 32, or 64 bits (2, 4, or 8 bytes) Signed or unsigned. Conforming compilers limit the maximum value of variables based on the picture clause and not the number of bits reserved for storage. |
Numeric fixed-point packed decimal | PIC S999V99 |
1 to 18 decimal digits (1 to 10 bytes) Signed or unsigned |
Numeric fixed-point zoned decimal | PIC S999V99 |
1 to 18 decimal digits (1 to 18 bytes) Signed or unsigned Leading or trailing sign, overpunch or separate |
Numeric floating-point | PIC S9V999ES99 |
Binary floating-point |
Edited numeric | PIC +Z,ZZ9.99 |
Formatted characters and digits |
Group (record) | 01 CUST-NAME. |
Aggregated elements |
Table (array) | OCCURS 12 TIMES |
Fixed-size array, row-major order Up to 7 dimensions |
Variable-length table | OCCURS 0 to 12 TIMES |
Variable-sized array, row-major order Up to 7 dimensions |
Renames (variant or union data) | 66 RAW-RECORD |
Character data overlaying other variables |
Condition name | 88 IS-RETIRED-AGE |
Boolean value dependent upon another variable |
Array index | [USAGE] INDEX |
Array subscript |
Most vendors provide additional types, such as:
Data type | Sample declaration | Notes |
---|---|---|
Numeric floating-point single precision |
PIC S9V999ES99 [USAGE] COMPUTATIONAL-1 |
Binary floating-point (IBM extension) |
Numeric floating-point double precision |
PIC S9V999ES99 [USAGE] COMPUTATIONAL-2 |
Binary floating-point (IBM extension) |
Numeric fixed-point packed decimal |
PIC S9V999 [USAGE] COMPUTATIONAL-3 |
same as PACKED DECIMAL (IBM extension) |
Numeric fixed-point binary | PIC S999V99 |
same as COMPUTATIONAL or BINARY (IBM extension) |
Numeric fixed-point binary (native binary) |
PIC S999V99 |
Binary 16, 32, or 64 bits (2, 4, or 8 bytes) Signed or unsigned. The maximum value of variables based on the number of bits reserved for storage and not on the picture clause. (IBM extension) |
Numeric fixed-point binary in native byte order |
PIC S999V99 |
Binary 16, 32, or 64 bits (2, 4, or 8 bytes) Signed or unsigned |
Numeric fixed-point binary in big-endian byte order |
PIC S999V99 |
Binary 16, 32, or 64 bits (2, 4, or 8 bytes) Signed or unsigned |
Wide character | PIC G(20) |
Alphanumeric Double-byte character set (DBCS) |
Edited wide character | PIC G99BGGG |
Formatted and inserted wide characters |
Edited floating-point | PIC +9.9(6)E+99 |
Formatted characters and decimal digits |
Data pointer | [USAGE] POINTER |
Data memory address |
Code pointer | [USAGE] PROCEDURE-POINTER |
Code memory address |
Bit field | PIC 1(n) [USAGE] COMPUTATIONAL-5 |
n can be from 1 to 64, defining an n-bit integer Signed or unsigned |
Index | [USAGE] INDEX |
Binary value corresponding to an occurrence of a table element May be linked to a specific table using INDEXED BY |
An example of the "Hello, world" program in COBOL:
IDENTIFICATION DIVISION. PROGRAM-ID. HELLO-WORLD. PROCEDURE DIVISION. DISPLAY 'Hello, world'. STOP RUN.
In his letter to an editor in 1975 titled "How do we tell truths that might hurt?", which was critical of several programming languages contemporaneous with COBOL, computer scientist and Turing Award recipient Edsger Dijkstra remarked that "The use of COBOL cripples the mind; its teaching should, therefore, be regarded as a criminal offense."[6]
In his dissenting response to Dijkstra's article and the above "offensive statement", computer scientist Howard E. Tompkins defended structured COBOL: "COBOL programs with convoluted control flow indeed tend to 'cripple the mind'", but this was because "there are too many such business application programs written by programmers that have never had the benefit of structured COBOL taught well...".[7]
COBOL 85 was not fully compatible with earlier versions, resulting in the "cesarean birth of COBOL 85". Joseph T. Brophy, CIO, Travelers Insurance, spearheaded an effort to inform users of COBOL of the heavy reprogramming costs of implementing the new standard. As a result the ANSI COBOL Committee received more than 3,200 letters from the public, mostly negative, requiring the committee to make changes. On the other hand, conversion to COBOL 85 was thought to increase productivity in future years, thus justifying the conversion costs.[8]
COBOL syntax has often been criticized for its verbosity. However, this is considered a strength by proponents, because "It has made COBOL the most readable, understandable and self-documenting programming language in use today"... "Not only does this readability generally assist the maintenance process but the older a program gets the more valuable this readability becomes."
Traditional COBOL is a simple language with a limited scope of function, encouraging a straightforward coding style. This has made it well-suited to its primary domain of business computing.
Because the standard does not belong to any particular vendor, programs written in COBOL are highly portable. The language can be used on a wide variety of hardware platforms and operating systems. The rigid hierarchical structure restricts the definition of external references to the Environment Division, which simplifies platform changes.[9]
Standards:
Reference manuals:
Compilers and other products:
|
|