COBOL

COBOL
Paradigm procedural,
object-oriented
Appeared in 1959 (1959)
Designed by Grace Hopper, William Selden, Gertrude Tierney, Howard Bromberg, Howard Discount, Vernon Reeves, Jean E. Sammet
Stable release COBOL 2002 (2002)
Typing discipline strong, static
Major implementations OpenCOBOL, Micro Focus International
Dialects HP3000 COBOL/II, COBOL/2, IBM OS/VS COBOL, IBM COBOL/II, IBM COBOL SAA, IBM Enterprise COBOL, IBM COBOL/400, IBM ILE COBOL, Unix COBOL X/Open, Micro Focus COBOL, Microsoft COBOL, Ryan McFarland RM/COBOL, Ryan McFarland RM/COBOL-85, DOSVS COBOL, UNIVAC COBOL, Realia COBOL, Fujitsu COBOL, ICL COBOL, ACUCOBOL-GT, DEC COBOL-10, DEC VAX COBOL, Wang VS COBOL, Visual COBOL
Influenced by FLOW-MATIC, COMTRAN, FACT
Influenced PL/I, CobolScript, ABAP
Wikibooks logo COBOL at Wikibooks

COBOL (pronounced /ˈkoʊbɒl/) is one of the oldest programming languages. Its name is an acronym for COmmon Business-Oriented Language, defining its primary domain in business, finance, and administrative systems for companies and governments.

The COBOL 2002 standard includes support for object-oriented programming and other modern language features.[1]

Contents

History and specification

The COBOL specification was created by Grace Hopper during the second half of 1959. The scene was set on April 8, 1959 at a meeting of computer manufacturers, users, and university people at the University of Pennsylvania Computing Center. The United States Department of Defense subsequently agreed to sponsor and oversee the next activities. A meeting chaired by Charles A. Phillips was held at the Pentagon on May 28 and 29 of 1959 (exactly one year after the Zürich ALGOL 58 meeting); there it was decided to set up three committees: short, intermediate and long range (the last one was never actually formed). It was the Short Range Committee, chaired by Joseph Wegstein of the US National Bureau of Standards, that during the following months created a description of the first version of COBOL.[2] The committee was formed to recommend a short range approach to a common business language. The committee was made up of members representing six computer manufacturers and three government agencies. The six computer manufacturers were Burroughs Corporation, IBM, Minneapolis-Honeywell (Honeywell Labs), RCA, Sperry Rand, and Sylvania Electric Products. The three government agencies were the US Air Force, the David Taylor Model Basin, and the National Bureau of Standards (now National Institute of Standards and Technology). The intermediate-range committee was formed but never became operational. In the end a sub-committee of the Short Range Committee developed the specifications of the COBOL language. This sub-committee was made up of six individuals:

This subcommittee completed the specifications for COBOL in December 1959. The specifications were to a great extent inspired by the FLOW-MATIC language invented by Grace Hopper - commonly referred to as "the mother of the COBOL language" - the IBM COMTRAN language invented by Bob Bemer, and the FACT language from Honeywell.

The decision to use the name "COBOL" was made at a meeting of the committee held on 18 September 1959.

The first compilers for COBOL were subsequently implemented during the year 1960 and on 6 and 7 December essentially the same COBOL program was run on two different makes of computers, an RCA computer and a Remington-Rand Univac computer, demonstrating that compatibility could be achieved.

ANS COBOL 1968

After 1959 COBOL underwent several modifications and improvements. In an attempt to overcome the problem of incompatibility between different versions of COBOL, the American National Standards Institute (ANSI) developed a standard form of the language in 1968. This version was known as American National Standard (ANS) COBOL.

COBOL 1974

In 1974, ANSI published a revised version of (ANS) COBOL, containing a number of features that were not in the 1968 version.

COBOL 1985

In 1985, ANSI published still another revised version that had new features not in the 1974 standard, most notably structured language constructs ("scope terminators"), including END-IF, END-PERFORM, END-READ, etc.

COBOL 2002 and object-oriented COBOL

The language continues to evolve today. In the early 1990s it was decided to add object-orientation in the next full revision of COBOL. The initial estimate was to have this revision completed by 1997 and an ISO CD (Committee Draft) was available by 1997. Some implementers (including Micro Focus, Fujitsu, and IBM) introduced object-oriented syntax based on the 1997 or other drafts of the full revision. The final approved ISO Standard (adopted as an ANSI standard by INCITS) was approved and made available in 2002.

Like the C++ programming language, Java, object-oriented COBOL compilers are available even as the language moves toward standardization. Fujitsu and Micro Focus currently support object-oriented COBOL compilers targeting the .NET framework.[4]

The 2002 (4th revision) of COBOL included many other features beyond object-orientation. These included (but are not limited to):

History of COBOL standards

The specifications approved by the full Short Range Committee were approved by the Executive Committee on January 3, 1960, and sent to the government printing office, which edited and printed these specifications as Cobol 60.

The American National Standards Institute (ANSI) produced several revisions of the COBOL standard, including:

After the Amendments to the 1985 ANSI Standard (which were adopted by ISO), primary development and ownership was taken over by ISO. The following editions and TRs (Technical Reports) have been issued by ISO (and adopted as ANSI) Standards:

From 2002, the ISO standard is also available to the public coded as ISO/IEC 1989.

Work is progressing on the next full revision of the COBOL Standard. It is expected to be approved and available in the early 2010s. For information on this revision, to see the latest draft of this revision, or to see what other works is happening with the COBOL Standard, see the COBOL Standards Website.

Legacy

COBOL programs are in use globally in governmental and military agencies and in commercial enterprises, and are running on operating systems such as IBM's z/OS, the POSIX families (Unix/Linux etc.), and Microsoft's Windows as well as ICL's VME operating system and Unisys' OS 2200. In 1997, the Gartner Group reported that 80% of the world's business ran on COBOL with over 200 billion lines of code in existence and with an estimated 5 billion lines of new code annually.[5]

Near the end of the twentieth century the year 2000 problem was the focus of significant COBOL programming effort, sometimes by the same programmers who had designed the systems decades before. The particular level of effort required for COBOL code has been attributed both to the large amount of business-oriented COBOL, as COBOL is by design a business language and business applications use dates heavily, and to constructs of the COBOL language such as the PICTURE clause, which can be used to define fixed-length numeric fields, including two-digit fields for years.

Features

COBOL as defined in the original specification included a PICTURE clause for detailed field specification. It did not support local variables, recursion, dynamic memory allocation, or structured programming constructs. Support for some or all of these features has been added in later editions of the COBOL standard. COBOL has many reserved words (over 400), called keywords.

Self-modifying code

The original COBOL specification supported self-modifying code via the infamous "ALTER X TO PROCEED TO Y" statement. X and Y are paragraph labels, and any "GOTO X" statements executed after such an ALTER statement have the meaning "GOTO Y" instead. Most compilers still support it, but it should not be used in new programs.

Syntactic features

COBOL provides an update-in-place syntax, for example

      ADD YEARS TO AGE

The equivalent construct in many procedural languages would be

age := age + years

This syntax is similar to the compound assignment operator later adopted by C:

age += years

The abbreviated conditional expression

      IF SALARY > 9000 OR SUPERVISOR-SALARY OR = PREV-SALARY

is equivalent to

      IF SALARY > 9000
          OR SALARY > SUPERVISOR-SALARY
          OR SALARY = PREV-SALARY

COBOL provides "named conditions" (so-called 88-levels). These are declared as sub-items of another item (the conditional variable). The named condition can be used in an IF statement, and tests whether the conditional variable is equal to any of the values given in the named condition's VALUE clause. The SET statement can be used to make a named condition TRUE (by assigning the first of its values to the conditional variable).

COBOL allows identifiers to be up to 30 characters long. When COBOL was introduced, much shorter lengths (e.g., 6 characters for FORTRAN) were prevalent.

The concept of copybooks was introduced by COBOL; these are chunks of text which can be inserted into a program's code. This is done with the COPY statement, which also allows parts of the copybook's text to be replaced with other text (using the REPLACING ... BY ... clause).

Data types

Standard COBOL provides the following data types:

Data type Sample declaration Notes
Character PIC X(20)
PIC A(4)9(5)X(7)
Alphanumeric and alphabetic-only
Single-byte character set (SBCS)
Edited character PIC X99BAXX Formatted and inserted characters
Numeric fixed-point binary PIC S999V99
[USAGE] COMPUTATIONAL

or
BINARY
Binary 16, 32, or 64 bits (2, 4, or 8 bytes)
Signed or unsigned. Conforming compilers limit the maximum value of variables based on the picture clause and not the number of bits reserved for storage.
Numeric fixed-point packed decimal PIC S999V99
PACKED-DECIMAL
1 to 18 decimal digits (1 to 10 bytes)
Signed or unsigned
Numeric fixed-point zoned decimal PIC S999V99
[USAGE DISPLAY]
1 to 18 decimal digits (1 to 18 bytes)
Signed or unsigned
Leading or trailing sign, overpunch or separate
Numeric floating-point PIC S9V999ES99 Binary floating-point
Edited numeric PIC +Z,ZZ9.99
PIC $***,**9.99CR
Formatted characters and digits
Group (record) 01 CUST-NAME.
  05 CUST-LAST PIC X(20).
  05 CUST-FIRST PIC X(20).
Aggregated elements
Table (array) OCCURS 12 TIMES Fixed-size array, row-major order
Up to 7 dimensions
Variable-length table OCCURS 0 to 12 TIMES
DEPENDING ON CUST-COUNT
Variable-sized array, row-major order
Up to 7 dimensions
Renames (variant or union data) 66 RAW-RECORD
  RENAMES CUST-RECORD
Character data overlaying other variables
Condition name 88 IS-RETIRED-AGE
  VALUES 65 THRU 150
Boolean value
dependent upon another variable
Array index [USAGE] INDEX Array subscript

Most vendors provide additional types, such as:

Data type Sample declaration Notes
Numeric floating-point
single precision
PIC S9V999ES99
[USAGE] COMPUTATIONAL-1
Binary floating-point
(IBM extension)
Numeric floating-point
double precision
PIC S9V999ES99
[USAGE] COMPUTATIONAL-2
Binary floating-point
(IBM extension)
Numeric fixed-point packed decimal
PIC S9V999
[USAGE] COMPUTATIONAL-3
same as PACKED DECIMAL
(IBM extension)
Numeric fixed-point binary PIC S999V99
[USAGE] COMPUTATIONAL-4
same as COMPUTATIONAL or BINARY
(IBM extension)
Numeric fixed-point binary
(native binary)
PIC S999V99
[USAGE] COMPUTATIONAL-5
Binary 16, 32, or 64 bits (2, 4, or 8 bytes)
Signed or unsigned. The maximum value of variables based on the number of bits reserved for storage and not on the picture clause.
(IBM extension)
Numeric fixed-point binary
in native byte order
PIC S999V99
[USAGE] COMPUTATIONAL-4
Binary 16, 32, or 64 bits (2, 4, or 8 bytes)
Signed or unsigned
Numeric fixed-point binary
in big-endian byte order
PIC S999V99
[USAGE] COMPUTATIONAL-5
Binary 16, 32, or 64 bits (2, 4, or 8 bytes)
Signed or unsigned
Wide character PIC G(20) Alphanumeric
Double-byte character set (DBCS)
Edited wide character PIC G99BGGG Formatted and inserted wide characters
Edited floating-point PIC +9.9(6)E+99 Formatted characters and decimal digits
Data pointer [USAGE] POINTER Data memory address
Code pointer [USAGE] PROCEDURE-POINTER Code memory address
Bit field PIC 1(n) [USAGE] COMPUTATIONAL-5 n can be from 1 to 64, defining an n-bit integer
Signed or unsigned
Index [USAGE] INDEX Binary value corresponding to an occurrence of a table element
May be linked to a specific table using INDEXED BY

Hello, world

An example of the "Hello, world" program in COBOL:

      IDENTIFICATION DIVISION.
      PROGRAM-ID. HELLO-WORLD.
      PROCEDURE DIVISION.
          DISPLAY 'Hello, world'.
          STOP RUN.

Criticism

In his letter to an editor in 1975 titled "How do we tell truths that might hurt?", which was critical of several programming languages contemporaneous with COBOL, computer scientist and Turing Award recipient Edsger Dijkstra remarked that "The use of COBOL cripples the mind; its teaching should, therefore, be regarded as a criminal offense."[6]

In his dissenting response to Dijkstra's article and the above "offensive statement", computer scientist Howard E. Tompkins defended structured COBOL: "COBOL programs with convoluted control flow indeed tend to 'cripple the mind'", but this was because "there are too many such business application programs written by programmers that have never had the benefit of structured COBOL taught well...".[7]

COBOL 85 was not fully compatible with earlier versions, resulting in the "cesarean birth of COBOL 85". Joseph T. Brophy, CIO, Travelers Insurance, spearheaded an effort to inform users of COBOL of the heavy reprogramming costs of implementing the new standard. As a result the ANSI COBOL Committee received more than 3,200 letters from the public, mostly negative, requiring the committee to make changes. On the other hand, conversion to COBOL 85 was thought to increase productivity in future years, thus justifying the conversion costs.[8]

Defenses

COBOL syntax has often been criticized for its verbosity. However, this is considered a strength by proponents, because "It has made COBOL the most readable, understandable and self-documenting programming language in use today"... "Not only does this readability generally assist the maintenance process but the older a program gets the more valuable this readability becomes."

Traditional COBOL is a simple language with a limited scope of function, encouraging a straightforward coding style. This has made it well-suited to its primary domain of business computing.

Because the standard does not belong to any particular vendor, programs written in COBOL are highly portable. The language can be used on a wide variety of hardware platforms and operating systems. The rigid hierarchical structure restricts the definition of external references to the Environment Division, which simplifies platform changes.[9]

See also

References

  1. Oliveira, Rui (2006). The Power of Cobol. City: BookSurge Publishing. ISBN 0620346523. 
  2. Garfunkel, Jerome (1987). The Cobol 85 Example Book. New York: Wiley. ISBN 0471804614. 
  3. Wexelblat, Richard (1981). History of Programming Languages. Boston: Academic Press. ISBN 0127450408. 
  4. NetCOBOL for .NET supports COBOL migration and software development in the .NET environment
  5. "What Professionals think of the Future of COBOL?". Micro Focus. http://www.cobolportal.com/developer/future.asp?bhcp=1. Retrieved 2010-05-05. 
  6. Dijkstra (2006). "E. W. Dijkstra Archive: How do we tell truths that might hurt? (EWD498)". University of Texas at Austin. http://www.cs.utexas.edu/users/EWD/transcriptions/EWD04xx/EWD498.html. Retrieved August 29, 2007. 
  7. "In Defense of Teaching Structured COBOL as Computer Science", Howard E. Tompkins, ACM SIGPLAN Notices, Vol. 18, Issue 4, April 1983.
  8. Garfunkel, Jerome (1987). The COBOL 85 example book. New York: Wiley. ISBN 0471804614. 
  9. COBOL Tutorial - Introduction to COBOL

Sources

External links

Standards:

Reference manuals:

Compilers and other products: