Talk:ALGOL 68

From Wikipedia, the free encyclopedia

Contents

[edit] Discussion of Algol 68 wikipedia content/style

I propose to shorten the article considerably. In the present form it has a too defensive character. For my feeling, the references to Perl, Scheme, ML are not really helpful. Some criticisms don't need to be mentioned (operator priority discussion). --Glasreiniger 13:24, 26 Feb 2005 (UTC)

I'm not sure I agree the article needs to be shorter. In many ways it should be longer. But it could certainly be improved. It has that common problem that it has been added to in pieces by many people with new information, and has lost any unity and structure it might have had.
My thoughts are
  • Rename "Overview" (which it isn't) as "Notable language features". Make sure everything there is notable, and make sure it makes sense to someone who never saw the language before.
  • Rework "some vanitas" into "Notable language features" or lose it.

Hmm. Iff the article is shortened, this is a good candidate, too. Note the "iff". In the present state of the article it may expalin some of the shortcomings.

  • Criticisms should not be based on new insights; new insights don't belong.

Why not?

  • Comparisons with other languages may have a place, but are they original research? In any case the comparison with C++ is not factually accurate (e.g. C++ has formatted transput, surely, with printf/scanf) and may not add much to the article.

printf is not part of the language, contrasting to A68.

  • --- This does not change anything. The language together with its standard library is something the user has on start. So this whole set should be considered, if you want to compare. You would also say that Tcl does not provide Object-Oriented features, not even mentioning that it has several libraries that provide it. Please, try not highlighting in comparisons things that are "not supported in the core language" because for a particular language user this does not mean anything; such a user would like to know if he can just do some thing requiring this feature or not. Not only has C++ printf-s derived from C's standard library, but there is also provided Boost::format with richer facilities. Should you have it in a language, you can say that the feature is available, no matter how. I would be rather interrested with facilities, which MUST be done in the core language and, for example, C++ does not have it and it is hard to emulate it.

Ethouris

  • Variants needs to mention the important ALGOL 68C and ALGOL 68R. The first part of that section belongs under notable language features or nowhere. Proceduring is notable, I think, but must be defined.

Proceduring is *the* part of the language where the non-revised language failed, and it was noted. Proceduring, as defined in the in the non-revised language didn't work correctly. But removing proceduring was a really bad cure for that. It is easy to see, that a usable description could have been found, by the simple fact that adding some parentheses and "PROC BOOL:" in the source make the magic work (correctly!).

  • I'd like to see more on history and especially on discussions of the reason for its failure. http://www.cs.ru.nl/~kees/home/papers/psi96.pdf is a very interesting reference. "[The revised report] resulted in an exemplarily clear, precise and consistent description of anelegant and orthogonal language that was at its appearance already classical but dead.

Agreed.

  • An original insight that could be included if someone else published it: why would the commercial world implement a language from academia in the form of a report, which also included a minority report definining a different language? Suicide!

Notinasnaid 12:10, 28 Feb 2005 (UTC)

Let me comment on the last item: It wasn't suicide. --Glasreiniger 21:24, 28 Feb 2005 (UTC)

Let us try to make a good article, at least. If nobody complains, I am going to start making the article more concise, trying to use your hints.

New insights? At one time I found this rule as an adjunct to "no original research", i.e. "no original insights". I can't find it now!
The issue with proceduring is that it is not defined before it is discussed. I know what it is, but I don't think anyone who didn't would be left any the wiser. It is quite an interesting and unusual idea, and worthy of discussion.
Anyway, to follow another Wikipedia rule: "Be bold!" Notinasnaid 21:54, 28 Feb 2005 (UTC)

[edit] Implementation of Algol68

Concerning Algol68 being an academic language and already dead-at-birth: at Control Data Corporation we've built an almost complete implementation (only 1 or 2 items missing), commercially published around 1978 for use on 6000 and Cyber series mainframes. The resulting code was quite efficient, notwithstanding the often called "complexity" of the language.

[edit] Operators

Is the elem operator the same as the ⌷ operator, or was/is the "¤" from EBCDIC used?

Also: I kinda recall "somewhere" that in place of := and =: one could use ← and →. Am I getting mixed up with APL?

NevilleDNZ 02:01, 24 August 2005 (UTC)

The rectangle is called "window symbol" in the RR, it is used as elem operation symbol, yes (RR10.2.3.9b). The report does not define how the symbols are encoded. A different document (Hansen, Boom: The Report on the Standard Representation for ALGOL 68) tries to clarify how the symbols are represented in ISO646 and EBCDIC. This document discourages the use of most symbols not in the intersection of these symbol sets.

(C.5 Other characters: A portable program should be written entirely in worthy characters, becasue only these characters are available in all implementations). The digraph <- is not allowed in a68 as operator symbol, as in S-plus, e.g. because x < -1 and x <- 1 would easily be confounded. ->, though, could be used. Extensive use of exotic symbols is not typical for programs written in A68. --85.22.19.171 19:26, 26 August 2005 (UTC)

Have been skimming throught the Draft Final Report you pointed out online.... It seems that there is a whole pile of operators in the Draft Final Report I never even knew about. Check Pages 109... The FR allows .. as well as :, → in place of "of". Did this "I am guessing that this online version isn't the "Final Final Report". Looks like I have a bit more reading to do. (Ironically a similar -> made it into C/C++)
But it looks as if the semantics were inverse. Personally, I prefer C++ in this respect. You call the document "draft": AFAICS my prionted version of thsi is the final published report, but before revision. I never got in acquaintance with A68 before revision. So I cannot comment mcuh on this, except that I had preferred that proceduring (not as coercion) ahd survived.

--Glasreiniger 20:02, 26 September 2005 (UTC)

The American Standards Association (ASA, later to become ANSI) first published ASCII as a standard in 1963. ASCII-1963 lacked the lowercase letters, and had an up-arrow (↑) instead of the caret (^) and a left-arrow (←) instead of the underscore (_).
"1968 President Johnson issues order adopting the (newer) American Standard Code for Information Exchange (ASCII-1967) as a federal standard".
Hence Algol68 was straddles both ASCII-1963 and ASCII-1967 standards.
Ironically the ¢ (cents) symbol got dropped from the Amercian ASCII standard, but became part of the European Algol 68 standard.
NevilleDNZ 16:56, 25 November 2005 (UTC)
I also found the A*10**B symbols were 10 AND "\". And some strange operators I havn't figured out yet..., i.e. lws, ups, ctb, btb, ::, ct, ctab etc, with related APL characters...
¢ NevilleDNZ 17:23, 26 September 2005 (UTC) ¢

[edit] Hardware character sets and representation

I had look into unicode internals for this case. It looks as if the "base ten" symbol was the only machine written symbol of computer age which didn't make it into unicode. This should be changed. But, personally, I don't like unicode enough to feel engaged. It seems that, if anybody wants to make this into unicode, has to be prepared very well. Of course, the representation using two subscripts is sufficient to publish documents containing. But for this, Unicode subscript were superfluous as well. The origin of this symbol seems from British teletype code as variant to the backslash, and it has been used in the German TR440 computer central code ZC1 at position 141. --Glasreiniger 20:02, 26 September 2005 (UTC)

I used Algol68C on EBCDIC of IBM VM/CMS and ASCII of DEC10. There were no "exotic characters", but fortunately there were upper and lower case, otherwise Algol68 code would have looked worse the COBOL code. Clearly there was no Unicode back in 1968, so did the "exotic characters" only ever exist on paper, or were they sourced from some other pecular computer system? Also, if I recall right, a semicolon was also regarded a exotic to some character sets, hence Algol-W used a comma-dot, instead of a semicolon at the end of statements. (And hence maybe the : used in todays array slices x[1:3], possibly originated from x[1..3] which appears to be more natural)

CHA Koster tells in his memories about the making of ALGOL 68, that the writers of the Report had much fun in finding new symbols on the typewriter wheels, which were state of the art at that time.
The IBM 2741[1] Terminal (c. 1965), this must have been what Adriaan van Wijngaarden had been using before the Aug 68 meeting. (Then in 1975 for $20,000 you could get a IBM 5100 [2]). Maybe it was the IBM Code page was being used in the IBM 2741 alternatly Code page 293[3], or Code page 371[4].
Another Algol68 superset totally missing from wikipedia is Algol68+ But I suspect that was not often used outside of The Netherlands. It would be interesting to lists its character set as Adriaan van Wijngaarden had a particular interest here.
¢ NevilleDNZ 02:07, 24 September 2005 (UTC) ¢

The ¬ clearly entered via EBCDIC. I am kind of surprised the the ¢ was so extensively used in the actual RR, it is in EBCDIC, but not ASCII. But certainly any typewritter can do it with a C^H/.

[edit] Language semantics

On an aside, in fortran a declaration of "REAL REAL REAL" is legal, I am guessing Algol68 was trying to mimic this in its declaration syntax, eg by allowing "real real real;", (which contains spaces in variables, and reuse of "reserved" words/tokens). If this true?

In case of FORTRAN IV, spaces are not significant in any position except in H formats. Position is, though. That spaces in identifiers are allowed, is heritage from Algol60.

And on another aside, why did Algol68 have the IF .. FI, DO .. OD and CASE .. ESAC. But spoil every thing by using CO .. CO, and PR .. PR, no CO .. OC and PR .. RP?

Probably an oversight. Comments do not contain interesting stuff. But TNEMMOC would look rather strange. NIGEB, too. When I program A68, I don't use IF..FI very often. I prefer the short symbols ( | | ), anyway. But for DO..OD there are no short symbols. --Glasreiniger 17:19, 27 August 2005 (UTC)
TNEMMOC & TAMGARP probably wouldn't translate into Dutch very well either. ¢ NevilleDNZ 06:32, 28 August 2005 (UTC) ¢
It is little known, that the ALGOL68 Reports have been translated into German and Russian, possibly other languages, too. The termini technici like MOID have been translated also in the unrevised report, in case of MOID the German translation was MÖSCH, which is a contraction of MODUS and LÖSCH. For the RR only the text was translated. --Glasreiniger

(BTW: Every time I do a "do .. done" loop in borne shell I cannot help but think of someone (long long ago) shooting themself in the foot with the unix "/bin/od" command)

NevilleDNZ 20:03, 26 August 2005 (UTC)

On the CDC 63- and 64-character sets, where ":" was problematic or missing, ".." was a representation substitute. This may have given rise to the use of ".." in Pascal, which was also first developed for the CDC mainframe series. Notinasnaid 22:06, 26 August 2005 (UTC)

The Control Data computers with their 6bit characters are, of course, very restricted. The standard hardware representation for A68 declares 60 characters worthy. --Glasreiniger 17:19, 27 August 2005 (UTC)

[edit] Operators vs "Syntatic elements"

I noticed some laxness in the operator list. In Algol 68 "IS","ISNT" and their shorter forms :=: :/=: are not operators but special syntactical symbols like := . I am not sure whether correcting this would go to much into technical details. For now I just move the short forms from the implementation column to the original language column, where they belong. --Glasreiniger 11:27, 23 September 2005 (UTC)

I have found an argument, why it must be corrected: The priorities are wrong. These constructs don't have PRIO 4,. Effectively their priority is less than that of any opreator. Maybe putting them in a row with PRIO 0 and a comment is a good solution. --Glasreiniger 18:34, 23 September 2005 (UTC)

If I remember right :=: and :/=: were not part of the standard. BTW I dug the is and isnt operator pririties from the Revised Report, so the prio is = 4, isnt = 4; should be OK.

Sorry to disagree. But the is symbol is not an operator, it just loooks like one. The :=: was present in in unrevised report already. For the negated equal sign, two possibilities are provided in the RR: either the mathematical unequal sign or the digraph from slash and equal sign for operator and identity relation symbol. The not symbol is only used to construct operators. --Glasreiniger 18:07, 24 September 2005 (UTC)
Not a problem. I checked the RR that you put online, and you are right. So, I have effectively moved the :~=: "syntactic" elements to priority 0. I left the plusto, but removed the other timesto operators as I could not find them in the RR. I wish I had a copy of the Algol68C manual to see if the timesto were implemented there.
I have a copy of the Algol68C manual. If this is not available on the net already, I could perhaps make scans available. But, with no compiler for this implementation, this is not too interesting. Algol68C does not define or allow to define operators with assignation. These are automatically available for all operators where these are applicable. So, Algol68C has timesto and similar. I am not sure that the authors really thought much about the displace-formulas, how they call these, after looking this up. --Glasreiniger 15:57, 25 September 2005 (UTC)
Do you know the "official" priority of the uniary operators? esp not, as typically (in other languages) its priority is lower then that of arithmetic diadic operators.
In A68, all monadic operators have prio 10. The most likely troublespot is -1^n . Of course, it would be more natural to have all boolean operators at a lower priority than arithmetic.
I guess my interest in the article is not to create a reference, but to answer some of the questions on the origins of Algol68 of the kind a learner may ponder, Why this? why that? etc. And keep the reading relatively "light". Kind of a "extra lite Algol68" version of "The Annontated C++ reference manual". So feel to interject if anything I add is too heavy.

[edit] LALR parsers

BTW: did anyone ever write a parser (like YACC) for the language of the Revised Report eg. "REF to MODE NEST destination{a} : soft REF to MODE NEST TERTIARY"?

¢ NevilleDNZ 01:55, 25 September 2005 (UTC) ¢

Algol68 cannot be parsed by LALR parsers because the natural grammar is not LR1. One of the big troublemakers is, that a comma is allowed between declarations; so a variable declaration may follow a mode declaration; so you don't know whether a mode indicant at this place is a defining or applying one before seeing the equals sign. At least you need to know the priority before use, if you want to parse operator precedence in one run. The Algol68C restriction is sufficient for this case. I have written a YACC parser for Algol68 for the full language with this restriction myself. I cannot really claim that this is a faithful parser, because some of the syntactic restrictions are not enforced. They have to be honored in the sematic routines. --Glasreiniger 14:52, 25 September 2005 (UTC)
I seem to remember (and this is from a very long time ago, when I was thinking about writing a compiler), that one of the most challening constructs to encounter starts
op == ( mode...

where you don't know yet how to parse the ==. Is this a typed operator declaration for ==, or an untyped declaration for =? This may be misremembered, but something like this annoyed me for a long time. Notinasnaid 16:04, 25 September 2005 (UTC)

This is not the actual trouble spot, because in the typed op declaration the declared symbol comes after the modes: OP (INT,INT)BOOL == SKIP . IMHO it is allowed to restrict usage of = as defines-symbol not to be adjacent to operator symbols it can combine with.
But if you have MODE FOO = STRUCT(...), BAR = ... , you need to 2 tokens look-ahead to see the equal sign, before you can decide that the mode declaration continues; otherwise BAR could start a variable or identity definition. In my lexer this leads to the restriction, that after a boldtag in defining context only one blank is allowed. The comment in my Flex lexer calls this a quirk. A hand-written lexer could avoid this.
Note that the keyword MODE is nearly useless, because a parser powerful enough to cope with this is also able to know that FOO = STRUCT ... must be the begin of a mode declaration, anyway. --Glasreiniger 16:42, 25 September 2005 (UTC)

As an aside, Charles Lindsey, one of the editors of the Revised Report, lectured to me in the Manchester Department of Computer Science on Algol 68 around 1970. He worked on one of the early compilers and I seem to remember him saying that they considered parsing the language in reverse, i.e. starting with the last token. This is clearly possible, but I'm not sure if it was ever done nor what the complexity of the reversed grammar might be.

[edit] Resources

[edit] Online reports

I am not a member of ACM, here is the revised report [5], do you know if this PDF is produced from scanning paper, or from original electronic text/input (Maybe EBCDIC)?

The ACM version is just the image from scanning. I happen to be the one who has made the (first) revised report available online. http://vestein.arb-phys.uni-dortmund.de/~wb/RR/rr.pdf e.g.. There are also TeX, .html and .dvi versions. Karl Kleine has published the MR176 paper. I am quite sure that the symbols in my version are the correct ones for the report. Charles Lindsey himself has done some proofreading of my text. --Glasreiniger 17:44, 24 September 2005 (UTC)
One nice thing you could do (from the html), is imbed a URL pointing to the original scanned paper original. That way others can help spot OCR/scanner errors and email you fixes.

¢ NevilleDNZ 01:55, 25 September 2005 (UTC) ¢

I shall follow this hint. But the accessibility of an ACM document is seriously difficult. Though, there is an authentic machine readable RR available now, from Dick Grune in [[6]]. The unrevised report is at [[7]]--Glasreiniger 17:06, 25 September 2005 (UTC)
I mostly converted the RR to html (with a python script) today. ¢ NevilleDNZ 17:23, 26 September 2005 (UTC) ¢

If one looks, there are getting to be a lot of Algol68 resources on the internet.

¢ NevilleDNZ 02:07, 24 September 2005 (UTC) ¢

[edit] Algol68 Final Report special characters

i.e. The ∨, ∧, ¬(?), ≠, ≤, ≥, ×, ÷, ⌷, ↑, ↓, ⌊, ⌈ and ⊥ characters. APL2 keyboard stickers are available from IBM part number SC33-0604. They cost about $13 for a set of two.

¢ NevilleDNZ 16:58, 24 September 2005 (UTC) ¢

Just curious: The number of atoms in 0.012 kilogram of carbon 12 is known as Avogadro's number. It is approximately 6.0221415×10↑23 (2002 CODATA value). Does the RR still allow me to define this constant as follows:

  • real n = 6.0221415₁₀23;

⑽, ⒑, ⑩ do exist in unicode... something to do with the British wanting to support 12 pennys in a shilling. But the subscript 10 needs two characters.

Maybe we need to get the Working Group 2.1 on ALGOL68 to revise the RR to include/loby Unicode for 2005?? :-)

¢ NevilleDNZ 15:44, 25 September 2005 (UTC) ¢

IMHO there are enough reasons to do a formal proposal to unicode.org to include this symbol. It has been included in a German norm (DIN66006 IIRC), it has been discussed in ACM: comm.ACM 7 No 7(424,447), there called "subscript 10". Personally, I don't like Unicode enough to be engaged in this. A nice web page about codes includes it: ALCOR. But a better place to show this stuff should sougth, because it distracts from the programming language. And it is heritage from ALGOL58. --Glasreiniger 08:33, 28 September 2005 (UTC)
I kinda agree with you... for example, even without significant support for the official APL character set, the APL developers continued programming in APL using "other characters". After all character sets don't make a language. Beside if we want to add to wiki all the details of the language (including the 10), then we might as well replace the wiki-page with the Revised Report!! (I am allowed to be harsh as it was my suggestion)
The now realise that the 10 characterm appears in several standards. ALCOR of [ACM] (became DIN 66006 in Germany), GOST 10859 7 bit Cyrillic & Latin.
In GOST 10859/Cyrillic there appears to be various characters subsets giving greater text compression:
BTW: Why was the word "worthy" used... was it a pun?
NevilleDNZ 04:55, 21 October 2005 (UTC) ⍧
Ironically the APL keyboard that appears to be the defacto keyboard for Algol68/FR, can still be obtained from IBM even today.

I can see another mine-field just with Working Group 2.1 adopting a unicode version of Algol68. After all which asterix is the official Algol68 asterix, there are dozens in Unicode. Not to mention many ways of writting the number 10.

BTW: ThanX for telling me where the 10 came from. Needless to say I used Algol68 in ascii, and didn't even realise it existed until I read the whole RR.

NevilleDNZ 17:37, 28 September 2005 (UTC) ⍧

If you look VERY carefully into this part of the report, you may be be able to notice, that the label symbol is a slanted colon, in contrast to the routine symbol and the colon used in slicing. The alternative EXIT symbol should be a fat point. -- Glasreiniger 07:15, 29 September 2005 (UTC)
I did notice in the machine readable document that the label colon had been given a special name "lbl". I wondered if this was just a "throw back" the document formating softawre used. (BTW: does this document foram have a name?).
I also noted that there is something called a "thin". eg in RR at section 3.3 between [ & ] is a "thin":
thinspace, \, in TeX.
[ ] INT q = (1, 4, 9, 16, 25); STRUCT (INT price, STRING category) bike := (150, "sport"). 
My gut feeling was so that [] didn't look too much like a window character ⎕.

Here us a rough match of the RR name, and the unicode equiv that I found. I gave priority to the APL keyboard.

RR Name Unicode Name UC Char A68 base A68 vogon
"ttp" "subscript one"+"subscript zero" "₁₀" "\" E
"sge" "greater-than or equal to" "≥" ">=" GE
"sle" "less-than or equal to" "≤" "<=" LE
"ne" "not equal to" "≠" "/=" "~=" NE
"ct" "apl functional symbol left shoe stile"/"cents" "⍧"/"¢" # CO
"flr" "left floor" "⌊" LWB
"clg" "left ceiling" "⌈" UPB
"box" "apl functional symbol quad" "⎕" ELEM
"slice" ../‥  :
"not" "not sign" "¬" "~" NOT
"sq" "division sign" "÷" "/" OVER
"sx" "multiplication sign" "×" "*" TIMES
"tip" "up tack" "⊥" I
"nl" "left-pointing double angle quotation mark" "«"
"ng" "right-pointing double angle quotation mark" "»"
"nil" "ring operator" "○" NIL
"up" "upwards arrow" "↑" ** UP
"dwn" "downwards arrow" "↓" DOWN
"hy" "hyphen" "-"
"or" "logical or" "∨" OR
"and" "logical and" "∧" & AND
Unicode / Final Report (1968)
"rightwards arrow" "→" OF
"box drawings light arc up and right" "╰" LWS
"box drawings light arc down and right" "╭" UPS
∙/•/∎ EXIT
LWS and UPS are the lower/upper state operators from the Algol681/FR.
BTW: Look at ALL these unicode characters for asterick: "✢✣✤✥✦✧✩✪✫✬✭✮✯✰✱✲✳✴✵✶✷✸✹✺✻✼✽✾✿❀❁❂❃❄❅❆❇❈❉❊❋". Now you know why I priotised first the APL characters. ☺. Me thinks that asking for just 1 "subscript ten"=₁₀ from the unicode guys would be a good idea to counter balance all the astericks!
I had totally forgotten about the "nil" symbol "○" (It is not in Algol68C ascii/ebcdic).
Unicode also has several logical and/or symbols... I guess usage of other codes (eg now Unicode) was seen as site specific. Making Algol68 Unicode Ready. ☺
Can you point me to a scanned/online version of the RR. AND Was SKIP officially a "~", or a "?"?

NevilleDNZ 09:32, 29 September 2005 (UTC) ⍧

The original is very bad; and AFAIK there are no real good prints at all. The report has been published photomechanically from bad sources. Even the microfiche version is not better. The combination of Dick Grune's version and my re-edition (the .dvi or .pdf, not .html) do a much better job. IIRC, the question mark is Algol68C for skip. --Glasreiniger 11:52, 29 September 2005 (UTC)
I could not access the document, a html permission problem. I know Victoria University has the paper Algol Bulletin, maybe they have an original RR. I can search the local universities. Actually the cheif reason I want to look is to check to see what the "strange" characters look like in print. I had to laugh, Algol_60 (http://eli-project.sourceforge.net/a60_html/a60.html) has "subscript 10" also, but they gave up and simply called it "E". As you suggest, somewhere in Europe subscript 10 was important... in the 1960s.
BTW: I tried emailing you, but got no reply.
Sorry. The permission is fixed now. --Glasreiniger 11:01, 21 October 2005 (UTC)
NevilleDNZ 09:39, 5 October 2005 (UTC) ⍧

[edit] Re structuring this document

So far, I have added a tiny section on the transput, and a tiny bit to the section on operators, and some code for parallel processing. They seem to fit in OK. There is not much mention about bits and bytes and such representations. FR vs RR and Extensions etc

Already we have reached the 32k recommended article size. Is it time for a revision/pruning? What have we missed that is notable? Having said that it looks basically OK to me so far.

One book I read years ago on A68 could have its sections read "horizontally and vertically" out of a grid. This highlights that A68 was designed to be orthogonal. Do U know the Author/Title of this book? Could we structure an article like this?

NevilleDNZ 17:37, 28 September 2005 (UTC) ⍧

The book is Lindsay and van der Meulen's Informal Introduction to Algol 68. It would be interesting to structure the article that way, though I'm not sure where one would fit some of the political aspects of the language's history in that structure.

One thing that is definitely missing from the article is a discussion of "triple ref technique" which is possible only because the language allows references to non-heap objects. A Rosetta Stone-style listing of linked list manipulation in ALGOL 68, Pascal, and C might be the way to show the technique's advantages.

A discussion of the triple REF technique is needed, indeed. IMHO this hasn't do too much with heap vs. non-heap, as Algol-68 can easily be implemented without LOC at all (just make all allocations on heap). I have prepared some variations on the theme in [[8]]. The file queuea.a68 is due to Andy Walker (discussion in news:comp.lang.misc) except for the complaint in the commentary, of course. queueb.a68 changes the original file by lifting the REF up to mode LIST. queuee.a68 makes the algorithm parallel-safe, hopefully. I like queuef.a68, because it uses operators instead of functions. In queueg.a68 I modified the program to avoid triple REFs. Yes, this is easy: Just wrap one of the levels into a STRUCT. For syntactical reasons, a second field dummy in this STRUCT is needed. --Glasreiniger 13:57, 14 February 2006 (UTC)
Ack, you're absolutely right. The point is that unlike Pascal, which only allows pointers to records, Algol 68 allows references to things of any mode, including ref ref amode (i.e. variables of type ref amode, in particular those where references to heads of lists or roots of trees are stored), hence the name. My mistake.

[edit] Just a quotation ...

I came across the following, years ago. (I won't vouch for its precision, though.) I don't agree with it, but, here it is anyway:

   "If PL/1 is a fatal disease, then is not Algol 68 just _capital punishment_?"

12.216.178.59 00:06, 20 February 2006

This quote might be a pun on case-stropping where the published form "begin real pi=4*atan(1) end" would be written as case-stropped "BEGIN REAL pi=4*atan(1) END". I remember that case-stropping (changing case for "Syntatic elements", MODEs and OPerators) was a pain on keyboards that had only caps-lock (ansi) and but useful on shift-Lock keyboards (IBM). I finally discovered a keyboard that had both manufactured by Tandem-Computers.

NevilleDNZ 11:48, 10 July 2006 (UTC)

[edit] wiki2wirthy.py - convert html4 markup and unicode to Algol68 "Wirthy" 7-bit characters

#!/usr/bin/env python
## -*- coding: utf-8 -*-
# wiki2wirthy.py - convert wiki markup and unicode to A68 "Wirthy" characters
# use this to dewikify a68 source, and convert unicode operators into UPPERCASE tokens
# 20061115 v0.2: add the original operators from the final report.
# 20070613 v0.3: add the html4 character set
import sys, re
 
import codecs
enc, dec, rwrap, wwrap = codecs.lookup('utf-8')
 
def forall_argv(process_file,argv=sys.argv[1:]):
  if len(argv)==0: # then
    process_file(rwrap(sys.stdin))
  else:
    for file in argv: # do
      file=open(file,"r");
      process_file(rwrap(file))
    # od
  # fi
# end forall_argv
 
re_markup=re.compile("(<[^>]+>|[&]nbsp;)",re.IGNORECASE);
re_ul=re.compile("(^<[bu]>$)",re.IGNORECASE);
re_lu=re.compile("(^</[ub]>$)",re.IGNORECASE);
 
wirthy={ u"¢":"#",
  u"₁₀":"E", r"\\":"E",
  u"≤":"<=", u"≥":">=",
  u"≠":"/=",
  u"[⍧¢£]":"#",
  u"⌈":"UPB", u"⌊":"LWB",
  u"⎕":"ELEM",
  u"[.][.]|‥":":",
  u"¬":"NOT",
  u"÷":"/", u"×":"*",
  u"⊥":"I",
  u"[○∅]":"NIL",
  u"↑":"UP", u"↓":"DOWN",
  u"∧":"AND", u"∨":"OR",
  u"→":"OF",
  u"[☩✠᛭]":"$", # [[ALCOR]]
  u"╭":"UPS", u"╰":"LWS",
}
 
re_wirthy=dict( (re.compile(char),"_"+fix+"_") for char,fix in wirthy.iteritems() );
 
import htmlentitydefs
""" additional ALGOL characters/operators from html4 std
  cent ¢    pound £  yen ¥    euro €    curren ¤
  plusmn ±  minus −  times ×  divide ÷  oplus ⊕  otimes ⊗        
  larr ←    uarr ↑   rarr →   darr ↓    harr ↔   crarr ↵ 
  and ∧     or ∨     not ¬   
  cong ≅    asymp ≈  prop ∝   ne ≠    
  empty ∅   infin ∞ 
  perp ⊥    lceil ⌈  rceil ⌉  lfloor ⌊  rfloor ⌋        
  equiv ≡   sup ⊃    nabla ∇  loz ◊     deg °    # [[GOST 10859]] ALGOL characters
  radic √   part ∂   int ∫    prod ∏    sum ∑    # bonus operators
"""
re_html4=dict((re.compile("&"+name+";"),u"%c"%code) for name,code in htmlentitydefs.name2codepoint.iteritems() );
rev_html4=dict((re.compile(u"%c"%code),"_"+name.upper()+"_") for code,name in htmlentitydefs.codepoint2name.iteritems() if code>127);
re_depad=re.compile("_ *| _|$_")
 
stdout=wwrap(sys.stdout)
stderr=wwrap(sys.stderr)
def wiki2wirthy(file):
  markup=True;
  in_ul=False;
  for line in file.readlines():
    markup=not markup;
    for token in  re_markup.split(line):# do
      if not markup: # then
        if in_ul: token=token.upper() # fi
        for re_char,fix in re_html4.iteritems():  token=re_char.sub(fix,token);
        for re_char,fix in re_wirthy.iteritems(): token=re_char.sub(fix,token);
        for re_char,fix in rev_html4.iteritems(): token=re_char.sub(fix,token);
        token=re_depad.sub(" ",token) # a crude tidy up
        for c in token: # do
          if c>"~":  # detect unconverted unicode and dump
            stderr.write("unknown character: "+c+"\n");
            stderr.write("Error: "+`[ c for c in token]`+"\n");
            break
          # fi
        # od
        stdout.write(token);
      # fi
      # handle wiki <markup>
      elif re_ul.match(token): in_ul=True
      elif re_lu.match(token): in_ul=False
      # fi
      markup=not markup
    # od
  # od
# end wiki2wirthy
 
if __name__=="__main__": forall_argv(wiki2wirthy,argv=sys.argv[1:]) # fi

My apologies to compiler writers (and python unicode experts) for this terribly crude parser... For wot is it wirth... I declare the above code to be GPL.

NevilleDNZ 13:07, 13 June 2007 (UTC) revised for html4

[edit] ALGOL 68 definitely not the ancestor of C, C++, sh or bash

  1. C++ was originally a front end built on top of C and drawing heavily on SIMULA-67 (which obviously predates ALGOL 68).
  2. C was a successor to B, itself a trimmed-down version of BCPL, itself a trimmed-down version of CPL, which again predates ALGOL 68
  3. sh and bash have nothing at all to do with ALGOL 68 apart from being procedural langauges.
  4. C, sh and bash are weakly typed.

99. On the other hand, both SIMULA-67 and CPL draw heavily on ALGOL 60. ALGOL 60 is the ancestor. ALGOL 68 is, as far as I can tell, more of an evolutionary dead end. -Dmh 19:01, 29 January 2007 (UTC)

[edit] re: Is ALGOL 68 an ancestor of C, C++, sh or bash?

1. re: C++

  • "Stroustrup borrowed successful features from other older languages. As a result, C++ incorporates the concepts of classes and virtual functions from Simula67. C++ borrows the idea of operator overloading from Algol 68. These features are an important part of the support that C++ provides for object-oriented programming.[9]"

2. Re: C

Steve talked Dennis into adding void as a type in C. It saved the instruction loading register for return value. Also when Steve got to the lab C types were like PL/1 namely offsets from any base you liked. They changed to A68 types but be the specific reason for this change is now lost. A few other A68isms ended up in C. (Subsequently these A68isms made their way into C++).

3. Re: Bourne Shell

  • The - Bourne shell source telling an interesting story of Algol68's influence on the Bourne shell. Bourne reused portions of ALGOL 68's "if ~ then ~ elif ~ else ~ fi", "case ~ in ~ out ~ esac" and "for ~ while ~ do ~ od" clauses in the common Unix Bourne shell syntax. Moreover - although the v7 shell is written in C - Bourne took advantage of some macros[1] to give the C source code an ALGOL 68 flavor.

I would suspect (but I cannot back it up with a quote) that the parallelism in the Bourne shell was a direct result of the PAR clause in Algol68. eg

  • Shell: (sleep 10 ; date ) & ( sleep 20; date) & (sleep 30; date)
  • Algol68: PAR ( (sleep(10); date ) , ( sleep(20); date) , (sleep(30); date) )

However Algol68 didn't get pipes until arriving on Unix, and the pipe behavior is not part of the standard.

Shell and Algol68 are unusual in that both are early examples of concurrency which is mostly absent from other standard languages of the time. (PL/I had/has the TASK keyword)

4. Re: C, sh and bash are weakly typed.

5. Re: ADA Influences

Extract from: http://archive.adaic.com/pol-hist/history/holwg-93/2.htm (source: portal.acm.org/ft_gateway.cfm?id=1057816&type=pdf)
  1. Without exception, the following languages were found by the evaluators to be inappropriate to serve as base languages for a development of the common language: FORTRAN, COBOL, TACPOL, CMS-2, JOVIAL J-73, JOVIAL J-3B, SIMULA 67, ALGOL 60, and CORAL 66.
  2. Proposals should be solicited from appropriate language designers for modification efforts using any of the languages, Pascal, PL/I, or ALGOL 68 as a base language from which to start. These efforts should be directed toward the production of a language that satisfied the DoD set of language requirements for embedded computer applications.
Ada - DoD HOLWG, Col Wm Whitaker, 1993

I don't know what the complete list of functionality originally from ALGOL 68 is. Maybe concurrency, operators, overloading & strong typing. These were mostly missing from ALGOL 60.

So I can say with certainty that Algol68 influenced C, to quote "there was influence, much of it so subtle that it is hard to recover even when I think hard." -Dennis Ritchie, 18 June 1988

99. re: Evolutionary Dead end.

  • IMHO: Algol68 was an experiment in both language design, and in collaborating in language design. Both of these disiplines were in their early stages of evolution .
ALGOL 68 was the first (and possibly one of the last) major language for which a full formal definition was made before it was implemented.
Certainly the prominent compiler writers for ALGOL 60 were prominent in the ALGOL Bulletin "mailing list", but strangely none of them were on the actual core ALGOL 68 team. Maybe this caused C99's ratification to be delayed 31 years? :-)

NevilleDNZ 23:36, 7 February 2007 (UTC)

[edit] a 2nd re: to dmh

IMHO you are right in A68 being a dead end. Now we know that it was a mistake to introduce pointers into a HLL. But AFAIK till the advent of Java this has not been corrected in the more successful languages.

Sh is not comparable to a programming language. But the facts, which Neville reports, are correct, at least in the main parts. One exception is the parallelism. The & Operator has been in use in V6 Unix, but not the if - fi bracketing construct, which is due to Bourne, who has been one of the first A68 compiler writers.

Agreed, postfix "&" would appear to predate A68's influence on the Bourne shell in v7. A coincidence only! :-( The "&" is certainly unique in both it's simplicity and similarity. It would be interesting to know what the inspiration for the v6 "&" was. NevilleDNZ 19:35, 8 February 2007 (UTC)

It is simply a misunderstanding of the article text, if you read the wording heritage as being ancestor. --Glasreiniger 14:16, 8 February 2007 (UTC)

[edit] References

[edit] Peter Craven of Algol Applications Ltd and A68C

Here is the reference for Peter Craven's Algol68's branch:

Murdoch states: "Peter Craven of Algol Applications Ltd made a version of A68C that was interactive (did incremental compilation, permitted no forward referencing)"

I will try and clarify. But I suspect CJC has the trump card.

NevilleDNZ 07:33, 6 March 2007 (UTC)

Appears the source quote was mis-read. The quote is: "The execution model is stack based, using separate static and dynamic stacks as in Algol68C, and all expressions are evaluated on the stack." i.e. "as in Algol68C."

I will see if I can get exp=667 corrected.

NevilleDNZ 07:47, 6 March 2007 (UTC)

[edit] Recent compiler work

[edit] 2002: Compiler / MMIX / NYU

In 2002 a NYU's Computer Science Ph.D. Candidate Antonio R. Nicolosi created a boot strap compiler for Algol68Nix.

NevilleDNZ 03:38, 13 April 2007 (UTC)

[edit] 2000: Interpretor / Linux&DOS / Rutgers University

Not all the exotic features are implemented. In particular there are no semaphores, formats and parallel-clauses.

NevilleDNZ 01:43, 1 December 2007 (UTC)

[edit] 1993: portable compiler / CWEB / univ-poitiers.fr

A portable Algol 68 compiler written in CWEB.

NevilleDNZ 03:05, 10 May 2007 (UTC)

[edit] A68S

Note, that there are two different beasts coming with this name: The a68s system which is available from the Unix Heritage Society for Unix-V7, see link to ALGOL 68S, and the compiler for Amiga etc. (add Atari ST and Ultrix-32 to the systems supported by this one) distributed by C.H.Lindsey. As I understand Lindsey's description they have a common ancestor. http://mirror.cc.vt.edu/pub/projects/Ancient_Unix/PDP-11/Applications/ --Glasreiniger 15:40, 3 December 2007 (UTC)

[edit] C++

First, the C++ code that has "the similar effect" to the new mode declaration is not exactly idiomatic C++. Why define a POD as a class rather than a struct, therefore requiring "public:", which is almost never written inline as in the example?

Meanwhile, in the comparisons, some of the most important differences are completely missed (e.g., separate compilation, templates, classes, RAII, exceptions, standard library vs. parallel processing, multi-dimensional arrays), while many of the listed differences are either not true or misleading.

It's claimed that C++ doesn't have:

  • garbage collection: But it does have refcounting (e.g., tr1::shared_ptr), with the usual advantages and disadvantages, and it can be used to collect all kinds of garbage (not just memory).
  • formatted transput using complex formatting declarations: C++ (with the standard library) has C's printf/scanf family, which has complex formatting embedded within the format string; these aren't syntactically integrated with the language (and in fact are interpreted at runtime rather than compile), but they're nevertheless roughly equivalent. C++ also has iostreams, which are syntactically integrated, albeit more verbose.
  • assignment operation symbol (to avoid confusion with equal sign): C++ spells the "equal sign" as "==", therefore the assignment operator can be "=" without confusion. (C++ spells constant initialization the same as assignment, while ALGOL 68 spells it the same as equality, but it's debatable which is more confusing.)
  • arrays (and slice operations on them, but in layered libraries): C++ does have arrays; it doesn't have slice operations; I don't know what layered libraries means in this context.
  • CASE expressions: No, but it does have case (switch) statements, which ALGOL 68 doesn't have (and the usual advantages and disadvantages of each apply).
  • nonlocal GOTO: What about setjmp?

While it's claimed that ALGOL 68 doesn't have:

  • overloaded procedures (in contrast to operators): True, but new operators can be defined.
  • explicit memory allocation and deallocation: Actually, it does have explicit allocation, with the "heap" modifier. It just doesn't have explicit deallocation.
  • textual preprocessing (header files): It's not just header files; it doesn't have textual macros or conditional compilation based either.
  • hierarchical classes: In fact, ALGOL 68 doesn't have real classes at all, so this is hardly surprising.

There are other features missing from C++:

  • parallelism (C++ can use platform-specific libraries such as pthreads--but even with these its memory model generally does not allow correct parallel programs to be written)
  • discriminated unions
  • case switching on unions
  • identifiers with spaces (not necessarily a bad thing to lack)
  • statement lists in loop conditions (although comma expressions can often fake this)
  • multiple-valued expressions (although see tr1::tuple and tr1::tie)
  • flexible arrays (although see std::realloc and std::vector)
  • variable-length array parameters (but again with the std::vector, or you can fake it by passing a separate length param)
  • multi-dimensional arrays (arrays of arrays are close, but not the same)
  • uniform initialization syntax (compare aggregates vs. classes with constructors)

And missing from ALGOL 68 (although some of these were included as extensions):

  • separate compilation (which lets you fake modules and libraries, as C and C++ do)
  • a large and useful standard library
  • templates (generics, metaprogramming, etc.)
  • class-oriented features (methods vs. static functions, constructors, etc.)
  • destructors/RAII
  • object-oriented features (dynamic polymorphism, inheritance, etc.)
  • exceptions (try/catch/throw)
  • namespaces
  • default parameter values
  • varargs
  • declaration/definition distinction
  • case switching on non-contiguous integers (and applying the same statement to multiple cases, and fall-through)
  • complex for loops (for (Node *p = head; p; p = p->next) { ... })
  • declarations in control statements (see above, and if (Node *p = (tail && tail->next)) swap(p, tail);)
  • true binary I/O
  • const reference (and pointer and, for that matter, value) parameters
  • overloading on constness
  • wide chars and strings
  • bit fields
  • operators equivalent to ^, ~, etc.

The corresponding section could presumably be fixed to be more complete and correct, but I question how useful it is in the first place. --75.36.138.40 10:38, 10 May 2007 (UTC)

A detailed comparison would be out of place in the ALGOL 68 article. (It is already 54k long)

In April I copied the comparison between ALGOL 68 and C++ into Comparison of ALGOL 68 and C++. I will remove it entirely (save for a link) from the A68 article. You (or I) can to insert your points above into Comparison of ALGOL 68 and C++, maybe with some details indicating if the feature is part of the C standard, C++ standard, a A68 revised standard, in a std/non-std library, or part of an implementation based extension. I am thinking a table format would work best.

You and I are probably the only people on the planet to worry about this, but I am keen if you are?

NevilleDNZ 08:34, 13 May 2007 (UTC)

I dont know what the original source for that particular comparison list. Hence even if we reuse it and expand on it in Comparison of ALGOL 68 and C++ we will have problems with Wikipedia:Attribution. Do you know of any "official" comparison? (eg. There is a Algol68 to Pascal comparison done by Andrew S. Tanenbaum).

NevilleDNZ 09:42, 13 May 2007 (UTC)

Having created this comparison, let me comment on this. As it is not from a verifiable source, it probably has to be deleted from the article. This comparison was solely intended to be a service for the uninitiated reader of the article about a historic language, which was likely to be forgotten at all. For the correctness of the claims you should distinguish between the language and any software system which can be built using the language. Garbage collection, e.g., is part of the language in Algol68 (and other programming languages like Java, Haskell, Lisp etc.), but not of C++. The goal of this list is not to discuss advantages or disadvantages, but just to explain what Algol68 is. A switch statement, e.g., is not at all comparable to a CASE expression, because it cannot be used in a formula. A68 does not have "identifier with spaces", it allows to write every identifier with spaces instead. I consider this form of polymorphism being not a good thing, too. --Glasreiniger 09:19, 14 May 2007 (UTC)
I removed the wrong claim that A68 does not explicitly allocates memory. Regarding the Pascal comparison: The main disadvantage of this is that Pascal was not intended to be used for production programming by its creator, its main design goal was teaching. The many fixes needed in real Pascal use make it difficult to distinguish between the language dialects. In contrast, both C++ and Algol68 claimed to be programming languages to be used.
The unkonwn contributor's comment on separate compilation has a very good point. Indeed, the lack of this is a big deficiency in A68, but: the solution of C++ is very clever, but mostly delegated to the host operating system by name mangling. At the time of A68, external symbols were restricted to be VERY short, sometimes only 6 chars; so it is not surprising that this way was not explored. Certainly A68 doesn't have wide char support, but C++ (as a language) neither did have (at least not in the earlier stages). In A68 it would be trivial to extend the language to allow LONG CHARs, but it can be argued that all characters in A68 could be wide chars as A68 chars are not defined being of length 8 (indeed, on many installations they had only 6 bits) as well.

--Glasreiniger

I always imagined that A68 was CharacterWidth agnostic. Esp it was planned to be used on 6-bit, 7-bit and 8-bit platforms. Will scan the Revised report for hints.

re: "identifier with spaces" - recall a quote (I cannot recall from exactly where) saying that after 2000 years of civilisation humans put spaces between words, and so too machines should be made to recognise spaces. (Ironically we have since evolved into CamelCase :-)

NevilleDNZ 10:03, 14 May 2007 (UTC)

Indeed, A68 ist charwidth agnostic, mostly. The RR defines the operators ABS and REPR to convert to and from INT. There is a prelude constant maxabschar defining the largest convertible integer, and it defines a minimal set of glyphs that must be present in the char set. But OTOH there are the conversions between BYTES (which may be available for different sizes), which assume that the length of CHARs converted to BYTES is a linear function of the charlength. This assumption is not valid for UTF-8. Anyway this could be cured without changing the language. Just supply prelude functions for support of the conversions needed.

One more thought about the attribution problem of this comparison. IMHO the facts reported in the comparison are easily verifiable by going down to the language definitions. So the only original research contained in the comparison is the selection of features mentioned. This should be acceptable for an encyclopedia. --Glasreiniger 12:47, 14 May 2007 (UTC)

[edit] Gomma?

The links to the external page (labelled 'Gomma') is broken. Searching the revised report only mentions that they are not included in the report. Apparently a reference to finnigan's wake, and supposedly allowing the use of semicolon instead of commas, is it an important concept? Is there a better link to replace the (broken) one? Google seems to only return either the revised report, this wikipedia page (or cut-and-pastes to/from it) or references to rubber. If it's not an important concept, the links (and mention of it) should go - if it is, then it needs exanding! Slothie 10:37, 15 June 2007 (UTC)

[edit] ALGOL 68 talk page tag &

I am sure that I am not the only ALGOL 68 programmer still kicking the dust.... So...

ALGOL 68-4 This user programs in ALGOL 68 at an expert level.
Here is a tag that can be put on ALGOL 68 contributors user page.

I have contributed some ALGOL 68 sample/example code to rosetta code... These are nothing fancy. http://www.rosettacode.org/rosettacode/w/index.php?title=ALGOL_68

Does anyone have some real ALGOL 68 code squirreled away on old tapes in the attic that could be contributed?

The 40th anniversary is coming up in December 2008. Anything simple planned in Munich? I hear Munich beer is rather good.

Here are now some online notes by Edsger W.Dijkstra on the subject of Algol-68. It is useful to see ALGOL-68 from his point of view: http://www.google.com.au/search?q=site:www.cs.utexas.edu/~EWD+algol-68

NevilleDNZ (talk) 15:43, 22 February 2008 (UTC)