The lexer hack

From Wikipedia, the free encyclopedia

When parsing computer programming languages, the lexer hack (as opposed to "a lexer hack") is a term in common use describing a common solution to the problems which arise when attempting to use a regular grammar-based lexer to classify tokens in ANSI C as either variable names or type names.

[edit] Solutions

The solution generally consists of feeding information from the parser's symbol table back into the lexer. This incestuous commingling of the lexer and parser is generally regarded as inelegant, which is why it is called a "hack".

This problem does not arise (and hence needs no "hack" in order to solve) when using lexerless parsing techniques.

[edit] Citations

Categories: Parsing algorithms

Views

Interaction

Search

This page was last modified 23:15, 3 February 2007 by Wikipedia user LouisWins. Based on work by Wikipedia user(s) Megacz, SmackBot, Enlil Ninlil, Velela, Delta G and Daniel Lawrence and Anonymous user(s) of Wikipedia.
All text is available under the terms of the GNU Free Documentation License. (See Copyrights for details.)
Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a U.S. registered 501(c)(3) tax-deductible nonprofit charity.
About Wikipedia
Disclaimers