A typed, algebraic approach to parsing
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)
MetadataShow full item record
Krishnaswami, N., & Yallop, J. (2019). A typed, algebraic approach to parsing. Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), 379-393. https://doi.org/10.1145/3314221.3314625
In this paper, we recall the definition of the context-free expressions (or µ-regular expressions), an algebraic presentation of the context-free languages. Then, we define a core type system for the context-free expressions which gives a compositional criterion for identifying those context-free expressions which can be parsed unambiguously by predictive algorithms in the style of recursive descent or LL(1). Next, we show how these typed grammar expressions can be used to derive a parser combinator library which both guarantees linear-time parsing with no backtracking and single-token lookahead, and which respects the natural denotational semantics of context-free expressions. Finally, we show how to exploit the type information to write a staged version of this library, which produces dramatic increases in performance, even outperforming code generated by the standard parser generator tool ocamlyacc.
External DOI: https://doi.org/10.1145/3314221.3314625
This record's URL: https://www.repository.cam.ac.uk/handle/1810/292062
All rights reserved