meta data for this page
This is an old revision of the document!
Tree Building Enhancements in JavaCC21
In JavaCC21, there is no separate program analogous to legacy JavaCC's JJTree. Building an [[AST]] is assumed to be the way that most people will want to use the tool, so parsers generated by JavaCC21 build a tree by default. If you really don't want to build a tree (or you want to build a tree but in your own java code actions) there is a TREE_BUILDING_ENABLED setting that can be set to false. (The default value is true.) Assuming that you leave TREE_BUILDING_ENABLED as its default of true, JavaCC21 supports various options. In addition to the ones that were available in JJTree (most of them, anyway) the following settings are available:
- TREE_BUILDING_DEFAULT
- TOKENS_ARE_NODES
- SPECIAL_TOKENS_ARE_NODES
- SMART_NODE_CREATION
- FREEMARKER_NODES
TREE_BUILDING_DEFAULT
TREE_BUILDING_DEFAULT means that your parser will build a tree by default. If this option is set to false, it means that your code needs to explicitly turn on tree-building. Note the difference between disabling tree building altogether via TREE_BUILDING_ENABLED=false and setting this option to false. If you set TREE_BUILDING_ENABLED to false, no tree building code is inserted in your parser. If you set TREE_BUILDING_DEFAULT to false, the tree building code is present, but must be toggled on. Like so:
MyParser parser = new MyParser(....); parser.setBuildTree(true);
Of course, in the opposite case, where TREE_BUILDING_DEFAULT is on and you want to parse input without building a tree, in the above snippet, you would have parser.setBuildTree(false). Note that this is the situation if you do not specify the TREE_BUILDING_DEFAULT option as tree building is on by default.
TOKENS_ARE_NODES
This option indicates that Tokens should be treated as (terminal) nodes in the abstract syntax tree. Note that, for this reason, The Token.java generated by JavaCC21 contains extra methods that allow it to implement the Node interface. Note that TOKENS_ARE_NODES is the default. If you want the older JJTree behavior, you must explicitly set TOKENS_ARE_NODES=false in your options block.
SPECIAL_TOKENS_ARE_NODES
If you set this option along with TOKENS_ARE_NODES, then any "special" tokens are also treated as terminal nodes and are added to the tree -- as preceding siblings to the regular Token they are associated with. This is off by default. As a practical question, SPECIAL_TOKENs are usually used for comments in the source code, so this would typically amount to deciding whether you want comments included in your AST.
SMART_NODE_CREATION
This option indicates that, by default, a production should result in a new node being created if there is more than one node on the stack. This is the default behavior, since it seems to be what most people would want out of the box, and, if you want the older default, of having every production be a definite node, you must explicitly turn it off via SMART_NODE_CREATION=false in your options block.
FREEMARKER_NODES
This option means that extra code is added so that your Node objects implement core FreeMarker API's, in particular freemarker.template.TemplateScalarModel and freemarker.template.TemplateNodeModel`. This means that if you expose the tree you build to a FreeMarker template, you can walk the tree using a very natural syntax. Note, however, that using the FREEMARKER_NODES option creates a runtime dependency on freemarker.jar.
Enhancements related to Tokens in the AST
By default, JavaCC21 will generate subclasses of Token that correspond to the various lexical elements in your grammar. Consider the following specification:
    TOKEN #Operator : 
    {
       
       |
       
       |
       
       |
        
    }
</pre>
For the above, JavaCC21 will define a Token subclass called Operator and the Tokens for “+”, “-”, “*”, and “/” will be instantiated as instances of Operator.