JavaCC Lexical Analysis Lab Guide

The document outlines Lab 1 for the CS4905 course at UNB, focusing on lexical analysis using JavaCC and lex. It provides step-by-step instructions for downloading example files, compiling them, and testing various lexer programs with specific input. Additionally, it includes modifications to enhance the functionality of the lexers, such as recognizing identifiers and keywords.

Uploaded by

Zerihun Bekele

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views3 pages

JavaCC Lexical Analysis Lab Guide

Uploaded by

Zerihun Bekele

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

UNB Faculty of Computer Science, CS4905 Introduction to Compiler Construction

Lab 1 January 15, 2007

Purpose: To become familiar with tools for lexical analysis.
1. Log in to a Linux workstation in ITD415. By virtue of being enrolled in CS4905, you should receive a
Computer Science Linux-lab login ID and password via your UNB E-mail.
2. Create a subdirectory in your unix directory space to contain the source code for this lab. For purposes
of illustration, I will assume that you call this subdirectory “L1”.
Part 1. Experiments with JavaCC
3. Download the “[Link]” file into your L1 subdirectory from the CS4905 web site
[Link]
(use e.g. Mozilla). File names ending in .jj are intended to contain JavaCC (Java compiler compiler)
input.

4. Type
javacc [Link]
on the command line to compile the javacc into a Java program. The overall process is shown in Figure 1.
source code

JavaCC Java source Java executable

program e.g. javacc program e.g. javac tokens
program e.g.
[Link] [Link] [Link]

Figure 1. Using JavaCC to generate executable programs for lexical analysis.

5. Type
javac [Link]
on the command line to compile the Java program into an executable (.class) program. This will
automatically compile any dependent classes.

6. Type
java Simple1
at the command line to execute the Java program. When running, the Simple1 program checks for
matching curly braces. Type a series of matching curly braces e.g.
{{}}
at the command line, followed by a “newline” character (press the Enter key to obtain a “\n” character)
and an “end of file” <EOF> character (press “Ctrl – d” to obtain an <EOF> charcacter).
Try entering a set of unmatched curly braces to see what the lexer program does.

7. Repeat the above steps 3, 4, 5 and 6 for the [Link] program. Test this program with input
containing curly braces and some white space characters. What is the difference between [Link]
and [Link]?

8. Repeat the above steps 3, 4, 5 and 6 for the [Link] program. Test this program with input
containing curly braces and some white space characters. Note how the program counts the nesting level
of the curly braces and prints the nesting level after <EOF> is encountered. Change the message printed
by the Simple3 program to “Curly brace nesting level is”. Recompile [Link] and run it
again to see the changed output.
1
9. Repeat the above steps 3, 4, 5 and 6 for the [Link] program. Test this program with input
containing valid and invalid identifiers according to the regular expressions for the TOKEN <Id> in
[Link].

10. Copy the [Link] program to a file called [Link]. We will now modify the
[Link] program to make a parser object called Lexer1. This requires changing all instances of
IdList to Lexer1 in the .jj program. Add an output statement something like the following:
{ [Link]("I recognize ID " ); }
after an ID token is recognized.
Add a [Link] program that constructs a Lexer1 object from a Main program. Do this by
entering a file called [Link] similar to the following:
import [Link].*;
public class Main {

public static void main(String [] args) throws Exception {

try {
new Lexer1([Link]).Input();
[Link]("Lexical analysis successful");
}
catch (ParseException e) {
[Link]("Lexer Error : \n"+ [Link]());
}
}
}
Prepare a test file [Link] containing the following three lines:
if8
Test
7.29
Now, run your Lexer1 parser against the input file using the following steps:
javacc [Link]
javac [Link]
java Main < [Link]
where the input redirection operator “<” redirects the file [Link] to the standard input object
[Link]. Your lexer should print out that it recognizes two ID tokens, and then print an error
message. Change the input data file so that all lines contain valid identifiers, and run your lexer again.

11. Modify your Lexer1 parser to add the recognition of keyword IF tokens and print a message "I
recognize IF" when an IF token is recognized. The JavaCC specification for this is as follows:
< IF: "if" >
Note that this token will need to be specified as the first alternative in a disjunctive list (i.e. separated by
the | operator) inside the TOKEN specifications. The Input() method also needs to be modified to add
the output statement for IF tokens. Modify your [Link] file to include valid if keywords and
ID tokens, and run your lexer program again.

Part 2. Experiments with lex

12. Download the “lex1.l” file into your L1 subdirectory from the CS4905 web site
[Link]
(use e.g. Mozilla). File names ending in .l are intended to contain lex input.

13. Compile the lex1.l program using the instructions posted at the above site; i.e. type
lex name.l
2
on the command line, where name.l is replaced with the name of your lex input file. Then invoke the C
compiler using
cc [Link].c -o name -ll
on the command line. By default, lex always produces output to the [Link].c file. The output
executable file is called name, so to run the program, type
./name < [Link]
where [Link] is the name of an input file.

14. Test your lexer by using the following input file:

if(x < 12.56E-4)
y = x + 7;
else if(x < 0.5 && x >= 0.0)
y = x * x + 4;
else
{
y = x / 2.5;
z = a*(b-3) + 4 / 7.3;
}
The lexer should print out all the numbers it finds in this file.

15. Modify the lex1.l program to add recognition of real numbers (in addition to integers) according
to the regular expression in Figure 2.2 on p.20 of the text. The regular expression for a REAL token in
Figure 2.2 is as follows:
([0-9]+"."[0-9]*)|([0-9]*"."[0-9]+)
Your will need to add an optional part something like
([eE][+-]?[0-9]+)?
to recognize the exponent as part of the real number.

16. Repeat the above step 13 for the lex2.l program. This lexer recognizes html tokens. Test this
lexer using as input the input file of the CS4905 home page; i.e.
[Link]

17. Repeat the above step 13 for the lex3.l program. This lexer program counts the number of lines,
words and characters in a file. Test this lexer using as input the test program given in step 14 above.
Note that the appearance of the circumflex ‘^’ character as the first character in a character class
specification (i.e. characters between square brackets ‘[‘ and ‘]’) changes the meaning to match any
character except those within the brackets.

18. Repeat the above step 13 for the lex5.l program. This lexer prints only words followed by
punctuation. If the following sentence was the input from standard input:
"I was here", they said.
But were they? I cannot tell.
it will recognize and print the words “here”, “said”, “they”, and “tell”. Test that your lexer program
works correctly with the above example. Note that the forward slash character ‘/’ matches the preceding
regular expression but only if followed by the following regular expression; thus the pattern ‘0/1’
matches “0” in the string “01”. The characters matched by the pattern following the forward slash is not
“consumed” and remains to be turned into subsequent tokens. Only one foward slash is permitted per
pattern.

Lexical Analysis in C Programming
No ratings yet
Lexical Analysis in C Programming
73 pages
hw1: miniJava Lexer Implementation
No ratings yet
hw1: miniJava Lexer Implementation
4 pages
Lexical Analyzer and Compiler Design
No ratings yet
Lexical Analyzer and Compiler Design
41 pages
Lexical Analyzer and Compiler Tools Guide
No ratings yet
Lexical Analyzer and Compiler Tools Guide
17 pages
Lexical Analyzer Project for mini-C
No ratings yet
Lexical Analyzer Project for mini-C
4 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
53 pages
Lexical Analyzer Implementation Guide
No ratings yet
Lexical Analyzer Implementation Guide
33 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
73 pages
Javalet Compiler Scanner Implementation
No ratings yet
Javalet Compiler Scanner Implementation
7 pages
Building Compilers with Flex and Bison
No ratings yet
Building Compilers with Flex and Bison
23 pages
Lexical Analyzer Implementation in C
No ratings yet
Lexical Analyzer Implementation in C
82 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
13 pages
Token Generation with FLEX and Polish Notation
No ratings yet
Token Generation with FLEX and Polish Notation
9 pages
C Language Scanner Design Guide
No ratings yet
C Language Scanner Design Guide
28 pages
Lexical Analysis and Analyzer Generators
No ratings yet
Lexical Analysis and Analyzer Generators
69 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
16 pages
CDLabmanual
No ratings yet
CDLabmanual
40 pages
Lexer Implementation for CS321 Lab 2
No ratings yet
Lexer Implementation for CS321 Lab 2
6 pages
Implementing a Lexical Analyzer with Flex
No ratings yet
Implementing a Lexical Analyzer with Flex
16 pages
Lexical Analyzer Using LEX/Flex Tool
No ratings yet
Lexical Analyzer Using LEX/Flex Tool
8 pages
Compiler Design Lab Manual
75% (16)
Compiler Design Lab Manual
55 pages
Compiler Design Lab Experiments Guide
No ratings yet
Compiler Design Lab Experiments Guide
14 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
71 pages
Compiler Design Lab Manual for CSE
No ratings yet
Compiler Design Lab Manual for CSE
40 pages
Lexical Analyzer with Flex and Lex
No ratings yet
Lexical Analyzer with Flex and Lex
8 pages
Practical File Compiler Design
No ratings yet
Practical File Compiler Design
32 pages
Compiler Design Lab: LEX & YACC Experiments
No ratings yet
Compiler Design Lab: LEX & YACC Experiments
74 pages
Lexical Analysis and Tokenization Explained
No ratings yet
Lexical Analysis and Tokenization Explained
6 pages
CD Cse Record
No ratings yet
CD Cse Record
76 pages
Compiler Design Lab Manual V1.X
No ratings yet
Compiler Design Lab Manual V1.X
11 pages
Lexical Analyzer Projects in C and Lex
No ratings yet
Lexical Analyzer Projects in C and Lex
23 pages
Using Flex for Lexical Analysis
No ratings yet
Using Flex for Lexical Analysis
11 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
67 pages
Lex Tool in Compiler Design Lab
No ratings yet
Lex Tool in Compiler Design Lab
15 pages
Lex and Yacc in Compiler Design
No ratings yet
Lex and Yacc in Compiler Design
26 pages
Lexical Analysis Using Jflex: Tokens
No ratings yet
Lexical Analysis Using Jflex: Tokens
39 pages
Lex and Yacc Programs Overview
No ratings yet
Lex and Yacc Programs Overview
169 pages
Compiler Design Lab Report: Lex, Yacc, ANTLR
No ratings yet
Compiler Design Lab Report: Lex, Yacc, ANTLR
52 pages
Finite Automata and Lex Tool Implementation
No ratings yet
Finite Automata and Lex Tool Implementation
55 pages
Lex and Yacc Programming Guide
No ratings yet
Lex and Yacc Programming Guide
34 pages
Compiler Design Lab Manual 2020-2024
No ratings yet
Compiler Design Lab Manual 2020-2024
28 pages
Compiler Design Lab Experiments Guide
No ratings yet
Compiler Design Lab Experiments Guide
50 pages
Compiler Construction and Lexical Analysis
No ratings yet
Compiler Construction and Lexical Analysis
14 pages
TinyJava Lexical Analyzer Assignment
No ratings yet
TinyJava Lexical Analyzer Assignment
3 pages
CS3501 Compiler Design Lab Manual
No ratings yet
CS3501 Compiler Design Lab Manual
53 pages
Compiler Design Lab Experiments KCS-552
No ratings yet
Compiler Design Lab Experiments KCS-552
48 pages
Lexical Analysis in Programming Languages
No ratings yet
Lexical Analysis in Programming Languages
21 pages
C Program for Symbol Table Implementation
No ratings yet
C Program for Symbol Table Implementation
68 pages
Lexical Analyzer in Compiler Design
No ratings yet
Lexical Analyzer in Compiler Design
40 pages
Lexical Analyzer Implementation with Lex
No ratings yet
Lexical Analyzer Implementation with Lex
4 pages
Flex and Bison Grammar Tutorial
No ratings yet
Flex and Bison Grammar Tutorial
7 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
32 pages
Compiler Design Practical Index
No ratings yet
Compiler Design Practical Index
33 pages
Lexical Analysis in Compiler Design
No ratings yet
Lexical Analysis in Compiler Design
38 pages
Compiler Design Lab Manual Guide
No ratings yet
Compiler Design Lab Manual Guide
80 pages
Compiler Design Lab Manual 2019
No ratings yet
Compiler Design Lab Manual 2019
38 pages
ToT Workshop for Digital Skills in Ethiopia
No ratings yet
ToT Workshop for Digital Skills in Ethiopia
5 pages
MARIE Computer Architecture Overview
100% (1)
MARIE Computer Architecture Overview
22 pages
Java PDF Parsing Techniques
No ratings yet
Java PDF Parsing Techniques
46 pages
Java Abstract Syntax Tree Overview
No ratings yet
Java Abstract Syntax Tree Overview
84 pages
Static vs Dynamic Memory in Java
No ratings yet
Static vs Dynamic Memory in Java
80 pages
Microsoft Word Training Modules
No ratings yet
Microsoft Word Training Modules
9 pages
Computer Architecture and Evolution Notes
No ratings yet
Computer Architecture and Evolution Notes
11 pages
Boolean Algebra and Logic Gates Overview
No ratings yet
Boolean Algebra and Logic Gates Overview
25 pages
Integer Division and Number Representation
No ratings yet
Integer Division and Number Representation
16 pages
Evaluation Functions in AI Coursework
No ratings yet
Evaluation Functions in AI Coursework
1 page
ACSC368: Artificial Intelligence: Course Details
No ratings yet
ACSC368: Artificial Intelligence: Course Details
4 pages
Understanding Internet of Things (IoT)
No ratings yet
Understanding Internet of Things (IoT)
91 pages
Overview of Artificial Intelligence
No ratings yet
Overview of Artificial Intelligence
26 pages
Blocks World Problem Solution Steps
No ratings yet
Blocks World Problem Solution Steps
1 page
VR, AR, and MR: Key Comparisons
No ratings yet
VR, AR, and MR: Key Comparisons
22 pages
Introduction to Emerging Technologies
100% (1)
Introduction to Emerging Technologies
58 pages
Use JavaCC To Build A User Friendly
No ratings yet
Use JavaCC To Build A User Friendly
21 pages
Java Recursive Descent Parsing Guide
No ratings yet
Java Recursive Descent Parsing Guide
82 pages
Compiler Basics and Lexical Analysis
No ratings yet
Compiler Basics and Lexical Analysis
58 pages
MS Word Practical Exercises Guide
No ratings yet
MS Word Practical Exercises Guide
5 pages
Dallol HR Management System Project
No ratings yet
Dallol HR Management System Project
17 pages
HMIS Exercise Answers - Feb 2010
No ratings yet
HMIS Exercise Answers - Feb 2010
30 pages
Moodle2Word Word Template: Startup Menu: Supported Question Types
No ratings yet
Moodle2Word Word Template: Startup Menu: Supported Question Types
6 pages
Fundamentals of Applied Probability and Random Processes: 2 Edition
0% (1)
Fundamentals of Applied Probability and Random Processes: 2 Edition
10 pages
Y-Bus Matrix Formation by Inspection
No ratings yet
Y-Bus Matrix Formation by Inspection
5 pages
CS2204 Results for CSE Students
No ratings yet
CS2204 Results for CSE Students
12 pages
Understanding StringBuilder in Java
No ratings yet
Understanding StringBuilder in Java
43 pages
iPasolink O&M Training Overview
No ratings yet
iPasolink O&M Training Overview
84 pages
MB760 MB770 MPS5502 Service-Manual-Rev1
100% (1)
MB760 MB770 MPS5502 Service-Manual-Rev1
390 pages
Airtel Customer Care Numbers by Region
No ratings yet
Airtel Customer Care Numbers by Region
1 page
Python Programs: Area, Cube, Max Number
No ratings yet
Python Programs: Area, Cube, Max Number
4 pages
Data Quality DMB Ok Dam A Brasil
100% (1)
Data Quality DMB Ok Dam A Brasil
46 pages
Digital Agency Workflow
100% (4)
Digital Agency Workflow
20 pages
Unpaid Dating Accounts for Sale
No ratings yet
Unpaid Dating Accounts for Sale
1 page
C-Strings and String Manipulation in C++
No ratings yet
C-Strings and String Manipulation in C++
30 pages
Counting Sort Algorithm and Complexity
No ratings yet
Counting Sort Algorithm and Complexity
24 pages
D1 Algorithms Assessment Overview
No ratings yet
D1 Algorithms Assessment Overview
20 pages
KeyframeCaddy UserGuide
No ratings yet
KeyframeCaddy UserGuide
13 pages
Intel Gigabit Ethernet Driver Settings
No ratings yet
Intel Gigabit Ethernet Driver Settings
14 pages
Data Residu Vervalpd Kab. Serang
No ratings yet
Data Residu Vervalpd Kab. Serang
39 pages
Virtual Memory Concepts and Techniques
No ratings yet
Virtual Memory Concepts and Techniques
103 pages
Overview of Distributed Systems Concepts
No ratings yet
Overview of Distributed Systems Concepts
48 pages
ArcView 9.3 Installation Guide
No ratings yet
ArcView 9.3 Installation Guide
4 pages
Quality Items Configuration Guide
No ratings yet
Quality Items Configuration Guide
2 pages
Emotion Recognition System Overview
No ratings yet
Emotion Recognition System Overview
29 pages
Holland High School IT Project Guide
No ratings yet
Holland High School IT Project Guide
4 pages
AI Problem Solving and Search Strategies
No ratings yet
AI Problem Solving and Search Strategies
42 pages
Nuvama Wealth: Prasad Kasar's Profile
No ratings yet
Nuvama Wealth: Prasad Kasar's Profile
5 pages
Company Directory and Contact List
0% (1)
Company Directory and Contact List
140 pages
Beginner's Guide to Word Processing Skills
No ratings yet
Beginner's Guide to Word Processing Skills
8 pages
Jetway V266B Manual
No ratings yet
Jetway V266B Manual
49 pages
Introduction to Network Programming
No ratings yet
Introduction to Network Programming
23 pages

JavaCC Lexical Analysis Lab Guide

Uploaded by

JavaCC Lexical Analysis Lab Guide

Uploaded by

UNB Faculty of Computer Science, CS4905 Introduction to Compiler Construction

Lab 1 January 15, 2007

JavaCC Java source Java executable

Figure 1. Using JavaCC to generate executable programs for lexical analysis.

public static void main(String [] args) throws Exception {

Part 2. Experiments with lex

14. Test your lexer by using the following input file:

You might also like