Question: Write C or C++ Program! Write a tokenizer. The tokenizer should input a stream of ASCII characters and output the token, its line number in

Write C or C++ Program!

Write a tokenizer. The tokenizer should input a stream of ASCII characters and output the token, its line number in the stream, its type, and its value. You must write the code from scratch without the use of any lexicographical parsing libraries or utilities.

Tokenizer should recognize the following:

1. A few keywords: "if", "else", "for", "while"

2. A few single-character symbols: '&', '|', '+', '*', ':', ';'

3. Labels (alpha-numerical strings)

4. Integers (including negative integers)

5. Floating-point numbers in radix notation (such as pi, i.e. 3.14159265)

6. Floating-point numbers in exponential notation (such as Avogadro's number, i.e. 6.022140857E23)

Max 100 lines of code and I/O example, I/O has to be read from file!

Code example down

#include #include #include

// --------------------------------------------------------------------- // this trivial program reads a stream and outputs tokens // valid tokens: // 0: error // 1: whitespace (blank, tab, CR, LF) // 2: word (lowercase only) // 3: number (unsigned decimal integer) // --------------------------------------------------------------------- #define STATE_ERROR 0 #define STATE_WHITESPACE 1 #define STATE_WORD 2 #define STATE_NUMBER 3

char szWhiteSpace[]=" \t "; char szWord[]="abcdefghijklmnopqrstuvwxyz"; char szNumber[]="0123456789"; char *szStates[]={"ERROR", "WHITESPACE", "WORD", "NUMBER"};

int main(void){ char c; char szToken[256]; int nTokenSize=0; int nChars=0, nTokens=0, nLines=1; int nCurState=STATE_ERROR, nNextState=STATE_ERROR;

while((c=getc(stdin))) { if(c==EOF) break; nChars++;

if(strchr(szWord, c)) nNextState=STATE_WORD; else if(strchr(szNumber, c)) nNextState=STATE_NUMBER; else if(strchr(szWhiteSpace, c)) nNextState=STATE_WHITESPACE; else nNextState=STATE_ERROR;

if(nChars==1) nCurState=nNextState;

// uncomment the following line to debug // printf("line %4d, char %4d: %c [%02X] (%d -> %d) ", nLines, nChars, c, c, nCurState, nNextState);

if(nNextState==nCurState) { szToken[nTokenSize++]=c; if(c==' ') nLines++; continue; }

szToken[nTokenSize]=0; if((nCurState==STATE_WORD)||(nCurState==STATE_NUMBER)) printf("token %2d, line %2d: %10s (%s) ", ++nTokens, nLines, szToken, szStates[nCurState]);

nTokenSize=0; szToken[nTokenSize++]=c; nCurState=nNextState; if(c==' ') nLines++; }

return(nTokens); } // --------------------------------------------------------------------- // ---------------------------------------------------------------------

------------------------------------------------------------------------ SAMPLE input: ------------------------------------------------------------------------ one two three

1 11 123 13456

one 1 two 2 ------------------------------------------------------------------------ corresponding output: (./cs305_lex < in.txt) ------------------------------------------------------------------------ token 1, line 1: one (WORD) token 2, line 2: two (WORD) token 3, line 3: three (WORD) token 4, line 5: 1 (NUMBER) token 5, line 5: 11 (NUMBER) token 6, line 5: 123 (NUMBER) token 7, line 5: 13456 (NUMBER) token 8, line 7: one (WORD) token 9, line 7: 1 (NUMBER) token 10, line 8: two (WORD) token 11, line 8: 2 (NUMBER)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Read the entire lab instruction before starting This lab is to be completed on BrightSpace any lab worksheets handed in will be discarded. Carefully follow the procedures outlined in this lab...

Write a program in the file freq.cpp which reads text from the user and then computes the frequency of each vowel as well as the number of consonants that appear in the text. A vowel is one of the...

Zipfs Law is a curious observation about the statistical distribution of words in text: the frequency of any word is inversely proportional to its rank in the frequency table. Frequency is the number...

NEED IT ASAP PLEASE!!!!!! 2.3 The summary program problemlines.c The line length check program described above can give a user full information about which of their program lines are too long, and by...

Can use C, C++, Java or Python 3. You are to implement three programs that implement an error-detection mechanism using the standard CRC (Cycle Redundancy Check) algorithm. The first program, named...

Can use C, C++, Java or Python 3. You are to implement three programs that implement an error-detection mechanism using the standard CRC algorithm. The first program, named generator, reads from the...

PLEASE DO NOT ANSWER IN C++ - Write an assembly language program for the JASPer processor. The program should allow the user to convert a four-digit binary number to decimal. The program works as...

Explain how iterative development makes project scheduling more complex.

Journal entries for revising estimate of life. Give the journal entries for the following selected transactions of Florida Manufacturing Corporation. The company uses the straight-line method of...

Enabled: 1 3 Qulz 3 The concept of substance over form infuences the classification of obligations expected to be refinanced. The or False

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

___ 19. I will feel successful in my career only if I achieve complete autonomy and freedom.

___ 20. I seek jobs in organizations that will give me a sense of security and stability.

___ 11. I am most fulfilled in my work when I am completely free to define my own tasks, schedules, and procedures.