flex-help Mailing List for flex: the fast lexical analyser

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Hi all,

I'm trying to learn Bison and Flex so I can make a converter between a
markup language of my design and PDF or HTML, depending on which Bison
program I use. After having been unsuccessful following several online
docs, I've decided to make the simplest possible parser where my
trivial Flex program tokenizes a document file comprised of paragraphs
each separated by one blank line, and my trivial Bison program adds
"<p>" to the front of each paragraph, and </p> to the end of each
paragraph. On this trivial experiment, to make it easier, I can specify
that a blank line consists of two consecutive \n characters without any
space between them, and if necessary a \n at the beginning of the
document file and an extra \n at the end of the document file.

I can see two ways to do this:

1) All Flex to Bison communication happens via the scanner's stdout to
   the parser's stdin, using a distinct executable for the scanner and
   the parser respectively. Both the Flex program and the Bison program
   would each have a main() function.

2) All Flex to Bison communications happen through tokens defined in
   the Bison source file, yylval, $$ and $1 etc. The Flex program would
   be a library without its own main(), that gets compiled and linked
   into the Bison generated parser. The combined program then
   translates the input document coming in through its stdin into
   something else that comes out of its stdout

#1 is conceptually easy, but I haven't been able to do it. I got close,
   but I couldn't make Bison do the regex necessary to parse its stdin
   input.

#2 seems to be the "best practices" way and I'd imagine it's faster on
   big input files, but I haven't been able to make it work because I
   don't understand how it functions.

I know this text paragraphs to <p></p> surrounded paragraphs conversion
could easily be done in Flex only, or for that matter a five line AWK
program or even an AWK one liner. My purpose in making this thing is to
have the easiest possible program that actually passes tokens from the
scanner to the parser, which translates it into something else. It's
the "Hello World" I must slowly build from to create a real converter.

For #2 I'd imagine my Bison rules would look something like the

input: chunk | input chunks
chunk: paragraph newline

1) Has any of you communicated between your lexer and parser using
stdio/stdin or intermediate files exclusively?

2) For option 2, what would my Flex and Bison files look like, and what
   would be the command to compile and link  them together? If the
   executable for option #2 is a.out, would the command to use it be
   cat mytest.txt > ./a.out

Also, is there a Flex IRC channel?

I know it sounds like I'm asking you to do my homework, but I've been
trying to do this for two weeks with web research and experimentation,
and haven't gotten to first base, so I'd appreciate any help you could
give.

Thanks,

SteveT

Steve Litt 

Autumn 2023 featured book: Rapid Learning for the 21st Century
http://www.troubleshooters.com/rl21

2004	Jan	Feb	Mar (2)	Apr	May	Jun	Jul (2)	Aug	Sep	Oct	Nov	Dec
2006	Jan	Feb (2)	Mar (2)	Apr (2)	May (3)	Jun (4)	Jul (10)	Aug (6)	Sep (20)	Oct (30)	Nov (10)	Dec (40)
2007	Jan (25)	Feb (18)	Mar (34)	Apr (36)	May (29)	Jun (1)	Jul (35)	Aug (5)	Sep (7)	Oct (15)	Nov (16)	Dec (13)
2008	Jan (11)	Feb (23)	Mar (17)	Apr (32)	May (7)	Jun (20)	Jul (2)	Aug (13)	Sep (13)	Oct (16)	Nov (3)	Dec (17)
2009	Jan (10)	Feb (10)	Mar (13)	Apr (3)	May (25)	Jun (11)	Jul (1)	Aug (17)	Sep (19)	Oct (9)	Nov (20)	Dec (22)
2010	Jan (29)	Feb (13)	Mar (11)	Apr (10)	May (9)	Jun (13)	Jul (4)	Aug (28)	Sep (8)	Oct (8)	Nov (4)	Dec (7)
2011	Jan (3)	Feb (3)	Mar (5)	Apr (4)	May (2)	Jun (7)	Jul (12)	Aug (10)	Sep (6)	Oct (14)	Nov (1)	Dec (9)
2012	Jan (6)	Feb (1)	Mar (13)	Apr (4)	May (5)	Jun (1)	Jul (6)	Aug (18)	Sep (12)	Oct (46)	Nov (7)	Dec (4)
2013	Jan (2)	Feb (3)	Mar	Apr (5)	May (2)	Jun (11)	Jul	Aug	Sep	Oct (11)	Nov (16)	Dec (1)
2014	Jan (2)	Feb (1)	Mar	Apr (11)	May	Jun (2)	Jul (2)	Aug	Sep	Oct (8)	Nov (1)	Dec (7)
2015	Jan	Feb (1)	Mar	Apr	May (1)	Jun	Jul (11)	Aug (1)	Sep	Oct	Nov	Dec (2)
2016	Jan (1)	Feb (4)	Mar (6)	Apr (2)	May (15)	Jun (19)	Jul (10)	Aug	Sep (1)	Oct (6)	Nov (4)	Dec
2017	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct (1)	Nov	Dec
2018	Jan (4)	Feb (1)	Mar (5)	Apr	May	Jun (3)	Jul	Aug	Sep	Oct	Nov	Dec
2019	Jan	Feb (3)	Mar	Apr	May	Jun	Jul (1)	Aug (1)	Sep	Oct	Nov	Dec
2021	Jan (3)	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct (1)	Nov	Dec
2022	Jan	Feb	Mar (3)	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2023	Jan	Feb	Mar	Apr	May	Jun (5)	Jul	Aug	Sep	Oct	Nov	Dec (1)

flex-help Mailing List for flex: the fast lexical analyser

flex is a tool for generating scanners

flex-help — help with using flex in other applications