A lexical analyzer generator that makes the class source code.
ULS is an object factory for lexical analysis supporting UTF-8. It's provided as C/C++ libraries with a few of tools on Linux/Windows platform.
- ULS is an object-oriented-designed generic lexical analysis supporting UTF-8 encoded input files. This means that you can use your locale language for identifiers in program source.
- ULS can tokenize multiple of files or (literal) strings as nested input files.
- ULS can stream the sequence of processed tokens to another file as binary format.
- ULS can be a replacement of 'flex' as token provider to yacc/bison parser. Its lexical configuration is read from the file suffixed by '*.ulc'.
- As the precision of floating number is irrelavant with lexical analysis, ULS manipulates real or integer numbers having arbitrary number of digits.
- ULS provides with a unique (varargs-style) logging framework, which is also used for gerneral purposes. ULS supports for dynamic printf. ULS enables the users to have their own formatted *-printf by adding or overriding its conversion specifications. There are %s, %c, %d, %u, %f, %Ld, %Lf, ... in default conversion specifications newly written from scratch. The group of conversion specifications itself can be instantiated thread by thread. The output port that the group uses can be any one only if it's given 'puts' interfaces. ULS, by default, provides the 'puts' interfaces for string-buffer and the file pointers, including stdout and stderr.
- There're examples scanning the 'css3', 'html5', 'shell-script', 'Makefile' dumping tokens.
Be the first to post a review of ULS!