Tokenize is a Julia package that serves a similar purpose and API as the tokenize module in Python but for Julia. This is to take a string or buffer containing Julia code, perform lexical analysis and return a stream of tokens.

Features

  • Fast, it currently lexes all of Julia source files in ~0.25 seconds (580 files, 2 million Tokens)
  • Round trippable, that is, from a stream of tokens the original string should be recoverable exactly
  • Round trippable, that is, from a stream of tokens the original string should be recoverable exactly
  • The function tokenize is the main entrypoint for generating Tokens
  • Each Token is represented by where it starts and ends, what string it contains and what type it is
  • Documentation available

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Tokenize.jl

Tokenize.jl Web Site

Other Useful Business Software
Build Agents and Models on One Platform Icon
Build Agents and Models on One Platform

Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
Try It Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Tokenize.jl!

Additional Project Details

Programming Language

Julia

Related Categories

Julia 3D Modeling Software, Julia Data Visualization Software, Julia Libraries

Registered

2023-12-07