Standalone. Small. Language-neutral. BudouX is the successor to Budou, the machine learning-powered line break organizer tool. It is standalone. It works with no dependency on third-party word segmenters such as Google cloud natural language API. It is small. It takes only around 15 KB including its machine learning model. It's reasonable to use it even on the client-side. It is language-neutral. You can train a model for any language by feeding a dataset to BudouX’s training script.

Features

  • BudouX supports HTML inputs
  • Documentation available
  • Examples available
  • You can get a list of phrases by feeding a sentence to the parser
  • BudouX supports HTML inputs and outputs HTML strings
  • BudouX uses the AdaBoost algorithm to segment a sentence into phrases

Project Samples

Project Activity

See All Activity >

Categories

Machine Learning

License

Apache License V2.0

Follow BudouX

BudouX Web Site

Other Useful Business Software
Atera - an All-in-one platform for IT management Icon
Atera - an All-in-one platform for IT management

Ideal for IT departments and MSPs (managed service providers)

Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
Try Atera now
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of BudouX!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Machine Learning Software

Registered

2024-08-19