Pycorrector

Pycorrector is a Python toolkit for Chinese text error correction. It focuses on common error types such as similar-sounding characters, visually similar characters, grammar issues, proper noun errors, missing words, extra words, wrong words, and word-order problems. The project implements multiple correction approaches, including KenLM, ConvSeq2Seq, BERT, MacBERT, ELECTRA, ERNIE, GPT-style models, and newer Qwen-based correction models. It is designed for use cases such as input method correction, OCR correction, speech recognition cleanup, search query correction, and general Chinese proofreading. The repository includes usage examples, evaluation materials, datasets, documentation, and model references. It is useful for NLP engineers, researchers, and application developers building Chinese language quality tools.

Features

Chinese text error correction
Phonetic and visual error handling
Grammar and word-order correction
KenLM and transformer model support
Evaluation and dataset resources
Ready-to-use Python toolkit

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Pycorrector

Pycorrector Web Site

Other Useful Business Software

$300 Free Credits for Your Google Cloud Projects

Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial

Rate This Project

User Reviews

Be the first to post a review of Pycorrector!

Additional Project Details

Programming Language

Python

Related Categories

Python Text Editors

Registered

2026-06-18

Similar Business Software

AIEditor

Simple, easy to use, open source license friendly, no limit count of users and apps, rich documentation. As an AI-powered rich text editor, AIEditor helps you build knowledge products quickly. Be able to recognize and correctly render the basic syntax of Markdown, and see results in real time....

See Software
Bird

Bird is a UNICODE based text editor that you can create and edit text what you need. Added more clarity in the characters you typed. It reads ASCII as well as UNICODE text, UNICODE up to LE (Little Endian). The saving format of the text is UNICODE only not ASCII. Data capacity: 1...

See Software
Sublime Text

A sophisticated text editor for code, markup and prose. Use Goto Anything to open files with only a few keystrokes, and instantly jump to symbols, lines or words. Make ten changes at the same time, not one change ten times. Multiple selections allow you to interactively change many lines at...

See Software
IntelliJ IDEA

IntelliJ IDEA is a professional-grade integrated development environment (IDE) primarily designed for Java and Kotlin development. It helps developers write code faster by automating routine tasks and providing smart coding assistance. The IDE supports the full software development lifecycle,...

See Software
Apache NetBeans

Apache NetBeans is a versatile, open-source Integrated Development Environment (IDE) used for developing applications across a wide range of programming languages, including Java, JavaScript, PHP, HTML5, and C/C++. Known for its modular architecture, NetBeans provides robust tools and features...

See Software
SlickEdit

A true cross-platform, multi-language code editor, with support for over 60 languages on 9 platforms. Build or compile your project, then double-click on an error message in the Build window to jump to that location. Errors and warnings are marked with an icon in the left margin. In addition,...

See Software

Report inappropriate content

Pycorrector

Pycorrector is a toolkit for text error correction

Get an email when there's a new version of Pycorrector

Features

Project Samples

Project Activity

Categories

License

Follow Pycorrector

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered