Pycorrector is a Python toolkit for Chinese text error correction. It focuses on common error types such as similar-sounding characters, visually similar characters, grammar issues, proper noun errors, missing words, extra words, wrong words, and word-order problems. The project implements multiple correction approaches, including KenLM, ConvSeq2Seq, BERT, MacBERT, ELECTRA, ERNIE, GPT-style models, and newer Qwen-based correction models. It is designed for use cases such as input method correction, OCR correction, speech recognition cleanup, search query correction, and general Chinese proofreading. The repository includes usage examples, evaluation materials, datasets, documentation, and model references. It is useful for NLP engineers, researchers, and application developers building Chinese language quality tools.

Features

  • Chinese text error correction
  • Phonetic and visual error handling
  • Grammar and word-order correction
  • KenLM and transformer model support
  • Evaluation and dataset resources
  • Ready-to-use Python toolkit

Project Samples

Project Activity

See All Activity >

Categories

Text Editors

License

Apache License V2.0

Follow Pycorrector

Pycorrector Web Site

Other Useful Business Software
$300 Free Credits for Your Google Cloud Projects Icon
$300 Free Credits for Your Google Cloud Projects

Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
Start Free Trial
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Pycorrector!

Additional Project Details

Programming Language

Python

Related Categories

Python Text Editors

Registered

12 hours ago