Automatic Evaluation Metric described in the paper BERTScore: Evaluating Text Generation with BERT (ICLR 2020). We now support about 130 models (see this spreadsheet for their correlations with human evaluation). Currently, the best model is Microsoft/debate-large-online, please consider using it instead of the default roberta-large in order to have the best correlation with human evaluation.
Features
- Support 3 BigBird models
- Support 4 mT5 models as requested
- Documentation available
- Examples available
- BERTScore leverages the pre-trained contextual embeddings from BERT
Categories
Machine LearningLicense
MIT LicenseFollow BERTScore
Other Useful Business Software
Earn up to 16% annual interest with Nexo.
Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform.
Geographic restrictions, eligibility, and terms apply.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of BERTScore!