VMZ: Model Zoo for Video Modeling
Refer and Ground Anything Anywhere at Any Granularity
Sample code and notebooks for Generative AI on Google Cloud
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Language modeling in a sentence representation space
The standard data-centric AI package for data quality and ML
Multi-modal large language model designed for audio understanding
FAIR Sequence Modeling Toolkit 2
21 Lessons, Get Started Building with Generative AI
A Python application to add watermarks (text or image) to PDF files
ChatGPT extension for scientific research work
Label, clean and enrich text datasets with LLMs
Graphical User Interface Face Anonymization Tool
Run GGUF models easily with a UI or API. One File. Zero Install.
mice stt tts
Open source annotation tool for machine learning practitioners
Convert an image to text to spot intelligible words.
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Application that simplifies the installation of AI-related projects
Resources, corpora, and tools for Chinese natural language processing
Chinese voice dialogue robot/smart speaker project
Video automatic transcribe and translated subtitle generator
Img2Txt - Extract Text From Images using AI