Obsei is a low code AI powered automation tool
A full spaCy pipeline and models for scientific/biomedical documents
An Open Source text-to-speech system built by inverting Whisper
Chat & pretrained large vision language model
Towards Human-Sounding Speech
Evaluate and monitor ML models from validation to production
Open Source Document Management System for Digital Archives
Han Language Processing
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
Generate Any 3D Scene in Seconds
21 Lessons, Get Started Building with Generative AI
Generate audiobooks from e-books
Standalone, small, language-neutral
Simple, Pythonic building blocks to evaluate LLM applications
TextWorld is a sandbox learning environment for the training
ImageBind One Embedding Space to Bind Them All
A python library that makes AMR parsing, generation and visualization
Open-Sora: Democratizing Efficient Video Production for All
Implementation of AudioLM audio generation model in Pytorch
End-to-end speech processing toolkit
A TTS model capable of generating ultra-realistic dialogue
A Repo For Document AI
Scalable data pre processing and curation toolkit for LLMs