Real-time voice interactive digital human
Capable of understanding text, audio, vision, video
Open-Sora: Democratizing Efficient Video Production for All
Terminal emulator application for Android OS extendible
Efficient few-shot learning with Sentence Transformers
Framework for building, orchestrating, and deploying AI agents
Concatenate a directory full of files into a single prompt
Python framework for adversarial attacks, and data augmentation
Textual is a TUI (Text User Interface) framework for Python
Long-form streaming TTS system for multi-speaker dialogue generation
Marrying Grounding DINO with Segment Anything & Stable Diffusion
HY-Motion model for 3D character animation generation
Model Context Protocol Server for Apache OpenDAL™
Chinese version of Google open source project style guide
Bailing is a voice dialogue robot similar to GPT-4o
Pretrained model hub for Keras 3
Making RAG Simpler with Small and Open-Sourced Language Models
A lightweight approach to removing Google web service dependency
TUI for Ollama and other LLM providers
Translates Django models using a registration approach
Parse files for optimal RAG
Extension of Google Research’s PaperBanana
Retrieval and Retrieval-augmented LLMs
Low-latency REST API for serving text-embeddings
Complete Two-Factor Authentication for Django