Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Generating Immersive, Explorable, and Interactive 3D Worlds
Collection of awesome LLM apps with AI Agents and RAG using OpenAI
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Speech-AI-Forge is a project developed around TTS generation model
The Classical Language Toolkit
Training data (data labeling, annotation, workflow) for all data types
Fast backend for long-term AI user memory via structured profiles
Sample code and notebooks for Generative AI on Google Cloud
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
kaldi-asr/kaldi is the official location of the Kaldi project
Python bindings for llama.cpp
Neural Network Compression Framework for enhanced OpenVINO
Open source personal AI Assistant for Linux, Windows and Mac
Implementation of DeepLabCut
DeepCode: Open Agentic Coding
Programmatic access to the AlphaGenome model
Seamlessly integrate LLMs into scikit-learn
Python library for defining and optimizing mathematical expressions
Automatically translates the text of a video based on a subtitle file
A modular graph-based Retrieval-Augmented Generation (RAG) system
Implementation of Make-A-Video, new SOTA text to video generator
Python tool for converting files and office documents to Markdown
Building an Intelligent Agent from Scratch
Definitions for AI/ML tasks like dataset creation