Real-time voice interactive digital human
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Towards Human-Level Text-to-Speech through Style Diffusion
Concatenate a directory full of files into a single prompt
Unified Multimodal Understanding and Generation Models
code for Mesh R-CNN, ICCV 2019
GPT4V-level open-source multi-modal model based on Llama3-8B
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Designed for text embedding and ranking tasks
Inference framework for 1-bit LLMs
Wraps all package managers with a unifying CLI
Implementation of Video Diffusion Models
A general purpose syntax highlighter in pure Go
Full git and GitHub integration with Sublime Text
CTFs as you need them
A refreshing functional take on deep learning
Tool for visualizing and tracking your machine learning experiments
Library for serving Transformers models on Amazon SageMaker
Build AI-powered semantic search applications
Python implementation of global optimization with gaussian processes
Combination of multiple linters to install as a GitHub Action
The Go support for Google's protocol buffers
The lightweight PyTorch wrapper for high-performance AI research
A Django content management system focused on flexibility & UX
CasADi is a symbolic framework for numeric optimization