Outcome driven agent development framework that evolves
Real-World Centric Foundation GUI Agents
Python binding to the Apache Tika™ REST services
Large Multimodal Models for Video Understanding and Editing
Towards Human-Sounding Speech
Interact with your SQL database, Natural Language to SQL using LLMs
Mice speech to text with MX Cinnamon OS ISO
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
A lightweight control plane for Linux servers
Visual Automation IDE — automate anything you see on screen
Software that uses AI to perform real-time voice conversion
Synapta OS is a preconfigured educational Linux distribution with AI
Real-time behaviour synthesis with MuJoCo, using Predictive Control
AI Suite for upscaling, interpolating & restoring images/videos
dashAI: an interactive platform for training, evaluating and deploying
A dev-first open source autonomous AI agent framework
computer vision projects | Fun AI projects related to computer vision
Multi-Voice and Prompt-Controlled TTS Engine
Build ChatGPT over your data, all with natural language
Official Code for DragGAN (SIGGRAPH 2023)
Let us control diffusion models
Python package for easily interfacing with chat apps
Free AutoGPT enables autonomous AI tasks without paid APIs
A webui for different audio related Neural Networks
A collection of high-quality models for the MuJoCo physics engine