GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Python SDK for the Computer Use model Lux, developed by OpenAGI
Large Multimodal Models for Video Understanding and Editing
Chat & pretrained large audio language model proposed by Alibaba Cloud
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Offline Text To Speech synthesis for python
A sharp, readable, vector-y version of Monocraft
A library of extension and helper modules for Python's data analysis
Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
Bring your favorite shell wherever you go through the ssh
LLM-based Reinforcement Learning audio edit model
GUI Exploration Lab. One of the best GUI agent solutions
Comprehensive and timely academic information on federated learning
A Model Context Protocol server that provides network asset info
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Sync and Async ODM (Object Document Mapper) for MongoDB
Trained models & code to predict toxic comments
WikiChat is an improved RAG
Automate code reviews, patching and documentation
Lazy Predict help build a lot of basic models without much code
Tools for manipulating datasets
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
CMS framework for Django
Datasets, transforms and models specific to Computer Vision
Animation engine for explanatory math videos