A lightweight audio-to-MIDI converter with pitch bend detection
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
DeepVariant is an analysis pipeline that uses a deep neural networks
Open source demo platform where you can easily showcase your AI models
Language modeling in a sentence representation space
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Data Lake for Deep Learning. Build, manage, and query datasets
Create HTML profiling reports from pandas DataFrame objects
Industrial-strength Natural Language Processing (NLP)
TTS with kokoro and onnx runtime
Synchronized Translation for Videos
NVR with realtime local object detection for IP cameras
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
The largest open-source medical AI skills library for OpenClaw
Guiding Instruction-based Image Editing via Multimodal Large Language
Generate short videos with one click using AI LLM
Open source framework for deep learning satellite and aerial imagery
Build cross-modal and multimodal applications on the cloud
Oobabooga - The definitive Web UI for local AI, with powerful features
A refreshing functional take on deep learning
Public repository for Agent Skills
Offline Text To Speech synthesis for python
AI agents autonomously run and improve ML experiments overnight
gpt-4o for windows, macos and linux
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning