Phi-3.5 for Mac: Locally-run Vision and Language Models
Building AI agents, atomically
Easiest and laziest way for building multi-agent LLMs applications
The Library for LLM-based multi-agent applications
A lightweight text-to-speech model with zero-shot voice cloning
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Generate high-definition story short videos with one click using AI
Quick illustration of how one can easily read books together with LLMs
The most powerful Android RPA agent framework
A command-line productivity tool powered by AI large language models
Open-source AI agent framework
Document (PDF, Word, PPTX ...) extraction and parse API
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
the terminal client for Ollama
AI agents running research on single-GPU nanochat training
Build reliable Gen AI solutions without overhead
The 100 line AI agent that solves GitHub issues
Making RAG Simpler with Small and Open-Sourced Language Models
"Big Model" trains a visual multimodal VLM with 26M parameters
Simple and easily configurable grid world environments
A TTS that fits in your CPU (and pocket)
On the Structural Pruning of Large Language Models
Implementation for MatMul-free LM