Implementation of Imagen, Google's Text-to-Image Neural Network
Build cross-modal and multimodal applications on the cloud
A multi-function Discord bot
Generate music based on natural language prompts using LLMs
Synchronized Translation for Videos
The artificial intelligence learning roadmap compiles 200 cases
Private chat with local GPT with document, images, video, etc.
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
LLM-based agent for general purpose software engineering tasks
Scalable machine learning for time series forecasting
Neural Search
Making Enterprise Data Intelligent and Responsive for AI
Powering Amazon custom machine learning chips
Python-free Rust inference server
Find the Root Cause in Your Code's Trace
Debug, evaluate, and monitor your LLMapps, RAG systems, and agentic AI
Open-Source AI Camera. Empower any camera/CCTV
Model Context Protocol server that integrates AgentQL's data
Recognition and resolution of numbers, units, date/time, etc.
Documentation for Google's Gen AI site - including Gemini API & Gemma
Just a Better Chatbot. Powered by MCP Client & Workflows
NMA Computational Neuroscience course
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
An advanced paper search agent powered by large language models