Tensor search for humans
The data structure for multimodal data
Implementation of Imagen, Google's Text-to-Image Neural Network
Open Source Differentiable Computer Vision Library
Hub of ready-to-use datasets for ML models
Build cross-modal and multimodal applications on the cloud
Build AI-powered semantic search applications
Tock, the open source conversational AI toolkit
A multi-function Discord bot
Python-free Rust inference server
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
LLM-based agent for general purpose software engineering tasks
Scalable machine learning for time series forecasting
On-device Speech-to-Intent engine powered by deep learning
Benchmarking synthetic data generation methods
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Powering Amazon custom machine learning chips
Model Context Protocol server that integrates AgentQL's data
Recognition and resolution of numbers, units, date/time, etc.
Documentation for Google's Gen AI site - including Gemini API & Gemma
Just a Better Chatbot. Powered by MCP Client & Workflows
Open platform for building, deploying, and managing LLM agents
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
An advanced paper search agent powered by large language models