Build cross-modal and multimodal applications on the cloud
Deep learning library
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
LLM-based agent for general purpose software engineering tasks
A minimal yet professional single agent demo project
Real-time voice interactive digital human
An Open Source text-to-speech system built by inverting Whisper
Towards Human-Sounding Speech
Powering Amazon custom machine learning chips
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
An advanced paper search agent powered by large language models
Automatically translates the text of a video based on a subtitle file
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Swirl queries any number of data sources with APIs
A python library for easy manipulation and forecasting of time series
Capable of understanding text, audio, vision, video
Genome modeling and design across all domains of life
An AI for Music Generation
Automatically Visualize any dataset, any size
Openai style api for open large language models
An efficient forwarding service designed for LLMs
Private chat with local GPT with document, images, video, etc.
Inference Llama 2 in one file of pure C