High-Fidelity and Controllable Generation of Textured 3D Assets
Bring the notion of Model-as-a-Service to life
Genome modeling and design across all domains of life
Achieving 3+ generation speedup on reasoning tasks
Faster and easier training and deployments
Run PyTorch LLMs locally on servers, desktop and mobile
4M: Massively Multimodal Masked Modeling
ICLR2024 Spotlight: curation/training code, metadata, distribution
Official implementation of DreamCraft3D
Reflexion: Language Agents with Verbal Reinforcement Learning
Parallax is a distributed model serving framework
ZAPI by Adopt AI is an open-source Python library
Build Vision Agents quickly with any model or video provider
Controllable & emotion-expressive zero-shot TTS
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Synthetic Data Generation for tabular, relational and time series data
Simplest working implementation of Stylegan2
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
A Personalized LLM-powered Agent Frameworks
Framework for validating and controlling LLM outputs in AI apps
An agentless approach to automatically solve software development
A simple, performant and scalable Jax LLM
LISA: Reasoning Segmentation via Large Language Model
Skywork-R1V is an advanced multimodal AI model series
Code and models for ICML 2024 paper, NExT-GPT