CogView4, CogView3-Plus and CogView3(ECCV 2024)
Python scraper based on AI
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Contexts Optical Compression
Fast and accurate AI powered file content types detection
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Python SDK for Claude Agent
Repo for SeedVR2 & SeedVR
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
FAIR Sequence Modeling Toolkit 2
Official implementation of Watermark Anything with Localized Messages
Example Discord bot written in Python that uses the completions API
Fast and Universal 3D reconstruction model for versatile tasks
MCP server that integrates Confluence and Jira
SoTA open-source TTS
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
The ChatGPT Retrieval Plugin lets you easily find personal documents
README file generator, powered by AI
Reading book source
LLM-based Reinforcement Learning audio edit model
Chat & pretrained large vision language model
A Python library for audio
A SOTA open-source image editing model
Claude Code skill that researches any topic across Reddit + X
Multi-modal large language model designed for audio understanding