Global weather forecasting model using graph neural networks and JAX
code for Mesh R-CNN, ICCV 2019
Language modeling in a sentence representation space
An AI-powered security review GitHub Action using Claude
Advancing Formal Mathematical Reasoning via Reinforcement Learning
FlashMLA: Efficient Multi-head Latent Attention Kernels
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Open-source large language model family from Tencent Hunyuan
Audio foundation model excelling in audio understanding
Example Discord bot written in Python that uses the completions API
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
A fast, local neural text to speech system
Dataset of GPT-2 outputs for research in detection, biases, and more
Official code for Style Aligned Image Generation via Shared Attention
High-Resolution Image Synthesis with Latent Diffusion Models
Open-source, high-performance Mixture-of-Experts large language model
Open source large language model by Alibaba
A Conversational Speech Generation Model
A CNN model that predicts human joints from RGB images of a person
Blazeface is a lightweight model that detects faces in images
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Latent Diffusion and Stable Diffusion Implementation
Detect faces in an image