Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
The ChatGPT Retrieval Plugin lets you easily find personal documents
Pushing the Limits of Mathematical Reasoning in Open Language Models
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open-source, high-performance Mixture-of-Experts large language model
Blazeface is a lightweight model that detects faces in images
Open source large language model by Alibaba
Powerful open source image generation model
DeepSeek LLM: Let there be answers
Open Multilingual Multimodal Chat LMs
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Official code for Style Aligned Image Generation via Shared Attention
Official PyTorch Implementation of "Scalable Diffusion Models"
Code release for ConvNeXt V2 model
Learning to Act by Watching Unlabeled Online Videos
Code release for "Masked-attention Mask Transformer
A library for Multilingual Unsupervised or Supervised word Embeddings
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
Open-source code agent designed for Lean 4
JetBrains’ 4B parameter code model for completions
Vision-language-action model for robot control via images and text
Portuguese ASR model fine-tuned on XLSR-53 for 16kHz audio input