Diversity-driven optimization and large-model reasoning ability
Multilingual Document Layout Parsing in a Single Vision-Language Model
Convert codebases into structured prompts optimized for LLM analysis
Helps scientists define testable, modular, self-documenting dataflow
Powering Amazon custom machine learning chips
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Designed for text embedding and ranking tasks
The official PyTorch implementation of Google's Gemma models
Automatically find issues in image datasets
A curated collection of skills for AI coding agents
LLM powered fuzzing via OSS-Fuzz
Tools for merging pretrained large language models
Pretrained time-series foundation model developed by Google Research
A lightweight data processing framework built on DuckDB and 3FS
Implementation of 'lightweight' GAN, proposed in ICLR 2021
LLM-based Reinforcement Learning audio edit model
A python library for self-supervised learning on images
A TTS model capable of generating ultra-realistic dialogue
Deep learning library
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Obsei is a low code AI powered automation tool
Get easy bot lobbies in any game with our bot lobbies tool.
Database system for building simpler and faster AI-powered application