CRAB: Cross-environment Agent Benchmark for Multimodal Language Model
Scalable machine learning for time series forecasting
Chat & pretrained large audio language model proposed by Alibaba Cloud
Benchmarking synthetic data generation methods
A collection of reference Jupyter notebooks and demo AI/ML application
A Repo For Document AI
ChatGPT interface with better UI
Unified Multimodal Understanding and Generation Models
Gemma open-weight LLM library, from Google DeepMind
Enable AI to control your desktop, mobile and HMI devices
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
kaldi-asr/kaldi is the official location of the Kaldi project
ktrain is a Python library that makes deep learning AI more accessible
Meta Agents Research Environments is a comprehensive platform
Pruna is a model optimization framework built for developers
JAX-based neural network library
A python library for self-supervised learning on images
A Python toolbox for scalable outlier detection
Powering Amazon custom machine learning chips
Official implementation of Watermark Anything with Localized Messages
Video understanding codebase from FAIR for reproducing video models
Multimodal-Driven Architecture for Customized Video Generation
Personalize Any Characters with a Scalable Diffusion Transformer
Superduper: Integrate AI models and machine learning workflows