Diversity-driven optimization and large-model reasoning ability
This repository provides an advanced RAG
Chinese and English multimodal conversational language model
Repo of Qwen2-Audio chat & pretrained large audio language model
AI-Driven Life Cycle (AI-DLC) adaptive workflow steering rules for AI
Project Lyra: Open Generative 3D World Models
Open-weight, large-scale hybrid-attention reasoning model
State-of-the-art (SoTA) text-to-video pre-trained model
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Time-lapse Video Generation Models as Metamorphic Simulators
Toolkit to help you get started with Spec-Driven Development
From-scratch PyTorch implementation of Google's TurboQuant
Question and Answer based on Anything
Build Vision Agents quickly with any model or video provider
Python library for building agents that leverages Google Antigravity
Ultra-Efficient LLMs on End Device
Project-scoped Lean workflow orchestrator from Math, Inc.
Open-source Python framework for hybrid quantum-classical ml learning
Linkedin Automation Tool
AI-Driven Exploration in the Space of Code
Hypernetworks that adapt LLMs for specific benchmark tasks
Neural Network architecture based on ideas of the original LSTM
The Cradle framework is a first attempt at General Computer Control
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Tutorial tailored for Chinese babies on rapid fine-tuning