Less Code, Lower Barrier, Faster Deployment
PPTAgent: Generating and Evaluating Presentations
A simple, secure MCP-to-OpenAPI proxy server
Implementation of "MobileCLIP" CVPR 2024
Advanced mathematical types and functions for Swift
A fast, powerful, and simple hierarchical vision transformer
Code release for Cut and Learn for Unsupervised Object Detection
Mobile manipulation research tools for roboticists
CoreNet: A library for training deep neural networks
High-resolution models for human tasks
Ling is a MoE LLM provided and open-sourced by InclusionAI
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
Personalize Any Characters with a Scalable Diffusion Transformer
Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Check code for common misspellings
Extensible AGI Framework
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster
SWE-agent takes a GitHub issue and tries to automatically fix it
Multilingual Automatic Speech Recognition with word-level timestamps
Superfast AI decision making and processing of multi-modal data
Powerful and highly extensible command-line based document