Flexible Photo Recrafting While Preserving Your Identity
MARS5 speech model (TTS) from CAMB.AI
Curl cryptocurrencies exchange rates
Implementation of "MobileCLIP" CVPR 2024
Build a large language model from 0 only with Python foundation
Real-World Centric Foundation GUI Agents
A Systematic Framework for Interactive World Modeling
Sample code and notebooks for Generative AI on Google Cloud
Unified Multimodal Understanding and Generation Models
AI framework for automated short video creation and editing tools
Quick illustration of how one can easily read books together with LLMs
A python tool that uses GPT-4, FFmpeg, and OpenCV
A lightweight framework for building LLM-based agents
Bidirectional token-classification model for identifiable info
Ultra-Efficient LLMs on End Device
Advanced NLP with spaCy: A free online course
Multi-tool for semantic search
SQL-Driven RAG Engine
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
95% token savings. 155x faster queries. 16 languages
Chinese XLNet pre-trained model
Framework for building neural networks
Memory-efficient and performant finetuning of Mistral's models
Large Multimodal Models for Video Understanding and Editing
Generative AI reference workflows