Central interface to connect your LLM's with external data
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Dealing with all unstructured data, such as reverse image search
Algorithms for outlier, adversarial and drift detection
This repo contains the code for 1D tokenizer and generator
Scalable generative AI framework built for researchers and developers
Repo of Qwen2-Audio chat & pretrained large audio language model
The data structure for multimodal data
Build AI-powered semantic search applications
Machine learning, conversational dialog engine for creating chat bots
Audio foundation model excelling in audio understanding
Phi-3.5 for Mac: Locally-run Vision and Language Models
Shared repository for open-sourced projects from the Google AI Lang
Context database designed specifically for AI Agents
Fast-stable-diffusion + DreamBooth
A python tool that uses GPT-4, FFmpeg, and OpenCV
Multimodal Diffusion with Representation Alignment
OpenRecall is a fully open-source, privacy-first alternative
Multi-modal large language model designed for audio understanding
Large Multimodal Models for Video Understanding and Editing
Concatenate a directory full of files into a single prompt
AI-Researcher: Autonomous Scientific Innovation
Solve end to end problems using Llama model family
Pretrained model hub for Keras 3
Efficient few-shot learning with Sentence Transformers