A modular graph-based Retrieval-Augmented Generation (RAG) system
Generate 3D objects conditioned on text or images
Multimodal-Driven Architecture for Customized Video Generation
Capable of understanding text, audio, vision, video
Label, clean and enrich text datasets with LLMs
Adding guardrails to large language models
text and image to video generation: CogVideoX (2024) and CogVideo
Python package for AutoML on Tabular Data with Feature Engineering
Code for the paper Language Models are Unsupervised Multitask Learners
Phi-3.5 for Mac: Locally-run Vision and Language Models
Repo of Qwen2-Audio chat & pretrained large audio language model
The open-source data curation platform for LLMs
Stable Diffusion built-in to Blender
An open source implementation of CLIP
Qwen2.5-VL is the multimodal large language model series
Underthesea - Vietnamese NLP Toolkit
Aider is AI pair programming in your terminal
Framework that is dedicated to making neural data processing
MII makes low-latency and high-throughput inference possible
Open source no-code system for text annotation and building of text
Language modeling in a sentence representation space
Code for the paper "Evaluating Large Language Models Trained on Code"
Multimodal Diffusion with Representation Alignment
Obsei is a low code AI powered automation tool
A full spaCy pipeline and models for scientific/biomedical documents