Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
When LLM Meets Domain Experts
Designed for text embedding and ranking tasks
Airtable integration for AI-powered applications
Turns Data and AI algorithms into production-ready web applications
A reactive notebook for Python
High-Fidelity and Controllable Generation of Textured 3D Assets
Multilingual sentence & image embeddings with BERT
The official Python Library for the Groq API
Repo of Qwen2-Audio chat & pretrained large audio language model
A unified framework for scalable computing
Solve end to end problems using Llama model family
A fast and lightweight framework for creating decentralized agents
Adding guardrails to large language models
tiktoken is a fast BPE tokeniser for use with OpenAI's models
A library for accelerating Transformer models on NVIDIA GPUs
Open source codebase for Scale Agentex
SOTA Open Source TTS
Offline inference engine for art, real-time voice conversations
MCP integration platforms for AI agents to use tools at any scale
ContextGem: Effortless LLM extraction from documents
Efficient Triton Kernels for LLM Training
Elyra extends JupyterLab with an AI centric approach
Concatenate a directory full of files into a single prompt
PPTAgent: Generating and Evaluating Presentations