Phi-3.5 for Mac: Locally-run Vision and Language Models
High-Resolution Image Synthesis with Latent Diffusion Models
DeepVariant is an analysis pipeline that uses a deep neural networks
A Domain-Fronting Relay that routes traffic though GAS
Wan2.1: Open and Advanced Large-Scale Video Generative Model
The RF and reverse engineering framework for everyone
The Modular Platform (includes MAX & Mojo)
Secure local-first microVM sandbox for running untrusted code fast
Find the local LLM that actually runs and performs best
AI agents running research on single-GPU nanochat training
Machine Learning Engineering Open Book
High-performance inference server for text embeddings models API layer
Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real
Solve puzzles. Learn CUDA
Unified framework for building enterprise RAG pipelines
Z80-μLM is a 2-bit quantized language model
Simplifies the local serving of AI models from any source
Text and image to video generation: CogVideoX and CogVideo
Low-latency AI inference engine optimized for mobile devices
Jupyter notebook tutorials for OpenVINO
Running large language models on a single GPU
High-performance inference framework for large language models
Making large AI models cheaper, faster and more accessible
The repository provides code for running inference with SAM 2
BioNeMo Framework: For building and adapting AI models