Fast and memory-efficient exact attention
NVR with realtime local object detection for IP cameras
An innovative library for efficient LLM inference
DeepSeek Coder: Let the Code Write Itself
Pruna is a model optimization framework built for developers
Toolkit for running TensorFlow training scripts on SageMaker
Blazing-fast Data-Wrangling toolkit
AI gateway with token compression for Claude Code, Codex, and more
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Protect and discover secrets using Gitleaks
Fast, flexible LLM inference
Pre-trained Deep Learning models and demos
Fully private LLM chatbot that runs entirely with a browser
Foundation Models for Time Series
Official implementation of DreamCraft3D
A GPU-accelerated library containing highly optimized building blocks
A simple tool for reading in poorly redacted documents
Towards Real-World Vision-Language Understanding
Text and image to video generation: CogVideoX and CogVideo
State-of-the-art TTS model under 25MB
The book 5 of statistics in simplicity
The official Meta Llama 3 GitHub site
C++ inference library for multiple SVC/TTS
From Images to High-Fidelity 3D Assets
Formula recognition based on LaTeX-OCR and ONNXRuntime