Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
lightweight package to simplify LLM API calls
Fast and memory-efficient exact attention
Bidirectional token-classification model for identifiable info
The easiest way to use deep metric learning in your application
Tool for exploring and debugging transformer model behaviors
Easily turn large sets of image urls to an image dataset
A TTS model capable of generating ultra-realistic dialogue
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
The AI toolkit for the AI developer
Open-source tool designed to enhance the efficiency of workloads
AI Agent Evaluator & Red Team Platform
Gracefully face hCaptcha challenge with multimodal llms
Stable Diffusion built-in to Blender
A refreshing functional take on deep learning
Toloka-Kit is a Python library for working with Toloka API
A lightweight, powerful framework for multi-agent workflows
Train machine learning models within Docker containers
Video Object and Interaction Deletion
Reflexion: Language Agents with Verbal Reinforcement Learning
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
A Python library for audio data augmentation
Python binding to the Apache Tika™ REST services
95% token savings. 155x faster queries. 16 languages
OCR expert VLM powered by Hunyuan's native multimodal architecture