Chinese XLNet pre-trained model
Inference script for Oasis 500M
Extract audio and video content and organize it into a Markdown note
Document Image Parsing via Heterogeneous Anchor Prompting”
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
Automatic SSRF fuzzer and exploitation tool
Implementation of Vision Transformer, a simple way to achieve SOTA
A best practices guide for day 2 operations
Mini website for testing both general CS knowledge and enforce coding
The best ChatGPT that $100 can buy
Blazing-fast vector DB with similarity search and metadata filtering
Library for reading and writing large multi-dimensional arrays
A Model Context Protocol server for searching and analyzing arXiv
This repository contains the official implementation of FastVLM
Refer and Ground Anything Anywhere at Any Granularity
Set of tools to assess and improve LLM security
TorchMultimodal is a PyTorch library
ICLR2024 Spotlight: curation/training code, metadata, distribution
A library for differentiable nonlinear optimization
A flexible, high-performance 3D simulator for Embodied AI research
PyTorch code and models for V-JEPA self-supervised learning from video
A PyTorch library for implementing flow matching algorithms
An implementation of a deep learning recommendation model (DLRM)
[CVPR 2025 Best Paper Award] VGGT