The repository provides code for running inference with SAM 2
Educational framework exploring multi-agent orchestration
The python library for real-time communication
Audiocraft is a library for audio processing and generation
Provides CTP stock options and Zhongtai Securities XTP
Learn all about Digital Forensics and Computer Forensics
End-to-end speech processing toolkit
Video understanding codebase from FAIR for reproducing video models
Multimodal Diffusion with Representation Alignment
A command-line utility for taking automated screenshots of websites
A collection of reference Jupyter notebooks and demo AI/ML application
Collection of common code shared among different research projects
Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Community-maintained approach to improving access to GitHub services
Meta Agents Research Environments is a comprehensive platform
SwarmZero's SDK for building AI agents, swarms of agents and much more
freeCodeCamp.org's open-source codebase and curriculum
Official implementation of DreamCraft3D
Towards Real-World Vision-Language Understanding
Open source platform for the machine learning lifecycle
AV1 Image File Format Specification - ISO-BMFF/HEIF derivative
Code for Language models can explain neurons in language models paper
A C++ binding for the OpenGL API, generated using the gl.xml specifica
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Concatenate a directory full of files into a single prompt