Multi-modal large language model designed for audio understanding
Large Multimodal Models for Video Understanding and Editing
OCR expert VLM powered by Hunyuan's native multimodal architecture
Rust binding generator, feature-rich, but seamless and simple
Large-language-model & vision-language-model based on Linear Attention
Swirl queries any number of data sources with APIs
A library and utilities for processing GIFs
Extensible workflow development framework
Open Source API Gateway written in Go
Self-Modifying Framework from the Future
OpenMMLab Model Deployment Framework
Obsei is a low code AI powered automation tool
CursusDB is an open-source distributed in-memory database
The Open Source CFD Toolbox
A collection of packages providing extra functionality for GNU Octave
Powerful desktop publishing software
Swiss army knife of image processing
Removes backgrounds from pictures. Extension for webui
Spatial data processing for geomodeling
Bilibili video downloader supporting 8K, batch, and toolbox tools
XML editor
Open Source Computer Vision Library
BoofCV is an open source Java library for real-time computer vision.