Multi-modal large language model designed for audio understanding
Large-language-model & vision-language-model based on Linear Attention
OpenMMLab Model Deployment Framework
Framework that is dedicated to making neural data processing
Open Source Computer Vision Library
Library of self-supervised methods for visual representation
Real-time, incremental ETL library for ML with record-level depend
Scientific Visualisation Made Easy
Dataset of GPT-2 outputs for research in detection, biases, and more
AI-powered tool to quickly remove watermarks from videos flawlessly
Translate English to Bangla using CSV file format and range wise.
Serving LangChain LLM apps automagically with FastApi
A tool to create the analytical index of a manuscript
Suite with Real-ESRGAN, BSRGAN , RealESRNet, IRCNN, GFPGAN & RIFE.
Database system for building simpler and faster AI-powered application
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Code repo for "WebArena to build Autonomous Agents
A Customizable Image-to-Video Model based on HunyuanVideo
Evaluation code for various unsupervised automated metrics
ML based QSAR Modelling And Translation of Model to Deployable WebApps
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Uma Ferramenta Computacional para Análise e Recuperação de Patentes
A Conversational Speech Generation Model
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Resources, corpora, and tools for Chinese natural language processing