Topic Modelling for Humans
Generate short videos with one click using AI LLM
Innovative user interfaces made easy
Stable-diffusion-webui-pixelization
Chat & pretrained large audio language model proposed by Alibaba Cloud
Talk to Your AI Agents from Anywhere
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Diff JSON and JSON-like structures in Python
The Multi-Agent Framework
A nearly-live implementation of OpenAI's Whisper
Code repository for PDFStitcher, a utility to stitch together PDFs
High-quality multi-lingual text-to-speech library by MyShell.ai
A Python visual Flow Based Programming library
Large Audio Language Model built for natural interactions
A simple tool for reading in poorly redacted documents
Automatic SSRF fuzzer and exploitation tool
LaTeX CV generator from a YAML/JSON input file
Tool for visualizing and tracking your machine learning experiments
State-of-the-art (SoTA) text-to-video pre-trained model
This repository contains the official implementation of FastVLM
Web application fuzzer
An experimental version of DeepSeek model
GPT4V-level open-source multi-modal model based on Llama3-8B
An open sourced end-to-end VLM-based GUI Agent
Designed for text embedding and ranking tasks