GUI for a Vocal Remover that uses Deep Neural Networks
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Context engineering is the new vibe coding
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
An extremely simple tool for separating vocals and background music
A 50 million tokens corpus of Classical Arabic.
Efficient 8B multimodal model tuned for advanced reasoning tasks.
High-precision 14B multimodal model built for advanced reasoning tasks
Compact 3B-param multimodal model for efficient on-device reasoning
Hermes 4 FP8: hybrid reasoning Llama-3.1-405B model by Nous Research