From Vibe Coding to Agentic Engineering
LTX-Video Support for ComfyUI
Contexts Optical Compression
Open Source Speech Language Model
OCR expert VLM powered by Hunyuan's native multimodal architecture
Encoder of greater-than-word length text trained on a variety of data
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Software that can generate photos from paintings
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
ClinicalBERT model trained on MIMIC notes for clinical NLP tasks
Small 3B-base multimodal model ideal for custom AI on edge hardware