Gemma open-weight LLM library, from Google DeepMind
Qwen3-omni is a natively end-to-end, omni-modal LLM
Phi-3.5 for Mac: Locally-run Vision and Language Models
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Qwen2.5-VL is the multimodal large language model series
Code and models for ICML 2024 paper, NExT-GPT
A Pioneering Open-Source Alternative to GPT-4o
GPT4V-level open-source multi-modal model based on Llama3-8B
Langchain Apps on Production with Jina & FastAPI