Ferret

Ferret

Apple
HunyuanVideo-Avatar

HunyuanVideo-Avatar

Tencent-Hunyuan
+
+

Related Products

  • Gemini Enterprise Agent Platform
    961 Ratings
    Visit Website
  • Google AI Studio
    12 Ratings
    Visit Website
  • LM-Kit.NET
    28 Ratings
    Visit Website
  • DocketManager
    31 Ratings
    Visit Website
  • KrakenD
    71 Ratings
    Visit Website
  • Stigg
    25 Ratings
    Visit Website
  • SciSure
    298 Ratings
    Visit Website
  • Budgyt
    282 Ratings
    Visit Website
  • ChatD&B
    Visit Website
  • SiteMinder
    257 Ratings
    Visit Website

About

An End-to-End MLLM that Accept Any-Form Referring and Ground Anything in Response. Ferret Model - Hybrid Region Representation + Spatial-aware Visual Sampler enable fine-grained and open-vocabulary referring and grounding in MLLM. GRIT Dataset (~1.1M) - A Large-scale, Hierarchical, Robust ground-and-refer instruction tuning dataset. Ferret-Bench - A multimodal evaluation benchmark that jointly requires Referring/Grounding, Semantics, Knowledge, and Reasoning.

About

HunyuanVideo‑Avatar supports animating any input avatar images to high‑dynamic, emotion‑controllable videos using simple audio conditions. It is a multimodal diffusion transformer (MM‑DiT)‑based model capable of generating dynamic, emotion‑controllable, multi‑character dialogue videos. It accepts multi‑style avatar inputs, photorealistic, cartoon, 3D‑rendered, anthropomorphic, at arbitrary scales from portrait to full body. Provides a character image injection module that ensures strong character consistency while enabling dynamic motion; an Audio Emotion Module (AEM) that extracts emotional cues from a reference image to enable fine‑grained emotion control over generated video; and a Face‑Aware Audio Adapter (FAA) that isolates audio influence to specific face regions via latent‑level masking, supporting independent audio‑driven animation in multi‑character scenarios.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

AI and LLM developers

Audience

Researchers and developers in AI-driven animation looking for a tool to generate emotion‑aligned, multi-character audio‑driven avatar videos

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Apple
Founded: 1976
United States
github.com/apple/ml-ferret

Company Information

Tencent-Hunyuan
United States
github.com/Tencent-Hunyuan/HunyuanVideo-Avatar

Alternatives

Selene 1

Selene 1

atla

Alternatives

AvatarFX

AvatarFX

Character.AI
GLM-4.5V

GLM-4.5V

Zhipu AI
Qwen2.5-Max

Qwen2.5-Max

Alibaba

Categories

Categories

Integrations

Gradio

Integrations

Gradio
Claim Ferret and update features and information
Claim Ferret and update features and information
Claim HunyuanVideo-Avatar and update features and information
Claim HunyuanVideo-Avatar and update features and information