python curses module free download

HunyuanVideo-Avatar

Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model

HunyuanVideo-Avatar is a multimodal diffusion transformer (MM-DiT) model by Tencent Hunyuan for animating static avatar images into dynamic, emotion-controllable, and multi-character dialogue videos, conditioned on audio. It addresses challenges of motion realism, identity consistency, and emotional alignment. Innovations include a character image injection module, an Audio Emotion Module for transferring emotion cues, and a Face-Aware Audio Adapter to isolate audio effects on faces, enabling...

Downloads: 2 This Week

Last Update: 2025-10-16

See Project

HunyuanCustom

Multimodal-Driven Architecture for Customized Video Generation

... reinforcement and modality-specific condition injection. Text-image fusion module based on LLaVA for improved multimodal understanding. Applicable to single- and multi-subject scenarios, video editing/replacement, singing avatars etc.

Downloads: 2 This Week

Last Update: 2025-10-15

See Project

VisualGLM-6B

Chinese and English multimodal conversational language model

VisualGLM-6B is an open-source multimodal conversational language model developed by ZhipuAI that supports both images and text in Chinese and English. It builds on the ChatGLM-6B backbone, with 6.2 billion language parameters, and incorporates a BLIP2-Qformer visual module to connect vision and language. In total, the model has 7.8 billion parameters. Trained on a large bilingual dataset — including 30 million high-quality Chinese image-text pairs from CogView and 300 million English pairs...

Downloads: 0 This Week

Last Update: 2 days ago

See Project

Perception Models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models

Perception Models is a state-of-the-art framework developed by Facebook Research for advanced image and video perception tasks. It introduces two primary components: the Perception Encoder (PE) for visual feature extraction and the Perception Language Model (PLM) for multimodal decoding and reasoning. The PE module is a family of vision encoders designed to excel in image and video understanding, surpassing models like SigLIP2, InternVideo2, and DINOv2 across multiple benchmarks. Meanwhile, PLM...

Downloads: 0 This Week

Last Update: 2 days ago

See Project

Video Pre-Training

Learning to Act by Watching Unlabeled Online Videos

.... for building houses or early-game tasks), and inference scripts that instantiate agents from pretrained weights. Key modules include the behavioral cloning logic, the agent wrapper, and data loading pipelines (with an accessible skeleton for loading Minecraft demonstration data). The repo also includes a run_agent.py script for testing an agent interactively, and an agent.py module encapsulating the control logic.

Downloads: 1 This Week

Last Update: 2025-10-03

See Project

Search Results for "python curses module"

Showing 5 open source projects for "python curses module"

HunyuanVideo-Avatar

HunyuanCustom

VisualGLM-6B

Perception Models

Video Pre-Training

Search Results for "python curses module"

Showing 5 open source projects for "python curses module"

HunyuanVideo-Avatar

HunyuanCustom

VisualGLM-6B

Perception Models

Video Pre-Training

Related Categories