Hunyuan Video Avatar FP8
1 0 0

其他

V2.0

Recent years have witnessed significant progress in audio-driven human animation. However, critical challenges remain in (i) generating highly dynamic videos while preserving character consistency, (ii) achieving precise emotion alignment between characters and audio, and (iii) enabling multi-character audio-driven animation. To address these challenges, we propose HunyuanVideo-Avatar, a multimodal diffusion transformer (MM-DiT)-based model capable of simultaneously generating dynamic, emotion-controllable, and multi-character dialogue videos. Concretely, HunyuanVideo-Avatar introduces three key innovations: (i) A character image injection module is designed to replace the conventional addition-based character conditioning scheme, eliminating the inherent condition mismatch between training and inference. This ensures the dynamic motion and strong character consistency; (ii) An Audio Emotion Module (AEM) is introduced to extract and transfer the emotional cues from an emotion reference image to the target generated video, enabling fine-grained and accurate emotion style control; (iii) A Face-Aware Audio Adapter (FAA) is proposed to isolate the audio-driven character with latent-level face mask, enabling independent audio injection via cross-attention for multi-character scenarios. These innovations empower HunyuanVideo-Avatar to surpass state-of-the-art methods on benchmark datasets and a newly proposed wild dataset, generating realistic avatars in dynamic, immersive scenarios. The source code and model weights will be released publicly.

此模型源自站外搬运（搬运地址: https://huggingface.co/tencent/HunyuanVideo-Avatar/resolve/main/ckpts/hunyuan-video-t2v-720p/transformers/mp_rank_00_model_states_fp8.pt ），若原作者对于本次搬运的结果存在异议，可点

申诉

我们会在 24 小时内，按照原作者的要求，对本模型展开编辑、删除或是转移给原作者等相关处理。由衷欢迎原作者入驻本站，共建 AI绘图的学习交流社区。

Tâm hồn vô tri

关注

Tâm hồn vô tri

关注

其他

模型信息

已冻结

模型类型：

Checkpoint

基础模型：

Other

文件名称：

models/checkpoints/HunyuanVideo-Avatar_fp8.pt

MD5：

9a68bfec534bb17d00b0ef2cccf49809

申诉

Hunyuan Video Avatar FP8 1 0 0

其他

其他

模型信息

Hunyuan Video Avatar FP8
1 0 0