总体难度
整体属于中等复杂度:有清晰的官方安装指引,核心依赖为Python 3.9、PyTorch 2.0.1 + CUDA 11.7、PyTorch3D与MMCV,并需准备模型权重与数据。对熟悉 conda、pip 与 GPU 环境的用户,按步骤操作通常可在30–90分钟内完成;若遇到系统与驱动不匹配、编译或依赖冲突,排障成本会明显上升。
关键步骤概览
git clone https://github.com/yerfor/GeneFacePlusPlus.git
conda create -n geneface python=3.9
conda activate geneface
conda install conda-forge::ffmpeg
conda install pytorch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pytorch-cuda=11.7 -c pytorch -c nvidia
pip install "git+https://github.com/facebookresearch/pytorch3d.git@stable"
pip install cython openmim==0.3.9
mim install mmcv==2.1
pip install -r requirements.txt
export PYTHONPATH=$(pwd):$PYTHONPATH
python inference/genefacepp_infer.py --a2m_ckpt checkpoints/audio2motion_vae --head_ckpt checkpoints/motion2video_nerf/may_head --torso_ckpt checkpoints/motion2video_nerf/may_torso --drv_aud data/raw/val_wavs/example.wav --out_name demo_output.mp4
常见难点与规避
降低难度的实用建议