mirror of https://github.com/TMElyralab/MuseTalk.git synced 2026-02-04 17:39:20 +08:00

Files

ShowLo 4f69a9cfdd Fix the bug that causes an infinite loop when the total number of frames in the video does not exceed 11.

eg, the video has 11 frames, when select the NO.6 frame, `while abs(random_element - img_idx) <= 5:` will result in an infinite loop

2024-09-19 17:09:35 +08:00

filelists

Update draft training codes

2024-04-28 18:04:22 +08:00

utils

modified dataloader.py and inference.py for training and inference

2024-06-03 11:09:12 +00:00

DataLoader.py

Fix the bug that causes an infinite loop when the total number of frames in the video does not exceed 11.

2024-09-19 17:09:35 +08:00

musetalk.json

Update draft training codes

2024-04-28 18:04:22 +08:00

README.md

fixed mltiple video data preperation

2024-06-17 18:39:15 +00:00

train.py

Fixed bug in train.py where pe was missing

2024-08-08 14:56:25 +08:00

train.sh

fixed mltiple video data preperation

2024-06-17 18:39:15 +00:00

README.md

Data preprocessing

Create two config yaml files, one for training and other for testing (both in same format as configs/inference/test.yaml) The train yaml file should contain the training video paths and corresponding audio paths The test yaml file should contain the validation video paths and corresponding audio paths

Run:

./data_new.sh train output train_video1.mp4 train_video2.mp4
./data_new.sh test output test_video1.mp4 test_video2.mp4

This creates folders which contain the image frames and npy files. This also creates train.json and val.json which can be used during the training.

Data organization

./data/
├── images
│     └──RD_Radio10_000
│         └── 0.png
│         └── 1.png
│         └── xxx.png
│     └──RD_Radio11_000
│         └── 0.png
│         └── 1.png
│         └── xxx.png
├── audios
│     └──RD_Radio10_000
│         └── 0.npy
│         └── 1.npy
│         └── xxx.npy
│     └──RD_Radio11_000
│         └── 0.npy
│         └── 1.npy
│         └── xxx.npy

Training

Simply run after preparing the preprocessed data

cd train_codes
sh train.sh #--train_json="../train.json" \(Generated in Data preprocessing step.)
            #--val_json="../val.json" \

Inference with trained checkpoit

Simply run after training the model, the model checkpoints are saved at train_codes/output usually

python -m scripts.finetuned_inference --inference_config configs/inference/test.yaml --unet_checkpoint path_to_trained_checkpoint_folder

TODO

release data preprocessing codes
release some novel designs in training (after technical report)