mirror of
https://github.com/TMElyralab/MuseTalk.git
synced 2026-02-05 09:59:18 +08:00
Update train_codes/README.md
This commit is contained in:
@@ -2,6 +2,10 @@
|
||||
|
||||
We provde the draft training codes here. Unfortunately, data preprocessing code is still being reorganized.
|
||||
|
||||
## Setup
|
||||
|
||||
We trained our model on an NVIDIA A100 with `batch size=8, gradient_accumulation_steps=4` for 20w+ steps. Using multiple GPUs should accelerate the training.
|
||||
|
||||
## Data preprocessing
|
||||
You could refer the inference codes which [crop the face images](https://github.com/TMElyralab/MuseTalk/blob/main/scripts/inference.py#L79) and [extract audio features](https://github.com/TMElyralab/MuseTalk/blob/main/scripts/inference.py#L69).
|
||||
|
||||
@@ -32,4 +36,8 @@ Finally, the data should be organized as follows:
|
||||
Simply run after preparing the preprocessed data
|
||||
```
|
||||
sh train.sh
|
||||
```
|
||||
```
|
||||
|
||||
## TODO
|
||||
- [ ] release data preprocessing codes
|
||||
- [ ] release some novel designs in training (after technical report)
|
||||
Reference in New Issue
Block a user