Commit Graph

33 Commits

Author SHA1 Message Date
czk32611
b3c30c3561 Merge pull request #197 from ShowLo/patch-1
fix bug in preprocessing data
2024-09-23 13:27:15 +08:00
ShowLo
5bd772d7da fix bug in preprocessing data 2024-09-19 09:56:18 +08:00
czk32611
98f0e6f2b1 Fixed bug in train.py where pe was missing 2024-08-08 14:56:25 +08:00
czk32611
1de8261491 Merge pull request #85 from shounakb1/train_codes
initial data script
2024-08-06 18:49:07 +08:00
Shounak Banerjee
b968548131 fixed mltiple video data preperation 2024-06-17 18:39:15 +00:00
Shounak Banerjee
af82f3b00f temporary commit to save changes 2024-06-13 14:14:52 +00:00
Shounak Banerjee
d74c4c098b clean code and sepaarate finetuned_inference.py 2024-06-07 18:39:24 +00:00
Shounak Banerjee
b4a592d7f3 modified dataloader.py and inference.py for training and inference 2024-06-03 11:09:12 +00:00
czk32611
6d19f3c0c8 Remove crop_audio_window from DataLoader.py 2024-06-01 22:23:47 +08:00
shounak
7254ca6306 initial data script 2024-05-16 18:24:44 +00:00
czk32611
30dcd5237f Update train_codes/README.md 2024-04-30 15:10:03 +08:00
czk32611
d73daf1808 Update draft training codes 2024-04-28 18:04:22 +08:00
czk32611
6e32247cb1 <BUGFIX>: fix app.py (#47) 2024-04-27 14:57:13 +08:00
czk32611
865a68c60e <enhance>: support using float16 in inference to speed up 2024-04-27 14:55:54 +08:00
czk32611
2c52de01b4 Update README and LICENSE 2024-04-26 09:24:23 +08:00
czk32611
ef36109b06 <enhance>(inference): support using an image as video input(#17 #34) 2024-04-19 09:18:39 +08:00
itechmusic
f5579a95bd Fix fps calculation bug in realtime_inference.py (#35) 2024-04-18 18:03:58 +08:00
czk32611
0387c39a93 Add codes for real time inference 2024-04-18 12:05:22 +08:00
Oli_Zhan
955ca416ea Merge pull request #31 from tobycroft/main 2024-04-17 14:34:35 +08:00
gluttony-10
d611968ba3 Update requirements.txt 2024-04-16 22:36:04 +08:00
Tuuz
ae956e63e0 Update app.py
add force_download to make env build more stable
2024-04-16 19:32:25 +08:00
itechmusic
696ec5aa03 Update README.md 2024-04-16 15:45:01 +08:00
czk32611
4ab40470f7 Add gradio 2024-04-16 09:57:12 +08:00
itechmusic
a8a028690a Merge pull request #5 from hotea/main
fix some cuda related issue when run on M-Series Mac with cpu
2024-04-07 15:21:36 +08:00
itechmusic
a5114f1fcb Update README.md 2024-04-07 09:49:29 +08:00
=
9a5212c8dd fix some cuda related issue when run on M-Series Mac 2024-04-05 22:03:28 +08:00
zkangchen
bc1379abad <enhance>: modified inference codes
1. Can set bbox_shift in configs/inference/test.yaml
2. Do not need to pip install whisper now
2024-04-03 14:56:17 +08:00
czk32611
dde2ee49ef Update demo 2024-04-03 12:32:16 +08:00
itechmusic
6b5fdf2d76 Update README.md 2024-04-03 12:30:36 +08:00
itechmusic
578325be30 Update README.md 2024-04-03 10:57:23 +08:00
itechmusic
4c9c634fc4 Update README.md 2024-04-03 10:44:25 +08:00
phighting
4bb0398b53 Update README.md 2024-04-02 18:04:57 +08:00
czk32611
d861498ae3 initial_commit 2024-04-02 17:56:58 +08:00