Refactor CUDA stream context management in CosyVoice2Model

- Replaced the use of torch.cuda.stream with a direct context management approach for improved clarity and performance during inference. - This change simplifies the stream handling code while maintaining efficient resource utilization.
2026-02-05 18:09:24 +08:00 · 2025-04-16 16:04:40 +08:00
parent 369f3c2c18
commit e8a26827ae
1 changed files with 1 additions and 1 deletions
--- a/cosyvoice/cli/model.py
+++ b/cosyvoice/cli/model.py
@@ -385,7 +385,7 @@ class CosyVoice2Model(CosyVoiceModel):
        # this_uuid is used to track variables related to this inference thread
        self.synchronize_stream()
        stream_context = self.stream_context_pool.get()
-        with torch.cuda.stream(stream_context):
+        with stream_context:

            this_uuid = str(uuid.uuid1())
            with self.lock: