Merge pull request #714 from Purfview/patch-4

Fix type hint for min_silence_at_max_speech (float -> int)
This commit is contained in:
Alexander Veysov
2025-11-05 08:44:00 +03:00
committed by GitHub

View File

@@ -223,7 +223,7 @@ def get_speech_timestamps(audio: torch.Tensor,
progress_tracking_callback: Callable[[float], None] = None, progress_tracking_callback: Callable[[float], None] = None,
neg_threshold: float = None, neg_threshold: float = None,
window_size_samples: int = 512, window_size_samples: int = 512,
min_silence_at_max_speech: float = 98, min_silence_at_max_speech: int = 98,
use_max_poss_sil_at_max_speech: bool = True): use_max_poss_sil_at_max_speech: bool = True):
""" """
@@ -272,7 +272,7 @@ def get_speech_timestamps(audio: torch.Tensor,
neg_threshold: float (default = threshold - 0.15) neg_threshold: float (default = threshold - 0.15)
Negative threshold (noise or exit threshold). If model's current state is SPEECH, values BELOW this value are considered as NON-SPEECH. Negative threshold (noise or exit threshold). If model's current state is SPEECH, values BELOW this value are considered as NON-SPEECH.
min_silence_at_max_speech: float (default - 98ms) min_silence_at_max_speech: int (default - 98ms)
Minimum silence duration in ms which is used to avoid abrupt cuts when max_speech_duration_s is reached Minimum silence duration in ms which is used to avoid abrupt cuts when max_speech_duration_s is reached
use_max_poss_sil_at_max_speech: bool (default - True) use_max_poss_sil_at_max_speech: bool (default - True)