Skip to content

Adding fine-grained VAD params to Sarvam saaras:v3 STT plugin#5563

Open
dhruvladia-sarvam wants to merge 1 commit intolivekit:mainfrom
dhruvladia-sarvam:feat/VAD-params-sarvam-stt
Open

Adding fine-grained VAD params to Sarvam saaras:v3 STT plugin#5563
dhruvladia-sarvam wants to merge 1 commit intolivekit:mainfrom
dhruvladia-sarvam:feat/VAD-params-sarvam-stt

Conversation

@dhruvladia-sarvam
Copy link
Copy Markdown
Contributor

The Sarvam STT WebSocket endpoint for saaras:v3 now accepts granular
VAD tuning parameters. Add support for: positive_speech_threshold,
negative_speech_threshold, min_speech_frames, first_turn_min_speech_frames,
negative_frames_count, negative_frames_window, start_speech_volume_threshold,
interrupt_min_speech_frames, pre_speech_pad_frames, and
num_initial_ignored_frames.

These are gated behind a new supports_vad_params flag in ModelConfig,
set to true only for saaras:v3. Parameters flow through STT.init,
stream(), and into the WebSocket query string via _build_websocket_url.
All are optional and only sent when explicitly provided.

Copy link
Copy Markdown
Contributor

@devin-ai-integration devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 2 additional findings.

Open in Devin Review

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant