Back to Models
PI

pipecat-ai/smart-turn-v3

pipecat-aigeneral

Smart Turn v3.x

Smart Turn is an open‑source semantic Voice Activity Detection (VAD) model that tells you whether a speaker has finished their turn by analysing the raw waveform, not the transcript.

Links

Model architecture

  • Backbone: Whisper Tiny encoder
  • Head: shallow linear classifier
  • Params: 8M
  • Checkpoint: 8 MB ONNX (int8 quantized), 32MB ONNX (unquantized)

How to use

Please see the blog post and GitHub repo for more information on using the model, either standalone or with Pipecat.

Thanks

Thank you to the following organisations for contributing audio datasets:

Visit Website

0 reviews

5
0
4
0
3
0
2
0
1
0
Likes146
Downloads
📝

No reviews yet

Be the first to review pipecat-ai/smart-turn-v3!

Model Info

Providerpipecat-ai
Categorygeneral
Reviews0
Avg. Rating / 5.0

Community

Likes146
Downloads

Rating Guidelines

★★★★★Exceptional
★★★★Great
★★★Good
★★Fair
Poor