What We Learned Building a Self-Hosted Speech Translation Platform

dhs@lemmy.world · 6 days ago

What We Learned Building a Self-Hosted Speech Translation Platform

artifex@piefed.social · 5 days ago

This is a pretty interesting project! Assuming one wanted to run everything locally, what’s the minimum viable hardware stack for near-realtime performance?

dhs@lemmy.world · 20 hours ago

Thanks! We’re still benchmarking different setups, so I don’t want to give a misleading “minimum spec” number yet. In practice, the hardware requirements depend much more on the STT/translation/TTS models you choose than on PolyTalk itself. For a single-user setup, you don’t necessarily need expensive hardware. As you push for lower latency, larger models, or multiple simultaneous streams, the requirements increase pretty quickly. Proper hardware benchmarks are something we plan to publish once we’ve tested a wider range of configurations.