I am looking to capture the sound of a VoIP call made with ZOIPER on my desktop, to stream it (real time) to a Cloud Speech-to-text Websocket API. Any idea if this is possible ?
For a Proof-of-Concept project, we need real time capture of a VoIP phone conversation made with ZOIPER 3. The necessary steps are:
- Call event recognition (made, received or - best - both)
- Start capturing the call sound, both sides of the conversation (not interrupting or disrupting the call)
- Direct the sound stream to the Websocket API
- Collect the API text result back
Thanks for any info...