Quote:
Originally Posted by rcooled
|
The funny part of this - for me at least - is that I’ve been working on some code that listens to speech, transcribes it and then calls an agent {to do useful things} when it detects commands in the stream.
For the POC I was using “HAL” as a trigger to denote what follows is a command. Yes I’m old. And geeky.
Sadly speech to text kept transcribing it as “How” - and it got so frustrating I switched to “Agent”… it’s quite surprising how easy it is to get this all working when you are used to writing your own io drivers, but even with all this abstraction and sophistication, speech to text isn’t 100%...