Alexa vs Black Mirror
www.washingtonpost.com/nation/…d-people
Most likely it will be dropped just like Google Duplex was, but the tech to make this kind of seamless, kind of exists. The key barrier is shit audio quality and the fact that TTS lacks emotion and long term coherency.
With 50 - 100 recordings it is definitely possible, if you pour a lot of server-based compute into it. Basically there is a trade off between the size of your model and the amount of voice required.
This is how it sounded for my grandparents without large models and large compute:
- soundcloud.com/alexand…nts-demo
The reason I am sharing this is to show how little capital cares about ethics despite the AI ethics BS. Also the reason OECD consumers flooded their homes with such devices eluded me, good thing the market looks to have cooled a bit.