Ok, so this concept is cool, but has a few problems…
- Privacy, this is far too complex to run on the headphones themselves, so the system will need to connect to a server to do the heavy lifting, what happens to the data once it used? For legal purposes I suspect it will need to be saved, meaning that any thing recorded could be analyzed or monitored.
- Trust, AI models have rules in place to make them act in specific ways, the owner of the AI system used could tweak it to change what spoken or how it is said, this could push political agendas in everyday conversations.
- Reduced lingual skills, an AI like this would reduce the incentive to learn another language, reducing people’s international direct communications, increasing dependancy on the AI service, further reducing our lingual skills.
This is scary…
lakemalcom10@lemm.ee 2 days ago
For 1 they actually addressed that: The system then translates the speech and maintains the expressive qualities and volume of each speaker’s voice while running on a device, such mobile devices with an Apple M2 chip like laptops and Apple Vision Pro. (The team avoided using cloud computing because of the privacy concerns with voice cloning.) Finally, when speakers move their heads, the system continues to track the direction and qualities of their voices as they change.
stoy@lemmy.zip 2 days ago
If that is enough power, and you can run it without any internet access, then yes, it would probably adress point 1.
Ilovethebomb@lemm.ee 1 day ago
The fact that all this can run on a phone is incredible, this sounds very processor intensive.
I wonder what it would do to your battery life?