On Tuesday, Microsoft Research Asia unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track. In the future, it could power virtual avatars that render locally and don’t require video feeds—or allow anyone with similar tools to take a photo of a person found online and make them appear to say whatever they want.
We’re going to need strong digital signatures on everything, and we need it fast, else we won’t be able to believe anything we see. It will be Steve Bannon’s “flood the zone with shit” dream come true.
That won’t help anything considering how easy it is to strip metadata.
I mean the opposite scenario, where if there’s no signature we assume it’s fake.
We’ve had email forgery and signatures to prevent it for decades, but barely anyone does that either.