Are there any legal issues recreating YouTube SponsorBlock for Podcasts?

z3rOR0ne@lemmy.ml · 7 months ago

Are there any legal issues recreating YouTube SponsorBlock for Podcasts?

SomeoneSomewhere@lemmy.nz · 7 months ago

You definitely would have legal issues redistributing the ad-free version.

Sponsor block works partly because it simply automates something the user is already allowed to do - it’s legally very safe. No modification or distribution of the source file is necessary, only some metadata.

It’s an approach that works against the one-off sponsorships read by the actual performers, but isn’t effective against ads dynamically inserted by the download server.

One option could be to crowdsource a database of signatures of audio ads, Shazam style. This could then be used by software controlled by the user (c.f. SB browser extension) to detect the ads and skip them, or have the software cut the ads out of files the user had legitimately downloaded, regardless of which podcast or where the ads appear.

Sponsorships by the actual content producers could then be handled in the same way as SB: check the podcast ID and total track length is right (to ensure no ads were missed) then flag and skip certain timestamps.

z3rOR0ne@lemmy.ml · 7 months ago

One option could be to crowdsource a database of signatures of audio ads, Shazam style. This could then be used by software controlled by the user (c.f. SB browser extension) to detect the ads and skip them, or have the software cut the ads out of files the user had legitimately downloaded, regardless of which podcast or where the ads appear.

That is one of the more unique ideas presented thus far. The other similar approach would be utilizing a trained AI model that would recognize advertisements and sponsor mentions. I’m not exactly sure how Shazam works, but that might be something to research in figuring out how best to approach this. Thanks.

SomeoneSomewhere@lemmy.nz · 7 months ago

Yeah, I have no idea either, but it’s been around for more than a decade so it should be fairly easy to find a library that duplicates it.

I would be wary of AI-based solutions. There’s a risk of it picking up e.g. satirical/spoof sponsorships as actual ads, and perhaps not detecting unusual ads.

I’m slightly terrified of the day someone starts getting AI to reword and read out individual ads for each stream.

z3rOR0ne@lemmy.ml · 7 months ago

Perhaps that would be a good first step then. Figure out how Shazam works, then create a standalone application that catalogues and recognizes the audio of advertisements. An obvious name for such an app would be along the lines of “IsAnAd?”. Then hook that standalone application up to a podcast aggregation client and use the timestamps of that to create the desired sponsor block functionality.

Thanks again. Just hashing this out with others like yourself has been super helpful.