lemmy.dbzer0.com now protected by Haphash Proof-of-Work [GenAI] [Instance]

db0@lemmy.dbzer0.com · edit-2 5 days ago

lemmy.dbzer0.com now protected by Haphash Proof-of-Work [GenAI] [Instance]

CameronDev@programming.dev · 5 days ago

Proof of work means that your client has to do some “work” in order to gain access. It typically means a challenge that can’t be trivially solved, but can be trivially verified.

For example, the challenge may be something to the effect of:

“Give me a string, that when hashed by md5, results in a hash that ends in 1234”.

Your browser can then start bruteforcing until it finds a string (should take a few seconds max), and then it can pass the string back to the server. The server can verify with a single hash, and you’re in.

Its not wildly different to crypto mining, but the difficulty is much lower for antibot, as it needs to be solveable in seconds by even low end devices.

Infernal_pizza@lemmy.dbzer0.com · 5 days ago

What stops the bots just solving it?

mic_check_one_two@lemmy.dbzer0.com · 5 days ago

Two things: First, bots don’t typically allow JavaScript. No JS, no entry. A user can temporarily enable JS if they’re stuck on an endless loading screen. But a scraper won’t.

Second, the fact that they’d need to solve them for every single bot, and every single site they scrape. It’s a low barrier for regular users, but it’s astronomical for scrapers who are running hundreds of thousands of bots.

kernelle@lemmy.dbzer0.com · 5 days ago

Cost of electricity for the most part. Having a scraper visit 100’s of URL’s per second isn’t unheard of, adding this should reduce the speed of the same scraper by 30-70% depending on the request

P03 Locke@lemmy.dbzer0.com · 5 days ago

Funny, HTTPS is computationally-expensive for similar reasons, but I guess this system works across sessions, with a front-loaded cost.

CameronDev@programming.dev · 5 days ago

I think they are on different scales, there is no bruteforcing involved in https/SSL.

socsa@piefed.social · 5 days ago

I guess the bots don’t know how to rainbow table.

CameronDev@programming.dev · 5 days ago

Its usually designed so that you can’t rainbow table.

give me a string that starts with “xyz”, and hashes to “000…”

That can’t be rainbow tabled, as the server can force a different salt.

(Note, I dont know the exact algorithms involved, just the general theory)

blindsight@beehaw.org · 5 days ago

You can uniquely salt every request trivially, so rainbow tables are effectively useless.