Show HN: Motie – Replit for Web Scraping

4 pointsposted 2 months ago
by jb_hn

4 Comments

theanonymousone

2 months ago

> we’ve noticed a very long tail of websites that don’t require proxies

That tail seems to be getting harshly slaughtered by Cloudflare.

jb_hn

2 months ago

Good point – we’ve definitely noticed a lot more Cloudflare representation these days. That said, there seems to be tiers in terms of the protection they offer (and thus the protection used by the websites in this long-tail), where lower tiers (so far) haven’t required proxies.

Curious if you’ve noticed any particularly well defined, obscure websites? Would love to take a look if so.

xmcp123

2 months ago

Ya know, I was ready to downvote this (AI scraping is not my favorite) but I’m not going to.

It really does have its niche - one off complex scrapes where it’s kind of questionable if it’s worth writing a scraper.

jb_hn

2 months ago

Haha I appreciate that! And that’s exactly right. Our goal is to make it so that you don’t have to ask the question “but is it worth the time and effort…” when you want to use or explore a new dataset.