Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

How does that work if the scraper takes a screenshot to feed to a LLM or OCR?
 help



That seems like a very expensive way to crawl the internet

Scrape normally collect emails, if no email seen take screenshot and OCR OCR is cheap and REGEX is cheap

It would be interesting to show bf code rather than the actual email on the webpage. A lot of OCR systems struggle with this kind of repeated symbols where the exact count is required.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: