Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

What approach would you be going for initially for deduplicating same urls?


You might look at a real data processing system, like something from the Apache projects




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: