Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Ask HN: What are some good resources to get started with Information Retrieval?
2 points by ideamonk on Nov 27, 2010 | hide | past | favorite | 4 comments
I haven't been in touch with hardcore academics since around 2 years. Recently coming across a few IEEE papers on twitter(finding hot trends, ranking tweets, etc), I'm thinking of teaching myself some information retrieval from scratch, but from this standpoint topics like NLP, IR seem quite hard to fathom, or even to begin with in the first place.

Any pointers to make the journey smoother, essential foundations? trusted/good books? fun experiments for self learning? IR specific wikis/discussion groups?

Thanks in advance.



Managing Gigabytes is one of the more well-known textbooks...

Really though - if you want to get started with IR, your best bet is probably to get a job at Google. This is one of those fields where industrial practice is light years beyond academic research, and once you get a job at Google in basically anything, it's relatively easy to transfer to Search and learn how the system works.


I am not convinced though. I don't think nooglers are assigned to Google Search team. They won't expose their search-algorithm to a rookie.


It's significantly easier to get into Google and then transfer to search quality than it would be to get into a graduate program, get the Ph.D, and then get the necessary data and machines to do top-level work in IR. Just look at the number of tenured professors that accept jobs at Google.

FWIW, I've been working in Google Search Quality since I joined the company nearly two years ago. I don't work in ranking, but I have high-level knowledge of how the whole system works that's better than you'd get out of a textbook.


After a bit of googling I found out that I was wrong in assuming that search-team of Google is reserved only for veterans and freshers are not allowed. Also it's a fact that you learn more from industrial exposure than that of textbooks.

As for OP, I got a nice resource, which may help him currently: http://www.ims.uni-stuttgart.de/ir/




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: