I am working on a financial software project. I have written code in Python to get all of the historical price data for each stock in the S&P 500. I have tested the code using an input file of five ticker symbols and the code runs perfectly. I would like to get data for all 500 stocks in the S&P 500, however, I don't know if collecting this much data would go well with Yahoo. I have implemented my program so that it only sends out one request per minute, but I am still worried about turning my system loose.
Has anyone else done something similar? For the people who own their own sites, how do you view scraping? Should I contact someone at Yahoo first?