I have a high-traffic news website that I need to add a search feature to. I need something that's free or cheap, can add new news stories immediately to search results, handle high traffic, and allow for customized styling of results. I looked at Google Custom Search and it didn't seem like a good option because you have to pay for high traffic and they don't add new news articles immediately unless I paid to index them every time an article is added.
Does anyone know of either a different search API that would meet my needs or better yet, a library or script of some sort I could use? For example, does Sphinx, Lucene, Solr, etc. have the capability to index web pages? If so, would this have huge data requirements that would make my Amazon Web Services bill skyrocket? What's the best solution?