Originally Posted by Barjack
Then I parse them all using another Ruby script using Nokogiri (XML/HTML parser) and simple regular expressions. The site uses AJAX often so there are often JSON objects sitting around on various pages that you can load into memory with a JSON parser, too. These are sometimes more convenient than scraping the HTML.
|
Out of sheer curiosity - could you share that script? (I am trying to learn Regex, and Nokogiri :P )