The following example shows you how to use Hpricot to parse HTML
# $Id$ # # /home/madann/learn/ruby/ht.rb # require 'net/http' require 'rubygems' require 'hpricot' HOST="www.estately.com" http = Net::HTTP.new(HOST,80) homepage = http.get('/', nil) doc = Hpricot(homepage.body) # p doc.class # p doc.methods.sort inner = Hpricot(doc.at("body").to_html) divs = inner.search("div") p divs