kevin.kempter at dataintellect.com
2005-Sep-21 20:48 UTC
[CentOS] OT - Tool to pull data from the web?
Hi all; I would like to pull data from some specific web pages that have for sale ads and insert the data into a database. Anyone know ofany tools that can help me with this?
I assume you are looking for a "simple" way of doing this just real quick so you're going to want a scripting language. Were I to do this, I would use Python and the urlgrabber. http://linux.duke.edu/projects/urlgrabber urlgrabber has some benefits over urllib2 or pycURL. You could also use jython to get quick access to lots of fun Java libraries that handle XML/HTML and database (JDBC) and HTTP. Regards, Greg
HTMLParser? Perl? Python? A -- Zope Managed Hosting Software Engineer Zope Corporation (540) 361-1700> -----Original Message----- > From: centos-bounces at centos.org [mailto:centos-bounces at centos.org] On > Behalf Of kevin.kempter at dataintellect.com > Sent: Wednesday, September 21, 2005 4:49 PM > To: CentOS mailing list > Subject: [CentOS] OT - Tool to pull data from the web? > > Hi all; > > I would like to pull data from some specific web pages that have for sale > ads > and insert the data into a database. Anyone know ofany tools that can help > me > with this? > > _______________________________________________ > CentOS mailing list > CentOS at centos.org > http://lists.centos.org/mailman/listinfo/centos