Linux.com

data mining - additional options

Posted by: Anonymous Coward on March 30, 2006 05:30 AM
I do a lot of data mining from websites for a variety of reasons. I tend to use a combination of Perl and wget mostly. I call wget from within Perl scripts and use Perl's powerful text manipulation methods to locate the data I'm interested in. I find that massaging the text within a perl script rather than saving to multiple files or piping constantly is more efficient and easier to understand.

I'm not saying there's anything wrong with the above method, just offering another solution.

#

Return to Extract data from the Internet with Web scraping