|
Data Mining Tutorial complete
with Data Mining Tools (PHP
Functions) to parse data and
match based on regular
expressions. Basic Data
Mining Steps: Fetch the HMTL
page(s) of Interest using
the Snoopy PHP Class,
Split the page HTML into a
more managable portion,
Remove un-wanted HTML tag
attributes, Reformat HTML,
adjust spacing and remove
entities, Match content with
regular expressions and
Store content into a MySQL
database for future use.
Data mining services
available for online
resources such as Google,
DMOZ, Yahoo, Yellow Pages
and several others.
Date: Jan, 29 2012 |