Targeted
Navigation:

Technology All The Way
Iodid Weblogs
 

Standard
Navigation:

Home
FAQ
Search
Table of Contents

Email:

TryandGuess
-at-
iodid.com

Search this weblog:


Visitor Count:
Money Grubbing:

Amazon Honor System Click Here to Pay Learn More

Terms of Use

June 20, 2001

Basic Web Crawler

his would be a web crawler that would be able to gather information in an automated fashion and enter this information into a database for other uses. While I have a few ideas for uses of this information, I do not yet want to describe those plans fully.
Information gathered would include total display characters, dimensions of images displayed, quantity of images, links contained on the page and whether those links are within the site or to external destinations.
I envision that the process would be recursive and controllable in how many links deep the crawler would go, whether to go outside the initial target site and whether to gather the information in concentric rings or whether it will follow each branch of the tree.

Posted by tmichael at June 20, 2001 04:21 PM
Comments
Post a comment