This is a discussion on "My pet crawler" within the Webforumz Cafe section. This forum, and the thread "My pet crawler are both part of the Community category.
|
|
|
|
|
![]() |
||
My pet crawler
|
||
| Notices |
![]() |
|
|
LinkBack | Thread Tools |
|
#1
|
|||
|
|||
|
My pet crawler
My pet project over the years has been building a web crawler. I just recently built a page that outputs some of the info being collected when it crawls a page instead of just storing the info. I get a kick out of using it on my websites and I thought some other people on webforumz might like to check it out.
http://www.mysitetrack.com/crawler/index.php Just enter a url and click Generate Report. |
|
|
|
#2
|
|||
|
|||
|
Re: My pet crawler
Hey! Thats nifty...even comes with a to do list....
|
|
#3
|
|||
|
|||
|
Re: My pet crawler
That's pretty cool. I love stuff like that.
Nice one. Pete. |
|
#4
|
|||
|
|||
|
Re: My pet crawler
Thanks guys, I figured some people would enjoy it.
I just started to combine it with my content analysis script to display more info about a page |
|
#5
|
|||
|
|||
|
Re: My pet crawler
Wow, I like that kinda handy. I think I have some sites I am going to check out. Thanks Man!!
|
|
#6
|
|||
|
|||
|
Re: My pet crawler
Neat. You deserve a pat on the old back for this. Yaaaaah!
Last Blog Entry: More Sara Blogging (Nov 29th, 2007)
|
|
#7
|
|||
|
|||
|
Re: My pet crawler
Thank you... Thank you very much... (said in an Elvis voice)
|
|
#8
|
|||
|
|||
|
Re: My pet crawler
thats sweet ..... its been added to my bookmarks ... its mine now muahha ha ha ha
ahem .... very nice well done |
|
#9
|
||||
|
||||
|
Re: My pet crawler
*Nicks a rabbit of accurax*
Last Blog Entry: Assassin's Creed (Nov 22nd, 2007)
|
|
#10
|
||||
|
||||
|
Re: My pet crawler
Very nice... I found something quite interesting.
At the end of the report, you analyse keywords within the content. one of my sites (pinesandneedles.com) had this: Keywords appearing once: one word: christmas, trees, london (and many more unrelated) two word: christmas tree, christmas trees (and many more unrelated) Keywords appearing twice: one word: christmas, trees, london (and a few more related) two word: christmas tree, christmas trees (and a few more related) Keywords appearing 3 times: one word: christmas, trees, london (and only) two word: christmas tree, christmas trees (and only) Keywords appearing 4 times: one word: christmas, trees, london (and only) two word: christmas tree, christmas trees (and only) Which is why we do so well on searches for "christmas trees london" and respectably well on searches for "christmas trees", considering the competition and the fact we're the only site on the top 10 results that doesn't have the words "christmas" or "tree" or even "london" in the url.... All this means is that your analyser is obviously imitating the behaviour of search engines very well and you should indeed be very proud of it. Nice work ;-)
Last Blog Entry: Random String in Javascript (Apr 21st, 2008)
|
|
#11
|
|||
|
|||
|
Re: My pet crawler
Thats all i do on rabbitdom mate.... i have no idea why... but its amusing to steal ur carrots
|
|
#12
|
|||
|
|||
|
Re: My pet crawler
Cool! I'll be following your project's progress. I'll be bookmarking your site.
ArdRigh |
|
#13
|
|||
|
|||
|
Re: My pet crawler
nice i like it
|
![]() |
| Tags |
| crawler |
| Thread Tools | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| web crawler not following links | nate2099 | Web Page Design | 1 | Feb 17th, 2008 01:59 |
| Links a crawler should ignore | Don Logan | Web Page Design | 0 | Oct 6th, 2006 18:55 |