Prelimenary results from Blogroll Ranking

Who are the influential bloggers? Which blogs matter? What metrics would you use to even begin to answer these questions?

I’ve been exploring alternate methods of ranking in the past months. The best results are coming from examining Blogrolls. When you think about it, blogrolls compromise the links in a huge implicit trust network. For now I’m calling the calculated score “PeopleRank”. It’s kinda like PageRank, in that blogroll links from higher PeopleRank-ed blogs count more. E.g. if Om Malik has you on his blogroll, that counts a lot more for your ranking than the blogroll of your niece on Livejournal. (No offense to your niece.)

So here are the top 50 blogs as ranked by the preliminary algorithm: (Commentary and caveats follow)

Blog Name URL People Rank Blogroll Count
TechCrunch (Arrington & Friends) 16.88550 74
Fred Wilson 13.65663 59
Om Malik 10.90295 51
Subscribe to Posts (RSS) 10.35721 58
Battelle, John 9.43316 36
kottke 9.30745 23
Micro Persuasion 9.05083 35
dooce 8.75597 24 8.24951 14
Advertise on this blog 8.24951 14
Creating Passionate Users 8.05627 51
Instapundit 8.01555 30
Brad Feld – Feld Thoughts 7.76376 57
BuzzMachine 7.68799 31
Seth’s Blog 7.64178 44
Full Content 7.39462 10
Comments 7.39462 10
How to Change the World 7.36782 39
Read/WriteWeb 7.32572 27
Canuckflack 7.25962 11
Slashdot 7.22526 32
Gizmodo 7.22314 19
Movable Type 6.92314 15
David Jones/PR Works 6.67162 11
GestureBank 6.61738 20
Hugh Macleod 6.58896 19
Michelle Malkin 6.53256 28
New World Notes 6.47961 6
Bad Astronomy 6.34440 9
Talking Points Memo: by Joshua Micah Marshall 6.30786 23
James Governor 6.11552 23
Three Kid Circus 6.10842 109
Sweetney 6.08445 107
Rain City Real Estate Guide 6.06087 11
Fussy 6.00416 16
SpiffyJapan 5.97301 5
Jottings By An Employer's Lawyer 5.95257 7
VentureBlog 5.91916 24
Joho the Blog 5.85586 23
Jeneane Sessum – Allied 5.73544 91
Her Bad Mother 5.73306 108
George’s Emplt 5.71551 7
B.L. Ochman's Weblog 5.69226 11
Captain's Quarters 5.65295 28
Techdirt (Mike Maznick) 5.64693 21
Venture Chronicles 5.63134 33
This Blog Sits at the 5.50986 9
Shel Holtz 5.49340 10

Caveats of this calculation:

  • Results with ~5K blogs crawled.
  • Blogroll Count = Number of blogrolls this blog appears on = How many people publicly admit to reading this blog.
  • The interesting datapoints are where the PeopleRank ordering puts a blog higher in the list than one with a higher blogroll count — those fewer subscribers must be “more important”.
  • This crawl took Lijit user blogs as the starting seeds giving an overall tech bias.
  • However, there was a period when the crawler went unchecked into what can only be called “The Mommy-o-sphere” so there is an over representation of Mom-blogs in teh dataset.
  • Our blogroll detector algorithm still gets false positives, thus the high rank for “Subscribe to Feedburner” and the multiple listings.
  • Some blogs use a Blogrolling widget for a “Web Ring” functionality, thus erroneously appearing as blogrolls. This explains most of the 100+ blogroll counts.
  • We need better de-duping. Several blogs appeared until multiple URL’s, reducing the overall score.

So how is this different from existing rankings? Til now, the most common methods have fallen into one of two camps:

  1. Number of subscribers. I.e. a pure democracy. Use some combination of Feedburner (for RSS readers) and some web analytics (for web readers) to count the raw number of people reading a blog.
  2. Raw number of incoming links (citations). This is similar, except that links are counted instead of subscribers.

Note that neither method discriminates between the blogs “casting the votes”. It doesn’t matter if that 24th reader of your blog happens to be Scoble. Nor does it matter if those 3 citations to your blog in the last month (Technorati defines this as “very low authority”) came from Seth Godin, Fred Wilson, and Guy Kawasaki.

Initial results are encouraging, and I hope to do more analysis this week. What do you think? If you have any suggestions or ideas, please get in touch with me.

Comments are closed.