May 25, 2006 WWW 2006 – Random Sampling from a Search Engine's Index — new method to benchmark relative sizes of search engines; only about 45% of Yahoo's index is in Google, and vice-versa #