WWW 2006 – Random Sampling from a Search Engine's Index

new method to benchmark relative sizes of search engines; only about 45% of Yahoo’s index is in Google, and vice-versa