Examining network properties using breadth-first sampling : A case study of the network spanned by the kth.se domain

University essay from KTH/Skolan för datavetenskap och kommunikation (CSC)

Abstract: Many real life complex networks consists of a tremendous amount of nodes and edges which make them difficult to extract and analyze. This thesis aims to examine what network prop- erties that can be deduced when considering small samples of a complex network and how well they correspond to the characteristics of the complete network. This is of importance as sampling will most likely be the de facto method when analyzing complex networks in the future. The study examine the scale-free property, the small-world property and the com- munity structure of the network spanned by the KTH domain. The method consisted of gathering data about the network through sampling it in a breadth-first manner using a web crawler. The samples was then compared with respect to each property. The results was that good approximations of the scale-free property could be made from small samples of the KTH network. However, no good approximation could be made about the small-world property using the sampling technique. Good approximations about a node’s community affiliation could be observed. However, general conclusions of the com- plete network’s community structures could not be made. To summarize, the result indi- cate that small samples can be used to approximate some properties of the complete KTH network. However, to determine if the result is true for the general case more research is necessary. 

  AT THIS PAGE YOU CAN DOWNLOAD THE WHOLE ESSAY. (follow the link to the next page)