Network Probability Matrix
Encyclopedia
The network probability matrix describes the probability structure of a network based on the historical presence or absence of edges in a network. For example, individuals in a social network are not connected to other individuals with uniform random probability. The probability structure is much more complex. Intuitively, there are some people whom a person will communicate with or be connected more closely than others. For this reason, real-world networks tend to have clusters or cliques of nodes that are more closely related than others (Albert and Barabasi, 2002, Carley [year], Newmann 2003). This can be simulated by varying the probabilities that certain nodes will communicate.
The edge probabilities can be derived from empirical data in several ways. Given network data collected over multiple time periods on a group of subjects, the edge probabilities can be estimated by the proportion of edge occurrences, eij, for each cell in the adjacency matrix, . In the case of communication networks, statistical distributions can be fitted to the time between messages for each potential edge in the network. For a specified period of time t, the edge probability p for each set of entities i and j can be found. Under certain assumptions, the following is true:
In practice, the function must be estimated using techniques such as maximum likelihood estimation; it is the probability density function for the time between communications from node vi to vj, and represent the parameters of the density. It may be desirable to construct a network based on a restriction such as, “two emails within a time period demonstrate a relationship, but one does not.” In this case, it is necessary to compose a function of random variables. If [?] represents the probability density function
of time between two sets of two emails and [?] represents the probability density function of time between one set of two emails, then the following is true under certain assumptions:
It is possible to generalize this idea; if the probability that x or more communications occur within time t, then the following is true:
This newly proposed framework for viewing the probability space of a social network
preserves the same flexibility for modeling dyadic relationships, however, it provides researchers with a means to understand the probability space of the network and thus devise more robust and appropriate statistical tests for social network analysis.
The network probability matrix was originally proposed by Ian McCulloh.
The edge probabilities can be derived from empirical data in several ways. Given network data collected over multiple time periods on a group of subjects, the edge probabilities can be estimated by the proportion of edge occurrences, eij, for each cell in the adjacency matrix, . In the case of communication networks, statistical distributions can be fitted to the time between messages for each potential edge in the network. For a specified period of time t, the edge probability p for each set of entities i and j can be found. Under certain assumptions, the following is true:
- [line missing, possibly as a result of copying and pasting]
In practice, the function must be estimated using techniques such as maximum likelihood estimation; it is the probability density function for the time between communications from node vi to vj, and represent the parameters of the density. It may be desirable to construct a network based on a restriction such as, “two emails within a time period demonstrate a relationship, but one does not.” In this case, it is necessary to compose a function of random variables. If [?] represents the probability density function
Probability density function
In probability theory, a probability density function , or density of a continuous random variable is a function that describes the relative likelihood for this random variable to occur at a given point. The probability for the random variable to fall within a particular region is given by the...
of time between two sets of two emails and [?] represents the probability density function of time between one set of two emails, then the following is true under certain assumptions:
- [line missing, possibly as a result of copying and pasting]
It is possible to generalize this idea; if the probability that x or more communications occur within time t, then the following is true:
- [line missing, possibly as a result of copying and pasting]
This newly proposed framework for viewing the probability space of a social network
Social network
A social network is a social structure made up of individuals called "nodes", which are tied by one or more specific types of interdependency, such as friendship, kinship, common interest, financial exchange, dislike, sexual relationships, or relationships of beliefs, knowledge or prestige.Social...
preserves the same flexibility for modeling dyadic relationships, however, it provides researchers with a means to understand the probability space of the network and thus devise more robust and appropriate statistical tests for social network analysis.
The network probability matrix was originally proposed by Ian McCulloh.