Algorithm | Higher-order network

Compatibility

For a ship at Singapore, given the network structure, where the ship will go next is proportional to the edge weights. This shows a Markovian property of the first-order network representation.

In HON, we break down Singapore into two nodes, Singapore given Tokyo and Singapore given Shanghai. These two nodes have their respective edge weights to LA and Seattle. While ships still perform random walks, coming from different paths to Singapore will now have different probabilities to choose the next step.

Everything in the formula is unchanged except the labeling of nodes, which means that this higher-order network keeps the data structure consistent with first-order network, and is directly compatible with existing tools.

Variable orders means scalability

What if the movement on the network depends on more than two steps, say, five steps ago? One potential approach is to break down every node five times to embed more information, creating a fifth-order network. While the fixed-order network is easy to build, it does not scale well. By forcibly breaking down every node into a certain high order will make the network exponentially more complex, considering how many potential combinations of five previous steps can be.

Instead, we propose to use a variable-order representation, that uses the first order when it is sufficient, and uses higher orders only when necessary. As a result, we can represent variable orders of dependencies in the same network. The best of all, the network size is magnitudes smaller than fixed-order networks, making it scalable for big data.

Compatibility

Variable orders means scalability

Workflow

Rule extraction

Network wiring

Scalability

HON+: a fundamentally improved HON construction algorithm