Attributed multilayer networks

Perform inference on networks with metadata on both edges and nodes.

Advancements in data collection techniques have led to the acquisition of more comprehensive data, particularly by gathering additional information that characterizes the units and their interactions within real-world systems. These enriched data are effectively represented by attributed multilayer networks, which are complex network representations that describe multiple types of interactions among the same set of nodes, while also incorporating node information such as attributes or covariates. To properly analyze such data, models must integrate various sources of information in a convenient and principled manner, leveraging both the network topology and the node metadata.

We developed two distinct probabilistic models, named MTCOV (Contisciani et al., 2020) and PIHAM (Contisciani* et al., 2024), designed to perform community detection and broader inference in attributed multilayer networks. Both models posit the existence of a hidden mixed-membership community structure that drives the generation of both interactions and node attributes, but they differ in their model specification and inference.

Specifically, MTCOV is designed to handle categorical attributes and nonnegative discrete weights, specifying the likelihood of the data through a linear combination of the likelihoods from the two sources of information. Additionally, inference is performed using an efficient EM algorithm. On the other hand, PIHAM flexibly adapts to any combination of input data and employs a Bayesian framework, along with Laplace approximations and automatic differentiation techniques for parameter inference.

In addition to applying these methods to already explored real-world data, such as social and biological networks, we employed this methodology for the first time in the analysis of patent citation networks (Higham et al., 2022). In this context, we not only illustrated the importance of using a multilayer framework for patent citation data analysis but also emphasized the role of a node covariate in driving the inference, alongside the structural information embedded within the network.

Main takeaways

  • Effectively integrating node attributes with topological information can significantly enhance network inference, by for instance boosting prediction performance.
  • Better outcomes are achieved when the node metadata are more informative and exhibit some degree of correlation with the information conveyed by the interactions.
  • When the node metadata offer valuable insights, our models identify communities that align with this information. This approach leads to more interpretable results, where attributes actively influence the inference process.

References

  1. [1]
    Community detection with node attributes in multilayer networks
    Martina Contisciani, Eleanor A Power, and Caterina De Bacco
    Scientific Reports, 2020
  2. [P1]
    Flexible inference in heterogeneous and attributed multilayer networks
    Martina Contisciani*, Marius Hobbhahn*, Eleanor A Power, Philipp Hennig, and Caterina De Bacco
    2024
  3. [4]
    Multilayer patent citation networks: A comprehensive analytical framework for studying explicit technological relationships
    Kyle Higham, Martina Contisciani, and Caterina De Bacco
    Technological Forecasting and Social Change, 2022