# active_learning_based_structural_inference__d1d28187.pdf Active Learning based Structural Inference Aoran Wang 1 Jun Pang 1 2 In this paper, we propose a novel framework Active Learning based Structural Inference (ALa SI), to infer the existence of directed connections from observed agents states over a time period in a dynamical system. With the help of deep active learning, ALa SI is competent in learning the representation of connections with a relatively small pool of prior knowledge. Moreover, based on information theory, the proposed interand outof-scope message learning pipelines are remarkably beneficial to structural inference for large dynamical systems. We evaluate ALa SI on various large datasets including simulated systems and real-world networks, to demonstrate that ALa SI is able to outperform previous methods in precisely inferring the existence of connections in large systems under either supervised learning or unsupervised learning. 1 Introduction Dynamical systems are commonly observed in real-world, including physical systems (Kwapie n & Dro zd z, 2012; Ha & Jeong, 2021), biological systems (Tsubaki et al., 2019; Pratapa et al., 2020), and multi-agent systems (Bras o & Leal Taix e, 2020; Li et al., 2022). A dynamical system can be described as a set of three core elements: (a) the state of the system in a time period, including the state of the individual agents, and can be viewed as time series; (b) the state-space of the system; and (c) the state-transition function (Irwin & Wang, 2017). Knowing these core elements, we can describe and predict how a dynamical system behaves. Yet the three elements are not independent of each other. The evolution of the state is affected by the state-transition function, which suggests that the future state may be predicted based on the current state and the entities which affect the agents 1Faculty of Science, Technology and Medicine, University of Luxembourg, Luxembourg 2Institute for Advanced Studies, University of Luxembourg, Luxembourg. Correspondence to: Aoran Wang , Jun Pang . Proceedings of the 40 th International Conference on Machine Learning, Honolulu, Hawaii, USA. PMLR 202, 2023. Copyright 2023 by the author(s). (i.e. connectivity). Moreover, the state-transition function is often deterministic (Katok & Hasselblatt, 1995), which simplifies the derivation of the future state as a Markovian transition function. However, in most cases, we hardly have access to the connectivity, or only have limited knowledge about the connectivity. Is it possible to infer the connectivity from the observed states of the agents over a time period? We formulate it as the problem of structural inference, and several machine learning frameworks have been proposed to address it (Kipf et al., 2018; Webb et al., 2019; Alet et al., 2019; Chen et al., 2021; L owe et al., 2022; Wang & Pang, 2022). Although these frameworks can accurately infer the connectivity, as they perform representation learning on a fully connected graph, these methods can only work for small systems (up to dozens of agents), and cannot scale well to real-world large dynamical systems, for example, with hundreds of agents. Besides, as we show in the experiment and appendix sections in this work, the integration of prior knowledge about partial connectivity of the system is quite problematic among these methods. In this work, we propose a novel structural inference framework, namely, Active Learning based Structural Inference (ALa SI), which is designed for the structural inference of large dynamical systems based on Deep Active Learning (Deep AL) (Ren et al., 2022), and is suitable for the integration of prior knowledge. In order to perform structural inference on large dynamical systems, unlike ordinal deep active learning methods that build feature pools on batches (Kirsch et al., 2019; Zhdanov, 2019; Ash et al., 2020; Gentile et al., 2022), the pools of ALa SI are built on agents, and the framework can consequently infer the existence of directed connections with a little prior knowledge of the connections. ALa SI leverages query strategy with dynamics for agent-wise selection to update the pool with the most informative partial system, which encourages ALa SI to infer the connections efficiently and accurately with partial prior knowledge of the connectivity (named scope ). Based on information theory, ALa SI learns both inter-scope (IS) and out-of-scope (OOS) messages from the current scope to distinguish the information which represents connections from agents within the scope and from agents out of the scope, which reserves redundancy when new agents come into scope. Moreover, with oracle such as Active Learning based Structural Inference Partial Information Decomposition (PID) (Williams & Beer, 2010), ALa SI can infer the connectivity even without prior knowledge and be trained in an unsupervised way. We show with extensive experiments that ALa SI can infer the directed connections of dynamical systems with up to 1.5K agents with either supervised learning or unsupervised learning. The main contribution of this paper is the following: We propose a novel structural inference algorithm, ALa SI, tailored to infer the connection of large dynamical systems based on Deep AL. It is the first attempt to structural inference with Deep AL to the best of our knowledge. We design a novel dynamic query strategy, which queries the most informative agents to be labeled based on the dynamic error, and enables ALa SI to learn efficiently on prior knowledge of the partial dynamical system. Based on information theory, we propose IS and OOS representation learning pipelines, which facilitate the learning of OOS connections from the current scope of the system, and reserve redundancy for new agents to be added to the current scope. We experimentally evaluate ALa SI with seven large dynamical systems, and show that ALa SI manages to precisely and efficiently infer the connections under both supervised and unsupervised settings. 2 Related Work Structural inference. The aim of structural inference is to accurately reconstruct the connections between the agents in a dynamical system with observational agents states. Among the wide variety of methods, Neural Relational Inference (NRI) (Kipf et al., 2018) was the first to address the problem of structural inference based on observational agents states with the help of a Variational Auto-encoder (VAE) operating on a fixed fully connected graph structure. Several works have been proposed based on further improvement on NRI. Such as extending to multi-interaction systems (Webb et al., 2019), integrating efficient messagepassing mechanisms (Chen et al., 2021), using modular meta-learning (Alet et al., 2019), and eliminating indirect connections with iterative process (Wang & Pang, 2022). From the aspect of Granger-causality, amortized causality discovery (ACD) (L owe et al., 2022) attempted to infer a latent posterior graph from temporal conditional dependence, while Wu et al. (2020) proposed the Minimum Predictive Information Regularization (MPIR) model and used a learnable noise mask on nodes to reduce the computational cost. In addition to the work mentioned above, several frameworks inferred the connectivity with different problem settings. Some approaches fitted a dynamics model and then produced a causal graph estimate of the model by using recurrent models (Tank et al., 2021; Khanna & Tan, 2020), or inferred the connections by generating edges sequen- tially (Johnson, 2017; Li et al., 2018), or were specially designed to infer the connections of dynamic graphs (Ivanovic & Pavone, 2019; Graber & Schwing, 2020; Li et al., 2022). However, because of the fixed latent space in VAE or exponential computational efficiency, most of the methods mentioned above are incapable of structural inference on large dynamical systems and have difficulties in the efficient utilization of prior knowledge. Deep Active learning. ALa SI follows the strategy of Deep AL (Gal et al., 2017; Pop & Fulop, 2018; Kirsch et al., 2019; Tran et al., 2019; Ren et al., 2022), attempting to combine the strong learning capability of deep learning in the context of high-dimensional data processing and the significant potential of Active Learning (AL) in effectively reducing labeling costs. To solve the problem of insufficient labeled sample data, (Tran et al., 2019) leveraged generative networks for data augmentation, and (Wang et al., 2016) expanded the labeled training set with pseudo-labels. Moreover, Hossain & Roy (2019) and Sim eoni et al. (2020) used labeled and unlabeled datasets to combine supervised and semisupervised training with AL methods. Several works have been proposed on how to improve the batch sample query strategy (Shi & Yu, 2019; Kirsch et al., 2019; Zhdanov, 2019; Ash et al., 2020). As we will show, by leveraging the advantages of Deep AL, ALa SI is competent in efficiently and accurately inferring the existence of directed connections with a small labeled pool of prior knowledge. Partial Information Decomposition. Partial Information Decomposition (PID) explicitly quantifies the information associated with two or more information sources that is not present in any subset of those sources (Williams & Beer, 2010; Lizier et al., 2013; Pakman et al., 2021). Therefore, PID is widely utilized to uncover the underlying connections between the agents in the dynamical systems in the field of physics (Barrett, 2015; Makkeh et al., 2018) and biology (Chan et al., 2017; Cang & Nie, 2020). Moreover, Lizier et al. (2013) extended the ordinary PID to cases with two or more sources and also considered past ego state as a source. Based on (Lizier et al., 2013), we derive a novel method for learning OOS messages from the current scope. Besides that, we also extend the original symmetric formulation of PID to unsymmetric cases by integrating temporal information, to enable ALa SI to infer the existence of directed connections even without any prior knowledge. 3 Preliminaries 3.1 Notations and General Problem Definition We view a dynamical system S as S = {V, E}, in which V represents the set of n agents in the system: V = {vi, 1 i n}, and E denotes the directed connections between the agents: (vi, vj) E V V. We focus on the cases Active Learning based Structural Inference where we have recordings of the agents states over a time period: V = {V t, 0 t T}, where T is the total number of time steps, and V t is the set of features of all the n agents at time step t: V t = {vt 1, vt 2, . . . , vt n}. We name the recordings as trajectories. Based on the trajectories, we aim to infer the existence of directed connections between any agent-pair in the system. The connections are represented as E = {eij {0, 1}}, where eij = 1 (or 0) denotes the existence of connection from agent i to j (or not). We sample a total number of K trajectories. With the notations above, the dynamics for agents within the system is: vt+1 i = vt i + X j Ui f ||vi, vj||α , (1) where denotes a time interval, Ui is the set of agents connected with agent i, and f( ) is the state-transition function deriving to dynamics caused by the edge from agent j to i, and || , ||α denotes the α-distance. We state the problem of structural inference as searching for a combinatorial distribution to describe the existence of a directed connection between any agent pair in the dynamical system. 3.2 Problem Definition in the Context of Deep AL Assume we have two sets of trajectories, the set of trajectories without knowing connectivity Dpool = {Vpool, E }, and the set of trajectories for training Dtrain = {Vtrain, Etrain}, where E denotes the empty set of connectivity. We consider two scenarios: in the first scenario we have access to the ground truth of connectivity E in the system, and we perform a supervised-learning-based Deep AL with ALa SI: min s L:|s L|