Sample Quality Heterogeneity-aware Federated Causal Discovery through Adaptive Variable Space Selection

Xianjie Guo; Kui Yu; Hao Wang; Lizhen Cui; Han Yu; Xiaoxiao Li

doi:10.24963/ijcai.2024/450

Sample Quality Heterogeneity-aware Federated Causal Discovery through Adaptive Variable Space Selection

Xianjie Guo, Kui Yu, Hao Wang, Lizhen Cui, Han Yu, Xiaoxiao Li

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence

Main Track. Pages 4071-4079. https://doi.org/10.24963/ijcai.2024/450

PDF BibTeX

Federated causal discovery (FCD) aims to uncover causal relationships among variables from decentralized data across multiple clients, while preserving data privacy. In practice, the sample quality of each client's local data may vary across different variable spaces, referred to as sample quality heterogeneity. Thus, data from different clients might be suitable for learning different causal relationships among variables. Model aggregated under existing FCD methods requires the entire model parameters from each client, thereby being unable to handle the sample quality heterogeneity issue. In this paper, we propose the Federated Adaptive Causal Discovery (FedACD) method to bridge this gap. During federated model aggregation, it adaptively selects the causal relationships learned under the "good" variable space (i.e., one with high-quality samples) from each client, while masking those learned under the "bad" variable space (i.e., one with low-quality samples). This way, each client only needs to send the optimal learning results to the server, achieving accurate FCD. Extensive experiments on various types of datasets demonstrate significant advantages of FedACD over existing methods. The source code is available at https://github.com/Xianjie-Guo/FedACD.

Keywords:

Machine Learning: ML: Federated learning

Knowledge Representation and Reasoning: KRR: Causality

Machine Learning: ML: Causality

Uncertainty in AI: UAI: Causality, structural causal models and causal inference