# compositional_fairness_constraints_for_graph_embeddings__cbf9fc41.pdf

Compositional Fairness Constraints for Graph Embeddings

Avishek Joey Bose 1 2 William L. Hamilton 1 2 3

Learning high-quality node embeddings is a key building block for machine learning models that operate on graph data, such as social networks and recommender systems. However, existing graph embedding techniques are unable to cope with fairness constraints, e.g., ensuring that the learned representations do not correlate with certain attributes, such as age or gender. Here, we introduce an adversarial framework to enforce fairness constraints on graph embeddings. Our approach is compositional meaning that it can ﬂexibly accommodate different combinations of fairness constraints during inference. For instance, in the context of social recommendations, our framework would allow one user to request that their recommendations are invariant to both their age and gender, while also allowing another user to request invariance to just their age. Experiments on standard knowledge graph and recommender system benchmarks highlight the utility of our proposed framework.

1. Introduction

Learning low-dimensional embeddings of the nodes in a graph is a fundamental technique underlying state-of-the-art approaches to link prediction and recommender systems (Hamilton et al., 2017b). However, in many applications especially those involving social graphs it is desirable to exercise control over the information contained within learned node embeddings. For instance, we may want to ensure that recommendations are fair or balanced with respect to certain attributes (e.g., that they do not depend on a user s race or gender) or we may want to ensure privacy by not exposing certain attributes through learned node representations. In this work we investigate the feasibility of enforcing such invariance constraints on (social) graph embeddings.

1Mc Gill University 2Mila 3Facebook AI Research. Correspondence to: Avishek Joey Bose <joey.bose@mail.mcgill.ca>.

Proceedings of the 36 th International Conference on Machine Learning, Long Beach, California, PMLR 97, 2019. Copyright 2019 by the author(s).

D Occupation

Input Graph Node Embedding Filters

Filtered Embedding Discriminators

Sensitive Attributes

Figure 1. Overview of our approach: Our goal is to generate graph embeddings that are invariant to particular sensitive attributes (e.g., age or gender). We train a set of ﬁlters to prevent adversarial discriminators from classifying the sensitive information from the ﬁltered embeddings. After training, these ﬁlters can be composed together in different combinations, allowing the ﬂexible generation of embeddings that are invariant w.r.t. any subset of the sensitive attributes.

While enforcing invariance constraints on general classiﬁcation models (Chouldechova, 2017; Gajane & Pechenizkiy, 2017; Kamishima et al., 2012) and collaborative ﬁltering algorithms (Yao & Huang, 2017) has received considerable attention in recent years, these techniques have yet to be considered within the context of graph embeddings a setting that introduces particular challenges due to the non-i.i.d. and non-Euclidean nature of relational, graph data.

Moreover, in the case of social graphs and large-scale recommender systems, it is often the case that there are many possible sensitive attributes that we may want to enforce invariance constraints over. Previous work on enforcing invariance (or fairness ) in social applications has generally focused on situations that involve one sensitive attribute (e.g., age in the context of credit or loan decisions; Zemel et al. (2013)), but in the context of social graph embeddings there can be an extremely large number of possible sensitive attributes. In fact, in extreme settings we may even want to be fair with respect to the existence of individual edges. For instance, a user on a social networking platform might want that platform s recommender system to ignore the fact that they are friends with a certain other user, or that they engaged with a particular piece of content.

Our contributions. We introduce an adversarial framework to enforce compositional fairness constraints on graph embeddings for multiple sensitive attributes. The key idea be-

Compositional Fairness Constraints for Graph Embeddings

hind our approach is that we learn a set of adversarial ﬁlters that remove information about particular sensitive attributes. Crucially, each of these learned ﬁlters can be optionally applied after training, so the model can ﬂexibly generate embeddings that are invariant with respect to different combinations of sensitive attributes. As the space of possible combinations of sensitive attributes can be combinatorially large, we demonstrate that our compositional strategy can generate invariant embeddings even on unseen combinations at test time. Our contribution is at the intersection of research on (social) graph embedding and algorithmic fairness. We build upon the success of recent adversarial approaches to fairness (Edwards & Storkey, 2015), disentanglement (Mathieu et al., 2016), and transfer learning (Madras et al., 2018) extending these approaches to the domain of graph representation learning and introducing new algorithmic techniques to accommodate compositional constraints during inference.

2. Related work

We now brieﬂy highlight core related work on (social) graph embeddings and algorithmic fairness, which our research builds upon.

2.1. Graph embedding

At the core of our proposed methodology is the notion of learning low-dimensional embeddings of graph-structured data, especially social data. Graph embedding techniques have a long history in the social sciences, with connections to early research on sociograms (small hand-constructed social networks) and latent variable models of social interactions (Faust, 1988; Majone, 1972). In more recent years, the task of embedding graph-structured data has received increasing attention from the machine learning and data mining communities (Cai et al., 2018; Hamilton et al., 2017b). Generally, the goal of these works is to map graph nodes to low-dimensional vector embeddings, such that the original graph can be reconstructed from these embeddings. Traditional approaches to this problem include Laplacian eigenmaps (Belkin & Niyogi, 2002) and matrix factorization techniques (Ng et al., 2001), with recent years witnessing a surge in methods that rely on random-walk based objectives (Grover & Leskovec, 2016; Perozzi et al., 2014), deep autoencoders (Wang et al., 2016), and graph neural networks (Hamilton et al., 2017a; Kipf & Welling, 2016).

Learned graph embeddings can be used for a wide variety of tasks, including node classiﬁcation, relation prediction, and clustering (Hamilton et al., 2017b). Here, we focus on the relation prediction task, i.e., using the learned representations to predict previously unobserved relationships between the input nodes. The relation prediction task is exceptionally general for example, it generalizes basic rec-

ommender systems, knowledge base completion, and even node classiﬁcation (ibid.).

2.2. Algorithmic fairness

Unlike previous research on graph embedding, in this work we focus on the challenge of enforcing fairnes or invariance constraints on the learned representations. Recent work on fairness in machine learning, including work on fairness in collaborative ﬁltering, involves making predictions that are balanced or invariant with respect to certain sensitive variables (e.g., age or gender) (Chouldechova, 2017; Gajane & Pechenizkiy, 2017; Kamishima et al., 2012; Madras et al., 2018; Zemel et al., 2013; Yao & Huang, 2017). Formally, in the standard fair classiﬁcation setting we consider a data point x Rn, its class label y Y, and a binary sensitive attribute a {0, 1} (e.g., indicating gender). The high-level goal is then to train a model to predict y from x, while making this prediction invariant or fair with respect to a (Madras et al., 2018). There are many speciﬁc deﬁnitions of fairness, such as whether fairness refers to parity or satisfying certain preferences (see (Gajane & Pechenizkiy, 2017) for a detailed discussion). In the context of fair machine learning, our core contribution is motivating, implementing, and evaluating an approach to enforce fairness within the context of graph embeddings. There are a number of complications introduced by this setting for instance, rather than one classiﬁcation task, we instead have thousands or even millions of interdependent edge relationships.

3. Preliminaries

We consider the general case of embedding a heterogeneous or multi-relational (social) graph G = (V, E), which consists of a set of directed edge triples e = u, r, v E, where u, v V are nodes and r R is a relation type. We further assume that each node is of a particular type, T V, and that relations may have constraints regarding the types of nodes that they can connect.

Relation prediction. The general relation prediction task on such a graph is as follows. Let Etrain E denote a set of observed training edges and let E = { vi, r, vj : vi, vj V, r R} \ E denote the set of negative edges that are not present in the true graph G. Given Etrain, we aim to learn a scoring function s such that

s(e) > s(e ), e E, e E. (1)

In other words, the learned scoring function should ideally score any true edge higher than any negative edge.

Embedding-based models. In the context of graph embeddings, we aim to solve this relation prediction task by learning a function ENC : V 7 Rd that maps each node v V to an embedding zv = ENC(v). In this case, the signature

Compositional Fairness Constraints for Graph Embeddings

of the score function becomes s : Rd R Rd 7 R, i.e., it takes two node embeddings zu, zv Rd and a relation r R and scores the likelihood that the edge e =< u, r, v > exists in the graph. Generally, the intuition in embedding-based approaches is that the distance between two node embeddings should encode the likelihood that there is an edge between the nodes. Following standard practice, we consider the optimization of these scoring functions using contrastive learning methods that make use of a corruption distribution such as noise contrastive estimation (Dyer, 2014; Mnih & Teh, 2012) and similar variants (Bose et al., 2018), where the loss over a batch of edges Ebatch Etrain is given by: X

e Eedge Ledge(s(e), s(e 1 ), ..., s(e m)), (2)

where Ledge is a per-edge loss function and e 1 , ..., e m E are negative samples , i.e., randomly sampled edges that do not exist in the graph. Loss functions of this form generally attempt to maximize the likelihood of true edges compared to the negative samples.

Fairness. In order to incorporate the notion of fairness into the graph embedding setup, we assume that for exactly one node type T , all nodes of this type, i.e. all u T , have K categorical sensitive attributes, ak u Ak, k = 1..., K, and for simplicity, we assume that there are no other features or attributes associated with the nodes and edges in the graph.1

The challenge in enforcing fairness is thus to ensure that the learned node embeddings, zu, are not biased or unfair with respect to these sensitive attributes a point which we formalize in the next section.

4. Invariant graph embeddings

We ﬁrst motivate and argue in favor of a particular form of fairness (or rather invariance) within the context of graph embeddings. Following this, we outline our compositional and adversarial approach for enforcing these invariance constraints on graph embeddings.

4.1. Pragmatic fairness as invariance

In this paper we consider a simple, user-centric formulation of fairness within the context of social graph embeddings. Using gender as an example of a sensitive attribute and movie recommendation as an example relation prediction task, our approach is guided by the following question: If one gives a user a button that says Please ignore my gender when recommending movies , what does a user expect from the system after this button is pressed? Here, we accept it as non-controversial that the expectation from the user is that recommendation does not depend in any way on their

1Though this assumption can easily be relaxed.

gender, i.e., that the recommendation would be the same regardless of their gender. Formally, given a user u, this expectation amounts to an assumption of independence,

s(e) au v V, r R (3)

between the recommendation i.e., the score of the edge, s(e) = s( zu, r, zv ) and the sensitive attribute au.

One issue in directly enforcing Equation (3) is that there are many (potentially millions) of possible edges that we might want to score for every node u T , making it intractable to enforce independence on each of these decisions individually. However, if we assume that the score function s( zu, r, zv ) depends on u only through u s s embedding, zu, then we can guarantee the independence in Equation (3) for all edge predictions by enforcing what we call representational invariance:

zu au, u V. (4)

In other words, we require that the mutual information I(zu, au) is 0.

Generalizing to the setting of multiple sensitive attributes, for a given set of sensitive attributes S {1, ..., K}, we would require that

I(zu, ak u) = 0, k S, u V, (5)

which amounts to the assumption of S independent invariance constraints on the S distinct sensitive attributes.2 Importantly, we assume that the set S is not ﬁxed (e.g., different users might request different invariance constraints). In the language of algorithmic fairness, the representational invariance of Equation (5) implies that traditional demographic parity constraints are satisﬁed on the sensitive attributes and recommendations.

4.2. Model deﬁnition

In this work, we enforce representational invariance constraints on the node embeddings (Equation 5) by introducing an adversarial loss and a technique to ﬁlter the embeddings generated by the ENC function. Note, again, that a unique challenge here is that S the set of sensitive attributes we want to be invariant with respect to is not ﬁxed across nodes; i.e., we may want to enforce invariance on different sets of sensitive attributes for different nodes.

Note also that the framework presented in this section is quite general and can function with arbitrary combinations of base node embedding functions ENC and edge-prediction losses Ledge (see Equation 2). We discuss three concrete instantiations of this framework in Section 5.

2Note that this does not necessarily imply subgroup fairness on the joint distribution (Kearns et al., 2017).

Compositional Fairness Constraints for Graph Embeddings

Compositional encoder. The ﬁrst key component of our model is generalizing the ENC embedding function to optionally ﬁlter out the information about certain sensitive attributes. In particular, for every sensitive attribute k {1, ..., K} we deﬁne a ﬁlter function fk : Rd 7 Rd

that is trained to remove the information about the kth sensitive attribute. If we want the node embedding to be invariant w.r.t. some set of sensitive attributes S {1, ..., K}, we then generate its embedding by composing the output of the |S| ﬁltered embeddings using a compositional encoder:

C-ENC(u, S) = 1

k S fk(ENC(u)) (6)

To train C-ENC(u, S) we sample a binary mask to determine the set S at every iteration. In this work, we sample the binary mask as a sequence of k independent Bernoulli draws with a common ﬁxed probability p = 0.5; however, other application-speciﬁc distributions (e.g., incorporating dependencies between the attributes) could be employed. Sampling random binary masks forces the model to produce invariant embeddings for different combinations of sensitive attributes during training with the hope of generalizing to unseen combinations during inference time a phenomena that we empirically validate in Section 5.2.

Adversarial loss. To train the compositional encoder, we employ an adversarial regularizer. For each sensitive attribute k K, we deﬁne a discriminator Dk : Rd Ak 7 [0, 1], which attempts to predict the kth sensitive attribute from the node embeddings. Assuming we are given an edgeprediction loss function Ledge (as in Equation 2), we can then deﬁne our new adversarially regularized per-edge loss as

L(e) =Ledge(s(e), s(e 1 ), ..., s(e m))

ak Ak log(Dk(C-ENC(u, S), ak)), (7)

where λ is a hyperparameter controlling the strength of the adversarial regularization. To optimize this loss in a minibatch setting, we alternate between two types of stochastic gradient descent updates: (1) T minibatch updates minimizing L(e) with respect to C-ENC (with all the Dk ﬁxed), and (2) T minibatch updates minimizing L(e) with respect to Dk, k = 1..., K (with C-ENC ﬁxed).

Theoretical considerations. For clarity and simplicity, we consider the case of a single binary sensitive attribute, with the theoretical intuitions naturally generalizing to the multiattribute and multi-class settings. Assuming a single binary sensitive attribute ak, by simple application of Proposition 2 in Goodfellow et al. (2014), we have:3

3Theorem 1 holds as a consequence of Proposition 2 in Goodfellow et al. (2014) if we simply replace the task of distinguishing real/fake data by classifying a binary sensitive attribute.

Theorem 1. If C-ENC and Dk have enough capacity, T is large enough so that Dk is allowed to reach its optimum on L(e) (with C-ENC ﬁxed), and C-ENC is optimized according to L(e) (with D ﬁxed), then I(zu, au) 0, u T

That is, if we increase the weight of the adversarial regularizer to inﬁnity, the equilibrium of the minimax game in Equation (7) occurs when there is zero mutual information between the sensitive attribute and the embeddings. Of course, as λ trivial solutions to this game exist (e.g., C-ENC simply outputting a constant value) and in practice setting λ < leads to a tradeoff between performance on edge prediction and representational invariance.

5. Experiments

We investigated the impact of enforcing invariance on graph embeddings using three datasets: Freebase15k-2374, Movie Lens-1M5, and an edge-prediction dataset derived from Reddit.6 The dataset statistics are given in Table 1. Our experimental setup closely mirrors that of (Madras et al., 2018) where we jointly train the main model with adversaries, but when testing invariance, we train a new classiﬁer (with the same capacity as the discriminator) to predict the senstive attributes from the learned embeddings.

The goal of our experiments was to answer three questions:

(Q1) The cost of invariance. What is the tradeoff between enforcing invariance on the representations and accuracy on the main edge prediction task? (Q2) The impact of compositionality. How does the performance of a compositional approach, which jointly enforces fairness over a set of sensitive attributes, compare to a more traditional model that only enforces fairness on a single attribute? (Q3) Invariance on unseen combinations. In settings with many sensitive attributes, is our approach able to enforce invariance even on combinations of sensitive attributes that it never saw during training?

Throughout these experiments, we rely on two baselines: First, we compare against baselines that do not include any invariance constraints, i.e., models with λ = 0. Second, we compare against a non-compositional adversarial approach where we separately train K distinct encoders and K distinct adversaries for each of the K sensitive attributes in the data. This non-compositional adversary is essentially an extension of Edwards & Storkey (2015) s approach to the graph embedding domain.

4www.microsoft.com/en-us/download/details.aspx?id=52312 5grouplens.org/datasets/movielens/1m/ 6reddit.com

Compositional Fairness Constraints for Graph Embeddings

5.1. Setup and datasets

Before describing our experimental results, we ﬁrst outline the key properties of the datasets we used, as well as the speciﬁc encoders and edge-prediction loss functions used.

In all experiments, we used multi-layer perceptrons (MLPs) with leaky Re LU activation functions (Xu et al., 2015) as the discriminators Dk and ﬁlters fk. The Appendix contains details on the exact hyperparameters (e.g., number of layers and sizes) used for all the different experiments, as well as details on the training procedures (e.g., number of epochs and data splits). Code to reproduce our results is available at: https://github.com/joeybose/ Flexible-Fairness-Constraints.

FREEBASE15K-237 Freebase 15k-237 is a standard benchmark used for knowledge base completion (Toutanova et al., 2015). In this work, we use Freebase 15k-237 as a semi-synthetic testbed to evaluate the impact of adversarial regularization. Taking the entity attribute labels from Moon et al. (2017), we used the 3most common attribute labels (e.g., /award/award nominee) as sensitive attributes. The goal in this dataset is to perform the standard knowledge base completion task, while having the entity embeddings be invariant with respect to these sensitive attribute labels. While synthetic, this dataset provides a useful reference point due to its popularity in the graph embedding literature.

For our encoder and edge-prediction loss function, we follow Ji et al. (2015) s Trans D approach, since we found this approach gave signiﬁcant performance boosts compared to simpler models (e.g., Trans E). In this model, the encoding of a node/entity depends on the edge relation being predicted, as well as on whether the entity is the head or tail in a relation (i.e., the edge direction matters). In particular, the embedding of the head node (i.e., the source node) in an edge relation is given by:

ENC(u, u, r, v ) = (rpu p + Id d)u, (8)

where u, up, rp Rd are trainable embedding parameters and Id d is a d-dimensional identity matrix. The encoding function for the tail node is deﬁned analogously. The score function for this approach is given by

s( u, r, v ) = ENC(u, u, r, v )+r

ENC(v, u, r,v ) 2,

where r Rd is another trainable embedding parameter (one per relation). Finally, we use a standard max-margin loss with a single negative sample per positive edge:

Ledge(s(e), s(e )) = max(0, 1 s(e) + s(e) ). (9)

MOVIELENS-1M Our second dataset is derived from the Movie Lens-1M recommender system benchmark (Harper & Konstan, 2016). This is a standard recommender system benchmark, where the goal is to predict the rating that users assign movies. However, unlike previous work, in our experiments we treat the user features (age, gender, and occupation) as sensitive attributes (rather than as additional feature information for the recommendation task). Following Berg et al. (2017) we treat this recommendation task as an edge prediction problem between users and movies, viewing the different possible ratings as different edge relations.

For this dataset we use a simple embedding-lookup encoder, where each user and movie is associated with a unique embedding vector in Rd. As a scoring function, we follow Berg et al. (2017) and use a log-likelihood approach:

s( u, r, m ) = z u Qrzv log( X

r R z u Qr zv),

The relation matrices Qr Rd d are computed as:

Qr = ar,1P1 + ar,2P2,

where ar,1, ar,1 R and P1, P2 Rd d are trainable parameters. In this case, the loss function is simply the negative of the log-likelihood score.

REDDIT The ﬁnal dataset we consider is based on the social media website Reddit a popular, discussion-based website where users can post and comment on content in different topical communities, called subreddits . For this dataset, we consider a traditional edge prediction task, where the goal is to predict interactions between users and subreddit communities.

To construct the edge prediction task, we examined all comments from the month of November in 2017, and we placed an edge between a user and a community if this user commented on that community at least once within this time period. We then took the 10-core of this graph to remove low-degree nodes, which resulted in a graph with approximately 366K users, 18K communities, and 7M edges. Given this graph, the main task is to train an edge-prediction model on 90% of the user-subreddit edges and then predict missing edges in a held-out test set of the remaining edges.

Reddit is a pseudonymous website with no public user attributes. Thus, to deﬁne sensitive attributes, we treat certain subreddit nodes as sensitive nodes, and the sensitive attributes for users are whether or not they have an edge connecting to these sensitive nodes. In other words, the fairness objective in this setting is to force the model to be invariant to whether or not a user commented on a particular community. To select the sensitive subreddit communities,

Compositional Fairness Constraints for Graph Embeddings

Table 1. Statistics for the three datasets, including the total number of nodes (|V|) and number of nodes with sensitive attributes |T |, the number of sensitive attributes and their types and the total number of edges in the graph.

DATASET |V| |T | #SENSITIVE ATTRIBUTES EDGES BINARY ATTRIBUTES? MULTICLASS ATTRIBUTES?

FB15K-237 14,940 14,940 3 168,618 MOVIELENS1M 9,940 6,040 3 1,000,209

REDDIT COMMENTS 385,735 366,797 10 7,255,096

25 50 75 100 125 150 175 200 Epochs

Gender Adversary Age Adversary Occupation Adversary Compositional Adversary Baseline No Adversary

Figure 2. Performance on the edge prediction (i.e., recommendation) task on Movie Lens, using RMSE as in Berg et al. (2017).

we randomly sampled 10 from the top-100 communities by degree.7 Note that this setting represents the extreme case where we want the model to be invariant with respect to the existence of particular edges in the input graph.

As with Movie Lens-1M, we use a simple embeddinglookup encoder. In this case, there is only a single relation type indicating whether a Reddit user has commented on a subreddit community. Thus, we employ a simple dotproduct based scoring function, s( u, r, v ) = z u zv, and we use a max-margin loss as in Equation (9).

5.2. Results

We now address the core experimental questions (Q1-Q3).

Q1: THE COST OF INVARIANCE

In order to quantify the extent to which the learned embeddings are invariant to the sensitive attributes (e.g., after adversarial training), we freeze the trained compositional encoder C-ENC and train an new MLP classiﬁer to predict each sensitive attribute from the ﬁltered embeddings (i.e., we train one new classiﬁer per sensitive attribute). We also evaluate the performance of these ﬁltered embeddings on the original prediction tasks. In the best case, a newly trained MLP classiﬁer should have random accuracy when attempting to predict the sensitive attributes from the ﬁltered embeddings, but these embeddings should still provide strong

7We excluded the top-5 highest-degree outlying communities.

10 20 30 40 50 Epochs

Baseline Non Compositional Held Out Compositional No Held Out Compositional

Figure 3. Performance on the edge prediction (i.e., recommendation) task on the Reddit data. Evaluation is using the AUC score, since there is only one edge/relation type.

Non Compositional

No Held Out Compositional

Held Out Compositional

Figure 4. Ability to predict sensitive attributes on the Reddit data when using various embedding approaches. Bar plots correspond to the average AUC across the 10 binary sensitive attributes.

performance on the main edge prediction task. Thus, for binary sensitive attributes, an ideal result is an AUC score of 0.5 when attempting to predict the sensitive attributes from the learned embeddings.

Overall, we found that on the more realistic social recommendation datasets i.e., the Movie Lens-1M and Reddit datasets our approach was able to achieve a reasonable tradeoff, with the near-complete removal of the sensitive information leading to a roughly 10% relative error increase on the edge prediction tasks. In other words, on these two datasets the sensitive attributes were nearly impossible to predict from the ﬁltered embeddings, while the accuracy on the main edge prediction task was roughly 10% worse than

Compositional Fairness Constraints for Graph Embeddings

Table 2. Ability to predict sensitive attributes on the Movie Lens data when using various embedding approaches. For gender attribute the score is AUC while for age and occupation attributes the score is micro averaged F1. The columns represent the different embedding approaches (e.g., with or without adversarial regularizatin) while the rows are the attribute being classiﬁed.

MOVIELENS1M BASELINE NO ADVERSARY

GENDER ADVERSARY AGE ADVERSARY OCCUPATION ADVERSARY COMP. ADVERSARY MAJORITY CLASSIFIER RANDOM CLASSIFIER

GENDER 0.712 0.532 0.541 0.551 0.511 0.5 0.5 AGE 0.412 0.341 0.333 0.321 0.313 0.367 0.141 OCCUPATION 0.146 0.141 0.108 0.131 0.121 0.126 0.05

Table 3. Ability to predict sensitive attributes on the Freebase15k237 data when using various embedding approaches. AUC scores are reported, since all the sensitive attributes are binary. The mean rank on the main edge-prediction task is also reported.

FB15K-237 BASELINE NO ADVERSARY

NON COMP. ADVERSARY

COMP. ADVERSARY

ATTRIBUTE 0 0.97 0.82 0.77 ATTRIBUTE 1 0.99 0.81 0.79 ATTRIBUTE 2 0.98 0.81 0.81 MEAN RANK 285 320 542

Compositional Gender AUC Gender Baseline AUC

Figure 5. Tradeoff of Gender AUC score for a compositional adversary versus different λ

a baseline approach that does not include the invariance constraints. Table 2 and Figure 2 summarize these results for the Movie Lens data, where we can see that the accuracy of classifying the sensitive attributes is on-par with a majorityvote classiﬁer (Table 2) while the RMSE degrades from 0.865 to 1.01 with the compositional adversary. Figures 5 and 6 illustrate this tradeoff and show how the RMSE for the edge prediction task and ability to predict the sensitive attributes change as we vary the regularization strength, λ. As expected, increasing λ does indeed produce more invariant embeddings but at the cost of higher RMSE values. Figures 3 and 4 similiarly summarize these results on Reddit.

Interestingly, we found that on the Freebase15k-237 dataset it was not possible to completely remove the sensitive infor-

Compositional Adversary Baseline RMSE

Figure 6. Tradeoff of RMSE for a compositional adversary versus different λ

mation without incurring a signiﬁcant cost on the original edge prediction task. This result is not entirely surprising, since for this dataset the sensitive attributes were synthetically constructed from entity type annotations, which are presumably very relevant to the main edge/relation prediction task. However, it is an interesting point of reference that demonstrates the potential limitations of removing sensitive information from learned graph embeddings.

Q2: THE IMPACT OF COMPOSITIONALITY

In all our experiments, we observed that our compositional approach performed favorably compared to an approach that individually enforced fairness on each individual attribute. In fact, on the Movie Lens-1M data (and the synthetic Freebase15k-237 data), the compostionally trained adversary outperformed the individually trained adversaries in terms of removing information about the sensitive attributes (Table 2). In other words, training a model to jointly remove information about the sensitive attributes using the compositional encoder (Equation 6) removed more information about the sensitive attributes than training separate adversarially regularized embedding models for each sensitive attribute. This result is not entirely surprising, as it essentially indicates that the different sensitive attributes (age, gender, and occupation) are correlated in this dataset. Nonetheless, it is a positive result indicating that the extra ﬂexibility afforded by the compositional approach does

Compositional Fairness Constraints for Graph Embeddings

not necessarily lead to a decrease in performance. That said, on the Reddit data we observed the opposite trend and found that the compositional approach performed worse in terms of its ability to remove information about the sensitive attributes (Figure 4) as well as a small drop on the performance of the main edge prediction task (Figure 3).

Q3: INVARIANCE ON UNSEEN COMBINATIONS

One of the key beneﬁts of the compositional encoder is that it can ﬂexibly generate embeddings that are invariant to any subset of S {1, ..., K} of the sensitive attributes in a domain. In other words, at inference time, it is possible to generate 2K distinct embeddings for an individual node, depending on the exact set of invariance constraints. However, given this combinatorially large output space, a natural question is whether this approach performs well when generalizing to unseen combinations of sensitive attributes.

We tested this phenomenon on the Reddit dataset, since it has the largest number of sensitive attributes (10, compared to 3 sensitive attributes for the other two datasets). During training we held out 10% of the combinations of sensitive attributes, and we then evaluated the model s ability to enforce invariance on this held-out set. As we can see in Figure 4, the performance drop for the held-out combinations is very small (0.025), indicating that our compositional approach is capable of effectively generalizing to unseen combinations. The Appendix contains further results demonstrating how this trends scales gracefully when we increase the number of sensitive attributes from 10 to 50.

QUANTIFYING BIAS

In all of the above results, we used the ability to classify the sensitive attributes as a proxy for bias being contained within the embeddings. While this is a standard approach, e.g., see Edwards & Storkey (2015), and an intuitive method for evaluating representational invariance a natural question is whether the adversarial regularization also decreases bias in the edge prediction tasks. Ideally, after ﬁltering the embeddings, we would have that the edge predictions themselves are not biased according to the sensitive attributes.

To quantify this issue, we computed a prediction bias score for the Movie Lens1M dataset: For each movie, we computed the absolute difference between the average rating predicted for each possible value of a sensitive attribute and we then averaged these scores over all movies. Thus, for example, the bias score for gender corresponds to the average absolute difference in predicted ratings for male vs. female users, across all movies. From the perspective of fairness our adversary imposes a soft demographic parity constraint on the main task. A reduction in prediction bias across the different subgroups represents an empirical measure of achieving demographic parity. Figure 7 highlights these

Gender Age Occupation 0.00

Prediction Bias

Baseline Single Adversary Compositional Adversary

Figure 7. Prediction Bias for different Sensitive Attributes under three settings in Movie Lens1M.

results, which show that adversarial regularization does indeed drastically reduce prediction bias. Interestingly, using a compositional adversary works better than a single adversary for a speciﬁc sensitive attribute which we hypothesize is due to correlation between sensitive attributes.

6. Discussion and conclusion

Our work sheds light on how fairness can be enforced in graph representation learning a setting that is highly relevant to large-scale social recommendation and networking platforms. We found that using our proposed compositional adversary allows us to ﬂexibly accomodate unseen combinations of fairness constraints without explicitly training on them. Crucially, this highlights how fairness could be deployed in a real-word, user-driven setting, where it is necessary to optionally enforce a large number of possible invariance constraints over learned graph representations.

In terms of limitations and directions for future work, one important limitation is that we only consider one type of adversarial loss to enforce fairness. While this adversarial loss is theoretically motivated and known to perform well, there are other recent variations in the literature (e.g., Madras et al. (2018)) as well as related non-adversarial regularizers (e.g., Zemel et al. (2013)). Also, while we considered imposing fairness over sets of attributes, we did not explicitly model subgroup-level fairness (Kearns et al., 2017). Extending and testing our framework with these alternatives is a natural direction for future work.

There are also important questions about how our framework translates to real-world production systems. For instance, in this work we enforced fairness with respect to randomly sampled sets of attributes, but in real-world environments, these sets of attributes would come from user preferences, which may themselves be biased; e.g., it might be more common for female users to request fairness than male users potentially leading to new kinds of demographic inequalities. Understanding how these preference biases could impact our framework is an important direction for future inquiry.

Compositional Fairness Constraints for Graph Embeddings

Acknowledgements

The authors would like to thank the anonymous ICML reviewers for their helpful comments. In addition we would like to thank Koustuv Sinha, Riashat Islam and Andre Cianﬂone for helpful feedback on earlier drafts of this work. This research was funded in part by an academic grant from Microsoft Research, as well as a Canada CIFAR Chair in AI, held by Prof. Hamilton.

Belkin, M. and Niyogi, P. Laplacian eigenmaps and spectral techniques for embedding and clustering. In NIPS, 2002.

Berg, R. v. d., Kipf, T. N., and Welling, M. Graph convolutional matrix completion. ar Xiv preprint ar Xiv:1706.02263, 2017.

Bose, A. J., Ling, H., and Cao, Y. Adversarial contrastive estimation. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Long Papers), 2018. URL https://arxiv.org/abs/ 1805.03642.

Cai, H., Zheng, V. W., and Chang, K. A comprehensive survey of graph embedding: problems, techniques and applications. IEEE Transactions on Knowledge and Data Engineering, 2018.

Chouldechova, A. Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big data, 5(2):153 163, 2017.

Dyer, C. Notes on noise contrastive estimation and negative sampling. ar Xiv preprint ar Xiv:1410.8251, 2014.

Edwards, H. and Storkey, A. Censoring representations with an adversary. In ICLR, 2015.

Faust, K. Comparison of methods for positional analysis: Structural and general equivalences. Soc. Net., 10(4): 313 341, 1988.

Gajane, P. and Pechenizkiy, M. On formalizing fairness in prediction with machine learning. ar Xiv preprint ar Xiv:1710.03184, 2017.

Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., and Bengio, Y. Generative adversarial nets. In NIPS, pp. 2672 2680, 2014.

Grover, A. and Leskovec, J. node2vec: Scalable feature learning for networks. In KDD, 2016.

Hamilton, W., Ying, R., and Leskovec, J. Inductive representation learning on large graphs. In NIPS, 2017a.

Hamilton, W., Ying, R., and Leskovec, J. Representation learning on graphs: Methods and applications. IEEE Data Engineering Bulletin, 2017b.

Harper, F. M. and Konstan, J. A. The movielens datasets: History and context. Acm transactions on interactive intelligent systems (tiis), 5(4):19, 2016.

Ji, G., He, S., Xu, L., Liu, K., and Zhao, J. Knowledge graph embedding via dynamic mapping matrix. In ACL, 2015.

Kamishima, T., Akaho, S., Asoh, H., and Sakuma, J. Fairness-aware classiﬁer with prejudice remover regularizer. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 35 50. Springer, 2012.

Kearns, M., Neel, S., Roth, A., and Wu, Z. Preventing fairness gerrymandering: Auditing and learning for subgroup fairness. ar Xiv:1711.05144, 2017.

Kipf, T. and Welling, M. Variational graph auto-encoders. In NIPS Workshop on Bayesian Deep Learning, 2016.

Madras, D., Creager, E., Pitassi, T., and Zemel, R. Learning adversarially fair and transferable representations. 2018.

Majone, G. Social space and social distance: some remarks on metric methods in data analysis. Quality and Quantity, 6(2):239 252, 1972.

Mathieu, M., Zhao, J., Zhao, J., Ramesh, A., Sprechmann, P., and Le Cun, Y. Disentangling factors of variation in deep representation using adversarial training. In NIPS, 2016.

Mnih, A. and Teh, Y. W. A fast and simple algorithm for training neural probabilistic language models. 2012.

Moon, C., Jones, P., and Samatova, N. F. Learning entity type embeddings for knowledge graph completion. In CIKM, 2017.

Ng, A., Jordan, M., Weiss, Y., et al. On spectral clustering: Analysis and an algorithm. In NIPS, 2001.

Perozzi, B., Al-Rfou, R., and Skiena, S. Deepwalk: Online learning of social representations. In KDD, 2014.

Toutanova, K., Chen, D., Pantel, P., Poon, H., Choudhury, P., and Gamon, M. Representing text for joint embedding of text and knowledge bases. In EMNLP, 2015.

Wang, D., Cui, P., and Zhu, W. Structural deep network embedding. In KDD, 2016.

Xu, B., Wang, N., Chen, T., and Li, M. Empirical evaluation of rectiﬁed activations in convolutional network. Deep Learning Workshop, ICML 2015, 2015.

Compositional Fairness Constraints for Graph Embeddings

Yao, S. and Huang, B. New fairness metrics for recommendation that embrace differences. In Workshop on Fairness, Accountability, and Transparency in Machine Learning (FATML), June 2017.

Zemel, R., Wu, Y., Swersky, K., Pitassi, T., and Dwork, C. Learning fair representations. In International Conference on Machine Learning, pp. 325 333, 2013.