# subset_selection_via_implicit_utilitarian_voting__462494e2.pdf

Subset Selection Via Implicit Utilitarian Voting

Ioannis Caragiannis University of Patras, Greece

caragian@ceid.upatras.gr

Swaprava Nath Carnegie Mellon University, USA

swapravn@cs.cmu.edu

Ariel D. Procaccia Carnegie Mellon University, USA

arielpro@cs.cmu.edu

Nisarg Shah Carnegie Mellon University, USA

nkshah@cs.cmu.edu

How should one aggregate ordinal preferences expressed by voters into a measurably superior social choice? A well-established approach which we refer to as implicit utilitarian voting assumes that voters have latent utility functions that induce the reported rankings, and seeks voting rules that approximately maximize utilitarian social welfare. We extend this approach to the design of rules that select a subset of alternatives. We derive analytical bounds on the performance of optimal (deterministic as well as randomized) rules in terms of two measures, distortion and regret. Empirical results show that regret-based rules are more compelling than distortion-based rules, leading us to focus on developing a scalable implementation for the optimal (deterministic) regret-based rule. Our methods underlie the design and implementation of an upcoming social choice website.

1 Introduction

We are interested in the classic social choice problem of aggregating the preferences of a set of voters represented as rankings over a set of alternatives into a collective decision. Traditional social choice theory typically takes a normative approach, by specifying desirable axioms that the aggregation method (also known as a voting rule) should satisfy [Arrow, 1951]. In contrast, researchers in computational social choice [Brandt et al., 2016] often advocate quantitative approaches to the same problem. The high-level idea is to identify a compelling objective function, and design voting rules that optimize this function.

Here we focus on a speciﬁc objective function: utilitarian social welfare. Speciﬁcally, we assume that each voter assigns a utility to each possible outcome, and the socially optimal outcome maximizes the sum of utilities. This sounds simple enough at ﬁrst glance, but there is a major obstacle we must overcome: voters preferences are expressed as ordinal preferences (rankings), rather than cardinal preferences (utilities). While this reduces the cognitive load on voters, and

makes preference elicitation much easier, it does seem to be at odds with the utilitarian viewpoint.

Procaccia and Rosenschein [2006] reconcile these differences via an approach that we refer to as implicit utilitarian voting.1 They propose that voters have latent utility functions, and report rankings that are consistent with these utilities, that is, the voters rank the alternatives by their utility. The performance of a voting rule which can only access the submitted rankings, not the implicit utility functions can then be quantiﬁed via a measure called distortion: the worst-case (over utility functions consistent with the reported proﬁle of rankings) ratio between the social welfare of the optimal (welfare-maximizing) alternative, and the social welfare of the alternative selected by the voting rule. While Procaccia and Rosenschein focus on analyzing the distortion of existing voting rules, Boutilier et al. [2015] design voting rules that minimize distortion. In particular, they bound the worst-case distortion, and show that the distortion-minimizing (randomized) voting rule can be implemented in polynomial time.

The work of Boutilier et al. [2015] provides a good understanding of optimized aggregation of rankings from the utilitarian viewpoint but only when a single alternative is selected by the voting rule. Indeed, this understanding does not extend to common applications that require selection of a subset of alternatives, such as choosing a committee, or selecting restaurants for the next four group lunches. Our goal is therefore to

...build on the utilitarian approach to design optimal voting rules for selecting a subset of alternatives, and understand the guarantees they provide, as well as their performance in practice.

We make four main contributions. First, on a conceptual level, we introduce the additive notion of regret into the implicit utilitarian voting setting, as an alternative to the multiplicative notion of distortion. Second, in Section 3, we derive worst-case bounds on the distortion and regret of optimal deterministic and randomized voting rules. Third, in Section 4, we compare the worst-case-optimal deterministic voting rules with respect to distortion and regret denoted

1Cf. utilitarian voting, which has sporadically been used to refer to both approval voting and range voting.

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16)

reg, respectively with a slew of well-known voting rules, in terms of average-case distortion and regret, using experiments on synthetic and real data. We ﬁnd that f

reg outperforms all other rules on average, even when measuring distortion! Fourth, in Section 5, we develop a scalable implementation for f

reg (which, we show, is NP-hard to compute).

1.1 Direct Real-World Implications

Research in computational social choice has frequently been justiﬁed by potential applications in multiagent systems. But recently researchers have begun to realize that, arguably, the most exciting products of this research are computer programs that help humans make decisions via AI-driven algorithms. One example is Spliddit (www.spliddit.org), a fair division website [Goldman and Procaccia, 2014]. In the voting space, existing examples include Whale (whale3.noiraudes.net/whale3/) and Pnyx (pnyx.dss.in.tum.de) but these websites generally adopt the axiomatic viewpoint.

Since May 2015, some of us have been working on the design and implementation of a new not-for-proﬁt social choice website, Robo Vote (www.robovote.org), which is scheduled to launch in May 2016. The novelty of Robo Vote is that it relies on optimization-based approaches. For the case of objective votes when a ground truth ranking of the alternatives exists (e.g., the order of different stocks by the relative change in their prices tomorrow) Robo Vote implements voting rules that pinpoint the most likely best alternative [Young, 1988], or the set most likely to contain it [Procaccia et al., 2012]. For the case of subjective votes the classic setting which is the focus of this paper, with applications to everyday scenarios such as a group of friends selecting a movie to watch or a restaurant to go to we use the results of Boutilier et al. [2015] to select a single alternative. But, previously, the extension to subset selection was unavailable this is precisely the motivation for the work described herein. Based on the results of Sections 4 and 5, we have implemented the deterministic regret minimization rule on Robo Vote.

1.2 Related Work

In addition to the aforementioned papers [Procaccia and Rosenschein, 2006; Boutilier et al., 2015], several other papers employ the notion of distortion to quantify how close one can get to maximizing utilitarian social welfare when only ordinal preferences are available [Caragiannis and Procaccia, 2011; Anshelevich et al., 2015; Anshelevich and Sekar, 2016]. In particular, Anshelevich et al. [2015] study the same setting as Boutilier et al. [2015], but in addition assume the preferences of voters are consistent with distances in a metric space. We refer the reader to the paper by Boutilier et al. [2015, Section 1.2] for a thorough discussion of work (in philosophy, economics, and social choice theory) related to implicit utilitarian voting more broadly.

There is quite a bit of work in computational social choice on voting rules that select subsets of alternatives. Typically it is assumed that ordinal preferences are translated into a position-based score for each alternative (in contrast to our

work). Just to give a few examples, under the Chamberlin Courant method, each voter assigns a score to a set equal to the highest score of any alternative in the set, and the (computationally hard) objective is to choose a subset of size k that maximizes the sum of scores [Chamberlin and Courant, 1983; Procaccia et al., 2008]. Skowron et al. [2015] generalize the way in which the score of a voter for a subset of alternatives is computed. The budgeted social choice framework of Lu and Boutilier [2011a] is more general in that the number of alternatives to be selected is not ﬁxed; rather, each alternative has a cost that must be paid to add it to the selection.

2 The Model

Let [t] = {1, . . . , t}. Let A be the set of alternatives, and denote m = |A|. Let N = [n] be the set of voters. Let L = L(A) denote the set of rankings over the alternatives. Each voter i 2 [n] submits a ranking σi 2 L over the alternatives, and which can alternatively be seen as a permutation of A. Therefore, σi(a) is the position in which voter i ranks alternative a (1 is best, m is worst). Moreover, a σi b denotes that voter i prefers alternative a to alternative b. The collection of voters (submitted) rankings is called the preference proﬁle, and denoted by ~σ 2 Ln.

We assume the rankings are induced by comparisons between the voters underlying utilities. For i 2 N and a 2 A, let ui(a) 2 [0, 1] be the utility of voter i for alternative a. As in previous papers [Boutilier et al., 2015; Caragiannis and Procaccia, 2011], we assume that the utilities are normalized such that P

a2A ui(a) = 1 for all i 2 N. The collection of voter utilities, denoted ~u, is called the utility proﬁle. We say that utility proﬁle ~u is consistent with preference proﬁle ~σ denoted ~u . ~σ if for all a, b 2 A and i 2 N, a σi b implies ui(a) ui(b).

Next we need to deﬁne the utility of a voter for a set of alternatives. For S A, we deﬁne ui(S) = maxa2S ui(a), that is, each voter derives utility for his favorite alternative in the set; this is in the same spirit as previous papers on set selection [Chamberlin and Courant, 1983; Monroe, 1995; Procaccia et al., 2008; Lu and Boutilier, 2011a]. Then, the (utilitarian) social welfare of S given the utility proﬁle ~u is sw(S, ~u) = Pn

i=1 ui(S). We are interested in voting rules that, given a preference proﬁle, select a subset of given cardinality k.2 Therefore, it will be useful to denote Ak = {S A : |S| = k}. In order to unify notation, we directly deﬁne a randomized voting rule as a function f : Ln ! (Ak), that is, the rule is allowed to select alternatives randomly, and formally f(~σ) is a probability distribution over Ak. A deterministic voting rule simply gives probability 1 to a speciﬁc subset.

A voting rule can only access the preference proﬁle ~σ, yet the goal is to maximize social welfare with respect to the latent utility function ~u .~σ. We study two notions that quantify how well a rule achieves this goal: distortion and regret.

The distortion [Procaccia and Rosenschein, 2006] of a

2Formally, this is a special case of social choice correspondences with ﬁxed output cardinality [Campbell and Kelly, 1996].

(randomized) voting rule f on a preference proﬁle ~σ is

dist(f,~σ) = sup

max S2Ak sw(S, ~u)

E[sw(f(~σ), ~u)] .

In words, it is the worst-case over utility proﬁles consistent with the given preference proﬁle ratio between the social welfare of the best subset, and the expected social welfare of the subset selected by the voting rule. We deﬁne the distortion of a voting rule f by taking the worst case over preference proﬁles: dist(f) = max~σ2Ln dist(f,~σ).

The second measure is regret. While it has not been studied as part of the agenda of implicit utilitarian voting, it has been explored in other social choice settings, especially partial preferences [Lu and Boutilier, 2011b]; similar measures have been extensively studied in decision theory and machine learning [Blum and Mansour, 2007; Bubeck and Cesa Bianchi, 2012]. The regret of a (randomized) voting rule f on a preference proﬁle ~σ is given by

reg(f,~σ) = 1

max S2Ak sw(S, ~u) E[sw(f(~σ), ~u)]

As before, deﬁne the regret of a rule f to be reg(f) = max~σ2Ln reg(f,~σ). We divide by n because the total (worst-case) regret of any voting rule f is provably linear in n (so this is per vote regret). Note that distortion is a multiplicative measure of loss, whereas regret is its additive version.

3 Worst-Case Bounds In this section we provide bounds on worst-case distortion and regret, for both deterministic and randomized voting rules. Boutilier et al. [2015] show that for selecting a single winner (k = 1), we can achieve O(pm log m) distortion using a randomized rule, where log m is the iterated logarithm of m (the number of alternatives). This bound is asymptotically almost tight: they also show that the worstcase distortion is always (pm).

For a large k, though, one can hope for a better bound. Clearly, when k = m there is only one voting rule (which selects every alternative), and its distortion is 1. More generally, it is easy to show that the voting rule f that selects a subset from Ak uniformly at random has dist(f) m/k. However, since we can already achieve O(pm log m) distortion for k = 1, a bound of m/k provides an improvement only for k = (pm/ log m). Can we achieve better distortion for smaller values of k as well? It is not even clear whether the optimal worst-case distortion should monotonically decrease in k, because as our ﬂexibility grows with k, so does the ﬂexibility of the welfare-maximizing solution. In fact, a part of our main result shows that the worst-case distortion remains (pm) for all values of k up to (pm). Theorem 1. Let m = |A|, and let k be the number of alternatives to be selected.

1. Distortion, deterministic rules: There exists a determin-

istic voting rule f with dist(f ) 1 + m (m k)/k. Moreover, for every deterministic voting rule f,

1 + m(m 3k)

9 , 1 + m if m

2 , 1 + m(m k)

k otherwise.

These bounds are tight up to a constant factor of 7.3. 2. Distortion, randomized rules: There exists a random-

ized voting rule f such that

2pm Hm if k 2 m Hm

m k if 2 m Hm

k otherwise,

where Hm = (log m) is the mth harmonic number. Moreover, for every randomized voting rule f,

2 if k m (pm 1)

m k+m/k otherwise.

These bounds are tight up to a factor of 9.4 m1/6. 3. Regret, deterministic rules: There exists a deterministic

voting rule f such that

m otherwise,

and this upper bound is completely tight. 4. Regret, randomized rules: There exists a randomized

voting rule f such that reg(f ) 1/2

. Moreover, for every randomized voting rule f,

( 1 4 if k m/2

All the upper bounds above can be achieved via polynomialtime algorithms.

Det-UB Det-LB Rand-UB Rand-LB

0 50 100 100

(a) Distortion

Figure 1: The upper and lower bounds on worst-case distortion and regret for m = 100.

The bounds presented above are simpliﬁed forms of the exact bounds that we derive. Figure 1 shows our exact bounds for m = 100.3

The intricate proof of Theorem 1 appears in the full version of the paper.4 Below, we just sketch one part of the proof that we ﬁnd especially interesting.

3The second upper bound in part 2 of Theorem 1 (which increases with k) does not play a role unless m is very large.

4Available at: www.cs.cmu.edu/ arielpro/papers.html

Proof sketch of the upper bound in part 2 of Theorem 1. Our construction builds on the one used by Boutilier et al. [2015] for k = 1, but uses additional tools and introduces novel techniques. As mentioned at the beginning of this section, choosing a set uniformly at random from Ak (under which the marginal probability of every alternative being chosen is k/m) has distortion at most m/k. However, this approach does not work well if some alternatives are signiﬁcantly better than others.

In that case, one may wish to choose the alternatives with probabilities proportional to their quality . For a 2 A, let us deﬁne its quality by its harmonic score har(a) = P

i2[n] 1/σi(a). Then, we wish to choose alternative a with marginal probability k har(a)/ P

b2A har(b). Note that this quantity may be greater than 1. Moreover, this approach fails when all sets are almost equally good. Hence, we employ a combination of the two approaches.

Fix 0 1, and for an alternative a 2 A deﬁne

m + (1 ) k har(a) P

b2A har(b). (1)

Using the bihierarchy extension [Budish et al., 2013] of the Birkhoff-von Neumann theorem [Birkhoff, 1946; von Neumann, 1953], we can show that there exists a distribution over Ak under which the marginal probabilities of selected alternatives are consistent with Equation (1) if and only if

8a 2 A, 0 pa 1 and

Suppose such a distribution D exists. Consider a preference proﬁle ~σ and a utility proﬁle ~u . ~σ. Let S 2 arg max S2Ak sw(S, ~u). Deﬁne

where Hm = Pm

t=1 1/t is the mth harmonic number. Note that P

a2A har(a) = n Hm. Now, consider two cases.

Case 1: Suppose sw(S , ~u) n X. Then,

ES D[sw(S, ~u)] =

ui(a) Pr S D[a 2 S]

Hence, the distortion is

sw(S , ~u) ES D[sw(S, ~u)] n X n/m = X m

Case 2: Suppose sw(S , ~u) > n X. Then, for each alternative a 2 S , let Na denote the subset of voters who rank a above any other alternative of S , i.e.,

Na = {i 2 [n] : 8b 2 S \ {a}, a σi b, }.

Let sw Na(S, ~u) denote the welfare of the voters in Na for the set of alternatives S under the utility proﬁle ~u. Let Ta denote the total utility that agents in Na have for alternative a, i.e., Ta = P

i2Na ui(a). It can be shown (although it is nontrivial) that har(a) Ta for all a 2 A. Because {Na}a2S is a partition of the set of voters, we have

ES D[sw(S, ~u)] = ES D

sw Na(S, ~u)

Ta Pr S D[a 2 S]

Ta (1 ) k har(a) P

(sw(S , ~u))2 .

Here, the fourth transition uses har(a) Ta, the ﬁfth transition uses the power-mean inequality, and the ﬁnal transition uses sw(S , ~u) = P

a2S Ta. Now, the distortion is

sw(S , ~u) ES D[sw(S, ~u)] n Hm (1 ) sw(S , ~u) <

where the ﬁnal transition uses our assumption sw(S , ~u) > n X along with the deﬁnition of X.

Combined analysis: In both cases, the distortion is at most p

m Hm/( (1 )). The ﬁnal step involves choosing the optimal value of by minimizing this quantity subject to our constraints: pa 1 for all a 2 A. Simplifying the bound obtained along with our universal distortion bound of m/k yields the required upper bound.

4 Empirical Comparisons In Section 3 we provided analytical results for both deterministic and randomized rules. In our view, randomized rules are especially practicable when the output distribution is sampled multiple times, or when the voters are well-informed, or when the voters are indifferent about the outcome (e.g., they are software agents). Moreover, we believe that the results for randomized rules are of substantial theoretical interest. But our work is partly driven by its direct applications in Robo Vote (see Section 1.1), which does not satisfy the above conditions. This leads us to use deterministic voting rules, which is what we focus on hereinafter.

reg be the deterministic rules that minimize the worst-case distortion and regret, respectively, on every given preference proﬁle. The deterministic results of Section 3 establish upper and lower bounds on their worst-case

dist Plurality Borda STV Other

1 2 3 4 1.05

(a) Avg. Distortion

(b) Avg. Regret

Figure 2: Uniformly random utility proﬁles.

(a) Avg. Distortion

1 2 3 4 0.02

(b) Avg. Regret

Figure 3: Utility proﬁles from the Jester dataset.

(a) Sushi: Avg. Distortion

(b) Sushi: Avg. Regret

(c) T Shirt: Avg. Distortion

(d) T Shirt: Avg. Regret

Figure 4: Preference proﬁles from the Sushi and the T-Shirt datasets, uniformly random consistent utility proﬁles.

distortion/regret. In this section, we evaluate their averagecase performance on simulated as well as real data, and compare them against nine well-known voting rules: plurality, approval voting, Borda count, STV, Kemeny s rule, the maximin rule, Copeland s rule, Bucklin s rule, and Tideman s rule.5

We perform three experiments: (i) choosing a utility proﬁle uniformly at random from the simplex of all utility proﬁles, (ii) drawing a real-world utility proﬁle from the Jester datasets [Goldberg et al., 2001], and (iii) drawing a realworld preference proﬁle from the Pref Lib datasets [Mattei and Walsh, 2013], and choosing a consistent utility proﬁle uniformly at random. For each experiment, we have 8 voters and 10 alternatives, and test for k 2 [4].6 For each setting, we perform 10 000 random simulations, and measure both distortion and regret for the actual utility proﬁle, as opposed to the worst-case utility proﬁle. The ﬁgures show the average performance with 95% conﬁdence intervals.

In all of our simulations, we observed that three of the classical voting rules stand out: Borda count performs well for choosing a single alternative (but not for choosing larger

5For the score-based rules, the k-subset is selected by picking the top k alternatives based on their scores.

6In Robo Vote, we expect typical instances to have few voters and alternatives. But we chose m > 2k because otherwise the problem would be trivial: for k m/2, picking the top k alternatives based on plurality scores is optimal for both distortion and regret.

subsets) whereas plurality and STV perform well for choosing larger subsets (but not for choosing a single alternative). Hence, all of our graphs speciﬁcally distinguish these three rules in addition to f

reg. Figure 2 shows the results for the ﬁrst experiment where we choose the utility proﬁle uniformly at random. Figure 3 shows the results for the second experiment where real-world utility proﬁles are drawn from one of the Jester datasets, in which more than 50 000 voters rated 150 jokes on a realvalued scale; the results from the other Jester dataset are almost identical. Finally, Figure 4 shows the results for the third experiment where real-world preference proﬁles are drawn from the Sushi dataset (5 000 voters ranking 100 different kinds of sushi) and the T-Shirt dataset (30 voters ranking 11 T-shirt designs) from Pref Lib. Experiments on other datasets from Pref Lib (AGH Course Selection, Netﬂix, Skate, and Web Search) yielded similar results.

Right off the bat, one can observe that the average-case distortion and regret values are much lower than their worst-case counterparts. For example, average regret is generally lower than 0.1 compare with the tight worst-case deterministic bound of 1/2 for k m/2.

Much to our surprise, in all of our experiments, f

reg outperforms f

dist in terms of both average-case distortion (multiplicative loss) and regret (additive loss). While both measures of loss have been studied extensively in the literature,

we are not aware of any previous work that compares the two approaches. At least in our social choice domain, the regretbased approach is clearly better on average.

Moreover, in all cases but one (k = 1 in the Jester experiment), f

reg also outperforms all the classical voting rules under consideration. We therefore conclude that, on random as well as on real-world instances, f

reg provides superior performance in terms of social welfare maximization.

5 Computation and Implementation In this section, we analyze and compare the two deterministic optimal rules f

reg from a computational viewpoint. Selecting optimal subsets turns out to be challenging, as both rules are NP-hard to compute; the proof of this nontrivial result appears in the full version. Theorem 2. Given a preference proﬁle ~σ and an integer k, computing a k-subset of alternatives that has the minimum distortion or the minimum regret on ~σ is NP-hard.

Given that f

reg outperforms f

dist in the experiments of Section 4, and that both rules are computationally hard, f

reg stands out as the clear choice for implementation in our website Robo Vote. We therefore devoted our efforts to developing a scalable implementation for f

reg. The ﬁrst step is to simplify the description of f

reg. Given a ranking σ and an alternative a 2 A, recall that σ(a) denotes the position of a in σ. For a set S A, let σ(S) = mina2S σ(a). For sets S, T A, we say T σ S if σ(T) < σ(S), i.e., if there exists an alternative in T that is preferred in σ to every alternative in S. Using these notations, it is relatively straightforward to prove that

reg,k(~σ) = arg min

1 σi(S). (2)

To better understand this equation, we consider the special case of k = 1. In this case,

reg(~σ) 2 arg min

i2[n]:b σia

Note that this voting rule is very similar to the classical maximin rule: replacing 1/σi(b) with 1 would yield the maximin rule. Thus, in some sense, this is a smooth version of the maximin rule, where the victory of b over a in voter i s vote is weighted by the strength of b in this vote (measured by 1/σi(b)). In our view, this intuitive structure makes f

reg even more compelling.

We now brieﬂy describe six approaches we have developed for computing f

1. Na ıve: This uses Equation (2), and requires (n

)2) operations, which is prohibitive even for small m. 2. Submodular: The regret for set S in choosing set T, i.e., P

i2[n]:S σi T 1/σi(S), is submodular in S. Hence, for each T 2 Ak we can optimize over S 2 Ak using any algorithm for the submodular maximization subject to cardinality constraint (SMCC) problem. We use the SFO toolbox for Matlab [Krause, 2010].

3. Submodular+Greedy: This improves the previous ap-

proach by ﬁrst computing a 1 1/e greedy approximation to the SMCC instance for set T, and pruning T if this is already greater than the best regret found so far.

4. Multi ILP: Instead of using SMCC, for each T 2 Ak

we optimize over S 2 Ak by solving an integer linear program (ILP) with roughly n m variables and n m2 constraints. Note that

such ILPs need to be solved.

5. Multi ILP+Greedy: This improves the Multi ILP approach by using a greedy pruning procedure as before.

6. Single ILP: This approach solves a single but huge ILP

additional constraints.

Figure 5 shows the average running times of these approaches (and 95% conﬁdence intervals) over 10 000 instances with n = 15, k = 3, and m varying from 10 to 50.7 The experiments were performed on a single machine with quad-core 2.9 GHz CPU and 32 GB RAM. A time limit of 2 minutes was set because a running time greater than this would not be helpful for our website, where the results need to be delivered quickly to the users. While the greedy pruning procedure does help reduce the running time of both the Submodular and Multi ILP approaches, Single ILP still computes f

reg much faster than any other approach, solving instances with 50 alternatives in less than 10 seconds. We have therefore implemented Single ILP on Robo Vote.

10 20 30 40 50 0

Naive Submodular Submodular+Greedy Multi ILP Multi ILP+Greedy Single ILP Time Limit

Figure 5: Running times of six approaches to computing f

6 Discussion

We ﬁnd it exciting that new theoretical questions in computational social choice are driven by concrete real-world applications. And while research in the ﬁeld is often motivated by potential applications to multiagent systems, we focus on helping people not software agents make joint decisions.

We also remark that we consider the empirical dominance of f

reg, in terms of both regret and (surprisingly) distortion, to be especially signiﬁcant. It would be interesting to understand, on a theoretical level, why this happens. A promising starting point is to derive analytical bounds on the averagecase distortion of f

reg under uniformly random utility proﬁles.

7The running time scales linearly in n, and increases with

Acknowledgments This work was supported in part by NSF grants CCF1215883, CCF-1525932, and IIS-1350598; and by a Sloan Research Fellowship.

References [Anshelevich and Sekar, 2016] E. Anshelevich and S. Sekar.

Blind, greedy, and random: Algorithms for matching and clustering using only ordinal information. In Proceedings of the 30th AAAI Conference on Artiﬁcial Intelligence (AAAI), 2016. Forthcoming. [Anshelevich et al., 2015] E. Anshelevich, O. Bhardwaj, and

J. Postl. Approximating optimal social choice under metric preferences. In Proceedings of the 29th AAAI Conference on Artiﬁcial Intelligence (AAAI), pages 777 783, 2015. [Arrow, 1951] K. Arrow. Social Choice and Individual Val-

ues. Wiley, 1951. [Birkhoff, 1946] G. Birkhoff. Three observations on linear

algebra. Universidad Nacional de Tucum an, Revista A, 5:147 151, 1946. [Blum and Mansour, 2007] A. Blum and Y. Mansour. Learn-

ing, regret minimization, and equilibria. In N. Nisan, T. Roughgarden, E. Tardos, and V. Vazirani, editors, Algorithmic Game Theory, chapter 4. Cambridge University Press, 2007. [Boutilier et al., 2015] C. Boutilier, I. Caragiannis, S. Haber,

T. Lu, A. D. Procaccia, and O. Sheffet. Optimal social choice functions: A utilitarian view. Artiﬁcial Intelligence, 227:190 213, 2015. [Brandt et al., 2016] F. Brandt, V. Conitzer, U. Endress,

J. Lang, and A. D. Procaccia, editors. Hanbook of Computational Social Choice. Cambridge University Press, 2016. [Bubeck and Cesa-Bianchi, 2012] S. Bubeck and N. Cesa-

Bianchi. Regret analysis of stochastic and nonstochastic multi-armed bandit problems. Foundations and Trends in Machine Learning, 5(1):1 122, 2012. [Budish et al., 2013] E. Budish, Y.-K. Che, F. Kojima, and

P. Milgrom. Designing random allocation mechanisms: Theory and applications. American Economic Review, 103(2):585 623, 2013. [Campbell and Kelly, 1996] D. E. Campbell and J. S. Kelly.

Arrovian social choice correspondences. International Economic Review, 37(4):803 823, 1996. [Caragiannis and Procaccia, 2011] I. Caragiannis and A. D.

Procaccia. Voting almost maximizes social welfare despite limited communication. Artiﬁcial Intelligence, 175(9 10):1655 1671, 2011. [Chamberlin and Courant, 1983] J. R. Chamberlin and P. N.

Courant. Representative deliberations and representative decisions: Proportional representation and the Borda rule. American Political Science Review, 77(3):718 733, 1983. [Goldberg et al., 2001] K. Y. Goldberg, T. Roeder, D. Gupta,

and C. Perkins. Eigentaste: A constant time collaborative ﬁltering algorithm. Information Retrieval, 4(2):133 151, 2001.

[Goldman and Procaccia, 2014] J. Goldman and A. D. Pro-

caccia. Spliddit: Unleashing fair division algorithms. SIGecom Exchanges, 13(2):41 46, 2014. [Krause, 2010] A. Krause. SFO: A toolbox for submodular

function optimization. Journal of Machine Learning Research, 11:1141 1144, 2010. [Lu and Boutilier, 2011a] T. Lu and C. Boutilier. Budgeted

social choice: From consensus to personalized decision making. In Proceedings of the 22nd International Joint Conference on Artiﬁcial Intelligence (IJCAI), pages 280 286, 2011. [Lu and Boutilier, 2011b] T. Lu and C. Boutilier. Robust ap-

proximation and incremental elicitation in voting protocols. In Proceedings of the 22nd International Joint Conference on Artiﬁcial Intelligence (IJCAI), pages 287 293, 2011. [Mattei and Walsh, 2013] N. Mattei and T. Walsh. Preﬂib:

A library of preference data. In Proceedings of the 3rd International Conference on Algorithmic Decision Theory (ADT), pages 259 270, 2013. [Monroe, 1995] B. L. Monroe. Fully proportional represen-

tation. American Political Science Review, 89(4):925 940, 1995. [Procaccia and Rosenschein, 2006] A. D. Procaccia and J. S.

Rosenschein. The distortion of cardinal preferences in voting. In Proceedings of the 10th International Workshop on Cooperative Information Agents (CIA), pages 317 331, 2006. [Procaccia et al., 2008] A. D. Procaccia, J. S. Rosenschein,

and A. Zohar. On the complexity of achieveing proportional representation. Social Choice and Welfare, 30(3):353 362, 2008. [Procaccia et al., 2012] A. D. Procaccia, S. J. Reddi, and

N. Shah. A maximum likelihood approach for selecting sets of alternatives. In Proceedings of the 28th Annual Conference on Uncertainty in Artiﬁcial Intelligence (UAI), pages 695 704, 2012. [Skowron et al., 2015] P. Skowron, P. Faliszewski, and J. Lang. Finding a collective set of items: From proportional multirepresentation to group recommendation. In Proceedings of the 29th AAAI Conference on Artiﬁcial Intelligence (AAAI), pages 2131 2137, 2015. [von Neumann, 1953] J. von Neumann. A certain zero-sum

two-person game equivalent to the optimal assignment problem. In W. Kuhn and A. W. Tucker, editors, Contributions to the Theory of Games, volume 2, pages 5 12. 1953. [Young, 1988] H. P. Young. Condorcet s theory of voting.

The American Political Science Review, 82(4):1231 1244, 1988.