# samplingbased_belief_revision__13eba587.pdf Sampling-Based Belief Revision Michael Thielscher University of New South Wales, Australia mit@unsw.edu.au Model sampling has proved to be a practically viable method for decision-making under uncertainty, for example in imperfect-information games with large state spaces. In this paper, we examine the logical foundations of sampling-based belief revision. We show that it satisfies six of the standard AGM postulates but not Vacuity nor Subexpansion. We provide a corresponding representation theorem that generalises the standard result from a single to a family of faithful assignments for a given belief set. We also provide a formal axiomatisation of sampling-based belief revision in the Situation Calculus as an alternative way of reasoning about actions, sensing, and beliefs. 1 Introduction Model sampling is a viable approach to automated decisionmaking under uncertainty in a variety of domains; examples include the control of autonomous robots [Fox et al., 1999; Hollinger and Sukhatme, 2014] and computer game playing, where sampling is the method of choice to cope with highly incomplete knowledge of the game state in most imperfectinformation games [Frank and Basin, 1998; Ginsberg, 2001; Richards and Amir, 2012]. In all of these applications, agents base their decisions on a small set of samples from the set of all complete models that are consistent with their observations. When further information is acquired, models that turn out to be inconsistent with the new observations are resampled in order to maintain both consistency as well as a uniform size of the sample set [Frank and Basin, 1998; Edelkamp et al., 2012; Schofield et al., 2012]. From a logical perspective, when agents base their decisions on model sampling then they can be said to believe that the world is fully characterised by the samples. If, for instance, all models randomly sampled by a Bridge-playing agent [Ginsberg, 2001] happen to assume that their partner carries at least one card of each suit, then the agent concludes that the player will have to follow suit on any lead card. However, rational agents operating with beliefs (rather than knowledge) ought to take into account the possibility that their beliefs may be wrong, for example when a subsequent observation implies that a player does in fact not have a card of a specific suit. When this happens, the agent needs to resample inconsistent models [Frank and Basin, 1998], thereby causing him to revise his beliefs in the light of new information. In this paper, we study the formal properties of this kind of belief revision operators that follow from the sampling-based approach to handling incomplete information. The purpose of this study is two-fold: 1. A formal analysis of how model sampling affects the be- liefs of an agent provides for a deeper understanding of the formal properties of an approach that has proved successful in a range of applications including robot control and computer game playing. 2. A formalisation of model sampling as a general princi- ple for belief revision enriches the suite of traditional operators based on the AGM postulates [Alchourr on et al., 1985], especially since it promises to be a viable method for applications with large state spaces for which classical Belief Revision operators are not practical. Our investigation into the logical foundations for samplingbased belief revision will show that it satisfies six of the eight standard AGM postulates but not Vacuity nor Subexpansion. The intuitive reason is that when some but not all sampled models are consistent with new observation φ, then those that contradict φ get resampled. These new models may then invalidate previously held beliefs despite the fact that the presence of other samples that are consistent with φ imply that the new observation is consistent with the agent s original beliefs. As a simple example, consider a situation in an unspecified card game where there are the 6 outstanding cards |2, 2, |3, ~3, }3, }4, and suppose that an agent has randomly generated the following two samples for the belief about the remaining hand (consisting of two cards) of another player: K = {{|3, }3}, {|2, ~3}}. Suppose further that the player is observed not to follow suit when hearts is played. This observation, which we shall write φ = ~, renders the second model inconsistent (but not the first). If we assume that resampling this model results in the new sample set K φ = {{|3, }3}, { 2, }4}}, then the following can be said about the beliefs of the agent before and after observing φ: 1. In K, the agent believes that the player has a |-card. 2. K is consistent with φ = ~. Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16) (AGM-1) K φ = Cn(K φ) (Closure) (AGM-2) φ 2 K φ (Success) (AGM-3) K φ K + φ (Inclusion) (AGM-4) If φ 62 K then K φ K + φ (Vacuity) (AGM-5) If φ is consistent, then K φ is consistent (Consistency) (AGM-6) If φ , then K φ = K (Extensionality) (AGM-7) K (φ ) (K φ) + (Superexpansion) (AGM-8) If 62 (K φ), then K (φ ) (K φ) + (Subexpansion) Figure 1: The eight AGM postulates. 3. In K φ, the agent no longer believes that the player has These three properties together violate the Vacuity postulate, which satipulates that if φ is consistent with K, then every belief in K should also be a belief in K φ. We will show that for a similar reason, the Subexpansion postulate is not satisfied either. We will provide a representation theorem for belief revision operators that satisfy all but these two AGM postulates. The result will be a generalisation of the classical representation theorem [G ardenfors and Makinson, 1988] from single to a family of faithful assignments for a given belief set. The remainder of the paper is organised as follows. In the next section, we recall the necessary notions and notations for belief revision and logical reasoning about actions. In the section that follows, we define and analyse model sampling for belief revision in the context of the AGM framework. Thereafter, we state and prove the representation result. Finally, we apply our general concept of model sampling for belief revision to develop a variant of the classical Situation Calculus for reasoning about actions, sensing and beliefs based on (re-)sampling. 2 Background We will formulate and analyse sampling-based belief revision in the context of the classical AGM-based approach and assuming a logic language L generated from a finite set P of propositions. By Cn(φ) we denote all classical logical consequences of a sentence, or a set of sentences, φ. K + φ stands for Cn(K [ {φ}). As usual, a propositional interpretation (world) is a mapping from P to {>, ?}. The set of all interpretations is denoted by W. If an interpretation w truthfunctionally maps a sentence, or a set of sentences, φ to >, then w is called a model of φ (denoted by w |= φ). Mods(φ) denotes the set of all models of φ, and the latter is consistent iff Mods(φ) 6= ;. Formulas, or sets of formulas, φ and are equivalent, written φ , iff Mods(φ) = Mods( ). AGM Postulates Let the beliefs of an agent be represented by a set of sentences K in L. Any new evidence is a sentence φ in L, and the result of revising K with φ, denoted by K φ, is also a belief set over L. Alchourr on et al. [1985] put forward eight postulates for belief revision operators, which are given in Figure 1. Readers are referred to G ardenfors and Makinson [1988] for a detailed discussion of the motivation and interpretation of these postulates. A Representation Theorem The classical representation theorem [G ardenfors and Makinson, 1988] characterises belief revision operators that satisfy the AGM postulates with the help of preference relations over worlds (i.e. models). A total pre-order (possibly indexed) is a reflexive, transitive binary relation such that either β or β holds for any , β. The strict part of is denoted by <, that is, < β iff β and β 6 . As usual, = β abbreviates β and β . Given any total pre-order over a set S, by min(S, ) we denote the set { 2 S : β for all β 2 S}. Definition 1 A function that maps each belief set K to a total pre-order K on W is called a faithful assignment if it satisfies the following conditions. If w1, w2 |= K, then w1 =K w2. If w1 |= K and w2 6|= K, then w1 0. Consider an arbitrary but fixed order w1 0 and / the operation of resampling wrt. any given order for generating samples. The corresponding belief revision operator defined by K|S φ = K|S/φ (8) satisfies (AGM-1) (AGM-3) and (AGM-5) (AGM-7). Proof: We prove each postulate in turn. (AGM-1) follows by Definition 2, equation (6). (AGM-2) holds by Definition 3, equation (7), which im- plies that S /φ Mods(φ), hence φ 2 K|S/φ according to equation (6). For (AGM-3) we observe that if 2 K|S φ then w |= for all w 2 S / φ according to (8), hence w |= for all w 2 S \ Mods(φ) according to (7). It follows that 2 K|S + φ according to (6). (AGM-5) holds since Mods(φ) 6= ; implies S / φ 6= ; according to (7); hence, if φ is consistent then by (8), K|S φ is consistent too. 3If |Mods(φ)| = 0 then S = ;, i.e., no consistent sample exists. (AGM-6) follows from Definition 3, equation (7), since φ implies Mods(φ) = Mods( ), hence S / φ = S / , which implies K|S φ = K|S by (8). For (AGM-7), recall Definition 3, equation (7), and con- sider any sample w 2 S / φ that satisfies not only φ but also . By Definition 3 and because the same sampling strategy