# editing_boolean_classifiers_a_belief_change_perspective__b13246c7.pdf

Editing Boolean Classiﬁers: A Belief Change Perspective

Nicolas Schwind1, Katsumi Inoue2,3, Pierre Marquis4,5

1 National Institute of Advanced Industrial Science and Technology, Tokyo, Japan 2 National Institute of Informatics, Tokyo, Japan 3 The Graduate University for Advanced Studies, SOKENDAI, Tokyo, Japan 4 Univ. Artois, CNRS, CRIL, F-62300 Lens, France 5 Institut Universitaire de France nicolas-schwind@aist.go.jp, inoue@nii.ac.jp, marquis@cril.fr

This paper is about editing Boolean classiﬁers, i.e., determining how a Boolean classiﬁer should be modiﬁed when new pieces of evidence must be incorporated. Our main goal is to delineate what are the rational ways of making such edits. This goes through a number of rationality postulates inspired from those considered so far for belief revision. We give a representation theorem and present some families of edit operators satisfying the postulates.

Introduction Alice, a bank employee, receives Bob, a customer who wants to obtain a loan. Bob has a low income, but no debts. His record shows that he had already requested a loan in the past, and had fully reimbursed it. The bank management has recently provided Alice with an AI algorithm (a pre-trained predictor) to help her decide which issue to give to any loan application. Alice is asked to use this algorithm which recommends against granting Bob the requested loan due to the fact that he is not the owner of his principal residence. However, Alice is experienced and remembers of two customers Cindy and Dan with a proﬁle similar to Bob s, who both had previously been granted a loan without any issue. Hence, Alice s expertise led her not to follow the recommendation of the AI algorithm and to grant Bob the loan requested. But Alice would like to do more to avoid that the problem encountered arises again with future clients having similar proﬁles. She wonders what could be done to this end. The research question tackled in this paper is relevant to Alice s concern. We focus on Boolean classiﬁers ϕ: given an instance represented as a world, i.e., a truth assignment of all the variables of interest, ϕ classiﬁes the instance as positive when it is a model of ϕ, and as negative when it is a countermodel of ϕ. The concept associated with ϕ is the set of all positive instances. Our very purpose is to determine how a Boolean classiﬁer ϕ that has already been learned should be modiﬁed when new pieces of positive evidence / negative evidence µ (that may conﬂict with predictions of the classiﬁer) are considered. We call such change operations on Boolean classiﬁers positive edit / negative edits (respectively), and we note them + and .

Copyright 2023, Association for the Advancement of Artiﬁcial Intelligence (www.aaai.org). All rights reserved.

We assume that both inputs ϕ and µ are represented as propositional formulae. Doing so, ϕ identiﬁes both the classiﬁer under consideration and, through its set of models, the associated concept. On the other hand, µ s models represent new positive (resp. negative) pieces of evidence in the case of a positive (resp. negative) edit. This representation choice allows one to deal with a number of existing ML classiﬁers: in an e Xplainable AI perspective, many works have shown recently how ML classiﬁers C of various types can be associated with Boolean circuits ϕC, exhibiting the same input-output behaviours (i.e., the predictions made using C are precisely the same ones as those made using ϕC). The ML models that are concerned include not only decision trees (Izza, Ignatiev, and Marques-Silva 2020; Audemard et al. 2021) and decision lists (Ignatiev and Silva 2021), but also a number of ML models that are usually considered as less interpretable, like random forests (Audemard, Koriche, and Marquis 2020; Izza and Marques-Silva 2021), gradient boosted trees (Ignatiev 2020), some Bayes nets (Shih, Choi, and Darwiche 2018, 2019), and binary neural networks (Narodytska et al. 2018; Shi et al. 2020). Accordingly, classiﬁers C from those families can be taken into account in our framework, using ϕC as a representation of C since the two are prediction-equivalent.

Edit operations are connected to incremental concept learners, like Mitchell s Candidate Elimination Algorithm (Mitchell 1977), Schlimmer and Granger s STAGGER (Schlimmer and Fisher 1986), Fisher s COBWEB (Fisher 1987), and Gallant s Pocket Algorithm (Gallant 1988). Such systems, also referred to as on-line learning systems, are suited to learning scenarios when a whole training set is not available a priori but examples arrive over time. Borrowing the criteria used in (Maloof and Michalski 2000) to draw a typology of such systems, edit operations characterize online learning systems with (full) concept memory (the role played by the classiﬁer ϕ), temporal batch (the set of models of µ is a new set of examples in the case of a positive edit, and a new set of counter-examples in the case of a negative edit), and no instance memory (the examples and counter-examples used to induce ϕ are not stored). However, previous work about incremental concept learners was typically centered on aspects that are not considered in this paper. These included the design of a number of on-line learners (based on speciﬁc concept representations, e.g., decision

The Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-23)

trees or decision rules), the evaluation of their empirical accuracy but also of their run-time efﬁciency (this can be a critical aspect since items in a data stream can be received at a so high rate that real-time guarantees are required to handle all of them (Domingos and Hulten 2000)), and ﬁnally the choice of examples that must be kept at each learning step (Maloof and Michalski 2004). Contrastingly, in our work, the focus is on an axiomatic approach. We do not consider any speciﬁc concept representation, and do not make any assumption about how the batch of new examples or counter-examples are represented. We nevertheless suppose that the new piece of evidence µ that triggers the edit operation of ϕ is certain, i.e., not pervaded by any noise. Thus, stepping back to the loan scenario, Alice is sure that Bob should be granted the loan. Here, our main goal is to delineate the rational ways of making such edits. This goes through a number of rationality postulates. To determine such postulates, we look back at the core principles of belief revision which aims to incorporate, in a rational way, a new piece of information into the belief set of an agent (Alchourr on, G ardenfors, and Makinson 1985; Alchourr on and Makinson 1985; G ardenfors 1988). The AGM postulates (for Alchourr on, G ardenfors and Makinson 1985) aim to formalize a set of rationality conditions based on three main principles: primacy of update (the new information must be believed after the change), consistency (the resulting belief set must be kept consistent when the new information is consistent), and minimal change (if simply adding the new information to the belief set raises no conﬂict, then nothing else should be added or removed). Adapting the postulates of belief revision into edit is not a trivial task: provided that the beliefs of an agent ϕ and the new information µ are represented by propositional formulae, when the conjunction of ϕ and µ is consistent, the revision of ϕ by µ corresponds to that conjunction (Katsuno and Mendelzon 1991). However, by representing a Boolean classiﬁer ϕ and a set of incoming examples µ by two propositional formulae, one cannot reasonably require the edited classiﬁer to be the conjunction of ϕ and µ whenever consistent: this process would unconditionally remove positive instances not explicitly questioned by µ, while also not incorporating the examples from µ previously classiﬁed negatively by ϕ. Edit differs from belief revision in that the objects under consideration (all of which being represented by propositional formulae) are nevertheless of different nature. Thus, an agent s beliefs (represented by ϕ) correspond to a set of possible worlds to whom the one actual true world is believed to belong, while in edit, it makes perfect sense for several instances to be both members of the concept represented by a Boolean classiﬁer. Likewise, every Boolean classiﬁer ϕ is essentially consistent : when ϕ has no model, it simply represents the empty concept. This explains also why the consistency principle is irrelevant to an edit operation. Nevertheless, the primacy of update and minimality of change principles can be adapted to the edit context. For this purpose, after some formal preliminaries, we introduce the edit postulates in the context of positive edit ﬁrst (incorporating a batch of positive instances into a Boolean classiﬁer).

We also give a representation theorem and present some examples of positive edit operators. Then, we show how these postulates can be adapted to the case of a negative edit (i.e., when the arriving batch is interpreted as a set of negative instances) and make precise how a correspondence between the two operations can be formalized through a duality result. We then consider the case of a full edit, where both positive and negative instances can be considered in the same batch. Lastly, related work is discussed just before the conclusion. The proofs of propositions are available online.1

Formal Preliminaries

We consider a propositional language LP S built from a ﬁnite set PS of variables and the standard connectives. A world is a truth assignment of all variables from PS. The set of all worlds is denoted by Ω, and the set of models of a propositional formula ϕ LP S (i.e., the set of worlds that make ϕ true) is denoted by [ϕ]. Given two formulae α, β, we write α |= β whenever [α] [β] and α β when [α] = [β]. Belief revision aims to incorporate into the beliefs of an agent (a formula ϕ) a new piece of information (a formula µ). Thus a revision operator associates formulae ϕ, µ with a revised formula ϕ µ, and is expected to satisfy a set of rationality postulates:2

Deﬁnition 1 (KM revision operator). A revision operator is said to be a KM revision operator if it satisﬁes the following postulates:

(R1) ϕ µ |= µ (R2) If [ϕ µ] = , then ϕ µ ϕ µ (R3) If [µ] = , then [ϕ µ] = (R4) If ϕ ϕ and µ µ , then ϕ µ ϕ µ

(R5) (ϕ µ) µ |= ϕ (µ µ ) (R6) If [(ϕ µ) µ ] = , then ϕ (µ µ ) |= (ϕ µ) µ

(R1) is the success postulate, it relates to the primacy of update principle: the new information must be believed after revision. (R3) is the consistency postulate. (R4) is the syntax-irrelevance postulate. And (R2), (R5) and (R6) express the minimality of change conditions. We refer the reader to (Alchourr on, G ardenfors, and Makinson 1985; Katsuno and Mendelzon 1991) for a deeper discussion about the rationale of these postulates.

Positive Edit

We now intend to deﬁne a change operation + that consists in editing an (already learned) Boolean classiﬁer ϕ according to a new information µ. We assume that ϕ is represented by a propositional formula. In this context, each world represents an instance, and a world ω is a model of ϕ if and only if it is a positive instance of the concept represented by ϕ (so each instance is either classiﬁed as positive or negative by ϕ). The new information µ is called a positive dataset and is

1https://nicolas-schwind.github.io/SIM-AAAI23-proofs.pdf 2We give here the KM postulates (Katsuno and Mendelzon 1991), which are the translation of the AGM postulates in ﬁnite propositional logic.

also represented by a propositional formula. The set of models of µ represents a batch of arriving positive instances, also called positive examples (i.e., when referring to the models of µ). We do not make any further assumption on the way ϕ and µ are represented (e.g., ϕ could be a decision tree and µ a DNF formula, but it does not have to be the case). Example 1. Let us formalize the scenario provided in the introduction. We set PS = {p, q, r, s} where p means that the applicant has a high income , q stands for owns her principal residence , r means has no debts , and s means has reimbursed a previous loan . We assume that ϕ = p q r, i.e., the predictor recommends granting a loan precisely to those residence owners having a high income and no debts. Then, let Bob have the proﬁle ω1 = 0011, i.e., he is not owning his residence, has a low income, but has no debts and has reimbursed a previous loan; and let Cindy and Dan be identiﬁed with the same proﬁle ω2 = 0101. The positive dataset µ is then deﬁned as any propositional formula such that [µ] = {ω1, ω2}, e.g., µ = p s (q r). An edit operator + associates every Boolean classiﬁer ϕ and every positive dataset µ with an edited Boolean classiﬁer ϕ + µ. Our key assumption is that the new piece of evidence µ that triggers the edit operation of ϕ is provided by a domain expert: it is therefore certain, i.e., not pervaded by any noise. This can be ensured in a number of scenarios (thus, stepping back to the example given in the introduction, Alice is sure that applicants with the same proﬁles as Bob, Cindy and Dan should be granted the loan). We are ready to introduce our postulates for positive edit: Deﬁnition 2 (Positive Edit operator). An operator + is said to be a positive edit operator (PE operator for short) if it satisﬁes the following postulates: (P1) µ |= ϕ + µ (P2) If µ |= ϕ, then ϕ + µ ϕ (P3) If ϕ1 ϕ2 and µ1 µ2, then ϕ1 + µ1 ϕ2 + µ2 (P4) If ψ |= ϕ + µ, then ϕ + µ ϕ + (µ ψ)

(P1) relates to the primacy of update principle. Since the incoming positive dataset µ is assumed to be certain, (P1) requires the edited classiﬁer to comply with µ, i.e., to correctly classify all examples from µ as positive instances. This can be viewed as the counterpart of (R1) in belief revision. (P2) is a minimality of change postulate: if the initial classiﬁer already complies with µ, then there is no need to change it. It is reminiscent to (R2) in belief revision, but (P2) and (R2) differ in their premise. Indeed, (P2) does not say anything when µ |= ϕ and [ϕ µ] = : if ϕ does not comply with µ (i.e., some positive examples from µ were previously classiﬁed as negative instances by ϕ), then it makes perfect sense to question the concept membership of any instance ω / [µ]. Note that when µ |= ϕ, the conclusion of (P2) can equivalently be stated as ϕ + µ ϕ µ, from which the similarity with (R2) is clearer: µ is simply added to ϕ, which results in not changing ϕ at all. (P3) is the syntaxindependence postulate, which is the direct counterpart of (R4). (P4) is another minimality of change postulate. Its counterparts in belief revision are (R5) and (R6), which together express that if ϕ revised by a ﬁrst piece of information µ1 is consistent with another piece of information µ2,

then revising ϕ by both pieces of information taken together (i.e., by µ1 µ2) boils down to adding µ2 to the revision of ϕ by µ1. Likewise, in our setting, (P4) says that if the edit of a classiﬁer ϕ by a ﬁrst positive dataset µ complies with another positive dataset ψ, then its edit by the two batches taken together (i.e., by µ ψ) boils down to adding ψ to the edit of ϕ by µ: indeed, if ψ |= ϕ + µ, then ϕ + µ (ϕ + µ) ψ and thus the conclusion of (P4) can equivalently be written as (ϕ + µ) ψ ϕ + (µ ψ). Please note that (R3), the consistency postulate in belief revision, is the only postulate with no counterpart in our setting, since every Boolean classiﬁer is essentially consistent : if [ϕ] = , then ϕ characterizes an empty concept. Notably, the edit postulates (P1-P4) are also reminiscent to properties that can be sought for incremental concept learners in the absence of noise. Thus, (P1) states that once the edit operation has been performed, the resulting concept ϕ + µ must be consistent (in the sense of (Mitchell 1982)) with the new examples given by µ, which precisely means that those examples must be positive instances of ϕ + µ. (P2) requires not to change the concept ϕ when it classiﬁes correctly the new examples given by µ. This condition is achieved, for instance, by the perceptron update rule (Rosenblatt 1958). (P3) requires syntax not to play any role in the on-line learning process, which makes sense if the speciﬁc representation ϕ of the concept at hand is irrelevant (this is one of our starting assumptions). Finally, provided that (P2) holds, (P4) can be viewed as a relaxation of an order-independence condition that is satisﬁed by some on-line learners. This last property roughly states that while the new examples arrives over time, once the whole input sequence has been processed, the classiﬁer has been transformed in the same way as if all pieces of evidence were available as a whole. Such an order-independence condition is ensured by ID5 (Utgoff 1989), that has been shown to compute the same decision tree as the one that would be generated by ID3 (Quinlan 1986), provided that the whole set of examples would be available at start. Formally, in our setting, the order-independence condition can be stated as (ϕ +µ) +ψ ϕ +(µ ψ). Note that this condition is quite demanding and not satisﬁed by every on-line learner (e.g., the perceptron update rule may easily question the way an instance µ has been classiﬁed when editing further the linear classiﬁer by taking a new instance ψ into account). Accordingly, we focused on a weaker condition (it is easy to show that (P4) is a logical consequence of the order-independence condition when (P2) holds). At this point, one can already identify a few simple operators from the class:

Deﬁnition 3 (Some PE operators). The trivial, basic, and drastic operators, respectively noted + T , + B, and + D, are deﬁned for each classiﬁer ϕ and each positive dataset µ as:

ϕ + T µ = ϕ if µ |= ϕ, otherwise ϕ + T µ = ϕ + B µ = ϕ µ ϕ + D µ = ϕ if µ |= ϕ, otherwise ϕ + D µ = µ

The trivial operator + T just requires a classiﬁer to classify all worlds as positive instances as soon as it does not

initially comply with the input positive dataset. The basic operator + B is simply deﬁned as disjunction: it adds as positive instances all the examples provided by µ which were not already classiﬁed as positive. The drastic operator + D leaves the classiﬁer ϕ unchanged if already compliant with µ. Otherwise, similarly to the trivial operator, it forgets everything, but classiﬁes as positive instances precisely the ones explicitly provided as examples by µ. This can be viewed as the PE counterpart of the drastic revision operator D deﬁned as ϕ Dµ = ϕ µ if [ϕ µ] = , otherwise ϕ Dµ = µ. It is quite easy to check that these operators satisfy (P1P4) (the proof is direct):

Proposition 1. + T , + B and + D are PE operators.

A Representation Theorem. Let us now show how PE operators can be characterized in terms of so-called positive assignments:

Deﬁnition 4 (Positive assignment). A positive assignment is a mapping associating every classiﬁer ϕ with a mapping fϕ : P(Ω) 7 P(Ω), such that for all classiﬁers ϕ, ϕ and all subsets of worlds W, W P(Ω), the following properties are satisﬁed:

1. W fϕ(W) 2. If W [ϕ], then fϕ(W) = [ϕ] 3. If ϕ ϕ , then fϕ = fϕ 4. If W W and W fϕ(W), then fϕ(W) = fϕ(W )

Proposition 2. An operator + is a PE operator if and only if there is a positive assignment ϕ 7 fϕ such that for each classiﬁer ϕ and each positive dataset µ, [ϕ µ] = fϕ([µ]).

This is a strong representation result, in the sense that different positive assignments deﬁne different PE operators. Interestingly, every mapping fϕ satisﬁes the property of idempotence:

Proposition 3. For each positive assignment ϕ 7 fϕ and each W Ω, we have that fϕ(fϕ(W)) = fϕ(W).

Accordingly, a consequence of the PE postulates is that (ϕ + µ) + µ ϕ + µ. This idempotence property reﬂects a very simple form of minimal change and is standard in belief change: it is satisﬁed by belief revision operators but also by other forms of change operations, e.g., contraction (Caridroit, Konieczny, and Marquis 2017). Noteworthy, condition 4 corresponds to the condition of Irrelevance of Rejected Contracts (IRC) in matching theory (Hatﬁeld and Milgrom 2005). In that context, this property requires the removal of rejected contracts not to affect a choice set, and is a necessary condition to guarantee the existence of stable allocations (Ayg un and S onmez 2013), without implying rationalizability3(Yang 2020).

Distance-based PE operators. We now introduce two classes of operators, called dilation operators and mingeneralization operators. These operators are parameterized by a distance between worlds, i.e., a mapping d : Ω Ω7 N

3A choice function σ : P(E) 7 P(E), i.e., a mapping such that σ(S) S, is rationalizable if it can be characterized in terms of preference relation over elements from E.

such that d(ω, ω ) = 0 if and only if ω = ω , and that satisﬁes the triangular inequality property, i.e., d(ω, ω ) d(ω, ω ) + d(ω , ω ), for all worlds ω, ω , ω . Let us start with dilation operators, whose deﬁnition is inspired from the notion of formula dilation from (Bloch and Lang 2002; Dalal 1988). Given a classiﬁer ϕ such that [ϕ] = , and an integer k, the k-dilation of ϕ w.r.t. d, denoted by Dd ϕ(k), is deﬁned by Dd ϕ(k) = {ω Ω| d(ω, ϕ) k}, where d(ω, ϕ) = min{d(ω, ω ) | ω |= ϕ}.

Deﬁnition 5 (Dilation operator). The dilation operator + dil,d induced by d is deﬁned for each classiﬁer ϕ and each positive dataset µ by [ϕ + dil,d µ] = [µ] if [ϕ] = , otherwise [ϕ + dil,d µ] = arg mink({Dd ϕ(k) | [µ] Dd ϕ(k)}).

A number of dilation operators can be deﬁned depending on the choice of d. For instance, consider the Hamming distance between worlds, denoted by d H, deﬁned for all worlds ω, ω Ωas d H(ω, ω ) = {x PS | ω(x) = ω (x)} (Dalal 1988). Then the Hamming-based dilation operator + dil,d H consists in k-dilating ϕ w.r.t. d H where k is the least integer for which the resulting set of models includes every model of µ (see Example 1 below). Let us now introduce the class of min-generalization operators. Given a distance d and a world ω, let d ω is the total preorder over worlds induced by ω and d and deﬁned by ω d ω ω iff d(ω , ω) d ω d(ω , ω). Given a classiﬁer ϕ such that [ϕ] = , the set min([ϕ], d ω) denotes the set of models of ϕ that have a distance to ω which is minimal among all models of ϕ, i.e., min([ϕ], d ω) = {ω [ϕ] | ω [ϕ], d(ω , ω) d(ω , ω)}.

Deﬁnition 6 (Min-generalization operator). The mingeneralization operator + gen,d induced by d is deﬁned for each classiﬁer ϕ and each positive dataset µ by [ϕ + gen,dµ] = [µ] if [ϕ] = , otherwise [ϕ + gen,d µ] = {ω Ω| ω , ω Ω, ω |= µ, ω min([ϕ], d ω ), d(ω, ω ) + d(ω, ω ) d(ω , ω )}.

The min-generalization operator consists in considering as positive instances for ϕ + gen,d µ every world ω that is in-between (w.r.t. d) a model ω of µ and a model ω of ϕ that is among the closest ones (w.r.t. d) to ω . When d = d H, the min-generalization operator can be characterized using the most speciﬁc generalization (msg) of the worlds involved.4 Let msg(ω, ω ) be the term V

x P S|ω(x)=ω (x)=1 x V

x P S|ω(x)=ω (x)=0 x. Then one can check that ϕ + gen,d H µ W{msg(ω, ω ) | ω [µ], ω min([ϕ], d ω)}. All the operators from these classes satisfy (P1-P4):

Proposition 4. For every distance d, the operators + dil,d and + gen,d are PE operators.

Example 1 (continued). Let us go back to our loan scenario, and recall that PS = {p, q, r, s}, ϕ = p q r, and µ = p s (q r). Figure 1 depicts through

4Most speciﬁc generalization is a key concept in machine learning, see e.g., (Plotkin 1970; Mitchell 1977).

00 01 11 10 00

(a) [ϕ] and [µ]

00 01 11 10 00

(b) Dd H ϕ (1)

(Dd H ϕ (2))

[ϕ + dil,d H µ]

00 01 11 10 00

(c) [ϕ + dil,d H µ]

[msg(ω, ω1)]

00 01 11 10 00

(d) [msg(ω, ω1)]

[msg(ω, ω2)]

00 01 11 10 00

(e) [msg(ω, ω2)]

[ϕ + gen,d H µ] 00 01 11 10 00

(f) [ϕ + gen,d H µ]

Figure 1: An example of Hamming-based dilation (Fig. 1c) and min-generalization (Fig. 1f) positive edits.

Karnaugh maps the models of ϕ and µ (Fig. 1a), the 1dilation of ϕ (Fig. 1b), the Hamming-based dilation edit of ϕ by µ, which corresponds to the 2-dilation of ϕ (Fig. 1c), and the Hamming-based min-generalization edit of ϕ by µ (Fig. 1f), which corresponds to the disjunction of the two msgs given in Fig. 1d and 1e. Accordingly, we get that ϕ + dil,d H µ p q r and ϕ + gen,d H µ s (q r).

As it can be veriﬁed on the example, dilation and mingeneralization PE operators can easily add to ϕ models that are neither models of ϕ nor models of µ, thus questioning negative instances (the counter-models of ϕ). Such a generalization power (required by incremental learning) is not forbidden but not mandatory for PE operators (e.g., consider the basic and the drastic PE operators in Deﬁnition 3). Mingeneralization PE operators may also question positive instances (again, this is expected when used for learning).

Negative Edit Let us now consider the incorporation of negative instances into a Boolean classiﬁer, alias negative edits. This time, the models of the change formula µ represent counter-examples of the target concept (a negative dataset). Deﬁnition 7 (Negative Edit operator). An operator is said to be a negative edit operator (NE operator for short) if it satisﬁes the following postulates: (N1) µ |= (ϕ µ) (N2) If µ |= ϕ, then ϕ µ ϕ (N3) If ϕ1 ϕ2 and µ1 µ2, then ϕ1 µ1 ϕ2 µ2 (N4) If ψ |= (ϕ µ), then ϕ µ ϕ (µ ψ)

Similarly to Harper and Levi s identities which show how a revision operator can be deﬁned from a contraction operator and vice-versa (see e.g., (Caridroit, Konieczny, and Marquis 2017)), one can identify a correspondence between PE and NE operators. With an operator : L L 7 L, let us associate an operator σ( ) : L L 7 L deﬁned as:

ϕ σ( ) µ = ( ϕ µ),

for every classiﬁer ϕ and every formula µ. Then: Proposition 5. σ is an involution, that is, for each operator : L L 7 L, σ(σ( )) = . Moreover, σ( ) is an NE operator if and only if is a PE operator. We say that the operator σ( ) is the dual of . For instance, consider again the trivial, basic and drastic PE operators introduced in Deﬁnition 3. Then it is easy to see

that the dual of these operators, i.e., the trivial, basic, and drastic NE operators, respectively denoted by T = σ( + T ), B = σ( + B), and D = σ( + D), are deﬁned for each classiﬁer ϕ and each negative dataset µ by: ϕ T µ = ϕ if µ |= ϕ, otherwise ϕ T µ = ϕ B µ = ϕ µ ϕ D µ = ϕ if µ |= ϕ, otherwise ϕ D µ = µ Dual operators of dilation operators and mingeneralization operators can also be easily deﬁned. Interestingly, the operators dual to dilation operators involve an operation of formula erosion (Bloch and Lang 2002), which is an operation on formulae dual to the one of dilation. For instance, the Hamming-based erosion operator, denoted by ero,d H, is deﬁned for each classiﬁer ϕ and each negative dataset µ as ϕ ero,d H µ = ( ϕ + dil,d H µ).

Full Edit Let us ﬁnally consider the more general case when the new piece of evidence consists of both positive and negative instances that must be incorporated into the classiﬁer. We call such a piece of evidence a dataset, i.e., a pair (µ+, µ ) such that µ+ is a positive dataset (a set of examples), µ is a negative dataset (a set of counter-examples), and such that [µ+ µ ] = . The set D denotes the set of all datasets. Deﬁnition 8 (Full Edit operator). An operator : L D 7 L is said to be a full edit operator (FE operator for short) if for each a classiﬁer ϕ and each dataset (µ+, µ ), it satisﬁes the following postulates: (F1) µ+ |= ϕ (µ+, µ ) (F2) µ |= (ϕ (µ+, µ )) (F3) If µ+ |= ϕ and µ |= ϕ, then ϕ (µ+, µ ) ϕ (F4) If ϕ1 ϕ2, µ+ 1 µ+ 2 and µ 1 µ 2 , then ϕ1 (µ+ 1 , µ 1 ) ϕ2 (µ+ 2 , µ 2 ) (F5) If ψ |= ϕ (µ+, µ ) and α |= (ϕ (µ+, µ )), then ϕ (µ+, µ ) ϕ (µ+ ψ, µ α)

The postulate (F1) (resp. (F2)) corresponds to (P1) (resp. (N1)), while (F3) (resp. (F4), (F5)) is a (weak) combination of (P2) and (N2) (resp. (P3) and (N3), (P4) and (N4)). A number of FE operators can be deﬁned by means of a PE operator or an NE operator. Given an operator + : L L 7 L, let us deﬁne the operator + : L D 7 L for each classiﬁer ϕ and each dataset (µ+, µ ) as: ϕ + (µ+, µ ) = (ϕ + µ+) µ .

We say that + is positively induced by +. Then: Proposition 6. + is an FE operator if and only if + is a PE operator. Proposition 6 gives us a constructive way to deﬁne an FE operator from a PE operator. Consider for instance the dilation operator + dil,d, where d is any distance between worlds (cf. Deﬁnition 5). Then the operator + dil,d consists in ﬁrst dilating an input classiﬁer ϕ so as to include all positive examples from µ+, and then removing all instances introduced in the dilation step according to µ . As a consequence of Proposition 6, this operator satisﬁes (F1-F5). Likewise, each NE operator also deﬁnes an FE operator. Given an operator : L L 7 L, is said to be negatively induced by , denoted by = , if it is deﬁned for each classiﬁer ϕ and each dataset (µ+, µ ) by ϕ (µ+, µ ) = (ϕ µ ) µ+. Echoing Proposition 6, we get that: Proposition 7. is an FE operator if and only if is an NE operator. Remark that inducing an operator by a PE operator and an NE operator, e.g., as ϕ (µ+, µ ) = (ϕ + µ+) µ , does not always deﬁne an FE operator, even when + and are dual. To give an example when this kind of construction does not work, let us consider our loan scenario again: Example 1 (continued). Assume now that Alice receives an additional applicant, Emir, with proﬁle ω3 = 0100. Since Emir has a low income, debts, and has not yet reimbursed his previous loan, Alice is certain that Emir is not eligible for a new loan. We are then given both a dataset µ = (µ+, µ ), where µ+ = p s (q r) with [µ+] = {ω1, ω2} (Bob / Cindy and Dan are positive examples), and µ = p q r s, i.e., [µ ] = {ω3} (Emir is a negative example). Let us consider the operator deﬁned by ϕ µ = (ϕ + dil,d H µ+) ero,d H µ , i.e., the classiﬁer is ﬁrst edited according to µ+ using the Hamming-based dilation edit + dil,d H, and is then edited again according to µ using the Hammingbased erosion edit ero,d H, that is, the negative edit operator dual to + dil,d H. Recall ﬁrst that ϕ = ϕ + dil,d H µ+ = p q r (cf. Fig. 1c). Then we get that ϕ = ϕ µ = ϕ ero,d H µ (p q) (r (p q)), with [ϕ ] = Dd H ϕ (1) (cf. Fig. 1b). Yet {ω1, ω2} [ϕ ] = , i.e., Bob / Cindy and Dan are not classiﬁed as positive instances by the edited classiﬁer ϕ . Hence, does not satisfy (F1), i.e., is not an FE operator. At that stage, a natural question is whether one can ﬁnd an FE operator that is not induced by a PE operator or an NE operator. We provide below a positive answer to this question. In fact, we intend to introduce an operator which is not decomposable in any way by means of a combination of a PE operator and an NE operator. Formally, given an FE operator , a PE operator + and an NE operator , we say that the pair ( +, ) is faithful to if for each classiﬁer ϕ and each dataset (µ+, µ ), ϕ (µ+, µ ) (ϕ + µ+) µ or ϕ (µ+, µ ) (ϕ µ ) + µ+. An FE operator is then said to be decomposable if admits a faithful pair ( +, ). In particular, positively induced FE operators + are decomposable: it can be easily veriﬁed that for each PE

operator +, the FE operator + admits the faithful pair ( +, B), where B is the basic NE operator (recall that ϕ B µ = ϕ µ, for each classiﬁer ϕ and each negative dataset µ). And similarly, negatively induced FE operators are decomposable as well since they admit the faithful pair ( + B, ). Now, let us consider the operator deﬁned for each classiﬁer ϕ and each dataset (µ+, µ ) as:

ϕ, if µ+ |= ϕ and µ |= ϕ, , if µ+ |= ϕ and [µ ] = , µ+, otherwise.

This operator simply leaves unchanged the edited classiﬁer in the case when it already complies with the input dataset (as required by (F3)). In the remaining cases, it behaves like the PE trivial operator + T if the negative batch is empty, otherwise it behaves like the PE drastic operator + D (cf. Deﬁnition 3). We can show that: Proposition 8. is an FE operator that is not decomposable. This leaves us the interesting open question of whether a characterization result for FE operators can be found. This is a perspective for further research.

Related Work Theory revision is a change operation studied by ML researchers in the nineties that is connected to the edit one. Theory revision is an important component for concept formation, and as such it has been investigated and implemented as part of knowledge acquisition and machine learning systems (see e.g., MOBAL (Morik et al. 1994) and EITHER (Ourston and Mooney 1994)). Typically, in theory revision, a theory Σ is a logical representation (most of the time, a FOL formula) linking together atoms, denoting features used for describing instances and targeted concepts. An instance x is classiﬁed by Σ as an element of a concept y (represented by an atom) whenever y can be deduced from Σ and x. When an instance together with its right concept (given by the change formula) is not classiﬁed by Σ as expected, a theory revision operator can be exploited to modify Σ so as to ensure that the instance is not classiﬁed incorrectly any longer in the revised theory. AGM contraction operators (Alchourr on, G ardenfors, and Makinson 1985) can be used to this end (Wrobel 1993). Note that it can be the case that an instance x is not classiﬁed by Σ as an element of any concept. Accordingly, the representation to be changed Σ does not necessarily represent a full classiﬁer as in the edit case. Furthermore, the basic operations that are used to derive the revised theory are usually not syntaxindependent. This is typically the case, e.g., in the learning from interpretations setting (De Raedt 1997; De Raedt and Dehaspe 1997) where both the set of examples and the revised theory (called hypothesis) are full clausal theories, and other various formalizations of concept learning in logical settings, including inductive logic programming (Muggleton and De Raedt 1994; Flach 1997). This also makes them distinct from edit operations. Finally, works on theory revision are typically focused on deﬁning speciﬁc approaches to

achieve a revision of the input theory (possibly using few basic operations (Goldsmith et al. 2004; Goldsmith and Sloan 2005)), but they do not adopt an axiomatic perspective for delineating all the rational theory revision operators. More recently, Zhou (2019) emphasized again the importance of integrating learning and reasoning in modern learning systems. The idea is to improve the decisions made by an underlying ML system C (the classiﬁer), taking advantage of a reasoning module M. Roughly speaking, whenever a prediction P is made by the classiﬁer C, it is transmitted to a reasoning module that checks whether the prediction is correct or not. If it is correct, nothing should be changed; otherwise, the corrected prediction P found by M is transmitted back to C that is trained again using P . Our edit framework is similar in essence, where a classiﬁer ϕ plays the role of C and the positive / negative dataset µ is provided by an underlying expert module M. One of the strengths of Zhou s approach is that it is model-agnostic: the ML system can be any black box function. This is reminiscent to our edit framework where no further assumption is made on the representation of the input classiﬁer ϕ and batch µ, besides being propositional formulae. However, in (Zhou 2019), the correction step is achieved by learning, which means that there is no guarantee that the repair is effective in the general case. In comparison, our framework, by its principled nature, guarantees the classiﬁer to become fully compliant with the input batch after edit (cf. (P1), (N1), (F1) and (F2)). Modifying a predictor so as to better take account for instances that are misclassiﬁed, as done with edit operations, is also at the core of boosting, a key principle in ML. In adaptative boosting for binary classiﬁcation (Freund and Schapire 1997), the predictor has the form of an ensemble of weak learners, often decision trees reduced to decision stumps. The output of those weak learners is combined into a weighted sum that represents the ﬁnal output of the boosted classiﬁer. Ada Boost is an iterative learning algorithm: at each iteration, the algorithm samples the training set, taking account for the distribution given by the weights associated with the instances (at start the uniform distribution is considered), then it looks for a weak classiﬁer which minimizes the total weighted error, uses this to calculate the error rate and the weight of the weak classiﬁer that has been generated, and ﬁnally update the weights of the instances so as to favor at the next step the selection of instances that have been misclassiﬁed by the generated weak learner. After a preset number of iterations, the algorithm stops. It turns out that the boosted classiﬁer generated after an iteration may still misclassify the instances that were already misclassiﬁed by the boosted classiﬁer before the iteration. Accordingly, the update operation at work in Ada Boost for improving the current boosted tree at each iteration is not a positive edit operator: (P1) is not satisﬁed. Lastly, closely related to our work is a recent paper about classiﬁer rectiﬁcation (Coste-Marquis and Marquis 2021). Unlike the present paper, more than two classes can be considered in (Coste-Marquis and Marquis 2021) (classes are explicitly represented). Thus, two subsets X and Y of PS are used to encode, on the one hand, instances (positive ones and negative ones) and on the other hand, classes. When

only two classes are targeted (the class of positive instances, a subset of ΩX, the worlds over X, and its complementary set in ΩX containing the negative instances), a singleton Y = {y} is enough. Coste-Marquis and Marquis [2021] point out rules to be obeyed by any rational change operation on Boolean classiﬁers Σ, when new pieces of evidence T must be taken into account. Boolean classiﬁers Σ are formulae from L satisfying the so-called XY -classiﬁcation property. When Y = {y}, this precisely means that Σ is equivalent to ϕX y where ϕX is a formula over X. Thus, Σ classiﬁes a given instance x ΩX as positive (resp. negative) whenever x |= ϕX (resp. x |= ϕX). Accordingly, every PE operation (resp. NE operation) of ϕX by a change formula µX corresponds to a rectiﬁcation operation of Σ = ϕX y by T = µX y (resp. T = µX y). Postulates for the rectiﬁcation operation have been provided in (Coste-Marquis and Marquis 2021). Though some connections between rectiﬁcation postulates and PE / NE postulates exist, it is not the case that every PE (or NE) operator induces a rectiﬁcation operator. Indeed, the rectiﬁcation postulate (RE2) (see (Coste-Marquis and Marquis 2021) for details) makes formal a very demanding view of minimal change: when a change concerning an example (positive instance) µX is triggered, the classiﬁcations achieved by the rectiﬁed classiﬁer coincide with those achieved by the classiﬁer Σ at start, except possibly for µX (accordingly, rectiﬁcation operators do not allow any generalization to take place and thus are not convenient for incremental learning).

Conclusion The paper was focused on the question of editing Boolean classiﬁers, i.e., determining how a Boolean classiﬁer should be modiﬁed when new pieces of evidence must be incorporated, an issue at the crossroads of ML and KR. Though the performance of ML models in terms of accuracy is impressive most of the time (especially when classiﬁers are learned from a sufﬁcient amount of data), the error risk cannot be totally removed (this is intrinsic to inductive generalization). Thus, it is important to design and study approaches to determine how a classiﬁer should be modiﬁed whenever it does not label an instance in the right way. The reported work is a step in this direction, centered on the identiﬁcation of ﬁrst principles (postulates) for characterizing what a rational change could be when dealing with Boolean classiﬁers. One of our key assumptions in this paper is that the input dataset is fully reliable, which is reﬂected by the success postulate (P1) (in the case of positive edit). However, a number of standard learning algorithms take noisy examples into account, e.g., the k-NN algorithm, the perceptron algorithm, and algorithms for generating decision trees with pruning; and those algorithms do not satisfy (P1). To extend the edit setting to noisy data, we plan to investigate how (P1) could be relaxed, so that an example is incorporated only if the corresponding piece of evidence is considered sufﬁciently often by the learning algorithm. For capturing such a behavior, improvement appears as a promising candidate (Konieczny, Medina Grespan, and Pino P erez 2010), and determining the extent to which edit and improvement could be combined looks as a valuable perspective for further work.

Acknowledgments This work has beneﬁted from the support of the JSPS KAKENHI Grant Number JP21H04905 and JST CREST Grant Number JPMJCR22D3, Japan. Pierre Marquis has beneﬁted from the support of the AI Chair EXPEKCTATION (ANR-19-CHIA-0005-01) of the French National Research Agency. He was also partially supported by TAILOR, a project funded by EU Horizon 2020 research and innovation programme under GA No952215.

References Alchourr on, C. E.; G ardenfors, P.; and Makinson, D. 1985. On the logic of theory change: partial meet contraction and revision functions. Journal of Symbolic Logic, 50: 510 530. Alchourr on, C. E.; and Makinson, D. 1985. On the logic of theory change: safe contraction. Studia Logica, 44(4): 405 422. Audemard, G.; Bellart, S.; Bounia, L.; Koriche, F.; Lagniez, J.-M.; and Marquis, P. 2021. On the Computational Intelligibility of Boolean Classiﬁers. In Proceedings of the 18th International Conference on Principles of Knowledge Representation and Reasoning (KR 21), 74 86. Audemard, G.; Koriche, F.; and Marquis, P. 2020. On Tractable XAI Queries based on Compiled Representations. In Proceedings of the 17th International Conference on Principles of Knowledge Representation and Reasoning (KR 20), 838 849. Ayg un, O.; and S onmez, T. 2013. Matching with Contracts: Comment. American Economic Review, 103(5): 2050 2051. Bloch, I.; and Lang, J. 2002. Towards mathematical morpho-logics, 367 380. Heidelberg: Physica-Verlag HD. Caridroit, T.; Konieczny, S.; and Marquis, P. 2017. Contraction in propositional logic. International Journal of Approximate Reasoning, 80: 428 442. Coste-Marquis, S.; and Marquis, P. 2021. On Belief Change for Multi-Label Classiﬁer Encodings. In Proceedings of the 30th International Joint Conference on Artiﬁcial Intelligence (IJCAI 21), 1829 1836. Dalal, M. 1988. Investigations into a theory of knowledge base revision: preliminary report. In Proceedings of the 7th National Conference on Artiﬁcial Intelligence (AAAI 88), 475 479. De Raedt, L. 1997. Logical Settings for Concept-Learning. Artiﬁcial Intelligence, 95(1): 187 201. De Raedt, L.; and Dehaspe, L. 1997. Clausal Discovery. Machine Learning, 26(2-3): 99 146. Domingos, P. M.; and Hulten, G. 2000. Mining highspeed data streams. In Proceedings of the 6th International Conference on Knowledge Discovery and Data Mining (SIGKDD 00), 71 80. Fisher, D. H. 1987. Knowledge Acquisition via Incremental Conceptual Clustering. Machine Learning, 2(2): 139 172. Flach, P. A. 1997. Normal Forms for Inductive Logic Programming. In Proceedings of the 7th International Workshop of Inductive Logic Programming (ILP 97), 149 156.

Freund, Y.; and Schapire, R. E. 1997. A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting. Journal of Computer and System Sciences, 55(1): 119 139. Gallant, S. I. 1988. Connectionist Expert Systems. Communications of the ACM, 31(2): 152 169. G ardenfors, P. 1988. Knowledge in ﬂux. MIT Press. Goldsmith, J.; and Sloan, R. H. 2005. New Horn Revision Algorithms. Journal of Machine Learning Research, 6: 1919 1938. Goldsmith, J.; Sloan, R. H.; Sz or enyi, B.; and Tur an, G. 2004. Theory revision with queries: Horn, read-once, and parity formulas. Artiﬁcial Intelligence, 156(2): 139 176. Hatﬁeld, J. W.; and Milgrom, P. R. 2005. Matching with Contracts. American Economic Review, 95(4): 913 935. Ignatiev, A. 2020. Towards Trustable Explainable AI. In Proceedings of the 29th International Joint Conference on Artiﬁcial Intelligence and the 17th Paciﬁc Rim International Conference on Artiﬁcial Intelligence (IJCAI-PRICAI 20), 5154 5158. Ignatiev, A.; and Silva, J. M. 2021. SAT-Based Rigorous Explanations for Decision Lists. In Proceedings of the 24th International Conference on Theory and Applications of Satisﬁability Testing (SAT 21), 251 269. Izza, Y.; Ignatiev, A.; and Marques-Silva, J. 2020. On Explaining Decision Trees. Co RR, abs/2010.11034. Izza, Y.; and Marques-Silva, J. 2021. On Explaining Random Forests with SAT. In Proceedings of the 30th International Joint Conference on Artiﬁcial Intelligence (IJCAI 21), 2584 2591. Katsuno, H.; and Mendelzon, A. O. 1991. Propositional knowledge base revision and minimal change. Artiﬁcial Intelligence, 52: 263 294. Konieczny, S.; Medina Grespan, M.; and Pino P erez, R. 2010. Taxonomy of Improvement Operators and the Problem of Minimal Change. In Proceedings of the 12th International Conference on Principles of Knowledge Representation and Reasoning (KR 10), 161 170. Maloof, M. A.; and Michalski, R. S. 2000. Selecting Examples for Partial Memory Learning. Machine Learning, 41(1): 27 52. Maloof, M. A.; and Michalski, R. S. 2004. Incremental learning with partial instance memory. Artiﬁcial Intelligence, 154(1-2): 95 126. Mitchell, T. M. 1977. Version Spaces: A Candidate Elimination Approach to Rule Learning. In Proceedings of the 5th International Joint Conference on Artiﬁcial Intelligence (IJCAI 77), 305 310. Mitchell, T. M. 1982. Generalization as Search. Artiﬁcial Intelligence, 18(2): 203 226. Morik, K.; Potamias, G.; Moustakis, V.; and Charissis, G. 1994. Knowledgeable learning using MOBAL: A medical case study. Applied Artiﬁcial Intelligence, 8(4): 579 592. Muggleton, S.; and De Raedt, L. 1994. Inductive Logic Programming: Theory and Methods. Journal of Logic Programming, 19/20: 629 679.

Narodytska, N.; Kasiviswanathan, S. P.; Ryzhyk, L.; Sagiv, M.; and Walsh, T. 2018. Verifying Properties of Binarized Deep Neural Networks. In Proceedings of the 32nd AAAI Conference on Artiﬁcial Intelligence (AAAI 18), 6615 6624. Ourston, D.; and Mooney, R. 1994. Theory Reﬁnement Combining Analytical and Empirical Methods. Artiﬁcial Intelligence, 66: 273 309. Plotkin, G. D. 1970. A Note on Inductive Generalization. Machine Intelligence, 5: 153 163. Quinlan, J. R. 1986. Induction of Decision Trees. Machine Learning, 1(1): 81 106. Rosenblatt, F. 1958. The perceptron: a probabilistic model for information storage and organization in the brain. Psychological Review, 65(6): 386 408. Schlimmer, J. C.; and Fisher, D. H. 1986. A Case Study of Incremental Concept Induction. In Proceedings of the 5th National Conference on Artiﬁcial Intelligence (AAAI 86), 496 501. Shi, W.; Shih, A.; Darwiche, A.; and Choi, A. 2020. On Tractable Representations of Binary Neural Networks. In Proceedings of the 17th International Conference on Principles of Knowledge Representation and Reasoning (KR 20), 882 892. Shih, A.; Choi, A.; and Darwiche, A. 2018. A Symbolic Approach to Explaining Bayesian Network Classiﬁers. In Proceedings of the 27th International Joint Conference on Artiﬁcial Intelligence (IJCAI 18), 5103 5111. Shih, A.; Choi, A.; and Darwiche, A. 2019. Compiling Bayesian Networks into Decision Graphs. In Proceedings of the 33rd AAAI Conference on Artiﬁcial Intelligence (AAAI 19), 7966 7974. Utgoff, P. E. 1989. Incremental Induction of Decision Trees. Machine Learning, 4: 161 186. Wrobel, S. 1993. On the Proper Deﬁnition of Minimality in Specialization and Theory Revision. In Proceedings on the European Conference on Machine Learning (ECML 93), 65 82. Yang, Y.-Y. 2020. Rationalizable choice functions. Games and Economic Behavior, 123: 120 126. Zhou, Z. 2019. Abductive learning: towards bridging machine learning and logical reasoning. Science China Information Science, 62(7): 76101:1 76101:3.