# generative_code_modeling_with_graphs__25f010dc.pdf

Published as a conference paper at ICLR 2019

GENERATIVE CODE MODELING WITH GRAPHS

Marc Brockschmidt, Miltiadis Allamanis, Alexander Gaunt Microsoft Research Cambridge, UK {mabrocks,miallama,algaunt}@microsoft.com

Oleksandr Polozov Microsoft Research Redmond, WA, USA polozov@microsoft.com

Generative models for source code are an interesting structured prediction problem, requiring to reason about both hard syntactic and semantic constraints as well as about natural, likely programs. We present a novel model for this problem that uses a graph to represent the intermediate state of the generated output. Our model generates code by interleaving grammar-driven expansion steps with graph augmentation and neural message passing steps. An experimental evaluation shows that our new model can generate semantically meaningful expressions, outperforming a range of strong baselines.

1 INTRODUCTION

Learning to understand and generate programs is an important building block for procedural artiﬁcial intelligence and more intelligent software engineering tools. It is also an interesting task in the research of structured prediction methods: while imbued with formal semantics and strict syntactic rules, natural source code carries aspects of natural languages, since it acts as a means of communicating intent among developers. Early works in the area have shown that approaches from natural language processing can be applied successfully to source code (Hindle et al., 2012), whereas the programming languages community has had successes in focusing exclusively on formal semantics. More recently, methods handling both modalities (i.e., the formal and natural language aspects) have shown successes on important software engineering tasks (Raychev et al., 2015; Bichsel et al., 2016; Allamanis et al., 2018b) and semantic parsing (Yin & Neubig, 2017; Rabinovich et al., 2017).

However, current generative models of source code mostly focus on only one of these modalities at a time. For example, program synthesis tools based on enumeration and deduction (Solar-Lezama, 2008; Polozov & Gulwani, 2015; Feser et al., 2015; Feng et al., 2018) are successful at generating programs that satisfy some (usually incomplete) formal speciﬁcation but are often obviously wrong on manual inspection, as they cannot distinguish unlikely from likely, natural programs. On the other hand, learned code models have succeeded in generating realistic-looking programs (Maddison & Tarlow, 2014; Bielik et al., 2016; Parisotto et al., 2017; Rabinovich et al., 2017; Yin & Neubig, 2017). However, these programs often fail to be semantically relevant, for example because variables are not used consistently.

In this work, we try to overcome these challenges for generative code models and present a general method for generative models that can incorporate structured information that is deterministically available at generation time. We focus our attention on generating source code and follow the ideas of program graphs (Allamanis et al., 2018b) that have been shown to learn semantically meaningful representations of (pre-existing) programs. To achieve this, we lift grammar-based tree decoder models into the graph setting, where the diverse relationships between various elements of the generated code can be modeled. For this, the syntax tree under generation is augmented with additional edges denoting known relationships (e.g., last use of variables). We then interleave the steps of the generative procedure with neural message passing (Gilmer et al., 2017) to compute more precise representations of the intermediate states of the program generation. This is fundamentally different from sequential generative models of graphs (Li et al., 2018; Samanta et al., 2018), which aim to generate all edges and nodes, whereas our graphs are deterministic augmentations of generated trees.

To summarize, we present a) a general graph-based generative procedure for highly structured objects, incorporating rich structural information; b) Expr Gen, a new code generation task focused on

Published as a conference paper at ICLR 2019

Algorithm 1 Pseudocode for Expand Input: Context c, partial AST a, node v to expand

1: hv get Representation(c, a, v) 2: rhs pick Production(v, hv) 3: for child node type ℓ rhs do 4: (a, u) insert Child(a, ℓ) 5: if ℓis nonterminal type then 6: a Expand(c, a, u)

7: return a

int il Offset Idx = Array.Index Of(sorted ILOffsets, map.ILOffset); int next ILOffset Idx = il Offset Idx + 1; int next Map ILOffset =

next ILOffset Idx < sorted ILOffsets.Length ? sorted ILOffsets[next ILOffset Idx] : int.Max Value;

Figure 1: Example for Expr Gen, target expression to be generated is marked . Taken from Benchmark Dot Net, lightly edited for formatting.

generating small, but semantically complex expressions conditioned on source code context; and c) a comprehensive experimental evaluation of our generative procedure and a range of baseline methods from the literature.

2 BACKGROUND & TASK

The most general form of the code generation task is to produce a (partial) program in a programming language given some context information c. This context information can be natural language (as in, e.g., semantic parsing), input-output examples (e.g., inductive program synthesis), partial program sketches, etc. Early methods generate source code as a sequence of tokens (Hindle et al., 2012; Hellendoorn & Devanbu, 2017) and sometimes fail to produce syntactically correct code. More recent models are sidestepping this issue by using the target language s grammar to generate abstract syntax trees (ASTs) (Maddison & Tarlow, 2014; Bielik et al., 2016; Parisotto et al., 2017; Yin & Neubig, 2017; Rabinovich et al., 2017), which are syntactically correct by construction.

In this work, we follow the AST generation approach. The key idea is to construct the AST a sequentially, by expanding one node at a time using production rules from the underlying programming language grammar. This simpliﬁes the code generation task to a sequence of classiﬁcation problems, in which an appropriate production rule has to be chosen based on the context information and the partial AST generated so far. In this work, we simplify the problem further similar to Maddison & Tarlow (2014); Bielik et al. (2016) by ﬁxing the order of the sequence to always expand the left-most, bottom-most nonterminal node. Alg. 1 illustrates the common structure of AST-generating models. Then, the probability of generating a given AST a given some context c is

p(a | c) = Y

t p(at | c, a<t), (1)

where at is the production choice at step t and a<t the partial syntax tree generated before step t.

Code Generation as Hole Completion We introduce the Expr Gen task of ﬁlling in code within a hole of an otherwise existing program. This is similar, but not identical to the auto-completion function in a code editor, as we assume information about the following code as well and aim to generate whole expressions rather than single tokens. The Expr Gen task also resembles program sketching (Solar-Lezama, 2008) but we give no other (formal) speciﬁcation other than the surrounding code. Concretely, we restrict ourselves to expressions that have Boolean, arithmetic or string type, or arrays of such types, excluding expressions of other types or expressions that use project-speciﬁc APIs. An example is shown in Fig. 1. We picked this subset because it already has rich semantics that can require reasoning about the interplay of different variables, while it still only relies on few operators and does not require to solve the problem of open vocabularies of full programs, where an unbounded number of methods would need to be considered.

In our setting, the context c is the pre-existing code around a hole for which we want to generate an expression. This also includes the set of variables v1, . . . , vℓthat are in scope at this point, which can be used to guide the decoding procedure (Maddison & Tarlow, 2014). Note, however, that our method is not restricted to code generation and can be easily extended to all other tasks and domains that can be captured by variations of Alg. 1 (e.g. in NLP).

Published as a conference paper at ICLR 2019

3 GRAPH DECODING FOR SOURCE CODE

To tackle the code generation task presented in the previous section, we have to make two design choices: (a) we need to ﬁnd a way to encode the code context c, v1, . . . , vℓand (b) we need to construct a model that can learn p(at | c, a<t) well. We do not investigate the question of encoding the context in this paper, and use two existing methods in our experiments in Sect. 5. Both these encoders yield a distributed vector representation for the overall context, representations ht1, . . . , ht T for all tokens in the context, and separate representations for each of the in-scope variables v1, . . . , vℓ, summarizing how each variable is used in the context. This information can then be used in the generation process, which is the main contribution of our work and is described in this section.

Overview Our decoder model follows the grammar-driven AST generation strategy of prior work as shown in Alg. 1. The core difference is in how we compute the representation of the node to expand. Maddison & Tarlow (2014) construct it entirely from the representation of its parent in the AST using a log-bilinear model. Rabinovich et al. (2017) construct the representation of a node using the parents of the AST node but also found it helpful to take the relationship to the parent node (e.g. condition of a while ) into account. Yin & Neubig (2017) on the other hand propose to take the last expansion step into account, which may have ﬁnished a subtree to the left . In practice, these additional relationships are usually encoded by using gated recurrent units with varying input sizes.

We propose to generalize and unify these ideas using a graph to structure the ﬂow of information in the model. Concretely, we use a variation of attribute grammars (Knuth, 1967) from compiler theory to derive the structure of this graph. We associate each node in the AST with two fresh nodes representing inherited resp. synthesized information (or attributes). Inherited information is derived from the context and parts of the AST that are already generated, whereas synthesized information can be viewed as a summary of a subtree. In classical compiler theory, inherited attributes usually contain information such as declared variables and their types (to allow the compiler to check that only declared variables are used), whereas synthesized attributes carry information about a subtree to the right (e.g., which variables have been declared). Traditionally, to implement this, the language grammar has to be extended with explicit rules for deriving and synthesizing attributes.

To transfer this idea to the deep learning domain, we represent attributes by distributed vector representations and train neural networks to learn how to compute attributes. Our method for get Representation from Alg. 1 thus factors into two parts: a deterministic procedure that turns a partial AST a<t into a graph by adding additional edges that encode attribute relationships, and a graph neural network that learns from this graph.

Notation Formally, we represent programs as graphs where nodes u, v, . . . are either the AST nodes or their associated attribute nodes, and typed directed edges u, τ, v E connect the nodes according to the ﬂow of information in the model. The edge types τ represent different syntactic or semantic relations in the information ﬂow, discussed in detail below. We write Ev for the set of incoming edges into v. We also use functions like parent(a, v) and last Sibling(a, v) that look up and return nodes from the AST a (e.g. resp. the parent node of v or the preceding AST sibling of v).

Algorithm 2 Pseudocode for Compute Edge Input: Partial AST a, node v

1: Edge set E 2: if v is inherited then 3: E E { parent(a, v), Child, v } 4: if v is terminal node then 5: E E { last Token(a, v), Next Token, v } 6: if v is variable then 7: E E { last Use(a, v), Next Use, v } 8: if v is not ﬁrst child then 9: E E { last Sibling(a, v), Next Sib, v } 10: else 11: E E { u, Parent, v | u children(a, v)} 12: E E { inherited Attr(v), Inh To Syn, v } 13: return E

Example Consider the AST of the expression i - j shown in Fig. 2 (annotated with attribute relationships) constructed step by step by our model. The AST derivation using the programming language grammar is indicated by shaded backgrounds, nonterminal nodes are shown as rounded rectangles, and terminal nodes are shown as rectangles. We additionally show the variables given within the context as dashed rectangles at the bottom. First, the root node, Expr, was expanded using the production rule (1) : Expr = Expr - Expr. Then, its two nonterminal children were in turn expanded to the set of known variables using the produc-

Published as a conference paper at ICLR 2019

(1) Expr Expr Expr

Expr 3 Expr -

Initial State Step 1

Expr 3 Expr

Expr 3 5 Expr

Expr 3 5 Expr

Expr 3 5 Expr 7

Next Sib Next Sib

(2) Expr 𝒱 Expr 0

Expr 3 5 Expr 7

Next Sib Next Sib

Expr 3 5 Expr 7 9

Next Sib Next Sib

Expr 3 5 Expr 7 9

Next Sib Next Sib

Figure 2: Example AST with attribute dependencies, shown constructed step by step in the order of generation. Each AST node (labeled by a terminal or non-terminal) has either one or two associated attribute nodes, shown as its left/right parts. The node IDs are highlighted at the corresponding generation step. Edge color and label indicate edge type. Edges are computed using Alg. 2, but are only depicted after use in message passing. Best viewed in color.

tion rule (2) : Expr = V, choosing i for the ﬁrst variable and j for the second variable (cf. below for details on picking variables).

Attribute nodes are shown overlaying their corresponding AST nodes. For example, the root node is associated with its inherited attributes node 0 and with node 10 for its synthesized attributes. For simplicity, we use the same representation for inherited and synthesized attributes of terminal nodes.

Edges in a<t We discuss the edges used in our neural attribute grammars (NAG) on our example below, and show them in Fig. 2 using different edge drawing styles for different edge types. Once a node is generated, the edges connecting this node can be deterministically added to a<t (precisely deﬁned in Alg. 2). The list of different edge types used in our model is as follows:

Child (red) edges connect an inherited attribute node to the inherited attributes nodes of its children, as seen in the edges from node 0. These are the connections in standard syntaxdriven decoders (Maddison & Tarlow, 2014; Parisotto et al., 2017; Yin & Neubig, 2017; Rabinovich et al., 2017). Parent (green) edges connect a synthesized attribute node to the synthesized attribute node of its AST parent, as seen in the edges leading to node 10. These are the additional connections used by the R3NN decoder introduced by Parisotto et al. (2017). Next Sib (black) edges connect the synthesized attribute node to the inherited attribute node of its next sibling (e.g. from node 5 to node 6). These allow information about the synthesized attribute nodes from a fully generated subtree to ﬂow to the next subtree. Next Use (orange) edges connect the attribute nodes of a variable (since variables are always terminal nodes, we do not distinguish inherited from synthesized attributes) to their next use. Unlike Allamanis et al. (2018b), we do not perform a dataﬂow analysis, but instead just

Published as a conference paper at ICLR 2019

follow the lexical order. This can create edges from nodes of variables in the context c (for example, from node 1 to 4 in Fig. 2), or can connect AST leaf nodes that represent multiple uses of the same variable within the generated expressions. Next Token (blue) edges connect a terminal node (a token) to the next token in the program text, for example between nodes 4 and 6. Inh To Syn edges (not shown in Fig. 2) connect the inherited attributes nodes to its synthesized attribute nodes. This is not strictly adding any information, but we found it to help with training.

The panels of Fig. 2 show the timesteps at which the representations of particular attribute nodes are computed and added to the graph. For example, in the second step, the attributes for the terminal token i (node 4) in Fig. 2 are computed from the inherited attributes of its AST parent Expr (node 3), the attributes of the last use of the variable i (node 1), and the node label i. In the third step, this computed attribute is used to compute the synthesized attributes of its AST parent Expr (node 5).

Attribute Node Representations To compute the neural attribute representation hv of an attribute node v whose corresponding AST node is labeled with ℓv, we ﬁrst obtain its incoming edges using Alg. 2 and then use the state update function from Gated Graph Neural Networks (GGNN) (Li et al., 2016). Thus, we take the attribute representations hui at edge sources ui, transform them according to the corresponding edge type ti using a learned function fti, aggregate them (by elementwise summation) and combine them with the learned embedding emb(ℓv) of the node label ℓv using a function g:

hv = g(emb(ℓv), X

ui,ti,v Ev fti(hui)) (2)

In practice, we use a single linear layer for fti and implement g as a gated recurrent unit (Cho et al., 2014). We compute node representations in such an order that all hui appearing on the right of (2) are already computed. This is possible as the graphs obtained by repeated application of Alg. 2 are directed acyclic graphs rooted in the inherited attribute node of the root node of the AST. We initialize the representation of the root inherited attribute to the representation returned by the encoder for the context information.

Choosing Productions, Variables & Literals We can treat picking production rules as a simple classiﬁcation problem over all valid production rules, masking out those choices that do not correspond to the currently considered nonterminal. For a nonterminal node v with label ℓv and inherited attributes hv, we thus deﬁne pick Production(ℓv, hv) = arg max P(rule | ℓv, hv) = arg max [e(hv) + mℓv] . (3) Here, mℓv is a mask vector whose value is 0 for valid productions ℓv . . . and for all other productions. In practice, we implement e using a linear layer.

Similarly, we pick variables from the set of variables V in scope using their representations hvvar (initially the representation obtained from the context, and later the attribute representation of the last node in the graph in which they have been used) by using a pointer network (Vinyals et al., 2015). Concretely, to pick a variable at node v, we use learnable linear function k and deﬁne pick Variable(V, hv) = arg max var V P(var | hv) = arg max var V k(hv, hvvar ). (4)

Note that since the model always picks a variable from the set of in-scope variables V, this generation model can never predict an unknown or out-of-scope variable.

Finally, to generate literals, we combine a small vocabulary L of common literals observed in the training data and special UNK tokens for each type of literal with another pointer network that can copy one of the tokens t1 . . . t T from the context. Thus, to pick a literal at node v, we deﬁne pick Literal(V, hv) = arg max lit L {t1...t T } P(lit | hv). (5)

Note that this is the only operation that may produce an unknown token (i.e. an UNK literal). In practice, we implement this by learning two functions s L and sc, such that s L(hv) produces a score for each token from the vocabulary and sc(hv, hti) computes a score for copying token ti from the context. By computing a softmax over all resulting values and normalizing it by summing up entries corresponding to the same constant, we can learn to approximate the desired P(lit | hv).

Published as a conference paper at ICLR 2019

Training & Training Objective The different shapes and sizes of generated expressions complicate an efﬁcient training regime. However, note that given a ground truth target tree, we can easily augment it with all additional edges according to Alg. 2. Given that full graph, we can compute a propagation schedule (intuitively, a topological ordering of the nodes in the graph, starting in the root node) that allows to repeatedly apply (2) to obtain representations for all nodes in the graph. By representing a batch of graphs as one large (sparse) graph with many disconnected components, similar to Allamanis et al. (2018b), we can train our graph neural network efﬁciently. We have released the code for this on https://github.com/Microsoft/graph-based-code-modelling.

Our training procedure thus combines an encoder (cf. Sect. 5), whose output is used to initialize the representation of the root and context variable nodes in our augmented syntax graph, the sequential graph propagation procedure described above, and the decoder choice functions (3) and (4). We train the system end-to-end using a maximum likelihood objective without pre-trained components.

Additional Improvements We extend (3) with an attention mechanism (Bahdanau et al., 2014; Luong et al., 2015) that uses the state hv of the currently expanded node v as a key and the context token representations ht1, . . . , ht T as memories. Experimentally, we found that extending Eqs. 4, 5 similarly did not improve results, probably due to the fact that they already are highly dependent on the context information.

Following Rabinovich et al. (2017), we provide additional information for Child edges. To allow this, we change our setup so that some edge types also require an additional label, which is used when computing the messages sent between different nodes in the graph. Concretely, we extend (2) by considering sets of unlabeled edges Ev and labeled edges Eℓ v:

hv = g(emb(ℓv), X

(ui,ti,v) Ev fti(hui) + X

(ui,ti,ℓi,v) Eℓ v fti(hui, embe(ℓi))) (6)

Thus for labeled edge types, fti takes two inputs and we additionally introduce a learnable embedding for the edge labels. In our experiments, we found it useful to label Child with tuples consisting of the chosen production and the index of the child, i.e., in Fig. 2, we would label the edge from 0 to 3 with (2, 0), the edge from 0 to 6 with (2, 1), etc.

Furthermore, we have extended pick Production to also take the information about available variables into account. Intuitively, this is useful in cases of productions such as Expr = Expr.Length, which can only be used in a well-typed derivation if an array-typed variable is available. Thus, we extend e(hv) from (3) to additionally take the representation of all variables in scope into account, i.e., e(hv, r({hvvar | var V})), where we have implemented r as a max pooling operation.

4 RELATED WORK

Source code generation has been studied in a wide range of different settings (Allamanis et al., 2018a). We focus on the most closely related works in language modeling here. Early works approach the task by generating code as sequences of tokens (Hindle et al., 2012; Hellendoorn & Devanbu, 2017), whereas newer methods have focused on leveraging the known target grammar and generate code as trees (Maddison & Tarlow, 2014; Bielik et al., 2016; Parisotto et al., 2017; Yin & Neubig, 2017; Rabinovich et al., 2017) (cf. Sect. 2 for an overview). While modern models succeed at generating natural-looking programs, they often fail to respect simple semantic rules. For example, variables are often used without initialization or written several times without being read inbetween.

Existing tree-based generative models primarily differ in what information they use to decide which expansion rule to use next. Maddison & Tarlow (2014) consider the representation of the immediate parent node, and suggest to consider more information (e.g., nearby tokens). Parisotto et al. (2017) compute a fresh representation of the partial tree at each expansion step using R3NNs (which intuitively perform a leaf-to-root traversal followed by root-to-leaf traversal of the AST). The PHOG model (Bielik et al., 2016) conditions generation steps on the result of learned (decision tree-style) programs, which can do bounded AST traversals to consider nearby tokens and non-terminal nodes. The language also supports a jump to the last node with the same identiﬁer, which can serve as syntactic approximation of data-ﬂow analysis. Rabinovich et al. (2017) only use information about the parent node, but use neural networks specialized to different non-terminals to gain more ﬁne-grained

Published as a conference paper at ICLR 2019

control about the ﬂow of information to different successor nodes. Finally, Amodio et al. (2017) and Yin & Neubig (2017) follow a left-to-right, depth-ﬁrst expansion strategy, but thread updates to single state (via a gated recurrent unit) through the overall generation procedure, thus giving the pick Production procedure access to the full generation history as well as the representation of the parent node. Amodio et al. (2017) also suggest the use of attribute grammars, but use them to deﬁne a deterministic procedure that collects information throughout the generation process, which is provided as additional feature.

As far as we are aware, previous work has not considered a task in which a generative model ﬁlls a hole in a program with an expression. Lanuage model-like methods take into account only the lexicographically previous context of code. The task of Raychev et al. (2014) is near to our Expr Gen, but instead focuses on ﬁlling holes in sequences of API calls. There, the core problem is identifying the correct function to call from a potentially large set of functions, given a sequence context. In contrast, Expr Gen requires to handle arbitrary code in the context, and then to build possibly complex expressions from a small set of operators. Allamanis et al. (2018b) consider similar context, but are only picking a single variable from a set of candidates, and thus require no generative modeling.

5 EVALUATION

Dataset We have collected a dataset for our Expr Gen task from 593 highly-starred open-source C# projects on Git Hub, removing any near-duplicate ﬁles, following the work of Lopes et al. (2017). We parsed all C# ﬁles and identiﬁed all expressions of the fragment that we are considering (i.e., restricted to numeric, Boolean and string types, or arrays of such values; and not using any user-deﬁned functions). We then remove the expression, perform a static analysis to determine the necessary context information and extract a sample. For each sample, we create an abstract syntax tree by coarsening the syntax tree generated by the C# compiler Roslyn. This resulted in 343 974 samples overall with 4.3 ( 3.8) tokens per expression to generate, or alternatively 3.7 ( 3.1) production steps. We split the data into four separate sets. A test-only dataset is made up from 100k samples generated from 114 projects. The remaining data we split into training-validation-test sets (3 : 1 : 1), keeping all expressions collected from a single source ﬁle within a single fold. Samples from our dataset can be found in the supplementary material. Our decoder uses the grammar made up by 222 production rules observed in the ASTs of the training set, which includes rules such as Expr = Expr + Expr for binary operations, Expr = Expr.Equals(Expr) for built-in methods, etc.

Encoders We consider two models to encode context information. Seq is a two-layer bi-directional recurrent neural network (using a GRU (Cho et al., 2014)) to encode the tokens before and after the hole in which we want to generate an expression. Additionally, it computes a representation for each variable var in scope in the context in a similar manner: For each variable var it identiﬁes usages before/after the hole and encodes each of them independently using a second bi-directional two-layer GRU, which processes a window of tokens around each variable usage. It then computes a representation for var by average pooling of the ﬁnal states of these GRU runs.

The second encoder G is an implementation of the program graph approach introduced by Allamanis et al. (2018b). We follow the transformation used for the Varmisuse task presented in that paper, i.e., the program is transformed into a graph, and the target expression is replaced by a fresh dummy node. We then run a graph neural network for 8 steps to obtain representations for all nodes in the graph, allowing us to read out a representation for the hole (from the introduced dummy node) and for all variables in context. The used context information captured by the GNN is a superset of what existing methods (e.g. language models) consider.

Baseline Decoders We compare our model to re-implementations of baselines from the literature. As our Expr Gen task is new, re-using existing implementations is hard and problematic in comparison. Most recent baseline methods can be approximated by ablations of our model. We experimented with a simple sequence decoder with attention and copying over the input, but found it to be substantially weaker than other models in all regards. Next, we consider T ree, our model restricted to using only Child edges without edge labels. This can be viewed as an evolution of Maddison & Tarlow (2014), with the difference that instead of a log-bilinear network that does not maintain state during the generation, we use a GRU. ASN is similar to abstract syntax networks (Rabinovich et al., 2017) and arises as an extension of the T ree model by adding edge labels on Child that encode the chosen

Published as a conference paper at ICLR 2019

Table 1: Evaluation of encoder and decoder combinations on predicting an expression from code context. : PHOG (Bielik et al., 2016) is only conditioned on the tokens on the left of the expression.

Model Test (from seen projects) Test-only (from unseen projects)

Perplexity Well-Typed Acc@1 Acc@5 Perplexity Well-Typed Acc@1 Acc@5

PHOG 34.8% 42.9% 28.0% 37.3%

Seq Seq 87.48 32.4% 21.8% 28.1% 130.46 23.4% 10.8% 16.8% Seq NAG 6.81 53.2% 17.7% 33.7% 8.38 40.4% 8.4% 15.8%

G Seq 93.31 40.9% 27.1% 34.8% 28.48 36.3% 17.2% 25.6% G T ree 4.37 49.3% 26.8% 48.9% 5.37 41.2% 19.9% 36.8% G ASN 2.62 78.7% 45.7% 62.0% 3.03 74.7% 32.4% 48.1% G Syn 2.71 84.9% 50.5% 66.8% 3.48 84.5% 36.0% 52.7% G NAG 2.56 86.4% 52.3% 69.2% 3.07 84.5% 38.8% 57.0%

production and the index of the child (corresponding to the ﬁeld name Rabinovich et al. (2017)). Finally, Syn follows the work of Yin & Neubig (2017), but uses a GRU instead of an LSTM. For this, we extend T ree by a new Next Exp edge that connects nodes to each other in the expansion sequence of the tree, thus corresponding to the action ﬂow (Yin & Neubig, 2017).

In all cases, our re-implementations improve on prior work in our variable selection mechanism, which ensures that generated programs only use variables that are deﬁned and in scope. Both Rabinovich et al. (2017) and Yin & Neubig (2017) instead use a copying mechanism from the context. On the other hand, they use RNN modules to generate function names and choose arguments from the context (Yin & Neubig, 2017) and to generate string literals (Rabinovich et al., 2017). Our Expr Gen task limits the set of allowed functions and string literals substantially and thus no RNN decoder generating such things is required in our experiments.

The authors of the PHOG (Bielik et al., 2016) language model kindly ran experiments on our data for the Expr Gen task, to provide baseline results of a non-neural language model. Note, however, that PHOG does not consider the code context to the right of the expression to generate, and does no additional analyses to determine which variable choices are valid. Extending the model to take more context into account and do some analyses to restrict choices would certainly improve its results.

5.1 QUANTITATIVE EVALUATION

Metrics We are interested in the ability of a model to generate valid expressions based on the current code context. To evaluate this, we consider four metrics. As our Expr Gen task requires a conditional language model of code, we ﬁrst consider the per-token perplexity of the model; the lower the perplexity, the better the model ﬁts the real data distribution. We then evaluate how often the generated expression is well-typed (i.e., can be typed in the original code context). We report these metrics for the most likely expression returned by beam search decoding with beam width 5. Finally, we compute how often the ground truth expression was generated (reported for the most likely expression, as well as for the top ﬁve expressions). This measure is stricter than semantic equivalence, as an expression j > i will not match the equivalent i < j.

Results We show the results of our evaluation in Tab. 1. Overall, the graph encoder architecture seems to be best-suited for this task. All models learn to generate syntactically valid code (which is relatively simple in our domain). However, the different encoder models perform very differently on semantic measures such as well-typedness and the retrieval of the ground truth expression. Most of the type errors are due to usage of an UNK literal (for example, the G NAG model only has 4% type error when ﬁltering out such unknown literals). The results show a clear trend that correlates better semantic results with the amount of information about the partially generated programs employed by the generative models. Transferring a trained model to unseen projects with a new project-speciﬁc vocabulary substantially worsens results, as expected. Overall, our NAG model, combining and adding additional signal sources, seems to perform best on most measures, and seems to be leastimpacted by the transfer.

Published as a conference paper at ICLR 2019

int meth Param Count = 0; if (param Count > 0) { IParameter Type Information[] module Param Arr = Get Param Type Informations(Dummy.Signature, param Count); meth Param Count = module Param Arr.Length; } if ( param Count > meth Param Count ) { IParameter Type Information[] module Param Arr = Get Param Type Informations(Dummy.Signature, param Count - meth Param Count); }

G NAG: param Count > meth Param Count (34.4%) param Count == meth Param Count (11.4%) param Count < meth Param Count (10.0%)

G ASN: param Count == 0 (12.7%) param Count < 0 (11.5%) param Count > 0 (8.0%)

public static String URIto Path(String uri) {

if (System.Text.Regular Expressions .Regex.Is Match(uri, " file:\\\\[a-z,A-Z]:")) { return uri.Substring(6); } if ( uri.Starts With(@"file:") ) {

return uri.Substring(5); } return uri; }

G NAG: uri.Contains(UNK_STRING_LITERAL) (32.4%) uri.Starts With(UNK_STRING_LITERAL) (29.2%) uri.Has Value() (7.7%)

G Syn: uri == UNK_STRING_LITERAL (26.4%) uri == "" (8.5%) uri.Starts With(UNK_STRING_LITERAL) (6.7%)

Figure 3: Two lightly edited examples from our test set and expressions predicted by different models. More examples can be found in the supplementary material.

5.2 QUALITATIVE EVALUATION

As the results in the previous section suggest, the proposed Expr Gen task is hard even for the strongest models we evaluated, achieving no more than 50% accuracy on the top prediction. It is also unsolvable for classical logico-deductive program synthesis systems, as the provided code context does not form a precise speciﬁcation. However, we do know that most instances of the task are (easily) solvable for professional software developers, and thus believe that machine learning systems can have considerable success on the task.

Fig. 3 shows two (abbreviated) samples from our test set, together with the predictions made by the two strongest models we evaluated. In the ﬁrst example, we can see that the G NAG model correctly identiﬁes that the relationship between param Count and meth Param Count is important (as they appear together in the blocked guarded by the expression to generate), and thus generates comparison expressions between the two variables. The G ASN model lacks the ability to recognize that param Count (or any variable) was already used and thus fails to insert both relevant variables. We found this to be a common failure, often leading to suggestions using only one variable (possibly repeatedly). In the second example, both G NAG and G Syn have learned the common if (var.Starts With(...)) { ... var.Substring(num) ... } pattern, but of course fail to produce the correct string literal in the condition. We show results for all of our models for these examples, as well as for as additional examples, in the supplementary material B.

6 DISCUSSION & CONCLUSIONS

We presented a generative code model that leverages known semantics of partially generated programs to direct the generative procedure. The key idea is to augment partial programs to obtain a graph, and then use graph neural networks to compute a precise representation for the partial program. This representation then helps to better guide the remainder of the generative procedure. We have shown that this approach can be used to generate small but semantically interesting expressions from very imprecise context information. The presented model could be useful in program repair scenarios (where repair proposals need to be scored, based on their context) or in the code review setting (where it could highlight very unlikely expressions). We also believe that similar models could have applications in related domains, such as semantic parsing, neural program synthesis and text generation.

Published as a conference paper at ICLR 2019

Miltiadis Allamanis, Earl T Barr, Premkumar Devanbu, and Charles Sutton. A survey of machine learning for big code and naturalness. ACM Computing Surveys, 2018a.

Miltiadis Allamanis, Marc Brockschmidt, and Mahmoud Khademi. Learning to represent programs with graphs. In International Conference on Learning Representations (ICLR), 2018b.

Matthew Amodio, Swarat Chaudhuri, and Thomas W. Reps. Neural attribute machines for program generation. ar Xiv preprint ar Xiv:1705.09231, 2017.

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate. In International Conference on Learning Representations (ICLR), 2014.

Benjamin Bichsel, Veselin Raychev, Petar Tsankov, and Martin Vechev. Statistical deobfuscation of android applications. In Conference on Computer and Communications Security (CCS), 2016.

Pavol Bielik, Veselin Raychev, and Martin Vechev. PHOG: probabilistic model for code. In International Conference on Machine Learning (ICML), 2016.

Kyunghyun Cho, Bart van Merriënboer, Dzmitry Bahdanau, and Yoshua Bengio. On the properties of neural machine translation: Encoder decoder approaches. Syntax, Semantics and Structure in Statistical Translation, 2014.

Yu Feng, Ruben Martins, Osbert Bastani, and Isil Dillig. Program synthesis using conﬂict-driven learning. In Programming Languages Design and Implementation (PLDI), 2018.

John K. Feser, Swarat Chaudhuri, and Isil Dillig. Synthesizing data structure transformations from input-output examples. In Programming Languages Design and Implementation (PLDI), 2015.

Justin Gilmer, Samuel S. Schoenholz, Patrick F. Riley, Oriol Vinyals, and George E. Dahl. Neural message passing for quantum chemistry. In International Conference on Machine Learning (ICML), 2017.

Vincent J. Hellendoorn and Premkumar Devanbu. Are deep neural networks the best choice for modeling source code? In Foundations of Software Engineering (FSE), 2017.

Abram Hindle, Earl T Barr, Zhendong Su, Mark Gabel, and Premkumar Devanbu. On the naturalness of software. In International Conference on Software Engineering (ICSE), 2012.

Donald E. Knuth. Semantics of context-free languages. Mathemtical Systems Theory, 2(2):127 145, 1967.

Yujia Li, Daniel Tarlow, Marc Brockschmidt, and Richard Zemel. Gated graph sequence neural networks. In International Conference on Learning Representations (ICLR), 2016.

Yujia Li, Oriol Vinyals, Chris Dyer, Razvan Pascanu, and Peter Battaglia. Learning deep generative models of graphs. Co RR, abs/1803.03324, 2018.

Cristina V Lopes, Petr Maj, Pedro Martins, Vaibhav Saini, Di Yang, Jakub Zitny, Hitesh Sajnani, and Jan Vitek. DéjàVu: a map of code duplicates on Git Hub. In Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA), 2017.

Minh-Thang Luong, Hieu Pham, and Christopher D Manning. Effective approaches to attention-based neural machine translation. In Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015.

Chris J Maddison and Daniel Tarlow. Structured generative models of natural source code. In International Conference on Machine Learning (ICML), 2014.

Emilio Parisotto, Abdel-rahman Mohamed, Rishabh Singh, Lihong Li, Dengyong Zhou, and Pushmeet Kohli. Neuro-symbolic program synthesis. In International Conference on Learning Representations (ICLR), 2017.

Oleksandr Polozov and Sumit Gulwani. Flash Meta: a framework for inductive program synthesis. In Object Oriented Programming, Systems, Languages, and Applications (OOPSLA), 2015.

Maxim Rabinovich, Mitchell Stern, and Dan Klein. Abstract syntax networks for code generation and semantic parsing. In Annual Meeting of the Association for Computational Linguistics (ACL), 2017.

Veselin Raychev, Martin Vechev, and Eran Yahav. Code completion with statistical language models. In Programming Languages Design and Implementation (PLDI), 2014.

Published as a conference paper at ICLR 2019

Veselin Raychev, Martin Vechev, and Andreas Krause. Predicting program properties from Big Code. In Principles of Programming Languages (POPL), 2015.

Bidisha Samanta, Abir De, Niloy Ganguly, and Manuel Gomez-Rodriguez. Designing random graph models using variational autoencoders with applications to chemical design. Co RR, abs/1802.05283, 2018.

Armando Solar-Lezama. Program synthesis by sketching. University of California, Berkeley, 2008.

Oriol Vinyals, Meire Fortunato, and Navdeep Jaitly. Pointer networks. In Advances in Neural Information Processing Systems, 2015.

Pengcheng Yin and Graham Neubig. A syntactic neural model for general-purpose code generation. In Annual Meeting of the Association for Computational Linguistics (ACL), 2017.

Published as a conference paper at ICLR 2019

A DATASET SAMPLES

Below we list some sample snippets from the training set for our Expr Gen task. The highlighted expressions are to be generated.

for (int i=0; i < 3*time Span Units + 1 ; ++i) {

consolidator.Update(new Trade Bar { Time = ref Date Time });

if (i < time Span Units) { // before initial consolidation happens

Assert.Is Null(consolidated); } else {

Assert.Is Not Null(consolidated); if ( i % time Span Units == 0 ) { // i = 3, 6, 9

Assert.Are Equal(ref Date Time.Add Minutes(-time Span Units), consolidated.Time); } }

ref Date Time = ref Date Time.Add Minutes(1); }

Figure 4: Sample snippet from the Lean project. Formatting has been modiﬁed.

var words = (from word in phrase.Split( ) where word.Length > 0 select word.To Lower()).To Array();

Figure 5: Sample snippet from the Bot Builder project. Formatting has been modiﬁed.

_has Handle = _mutex.Wait One( time Out < 0 ? Timeout.Infinite : time Out, exit Context: false);

Figure 6: Sample snippet from the Chocolatey project. Formatting has been modiﬁed.

public static T retry<T>(int number Of Tries, Func<T> function,

int wait Duration Milliseconds = 100, int increase Retry By Milliseconds = 0) { if (function == null) return default(T); if (number Of Tries == 0)

throw new Application Exception("You must specify a number"

+ " of retries greater than zero."); var return Value = default(T);

var debugging = log_is_in_debug_mode(); var log Location = Chocolatey Loggers.Normal;

for (int i = 1; i <= number Of Tries ; i++) {

Figure 7: Sample snippet from the Chocolatey project. Formatting has been modiﬁed and the snippet has been abbreviated.

Published as a conference paper at ICLR 2019

while ( count >= start Index ) {

c = s[count]; if ( c != && c != n ) break; count--; }

Figure 8: Samples snippet in the Common Mark.NET project. Formatting has been modiﬁed.

private string Get Resource For Time Span(Time Unit unit, int count) {

var resource Key = Resource Keys.Time Span Humanize.Get Resource Key(unit, count); return count == 1 ? Format(resource Key) : Format(resource Key, count); }

Figure 9: Sample snippet from the Humanizer project. Formatting has been modiﬁed.

var index Of Equals = segment.Index Of( = ) ;

if ( index Of Equals == -1 ) {

var decoded = Url Decode(segment, encoding); return new Key Value Pair<string, string>(decoded, decoded); }

Figure 10: Samples snippet from the Nancy project. Formatting has been modiﬁed.

private bool Resolve Writable Override(bool writable) {

if (!Writable && writable)

throw new Storage Invalid Operation Exception("Cannot open writable storage"

+ " in readonly storage.");

bool open Writable = Writable; if ( open Writable && !writable )

open Writable = writable; return open Writable; }

Figure 11: Sample snippet from the Open Live Writer project. Formatting has been modiﬁed.

char c = html[j]; if ( c == ; || (!(c >= a && c <= z ) && !(c >= A && c <= Z ) && !(c >= 0 && c <= 9 )) ) {

Figure 12: Sample snippet from the Open Live Writer project. Formatting has been modiﬁed.

Published as a conference paper at ICLR 2019

string entity Ref = html.Substring(i + 1, j - (i + 1)) ;

Figure 13: Sample snippet from the Open Live Writer project. Formatting has been modiﬁed.

B SAMPLE GENERATIONS

On the following pages, we list some sample snippets from the test set for our Expr Gen task, together with suggestions produced by different models. The highlighted expressions are the ground truth expression that should be generated.

Published as a conference paper at ICLR 2019

if (context.Context == _MARKUP_CONTEXT_TYPE.CONTEXT_TYPE_Text &&

!String.Is Null Or Empty(text)) { idx = original Text.Index Of(text) ; if (idx == 0) {

// Drop this portion from the expected string original Text = original Text.Substring(text.Length);

// Update the current pointer begin Damage Pointer.Move To Pointer(current Range.End); } else if (idx > 0 &&

original Text.Substring(0, idx)

.Replace("\r\n", string.Empty).Length == 0) {

// Drop this portion from the expected string original Text = original Text.Substring(text.Length + idx); // Update the current pointer begin Damage Pointer.Move To Pointer(current Range.End); } else {

return false; } }

Sample snippet from Open Live Writer. The following suggestions were made: Seq Seq:

UNK_TOKEN[i] (0.6%)

input[input Offset + 1] (0.3%)

UNK_TOKEN & UNK_NUM_LITERAL (0.3%)

Marshal Url Supported.Index Of(UNK_CHAR_LITERAL) (0.9%)

Is Edit Field Selected.Index Of(UNK_CHAR_LITERAL) (0.8%)

marshal Url Supported.Index Of(UNK_CHAR_LITERAL) (0.7%)

UNK_TOKEN.Index Of(UNK_CHAR_LITERAL) (21.6%)

UNK_TOKEN.Last Index Of(UNK_CHAR_LITERAL) (14.9%)

UNK_TOKEN.Get Hash Code() (8.1%)

UNK_CHAR_LITERAL.Index Of(UNK_CHAR_LITERAL) (8.1%)

UNK_CHAR_LITERAL.Index Of(original Text) (8.1%)

original Text.Index Of(UNK_CHAR_LITERAL) (8.1%)

original Text.Get Hash Code() (37.8%)

original Text.Index Of(UNK_CHAR_LITERAL) (14.8%)

original Text.Last Index Of(UNK_CHAR_LITERAL) (6.2%)

text.Index Of(UNK_CHAR_LITERAL) (20.9%)

text.Last Index Of(UNK_CHAR_LITERAL) (12.4%)

original Text.Index Of(UNK_CHAR_LITERAL) (11.6%)

original Text.Index Of(UNK_CHAR_LITERAL) (32.8%)

original Text.Last Index Of(UNK_CHAR_LITERAL) (12.4%)

original Text.Index Of(text) (8.7%)

Published as a conference paper at ICLR 2019

caret Pos--; if (caret Pos < 0) {

caret Pos = 0; }

int len = input String.Length; if (caret Pos >= len) {

caret Pos = len - 1 ; }

Sample snippet from acat. The following suggestions were made: Seq Seq:

UNK_TOKEN+1 (2.1%)

UNK_TOKEN+UNK_TOKEN] (1.8%)

UNK_TOKEN.Index Of(UNK_CHAR_LITERAL) (1.3%)

word To Replace - 1 (3.2%)

insert Or Replace Offset - 1 (2.9%)

input String - 1 (1.9%)

len + 1 (35.6%)

len - 1 (11.3%)

len >> UNK_NUM_LITERAL (3.5%)

len + len (24.9%)

len - len (10.7%)

1 + len (3.7%)

len + 1 (22.8%)

len - 1 (10.8%)

len + len (10.3%)

len + 1 (13.7%)

len - 1 (11.5%)

len - len (11.0%)

len++ (33.6%)

len-1 (21.9%)

len+1 (14.6%)

Published as a conference paper at ICLR 2019

public static String URIto Path(String uri) {

if (System.Text.Regular Expressions

.Regex.Is Match(uri, " file:\\\\[a-z,A-Z]:")) { return uri.Substring(6); } if ( uri.Starts With(@"file:") ) {

return uri.Substring(5); } return uri; }

Sample snippet from acat. The following suggestions were made: Seq Seq:

!UNK_TOKEN (11.1%)

UNK_TOKEN == 0 (3.6%)

UNK_TOKEN != 0 (3.4%)

!uri (7.6%)

!My Videos (4.7%)

!My Documents (4.7%)

action == UNK_STRING_LITERAL (22.6%)

label == UNK_STRING_LITERAL (14.8%)

file.Contains(UNK_STRING_LITERAL) (4.6%)

uri == uri (7.4%)

uri.Starts With(uri) (5.5%)

uri.Contains(uri) (4.3%)

uri == UNK_STRING_LITERAL (11.7%)

uri.Contains(UNK_STRING_LITERAL) (11.7%)

uri.Starts With(UNK_STRING_LITERAL) (8.3%)

uri == UNK_STRING_LITERAL (26.4%)

uri == "" (8.5%)

uri.Starts With(UNK_STRING_LITERAL) (6.7%)

uri.Contains(UNK_STRING_LITERAL) (32.4%)

uri.Starts With(UNK_STRING_LITERAL) (29.2%)

uri.Has Value() (7.7%)

Published as a conference paper at ICLR 2019

start Pos = index + 1; int count = end Pos - start Pos + 1;

word = (count > 0) ? input.Substring(start Pos, count) : String.Empty;

Sample snippet from acat. The following suggestions were made: Seq Seq:

UNK_TOKEN.Trim() (3.4%)

UNK_TOKEN.Replace(UNK_STRING_LITERAL, UNK_STRING_LITERAL) (2.1%)

UNK_TOKEN.Replace( UNK_CHAR , UNK_CHAR ) (3.4%)

input[index] (1.4%)

start Pos[input] (0.9%)

input[count] (0.8%)

val.Trim() (6.6%)

input.Trim() (6.5%)

input.Substring(UNK_NUM_LITERAL) (4.0%)

UNK_STRING_LITERAL + UNK_STRING_LITERAL (8.4%)

UNK_STRING_LITERAL + start Pos (7.8%)

start Pos + UNK_STRING_LITERAL (7.8%)

input.Trim() (15.6%)

input.Substring(0) (6.4%)

input.Replace(UNK_STRING_LITERAL, UNK_STRING_LITERAL) (2.8%)

input.Trim() (7.8%)

input.To Lower() (6.4%)

input + UNK_STRING_LITERAL (5.6%)

input+Start Pos (11.8%)

input+count (9.5%)

input.Substring(start Pos, end Pos - count) (6.3%)

Published as a conference paper at ICLR 2019

protected virtual void Crawl Site() {

while ( !_crawl Complete ) {

Run Pre Work Checks();

if (_scheduler.Count > 0) {

_thread Manager.Do Work(

() => Process Page(_scheduler.Get Next())); } else if (!_thread Manager.Has Running Threads()) {

_crawl Complete = true; } else {

_logger.Debug Format("Waiting for links to be scheduled..."); Thread.Sleep(2500); } } }

Sample snippet from Abot. The following suggestions were made: Seq Seq:

!UNK_TOKEN (9.4%)

UNK_TOKEN > 0 (2.6%)

UNK_TOKEN != value (1.3%)

!_max Pages To Crawl Limit Reached Or Scheduled (26.2%)

!_crawl Cancellation Reported (26.0%)

!_crawl Stop Reported (21.8%)

!UNK_TOKEN (54.9%)

!done (18.8%)

!throw On Error (3.3%)

!_crawl Cancellation Reported (23.6%)

!_crawl Stop Reported (23.3%)

!_max Pages To Crawl Limit Reached Or Scheduled (18.9%)

!_crawl Stop Reported (26.6%)

!_crawl Cancellation Reported (26.5%)

!_max Pages To Crawl Limit Reached Or Scheduled (25.8%)

!_crawl Stop Reported (19.6%)

!_max Pages To Crawl Limit Reached Or Scheduled (19.0%)

!_crawl Cancellation Reported (15.7%)

!_crawl Stop Reported (38.4%)

!_crawl Cancellation Reported (31.8%)

!_max Pages To Crawl Limit Reached Or Scheduled (27.0%)

Published as a conference paper at ICLR 2019

char character = original Name[i]; if ( character == < ) {

++start Tag Count; builder.Append( ); } else if (start Tag Count > 0) {

if (character == > ) {

--start Tag Count; }

Sample snippet from Style Cop. The following suggestions were made: Seq Seq:

x == UNK_CHAR_LITERAL (5.9%)

UNK_TOKEN == 0 (3.3%)

UNK_TOKEN > 0 (2.7%)

!i == 0 (5.1%)

character < 0 (2.7%)

character (2.2%)

character == UNK_CHAR_LITERAL (70.8%)

character == UNK_CHAR_LITERAL || character == UNK_CHAR_LITERAL (5.8%)

character != UNK_CHAR_LITERAL (3.1%)

character == character (9.9%)

UNK_CHAR_LITERAL == character (8.2%)

character == UNK_CHAR_LITERAL (8.2%)

character == UNK_CHAR_LITERAL (43.4%)

character || character (3.3%)

character == UNK_CHAR_LITERAL == UNK_CHAR_LITERAL (3.0%)

character == UNK_CHAR_LITERAL (39.6%)

character || character == UNK_STRING_LITERAL (5.2%)

character == UNK_STRING_LITERAL (2.8%)

character == UNK_CHAR_LITERAL (75.5%)

character == (2.6%)

character != UNK_CHAR (2.5%)

Published as a conference paper at ICLR 2019

public void Allow Access(string path) {

if (path == null) throw new Argument Null Exception("path"); if ( !path.Starts With(" /") )

throw new Argument Exception(

string.Format(

"The path \"{0}\" is not application relative."

+ " It must start with \"~/\".", path), "path");

paths.Add(path); }

Sample snippet from cassette. The following suggestions were made: Seq Seq:

UNK_TOKEN < 0 (14.6%)

!UNK_TOKEN (7.5%)

UNK_TOKEN == 0 (3.3%)

path == UNK_STRING_LITERAL (18.1%)

path <= 0 (5.6%)

path == "" (4.8%)

!UNK_TOKEN (48.0%)

!discard Nulls (6.3%)

!first (2.7%)

!path (67.4%)

path && path (8.4%)

!!path (5.5%)

!path (91.5%)

!path && !path (0.9%)

!path.Contains(UNK_STRING_LITERAL) (0.7%)

!path (89.6%)

!path && !path (1.5%)

!path.Contains(UNK_STRING_LITERAL) (0.5%)

!path (42.9%)

!path.Starts With(UNK_STRING_LITERAL) (23.8%)

!path.Contains(UNK_STRING_LITERAL) (5.9%)

Published as a conference paper at ICLR 2019

int method Param Count = 0; IEnumerable<IParameter Type Information> module Parameters =

Enumerable<IParameter Type Information>.Empty; if (param Count > 0) {

IParameter Type Information[] module Parameter Arr =

this.Get Module Parameter Type Informations(Dummy.Signature, param Count); method Param Count = module Parameter Arr.Length; if (method Param Count > 0)

module Parameters = Iterator Helper.Get Readonly(module Parameter Arr); } IEnumerable<IParameter Type Information> module Varargs Parameters =

Enumerable<IParameter Type Information>.Empty; if ( param Count > method Param Count ) {

IParameter Type Information[] module Parameter Arr =

this.Get Module Parameter Type Informations(

Dummy.Signature, param Count - method Param Count); if (module Parameter Arr.Length > 0)

module Varargs Parameters = Iterator Helper.Get Readonly(module Parameter Arr); }

Sample snippet from Afterthought. The following suggestions were made: Seq Seq:

!UNK_TOKEN (10.9%)

UNK_TOKEN == UNK_TOKEN (4.6%)

UNK_TOKEN == UNK_STRING_LITERAL (3.3%)

dummy Pinned != 0 (2.2%)

param Count != 0 (2.1%)

dummy Pinned == 0 (1.5%)

new Value > 0 (9.7%)

zeroes > 0 (9.0%)

param Count > 0 (6.0%)

method Param Count == method Param Count (3.4%)

0 == method Param Count (2.8%)

method Param Count == param Count (2.8%)

param Count == 0 (12.7%)

param Count < 0 (11.5%)

param Count > 0 (8.0%)

method Param Count > 0 (10.9%)

param Count > 0 (7.9%)

method Param Count != 0 (5.6%)

param Count > method Param Count (34.4%)

param Count == method Param Count (11.4%)

param Count < method Param Count (10.0%)

Published as a conference paper at ICLR 2019

public Code Location(int index, int end Index, int index On Line,

int end Index On Line, int line Number, int end Line Number) {

Param.Require Greater Than Or Equal To Zero(index, "index"); Param.Require Greater Than Or Equal To(end Index, index, "end Index"); Param.Require Greater Than Or Equal To Zero(index On Line, "index On Line"); Param.Require Greater Than Or Equal To Zero(end Index On Line, "end Index On Line"); Param.Require Greater Than Zero(line Number, "line Number"); Param.Require Greater Than Or Equal To(end Line Number, line Number, "end Line Number");

// If the entire segment is on the same line, // make sure the end index is greater or equal to the start index. if ( line Number == end Line Number ) {

Debug.Assert(end Index On Line >= index On Line,

"The end index must be greater than the start index," + " since they are both on the same line."); }

this.start Point = new Code Point(index, index On Line, line Number); this.end Point = new Code Point(end Index, end Index On Line, end Line Number); }

Sample snippet from Style Cop. The following suggestions were made: Seq Seq:

!UNK_TOKEN (14.0%)

UNK_TOKEN == 0 (4.4%)

UNK_TOKEN > 0 (3.5%)

end Index < 0 (3.8%)

end Index > 0 (3.4%)

end Index == 0 (2.2%)

line Number < 0 (9.4%)

line Number == 0 (7.4%)

line Number <= 0 (5.1%)

line Number == line Number (3.4%)

0 == line Number (2.5%)

line Number > line Number (2.5%)

end Line Number == 0 (9.6%)

end Line Number < 0 (7.9%)

end Line Number > 0 (6.1%)

line Number > 0 (11.3%)

line Number == 0 (7.3%)

line Number != 0 (6.7%)

line Number > end Line Number (20.7%)

line Number < end Line Number (16.5%)

line Number == end Line Number (16.2%)

Published as a conference paper at ICLR 2019

public static Bitmap Rotate Image(Image img, float angle Degrees,

bool upsize, bool clip) { // Test for zero rotation and return a clone of the input image if (angle Degrees == 0f) return (Bitmap)img.Clone();

// Set up old and new image dimensions, assuming upsizing not wanted // and clipping OK int old Width = img.Width; int old Height = img.Height; int new Width = old Width; int new Height = old Height; float scale Factor = 1f;

// If upsizing wanted or clipping not OK calculate the size of the // resulting bitmap if ( upsize || !clip ) {

double angle Radians = angle Degrees * Math.PI / 180d; double cos = Math.Abs(Math.Cos(angle Radians)); double sin = Math.Abs(Math.Sin(angle Radians)); new Width = (int)Math.Round((old Width * cos) + (old Height * sin)); new Height = (int)Math.Round((old Width * sin) + (old Height * cos)); } // If upsizing not wanted and clipping not OK need a scaling factor if (!upsize && !clip) {

scale Factor = Math.Min((float)old Width / new Width,

(float)old Height / new Height); new Width = old Width; new Height = old Height; }

Sample snippet from Share X. The following suggestions were made: Seq Seq:

UNK_TOKEN > 0 (8.3%)

!UNK_TOKEN (4.4%)

UNK_TOKEN == 0 (2.6%)

new Height > 0 (5.1%)

clip > 0 (3.2%)

old Width > 0 (2.9%)

UNK_TOKEN && UNK_TOKEN (15.0%)

UNK_TOKEN || UNK_TOKEN (13.6%)

trusted For Delegation && !app Only (12.1%)

upsize && upsize (21.5%)

upsize && clip (10.9%)

clip && upsize (10.9%)

upsize && clip (13.9%)

upsize && !clip (9.8%)

clip && clip (9.3%)

upsize && !upsize (6.9%)

clip && !upsize (6.3%)

upsize || upsize (5.7%)

upsize || clip (19.1%)

upsize && clip (18.8%)

upsize && ! clip (12.2%) 24