# meka_a_multilabelmultitarget_extension_to_weka__f4e51a5b.pdf

Journal of Machine Learning Research 17 (2016) 1-5 Submitted 5/12; Revised 6/15; Published 2/16

Meka: A Multi-label/Multi-target Extension to Weka

Jesse Read jesse.read@aalto.fi Helsinki Institute for Information Technology (HIIT), and Aalto University, Dept. of Computer Science, Espoo, Finland Peter Reutemann fracpete@waikato.ac.nz Bernhard Pfahringer bernhard@cs.waikato.ac.nz GeoﬀHolmes geoff@cs.waikato.ac.nz Department of Computer Science University of Waikato, New Zealand

Editor: Balazs Kegl

Multi-label classiﬁcation has rapidly attracted interest in the machine learning literature, and there are now a large number and considerable variety of methods for this type of learning. We present Meka: an open-source Java framework based on the well-known Weka library. Meka provides interfaces to facilitate practical application, and a wealth of multi-label classiﬁers, evaluation metrics, and tools for multi-label experiments and development. It supports multi-label and multi-target data, including in incremental and semi-supervised contexts. Keywords: classiﬁcation, learning, multi-label, multi-target, incremental

1. Introduction

In multi-label learning a data instance may be associated with multiple binary class labels. This is as opposed to the traditional task of single-label (i.e., multi-class, or binary) classiﬁcation where each instance is only associated with a single class label. The multi-label context is receiving a lot of attention and is relevant to a wide variety of domains, including text, music, images and video, and bioinformatics; as reviewed by Tsoumakas et al. (2010). The multi-label learning problem is in fact a special case of multi-target learning (also known as multi-dimensional or multi-objective), where each label can take multiple values, as opposed to binary labels indicating relevance or not (Bielza et al., 2011; Appice and Dˇzeroski, 2007). The context of multiple target variables has important implications for classiﬁcation (how to model dependencies between variables) and evaluation (how to score multiple target classiﬁcations for each instance) that traditional single-label frameworks do not deal with. Meka has been designed speciﬁcally for this context.

2. The Meka Framework

Meka was created to perform and evaluate multi-label classiﬁcation using the popular and eﬀective family of problem transformation methods, which make use of existing oﬀ-the-shelf

c 2016 Jesse Read, Peter Reutemann, Bernhard Pfahringer, and GeoﬀHolmes.

Read, Reutemann, Pfahringer, and Holmes

Figure 1: MEKA s GUI interface; having loaded and setup the Music data set (left) and selecting and conﬁguring a classiﬁer (right).

single-label (binary or multi-class) methods as base classiﬁers . The Weka framework (Hall et al., 2009) contains a plethora of well-documented single-label classiﬁers used by the machine learning community for many years that can be used as such.

Meka contains all the basic problem transformation methods as reviewed by Tsoumakas et al. (2010), advanced methods including many of those surveyed by Madjarov et al. (2012) and Zhang and Zhou (2013), many varieties of classiﬁer chains (Read et al., 2011) which have often been used as a benchmark in the recent multi-label literature, and also algorithm adaptations such as multi-label neural networks and deep neural networks (Hinton and Salakhutdinov, 2006). It includes two strategies for automatic threshold calibration, and a variety of evaluation metrics from the literature.

Meka is easily used from either the command line interface (CLI) or graphical user interface (GUI). Thus no programming is required to parameterize, run, and evaluate classiﬁers, making it suitable for practitioners unfamiliar with Java. However, it is straightforward to extend Meka with new classiﬁers and integrate it in other frameworks. Those familiar with Weka will have almost no learning curve (much of Weka s documentation and modus operandi is directly applicable). Any new Meka classiﬁer can also be combined within any of Meka s existing ensemble schemes and any Weka base classiﬁer without writing extra code, and be compared easily with benchmark and state-of-the-art methods. Meka also oﬀers support for semi-supervised and incremental classiﬁcation in the multi-label context (Read et al., 2015), as well as the general multi-target context considered by Appice and Dˇzeroski (2007), making it a good platform for working in these areas.

2.1 Using Meka

Meka can be run on any machine with Java installed (version 1.7 or above). Everything needed to run Meka is included in the distribution package and detailed instructions and examples on how to setup data sets and run experiments are included in the tutorial. Figure 1 shows screen captures of Meka s GUI.

Meka: A Multi-label/Multi-target Extension to Weka

Meka uses Weka s thoroughly-documented ARFF ﬁle format,1 using multiple binary attributes to specify the label relevances (or nominal attributes in the case of multi-target). The target attributes can easily be conﬁgured using the GUI, or directly from within the ARFF ﬁle (optionally using Weka s ﬁlters). Meka s CLI follows the style of Weka. For example, to run ﬁve fold cross validation of an ensemble of 50 chain classiﬁers (Read et al., 2011) on the Music data set with support vector machines as the base classiﬁer (as also shown on the right of Figure 1):

java meka.classifiers.multilabel.meta.Bagging ML -x 5 -t data/Music.arff -I 50 \ -W meka.classifiers.multilabel.CC -- -W weka.classifiers.trees.SMO

A list of methods implemented in Meka along with examples, is given at http://meka. sourceforge.net/methods.html. Further examples and documentation can be found in the tutorial and API reference, where relevant publications are also mentioned. Meka also includes a wrapper to Mulan (the MULAN class) to allow additional classiﬁers to be run from Meka s interfaces. Meka includes over a dozen evaluation metrics, including Hamming loss, 0/1 loss, Jaccard index, rank loss, log loss, and F1-measure (macro and micro averages). Several metrics can be output overall and per-label.

2.2 Extending Meka

Meka can be extended by writing classiﬁers that implement the Multi Label Classifier interface. A multi-label classiﬁer uses the same methods as a Weka classiﬁer, namely the build Classifier(Instances) and distribution For Instance(Instance) methods. The diﬀerence in Meka is that class Index() will indicate the number of target attributes. As in Weka, the distribution For Instance method returns a vector which may be votes or posterior probabilities. In the latter case, a threshold or thresholds can automatically be calibrated, or deﬁned ad-hoc according to user preference, to produce 0/1 label relevances. The tutorial deals with the general multi-target case. Incremental classiﬁers simply implement Incremental Multi Label Classifier and thus, as with Weka s Updateable Classifier interface, have an update Classifier(Instance) method. Meka automatically carries out incremental learning and evaluation for any updateable classiﬁers. Semi-supervised classiﬁers implement Semisupervised Classifier interface and must implement the method set Unlabelled Data(Instances) which is called by the evaluation routine.

3. Related frameworks

Mulan (Tsoumakas et al., 2011) is a programmatic API also based on Weka providing a number of multi-label algorithms implemented from the literature. There is some overlap in terms of classiﬁers available, but the usage and performance is diﬀerent. Mulan is an API, whereas Meka provides both graphical and command-line interfaces and this allows novel classiﬁer and ensemble combinations and parameterizations thereof to be trialled rapidly without writing any code; allowing much ﬂexibility when tackling a multi-label problem. A single command line is often suﬃcient to reproduce the results of publications. Both frameworks have attracted considerable interest and both are mentioned in the literature.

1. See, for example, http://weka.wikispaces.com/ARFF.

Read, Reutemann, Pfahringer, and Holmes

Clus Mulan Meka interface conﬁg. ﬁle CLI, GUI multi-target regression classiﬁcation hierarchical incremental semi-supervised num. of classiﬁers 2+ 20+ 20+ classiﬁcation paradigms decision trees/rules Weka+others Weka+others

Table 1: Comparing Meka with related frameworks.

Data set Meka Mulan Enron 8 33 Corel-5k 10 35 Media Mill 449 2104

Table 2: Running time (s) of RAk EL under Meka and Mulan, with SMO, k = 3, m = 10.

Meka specializes in the problem transformation methods, in particular the classiﬁer chains paradigm, implementing at least 10 variations; and also meta methods for combining them together in many combinations. Clus is a decision tree and rule learning system (Appice and Dˇzeroski, 2007) that can also carry out multi-label and multi-target classiﬁcation. Classiﬁers are conﬁgured in a text ﬁle. Clus s implementations of decision tree and rule algorithms are scalable and competitive; but it does not include many of the popular problem transformation methods. Table 1 roughly compares the frameworks in terms of features. Clus has fewer classiﬁcation paradigms, but can be used in many diﬀerent contexts. Mulan has a similar number of classiﬁers but Meka has diﬀerent methods. Meka is noticeably faster than Mulan in some implementations under the same conﬁgurations, for example RAk EL, see Table 2. Meka itself integrates well in other frameworks. For example, it is already part of the Moa (data streams), DKPro (text classiﬁcation), and ADAMS (workﬂow engine) frameworks, and available via Maven Central.2

4. Summary and Further Notes

Meka provides a multi-label and multi-target framework with comfortable command line and graphical interfaces. This includes traditional, benchmark, and state of the art algorithms; and a multitude of multi-label speciﬁc evaluation measures. It also includes methods for incremental, multi-target, and semi-supervised learning, and is easy to use and extend. Meka is released under the GNU GPL 3 licence; can run on any Java (1.7 or above) machine. The software with source code, API reference, data sets, list of methods with examples, and a tutorial can be found at at http://meka.sourceforge.net, along with links to additional material about learning from multi-label and multi-target data.

2. See http://moa.cms.waikato.ac.nz/, https://code.google.com/p/dkpro-tc/, https://adams.cms. waikato.ac.nz/, and https://search.maven.org, respectively

Meka: A Multi-label/Multi-target Extension to Weka

Annalisa Appice and Saso Dˇzeroski. Stepwise induction of multi-target model trees. In ECML 07: Proceedings of the 18th European Conference on Machine Learning, pages 502 509, Berlin, Heidelberg, 2007. Springer-Verlag. ISBN 978-3-540-74957-8. doi: 10. 1007/978-3-540-74958-5 46. URL http://dx.doi.org/10.1007/978-3-540-74958-5_ 46.

Concha Bielza, Guangdi Li, and Pedro Larra naga. Multi-dimensional classiﬁcation with bayesian networks. International Journal of Approximate Reasoning, 52:705 727, 2011.

Mark Hall, Eibe Frank, Geoﬀrey Holmes, Bernhard Pfahringer, Reutemann Peter, and Ian H. Witten. The weka data mining software: An update. SIGKDD Explorations, 11 (1), 2009.

Geoﬀrey Hinton and Ruslan Salakhutdinov. Reducing the dimensionality of data with neural networks. Science, 313(5786):504 507, 2006.

Gjorgji Madjarov, Dragi Kocev, Dejan Gjorgjevikj, and Saso Dzeroski. An extensive experimental comparison of methods for multi-label learning. Pattern Recogn., 45(9): 3084 3104, 2012. ISSN 0031-3203. doi: 10.1016/j.patcog.2012.03.004. URL http: //dx.doi.org/10.1016/j.patcog.2012.03.004.

Jesse Read, Bernhard Pfahringer, Geoﬀrey Holmes, and Eibe Frank. Classiﬁer chains for multi-label classiﬁcation. Machine Learning, 85(3):333 359, 2011.

Jesse Read, Albert Bifet, and Fernando Perez-Cruz. Deep learning in partially-labelled data streams. In SAC 2015: 30th ACM Symposium on Applied Computing. ACM, 2015.

Grigorios Tsoumakas, Ioannis Katakis, and Ioannis P. Vlahavas. Mining multi-label data. In O. Maimon and L. Rokach, editors, Data Mining and Knowledge Discovery Handbook. 2nd edition, Springer, 2010.

Grigorios Tsoumakas, Jozef Vilcek, Eleftherios Spyromitros Xiouﬁs, and Ioannis Vlahavas. MULAN: A java library for multi-label learning. Journal of Machine Learning Research, 12:2411 2414, 2011.

Min-Ling Zhang and Zhi-Hua Zhou. A review on multi-label learning algorithms. IEEE Transactions on Knowledge and Data Engineering, 99(Pre Prints):1, 2013. ISSN 10414347. doi: http://doi.ieeecomputersociety.org/10.1109/TKDE.2013.39.