Show simple item record

dc.contributor.authorPalczewska, Anna Maria*
dc.contributor.authorPalczewski, J.*
dc.contributor.authorMarchese-Robinson, R.M.*
dc.contributor.authorNeagu, Daniel*
dc.date.accessioned2016-12-09T15:41:39Z
dc.date.available2016-12-09T15:41:39Z
dc.date.issued2014-02-18
dc.identifier.citationPalczewska A, Palczewski J, Marchese-Robinson RM et al (2014) Interpreting Random Forest Classification Models Using a Feature Contribution Method. In: Bouabana-Tebibel T and Rubin SH (Eds.) Integration of Reusable Systems. Volume 263 of the series Advances in Intelligent Systems and Computing: 193-218.en_US
dc.identifier.urihttp://hdl.handle.net/10454/10917
dc.descriptionNoen_US
dc.description.abstractModel interpretation is one of the key aspects of the model evaluation process. The explanation of the relationship between model variables and outputs is relatively easy for statistical models, such as linear regressions, thanks to the availability of model parameters and their statistical significance . For “black box” models, such as random forest, this information is hidden inside the model structure. This work presents an approach for computing feature contributions for random forest classification models. It allows for the determination of the influence of each variable on the model prediction for an individual instance. By analysing feature contributions for a training dataset, the most significant variables can be determined and their typical contribution towards predictions made for individual classes, i.e., class-specific feature contribution “patterns”, are discovered. These patterns represent a standard behaviour of the model and allow for an additional assessment of the model reliability for new data. Interpretation of feature contributions for two UCI benchmark datasets shows the potential of the proposed methodology. The robustness of results is demonstrated through an extensive analysis of feature contributions calculated for a large number of generated random forest models.en_US
dc.language.isoenen_US
dc.relation.isreferencedbyhttp://dx.doi.org/10.1007/978-3-319-04717-1_9en_US
dc.subjectRandom forest; Classification; Variable importance; Feature contribution; Cluster analysisen_US
dc.titleInterpreting Random Forest Classification Models Using a Feature Contribution Methoden_US
dc.status.refereedYesen_US
dc.typeBook chapteren_US
dc.type.versionNo full-text in the repositoryen_US


Item file(s)

Thumbnail
Name:
palczewska_et_al_2014.pdf
Size:
437.1Kb
Format:
PDF
Description:
Keep suppressed - no permission ...

This item appears in the following Collection(s)

Show simple item record