Logo for the University of Illinois at Chicago
    • Login
    View Item 
    •   INDIGO Home
    • Business Administration, College of
    • Information & Decision Sciences, Department of
    • Publications - Information & Decision Sciences
    • View Item
    •   INDIGO Home
    • Business Administration, College of
    • Information & Decision Sciences, Department of
    • Publications - Information & Decision Sciences
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Confidence in Predictions from Random Tree Ensembles

    Thumbnail
    View/Open
    Main Article (997.2Kb)
    Date
    2013-05
    Author
    Bhattacharyya, Siddhartha
    Publisher
    Springer Verlag
    Metadata
    Show full item record
    Abstract
    Obtaining an indication of confidence of predictions is desirable for many data mining applications. Predictions complemented with confidence levels can inform on the certainty or extent of reliability that may be associated with the prediction. This can be useful in varied application contexts where model outputs form the basis for potentially costly decisions, and in general across risk sensitive applications. The conformal prediction framework presents a novel approach for obtaining valid confidence measures associated with predictions from machine learning algorithms. Confidence levels are obtained from the underlying algorithm, using a non-conformity measure which indicates how 'atypical' a given example set is. The non-conformity measure is key to determining the usefulness and efficiency of the approach. This paper considers inductive conformal prediction in the context of random tree ensembles like random forests, which have been noted to perform favorably across problems. Focusing on classification tasks, and considering realistic data contexts including class imbalance, we develop non-conformity measures for assessing the confidence of predicted class labels from random forests. We examine the performance of these measures on multiple datasets. Results demonstrate the usefulness and validity of the measures, their relative differences, and highlight the effectiveness of conformal prediction random forests for obtaining predictions with associated confidence.
    Subject
    prediction confidence
    random forests
    classification
    Type
    Article
    Date available in INDIGO
    2014-01-09T20:42:02Z
    URI
    http://hdl.handle.net/10027/11056
    Collections
    • Publications - Information & Decision Sciences

    DSpace software copyright © 2002-2015  DuraSpace
    Contact Us | Send Feedback | Privacy Statement
    Theme by 
    Atmire NV

    Browse

    All of INDIGOCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    DSpace software copyright © 2002-2015  DuraSpace
    Contact Us | Send Feedback | Privacy Statement
    Theme by 
    Atmire NV