(请使用IE浏览器访问本系统)

  学科分类

  基础科学

  工程技术

  生命科学

  人文社会科学

  其他

篇目详细内容

【篇名】 The quest for conditional independence in prospectivity modeling: weights-of-evidence, boost weights-of-evidence, and logistic regression
【刊名】 Frontiers of Earth Science
【刊名缩写】 Front. Earth Sci.
【ISSN】 2095-0195
【EISSN】 2095-0209
【DOI】 10.1007/s11707-016-0595-y
【出版社】
【出版年】 2016
【卷期】 10 卷3期
【页码】 389-408 页,共 20 页
【作者】 Helmut SCHAEBEN; Georg SEMMLER;
【关键词】 general weights of evidence|joint conditional independence|naïve Bayes model|Hammersley–Clifford theorem|interaction terms|statistical significance

【摘要】

The objective of prospectivity modeling is prediction of the conditional probability of the presence T=1 or absence T=0 of a target T given favorable or prohibitive predictors B, or construction of a two classes {0,1} classification of T. A special case of logistic regression called weights-of-evidence (WofE) is geologists’ favorite method of prospectivity modeling due to its apparent simplicity. However, the numerical simplicity is deceiving as it is implied by the severe mathematical modeling assumption of joint conditional independence of all predictors given the target. General weights of evidence are explicitly introduced which are as simple to estimate as conventional weights, i.e., by counting, but do not require conditional independence. Complementary to the regression view is the classification view on prospectivity modeling. Boosting is the construction of a strong classifier from a set of weak classifiers. From the regression point of view it is closely related to logistic regression. Boost weights-of-evidence (BoostWofE) was introduced into prospectivity modeling to counterbalance violations of the assumption of conditional independence even though relaxation of modeling assumptions with respect to weak classifiers was not the (initial) purpose of boosting. In the original publication of BoostWofE a fabricated dataset was used to “validate” this approach. Using the same fabricated dataset it is shown that BoostWofE cannot generally compensate lacking conditional independence whatever the consecutively processing order of predictors. Thus the alleged features of BoostWofE are disproved by way of counterexamples, while theoretical findings are confirmed that logistic regression including interaction terms can exactly compensate violations of joint conditional independence if the predictors are indicators.

版权所有 © CALIS管理中心 2008