{"id":1018,"date":"2020-06-11T10:53:00","date_gmt":"2020-06-11T10:53:00","guid":{"rendered":"https:\/\/blog.uantwerpen.be\/business-economics\/?p=1018"},"modified":"2023-06-14T12:41:39","modified_gmt":"2023-06-14T12:41:39","slug":"predictive-models-on-big-data-mining-a-pool-of-evidence","status":"publish","type":"post","link":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/","title":{"rendered":"Explaining predictive models:  the Evidence Counterfactual"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"1018\" class=\"elementor elementor-1018\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-e905341 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"e905341\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-475c481\" data-id=\"475c481\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-92db627 elementor-widget elementor-widget-text-editor\" data-id=\"92db627\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Imagine being targeted with an advertisement for this blog. You\u2019d like to know: why did the AI model predict you\u2019d be interested in the Faculty of Business and Economics\u2019 blog, based on the hundreds of web pages you visited? The answer could be: because you visited <a href=\"http:\/\/www.great-business-faculties.com\">www.great-business-faculties.com<\/a>, <a href=\"http:\/\/www.datascience-in-business.org\">www.datascience-in-business.org<\/a> and <a href=\"http:\/\/www.living-in-antwerp.com\">www.living-in-antwerp.com<\/a>: if you would not have visited these pages, you\u2019d no longer be targeted with this specific ad. This explanation is an example of an imaginary \u201c<strong>Evidence Counterfactual\u201d. <\/strong><\/p><p>The use of such browsing data, as well as Facebook likes or location data is often used in targeted advertising systems, and beyond. The predictive models are increasingly complex, while end users become more vocal in their wish for <strong>transparent and explainable systems<\/strong>.<\/p><p>When being the subject of such predictive models, businesses and citizens might ask: why I am being rejected credit? Why I am being targeted with this ad? Etc. The European GDPR even provides the \u201cright to obtain meaningful information about the logic involved\u201d to data subjects who are involved in automated decision making.<\/p><p>In this post, you\u2019ll learn more about the <strong>Evidence Counterfactual, an increasingly important approach within the \u201cexplainable AI\u201d research domain, that helps understand the decisions of predictive systems that use Big Data.<\/strong> The Applied Data Mining research group has developed algorithms to provide such explanations, and validated them in a variety of business domains.\u00a0<\/p><h2><strong>Behavioral big data <\/strong><\/h2><p>More and more companies are tapping into a large pool of humanly-generated data, or &#8220;behavioral big data&#8221;. Think of a person liking Instagram posts, visiting different locations captured by their mobile GPS, searching Google, making online payments, connecting to people on LinkedIn, and so on. All these <strong>behavioral traces lead to artificial intelligent (AI) systems with very high predictive performance in a variety of application areas,<\/strong> ranging from finance to risk to marketing<sup>1<\/sup> .<\/p><p>The goal of these AI systems is to use this data to predict a variable of interest, for example, a person\u2019s personality traits, product interests, creditworthiness and so on. The model uses a large number of small pieces of evidence to make predictions. Let&#8217;s refer to all that data as the &#8220;<strong>evidence pool<\/strong>&#8220;. The pieces of evidence are either &#8220;<strong>present<\/strong>&#8221; or &#8220;<strong>missing<\/strong>\u201d. All pieces that are present can be used to make predictions.<sup>2<\/sup><\/p><h2><strong>Tourist or citizen?<\/strong><\/h2><p>To illustrate how behavioral big data can be seen as a &#8220;pool of evidence,&#8221; imagine a model that uses location data of people in New York City to predict if someone is a tourist or a citizen. Out of all possible places to go to (the &#8220;evidence pool&#8221;), a person will only visit a relatively small number of places each month.\u00a0<\/p><p>These are the pieces of evidence that are \u201cpresent\u201d and are represented by a value of 1 (see <strong>Figure 1<\/strong>). All places that are not visited by that person are &#8220;missing&#8221; and get a corresponding zero value in the data matrix.<\/p><p><img fetchpriority=\"high\" decoding=\"async\" src=\"https:\/\/blog.uantwerpen.be\/business-economics\/wp-content\/uploads\/2020\/06\/1.jpg\" alt=\"\" width=\"700\" height=\"740\" \/><\/p><p>In\u00a0<strong>Figure 1<\/strong>, for example, Anna visited 85 places out of the 50,000 possible places used by the model. She visited\u00a0<em>Times Square\u00a0<\/em>and<em>\u00a0Dumbo<\/em>, but she did not visit\u00a0<em>Columbia University<\/em>, making this a missing piece of evidence. The model decides she\u2019s a tourist.<\/p><h2><strong><br \/>Intuition behind the Evidence Counterfactual<\/strong><\/h2><p>Explaining how predictive systems make decisions based on big data is challenging. Evidence Counterfactuals helps understand the reasons behind individual model predictions. This explanation approach<sup>3<\/sup> identifies a <strong>causal relationship between two events: event A causes event B, only if we observe a difference in B after changing A, while keeping everything else constant.<\/strong><sup>4<\/sup><\/p><p>The Evidence Counterfactual shows a subset of evidence (event A) that causally drives the model&#8217;s decision (event B). We imagine two worlds, identical in every way up until the point where the evidence set is present in one world, but not in the other. <strong>The first world is the &#8220;factual&#8221; world, the unobserved world is the &#8220;counterfactual&#8221; world<\/strong>. To help clarify this, consider the following:<\/p><p style=\"padding-left: 40px;\"><strong>IF<\/strong>\u00a0Anna did\u00a0<strong>not<\/strong>\u00a0visit\u00a0<em>Times<\/em>\u00a0<em>Square<\/em>\u00a0and\u00a0<em>Dumbo<\/em>,\u00a0<strong>THEN<\/strong>\u00a0the model&#8217;s prediction changes from\u00a0<em>tourist<\/em>\u00a0to\u00a0<em>NY citizen.<\/em><\/p><p>The pieces of evidence {Times Square, Dumbo} are a subset of the evidence of Anna (all\u00a0the places she visited) and explain the model\u2019s decision. Simply removing Times Square or Dumbo from her visited locations would not change the predicted class. <strong>Both locations need to be \u201cremoved\u201d (feature value set to zero) to change the model\u2019s decision.<\/strong><\/p><p>The &#8220;factual world&#8221; is the one that&#8217;s observed and includes\u00a0<em>all<\/em>\u00a0the places Anna visited. The &#8220;counterfactual world&#8221; that results in a predicted class change is identical to the factual world in every way up until the two locations Times Square and Dumbo.<\/p><p>An important advantage of counterfactuals is that <strong>they do not require all features that are used in the model (the &#8220;evidence pool&#8221;) or all the evidence (e.g., all places Anna visited) to be part of the explanation<\/strong>. This is especially interesting in the context of humanly-generated big data: it allows us to explain predictions using concise and comprehensible explanations.<\/p><h2><strong>Computing Evidence Counterfactuals<\/strong><\/h2><p>The huge dimensionality of the behavioral data makes it infeasible to compute counterfactual explanations using a complete search algorithm (this search strategy would check all subsets of evidence as candidate explanations).<\/p><p>Alternatively, a<strong> heuristic search algorithm can be used to efficiently find counterfactuals<\/strong>. One existing approach is based on a <strong>best-first search <\/strong>and makes use of the model\u2019s scoring function to first consider subsets of evidence that, when removed, reduce the predicted score the most in the direction of the opposite predicted class.<\/p><p>There are at least <strong>two weaknesses<\/strong> of this strategy:<\/p><p><img decoding=\"async\" src=\"https:\/\/blog.uantwerpen.be\/business-economics\/wp-content\/uploads\/2020\/06\/markus-spiske-Skf7HxARcoc-unsplash-1024x683.jpg\" alt=\"\" width=\"525\" height=\"350\" \/><\/p><p style=\"padding-left: 40px;\">1) for some nonlinear models, <strong>removing one feature does not result in a predicted score change<\/strong>, which results in the search algorithm picking a random feature to expand in the first iteration;<\/p><p style=\"padding-left: 40px;\">2) the search time is very sensitive to the size of the counterfactual explanation: the more evidence that needs to be removed, the <strong>longer it takes the algorithm to find the explanation.<\/strong><\/p><p>As an alternative to the best-first search, we proposed\u00a0a <strong>search strategy that chooses features according to their overall importance<\/strong> for the predicted score.<sup>5<\/sup> The idea is that the more accurate the importance rankings are, the more likely it is to find a counterfactual explanation starting from removing the top-ranked feature up until a counterfactual explanation is found. <strong>The hybrid algorithm LIME-Counterfactual (LIME-C) seems a favorable alternative<\/strong> to the best-first search because of its good overall effectiveness and efficiency.<\/p><h2><strong>Other data and models<\/strong><\/h2><p>Evidence Counterfactuals can address <strong>various data types, from textual data and tabular data (e.g. standard Excel files) to image data<\/strong>. The issue is to define what it means for evidence to be &#8220;present&#8221; or &#8220;missing.&#8221; To compute counterfactuals, we need to define the notion of &#8220;removing evidence&#8221; or setting evidence to &#8220;missing.&#8221; In this post, we focused on behavioral big data. For these data, which is very sparse (a lot of zero values in the data matrix), it makes sense to represent evidence that&#8217;s \u201cpresent\u201d by those features (e.g., word or behavior) having a nonzero value.<sup>\u00a0<\/sup><\/p><h2><strong>Key takeaways<\/strong><\/h2><ul><li>Predictive systems\u00a0that are trained from\u00a0humanly-generated Big Data have <strong>high predictive performance, however,\u00a0explaining\u00a0them is challenging<\/strong>.<\/li><li>Explaining data-driven decisions is <strong>important for a variety of reasons<\/strong> (increase trust and acceptance, improve models, gain insights, etc.), <strong>and for many stakeholders<\/strong> (data scientists, managers, decision subjects, etc.).<\/li><li>The <strong>Evidence Counterfactual is an explanation approach that can be applied across many relevant applications<\/strong> and highlights a key subset of evidence that led to\u00a0a particular model decision.<\/li><\/ul><p><strong>By\u00a0<\/strong><a href=\"https:\/\/www.uantwerpen.be\/nl\/personeel\/yanou-ramon\/\"><strong>Yanou Ramon<\/strong><\/a><strong> and Prof. <a href=\"https:\/\/www.uantwerpen.be\/nl\/personeel\/david-martens\/\">David Martens<\/a><\/strong><strong>, <a href=\"https:\/\/www.uantwerpen.be\/en\/research-groups\/applied-data-mining\/\">Applied Data Mining Research Group, University of Antwerp<\/a><\/strong>.<\/p><ol><li>Junqu\u00e9 de Fortuny, E., Martens, D., Provost, F., Predictive Modeling with Big Data: Is Bigger Really Better?, Big Data, 1(4), pp215-226, 2013<\/li><li>Provost, F., Understanding decisions driven by big data: from analytics management to privacy-friendly cloaking devices, Keynote Lecture, Strate Europe, https:\/\/learning.oreilly.com\/library\/view\/stratahadoop\/9781491917381\/video203329.html (2014)<\/li><li>Martens, D., Provost, F., Explaining data-driven document classifications, MIS Quarterly, 38(1), pp73-99 (2014)<\/li><li><a href=\"https:\/\/causalinference.gitlab.io\/causal-reasoning-book-chapter1\/\" target=\"_blank\" rel=\"noopener\">https:\/\/causalinference.gitlab.io\/causal-reasoning-book-chapter1\/<\/a><\/li><li>Ramon, Y., Martens, D., Provost, F., Evgeniou, T., Counterfactual Explanation Algorithms for Behavioral and Textual Data, arXiv:1912.01819 (2019). Available\u00a0<a href=\"https:\/\/arxiv.org\/abs\/1912.01819\" target=\"_blank\" rel=\"noopener\">online<\/a><\/li><\/ol>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Big Data generated by people provide enormous prediction value for Artificial Intelligent systems. However, explaining how these models use the data to make predictions is quite challenging. This Evidence Counterfactual explanation approach considers how a model would behave if it didn&#8217;t have the original set of data to work with.<\/p>\n","protected":false},"author":26,"featured_media":1021,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[1],"tags":[53,59,35],"coauthors":[60,61],"class_list":["post-1018","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-no-category","tag-tag-academic-news","tag-bigdata","tag-research"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Explaining predictive models: the Evidence Counterfactual - Business and Economics<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Explaining predictive models: the Evidence Counterfactual - Business and Economics\" \/>\n<meta property=\"og:description\" content=\"Big Data generated by people provide enormous prediction value for Artificial Intelligent systems. However, explaining how these models use the data to make predictions is quite challenging. This Evidence Counterfactual explanation approach considers how a model would behave if it didn&#039;t have the original set of data to work with.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/\" \/>\n<meta property=\"og:site_name\" content=\"Business and Economics\" \/>\n<meta property=\"article:published_time\" content=\"2020-06-11T10:53:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-06-14T12:41:39+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/blog.uantwerpen.be\/business-economics\/wp-content\/uploads\/2020\/06\/franki-chamaki-1K6IQsQbizI-unsplash-scaled.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\n\t<meta property=\"og:image:height\" content=\"1340\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Yanou Ramon, David Martens\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Yanou Ramon, David Martens\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/predictive-models-on-big-data-mining-a-pool-of-evidence\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/predictive-models-on-big-data-mining-a-pool-of-evidence\\\/\"},\"author\":{\"name\":\"Yanou Ramon\",\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/#\\\/schema\\\/person\\\/2d5507d91c6779c198c5b72e9206d27a\"},\"headline\":\"Explaining predictive models: the Evidence Counterfactual\",\"datePublished\":\"2020-06-11T10:53:00+00:00\",\"dateModified\":\"2023-06-14T12:41:39+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/predictive-models-on-big-data-mining-a-pool-of-evidence\\\/\"},\"wordCount\":1396,\"publisher\":{\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/predictive-models-on-big-data-mining-a-pool-of-evidence\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/wp-content\\\/uploads\\\/2020\\\/06\\\/franki-chamaki-1K6IQsQbizI-unsplash-scaled.jpg\",\"keywords\":[\"Academic news\",\"bigdata\",\"research\"],\"articleSection\":[\"No category\"],\"inLanguage\":\"en-GB\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/predictive-models-on-big-data-mining-a-pool-of-evidence\\\/\",\"url\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/predictive-models-on-big-data-mining-a-pool-of-evidence\\\/\",\"name\":\"Explaining predictive models: the Evidence Counterfactual - Business and Economics\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/predictive-models-on-big-data-mining-a-pool-of-evidence\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/predictive-models-on-big-data-mining-a-pool-of-evidence\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/wp-content\\\/uploads\\\/2020\\\/06\\\/franki-chamaki-1K6IQsQbizI-unsplash-scaled.jpg\",\"datePublished\":\"2020-06-11T10:53:00+00:00\",\"dateModified\":\"2023-06-14T12:41:39+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/predictive-models-on-big-data-mining-a-pool-of-evidence\\\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/predictive-models-on-big-data-mining-a-pool-of-evidence\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/predictive-models-on-big-data-mining-a-pool-of-evidence\\\/#primaryimage\",\"url\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/wp-content\\\/uploads\\\/2020\\\/06\\\/franki-chamaki-1K6IQsQbizI-unsplash-scaled.jpg\",\"contentUrl\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/wp-content\\\/uploads\\\/2020\\\/06\\\/franki-chamaki-1K6IQsQbizI-unsplash-scaled.jpg\",\"width\":2560,\"height\":1340,\"caption\":\"data better idea\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/predictive-models-on-big-data-mining-a-pool-of-evidence\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Explaining predictive models: the Evidence Counterfactual\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/#website\",\"url\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/\",\"name\":\"Business and Economics\",\"description\":\"Business and Economics\",\"publisher\":{\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-GB\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/#organization\",\"name\":\"Business and Economics\",\"url\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/wp-content\\\/uploads\\\/2021\\\/02\\\/cropped-cropped-UA-eng-hor-1-WHITE.png\",\"contentUrl\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/wp-content\\\/uploads\\\/2021\\\/02\\\/cropped-cropped-UA-eng-hor-1-WHITE.png\",\"width\":846,\"height\":250,\"caption\":\"Business and Economics\"},\"image\":{\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/#\\\/schema\\\/logo\\\/image\\\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/#\\\/schema\\\/person\\\/2d5507d91c6779c198c5b72e9206d27a\",\"name\":\"Yanou Ramon\",\"url\":\"https:\\\/\\\/blog.uantwerpen.be\\\/business-economics\\\/author\\\/yanou-ramon\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Explaining predictive models: the Evidence Counterfactual - Business and Economics","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/","og_locale":"en_GB","og_type":"article","og_title":"Explaining predictive models: the Evidence Counterfactual - Business and Economics","og_description":"Big Data generated by people provide enormous prediction value for Artificial Intelligent systems. However, explaining how these models use the data to make predictions is quite challenging. This Evidence Counterfactual explanation approach considers how a model would behave if it didn't have the original set of data to work with.","og_url":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/","og_site_name":"Business and Economics","article_published_time":"2020-06-11T10:53:00+00:00","article_modified_time":"2023-06-14T12:41:39+00:00","og_image":[{"width":2560,"height":1340,"url":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-content\/uploads\/2020\/06\/franki-chamaki-1K6IQsQbizI-unsplash-scaled.jpg","type":"image\/jpeg"}],"author":"Yanou Ramon, David Martens","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Yanou Ramon, David Martens","Estimated reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/#article","isPartOf":{"@id":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/"},"author":{"name":"Yanou Ramon","@id":"https:\/\/blog.uantwerpen.be\/business-economics\/#\/schema\/person\/2d5507d91c6779c198c5b72e9206d27a"},"headline":"Explaining predictive models: the Evidence Counterfactual","datePublished":"2020-06-11T10:53:00+00:00","dateModified":"2023-06-14T12:41:39+00:00","mainEntityOfPage":{"@id":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/"},"wordCount":1396,"publisher":{"@id":"https:\/\/blog.uantwerpen.be\/business-economics\/#organization"},"image":{"@id":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/#primaryimage"},"thumbnailUrl":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-content\/uploads\/2020\/06\/franki-chamaki-1K6IQsQbizI-unsplash-scaled.jpg","keywords":["Academic news","bigdata","research"],"articleSection":["No category"],"inLanguage":"en-GB"},{"@type":"WebPage","@id":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/","url":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/","name":"Explaining predictive models: the Evidence Counterfactual - Business and Economics","isPartOf":{"@id":"https:\/\/blog.uantwerpen.be\/business-economics\/#website"},"primaryImageOfPage":{"@id":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/#primaryimage"},"image":{"@id":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/#primaryimage"},"thumbnailUrl":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-content\/uploads\/2020\/06\/franki-chamaki-1K6IQsQbizI-unsplash-scaled.jpg","datePublished":"2020-06-11T10:53:00+00:00","dateModified":"2023-06-14T12:41:39+00:00","breadcrumb":{"@id":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/#primaryimage","url":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-content\/uploads\/2020\/06\/franki-chamaki-1K6IQsQbizI-unsplash-scaled.jpg","contentUrl":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-content\/uploads\/2020\/06\/franki-chamaki-1K6IQsQbizI-unsplash-scaled.jpg","width":2560,"height":1340,"caption":"data better idea"},{"@type":"BreadcrumbList","@id":"https:\/\/blog.uantwerpen.be\/business-economics\/predictive-models-on-big-data-mining-a-pool-of-evidence\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/blog.uantwerpen.be\/business-economics\/"},{"@type":"ListItem","position":2,"name":"Explaining predictive models: the Evidence Counterfactual"}]},{"@type":"WebSite","@id":"https:\/\/blog.uantwerpen.be\/business-economics\/#website","url":"https:\/\/blog.uantwerpen.be\/business-economics\/","name":"Business and Economics","description":"Business and Economics","publisher":{"@id":"https:\/\/blog.uantwerpen.be\/business-economics\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/blog.uantwerpen.be\/business-economics\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Organization","@id":"https:\/\/blog.uantwerpen.be\/business-economics\/#organization","name":"Business and Economics","url":"https:\/\/blog.uantwerpen.be\/business-economics\/","logo":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/blog.uantwerpen.be\/business-economics\/#\/schema\/logo\/image\/","url":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-content\/uploads\/2021\/02\/cropped-cropped-UA-eng-hor-1-WHITE.png","contentUrl":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-content\/uploads\/2021\/02\/cropped-cropped-UA-eng-hor-1-WHITE.png","width":846,"height":250,"caption":"Business and Economics"},"image":{"@id":"https:\/\/blog.uantwerpen.be\/business-economics\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/blog.uantwerpen.be\/business-economics\/#\/schema\/person\/2d5507d91c6779c198c5b72e9206d27a","name":"Yanou Ramon","url":"https:\/\/blog.uantwerpen.be\/business-economics\/author\/yanou-ramon\/"}]}},"_links":{"self":[{"href":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-json\/wp\/v2\/posts\/1018","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-json\/wp\/v2\/users\/26"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-json\/wp\/v2\/comments?post=1018"}],"version-history":[{"count":5,"href":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-json\/wp\/v2\/posts\/1018\/revisions"}],"predecessor-version":[{"id":4092,"href":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-json\/wp\/v2\/posts\/1018\/revisions\/4092"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-json\/wp\/v2\/media\/1021"}],"wp:attachment":[{"href":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-json\/wp\/v2\/media?parent=1018"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-json\/wp\/v2\/categories?post=1018"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-json\/wp\/v2\/tags?post=1018"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/blog.uantwerpen.be\/business-economics\/wp-json\/wp\/v2\/coauthors?post=1018"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}