HOME

TheInfoList



OR:

Uplift modelling, also known as incremental modelling, true lift modelling, or net modelling is a
predictive modelling Predictive modelling uses statistics to predict outcomes. Most often the event one wants to predict is in the future, but predictive modelling can be applied to any type of unknown event, regardless of when it occurred. For example, predictive mod ...
technique that directly models the incremental impact of a treatment (such as a direct marketing action) on an individual's behaviour. Uplift modelling has applications in
customer relationship management Customer relationship management (CRM) is a process in which a business or other organization administers its interactions with customers, typically using data analysis to study large amounts of information. CRM systems compile data from a ra ...
for up-sell, cross-sell and retention modelling. It has also been applied to political election and personalised medicine. Unlike the related Differential Prediction concept in psychology, Uplift Modelling assumes an active agent.


Introduction

Uplift modelling uses a randomised
scientific control A scientific control is an experiment or observation designed to minimize the effects of variables other than the independent variable (i.e. confounding variables). This increases the reliability of the results, often through a comparison betwe ...
to not only measure the effectiveness of an action but also to build a predictive model that predicts the incremental response to the action. The response could be a binary variable (for example, a website visit) or a continuous variable (for example, customer revenue). Uplift modelling is a data mining technique that has been applied predominantly in the financial services, telecommunications and retail direct marketing industries to up-sell,
cross-sell Cross-selling is a sales technique involving the selling of an additional product or service to an existing customer. In practice, businesses define cross-selling in many different ways. Elements that might influence the definition might include ...
,
churn Churn may refer to: * Churn drill, large-diameter drilling machine large holes appropriate for holes in the ground Dairy-product terms * Butter churn, device for churning butter * Churning (butter), the process of creating butter out of mil ...
and retention activities.


Measuring uplift

The uplift of a marketing campaign is usually defined as the difference in response rate between a ''treated'' group and a randomized ''control'' group. This allows a marketing team to isolate the effect of a marketing action and measure the effectiveness or otherwise of that individual marketing action. Honest marketing teams will only take credit for the incremental effect of their campaign. However, many marketers define lift (rather than uplift) as the difference in response rate between treatment and control, so uplift modeling can be defined as improving (upping) lift through predictive modeling. The table below shows the details of a campaign showing the number of responses and calculated response rate for a hypothetical marketing campaign. This campaign would be defined as having a response rate uplift of 5%. It has created 50,000 incremental responses (100,000 - 50,000).


Traditional response modelling

Traditional response modelling typically takes a group of ''treated'' customers and attempts to build a predictive model that separates the likely responders from the non-responders through the use of one of a number of
predictive modelling Predictive modelling uses statistics to predict outcomes. Most often the event one wants to predict is in the future, but predictive modelling can be applied to any type of unknown event, regardless of when it occurred. For example, predictive mod ...
techniques. Typically this would use
decision trees A decision tree is a decision support tool that uses a tree-like model of decisions and their possible consequences, including chance event outcomes, resource costs, and utility. It is one way to display an algorithm that only contains condit ...
or
regression analysis In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships between a dependent variable (often called the 'outcome' or 'response' variable, or a 'label' in machine learning parlance) and one ...
. This model would only use the treated customers to build the model. In contrast uplift modeling uses both the treated and control customers to build a predictive model that focuses on the incremental response. To understand this type of model it is proposed that there is a fundamental segmentation that separates customers into the following groups (their names were suggested by N. Radcliffe and explained in ) * ''The Persuadables'' : customers who only respond to the marketing action because they were targeted * ''The Sure Things'' : customers who would have responded whether they were targeted or not * ''The Lost Causes'' : customers who will not respond irrespective of whether or not they are targeted * ''The Do Not Disturbs or Sleeping Dogs'' : customers who are less likely to respond because they were targeted The only segment that provides true incremental responses is the ''Persuadables''. Uplift modelling provides a scoring technique that can separate customers into the groups described above. Traditional response modelling often targets the ''Sure Things'' being unable to distinguish them from the ''Persuadables''.


Return on investment

Because uplift modelling focuses on incremental responses only, it provides very strong return on investment cases when applied to traditional demand generation and retention activities. For example, by only targeting the persuadable customers in an outbound marketing campaign, the contact costs and hence the return per unit spend can be dramatically improved.


Removal of negative effects

One of the most effective uses of uplift modelling is in the removal of negative effects from retention campaigns. Both in the telecommunications and financial services industries often retention campaigns can trigger customers to cancel a contract or policy. Uplift modelling allows these customers, the Do Not Disturbs, to be removed from the campaign.


Application to A/B and multivariate testing

It is rarely the case that there is a single treatment and control group. Often the "treatment" can be a variety of simple variations of a message or a multi-stage contact strategy that is classed as a single treatment. In the case of A/B or multivariate testing, uplift modelling can help in understanding whether the variations in tests provide any significant uplift compared to other targeting criteria such as behavioural or demographic indicators.


History of uplift modelling

The first appearance of ''true response modelling'' appears to be in the work of Radcliffe and Surry. Victor Lo also published on this topic in ''The True Lift Model'' (2002), and later Radcliffe again with ''Using Control Groups to Target on Predicted Lift: Building and Assessing Uplift Models'' (2007). Radcliffe also provides a very useful frequently asked questions (FAQ) section on his web site, Scientific Marketer. Lo (2008) provides a more general framework, from program design to predictive modeling to optimization, along with future research areas. Independently uplift modelling has been studied by Piotr Rzepakowski. Together with Szymon Jaroszewicz he adapted
information theory Information theory is the scientific study of the quantification (science), quantification, computer data storage, storage, and telecommunication, communication of information. The field was originally established by the works of Harry Nyquist a ...
to build multi-class uplift
decision trees A decision tree is a decision support tool that uses a tree-like model of decisions and their possible consequences, including chance event outcomes, resource costs, and utility. It is one way to display an algorithm that only contains condit ...
and published the paper in 2010. And later in 2011 they extended the algorithm to multiple treatment case. Similar approaches have been explored in personalised medicine. Szymon Jaroszewicz and Piotr Rzepakowski (2014) designed uplift methodology for
survival analysis Survival analysis is a branch of statistics for analyzing the expected duration of time until one event occurs, such as death in biological organisms and failure in mechanical systems. This topic is called reliability theory or reliability analysi ...
and applied it to randomized controlled trial analysis. Yong (2015) combined a mathematical optimization algorithm via dynamic programming with machine learning methods to optimally stratify patients. Uplift modelling is a special case of the older psychology concept of Differential Prediction. In contrast to differential prediction, uplift modelling assumes an active agent, and uses the uplift measure as an optimization metric. Uplift modeling has been recently extended and incorporated into diverse
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
algorithms, like Inductive Logic Programming,
Bayesian Network A Bayesian network (also known as a Bayes network, Bayes net, belief network, or decision network) is a probabilistic graphical model that represents a set of variables and their conditional dependencies via a directed acyclic graph (DAG). Bay ...
, Statistical relational learning,
Support Vector Machines In machine learning, support vector machines (SVMs, also support vector networks) are supervised learning models with associated learning algorithms that analyze data for classification and regression analysis. Developed at AT&T Bell Laboratorie ...
,
Survival Analysis Survival analysis is a branch of statistics for analyzing the expected duration of time until one event occurs, such as death in biological organisms and failure in mechanical systems. This topic is called reliability theory or reliability analysi ...
and
Ensemble learning In statistics and machine learning, ensemble methods use multiple learning algorithms to obtain better predictive performance than could be obtained from any of the constituent learning algorithms alone. Unlike a statistical ensemble in statisti ...
. Even though uplift modeling is widely applied in marketing practice (along with political elections), it has rarely appeared in marketing literature. Kane, Lo and Zheng (2014) published a thorough analysis of three data sets using multiple methods in a marketing journal and provided evidence that a newer approach (known as the Four Quadrant Method) worked quite well in practice. Lo and Pachamanova (2015) extended uplift modeling to prescriptive analytics for multiple treatment situations and proposed algorithms to solve large deterministic optimization problems and complex stochastic optimization problems where estimates are not exact. Recent research analyses the performance of various state-of-the-art uplift models in benchmark studies using large data amounts. A detailed description of uplift modeling, its history, the way uplift models are built, differences to classical model building as well as uplift-specific evaluation techniques, a comparison of various software solutions and an explanation of different economical scenarios can be found here. R. Michel, I. Schnakenburg, T. von Martens (2019). „Targeting Uplift“. Springer,


Implementations


In Python


CausalML



EconML

UpliftML

PyLift

scikit-uplift


In R




Other languages



for R * JMP by SAS * Portrait Uplift by
Pitney Bowes Pitney Bowes Inc. is an American technology company most known for its postage meters and other mailing equipment and services, and with expansions into e-commerce, software, and other technologies. The company was founded by Arthur Pitney, who i ...
* Uplift node for
KNIME KNIME (), the Konstanz Information Miner, is a free and open-source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining "Building Blocks ...
by Dymatrix * Uplift Modelling i
Miró
b
Stochastic Solutions


Datasets




Criteo Uplift Prediction dataset




* ttps://www.uplift-modeling.com/en/latest/api/datasets/fetch_megafon.html#megafon-uplift-competition-dataset MegaFon Uplift Competition Dataset


Notes and references

{{reflist


See also

*
Lift (data mining) In data mining and association rule learning, lift is a measure of the performance of a targeting model (association rule) at predicting or classifying cases as having an enhanced response (with respect to the population as a whole), measured ag ...


External links


Abby Johnson explains how it works in this video broadcast

Introductory white paper with full references

Eric Siegel: Uplift Modeling


Quantitative marketing research Business intelligence