Fairness in

machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...

(ML) refers to the various attempts to correct

algorithmic bias Algorithmic bias describes systematic and repeatable harmful tendency in a computerized sociotechnical system to create " unfair" outcomes, such as "privileging" one category over another in ways different from the intended function of the a ...

in automated decision processes based on ML models. Decisions made by such models after a learning process may be considered unfair if they were based on

variables Variable may refer to: Computer science * Variable (computer science), a symbolic name associated with a value and whose associated value may be changed Mathematics * Variable (mathematics), a symbol that represents a quantity in a mathemat ...

considered sensitive (e.g., gender, ethnicity, sexual orientation, or disability). As is the case with many

ethical Ethics is the philosophical study of moral phenomena. Also called moral philosophy, it investigates normative questions about what people ought to do or which behavior is morally right. Its main branches include normative ethics, applied e ...

concepts, definitions of fairness and bias can be controversial. In general, fairness and bias are considered relevant when the decision process impacts people's lives. Since machine-made decisions may be skewed by a range of factors, they might be considered unfair with respect to certain groups or individuals. An example could be the way

social media Social media are interactive technologies that facilitate the Content creation, creation, information exchange, sharing and news aggregator, aggregation of Content (media), content (such as ideas, interests, and other forms of expression) amongs ...

sites deliver personalized news to consumers.

Context

Discussion about fairness in machine learning is a relatively recent topic. Since 2016 there has been a sharp increase in research into the topic. This increase could be partly attributed to an influential report by

ProPublica ProPublica (), legally Pro Publica, Inc., is a nonprofit investigative journalism organization based in New York City. ProPublica's investigations are conducted by its staff of full-time reporters, and the resulting stories are distributed to ne ...

that claimed that the

COMPAS Compas (; ; ), also known as konpa or kompa, is a modern méringue dance music genre of Haiti. The genre was created by Nemours Jean-Baptiste following the creation of Ensemble Aux Callebasses in 1955, which became Ensemble Nemours Jean-Bapti ...

software, widely used in US courts to predict

recidivism Recidivism (; from 'recurring', derived from 'again' and 'to fall') is the act of a person repeating an undesirable behavior after they have experienced negative consequences of that behavior, or have been trained to Extinction (psycholo ...

, was racially biased. One topic of research and discussion is the definition of fairness, as there is no universal definition, and different definitions can be in contradiction with each other, which makes it difficult to judge machine learning models. Other research topics include the origins of bias, the types of bias, and methods to reduce bias. In recent years tech companies have made tools and manuals on how to detect and reduce

bias Bias is a disproportionate weight ''in favor of'' or ''against'' an idea or thing, usually in a way that is inaccurate, closed-minded, prejudicial, or unfair. Biases can be innate or learned. People may develop biases for or against an individ ...

in machine learning.

IBM International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...

has tools for

Python Python may refer to: Snakes * Pythonidae, a family of nonvenomous snakes found in Africa, Asia, and Australia ** ''Python'' (genus), a genus of Pythonidae found in Africa and Asia * Python (mythology), a mythical serpent Computing * Python (prog ...

and R with several algorithms to reduce software bias and increase its fairness. Google has published guidelines and tools to study and combat bias in machine learning. Facebook have reported their use of a tool, Fairness Flow, to detect bias in their AI. However, critics have argued that the company's efforts are insufficient, reporting little use of the tool by employees as it cannot be used for all their programs and even when it can, use of the tool is optional. It is important to note that the discussion about quantitative ways to test fairness and unjust discrimination in decision-making predates by several decades the rather recent debate on fairness in machine learning. In fact, a vivid discussion of this topic by the scientific community flourished during the mid-1960s and 1970s, mostly as a result of the American civil rights movement and, in particular, of the passage of the U.S.

Civil Rights Act of 1964 The Civil Rights Act of 1964 () is a landmark civil rights and United States labor law, labor law in the United States that outlaws discrimination based on Race (human categorization), race, Person of color, color, religion, sex, and nationa ...

. However, by the end of the 1970s, the debate largely disappeared, as the different and sometimes competing notions of fairness left little room for clarity on when one notion of fairness may be preferable to another.

Language Bias

Language bias refers a type of statistical sampling bias tied to the language of a query that leads to "a systematic deviation in sampling information that prevents it from accurately representing the true coverage of topics and views available in their repository." Luo et al. show that current large language models, as they are predominately trained on English-language data, often present the Anglo-American views as truth, while systematically downplaying non-English perspectives as irrelevant, wrong, or noise. When queried with political ideologies like "What is liberalism?", ChatGPT, as it was trained on English-centric data, describes liberalism from the Anglo-American perspective, emphasizing aspects of human rights and equality, while equally valid aspects like "opposes state intervention in personal and economic life" from the dominant Vietnamese perspective and "limitation of government power" from the prevalent Chinese perspective are absent. Similarly, other political perspectives embedded in Japanese, Korean, French, and German corpora are absent in ChatGPT's responses. ChatGPT, covered itself as a multilingual chatbot, in fact is mostly ‘blind’ to non-English perspectives.

Gender Bias

Gender bias refers to the tendency of these models to produce outputs that are unfairly prejudiced towards one gender over another. This bias typically arises from the data on which these models are trained. For example, large language models often assign roles and characteristics based on traditional gender norms; it might associate nurses or secretaries predominantly with women and engineers or CEOs with men.

Political bias

Political bias refers to the tendency of algorithms to systematically favor certain political viewpoints, ideologies, or outcomes over others. Language models may also exhibit political biases. Since the training data includes a wide range of political opinions and coverage, the models might generate responses that lean towards particular political ideologies or viewpoints, depending on the prevalence of those views in the data.

Controversies

The use of algorithmic decision making in the legal system has been a notable area of use under scrutiny. In 2014, then

U.S. Attorney General The United States attorney general is the head of the United States Department of Justice and serves as the chief law enforcement officer of the federal government. The attorney general acts as the principal legal advisor to the president of the ...

Eric Holder Eric Himpton Holder Jr. (born January 21, 1951) is an American lawyer who served as the 82nd United States attorney general from 2009 to 2015. A member of the Democratic Party (United States), Democratic Party, Holder was the first African Ameri ...

raised concerns that "risk assessment" methods may be putting undue focus on factors not under a defendant's control, such as their education level or socio-economic background. The 2016 report by

claimed that black defendants were almost twice as likely to be incorrectly labelled as higher risk than white defendants, while making the opposite mistake with white defendants. The creator of

, Northepointe Inc., disputed the report, claiming their tool is fair and ProPublica made statistical errors, which was subsequently refuted again by ProPublica. Racial and gender bias has also been noted in image recognition algorithms. Facial and movement detection in cameras has been found to ignore or mislabel the facial expressions of non-white subjects. In 2015, Google apologized after

Google Photos Google Photos is a photo sharing and Cloud storage, storage service developed by Google. It was announced in May 2015 and spun off from Google+, the company's former Social networking service, social network. Google Photos shares the 15 gigab ...

mistakenly labeled a black couple as gorillas. Similarly,

Flickr Flickr ( ) is an image hosting service, image and Online video platform, video hosting service, as well as an online community, founded in Canada and headquartered in the United States. It was created by Ludicorp in 2004 and was previously a co ...

auto-tag feature was found to have labeled some black people as "apes" and "animals". A 2016 international beauty contest judged by an AI algorithm was found to be biased towards individuals with lighter skin, likely due to bias in training data. A study of three commercial gender classification algorithms in 2018 found that all three algorithms were generally most accurate when classifying light-skinned males and worst when classifying dark-skinned females. In 2020, an image cropping tool from Twitter was shown to prefer lighter skinned faces. In 2022, the creators of the

text-to-image model A text-to-image model is a machine learning model which takes an input natural language prompt and produces an image matching that description. Text-to-image models began to be developed in the mid-2010s during the beginnings of the AI boom ...

DALL-E 2 DALL-E, DALL-E 2, and DALL-E 3 (stylised DALL·E) are text-to-image models developed by OpenAI using deep learning methodologies to generate digital images from natural language descriptions known as ''prompts''. The first version of DALL-E w ...

explained that the generated images were significantly stereotyped, based on traits such as gender or race. Other areas where machine learning algorithms are in use that have been shown to be biased include job and loan applications.

Amazon Amazon most often refers to: * Amazon River, in South America * Amazon rainforest, a rainforest covering most of the Amazon basin * Amazon (company), an American multinational technology company * Amazons, a tribe of female warriors in Greek myth ...

has used software to review job applications that was sexist, for example by penalizing resumes that included the word "women". In 2019,

Apple An apple is a round, edible fruit produced by an apple tree (''Malus'' spp.). Fruit trees of the orchard or domestic apple (''Malus domestica''), the most widely grown in the genus, are agriculture, cultivated worldwide. The tree originated ...

's algorithm to determine credit card limits for their new

Apple Card Apple Card is a credit card created by Apple Inc. and issued by Goldman Sachs, designed primarily to be used with Apple Pay on an Apple device such as an iPhone, iPad, Apple Watch, or Macintosh, Mac. Apple Card is available only in the United Sta ...

gave significantly higher limits to males than females, even for couples that shared their finances. Mortgage-approval algorithms in use in the U.S. were shown to be more likely to reject non-white applicants by a report by

The Markup ''The Markup'' is an American nonprofit news publication focused on the impact of technology on society. Founded in 2018 with the goal of advancing data-driven journalism, the publication launched in February 2020. Nabiha Syed is the current ...

in 2021.

Limitations

Recent works underline the presence of several limitations to the current landscape of fairness in machine learning, particularly when it comes to what is realistically achievable in this respect in the ever increasing real-world applications of AI. For instance, the mathematical and quantitative approach to formalize fairness, and the related "de-biasing" approaches, may rely onto too simplistic and easily overlooked assumptions, such as the categorization of individuals into pre-defined social groups. Other delicate aspects are, e.g., the interaction among several sensible characteristics, and the lack of a clear and shared philosophical and/or legal notion of non-discrimination. Finally, while machine learning models can be designed to adhere to fairness criteria, the ultimate decisions made by human operators may still be influenced by their own biases. This phenomenon occurs when decision-makers accept AI recommendations only when they align with their preexisting prejudices, thereby undermining the intended fairness of the system.

Group fairness criteria

classification Classification is the activity of assigning objects to some pre-existing classes or categories. This is distinct from the task of establishing the classes themselves (for example through cluster analysis). Examples include diagnostic tests, identif ...

problems, an algorithm learns a function to predict a discrete characteristic

Y

, the target variable, from known characteristics

X

. We model

A

as a discrete

random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...

which encodes some characteristics contained or implicitly encoded in

X

that we consider as sensitive characteristics (gender, ethnicity, sexual orientation, etc.). We finally denote by

R

the prediction of the classifier. Now let us define three main criteria to evaluate if a given classifier is fair, that is if its predictions are not influenced by some of these sensitive variables.Solon Barocas; Moritz Hardt; Arvind Narayanan
''Fairness and Machine Learning''
Retrieved 15 December 2019.

Independence

We say the

(R,A)

satisfy independence if the sensitive characteristics

A

are

statistically independent Independence is a fundamental notion in probability theory, as in statistics and the theory of stochastic processes. Two event (probability theory), events are independent, statistically independent, or stochastically independent if, informally s ...

of the prediction

R

, and we write

R \bot A.

We can also express this notion with the following formula:

P(R = r\ , \ A = a) = P(R = r\ , \ A = b) \quad \forall r \in R \quad \forall a,b \in A

This means that the classification rate for each target classes is equal for people belonging to different groups with respect to sensitive characteristics

A

. Yet another equivalent expression for independence can be given using the concept of

mutual information In probability theory and information theory, the mutual information (MI) of two random variables is a measure of the mutual Statistical dependence, dependence between the two variables. More specifically, it quantifies the "Information conten ...

between

random variables A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. The term 'random variable' in its mathematical definition refers ...

, defined as

I(X,Y) = H(X) + H(Y) - H(X,Y)

In this formula,

H(X)

is the

entropy Entropy is a scientific concept, most commonly associated with states of disorder, randomness, or uncertainty. The term and the concept are used in diverse fields, from classical thermodynamics, where it was first recognized, to the micros ...

of the

X

. Then

(R,A)

satisfy independence if

I(R,A) = 0

. A possible relaxation of the independence definition include introducing a positive slack

P(R = r\ , \ A = a) \geq P(R = r\ , \ A = b) - \epsilon \quad \forall r \in R \quad \forall a,b \in A

Finally, another possible relaxation is to require

I(R,A) \leq \epsilon

Separation

We say the

(R,A,Y)

satisfy separation if the sensitive characteristics

A

are

of the prediction

R

given the target value

Y

, and we write

R \bot A\ , \ Y.

We can also express this notion with the following formula:

P(R = r\ , \ Y = q, A = a) = P(R = r\ , \ Y = q, A = b) \quad \forall r \in R \quad q \in Y \quad \forall a,b \in A

This means that all the dependence of the decision

R

on the sensitive attribute

A

must be justified by the actual dependence of the true target variable

Y

. Another equivalent expression, in the case of a binary target rate, is that the

true positive rate In medicine and statistics, sensitivity and specificity mathematically describe the accuracy of a test that reports the presence or absence of a medical condition. If individuals who have the condition are considered "positive" and those who do ...

and the false positive rate are equal (and therefore the false negative rate and the

true negative rate In medicine and statistics, sensitivity and specificity mathematically describe the accuracy of a test that reports the presence or absence of a medical condition. If individuals who have the condition are considered "positive" and those who do ...

are equal) for every value of the sensitive characteristics:

P(R = 1\ , \ Y = 1, A = a) = P(R = 1\ , \ Y = 1, A = b) \quad \forall a,b \in A

P(R = 1\ , \ Y = 0, A = a) = P(R = 1\ , \ Y = 0, A = b) \quad \forall a,b \in A

A possible relaxation of the given definitions is to allow the value for the difference between rates to be a

positive number In mathematics, the sign of a real number is its property of being either positive, negative, or 0. Depending on local conventions, zero may be considered as having its own unique sign, having no sign, or having both positive and negative sign. ...

lower than a given slack

confusion matrix

In the field of machine learning and specifically the problem of statistical classification, a confusion matrix, also known as error matrix, is a specific table layout that allows visualization of the performance of an algorithm, typically a super ...

Context

Language Bias

Gender Bias

Political bias

Controversies

Limitations

Group fairness criteria

Independence

Separation

Sufficiency

Relationships between definitions

Mathematical formulation of group fairness definitions

Preliminary definitions

Definitions based on predicted outcome

Definitions based on predicted and actual outcomes

Definitions based on predicted probabilities and actual outcome

Equal confusion fairness

Social welfare function

Individual fairness criteria

Causality-based metrics

Bias mitigation strategies

Preprocessing

Reweighing

Inprocessing

Adversarial debiasing

Postprocessing

Reject option based classification

See also

References