Big Data�s Disparate Impact
Advocates of algorithmic techniques like data mining argue that these techniques eliminate human biases from the decision-making process. But an algorithm is only as good as the data it works with. Data is frequently imperfect in ways that allow these algorithms to inherit the prejudices of prior decision makers. In other cases, data may simply reflect the widespread biases that persist in society at large. In still others, data mining can discover surprisingly useful regularities that are really just preexisting patterns of exclusion and inequality. Unthinking reliance on data mining can deny historically disadvantaged and vulnerable groups full participation in society. Worse still, because the resulting discrimination is almost always an unintentional emergent property of the algorithm’s use rather than a conscious choice by its programmers, it can be unusually hard to identify the source of the problem or to explain it to a court.This Essay examines these concerns through the lens
Advocates of algorithmic techniques like data mining argue that these techniques eliminate human biases from the decision-making process. But an algorithm is only as good as the data it works with. Data is frequently imperfect in ways that allow these algorithms to inherit the prejudices of prior decision makers. In other cases, data may simply reflect the widespread biases that persist in society at large. In still others, data mining can discover surprisingly useful regularities that are really just preexisting patterns of exclusion and inequality. Unthinking reliance on data mining can deny historically disadvantaged and vulnerable groups full participation in society. Worse still, because the resulting discrimination is almost always an unintentional emergent property of the algorithm’s use rather than a conscious choice by its programmers, it can be unusually hard to identify the source of the problem or to explain it to a court.This Essay examines these concerns through the lens of American antidiscrimination law — more particularly, through Title VII’s prohibition of discrimination in employment. In the absence of a demonstrable intent to discriminate, the best doctrinal hope for data mining’s victims would seem to lie in disparate impact doctrine. Case law and the Equal Employment Opportunity Commission’s Uniform Guidelines, though, hold that a practice can be justified as a business necessity when its outcomes are predictive of future employment outcomes, and data mining is specifically designed to find such statistical correlations. Unless there is a reasonably practical way to demonstrate that these discoveries are spurious, Title VII would appear to bless its use, even though the correlations it discovers will often reflect historic patterns of prejudice, others’ discrimination against members of protected groups, or flaws in the underlying dataAddressing the sources of this unintentional discrimination and remedying the corresponding deficiencies in the law will be difficult technically, difficult legally, and difficult politically. There are a number of practical limits to what can be accomplished computationally. For example, when discrimination occurs because the data being mined is itself a result of past intentional discrimination, there is frequently no obvious method to adjust historical data to rid it of this taint. Corrective measures that alter the results of the data mining after it is complete would tread on legally and politically disputed terrain. These challenges for reform throw into stark relief the tension between the two major theories underlying antidiscrimination law: anticlassification and antisubordination. Finding a solution to big data’s disparate impact will require more than best efforts to stamp out prejudice and bias; it will require a wholesale reexamination of the meanings of “discrimination” and “fairness.”
Executive Summary
The article 'Big Data’s Disparate Impact' explores the unintended discriminatory outcomes that can arise from the use of algorithmic techniques like data mining in employment decisions. It argues that while these techniques are designed to eliminate human biases, they often inherit prejudices from historical data or reflect societal biases. The article examines these concerns through the lens of Title VII of the Civil Rights Act, particularly the disparate impact doctrine, and highlights the challenges in addressing these issues technically, legally, and politically. It calls for a reexamination of the meanings of 'discrimination' and 'fairness' to find solutions to big data’s disparate impact.
Key Points
- ▸ Algorithmic techniques can inherit biases from historical data or reflect societal biases.
- ▸ Disparate impact doctrine under Title VII may not adequately address unintended discrimination from data mining.
- ▸ Addressing these issues requires technical, legal, and political solutions.
- ▸ A reexamination of the meanings of 'discrimination' and 'fairness' is necessary.
Merits
Comprehensive Analysis
The article provides a thorough examination of the intersection between big data and antidiscrimination law, offering a nuanced understanding of the challenges involved.
Interdisciplinary Approach
The article effectively bridges the gap between technical, legal, and societal issues, making it relevant to a wide audience.
Demerits
Complexity
The article's complexity may make it less accessible to readers without a background in both data science and legal theory.
Lack of Concrete Solutions
While the article identifies problems and challenges, it does not offer concrete solutions, which might leave readers seeking more practical guidance.
Expert Commentary
The article 'Big Data’s Disparate Impact' presents a critical and timely analysis of the unintended consequences of data mining in employment decisions. By highlighting the inherent biases that can be perpetuated through algorithmic techniques, the article underscores the need for a more nuanced understanding of discrimination in the digital age. The author's examination of Title VII and the disparate impact doctrine is particularly insightful, as it reveals the limitations of current legal frameworks in addressing these issues. The call for a reexamination of the meanings of 'discrimination' and 'fairness' is a significant contribution to the ongoing debate about how to ensure that technological advancements do not exacerbate existing inequalities. However, the article's complexity and lack of concrete solutions may limit its immediate practical impact. Overall, this article is a valuable addition to the literature on big data and antidiscrimination law, and it should prompt further research and policy discussions on these critical issues.
Recommendations
- ✓ Further research should be conducted to develop practical methods for identifying and mitigating biases in data mining techniques.
- ✓ Policymakers should engage in interdisciplinary dialogues with data scientists, legal experts, and ethicists to update antidiscrimination laws and regulations to better address the challenges posed by big data.