Unit 8: AI Ethics, Privacy & Responsible Use

Generative AI & Prompt Engineering

This chapter examines the critical ethical and societal challenges accompanying artificial intelligence. It explores how AI systems inherit and amplif...

AI-Powered

TL;DR — Quick Summary

Click Generate Summary to get a quick AI-powered overview of this chapter.

MCQ Practice

Generate MCQ

Lesson 8.1: AI Ethics, Privacy & Responsible Use

Module Overview

This module explores the critical ethical, privacy, and societal challenges posed by artificial intelligence systems. As AI becomes increasingly integrated into daily life, understanding its potential harms—and how to mitigate them—is essential for responsible development and deployment. The module covers six key areas: AI bias, copyright, data privacy, deepfakes, misinformation, and responsible AI use, complemented by real-world case studies.

1. AI Bias

Definition

AI bias refers to the systematic distortion in AI system outputs where models reflect or amplify existing societal stereotypes and prejudices related to gender, race, culture, political views, and other social categories. Machine learning models are not inherently objective—they are trained on human-generated data, and human involvement in data selection and curation can make model predictions susceptible to bias. This bias can lead to discrimination, the spread of misinformation, and the erosion of trust in technology.

Types of Bias

Historical Bias occurs when historical data reflects inequities that existed in the world at that time. Example: A city housing dataset from the 1960s containing home-price data that reflects discriminatory lending practices (redlining).

Reporting Bias occurs when the frequency of events captured in a dataset does not accurately reflect their real-world frequency, because people tend to document unusual or memorable circumstances. Example: A sentiment-analysis model trained on book reviews from a website where most reviews reflect extreme opinions, making the model less able to predict sentiment for reviews using subtle language.

Selection Bias occurs if a dataset's examples are chosen in a way that is not reflective of their real-world distribution. This includes coverage bias (data not selected representatively) and non-response bias (unrepresentative data due to participation gaps).

Automation Bias is a tendency to favor results generated by automated systems over those from non-automated systems, irrespective of error rates.

Gender Bias: Models tend to reproduce traditional gender stereotypes, associating professions and characteristics with specific genders. A 2024 UNESCO study showed LLMs are four times more likely to describe women in domestic roles than men.

Racial and Ethnic Bias: LLMs can exhibit subtle discrimination against ethnic groups. A Bloomberg study found ChatGPT 3.5 preferred resumes from Asian candidates over Black candidates.

Political and Ideological Bias: Despite claims of neutrality, many LLMs show political leaning. A study found left-liberal bias in 23 out of 24 tested LLMs.

Case Study: AI in Hiring

A company deploys an AI recruitment tool trained on historical hiring data from a period when the company predominantly hired male candidates. The AI learns to penalize resumes containing "women's" keywords (e.g., "women's chess club captain"). This demonstrates how historical bias in training data perpetuates discrimination.

2. Copyright

The Core Challenge

AI systems trained on vast amounts of copyrighted material raise fundamental questions: Who owns AI-generated works? Does training AI on copyrighted content constitute fair use?

Key Legal Developments

AI-Generated Works and Copyright: The U.S. Supreme Court and D.C. Circuit have confirmed that purely AI-generated works cannot receive copyright protection. Courts are also beginning to reject the assumption that training AI on copyrighted material is automatically fair use.

The GEMA vs. OpenAI Case: The German music rights association GEMA sued OpenAI for alleged copyright infringement when training ChatGPT with protected song lyrics. The court followed GEMA's theory that at least certain older versions of ChatGPT contained reproductions inside the model. The case concerned lyrics of famous German songs including "Atemlos" and "Über den Wolken".

Key Legal Issues:

Using original works to train, test, or develop AI systems
Whether AI models store or copy specific training data versus reflecting learned patterns
Who owns copyright in AI-generated creative works

Case Study: AI Plagiarism in Design

The owner of Dat Lanh Ao Dai fabric company was prosecuted for using AI to plagiarize design samples from a copyrighted collection. This illustrates how integrating original works into AI systems can give rise to copyright infringement claims.

3. Data Privacy

The Privacy Challenge

AI systems require vast amounts of data, often including personal information. The use of such data poses risks to individuals' rights and freedoms, including the right to privacy. Responsible AI use must consider concerns over data protection and privacy.

Regulatory Framework

GDPR and AI Development: Where personal data is used for AI system development, both the GDPR and the EU AI Act apply. Key requirements include:

Defining a clear, explicit, and legitimate purpose for the AI system
Ensuring AI training data is processed lawfully—based on consent, contractual necessity, legitimate interest, or other legal justifications
Implementing robust safeguards to prevent misuse of personal information

Privacy by Design: Data protection authorities emphasize embedding privacy-by-design principles into AI systems. AI should be developed and deployed in accordance with data protection and privacy rules.

Global Privacy Laws: New comprehensive privacy laws continue to emerge—three U.S. states (Indiana, Kentucky, and Rhode Island) transitioned from planning to enforcement in January 2026.

Case Study: AI Training Data

A health-tech company develops an AI diagnostic tool using patient records. Without proper anonymization and consent mechanisms, the system violates patient privacy. This highlights the tension between AI innovation and data protection—training datasets often include "personal data" that must be handled with care.

4. Deepfakes

Definition and Technology

Deepfakes are AI-generated digital content—typically video, audio, or images—manipulated using deep learning algorithms that alter original content with fake content that looks authentic. The term combines "deep learning" and "fake". These manipulations use Generative AI techniques trained on extensive datasets.

Risks and Concerns

Deepfakes pose serious threats including:

Privacy violations and identity theft
Political manipulation and election interference
Fraud and impersonation
Non-consensual intimate imagery
Large-scale societal disruption threatening trust and information integrity

Detection Challenge: Alarmingly, 27%–50% of people cannot distinguish deepfake-generated content from real content. Detection with human accuracy remains relatively low.

Emerging Detection Methods

Current research focuses on multimodal detection systems, real-time forensics, explainable AI, and diverse high-quality datasets. Techniques include diffusion models, blockchain, federated learning, and explainable AI systems.

Case Study: Hungarian Election Deepfakes

In Hungary's 2026 parliamentary elections, AI-generated content was widely used to manipulate public opinion. A fabricated video showed opposition leader Peter Magyar allegedly calling President Trump "a senile grandpa"—the original audio was swapped with AI-generated voiceover. Perpetrators superimposed news logos and linked to counterfeit websites to lend credibility. Another fake claimed Magyar was withdrawing his candidacy. The use of deepfakes and counterfeit news sites posed significant risks to democratic transparency.

5. Misinformation

Defining Misinformation vs. Disinformation

Misinformation is false information shared without the intent to deceive. Disinformation is false information deliberately created and spread to manipulate and mislead. Regardless of intent, both can cause significant harm to society.

AI's Role in Misinformation

AI technologies have accelerated the spread of false information, fueling hate, political manipulation, and financial exploitation. Key concerns include:

AI-Generated Content: Deepfakes, synthetic voices, and falsified documents introduce new security and ethical threats
Counterfeit News Sites: Fake websites mimicking legitimate news outlets
AI Chatbot Disinformation: Pro-Russian misinformation has been found to seep into responses generated by conversational AI agents
Visual Misinformation: Recycled or AI-generated images are increasingly blurring the line between fact and fiction

Real-World Examples

Venezuela Misinformation: After US forces seized Venezuelan leader Nicolás Maduro, a surge of visual misinformation inundated social media. NewsGuard identified seven fabricated images and videos related to the operation that collectively garnered more than 21 million views on X alone.

AI-Generated "News": Massive numbers of fake news accounts use misleading content, including AI-generated animations of real events, flooding social media platforms.

Case Study: AI-Generated Disaster Footage

During flooding in Kenya, a fake "disaster" video showing cars being swept away was created by repurposing AI-generated content from another creator and falsely labeling it as Kenyan灾情. This demonstrates how AI-generated content can be easily repurposed for misinformation.

6. Responsible AI Use

What is Responsible AI?

Responsible AI (RAI) focuses on how to develop, evaluate, deploy, and monitor AI systems in a safe, trustworthy, and ethical manner. It seeks to harness AI's potential while anchoring it in ethical principles.

Core Principles

The Responsible AI framework typically includes three main components: Ethical Principles, Workplace Impact, and Regulatory Framework, each linked to specific strategies like Bias Prevention, Reskilling Programs, and Legal Frameworks.

Key principles include:

Safety: AI systems should be protected against adversarial threats and misuse, and aligned to the public good
Transparency: Clear understanding of how AI models reach decisions
Accountability: Clear ownership and responsibility for AI outcomes
Fairness: Prevention of bias and discrimination

Practical Implementation

Stanford Law School's AI Life Cycle Core Principles (AILCCP) framework provides 48 actionable controls—specific mechanisms, policies, and technical safeguards that translate abstract AI principles into concrete, implementable measures. These controls span domains including security, technical, governance, monitoring, testing, regulatory, documentation, safety, transparency, and maintenance.

Key Practices for Responsible AI

Define clear objectives: AI systems must have well-defined, explicit, and legitimate purposes
Document human creative contributions: When using AI for content creation, document human involvement
Audit training data sourcing: Ensure data is collected and used lawfully
Conduct bias and fairness assessments: Regularly evaluate models for bias
Implement security measures: Protect against adversarial attacks and misuse

Case Study: AI Tutors and Academic Integrity

An institution deployed an AI tutor platform across postgraduate programs. Students earned "readiness scores" by engaging with pre-class material. However, cracks appeared quickly:

High scores didn't align with actual understanding
Students gamed the system by copying generic answers
Administrative decisions were increasingly shaped by AI dashboards, not human dialogue
The tool drifted from being a reflective support system to a performative signal

Simultaneously, another committee found that unsupervised assessments powered by generative AI were the source of nearly 73 percent of reported ethical violations. Students used AI tools to generate answer templates and sold them through encrypted messaging platforms—no detection system flagged the activity.

Lessons Learned: Don't confuse metrics with meaning. AI-powered metrics are useful but not definitive—human insights and faculty judgment remain essential.

Activities: Case Studies for Discussion

Case Study 1: The Biased Hiring Algorithm

Scenario: A company uses an AI recruitment tool to screen job applications. The tool consistently ranks male candidates higher than female candidates for technical roles. Discussion Questions:

What type(s) of bias are present?
How might historical data have contributed?
What steps could mitigate this bias?
Who is responsible for ensuring fairness?

Case Study 2: The AI-Generated Deepfake Scandal

Scenario: During an election campaign, a deepfake video of a candidate making inflammatory statements goes viral. The video is viewed millions of times before it's debunked. Discussion Questions:

What are the ethical implications?
How should platforms respond?
What legal recourse exists?
How can society build resilience against such manipulation?

Case Study 3: Copyright in the Age of AI

Scenario: An artist discovers that their copyrighted artwork was used to train a commercial AI image generator without permission or compensation. Discussion Questions:

Should training AI on copyrighted works require permission?
Who owns AI-generated derivatives of copyrighted works?
How should copyright law evolve?

Case Study 4: Privacy vs. Innovation

Scenario: A healthcare AI startup wants to train a diagnostic model using patient data. They have anonymized the data but did not obtain explicit patient consent. Discussion Questions:

What privacy principles are at stake?
Is anonymization sufficient?
What regulatory requirements apply?
How can innovation be balanced with privacy protection?

Case Study 5: Academic Integrity and AI

Scenario: A university discovers that students are using AI to generate essays and selling them to other students. Nearly 73% of reported ethical violations involve AI-assisted assessments. Discussion Questions:

How should educational institutions respond?
What role should AI detection tools play?
How can AI be ethically integrated into education?
What are the responsibilities of students, faculty, and administrators?

Lesson Summary

This module highlights that AI is not neutral—it reflects the data, values, and intentions of its creators. Understanding AI bias, copyright complexities, data privacy requirements, deepfake risks, misinformation dynamics, and responsible AI principles is essential for anyone working with or affected by AI systems. The case studies demonstrate that ethical challenges are not abstract—they have real-world consequences that demand thoughtful, proactive responses from developers, policymakers, educators, and society at large.