Leaf decoration

AI Ethics, Privacy & Responsible Use

Unit: 8
Book Icon

Generative AI & Prompt Engineering

This chapter examines the critical ethical and societal challenges accompanying artificial intelligence. It explores how AI systems inherit and amplif...

AI-Powered
TL;DR — Quick Summary
Click Generate Summary to get a quick AI-powered overview of this chapter.
Gemini is reading the chapter...
    Could not generate summary. Please try again.
    Explain This
    AI Explanation
    Explaining...

    Could not explain. Try again.
    MCQ Practice

    Lesson 8.1: AI Ethics, Privacy & Responsible Use

    Module Overview

    This module explores the critical ethical, privacy, and societal challenges posed by artificial intelligence systems. As AI becomes increasingly integrated into daily life, understanding its potential harms—and how to mitigate them—is essential for responsible development and deployment. The module covers six key areas: AI bias, copyright, data privacy, deepfakes, misinformation, and responsible AI use, complemented by real-world case studies.


    1. AI Bias

    Definition

    AI bias refers to the systematic distortion in AI system outputs where models reflect or amplify existing societal stereotypes and prejudices related to gender, race, culture, political views, and other social categories. Machine learning models are not inherently objective—they are trained on human-generated data, and human involvement in data selection and curation can make model predictions susceptible to bias. This bias can lead to discrimination, the spread of misinformation, and the erosion of trust in technology.

    Types of Bias

    Historical Bias occurs when historical data reflects inequities that existed in the world at that time. Example: A city housing dataset from the 1960s containing home-price data that reflects discriminatory lending practices (redlining).

    Reporting Bias occurs when the frequency of events captured in a dataset does not accurately reflect their real-world frequency, because people tend to document unusual or memorable circumstances. Example: A sentiment-analysis model trained on book reviews from a website where most reviews reflect extreme opinions, making the model less able to predict sentiment for reviews using subtle language.

    Selection Bias occurs if a dataset's examples are chosen in a way that is not reflective of their real-world distribution. This includes coverage bias (data not selected representatively) and non-response bias (unrepresentative data due to participation gaps).

    Automation Bias is a tendency to favor results generated by automated systems over those from non-automated systems, irrespective of error rates.

    Gender Bias: Models tend to reproduce traditional gender stereotypes, associating professions and characteristics with specific genders. A 2024 UNESCO study showed LLMs are four times more likely to describe women in domestic roles than men.

    Racial and Ethnic Bias: LLMs can exhibit subtle discrimination against ethnic groups. A Bloomberg study found ChatGPT 3.5 preferred resumes from Asian candidates over Black candidates.

    Political and Ideological Bias: Despite claims of neutrality, many LLMs show political leaning. A study found left-liberal bias in 23 out of 24 tested LLMs.

    Case Study: AI in Hiring

    A company deploys an AI recruitment tool trained on historical hiring data from a period when the company predominantly hired male candidates. The AI learns to penalize resumes containing "women's" keywords (e.g., "women's chess club captain"). This demonstrates how historical bias in training data perpetuates discrimination.


    2. Copyright

    The Core Challenge

    AI systems trained on vast amounts of copyrighted material raise fundamental questions: Who owns AI-generated works? Does training AI on copyrighted content constitute fair use?

    Key Legal Developments

    AI-Generated Works and Copyright: The U.S. Supreme Court and D.C. Circuit have confirmed that purely AI-generated works cannot receive copyright protection. Courts are also beginning to reject the assumption that training AI on copyrighted material is automatically fair use.

    The GEMA vs. OpenAI Case: The German music rights association GEMA sued OpenAI for alleged copyright infringement when training ChatGPT with protected song lyrics. The court followed GEMA's theory that at least certain older versions of ChatGPT contained reproductions inside the model. The case concerned lyrics of famous German songs including "Atemlos" and "Über den Wolken".

    Key Legal Issues:

    1. Using original works to train, test, or develop AI systems
    2. Whether AI models store or copy specific training data versus reflecting learned patterns
    3. Who owns copyright in AI-generated creative works

    Case Study: AI Plagiarism in Design

    The owner of Dat Lanh Ao Dai fabric company was prosecuted for using AI to plagiarize design samples from a copyrighted collection. This illustrates how integrating original works into AI systems can give rise to copyright infringement claims.


    3. Data Privacy

    The Privacy Challenge

    AI systems require vast amounts of data, often including personal information. The use of such data poses risks to individuals' rights and freedoms, including the right to privacy. Responsible AI use must consider concerns over data protection and privacy.

    Regulatory Framework

    GDPR and AI Development: Where personal data is used for AI system development, both the GDPR and the EU AI Act apply. Key requirements include:

    • Defining a clear, explicit, and legitimate purpose for the AI system
    • Ensuring AI training data is processed lawfully—based on consent, contractual necessity, legitimate interest, or other legal justifications
    • Implementing robust safeguards to prevent misuse of personal information

    Privacy by Design: Data protection authorities emphasize embedding privacy-by-design principles into AI systems. AI should be developed and deployed in accordance with data protection and privacy rules.

    Global Privacy Laws: New comprehensive privacy laws continue to emerge—three U.S. states (Indiana, Kentucky, and Rhode Island) transitioned from planning to enforcement in January 2026.

    Case Study: AI Training Data

    A health-tech company develops an AI diagnostic tool using patient records. Without proper anonymization and consent mechanisms, the system violates patient privacy. This highlights the tension between AI innovation and data protection—training datasets often include "personal data" that must be handled with care.


    4. Deepfakes

    Definition and Technology

    Deepfakes are AI-generated digital content—typically video, audio, or images—manipulated using deep learning algorithms that alter original content with fake content that looks authentic. The term combines "deep learning" and "fake". These manipulations use Generative AI techniques trained on extensive datasets.

    Risks and Concerns

    Deepfakes pose serious threats including:

    • Privacy violations and identity theft
    • Political manipulation and election interference
    • Fraud and impersonation
    • Non-consensual intimate imagery
    • Large-scale societal disruption threatening trust and information integrity

    Detection Challenge: Alarmingly, 27%–50% of people cannot distinguish deepfake-generated content from real content. Detection with human accuracy remains relatively low.

    Emerging Detection Methods

    Current research focuses on multimodal detection systems, real-time forensics, explainable AI, and diverse high-quality datasets. Techniques include diffusion models, blockchain, federated learning, and explainable AI systems.

    Case Study: Hungarian Election Deepfakes

    In Hungary's 2026 parliamentary elections, AI-generated content was widely used to manipulate public opinion. A fabricated video showed opposition leader Peter Magyar allegedly calling President Trump "a senile grandpa"—the original audio was swapped with AI-generated voiceover. Perpetrators superimposed news logos and linked to counterfeit websites to lend credibility. Another fake claimed Magyar was withdrawing his candidacy. The use of deepfakes and counterfeit news sites posed significant risks to democratic transparency.


    5. Misinformation

    Defining Misinformation vs. Disinformation

    Misinformation is false information shared without the intent to deceive. Disinformation is false information deliberately created and spread to manipulate and mislead. Regardless of intent, both can cause significant harm to society.

    AI's Role in Misinformation

    AI technologies have accelerated the spread of false information, fueling hate, political manipulation, and financial exploitation. Key concerns include:

    • AI-Generated Content: Deepfakes, synthetic voices, and falsified documents introduce new security and ethical threats
    • Counterfeit News Sites: Fake websites mimicking legitimate news outlets
    • AI Chatbot Disinformation: Pro-Russian misinformation has been found to seep into responses generated by conversational AI agents
    • Visual Misinformation: Recycled or AI-generated images are increasingly blurring the line between fact and fiction

    Real-World Examples

    Venezuela Misinformation: After US forces seized Venezuelan leader Nicolás Maduro, a surge of visual misinformation inundated social media. NewsGuard identified seven fabricated images and videos related to the operation that collectively garnered more than 21 million views on X alone.

    AI-Generated "News": Massive numbers of fake news accounts use misleading content, including AI-generated animations of real events, flooding social media platforms.

    Case Study: AI-Generated Disaster Footage

    During flooding in Kenya, a fake "disaster" video showing cars being swept away was created by repurposing AI-generated content from another creator and falsely labeling it as Kenyan灾情. This demonstrates how AI-generated content can be easily repurposed for misinformation.


    6. Responsible AI Use

    What is Responsible AI?

    Responsible AI (RAI) focuses on how to develop, evaluate, deploy, and monitor AI systems in a safe, trustworthy, and ethical manner. It seeks to harness AI's potential while anchoring it in ethical principles.

    Core Principles

    The Responsible AI framework typically includes three main components: Ethical Principles, Workplace Impact, and Regulatory Framework, each linked to specific strategies like Bias Prevention, Reskilling Programs, and Legal Frameworks.

    Key principles include:

    • Safety: AI systems should be protected against adversarial threats and misuse, and aligned to the public good
    • Transparency: Clear understanding of how AI models reach decisions
    • Accountability: Clear ownership and responsibility for AI outcomes
    • Fairness: Prevention of bias and discrimination

    Practical Implementation

    Stanford Law School's AI Life Cycle Core Principles (AILCCP) framework provides 48 actionable controls—specific mechanisms, policies, and technical safeguards that translate abstract AI principles into concrete, implementable measures. These controls span domains including security, technical, governance, monitoring, testing, regulatory, documentation, safety, transparency, and maintenance.

    Key Practices for Responsible AI

    1. Define clear objectives: AI systems must have well-defined, explicit, and legitimate purposes
    2. Document human creative contributions: When using AI for content creation, document human involvement
    3. Audit training data sourcing: Ensure data is collected and used lawfully
    4. Conduct bias and fairness assessments: Regularly evaluate models for bias
    5. Implement security measures: Protect against adversarial attacks and misuse

    Case Study: AI Tutors and Academic Integrity

    An institution deployed an AI tutor platform across postgraduate programs. Students earned "readiness scores" by engaging with pre-class material. However, cracks appeared quickly:

    • High scores didn't align with actual understanding
    • Students gamed the system by copying generic answers
    • Administrative decisions were increasingly shaped by AI dashboards, not human dialogue
    • The tool drifted from being a reflective support system to a performative signal

    Simultaneously, another committee found that unsupervised assessments powered by generative AI were the source of nearly 73 percent of reported ethical violations. Students used AI tools to generate answer templates and sold them through encrypted messaging platforms—no detection system flagged the activity.

    Lessons Learned: Don't confuse metrics with meaning. AI-powered metrics are useful but not definitive—human insights and faculty judgment remain essential.


    Activities: Case Studies for Discussion

    Case Study 1: The Biased Hiring Algorithm

    Scenario: A company uses an AI recruitment tool to screen job applications. The tool consistently ranks male candidates higher than female candidates for technical roles. Discussion Questions:

    • What type(s) of bias are present?
    • How might historical data have contributed?
    • What steps could mitigate this bias?
    • Who is responsible for ensuring fairness?

    Case Study 2: The AI-Generated Deepfake Scandal

    Scenario: During an election campaign, a deepfake video of a candidate making inflammatory statements goes viral. The video is viewed millions of times before it's debunked. Discussion Questions:

    • What are the ethical implications?
    • How should platforms respond?
    • What legal recourse exists?
    • How can society build resilience against such manipulation?

    Case Study 3: Copyright in the Age of AI

    Scenario: An artist discovers that their copyrighted artwork was used to train a commercial AI image generator without permission or compensation. Discussion Questions:

    • Should training AI on copyrighted works require permission?
    • Who owns AI-generated derivatives of copyrighted works?
    • How should copyright law evolve?

    Case Study 4: Privacy vs. Innovation

    Scenario: A healthcare AI startup wants to train a diagnostic model using patient data. They have anonymized the data but did not obtain explicit patient consent. Discussion Questions:

    • What privacy principles are at stake?
    • Is anonymization sufficient?
    • What regulatory requirements apply?
    • How can innovation be balanced with privacy protection?

    Case Study 5: Academic Integrity and AI

    Scenario: A university discovers that students are using AI to generate essays and selling them to other students. Nearly 73% of reported ethical violations involve AI-assisted assessments. Discussion Questions:

    • How should educational institutions respond?
    • What role should AI detection tools play?
    • How can AI be ethically integrated into education?
    • What are the responsibilities of students, faculty, and administrators?

    Lesson Summary

    This module highlights that AI is not neutral—it reflects the data, values, and intentions of its creators. Understanding AI bias, copyright complexities, data privacy requirements, deepfake risks, misinformation dynamics, and responsible AI principles is essential for anyone working with or affected by AI systems. The case studies demonstrate that ethical challenges are not abstract—they have real-world consequences that demand thoughtful, proactive responses from developers, policymakers, educators, and society at large.

    Share Now

    Share to help more learners!

    Resources
    Lesson Contents