Unit 4: Working with Popular AI Tools

Generative AI & Prompt Engineering

This chapter explores essential AI platforms transforming creative and professional workflows. Learners gain practical skills with leading text tools...

AI-Powered

TL;DR — Quick Summary

Click Generate Summary to get a quick AI-powered overview of this chapter.

MCQ Practice

Generate MCQ

Lesson 4.1: Working with Popular AI Tools

This module provides hands-on exploration of leading AI tools across categories. You'll learn their core features, strengths, limitations, access methods, best practices, and real-world applications. The focus is practical proficiency and understanding when to choose one tool over another.

Learning Objectives:

Understand capabilities and interfaces of major AI tools.
Compare tools for text, image, video, presentation, and research tasks.
Apply tools effectively through prompting and workflow integration.
Complete cross-tool exercises to evaluate outputs and select the best fit.

Lesson 4.2: Popular tools AI Tools

1. Text AI

a. ChatGPT (OpenAI)
ChatGPT is a versatile conversational AI with strong general capabilities, multimodal support (text, images, voice), and features like memory, Canvas for editing, Deep Research, file analysis, and advanced models (e.g., GPT-5.x series in 2026).

Key Features (2026): Model selection (e.g., reasoning vs. fast models), web search integration, voice mode, Projects for file management, Codex for coding, image/video generation, persistent memory, and integrations (Slack, etc.). Large pastes as attachments; advanced data analysis.
How to Use: Access via chatgpt.com or apps. Start with clear prompts; use "Thinking" mode for complex tasks. Upload files for analysis. Enable Canvas for iterative editing.
Strengths: Balanced performance, voice, broad integrations, value in Plus/Pro plans. Good for creative writing, coding, and research.
Limitations: Can hallucinate without search; usage limits on free tier.
Best For: Everyday tasks, brainstorming, content generation.

b. Gemini (Google)
Google's multimodal AI excels in integration with Google Workspace (Gmail, Docs, Sheets), real-time search, image/video generation (via models like Veo/Omni), and analysis of files/tabs.

Key Features: Canvas mode, Gems (custom experts), deep integration with Google apps, video generation/editing (Gemini Omni), image creation, and proactive task automation.
How to Use: gemini.google.com or mobile app. Leverage context from Drive/Gmail. Use for research with live web data.
Strengths: Strong multimodal (audio/video analysis), Google ecosystem, speed in Flash models.
Limitations: Coding sometimes weaker than competitors.
Best For: Productivity in Google apps, visual/media tasks, research.

c. Claude (Anthropic)
Claude prioritizes safety, reasoning, and reliability. It shines in complex tasks, coding, long-context analysis, and structured outputs. Features like Artifacts, Projects, and computer use.

Key Features (2026): Opus models for advanced reasoning/coding, file handling, vision, structured responses, and tools like Claude Code/Design. Strong instruction-following.
How to Use: claude.ai. Upload documents for analysis; use Artifacts for interactive previews.
Strengths: Excellent for deep reasoning, coding, nuanced writing, and following detailed prompts.
Limitations: May be more cautious/restrictive; higher cost for heavy use.
Best For: Professional writing, coding, analysis requiring accuracy.

Tips for Text AI: Use role-playing in prompts (e.g., "Act as a expert editor"). Chain tools: Research in Perplexity/Claude, refine in ChatGPT/Gemini.

2. Image AI

a. Midjourney
A leading text-to-image tool known for artistic, high-quality outputs. Primarily accessed via Discord.

Key Features: /imagine command, parameters (e.g., --v for version, --ar for aspect ratio), upscaling, variations, style references.
How to Use: Join Midjourney Discord server, go to newbie channels, type /imagine prompt: [detailed description]. Refine with U/V buttons or remixing.
Strengths: Creative, cinematic styles; strong community.
Limitations: Discord-based; subscription for full access; learning curve for parameters.
Best For: Concept art, illustrations, fantasy/surreal images.

b. Adobe Firefly
Integrated generative AI in Adobe ecosystem, focused on professional, ethical (trained on licensed data) creation.

Key Features: Text-to-image/video, Generative Fill/Expand, Firefly Boards for ideation, integration with Photoshop/Express, brand kits, and agentic workflows (e.g., product videos).
How to Use: firefly.adobe.com or within Creative Cloud apps. Use prompts with references; edit non-destructively.
Strengths: Seamless Adobe workflow, commercial safety, high control for professionals.
Limitations: Best with Adobe subscription; credit-based usage.
Best For: Photo-realistic edits, marketing assets, professional design.

Tips: Craft detailed prompts (subject, style, lighting, composition). Use references for consistency.

3. Video AI

a. Veo (Google)
Google's advanced video generation model, integrated in Gemini/Vids, with native audio and high realism.

Key Features: Text/image-to-video, native audio/dialogue, physics simulation, style references, editing controls (e.g., Omni for multimodal).
How to Use: Via Gemini app or Google Vids. Prompt with cinematic terms (e.g., "timelapse, aerial shot").
Strengths: Realism, audio integration, control.
Limitations: Access via Google tools; potential generation limits.
Best For: Short clips, storytelling, social media.

b. Runway ML
Powerful platform for text-to-video, image-to-video, and advanced editing (Gen-3/Gen-4 models).

Key Features: Video-to-video, motion brush, keyframes, style presets, generative audio, workflows for consistency.
How to Use: runwayml.com. Upload images/videos, add prompts, adjust parameters like duration/motion.
Strengths: Cinematic control, professional tools for filmmakers.
Limitations: Learning curve; compute-intensive.
Best For: High-quality video production, VFX-style edits.

Tips: Start with short clips. Combine image AI outputs as starting frames.

4. Presentation AI

a. Gamma
AI-powered tool for rapid creation of presentations, documents, and webpages.

Key Features: Prompt-to-deck, file/URL import, templates, branding customization, multi-format export (PPT, PDF), media integration.
How to Use: gamma.app. Enter topic/prompt or upload content; AI generates structure, then customize cards/slides.
Strengths: Speed, visual polish, versatility beyond slides (websites/docs).
Limitations: May need manual tweaks for complex branding.
Best For: Quick professional decks, reports, interactive content.

5. Research AI

a. Perplexity
AI search engine with real-time web access, citations, and deep research modes.

Key Features: Search mode (fast answers with sources), Deep Research (multi-step reports), Labs for apps/files, model selection.
How to Use: perplexity.ai. Ask questions; use "Deep Research" for complex topics. Follow citations.
Strengths: Accurate, sourced info; reduces hallucinations.
Limitations: Less creative than pure LLMs.
Best For: Fact-checking, literature reviews, informed decision-making.

Module Assessment

Exercise 1: Content Creation Task
Choose a topic (e.g., "Create a marketing plan for a new eco-friendly product").

Generate outline/research with Perplexity/Claude.
Draft content with ChatGPT/Gemini.
Create visuals with Midjourney/Firefly.
Build a presentation with Gamma.
(Optional) Add video clip with Veo/Runway.
Compare outputs: quality, speed, accuracy, creativity. Note strengths/weaknesses.

Exercise 2: Image/Video Workflow

Prompt the same scene in Midjourney and Firefly.
Animate one output in Veo and Runway.
Evaluate consistency, realism, and ease of editing.

Exercise 3: Research + Presentation
Research a current topic with Perplexity. Summarize/refine with text AIs. Build deck in Gamma. Test in different tools and iterate.

Discussion/Reflection (15-30 min):

Which tool excelled for which part of the task?
Prompting lessons learned.
Ethical considerations (copyright, bias, disclosure).
Integration ideas (e.g., Firefly in Photoshop + Gamma export).

Additional Resources & Best Practices:

Prompt engineering: Be specific, iterative, use examples.
Combine tools: Perplexity for research → Claude for reasoning → Gamma for output.
Stay updated via official sites/docs, as features evolve rapidly.
Practice daily; track what works for your workflows.

Unit 4: Working with Popular AI Tools

TL;DR — Quick Summary

Lesson 4.1: Working with Popular AI Tools

Lesson 4.2: Popular tools AI Tools

1. Text AI

2. Image AI

3. Video AI

4. Presentation AI

5. Research AI

Module Assessment

Share Now

Resources

Lesson Contents

Unit 4: Working with Popular AI Tools

TL;DR — Quick Summary

Lesson 4.1: Working with Popular AI Tools

Lesson 4.2: Popular tools AI Tools

1. Text AI

2. Image AI

3. Video AI

4. Presentation AI

5. Research AI

Module Assessment

Share Now

Resources

Lesson Contents

Share Lesson