Leaf decoration

Working with Popular AI Tools

Unit: 4
Book Icon

Generative AI & Prompt Engineering

This chapter explores essential AI platforms transforming creative and professional workflows. Learners gain practical skills with leading text tools...

AI-Powered
TL;DR — Quick Summary
Click Generate Summary to get a quick AI-powered overview of this chapter.
Gemini is reading the chapter...
    Could not generate summary. Please try again.
    Explain This
    AI Explanation
    Explaining...

    Could not explain. Try again.
    MCQ Practice

    Lesson 4.1: Working with Popular AI Tools

    This module provides hands-on exploration of leading AI tools across categories. You'll learn their core features, strengths, limitations, access methods, best practices, and real-world applications. The focus is practical proficiency and understanding when to choose one tool over another.

    Learning Objectives:

    • Understand capabilities and interfaces of major AI tools.
    • Compare tools for text, image, video, presentation, and research tasks.
    • Apply tools effectively through prompting and workflow integration.
    • Complete cross-tool exercises to evaluate outputs and select the best fit.

    Lesson 4.2: Popular tools AI Tools

    1. Text AI

    a. ChatGPT (OpenAI)
    ChatGPT is a versatile conversational AI with strong general capabilities, multimodal support (text, images, voice), and features like memory, Canvas for editing, Deep Research, file analysis, and advanced models (e.g., GPT-5.x series in 2026).

    • Key Features (2026): Model selection (e.g., reasoning vs. fast models), web search integration, voice mode, Projects for file management, Codex for coding, image/video generation, persistent memory, and integrations (Slack, etc.). Large pastes as attachments; advanced data analysis.
    • How to Use: Access via chatgpt.com or apps. Start with clear prompts; use "Thinking" mode for complex tasks. Upload files for analysis. Enable Canvas for iterative editing.
    • Strengths: Balanced performance, voice, broad integrations, value in Plus/Pro plans. Good for creative writing, coding, and research.
    • Limitations: Can hallucinate without search; usage limits on free tier.
    • Best For: Everyday tasks, brainstorming, content generation.

    b. Gemini (Google)
    Google's multimodal AI excels in integration with Google Workspace (Gmail, Docs, Sheets), real-time search, image/video generation (via models like Veo/Omni), and analysis of files/tabs.

    • Key Features: Canvas mode, Gems (custom experts), deep integration with Google apps, video generation/editing (Gemini Omni), image creation, and proactive task automation.
    • How to Use: gemini.google.com or mobile app. Leverage context from Drive/Gmail. Use for research with live web data.
    • Strengths: Strong multimodal (audio/video analysis), Google ecosystem, speed in Flash models.
    • Limitations: Coding sometimes weaker than competitors.
    • Best For: Productivity in Google apps, visual/media tasks, research.

    c. Claude (Anthropic)
    Claude prioritizes safety, reasoning, and reliability. It shines in complex tasks, coding, long-context analysis, and structured outputs. Features like Artifacts, Projects, and computer use.

    • Key Features (2026): Opus models for advanced reasoning/coding, file handling, vision, structured responses, and tools like Claude Code/Design. Strong instruction-following.
    • How to Use: claude.ai. Upload documents for analysis; use Artifacts for interactive previews.
    • Strengths: Excellent for deep reasoning, coding, nuanced writing, and following detailed prompts.
    • Limitations: May be more cautious/restrictive; higher cost for heavy use.
    • Best For: Professional writing, coding, analysis requiring accuracy.

    Tips for Text AI: Use role-playing in prompts (e.g., "Act as a expert editor"). Chain tools: Research in Perplexity/Claude, refine in ChatGPT/Gemini.

    2. Image AI

    a. Midjourney
    A leading text-to-image tool known for artistic, high-quality outputs. Primarily accessed via Discord.

    • Key Features: /imagine command, parameters (e.g., --v for version, --ar for aspect ratio), upscaling, variations, style references.
    • How to Use: Join Midjourney Discord server, go to newbie channels, type /imagine prompt: [detailed description]. Refine with U/V buttons or remixing.
    • Strengths: Creative, cinematic styles; strong community.
    • Limitations: Discord-based; subscription for full access; learning curve for parameters.
    • Best For: Concept art, illustrations, fantasy/surreal images.

    b. Adobe Firefly
    Integrated generative AI in Adobe ecosystem, focused on professional, ethical (trained on licensed data) creation.

    • Key Features: Text-to-image/video, Generative Fill/Expand, Firefly Boards for ideation, integration with Photoshop/Express, brand kits, and agentic workflows (e.g., product videos).
    • How to Use: firefly.adobe.com or within Creative Cloud apps. Use prompts with references; edit non-destructively.
    • Strengths: Seamless Adobe workflow, commercial safety, high control for professionals.
    • Limitations: Best with Adobe subscription; credit-based usage.
    • Best For: Photo-realistic edits, marketing assets, professional design.

    Tips: Craft detailed prompts (subject, style, lighting, composition). Use references for consistency.

    3. Video AI

    a. Veo (Google)
    Google's advanced video generation model, integrated in Gemini/Vids, with native audio and high realism.

    • Key Features: Text/image-to-video, native audio/dialogue, physics simulation, style references, editing controls (e.g., Omni for multimodal).
    • How to Use: Via Gemini app or Google Vids. Prompt with cinematic terms (e.g., "timelapse, aerial shot").
    • Strengths: Realism, audio integration, control.
    • Limitations: Access via Google tools; potential generation limits.
    • Best For: Short clips, storytelling, social media.

    b. Runway ML
    Powerful platform for text-to-video, image-to-video, and advanced editing (Gen-3/Gen-4 models).

    • Key Features: Video-to-video, motion brush, keyframes, style presets, generative audio, workflows for consistency.
    • How to Use: runwayml.com. Upload images/videos, add prompts, adjust parameters like duration/motion.
    • Strengths: Cinematic control, professional tools for filmmakers.
    • Limitations: Learning curve; compute-intensive.
    • Best For: High-quality video production, VFX-style edits.

    Tips: Start with short clips. Combine image AI outputs as starting frames.

    4. Presentation AI

    a. Gamma
    AI-powered tool for rapid creation of presentations, documents, and webpages.

    • Key Features: Prompt-to-deck, file/URL import, templates, branding customization, multi-format export (PPT, PDF), media integration.
    • How to Use: gamma.app. Enter topic/prompt or upload content; AI generates structure, then customize cards/slides.
    • Strengths: Speed, visual polish, versatility beyond slides (websites/docs).
    • Limitations: May need manual tweaks for complex branding.
    • Best For: Quick professional decks, reports, interactive content.

    5. Research AI

    a. Perplexity
    AI search engine with real-time web access, citations, and deep research modes.

    • Key Features: Search mode (fast answers with sources), Deep Research (multi-step reports), Labs for apps/files, model selection.
    • How to Use: perplexity.ai. Ask questions; use "Deep Research" for complex topics. Follow citations.
    • Strengths: Accurate, sourced info; reduces hallucinations.
    • Limitations: Less creative than pure LLMs.
    • Best For: Fact-checking, literature reviews, informed decision-making.

    Module Assessment

    Exercise 1: Content Creation Task
    Choose a topic (e.g., "Create a marketing plan for a new eco-friendly product").

    • Generate outline/research with Perplexity/Claude.
    • Draft content with ChatGPT/Gemini.
    • Create visuals with Midjourney/Firefly.
    • Build a presentation with Gamma.
    • (Optional) Add video clip with Veo/Runway.
      Compare outputs: quality, speed, accuracy, creativity. Note strengths/weaknesses.

    Exercise 2: Image/Video Workflow

    • Prompt the same scene in Midjourney and Firefly.
    • Animate one output in Veo and Runway.
    • Evaluate consistency, realism, and ease of editing.

    Exercise 3: Research + Presentation
    Research a current topic with Perplexity. Summarize/refine with text AIs. Build deck in Gamma. Test in different tools and iterate.

    Discussion/Reflection (15-30 min):

    • Which tool excelled for which part of the task?
    • Prompting lessons learned.
    • Ethical considerations (copyright, bias, disclosure).
    • Integration ideas (e.g., Firefly in Photoshop + Gamma export).

    Additional Resources & Best Practices:

    • Prompt engineering: Be specific, iterative, use examples.
    • Combine tools: Perplexity for research → Claude for reasoning → Gamma for output.
    • Stay updated via official sites/docs, as features evolve rapidly.
    • Practice daily; track what works for your workflows.

    Share Now

    Share to help more learners!

    Resources
    Lesson Contents