TL;DR — Quick Summary
Lesson 4.1: Working with Popular AI Tools
This module provides hands-on exploration of leading AI tools across categories. You'll learn their core features, strengths, limitations, access methods, best practices, and real-world applications. The focus is practical proficiency and understanding when to choose one tool over another.
Learning Objectives:
- Understand capabilities and interfaces of major AI tools.
- Compare tools for text, image, video, presentation, and research tasks.
- Apply tools effectively through prompting and workflow integration.
- Complete cross-tool exercises to evaluate outputs and select the best fit.
Lesson 4.2: Popular tools AI Tools
1. Text AI
a. ChatGPT (OpenAI)
ChatGPT is a versatile conversational AI with strong general capabilities, multimodal support (text, images, voice), and features like memory, Canvas for editing, Deep Research, file analysis, and advanced models (e.g., GPT-5.x series in 2026).
- Key Features (2026): Model selection (e.g., reasoning vs. fast models), web search integration, voice mode, Projects for file management, Codex for coding, image/video generation, persistent memory, and integrations (Slack, etc.). Large pastes as attachments; advanced data analysis.
- How to Use: Access via chatgpt.com or apps. Start with clear prompts; use "Thinking" mode for complex tasks. Upload files for analysis. Enable Canvas for iterative editing.
- Strengths: Balanced performance, voice, broad integrations, value in Plus/Pro plans. Good for creative writing, coding, and research.
- Limitations: Can hallucinate without search; usage limits on free tier.
- Best For: Everyday tasks, brainstorming, content generation.
b. Gemini (Google)
Google's multimodal AI excels in integration with Google Workspace (Gmail, Docs, Sheets), real-time search, image/video generation (via models like Veo/Omni), and analysis of files/tabs.
- Key Features: Canvas mode, Gems (custom experts), deep integration with Google apps, video generation/editing (Gemini Omni), image creation, and proactive task automation.
- How to Use: gemini.google.com or mobile app. Leverage context from Drive/Gmail. Use for research with live web data.
- Strengths: Strong multimodal (audio/video analysis), Google ecosystem, speed in Flash models.
- Limitations: Coding sometimes weaker than competitors.
- Best For: Productivity in Google apps, visual/media tasks, research.
c. Claude (Anthropic)
Claude prioritizes safety, reasoning, and reliability. It shines in complex tasks, coding, long-context analysis, and structured outputs. Features like Artifacts, Projects, and computer use.
- Key Features (2026): Opus models for advanced reasoning/coding, file handling, vision, structured responses, and tools like Claude Code/Design. Strong instruction-following.
- How to Use: claude.ai. Upload documents for analysis; use Artifacts for interactive previews.
- Strengths: Excellent for deep reasoning, coding, nuanced writing, and following detailed prompts.
- Limitations: May be more cautious/restrictive; higher cost for heavy use.
- Best For: Professional writing, coding, analysis requiring accuracy.
Tips for Text AI: Use role-playing in prompts (e.g., "Act as a expert editor"). Chain tools: Research in Perplexity/Claude, refine in ChatGPT/Gemini.
2. Image AI
a. Midjourney
A leading text-to-image tool known for artistic, high-quality outputs. Primarily accessed via Discord.
- Key Features: /imagine command, parameters (e.g., --v for version, --ar for aspect ratio), upscaling, variations, style references.
- How to Use: Join Midjourney Discord server, go to newbie channels, type
/imagine prompt: [detailed description]. Refine with U/V buttons or remixing. - Strengths: Creative, cinematic styles; strong community.
- Limitations: Discord-based; subscription for full access; learning curve for parameters.
- Best For: Concept art, illustrations, fantasy/surreal images.
b. Adobe Firefly
Integrated generative AI in Adobe ecosystem, focused on professional, ethical (trained on licensed data) creation.
- Key Features: Text-to-image/video, Generative Fill/Expand, Firefly Boards for ideation, integration with Photoshop/Express, brand kits, and agentic workflows (e.g., product videos).
- How to Use: firefly.adobe.com or within Creative Cloud apps. Use prompts with references; edit non-destructively.
- Strengths: Seamless Adobe workflow, commercial safety, high control for professionals.
- Limitations: Best with Adobe subscription; credit-based usage.
- Best For: Photo-realistic edits, marketing assets, professional design.
Tips: Craft detailed prompts (subject, style, lighting, composition). Use references for consistency.
3. Video AI
a. Veo (Google)
Google's advanced video generation model, integrated in Gemini/Vids, with native audio and high realism.
- Key Features: Text/image-to-video, native audio/dialogue, physics simulation, style references, editing controls (e.g., Omni for multimodal).
- How to Use: Via Gemini app or Google Vids. Prompt with cinematic terms (e.g., "timelapse, aerial shot").
- Strengths: Realism, audio integration, control.
- Limitations: Access via Google tools; potential generation limits.
- Best For: Short clips, storytelling, social media.
b. Runway ML
Powerful platform for text-to-video, image-to-video, and advanced editing (Gen-3/Gen-4 models).
- Key Features: Video-to-video, motion brush, keyframes, style presets, generative audio, workflows for consistency.
- How to Use: runwayml.com. Upload images/videos, add prompts, adjust parameters like duration/motion.
- Strengths: Cinematic control, professional tools for filmmakers.
- Limitations: Learning curve; compute-intensive.
- Best For: High-quality video production, VFX-style edits.
Tips: Start with short clips. Combine image AI outputs as starting frames.
4. Presentation AI
a. Gamma
AI-powered tool for rapid creation of presentations, documents, and webpages.
- Key Features: Prompt-to-deck, file/URL import, templates, branding customization, multi-format export (PPT, PDF), media integration.
- How to Use: gamma.app. Enter topic/prompt or upload content; AI generates structure, then customize cards/slides.
- Strengths: Speed, visual polish, versatility beyond slides (websites/docs).
- Limitations: May need manual tweaks for complex branding.
- Best For: Quick professional decks, reports, interactive content.
5. Research AI
a. Perplexity
AI search engine with real-time web access, citations, and deep research modes.
- Key Features: Search mode (fast answers with sources), Deep Research (multi-step reports), Labs for apps/files, model selection.
- How to Use: perplexity.ai. Ask questions; use "Deep Research" for complex topics. Follow citations.
- Strengths: Accurate, sourced info; reduces hallucinations.
- Limitations: Less creative than pure LLMs.
- Best For: Fact-checking, literature reviews, informed decision-making.
Module Assessment
Exercise 1: Content Creation Task
Choose a topic (e.g., "Create a marketing plan for a new eco-friendly product").
- Generate outline/research with Perplexity/Claude.
- Draft content with ChatGPT/Gemini.
- Create visuals with Midjourney/Firefly.
- Build a presentation with Gamma.
- (Optional) Add video clip with Veo/Runway.
Compare outputs: quality, speed, accuracy, creativity. Note strengths/weaknesses.
Exercise 2: Image/Video Workflow
- Prompt the same scene in Midjourney and Firefly.
- Animate one output in Veo and Runway.
- Evaluate consistency, realism, and ease of editing.
Exercise 3: Research + Presentation
Research a current topic with Perplexity. Summarize/refine with text AIs. Build deck in Gamma. Test in different tools and iterate.
Discussion/Reflection (15-30 min):
- Which tool excelled for which part of the task?
- Prompting lessons learned.
- Ethical considerations (copyright, bias, disclosure).
- Integration ideas (e.g., Firefly in Photoshop + Gamma export).
Additional Resources & Best Practices:
- Prompt engineering: Be specific, iterative, use examples.
- Combine tools: Perplexity for research → Claude for reasoning → Gamma for output.
- Stay updated via official sites/docs, as features evolve rapidly.
- Practice daily; track what works for your workflows.