AI image generation for SEO assets: OG/social preview images, blog hero images, schema images, product photography, infographics. Powered by Gemini via nanobanana-mcp. Requires banana extension installed. Use when user says \"generate image\", \"OG image\", \"social preview\", \"hero image\", \"blog image\", \"product photo\", \"infographic\", \"seo image\", \"create visual\", \"image-gen\", \"favicon\", \"schema image\", \"pinterest pin\", \"generate visual\", \"banner\", or \"thumbnail\".
- 📁 references/
- 📁 scripts/
- 📁 tmp/
- 📄 SKILL.md
Generate PNG images using AI (multiple models via OpenRouter including Gemini, FLUX.2, Riverflow, SeedDream, GPT-5 Image, proxied through Cloudflare AI Gateway BYOK). Also analyze/describe existing images using multimodal AI vision. Use when user asks to "generate an image", "create a PNG", "make an icon", "make it transparent", "describe this image", "analyze this image", "what's in this image", "explain this image", or needs AI-generated visual assets for the project. Supports model selection via keywords (gemini, riverflow, flux2, seedream, gpt5), configurable aspect ratios/resolutions, transparent backgrounds (-t), reference image editing (-r), image analysis (--analyze), and per-project cost tracking (--costs).
Interact with IIB (Infinite Image Browsing) service for searching, browsing, tagging, and organizing AI-generated images. Use when the user needs to search images by prompt/keyword, manage image tags, organize files into folders, get image generation parameters, or work with an image library.
- 📁 references/
- 📁 scripts/
- 📄 REFERENCES.md
- 📄 SKILL.md
Generate or edit images using the Antigravity-hosted Gemini image model via the local gateway. Use when the user asks to create an image, generate an avatar, or edit/transform an existing image with text instructions. Supports text-to-image and image-to-image editing.
Produce a background-removed portrait image for a requested person and send it through Telegram. Primary surface: Google Chrome, Pixelmator Pro, Telegram. Inputs: Person or image search query, Telegram recipient/chat, Optional existing image file.... Trigger cues: Google Chrome Search | Google Chrome Person or image search query, Telegram recipient/chat, Optional existing image... | Produce a background-removed portrait image for a requested person and send it through Telegram.
Research a topic deeply, plan a blog step by step, and write a long-form article with structured Markdown outputs in the current working directory. Supports multilingual writing, source tracking, metadata generation, and optional image insertion via available image tools.
- 📁 references/
- 📁 scripts/
- 📄 SKILL.md
AI image generation Creative Director powered by Google Gemini Nano Banana models. Use this skill for ANY request involving image creation, editing, visual asset production, or creative direction. Triggers on: generate an image, create a photo, edit this picture, design a logo, make a banner, visual for my anything, and all /banana commands. Handles text-to-image, image editing, multi-turn creative sessions, batch workflows, and brand presets.
- 📁 references/
- 📁 scenarios/
- 📁 scripts/
- 📄 SKILL.md
Generate or edit images via backend Skywork Image API. Use for any image creation, poster design, logo design, visual asset generation, or image modification request. Supports text-to-image and image-to-image editing with aspect ratio and resolution control.
Iteratively fix a SwiftUI View to match a design image. Captures previews with axe, compares against the target design, and modifies the code until the UI matches. Use when the user provides a design image and wants the View adjusted to match it.
Use when the user wants OCR on images, screenshots, scans, receipts, diagrams, or image files; extract text from a local image path, image URL, or base64 image; convert OCR output to plain text, markdown table, structured JSON, or code comments; or rename, summarize, or post-process files based on recognized text. Prefer this skill for image-to-text workflows backed by the local ocrtool-mcp binary.
- 📁 meitu-tools/
- 📁 scripts/
- 📁 skills/
- 📄 .gitignore
- 📄 LICENSE
- 📄 package.json
Comprehensive Meitu AI toolkit for image and video editing. Features include AI poster design, precise background cutout, virtual try-on, e-commerce product swap, image upscaling and restoration, ID photo generation, smart object removal, portrait beauty enhancement, and motion-transfer dance videos. The ultimate creative assistant.
- 📁 .claude/
- 📁 db/
- 📁 mnt/
- 📄 .gitattributes
- 📄 .gitignore
- 📄 CHANGELOG.md
Use when the user asks which model to use, wants to compare models, asks about credit costs, or needs guidance selecting between any Higgsfield image or video model including Kling 3.0/Omni/Motion Control, Sora 2, Veo 3.1, WAN 2.5/2.6, Seedance Pro, Minimax Hailuo 02, Higgsfield DoP (Lite/Standard/Turbo), Soul 2.0, Soul Cinema Preview, Soul Cast, Nano Banana/Pro/2, Kling Image 3.0/Omni, Seedream 4.0, GPT Image 1.5, Flux 2/Kontext, Face Swap, Character Swap, or Photodump.