oma-image

Solid

Multi-vendor AI image generation with authentication-aware parallel dispatch. Routes to Codex (gpt-image-2 via ChatGPT OAuth), Antigravity (gemini-2.5-flash-image aka nano-banana via `agy` CLI + Gemini Code Assist), and Pollinations (flux/zimage, free with signup). Use for image generation, image creation, visual asset generation, and AI art.

AI & Automation 1,042 stars 119 forks Updated today MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Image Agent - Multi-Vendor Image Router ## Scheduling ### Goal Generate images and visual assets through authenticated multi-vendor routing while preserving prompt clarity, reference-image handling, cost controls, and reproducible output manifests. ### Intent signature - User asks to generate images, visual assets, illustrations, product photos, concept art, mockups, or AI art. - Another skill needs shared image-generation infrastructure. - User provides reference images or asks for vendor comparison. ### When to use - Generating images, visual assets, illustrations, product photos, concept art - Comparing output between multiple image models for the same prompt - Producing images from prompts within editor workflows (Claude Code, Codex, Gemini CLI) - Other skills needing image generation infrastructure (shared invocation) ### When NOT to use - Editing an existing image or photo manipulation -> out of scope - Generating videos or audio -> out of scope - Inline vector art / SVG composition from structured data -> use a templating skill - Simple asset resizing or format conversion -> use a dedicated image library ### Expected inputs - Image prompt or creative brief - Optional vendor, size, quality, count, output directory, and reference images - Authentication/environment state for Codex, Pollinations, or Gemini ### Expected outputs - Generated image files under `.agents/results/images/` or requested output directory - `manifest.json` with prompt, vendor, model, and ...

Details

Author
first-fluke
Repository
first-fluke/oh-my-agent
Created
4 months ago
Last Updated
today
Language
TypeScript
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

imagegen

Generates or edits images for the current project (website assets, game assets, UI mockups, product mockups, wireframes, logos, photorealistic scenes, infographics) using gpt-image-2 via the local Codex CLI authenticated with the user's ChatGPT subscription. Use when the user asks to generate, edit, restyle, mockup, or remix images, or mentions gpt-image-2 / GPT Image 2 / ChatGPT Images 2.0.

1 Updated today
Firzus
AI & Automation Solid

imagegen

Use when the user asks to generate or edit images via the OpenAI Image API (for example: generate image, edit/inpaint/mask, background removal or replacement, transparent background, product shots, concept art, covers, or batch variants); run the bundled CLI (`scripts/image_gen.py`) and require `OPENAI_API_KEY` for live calls.

2,210 Updated 1 weeks ago
foryourhealth111-pixel
AI & Automation Solid

imagegen

Use when the user asks to generate or edit images via the OpenAI Image API (for example: generate image, edit/inpaint/mask, background removal or replacement, transparent background, product shots, concept art, covers, or batch variants); run the bundled CLI (`scripts/image_gen.py`) and require `OPENAI_API_KEY` for live calls.

27,705 Updated today
davila7
AI & Automation Listed

imagegen

Use when the user asks to generate or edit images via the OpenAI Image API (for example: generate image, edit/inpaint/mask, background removal or replacement, transparent background, product shots, concept art, covers, or batch variants); run the bundled CLI (`scripts/image_gen.py`) and require `OPENAI_API_KEY` for live calls.

1 Updated today
HGGodhand33
AI & Automation Solid

ai-image-generator

Generate AI images using Gemini or GPT APIs directly. Covers model selection (Gemini for scenes, GPT for transparent icons), the 5-part prompting framework, API calling patterns, multi-turn editing, and quality assurance. Produces photorealistic scenes, icons, illustrations, OG images, and product shots. Use when building websites that need images, creating marketing assets, or generating visual content. Triggers: 'generate image', 'ai image', 'create hero image', 'make an icon', 'generate illustration', 'create og image', 'ai art', 'image generation'.

813 Updated 2 weeks ago
jezweb