image-studio

Featured

Create images from a text description, or edit photos and graphics the user provides (remove backgrounds or watermarks, retouch, restyle, in-paint). Can produce multiple variants when the user wants options to choose from.

AI & Automation 954 stars 145 forks Updated today MIT

Install

View on GitHub

Quality Score: 92/100

Stars 20%

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

Use the `media_generate_image` tool via `skill_execute` to create or edit images. ## Modes - **generate** (default): Create a new image from a text prompt. - **edit**: Modify an existing image. Requires one or more source images via `source_paths`. ## Models Do not pass the `model` parameter unless you need a specific tier. Omitting it uses the configured default, which is correct for most requests. When you do need to choose, use an alias, not a concrete model ID. Aliases always resolve to the current model for that tier: - `fast`: quickest, good quality (default tier) - `quality`: higher fidelity, slower - `openai`: OpenAI's model; most permissive on photo edits Pass a concrete model ID only if the user names one explicitly. If the tool rejects an unknown model ID, the error lists the currently available models and aliases. ## Example calls Generate (no model parameter, default is correct): ```json { "tool": "media_generate_image", "input": { "prompt": "A sunset over the ocean, golden hour, soft haze, 35mm photo style", "variants": 2 } } ``` Edit: ```json { "tool": "media_generate_image", "input": { "prompt": "Remove the watermark text from the background. Keep the subject, framing, lighting, and colors exactly identical. Change nothing else.", "mode": "edit", "source_paths": ["conversations/<conv-id>/attachments/photo.jpeg"], "model": "openai" } } ``` `source_paths` is a flat array of file path strings. Do NOT pass objects: - Wrong: `"source_paths": [{ "path"...

Details

Author: vellum-ai
Repository: vellum-ai/vellum-assistant
Created: 5 months ago
Last Updated: today
Language: TypeScript
License: MIT

Integrates with

OpenAI · AI Anthropic · AI

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

image-generation

Generate and edit images via OpenAI's gpt-image-2 (ChatGPT Images 2.0) API. Use when the user asks to "generate an image", "create an image", "make a picture of X", "draw X", "edit this image", "remove the background from this image", "extend this image", "regenerate this image with Y", or invokes /image-generation. Outputs PNG/JPEG/WebP files to disk. Requires OPENAI_API_KEY in the environment.

2 Updated today

magnusrodseth

AI & Automation Featured

image-generation

Generate images and iteratively edit saved image artifacts.

46,322 Updated today

HKUDS

AI & Automation Featured

image-generation

Generate or edit images from text prompts. Use when the user asks to create, draw, design, or edit an image, illustration, photo, icon, poster, or any visual content.

46,174 Updated today

zhayujie