image-to-video

Solid

FFmpeg-based video creation from image and audio.

AI & Automation 392 stars 36 forks Updated today MIT

Install

View on GitHub

Quality Score: 93/100

Stars 20%
86
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Image to Video Skill Combine a static image with an audio file to produce an MP4 video using FFmpeg. Supports resolution presets (1080p, 720p, square, vertical), optional audio visualization overlays (waveform, spectrum, cqt, bars), and batch processing of matched image+audio pairs. For image generation, use `gemini-image-generator` instead. ## Reference Loading Table | Signal | Load These Files | Why | |---|---|---| | tasks related to this reference | `ffmpeg-filters.md` | Loads detailed guidance from `ffmpeg-filters.md`. | ## Instructions ### Phase 1: VALIDATE Confirm all prerequisites before attempting video creation. **Step 1: Check FFmpeg installation** Always run this check first -- many systems lack FFmpeg or have minimal builds, and skipping it produces confusing subprocess errors instead of clear install guidance. ```bash ffmpeg -version ``` If FFmpeg is not installed, provide platform-specific install instructions and stop. **Step 2: Verify input files exist** Both the image and audio files must be confirmed present before processing. Use absolute paths for all arguments -- relative paths break silently when the script executes from a different working directory. ```bash ls -la /absolute/path/to/image.png /absolute/path/to/audio.mp3 ``` Confirm both files exist and have non-zero size. Supported formats: - **Images**: PNG, JPG, JPEG, GIF, WEBP, BMP - **Audio**: MP3, WAV, M4A, OGG, FLAC **Step 3: Determine parameters** Re-read the user's request befo...

Details

Author
notque
Repository
notque/vexjoy-agent
Created
2 months ago
Last Updated
today
Language
Python
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

gemini-image-generator

Generate images from text prompts via Google Gemini.

392 Updated today
notque
AI & Automation Solid

video-editing

Video editing pipeline: cut footage, assemble clips via FFmpeg and Remotion.

392 Updated today
notque
Code & Development Listed

video-processor

Process video files with audio extraction, format conversion (mp4, webm), and Whisper transcription. Use when user mentions video conversion, audio extraction, transcription, mp4, webm, ffmpeg, or whisper transcription.

335 Updated today
aiskillstore
AI & Automation Listed

image-to-video

Still-to-video conversion guide: model selection, motion prompting, and camera movement. Covers Wan 2.5 i2v, Seedance, Fabric, Grok Video with when to use each. Use for: animating images, creating video from stills, adding motion, product animations. Triggers: image to video, i2v, animate image, still to video, add motion to image, image animation, photo to video, animate still, wan i2v, image2video, bring image to life, animate photo, motion from image

335 Updated today
aiskillstore
AI & Automation Listed

video-loop

Create a looping background video — crossfade the loop point, encode optimized MP4 + WebM, optional poster frame. Use whenever the user has a video that needs to loop for web use (hero sections, backgrounds, landing-page ambience), wants to fix a visible jump at the loop point, or needs to optimize a clip for web delivery — even if they just say "make this loop smoothly" or "compress this for the site".

0 Updated 3 days ago
coroboros