media-processing
SolidIngest and process media files (video, audio, image)
Install
Quality Score: 91/100
Skill Content
Details
- Author
- vellum-ai
- Repository
- vellum-ai/vellum-assistant
- Created
- 4 months ago
- Last Updated
- today
- Language
- TypeScript
- License
- MIT
Integrates with
Similar Skills
Semantically similar based on skill content — not just same category
media-processor
Process multimedia content — audio transcription, video analysis, PDF data extraction, image generation. Use for deeper image analysis when implementing from UI designs, analyzing charts for data, reading dense screenshots, or studying artworks and visual references.
media-memory
Multimodal long-term memory. Ingest, describe, embed (Gemini Embedding 2 / gemini-embedding-001), and search any media (images, video, audio, documents) stored under /media-memory. Supports semantic similarity search plus structured metadata filtering by type, source, date range, and tag. Use when the user shares any media file, when the assistant generates any media, or when a past asset might be relevant to the current task. Triggers on: log this image, save this media, find that screenshot, do we have a recording of, search media memory, what was that file about.
metamedia
Multimodal memory — ingest, embed, and search media (images, video, audio, files) with Gemini Embedding 2 + ChromaDB