videodb

Featured

See, Understand, Act on video and audio. See- ingest from local files, URLs, RTSP/live feeds, or live record desktop; return realtime context and playable stream links. Understand- extract frames, build visual/semantic/temporal indexes, and search moments with timestamps and auto-clips. Act- transcode and normalize (codec, fps, resolution, aspect ratio), perform timeline edits (subtitles, text/image overlays, branding, audio overlays, dubbing, translation), generate media assets (image, audio, video), and create real time alerts for events from live streams or desktop capture.

AI & Automation 201,447 stars 30903 forks Updated yesterday MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# VideoDB Skill **Perception + memory + actions for video, live streams, and desktop sessions.** ## When to use ### Desktop Perception - Start/stop a **desktop session** capturing **screen, mic, and system audio** - Stream **live context** and store **episodic session memory** - Run **real-time alerts/triggers** on what's spoken and what's happening on screen - Produce **session summaries**, a searchable timeline, and **playable evidence links** ### Video ingest + stream - Ingest a **file or URL** and return a **playable web stream link** - Transcode/normalize: **codec, bitrate, fps, resolution, aspect ratio** ### Index + search (timestamps + evidence) - Build **visual**, **spoken**, and **keyword** indexes - Search and return exact moments with **timestamps** and **playable evidence** - Auto-create **clips** from search results ### Timeline editing + generation - Subtitles: **generate**, **translate**, **burn-in** - Overlays: **text/image/branding**, motion captions - Audio: **background music**, **voiceover**, **dubbing** - Programmatic composition and exports via **timeline operations** ### Live streams (RTSP) + monitoring - Connect **RTSP/live feeds** - Run **real-time visual and spoken understanding** and emit **events/alerts** for monitoring workflows ## How it works ### Common inputs - Local **file path**, public **URL**, or **RTSP URL** - Desktop capture request: **start / stop / summarize session** - Desired operations: get context for understanding, transco...

Details

Author
affaan-m
Repository
affaan-m/everything-claude-code
Created
4 months ago
Last Updated
yesterday
Language
JavaScript
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category