What did you build?
One sentence. What is the output?
Livepeer Workflow Kit — the first framework that lets AI agents compose and run Livepeer media workflows — plus two new open-source Livepeer runners (diarized/streaming audio transcription and Florence-2 vision) that are deployed as live capabilities on the Livepeer Modules Gateway.
Why does it matter?
One to two sentences. What problem does it solve, and for whom on the network?
It is the first outside application to integrate Mike Zoop’s DAO-funded Livepeer Modules Gateway, and it breaks Livepeer out of being primarily for media creators into a media-intelligence backend for anyone driving an agent through everyday or professional tasks. This framework enables users/agents to use Livepeer to address tasks like:
- “Extract the slides from this video,”
- “Capture and transcribe this Google Meet,”
- “Cut these home videos into scenes.”
Because Roboflow is the workflow core, every new Livepeer module becomes a reusable block, so the framework — and the network’s reach into agent ecosystems like OpenClaw and Hermes — scales without rewrites; the two new runners also give orchestrators reusable diarized-transcription and vision capabilities that are essential primitives for agentic tasks.
Link to the work
Direct link to the merged PR, deployed tool, published docs, or equivalent. No link = no review.
- Livepeer Workflow Kit (open source, Apache-2.0) — GitHub - moatus/livepeer-workflow-kit · GitHub
- Audio diarized transcription runner — GitHub - moatus/audio-diarized-transcription-runner · GitHub
- Florence-2 vision runner — GitHub - moatus/florence-2-runner · GitHub
- Both runners live in the Livepeer Open Clearinghouse catalog — Livepeer Open Clearinghouse · Portal
Evidence of impact
How is this being used? Who benefits? Quantify where possible (e.g. number of users, PRs merged, integrations adopted).
- First outside application integrated with the Livepeer Modules Gateway: first Roboflow → Livepeer gateway calls succeeded, with paid usage and billing observed through the gateway.
- 2 new network runners deployed and published live by orchestrators (
openai:audio-transcriptionsdiarized/streaming model;florence-2vision), reusable by any caller via OpenAI- and Roboflow-compatible APIs. - Agent-native: given the framework with no example, multiple different agents independently figured out how to capture and transcribe a meeting from a one-line “take this stream and get this data” prompt.
- Beneficiaries: agent/end users gain Livepeer access for use in everyday task based workflows; the gateway operator gains a first live outside integration with paid usage; orchestrators gain two new deployable runner capabilities.
Community proof points
Link to the Discord thread or other signal. Reminder: you should also get 2-3 Orchestrators commenting their support for this work.
- Watercooler Call Presentation May 26th — https://www.youtube.com/live/A48k95jH21M?si=aRpU0w634Nho3I04&t=1143
- Roadmap Session — Discord
- Suggestions Thread — Discord
- Watercooler Call Presentation June 2nd — https://www.youtube.com/live/dhi0Qe3Q7YA?si=ffsuPUbIxqIQcRy2&t=2174
- Project Thread — Discord
- Watercooler Call Presentation June 16th (Demo #1) — https://www.youtube.com/live/IOvOIOKiVxY?si=hJBg7Hs-ufrK32pF&t=2004
- Open Source Livepeer runners (collaboration with Mike Zoop) — GitHub - moatus/florence-2-runner · GitHub and GitHub - moatus/audio-diarized-transcription-runner · GitHub
- Watercooler Call June 23rd (Demo #2) — https://www.youtube.com/live/_QYsoTqaXOE?si=mo-bLxsypg4havpy&t=582
- Open Source Livepeer Workflow Kit (collaboration with Mike Zoop)— GitHub - moatus/livepeer-workflow-kit · GitHub
Amount requested
USD-equivalent (max $5,000). Include a brief breakdown if the amount is above $2,000.
$5,000 USD-equivalent in LPT. Development work breakdown:
- Livepeer Workflow Kit (85%)
- Core framework
- Livepeer-aware Roboflow blocks
- CloudSPE integration
- Ingest paths
- Session runner
- Agent skill
- Audio diarized transcription runner (10%) —
openai:audio-transcriptionscapability- NeMo diarized ASR
- OpenAI-compatible bounded transcription
- True streaming (WebSocket) transcription
- Speaker diarization with segment/word timestamps
- Florence-2 vision runner (5%) —
florence-2vision capability- Florence-2 screen, slide, image, and visual-text understanding
- OpenAI-compatible vision chat route
- Direct vision analysis route
- Roboflow LMM-compatible routes