A common metric all Orchestrators wish they had access to is demand-by-region and current avg and max streams + number of broadcasters by region. These appear innocuous enough, but starting a community driven effort to collate this bears scrutiny, discussion and a majority agreement within the Orchestrator community.
Having discussed issues around this with other Orchestrator’s (O’s), it appears, besides the two metrics above, we all seek different metrics from the Livepeer network. Thankfully Grafana’s remote write feature enables zero trust and selective metric sharing and is part of the stack used by all O’s today.
-
A minimal set that can answer the two most sought after metrics (above) should be all required of any orchestrator to opt-in. Any other metrics are optional. No metric should require root access or new untrusted binaries to collect. No O should be expected to open a port to share metrics.
-
This effort should be “By Orchestrator’s, For Orchestrator’s”.
O’s should decide what information casual visitors to this site are allowed to view, it should be anonymized and geo-location accuracy reduced by 50-100 miles.
Only O’s should be able to login (Metamask?) and see the rest of the metrics, with accuracy.
This effort should not be collecting this data to share or sell to anybody, ever.
It should collect the least metrics by default, enabling O’s to share more if they so desire. -
If this is to serve as a community driven effort, we ought to collect metrics from sources other than the
livepeer
binary, like the kernel / NIC driver etc. and ensure that some metric types, common to all of them, do indeed converge as expected.
For example, iflivepeer
reports it has transcoded 1 hour of video (10 streams at 1080p + 10 streams at 720p), the amount of traffic reported on the Orchestrator port reported by the kernel / NIC driver should converge with the data extrapolated fromlivepeer
.
The data transferred should be within a 1-2% error margin as reported by all sources.
Introducing Prometheus Agent Mode, an Efficient and Cloud-Native Way for Metric Forwarding | Prometheus is an interesting read that touches on some of the challenges and pitfalls for a global effort like this.