algorithm simulations and metrics #39

olderTaoist · 2021-05-21T01:33:08Z

i use hived on product environment with 20+ nivida V100， so i want to get some reporters on reducing fragmentation of GPU and job wait time like hived paper in fifth chapter，how can i simulations. meantime, Is there a plan to add some metrics of the hived scheduler???

olderTaoist · 2021-05-26T09:52:45Z

@fanyangCS

fanyangCS · 2021-05-28T03:59:16Z

hived is designed to remove external fragmentation, it relies on Intra-VC scheduler to reduce internal fragmentation.
we don't have a plan expose specific metrics, but we have APIs to query the current scheduling status, which can be used to derive metrics like fragmentation.

olderTaoist · 2021-06-03T01:57:01Z

hived is designed to remove external fragmentation, it relies on Intra-VC scheduler to reduce internal fragmentation.
we don't have a plan expose specific metrics, but we have APIs to query the current scheduling status, which can be used to derive metrics like fragmentation.

the APIs of scheduling status is /v1/inspect/clusterstatus/physicalcluster and /v1/inspect/clusterstatus/virtualclusters? the external fragmentation is mean that virutal cluster mapping to physical cluster？the Intra-VC scheduler is somewhere like Binpack scheduling one virtual cell by one。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

algorithm simulations and metrics #39

algorithm simulations and metrics #39

olderTaoist commented May 21, 2021

olderTaoist commented May 26, 2021

fanyangCS commented May 28, 2021

olderTaoist commented Jun 3, 2021

algorithm simulations and metrics #39

algorithm simulations and metrics #39

Comments

olderTaoist commented May 21, 2021

olderTaoist commented May 26, 2021

fanyangCS commented May 28, 2021

olderTaoist commented Jun 3, 2021