Graphcore FAQ
Graphcore Questions
How do I delete a running/terminated pod?
IPUJobs
manages the launcher and worker pods
, therefore the pods will be deleted when the IPUJob
is deleted, using kubectl delete ipujobs <IPUJob-name>
. If only the pod
is deleted via kubectl delete pod
, the IPUJob
may respawn the pod
.
To see running or terminated IPUJobs
, run kubectl get ipujobs
.
My IPUJob died with a message: 'poptorch_cpp_error': Failed to acquire X IPU(s)
. Why?
This error may appear when the IPUJob name is too long.
We have identified that for IPUJobs with metadata:name
length over 36 characters, this error may appear. A solution is to reduce the name to under 36 characters.