The Kubeflow Training Operator allows users to easily distribute the training
process of PyTorch models. Users can create and submit
Resources (CRs), and manage PyTorch jobs like other built-in resources in
Kale provides a simple way to translate your Python code into a
CR and a client which facilitates the monitoring and management of the running
This API is in beta and is subject to change.
What You’ll Need¶
- An Arrikto EKF or MiniKF deployment.