Train and Deploy on GCP from a Local Notebook

Train and Deploy on GCP from a Local Notebook

Use Kubeflow Fairing to train and deploy a model on Google Cloud Platform (GCP) from a local notebook.

This guide introduces you to using Kubeflow Fairing to train and deploy a model to Kubeflow on Google Kubernetes Engine (GKE), and Google Cloud ML Engine. As an example, this guide uses a local notebook to demonstrate how to:

Train an XGBoost model in a local notebook,
Use Kubeflow Fairing to train an XGBoost model remotely on Kubeflow,
Use Kubeflow Fairing to train an XGBoost model remotely on Cloud ML Engine,
Use Kubeflow Fairing to deploy a trained model to Kubeflow, and
Call the deployed endpoint for predictions.

This guide has been tested on Linux and Mac OS X. Currently, this guide has not been tested on Windows.

Clone the Kubeflow Fairing repository

Clone the Kubeflow Fairing repository to download the files used in this example.

git clone https://github.com/kubeflow/fairing 
cd fairing

Set up Python, Jupyter Notebook, and Kubeflow Fairing

You need Python 3.6 or later to use Kubeflow Fairing. To check if you have Python 3.6 or later installed, run the following command:
```
python3 -V
```
The response should be something like this:
```
Python 3.6.5
```
If you do not have Python 3.6 or later, you can download Python from the Python Software Foundation.
Use virtualenv to create a virtual environment to install Kubeflow Fairing in. To check if you have virtualenv installed, run the following command:
```
which virtualenv
```
The response should be something like this.
```
/usr/bin/virtualenv
```
If you do not have virtualenv, use pip3 to install virtualenv.
```
pip3 install --upgrade virtualenv
```
Create a new virtual environment, and activate it.
```
virtualenv venv --python=python3
source venv/bin/activate
```
Install Jupyter Notebook.
```
pip3 install --upgrade jupyter
```
Install Kubeflow Fairing from the cloned repository.
```
pip3 install --upgrade .
```
Install the Python dependencies for the XGBoost demo notebook.
```
pip3 install -r examples/prediction/requirements.txt
```

Install and configure the Google Cloud SDK

In order to use Kubeflow Fairing to train or deploy to Kubeflow on GKE, or Cloud Machine Learning Engine, you must configure your development environment with access to GCP.

If you do not have the Cloud SDK installed, install the Cloud SDK.

Use gcloud to set a default project.

export PROJECT_ID=<your-project-id>
gcloud config set project ${PROJECT_ID}

Kubeflow Fairing needs a service account to make API calls to GCP. The recommended way to provide Fairing with access to this service account is to set the GOOGLE_APPLICATION_CREDENTIALS environment variable. To check for the GOOGLE_APPLICATION_CREDENTIALS environment variable, run the following command:
```
ls "${GOOGLE_APPLICATION_CREDENTIALS}"
```
The response should be something like this:
```
/.../.../key.json
```
If you do not have a service account, then create one and grant it access to the required roles.
```
export SA_NAME=<your-sa-name>
gcloud iam service-accounts create ${SA_NAME}
gcloud projects add-iam-policy-binding ${PROJECT_ID} \
    --member serviceAccount:${SA_NAME}@${PROJECT_ID}.iam.gserviceaccount.com \
    --role 'roles/editor'
```
Create a key for your service account.
```
gcloud iam service-accounts keys create ~/key.json \
    --iam-account ${SA_NAME}@${PROJECT_ID}.iam.gserviceaccount.com
```
Create the GOOGLE_APPLICATION_CREDENTIALS environment variable.
```
export GOOGLE_APPLICATION_CREDENTIALS=~/key.json
```

Set up Docker

You need to have Docker installed to use Kubeflow Fairing. Fairing packages your code as a Docker image and executes it in the remote cluster. To check if your local Docker daemon is running, run the following command:

docker ps

If you get a message like docker: command not found, then install Docker.
If you get an error like Error response from daemon: Bad response from Docker engine, then restart your docker daemon.
If you are using Linux and you use sudo to access Docker, follow these steps to add your user to the docker group. Note, the docker group grants privileges equivalent to the root user. To learn more about how this affects security in your system, see the guide to the Docker daemon attack surface.

Authorize Docker to access your GCP Container Registry.

gcloud auth configure-docker

Set up Kubeflow

Use the following instructions to set up and configure your Kubeflow and development environments for training and prediction from Kubeflow Fairing.

If you do not have a Kubeflow environment, follow the guide to deploying Kubeflow on GKE to set up your Kubeflow environment. The guide provides two options for setting up your environment:
- The Kubeflow deployment user interface is an easy way for you to set up a GKE cluster with Kubeflow installed, or
- You can deploy Kubeflow using the command line.
Update your kubeconfig with appropriate credentials and endpoint information for your Kubeflow cluster. To find your cluster’s name, run the following command to list the clusters in your project:
```
gcloud container clusters list
```
Update the following command with your cluster’s name and GCP zone, then run the command to update your kubeconfig to provide it with credentials to access this Kubeflow cluster.
```
export CLUSTER_NAME=kubeflow
export ZONE=us-central1-a
gcloud container clusters get-credentials ${CLUSTER_NAME} --region ${ZONE}
```

Use Kubeflow Fairing to train a model locally and on GCP

Launch the XGBoost quickstart in a local Jupyter notebook.

jupyter notebook examples/prediction/xgboost-high-level-apis.ipynb

Follow the instructions in the notebook to train a model locally, on Kubeflow, and on Cloud ML Engine. Then deploy the trained model to Kubeflow for predictions and send requests to the prediction endpoint.

Last modified 21.04.2021: Components v external add ons (#2630) (42f08be0)