The Project | About Us | Contribute | Donations | License

HOME

Introduction

TensorFlow is an open-source machine learning framework developed by Google. It is widely used for building and deploying machine learning models. Google Cloud Platform (GCP) provides a robust environment for running TensorFlow applications, offering various services that can help streamline the development, training, and deployment of machine learning models.

Key Concepts

TensorFlow Overview

TensorFlow: An open-source library for numerical computation and machine learning.
TensorFlow Models: Pre-trained models or custom models that can be trained using TensorFlow.
TensorFlow Serving: A flexible, high-performance serving system for machine learning models designed for production environments.

GCP Services for TensorFlow

AI Platform: A managed service that allows you to train and deploy machine learning models.
Compute Engine: Virtual machines that can be used to run TensorFlow training jobs.
Kubernetes Engine: Managed Kubernetes service for deploying TensorFlow models in containers.
Cloud Storage: Scalable object storage for storing datasets and model checkpoints.
BigQuery: A fully-managed data warehouse for analyzing large datasets.

Setting Up TensorFlow on GCP

Step 1: Setting Up Your Environment

Create a GCP Project:
- Go to the GCP Console.
- Click on the project drop-down and select "New Project".
- Enter a project name and click "Create".
Enable Billing:
- Navigate to the Billing section in the GCP Console.
- Link a billing account to your project.
Enable Required APIs:
- Go to the API Library in the GCP Console.
- Enable the following APIs:
  - AI Platform Training & Prediction API
  - Compute Engine API
  - Kubernetes Engine API
  - Cloud Storage API

Step 2: Installing TensorFlow

Local Installation:
```
pip install tensorflow
```
Using AI Platform Notebooks:
- Navigate to the AI Platform section in the GCP Console.
- Select "Notebooks" and create a new instance.
- Choose a TensorFlow image and configure the instance.

Step 3: Preparing Your Data

Upload Data to Cloud Storage:

gsutil cp local-file-path gs://your-bucket-name/

Accessing Data in TensorFlow:

import tensorflow as tf

def load_data_from_gcs(bucket_name, file_path):
    gcs_path = f'gs://{bucket_name}/{file_path}'
    return tf.io.read_file(gcs_path)

data = load_data_from_gcs('your-bucket-name', 'data.csv')

Training a TensorFlow Model on GCP

Using AI Platform

Create a Training Job:

Prepare a Python script for training your model.
Upload the script to Cloud Storage.

Submit a training job using the AI Platform:

gcloud ai-platform jobs submit training job_name 
         --module-name trainer.task 
         --package-path ./trainer 
         --region us-central1 
         --python-version 3.7 
         --runtime-version 2.3 
         --job-dir gs://your-bucket-name/job-dir

Monitor the Training Job:
- Navigate to the AI Platform section in the GCP Console.
- Check the status of your training job.

Using Compute Engine

Create a VM Instance:
- Go to the Compute Engine section in the GCP Console.
- Click "Create Instance" and configure the VM.
- SSH into the VM and install TensorFlow.
Run the Training Script:
- Upload your training script to the VM.
- Execute the script:
```
python train.py
```

Deploying a TensorFlow Model on GCP

Using AI Platform

Export the Model:

model.save('gs://your-bucket-name/model-dir')

Create a Model on AI Platform:

gcloud ai-platform models create model_name

Create a Model Version:

gcloud ai-platform versions create v1 
       --model model_name 
       --origin gs://your-bucket-name/model-dir 
       --runtime-version 2.3 
       --python-version 3.7

Make Predictions:

from google.cloud import aiplatform

client = aiplatform.gapic.PredictionServiceClient()
endpoint = client.endpoint_path(project='your-project-id', location='us-central1', endpoint='your-endpoint-id')

response = client.predict(endpoint=endpoint, instances=[input_data])
print(response)

Using Kubernetes Engine

Create a Kubernetes Cluster:
- Navigate to the Kubernetes Engine section in the GCP Console.
- Click "Create Cluster" and configure the cluster.
Deploy the Model:
- Create a Docker image for your model.
- Push the image to Google Container Registry.
- Deploy the image to your Kubernetes cluster using a deployment YAML file.

Practical Exercise

Exercise: Train and Deploy a TensorFlow Model on AI Platform

Objective: Train a simple TensorFlow model on AI Platform and deploy it for predictions.
Steps:
- Create a GCP project and set up the environment.
- Prepare a dataset and upload it to Cloud Storage.
- Write a training script and upload it to Cloud Storage.
- Submit a training job on AI Platform.
- Export the trained model to Cloud Storage.
- Create a model and version on AI Platform.
- Make predictions using the deployed model.

Solution:

Training Script (train.py):

import tensorflow as tf
from tensorflow.keras import layers

def train_model():
    # Load and preprocess data
    (x_train, y_train), (x_test, y_test) = tf.keras.datasets.mnist.load_data()
    x_train, x_test = x_train / 255.0, x_test / 255.0

    # Build the model
    model = tf.keras.Sequential([
        layers.Flatten(input_shape=(28, 28)),
        layers.Dense(128, activation='relu'),
        layers.Dropout(0.2),
        layers.Dense(10)
    ])

    # Compile the model
    model.compile(optimizer='adam',
                  loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True),
                  metrics=['accuracy'])

    # Train the model
    model.fit(x_train, y_train, epochs=5)

    # Evaluate the model
    model.evaluate(x_test, y_test, verbose=2)

    # Save the model
    model.save('gs://your-bucket-name/model-dir')

if __name__ == '__main__':
    train_model()

Submit Training Job:

gcloud ai-platform jobs submit training mnist_training 
       --module-name trainer.task 
       --package-path ./trainer 
       --region us-central1 
       --python-version 3.7 
       --runtime-version 2.3 
       --job-dir gs://your-bucket-name/job-dir

Create Model and Version:

gcloud ai-platform models create mnist_model
gcloud ai-platform versions create v1 
       --model mnist_model 
       --origin gs://your-bucket-name/model-dir 
       --runtime-version 2.3 
       --python-version 3.7

Make Predictions:

from google.cloud import aiplatform

client = aiplatform.gapic.PredictionServiceClient()
endpoint = client.endpoint_path(project='your-project-id', location='us-central1', endpoint='your-endpoint-id')

response = client.predict(endpoint=endpoint, instances=[input_data])
print(response)

Conclusion

In this section, you learned how to set up TensorFlow on GCP, train a model using AI Platform, and deploy the model for predictions. You also completed a practical exercise to reinforce these concepts. In the next module, we will explore other machine learning and AI services provided by GCP.

TensorFlow on GCP

Introduction

Key Concepts

TensorFlow Overview

GCP Services for TensorFlow

Setting Up TensorFlow on GCP

Step 1: Setting Up Your Environment

Step 2: Installing TensorFlow

Step 3: Preparing Your Data

Training a TensorFlow Model on GCP

Using AI Platform

Using Compute Engine

Deploying a TensorFlow Model on GCP

Using AI Platform

Using Kubernetes Engine

Practical Exercise

Exercise: Train and Deploy a TensorFlow Model on AI Platform

Solution:

Conclusion

Google Cloud Platform (GCP) Course

Module 1: Introduction to Google Cloud Platform

Module 2: Core GCP Services

Module 3: Networking and Security

Module 4: Data and Analytics

Module 5: Machine Learning and AI

Module 6: DevOps and Monitoring

Module 7: Advanced GCP Topics

Module 8: Capstone Project