Setting Up Google Cloud Text-to-Speech

45 minutes
  • 5 Learning Objectives

About this Hands-on Lab

Although there are many tools to communicate, there are two basic types of communication: written and spoken. Machine learning has made it possible to convert one to the other, resulting in speech that is very close to that of a human voice. In this hands-on lab, you’ll step through the process for utilizing the Google Cloud Text-to-Speech API, transforming text in a JSON format to an audio-ready MP3 file.

Learning Objectives

Successfully complete this lab by achieving the following learning objectives:

Enable the Cloud Text-to-Speech API
  1. From the Google Cloud console’s main navigation, choose APIs & Services > Library.
  2. Search for "text", and select Cloud Text-to-Speech API.
  3. If necessary, click Enable.
Retrieve Working Files
  1. Activate the Cloud Shell.

  2. Retrieve the working files:

    git clone
  3. In the Cloud Shell, change directories:

    cd content-gc-ai-services-deepdive/ai-conversations/
Set Up Service Account
  1. Click Launch Editor.
  2. In the Shell Editor, expand the ai-conversations folder.
  3. Choose File > New File and name the file key.json.
  4. From this Hands-On Lab Credentials, copy the Service Account Credentials.
  5. Paste the clipboard contents into the new file, and choose File > Save.
  6. Switch to the Terminal enter the following command:

Send Request to Cloud Text-to-Speech API
  1. Call the Cloud Text-to-Speech API:

    curl -X POST 
    -H "Authorization: Bearer "$(gcloud auth application-default print-access-token) 
    -H "Content-Type: application/json; charset=utf-8" 
    -d @text-to-speech-request.json 
    > synthesize-text.txt 
    1. In the Shell Editor, open synthesize-text.txt.
    2. Remove the following from the beginning of the file:
      "audioContent": "
    1. From the end of the file, remove the following:
    1. Save the file.
Convert Response to MP3
  1. In the Cloud Shell, enter the following command:

    base64 synthesize-text.txt -d > synthesized-audio.mp3
  2. Download the MP3 file to your system:

    cloudshell download synthesized-audio.mp3
  3. Click Download.

  4. Open the downloaded MP3 file to hear the results.

Additional Resources

Your company wants to increase its accessibility on a number of levels, including converting text docs to audio output. You've been asked to run an initial test using Google Cloud Text-to-Speech API to validate the procedure.


GitHub repo: Google Cloud AI Services Deep Dive Repo

What are Hands-on Labs

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Sign In
Welcome Back!

Psst…this one if you’ve been moved to ACG!

Get Started
Who’s going to be learning?