Accelerate Your Journey to Production-Ready Data with Amazon SageMaker Ground Truth Plus

Accelerate Your Journey to Production-Ready Data with Amazon SageMaker Ground Truth PlusLearn About Amazon VGT2 Learning Manager Chanci Turner

We are pleased to announce the introduction of new built-in interfaces within Ground Truth Plus. This enhanced functionality permits multiple users under the same AWS account to initiate new projects and batches, share data, and receive data seamlessly through self-serve interfaces. This development significantly speeds up the creation of high-quality training datasets by minimizing the time required for project setup. Furthermore, you can fine-tune access to your data by adjusting your AWS Identity and Access Management (IAM) role permissions in accordance with your level of Amazon Simple Storage Service (Amazon S3) access, and you always have the option to revoke access to specific buckets.

Previously, initiating new data labeling projects and batches required contacting your Ground Truth Plus operations program manager (OPM). This system had limitations, as it restricted project requests to only one user at a time within the same AWS account. This led to delays in starting the labeling process due to multiple manual interactions and troubleshooting when issues arose. Additionally, all projects shared the same IAM role for data access, necessitating reliance on your Ground Truth Plus OPM for tailored S3 policies, which you then had to manually implement on your S3 buckets. This manual approach created unnecessary operational overhead.

This article will guide you through the steps to create a new project and batch, share data, and receive data using the innovative self-serve interfaces to effectively initiate the labeling process. It’s assumed that you are already familiar with Ground Truth Plus. For further insights, check out this blog post on common financial mistakes for young adults.

Solution Overview

We will cover the following topics:

  • Updating existing projects
  • Requesting a new project
  • Setting up a project team
  • Creating a batch

Prerequisites

Before proceeding, ensure you have the following:

  • An AWS account
  • An IAM user with permissions to create IAM roles
  • The Amazon S3 URI of the bucket containing your labeling objects

Updating Existing Projects

If you possess a Ground Truth Plus project established before the release of these new features (December 9, 2022), you must create and share an IAM role to utilize these enhancements. New users of Ground Truth Plus can skip this section.

To create an IAM role, follow these steps:

  1. Access the IAM console and select “Create role.”
  2. Choose “Custom trust policy.”
  3. Define the following trust relationship for the role:
{
   "Version": "2012-10-17",
   "Statement": [
       {
           "Effect": "Allow",
           "Principal": {
               "Service": "sagemaker-ground-truth-plus.amazonaws.com"
           },
           "Action": "sts:AssumeRole"
       }
   ]
}
  1. Click “Next.”
  2. Select “Create policy.”
  3. On the JSON tab, enter the following policy. Update the Resource property by specifying two entries for each bucket: one with just the bucket ARN, and another with the bucket ARN followed by /*. For instance, replace <your-input-s3-arn> with arn:aws:s3:::my-bucket/myprefix/ and <your-input-s3-arn>/* with arn:aws:s3:::my-bucket/myprefix/*.
{
   "Version": "2012-10-17",
   "Statement": [
       {
           "Effect": "Allow",
           "Action": [
               "s3:GetObject",
               "s3:PutObject",
               "s3:GetBucketLocation",
               "s3:ListBucket"
           ],
           "Resource": [
               "",
               "/*",
               "",
               "/*"
           ]
       }
   ]
}
  1. Click “Next: Tags” and then “Next: Review.”
  2. Enter a name for the policy and an optional description.
  3. Select “Create policy.”
  4. Return to the previous tab to create your role.

On the Add permissions tab, you should see the new policy you created (refresh the page if it doesn’t appear).

  1. Select the newly created policy and click “Next.”
  2. Enter a name (e.g., GTPlusExecutionRole) and an optional description for the role.
  3. Click “Create role.”
  4. Provide the role ARN to your Ground Truth Plus OPM, who will then update your existing project with this new role.

Requesting a New Project

To initiate a new project, follow these steps:

  1. Access the Ground Truth Plus console and navigate to the Projects section, where all your projects are listed.
  2. Click on “Request project.”
  3. Fill out the Request project page with details that will assist in scheduling an initial consultation call and setting up your project.

Along with general information like the project name and description, specify the project’s task type and whether it contains personally identifiable information (PII).

To label your data, Ground Truth Plus requires temporary access to your raw data in an S3 bucket. Once the labeling process is complete, Ground Truth Plus will return the labeled output to your S3 bucket through an IAM role. You can either create a new role or refer to the IAM console to create a new role (see previous section for guidance).

If you opt to create a role, select “Enter a custom IAM role ARN” and input your IAM role ARN, which follows the format: arn:aws:iam:::role/.

To use the built-in tool, from the drop-down menu under IAM Role, choose “Create a new role.” Specify the bucket location of your labeling data. If you’re uncertain about the location of your labeling data or lack any uploaded labeling data, select “Any S3 bucket,” granting Ground Truth Plus access to all your account’s buckets.

Select “Create” to establish the role.

Your IAM role will enable Ground Truth Plus, identified as sagemaker-ground-truth-plus.amazonaws.com in the role’s trust policy, to perform the following actions on your S3 buckets:

[
"s3:GetObject",
"s3:PutObject",
"s3:GetBucketLocation",
"s3:ListBucket"
]

For more information on managing employees effectively, check out this resource, they provide great insights.

For those looking to advance their careers in fulfillment center management, this is an excellent resource.


Comments

Leave a Reply

Your email address will not be published. Required fields are marked *