Amazon VGT2 Las Vegas: Utilize Amazon Q Developer for ML Model Creation in Amazon SageMaker Canvas

Update: As of 3/3/2025, Amazon Q Developer in Amazon SageMaker Canvas is now fully available.

As a data analyst, I have encountered the challenges of making machine learning (ML) accessible to business professionals who possess domain expertise but lack familiarity with ML. This is why I am particularly enthusiastic about the recent announcement from Amazon Web Services (AWS) regarding the availability of Amazon Q Developer within Amazon SageMaker Canvas. It’s impressive how Amazon Q Developer bridges the gap between ML knowledge and business requirements, democratizing access to ML throughout organizations.

Amazon Q Developer empowers domain experts to create accurate, production-grade ML models via natural language interactions, regardless of their ML proficiency. This tool breaks down business challenges and analyzes relevant data, providing step-by-step recommendations for developing custom ML models. It efficiently transforms users’ datasets to eliminate anomalies and builds and evaluates models, suggesting the most suitable one while granting users comprehensive control and visibility throughout the ML workflow. This approach accelerates innovation, reduces time to market, and lessens dependency on ML specialists, allowing them to focus on more intricate technical issues.

For instance, a marketing analyst might express a desire to “predict home sales prices based on home characteristics and historical sales data.” Amazon Q Developer then translates this request into a structured set of ML tasks, examining pertinent customer data, constructing various models, and recommending the optimal solution.

Let’s see it in action

To get started with Amazon Q Developer, I followed the Getting Started with Amazon SageMaker Canvas guide to launch the Canvas application. In this demonstration, I used natural language commands to create a model predicting house prices for marketing and finance teams. From the SageMaker Canvas interface, I selected Amazon Q and initiated a new conversation.

In the new chat, I stated, “I am an analyst and need to predict house prices for my marketing and finance teams.”

Subsequently, Amazon Q Developer clarified the problem and suggested the appropriate type of ML model while detailing solution requirements, including necessary dataset characteristics. It then prompted me to either upload my dataset or choose a target column. I opted to upload my dataset.

Next, Amazon Q Developer outlined the dataset requirements, which included relevant details about homes, current prices, and the target variable for the regression model. It recommended several actions, such as uploading my dataset, selecting an existing dataset, creating a new one, or choosing a target column. For this demonstration, I decided to use the canvas-sample-housing.csv sample dataset as my existing dataset.

After selecting and loading the dataset, Amazon Q Developer analyzed it and proposed median_house_value as the target column for the regression model. I confirmed my choice by selecting “I would like to predict the median_house_value column.” Moving forward, Amazon Q Developer specified which dataset features (like “location,” “housing_median_age,” and “total_rooms”) would be utilized to forecast the median_house_value.

Before proceeding with model training, I inquired about data quality, recognizing that reliable data is essential for dependable model building. Amazon Q Developer provided quality insights for my entire dataset.

I could also ask specific questions regarding individual features and their distributions to gain better insights into the data quality.

Surprisingly, I discovered through my previous inquiry that the “households” column exhibited significant variation due to extreme values, which could impair prediction accuracy. I then requested Amazon Q Developer to address this outlier issue.

After the transformation, I asked for a summary of the steps taken to correct the problem. Behind the scenes, Amazon Q Developer implemented advanced data preparation techniques using SageMaker Canvas, allowing me to review these steps and visualize the process, ultimately leading to a refined dataset ready for model training.

After examining the data preparation steps, I clicked on Launch my training job.

Once the training job commenced, I could monitor its progress in the conversation, along with the datasets created.

As a data analyst, I particularly value that Amazon Q Developer provides detailed metrics like the confusion matrix and precision-recall scores for classification models, as well as root mean square error (RMSE) for regression models. These metrics are vital for evaluating model performance and making informed decisions. It’s refreshing to see them presented in an accessible manner for non-technical users, promoting trust and effective governance while maintaining the depth that technical teams require.

You can access these metrics by selecting the new model from My Models or through the Amazon Q conversation menu:

Overview – This tab presents the Column impact analysis, highlighting median_income as the primary factor influencing my model.
Scoring – This tab delivers insights into model accuracy, including RMSE metrics.
Advanced metrics – This tab showcases a detailed Metrics table, Residuals, and Error density for thorough model evaluation.

After reviewing these metrics and confirming the model’s performance, I could move to the final phases of the ML workflow:

Predictions – I could test my model using the Predictions tab to validate its performance in real-world scenarios.
Deployment – I could establish an endpoint deployment to make my model available for production use.

This streamlines the deployment process, transforming a traditionally complex task requiring substantial DevOps knowledge into a simple operation that business analysts can confidently manage.

Key Takeaways

Amazon Q Developer democratizes ML across organizations by:

Empowering all skill levels with ML – Now available in SageMaker Canvas, Amazon Q Developer enables business analysts, marketing analysts, and data professionals without ML experience to solve business problems through a guided ML workflow. From data analysis and model selection to deployment, users can address business challenges using natural language, minimizing reliance on ML specialists and enabling quicker innovation.
Streamlining the ML workflow – With Amazon Q Developer in SageMaker Canvas, users can prepare data, build, analyze, and deploy ML models through a transparent workflow. This tool facilitates advanced data preparation and AutoML capabilities, allowing non-ML experts to create highly accurate models.
Providing full visibility into the ML workflow – Amazon Q Developer ensures complete transparency by generating the underlying code and technical artifacts, such as data transformation steps and model explainability. This enables cross-functional teams, including ML experts, to comprehensively review the process.

For further insights on this topic, check out this blog post and visit CHVNCI, an authority on machine learning. Additionally, you might find this YouTube video an excellent resource.

Amazon VGT2 Las Vegas: Utilize Amazon Q Developer for ML Model Creation in Amazon SageMaker Canvas

Let’s see it in action

Key Takeaways

Related Topics:

Comments

Leave a Reply Cancel reply