Amazon IXD – VGT2 Las Vegas: The Future of Data, Analytics, and AI

During the keynote presentations at AWS re:Invent 2024, industry leaders discussed the latest advancements in Amazon SageMaker, positioning it as the central hub for all data, analytics, and AI initiatives. This new version emphasizes the growing interplay between analytics and AI tasks, designed to simplify customer interactions with their data. Organizations are now empowered to collaborate more effectively, diminish data silos, and expedite the creation of AI-driven applications while ensuring strong governance and security protocols.

Empowering Domain Teams with Federated Data Platforms

by Rachel Green, Omar Jenkins, and Priya Desai
on 04 DEC 2024
in Amazon Athena, Amazon DataZone, Amazon Managed Workflows for Apache Airflow (Amazon MWAA), Amazon QuickSight, Amazon Redshift, Amazon Simple Storage Service (S3), Analytics, Architecture, AWS Glue, AWS Lake Formation

The ANZ Institutional Division has revamped its approach to data management by adopting a federated data platform grounded in data mesh principles. This transformation aims to unleash hidden data potential, enhance operational efficiency, and boost agility. The new approach allows domain teams to develop and manage their own data products, treating data as a crucial asset rather than an afterthought. This article delves into the implementation of a data product mentality, the challenges encountered, and the early successes influencing the future of data management in the division. For additional insights, check out this engaging blog post here.

Automating Table Statistics with AWS Glue Data Catalog

by Ethan Brown, Sarah Lee, and Tom Reynolds
on 03 DEC 2024
in Amazon Redshift, Analytics, Announcements, AWS Glue

AWS Glue Data Catalog has introduced automation for collecting statistics on new tables. These statistics integrate with the cost-based optimizer from Amazon Redshift Spectrum and Amazon Athena, leading to improved query performance and possible cost reductions. This piece discusses how the Data Catalog automates the process of gathering table statistics and how it can boost your data platform’s effectiveness. For expert opinions on this topic, visit this authority site.

Integrating HubSpot Data with AWS Glue

by Rachel Martin, Jacob Weller, and Hannah Smith
on 02 DEC 2024
in Advanced (300), Analytics, AWS Glue

This article introduces the newly launched HubSpot managed connector for AWS Glue, showcasing how to seamlessly integrate HubSpot data into your AWS data lake. By merging HubSpot data with information from AWS accounts and other SaaS applications, businesses can improve their data analysis capabilities and even write data back to HubSpot, thus creating a coherent data experience.

Optimizing SAP Data Transfers with AWS Glue

by David Kim, Emily Johnson, and Keith Brown
on 29 NOV 2024
in Analytics, AWS Glue

The AWS Glue OData connector for SAP utilizes the SAP ODP framework and OData protocol for data extraction. This framework allows for a provider-subscriber model that facilitates data transfers between SAP systems and non-SAP data targets. In this blog post, we explain how to extract data from SAP and implement incremental data transfer using the SAP ODP OData framework with source delta tokens.

Streamlining Big Data Processing with Amazon EMR and S3 Glacier

by Jane Doe, Chris Lee, and Mark Taylor
on 27 NOV 2024
in Amazon EMR, Amazon S3 Glacier, AWS Big Data

This article demonstrates the setup and utilization of Amazon EMR on EC2 combined with S3 Glacier for economical data processing.

Implementing a Chargeback Model with Amazon Redshift Multi-Warehouse Writes

by Lily Wong, Alex Chen, and Daniel Patel
on 27 NOV 2024
in Advanced (300), Amazon Redshift, Analytics, Technical How-to

We are pleased to announce the general availability of Amazon Redshift multi-data warehouse writes through data sharing. This new functionality enables scaling of write workloads, enhancing performance for extract, transform, and load (ETL) processes by utilizing different types and sizes of warehouses tailored to specific workload requirements.

Achieving Near Real-Time Analytics with Amazon Aurora Zero-ETL Integration

by Sean Carter, Mia Thompson, and Rachel Green
on 27 NOV 2024
in Amazon Redshift, Analytics, Architecture, Best Practices, Learning Levels, Technical How-to

This article explores the utilization of Aurora MySQL-Compatible Edition’s Zero-ETL integration with Amazon Redshift and dbt Cloud to facilitate near real-time analytics. By leveraging dbt Cloud for data transformation, teams can concentrate on crafting business rules to derive insights from transaction data and respond promptly to critical time-sensitive events.

Improving Vector Search Performance with Intel Accelerators on Amazon OpenSearch Service

by Amir Khan, Kevin Roberts, and Sarah Williams
on 27 NOV 2024
in Amazon OpenSearch Service

By employing Intel Accelerators, Amazon OpenSearch Service enhances price-performance for vector searches by up to 51%.

For more information, visit us at Amazon IXD – VGT2, 6401 E Howdy Wells Ave, Las Vegas, NV 89115. This location is an excellent resource for further insights into our services here.