Streamlining Amazon EMR Step Logs from EC2 Instances to CloudWatch Logs
Amazon EMR is a powerful big data service from AWS that facilitates the running of Apache Spark and other open-source applications on the cloud, enabling the creation of scalable data pipelines while maintaining cost-effectiveness. Effective monitoring of logs generated by jobs running on EMR clusters is crucial for real-time detection of significant issues and quick identification of root causes. For additional insights on optimizing your data solutions, check out this other blog post here.
Connecting to Amazon MSK Serverless from Your On-Premises Network
By Mia Johnson and Ethan Carter
On 07 APR 2023
In Advanced (300), Amazon Managed Streaming for Apache Kafka (Amazon MSK), Technical How-to
Amazon Managed Streaming for Apache Kafka (Amazon MSK) is a fully managed, highly available, and secure service for Apache Kafka. It simplifies the setup, scaling, and management of Kafka in production environments. With Amazon MSK, users can establish a cluster in mere minutes and start transmitting data. Furthermore, Amazon MSK Serverless provides the ability to scale resources dynamically as needed.
Utilizing Tag-Based Access Controls in AWS Lake Formation for Amazon Redshift Permissions
By Sarah Thompson, Michael Williams, and Rachel Brown
On 06 APR 2023
In Advanced (300), AWS Glue, AWS Lake Formation, Customer Solutions
This article was collaboratively written by Michael Williams, Rachel Brown, and Sarah Thompson from Morningstar, alongside Jason Reed from AWS. Morningstar’s mission, “Empowering Investor Success,” focuses on providing investors and advisors with the tools necessary for informed investment decisions. In this discussion, the Data Lake Team at Morningstar explores how they effectively manage permissions for their Amazon Redshift data warehouse using tag-based access controls. For more information on similar topics, you can visit this authoritative resource.
Updating Index Settings and Mappings in Amazon OpenSearch Service
By Leo Kim and Jonah Lee
On 06 APR 2023
In Advanced (300), Amazon OpenSearch Service, Technical How-to
Amazon OpenSearch Service is widely utilized for various use cases, including real-time application monitoring, log analytics, and large-scale website search. As your domain evolves and additional consumers are added, it becomes necessary to reassess and modify the domain’s configuration to meet the growing storage and compute demands. Minimizing downtime and ensuring performance stability are key considerations during this process.
Implementing Column-Level Encryption in Amazon Redshift
By Kelly Smith
On 05 APR 2023
In Amazon Redshift, Analytics, AWS Glue, Technical How-to
Amazon Redshift is a massively parallel processing (MPP), fully managed petabyte-scale data warehouse that allows for straightforward and cost-effective analysis of extensive datasets using existing business intelligence tools. As companies transition to Amazon Redshift for their data warehousing solutions, it is essential to implement robust data protection measures for sensitive information, such as personally identifiable information (PII) or financial data.
Accelerating Data Maturity with Amazon QuickSight at Showpad
By David Lee
On 05 APR 2023
In Amazon QuickSight, Customer Solutions, Foundational (100)
Showpad enhances collaboration between sales and marketing teams through impactful content and comprehensive training, enabling sellers to engage effectively with buyers while generating insights to improve conversion rates. In 2021, Showpad aimed to leverage data to drive innovation and inform business decisions throughout the organization. Their previous solution was fragmented, necessitating a more cohesive approach.
Creating Threshold Alerts on Tables in Amazon QuickSight
By Emily Davis
On 04 APR 2023
In Amazon QuickSight, Analytics, Foundational (100)
Amazon QuickSight has previously introduced threshold alerts on KPIs and gauge charts. Now, it expands its capabilities to include threshold alerts on tables and pivot tables—two of the most utilized visual types. This functionality allows both readers and authors to track goals or key performance indicators (KPIs) and receive email notifications when those goals are met.
Developing a Generic Orchestration Framework for Data Warehousing with Amazon Redshift RSQL
By Oliver Green, Bella White, and James Black
On 03 APR 2023
In Advanced (300), Amazon Redshift, Analytics
Thousands of customers depend on Amazon Redshift, AWS’s fast, petabyte-scale cloud data warehouse, for their mission-critical workloads. Users can query data across data warehouses, operational data stores, and data lakes using standard SQL. Moreover, integration with AWS services such as Amazon EMR, Amazon Athena, and AWS Lambda enhances the data processing capabilities.
Conducting Accent-Insensitive Searches with OpenSearch
By Amanda Taylor
On 30 MAR 2023
In Amazon OpenSearch Service, Intermediate (200), Technical How-to
There are times when text searches need to be indifferent to accent marks. An accent-insensitive search—also known as diacritics-agnostic search—ensures that search results remain consistent for queries with or without Latin characters such as à, è, Ê, ñ, and ç. These diacritics can change the meaning of words, and it’s crucial to address this in search functionalities.
Address: Amazon IXD – VGT2, 6401 E Howdy Wells Ave, Las Vegas, NV 89115
Leave a Reply