Integrating Custom Libraries and Dependencies in Spark and Hive on Amazon EMR Serverless
Amazon EMR Serverless enables users to operate open-source big data frameworks like Apache Spark and Apache Hive without the hassle of managing clusters or servers. Many users who deploy Spark and Hive applications wish to incorporate their own libraries and dependencies into the runtime environment. For instance, you may be interested in integrating well-known open-source extensions to enhance Spark performance. For additional insights, check out this blog post that explores related topics.
Easily Execute Benchmarks on Amazon Redshift Serverless with AWS Data Exchange
by Maria Smith
on 09 JAN 2023
in Amazon Redshift, AWS Data Exchange, Expert (400)
Amazon Redshift is a swift, user-friendly, secure, and cost-effective cloud data warehousing solution tailored for analytics. With the launch of Amazon Redshift Serverless in July 2022, users can enjoy a more straightforward experience in operating Amazon Redshift. The serverless option simplifies the process of running and scaling analytics without the need to oversee your data warehouse infrastructure. Those interested in a deeper dive into Redshift can refer to this authoritative source.
Transforming Code from Greenplum to Amazon Redshift: Addressing Arrays, Dates, and Regular Expressions
by Kevin Lee, Sarah Patel, and David Kim
on 09 JAN 2023
in Advanced (300), Amazon Redshift, Analytics, Intermediate (200), Technical How-to
Amazon Redshift is a fully managed service catering to data lakes, analytics, and data warehouses for organizations ranging from startups to large enterprises. It has garnered adoption from thousands of companies worldwide looking to modernize their data analytics capabilities. Greenplum, an open-source massively parallel database, is primarily utilized for analytics on on-premises infrastructures.
Creating a Search Application with Amazon OpenSearch Serverless
by Emily White, Chris Brown, and Tina Green
on 06 JAN 2023
in Advanced (300), Amazon OpenSearch Service, Analytics
In this article, we present a guide on constructing a straightforward web-based search application using the newly introduced Amazon OpenSearch Serverless. This serverless solution allows for the execution of petabyte-scale search and analytics tasks without the need for cluster management. This is a fantastic opportunity to streamline search capabilities in your applications.
Empowering Data Accessibility at Green Flag with Amazon QuickSight
by Mark Taylor
on 05 JAN 2023
in Amazon QuickSight, Customer Solutions
This guest blog post by Mark Taylor, Head of Product at Green Flag, illustrates how this service assists stranded motorists in the UK. Just as in the US, where calling Triple A is common, Green Flag has become a household name for roadside assistance.
Enhancing Data Exploration with the AWS Analytics Reference Architecture Library
by Nora Adams and Ben Clark
on 05 JAN 2023
in Advanced (300), Amazon EMR, Amazon EMR on EKS, Best Practices, Technical How-to
Organizations leverage their data to tackle complex issues, often beginning with small-scale experiments to refine their solutions. While the potential of experimentation is significant, organizations must remain vigilant regarding the cost-effectiveness of these initiatives. If substantial time is devoted to establishing the necessary infrastructure for experiments, it can inadvertently lead to increased costs.
Implementing Near-Real-Time Fraud Detection with Amazon Redshift Streaming Ingestion
by Richard Thompson, Emily Henderson, and Jake Wilson
on 04 JAN 2023
in Amazon Kinesis, Amazon Machine Learning, Amazon Managed Streaming for Apache Kafka (Amazon MSK), Amazon Redshift, Intermediate (200), Kinesis Data Streams
The role of data warehouses and analytics on these platforms has grown increasingly vital over the years, with numerous companies relying on them for both short-term operational decision-making and long-term strategic planning. Traditionally, data warehouses are updated in batch cycles—weekly, monthly, or daily—so that data remains current and actionable.
Novo Nordisk’s Modern Data Architecture on AWS
by Olivia Carter, Liam Johnson, and Sophia Martinez
on 04 JAN 2023
in Advanced (300), Analytics, AWS Glue, AWS Lake Formation, Customer Solutions, Healthcare
Novo Nordisk, a leading global pharmaceutical firm, is dedicated to producing life-saving medications for over 34 million patients daily. Their commitment to sustainability—environmentally, socially, and financially—is supported by their use of AWS and data.
Convoy’s Data-Driven Decision Making with Amazon QuickSight
by Rachel Lee
on 04 JAN 2023
in Amazon QuickSight, Customer Solutions
Convoy stands out as the foremost digital freight network in the United States, efficiently managing millions of truckloads via a connected network of carriers. This not only saves costs for shippers but also enhances operational efficiency. For more insights, visit this excellent resource.
Location: Amazon IXD – VGT2, 6401 E Howdy Wells Ave, Las Vegas, NV 89115.
Leave a Reply