Member-only story

What is Amazon EMR MindMap

Prem Vishnoi(cloudvala)
3 min readJun 10, 2024

Amazon EMR (Elastic MapReduce) is a cloud-based big data platform provided by Amazon Web Services (AWS) that allows users to process and analyse large amounts of data quickly and cost-effectively using AWS.

It simplified running big data frameworks such as Apache Hadoop and Apache Spark on the AWS cloud.

Key feature for EMR

Big Data Processing:

  • Amazon EMR enables you to process large amount of data using distributed computing frameworks.
  • It supports frameworks like eg Apache Hadoop, Apache Spark, Apache HBase, Apache Flink, Apache Hudi, and Presto and many more.

Scalability:

  • EMR can scale to handle data processing tasks ranging from GB to PB. You can easily add or remove instances to meet your processing needs of your demand

Cost-Effective:

  • EMR is designed to be cost-efficient, allowing you to pay only for the resources you use.
  • You can optimize costs further by using spot instances and reserved instances (avoid use of on demand instances)

--

--

Prem Vishnoi(cloudvala)
Prem Vishnoi(cloudvala)

Written by Prem Vishnoi(cloudvala)

Head of Data and ML experienced in designing, implementing, and managing large-scale data infrastructure. Skilled in ETL, data modeling, and cloud computing

No responses yet