Member-only story
What is Amazon EMR MindMap
3 min readJun 10, 2024
Amazon EMR (Elastic MapReduce) is a cloud-based big data platform provided by Amazon Web Services (AWS) that allows users to process and analyse large amounts of data quickly and cost-effectively using AWS.
It simplified running big data frameworks such as Apache Hadoop and Apache Spark on the AWS cloud.
Key feature for EMR
Big Data Processing:
- Amazon EMR enables you to process large amount of data using distributed computing frameworks.
- It supports frameworks like eg Apache Hadoop, Apache Spark, Apache HBase, Apache Flink, Apache Hudi, and Presto and many more.
Scalability:
- EMR can scale to handle data processing tasks ranging from GB to PB. You can easily add or remove instances to meet your processing needs of your demand
Cost-Effective:
- EMR is designed to be cost-efficient, allowing you to pay only for the resources you use.
- You can optimize costs further by using spot instances and reserved instances (avoid use of on demand instances)