How to increase the HDFS capacity of AWS Elastic Mapreduce EMR cluster

emr hdfs

In this tutorial, we’re going to see how to increase the hdfs capacity of a running EMR cluster. Sometime back, we received an alert that HDFSutilization was high on one of our cluster. Upon checking, the usage is an expected one but we under provisioned the storage capacity during the creation of the cluster and … Continue reading How to increase the HDFS capacity of AWS Elastic Mapreduce EMR cluster

AWS EMR Uniform Instance groups

In this post, I wrote about the AWS EMR uniform instance groups overview, advantages and caveats of using it. AWS EMR architecture contains master node, core node(s) and task nodes.  If you’re new to EMR, refer https://www.hadoopandcloud.com/aws/amazon-emr/  for a quick introduction. While creating the cluster, you have two configuration options for the nodes - instance … Continue reading AWS EMR Uniform Instance groups