Analyze

Key Features

  • Processing and generating large data sets.
  • Exploits a restricted programming model to parallelize the user program automatically and to provide transparent fault-tolerance.

Details

By distributing the workload to many machines and let them execute the tasks in parallel.

Specifically, input files are split into $M$ pieces and master assign each one to an idle worker. Worker process it with the map function and save each key/value pair to one of the $R$ files according to the partioning function. When finishing processing all $M$ pieces, $R$ reduce workers will read data from the corresponding intermediate files, process it with the reduce function and save to output file eventually.

image-20231029205513215

Limit

Task has to fit in the form of map and reduce.

Reliability

It can tolerant both worker and master failures.

How

The master will send heartbeat to the workers, a worker will be marked as failed when there is no response received by the master in a certain amount of time, and its task will be rescheduled.

MapReduce is resilient to large-scale worker failures, if there are workers alive, the progress can be prompted.

However, if master fails, Google’s choice is to abort the MapReduce computation. To achieve reliability, we can store the state on ZooKeeper or etcd, or just write periodic checkpoints to local disk.

When the master has seen more than one failure on a particular record, it indicates that the record should be skipped when it issues the next re-execution of the corresponding Map or Reduce task.

Performance

High

Several Tricks

Backup Tasks

When a MapReduce operation is close to completion, the master schedules backup executions of the remaining in-progress tasks.

Locality

Network bandwidth is a relatively scarce resource. When running large MapReduce operations on a significant fraction of the workers in a cluster, most input data is read locally and consumes no network bandwidth.

Scalability

High.

More machines, better performance.

References