TEXT 24
Parallel & Distributed Guest on 3rd January 2023 02:00:26 PM
  1. Distributed File System
  2. when file is being uploaded, it will select the chunks based on the file size, by default, 64 mb
  3. Master declares the nodes, it determines which node will be made.
  4. Namespace stores the information about the replication of the data
  5. Google file system or GFS is distributed by default.
  6.  
  7. YARN - Yet Another Resource Negotiator
  8.  
  9. MapReduce - Makes the link between the master and the slave. And determines which job to assign to the slave. If a job is
  10. assigned to slave, the slave itself informs about the progress of the work through MPI.
  11.  
  12. Queries are run in Hive in Hadoop. SQL + HIVE = HQL
  13.  
  14. Hadoop is an eco-system.

Coding Base is for source code and general debugging text.

Login or Register to edit, delete and keep track of your pastes and more.

Raw Paste

Login or Register to edit or fork this paste. It's free.