TEXT 19
Cloud Computing 2 Guest on 18th May 2023 10:59:54 AM
  1. Distribution Models
  2. Oriented - Makes the distribution of data easier
  3.  
  4. Distributed Memory - Assign the memory using the network
  5. Shared Memory - The memory which is assigned to all system using the same memory
  6.  
  7. Sharding - Distributing the data across multiple servers. No data redundancy
  8. Replication: The data is copied data across multiple servers
  9. Master-server replication
  10. Peer to peer replication
  11.  
  12. Master-slave replication will reduce chance of update conflicts.
  13. Peer to peer avoid loading writes onto a single server
  14.  
  15. Master-slave is no code
  16. Salesforce is utilizing both the models
  17.  
  18. CAP = Consistency, Availability, Partition
  19.  
  20. MTTR - MAIN TIME TO REPAIR - How long did it take to repair
  21. MTBF - MAIN TIME BETWEEN FAILURES - How long did it stay up between the failures
  22.  
  23. MTTR is inversely proportional to MTBF.
  24.  
  25. This is the availability of the cloud.
  26.  
  27. Partition Tolerance - Atomicity of the transaction - Threads that can be used in the tasks.
  28.  
  29. Three variables, r, w, n
  30. r = read, w = write, n = replication factor
  31.  
  32. Apache Hadoop & Spark
  33. The database which can handle the growth of large data
  34.  
  35. Hadoop Ecosystem - Scalable, Fault Tolerance, Handle Variety of Data
  36.  
  37. Hadoop Distributed File System - It uses the master-slave design.
  38.  
  39. Master Node - Stores and manages the meta data
  40. Slave Node - Storing the data
  41.  
  42.  
  43. HBase  
  44. Horizontal Scalable - Ability to add more resources from pool
  45.  
  46. HBase not good for relational database model, like transactional applications, data analytics.
  47. It is not good for text processing.
  48.  
  49. MapReduce - it is a simple programming for Hadoop ecosystem.
  50. Runs the code in parallel.
  51.  
  52. Spark uses memory/caching instead of disk for data sharing, whereas MapReduce uses hard drive to share the data.
  53. MapReduce - Data is stored in hard drive, so it is better with fault tolerance.

Coding Base is for source code and general debugging text.

Login or Register to edit, delete and keep track of your pastes and more.

Raw Paste

Login or Register to edit or fork this paste. It's free.