Shared Flashcard Set

Details

COMP3017 - Lecture 7 (C) - Distributed Databases
COMP3017 - Lecture 7 (C) - Distributed Databases
21
Computer Science
Undergraduate 2
05/03/2014

Additional Computer Science Flashcards

 


 

Cards

Term
What are the characteristics of WAIT-DIE and WOUND-WAIT?
Definition

Ti requests a lock on a data item that is already locked by Tj

  • The WAIT-DIE rule:
    • IF ts(Ti) < ts(Tj) 
      • THEN Ti waits
      • ELSE Ti dies (abort and restart with same timestamp)
  • The WOUND-WAIT rule:
    • IF ts(Ti) < ts(Tj)
      • THEN Tj is wounded (aborts and restarts with same timestamp)
      • ELSE Ti waits

Note: WOUND-WAIT pre-empts active transactions 

Term
What are some characteristics of Detection and Resolution?
Definition
  1. Study the GWFG for cycles (detection)
  2. Break cycles by aborting transactions (resolution)

Selecting minimum total cost sets of transactions to abort is NP-compelte

 

Three main approaches to deadlock detection:

  • Centralized
  • Hierarchical 
  • Distributed
Term
What are some characteristics of Centralized Deadlock Detection?
Definition
  • One site is designed as the deadlock detector (DD) for the system
  • Each site sends its LWFG (or changes to its LWFG) to the DD at intervals
  • DD constructs the GWFG and looks for cycles
Term
What are some characteristics of Hierarchical Deadlock Detection?
Definition
  • Each site has a DD, which looks in the site's LWFG for cycles
  • Each site sends its LWFG at the next level, which merges the LGFW to the DD at the next level, which merges the LWFGs send to it and looks for cycles
  • These DDs send the merged WFGs to the next level and so on
Term
What are some characteristics of Distributed Deadlock Detection?
Definition
  • Responsibility for detecting deadlocks is delegated to sites
  • LWFGs are modified to show relationships between local transactions and remote transactions
  • LWFG containsa  cycle not involving external edges (local deadlocks, resolve locally)
  • LWFG contains a cycle involving external edges
    • Potential deadlock - communicate to other sites
    • Sites must then agree on a victim transaction to abort
Term
What are some reliability characteristics on distribution and ACID in DDBMSs?
Definition
  • Non distributed databases aim to maintain atomicity and durability of transactions
    • Atomicity: A transaction is either performed completely or not at all
    • Durability: Once a transaction has been committed, changes should not be lost because of failure
  • As with parallel databases, distributed databases use the two phase commit protocol (2PC) to preserve atomicity
Term
What is the main concern of 2PC?
Definition
  • Distinguish between global transaction and local transactions into which the global transaction is decomposed
Term
What are the main phases of 2PC?
Definition
  1. Voting
    • Coordinator sends "prepare T" message to all participants
    • Participants respond with either "vote-commit T" or vote-abort T"
    • Coordinator waits for participants to respond with a timeout period
  2. Decision
    • If all participants return "vote-commit T" (to commit), send "commit T" to all participants. Wait for acknowledgements within timeout period.
    • If any participant returns "vote-abort T", send "abort T" to all participants. Wait for all acknowledgements within timeout period
    • When all acknowledgements received, trnsaction is completed
    • If a site does not acknowledge, resend global decision until it is acknowledged
Term
Show a picture for a normal operation in 2PC
Definition
[image]
Term
Show a picture of a 2pc aborted transaction
Definition

[image]

Term
Draw the state diagram for a coordinator during 2pc
Definition
[image]
Term
Draw the state diagram for a participant during 2pc
Definition
[image]
Term
What are some key principles when dealing with failures in 2PC?
Definition
  • If coordinator or participant fails during commit, two things happen:
    • The other sites will time out while waiting for the next message from the failed site and invoke a termination protocol
    • When the failed site restarts, it tries to work out the state of the commit by invoking a recovery protocol
  • The behaviour of the sites under these protocols depends on the state they were in when the site failed
Term
What is the termination protocol for the coordinator?
Definition
  • Timeout in WAIT
    • Coordinator is waiting for participants to vote on whether they're going to commit or abort
    • A missing vote means that the coordinator cannot commit the global transaction
    • Coordinator may abort the global transaction
  • Timeout in COMMIT/ABORT
    • Coordinator is waiting for participants to acknowledge successful commit or abort
    • Coordinator resends global decision to participants who have not acknowledged
Term
What is the termination protocol for the participant?
Definition
  • Timeout in INITIAL
    • Participant is waiting for a "prepare T"
    • May unilaterally abort the transaction after a timeout
    • If "Prepare T" arrives after the unilateral abort, either:
      • Resend the "volte-abort T" message or
      • ignore (coordinator then times out in WAIT)
  • Timeout in READY
    • Participant is waiting for the instruction to commit or abort - blocked without further information
    • Participant can contact other participants to find one that knows the decision - cooperative termination protocol
Term
What is the recovery protocol for the coordinator?
Definition
  • Failure in INITIAL
    • Commit not yet begun, restart commit procedure
  • Failure in WAIT
    • Coordinator not yet begun, restart commit procedure
    • Recovery restarts commit procedure by resending "prepare T"
  • Failure in COMMIT/ABORT
    • If coordinator has received all "ack" messages, complete successfully
    • Otherwise, terminate
Term
What is the recovery protocol for the participant?
Definition
  • Failure in INITIAL
    • Participant has not yet voted
    • Coordinator cannot have reached a decision
    • Participant should unilaterally abort by sending "vote-abort T"
  • Failure in READY
    • Participant has voted, but doesn't know what the global decision was
    • Cooperative termination protocol
  • Failure in COMMIT/ABORT
    • Resend "Ack" message
Term
Show a diagram of communications between coordinator and participants in centralized 2pc
Definition
[image]
Term
What are the characteristics of linear 2PC?
Definition
  • First phase from the coordinator to the participants
  • Second phase from the participants to the coordinator
  • Participants may unilaterally abort

[image]

Term
Give a comparison of the centralized vs the linear 2pc
Definition
  • Linear 2PC involves fewer messages
  • Centralised 2PC provides opportunities for parallelism
  • Linear 2PC has worse reponse time performance
Term
What are the main characteristics of the CAP theorem?
Definition
  • The Consistency-Availability-PartitionTolerance theorem is an example of the trade-off between safety and liveness in an unreliable system
    • Safety: nothing bad ever happens
    • Liveness: Eventually something good happens
  • We can only manage two of the three form C, A, P (Typically we sacrifice either availability (liveness) or consistency (safety)
  • In any distributed system, there is a tradeoff between:
    • Consistency - Each server always returns the correct response to each request
    • Availability - Each request eventually receives a response
    • Partition tolerance - Communication may be unreliable (messages delayed, lost, servers partitioned into groups that cannot communicate with each other) but the system as a whole should continute to function
Supporting users have an ad free experience!