Question: We have a shared - nothing system with 4 processing nodes ( PNs ) . The TIJ algorithm is used to join two relations R
We have a sharednothing system with processing nodes PNs The TIJ algorithm is used to join
two relations R and S R has pages and S has pages. The two relations are distributed
among the PNs PN has of the R and S tuples. PN PN and PN each have of the tuples.
Hashing R and S results in bucket skew: The first of the hash buckets together have of the
tuples. Each of the remaining three has of the tuples.
Each PN can only access one page from its disks at a time, ie no parallel IO within each PN
Computation and communication times are negligible ie the analysis is based on IO costs
Assume that the joins of the bucket pairs do not encounter memory overflow ie at least one of the
operands fits in the memory Estimate the computation costs in terms of number of IOs for each
of the four phases using the following table. What is the total cost of this join operation.
Now Calculate Reading costnumber of disk accesses and writing costnumber of disk accesses Hashing, Partition Tuning and Bucket Tuning. What is the reading cost of Join?. Need to explain the calculation.
Hints:use Grace Algorithm calculation
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
