Question: Suffix tree, suffix array and BWT: A). Show the suffix tree for string actgctcggct. B). Compute the suffix array for the same string. C). Compute
Suffix tree, suffix array and BWT:
A). Show the suffix tree for string actgctcggct.
B). Compute the suffix array for the same string.
C). Compute the BWT transformation for the same string.
D). Assume that an implementation of suffix tree for DNA sequences uses 20 bytes per base. Estimate the memory requirement for a suffix tree representation of the human genome, which has 3 billion bases.
E). Estimate the memory requirement for a suffix array representation of the human genome.
F). Estimate the memory requirement for a BWT representation of the human genome.
G). Considering that we are indexing a large database of strings with 3 billions characters, where each character is an integer in the range of 1 to 216. Does this change the memory requirement of suffix array and BWT? What impact it might have on suffix trees?
Step by Step Solution
There are 3 Steps involved in it
Sure lets tackle each part systematically A Building the Suffix Tree for the String actgctcggct A suffix tree for a given string is a compressed trie of all the suffixes of the string Heres how you ca... View full answer
Get step-by-step solutions from verified subject matter experts
