Question: A human chromosome is represented as a long string. Display a C program for counting the occurrences of all words of length 10 in the
A human chromosome is represented as a long string. Display a C program for counting the occurrences of all words of length 10 in the human Chromosome 1
http://hgdownload.soe.ucsc.edu/goldenPath/hg38/chromosomes/chr1.fa.gz
A word in this context is a substring starting at any nucleotide and has a length of 10. Each nucleotide represents the beginning of a word; two consecutive words overlap in 9 nucleotides, and are NOT separated by spaces.
The chromosome file must be provided to your program as a command-line argument. You may look up how to pass command-line arguments to a C program. Sequences of human chromosomes may contain additional letters when the identity of the nucleotide cannot be determined precisely. Words consisting of A, C, G, and T only must be counted; all other words must not.
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
