Question: Please write a function called calculateGCContent that accepts a single argument: the name of a text file. This text file will be in FASTA format.
Please write a function called calculateGCContent that accepts a single argument: the name of a text file. This text file will be in FASTA format. Each entry in the FASTA file represents the DNA sequence of a chromosome. Each entry starts with a ">" character and may be split across multiple lines in the file. Your function should calculate the overall proportion of nucleotides that are "G" or "C" nucleotides across all chromosomes. Please round your answer to two decimal places.
Test code (to be executed after your solution):
print(calculateGCContent("DNA1.fa")) print(calculateGCContent("DNA2.fa")) FILE 1:
>chr1 AGTGTCCTTAAGACTTAAGATAACGCCTCTCGTAGACTCG ACAAAGTCGACTCCGCCTGCTCAAATCGACACCTTGGTGA >chr2 TCACAACAATGAGACGTGGCCAGAAGTACGACATTGTGAA GTGTAGGTCTGAAAACATTATCCGCATCAGGCCT >chr3 CCAGAAGCTACAGGCTTGAGTTGCCTGGAGATTCAGTTTG GCGAGCATCCCACCAGTGCTTACTAGCATACCGTCTGTAT CCTAGTGCATGAG >chr4 GCAAAAAACATTTTTCCGCTAACTAACTACTTGCGACGCG TGAACCATAAGCGGAGATGTACCTC >chr5 AAACAGTTATCTGTTAACCCCCCGTCTTTTTACCGGCCGC CGGAACGCAACTAACCCCTGTCGGAGCCCAAAGATCGTCA CCGTGCCTCCTAAGGCGGTGGCACTACGGAAAGTC
FILE 2:
>chr1 TACCTCCAGGGGGCATCCTCCCCCCCAATTCGAAACACAT CGTAGCCCCTGGCACTACCTATGTGTGTCAATTCGGAGAA GAGAGATTCACGAAAAAAAAGTCTGGACTCAACTAGGATC ACACATTCGGCTACAGATACCAAAAAAAAAAAAAAAA >chr2 CTCGTCTCAGCGAAGTATAGGAGACACCCCAGTATTTGTG CGAAACAGTTATAAGAGGAACGCGACTACGACCTAGAGTG TAAGCCCGGTTAGTCATAATATGCAGAAGAGAATCAATTG ATTCCGGGCTTCTTCTTTAATAACATGAGTGGTTCCATAG ATATAGGTCCGCCTAGGTTAACTCGCCAGTAGAGAGCAAC GTATGGCAGCACATTCCGATCTTGATCAACGGCCTGCCGA ACTTCCGACCAGTACGATGCTAGTTCAAATCCCCAGGTGG TATAGTCTTATCTGCACGTCAAA >chr3 AAATTTTCACCATTGAGGCACCACCTTCTCGTCGCTGCGC GCTCTGCTCGCTTCGGCTAAAAATTCGCGCAATACATTCG CTACAGATACCAAA >chr4 GTCTCTGCTGTCCGGTAGGCAGGGCTTTGACTGCTGGATG GTGAATTACATGCCCGATTGTACTCACGTCCACTCAAGCG CTTGAAAGACTGGGCAATATGATCCCGTATATTGCGTAAT CTCGGTGGGTCAGGACTCCTAAAATGCTCATGGTAAGTTG TAGACGGGGGAAATCCTACAACCACCC >chr5 CTATCTTACCAGGAGCTAGCTTACTAGCCCTTTAAAGGAA AAATTGGCACGCGGAGACAGACAG
OUTPUT 0.5 0.47
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
