Question: 1 . Write a regular expression that replaces the cumbersome header lines of a FASTA formatted sequence file from GenBank ( e . g .

1. Write a regular expression that replaces the cumbersome header lines of a FASTA formatted sequence file from GenBank (e.g.,
>gi|63028387|gb|AAY27075.1| Pax6[Oikopleura dioica]
with the more elegant and tractable:
>_AAY27075_Pax6_Oikopleura_dioica
while retaining the sequences, themselves.
Below are examples of four FASTA formatted protein sequence files for Pax6 to practice on.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!