DNA and Protein Sequences for Analysis
The following DNA sequence contains a gene.
TCTTGAAATCCATTTTTAGCCCAACCAGATCATCCCGCGCGAAGTGCTCGAACGAGGCTT CGCACCCGTCCGGATTGGTCATCCGGTCCGGGATGGGGAACAAGATCATCAATGTTTGTC
CATGGAGGAAAACATGGCGTTTCGACCGCTTCATGATCGTATTCTCGTCCGCCGCGTCGA
GTCCGAAGAGAAGACCAAAGGCGGCATCATCATCCCCGACACTGCCAAGGAGAAGCCCCA
GGAAGGCGAAGTCCTCGCTGTAGGTCCCGGCGCGCGCGGCGAACAGGGTCAGATCCAGCC GCTCGACGTCAAGGTGGGCGACCGCATCCTGTTCGGCAAGTGGTCCGGCACCGAGATCAA GATCGACGGAGAAGATCTCCTGATCATGAAGGAAAGCGATGTCATGGGAATCATCGAGGC CCGGGCGCCGAGAAGATAGCCGCCTGATAACGCGAAGATACAGTCAACAAGCTGCCTATCHere is a protein sequence.
MAQLSLQHIQKIYDNQVHVVKDFNLEIADKEFIVFVAASGCGKSTTLRMIAGLEEISGGDLLIDG
KRMNDVPAKARNIAMVFQNYALYPHMTVYDNMAFGLKMQKIAKEVIDERVNWAAQILGLREYL
KRKPGALSGGQRQRVALGRAIVREAGVFLMDEPLSNLDAKLRVQMRAEISKLHQKLNTTMIYV
THDQTEAMTMATRIVIMKDGIVQQVGAPKTVYNQPANMFVSGFIGSPAMNFIRGTIDGDKFVT
ETLKLTIPEEKLAVLKTQESLHKPIVMGIRPEDIHPDAQEENNISAKISVAELTGAEFMLYTTVG
GTSThe following four related proteins are presented in FASTA format.
>A protein
MNIRPLHDRVIIKREEVETRSAGGIVLTGSAATKSTRAKVLAVGKGRILENGTVQPLDVKVGDVIF
NDGYGVKAEKIDGEEVLIISENDILAIVE
>B protein
MADIKFRPLHDRVVVRRVESEAKTAGGIIIPDTAKEKPQEGEVVAAGAGARDEAGKLVPLDVKAG
DRVLFGKWSGTEVKIGGEDLLIMKESDILGIVG
>C protein
MNIRPLHDRVIVKRKEVETKSAGGIVLTGSAAAKSTRGEVLAVGNGRILENGEVKPLDVKVGDIVI
FNDGYGVKSEKIDNEEVLIMSENDILAIVEA
>D protein
MKLRPLHDRVVIRRSEEETKTAGGIVLPGSAAEKPNRGEVVAVGTGRVLDNGEVRALAVKVGDK
VVFGPYSGSNAIKVDGEELLVMGESEILAVLED