GENSCAN 1.0 Date run: 15-Jan-104 Time: 04:52:02 Sequence KRT5_1 : 8282 bp : 49.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 102 274 173 0 2 62 91 129 0.679 10.19 tttgcagaggggatctcaaagggccctccctaaagcagctggcagattgtcatgccagct gcctgggcccacaggcctgctctggctagcctttctggcagGACAATCCATAGACTTAGAATCCCGTTTTCCATTGCCTTGGCCCTGAAGGAGGCTGTGGG CTACAGCATAGAGAAAGCAGAGCAAGTCCCGCTGCTTGGAACAGGGTGGTTCGCCGTTAT AACAGGCCGTCAGACTCTCAACATGTTCCTGATGAGATTAGAAGCTCCTGCAG gtatgtgtttgtggcggccttcagcctgtatcaacacatacgatgactcatttcttccct agtggaatagagcttgctggaacacacctggggggctggggaaccggcagagtagctacc cccaaagagagacgctatagcccatgagtctcagggtttttttttaagggattgcatggg 1.02 Intr + 967 1147 181 2 1 54 105 67 0.371 3.93 caacctgctgcctggaaaagtgtaagagcagatcactggggaatcgtttgccccccgctg atggacagcttccccaagctggaagggcaggtgctcagcatgtaccgtactgggatggtt gtcaatactcctggtcctgtaagagtccccaggacactggccatgccaatgccccctcag TTCCTGGCATCCTTTTTGGGCTGCTCACAGCCCCCAGCCTCTATGGTGAAGACATACTTG CTAGCAGCGTCACCAACTTGCTGCCAAGAGATCAGTGCTGCAAGGCAAGGTTATTTCTAA CTGAGCAGAGCCTGCCAGGAAGAAAGCGTTTGCACCCCACACCACTGTGCAGGTGTGACC G gtgagctcacagctgccccccaggcatgcccagcccacttaatcattcacagctcgacag ctctctcgcccagcccagttctggaagggataaaaagggggcatcaccgttcctgggtaa cagagccaccttctgcgtcctgctgagctctgttctctccagcacctcccaacccactag 1.03 Intr + 1456 1918 463 1 1 81 105 556 0.585 49.76 ccttctgcgtcctgctgagctctgttctctccagcacctcccaacccactagtgcctggt tctcttgctccaccaggaacaagccaccatgtctcgccagtcaagtgtgtccttccggag cgggggcagtcgtagcttcagcaccgcctctgccatcaccccgtctgtctcccgcaccag CTTCACCTCCGTGTCCCGGTCCGGGGGTGGCGGTGGTGGTGGCTTCGGCAGGGTCAGCCT TGCGGGTGCTTGTGGAGTGGGTGGCTATGGCAGCCGGAGCCTCTACAACCTGGGGGGCTC CAAGAGGATATCCATCAGCACTAGTGGTGGCAGCTTCAGGAACCGGTTTGGTGCTGGTGC TGGAGGCGGCTATGGCTTTGGAGGTGGTGCCGGTAGTGGATTTGGTTTCGGCGGTGGAGC TGGTGGTGGCTTTGGGCTCGGTGGCGGAGCTGGCTTTGGAGGTGGCTTCGGTGGCCCTGG CTTTCCTGTCTGCCCTCCTGGAGGTATCCAAGAGGTCACTGTCAACCAGAGTCTCCTGAC TCCCCTCAACCTGCAAATCGACCCCAGCATCCAGAGGGTGAGGACCGAGGAGCGCGAGCA GATCAAGACCCTCAACAATAAGTTTGCCTCCTTCATCGACAAG gtgagctacgatcttttgtaaaaaatcactgtgggtctgaaataaatgccaaagagagag aaagaaggaagatgttttggctttgtgcaaatacttttatagttgtacagttctgtgttt ccgtttgtttctgtgccctggatttgtggacacgttctgaattagactggcagctgggaa 1.04 Intr + 2500 2714 215 0 2 93 89 414 0.999 39.51 ctccccctggaaaagtgagtttgggtgcctcagtctgcacctcccctcctggggcccagg gccaggcacagtgcacagaaaagctttagagggacggaaagaggtgggaggcaccttagt gagttgatcatagaacatgaaatccaagtttctctatcttcaaaccctgctcccctccag GTGCGGTTCCTGGAGCAGCAGAACAAGGTTCTGGACACCAAGTGGACCCTGCTGCAGGAG CAGGGCACCAAGACTGTGAGGCAGAACCTGGAGCCGTTGTTCGAGCAGTACATCAACAAC CTCAGGAGGCAGCTGGACAGCATCGTGGGGGAACGGGGCCGCCTGGACTCAGAGCTGAGA AACATGCAGGACCTGGTGGAAGACTTCAAGAACAA gtgagttggggtggagggtggacacaggggagggtggtgtcttcttggtaccagatgggc tttgttactagtctactaccatgccttcctttggggctgggaggatataccttccatgga cacctccagtactaaaacaaacaaacaaatacacaagccaggctcatgtttagaaagttc 1.05 Intr + 3497 3557 61 2 1 98 100 102 0.986 11.14 agtatttaaatggggagggaccttggccagaggttcatgctaccagtgtcccctccctct ctgcctatgaggagaagccccttcccactgcaaaagtaggcttacagttgactgcataat aaagaccccttacctgttgctcatcctctgacaaacacttcccatccctttcaccaccag GTATGAGGATGAAATCAACAAGCGTACCACTGCTGAGAATGAGTTTGTGATGCTGAAGAA G gtgcgtgtgggtgggagagaaccagcagcctgcagctatgctctctaagcgtggagctca cttgagtagggtgacggtgtgtgcagtgccaatcatccctgtttccccaggatgtagatg ctgcctacatgaacaaggtggagctggaggccaaggttgatgcactgatggatgagatta 1.06 Intr + 3668 3763 96 1 0 83 95 193 0.982 19.71 caccaccaggtatgaggatgaaatcaacaagcgtaccactgctgagaatgagtttgtgat gctgaagaaggtgcgtgtgggtgggagagaaccagcagcctgcagctatgctctctaagc gtggagctcacttgagtagggtgacggtgtgtgcagtgccaatcatccctgtttccccag GATGTAGATGCTGCCTACATGAACAAGGTGGAGCTGGAGGCCAAGGTTGATGCACTGATG GATGAGATTAACTTCATGAAGATGTTCTTTGATGCG gtaagaaacttatctaaatttttcacatgggtgggttttttttcctcaagttgtcatatc acaatgagtgaaagatttgaatggaactgacattaaataacacaacacagaaccagatga ccgactccaaatctccctgcaggagctgtcccagatgcagacgcatgtctctgacacctc 1.07 Intr + 3906 4070 165 2 0 80 98 314 0.998 31.76 tggatgagattaacttcatgaagatgttctttgatgcggtaagaaacttatctaaatttt tcacatgggtgggttttttttcctcaagttgtcatatcacaatgagtgaaagatttgaat ggaactgacattaaataacacaacacagaaccagatgaccgactccaaatctccctgcag GAGCTGTCCCAGATGCAGACGCATGTCTCTGACACCTCAGTGGTCCTCTCCATGGACAAC AACCGCAACCTGGACCTGGATAGCATCATCGCTGAGGTCAAGGCCCAGTATGAGGAGATT GCCAACCGCAGCCGGACAGAAGCCGAGTCCTGGTATCAGACCAAG gtgggtgctctgatgactgtctcctgagtgaggggacacacccatgtctaggattctagg ccatgacgacactaagaatggggctcctggaacctgctgaagcccattgggatgctttat aggaatggctgtgtagacaatttccatctaaacccaaggacctgagaaataatctcactc 1.08 Intr + 4428 4553 126 2 0 27 91 208 0.997 15.88 ctctgaatctcacaccagtgtctccaggagagcaaaggcactagactagctcagccttgg aaaagcctgaaccagccccacactatttgccaccacttagtactcactgcctgtgaactt tgggaaagttcttcccattctcatataaacagaatctttcctctttcctggctgcatcag TATGAGGAGCTGCAGCAGACAGCTGGCCGGCATGGCGATGACCTCCGCAACACCAAGCAT GAGATCTCTGAGATGAACCGGATGATCCAGAGGCTGAGAGCCGAGATTGACAATGTCAAG AAACAG gtagggtgagattgaaagagggcaaggaaggggcctgagttctaaaagaaacacctactt tgttttattttgttttacttttgctaaacacacgcagctagatccagtaccagtgtttca gtgtcctgccacccacgatgtactggtttctctctgggattcatgatagtttggtttgtc 1.09 Intr + 4803 5023 221 2 2 79 109 591 0.994 57.40 ttgttttacttttgctaaacacacgcagctagatccagtaccagtgtttcagtgtcctgc cacccacgatgtactggtttctctctgggattcatgatagtttggtttgtctgacccaga aactcagaaggagacacctcaggtcccaagcaaccccactctcctcctttctatctgtag TGCGCCAATCTGCAGAACGCCATTGCGGATGCCGAGCAGCGTGGGGAGCTGGCCCTCAAG GATGCCAGGAACAAGCTGGCCGAGCTGGAGGAGGCCCTGCAGAAGGCCAAGCAGGACATG GCCCGGCTGCTGCGTGAGTACCAGGAGCTCATGAACACCAAGCTGGCCCTGGACGTGGAG ATCGCCACTTACCGCAAGCTGCTGGAGGGCGAGGAATGCAG gtgagtagacagcatgaactcaagatggccttcagctgataaagcgaagctgctctactg tggggtgtacaacacacatacatgagatcagtgacttgtgcgtgataatgacacatcatc aacactatttcagtctgactcatggccatatagctgacctcaactcacttttctggtctc 1.10 Intr + 5828 5862 35 2 2 133 84 41 0.998 6.07 ccctcttcattggaaaatccctctggagagttctcccttcctttaacttaagcagctttt gggtgtacagactcctggcttatggaatgaactcgaatcatgaggatgggagttagccac atagactaatgctgtctttttgggagctgttaacccttaattcaatttttcccctttcag ACTCAGTGGAGAAGGAGTTGGACCAGTCAACATCT gtaagtagctttgaacagacattaacaacgacaataatatgggatatatttagtgccaac tcagaattctgctgtttctagatccaaacttttcccatcccagcatatggttatttataa taatacacttagtaagttgtgggtggtggaggggaaggacagattgggacaggaagcaat 1.11 Term + 6420 6718 299 1 2 141 48 147 0.972 11.43 cctcttacactcacccacttttttagggaccttaattaaatgacagttcttccgggcctt gtttgctactctgtaaagggggtccagtagagtgctccaacaccagcagatcaaataaat gggccatgcaggatcagcctggcagatggtctcactgagtcctccctcctttccctgcag CTGTTGTCACAAGCAGTGTTTCCTCTGGATATGGCAGTGGCAGTGGCTATGGCGGTGGCC TCGGTGGAGGTCTTGGCGGCGGCCTCGGTGGAGGTCTTGCCGGAGGTAGCAGTGGAAGCT ACTACTCCAGCAGCAGTGGGGGTGTCGGCCTAGGTGGTGGGCTCAGTGTGGGGGGCTCTG GCTTCAGTGCAAGCAGTGGCCGAGGGCTGGGGGTGGGCTTTGGCAGTGGCGGGGGTAGCA GCTCCAGCGTCAAATTTGTCTCCACCACCTCCTCCTCCCGGAAGAGCTTCAAGAGCTAA gaacctgctgcaagtcactgccttccaagtgcagcaacccagcccatggagattgcctct tctaggcagttgctcaagccatgttttatccttttctggagagtagtctagaccaagcca attgcagaaccacattctttggttcccaggagagccccattcccagcccctggtctcccg Predicted peptide sequence(s): >KRT5_1|GENSCAN_predicted_peptide_1|678_aa XTIHRLRIPFSIALALKEAVGYSIEKAEQVPLLGTGWFAVITGRQTLNMFLMRLEAPAVP GILFGLLTAPSLYGEDILASSVTNLLPRDQCCKARLFLTEQSLPGRKRLHPTPLCRCDRF TSVSRSGGGGGGGFGRVSLAGACGVGGYGSRSLYNLGGSKRISISTSGGSFRNRFGAGAG GGYGFGGGAGSGFGFGGGAGGGFGLGGGAGFGGGFGGPGFPVCPPGGIQEVTVNQSLLTP LNLQIDPSIQRVRTEEREQIKTLNNKFASFIDKVRFLEQQNKVLDTKWTLLQEQGTKTVR QNLEPLFEQYINNLRRQLDSIVGERGRLDSELRNMQDLVEDFKNKYEDEINKRTTAENEF VMLKKDVDAAYMNKVELEAKVDALMDEINFMKMFFDAELSQMQTHVSDTSVVLSMDNNRN LDLDSIIAEVKAQYEEIANRSRTEAESWYQTKYEELQQTAGRHGDDLRNTKHEISEMNRM IQRLRAEIDNVKKQCANLQNAIADAEQRGELALKDARNKLAELEEALQKAKQDMARLLRE YQELMNTKLALDVEIATYRKLLEGEECRLSGEGVGPVNISVVTSSVSSGYGSGSGYGGGL GGGLGGGLGGGLAGGSSGSYYSSSSGGVGLGGGLSVGGSGFSASSGRGLGVGFGSGGGSS SSVKFVSTTSSSRKSFKS