GENSCAN 1.0	Date run: 15-Jan-104	Time: 04:52:02

Sequence KRT5_1 : 8282 bp : 49.64% C+G : Isochore 2 (43 - 51 C+G%)

Parameter matrix: HumanIso.smat

Predicted genes/exons:

Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr..
----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------

 1.01 Intr +    102    274  173  0  2   62   91   129 0.679  10.19
tttgcagaggggatctcaaagggccctccctaaagcagctggcagattgtcatgccagct
gcctgggcccacaggcctgctctggctagcctttctggcagGACAATCCATAGACTTAGAATCCCGTTTTCCATTGCCTTGGCCCTGAAGGAGGCTGTGGG
CTACAGCATAGAGAAAGCAGAGCAAGTCCCGCTGCTTGGAACAGGGTGGTTCGCCGTTAT
AACAGGCCGTCAGACTCTCAACATGTTCCTGATGAGATTAGAAGCTCCTGCAG
gtatgtgtttgtggcggccttcagcctgtatcaacacatacgatgactcatttcttccct
agtggaatagagcttgctggaacacacctggggggctggggaaccggcagagtagctacc
cccaaagagagacgctatagcccatgagtctcagggtttttttttaagggattgcatggg

 1.02 Intr +    967   1147  181  2  1   54  105    67 0.371   3.93
caacctgctgcctggaaaagtgtaagagcagatcactggggaatcgtttgccccccgctg
atggacagcttccccaagctggaagggcaggtgctcagcatgtaccgtactgggatggtt
gtcaatactcctggtcctgtaagagtccccaggacactggccatgccaatgccccctcag
TTCCTGGCATCCTTTTTGGGCTGCTCACAGCCCCCAGCCTCTATGGTGAAGACATACTTG
CTAGCAGCGTCACCAACTTGCTGCCAAGAGATCAGTGCTGCAAGGCAAGGTTATTTCTAA
CTGAGCAGAGCCTGCCAGGAAGAAAGCGTTTGCACCCCACACCACTGTGCAGGTGTGACC
G
gtgagctcacagctgccccccaggcatgcccagcccacttaatcattcacagctcgacag
ctctctcgcccagcccagttctggaagggataaaaagggggcatcaccgttcctgggtaa
cagagccaccttctgcgtcctgctgagctctgttctctccagcacctcccaacccactag

 1.03 Intr +   1456   1918  463  1  1   81  105   556 0.585  49.76
ccttctgcgtcctgctgagctctgttctctccagcacctcccaacccactagtgcctggt
tctcttgctccaccaggaacaagccaccatgtctcgccagtcaagtgtgtccttccggag
cgggggcagtcgtagcttcagcaccgcctctgccatcaccccgtctgtctcccgcaccag
CTTCACCTCCGTGTCCCGGTCCGGGGGTGGCGGTGGTGGTGGCTTCGGCAGGGTCAGCCT
TGCGGGTGCTTGTGGAGTGGGTGGCTATGGCAGCCGGAGCCTCTACAACCTGGGGGGCTC
CAAGAGGATATCCATCAGCACTAGTGGTGGCAGCTTCAGGAACCGGTTTGGTGCTGGTGC
TGGAGGCGGCTATGGCTTTGGAGGTGGTGCCGGTAGTGGATTTGGTTTCGGCGGTGGAGC
TGGTGGTGGCTTTGGGCTCGGTGGCGGAGCTGGCTTTGGAGGTGGCTTCGGTGGCCCTGG
CTTTCCTGTCTGCCCTCCTGGAGGTATCCAAGAGGTCACTGTCAACCAGAGTCTCCTGAC
TCCCCTCAACCTGCAAATCGACCCCAGCATCCAGAGGGTGAGGACCGAGGAGCGCGAGCA
GATCAAGACCCTCAACAATAAGTTTGCCTCCTTCATCGACAAG
gtgagctacgatcttttgtaaaaaatcactgtgggtctgaaataaatgccaaagagagag
aaagaaggaagatgttttggctttgtgcaaatacttttatagttgtacagttctgtgttt
ccgtttgtttctgtgccctggatttgtggacacgttctgaattagactggcagctgggaa

 1.04 Intr +   2500   2714  215  0  2   93   89   414 0.999  39.51
ctccccctggaaaagtgagtttgggtgcctcagtctgcacctcccctcctggggcccagg
gccaggcacagtgcacagaaaagctttagagggacggaaagaggtgggaggcaccttagt
gagttgatcatagaacatgaaatccaagtttctctatcttcaaaccctgctcccctccag
GTGCGGTTCCTGGAGCAGCAGAACAAGGTTCTGGACACCAAGTGGACCCTGCTGCAGGAG
CAGGGCACCAAGACTGTGAGGCAGAACCTGGAGCCGTTGTTCGAGCAGTACATCAACAAC
CTCAGGAGGCAGCTGGACAGCATCGTGGGGGAACGGGGCCGCCTGGACTCAGAGCTGAGA
AACATGCAGGACCTGGTGGAAGACTTCAAGAACAA
gtgagttggggtggagggtggacacaggggagggtggtgtcttcttggtaccagatgggc
tttgttactagtctactaccatgccttcctttggggctgggaggatataccttccatgga
cacctccagtactaaaacaaacaaacaaatacacaagccaggctcatgtttagaaagttc

 1.05 Intr +   3497   3557   61  2  1   98  100   102 0.986  11.14
agtatttaaatggggagggaccttggccagaggttcatgctaccagtgtcccctccctct
ctgcctatgaggagaagccccttcccactgcaaaagtaggcttacagttgactgcataat
aaagaccccttacctgttgctcatcctctgacaaacacttcccatccctttcaccaccag
GTATGAGGATGAAATCAACAAGCGTACCACTGCTGAGAATGAGTTTGTGATGCTGAAGAA
G
gtgcgtgtgggtgggagagaaccagcagcctgcagctatgctctctaagcgtggagctca
cttgagtagggtgacggtgtgtgcagtgccaatcatccctgtttccccaggatgtagatg
ctgcctacatgaacaaggtggagctggaggccaaggttgatgcactgatggatgagatta

 1.06 Intr +   3668   3763   96  1  0   83   95   193 0.982  19.71
caccaccaggtatgaggatgaaatcaacaagcgtaccactgctgagaatgagtttgtgat
gctgaagaaggtgcgtgtgggtgggagagaaccagcagcctgcagctatgctctctaagc
gtggagctcacttgagtagggtgacggtgtgtgcagtgccaatcatccctgtttccccag
GATGTAGATGCTGCCTACATGAACAAGGTGGAGCTGGAGGCCAAGGTTGATGCACTGATG
GATGAGATTAACTTCATGAAGATGTTCTTTGATGCG
gtaagaaacttatctaaatttttcacatgggtgggttttttttcctcaagttgtcatatc
acaatgagtgaaagatttgaatggaactgacattaaataacacaacacagaaccagatga
ccgactccaaatctccctgcaggagctgtcccagatgcagacgcatgtctctgacacctc

 1.07 Intr +   3906   4070  165  2  0   80   98   314 0.998  31.76
tggatgagattaacttcatgaagatgttctttgatgcggtaagaaacttatctaaatttt
tcacatgggtgggttttttttcctcaagttgtcatatcacaatgagtgaaagatttgaat
ggaactgacattaaataacacaacacagaaccagatgaccgactccaaatctccctgcag
GAGCTGTCCCAGATGCAGACGCATGTCTCTGACACCTCAGTGGTCCTCTCCATGGACAAC
AACCGCAACCTGGACCTGGATAGCATCATCGCTGAGGTCAAGGCCCAGTATGAGGAGATT
GCCAACCGCAGCCGGACAGAAGCCGAGTCCTGGTATCAGACCAAG
gtgggtgctctgatgactgtctcctgagtgaggggacacacccatgtctaggattctagg
ccatgacgacactaagaatggggctcctggaacctgctgaagcccattgggatgctttat
aggaatggctgtgtagacaatttccatctaaacccaaggacctgagaaataatctcactc

 1.08 Intr +   4428   4553  126  2  0   27   91   208 0.997  15.88
ctctgaatctcacaccagtgtctccaggagagcaaaggcactagactagctcagccttgg
aaaagcctgaaccagccccacactatttgccaccacttagtactcactgcctgtgaactt
tgggaaagttcttcccattctcatataaacagaatctttcctctttcctggctgcatcag
TATGAGGAGCTGCAGCAGACAGCTGGCCGGCATGGCGATGACCTCCGCAACACCAAGCAT
GAGATCTCTGAGATGAACCGGATGATCCAGAGGCTGAGAGCCGAGATTGACAATGTCAAG
AAACAG
gtagggtgagattgaaagagggcaaggaaggggcctgagttctaaaagaaacacctactt
tgttttattttgttttacttttgctaaacacacgcagctagatccagtaccagtgtttca
gtgtcctgccacccacgatgtactggtttctctctgggattcatgatagtttggtttgtc

 1.09 Intr +   4803   5023  221  2  2   79  109   591 0.994  57.40
ttgttttacttttgctaaacacacgcagctagatccagtaccagtgtttcagtgtcctgc
cacccacgatgtactggtttctctctgggattcatgatagtttggtttgtctgacccaga
aactcagaaggagacacctcaggtcccaagcaaccccactctcctcctttctatctgtag
TGCGCCAATCTGCAGAACGCCATTGCGGATGCCGAGCAGCGTGGGGAGCTGGCCCTCAAG
GATGCCAGGAACAAGCTGGCCGAGCTGGAGGAGGCCCTGCAGAAGGCCAAGCAGGACATG
GCCCGGCTGCTGCGTGAGTACCAGGAGCTCATGAACACCAAGCTGGCCCTGGACGTGGAG
ATCGCCACTTACCGCAAGCTGCTGGAGGGCGAGGAATGCAG
gtgagtagacagcatgaactcaagatggccttcagctgataaagcgaagctgctctactg
tggggtgtacaacacacatacatgagatcagtgacttgtgcgtgataatgacacatcatc
aacactatttcagtctgactcatggccatatagctgacctcaactcacttttctggtctc

 1.10 Intr +   5828   5862   35  2  2  133   84    41 0.998   6.07
ccctcttcattggaaaatccctctggagagttctcccttcctttaacttaagcagctttt
gggtgtacagactcctggcttatggaatgaactcgaatcatgaggatgggagttagccac
atagactaatgctgtctttttgggagctgttaacccttaattcaatttttcccctttcag
ACTCAGTGGAGAAGGAGTTGGACCAGTCAACATCT
gtaagtagctttgaacagacattaacaacgacaataatatgggatatatttagtgccaac
tcagaattctgctgtttctagatccaaacttttcccatcccagcatatggttatttataa
taatacacttagtaagttgtgggtggtggaggggaaggacagattgggacaggaagcaat

 1.11 Term +   6420   6718  299  1  2  141   48   147 0.972  11.43
cctcttacactcacccacttttttagggaccttaattaaatgacagttcttccgggcctt
gtttgctactctgtaaagggggtccagtagagtgctccaacaccagcagatcaaataaat
gggccatgcaggatcagcctggcagatggtctcactgagtcctccctcctttccctgcag
CTGTTGTCACAAGCAGTGTTTCCTCTGGATATGGCAGTGGCAGTGGCTATGGCGGTGGCC
TCGGTGGAGGTCTTGGCGGCGGCCTCGGTGGAGGTCTTGCCGGAGGTAGCAGTGGAAGCT
ACTACTCCAGCAGCAGTGGGGGTGTCGGCCTAGGTGGTGGGCTCAGTGTGGGGGGCTCTG
GCTTCAGTGCAAGCAGTGGCCGAGGGCTGGGGGTGGGCTTTGGCAGTGGCGGGGGTAGCA
GCTCCAGCGTCAAATTTGTCTCCACCACCTCCTCCTCCCGGAAGAGCTTCAAGAGCTAA
gaacctgctgcaagtcactgccttccaagtgcagcaacccagcccatggagattgcctct
tctaggcagttgctcaagccatgttttatccttttctggagagtagtctagaccaagcca
attgcagaaccacattctttggttcccaggagagccccattcccagcccctggtctcccg

Predicted peptide sequence(s):


>KRT5_1|GENSCAN_predicted_peptide_1|678_aa
XTIHRLRIPFSIALALKEAVGYSIEKAEQVPLLGTGWFAVITGRQTLNMFLMRLEAPAVP
GILFGLLTAPSLYGEDILASSVTNLLPRDQCCKARLFLTEQSLPGRKRLHPTPLCRCDRF
TSVSRSGGGGGGGFGRVSLAGACGVGGYGSRSLYNLGGSKRISISTSGGSFRNRFGAGAG
GGYGFGGGAGSGFGFGGGAGGGFGLGGGAGFGGGFGGPGFPVCPPGGIQEVTVNQSLLTP
LNLQIDPSIQRVRTEEREQIKTLNNKFASFIDKVRFLEQQNKVLDTKWTLLQEQGTKTVR
QNLEPLFEQYINNLRRQLDSIVGERGRLDSELRNMQDLVEDFKNKYEDEINKRTTAENEF
VMLKKDVDAAYMNKVELEAKVDALMDEINFMKMFFDAELSQMQTHVSDTSVVLSMDNNRN
LDLDSIIAEVKAQYEEIANRSRTEAESWYQTKYEELQQTAGRHGDDLRNTKHEISEMNRM
IQRLRAEIDNVKKQCANLQNAIADAEQRGELALKDARNKLAELEEALQKAKQDMARLLRE
YQELMNTKLALDVEIATYRKLLEGEECRLSGEGVGPVNISVVTSSVSSGYGSGSGYGGGL
GGGLGGGLGGGLAGGSSGSYYSSSSGGVGLGGGLSVGGSGFSASSGRGLGVGFGSGGGSS
SSVKFVSTTSSSRKSFKS