GENSCAN 1.0	Date run: 15-Jan-104	Time: 19:25:12

Sequence THOP1_1 : 30492 bp : 58.82% C+G : Isochore 4 (57 - 100 C+G%)

Parameter matrix: HumanIso.smat

Predicted genes/exons:

Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr..
----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------

 1.01 Intr +   1198   1371  174  2  0   70   86    54 0.181   4.51
gcggctctggaatggtagagcaagaggctccgcccacgcaccgccgagaccaataacgga
ccgggagggttacggcgagtgcgcagagggctccagcgcactcccggccctcctccttta
gctgtgggcggggctcaggggcgcgtgcgtcctgccctccctgtgcgcgcgccgccccag
CATGCCCCGGGAGCGCGGGCGGCGGGCCCCTTGGTCCTCAGGCGGCCGTGGCGGCGGTGG
CGGCGGTTGGGCCGAGGCAGGCGGCCTCAGTGGCCGAGGTGGCTGGACGCGTAGCAGGTG
GAAGGAGGGAGGGAGCCGCAGGCGCAGACCCACCCGCCATGAAGCCCCCCGCAG
gtaccgactaccccgctcgccggacccgggcgtcccctgcaccctcgctcctcccggggt
cgcgacttgggcctaaggctggagcgaagcgacccccgagccaccctcggccgggctgga
ggggcagcgggggaggtggacgaccctgctctctcgtttgccgaatgaatgaaccagctt

 1.02 Intr +   1675   1820  146  2  2   55   83     6 0.248  -2.01
gcagcgggggaggtggacgaccctgctctctcgtttgccgaatgaatgaaccagcttttc
cagcgcttcattcattctgcaaagttgatggcgcttcccttccggatgggtcgggccccg
tggcggaccctggggaggccgtgatttaacgactccagtcccgttttcctggaattctag
AGCCACAGACAGACATTCAAAAATTAAATAAGAGAGGATCGAAGATAGGGAGGGATCGGT
GCTCTGGTGCAGATAAACACGAACGGGGCAGTCCCAGGGACACGTTTTTAGAGAGAGGGT
GGTTCCTGGGGGGCTTTTCTGAAGAG
gtgagcgtcgacttgcttgattgggagaagcaagtgtcctgccaagaccggggaaggagg
cgtcccgggaggagggaacgtgtttggagagcagaaagtcagtttggctggaacgtaacg
aggggaccttttaattttgttatttatttactttgtctcccaagcgggagtgcagtggtg

 1.03 Intr +   3381   3597  217  2  1   72   78    66 0.276   3.30
tgagcccctgaagctcttctttaccccacggtttaatgtagatccacagtcccttttttg
cagttctaaaatcctaaatactctgtaaactgaaaggtttgccttaagtttacagcaaag
tcatttggccagaaaatgacctcaactgaccgactgagagcctttctttttcccacttag
CGAATATCATTGATTCTCAGCTGGGGTGATTGTGCTACCTCTGGGGACACTGGACGGTGT
CTGGGGACATTTGTGGTTGTCACAGTTGGGGGGCGCTCCTGGTATGGAATGGATGGAGGC
CGGGGACGCTGCTCAGCACCCTGCAGTGCCCAGGACAGCTCCACCCCAGAAAACGATCCG
GCCCCAATGGCCACAGTGCAGACAGAGAGACACCCTG
gtgcgtattcctgcatctcactgcagaaagagtaatttgatttcggagtgctgcctcagc
ctccctagtgatttggaatacatggcacttggactctattaatactgttctacaatgggg
ggaaattctgaatgttaaaatgtccccggccccaagactttttggacaggagagcatggt

 1.04 Intr +   4920   4982   63  1  0   87   37    58 0.117   0.43
caccgtacctggccaggcctgcctatttttgtacacccctggaactgaagatgatttttg
gatgagacagaaaccatatgtggcccacaaagctgaaaatccttggccctttgcagaact
gtttgttggccccctgggtcctctgcagaactgtttgctggtcccctgggtcctctgcag
AACTGTTTGCTGGCCCCCTGGACCCTCTGCAGAACTGTTTGCTGGCCCCCTGGGCCCTGC
ACG
gttaggtgtttccaggtggttttgccaccttctctgtgccgagccgcactgacttcacat
agtcgcccttcccgcagctgctctcgctgccaggagcccccgcctgccattcttgttttc
aggctccatttacccgtttctcagagaggccctggtgtgagtggccgggccacatatgac

 1.05 Intr +   6114   6326  213  1  0   51  111   467 0.157  45.21
atggcagagtagcctcctgctctagaggactggtcagtctgtgcttccctacaagagccc
tggacctcactcttctggatgagaagcgggtgatcccttcctttccctctgccctcctga
ctttgaccctaactgaaccgaaagcagacccgcccggcactgggttttgtttctgcgtag
CCTGTGCAGGAGACATGGCGGACGCAGCATCTCCGTGCTCTGTGGTAAACGACCTGCGGT
GGGACCTGAGTGCCCAGCAGATAGAGGAGCGCACCAGGGAGCTCATCGAGCAGACCAAGC
GCGTGTATGACCAGGTTGGCACCCAGGAGTTTGAGGACGTGTCCTACGAGAGCACGCTCA
AGGCGCTGGCCGATGTGGAGGTCACCTACACAG
gtaagtcccaggcagggtctgtgcgtgggccgcaggtgccgagggaggtggcaccgcagg
cgggagtagcccagcccgtggagccggttcagaacctggcttgaagtgtcaccagcaccc
tgcccctgagcacagggcagcgccacctgctcccaggctggtcgtggtggctaaatcaga

 1.06 Intr +  10457  10605  149  0  2   91   94   231 0.996  25.14
gggcccagcctggaggggacggtggcgtgtgcctctagtcccagctacttgggaggctga
ggcaggagagagttcacggggggattcagcagacaggcctgggagaggggtccgggcaac
acctgccagttgtactggggcctgttctcacacctgtctccctggtctcccctgttttag
TTCAGAGGAATATCCTTGACTTCCCCCAGCATGTTTCCCCCTCCAAGGACATCCGGACAG
CCAGCACAGAGGCCGACAAGAAGCTCTCTGAGTTCGACGTGGAGATGAGCATGAGGGAGG
ACGTGTACCAGAGGATCGTGTGGCTCCAG
gtgagggggccctgcggggagtgcaaatagcctcccaagtaactaggattacaggcgccc
gccaccacacccggctaatttttgtatttttagtagagacggggtttcaacgtgttggtc
aggctagtctcgaactcctgacctcaagtgatccgcccacctcggcctcccacagtgctg

 1.07 Intr +  11774  11881  108  1  0   66   81    69 0.936   5.65
cgaacgtttgcacaggttttatgtagagacaggtcgtcatttctctgagataaatgccca
ggagtgcagttgccaagtcacacgggggccggatgatcaggtttttaagaaactgccaaa
ctgctctttggagcggctgcactggcccacgtcctacatgctttgatgtttttattccag
GAGAAAGTTCAGAAGGACTCACTGAGGCCCGAGGCTGCGCGGTACCTGGAGCGGCTAATC
AAGCTGGGCCGGAGAAATGGGCTTCACCTCCCCAGAGAGACTCAGGAA
gtgagtgctgggtgtagggagtgctgggcgtgggcaatggtcgatcccggggagtgctgg
gcacagggagtgctcagtcccagggagtggtgggagcaagaagcgcagggcagggggagt
gctgggctcggggagtgctcggctcagggagtgcttggggcatcaggagtgctgggttca

 1.08 Intr +  15382  15484  103  0  1  121   63   275 0.999  29.21
cctccgtgtctgtctgtcctccgcctcccgtctcactctttcctcctcggttgcctcttc
tttcggcctggttcttggggaaatccctcagccctcggcgagagggttcccaggcgttgc
gtctccccgcggagcccgccccggtctctccctcccctcacgccccgcctttctctccag
AACATCAAACGCATCAAGAAGAAGCTGAGCCTTCTGTGCATCGACTTCAACAAGAACCTG
AACGAGGACACGACCTTCCTGCCCTTCACGCTCCAGGAGCTAG
gtaggggccgagcagtggggcacgggtggccatcggtcctgggactcagtgcaggcctgc
caggccctcgggggccgccgcttcctcctgtgggcttctgtgctgaagggcggcgggagg
ccgaagtacgtggacctgaccgccctgggctgaggactttcctcaccggaagcgctccag

 1.09 Term +  15534  15637  104  1  2  -10   55   132 0.989  -0.53
tcccctcacgccccgcctttctctccagaacatcaaacgcatcaagaagaagctgagcct
tctgtgcatcgacttcaacaagaacctgaacgaggacacgaccttcctgcccttcacgct
ccaggagctaggtaggggccgagcagtggggcacgggtggccatcggtcctgggactcag
TGCAGGCCTGCCAGGCCCTCGGGGGCCGCCGCTTCCTCCTGTGGGCTTCTGTGCTGAAGG
GCGGCGGGAGGCCGAAGTACGTGGACCTGACCGCCCTGGGCTGA
ggactttcctcaccggaagcgctccagggccgtttacagccaaaagctctgagccctcct
tggggagggccatggtctgtcccaccaagattccaccctgatagaaaatgtccctggcag
tggcggaagcccaggccggacggggtggccaagggtgccagtctcagccatggccctgag

 2.02 PlyA -  15894  15889    6                               1.05
 2.01 Sngl -  17173  16928  246  1  0   66   43   270 0.388  15.28
 2.00 Prom -  18579  18540   40                              -5.25

 3.01 Init +  19314  19365   52  2  1   96   36    20 0.468  -1.00
ccattgcagcctgggcaacagagcaagaccctttttctaaaaaaataaaaacttccagga
gtccactttgcagcttctcttccagtccagcgctgctgggccccccagctgtctcccttg
ctgtcctttaactcccgtctccgaccgccagccgtcactcctgggtgacacactgtcgag
ATGGAGAGCTGCTGTTTGAAGGCCCGTGGTCCTTGGAAAGGACGATGTGCAG
gtcccgggaggcctccccctcatgctgcagtcactcggcagacaattgccaggcccacgc
tggtgctggggtacaaacgggaaggcgggggctctgccggccacaggactctcagctccc
tcatcaccacccacacccacgggctggcctcccaaaatttcccccatctgcctctggccc

 3.02 Intr +  20709  20869  161  1  2   98   78   257 0.964  26.60
tgatggggttagatgggggatgtaagaattgcagcgggtggtggccaggctgcagctgct
cgctgagggcccccttgtagctttccaggcagaggggaagcccaccccttccctcacacg
ctgtgtgcaggctgtgggcccatcagtcctgtcaaagccaccctggttctgtccccatag
GAGGGCTCCCCGAGGACTTTCTGAACTCCCTGGAGAAGATGGAGGACGGCAAGTTGAAGG
TCACCCTCAAGTACCCCCATTACTTCCCCCTCCTGAAGAAATGCCACGTGCCTGAGACCA
GGAGGAAAGTGGAGGAGGCCTTCAACTGCCGGTGCAAGGAG
gtgagaaggcacggccagggggccccagaacagtgggtctggagcttgtggggcccgtct
gctccatgtgtgtgaggcacctccaggctttgcacttggatggcctcccaatctccctgg
ccaggcccagtggccagcagaggcctccctgtggggctctgccgtgccctggaggtgtct

 3.03 Intr +  22215  22379  165  2  0   61   39    72 0.607   0.95
tggcggtcggcctggcccctatttggctggctgctctgcacgcacctgctggttacagct
gggtttcctgtggacccagctggtgggcacatctgtcctgaacaggcctctgtttctcct
cgtaaacgcctcacgtggctttcatgcgtcctcccccacccccactggcttggttggcag
TGCCAGTCCTCAGAAGTGAGGCTGTGTGCATGTCATTTGCTGGTGGCGATAGGCAGAGCC
CGTGAGACTGGGAACCAAGGCCTGGACATGGAGGTGTCCCCACATGCAGCAGTGATGCCC
TCGGAGGCTCCTCGGACAGCCCCTGGTCCCCATACCACGTTCCGT
gtgaggccttagagtcatctgcaccccttctggttccaaaaaggccttgggattatgata
acgctggtggcatgaccagaggagttgtgtagtaacacaggccgagaggggcagggaccg
gaggtcagacggagctggatatgagcctttggccctgggacagggcgtgctggccctgga

 3.04 Intr +  22610  22745  136  1  1   77   36   275 0.989  22.52
gattatgataacgctggtggcatgaccagaggagttgtgtagtaacacaggccgagaggg
gcagggaccggaggtcagacggagctggatatgagcctttggccctgggacagggcgtgc
tggccctggaggtggagggagtggcgcccctgggtgacagtgtggtggtgtcggttgcag
GAGAACTGCGCTATCCTCAAGGAGCTGGTGACGCTGCGGGCCCAGAAGTCCCGCCTGCTG
GGGTTCCACACGCACGCCGACTATGTCCTGGAGATGAACATGGCCAAGACCAGCCAGACC
GTGGCCACCTTCCTAG
gtagcccttccttcctcctccactggggtccctgtggggaaggttctcagggtcacccca
ggggacagtcggtgctacccctagagccttgtcctttccctccatgaagagggacttctg
acgcctgccctggctccctcactccccccgagggacccttacagaacagggaagcgcccg

 3.05 Intr +  23135  23501  367  0  1   97   64   797 0.937  73.70
cggccccgccgccccctcaccctgagcgtgaggatgctcggcccctcgaggggttgccgg
gaactgcgggtcctcccttccaaatggggagcctgacgaagtcgcggccgcgggcgggcg
agcccagagccatggaggagcccgcgaggcgagaggcccacctttctgccctccccgcag
ATGAGCTGGCGCAGAAGCTGAAGCCCCTGGGGGAGCAGGAGCGTGCGGTGATTCTGGAGC
TGAAGCGTGCGGAGTGCGAGCGCCGGGGCCTGCCCTTCGACGGCCGCATCCGTGCCTGGG
ACATGCGCTACTACATGAACCAGGTGGAGGAGACGCGCTACTGCGTGGACCAGAACCTGC
TCAAGGAGTACTTCCCCGTGCAGGTGGTCACGCACGGGCTGCTGGGCATCTACCAGGAGC
TCCTGGGGCTGGCCTTCCACCACGAGGAGGGCGCCAGTGCCTGGCATGAGGACGTGCGGC
TCTACACCGCGAGGGACGCGGCCTCGGGGGAGGTGGTCGGCAAGTTCTACCTGGACCTGT
ACCCGCG
gtgggtgagggcagcgggggcggggggcgcaccccggccctgggtctcctcggagcccgg
tgtgcccgtctgagatgggaaggaagtcgttgcctgctccatggggcctcggggatgaac
cccgagacgtagcacccgtgggcacaccacaggggccgaaaatgcagctgtccccgtttc

 3.06 Intr +  23936  24137  202  2  1  101   94   425 0.994  44.90
gggcctggccgtggttttcatgctgccctgcgccaagccacctctgctgagagggaggtt
tagaacctcagaggtgcccggcggtctgggttagcagtcaggtgggctgagggatgcgga
gtcagggactcttgcggtgtctctagtccctgcggggcctgacgctgcctccctccccag
GGAAGGAAAGTACGGGCACGCGGCCTGCTTTGGCCTGCAGCCCGGCTGCCTGCGGCAGGA
TGGGAGCCGCCAGATCGCCATCGCGGCCATGGTGGCCAACTTCACCAAGCCCACAGCCGA
CGCGCCCTCGCTGCTGCAGCATGACGAGGTGGAGACCTACTTCCATGAGTTTGGCCACGT
GATGCACCAGCTCTGCTCCCAG
gtgggtgcgggcccgggcaggggcaggggcaggggcaggggcaggggctgcctgtggtca
gcgaggcccaagcctggggcttcggcttccggctctctctgcacccgtccggtggctcct
tcatccaatgccacccaaagatggtgactccctgtcatgcccgtgtcctggggctgcccc

 3.07 Intr +  25997  26183  187  1  1   86   55   404 0.999  37.37
agcagcagcccaggggcctccgggtcccggtcgttttccagttttggggtgggagggccg
ggcccagcgcatcacctgtgcactcgccctgtttgcactgtggcagctgacacgggccag
gccggcgggaacgggctaggcaggacctgggcactctgaggctctgccccatccctgcag
GCGGAGTTCGCCATGTTCAGCGGGACCCACGTGGAGCGGGACTTTGTGGAGGCGCCGTCG
CAGATGCTGGAGAACTGGGTGTGGGAGCAGGAGCCGCTGCTGCGGATGTCGCGGCACTAC
CGCACAGGCAGCGCCGTGCCCCGGGAGCTCCTGGAGAAGCTCATTGAGTCCCGGCAGGCC
AACACAG
gtgcacccgccccgtccggggaagggtgctaacctcggggggcggcacacagctggggcc
tggcatgtgccccgggtgggggtcggagctctgggcagagtgggccggggcaggtgcaga
ggccagctctctccttccctcccgcccaggcctcttcaacctgcgccagatcgtcctcgc

 3.08 Intr +  26333  26461  129  0  0  102   68   288 0.999  30.04
aagctcattgagtcccggcaggccaacacaggtgcacccgccccgtccggggaagggtgc
taacctcggggggcggcacacagctggggcctggcatgtgccccgggtgggggtcggagc
tctgggcagagtgggccggggcaggtgcagaggccagctctctccttccctcccgcccag
GCCTCTTCAACCTGCGCCAGATCGTCCTCGCCAAGGTGGACCAGGCCCTGCACACGCAGA
CGGACGCAGACCCCGCCGAGGAGTATGCGCGGCTCTGCCAGGAGATCCTCGGGGTCCCGG
CCACGCCAG
gtagccacccttgagccgggcacaccctggaatttggagcctcagctgcttccctgggaa
ggtgaagggagttggtccccatgcttcacttagcaagtgcaggagtccctgctggggagc
cacggagcgcgtcccaggctcctcgggggacagccccagctgagatgcagctgcaggccc

 3.09 Intr +  27291  27470  180  1  0   96  107   255 0.746  29.24
ccctcgctttgtggctgactctggttctagaagtatccagctggtctcggccctcaccgt
gatcctccagcttctccctgttccgggcaggggtctcaggagtctggctggttgtcttct
cctggaggaggagcatggggtgggggctacagcgtgaaccctgccatgtgtccgccccag
GAACCAACATGCCTGCAACCTTCGGCCATCTGGCAGGTGGCTACGACGCCCAGTACTACG
GGTACCTGTGGAGCGAGGTGTATTCCATGGACATGTTCCACACGCGCTTCAAGCAGGAGG
GTGTCCTGAACAGCAAGGTACGCGGGGACTGGGGACAGGGAGGGCGTCCTGAACGGCAAG

gtacgcggggactggggacagggagggcgtcctgaatggcaaggtacgcggggactgggg
acagggagggcgtcctgaacggcaaggtacgcggggactggggacagggagggcgtcctc
aacagcagggtggggccgagaaaagcacacctttggcctgggcctgggtgtgccgggtga

 3.10 Intr +  27720  27924  205  1  1   83   58    94 0.707   5.37
gcgtcctgaacggcaaggtacgcggggactggggacagggagggcgtcctcaacagcagg
gtggggccgagaaaagcacacctttggcctgggcctgggtgtgccgggtgagccatcccg
gccgaggcacctgctggtcttggtgcggtgctgcccacagctcctccctgggccctgcag
GAAGCTCCACACTGGCCCCTCAGGCCTCGGGCTGTGAGTGCTGGGAGCTCCGTGTCTTCC
CATACGAGTCCTTCCGGGATCTGCGTGATCGGCCTCAGAGCCATGGGCTGAGACAGACCC
TTCTCTGGAGAAGGCTTGGCGTTTGCCATTCATTTGCTCCTCCAACCTCCAGTTTCCTCA
CTATGAAGTGCGGCCGTTACGCCAT
gtgagcctgtgcctcccgctgacttggtgtgacccaggctcgggacagccgccagagtcc
ctcacaaggaacaggtggctaccagctttgagggccggccctgccctgcctgggtgcaac
attgcactgggcagcctgcaggtacctcggagccaccaagcaggcactcagctgctccct

 3.11 Intr +  28789  28960  172  1  1   66   46   202 0.015  14.94
gcacacccccgctctgagattctgatgaaaggcacctgcgctctcccacctgtgctgatg
caggcggcccccgggcctccctgggaacctggaaggggttgctccgaggcccccctgggc
gagggtgtgcagactcctggggtcctctgccttcctccctgggtccccacccggccacag
TGCCCTGTCTCCTCGGCAGGTTGGCATGGATTACAGAAGCTGCATCCTGAGACCCGGCGG
TTCCGAGGATGCCAGCGCCATGCTGAGGCGCTTCCTGGGCCGTGACCCCAAGCAGGACGC
CTTCCTCCTGAGCAAGGGGCTGCAGGTCGGGGGCTGCGAGCCCGAGCCGCAG
gtctgctgaggcctggcactgcgactgcccagtctggcctgcgctcccgccgccctggtg
ccttagcccccggcacaggatggggcaggctctggcacagtgcctgggactggcagggtg
gctgagcggctgtcttgcctcttgtcattgtctgtccccacccggtcgtggcccacccgg

Predicted peptide sequence(s):


>THOP1_1|GENSCAN_predicted_peptide_1|425_aa
XCPGSAGGGPLGPQAAVAAVAAVGPRQAASVAEVAGRVAGGRREGAAGADPPAMKPPAEP
QTDIQKLNKRGSKIGRDRCSGADKHERGSPRDTFLERGWFLGGFSEERISLILSWGDCAT
SGDTGRCLGTFVVVTVGGRSWYGMDGGRGRCSAPCSAQDSSTPENDPAPMATVQTERHPE
LFAGPLDPLQNCLLAPWALHACAGDMADAASPCSVVNDLRWDLSAQQIEERTRELIEQTK
RVYDQVGTQEFEDVSYESTLKALADVEVTYTVQRNILDFPQHVSPSKDIRTASTEADKKL
SEFDVEMSMREDVYQRIVWLQEKVQKDSLRPEAARYLERLIKLGRRNGLHLPRETQENIK
RIKKKLSLLCIDFNKNLNEDTTFLPFTLQELVQACQALGGRRFLLWASVLKGGGRPKYVD
LTALG

>THOP1_1|GENSCAN_predicted_peptide_2|81_aa
MTVSPGHMGLKPPARETPQARQMERHPKELDTKIHKHSSLQDGKREKKQAAETPTRTKLA
ETGWNPHEPAESVQNQLADAN

>THOP1_1|GENSCAN_predicted_peptide_3|652_aa
MESCCLKARGPWKGRCAGGLPEDFLNSLEKMEDGKLKVTLKYPHYFPLLKKCHVPETRRK
VEEAFNCRCKECQSSEVRLCACHLLVAIGRARETGNQGLDMEVSPHAAVMPSEAPRTAPG
PHTTFRENCAILKELVTLRAQKSRLLGFHTHADYVLEMNMAKTSQTVATFLDELAQKLKP
LGEQERAVILELKRAECERRGLPFDGRIRAWDMRYYMNQVEETRYCVDQNLLKEYFPVQV
VTHGLLGIYQELLGLAFHHEEGASAWHEDVRLYTARDAASGEVVGKFYLDLYPREGKYGH
AACFGLQPGCLRQDGSRQIAIAAMVANFTKPTADAPSLLQHDEVETYFHEFGHVMHQLCS
QAEFAMFSGTHVERDFVEAPSQMLENWVWEQEPLLRMSRHYRTGSAVPRELLEKLIESRQ
ANTGLFNLRQIVLAKVDQALHTQTDADPAEEYARLCQEILGVPATPGTNMPATFGHLAGG
YDAQYYGYLWSEVYSMDMFHTRFKQEGVLNSKVRGDWGQGGRPERQGSSTLAPQASGCEC
WELRVFPYESFRDLRDRPQSHGLRQTLLWRRLGVCHSFAPPTSSFLTMKCGRYAIALSPR
QVGMDYRSCILRPGGSEDASAMLRRFLGRDPKQDAFLLSKGLQVGGCEPEPQ