GENSCAN 1.0	Date run: 15-Jan-104	Time: 14:30:40

Sequence SHMT1_1 : 38068 bp : 49.95% C+G : Isochore 2 (43 - 51 C+G%)

Parameter matrix: HumanIso.smat

Predicted genes/exons:

Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr..
----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------

 1.01 Init +   1498   1639  142  0  1  101   76   186 0.286  17.01
agcgggtctgggactggtggcaccggcggcggcgtaggacggaggcgtcgctaggtacgt
gcgcgggccgtgttccggtaggtgggcggcttcgggtccgagggccgcggcgtcccagag
cccggggggtgcttcggcgtcctcgctgtcccccgccgtaccccaccctctgcagccgca
ATGACGGGAGGGGAGCGCTTGGGTCGCGCCTGGGCTGGGGATCGGGCGGCGGCCAGCCGC
AGCGCCCCAGTTCCAGGCACATTCGGTATGATGGGGTGGCGGCCCGTGGCCTTTGGCAAG
AATGCGAAGGCTTATCGGGGAT
gtgagtagtagctctccactgaacacttttctcgttgagcattttcgatttaaacaatcc
tagctccttcacaacaaagctgcaagtcaagtgctactgtttcccccattttacaggtga
ggaaggtggtggggccctgagaaaggaggcaacttaaatggcagagcccggaatagaacc

 1.02 Intr +   3940   3946    7  2  1   78   69     0 0.040 -10.11
tggccaggctggtctcaaactcctgacctcaagtgatccacccacctcggcctcccaaag
tgctgggattacaggcgtcagccaccgcgcctgaccgcttttttttctttctttagagag
acggggtctccctatgttgcccagggtggtcttgactcttgtggtcctgtgactccccag
ACTTCAG
gtgtgcaagcctgatttctgacccttggccagtcacactttctcctttgtctggcctttg
aacttggggtgctctgcacttgtttactctgcgtagattgctctcatcccagatttcatg
gttggctcctttccacagctcagatttcagctccgggacgttccctgacccagtctaaag

 1.03 Intr +   8743   8857  115  1  1   79   78   151 0.552  13.65
gtcagaagattgaggcaggagaatctcttgaacctgggagacggaggttgcagtgagccg
agatcatgccactgcactctagcctgggcaacaaagactacatctcaaaaataaataaaa
aataaataaaataaataaatttaattctggaagctacacatgtttttcccatttttttag
GCAGCTTCGAACCAGTGCAATGACGATGCCAGTCAACGGGGCCCACAAGGATGCTGACCT
GTGGTCCTCACATGACAAGATGCTGGCACAACCCCTCAAAGACAGTGATGTTGAG
gtgagatttttggggtcttcacagatttttttatgttggaggccttcatttaatctttag
ttctaattacaaattaattagggacagccttgaaatgagtattatcctgctggatttaga
ggtggtggcagacaaaatggctacaaatcctttgagggtaaatttaaagattgctgggtt

 1.04 Intr +  10926  11071  146  2  2   49   87   137 0.971   8.78
attttgtttttgttaattcattgattagtgctttctcagatactaaaacctttgaggagg
attcattctattccataatgtgaaagagtgtcttttctctttaggaggttgtaatttgga
tcagagcaaaattttaaactaaaatttccaaatatcaagcatctgttttcttctcataag
GTTTACAACATCATTAAGAAGGAGAGTAACCGGCAGAGGGTTGGATTGGAGCTGATTGCC
TCGGAGAATTTCGCCAGCCGAGCAGTTTTGGAGGCCCTAGGCTCTTGCTTAAATAACAAA
TACTCTGAGGGGTACCCGGGCCAGAG
gtatgtgaatatcttcaaaggcctggttctgacacagtcatagggagtactcagtcagcc
taggacctaaacaatttggaaatttcagaaaacaataagcttcccagctgtaggatttct
gttccacgcctgctctctgacagaactcttttattcccagcatgtgataacactgaagag

 1.05 Intr +  16303  16418  116  1  2  112   49   164 0.988  14.99
tccagaggcattggtggaacacttgatcatgctggcttcagcatctgcatgttaacaagc
tcttctcattacttctctccactttagtttcactgtcccacccctaagcttctttggcct
ctgtgaagcacagactttctgggtctgagtggatctaagtctcctgctcttctcccacag
ATACTATGGCGGGACTGAGTTTATTGATGAACTGGAGACCCTCTGTCAGAAGCGAGCCCT
GCAGGCCTATAAGCTGGACCCACAGTGCTGGGGGGTCAACGTCCAGCCCTACTCAG
gtgcttgtctcatccatatggcctggtgcagcctcttcatgtgcatcccccagggccgac
atgcaccaagaagagtcctaagatggactgccatggactgccataggccgggcccagtgg
ctcacgcctgtaatcccagcactttgggaggcggaggcaggcagaccacctgaggccggc

 1.06 Intr +  17087  17247  161  0  2  101  121   197 0.999  24.01
tgccctctctctgaacttcacctctcaaggctggtggtgaaggggcacctcctggagctc
agggaaggtacaaggcctgttctcgccactcctgggtgaaggcttagtggccaaggtccc
ggatgccacttggtacctcagttagaaataatcatgtctcctctggtccactcgtttcag
GCTCCCCTGCAAACTTTGCTGTGTACACTGCCCTGGTGGAACCCCATGGGCGCATCATGG
GCCTGGACCTTCCGGATGGGGGCCACCTGACCCATGGGTTCATGACAGACAAGAAGAAAA
TCTCTGCCACGTCCATCTTCTTTGAATCTATGCCCTACAAG
gtaagcatgtgtttctctcctagcttttatttccttgggaaacctctgatgcttggctgt
gcctgcaggggtaagtttggtttatgtgtcactacaggaagtccccacctcgcaccgttt
ttaatttccttgcagcggggactcctcaggttttcaggacccgctggggcagctgggatg

 1.07 Intr +  23930  24011   82  1  1   79   64   180 0.999  13.91
tgtgagccatacttggttgagcaggctgaacacaccagtatccatcctgtctcaccctca
agtcctcatctaggacatcattcctcttgtcatggtgggtggagccagttcaagtgggag
gacaggaggcaggtgtggttggaggaagcagcctgaacctgcctccctgacattccacag
GTGAACCCAGATACTGGCTACATCAACTATGACCAGCTGGAGGAGAACGCACGCCTCTTC
CACCCGAAGCTGATCATCGCAG
gtgatgcgcggggcggaaagtcaccttgttcaccagtagcctggtgcctcttacagaatg
aggatggaaagatgttcgcaccctgttgagcccactgctagtttctgccctggataactg
agtttctggggtctacatatattagtttctttcaggatacagatgtgtgtttgctgcttt

 1.08 Intr +  24488  24700  213  0  0   99  105   263 0.999  27.79
gttagagtggttgcctttagcacagaggtgctttttgagggcattctccatttttgctct
gagactttaagggaagaaatctctgggctttcagatctgcacatctagaggccactgtga
agccaactcagctggccggtggacatctctgatgcagctgtgtcacctttcctgtctcag
GAACCAGCTGCTACTCCCGAAACCTGGAATATGCCCGGCTACGGAAGATTGCAGATGAGA
ACGGGGCGTATCTCATGGCGGACATGGCTCACATCAGCGGGCTGGTGGCGGCTGGCGTGG
TGCCCTCCCCATTTGAACACTGCCATGTGGTGACCACCACCACTCACAAGACCCTGCGAG
GCTGCCGAGCTGGCATGATCTTCTACAGGAAAG
gtgagctcccgaatgttgggtacagatgggtaccatggtttacatttctcctgtgggggt
tcagattcagttgagggagctgggaaaaactctgacctgggatccttggaaaatattaag
atcccagtcccttaaagagctgcttaggaaggagcagtgacaggctttacagaaaggatt

 1.09 Intr +  29068  29184  117  2  0  118   58   101 0.987  10.54
ctggtctgccctcaggaggcaagggtcccctctgctccgctgtgttacaaatcagttctc
catgccaggataaaggccaggcttgtgcttgattcttccctggccagtctgggtttgagc
ctaaaaagaaaaagaggggacagcccctatgatcccactgtgatgtttttcttcctccag
GAGTGAAAAGTGTGGATCCCAAGACTGGCAAAGAGATTCTGTACAACCTGGAGTCTCTTA
TCAATTCTGCTGTGTTCCCTGGCCTGCAGGGAGGTCCCCACAACCACGCCATTGCTG
gtaaaacatcatctgcctctgctcatcctgtagcccctaattcccatcccacctgtcctc
caaaatcgtgtgccaggacttccacgctggggaaaaagctggagggctgtctctgtgcag
gagatgctaacacagagtagggcatggttgagactggcctaaggtagggacagtagggaa

 1.10 Intr +  31455  31577  123  1  0  115   80   110 0.995  13.46
taataaataataaaaaatttttaaaaatatatcattttggaggaaaagataatccaacgc
aaaagcaaggggtgacattgaaaagcaaggaagcttgagtaaaaacaggaatgttgtttg
actcatcatctgtgtatgattctatcctagcctcctgacccattgtctcttcttgtccag
GGGTTGCTGTGGCACTGAAGCAAGCTATGACTCTGGAATTTAAAGTTTATCAACACCAGG
TGGTGGCCAACTGCAGGGCTCTGTCTGAGGCCCTGACGGAGCTGGGCTACAAAATAGTCA
CAG
gtagagacacagatggtgttcagcaggcctgttcttgtggttgtattaaggcttgcttct
cagtttgtgcaaccaggatgtggcccaggctctgctgctgcagctgcaatgatgggccca
ccttgggaggaggtcagcctcgcctccaactggaaagcctccccctgcctcagaccagcc

 1.11 Intr +  34072  34188  117  2  0   77   94    74 0.963   7.34
tttcacattcttaggcacgggcatctctcgtggggattagaatcccaggccatgtggtgg
gggtgctcaccttcctgacctcatctgctttgactgccttcatcctcgccttcgtttctc
agagattccttcacggcatagaattcgcctgcatttctacatgtccaactttgtttccag
GTGGTTCTGACAACCATTTGATCCTTGTGGATCTCCGTTCCAAAGGCACAGATGGTGGAA
GGGCTGAGAAGGTGCTAGAAGCCTGTTCTATTGCCTGCAACAAGAACACCTGTCCAG
gtgagaatcatccttgccttctccttcactcctctcccatctgcatttcttctggccaaa
gttgtagctgatgaatatattggctccggggacagacctcacataaaaactgtagtgggg
gctgggcacggtggctcacgcctgtaatcccggcactttgggaggccgaggcaggcggat

 1.12 Intr +  35355  35465  111  1  0   75   94    65 0.977   6.15
cgtggttgtgcctccttgtgccttgggttctactttagtttctgaatcagttgtactcct
tgctagggacagccagtatcctcaggccagttctcttttgcccacatgtccctttttaaa
tgcaccaccatcacagtggtgacaatgtgacttagggacagagccccgtttatcttgtag
GTGACAGAAGCGCTCTGCGGCCCAGTGGACTGCGGCTGGGGACCCCAGCACTGACGTCCC
GTGGACTTTTGGAAAAAGACTTCCAAAAAGTAGCCCACTTTATTCACAGAG
gtaaggataaaagtttggaagttgccacatcttattgtgtgtatttagtttctgattgtg
atgagtagctgagaccctgccactgaaagacaattctgcaccaaggctgagtgttggggt
gggagacagcaaaaaacacatgtaaatcaccttgtctgagagggttataaatttgggctg

 1.13 Term +  35824  35993  170  2  2   68   43   174 0.777   8.94
tgcccaggacatctgtccaagggtgggtgcatctcagagtaattgaggtaagcatggaga
ccttgtgccctgccacactccccagctgggggacattctgacatttgtgcctcggccccc
agccattgtaccccctttggtgtgtagtgtggggtgactcatttgtgtcttgtggcacag
GGATAGAGCTGACCCTGCAGATCCAGAGCGACACTGGTGTCAGAGCCACCCTGAAAGAGT
TCAAGGAGAGACTGGCAGGGGATAAGTACCAGGCGGCCGTGCAGGCTCTCCGGGAGGAGG
TTGAGAGCTTCGCCTCTCTCTTCCCTCTGCCTGGCCTGCCTGACTTCTAA
aggagcgggcccactctggacccacctggcgccacagaggaagctgcctgccggaggacc
cccacctgagagatggatgagctgctccaaaggggaactgttgacactcgggccctttga
gggggtttcttttggacttttttcatgttttcttcacaaatcaaaatttgtttaagtctc

Predicted peptide sequence(s):


>SHMT1_1|GENSCAN_predicted_peptide_1|539_aa
MTGGERLGRAWAGDRAAASRSAPVPGTFGMMGWRPVAFGKNAKAYRGYFRQLRTSAMTMP
VNGAHKDADLWSSHDKMLAQPLKDSDVEVYNIIKKESNRQRVGLELIASENFASRAVLEA
LGSCLNNKYSEGYPGQRYYGGTEFIDELETLCQKRALQAYKLDPQCWGVNVQPYSGSPAN
FAVYTALVEPHGRIMGLDLPDGGHLTHGFMTDKKKISATSIFFESMPYKVNPDTGYINYD
QLEENARLFHPKLIIAGTSCYSRNLEYARLRKIADENGAYLMADMAHISGLVAAGVVPSP
FEHCHVVTTTTHKTLRGCRAGMIFYRKGVKSVDPKTGKEILYNLESLINSAVFPGLQGGP
HNHAIAGVAVALKQAMTLEFKVYQHQVVANCRALSEALTELGYKIVTGGSDNHLILVDLR
SKGTDGGRAEKVLEACSIACNKNTCPGDRSALRPSGLRLGTPALTSRGLLEKDFQKVAHF
IHRGIELTLQIQSDTGVRATLKEFKERLAGDKYQAAVQALREEVESFASLFPLPGLPDF