GENSCAN 1.0	Date run: 16-Jan-104	Time: 14:50:42

Sequence SLC22A2_1 : 44569 bp : 41.89% C+G : Isochore 1 ( 0 - 43 C+G%)

Parameter matrix: HumanIso.smat

Predicted genes/exons:

Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr..
----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------

 1.01 Intr +    262    461  200  1  2   79   72   125 0.389   8.07
gacacaggagccttcattcagaatgaagacccacagatacagggaaaactggatggcagt
atagaactgtagttggacaaaaagggcagcagcccatgttcgcaggctgaggggaaaacc
cagcaaggcctgtctgttcagatccgtcttggcccctctgtgcagcactccttcctccag
GCACCGGGGACAAGACTCCTCTGGAATGCGGGTCTGGATTTCTTTACGGCCCACTGTTAC
ACAGAAAGGCAGCGGGGAAGTTACAGTGGTATTTCTAGGCTTTCTGGCTGGCTTTGGGGA
GAAAAGAGTCTGGTTTCCACGAGCTGCTTTGAGGAAGAAGGATTCTCAGTTCTATGGCTT
GCCCCGGGGGAGAATGATGG
gtgagagaagagacaggagggcaggagaaggtcagagagagagactttgcttctgaggcc
tccaccttggggcatggatttctgagccccaacagaccttgacagaaaaatctaggacac
aaagatagtggcttggacacacctgcctgcatttacacttgacctgtctgcgacgtaaac

 1.02 Intr +   1288   1784  497  2  2   62   94   503 0.440  40.08
accagttataataaacacgacaggcatcctgggagtgagctcagggcatttgggaagtgc
agaaggacatgcacccccgctggaggggtgcacctttgaagtcagctggaccaaggaaag
gccctgccctgaaggctggtcacttgcagaggtaaactcccctctttgacttctggccag
GGTTTGTGCTGAGCTGGCTGCAGCCGCTCTCAGCCTCGCTCCGGGCACGTCGGGCAGCCT
CGGGCCCTCCTGCCTGCAGGATCATGCCCACCACCGTGGACGATGTCCTGGAGCATGGAG
GGGAGTTTCACTTTTTCCAGAAGCAAATGTTTTTCCTCTTGGCTCTGCTCTCGGCTACCT
TCGCGCCCATCTACGTGGGCATCGTCTTCCTGGGCTTCACCCCTGACCACCGCTGCCGGA
GCCCCGGAGTGGCCGAGCTGAGTCTGCGCTGCGGCTGGAGTCCTGCAGAGGAACTGAACT
ACACGGTGCCGGGCCCAGGACCTGCGGGCGAAGCCTCCCCAAGACAGTGTAGGCGCTACG
AGGTGGACTGGAACCAGAGCACCTTCGACTGCGTGGACCCCCTGGCCAGCCTGGACACCA
ACAGGAGCCGCCTGCCACTGGGCCCCTGCCGGGACGGCTGGGTGTACGAGACGCCTGGCT
CGTCCATCGTCACCGAG
gtaagagagtgagctcagggctcagatggagaagcaaatggtggagattttctcaaggaa
gcaagcagggaaggcccggctcttataaaccctgcctggacatctctttggcctgcatgt
atcttacaaaatcctagtttccaaatgagaacatgcagcttctaatttgtctggagaata

 1.03 Intr +   3411   3514  104  2  2   47   96    70 0.857   1.75
agagagagaagaagaagaaaattggcatgatcaagaaacaatggcttattgccacttaac
tatacagttcaattcaaggaatatttaatgagtaccctagctgagttatgtccaacagga
ttctaacaggatttcatttcctttctctcccaacttggaacacttctcccctgctggcag
TTTAACCTGGTATGTGCCAACTCCTGGATGTTGGACCTATTCCAGTCATCAGTGAATGTA
GGATTCTTTATTGGCTCTATGAGTATCGGCTACATAGCAGACAG
gtaggttgaatcacctgtggtggaatttaaacaatcccaaaggtttggagaacacttgga
tgcacctgcccctgcttgaatcccatcctttcccaagcacacagaccacaatctcctggc
cttcatgctggtgtttttacaatcttaaatttcttctgctgcttggataaccatgaccaa

 1.04 Intr +   9426   9580  155  0  2  128   43    91 0.641   7.39
gtgctacctggacctggagtcacagattccctttgtggctatcagtctgtgcctcctgga
ttcataaaatttaaagttaacctctaactttgtttagatttccataacttccatgcattt
tccacaatcttcacaaatcaaaattaattcaattccctctctttgtttttcttcctgcag
GTTTGGCCGTAAGCTCTGCCTCCTAACTACAGTCCTCATAAATGCTGCAGCTGGAGTTCT
CATGGCCATTTCCCCAACCTATACGTGGATGTTAATTTTTCGCTTAATCCAAGGACTGGT
CAGCAAAGCAGGCTGGTTAATAGGCTACATCCTGA
gtaagaatgtttgtgcttgcaactgtgagaacaaagcaacctcgctgccaaaatagagtc
aggtggggggcggcatgcaagaaaagggacccagcttcaagacaatattgagagattttt
ttatttctgtatcaaaagaattcctaaaagcatatcaactcattcagcattcctccttat

 1.05 Intr +  10744  10912  169  2  1   78  119    28 0.870   2.98
aaaacagcaatagttataacaaaataaaaaccaaaaaacacttaaaaatattcagagagt
tgcgtagaatattctagaatttgtctatatttttgaggttggtgttctagtttcctgata
gctggacagccaactcattattgtattgaattgcattaaccattcaaattctctctgcag
TTACAGAATTTGTTGGGCGGAGATATCGGAGAACAGTGGGGATTTTTTACCAAGTTGCCT
ATACAGTTGGGCTCCTGGTGCTAGCTGGGGTGGCTTACGCACTTCCTCACTGGAGGTGGT
TGCAGTTCACAGTTTCTCTGCCCAACTTCTTCTTCTTGCTCTATTACTG
gtaagtccatttggaataaaaaggaaataaatcaaagtattttattctgctaagaagtct
gccttccttaatgcacactatctttctgattctgagattgctttcactctccagacccat
ttcacaaagagcccaattccatgtttcaggaataaaaaccacaagcttcacctctccgtc

 1.06 Intr +  12830  12944  115  2  1   84   99    59 0.898   6.13
tccctgaggagggatttagcatttgagggaggaaagaaccacatatccccctacacgagt
gttattttctttggagatccaactgtattaacatcctcagagaaatctgattataaaaaa
atgggggatggggtaaggaggattcagtaagagttgccctccgctcaccttgtaccctag
GTGCATACCTGAGTCTCCCAGGTGGCTGATCTCCCAGAATAAGAATGCTGAAGCCATGAG
AATCATTAAGCACATCGCAAAGAAAAATGGAAAATCTCTACCCGCCTCCCTTCAG
gtgagccagggccttaagtatcaaatcaggggatggagaaaagggaggctctggtaagat
tcatgcatgccagtgatctcagcagggaaccatggaagcacccacccccttgcattttgg
cttacctgtagcggaatgcaacagagaattttatatgctaagatttgtggttccgtggga

 1.07 Intr +  14740  14822   83  0  2   39   82    77 0.549  -0.28
gtcctttctattccttttcttagcgcctgagacttgaagaggaaactggcaagaaattga
acccttcatttcttgacttggtcagaactcctcagataaggaaacatactatgatattga
tgtacaactggtaaggaatatttttcactttgaaatgcctccaaattgttttaatcacag
TCATCCATAGTAAGGAAGGGAAGAAAACATGACCTCATCCCGGTATTAGCGCTGACGGTG
GGCAGCAATGTGGGTTGTGTCTG
gtaggttttcctgatcgtgtttttcttgggcaaatattaagatcggtgtagcccctaagc
acagatcaaggcaatggcaacctcggcttcacattggaatcacctgggagcttacaaaat
accaaagccaaaacccccagaaatactgatttaattgttctagggtagggcttggaaacc

 1.08 Intr +  16342  16556  215  1  2  112  103   202 0.886  21.54
ccagccacagccagccactgaagtagatcagagattataaaagcagtgggatgggggact
agcaaggagatggtcacagggaaaggtgacctacaatccacattatttttccagaagcct
ctcttttttggtgccctgtgtttggtgcttgacttgacctgaactctcctctttgctcag
GTTCACGAGCTCTGTGCTCTACCAGGGCCTCATCATGCACATGGGCCTTGCAGGTGACAA
TATCTACCTGGATTTCTTCTACTCTGCCCTGGTTGAATTCCCAGCTGCCTTCATGATCAT
CCTCACCATCGACCGCATCGGACGCCGTTACCCTTGGGCTGCATCAAATATGGTTGCAGG
GGCAGCCTGTCTGGCCTCAGTTTTTATACCTGGTG
gtaagtttcaggtgaagttggagtcttatctccaagaccctggagaaagggagtgtcacg
gcccattgataggaaaaccatgtaatttgtcatccaaatcaatttggaaagacaaattgg
agggcagtatctgggacccttctgagcaaattcatatggccagcctccctttaggaaaaa

 1.09 Intr +  17726  17834  109  0  1  100   95     5 0.975   0.82
aggttaagatgaacaaacgtgagaatctgctgacattaacatgagcctatagaagaagtc
atttttctttcttgtcccattctgggatggggaatttgtccttacagtcccactctggag
cactgtctgaagatgaggaatcatctgtgtacggataagtactgttcttttccctcttag
ATCTACAATGGCTAAAAATTATTATCTCATGCTTGGGAAGAATGGGGATCACAATGGCCT
ATGAGATAGTCTGCCTGGTCAATGCTGAGCTGTACCCCACATTCATTAG
gtgagtgcattttccttagaatgacgagggtgggtgagacatttgttcattggctggaac
caccacaagtgcagacaaaggatatcttaccttcaaaacacagtgtaagggaaggtgaga
acacgtaaaggtccagccaggcccagcagttcacctctcacacttacaggctggggtaaa

 1.10 Intr +  18542  18654  113  2  2   84  103    80 0.993   8.20
tgtatgaataagttcttcagtggtgatttctgagattttggtgcacccgtcacctgagca
gtgtacactatagctcttgaacattttaattatttgaggaagtgtttattcaggggtgga
tgggagataactactgcactgtgataagttaacaagttaaatcagtttcctgcattctag
GAATCTTGGCGTCCACATCTGTTCCTCAATGTGTGACATTGGTGGCATCATCACGCCATT
CCTGGTCTACCGGCTCACTAACATCTGGCTTGAGCTCCCGCTGATGGTTTTCG
gtaagaaatcttcacagatgtctttaagtgaaaacttatttttctagagaccttcacttc
tcttctacttcattcaaaacaggtgtcatgcctattagtaactactaattactactcata
gtagtaatgatgtgtctgtggcttgcgttcttcctggatgtgcctagtaattagtaatat

 1.11 Term +  21919  21998   80  2  2   90   41   104 0.934   2.85
tcttgtttgcattacttacaatacaataaccaagtgaatcttgggaggcttggagacatt
ttgagaggttaattatttaataggcacttccactgtatataccatgtgccaggcactatt
ctaagcctttgacaatattaattcatttcagcctcataacaagcctgtctctgttaccag
AAAGGGGTCCCAATCCAGACCCCAAGAGAGGGTTCTTGGATCTCACACAAGAAGGAATTC
AGGGCGAGTTCACAGAGTGA
tgtgaaagcaagtttattaagaaagtaaacaaataaagaatggctgctccataggcaggg
cagccctgagggctgcttgttgcccatttttatggttatttcttaattatatgctaaaca
aagtgtagattatttatgagttccccgtttttagaccatataaggtaacttcctgatgtt

 2.06 PlyA -  22291  22286    6                               1.05
 2.05 Term -  25400  25314   87  2  0   21   37   125 0.848  -2.62
 2.04 Intr -  27161  26699  463  1  1   68   93   158 0.281   6.53
 2.03 Intr -  27515  27370  146  2  2   42   65   131 0.483   4.36
 2.02 Intr -  28871  28453  419  0  2   57   35   201 0.344   4.52
 2.01 Init -  29125  29086   40  1  1   78  110    35 0.570   5.11
 2.00 Prom -  29742  29703   40                              -2.55

 3.01 Sngl +  40654  41874 1221  0  0   51   34  1932 0.843 179.65
tgaaataggcgttattattattaaccccactgacaactggaaaaaaaaaaaaaacaggcc
agagaaagtaaaggacttgctcaaaaccacacagccaatcagtgacaaagccaaagtttc
aatcctggcagctagattttgtggcccatgaatttgtctcaccatcagagggttgggctc
ATGAATGGGACAGTGCGACGAAGCACCCGTCGCAGTGCTTGGTATTTAGTTAGTAGTCAC
TCACCGGTGGCTGATGTTGTTGCTGCTGCTGCTGTTGTTGCTGCTGCTATTGTCATTTTT
CTTGTTGCTGTTGTTGCTGCTATTGCTGTTGTTATTGTTACTGCTGCTATTGTTTTTGCT
GTTGCTATTACTGTTGTTATTGCTATTGTACTTATTTCTACTGTTGTTACTGTTGTTGCT
GCTGCTGTTATAATTATTGCTGCTGTTCCTACTGCTGCTACTGTTGTTGCTGCTGTTGTT
GCTGATGTTGCTGCTGCTGTTGTTACTGCTGGTGTTCGTACTGCTGTTGTTGCTGCTGCT
GTTATTGCTTCTGTTGCTGTTATTGTTGCTGATGCTGCTACTGTTGCTATTGCTGTTGCT
GCTACTATTACTGTTGCTATTGCTGCTGCTACTGCTGCTGGTGTTGCTGCTACTGTTGTT
GCTACTATGGTAATGATTGCTGCTGCTGCTGCTGCTGCTGTTGCTGCTATTACTACTGCT
ACTTCTGTTGCTGCTGCTGCTGTTATTGCTGCGACTTCTGTTGTTGCTACTGCTGATTTT
GCTATTTCTGTGGCTATTGTTGCTGCTGTTGTTGCTCTTAATTGTGATGGTGTTGATTCT
GTTGCTATTGTTGCTGCTATTGCTAAAATTGTTGCTGCTGTTGTTGCTATTGTTGGTTTT
GTTGTTACTGTTTCTGTTGCTTTTGTTATTGTTGCTGCTGCTGTTGCTGTTGCTGCTGCT
GCTGATGCTGTTACTGTTACTATTGCTATTACTGATACTATTGTTGCTGCTGCTGTTGCT
CGTGTTGCTACTGTTGCTATTGCTGTTACTACTGAGGATCCTATTGTTGCTATTGCTGTT
GCTGCTGATGGTATTACTGTTGCCATATCTGTTGCTGCTACTGTTACTGTTGCTGTTGCT
ATTACTATTGTTGCTGCTGTTGCTTTTGTTGTTGTTACTGCTGCTGTTGTCACTGATGCT
TCTACTGTTGCTATTGCAGTTGCTGCTGTAATTACTGTTGTTGCTGCTGTTGCTACTATT
GCTGCTACTGTTGCTGCTGTTCTTGCTTTTGTTGTTCCTGCTGCTGCTTCTATTGCTGCT
ATTTCTACTGTTGTTGTTTCTGCTGCTGTAGTTGTTGTTATTGTTGTTGGCAGAGAGACC
AGAGGAGCTCAACTGACTTGA
ttttacctccccatctggagctttagagaattccaaaaccatagggtgatttgggtaagc
ttctgccatccctcgtctcagattggattggtccagaaaacgagacaatactaaccaact
ccaggtgggggaaattctgtgagatgctgggtggctgggagcagcccacagctgattcaa

 4.02 PlyA -  42908  42903    6                               1.05
 4.01 Term -  44310  44131  180  0  0   71   49   160 0.774   7.03

Predicted peptide sequence(s):


>SLC22A2_1|GENSCAN_predicted_peptide_1|613_aa
XHRGQDSSGMRVWISLRPTVTQKGSGEVTVVFLGFLAGFGEKRVWFPRAALRKKDSQFYG
LPRGRMMGFVLSWLQPLSASLRARRAASGPPACRIMPTTVDDVLEHGGEFHFFQKQMFFL
LALLSATFAPIYVGIVFLGFTPDHRCRSPGVAELSLRCGWSPAEELNYTVPGPGPAGEAS
PRQCRRYEVDWNQSTFDCVDPLASLDTNRSRLPLGPCRDGWVYETPGSSIVTEFNLVCAN
SWMLDLFQSSVNVGFFIGSMSIGYIADRFGRKLCLLTTVLINAAAGVLMAISPTYTWMLI
FRLIQGLVSKAGWLIGYILITEFVGRRYRRTVGIFYQVAYTVGLLVLAGVAYALPHWRWL
QFTVSLPNFFFLLYYWCIPESPRWLISQNKNAEAMRIIKHIAKKNGKSLPASLQSSIVRK
GRKHDLIPVLALTVGSNVGCVWFTSSVLYQGLIMHMGLAGDNIYLDFFYSALVEFPAAFM
IILTIDRIGRRYPWAASNMVAGAACLASVFIPGDLQWLKIIISCLGRMGITMAYEIVCLV
NAELYPTFIRNLGVHICSSMCDIGGIITPFLVYRLTNIWLELPLMVFERGPNPDPKRGFL
DLTQEGIQGEFTE

>SLC22A2_1|GENSCAN_predicted_peptide_2|384_aa
MMDTELWVTLMVEGQDILTKLSASLTIPGLQPHLIAVLLPNPKPPLRLPLVSPNLNPQIW
NTSTPSLATDHMPITIPLKSNHPYPTQRQYLILQHALKGLKPVITHLLQHGLLKPINSAY
NTPILPVLKPDKPYRLVQDLRLINQIVLPIHPEEAGVIHCKGHQKAPDPIAQDNAYAHKV
AKKAAIKRHQIPSLRTMLMLIRLQATSPSLRTSHFLSIVEIYPQGNHFSVFHLLFHYSPG
ISQAPSLLYTSSSGICPAQDWQIDFTHMPRVRKLKYLLVWVDTFTGWVEAFSTGSEKATM
VISSLLSDIIPRFGLPTSIQSDNGPAFISQITQAVSQALGIQWNLHTPYHPQSSGKLKDF
ASSELAALKSFWEEQRKPANAVAQ

>SLC22A2_1|GENSCAN_predicted_peptide_3|406_aa
MNGTVRRSTRRSAWYLVSSHSPVADVVAAAAVVAAAIVIFLVAVVAAIAVVIVTAAIVFA
VAITVVIAIVLISTVVTVVAAAVIIIAAVPTAATVVAAVVADVAAAVVTAGVRTAVVAAA
VIASVAVIVADAATVAIAVAATITVAIAAATAAGVAATVVATMVMIAAAAAAAVAAITTA
TSVAAAAVIAATSVVATADFAISVAIVAAVVALNCDGVDSVAIVAAIAKIVAAVVAIVGF
VVTVSVAFVIVAAAVAVAAAADAVTVTIAITDTIVAAAVARVATVAIAVTTEDPIVAIAV
AADGITVAISVAATVTVAVAITIVAAVAFVVVTAAVVTDASTVAIAVAAVITVVAAVATI
AATVAAVLAFVVPAAASIAAISTVVVSAAVVVVIVVGRETRGAQLT

>SLC22A2_1|GENSCAN_predicted_peptide_4|59_aa
CDPLYRKPGGDGSEKCHFQVSTVDATEESAGGQAYLLSQEAIGNYASLKCDKNQEGDGI