GENSCAN 1.0	Date run: 17-Jan-104	Time: 04:54:49

Sequence ARX_1 : 13962 bp : 54.85% C+G : Isochore 3 (51 - 57 C+G%)

Parameter matrix: HumanIso.smat

Predicted genes/exons:

Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr..
----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------

 1.01 Intr +    510    596   87  2  0   56   66    97 0.352   4.96
ccagagtttaaaacggagaggagagcggttgggggacgctacaaacaagagagacacagt
cgggctcagggtgaagatagacagcgcaatgccttgggccgcaagagaaaagcgaactcc
cagacggcagatgtttgcttggggcggtcaccctggccccccgacgcccgtccggcctag
AGAGGACGGCAATTGAGCGCCAGGGAGGTCAAGGCGGTGTCCTGGGAGCCCAATATCGGC
TTTGAAAAGTGCCGAGTGAGTCTGAAC
gtgagcctcgaggcctcgcctccagggagcgaggtcctcgggttggtggaaggggcgcac
gccacccagatggcattcacaggcctggcatcccaataagctagaacttcggccaacact
aaagggtcaaaggaggagctgcaggaagaggatagcggacttagaaaattggtaacttaa

 1.02 Intr +    856   1396  541  0  1   55  116   340 0.821  26.58
caggcctggcatcccaataagctagaacttcggccaacactaaagggtcaaaggaggagc
tgcaggaagaggatagcggacttagaaaattggtaacttaaaaaaagaaagaaagaaaag
gaaaattggctctggtgcgtgcccgctgcccccacccccgcctgcccctctggaatccag
TCCGGGCTTTGCGCCGCGCCCACAGGCCGACGCAGCCCGGCCTCTGGCGAGAGCCAATCA
GAGGGCGCCTCTCAGCACGTGGAGGAGAGAGACTCCAGAGCTCAGCGCCCGCTGCTCACT
ACACTTGTTACCGCTTGTCCTGAGCGCGGAGAGGGCGAGCTCGGGCCGCGGGCAGGGCGG
GAGCCGGCAGCCGGCAACCAAGGGAGGCAGAAAGGCACAAAGATCGCAATAATATCCGTT
ATAACCCGCTATCTAACCCCACCCCCAACACACACCCATCCATCCCACCCTCCGGGAGAG
GCAGCCGGCGATCCGCTCTCTGCGCCCTGGGAAAAAGCCCCAGCCATGAGCAATCAGTAC
CAGGAGGAGGGCTGCTCCGAGAGGCCCGAGTGCAAAAGTAAATCTCCAACTTTGCTCTCC
TCCTACTGCATCGACAGCATCCTGGGCCGGAGGAGCCCGTGCAAAATGCGGTTGCTGGGA
GCCGCGCAGAGCTTGCCTGCTCCGCTGACCAGCCGCGCCGACCCGGAAAAGGCCGTGCAA
G
gtaaggatgctcccgtcaggcacttactaagggcattgggccctgatttggatgtttggt
gttcgggggccagtggcctggaattgtcaatttggagaggaaggaaggagagggcataac
tctgaggcctgctgcctcaaagagcgttaaacatctagccagggctgtctctccctccct

 1.03 Intr +   3140   4016  877  0  1  112   85  1844 0.969 177.57
aagaaaaagaaaaagcaagaaaagtggagaaagaggccaaggcgtcgaagtctggtggtg
cgcggtccccgcacgcctgggcctaggcactgggggagcaactgcgggcgggccccggca
gcagccctggctgggactccccggccggccggctggctgatagctctcccttgcccgcag
GCTCCCCTAAGAGCAGCAGCGCCCCGTTCGAGGCCGAGCTGCACCTGCCGCCCAAGCTGC
GGCGCCTGTACGGCCCGGGCGGGGGCCGCCTCCTTCAGGGTGCGGCAGCGGCGGCGGCGG
CGGCGGCGGCGGCGGCGGCAGCGGCCGCCACGGCCACGGCGGGTCCACGCGGGGAGGCCC
CTCCGCCGCCACCGCCAACCGCGCGGCCCGGGGAACGGCCGGACGGCGCAGGGGCCGCCG
CGGCAGCCGCGGCCGCGGCCGCCGCGGCCTGGGACACGCTCAAGATCAGCCAGGCGCCGC
AGGTGAGCATCAGCCGCAGCAAGTCGTACCGCGAGAACGGGGCGCCCTTCGTGCCGCCGC
CGCCCGCGCTGGACGAGCTGGGCGGCCCGGGGGGCGTCACGCACCCGGAGGAGCGCCTCG
GCGTGGCCGGCGGCCCGGGCAGCGCCCCGGCTGCGGGTGGTGGCACCGGCACCGAGGACG
ACGAGGAGGAGCTGCTGGAGGACGAAGAAGATGAGGACGAGGAAGAGGAACTGCTGGAGG
ACGACGAGGAGGAGCTGCTGGAGGACGACGCCCGCGCGCTGCTCAAGGAGCCCCGGCGCT
GTCCTGTGGCCGCCACTGGCGCCGTGGCCGCAGCAGCTGCCGCTGCAGTGGCCACAGAGG
GCGGGGAGCTGTCACCCAAGGAGGAGCTGCTGCTGCACCCGGAAGACGCTGAGGGCAAGG
ACGGCGAGGACAGCGTGTGCCTCTCTGCGGGCAGCGACTCGGAGGAGGGGCTGCTGAAAC
GCAAACAGAGGCGCTACCGCACCACGTTCACCAGCTACCAGCTGGAGGAACTGGAGCGGG
CCTTCCAGAAGACGCACTACCCGGACGTCTTCACCAG
gtatgcgcgtagggtggtcgcggtcgcagcggcagagagcgcacgcgggccccagggagg
gacagcgggctggcaggggccccggcctcggggcggacgcttggctcctggactctgtgg
aagagcgccctcccaacacacccacccttgctcacccaaactacccactcccccgacccc

 1.04 Intr +   6633   6678   46  0  1  111   99    48 0.970   7.10
tcagtcaaatagttggcactccagggggtgatggaatggtaatgcccatatgtttggggg
tggggggcaggcagccaggggagaccctggtggagtaggcctgccatagaggaggaaata
gctgagagggcattgctggggcctgcagtgacctcctgtctgtgtgttgctttcttatag
GGAGGAACTGGCCATGAGGCTGGACTTGACCGAGGCCCGAGTCCAG
gtgagctgcacaacagagggaagagggagggaggagggctgggggccagtgggagagaga
gagatgggttggtggcgggggggtgcggggggtgagatccccttcacaaaaccaagagaa
gcaggatcagaaaagtagacccaagcactctctcacacacaaagatatttaaaacttaag

 1.05 Intr +   9499   9827  329  0  2  128   97   272 0.992  27.15
cactgctcatttggccccagacgcgtccgaaaacaacctgaggccaaagaggggcaggtg
ggcgcgggccgcggtggaaaggaagggggccccgcagcgcgccaagggaagggacgggta
ggggcccggtgcgcccgccgggccgagccgcgccgaccctgggctctctctgccttgcag
GTCTGGTTCCAGAACCGTCGGGCCAAGTGGCGCAAGCGGGAGAAGGCAGGCGCGCAGACC
CACCCCCCTGGGCTGCCCTTCCCGGGGCCGCTCTCCGCCACCCACCCGCTCAGCCCCTAC
CTGGACGCCAGCCCCTTCCCTCCGCACCACCCGGCGCTCGACTCCGCTTGGACTGCCGCT
GCCGCCGCCGCCGCCGCCGCCTTCCCGAGCCTACCTCCGCCTCCGGGCTCGGCCAGCCTG
CCGCCCAGCGGGGCGCCGCTGGGCCTGAGCACTTTCCTCGGAGCGGCAGTGTTCCGACAC
CCAGCTTTCATCAGCCCGGCATTCGGCAG
gtaacgcgcagcctcggaagtctgtctgtctgtctgtctctcatacacacagaggctggg
ggcagggaggaggcaggagtcaaacagggttacacacatctttttctttgacccaacctc
agagactgtagccaaagaaaatactctctaattgagcctcaaaaaaaaaaatgtgggtct

 1.06 Term +  10311  10440  130  0  1   85   39    79 0.520   0.66
tggtaagcgagcaaatatgtacaggtgtagggtacagagcagggacgactttttaagttt
tgcaaaagttcttccttatgccccccaacctggcagcctgaggaacacagccctgatccc
catgggagccctggagccaccaggccctctgggtgcacagaattctttcttctgccccag
ACTCCATCTTCCTTGTCACTCAAGTGACCCGCTGCCAACGACACCCCCATATGTGCAAGC
CCCGATCTACACGGCAGTACAAGACAACGAAATAAAAGAAACAGTTCATCTTATAACTCC
CTTTCTTTAA
tgaacatcgaatgaggaaataaaaagtctgtataatcacacagacaaagaaacaaaggag
ggatgtattgtgttagcgcagtgaaaataaaactggatgcagcttcaatgatcactcttt
gggagaacaaagaaataatagattcatccaactgcgcagctgcttaacaaaaaatcatct

 2.01 Sngl +  12041  12268  228  1  0   93   43   415 0.762  30.92
ggtccagcttccgcccgcgccctctgcccacttccccgccgcggcccatccgggtcccgt
gggtaccgcgcccctcagctgccccggcgacaagcggcgggagacgctgcccgaggcgcc
gcgcacagctcccgaggccatgaccgcgctgtttgctctccctgcaggctcttttccaca
ATGGCCCCCCTGACCAGCGCGTCGACCGCGGCCGCGCTCCTGAGACAGCCCACACCCGCC
GTGGAGGGCGCAGTGGCATCGGGCGCCCTGGCCGACCCGGCCACGGCGGCCGCAGACAGA
CGCGCCTCTAGCATAGCCGCGCTGAGGCTCAAGGCCAAGGAGCACGCGGCGCAGCTCACG
CAGCTCAACATCCTGCCGGGCACCAGCACGGGCAAGGAGGTGTGCTAA
aggctgccctccacacccgcgccccgcgcgcgccccgaaaggtcacctcactcagcacca
ctcaagaccaaatggaaacagaggaccagcacactcccgagacggcactgagagagcgca
gccgccttcacagcagtctggatgcgggcatggcagccctcggcgctccgggacgtggca

 3.02 PlyA -  13095  13090    6                               1.05
 3.01 Sngl -  13730  13482  249  2  0  105   47   168 0.945   7.79

Predicted peptide sequence(s):


>ARX_1|GENSCAN_predicted_peptide_1|669_aa
RGRQLSAREVKAVSWEPNIGFEKCRVSLNSGLCAAPTGRRSPASGESQSEGASQHVEERD
SRAQRPLLTTLVTACPERGEGELGPRAGREPAAGNQGRQKGTKIAIISVITRYLTPPPTH
THPSHPPGEAAGDPLSAPWEKAPAMSNQYQEEGCSERPECKSKSPTLLSSYCIDSILGRR
SPCKMRLLGAAQSLPAPLTSRADPEKAVQGSPKSSSAPFEAELHLPPKLRRLYGPGGGRL
LQGAAAAAAAAAAAAAAAATATAGPRGEAPPPPPPTARPGERPDGAGAAAAAAAAAAAAW
DTLKISQAPQVSISRSKSYRENGAPFVPPPPALDELGGPGGVTHPEERLGVAGGPGSAPA
AGGGTGTEDDEEELLEDEEDEDEEEELLEDDEEELLEDDARALLKEPRRCPVAATGAVAA
AAAAAVATEGGELSPKEELLLHPEDAEGKDGEDSVCLSAGSDSEEGLLKRKQRRYRTTFT
SYQLEELERAFQKTHYPDVFTREELAMRLDLTEARVQVWFQNRRAKWRKREKAGAQTHPP
GLPFPGPLSATHPLSPYLDASPFPPHHPALDSAWTAAAAAAAAAFPSLPPPPGSASLPPS
GAPLGLSTFLGAAVFRHPAFISPAFGRLHLPCHSSDPLPTTPPYVQAPIYTAVQDNEIKE
TVHLITPFL

>ARX_1|GENSCAN_predicted_peptide_2|75_aa
MAPLTSASTAAALLRQPTPAVEGAVASGALADPATAAADRRASSIAALRLKAKEHAAQLT
QLNILPGTSTGKEVC

>ARX_1|GENSCAN_predicted_peptide_3|82_aa
MPWSPPPRKGAPGLLALAITCHLGLRTSLPGYVALSNSSLRLPSQDSRLLQISDSTVNNA
HHPPTHPGTHPTSVARDPQLPL