GENSCAN 1.0	Date run: 16-Jan-104	Time: 05:29:02

Sequence NR1I2_1 : 40401 bp : 46.02% C+G : Isochore 2 (43 - 51 C+G%)

Parameter matrix: HumanIso.smat

Predicted genes/exons:

Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr..
----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------

 1.01 Intr +  12723  12983  261  2  0   27   99   309 0.304  23.58
tgccaagtgtttgagtccatcggcaagtttggactggccttagctgttgcagaaggcatg
gtgaactctgccttatataacatggatgctgggcacagaactgtcatctttgaccgattc
tgtggagtacaggacattgtggtagggtaagggactcactttctcataccatgggtacag
AAACCAATTATCTTTGACCACCGTTCTCAACCACATAATGTGCCAGTCATCACTGGTAGC
AAAGATTTACAGAATGTCAATATCATTCCGTGCATCCTCTTTGGGCCTGTCACTAGCCAG
CTTCCTCGCATCTTCACCAGGATCGGAGAAGACTATGATGAGCGTGTGCTGCCATCCATC
ACTACTGAGATCCTCAAGTCAGTGGTGGCTCGCTTTGATGCTGGAGAACTAATCACCCAG
AGAGAACTGGTCTCCAGGCAG
gtgagcgacgaccttatggagcgagcagccacctttgggctcatcctggatgatgtgtct
ttgacacatctgaccttcaggaaggagttcacagaagcagtggaagccaaacaggtggct
cagcaggatgcagagagggccagaactcactggccactgcaggggacggcctgatggagc

 1.02 Intr +  13064  13266  203  1  2   73   53   167 0.365   9.78
gcgtgtgctgccatccatcactactgagatcctcaagtcagtggtggctcgctttgatgc
tggagaactaatcacccagagagaactggtctccaggcaggtgagcgacgaccttatgga
gcgagcagccacctttgggctcatcctggatgatgtgtctttgacacatctgaccttcag
GAAGGAGTTCACAGAAGCAGTGGAAGCCAAACAGGTGGCTCAGCAGGATGCAGAGAGGGC
CAGAACTCACTGGCCACTGCAGGGGACGGCCTGATGGAGCTGTGCAAGCTGGAAGCTGCA
GAGGACATCACGTACCAGCTCTCTCGCTCTTGGAACATCACCAATCTGCCGGCAGGGCAG
TCCGTGCTCCTCCAGCTGCCCCA
gtgagggcccaccctgcctgcacctccatgggccaactaggccacagccccagtgattct
taacactgccttccttctgcctccactccagaaatcactgtgaaatttcatgattggctt
aaagtgaaggaaatacagataaaatcacttcagatctcaaaaaagaaaaaaaaaatttag

 1.03 Intr +  27946  28164  219  1  0   90   80   197 0.992  16.52
tacagctctctttataacaattccaacccccattccccgctttttttccccatgagaggt
cagctcccgagttcacaggcccaaatgtgagtgatgcatagaagggacagagtgtttcct
ctgaggcctctacacatccctgtccagtcttttcattctctgtggttttctcatttctag
TCCAAGAGGCCCAGAAGCAAACCTGGAGGTGAGACCCAAAGAAAGCTGGAACCATGCTGA
CTTTGTACACTGTGAGGACACAGAGTCTGTTCCTGGAAAGCCCAGTGTCAACGCAGATGA
GGAAGTCGGAGGTCCCCAAATCTGCCGTGTATGTGGGGACAAGGCCACTGGCTATCACTT
CAATGTCATGACATGTGAAGGATGCAAGGGCTTTTTCAG
gtagagttacccatcagccttcacccacgtgccaccactgacccactgggtaacgtctca
gggcctcagcttgacctgtcccccaggttcagagtgtgggctggtggcccacccaaaggc
cttgtaattagtctcaagggagccatttatatcccagaggaatccttcatcttcagtctt

 1.04 Intr +  30778  30911  134  1  2  101   44   237 0.998  20.89
gacctctagggactcccacctacacccttcccataaagcctgacccagctgggacgcaaa
ggctagtgtccccctccccgagtcggtaggggctggggagggaggtggtatggcccggag
ccccaggccgagggcccgggcacccgtgcatccccccttctgctccccattctctcacag
GAGGGCCATGAAACGCAACGCCCGGCTGAGGTGCCCCTTCCGGAAGGGCGCCTGCGAGAT
CACCCGGAAGACCCGGCGACAGTGCCAGGCCTGCCGCCTGCGCAAGTGCCTGGAGAGCGG
CATGAAGAAGGAGA
gtgagcagtgggcgcgcgggcgggccggcgccggggtgcacggctctgagtaaggacgtg
ccgtgggtgtgggcatgcttgtgtggagatgcgcgccgagtgtgcgcgtgaacacacgtg
cacatgtgagctggtgtccgtgtgcaacaggcagccacctgggggagcgcttgcagtcgg

 1.05 Intr +  32256  32443  188  1  2   74   57   217 0.998  16.61
gtttctccagaagagcccacaggcctcttgagtccagacaggggagaattgcttgtcacc
attactttctcttttgcctaacggcttctgctgccttgagagggttacacagtggctctc
cagggggctggaggctcaccaggggcacgtgtgcctgagccagcctcactgtccctgcag
TGATCATGTCCGACGAGGCCGTGGAGGAGAGGCGGGCCTTGATCAAGCGGAAGAAAAGTG
AACGGACAGGGACTCAGCCACTGGGAGTGCAGGGGCTGACAGAGGAGCAGCGGATGATGA
TCAGGGAGCTGATGGACGCTCAGATGAAAACCTTTGACACTACCTTCTCCCATTTCAAGA
ATTTCCGG
gtaggaggaactgcacagtgacccgaggtgtcactgccatcttcattctcacatagaaac
tgaggttccccaaggataagaaacttatacaaggtcacagctaatcagtggtggagggta
gatttggagagctggtcctgcatctgtgctagctcctcaaagccttagtctcattcccaa

 1.06 Intr +  33403  33677  275  0  2   78   91   286 0.996  24.24
aaatgctgtgtgtgtatatgtgtgaggacacacgcatgcatgtgggtgtgaatgcctgca
tttgtgcatcctctcgagctgcaactgtggctgtgcatgtttggctggggcctgagttgg
gacctgtctatgaaagcacatgctgtctctcctctgtccacctcctggcatgtgtcctag
CTGCCAGGGGTGCTTAGCAGTGGCTGCGAGTTGCCAGAGTCTCTGCAGGCCCCATCGAGG
GAAGAAGCTGCCAAGTGGAGCCAGGTCCGGAAAGATCTGTGCTCTTTGAAGGTCTCTCTG
CAGCTGCGGGGGGAGGATGGCAGTGTCTGGAACTACAAACCCCCAGCCGACAGTGGCGGG
AAAGAGATCTTCTCCCTGCTGCCCCACATGGCTGACATGTCAACCTACATGTTCAAAGGC
ATCATCAGCTTTGCCAAAGTCATCTCCTACTTCAG
gtaggacatggagactgggtggttgggtgtggaaaagaactggaagtggccaggaggttc
aaagggcctggggtagatcctgaatttgggggatattggtgtcagaagaccctccttttc
ctgtgccctttccccgggcagccagtgctgctggggagtagagcccttgctgtatggctg

 1.07 Intr +  35696  35838  143  2  2   73   49   221 0.988  16.70
gctgggttgtgaggggagagatgagaggcagccagacagcagccacagtcatcctcaggg
aaaggagccatcctccctcttcctctcgcccccaacttctggattatgggatggctgctg
gtgccggtctgtgggctgcctcccagggagctgtcctcccctccccatccttgctgccag
GGACTTGCCCATCGAGGACCAGATCTCCCTGCTGAAGGGGGCCGCTTTCGAGCTGTGTCA
ACTGAGATTCAACACAGTGTTCAATGCGGAGACTGGAACCTGGGAGTGTGGCCGGCTGTC
CTACTGCTTGGAAGACACTGCAG
gtgcccgagagagcctgcctgccctggcagagggagggaaacactgcagttatgggagga
agggagctacgccaggatatgcaggttctgggatggcagggcaggaagatggaatggtgg
aaaacaagatattggtgagggatgattagatcttggtcagcttgctgagaagctgcccct

 1.08 Intr +  36040  36156  117  2  0  105   94   225 0.999  25.24
ccctggcagagggagggaaacactgcagttatgggaggaagggagctacgccaggatatg
caggttctgggatggcagggcaggaagatggaatggtggaaaacaagatattggtgaggg
atgattagatcttggtcagcttgctgagaagctgcccctccatcctgttaccatccacag
GTGGCTTCCAGCAACTTCTACTGGAGCCCATGCTGAAATTCCACTACATGCTGAAGAAGC
TGCAGCTGCATGAGGAGGAGTATGTGCTGATGCAGGCCATCTCCCTCTTCTCCCCAG
gtgaggatctcccctaggctgcctgacatcccccccagccttatctgccctccccaggga
aggtcccagtctatggccttgctcctcattcactgcgcagccaggatgggggctctcgct
ggtttctcctggggtcagtgggtgatgcccagccctggtcttccttcacttccctgcctg

 1.09 Intr +  36443  36548  106  0  1  111   71   123 0.998  12.17
tgggggctctcgctggtttctcctggggtcagtgggtgatgcccagccctggtcttcctt
cacttccctgcctgggtgactccagctctggagggtggttggcgagcaatgccctgactc
tgggctggactgagcttgtctttgccccatgatcttgcaccacacctccctcccctccag
ACCGCCCAGGTGTGCTGCAGCACCGCGTGGTGGACCAGCTGCAGGAGCAATTCGCCATTA
CTCTGAAGTCCTACATTGAATGCAATCGGCCCCAGCCTGCTCATAG
gtgagcacagcagggggtgaggacccgtgagggtgatgtgagggagccgaggttcaggga
aattgcccaagacttcatggccagagggtggcatctggaggtagccccagccagaccagg
tccaaagctcacacttttgagcactacctaaccacttcccaggaaaaacacaagcaaaca

 1.10 Intr +  37785  37921  137  0  2   58   70   288 0.565  24.21
caggctgttctgcctttctcatctacagggtaaaagagaagcttacggaattcagccaag
ccttgtctcttggctgacctgaaatgtccagagattatgcttgtgcagcctcagagcagc
cctgaggcttgtgggtcagggcgggctgcacccacaatcttttctctggctggcatgcag
GTTCTTGTTCCTGAAGATCATGGCTATGCTCACCGAGCTCCGCAGCATCAATGCTCAGCA
CACCCAGCGGCTGCTGCGCATCCAGGACATACACCCCTTTGCTACGCCCCTCATGCAGGA
GTTGTTCGGCATCACAG
gtagctgagcggctgcccttgggtgacacctccgagaggcagccagacccagagccctct
gagccgccactcccgggccaagacagatggacactgccaagagccgacaatgccctgctg
gcctgtctccctagggaattcctgctatgacagctggctagcattcctcaggaaggacat

Predicted peptide sequence(s):


>NR1I2_1|GENSCAN_predicted_peptide_1|595_aa
KPIIFDHRSQPHNVPVITGSKDLQNVNIIPCILFGPVTSQLPRIFTRIGEDYDERVLPSI
TTEILKSVVARFDAGELITQRELVSRQEGVHRSSGSQTGGSAGCREGQNSLATAGDGLME
LCKLEAAEDITYQLSRSWNITNLPAGQSVLLQLPHPRGPEANLEVRPKESWNHADFVHCE
DTESVPGKPSVNADEEVGGPQICRVCGDKATGYHFNVMTCEGCKGFFRRAMKRNARLRCP
FRKGACEITRKTRRQCQACRLRKCLESGMKKEMIMSDEAVEERRALIKRKKSERTGTQPL
GVQGLTEEQRMMIRELMDAQMKTFDTTFSHFKNFRLPGVLSSGCELPESLQAPSREEAAK
WSQVRKDLCSLKVSLQLRGEDGSVWNYKPPADSGGKEIFSLLPHMADMSTYMFKGIISFA
KVISYFRDLPIEDQISLLKGAAFELCQLRFNTVFNAETGTWECGRLSYCLEDTAGGFQQL
LLEPMLKFHYMLKKLQLHEEEYVLMQAISLFSPDRPGVLQHRVVDQLQEQFAITLKSYIE
CNRPQPAHRFLFLKIMAMLTELRSINAQHTQRLLRIQDIHPFATPLMQELFGITX