GENSCAN 1.0	Date run: 16-Jan-104	Time: 22:58:39

Sequence ARHGEF5_1 : 16679 bp : 47.29% C+G : Isochore 2 (43 - 51 C+G%)

Parameter matrix: HumanIso.smat

Predicted genes/exons:

Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr..
----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------

 1.01 Init +    284    671  388  1  1   73   99    62 0.531   3.00
aggagcactcggggaggacatatgaactcagggggtcatgccaaaacaagacctgcttgt
caagactggacagtccccctccctgcctctgctggacgcacctcctggcccccggccaca
gctagatcaacagagtctttcacttccaccagcaggagtaagagcgaagtgtcccctggc
ATGGCTTTCAGCAACATGACAAACTTCCTATGCCCCTCTTCCCCTACCACTCCCTGGACT
CCGGAGCTCCAGGGACCCACCTCTAAGGATGAAGCAGGGGTCTCAGAACACCCTGAGGCC
CCTGCGAGAGAACCTTTGAGAAGGACAACCCCTCAGCAAGGAGCCAGTGGCCCAGGGAGG
TCACCTGTGGGCCAAGCAAGGCAGCCAGAAAAACCCAGCCATCTGCACCTGGAGAAGGCG
TCCAGCTGGCCCCACAGGCGGGACTCAGGGAGGCCACCAGGGGACAGCAGTGGACAGGCT
GTGGCTCCTAGTGAGGGGGCCAACAAGCACAAGGGCTGGAGCCGGCAGGGCCTGCGCAGA
CCTTCCATCTTGCCTGAGGGCTCTTCAG
gtgagcaagaaccgggaccacagtgacacatcagaccaagcttttcccagctcttcccac
ctcatcccatcttctgtccctagggttcctaacattctctctgctggcttcgctcacatg
tgtgcctaacctcaacacagagagagccccgaagtgccctccaagctccttgggatgctc

 1.02 Intr +   1203   1260   58  1  1   94  123    39 0.960   5.94
ggctcgggacgtcactggtactgatctctttcacgctctctgctttgccactttcttccc
tcccagtttcccaataaagcccattttcccttctggacccctgcatctcctgctctagat
cttctttggctctctctatccctctgaaatctcagttatcttccccatttccacttttag
ATTCAAGAGGTCCAGCCGTGGAGAAACATCCGGGACCCTCAGACACTGTTGTTTTTCG
gtaagtcaccctctcccctaacagccacactgtactctcttcacttgggatactgagagt
cccctagtgaggatttcagttgataagggttctgagaccccgaatggctcattcctgttt
cttcttggtgtaaggaggagaagagagagatgccgggatgccagcttagggagattgtct

 1.03 Intr +   1694   1758   65  2  2   92   76    36 0.963   1.16
ccacaaaagctccatgttagcccaagtgggtcaagttgcaactctgtgcctctccggggg
atttggtcttgacatcccagcacacatcccagaggctagagactttgtcccccacagtca
cctggggtgctctgtgtctaatggccagtaggtaccccagagcccttctttactccacag
GGAGAAAAAACCAAAGGAGGTGATGGGAGGCTTTTCAAGACGCTGCTCCAAACTCATCAA
CTCCT
gtgagtaccttgaagtggaactatagacccaggtgggcattctgcccagcactggagagg
ggcttgtggatgtcagcagtggggtggggatgtccctagcgagaaggcattgcagaagat
ctgtttccctgcctccgctcaccgtcccttcttcctgcttctcttccttcccatgcttct

 1.04 Intr +   1950   2203  254  1  2  112   69   386 0.956  36.25
gaagtggaactatagacccaggtgggcattctgcccagcactggagaggggcttgtggat
gtcagcagtggggtggggatgtccctagcgagaaggcattgcagaagatctgtttccctg
cctccgctcaccgtcccttcttcctgcttctcttccttcccatgcttctttgccctgcag
CCCAGCTGCTTTACCAGGAGTATAGTGATGTTGTCCTGAATAAGGAGATCCAGAGCCAGC
AGCGGCTGGAGAGCCTGTCCGAGACACCCGGGCCTAGCTCTCCGCGGCAGCCTCGGAAGG
CCCTGGTCTCCTCCGAGTCGTACCTGCAGCGGCTCTCCATGGCCTCCAGCGGCTCCCTCT
GGCAGGAAATCCCCGTGGTGCGCAACAGCACCGTGCTGCTCTCCATGACCCATGAAGACC
AAAAGCTGCAAGAG
gtactgggcaggccacggtggggagggggctacagaaaagatgacaaggtcttgtttgct
agcatccttctctcctgctcaccactgccaccaagagtcggctgtccccaccacacaggc
gcgtgtacatgccacagctgtgggtcatgtccattctgaagaagttgaattgttttctgg

 1.05 Intr +   5868   5963   96  2  0  121   51     4 0.519   0.21
atagtaacagaattaccctagtgaagcaaattaacatatccatctcactttgttactcat
tttttgttcttgtttttgttatttagatttcttgaaagcatatttttcttttataccttt
gaaggcctgtacctgtaccctagattaaatcctactcaaatgctcttcccctccccgcag
ATCAACTTAGGTTCTGGTGTGACTGTGAACCATAAAAGGATCTTGCTCACCGCTCTATCC
CCAAAACATTACACAGTGGCTGGCATATTCCAGGGG
gtaaataaaatccttaattaatcttcctcatctccaacctcctaggtcaaatttgagctg
attgtgtcagaggcctcctacctgcgcagtctaaacatagctgtggatcatttccaactt
tcaacttcactccgggccacactttccaaccaggagcaccaatggctcttctctcgttta

 1.06 Intr +   6009   6169  161  2  2   96   57   147 0.999  11.19
agattaaatcctactcaaatgctcttcccctccccgcagatcaacttaggttctggtgtg
actgtgaaccataaaaggatcttgctcaccgctctatccccaaaacattacacagtggct
ggcatattccagggggtaaataaaatccttaattaatcttcctcatctccaacctcctag
GTCAAATTTGAGCTGATTGTGTCAGAGGCCTCCTACCTGCGCAGTCTAAACATAGCTGTG
GATCATTTCCAACTTTCAACTTCACTCCGGGCCACACTTTCCAACCAGGAGCACCAATGG
CTCTTCTCTCGTTTACAGGATGTGCGAGACGTCAGCGCCAC
gtgagactccccttctcctaaataccactcactcagcctcactgttgttaggctttaaat
gttccattatttttttcccctcttagaaaccctttcattttcacagttatgggaagaaat
tgggcctcagtttcttccatgattcccgatggttataggaagaagaatggcaagaagagg

 1.07 Intr +   6725   6877  153  2  0   47   39   182 0.995   8.39
tcaaactcaggacagttttggagcgaattctagtgcatagcaaggagcacagagcagaga
gtcaatagaagttattggctaatggagtgaagaagtcaccagctcagtgtacaccagggg
ccagtcagctgtttagctgaggtccagaagtctcatgagactcgtttaaatcctgtacag
GTTCCTTTCAGACCTGGAAGAGAACTTTGAGAACAATATCTTCTCCTTCCAAGTATGTGA
CGTAGTCCTGAACCACGCCCCAGACTTCCGCCGGGTCTACCTGCCTTATGTCACCAACCA
GACCTATCAGGAACGCACCTTCCAGAGCCTGAT
gtgagactcatcccccatttaatccccatgtaggccctgaggtgaccatgcaccagtccc
cagcccagagggctatcccaagagcacactttccccattccgccctctgtattggttacc
ccagcatcacatctgagcgccctgtacatcagctcccaccgctctttcccagcccacaga

 1.08 Intr +   7147   7276  130  1  1   91   91   139 0.999  15.20
tttccccattccgccctctgtattggttaccccagcatcacatctgagcgccctgtacat
cagctcccaccgctctttcccagcccacagatactccactgacctcccttccccgcacct
tcactgcatcccccattgctccccatatcccccctcccctaaggctcactctccgtgcag
GAATAGCAACAGCAATTTCCGGGAGGTCTTGGAGAAGCTGGAGAGCGACCCCGTCTGCCA
GCGCCTTTCCCTCAAGTCCTTTCTGATTCTGCCCTTCCAACGCATCACCCGCCTCAAACT
GCTGCTCCAG
gtagggcagatgctaccttgatcctctccccttaactcaaagggatgccctcaggaagac
ccacaacaaaggaccaaccattcttctgccaggtgtccacatcctgtctccctgctgccc
actgcctgcttgcttggaaatatttgctgctaaaatgtggtccctgggcttctccattca

 1.09 Intr +   7531   7605   75  0  0   99   96    65 0.996   8.11
caaccattcttctgccaggtgtccacatcctgtctccctgctgcccactgcctgcttgct
tggaaatatttgctgctaaaatgtggtccctgggcttctccattcaccagcccccaatca
tttcttcctgtttcccatttcttcctcccatctcactcctgcctactgtctgttcaatag
AACATTCTGAAGAGAACACAGCCTGGCTCCTCGGAGGAGGCAGAGGCCACGAAGGCACAC
CACGCCCTGGAGCAG
gtaggcagccaccacctccactctgaccctctgtgtgttcttctcagagaggtctttccc
acctaggcccatgactccagggagcatgggaggtgggaccctgttgggagagctcagcca
cctccctgcatccgccaaacttccaaacatacacaccccacggccacctctcccacgccg

 1.10 Intr +   8043   8132   90  2  0  101   87   106 0.999  11.99
atgttctgccaagtcatacaaagttgcctaggacttgtgcttgatactgcctgcccaatt
ccagcctgggaaaaatgactgaggctgtcataacctatttccagattgcttgcaagggac
tcaaaggtcaaagcctctaacacagtgcccttcccctttcctcactctctgcacccctag
CTGATCCGGGACTGCAATAACAATGTCCAGAGTATGCGACGGACAGAGGAGCTAATCTAC
CTGAGCCAGAAGATTGAGTTTGAGTGCAAA
gtgagtcggtcccatgcaccccatccctgcccatgaactcccttaacatgtcctgcagat
caccccctaacctggaaccaccttgctcacattatccccagcccctcccctcatttctgc
ccaccttctattctgtcctatttctgggaggcctacttgggtctccaagacatactgcta

 1.11 Intr +   9581   9738  158  1  2  121   15   151 0.741   9.91
gggttcagagagaaagagaatctaagtgatgggttatgagccagaaggcagatgtggaag
agatgcttgttcaagtggaacttgcatggaagtggcatggggaggagggtgggactggga
gcaaacctcatgcttctcccatctgtgacactgccttctctctcttcctctgccctgtag
ATATTCCCGCTCATTTCTCAGTCACGCTGGCTGGTGAAAAGTGGGGAGCTGACAGCCTTG
GAGTTCAGTGCTTCCCCAGGGCTACGAAGGAAGCTGAACACGCGTCCAGTCCACCTGCAC
CTCTTCAATGACTGTCTGCTGCTGTCTCGGCCCCGAGA
gtcagtgactggagtggcaggccagggcacaagaggggaaggggatgaggaaagaggggg
gtctgaaagggagagagaagggtcatgttcctagaagagcccttctcaatggcttaaccc
atagagcccaggtcatagcctagagaagagaaaaacaagcccaaagcaaaaaggggatcc

 1.12 Intr +  10363  10524  162  1  0   31   33   168 0.902   4.69
ccagactaggacaaatccctgaggaccctggagctaccatcttgggagcaagttaggacc
atcttatggtttttgtgggaatttgcaggctgtagatgtagggatttcgaacccaaggtt
atgagggtaggtgaagtatggaaactctagaatcaggttgaaaagatttgtattttgcag
GGGTAGCCGATTCCTGGTATTTGACCATGCTCCCTTCTCCTCCATTCGGGGGGAAAAGTG
TGAAATGAAGCTACATGGACCTCACAAAAACCTGTTCCGACTCTTTCTGCGGCAGAACAC
TCAGGGCGCCCAGGCCGAGTTCCTCTTCCGCACGGAGACTCA
gtgagatggggctgggcagaggagctgggggtgggggaagatgggcagccgagaaaagaa
gtgagaccaaggcagaaaatgtgtccagaagacagccacagcctcatttagcccattctg
gactggggaccaccatagagaaattcagactcctaaaactaatggataacttgcaggaga

 1.13 Intr +  11969  12039   71  2  2   86   79    -1 0.821  -2.37
agacacattcacatatagttcaagattgtgtcatgtttccaaccaaaaagagtcctatat
agtgtttgatagaagtcatggaactagataagtccttcccacagtcttttgccatcccca
tccttggccctcctcctacacccccacaattgatgctatgaccctgctttgttttctcag
AAGTGAAAAGCTTCGGTGGATCTCAGCCTTGGCCATGCCAAGAGAGGAGTTGGACCTTCT
GGAGTGTTACA
gtgagtgagggtctaagagggagagaaaagaaagcagggtcagatgtcacctttggatac
aggagtttaaagggctgggtgggaactctaggctttcatttattgatattccgtaaaatg
tcagggtggaagattggcctctagagcttaaaaacctgaaatatatcgcctaaaactgct

 1.14 Intr +  13611  13715  105  1  0  108  115   145 0.999  19.39
catttctggtatgtctatttctttattaaccattttcaaatttcctcttttgtctgtgac
tttcaacagcccaaatctgtctctatctcaagtcctcccacgccacccccctccaagtcc
ctgtctgtgttccaatcccctgcctctcctaacctctcttcacactcttctcttccaaag
ACTCCCCCCAGGTACAGTGCCTTCGAGCCTACAAGCCCCGAGAGAATGATGAATTGGCAC
TGGAGAAAGCCGACGTGGTGATGGTGACTCAGCAGAGCAGTGACG
gtaagcgggagcatgcgtgagcagcaggccaggcactgcaggcagggcagtgctgggagt
gtgttcacttcctgcagctgccatgacaaagtaccatggactgggtggcttatggcaaca
gaaatgcattctctcacagttctggaggcaagaagtcccaaatcaaggtgtgggcagagc

 1.15 Term +  14748  14905  158  1  2  103   41   187 0.999  13.60
cagctgtagacttggccacacccagggccgccatgttaggacacttggaggctgggttta
tgctggaggctccccagttcactcagcctgaggattcagaaaagtctccaggaaggtgat
ccagagaagctatcgtgagcctttcccaggtaattcttcccactcaccctgtgcccacag
GCTGGCTGGAGGGCGTGAGGCTCTCAGACGGGGAGCGAGGCTGGTTTCCTGTGCAGCAGG
TGGAGTTCATTTCCAACCCAGAGGTCCGTGCACAGAACCTGAAGGAAGCTCATCGAGTCA
AGACTGCCAAACTACAGCTGGTGGAACAGCAAGCCTAA
gtcttctctgagaggagtttcgtgagctgaagaacaagctgctcatggcaagggctggcc
ccagaaccctgcaagagaggccttctgtggatggagaactaggccttctcaaagctcaag
gacaaaatccagctaacccagtccctcggcccaggcctcctttcgtgctttgtgcttggt

Predicted peptide sequence(s):


>ARHGEF5_1|GENSCAN_predicted_peptide_1|707_aa
MAFSNMTNFLCPSSPTTPWTPELQGPTSKDEAGVSEHPEAPAREPLRRTTPQQGASGPGR
SPVGQARQPEKPSHLHLEKASSWPHRRDSGRPPGDSSGQAVAPSEGANKHKGWSRQGLRR
PSILPEGSSDSRGPAVEKHPGPSDTVVFREKKPKEVMGGFSRRCSKLINSSQLLYQEYSD
VVLNKEIQSQQRLESLSETPGPSSPRQPRKALVSSESYLQRLSMASSGSLWQEIPVVRNS
TVLLSMTHEDQKLQEINLGSGVTVNHKRILLTALSPKHYTVAGIFQGVKFELIVSEASYL
RSLNIAVDHFQLSTSLRATLSNQEHQWLFSRLQDVRDVSATFLSDLEENFENNIFSFQVC
DVVLNHAPDFRRVYLPYVTNQTYQERTFQSLMNSNSNFREVLEKLESDPVCQRLSLKSFL
ILPFQRITRLKLLLQNILKRTQPGSSEEAEATKAHHALEQLIRDCNNNVQSMRRTEELIY
LSQKIEFECKIFPLISQSRWLVKSGELTALEFSASPGLRRKLNTRPVHLHLFNDCLLLSR
PREGSRFLVFDHAPFSSIRGEKCEMKLHGPHKNLFRLFLRQNTQGAQAEFLFRTETQSEK
LRWISALAMPREELDLLECYNSPQVQCLRAYKPRENDELALEKADVVMVTQQSSDGWLEG
VRLSDGERGWFPVQQVEFISNPEVRAQNLKEAHRVKTAKLQLVEQQA