GENSCAN 1.0 Date run: 16-Jan-104 Time: 03:25:08 Sequence C21orf89_1 : 26778 bp : 49.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 52 47 6 1.05 1.03 Term - 1378 1029 350 2 2 91 37 171 0.167 6.95 1.02 Intr - 3177 3074 104 2 2 62 -14 104 0.407 -2.58 1.01 Init - 4574 4486 89 2 2 70 95 92 0.949 8.21 1.00 Prom - 10736 10697 40 -4.26 2.01 Init + 11761 11833 73 0 1 93 45 18 0.465 -0.56 attggggctttatttgggaagtgtaaacccaggactgtgagagtgaggtgcgagagattg aggcaaggaaggacctccctgcagtgatgcgatgtgagtctcaactgtgctggctgctgt ttcataaggggctgaagtgacctggctcggcagatgcctctgggtcagctgcatgggaag ATGGCTCCAGGGATTCCACAGGAGGGGAACGGAATGTATCTTCTGGCTCTCTTCTGGCGG CTACTTTCCACTG gtcagagctgtccccgcagggaatcaggggccctgggttgcattcccagcctcctcagca gctgctctgggtaccacatccccatgctgcagagtggcatctcatgaggacacagaagtg gcaggtggagcatgtggctggtctgcctggctgcactgagttggtcgaggcaggtcagca 2.02 Intr + 12251 12345 95 0 2 95 94 43 0.678 5.28 cacgagatgtctgctgacacagggtctaagatggcccagttatgcactggcccctgtgca tttctgggccacgacaccttctgaattgagttctcagtgtcattacaagtcacaggtttc tgggtgatgttaagatatcacaaggtggactggctatgtttccaaccctttttcctctag CTCCGTACACCATCTCCATCCCCAGCGCTGGCCCTTGGCAGCTCAGGGTAAAGTCATGTG AAGCTCTGGGTGCAGGTCAAGGGCTAGTGGTGGAG gtaagaggctctgctcctatggggtgggggtgaggggctctgtgcctatggagtgagggt cagggggtgaggggctctgtgcctatggggtgggggtgaggggtggggggctctgtgcct atggggtggggatgaggggtggggggctctgtgcctatggggtgggggtgaggggtgagg 2.03 Intr + 15922 16212 291 0 0 47 101 101 0.402 4.63 ccttaaagtagatgagcctggatcctctgggcgatgtgagtccctgacagcaggagaggg ggcaggaggggaaggcggggactcgaagcatgagggagactggctgctgctgctgctggc atagaaggagctggcttctgccaagaacctgagggagcccagtcgctgctgcctccccag CACCCTCAGGCAAGACCTCAGGCTGGGCATCGCCTTGCCTTTGGCCTGGTGAGGCTCCAA GCAGAGAACCCAGTCATCACCCTCAGAAACTTGTTTCAGTCCTTTCGGTTTGTAACAATC CAAGCACCGCCCCCAGAGTTACATGATAACTTTCATTGTCTCCTTCACTGTGGAGGCCAT GAGGAGCCCCTGCCTGAAAAATCCTCAAGTCCCCATGCTGAGTTGAACCGTACAGTCCGC AGTAGGGCAGGACCTGCCCGCCTGTTCGTCACCCCTGGCGGTGGCTGTGCG gtgagtctcccgtaagcagcacataactcagcttcaaaaacccaatctcataattgtgcc tttgatcagaagttgtatctatttagatttattgtgatttctgatatatttggattgttt atacctctctatattgtgttttctaatttatccttttgttgtttttttctcctttcttac 2.04 Intr + 18249 18378 130 2 1 95 43 21 0.523 -1.43 taaaccttgaacccacttgaggatggcagtggtagaaattctcgagactttttctcttat ttgctcctcttttcagagccgaggctcacgcagcaagcgtccctgccacttccccgtgac tggcgggtttccctggctcactcttgagggcccttcttttgggctcctggcttcctgcag GGCTTCTGCTCTGGGTTCCTGCCTCACACAGGCCCCAGGCTTCATCTCTGTGCTTGCCTG GGGCTCATTAATGTTCAGGATCCTGGACAGCAGGGATCAACAGGACCTGTACAGACGGTG CTGTGCCCAG gtggactcacgcttctccatggcctcaggccttttgggattcaagttaccttctccccaa aaagattttcaaatgtgatgttattgggcattttcgggttttagaggctgtctgttggga ggcctcctcccccatctcttctgctgtattctggaaggggtgttcctggcccacctcctc 2.05 Intr + 20566 21002 437 2 2 93 18 200 0.222 6.80 gtggatctggggatgagtgactcggcttgaatccctcaggaggtgtcgggcgccaaggtc ccgagcagttcctgcttctcgtttttataacctgaggtgtcccaattagctgctgcttca gaacaaatcaccctcaaacttaatggcataaacaaccactcatttaattgtcttccacag TCCTCTGGGTCAAGGTTCCAGGCGGGCTGGGCTCAGGGCTCTACTTGCAAAATGCACTTT ATGGGCCCAGCTTCTCCCTGCCGTGGTGGGGGCCCAAGAAGCATCCCCAGTCTGGTGGAG TCCCCGGTCTGGATGACCCCCCGGTCTGGTGGAGCCCCCCGGTCTGGTGGAGCCCCCCGG TCTGGATGAGCCTCTGGTCTGGTGGAGCCCCCAGTCTGGATGAGCCTCTGGTCTGGTGGA GCCCCCGGTCTGGTGCAGCCCCCGGTCTGGATGAGCCCCCGGTCTGGTGGAGCCCCCCGG TCTGGTGGAGCCCCCGGTCTGGTGGAGCCCCCCGGTCTGGTGGAGCCCCCCGGTCTGGTG GAGCCCCCGGTCTGGATGAGCCTCTGGTCTGGTGGAGCCCCCGGTCTGGTGGCGTCCCCG GTCTGGTGGCGTCCCCG gtctggtagagaacgttgaggtgagtgctcgaggggttgctcgctgctgctgggccccac ggtgcccacctggcccccaggaccccaaccccacacagccctttgttagcctccccaccc catgcaccttcctccctccccagaacatctggtgaagatttgttgtaacctcctgggccc 2.06 Term + 25300 25497 198 0 0 80 49 176 0.684 10.30 cccctcccgcggtcctgggtcatccacctgctggcctcactctgcccacgcggccaggtc ccaccggcccctgagctcaacagaccaaagctggcccgaccccacccccaagaagaatga aacaatttttttttacctcttgcagaaaagtaaaagatcatttattcattctgtttctag ATAGCAAAACTAAGTGTCAAAAGCACCTTCTGCACACAGTCTGCACACACTGGCCGGTGG TCCTGTTCCCGCAAGGTTGAGCTGTGTTCCAGAGACATGGGTCCTCCGGGTGATGAGGAG CCGCTGGAGGGCCCTGAGCTGCACGTGCTAATGATTAACGCCCCGTCCGTGCTGGCCGGT TTCTCAAATGCCTCCTGA cgattgcgcacagccggacatcatttgtactgagagacaaaaggaaatcactgaggcttt ctgaggtgagctgggcggccgcggggggactggactcacacctgctaacggccagtccac aggacctgcccagaggctcggacacacagctgaagtcacatcccacggggagcatcctca Predicted peptide sequence(s): >C21orf89_1|GENSCAN_predicted_peptide_1|180_aa MNKAAINIREQVLCGPKFSASLERNCWTICALKVCPLSGFLGLDGSEKPGFCYDVTDLMV HGVPSPPCARHHLGPPGTSGSYPRVLAVTETPRAGSVSSLGKALGTSGHSQRRTHDLCVA GTRTHPGLNTVKMDAFRFRLPLPPAASACRFCRRFCRRFCLRPRKRTSFCSFGSFGRSWT >C21orf89_1|GENSCAN_predicted_peptide_2|407_aa MAPGIPQEGNGMYLLALFWRLLSTAPYTISIPSAGPWQLRVKSCEALGAGQGLVVEHPQA RPQAGHRLAFGLVRLQAENPVITLRNLFQSFRFVTIQAPPPELHDNFHCLLHCGGHEEPL PEKSSSPHAELNRTVRSRAGPARLFVTPGGGCAGFCSGFLPHTGPRLHLCACLGLINVQD PGQQGSTGPVQTVLCPVLWVKVPGGLGSGLYLQNALYGPSFSLPWWGPKKHPQSGGVPGL DDPPVWWSPPVWWSPPVWMSLWSGGAPSLDEPLVWWSPRSGAAPGLDEPPVWWSPPVWWS PRSGGAPRSGGAPRSGGAPGLDEPLVWWSPRSGGVPGLVASPIAKLSVKSTFCTQSAHTG RWSCSRKVELCSRDMGPPGDEEPLEGPELHVLMINAPSVLAGFSNAS