BLASTX nr result

ID: Paeonia24_contig00002988 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia24_contig00002988
         (2465 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]   769   0.0  
ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258...   766   0.0  
ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm...   655   0.0  
ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobr...   632   e-178
ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni...   625   e-176
gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]     613   e-173
ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu...   612   e-172
ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni...   610   e-172
ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni...   605   e-170
gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus...   590   e-165
ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Th...   580   e-163
ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prun...   578   e-162
ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni...   572   e-160
ref|XP_004302308.1| PREDICTED: putative RNA polymerase II subuni...   551   e-154
ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni...   541   e-151
ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma c...   535   e-149
ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma c...   533   e-148
ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobr...   500   e-138
gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlise...   499   e-138
ref|XP_006841578.1| hypothetical protein AMTR_s00003p00194360 [A...   452   e-124

>emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]
          Length = 659

 Score =  770 bits (1987), Expect = 0.0
 Identities = 402/663 (60%), Positives = 481/663 (72%), Gaps = 17/663 (2%)
 Frame = -1

Query: 2291 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 2112
            M  D PI VKDAV+KLQ  LL+GI++ENQLFAAGSLMSRSDYEDVV ER++A +CGYPLC
Sbjct: 1    MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60

Query: 2111 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 1932
            +N+LPSER RKG YR+SLKEHKVYDL ETYMYC S CVVNSR+FA SLQEERCSV +  +
Sbjct: 61   SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120

Query: 1931 IDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 1752
            I+ IL+LFG+ SLES           LSELKI+E  + KAGEV +E+WIGPSNAIEGYVP
Sbjct: 121  INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180

Query: 1751 -KDRNSKPLLLKKRKG---------DTGKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXX 1602
             +DRN KP  +K  K          D+GKN V ++MDF S I   DEYSI+         
Sbjct: 181  QRDRNLKPKNIKNHKEGSKSSNSKMDSGKNFVIDEMDFVSTIITKDEYSISKSSKGLKDT 240

Query: 1601 XXXXXXXXXXXXXIPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKS------ELSIPE 1440
                            D   Q ++LE  A  +QN  E KL+ES   +S      E S  E
Sbjct: 241  TSHAKSKEPKEKASIGD---QLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAE 297

Query: 1439 VPSIPCQNGSDIIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSM 1260
            VPS+P Q+GS++   +GK++  TE  AQ+G +  KSS+K SG KK+ RSVTWAD+K DS 
Sbjct: 298  VPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADEKMDSA 357

Query: 1259 NNTNLCAIRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXAVGSGQSGVT 1080
            ++ + C +RELE  KED   LG +++GDDDNALRF               AV SG++ +T
Sbjct: 358  DSRDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMT 417

Query: 1079 DAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDT 903
            DAV+EAGI+I+P P D DEG+SL+ D D+L   P P+KWP KP ++  D F+S +SWYDT
Sbjct: 418  DAVSEAGIIILPHPRDMDEGESLK-DADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDT 476

Query: 902  PPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGR 723
            PPEGF+LTLSPFATMWMALFAW+TSSS+AYIYGRDESFHEEYLSVNGREYP+KIVLTDGR
Sbjct: 477  PPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGR 536

Query: 722  SSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVI 543
            SSEIKQ LAGCL+RALPGLVA+L LP P+S LE+G+G LL+TMSF DALPSFR +QWQVI
Sbjct: 537  SSEIKQTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVI 596

Query: 542  ALLFMDALSICRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQ 363
             LLF+DALS+CRIP LTPHMTSRR    KV + AQV  +EYEVMKD IIPLGR PQFSAQ
Sbjct: 597  VLLFIDALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQ 656

Query: 362  RGG 354
             GG
Sbjct: 657  SGG 659


>ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera]
            gi|296089830|emb|CBI39649.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  766 bits (1977), Expect = 0.0
 Identities = 400/663 (60%), Positives = 480/663 (72%), Gaps = 17/663 (2%)
 Frame = -1

Query: 2291 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 2112
            M  D PI VKDAV+KLQ  LL+GI++ENQLFAAGSLMSRSDYEDVV ER++A +CGYPLC
Sbjct: 1    MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60

Query: 2111 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 1932
            +N+LPSER RKG YR+SLKEHKVYDL ETYMYC S CVVNSR+FA SLQEERCSV +  +
Sbjct: 61   SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120

Query: 1931 IDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 1752
            I+ IL+LFG+ SLES           LSELKI+E  + KAGEV +E+WIGPSNAIEGYVP
Sbjct: 121  INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180

Query: 1751 -KDRNSKPLLLKKRKG---------DTGKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXX 1602
             +DRN KP  +K RK          D+GKN V ++MDF   I   DEYSI+         
Sbjct: 181  QRDRNLKPKNIKNRKEGSKSSNSKMDSGKNFVIDEMDFVRTIITEDEYSISKSSKGLKDT 240

Query: 1601 XXXXXXXXXXXXXIPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKS------ELSIPE 1440
                            D   Q ++LE  A  +QN  E KL+ES   +S      E S  E
Sbjct: 241  TSHAKSKEPKEKASIGD---QLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAE 297

Query: 1439 VPSIPCQNGSDIIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSM 1260
            VPS+P Q+GS++   +GK++  TE  AQ+G + LKS +K SG KK+TRSVTWAD+K DS 
Sbjct: 298  VPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADEKMDSA 357

Query: 1259 NNTNLCAIRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXAVGSGQSGVT 1080
            ++ + C +RELE  KED   LG +++GDDDNALRF               AV SG++ +T
Sbjct: 358  DSRDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMT 417

Query: 1079 DAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDT 903
            DAV+EA I+I+P P D DEG+SL+ D D+L   P P+KWP KP ++  D F+S +SWYDT
Sbjct: 418  DAVSEARIIILPHPRDMDEGESLK-DADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDT 476

Query: 902  PPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGR 723
            PPEGF+LTLSPFATMWMALFAW+TSSS+AYIYGRDESFHEEYLSVNGREYP+KIVLTDGR
Sbjct: 477  PPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGR 536

Query: 722  SSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVI 543
            SSEIKQ LAGCLARALPGLVA+L LP P+S LE+G+G LL+TMSF DALPSFR +QWQVI
Sbjct: 537  SSEIKQTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVI 596

Query: 542  ALLFMDALSICRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQ 363
             LLF+DALS+C+IP LTPHM S+R    KV + AQV  +EYEVMKD IIPLGR PQFSAQ
Sbjct: 597  VLLFIDALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQ 656

Query: 362  RGG 354
             GG
Sbjct: 657  SGG 659


>ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis]
            gi|223538861|gb|EEF40460.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 645

 Score =  655 bits (1690), Expect = 0.0
 Identities = 344/649 (53%), Positives = 441/649 (67%), Gaps = 10/649 (1%)
 Frame = -1

Query: 2291 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 2112
            M  ++ + VKD VYKLQ SLL+GI +E+QL AAGSLMSRSDYEDVV ERS++ +CGYPLC
Sbjct: 1    MAKEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLC 60

Query: 2111 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 1932
            NN+LPS+RP KGRYR+SLKEH+VYDLQETYMYC SSC+VNSRAF+ SLQE+RCSV +P K
Sbjct: 61   NNSLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIK 120

Query: 1931 IDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 1752
            ++ IL+ F DL+L+S+           S LKIQEKS++  G+V LEEWIGPSNAIEGYVP
Sbjct: 121  LNEILRKFNDLTLDSEGLGRSGDLGL-SNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVP 179

Query: 1751 K-DRNSKPLLLKKRKG--------DTGKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXXX 1599
            + DR+  P L   ++G         + ++  F+D DFTS I   DEYSI+          
Sbjct: 180  QGDRDPNPSLKNHKEGLKAICKKPVSKQDCFFSDTDFTSTIITNDEYSISKGPSGLTSTA 239

Query: 1598 XXXXXXXXXXXXIPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIPEVPSIPCQ 1419
                            N    ++ +  +     K +G+ KE    K +L+  ++PS    
Sbjct: 240  SDIKLQAQTGKGHEGLNAQLSSLRKQDSIKASRKSKGRRKEKVI-KEQLNFQDLPS---- 294

Query: 1418 NGSDIIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSMNNTNLCA 1239
              S     E +   +    A + +S+LK S+KSSGAK+  RSVTWAD++ D+  + NLC 
Sbjct: 295  --SSYYTAEAEDISQATGAANLNESVLKPSLKSSGAKRSNRSVTWADERVDNAGSRNLCE 352

Query: 1238 IRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAEAG 1059
            ++E+E T E  E   S   GDD + LRF               AV SG + V  A++EAG
Sbjct: 353  VQEMEQTNESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNKAMSEAG 412

Query: 1058 IVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNL 882
            I+++P   D  +G ++E++ DM+    A +KWPTKP +   D F+ ++SWYD PPEGF+L
Sbjct: 413  IIVLPPSQDLGQGGNVEKN-DMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAPPEGFSL 471

Query: 881  TLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQA 702
            TLSPFATMWMALFAWVTSSSLAYIYGRDES HE+YLSVNGREYP+KIVL DGRSSEI+  
Sbjct: 472  TLSPFATMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRSSEIRLT 531

Query: 701  LAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDA 522
               CLAR  PGLVA L LP P+S LE+G G LLETMSF DALP+FRT+QWQVIALLF++A
Sbjct: 532  AESCLARTFPGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQVIALLFIEA 591

Query: 521  LSICRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQ 375
            LS+CRIP LT +MTSRR  LH+VL+GA +  +EY++MKDF++PLGR PQ
Sbjct: 592  LSVCRIPALTSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLGRDPQ 640


>ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao]
            gi|508787289|gb|EOY34545.1| F2P16.20 protein, putative
            isoform 1 [Theobroma cacao]
          Length = 739

 Score =  632 bits (1631), Expect = e-178
 Identities = 354/690 (51%), Positives = 447/690 (64%), Gaps = 44/690 (6%)
 Frame = -1

Query: 2294 TMTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPL 2115
            +M  +  I V +AV+K+Q  LLDGIRDE QL A+GSL+SRSDYEDVV ER+++  CGYPL
Sbjct: 54   SMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPL 113

Query: 2114 CNNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPA 1935
            C N LPSE  RKGRYR+SLKEHKVYDLQETYM+C ++C++NSRAFA SLQEERCSV + A
Sbjct: 114  CANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHA 173

Query: 1934 KIDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYV 1755
            K++ IL LFGDL L+             S L+I+E  + KA +V L    GPSNAIEGYV
Sbjct: 174  KLNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 229

Query: 1754 P-KDRNSKPLLLKKRKG---DTGKN---------LVFNDMDFTSEIFIGDEYSIAXXXXX 1614
            P ++  SKP   K  K    D+  +          V N++DF   I + DEY I+     
Sbjct: 230  PQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGS 289

Query: 1613 XXXXXXXXXXXXXXXXXIPNDN-------EHQFTILEMQATSVQNKGEGKLKE------- 1476
                             I   +         ++TI +M + S Q+  +  LKE       
Sbjct: 290  FKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGIC 349

Query: 1475 -------------SSCGKSELSIPEVPSIP--CQNGSDIIATEGKKDPRTEKEAQIGDSM 1341
                         S+  + + SI E+PS     Q+G D  + E +K+   +K     +++
Sbjct: 350  KDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETV 409

Query: 1340 LKSSMKSSGAKKLTRSVTWAD-KKADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNA 1164
            LKSS+KS+GAKKL R VTWAD KKAD+  N NLC ++E+E  K DSE  GS E G DDN 
Sbjct: 410  LKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNM 469

Query: 1163 LRFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGL 984
            LRF               AV SG S VTDAV E G++I+P   + D+ + + ED DML  
Sbjct: 470  LRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPM-EDGDMLEP 528

Query: 983  GPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIY 807
              AP+KWP KP +   D F  ++SW+D PPEGF+LTLS FATMW ALF W+TSSSLAYIY
Sbjct: 529  ETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIY 588

Query: 806  GRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSIL 627
            GRDESFHEEYLS+NGREYP+KI L DGRSSEIK+ LA C++RALP +V +L LP P+S L
Sbjct: 589  GRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTL 648

Query: 626  ERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDALSICRIPGLTPHMTSRRTSLHKVLE 447
            E+GMGHL++T+SF +ALP+FR +QWQVI LLF+DALS+CRIP LTPHMT+ R  LHKVL+
Sbjct: 649  EQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRMLLHKVLD 708

Query: 446  GAQVGVDEYEVMKDFIIPLGRAPQFSAQRG 357
            GAQ+ ++EYEVMKD IIPLGRAP FSAQ G
Sbjct: 709  GAQISMEEYEVMKDLIIPLGRAPHFSAQSG 738


>ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cicer arietinum]
          Length = 666

 Score =  625 bits (1611), Expect = e-176
 Identities = 337/667 (50%), Positives = 436/667 (65%), Gaps = 22/667 (3%)
 Frame = -1

Query: 2291 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 2112
            M  D PI VKDAV+KLQ +LL+GI+ E+QLFAAGSL+SRSDYEDVV ERS+ +VC YPLC
Sbjct: 1    MEKDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLC 60

Query: 2111 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 1932
             N LPSERPRKGRYR+SLKEHKVYDL ETYM+C SSCVVNS+AFA SL+++RC   DP K
Sbjct: 61   CNALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQK 120

Query: 1931 IDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 1752
            ++ IL+LFG+ +LE            LS L+IQ+K+++   EV LE+W+GPSNAIEGYVP
Sbjct: 121  LNNILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTET-VTEVSLEQWVGPSNAIEGYVP 179

Query: 1751 K--DRNSKPLLLKKRKGDTG--------KNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXX 1602
            K  D  SK      +KG           KNL+ ++ DF S I + DEYS++         
Sbjct: 180  KKRDNGSKGSQKNTKKGSKASHGKSNGVKNLINSEFDFMSTIIMQDEYSVSKVSSGQTDA 239

Query: 1601 XXXXXXXXXXXXXIPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIPEVPSIPC 1422
                          P   +H+    +     + +     L  S+  K +       ++  
Sbjct: 240  TVDHQIKPTAILEQPKRVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEIAKSCKNVLK 299

Query: 1421 QNGSDIIATEGKK----DP-------RTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADK 1275
               + + A +       DP       + EKE     +  KSS+KS+G KKL RSVTWADK
Sbjct: 300  GKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLGRSVTWADK 359

Query: 1274 KADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXAVGSG 1095
            K D   +T+LCA +E  + K++S+   ++++ DD++ LR                AV SG
Sbjct: 360  KIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALSQAAEAVASG 419

Query: 1094 QSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQN 918
             S   DAV+EAGI+I+P   +  E +S  +DVD+L      +KWP KP ++D+D F S +
Sbjct: 420  DSDAIDAVSEAGIIILPHTENAVE-ESTVDDVDILETDSVTLKWPRKPGISDFDLFASDD 478

Query: 917  SWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIV 738
            SW+D PPEGF+LTLSPFAT+W A F+W+TSSSLAYIYGRD SF+EE+LSV+GREYP KIV
Sbjct: 479  SWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDGREYPCKIV 538

Query: 737  LTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTR 558
            L+DGRSSEIKQ LA CLARALP +VAEL LP P+S LE+GM  LL+TMSF D LP FR +
Sbjct: 539  LSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSFVDPLPGFRFK 598

Query: 557  QWQVIALLFMDALSICRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAP 378
            QWQV+ALLF+DALS+CRIP L  +MT RR   HKVL G+Q+G++EY V+KD I+PLGRAP
Sbjct: 599  QWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLKDLIVPLGRAP 658

Query: 377  QFSAQRG 357
             FS+Q G
Sbjct: 659  HFSSQSG 665


>gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]
          Length = 695

 Score =  613 bits (1582), Expect = e-173
 Identities = 339/698 (48%), Positives = 428/698 (61%), Gaps = 58/698 (8%)
 Frame = -1

Query: 2276 PIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLCNNTLP 2097
            PI VKD VY+LQ SLL G+  E+QLFAAGS+MSRSDY DVV ERS+A +CGYPLC N LP
Sbjct: 8    PISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYPLCPNPLP 67

Query: 2096 SERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAKIDRIL 1917
            S+RPRKGRYR+SLKEHKVYDL ETYMYC S CV+NSR FAASL++ERC+V D A+ID +L
Sbjct: 68   SDRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDSARIDAVL 127

Query: 1916 KLFGDLS-LESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYV-PKDR 1743
            ++F D S LE +           S+LKI+EK+++  G+V LE+W GPSNAIEGYV  ++R
Sbjct: 128  RMFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRER 187

Query: 1742 NSKPLLLK--KRKGDTGKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXXXXXXXXXXXXX 1569
              K L  K  KR       ++ NDMDF S I   DEY+++                    
Sbjct: 188  KPKELGSKSPKRGSKANNTVLINDMDFVSTIITEDEYTVSKTPSSLKKTGLDSKVREQEE 247

Query: 1568 XXIPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIPEVPSIPCQNGSDIIATEG 1389
                    ++F +LE       N            +  L   +V S   + GS + +   
Sbjct: 248  ILAKKAMGNEFAVLETSYAPASN----------VSRVGLVFEDVTS-SLRAGSCLSSARA 296

Query: 1388 KKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSMNNTNLCAIRELEDTKED 1209
            +++   +K  +  ++ +KSS+K S  KKL+R+VTWAD+K DS     LC IRE+ED KED
Sbjct: 297  EEESHDDKAEKCTEASIKSSLKPSRKKKLSRTVTWADEKTDSSGGRKLCEIREIEDMKED 356

Query: 1208 ---------------------------------------------------SESLGSMEI 1182
                                                               ++ L + + 
Sbjct: 357  PSVVENKNGVSFTSSGKMKAGQSVIWADEKGDSSKSIDVCEVREIEDAKEAADMLCNADT 416

Query: 1181 GDDDNALRFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEED 1002
            G++D+  RF               AV S +  V DA++EAGI+I+PRP + DEG+ +EED
Sbjct: 417  GENDDTFRFASAEACARALDEASEAVASEELEVNDAMSEAGIIILPRPENGDEGEPMEED 476

Query: 1001 VDMLGLGP--APIKWPTKPVTDY-DFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVT 831
             D     P  APIKWP KP + + D F+ ++SW+D PPE F+LTLSPFA MW ALF W T
Sbjct: 477  DDDETSEPEQAPIKWPKKPGSQHSDLFDPEDSWFDAPPEDFSLTLSPFAKMWNALFTWTT 536

Query: 830  SSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELG 651
            SS+LAYIYGRDES HEEY  VNGREYP+KIV  DGRSSEIKQ LAG LARALPGLVA+L 
Sbjct: 537  SSTLAYIYGRDESLHEEYAVVNGREYPEKIVFGDGRSSEIKQTLAGSLARALPGLVADLR 596

Query: 650  LPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDALSICRIPGLTPHMTSRR 471
            L TP+S LE+GMG LL+TMSF DALP FR +QWQVI LLF++ALS+ R+P LTPHM  RR
Sbjct: 597  LSTPISSLEQGMGRLLDTMSFVDALPPFRMKQWQVIILLFLEALSVYRLPALTPHMMYRR 656

Query: 470  TSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQRG 357
               HKVL+ AQ+  +EYEVMKD +IPLGR P FSAQ G
Sbjct: 657  VLFHKVLDSAQISAEEYEVMKDLVIPLGRTPHFSAQSG 694


>ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa]
            gi|550321730|gb|EEF05523.2| hypothetical protein
            POPTR_0015s01330g [Populus trichocarpa]
          Length = 696

 Score =  612 bits (1578), Expect = e-172
 Identities = 337/700 (48%), Positives = 434/700 (62%), Gaps = 55/700 (7%)
 Frame = -1

Query: 2291 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 2112
            M  D    VKD +YKLQ SLLDGI++E+QL AAGS+MS SDYEDVV ER++A +CGYPLC
Sbjct: 1    MAKDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLC 60

Query: 2111 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 1932
             N+LPS+RP+KGRYR+SLKEHKVYDL ETYMYC SSCV+NSR F+ SLQEERC V +PAK
Sbjct: 61   GNSLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAK 120

Query: 1931 IDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 1752
            ++ +L LF + SL S+           S LKI+EK++   GEV  E+WIGPSNAIEGYVP
Sbjct: 121  LNEVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVP 180

Query: 1751 K----------------------------------------DRNSKPLLLKKRKGDTGKN 1692
            +                                         +  KP      KG  G  
Sbjct: 181  QRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSK 240

Query: 1691 L-----------VFNDMDFTSEIFIG-DEYSIAXXXXXXXXXXXXXXXXXXXXXXIPNDN 1548
                          NDM+FTS I I  DEYSI+                          +
Sbjct: 241  AKGTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSS 300

Query: 1547 EHQFTILEMQATSVQNKGEGKLKESSCGKSELSIPEV--PSIPCQNGSDIIATEGKKDPR 1374
            E+Q +      +S  ++   + +     K ELS  ++  P   CQ  S  I  E K+   
Sbjct: 301  ENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSV 360

Query: 1373 TEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSMNNTNLCAIRELEDTKEDSESLG 1194
            +EK A+  +S LK S+K+SGAK+LTRSVTWAD+K  S  + +LC +R +EDTK   E + 
Sbjct: 361  SEKAAKPVESSLKPSLKTSGAKQLTRSVTWADEKVGSSGSRDLCEVRGMEDTKAGPEIVD 420

Query: 1193 SMEIGDDDNALRFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDS 1014
            +++  DD    +F               AV SG +  ++A++EAG+VI+P+PHD D+GD 
Sbjct: 421  NIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILPQPHDLDQGDP 480

Query: 1013 LEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSPFATMWMALFAW 837
            +E DVD+L    + IKWP KP +   + F+ +NSWYD PPEGF+L LS FAT+WMALFAW
Sbjct: 481  ME-DVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFATIWMALFAW 539

Query: 836  VTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAE 657
            VTSSSLAY+YG+DES HEEYL VNGREYP+KIVL DGRS EI+Q + GCL RA P +VA+
Sbjct: 540  VTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLGRAFPVVVAD 599

Query: 656  LGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDALSICRIPGLTPHMTS 477
            L LP P+S LE+G  +LL TMSF DA+P+FR +QWQVIALLF++ALS+CRIP L  +M +
Sbjct: 600  LRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRIPALISYMDN 659

Query: 476  RRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQRG 357
            RR     V++G ++  +EYEVMKD +IPLGRAPQFS Q G
Sbjct: 660  RR----MVVDGVRMSAEEYEVMKDLMIPLGRAPQFSPQSG 695


>ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Solanum lycopersicum]
          Length = 660

 Score =  610 bits (1574), Expect = e-172
 Identities = 348/670 (51%), Positives = 440/670 (65%), Gaps = 24/670 (3%)
 Frame = -1

Query: 2291 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 2112
            M   + + VKDAV+KLQ  LL+GI+DE+QL AAGSL+SRSDY+DVV ERS+A +CGYPLC
Sbjct: 1    MAKGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60

Query: 2111 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 1932
            +N+LPSER RKG YR+SLKEHKVYDL ETYMYC ++CVVNS AFA SLQ+ER S  +PAK
Sbjct: 61   SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120

Query: 1931 IDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 1752
            ++++L LF  L L S            S+LKIQEK   K GEV LEEW+GPSNAIEGYVP
Sbjct: 121  LNQVLNLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVP 180

Query: 1751 -KDRNSKPLLLKK-RKGDTGK--------NLVFNDMDFTSEIFIGDEYSIAXXXXXXXXX 1602
             +DR+  P LLK   KG   K        N++ N+ DF+S I   DEYS++         
Sbjct: 181  QRDRSVNPALLKNINKGSKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNAD 240

Query: 1601 XXXXXXXXXXXXXIPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIP-----EV 1437
                             ++  + IL  Q  ++Q +   + ++S      L +      EV
Sbjct: 241  SNVKFKETQAKTRYKVRDDDVY-ILGKQVDALQLRSGEETEKSDKNTRFLKVDKFNSGEV 299

Query: 1436 PSIPCQ----NGSDIIATEGKKDPRTEKEAQIGD-SMLKSSMKSSGAKKLTRSVTWADKK 1272
             S P Q    N S +I ++  +     K A  G+   LKSS+KSS +KK++RSVTWAD+ 
Sbjct: 300  SSGPSQHDVKNKSVLIMSDDGR-----KYASHGEHDKLKSSLKSSNSKKMSRSVTWADES 354

Query: 1271 ADS---MNNTNLCAIRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXAVG 1101
             D        +   I E E       +   ME  ++D++ RF               AV 
Sbjct: 355  IDGGIGKKTESSSKISEYESQAYGGSASTDME--ENDDSYRFESAEACAAALSQAAEAVA 412

Query: 1100 SGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFES 924
            SG S V DAV++AGIVI+P   + DE   L+E  +ML L  AP+KWP KP + +YD FES
Sbjct: 413  SG-SDVPDAVSKAGIVILPPSQEVDEA-ILQETDEMLDLETAPLKWPRKPGMPNYDVFES 470

Query: 923  QNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQK 744
            ++SWYD+PPEGFN+TLSPF TM+ +LF W++SSSLA+IYG DES +EEYLS+NGREYP+K
Sbjct: 471  EDSWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGREYPRK 530

Query: 743  IVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFR 564
            IVL+DGRS+EIKQ LAGCLARALPGLVA+L LP P+S LE+GM  LL TMSF D LP+FR
Sbjct: 531  IVLSDGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFR 590

Query: 563  TRQWQVIALLFMDALSICRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGR 384
             +QWQ+I LLF+DALS+CRIP LTP+MT RRTS  KVL+GAQ+   EYE+MKD IIPLGR
Sbjct: 591  MKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLIIPLGR 650

Query: 383  APQFSAQRGG 354
             PQFS Q GG
Sbjct: 651  VPQFSMQSGG 660


>ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Solanum tuberosum]
          Length = 662

 Score =  605 bits (1559), Expect = e-170
 Identities = 342/671 (50%), Positives = 439/671 (65%), Gaps = 25/671 (3%)
 Frame = -1

Query: 2291 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 2112
            M   + + VKDAV+KLQ  LL+GI+DENQL AAGSL+SRSDY+DVV ERS+A +CGYPLC
Sbjct: 1    MAKGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60

Query: 2111 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 1932
            +N+LPSER RKG YR+SLKEHKVYDL ETYMYC ++CVVNS AFA SLQ+ER S  +PAK
Sbjct: 61   SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120

Query: 1931 IDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAG-EVPLEEWIGPSNAIEGYV 1755
            ++++L LF  L L S            S+LKIQEK   K G EV LEEW+GPSNAIEGYV
Sbjct: 121  LNQVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYV 180

Query: 1754 P-KDRNSKPLLLKK-RKG--------DTGKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXX 1605
            P +DR+  P LLK   KG           KN++ N+ DF+S I   DEYS++        
Sbjct: 181  PQRDRSVNPALLKNINKGFKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNA 240

Query: 1604 XXXXXXXXXXXXXXIPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIP-----E 1440
                               +   +IL  +  ++Q +   + ++S      L +      E
Sbjct: 241  VSSEKFKEAQAKTRY-KVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFLKVDKFNSGE 299

Query: 1439 VPSIPCQNGSD-----IIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADK 1275
            V S P Q+        I++ +G+K        +    +LKSS+KSS +KK+++SVTWAD+
Sbjct: 300  VSSGPSQHDVKNKSVLIMSDDGRK---YASHGEHDKQLLKSSLKSSNSKKMSQSVTWADE 356

Query: 1274 KADS---MNNTNLCAIRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXAV 1104
              D        +   I E E+      +   ME  +DD++ RF               AV
Sbjct: 357  IIDGGIGKKTESSSKISEYENQAYGGSASTDME--EDDDSYRFESAEACAAALSQAAEAV 414

Query: 1103 GSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFE 927
             SG S V DAV++AGIVI+P   + DE  ++ ++ +ML + PAP+KWP KP + +YD FE
Sbjct: 415  ASG-SDVPDAVSKAGIVILPTSQEVDE--AILQETEMLDIEPAPLKWPRKPGMPNYDVFE 471

Query: 926  SQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQ 747
            S++ WYD PPEGFN+TLSPFATM+ +LF W++SSSLA+IYG DE+ +EEYLS+NGREYP 
Sbjct: 472  SEDCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYPH 531

Query: 746  KIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSF 567
            KIVL+DG S+EIKQ LAGCLARALPGLVA+L LP P+S LE+GM  LL TMSF D LP+F
Sbjct: 532  KIVLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAF 591

Query: 566  RTRQWQVIALLFMDALSICRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLG 387
            R +QWQ+I LLF+DALS+CRIP LTP+MT RRTSL KVL+GAQ+   EYE+MKD IIPLG
Sbjct: 592  RMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPLG 651

Query: 386  RAPQFSAQRGG 354
            R PQFS Q GG
Sbjct: 652  RVPQFSMQSGG 662


>gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus guttatus]
          Length = 597

 Score =  590 bits (1521), Expect = e-165
 Identities = 326/652 (50%), Positives = 433/652 (66%), Gaps = 14/652 (2%)
 Frame = -1

Query: 2267 VKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLCNNTLPSER 2088
            VKDAV+KLQ SLL+GI+ E+QL AAGSL+S+SDY+DVV ER++A VCGYPLC N+LPSE 
Sbjct: 9    VKDAVHKLQLSLLEGIKHESQLIAAGSLISQSDYQDVVTERTIAHVCGYPLCVNSLPSEP 68

Query: 2087 PRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAKIDRILKLF 1908
            PRKG YR+SLKEHKVYDL ET+MYC + C++ SRAF ASL+EER S  DPAKI+ +LK+F
Sbjct: 69   PRKGHYRISLKEHKVYDLHETHMYCSTECLIRSRAFGASLEEERSSSLDPAKINSVLKMF 128

Query: 1907 GDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVPK-DRNSK- 1734
              LSL+S           LS LKI+EK  + +GE+ LEEW+GPSNAI+GYVP+ D+NS+ 
Sbjct: 129  DGLSLDSVMGLDKSGDLGLSGLKIREKMVTGSGEMSLEEWVGPSNAIDGYVPRRDQNSER 188

Query: 1733 --PLLLKKRKGDTGKNLVFN---DMDFTSEIFIGDEYSIAXXXXXXXXXXXXXXXXXXXX 1569
              P   K        NL      D++FTS I + DEYS++                    
Sbjct: 189  KQPSRKKTESNHAKPNLADTLPFDVNFTSTIIMQDEYSVSK------------------- 229

Query: 1568 XXIPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKS----ELSIPEVPSIPCQNGSDII 1401
                              T+V  + +GK+K     KS    ++S+ +  + P QN +   
Sbjct: 230  ------------------TAVPREAKGKVKGKMIRKSVKAEKISVLDDTAGPSQNDT--- 268

Query: 1400 ATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSMNNTNLCAIRELED 1221
                              ++LKSS+K+  +KK TRSVTWAD+K+D  +  ++   RE+ D
Sbjct: 269  ------------------TLLKSSLKTLDSKKETRSVTWADEKSDG-DGKSISECREIGD 309

Query: 1220 TKED--SESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAEAGIVIV 1047
             K       L   ++GD+  + RF               AV SG++  +DAV+EAG++I+
Sbjct: 310  NKGAVVMPHLTDEDVGDE--SYRFTSAEACARALSQASEAVASGKTDASDAVSEAGVIIL 367

Query: 1046 PRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSP 870
            P PH+ DE    E+  +++ + P  +KWP KP  +  D F+S++SWYD+PPEGFNLTLSP
Sbjct: 368  PPPHEVDEA-KYEQIGEVVDVDPIELKWPPKPGFSSEDLFDSEDSWYDSPPEGFNLTLSP 426

Query: 869  FATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGC 690
            F+TM+M+LFAW++SSSLAYIYG++E FHE+YLS+NGREYP KI++ DGRS+E+K  LAGC
Sbjct: 427  FSTMFMSLFAWISSSSLAYIYGKEERFHEDYLSINGREYPPKIII-DGRSAEVKHTLAGC 485

Query: 689  LARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDALSIC 510
            LARALPGLV+E+ +PTP+S +E+GMG LL+TMSF DALP FR +QWQVIALLF+DALS+ 
Sbjct: 486  LARALPGLVSEIRIPTPVSTIEQGMGRLLDTMSFTDALPGFRMKQWQVIALLFLDALSVS 545

Query: 509  RIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQRGG 354
            RIP L+P+MT RR  L KVLEGAQ+ V+E+E+MKD IIPLGR PQFS Q GG
Sbjct: 546  RIPALSPYMTGRRILLPKVLEGAQINVEEFEIMKDLIIPLGRVPQFSTQSGG 597


>ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao]
            gi|508787291|gb|EOY34547.1| F2P16.20-like protein isoform
            3, partial [Theobroma cacao]
          Length = 703

 Score =  580 bits (1496), Expect = e-163
 Identities = 330/666 (49%), Positives = 419/666 (62%), Gaps = 44/666 (6%)
 Frame = -1

Query: 2294 TMTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPL 2115
            +M  +  I V +AV+K+Q  LLDGIRDE QL A+GSL+SRSDYEDVV ER+++  CGYPL
Sbjct: 54   SMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPL 113

Query: 2114 CNNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPA 1935
            C N LPSE  RKGRYR+SLKEHKVYDLQETYM+C ++C++NSRAFA SLQEERCSV + A
Sbjct: 114  CANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHA 173

Query: 1934 KIDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYV 1755
            K++ IL LFGDL L+             S L+I+E  + KA +V L    GPSNAIEGYV
Sbjct: 174  KLNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 229

Query: 1754 P-KDRNSKPLLLKKRKG---DTGKN---------LVFNDMDFTSEIFIGDEYSIAXXXXX 1614
            P ++  SKP   K  K    D+  +          V N++DF   I + DEY I+     
Sbjct: 230  PQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGS 289

Query: 1613 XXXXXXXXXXXXXXXXXIPNDN-------EHQFTILEMQATSVQNKGEGKLKE------- 1476
                             I   +         ++TI +M + S Q+  +  LKE       
Sbjct: 290  FKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGIC 349

Query: 1475 -------------SSCGKSELSIPEVPSIP--CQNGSDIIATEGKKDPRTEKEAQIGDSM 1341
                         S+  + + SI E+PS     Q+G D  + E +K+   +K     +++
Sbjct: 350  KDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETV 409

Query: 1340 LKSSMKSSGAKKLTRSVTWAD-KKADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNA 1164
            LKSS+KS+GAKKL R VTWAD KKAD+  N NLC ++E+E  K DSE  GS E G DDN 
Sbjct: 410  LKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNM 469

Query: 1163 LRFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGL 984
            LRF               AV SG S VTDAV E     V +    ++GD LE +      
Sbjct: 470  LRFVSAEACAMALSKAAEAVASGDSDVTDAVCE-----VDKEEPMEDGDMLEPET----- 519

Query: 983  GPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIY 807
              AP+KWP KP +   D F  ++SW+D PPEGF+LTLS FATMW ALF W+TSSSLAYIY
Sbjct: 520  --APVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIY 577

Query: 806  GRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSIL 627
            GRDESFHEEYLS+NGREYP+KI L DGRSSEIK+ LA C++RALP +V +L LP P+S L
Sbjct: 578  GRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTL 637

Query: 626  ERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDALSICRIPGLTPHMTSRRTSLHKVLE 447
            E+GMGHL++T+SF +ALP+FR +QWQVI LLF+DALS+CRIP LTPHMT+ R  LHKVL+
Sbjct: 638  EQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRMLLHKVLD 697

Query: 446  GAQVGV 429
            GAQ+ +
Sbjct: 698  GAQISM 703


>ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica]
            gi|462404075|gb|EMJ09632.1| hypothetical protein
            PRUPE_ppa002134mg [Prunus persica]
          Length = 711

 Score =  578 bits (1490), Expect = e-162
 Identities = 337/716 (47%), Positives = 435/716 (60%), Gaps = 77/716 (10%)
 Frame = -1

Query: 2273 IPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLCNNTLPS 2094
            I VKD VYKLQ +LL+GI+ ++ L+ AGS++SRSDY DVV ER++A +CGYPLC+N LPS
Sbjct: 13   ISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCSNALPS 72

Query: 2093 E--RPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAKIDRI 1920
            +  RP KG YR+SLKEHKVYDL ETYMYC S CV+ S+AFA SL EERC V D  K++RI
Sbjct: 73   DSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFGKVERI 132

Query: 1919 LKLFGDLSLES-QXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEW--------------- 1788
            L+ FGD+  +  +          +S+LKI+EK ++  G++ +                  
Sbjct: 133  LRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHIGDLGA 192

Query: 1787 IGPSNAIEGYVP-KDRNSKPLLLKKRK-GDTGKN--------LVFNDMDFTSEIFIGDEY 1638
            +GPSNAIEGYVP K+R SKPL  KK K G  GK+        ++FN+MDF S I   DEY
Sbjct: 193  VGPSNAIEGYVPQKERISKPLGSKKNKEGSKGKDAKMSSGMDIIFNEMDFMSTIITSDEY 252

Query: 1637 SIAXXXXXXXXXXXXXXXXXXXXXXIPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKS 1458
            S++                        N N+           S Q+KG    K  +  K 
Sbjct: 253  SVSKIPPSVGEPDFETKFKKSKGKVGLNKNDSV-------KKSRQSKGG---KNKNVKKD 302

Query: 1457 ELSIPEVPSIP-----CQNGSDIIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRS 1293
            ++ I EVPS         NGS     E K++   EK  Q G+++L+SS+K SG KKL RS
Sbjct: 303  DVCIREVPSTSDASQTVLNGS---TKEEKEEFIVEKAEQSGEALLRSSLKPSGTKKLNRS 359

Query: 1292 VTWADKKADSMNNTNLCAIRELEDTKE--------------------------------- 1212
            VTWAD+  DS  + NL  +RE+E   E                                 
Sbjct: 360  VTWADEMIDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDSTKS 419

Query: 1211 ----------DSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAEA 1062
                      D++ LGS+++ +++                    AV SG+S V+ AV+ A
Sbjct: 420  KNICEVREVQDADVLGSLDLQENEI---LESAEACAMALNQAAEAVASGESDVSGAVSGA 476

Query: 1061 GIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFN 885
            GI+I+PRP   DE +  E DVDML    AP+ WP KP +   D F+ ++SW+D PPEGF+
Sbjct: 477  GIIILPRPDGLDEEEPTE-DVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDAPPEGFS 534

Query: 884  LTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQ 705
            +TLSPFATMW +LF W+TSS+LAYIYGRDESFHEE+LSVNGREYP KIVL  GRSSEIK+
Sbjct: 535  VTLSPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGRSSEIKK 594

Query: 704  ALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFMD 525
             L    ARALPG+V+EL LPTP+S LE+GMG +L TMSF DA+P+FR +QWQVI LLF++
Sbjct: 595  TLDESFARALPGVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQVIVLLFLE 654

Query: 524  ALSICRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQRG 357
             LS+CRIP LTPHMT+RR   +KVLE  Q+  ++YE+MKD IIPLGRAPQFSAQ G
Sbjct: 655  GLSVCRIPALTPHMTNRRMLFYKVLENTQISAEQYELMKDLIIPLGRAPQFSAQSG 710


>ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 662

 Score =  572 bits (1474), Expect = e-160
 Identities = 324/670 (48%), Positives = 419/670 (62%), Gaps = 29/670 (4%)
 Frame = -1

Query: 2291 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 2112
            M  +  + +KD VYKLQ +L +GI++ENQLFAAGSLMSRSDYEDVV ERS+A +CGYPLC
Sbjct: 1    MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 2111 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 1932
            ++ LPS+  R+GRYR+SLKEHKVYDL+ETY YC S+C++NSRAF+  LQ+ERCSV +P K
Sbjct: 61   HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 1931 IDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 1752
            +  ILKLF ++SL+S+           S L+IQEK +S  GEVP+EEW+GPSNAIEGYVP
Sbjct: 121  LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177

Query: 1751 KDRNSKPLLLKKRKGDTGKNL-------------VFNDMDFTSEIFIGDEYSIAXXXXXX 1611
              R+ K + L  + G   K+               F+D   TS I   +EYS++      
Sbjct: 178  H-RDHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGL 236

Query: 1610 XXXXXXXXXXXXXXXXIPNDNEHQFTILEMQ------ATSVQNKGEG---KLKESSCGKS 1458
                               ++  QF ILE          SV  K  G   + K S+  +S
Sbjct: 237  KEMALDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKES 296

Query: 1457 ELSIPEVPSIPCQNGSDI-IATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWA 1281
              ++ + PS      ++  + TE   +PR      +  + LKSS+K  G K L RSVTWA
Sbjct: 297  TDNLSDAPSTSKNRSTNFNLMTE---EPRGGFN-DLSGTELKSSLKKPGKKNLCRSVTWA 352

Query: 1280 DKKADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNA--LRFXXXXXXXXXXXXXXXA 1107
            D+K D  +  NL  + E+  TKE S +  ++   D+DN   LR                A
Sbjct: 353  DEKTDDASIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEA 412

Query: 1106 VGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP----VTDY 939
            + SGQS V+DAV+EAGI+I+P P D +E    E   D +     P  +  K     V   
Sbjct: 413  ITSGQSEVSDAVSEAGIIILPHPSDANE----EASTDPVNASE-PHSFSEKSNKLGVLRS 467

Query: 938  DFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGR 759
            D F+  +SWYD PPEGF+LTLS FATMWMA+FAWVTSSSLAYIYG+D+ FHEE+L ++G+
Sbjct: 468  DLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGK 527

Query: 758  EYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDA 579
            EYP KIV  DGRSSEIKQ LAGCL RA+PGL +EL L TP+S LE GM HLL+TM+F DA
Sbjct: 528  EYPSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDA 587

Query: 578  LPSFRTRQWQVIALLFMDALSICRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFI 399
            LP+FR +QWQVI LLF++ALS+ RIP L  HM+S R   HKVL+ AQ+  DEYE+M+D I
Sbjct: 588  LPAFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHI 647

Query: 398  IPLGRAPQFS 369
            +PLGR  Q S
Sbjct: 648  LPLGRTAQLS 657


>ref|XP_004302308.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Fragaria vesca subsp. vesca]
          Length = 692

 Score =  551 bits (1419), Expect = e-154
 Identities = 327/717 (45%), Positives = 423/717 (58%), Gaps = 80/717 (11%)
 Frame = -1

Query: 2267 VKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLCNNTLPSE- 2091
            V DAVYKLQ +LLD ++  ++L+ AGS++SRSDY DVV ERS+A +CGYPLC+N LP E 
Sbjct: 13   VNDAVYKLQLALLDSVKTLDRLYLAGSIISRSDYTDVVTERSIADLCGYPLCSNALPPEA 72

Query: 2090 -RPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAKIDRILK 1914
             R RKG YR+SLKEHKVYDL+ET +YC S CV++S+AFA  L EERC V D  K++R+L+
Sbjct: 73   SRTRKGHYRISLKEHKVYDLRETKLYCSSKCVIDSKAFAQGLSEERCDVLDLGKVERVLR 132

Query: 1913 LFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVPK-DRNS 1737
             FG+   E            LS LKI+EKS + +G+V   E  GPSNAIEGYVP+ DR S
Sbjct: 133  EFGEEKKE-------IGDLGLSSLKIEEKSGTYSGKV---EEFGPSNAIEGYVPRRDRVS 182

Query: 1736 KPLLLKKRKGDT----------GKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXXXXXXX 1587
            K    KK K  +          GK L+ NDMDF S +   DEYS++              
Sbjct: 183  KASGAKKNKQGSKGKDAKPSGGGKQLILNDMDFMSTLLACDEYSVSKMPPNVADNNVDTE 242

Query: 1586 XXXXXXXXIPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIPEVPSIPCQNGSD 1407
                       D E  F++LE  AT   NK EG +     G S L I             
Sbjct: 243  LKKSKG----KDLESGFSVLETSATP--NKSEGVMDVGDLGMSRLKI------------- 283

Query: 1406 IIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSMNNTNLCAIREL 1227
                E +++ +  K  +  +  L+SS+K SG KKL+RSVTWAD+K+DS    NLC +R++
Sbjct: 284  ----EAEEESQVGKGEKSSEGTLRSSLKHSGTKKLSRSVTWADEKSDSTGRRNLCEVRDM 339

Query: 1226 E-----------------------------------------------DTKEDSESLGSM 1188
            E                                               D KE  E +GS 
Sbjct: 340  EDGLENPGAFDSLYKPSSSSEAGSSFSWVDKTIDSTKCENICEVSGTHDAKEVPEVVGSS 399

Query: 1187 EIGDDDNALRFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAEAGIVIVPRPH--DE----- 1029
             +  ++    F               AV +G+   +DAV++AGI+I+PR    DE     
Sbjct: 400  VVQGNE---WFESAEACAVALSEAAGAVETGEFDTSDAVSKAGIIILPRTDGVDEEEFIV 456

Query: 1028 ---DEGDSLE---------EDVDMLGLGPAPIKWPTKPVTD-YDFFESQNSWYDTPPEGF 888
               DE DS+E         ED+DML    A  KWP KP +  +D F  ++SW+D PP+GF
Sbjct: 457  DGADEEDSIEDSVDEEESTEDIDMLEPEQALSKWPKKPESSQFDLFNPEDSWFDAPPDGF 516

Query: 887  NLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIK 708
            NLTLSPFATMW ALF W TSS+LAYIYG+D+SFHEE+L+VNGR YP KIVL DGRSSEIK
Sbjct: 517  NLTLSPFATMWNALFTWTTSSTLAYIYGKDDSFHEEFLNVNGRSYPHKIVLADGRSSEIK 576

Query: 707  QALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFM 528
              +   L+RALP +VAELGL  P   LE+GMG +L TMSF +ALP+FR +QWQVIALLF+
Sbjct: 577  LTVGASLSRALPEIVAELGLAVP--NLEKGMGFMLNTMSFIEALPAFRMKQWQVIALLFI 634

Query: 527  DALSICRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQRG 357
            + LS+CR+P LTPHMT+RR  + +VL+GA++ V+EYE+MKDF+IPLGRAPQF++Q G
Sbjct: 635  EGLSVCRMPALTPHMTNRRVLIQRVLDGARISVEEYEIMKDFLIPLGRAPQFASQSG 691


>ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 632

 Score =  541 bits (1395), Expect = e-151
 Identities = 309/661 (46%), Positives = 403/661 (60%), Gaps = 20/661 (3%)
 Frame = -1

Query: 2291 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 2112
            M  +  + +KD VYKLQ +L +GI++ENQLFAAGSLMSRSDYEDVV ERS+A +CGYPLC
Sbjct: 1    MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 2111 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 1932
            ++ LPS+  R+GRYR+SLKEHKVYDL+ETY YC S+C++NSRAF+  LQ+ERCSV +P K
Sbjct: 61   HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 1931 IDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 1752
            +  ILKLF ++SL+S+           S L+IQEK +S  GEVP+EEW+GPSNAIEGYVP
Sbjct: 121  LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177

Query: 1751 KDRNSKPLLLKKRKGDTGKNL-------------VFNDMDFTSEIFIGDEYSIAXXXXXX 1611
              R+ K + L  + G   K+               F+D  FTS I   +EYS++      
Sbjct: 178  H-RDHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGL 236

Query: 1610 XXXXXXXXXXXXXXXXIPNDNEHQFTILEM-QATSVQNKGEGKLKESSCGKSELSIPEVP 1434
                                +  QF ILE   A +      G+    S  ++++S     
Sbjct: 237  KEMALDTNSKNQTGEFCGKKSNDQFAILETPHAPAPPKNSVGRKARGSKERTKVS----- 291

Query: 1433 SIPCQNGSDIIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSMNN 1254
                       AT+   D        + D+   S+ +S+    +T      D+K D  + 
Sbjct: 292  -----------ATKESTD-------NLSDAPSTSNNRSTNFNLMTEEPR--DEKTDDASI 331

Query: 1253 TNLCAIRELEDTKEDSESLGSMEIGDDDNA--LRFXXXXXXXXXXXXXXXAVGSGQSGVT 1080
             NL  + E+  TKE S +  ++   D+DN   LR                A+ SGQS V+
Sbjct: 332  MNLPEVGEMGKTKECSRTTSNLVNFDNDNEDLLRVESAEACAMALSQAAKAITSGQSEVS 391

Query: 1079 DAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP----VTDYDFFESQNSW 912
            DAV+EAGI+I+P P D +E    E   D +     P  +  K     V   D F+  +SW
Sbjct: 392  DAVSEAGIIILPHPSDANE----EASTDPVNASE-PHSFSEKSNKLGVLRSDLFDPSDSW 446

Query: 911  YDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLT 732
            YD PPEGF+LTLS FATMWMA+FAWVTSSSLAYIYG+D+ FHEE+L ++G+EYP KIV  
Sbjct: 447  YDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIVSA 506

Query: 731  DGRSSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQW 552
            DGRSSEIKQ LAGCL RA+PGL +EL L TP+S LE GM HLL+TM+F DALP+FR +QW
Sbjct: 507  DGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMKQW 566

Query: 551  QVIALLFMDALSICRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQF 372
            QVI LLF++ALS+ RIP L  HM+S R   HKVL+ AQ+  DEYE+M+D I+PLGR  Q 
Sbjct: 567  QVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTAQL 626

Query: 371  S 369
            S
Sbjct: 627  S 627


>ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma cacao]
            gi|508787293|gb|EOY34549.1| F2P16.20-like protein isoform
            5 [Theobroma cacao]
          Length = 708

 Score =  535 bits (1377), Expect = e-149
 Identities = 305/627 (48%), Positives = 393/627 (62%), Gaps = 44/627 (7%)
 Frame = -1

Query: 2294 TMTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPL 2115
            +M  +  I V +AV+K+Q  LLDGIRDE QL A+GSL+SRSDYEDVV ER+++  CGYPL
Sbjct: 54   SMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPL 113

Query: 2114 CNNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPA 1935
            C N LPSE  RKGRYR+SLKEHKVYDLQETYM+C ++C++NSRAFA SLQEERCSV + A
Sbjct: 114  CANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHA 173

Query: 1934 KIDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYV 1755
            K++ IL LFGDL L+             S L+I+E  + KA +V L    GPSNAIEGYV
Sbjct: 174  KLNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 229

Query: 1754 P-KDRNSKPLLLKKRKG---DTGKN---------LVFNDMDFTSEIFIGDEYSIAXXXXX 1614
            P ++  SKP   K  K    D+  +          V N++DF   I + DEY I+     
Sbjct: 230  PQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGS 289

Query: 1613 XXXXXXXXXXXXXXXXXIPNDN-------EHQFTILEMQATSVQNKGEGKLKE------- 1476
                             I   +         ++TI +M + S Q+  +  LKE       
Sbjct: 290  FKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGIC 349

Query: 1475 -------------SSCGKSELSIPEVPSIP--CQNGSDIIATEGKKDPRTEKEAQIGDSM 1341
                         S+  + + SI E+PS     Q+G D  + E +K+   +K     +++
Sbjct: 350  KDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETV 409

Query: 1340 LKSSMKSSGAKKLTRSVTWAD-KKADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNA 1164
            LKSS+KS+GAKKL R VTWAD KKAD+  N NLC ++E+E  K DSE  GS E G DDN 
Sbjct: 410  LKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNM 469

Query: 1163 LRFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGL 984
            LRF               AV SG S VTDAV E G++I+P   + D+ + + ED DML  
Sbjct: 470  LRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPM-EDGDMLEP 528

Query: 983  GPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIY 807
              AP+KWP KP +   D F  ++SW+D PPEGF+LTLS FATMW ALF W+TSSSLAYIY
Sbjct: 529  ETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIY 588

Query: 806  GRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSIL 627
            GRDESFHEEYLS+NGREYP+KI L DGRSSEIK+ LA C++RALP +V +L LP P+S L
Sbjct: 589  GRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTL 648

Query: 626  ERGMGHLLETMSFFDALPSFRTRQWQV 546
            E+GMGHL++T+SF +ALP+FR +QW++
Sbjct: 649  EQGMGHLIDTISFMEALPAFRMKQWEI 675


>ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma cacao]
            gi|508787290|gb|EOY34546.1| F2P16.20-like protein isoform
            2 [Theobroma cacao]
          Length = 679

 Score =  533 bits (1372), Expect = e-148
 Identities = 305/625 (48%), Positives = 391/625 (62%), Gaps = 44/625 (7%)
 Frame = -1

Query: 2294 TMTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPL 2115
            +M  +  I V +AV+K+Q  LLDGIRDE QL A+GSL+SRSDYEDVV ER+++  CGYPL
Sbjct: 54   SMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPL 113

Query: 2114 CNNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPA 1935
            C N LPSE  RKGRYR+SLKEHKVYDLQETYM+C ++C++NSRAFA SLQEERCSV + A
Sbjct: 114  CANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHA 173

Query: 1934 KIDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYV 1755
            K++ IL LFGDL L+             S L+I+E  + KA +V L    GPSNAIEGYV
Sbjct: 174  KLNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 229

Query: 1754 P-KDRNSKPLLLKKRKG---DTGKN---------LVFNDMDFTSEIFIGDEYSIAXXXXX 1614
            P ++  SKP   K  K    D+  +          V N++DF   I + DEY I+     
Sbjct: 230  PQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGS 289

Query: 1613 XXXXXXXXXXXXXXXXXIPNDN-------EHQFTILEMQATSVQNKGEGKLKE------- 1476
                             I   +         ++TI +M + S Q+  +  LKE       
Sbjct: 290  FKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGIC 349

Query: 1475 -------------SSCGKSELSIPEVPSIP--CQNGSDIIATEGKKDPRTEKEAQIGDSM 1341
                         S+  + + SI E+PS     Q+G D  + E +K+   +K     +++
Sbjct: 350  KDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETV 409

Query: 1340 LKSSMKSSGAKKLTRSVTWAD-KKADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNA 1164
            LKSS+KS+GAKKL R VTWAD KKAD+  N NLC ++E+E  K DSE  GS E G DDN 
Sbjct: 410  LKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNM 469

Query: 1163 LRFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGL 984
            LRF               AV SG S VTDAV E G++I+P   + D+ + + ED DML  
Sbjct: 470  LRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPM-EDGDMLEP 528

Query: 983  GPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIY 807
              AP+KWP KP +   D F  ++SW+D PPEGF+LTLS FATMW ALF W+TSSSLAYIY
Sbjct: 529  ETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIY 588

Query: 806  GRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSIL 627
            GRDESFHEEYLS+NGREYP+KI L DGRSSEIK+ LA C++RALP +V +L LP P+S L
Sbjct: 589  GRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTL 648

Query: 626  ERGMGHLLETMSFFDALPSFRTRQW 552
            E+GMGHL++T+SF +ALP+FR +QW
Sbjct: 649  EQGMGHLIDTISFMEALPAFRMKQW 673


>ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobroma cacao]
            gi|508787292|gb|EOY34548.1| F2P16.20 protein, putative
            isoform 4 [Theobroma cacao]
          Length = 607

 Score =  500 bits (1287), Expect = e-138
 Identities = 292/603 (48%), Positives = 371/603 (61%), Gaps = 44/603 (7%)
 Frame = -1

Query: 2291 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 2112
            M  +  I V +AV+K+Q  LLDGIRDE QL A+GSL+SRSDYEDVV ER+++  CGYPLC
Sbjct: 1    MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 60

Query: 2111 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 1932
             N LPSE  RKGRYR+SLKEHKVYDLQETYM+C ++C++NSRAFA SLQEERCSV + AK
Sbjct: 61   ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 120

Query: 1931 IDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 1752
            ++ IL LFGDL L+             S L+I+E  + KA +V L    GPSNAIEGYVP
Sbjct: 121  LNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYVP 176

Query: 1751 -KDRNSKPLLLKKRKG---DTGKN---------LVFNDMDFTSEIFIGDEYSIAXXXXXX 1611
             ++  SKP   K  K    D+  +          V N++DF   I + DEY I+      
Sbjct: 177  QRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSF 236

Query: 1610 XXXXXXXXXXXXXXXXIPNDN-------EHQFTILEMQATSVQNKGEGKLKE-------- 1476
                            I   +         ++TI +M + S Q+  +  LKE        
Sbjct: 237  KQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICK 296

Query: 1475 ------------SSCGKSELSIPEVPSIP--CQNGSDIIATEGKKDPRTEKEAQIGDSML 1338
                        S+  + + SI E+PS     Q+G D  + E +K+   +K     +++L
Sbjct: 297  DSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVL 356

Query: 1337 KSSMKSSGAKKLTRSVTWAD-KKADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNAL 1161
            KSS+KS+GAKKL R VTWAD KKAD+  N NLC ++E+E  K DSE  GS E G DDN L
Sbjct: 357  KSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNML 416

Query: 1160 RFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLG 981
            RF               AV SG S VTDAV E G++I+P   + D+ + + ED DML   
Sbjct: 417  RFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPM-EDGDMLEPE 475

Query: 980  PAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIYG 804
             AP+KWP KP +   D F  ++SW+D PPEGF+LTLS FATMW ALF W+TSSSLAYIYG
Sbjct: 476  TAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYG 535

Query: 803  RDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSILE 624
            RDESFHEEYLS+NGREYP+KI L DGRSSEIK+ LA C++RALP +V +L LP P+S LE
Sbjct: 536  RDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTLE 595

Query: 623  RGM 615
            +GM
Sbjct: 596  QGM 598


>gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlisea aurea]
          Length = 597

 Score =  499 bits (1284), Expect = e-138
 Identities = 291/648 (44%), Positives = 396/648 (61%), Gaps = 13/648 (2%)
 Frame = -1

Query: 2291 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 2112
            M  D+ + +K+AVY+LQ SLL+G ++ENQL AAGSLMSR DY+D+V ER +AK+CGYPLC
Sbjct: 1    MAKDEILTMKEAVYRLQTSLLEGAKNENQLSAAGSLMSRGDYQDLVTERVIAKICGYPLC 60

Query: 2111 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 1932
            +N L SERP KGRYR+SLKEHKVYD+QETY +C S C++NSRAF+  L +ER S  DP K
Sbjct: 61   SNNLNSERPSKGRYRISLKEHKVYDVQETYSFCSSGCLINSRAFSIGLPDERTSDLDPIK 120

Query: 1931 IDRILKLFGDLSLESQXXXXXXXXXXLSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 1752
            ++ +LK F      S           LS+L+I EK   +AGEV   EWIGPS+AI+GYVP
Sbjct: 121  LNEVLKRFDGFGANSTPNMGRNEDLGLSQLRIMEKENIEAGEVSSNEWIGPSDAIDGYVP 180

Query: 1751 K-DRNSKPLLLKKRKGDTGKNLVF--------NDMDFTSEIFIGDEYSIAXXXXXXXXXX 1599
            + DRNS  L  K++KG++  +L          +DM FTS I   +EYSIA          
Sbjct: 181  RRDRNSNTLSSKQKKGESRYHLSLQVLTSIFPSDMSFTSVIIDQNEYSIAKTTTPSSSKQ 240

Query: 1598 XXXXXXXXXXXXIPNDNEHQFTILEMQATSVQNKG-EGKLKESSCGKSELSIPEVPSIPC 1422
                         P ++       +    +++  G     K +   K +  +        
Sbjct: 241  SGESNEKVI----PEEDVRPKQSPDSSVANIKGSGFRNPSKRNGRAKIDAKLSASEDKAS 296

Query: 1421 QNGSDIIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLT-RSVTWADKKADSMNNTNL 1245
            +NG +    +G      +K AQ G ++LKSS+K+S +K+ T R+V+WAD KA+  +  NL
Sbjct: 297  ENGGEPKLADG------DKSAQ-GAAVLKSSLKTSYSKETTTRTVSWADVKAE--DGQNL 347

Query: 1244 CAIRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAE 1065
              + E+ D      S  +                            V S ++  T A  +
Sbjct: 348  ETVCEMNDPHGGGISRETSS--------------------------VESHKTASTKASKD 381

Query: 1064 A-GIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDTPPEG 891
            A G  ++    D +EG+   E +         +KWP KP  ++ D  ES ++ YD PP+G
Sbjct: 382  APGKFLLT---DFNEGEIFTEAI---------LKWPPKPGFSEADLVESDDTLYDRPPDG 429

Query: 890  FNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEI 711
            FNL+LSPF T++ +LF+W++SSSLAYIYG+D+SFHEEY++ NGREYP K+V  DGRSSEI
Sbjct: 430  FNLSLSPFCTLFNSLFSWISSSSLAYIYGKDDSFHEEYVNANGREYPCKVVAEDGRSSEI 489

Query: 710  KQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLF 531
            KQ L+  LARALPG+V+EL LPTP+SILE+GMG LL+TMSF D LPS RT+QWQ I LLF
Sbjct: 490  KQTLSAALARALPGVVSELRLPTPISILEQGMGRLLDTMSFIDPLPSLRTKQWQAIVLLF 549

Query: 530  MDALSICRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLG 387
            ++ALS+ RIP L+ ++  RR S+ KVLEGA +GV+E+EVMKD IIPLG
Sbjct: 550  LNALSVSRIPALSKYLEDRRASIQKVLEGAGIGVEEFEVMKDLIIPLG 597


>ref|XP_006841578.1| hypothetical protein AMTR_s00003p00194360 [Amborella trichopoda]
            gi|548843599|gb|ERN03253.1| hypothetical protein
            AMTR_s00003p00194360 [Amborella trichopoda]
          Length = 591

 Score =  452 bits (1164), Expect = e-124
 Identities = 264/635 (41%), Positives = 369/635 (58%), Gaps = 10/635 (1%)
 Frame = -1

Query: 2267 VKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLCNNTLPSER 2088
            +KDA+YK+Q  LLDGI  ENQL AA +L+SRSDY+DVV ER++  +CGYPLCN  LP +R
Sbjct: 9    LKDAIYKIQTYLLDGISKENQLLAAANLISRSDYDDVVTERTITNLCGYPLCNKYLPCDR 68

Query: 2087 PRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAKIDRILKLF 1908
            P+KGRYR+SLKEH VYDL+ET++YC   CV+NS+AF+  L+ ERC   DP KI  IL LF
Sbjct: 69   PKKGRYRISLKEHSVYDLKETWLYCSPECVINSQAFSKLLKPERCEFSDPGKIAEILNLF 128

Query: 1907 GDLSLESQXXXXXXXXXXLS----ELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVPKDRN 1740
               S+E            +S     L I EK     G++   +++GP NAIEGYVP+   
Sbjct: 129  SSPSIEESNAGGAEKNEKISLAFSSLTIHEKEDVSVGDIQSMDFVGPYNAIEGYVPRQDQ 188

Query: 1739 SKPLLLKKRKGD-TGKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXXXXXXXXXXXXXXX 1563
              P+   +RKG  +GK+    D  +    F                              
Sbjct: 189  VPPV---QRKGSKSGKSTTKKDPIYPETNFAS---------------------------- 217

Query: 1562 IPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIPEVPSIPCQNGSDIIATEGKK 1383
                     TI+  + +S      G L+++S  K       V         ++  ++ ++
Sbjct: 218  ---------TIIIGEPSS------GNLQKNSSSKFVNDHVHV---------NVEGSKREQ 253

Query: 1382 DPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKK---ADSMNNTNLCAIRELEDTKE 1212
              + + ++   ++ L+S++K+ GAK  TR+V+WAD++    + + N  L   + +E   +
Sbjct: 254  HAQEKSQSHPKETKLRSALKNLGAKASTRTVSWADEQQTIVEGIQNMTLNNCQGIESGSK 313

Query: 1211 DSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXAVGSGQSGVTDAVAEAGIVIVPRPHD 1032
              ES  S+ + D   + R                AV SGQS   DA +EAGI+I P P+ 
Sbjct: 314  CKESSDSLSVEDTMISSRRASAEACASALTEAAAAVASGQSNTLDAASEAGILIFPCPNS 373

Query: 1031 EDEGDSLEEDVDMLGLGPAPIKWPTKPVTDYD--FFESQNSWYDTPPEGFNLTLSPFATM 858
             +E +++++  D L       KW  +P   +   F   ++SWYD PPEGF+LTLS FATM
Sbjct: 374  VEE-ENIQKVADELKPEEGE-KWVKRPSLLHTGAFDTEEDSWYDAPPEGFSLTLSSFATM 431

Query: 857  WMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARA 678
            WMALF WVT+SS+AYIYGR ES  EE++ V+GREYP K VL DG SSEIK+ L+GCLARA
Sbjct: 432  WMALFGWVTASSMAYIYGRAESAEEEFVVVDGREYPHKFVLGDGLSSEIKETLSGCLARA 491

Query: 677  LPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDALSICRIPG 498
            LPG+VA + LPTP+S LE  +G LL+TM+F +ALP FR +QW VI LLF+DALS+  +P 
Sbjct: 492  LPGVVANIKLPTPISTLEVALGRLLDTMTFTEALPPFRMKQWHVIVLLFLDALSVHIVPA 551

Query: 497  LTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIP 393
            L  H+ SRRT +HK+LE AQV  +EY +M+D  +P
Sbjct: 552  LEQHIASRRTLVHKMLEDAQVSNEEYNIMRDLFLP 586


Top