BLASTX nr result

ID: Rehmannia22_contig00009000 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00009000
         (1141 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]   384   e-104
ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258...   381   e-103
ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni...   349   1e-93
ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu...   340   8e-91
ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni...   338   2e-90
ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm...   337   5e-90
ref|XP_006480289.1| PREDICTED: putative RNA polymerase II subuni...   331   3e-88
ref|XP_006428243.1| hypothetical protein CICLE_v10011677mg [Citr...   331   3e-88
gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus...   330   6e-88
ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subuni...   328   2e-87
gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma c...   327   5e-87
ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subuni...   323   1e-85
ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni...   320   5e-85
ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subuni...   320   7e-85
gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]     320   9e-85
gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus pe...   319   1e-84
gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobro...   316   1e-83
ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni...   315   3e-83
gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao]      301   4e-79
gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao]      301   4e-79

>emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]
          Length = 659

 Score =  384 bits (986), Expect = e-104
 Identities = 223/404 (55%), Positives = 279/404 (69%), Gaps = 25/404 (6%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISK-------TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTN 980
            +M+F STIIT+DEYSISK       T    K+KEPK KAS   +  Q + ++K   P+ N
Sbjct: 215  EMDFVSTIITKDEYSISKSSKGLKDTTSHAKSKEPKEKAS---IGDQLSMLEKSAPPIQN 271

Query: 979  IQETR---SKNKSKNVITKDDKLSLLENIAGPSQNDS----TKAVKELQESTAGAXXXXX 821
              E++   SK +   VI KD+  S  E  + PSQ+ S     K  +E     A       
Sbjct: 272  DSESKLRESKGRRSRVIFKDE-FSTAEVPSVPSQSGSELNGVKGKEEYHTENAAQLGPTK 330

Query: 820  XXXXXXKA-----TRSVTWADEKTDG-DGQNLNECRELKDKKGAVVTSHSADEEVGEE-- 665
                   +      RSVTWADEK D  D ++  + REL+ KK     +   D +VG++  
Sbjct: 331  PKSSLKPSGGKKVIRSVTWADEKMDSADSRDFCKVRELEVKKED--PNGLGDIDVGDDDN 388

Query: 664  SYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEEN---GDVMET 494
            + RFASAEACA+AL+QAAE VASG+++ +DAVSEAG+IILP P   DE E+    D++E 
Sbjct: 389  ALRFASAEACAVALSQAAEAVASGETDMTDAVSEAGIIILPHPRDMDEGESLKDADLLEP 448

Query: 493  DPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIY 314
            +P+ LKWP KPG           SWYD+PPEGF+LTLSPF+TM+MALF+W++SSS+AYIY
Sbjct: 449  EPVPLKWPIKPGISHSDIFDSDDSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIY 508

Query: 313  GKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTL 134
            G++ESFHEEYLSVNGREYP+KIV+ DGRSSEIKQTLAGCL+RALPGLVA+LRLPIPVS L
Sbjct: 509  GRDESFHEEYLSVNGREYPKKIVLTDGRSSEIKQTLAGCLSRALPGLVADLRLPIPVSNL 568

Query: 133  EQGMGRLLDTMSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
            EQG+GRLLDTMSF+D LP+FRMKQW  IVLLF+DALSV RIPAL
Sbjct: 569  EQGVGRLLDTMSFVDALPSFRMKQWQVIVLLFIDALSVCRIPAL 612


>ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera]
            gi|296089830|emb|CBI39649.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  381 bits (979), Expect = e-103
 Identities = 222/404 (54%), Positives = 278/404 (68%), Gaps = 25/404 (6%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISK-------TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTN 980
            +M+F  TIIT+DEYSISK       T    K+KEPK KAS   +  Q + ++K   P+ N
Sbjct: 215  EMDFVRTIITEDEYSISKSSKGLKDTTSHAKSKEPKEKAS---IGDQLSMLEKSAPPIQN 271

Query: 979  IQETR---SKNKSKNVITKDDKLSLLENIAGPSQNDS----TKAVKELQESTAGAXXXXX 821
              E++   SK +   VI KD+  S  E  + PSQ+ S     K  +E     A       
Sbjct: 272  DSESKLRESKGRRSRVIFKDE-FSTAEVPSVPSQSGSELNGVKGKEEYHTENAAQLGPTK 330

Query: 820  XXXXXXKA-----TRSVTWADEKTDG-DGQNLNECRELKDKKGAVVTSHSADEEVGEE-- 665
                   +     TRSVTWADEK D  D ++  + REL+ KK     +   D +VG++  
Sbjct: 331  LKSCLKPSGGKKVTRSVTWADEKMDSADSRDFCKVRELEVKKED--PNGLGDIDVGDDDN 388

Query: 664  SYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEEN---GDVMET 494
            + RFASAEACA+AL+QAAE VASG+++ +DAVSEA +IILP P   DE E+    D++E 
Sbjct: 389  ALRFASAEACAIALSQAAEAVASGETDMTDAVSEARIIILPHPRDMDEGESLKDADLLEP 448

Query: 493  DPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIY 314
            +P+ LKWP KPG           SWYD+PPEGF+LTLSPF+TM+MALF+W++SSS+AYIY
Sbjct: 449  EPVPLKWPIKPGISHSDIFDSDDSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIY 508

Query: 313  GKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTL 134
            G++ESFHEEYLSVNGREYP+KIV+ DGRSSEIKQTLAGCLARALPGLVA+LRLPIPVS L
Sbjct: 509  GRDESFHEEYLSVNGREYPKKIVLTDGRSSEIKQTLAGCLARALPGLVADLRLPIPVSNL 568

Query: 133  EQGMGRLLDTMSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
            EQG+GRLLDTMSF+D LP+FRMKQW  IVLLF+DALSV +IPAL
Sbjct: 569  EQGVGRLLDTMSFVDALPSFRMKQWQVIVLLFIDALSVCQIPAL 612


>ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Solanum lycopersicum]
          Length = 660

 Score =  349 bits (896), Expect = 1e-93
 Identities = 210/407 (51%), Positives = 262/407 (64%), Gaps = 28/407 (6%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTVPA-------VKAKEPKGKASSKE-------VNRQSNPVQK 1001
            + +F+STIITQDEYS+SK  PA       VK KE + K   K        + +Q + +Q 
Sbjct: 215  EFDFSSTIITQDEYSVSK-FPAPVNADSNVKFKETQAKTRYKVRDDDVYILGKQVDALQ- 272

Query: 1000 PTAPLTNIQETRSKNKSKNVITKDDKLSLLENIAGPSQND---------STKAVKELQES 848
                L + +ET   +K+   + K DK +  E  +GPSQ+D         S    K     
Sbjct: 273  ----LRSGEETEKSDKNTRFL-KVDKFNSGEVSSGPSQHDVKNKSVLIMSDDGRKYASHG 327

Query: 847  TAGAXXXXXXXXXXXKATRSVTWADEKTDGD-GQNLNECRELKDKKG-AVVTSHSADEEV 674
                           K +RSVTWADE  DG  G+      ++ + +  A   S S D E 
Sbjct: 328  EHDKLKSSLKSSNSKKMSRSVTWADESIDGGIGKKTESSSKISEYESQAYGGSASTDMEE 387

Query: 673  GEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDE---EENGDV 503
             ++SYRF SAEACA AL+QAAE VASG S+  DAVS+AG++ILPP    DE   +E  ++
Sbjct: 388  NDDSYRFESAEACAAALSQAAEAVASG-SDVPDAVSKAGIVILPPSQEVDEAILQETDEM 446

Query: 502  METDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLA 323
            ++ +   LKWP KPG           SWYDSPPEGFN+TLSPF TMF +LF+W+SSSSLA
Sbjct: 447  LDLETAPLKWPRKPGMPNYDVFESEDSWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLA 506

Query: 322  YIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPV 143
            +IYG +ES +EEYLS+NGREYP+KIV+ DGRS+EIKQTLAGCLARALPGLVA+LRLP+P+
Sbjct: 507  FIYGHDESNNEEYLSINGREYPRKIVLSDGRSTEIKQTLAGCLARALPGLVADLRLPVPI 566

Query: 142  STLEQGMGRLLDTMSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
            STLEQGM  LL+TMSF+DPLPAFRMKQW  IVLLFLDALSV RIP L
Sbjct: 567  STLEQGMVLLLNTMSFVDPLPAFRMKQWQLIVLLFLDALSVCRIPTL 613


>ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa]
            gi|550321730|gb|EEF05523.2| hypothetical protein
            POPTR_0015s01330g [Populus trichocarpa]
          Length = 696

 Score =  340 bits (871), Expect = 8e-91
 Identities = 202/408 (49%), Positives = 260/408 (63%), Gaps = 29/408 (7%)
 Frame = -1

Query: 1138 DMNFTSTII-TQDEYSISK-------TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLT 983
            DMNFTSTII TQDEYSISK       T    K ++ K K S K    QS+  +K  +  T
Sbjct: 256  DMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSSENQSSATRKVGSSKT 315

Query: 982  N--IQETRSKNKSKNVITKDDKLSLLENIAGPS---------QNDSTKAVKELQES---- 848
            +  ++E RSK   K+ ++  D  S  ++    S         ++ S KA K ++ S    
Sbjct: 316  SRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSVSEKAAKPVESSLKPS 375

Query: 847  --TAGAXXXXXXXXXXXKATRSVTWADEKTDGDG-QNLNECRELKDKKGAVVTSHSADEE 677
              T+GA             TRSVTWADEK    G ++L E R ++D K       + D+ 
Sbjct: 376  LKTSGAKQL----------TRSVTWADEKVGSSGSRDLCEVRGMEDTKAGPEIVDNIDKR 425

Query: 676  VGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEE---ENGD 506
                  +F SAEACA AL+QAAE VASG ++AS+A+SEAG++ILP PH  D+    E+ D
Sbjct: 426  DDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILPQPHDLDQGDPMEDVD 485

Query: 505  VMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSL 326
            V++ +   +KWP KPG           SWYD+PPEGF+L LS F+T++MALF+WV+SSSL
Sbjct: 486  VLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFATIWMALFAWVTSSSL 545

Query: 325  AYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIP 146
            AY+YGK+ES HEEYL VNGREYP+KIV+ DGRS EI+QT+ GCL RA P +VA+LRLPIP
Sbjct: 546  AYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLGRAFPVVVADLRLPIP 605

Query: 145  VSTLEQGMGRLLDTMSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
            +STLEQG   LL TMSF+D +PAFRMKQW  I LLF++ALSV RIPAL
Sbjct: 606  ISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRIPAL 653


>ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Solanum tuberosum]
          Length = 662

 Score =  338 bits (868), Expect = 2e-90
 Identities = 202/402 (50%), Positives = 258/402 (64%), Gaps = 23/402 (5%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISK------TVPAVKAKEPKGKASSKEVNRQSNPVQK--PTAPLT 983
            + +F+STIITQDEYS+SK       V + K KE + K   K  +   + + K      L 
Sbjct: 216  EFDFSSTIITQDEYSVSKFPAPVNAVSSEKFKEAQAKTRYKVRDDDVSILGKRVDALQLR 275

Query: 982  NIQETRSKNKSKNVITKDDKLSLLENIAGPSQND-STKAVKELQ----------ESTAGA 836
            + +ET   +K+   + K DK +  E  +GPSQ+D   K+V  +           E     
Sbjct: 276  SGEETEKSDKNTRFL-KVDKFNSGEVSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKQL 334

Query: 835  XXXXXXXXXXXKATRSVTWADEKTDGD-GQNLNECRELKDKKG-AVVTSHSADEEVGEES 662
                       K ++SVTWADE  DG  G+      ++ + +  A   S S D E  ++S
Sbjct: 335  LKSSLKSSNSKKMSQSVTWADEIIDGGIGKKTESSSKISEYENQAYGGSASTDMEEDDDS 394

Query: 661  YRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEE--ENGDVMETDP 488
            YRF SAEACA AL+QAAE VASG S+  DAVS+AG++ILP     DE   +  ++++ +P
Sbjct: 395  YRFESAEACAAALSQAAEAVASG-SDVPDAVSKAGIVILPTSQEVDEAILQETEMLDIEP 453

Query: 487  LQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGK 308
              LKWP KPG            WYD PPEGFN+TLSPF+TMF +LF+W+SSSSLA+IYG 
Sbjct: 454  APLKWPRKPGMPNYDVFESEDCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGH 513

Query: 307  EESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQ 128
            +E+ +EEYLS+NGREYP KIV+ DG S+EIKQTLAGCLARALPGLVA+LRLP+P+STLEQ
Sbjct: 514  DENNNEEYLSINGREYPHKIVLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQ 573

Query: 127  GMGRLLDTMSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
            GM  LL+TMSF+DPLPAFRMKQW  IVLLFLDALSV RIP L
Sbjct: 574  GMVLLLNTMSFVDPLPAFRMKQWQLIVLLFLDALSVCRIPTL 615


>ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis]
            gi|223538861|gb|EEF40460.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 645

 Score =  337 bits (864), Expect = 5e-90
 Identities = 198/397 (49%), Positives = 257/397 (64%), Gaps = 18/397 (4%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISK-------TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTN 980
            D +FTSTIIT DEYSISK       T   +K +   GK   + +N Q + ++K  +   +
Sbjct: 213  DTDFTSTIITNDEYSISKGPSGLTSTASDIKLQAQTGKGH-EGLNAQLSSLRKQDSIKAS 271

Query: 979  IQETRSKNKSKNVITKDDKLSLLENIAGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXK 800
                +SK + K  + K+     L     PS +  T   +++ ++T  A           K
Sbjct: 272  ---RKSKGRRKEKVIKEQ----LNFQDLPSSSYYTAEAEDISQATGAANLNESVLKPSLK 324

Query: 799  AT------RSVTWADEKTDGDG-QNLNECRELKDKKGAVVTSHSADEEVGEESYRFASAE 641
            ++      RSVTWADE+ D  G +NL E +E++    +   S SA++       RF SAE
Sbjct: 325  SSGAKRSNRSVTWADERVDNAGSRNLCEVQEMEQTNESHEISESANKGDDGHMLRFESAE 384

Query: 640  ACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPH----GEDEEENGDVMETDPLQLKW 473
            ACA+AL+QAAE VASG ++ + A+SEAG+I+LPP      G + E+N D++E +   LKW
Sbjct: 385  ACAVALSQAAEAVASGDADVNKAMSEAGIIVLPPSQDLGQGGNVEKN-DMIEQESASLKW 443

Query: 472  PPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFH 293
            P KPG           SWYD+PPEGF+LTLSPF+TM+MALF+WV+SSSLAYIYG++ES H
Sbjct: 444  PTKPGIPQSDLFDPEDSWYDAPPEGFSLTLSPFATMWMALFAWVTSSSLAYIYGRDESAH 503

Query: 292  EEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRL 113
            E+YLSVNGREYP+KIV+ DGRSSEI+ T   CLAR  PGLVA LRLPIPVSTLEQG GRL
Sbjct: 504  EDYLSVNGREYPRKIVLRDGRSSEIRLTAESCLARTFPGLVANLRLPIPVSTLEQGAGRL 563

Query: 112  LDTMSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
            L+TMSF+D LPAFR KQW  I LLF++ALSV RIPAL
Sbjct: 564  LETMSFVDALPAFRTKQWQVIALLFIEALSVCRIPAL 600


>ref|XP_006480289.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Citrus sinensis]
          Length = 768

 Score =  331 bits (849), Expect = 3e-88
 Identities = 197/399 (49%), Positives = 251/399 (62%), Gaps = 20/399 (5%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISK-------TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTN 980
            +M+FTS I+T DEYSISK       T+   K +E K  A  + +  Q   +      L  
Sbjct: 337  EMDFTSVIMTNDEYSISKPHCGSTKTITKTKFEETKENADGENLEDQCAAL----GSLAL 392

Query: 979  IQETRSKNKSKNVITKDDKLSLLENIA-------GPSQNDSTKAVKELQESTAGAXXXXX 821
            I++  S  KSK V+  +     + + +         S  D+ + ++  +ES +G      
Sbjct: 393  IKDD-SCRKSKTVVKAELSAQKVPSASVLPLTGSNISTVDAEREIQVAKESISGVSMPKS 451

Query: 820  XXXXXXKAT--RSVTWADEKTDGDG-QNLNECRELKDKKGAVVTSHSADEEVGEESYRFA 650
                        SVTWADEK DG G ++L E R++ D           ++   ++  RFA
Sbjct: 452  SLKSSGSKKVGLSVTWADEKIDGCGSRDLFEVRDMGD---------DGNDNNADDMLRFA 502

Query: 649  SAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPP---HGEDEEENGDVMETDPLQL 479
            SA ACAMAL++ AE V SG S+ +DAVSEAGVIILP P   H  +  E+ DV+E +   L
Sbjct: 503  SAGACAMALSRVAEAVMSGDSDVADAVSEAGVIILPSPRDGHEGESMEDPDVLEPEAALL 562

Query: 478  KWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEES 299
            KWP KPG           SWYD PPEGF+LTLSPF+TM+MA+F+W+SSSSLAYIYG++ES
Sbjct: 563  KWPSKPGIPRSELFDPEDSWYDEPPEGFSLTLSPFATMWMAIFAWISSSSLAYIYGRDES 622

Query: 298  FHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMG 119
            FHEEYLSVNGREY QKI+M DG SS IKQTL+GCLAR  P LVA+LRL IPVSTLE+G+ 
Sbjct: 623  FHEEYLSVNGREYSQKIIMGDGHSSAIKQTLSGCLARTFPALVADLRLRIPVSTLEKGLE 682

Query: 118  RLLDTMSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
             LL+TMSFIDPLPAF++KQW  I +LFLDALSV RIPAL
Sbjct: 683  GLLNTMSFIDPLPAFKVKQWQVITVLFLDALSVCRIPAL 721


>ref|XP_006428243.1| hypothetical protein CICLE_v10011677mg [Citrus clementina]
            gi|557530300|gb|ESR41483.1| hypothetical protein
            CICLE_v10011677mg [Citrus clementina]
          Length = 460

 Score =  331 bits (849), Expect = 3e-88
 Identities = 197/399 (49%), Positives = 251/399 (62%), Gaps = 20/399 (5%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISK-------TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTN 980
            +M+FTS I+T DEYSISK       T+   K +E K  A  + +  Q   +      L  
Sbjct: 29   EMDFTSVIMTNDEYSISKPHCGSTKTITKTKFEETKENADGENLEDQCAAL----GSLAL 84

Query: 979  IQETRSKNKSKNVITKDDKLSLLENIA-------GPSQNDSTKAVKELQESTAGAXXXXX 821
            I++  S  KSK V+  +     + + +         S  D+ + ++  +ES +G      
Sbjct: 85   IKDD-SCRKSKTVVKAELSAQKVPSASVLPLTGSNISTVDAEREIQVAKESISGVSMPKS 143

Query: 820  XXXXXXKAT--RSVTWADEKTDGDG-QNLNECRELKDKKGAVVTSHSADEEVGEESYRFA 650
                        SVTWADEK DG G ++L E R++ D           ++   ++  RFA
Sbjct: 144  SLKSSGSKKVGLSVTWADEKIDGCGSRDLFEVRDMGD---------DGNDNNADDMLRFA 194

Query: 649  SAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPP---HGEDEEENGDVMETDPLQL 479
            SA ACAMAL++ AE V SG S+ +DAVSEAGVIILP P   H  +  E+ DV+E +   L
Sbjct: 195  SAGACAMALSRVAEAVMSGDSDVADAVSEAGVIILPSPRDGHEGESMEDPDVLEPEAALL 254

Query: 478  KWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEES 299
            KWP KPG           SWYD PPEGF+LTLSPF+TM+MA+F+W+SSSSLAYIYG++ES
Sbjct: 255  KWPSKPGIPRSELFDPEDSWYDEPPEGFSLTLSPFATMWMAIFAWISSSSLAYIYGRDES 314

Query: 298  FHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMG 119
            FHEEYLSVNGREY QKI+M DG SS IKQTL+GCLAR  P LVA+LRL IPVSTLE+G+ 
Sbjct: 315  FHEEYLSVNGREYSQKIIMGDGHSSAIKQTLSGCLARTFPALVADLRLRIPVSTLEKGLE 374

Query: 118  RLLDTMSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
             LL+TMSFIDPLPAF++KQW  I +LFLDALSV RIPAL
Sbjct: 375  GLLNTMSFIDPLPAFKVKQWQVITVLFLDALSVCRIPAL 413


>gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris]
          Length = 706

 Score =  330 bits (846), Expect = 6e-88
 Identities = 210/446 (47%), Positives = 259/446 (58%), Gaps = 67/446 (15%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTVPA-------------------------------------- 1073
            +MNF STII QDEYS+SK  P                                       
Sbjct: 215  EMNFVSTIIMQDEYSVSKASPGQTDTTAHHQIKPTAVDRQQEEKVGLKVVRKDEDSIQDL 274

Query: 1072 ---------VKAKEPKGKASSK--EVNRQSNP---VQKPTAPLTNIQETR---SKNKS-- 950
                     + A E KGK  SK  EV  +S P   ++K  A   +I E      KN S  
Sbjct: 275  SSSFESGLHLSASE-KGKEVSKSCEVVVKSTPNLAIKKKDAHSVSISERHYDVEKNNSAR 333

Query: 949  KNVITKDDKLSLLENIAGPSQNDSTKAVKE-LQESTAGAXXXXXXXXXXXKA-----TRS 788
            K+V  K +   +  N    + N     VKE  Q    G             A     +R+
Sbjct: 334  KSVQLKGETSRVTVNGDASTSNFDPDNVKEKFQVEKVGGLCETKLKSSLKSAGEKKLSRT 393

Query: 787  VTWADEKTDGDG-QNLNECRELKDKKGAVVTSHSADEEVGEESYRFASAEACAMALTQAA 611
            VTWADEK +G G ++L E +E  D      +  + D    E+  R ASAEACA+AL+QA+
Sbjct: 394  VTWADEKINGAGNKDLCEVKEFGDIIKESESVGNEDVANNEDMLRQASAEACAIALSQAS 453

Query: 610  EEVASGKSEASDAVSEAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXX 440
            E VASG S+A+DAVSEAG+IILP PH   EE   E+ D+++ D + LKWP KPG      
Sbjct: 454  EAVASGDSDATDAVSEAGIIILPQPHDAVEEGTMEDADILQNDSVTLKWPRKPGISDIDF 513

Query: 439  XXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREY 260
                 SW+D+PPEGF+LTLSPF+ M+ A+FSW++S SLAYIYG++ESFHEEYLSVNGREY
Sbjct: 514  FESDDSWFDAPPEGFSLTLSPFANMWNAIFSWMTSYSLAYIYGRDESFHEEYLSVNGREY 573

Query: 259  PQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLP 80
            P K+V+ DGRSSEIKQT AGCLARA P LVA LRLPIP+STLEQGM  LL+TMSF+D LP
Sbjct: 574  PCKVVLSDGRSSEIKQTFAGCLARAFPALVAGLRLPIPISTLEQGMACLLETMSFVDALP 633

Query: 79   AFRMKQWYAIVLLFLDALSVSRIPAL 2
            AFR KQW  + LLF+DALSV RIP+L
Sbjct: 634  AFRTKQWQVVALLFVDALSVCRIPSL 659


>ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Glycine max]
          Length = 706

 Score =  328 bits (841), Expect = 2e-87
 Identities = 201/446 (45%), Positives = 257/446 (57%), Gaps = 67/446 (15%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTVPA-------------VKAKEPKGKASSKEVNRQSNPVQ-- 1004
            +M F STII QDEYS+SK  P                 K+P+ K  ++ V +  + +Q  
Sbjct: 215  EMGFVSTIIMQDEYSVSKVPPGQMDATANHQIKPTATVKQPE-KVDAEVVRKDDDSIQDL 273

Query: 1003 ----KPTAPLTNIQETRSKNKSKNVITKDD-----------KLSLLENIAGPSQNDSTKA 869
                K +  L+  ++     KS   + K              +S+ E      QNDS + 
Sbjct: 274  SSSFKSSLILSTSEKEEEVTKSCEAVLKFSPGCAIQKKDVHSISISERQCDVEQNDSARK 333

Query: 868  VKELQEST---------------------------AGAXXXXXXXXXXXKA-----TRSV 785
              +++  T                           AG             A     +R+V
Sbjct: 334  SVQVKGKTSRVIANDDASTSNLDPANVEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTV 393

Query: 784  TWADEKTDGDG-QNLNECRELKD-KKGAVVTSHSADEEVGEESYRFASAEACAMALTQAA 611
            TWADEK +  G ++L E +E  D KK +    ++ D    E+  R ASAEACA+AL+ A+
Sbjct: 394  TWADEKINSTGSKDLCEFKEFGDIKKESDSVGNNIDVANDEDILRRASAEACAIALSSAS 453

Query: 610  EEVASGKSEASDAVSEAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXX 440
            E VASG S+ SDAVSEAG+ ILPPPH   EE   E+ D+++ D + LKWP K G      
Sbjct: 454  EAVASGDSDVSDAVSEAGITILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADF 513

Query: 439  XXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREY 260
                 SW+D+PPEGF+LTLSPF+TM+  LFSW +SSSLAYIYG++ESFHEEYLSVNGREY
Sbjct: 514  FESDDSWFDAPPEGFSLTLSPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREY 573

Query: 259  PQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLP 80
            P K+V+ DGRSSEIKQTLA CLARALP LVA LRLPIPVS +EQGM  LL+TMSF+D LP
Sbjct: 574  PCKVVLADGRSSEIKQTLASCLARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALP 633

Query: 79   AFRMKQWYAIVLLFLDALSVSRIPAL 2
            AFR KQW  + LLF+DALSV R+PAL
Sbjct: 634  AFRTKQWQVVALLFIDALSVCRLPAL 659


>gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao]
          Length = 739

 Score =  327 bits (838), Expect = 5e-87
 Identities = 192/397 (48%), Positives = 253/397 (63%), Gaps = 18/397 (4%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTVPA---------VKAKEPKGKASSKE----VNRQSNPVQKP 998
            +M+FTS II  DEY+ISK             +K  E KG     E    ++  S+ +++ 
Sbjct: 309  EMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREK 368

Query: 997  TAPLTNIQETRSKNKSKNVITKDDKLSLLENIAGPSQNDSTKAVKELQESTAGAXXXXXX 818
             + +  +  T  KN  ++ +      +  E  A  +   S   +K   +S AGA      
Sbjct: 369  DSSIVELPST--KNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKS-AGAKKL--- 422

Query: 817  XXXXXKATRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTSHSADEEVGEESYRFASA 644
                    R VTWAD+K  D  G  NL E +E++  KG    S SA++   +   RF SA
Sbjct: 423  -------NRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSA 475

Query: 643  EACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE---NGDVMETDPLQLKW 473
            EACAMAL++AAE VASG S+ +DAV E G+IILP     D+EE   +GD++E +   +KW
Sbjct: 476  EACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKW 535

Query: 472  PPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFH 293
            P KPG           SW+D+PPEGF+LTLS F+TM+ ALF W++SSSLAYIYG++ESFH
Sbjct: 536  PKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYGRDESFH 595

Query: 292  EEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRL 113
            EEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +LRLPIP+STLEQGMG L
Sbjct: 596  EEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTLEQGMGHL 655

Query: 112  LDTMSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
            +DT+SF++ LPAFRMKQW  IVLLF+DALSV RIPAL
Sbjct: 656  IDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPAL 692


>ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Glycine max]
          Length = 706

 Score =  323 bits (827), Expect = 1e-85
 Identities = 199/450 (44%), Positives = 251/450 (55%), Gaps = 71/450 (15%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTVPAVKAK------------EPKGKASSKEVNRQSNPVQKPT 995
            +M F STII QD YS+SK +P  +              +  GK  +K V +    +Q  +
Sbjct: 215  EMGFVSTIIMQDGYSVSKVLPGQRDATAHHQIKPTAIVKQLGKVDAKVVRKDDGSIQDLS 274

Query: 994  AP---------------LTNIQETRSKNKSKNVITKDD--KLSLLENIAGPSQNDSTK-- 872
            +                L    E   K+     I K D   +S+ E      QNDS K  
Sbjct: 275  SSFKSSLILGTSEKEEELAQSCEAALKSSPDCAIKKKDVYSVSISERQCDVEQNDSAKKS 334

Query: 871  -------------------------AVKELQESTAGAXXXXXXXXXXXKA-----TRSVT 782
                                       ++ Q   AG             A     +R+VT
Sbjct: 335  VQVKGKMSRVTANDDASTSNLDPANVEEKFQVEKAGGSLNTKPKSSLKSAGEKKLSRTVT 394

Query: 781  WADEKTDGDG-------QNLNECRELKDKKGAVVTSHSADEEVGEESYRFASAEACAMAL 623
            WAD+K +  G       +N  + R   D  G     +S D    E++ R ASAEAC +AL
Sbjct: 395  WADKKINSTGSKDLCGFKNFGDIRNESDSAG-----NSIDVANDEDTLRRASAEACVIAL 449

Query: 622  TQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXX 452
            + A+E VASG S+ SDAVSEAG+IILPPPH   EE   E+ D+++ D + +KWP KPG  
Sbjct: 450  SSASEAVASGDSDVSDAVSEAGIIILPPPHDAGEEGTLEDVDILQNDSVTVKWPRKPGIS 509

Query: 451  XXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVN 272
                     SW+D+ PEGF+LTLSPF+TM+  LFSW++SSSLAYIYG++ESF EEYLSVN
Sbjct: 510  EADFFESDDSWFDAAPEGFSLTLSPFATMWNTLFSWITSSSLAYIYGRDESFQEEYLSVN 569

Query: 271  GREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFI 92
            GREYP K+V+ DGRSSEIKQTLA CLARALP LVA LRLPIPVST+EQGM  LL+TMSF+
Sbjct: 570  GREYPCKVVLADGRSSEIKQTLASCLARALPTLVAVLRLPIPVSTMEQGMACLLETMSFV 629

Query: 91   DPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
            D LPAFR KQW  + LLF+DALSV R+PAL
Sbjct: 630  DALPAFRTKQWQVVALLFIDALSVCRLPAL 659


>ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cicer arietinum]
          Length = 666

 Score =  320 bits (821), Expect = 5e-85
 Identities = 192/408 (47%), Positives = 249/408 (61%), Gaps = 29/408 (7%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTV-------------PAVKAKEPKGKASSKEVNRQSNPVQKP 998
            + +F STII QDEYS+SK               P    ++PK      E+ R+ + +Q  
Sbjct: 214  EFDFMSTIIMQDEYSVSKVSSGQTDATVDHQIKPTAILEQPK--RVDHELVRKDDDIQDL 271

Query: 997  TAPLTNIQETRSKNKSK-------NVITKDDKLSLLENIAGPSQNDSTKAVKELQ-ESTA 842
            ++   +     +  K K       NV+          + +  S  D +   +++Q E   
Sbjct: 272  SSSFASSLNLSASKKDKEIAKSCKNVLKGKTNRVAANDDSSTSNFDPSDVEEKIQIEKEI 331

Query: 841  GAXXXXXXXXXXXKAT----RSVTWADEKTDGDGQ-NLNECRELKDKKGAVVTSHSADEE 677
            G+                  RSVTWAD+K DG G  +L   +E  + K     + + D  
Sbjct: 332  GSCHTKPKSSLKSNGKKKLGRSVTWADKKIDGCGSTDLCAFKEFGNIKKESDVADNVDVV 391

Query: 676  VGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEE---ENGD 506
              E+  R  SAEACA+AL+QAAE VASG S+A DAVSEAG+IILP      EE   ++ D
Sbjct: 392  DDEDILRSVSAEACAIALSQAAEAVASGDSDAIDAVSEAGIIILPHTENAVEESTVDDVD 451

Query: 505  VMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSL 326
            ++ETD + LKWP KPG           SW+D+PPEGF+LTLSPF+T++ A FSW++SSSL
Sbjct: 452  ILETDSVTLKWPRKPGISDFDLFASDDSWFDAPPEGFSLTLSPFATLWNAFFSWITSSSL 511

Query: 325  AYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIP 146
            AYIYG++ SF+EE+LSV+GREYP KIV+ DGRSSEIKQTLA CLARALP +VAEL+LP+P
Sbjct: 512  AYIYGRDVSFYEEFLSVDGREYPCKIVLSDGRSSEIKQTLASCLARALPAVVAELKLPMP 571

Query: 145  VSTLEQGMGRLLDTMSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
            VSTLEQGM  LLDTMSF+DPLP FR KQW  + LLF+DALSV RIPAL
Sbjct: 572  VSTLEQGMVCLLDTMSFVDPLPGFRFKQWQVVALLFVDALSVCRIPAL 619


>ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X2 [Glycine max]
          Length = 716

 Score =  320 bits (820), Expect = 7e-85
 Identities = 201/456 (44%), Positives = 257/456 (56%), Gaps = 77/456 (16%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTVPA-------------VKAKEPKGKASSKEVNRQSNPVQ-- 1004
            +M F STII QDEYS+SK  P                 K+P+ K  ++ V +  + +Q  
Sbjct: 215  EMGFVSTIIMQDEYSVSKVPPGQMDATANHQIKPTATVKQPE-KVDAEVVRKDDDSIQDL 273

Query: 1003 ----KPTAPLTNIQETRSKNKSKNVITKDD-----------KLSLLENIAGPSQNDSTKA 869
                K +  L+  ++     KS   + K              +S+ E      QNDS + 
Sbjct: 274  SSSFKSSLILSTSEKEEEVTKSCEAVLKFSPGCAIQKKDVHSISISERQCDVEQNDSARK 333

Query: 868  VKELQEST---------------------------AGAXXXXXXXXXXXKA-----TRSV 785
              +++  T                           AG             A     +R+V
Sbjct: 334  SVQVKGKTSRVIANDDASTSNLDPANVEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTV 393

Query: 784  TWADEKTDGDG-QNLNECRELKD-KKGAVVTSHSADEEVGEESYRFASAEACAMALTQAA 611
            TWADEK +  G ++L E +E  D KK +    ++ D    E+  R ASAEACA+AL+ A+
Sbjct: 394  TWADEKINSTGSKDLCEFKEFGDIKKESDSVGNNIDVANDEDILRRASAEACAIALSSAS 453

Query: 610  EEVASGKSEASDAV----------SEAGVIILPPPHGEDEE---ENGDVMETDPLQLKWP 470
            E VASG S+ SDAV          SEAG+ ILPPPH   EE   E+ D+++ D + LKWP
Sbjct: 454  EAVASGDSDVSDAVFSPMNETCAVSEAGITILPPPHDAAEEGTVEDADILQNDSVTLKWP 513

Query: 469  PKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHE 290
             K G           SW+D+PPEGF+LTLSPF+TM+  LFSW +SSSLAYIYG++ESFHE
Sbjct: 514  RKTGISEADFFESDDSWFDAPPEGFSLTLSPFATMWNTLFSWTTSSSLAYIYGRDESFHE 573

Query: 289  EYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLL 110
            EYLSVNGREYP K+V+ DGRSSEIKQTLA CLARALP LVA LRLPIPVS +EQGM  LL
Sbjct: 574  EYLSVNGREYPCKVVLADGRSSEIKQTLASCLARALPALVAVLRLPIPVSIMEQGMACLL 633

Query: 109  DTMSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
            +TMSF+D LPAFR KQW  + LLF+DALSV R+PAL
Sbjct: 634  ETMSFVDALPAFRTKQWQVVALLFIDALSVCRLPAL 669


>gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]
          Length = 695

 Score =  320 bits (819), Expect = 9e-85
 Identities = 197/444 (44%), Positives = 258/444 (58%), Gaps = 65/444 (14%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTVPAVK-------AKEPKGKASSKEVNRQSNPVQKPTAPLTN 980
            DM+F STIIT+DEY++SKT  ++K        +E +   + K +  +   ++   AP +N
Sbjct: 211  DMDFVSTIITEDEYTVSKTPSSLKKTGLDSKVREQEEILAKKAMGNEFAVLETSYAPASN 270

Query: 979  IQETRSKNKSKNVITKDDKLSLLENIAGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXK 800
            +  +R     ++V +     S L +     ++   KA K     T  +           K
Sbjct: 271  V--SRVGLVFEDVTSSLRAGSCLSSARAEEESHDDKAEK----CTEASIKSSLKPSRKKK 324

Query: 799  ATRSVTWADEKTDGDG--------------------QNLN-------------------- 740
             +R+VTWADEKTD  G                    +N N                    
Sbjct: 325  LSRTVTWADEKTDSSGGRKLCEIREIEDMKEDPSVVENKNGVSFTSSGKMKAGQSVIWAD 384

Query: 739  ------------ECRELKDKKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVAS 596
                        E RE++D K A     +AD    ++++RFASAEACA AL +A+E VAS
Sbjct: 385  EKGDSSKSIDVCEVREIEDAKEAADMLCNADTGENDDTFRFASAEACARALDEASEAVAS 444

Query: 595  GKSEASDAVSEAGVIILPPPHGEDE----EENGDVMETDPLQ--LKWPPKPGXXXXXXXX 434
             + E +DA+SEAG+IILP P   DE    EE+ D   ++P Q  +KWP KPG        
Sbjct: 445  EELEVNDAMSEAGIIILPRPENGDEGEPMEEDDDDETSEPEQAPIKWPKKPGSQHSDLFD 504

Query: 433  XXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQ 254
               SW+D+PPE F+LTLSPF+ M+ ALF+W +SS+LAYIYG++ES HEEY  VNGREYP+
Sbjct: 505  PEDSWFDAPPEDFSLTLSPFAKMWNALFTWTTSSTLAYIYGRDESLHEEYAVVNGREYPE 564

Query: 253  KIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAF 74
            KIV  DGRSSEIKQTLAG LARALPGLVA+LRL  P+S+LEQGMGRLLDTMSF+D LP F
Sbjct: 565  KIVFGDGRSSEIKQTLAGSLARALPGLVADLRLSTPISSLEQGMGRLLDTMSFVDALPPF 624

Query: 73   RMKQWYAIVLLFLDALSVSRIPAL 2
            RMKQW  I+LLFL+ALSV R+PAL
Sbjct: 625  RMKQWQVIILLFLEALSVYRLPAL 648


>gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica]
          Length = 711

 Score =  319 bits (817), Expect = 1e-84
 Identities = 203/436 (46%), Positives = 260/436 (59%), Gaps = 57/436 (13%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTVPAVKAK--EPKGKASSKEVNRQSNPVQKPTAPLTNIQETR 965
            +M+F STIIT DEYS+SK  P+V     E K K S  +V    N          +++++R
Sbjct: 239  EMDFMSTIITSDEYSVSKIPPSVGEPDFETKFKKSKGKVGLNKND---------SVKKSR 289

Query: 964  SKNKSKNVITKDDKLSLLE--NIAGPSQ---NDSTKAVKE------LQESTAGAXXXXXX 818
                 KN   K D + + E  + +  SQ   N STK  KE       ++S          
Sbjct: 290  QSKGGKNKNVKKDDVCIREVPSTSDASQTVLNGSTKEEKEEFIVEKAEQSGEALLRSSLK 349

Query: 817  XXXXXKATRSVTWADEKTDGDG-QNLNECRELK---DKKGAVVTSH--SADEEVG----- 671
                 K  RSVTWADE  D  G +NL E RE++   +   A  + H  S + +VG     
Sbjct: 350  PSGTKKLNRSVTWADEMIDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTW 409

Query: 670  ------------------------------EESYRFASAEACAMALTQAAEEVASGKSEA 581
                                          +E+    SAEACAMAL QAAE VASG+S+ 
Sbjct: 410  FDEKIDSTKSKNICEVREVQDADVLGSLDLQENEILESAEACAMALNQAAEAVASGESDV 469

Query: 580  SDAVSEAGVIILPPPHGEDEEE---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDS 410
            S AVS AG+IILP P G DEEE   + D++E++   L WP KPG           SW+D+
Sbjct: 470  SGAVSGAGIIILPRPDGLDEEEPTEDVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDA 528

Query: 409  PPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGR 230
            PPEGF++TLSPF+TM+ +LF+W++SS+LAYIYG++ESFHEE+LSVNGREYP KIV+  GR
Sbjct: 529  PPEGFSVTLSPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGR 588

Query: 229  SSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWYAI 50
            SSEIK+TL    ARALPG+V+ELRLP P+S+LEQGMGR+L+TMSFID +PAFRMKQW  I
Sbjct: 589  SSEIKKTLDESFARALPGVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQVI 648

Query: 49   VLLFLDALSVSRIPAL 2
            VLLFL+ LSV RIPAL
Sbjct: 649  VLLFLEGLSVCRIPAL 664


>gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao]
          Length = 703

 Score =  316 bits (810), Expect = 1e-83
 Identities = 186/394 (47%), Positives = 246/394 (62%), Gaps = 15/394 (3%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTVPA---------VKAKEPKGKASSKE----VNRQSNPVQKP 998
            +M+FTS II  DEY+ISK             +K  E KG     E    ++  S+ +++ 
Sbjct: 309  EMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREK 368

Query: 997  TAPLTNIQETRSKNKSKNVITKDDKLSLLENIAGPSQNDSTKAVKELQESTAGAXXXXXX 818
             + +  +  T  KN  ++ +      +  E  A  +   S   +K   +S AGA      
Sbjct: 369  DSSIVELPST--KNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKS-AGAKKL--- 422

Query: 817  XXXXXKATRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTSHSADEEVGEESYRFASA 644
                    R VTWAD+K  D  G  NL E +E++  KG    S SA++   +   RF SA
Sbjct: 423  -------NRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSA 475

Query: 643  EACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEENGDVMETDPLQLKWPPK 464
            EACAMAL++AAE VASG S+ +DAV E           E+  E+GD++E +   +KWP K
Sbjct: 476  EACAMALSKAAEAVASGDSDVTDAVCEVDK--------EEPMEDGDMLEPETAPVKWPKK 527

Query: 463  PGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEY 284
            PG           SW+D+PPEGF+LTLS F+TM+ ALF W++SSSLAYIYG++ESFHEEY
Sbjct: 528  PGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEY 587

Query: 283  LSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDT 104
            LS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +LRLPIP+STLEQGMG L+DT
Sbjct: 588  LSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTLEQGMGHLIDT 647

Query: 103  MSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
            +SF++ LPAFRMKQW  IVLLF+DALSV RIPAL
Sbjct: 648  ISFMEALPAFRMKQWQVIVLLFIDALSVCRIPAL 681


>ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 662

 Score =  315 bits (806), Expect = 3e-83
 Identities = 195/408 (47%), Positives = 246/408 (60%), Gaps = 29/408 (7%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTVPAVK-------AKEPKGKASSKEVNRQSNPVQKPTAPL-- 986
            D + TSTIIT +EYS+SK    +K       +K   G+   KE N Q   ++ P AP   
Sbjct: 214  DFSITSTIITDEEYSVSKISSGLKEMALDTNSKNQTGEFCGKESNDQFAILETPHAPAPP 273

Query: 985  ---TNIQETRSKNKSKNVITKDDKLSLLENIAGPSQNDSTKAVKELQESTAG-------A 836
                  +   SK ++K   TK+   +L  +    S+N ST      +E   G        
Sbjct: 274  KNSVGRKARGSKERTKVSATKESTDNL-SDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTE 332

Query: 835  XXXXXXXXXXXKATRSVTWADEKTDGDG-QNLNECREL-KDKKGAVVTSHSAD-EEVGEE 665
                          RSVTWADEKTD     NL E  E+ K K+ +  TS+  + +   E+
Sbjct: 333  LKSSLKKPGKKNLCRSVTWADEKTDDASIMNLPEVGEMGKTKECSRTTSNLVNFDNDNED 392

Query: 664  SYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEENGDVMETDPL 485
              R  SAEACAMAL+QAAE + SG+SE SDAVSEAG+IILP P   +EE +     TDP+
Sbjct: 393  ILRVESAEACAMALSQAAEAITSGQSEVSDAVSEAGIIILPHPSDANEEAS-----TDPV 447

Query: 484  QLKWPP-------KPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSL 326
                P        K G           SWYD+PPEGF+LTLS F+TM+MA+F+WV+SSSL
Sbjct: 448  NASEPHSFSEKSNKLGVLRSDLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSL 507

Query: 325  AYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIP 146
            AYIYGK++ FHEE+L ++G+EYP KIV  DGRSSEIKQTLAGCL RA+PGL +EL L  P
Sbjct: 508  AYIYGKDDKFHEEFLYIDGKEYPSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTP 567

Query: 145  VSTLEQGMGRLLDTMSFIDPLPAFRMKQWYAIVLLFLDALSVSRIPAL 2
            +S LE GM  LLDTM+F+D LPAFRMKQW  IVLLF++ALSVSRIP+L
Sbjct: 568  ISRLENGMAHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVSRIPSL 615


>gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao]
          Length = 708

 Score =  301 bits (770), Expect = 4e-79
 Identities = 177/378 (46%), Positives = 237/378 (62%), Gaps = 18/378 (4%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTVPA---------VKAKEPKGKASSKE----VNRQSNPVQKP 998
            +M+FTS II  DEY+ISK             +K  E KG     E    ++  S+ +++ 
Sbjct: 309  EMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREK 368

Query: 997  TAPLTNIQETRSKNKSKNVITKDDKLSLLENIAGPSQNDSTKAVKELQESTAGAXXXXXX 818
             + +  +  T  KN  ++ +      +  E  A  +   S   +K   +S AGA      
Sbjct: 369  DSSIVELPST--KNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKS-AGAKKL--- 422

Query: 817  XXXXXKATRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTSHSADEEVGEESYRFASA 644
                    R VTWAD+K  D  G  NL E +E++  KG    S SA++   +   RF SA
Sbjct: 423  -------NRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSA 475

Query: 643  EACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE---NGDVMETDPLQLKW 473
            EACAMAL++AAE VASG S+ +DAV E G+IILP     D+EE   +GD++E +   +KW
Sbjct: 476  EACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKW 535

Query: 472  PPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFH 293
            P KPG           SW+D+PPEGF+LTLS F+TM+ ALF W++SSSLAYIYG++ESFH
Sbjct: 536  PKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYGRDESFH 595

Query: 292  EEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRL 113
            EEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +LRLPIP+STLEQGMG L
Sbjct: 596  EEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTLEQGMGHL 655

Query: 112  LDTMSFIDPLPAFRMKQW 59
            +DT+SF++ LPAFRMKQW
Sbjct: 656  IDTISFMEALPAFRMKQW 673


>gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao]
          Length = 679

 Score =  301 bits (770), Expect = 4e-79
 Identities = 177/378 (46%), Positives = 237/378 (62%), Gaps = 18/378 (4%)
 Frame = -1

Query: 1138 DMNFTSTIITQDEYSISKTVPA---------VKAKEPKGKASSKE----VNRQSNPVQKP 998
            +M+FTS II  DEY+ISK             +K  E KG     E    ++  S+ +++ 
Sbjct: 309  EMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREK 368

Query: 997  TAPLTNIQETRSKNKSKNVITKDDKLSLLENIAGPSQNDSTKAVKELQESTAGAXXXXXX 818
             + +  +  T  KN  ++ +      +  E  A  +   S   +K   +S AGA      
Sbjct: 369  DSSIVELPST--KNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKS-AGAKKL--- 422

Query: 817  XXXXXKATRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTSHSADEEVGEESYRFASA 644
                    R VTWAD+K  D  G  NL E +E++  KG    S SA++   +   RF SA
Sbjct: 423  -------NRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSA 475

Query: 643  EACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE---NGDVMETDPLQLKW 473
            EACAMAL++AAE VASG S+ +DAV E G+IILP     D+EE   +GD++E +   +KW
Sbjct: 476  EACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKW 535

Query: 472  PPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFH 293
            P KPG           SW+D+PPEGF+LTLS F+TM+ ALF W++SSSLAYIYG++ESFH
Sbjct: 536  PKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYGRDESFH 595

Query: 292  EEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRL 113
            EEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +LRLPIP+STLEQGMG L
Sbjct: 596  EEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTLEQGMGHL 655

Query: 112  LDTMSFIDPLPAFRMKQW 59
            +DT+SF++ LPAFRMKQW
Sbjct: 656  IDTISFMEALPAFRMKQW 673


Top