BLASTX nr result

ID: Rauwolfia21_contig00009903 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00009903
         (2480 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni...   696   0.0  
ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni...   686   0.0  
emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]   677   0.0  
ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258...   669   0.0  
gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma c...   598   e-168
ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm...   597   e-168
gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus...   596   e-167
ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni...   593   e-166
ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subuni...   593   e-166
ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subuni...   591   e-166
ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subuni...   585   e-164
ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu...   578   e-162
gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobro...   545   e-152
gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]     535   e-149
ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni...   528   e-147
gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus pe...   521   e-145
ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni...   516   e-143
gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao]      509   e-141
gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao]      508   e-141
gb|EOY34548.1| F2P16.20 protein, putative isoform 4 [Theobroma c...   480   e-132

>ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Solanum lycopersicum]
          Length = 660

 Score =  696 bits (1796), Expect = 0.0
 Identities = 379/674 (56%), Positives = 462/674 (68%), Gaps = 2/674 (0%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAK EA+ VKDAVHKLQLCL EGI+DE++L AAG+L+S+SDYQDVV ERSIAN+CGYPLC
Sbjct: 1    MAKGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N+LPSER+RKG YRISLKEHKVYDLHETY YCST+C+VNS AFAGSL++ERS  L PAK
Sbjct: 61   SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L++VL LF G+ L  S D++K+    G S L++QEK D KG EVS+EEW+GPSNAIEGYV
Sbjct: 121  LNQVLNLFKGLHLH-SLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYV 179

Query: 1745 PRRDHTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSKAMDQTET 1566
            P+RD +  P   K + KG K K  ++  E++MI NE DF+S II QDEY+VSK       
Sbjct: 180  PQRDRSVNPALLKNINKGSKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNA 239

Query: 1565 VSSQKNKEAKRKVETDERKDKSKNLGESAVCKSSQVHKKCQKSDEPLCRNVMGEQXXXXX 1386
             S+ K KE + K     R D    LG+       +  ++ +KSD+               
Sbjct: 240  DSNVKFKETQAKTRYKVRDDDVYILGKQVDALQLRSGEETEKSDK---------NTRFLK 290

Query: 1385 XXXXXXLSISDPVGLDQNTIYKKSTEADSKFHTEKASASNACXXXXXXXXXXXXXXSCSV 1206
                    +S   G  Q+ +  KS    S    + AS                   S SV
Sbjct: 291  VDKFNSGEVSS--GPSQHDVKNKSVLIMSDDGRKYASHGEHDKLKSSLKSSNSKKMSRSV 348

Query: 1205 TWADEQTDGHDNKNLCNYNECEKNRDSSSQSGSADIDMDDN--SYRFXXXXXXXXXXXXX 1032
            TWADE  DG   K   + ++  +  +S +  GSA  DM++N  SYRF             
Sbjct: 349  TWADESIDGGIGKKTESSSKISEY-ESQAYGGSASTDMEENDDSYRFESAEACAAALSQA 407

Query: 1031 XXXXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNMLDPERASIKWPSKPGLPNSD 852
                ASG SDV D VS AG++ILPP +E D A  QE   MLD E A +KWP KPG+PN D
Sbjct: 408  AEAVASG-SDVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYD 466

Query: 851  LFESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAYIYGNDESHHEEYLCVNGRE 672
            +FES+ SWY++PP+GF++TLSPF  MF +LF W SSSSLA+IYG+DES++EEYL +NGRE
Sbjct: 467  VFESEDSWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGRE 526

Query: 671  YPRKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLSTLEREIDRLLDTMSFTDPL 492
            YPRK VL+DGRS+EI+Q L+GCLARALPG+VADLRLP P+STLE+ +  LL+TMSF DPL
Sbjct: 527  YPRKIVLSDGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPL 586

Query: 491  PALRMKQWQLLVFLFLDALSVCRIPTLAPYMTGRRVSLPKVLDGAKISSEEYEIMKDLII 312
            PA RMKQWQL+V LFLDALSVCRIPTL PYMTGRR S PKVLDGA+IS+ EYEIMKDLII
Sbjct: 587  PAFRMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLII 646

Query: 311  PLGRVPQFVMQSGG 270
            PLGRVPQF MQSGG
Sbjct: 647  PLGRVPQFSMQSGG 660


>ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Solanum tuberosum]
          Length = 662

 Score =  686 bits (1771), Expect = 0.0
 Identities = 378/679 (55%), Positives = 462/679 (68%), Gaps = 7/679 (1%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAK EA+ VKDAVHKLQLCL EGI+DEN+L AAG+L+S+SDYQDVV ERSIAN+CGYPLC
Sbjct: 1    MAKGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N+LPSER+RKG YRISLKEHKVYDLHETY YCST+C+VNS AFAGSL++ERS  L PAK
Sbjct: 61   SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGR-EVSVEEWVGPSNAIEGY 1749
            L++VL LF G+ L   +D +K+    G S L++QEK D KG  EVS+EEW+GPSNAIEGY
Sbjct: 121  LNQVLNLFKGLHLHSPED-VKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGY 179

Query: 1748 VPRRDHTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSKAMDQTE 1569
            VP+RD +  P   K + KG K K  ++  E++MI NE DF+S II QDEY+VSK      
Sbjct: 180  VPQRDRSVNPALLKNINKGFKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVN 239

Query: 1568 TVSSQKNKEAKRKVETDERKDKSKNLGESAVCKSSQVHKKCQKSDE-----PLCRNVMGE 1404
             VSS+K KEA+ K     R D    LG+       +  ++ +KSD+      + +   GE
Sbjct: 240  AVSSEKFKEAQAKTRYKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFLKVDKFNSGE 299

Query: 1403 QXXXXXXXXXXXLSISDPVGLDQNTIYKKSTEADSKFHTEKASASNACXXXXXXXXXXXX 1224
                         S+   +  D    Y    E D +       +SN+             
Sbjct: 300  VSSGPSQHDVKNKSVL--IMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQ-------- 349

Query: 1223 XXSCSVTWADEQTDGHDNKNLCNYNEC-EKNRDSSSQSGSADIDMDDNSYRFXXXXXXXX 1047
                SVTWADE  DG   K   + ++  E    +   S S D++ DD+SYRF        
Sbjct: 350  ----SVTWADEIIDGGIGKKTESSSKISEYENQAYGGSASTDMEEDDDSYRFESAEACAA 405

Query: 1046 XXXXXXXXXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNMLDPERASIKWPSKPG 867
                     ASG SDV D VS AG++ILP  +E D A  QE   MLD E A +KWP KPG
Sbjct: 406  ALSQAAEAVASG-SDVPDAVSKAGIVILPTSQEVDEAILQET-EMLDIEPAPLKWPRKPG 463

Query: 866  LPNSDLFESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAYIYGNDESHHEEYLC 687
            +PN D+FES+  WY+ PP+GF++TLSPFA MF +LF W SSSSLA+IYG+DE+++EEYL 
Sbjct: 464  MPNYDVFESEDCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLS 523

Query: 686  VNGREYPRKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLSTLEREIDRLLDTMS 507
            +NGREYP K VL+DG S+EI+Q L+GCLARALPG+VADLRLP P+STLE+ +  LL+TMS
Sbjct: 524  INGREYPHKIVLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMS 583

Query: 506  FTDPLPALRMKQWQLLVFLFLDALSVCRIPTLAPYMTGRRVSLPKVLDGAKISSEEYEIM 327
            F DPLPA RMKQWQL+V LFLDALSVCRIPTL PYMTGRR SLPKVLDGA+IS+ EYEIM
Sbjct: 584  FVDPLPAFRMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIM 643

Query: 326  KDLIIPLGRVPQFVMQSGG 270
            KDLIIPLGRVPQF MQSGG
Sbjct: 644  KDLIIPLGRVPQFSMQSGG 662


>emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]
          Length = 659

 Score =  677 bits (1748), Expect = 0.0
 Identities = 350/673 (52%), Positives = 458/673 (68%), Gaps = 1/673 (0%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MA D+ + VKDAVHKLQL L EGIQ+EN+LFAAG+LMS+SDY+DVV ER+IAN+CGYPLC
Sbjct: 1    MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N+LPSER RKG YRISLKEHKVYDLHETY YCS+ C+VNSR+FAGSL+EER   L   +
Sbjct: 61   SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            ++ +LRLF   SLE +K  L K    G+S+L+++E  + K  EVS+E+W+GPSNAIEGYV
Sbjct: 121  INGILRLFGESSLESNKI-LGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYV 179

Query: 1745 PRRDHTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSKAMDQT-E 1569
            P+RD   +PK  K  ++G K    +++  ++ + +EMDF S II +DEY++SK+     +
Sbjct: 180  PQRDRNLKPKNIKNHKEGSKSSNSKMDSGKNFVIDEMDFVSTIITKDEYSISKSSKGLKD 239

Query: 1568 TVSSQKNKEAKRKVETDERKDKSKNLGESAVCKSSQVHKKCQKSDEPLCRNVMGEQXXXX 1389
            T S  K+KE K K       D+   L +SA    +    K ++S     R +  ++    
Sbjct: 240  TTSHAKSKEPKEKASIG---DQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTA 296

Query: 1388 XXXXXXXLSISDPVGLDQNTIYKKSTEADSKFHTEKASASNACXXXXXXXXXXXXXXSCS 1209
                    S+    G + N +  K      ++HTE A+                     S
Sbjct: 297  EVP-----SVPSQSGSELNGVKGKE-----EYHTENAAQLGPTKPKSSLKPSGGKKVIRS 346

Query: 1208 VTWADEQTDGHDNKNLCNYNECEKNRDSSSQSGSADIDMDDNSYRFXXXXXXXXXXXXXX 1029
            VTWADE+ D  D+++ C   E E  ++  +  G  D+  DDN+ RF              
Sbjct: 347  VTWADEKMDSADSRDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAVALSQAA 406

Query: 1028 XXXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNMLDPERASIKWPSKPGLPNSDL 849
               ASG++D+ D VS+AG+IILP PR+ D  E+ ++ ++L+PE   +KWP KPG+ +SD+
Sbjct: 407  EAVASGETDMTDAVSEAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDI 466

Query: 848  FESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAYIYGNDESHHEEYLCVNGREY 669
            F+SD SWY+ PP+GFSLTLSPFA M+MALFAW +SSS+AYIYG DES HEEYL VNGREY
Sbjct: 467  FDSDDSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREY 526

Query: 668  PRKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLSTLEREIDRLLDTMSFTDPLP 489
            P+K VLTDGRSSEI+Q L+GCL+RALPG+VADLRLP P+S LE+ + RLLDTMSF D LP
Sbjct: 527  PKKIVLTDGRSSEIKQTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALP 586

Query: 488  ALRMKQWQLLVFLFLDALSVCRIPTLAPYMTGRRVSLPKVLDGAKISSEEYEIMKDLIIP 309
            + RMKQWQ++V LF+DALSVCRIP L P+MT RR+  PKV D A++S+EEYE+MKDLIIP
Sbjct: 587  SFRMKQWQVIVLLFIDALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIP 646

Query: 308  LGRVPQFVMQSGG 270
            LGRVPQF  QSGG
Sbjct: 647  LGRVPQFSAQSGG 659


>ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera]
            gi|296089830|emb|CBI39649.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  669 bits (1727), Expect = 0.0
 Identities = 346/673 (51%), Positives = 456/673 (67%), Gaps = 1/673 (0%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MA D+ + VKDAVHKLQL L EGIQ+EN+LFAAG+LMS+SDY+DVV ER+IAN+CGYPLC
Sbjct: 1    MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N+LPSER RKG YRISLKEHKVYDLHETY YCS+ C+VNSR+FAGSL+EER   L   +
Sbjct: 61   SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            ++ +LRLF   SLE +K  L K    G+S+L+++E  + K  EVS+E+W+GPSNAIEGYV
Sbjct: 121  INGILRLFGESSLESNKI-LGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYV 179

Query: 1745 PRRDHTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSKAMDQT-E 1569
            P+RD   +PK  K  ++G K    +++  ++ + +EMDF   II +DEY++SK+     +
Sbjct: 180  PQRDRNLKPKNIKNRKEGSKSSNSKMDSGKNFVIDEMDFVRTIITEDEYSISKSSKGLKD 239

Query: 1568 TVSSQKNKEAKRKVETDERKDKSKNLGESAVCKSSQVHKKCQKSDEPLCRNVMGEQXXXX 1389
            T S  K+KE K K       D+   L +SA    +    K ++S     R +  ++    
Sbjct: 240  TTSHAKSKEPKEKASIG---DQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTA 296

Query: 1388 XXXXXXXLSISDPVGLDQNTIYKKSTEADSKFHTEKASASNACXXXXXXXXXXXXXXSCS 1209
                    S+    G + N +  K      ++HTE A+                   + S
Sbjct: 297  EVP-----SVPSQSGSELNGVKGKE-----EYHTENAAQLGPTKLKSCLKPSGGKKVTRS 346

Query: 1208 VTWADEQTDGHDNKNLCNYNECEKNRDSSSQSGSADIDMDDNSYRFXXXXXXXXXXXXXX 1029
            VTWADE+ D  D+++ C   E E  ++  +  G  D+  DDN+ RF              
Sbjct: 347  VTWADEKMDSADSRDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAIALSQAA 406

Query: 1028 XXXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNMLDPERASIKWPSKPGLPNSDL 849
               ASG++D+ D VS+A +IILP PR+ D  E+ ++ ++L+PE   +KWP KPG+ +SD+
Sbjct: 407  EAVASGETDMTDAVSEARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDI 466

Query: 848  FESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAYIYGNDESHHEEYLCVNGREY 669
            F+SD SWY+ PP+GFSLTLSPFA M+MALFAW +SSS+AYIYG DES HEEYL VNGREY
Sbjct: 467  FDSDDSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREY 526

Query: 668  PRKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLSTLEREIDRLLDTMSFTDPLP 489
            P+K VLTDGRSSEI+Q L+GCLARALPG+VADLRLP P+S LE+ + RLLDTMSF D LP
Sbjct: 527  PKKIVLTDGRSSEIKQTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALP 586

Query: 488  ALRMKQWQLLVFLFLDALSVCRIPTLAPYMTGRRVSLPKVLDGAKISSEEYEIMKDLIIP 309
            + RMKQWQ++V LF+DALSVC+IP L P+M  +R+  PKV D A++S+EEYE+MKDLIIP
Sbjct: 587  SFRMKQWQVIVLLFIDALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIP 646

Query: 308  LGRVPQFVMQSGG 270
            LGRVPQF  QSGG
Sbjct: 647  LGRVPQFSAQSGG 659


>gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao]
          Length = 739

 Score =  598 bits (1543), Expect = e-168
 Identities = 332/692 (47%), Positives = 428/692 (61%), Gaps = 21/692 (3%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAK+++++V +AVHK+QL L +GI+DE +L A+G+L+S+SDY+DVV ER+I+N CGYPLC
Sbjct: 55   MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 114

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N LPSE  RKGRYRISLKEHKVYDL ETY +CST+CL+NSRAFAGSL+EER   L  AK
Sbjct: 115  ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 174

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L+++L LF  + L+D  ++L K    G S+LR++E  + K  +VS+    GPSNAIEGYV
Sbjct: 175  LNDILSLFGDLDLDD--NDLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 229

Query: 1745 PRRD----HTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSK--- 1587
            P+R+     TP    + ++      KLG   KE   + NE+DF   IIM DEY +SK   
Sbjct: 230  PQRELISKPTPPKNNKNKVFDSSSSKLGS-KKEEYFVNNELDFAGTIIMNDEYIISKKPG 288

Query: 1586 AMDQTETVSSQKNKE---------AKRKVETDERKDKSKNLGESAVCKSSQV----HKKC 1446
            +  Q +       KE             +  DE        G    C  S +     K  
Sbjct: 289  SFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGI 348

Query: 1445 QKSDEPLCRNVMGEQXXXXXXXXXXXLSISDPVGLDQNTIYKKSTEADSKFHTEKASASN 1266
             K  E  C  V+              + +     + Q+ +   S EA+ + H +KA  S+
Sbjct: 349  CKDSEDKC--VISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSS 406

Query: 1265 ACXXXXXXXXXXXXXXSCSVTWAD-EQTDGHDNKNLCNYNECEKNRDSSSQSGSADIDMD 1089
                            +  VTWAD ++ D   N NLC   E E  +  S  SGSA+   D
Sbjct: 407  ETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGD 466

Query: 1088 DNSYRFXXXXXXXXXXXXXXXXXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNML 909
            DN  RF                 ASGDSDV D V + G+IILP   E D  E  E+G+ML
Sbjct: 467  DNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDML 526

Query: 908  DPERASIKWPSKPGLPNSDLFESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAY 729
            +PE A +KWP KPG+P+SD+F  + SW++ PP+GFSLTLS FA M+ ALF W +SSSLAY
Sbjct: 527  EPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAY 586

Query: 728  IYGNDESHHEEYLCVNGREYPRKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLS 549
            IYG DES HEEYL +NGREYPRK  L DGRSSEI++ L+ C++RALP +V DLRLP P+S
Sbjct: 587  IYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPIS 646

Query: 548  TLEREIDRLLDTMSFTDPLPALRMKQWQLLVFLFLDALSVCRIPTLAPYMTGRRVSLPKV 369
            TLE+ +  L+DT+SF + LPA RMKQWQ++V LF+DALSVCRIP L P+MT  R+ L KV
Sbjct: 647  TLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRMLLHKV 706

Query: 368  LDGAKISSEEYEIMKDLIIPLGRVPQFVMQSG 273
            LDGA+IS EEYE+MKDLIIPLGR P F  QSG
Sbjct: 707  LDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 738


>ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis]
            gi|223538861|gb|EEF40460.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 645

 Score =  597 bits (1539), Expect = e-168
 Identities = 323/665 (48%), Positives = 431/665 (64%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAK+E+++VKD V+KLQL L EGI++E++L AAG+LMS+SDY+DVV+ERSI+N+CGYPLC
Sbjct: 1    MAKEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N+LPS+R  KGRYRISLKEH+VYDL ETY YCS+SCLVNSRAF+ SL+E+R   L P K
Sbjct: 61   NNSLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIK 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L+E+LR F  ++L+   + L +    G+S+L++QEK++T   +VS+EEW+GPSNAIEGYV
Sbjct: 121  LNEILRKFNDLTLDS--EGLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYV 178

Query: 1745 PRRDHTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSKAMDQTET 1566
            P+ D  P P  +   E G K    +   ++   F++ DFTS II  DEY++SK      +
Sbjct: 179  PQGDRDPNPSLKNHKE-GLKAICKKPVSKQDCFFSDTDFTSTIITNDEYSISKGPSGLTS 237

Query: 1565 VSSQKNKEAKRKVETDERKDKSKNLGESAVCKSSQVHKKCQKSDEPLCRNVMGEQXXXXX 1386
             +S    +A+     +    +  +L +    K+S+  K  +K        V+ EQ     
Sbjct: 238  TASDIKLQAQTGKGHEGLNAQLSSLRKQDSIKASRKSKGRRKE------KVIKEQLNFQD 291

Query: 1385 XXXXXXLSISDPVGLDQNTIYKKSTEADSKFHTEKASASNACXXXXXXXXXXXXXXSCSV 1206
                          L  ++ Y  + EA+       A+  N                + SV
Sbjct: 292  --------------LPSSSYY--TAEAEDISQATGAANLNESVLKPSLKSSGAKRSNRSV 335

Query: 1205 TWADEQTDGHDNKNLCNYNECEKNRDSSSQSGSADIDMDDNSYRFXXXXXXXXXXXXXXX 1026
            TWADE+ D   ++NLC   E E+  +S   S SA+   D +  RF               
Sbjct: 336  TWADERVDNAGSRNLCEVQEMEQTNESHEISESANKGDDGHMLRFESAEACAVALSQAAE 395

Query: 1025 XXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNMLDPERASIKWPSKPGLPNSDLF 846
              ASGD+DV   +S+AG+I+LPP ++       E+ +M++ E AS+KWP+KPG+P SDLF
Sbjct: 396  AVASGDADVNKAMSEAGIIVLPPSQDLGQGGNVEKNDMIEQESASLKWPTKPGIPQSDLF 455

Query: 845  ESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAYIYGNDESHHEEYLCVNGREYP 666
            + + SWY+ PP+GFSLTLSPFA M+MALFAW +SSSLAYIYG DES HE+YL VNGREYP
Sbjct: 456  DPEDSWYDAPPEGFSLTLSPFATMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYP 515

Query: 665  RKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLSTLEREIDRLLDTMSFTDPLPA 486
            RK VL DGRSSEIR     CLAR  PG+VA+LRLP P+STLE+   RLL+TMSF D LPA
Sbjct: 516  RKIVLRDGRSSEIRLTAESCLARTFPGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPA 575

Query: 485  LRMKQWQLLVFLFLDALSVCRIPTLAPYMTGRRVSLPKVLDGAKISSEEYEIMKDLIIPL 306
             R KQWQ++  LF++ALSVCRIP L  YMT RR+ L +VLDGA IS+EEY+IMKD ++PL
Sbjct: 576  FRTKQWQVIALLFIEALSVCRIPALTSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPL 635

Query: 305  GRVPQ 291
            GR PQ
Sbjct: 636  GRDPQ 640


>gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris]
          Length = 706

 Score =  596 bits (1537), Expect = e-167
 Identities = 332/709 (46%), Positives = 442/709 (62%), Gaps = 38/709 (5%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAKD+A++VKDAV KLQ+ L EGIQ+E++LFAAG+LMS+SDY+D+V ERSI N+CGYPLC
Sbjct: 1    MAKDKAVSVKDAVFKLQMLLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N LPSER RKG+YRISLKEHKVYDL ETY +CS++C+V+S+AF+G L+ ER   L P K
Sbjct: 61   CNALPSERPRKGKYRISLKEHKVYDLQETYMFCSSNCVVSSKAFSGILQAERCSALDPEK 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L+ VL LF  ++LE + +N+ K+   G+S+L++QEKT T   EV +E+WVGPSNAIEGYV
Sbjct: 121  LNNVLGLFENLNLEQT-ENVPKDGDLGLSNLKIQEKTVTTSGEVPLEQWVGPSNAIEGYV 179

Query: 1745 PRRDHTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSKAMD-QTE 1569
            P+         +K ++KG K   G+ N ++ +I +EM+F S IIMQDEY+VSKA   QT+
Sbjct: 180  PKPRERESKGLRKNVKKGSKAGHGKSNNDKDLINSEMNFVSTIIMQDEYSVSKASPGQTD 239

Query: 1568 TVS-----------SQKNKEAKRKVETDER--KDKSKNLGESAVCKSSQVHKKCQKSDEP 1428
            T +            Q+ K   + V  DE   +D S +        +S+  K+  KS E 
Sbjct: 240  TTAHHQIKPTAVDRQQEEKVGLKVVRKDEDSIQDLSSSFESGLHLSASEKGKEVSKSCEV 299

Query: 1427 LCRNVMGEQXXXXXXXXXXXLSISDP-VGLDQNTIYKKSTE------------------- 1308
            + ++                +SIS+    +++N   +KS +                   
Sbjct: 300  VVKSTPN---LAIKKKDAHSVSISERHYDVEKNNSARKSVQLKGETSRVTVNGDASTSNF 356

Query: 1307 ----ADSKFHTEKASASNACXXXXXXXXXXXXXXSCSVTWADEQTDGHDNKNLCNYNECE 1140
                   KF  EK                     S +VTWADE+ +G  NK+LC   E  
Sbjct: 357  DPDNVKEKFQVEKVGGLCETKLKSSLKSAGEKKLSRTVTWADEKINGAGNKDLCEVKEFG 416

Query: 1139 KNRDSSSQSGSADIDMDDNSYRFXXXXXXXXXXXXXXXXXASGDSDVADIVSDAGVIILP 960
                 S   G+ D+  +++  R                  ASGDSD  D VS+AG+IILP
Sbjct: 417  DIIKESESVGNEDVANNEDMLRQASAEACAIALSQASEAVASGDSDATDAVSEAGIIILP 476

Query: 959  PPRESDGAETQEEGNMLDPERASIKWPSKPGLPNSDLFESDGSWYENPPDGFSLTLSPFA 780
             P ++    T E+ ++L  +  ++KWP KPG+ + D FESD SW++ PP+GFSLTLSPFA
Sbjct: 477  QPHDAVEEGTMEDADILQNDSVTLKWPRKPGISDIDFFESDDSWFDAPPEGFSLTLSPFA 536

Query: 779  MMFMALFAWTSSSSLAYIYGNDESHHEEYLCVNGREYPRKAVLTDGRSSEIRQALSGCLA 600
             M+ A+F+W +S SLAYIYG DES HEEYL VNGREYP K VL+DGRSSEI+Q  +GCLA
Sbjct: 537  NMWNAIFSWMTSYSLAYIYGRDESFHEEYLSVNGREYPCKVVLSDGRSSEIKQTFAGCLA 596

Query: 599  RALPGVVADLRLPTPLSTLEREIDRLLDTMSFTDPLPALRMKQWQLLVFLFLDALSVCRI 420
            RA P +VA LRLP P+STLE+ +  LL+TMSF D LPA R KQWQ++  LF+DALSVCRI
Sbjct: 597  RAFPALVAGLRLPIPISTLEQGMACLLETMSFVDALPAFRTKQWQVVALLFVDALSVCRI 656

Query: 419  PTLAPYMTGRRVSLPKVLDGAKISSEEYEIMKDLIIPLGRVPQFVMQSG 273
            P+L  YMT RR    KVL G++I  EEYEI+KDL++PLGR P   +QSG
Sbjct: 657  PSLISYMTDRRALFHKVLSGSQIGMEEYEILKDLVVPLGRAPHISVQSG 705


>ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cicer arietinum]
          Length = 666

 Score =  593 bits (1528), Expect = e-166
 Identities = 327/683 (47%), Positives = 432/683 (63%), Gaps = 12/683 (1%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            M KD+ ++VKDAV KLQL L EGIQ E++LFAAG+L+S+SDY+DVV ERSI  +C YPLC
Sbjct: 1    MEKDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N LPSER RKGRYRISLKEHKVYDLHETY +CS+SC+VNS+AFAGSL+++R   L P K
Sbjct: 61   CNALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQK 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L+ +LRLF   +LE   +N  K+ + G+S LR+Q+KT+T   EVS+E+WVGPSNAIEGYV
Sbjct: 121  LNNILRLFGNSNLEPM-ENSGKDGELGLSSLRIQDKTETV-TEVSLEQWVGPSNAIEGYV 178

Query: 1745 PRRDHTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSKAMD-QTE 1569
            P++        QK  +KG K   G+ N  +++I +E DF S IIMQDEY+VSK    QT+
Sbjct: 179  PKKRDNGSKGSQKNTKKGSKASHGKSNGVKNLINSEFDFMSTIIMQDEYSVSKVSSGQTD 238

Query: 1568 TVSSQKNK-----EAKRKVE------TDERKDKSKNLGESAVCKSSQVHKKCQKSDEPLC 1422
                 + K     E  ++V+       D+ +D S +   S    +S+  K+  KS    C
Sbjct: 239  ATVDHQIKPTAILEQPKRVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEIAKS----C 294

Query: 1421 RNVMGEQXXXXXXXXXXXLSISDPVGLDQNTIYKKSTEADSKFHTEKASASNACXXXXXX 1242
            +NV+  +            S  DP            ++ + K   EK   S         
Sbjct: 295  KNVLKGKTNRVAANDDSSTSNFDP------------SDVEEKIQIEKEIGSCHTKPKSSL 342

Query: 1241 XXXXXXXXSCSVTWADEQTDGHDNKNLCNYNECEKNRDSSSQSGSADIDMDDNSYRFXXX 1062
                      SVTWAD++ DG  + +LC + E    +  S  + + D+  D++  R    
Sbjct: 343  KSNGKKKLGRSVTWADKKIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSA 402

Query: 1061 XXXXXXXXXXXXXXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNMLDPERASIKW 882
                          ASGDSD  D VS+AG+IILP    +    T ++ ++L+ +  ++KW
Sbjct: 403  EACAIALSQAAEAVASGDSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKW 462

Query: 881  PSKPGLPNSDLFESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAYIYGNDESHH 702
            P KPG+ + DLF SD SW++ PP+GFSLTLSPFA ++ A F+W +SSSLAYIYG D S +
Sbjct: 463  PRKPGISDFDLFASDDSWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFY 522

Query: 701  EEYLCVNGREYPRKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLSTLEREIDRL 522
            EE+L V+GREYP K VL+DGRSSEI+Q L+ CLARALP VVA+L+LP P+STLE+ +  L
Sbjct: 523  EEFLSVDGREYPCKIVLSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCL 582

Query: 521  LDTMSFTDPLPALRMKQWQLLVFLFLDALSVCRIPTLAPYMTGRRVSLPKVLDGAKISSE 342
            LDTMSF DPLP  R KQWQ++  LF+DALSVCRIP L  YMT RR    KVL G++I  E
Sbjct: 583  LDTMSFVDPLPGFRFKQWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGME 642

Query: 341  EYEIMKDLIIPLGRVPQFVMQSG 273
            EY ++KDLI+PLGR P F  QSG
Sbjct: 643  EYNVLKDLIVPLGRAPHFSSQSG 665


>ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Glycine max]
          Length = 706

 Score =  593 bits (1528), Expect = e-166
 Identities = 329/709 (46%), Positives = 443/709 (62%), Gaps = 38/709 (5%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAKD+ ++VKDAV KLQ+ L EGIQ+E++LFAAG+LMS+SDY+D+V ERSI N+CGYPLC
Sbjct: 1    MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N LPS+R RKGRYRISLKEHKVYDL ETY +CS++CLV+S+ FAGSL+ ER   L   K
Sbjct: 61   SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L+ VL LF  ++LE   + L+K    G+SDL++QEKT+    EVS+E+W GPSNAIEGYV
Sbjct: 121  LNNVLSLFENLNLEPV-ETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYV 179

Query: 1745 PRRDHTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSKA------ 1584
            P+  +      +K ++KG K   G+   + ++I +EM F S IIMQDEY+VSK       
Sbjct: 180  PKPRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKVPPGQMD 239

Query: 1583 ------MDQTETVSSQKNKEAKR-KVETDERKDKSKNLGESAVCKSSQVHKKCQKSDEPL 1425
                  +  T TV   +  +A+  + + D  +D S +   S +  +S+  ++  KS E +
Sbjct: 240  ATANHQIKPTATVKQPEKVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEVTKSCEAV 299

Query: 1424 CRNVMGEQXXXXXXXXXXXLSISD-PVGLDQNTIYKKSTEA------------------- 1305
             +   G             +SIS+    ++QN   +KS +                    
Sbjct: 300  LKFSPG---CAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDASTSNLD 356

Query: 1304 ----DSKFHTEKASASNACXXXXXXXXXXXXXXSCSVTWADEQTDGHDNKNLCNYNEC-E 1140
                + KF  EKA  S                 S +VTWADE+ +   +K+LC + E  +
Sbjct: 357  PANVEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADEKINSTGSKDLCEFKEFGD 416

Query: 1139 KNRDSSSQSGSADIDMDDNSYRFXXXXXXXXXXXXXXXXXASGDSDVADIVSDAGVIILP 960
              ++S S   + D+  D++  R                  ASGDSDV+D VS+AG+ ILP
Sbjct: 417  IKKESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVSEAGITILP 476

Query: 959  PPRESDGAETQEEGNMLDPERASIKWPSKPGLPNSDLFESDGSWYENPPDGFSLTLSPFA 780
            PP ++    T E+ ++L  +  ++KWP K G+  +D FESD SW++ PP+GFSLTLSPFA
Sbjct: 477  PPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFDAPPEGFSLTLSPFA 536

Query: 779  MMFMALFAWTSSSSLAYIYGNDESHHEEYLCVNGREYPRKAVLTDGRSSEIRQALSGCLA 600
             M+  LF+WT+SSSLAYIYG DES HEEYL VNGREYP K VL DGRSSEI+Q L+ CLA
Sbjct: 537  TMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADGRSSEIKQTLASCLA 596

Query: 599  RALPGVVADLRLPTPLSTLEREIDRLLDTMSFTDPLPALRMKQWQLLVFLFLDALSVCRI 420
            RALP +VA LRLP P+S +E+ +  LL+TMSF D LPA R KQWQ++  LF+DALSVCR+
Sbjct: 597  RALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQVVALLFIDALSVCRL 656

Query: 419  PTLAPYMTGRRVSLPKVLDGAKISSEEYEIMKDLIIPLGRVPQFVMQSG 273
            P L  YMT RR S  +VL G++I  EEYE++KDL++PLGR P    QSG
Sbjct: 657  PALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISSQSG 705


>ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Glycine max]
          Length = 706

 Score =  591 bits (1523), Expect = e-166
 Identities = 334/707 (47%), Positives = 443/707 (62%), Gaps = 36/707 (5%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            M KD+ ++VKDAV KLQ+ L EGIQ+E++LFAAG+LMS+SDY+D+V ERSI N+CGYPLC
Sbjct: 1    MEKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N LPS+R RKGRYRISLKEHKVYDLHETY +C ++C+V+S+AFAGSL+ ER   L   K
Sbjct: 61   SNALPSDRPRKGRYRISLKEHKVYDLHETYMFCCSNCVVSSKAFAGSLQAERCSGLDLEK 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L+ +L LF  ++LE + +NL+K +  G+SDL++QEKT+T   EVS+E+W GPSNAIEGYV
Sbjct: 121  LNNILSLFENLNLEPA-ENLQKNEDFGLSDLKIQEKTETSSGEVSLEQWAGPSNAIEGYV 179

Query: 1745 PR-RDHTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSKAM---- 1581
            P+ RDH  +  R K ++KG K   G+   + ++I +EM F S IIMQD Y+VSK +    
Sbjct: 180  PKPRDHDSKGLR-KNVKKGSKAGHGKPISDINLISSEMGFVSTIIMQDGYSVSKVLPGQR 238

Query: 1580 DQTETVSSQKNKEAKRKVETDE---RKDKSKNLGESAVCKSSQVHKKCQKSDE--PLCRN 1416
            D T     +     K+  + D    RKD       S+  KSS +    +K +E    C  
Sbjct: 239  DATAHHQIKPTAIVKQLGKVDAKVVRKDDGSIQDLSSSFKSSLILGTSEKEEELAQSCEA 298

Query: 1415 VM-GEQXXXXXXXXXXXLSISD-PVGLDQNTIYKKSTE---------------------- 1308
             +               +SIS+    ++QN   KKS +                      
Sbjct: 299  ALKSSPDCAIKKKDVYSVSISERQCDVEQNDSAKKSVQVKGKMSRVTANDDASTSNLDPA 358

Query: 1307 -ADSKFHTEKASASNACXXXXXXXXXXXXXXSCSVTWADEQTDGHDNKNLCNYNECEKNR 1131
              + KF  EKA  S                 S +VTWAD++ +   +K+LC +      R
Sbjct: 359  NVEEKFQVEKAGGSLNTKPKSSLKSAGEKKLSRTVTWADKKINSTGSKDLCGFKNFGDIR 418

Query: 1130 DSSSQSG-SADIDMDDNSYRFXXXXXXXXXXXXXXXXXASGDSDVADIVSDAGVIILPPP 954
            + S  +G S D+  D+++ R                  ASGDSDV+D VS+AG+IILPPP
Sbjct: 419  NESDSAGNSIDVANDEDTLRRASAEACVIALSSASEAVASGDSDVSDAVSEAGIIILPPP 478

Query: 953  RESDGAETQEEGNMLDPERASIKWPSKPGLPNSDLFESDGSWYENPPDGFSLTLSPFAMM 774
             ++    T E+ ++L  +  ++KWP KPG+  +D FESD SW++  P+GFSLTLSPFA M
Sbjct: 479  HDAGEEGTLEDVDILQNDSVTVKWPRKPGISEADFFESDDSWFDAAPEGFSLTLSPFATM 538

Query: 773  FMALFAWTSSSSLAYIYGNDESHHEEYLCVNGREYPRKAVLTDGRSSEIRQALSGCLARA 594
            +  LF+W +SSSLAYIYG DES  EEYL VNGREYP K VL DGRSSEI+Q L+ CLARA
Sbjct: 539  WNTLFSWITSSSLAYIYGRDESFQEEYLSVNGREYPCKVVLADGRSSEIKQTLASCLARA 598

Query: 593  LPGVVADLRLPTPLSTLEREIDRLLDTMSFTDPLPALRMKQWQLLVFLFLDALSVCRIPT 414
            LP +VA LRLP P+ST+E+ +  LL+TMSF D LPA R KQWQ++  LF+DALSVCR+P 
Sbjct: 599  LPTLVAVLRLPIPVSTMEQGMACLLETMSFVDALPAFRTKQWQVVALLFIDALSVCRLPA 658

Query: 413  LAPYMTGRRVSLPKVLDGAKISSEEYEIMKDLIIPLGRVPQFVMQSG 273
            L  YMT RR S  +VL G++I  EEYE++KDL +PLGR P    QSG
Sbjct: 659  LISYMTDRRASFHRVLSGSQIGMEEYEVLKDLAVPLGRAPHISAQSG 705


>ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X2 [Glycine max]
          Length = 716

 Score =  585 bits (1507), Expect = e-164
 Identities = 329/719 (45%), Positives = 443/719 (61%), Gaps = 48/719 (6%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAKD+ ++VKDAV KLQ+ L EGIQ+E++LFAAG+LMS+SDY+D+V ERSI N+CGYPLC
Sbjct: 1    MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N LPS+R RKGRYRISLKEHKVYDL ETY +CS++CLV+S+ FAGSL+ ER   L   K
Sbjct: 61   SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L+ VL LF  ++LE   + L+K    G+SDL++QEKT+    EVS+E+W GPSNAIEGYV
Sbjct: 121  LNNVLSLFENLNLEPV-ETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYV 179

Query: 1745 PRRDHTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSKA------ 1584
            P+  +      +K ++KG K   G+   + ++I +EM F S IIMQDEY+VSK       
Sbjct: 180  PKPRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKVPPGQMD 239

Query: 1583 ------MDQTETVSSQKNKEAKR-KVETDERKDKSKNLGESAVCKSSQVHKKCQKSDEPL 1425
                  +  T TV   +  +A+  + + D  +D S +   S +  +S+  ++  KS E +
Sbjct: 240  ATANHQIKPTATVKQPEKVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEVTKSCEAV 299

Query: 1424 CRNVMGEQXXXXXXXXXXXLSISD-PVGLDQNTIYKKSTEA------------------- 1305
             +   G             +SIS+    ++QN   +KS +                    
Sbjct: 300  LKFSPG---CAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDASTSNLD 356

Query: 1304 ----DSKFHTEKASASNACXXXXXXXXXXXXXXSCSVTWADEQTDGHDNKNLCNYNEC-E 1140
                + KF  EKA  S                 S +VTWADE+ +   +K+LC + E  +
Sbjct: 357  PANVEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADEKINSTGSKDLCEFKEFGD 416

Query: 1139 KNRDSSSQSGSADIDMDDNSYRFXXXXXXXXXXXXXXXXXASGDSDVADIV--------- 987
              ++S S   + D+  D++  R                  ASGDSDV+D V         
Sbjct: 417  IKKESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVFSPMNETCA 476

Query: 986  -SDAGVIILPPPRESDGAETQEEGNMLDPERASIKWPSKPGLPNSDLFESDGSWYENPPD 810
             S+AG+ ILPPP ++    T E+ ++L  +  ++KWP K G+  +D FESD SW++ PP+
Sbjct: 477  VSEAGITILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFDAPPE 536

Query: 809  GFSLTLSPFAMMFMALFAWTSSSSLAYIYGNDESHHEEYLCVNGREYPRKAVLTDGRSSE 630
            GFSLTLSPFA M+  LF+WT+SSSLAYIYG DES HEEYL VNGREYP K VL DGRSSE
Sbjct: 537  GFSLTLSPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADGRSSE 596

Query: 629  IRQALSGCLARALPGVVADLRLPTPLSTLEREIDRLLDTMSFTDPLPALRMKQWQLLVFL 450
            I+Q L+ CLARALP +VA LRLP P+S +E+ +  LL+TMSF D LPA R KQWQ++  L
Sbjct: 597  IKQTLASCLARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQVVALL 656

Query: 449  FLDALSVCRIPTLAPYMTGRRVSLPKVLDGAKISSEEYEIMKDLIIPLGRVPQFVMQSG 273
            F+DALSVCR+P L  YMT RR S  +VL G++I  EEYE++KDL++PLGR P    QSG
Sbjct: 657  FIDALSVCRLPALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISSQSG 715


>ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa]
            gi|550321730|gb|EEF05523.2| hypothetical protein
            POPTR_0015s01330g [Populus trichocarpa]
          Length = 696

 Score =  578 bits (1489), Expect = e-162
 Identities = 320/713 (44%), Positives = 431/713 (60%), Gaps = 42/713 (5%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAKD++  VKD ++KLQL L +GIQ+E++L AAG++MS SDY+DVV ER+IAN+CGYPLC
Sbjct: 1    MAKDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N+LPS+R +KGRYRISLKEHKVYDLHETY YCS+SC++NSR F+GSL+EER   L PAK
Sbjct: 61   GNSLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAK 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L+EVL LF   SL  S+ +L K    G S+L+++EKT+    EVS E+W+GPSNAIEGYV
Sbjct: 121  LNEVLMLFDNFSL-GSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYV 179

Query: 1745 PRRDHT-----------------------------------------PQPKRQKELEKGQ 1689
            P+RD                                           P+ K   +  KG 
Sbjct: 180  PQRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGS 239

Query: 1688 KPKLGQVNKERSMIFNEMDFTSAIIM-QDEYNVSKAMDQTETVSSQKNKEAKRKVETDER 1512
            K K  + + ++    N+M+FTS II+ QDEY++SK+       +S K K  K+K +  ++
Sbjct: 240  KAKGTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTS-KTKIQKQKEKVSQK 298

Query: 1511 KDKSKNLGESAVCKSSQVHKKCQKSDEPLCRNVMGEQXXXXXXXXXXXLSISDPVGLDQN 1332
              ++++     V  S    K  +   +   ++ +  Q             +S P    Q 
Sbjct: 299  SSENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQ------------DLSSPFDSCQT 346

Query: 1331 TIYKKSTEADSKFHTEKASASNACXXXXXXXXXXXXXXSCSVTWADEQTDGHDNKNLCNY 1152
            +    + EA  K  +EKA+                   + SVTWADE+     +++LC  
Sbjct: 347  SSITITAEAKEKSVSEKAAKPVESSLKPSLKTSGAKQLTRSVTWADEKVGSSGSRDLCEV 406

Query: 1151 NECEKNRDSSSQSGSADIDMDDNSYRFXXXXXXXXXXXXXXXXXASGDSDVADIVSDAGV 972
               E  +       + D   D    +F                 ASGD+D ++ +S+AG+
Sbjct: 407  RGMEDTKAGPEIVDNIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGL 466

Query: 971  IILPPPRESDGAETQEEGNMLDPERASIKWPSKPGLPNSDLFESDGSWYENPPDGFSLTL 792
            +ILP P + D  +  E+ ++LD E ++IKWP KPG+P S+ F+ + SWY+ PP+GFSL L
Sbjct: 467  VILPQPHDLDQGDPMEDVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLEL 526

Query: 791  SPFAMMFMALFAWTSSSSLAYIYGNDESHHEEYLCVNGREYPRKAVLTDGRSSEIRQALS 612
            S FA ++MALFAW +SSSLAY+YG DES HEEYL VNGREYPRK VL DGRS EI+Q + 
Sbjct: 527  SSFATIWMALFAWVTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIE 586

Query: 611  GCLARALPGVVADLRLPTPLSTLEREIDRLLDTMSFTDPLPALRMKQWQLLVFLFLDALS 432
            GCL RA P VVADLRLP P+STLE+    LL TMSF D +PA RMKQWQ++  LF++ALS
Sbjct: 587  GCLGRAFPVVVADLRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALS 646

Query: 431  VCRIPTLAPYMTGRRVSLPKVLDGAKISSEEYEIMKDLIIPLGRVPQFVMQSG 273
            VCRIP L  YM  RR+    V+DG ++S+EEYE+MKDL+IPLGR PQF  QSG
Sbjct: 647  VCRIPALISYMDNRRM----VVDGVRMSAEEYEVMKDLMIPLGRAPQFSPQSG 695


>gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao]
          Length = 703

 Score =  545 bits (1404), Expect = e-152
 Identities = 308/667 (46%), Positives = 401/667 (60%), Gaps = 21/667 (3%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAK+++++V +AVHK+QL L +GI+DE +L A+G+L+S+SDY+DVV ER+I+N CGYPLC
Sbjct: 55   MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 114

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N LPSE  RKGRYRISLKEHKVYDL ETY +CST+CL+NSRAFAGSL+EER   L  AK
Sbjct: 115  ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 174

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L+++L LF  + L+D  ++L K    G S+LR++E  + K  +VS+    GPSNAIEGYV
Sbjct: 175  LNDILSLFGDLDLDD--NDLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 229

Query: 1745 PRRD----HTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSK--- 1587
            P+R+     TP    + ++      KLG   KE   + NE+DF   IIM DEY +SK   
Sbjct: 230  PQRELISKPTPPKNNKNKVFDSSSSKLGS-KKEEYFVNNELDFAGTIIMNDEYIISKKPG 288

Query: 1586 AMDQTETVSSQKNKE---------AKRKVETDERKDKSKNLGESAVCKSSQV----HKKC 1446
            +  Q +       KE             +  DE        G    C  S +     K  
Sbjct: 289  SFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGI 348

Query: 1445 QKSDEPLCRNVMGEQXXXXXXXXXXXLSISDPVGLDQNTIYKKSTEADSKFHTEKASASN 1266
             K  E  C  V+              + +     + Q+ +   S EA+ + H +KA  S+
Sbjct: 349  CKDSEDKC--VISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSS 406

Query: 1265 ACXXXXXXXXXXXXXXSCSVTWAD-EQTDGHDNKNLCNYNECEKNRDSSSQSGSADIDMD 1089
                            +  VTWAD ++ D   N NLC   E E  +  S  SGSA+   D
Sbjct: 407  ETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGD 466

Query: 1088 DNSYRFXXXXXXXXXXXXXXXXXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNML 909
            DN  RF                 ASGDSDV D V            E D  E  E+G+ML
Sbjct: 467  DNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVC-----------EVDKEEPMEDGDML 515

Query: 908  DPERASIKWPSKPGLPNSDLFESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAY 729
            +PE A +KWP KPG+P+SD+F  + SW++ PP+GFSLTLS FA M+ ALF W +SSSLAY
Sbjct: 516  EPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAY 575

Query: 728  IYGNDESHHEEYLCVNGREYPRKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLS 549
            IYG DES HEEYL +NGREYPRK  L DGRSSEI++ L+ C++RALP +V DLRLP P+S
Sbjct: 576  IYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPIS 635

Query: 548  TLEREIDRLLDTMSFTDPLPALRMKQWQLLVFLFLDALSVCRIPTLAPYMTGRRVSLPKV 369
            TLE+ +  L+DT+SF + LPA RMKQWQ++V LF+DALSVCRIP L P+MT  R+ L KV
Sbjct: 636  TLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRMLLHKV 695

Query: 368  LDGAKIS 348
            LDGA+IS
Sbjct: 696  LDGAQIS 702


>gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]
          Length = 695

 Score =  535 bits (1379), Expect = e-149
 Identities = 308/717 (42%), Positives = 417/717 (58%), Gaps = 46/717 (6%)
 Frame = -1

Query: 2285 MAKDEA--LTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYP 2112
            MAK++   ++VKD V++LQL L +G+  E++LFAAG++MS+SDY DVV ERSIAN+CGYP
Sbjct: 1    MAKNQPPPISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYP 60

Query: 2111 LCRNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKP 1932
            LC N LPS+R RKGRYRISLKEHKVYDLHETY YCS+ C++NSR FA SL++ER   L  
Sbjct: 61   LCPNPLPSDRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDS 120

Query: 1931 AKLSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEG 1752
            A++  VLR+F   S  + +    K++  G S L+++EKT+    +VS+E+W GPSNAIEG
Sbjct: 121  ARIDAVLRMFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEG 180

Query: 1751 YVPRRDHTPQPKRQKELEKGQKPKLGQVNKER---SMIFNEMDFT--------------S 1623
            YV +R+  P+    K  ++G K     +  +    S I  E ++T              S
Sbjct: 181  YVLQRERKPKELGSKSPKRGSKANNTVLINDMDFVSTIITEDEYTVSKTPSSLKKTGLDS 240

Query: 1622 AIIMQDEYNVSKAMDQTETVSSQKNKEAKR------------------------KVETDE 1515
             +  Q+E    KAM     V       A                          + E + 
Sbjct: 241  KVREQEEILAKKAMGNEFAVLETSYAPASNVSRVGLVFEDVTSSLRAGSCLSSARAEEES 300

Query: 1514 RKDKSKNLGESAVCKSSQVHKKCQKSDEPLCRNVMGEQXXXXXXXXXXXLSISDPVGLDQ 1335
              DK++   E+++  S +  +K     + L R V                 I +   + +
Sbjct: 301  HDDKAEKCTEASIKSSLKPSRK-----KKLSRTVTWADEKTDSSGGRKLCEIREIEDMKE 355

Query: 1334 NTIYKKSTEADSKFHTEKASASNACXXXXXXXXXXXXXXSCSVTWADEQTDGHDNKNLCN 1155
            +    ++    S   + K  A                    SV WADE+ D   + ++C 
Sbjct: 356  DPSVVENKNGVSFTSSGKMKAGQ------------------SVIWADEKGDSSKSIDVCE 397

Query: 1154 YNECEKNRDSSSQSGSADIDMDDNSYRFXXXXXXXXXXXXXXXXXASGDSDVADIVSDAG 975
              E E  ++++    +AD   +D+++RF                 AS + +V D +S+AG
Sbjct: 398  VREIEDAKEAADMLCNADTGENDDTFRFASAEACARALDEASEAVASEELEVNDAMSEAG 457

Query: 974  VIILPPPRESDGAETQEEGN---MLDPERASIKWPSKPGLPNSDLFESDGSWYENPPDGF 804
            +IILP P   D  E  EE +     +PE+A IKWP KPG  +SDLF+ + SW++ PP+ F
Sbjct: 458  IIILPRPENGDEGEPMEEDDDDETSEPEQAPIKWPKKPGSQHSDLFDPEDSWFDAPPEDF 517

Query: 803  SLTLSPFAMMFMALFAWTSSSSLAYIYGNDESHHEEYLCVNGREYPRKAVLTDGRSSEIR 624
            SLTLSPFA M+ ALF WT+SS+LAYIYG DES HEEY  VNGREYP K V  DGRSSEI+
Sbjct: 518  SLTLSPFAKMWNALFTWTTSSTLAYIYGRDESLHEEYAVVNGREYPEKIVFGDGRSSEIK 577

Query: 623  QALSGCLARALPGVVADLRLPTPLSTLEREIDRLLDTMSFTDPLPALRMKQWQLLVFLFL 444
            Q L+G LARALPG+VADLRL TP+S+LE+ + RLLDTMSF D LP  RMKQWQ+++ LFL
Sbjct: 578  QTLAGSLARALPGLVADLRLSTPISSLEQGMGRLLDTMSFVDALPPFRMKQWQVIILLFL 637

Query: 443  DALSVCRIPTLAPYMTGRRVSLPKVLDGAKISSEEYEIMKDLIIPLGRVPQFVMQSG 273
            +ALSV R+P L P+M  RRV   KVLD A+IS+EEYE+MKDL+IPLGR P F  QSG
Sbjct: 638  EALSVYRLPALTPHMMYRRVLFHKVLDSAQISAEEYEVMKDLVIPLGRTPHFSAQSG 694


>ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 662

 Score =  528 bits (1359), Expect = e-147
 Identities = 306/681 (44%), Positives = 413/681 (60%), Gaps = 16/681 (2%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAK++++ +KD V+KLQL L+EGI++EN+LFAAG+LMS+SDY+DVV ERSIA++CGYPLC
Sbjct: 1    MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             + LPS+ TR+GRYRISLKEHKVYDL ETY YCS++CL+NSRAF+G L++ER   + P K
Sbjct: 61   HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L E+L+LF  MSL DSK+N+     SG   L +QEK ++   EV +EEW+GPSNAIEGYV
Sbjct: 121  LKEILKLFENMSL-DSKENMGNNCDSG---LEIQEKIESNIGEVPIEEWMGPSNAIEGYV 176

Query: 1745 PRRDH---TPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSKAMDQ 1575
            P RDH   T   K  KE + G K K+  +   +   F++   TS II  +EY+VSK    
Sbjct: 177  PHRDHKVMTLHSKDGKESKDGSKAKIKPLGGGKDF-FSDFSITSTIITDEEYSVSKISSG 235

Query: 1574 TETVSSQKNKEAKRKVETDERKDKSKNLGESAVCKSSQVHKKCQKSDEPLCRNVMGEQXX 1395
             + ++   N     K +T E   K  N  + A+ ++       + S     R        
Sbjct: 236  LKEMALDTNS----KNQTGEFCGKESN-DQFAILETPHAPAPPKNSVGRKARGSKERTKV 290

Query: 1394 XXXXXXXXXLSISDPVGLDQNTIYKKSTE-----------ADSKFHTEKASASNACXXXX 1248
                     LS +     +++T +   TE            + K   +K    N C    
Sbjct: 291  SATKESTDNLSDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCR--- 347

Query: 1247 XXXXXXXXXXSCSVTWADEQTDGHDNKNLCNYNECEKNRDSS-SQSGSADIDMDDNSY-R 1074
                        SVTWADE+TD     NL    E  K ++ S + S   + D D+    R
Sbjct: 348  ------------SVTWADEKTDDASIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILR 395

Query: 1073 FXXXXXXXXXXXXXXXXXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNMLDPERA 894
                               SG S+V+D VS+AG+IILP P +++   + +  N  +P   
Sbjct: 396  VESAEACAMALSQAAEAITSGQSEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSF 455

Query: 893  SIKWPSKPGLPNSDLFESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAYIYGND 714
            S K  +K G+  SDLF+   SWY+ PP+GFSLTLS FA M+MA+FAW +SSSLAYIYG D
Sbjct: 456  SEK-SNKLGVLRSDLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKD 514

Query: 713  ESHHEEYLCVNGREYPRKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLSTLERE 534
            +  HEE+L ++G+EYP K V  DGRSSEI+Q L+GCL RA+PG+ ++L L TP+S LE  
Sbjct: 515  DKFHEEFLYIDGKEYPSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENG 574

Query: 533  IDRLLDTMSFTDPLPALRMKQWQLLVFLFLDALSVCRIPTLAPYMTGRRVSLPKVLDGAK 354
            +  LLDTM+F D LPA RMKQWQ++V LF++ALSV RIP+LA +M+  R    KVLD A+
Sbjct: 575  MAHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQ 634

Query: 353  ISSEEYEIMKDLIIPLGRVPQ 291
            I S+EYEIM+D I+PLGR  Q
Sbjct: 635  IRSDEYEIMRDHILPLGRTAQ 655


>gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica]
          Length = 711

 Score =  521 bits (1343), Expect = e-145
 Identities = 309/724 (42%), Positives = 422/724 (58%), Gaps = 59/724 (8%)
 Frame = -1

Query: 2267 LTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLCRNTLPS 2088
            ++VKD V+KLQL L EGI+ ++ L+ AG+++S+SDY DVV ER+IAN+CGYPLC N LPS
Sbjct: 13   ISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCSNALPS 72

Query: 2087 ERTR--KGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAKLSEV 1914
            + +R  KG YRISLKEHKVYDLHETY YCS+ C++ S+AFA SL EER   L   K+  +
Sbjct: 73   DSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFGKVERI 132

Query: 1913 LRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEW--------------- 1779
            LR F  +  +  +    +    G+S L+++EK +T   ++ +                  
Sbjct: 133  LRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHIGDLGA 192

Query: 1778 VGPSNAIEGYVPRRDHTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEY 1599
            VGPSNAIEGYVP+++   +P   K+ ++G K K  +++    +IFNEMDF S II  DEY
Sbjct: 193  VGPSNAIEGYVPQKERISKPLGSKKNKEGSKGKDAKMSSGMDIIFNEMDFMSTIITSDEY 252

Query: 1598 NVSKAMDQT-ETVSSQKNKEAKRKVETDERKD----------KSKNLGESAVC------- 1473
            +VSK      E     K K++K KV  ++             K+KN+ +  VC       
Sbjct: 253  SVSKIPPSVGEPDFETKFKKSKGKVGLNKNDSVKKSRQSKGGKNKNVKKDDVCIREVPST 312

Query: 1472 ---------------KSSQVHKKCQKSDEPLCRNVMGEQXXXXXXXXXXXLS-ISDPVGL 1341
                           K   + +K ++S E L R+ +                 + D  G 
Sbjct: 313  SDASQTVLNGSTKEEKEEFIVEKAEQSGEALLRSSLKPSGTKKLNRSVTWADEMIDSTG- 371

Query: 1340 DQNTIYK--------KSTEADSKFHTEKASASNACXXXXXXXXXXXXXXSCSVTWADEQT 1185
                +Y+        + ++A S  H  K S  N                 CS TW DE+ 
Sbjct: 372  -SRNLYEVREMEQIMEYSDAFSSMH--KPSVENKVG--------------CSNTWFDEKI 414

Query: 1184 DGHDNKNLCNYNECEKNRDSSSQSGSADIDMDDNSYRFXXXXXXXXXXXXXXXXXASGDS 1005
            D   +KN+C   E +     +   GS  +D+ +N                     ASG+S
Sbjct: 415  DSTKSKNICEVREVQ----DADVLGS--LDLQENEI-LESAEACAMALNQAAEAVASGES 467

Query: 1004 DVADIVSDAGVIILPPPRESDGAETQEEGNMLDPERASIKWPSKPGLPNSDLFESDGSWY 825
            DV+  VS AG+IILP P   D  E  E+ +ML+ E+A + WP KPG+P SDLF+ + SW+
Sbjct: 468  DVSGAVSGAGIIILPRPDGLDEEEPTEDVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWF 526

Query: 824  ENPPDGFSLTLSPFAMMFMALFAWTSSSSLAYIYGNDESHHEEYLCVNGREYPRKAVLTD 645
            + PP+GFS+TLSPFA M+ +LF W +SS+LAYIYG DES HEE+L VNGREYP K VL  
Sbjct: 527  DAPPEGFSVTLSPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAG 586

Query: 644  GRSSEIRQALSGCLARALPGVVADLRLPTPLSTLEREIDRLLDTMSFTDPLPALRMKQWQ 465
            GRSSEI++ L    ARALPGVV++LRLPTP+S+LE+ + R+L+TMSF D +PA RMKQWQ
Sbjct: 587  GRSSEIKKTLDESFARALPGVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQ 646

Query: 464  LLVFLFLDALSVCRIPTLAPYMTGRRVSLPKVLDGAKISSEEYEIMKDLIIPLGRVPQFV 285
            ++V LFL+ LSVCRIP L P+MT RR+   KVL+  +IS+E+YE+MKDLIIPLGR PQF 
Sbjct: 647  VIVLLFLEGLSVCRIPALTPHMTNRRMLFYKVLENTQISAEQYELMKDLIIPLGRAPQFS 706

Query: 284  MQSG 273
             QSG
Sbjct: 707  AQSG 710


>ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 632

 Score =  516 bits (1330), Expect = e-143
 Identities = 299/668 (44%), Positives = 406/668 (60%), Gaps = 3/668 (0%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAK++++ +KD V+KLQL L+EGI++EN+LFAAG+LMS+SDY+DVV ERSIA++CGYPLC
Sbjct: 1    MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             + LPS+ TR+GRYRISLKEHKVYDL ETY YCS++CL+NSRAF+G L++ER   + P K
Sbjct: 61   HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L E+L+LF  MSL DSK+N+     SG   L +QEK ++   EV +EEW+GPSNAIEGYV
Sbjct: 121  LKEILKLFENMSL-DSKENMGNNCDSG---LEIQEKIESNIGEVPIEEWMGPSNAIEGYV 176

Query: 1745 PRRDH---TPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSKAMDQ 1575
            P RDH   T   K  KE + G K K+  +   +   F++  FTS II  +EY+VSK    
Sbjct: 177  PHRDHKVMTLHSKDGKESKDGSKAKIKPLGGGKDF-FSDFSFTSTIITDEEYSVSKISSG 235

Query: 1574 TETVSSQKNKEAKRKVETDERKDKSKNLGESAVCKSSQVHKKCQKSDEPLCRNVMGEQXX 1395
             + ++   N     K +T E   K  N  + A+ ++       + S     R        
Sbjct: 236  LKEMALDTNS----KNQTGEFCGKKSN-DQFAILETPHAPAPPKNSVGRKARGSKERTKV 290

Query: 1394 XXXXXXXXXLSISDPVGLDQNTIYKKSTEADSKFHTEKASASNACXXXXXXXXXXXXXXS 1215
                     LS +     +++T +   TE      T+ AS  N                 
Sbjct: 291  SATKESTDNLSDAPSTSNNRSTNFNLMTEEPRDEKTDDASIMNL-----PEVGEMGKTKE 345

Query: 1214 CSVTWADEQTDGHDNKNLCNYNECEKNRDSSSQSGSADIDMDDNSYRFXXXXXXXXXXXX 1035
            CS T ++     +DN++L      E    + SQ+  A                       
Sbjct: 346  CSRTTSNLVNFDNDNEDLLRVESAEACAMALSQAAKA----------------------- 382

Query: 1034 XXXXXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNMLDPERASIKWPSKPGLPNS 855
                  SG S+V+D VS+AG+IILP P +++   + +  N  +P   S K  +K G+  S
Sbjct: 383  ----ITSGQSEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEK-SNKLGVLRS 437

Query: 854  DLFESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAYIYGNDESHHEEYLCVNGR 675
            DLF+   SWY+ PP+GFSLTLS FA M+MA+FAW +SSSLAYIYG D+  HEE+L ++G+
Sbjct: 438  DLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGK 497

Query: 674  EYPRKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLSTLEREIDRLLDTMSFTDP 495
            EYP K V  DGRSSEI+Q L+GCL RA+PG+ ++L L TP+S LE  +  LLDTM+F D 
Sbjct: 498  EYPSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDA 557

Query: 494  LPALRMKQWQLLVFLFLDALSVCRIPTLAPYMTGRRVSLPKVLDGAKISSEEYEIMKDLI 315
            LPA RMKQWQ++V LF++ALSV RIP+LA +M+  R    KVLD A+I S+EYEIM+D I
Sbjct: 558  LPAFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHI 617

Query: 314  IPLGRVPQ 291
            +PLGR  Q
Sbjct: 618  LPLGRTAQ 625


>gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao]
          Length = 708

 Score =  509 bits (1312), Expect = e-141
 Identities = 286/629 (45%), Positives = 377/629 (59%), Gaps = 21/629 (3%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAK+++++V +AVHK+QL L +GI+DE +L A+G+L+S+SDY+DVV ER+I+N CGYPLC
Sbjct: 55   MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 114

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N LPSE  RKGRYRISLKEHKVYDL ETY +CST+CL+NSRAFAGSL+EER   L  AK
Sbjct: 115  ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 174

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L+++L LF  + L+D  ++L K    G S+LR++E  + K  +VS+    GPSNAIEGYV
Sbjct: 175  LNDILSLFGDLDLDD--NDLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 229

Query: 1745 PRRD----HTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSK--- 1587
            P+R+     TP    + ++      KLG   KE   + NE+DF   IIM DEY +SK   
Sbjct: 230  PQRELISKPTPPKNNKNKVFDSSSSKLGS-KKEEYFVNNELDFAGTIIMNDEYIISKKPG 288

Query: 1586 AMDQTETVSSQKNKE---------AKRKVETDERKDKSKNLGESAVCKSSQV----HKKC 1446
            +  Q +       KE             +  DE        G    C  S +     K  
Sbjct: 289  SFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGI 348

Query: 1445 QKSDEPLCRNVMGEQXXXXXXXXXXXLSISDPVGLDQNTIYKKSTEADSKFHTEKASASN 1266
             K  E  C  V+              + +     + Q+ +   S EA+ + H +KA  S+
Sbjct: 349  CKDSEDKC--VISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSS 406

Query: 1265 ACXXXXXXXXXXXXXXSCSVTWAD-EQTDGHDNKNLCNYNECEKNRDSSSQSGSADIDMD 1089
                            +  VTWAD ++ D   N NLC   E E  +  S  SGSA+   D
Sbjct: 407  ETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGD 466

Query: 1088 DNSYRFXXXXXXXXXXXXXXXXXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNML 909
            DN  RF                 ASGDSDV D V + G+IILP   E D  E  E+G+ML
Sbjct: 467  DNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDML 526

Query: 908  DPERASIKWPSKPGLPNSDLFESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAY 729
            +PE A +KWP KPG+P+SD+F  + SW++ PP+GFSLTLS FA M+ ALF W +SSSLAY
Sbjct: 527  EPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAY 586

Query: 728  IYGNDESHHEEYLCVNGREYPRKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLS 549
            IYG DES HEEYL +NGREYPRK  L DGRSSEI++ L+ C++RALP +V DLRLP P+S
Sbjct: 587  IYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPIS 646

Query: 548  TLEREIDRLLDTMSFTDPLPALRMKQWQL 462
            TLE+ +  L+DT+SF + LPA RMKQW++
Sbjct: 647  TLEQGMGHLIDTISFMEALPAFRMKQWEI 675


>gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao]
          Length = 679

 Score =  508 bits (1309), Expect = e-141
 Identities = 287/631 (45%), Positives = 377/631 (59%), Gaps = 21/631 (3%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAK+++++V +AVHK+QL L +GI+DE +L A+G+L+S+SDY+DVV ER+I+N CGYPLC
Sbjct: 55   MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 114

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N LPSE  RKGRYRISLKEHKVYDL ETY +CST+CL+NSRAFAGSL+EER   L  AK
Sbjct: 115  ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 174

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L+++L LF  + L+D  ++L K    G S+LR++E  + K  +VS+    GPSNAIEGYV
Sbjct: 175  LNDILSLFGDLDLDD--NDLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 229

Query: 1745 PRRD----HTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSK--- 1587
            P+R+     TP    + ++      KLG   KE   + NE+DF   IIM DEY +SK   
Sbjct: 230  PQRELISKPTPPKNNKNKVFDSSSSKLGS-KKEEYFVNNELDFAGTIIMNDEYIISKKPG 288

Query: 1586 AMDQTETVSSQKNKE---------AKRKVETDERKDKSKNLGESAVCKSSQV----HKKC 1446
            +  Q +       KE             +  DE        G    C  S +     K  
Sbjct: 289  SFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGI 348

Query: 1445 QKSDEPLCRNVMGEQXXXXXXXXXXXLSISDPVGLDQNTIYKKSTEADSKFHTEKASASN 1266
             K  E  C  V+              + +     + Q+ +   S EA+ + H +KA  S+
Sbjct: 349  CKDSEDKC--VISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSS 406

Query: 1265 ACXXXXXXXXXXXXXXSCSVTWAD-EQTDGHDNKNLCNYNECEKNRDSSSQSGSADIDMD 1089
                            +  VTWAD ++ D   N NLC   E E  +  S  SGSA+   D
Sbjct: 407  ETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGD 466

Query: 1088 DNSYRFXXXXXXXXXXXXXXXXXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNML 909
            DN  RF                 ASGDSDV D V + G+IILP   E D  E  E+G+ML
Sbjct: 467  DNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDML 526

Query: 908  DPERASIKWPSKPGLPNSDLFESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAY 729
            +PE A +KWP KPG+P+SD+F  + SW++ PP+GFSLTLS FA M+ ALF W +SSSLAY
Sbjct: 527  EPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAY 586

Query: 728  IYGNDESHHEEYLCVNGREYPRKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLS 549
            IYG DES HEEYL +NGREYPRK  L DGRSSEI++ L+ C++RALP +V DLRLP P+S
Sbjct: 587  IYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPIS 646

Query: 548  TLEREIDRLLDTMSFTDPLPALRMKQWQLLV 456
            TLE+ +  L+DT+SF + LPA RMKQW  +V
Sbjct: 647  TLEQGMGHLIDTISFMEALPAFRMKQWCWMV 677


>gb|EOY34548.1| F2P16.20 protein, putative isoform 4 [Theobroma cacao]
          Length = 607

 Score =  480 bits (1236), Expect = e-132
 Identities = 273/604 (45%), Positives = 358/604 (59%), Gaps = 21/604 (3%)
 Frame = -1

Query: 2285 MAKDEALTVKDAVHKLQLCLFEGIQDENKLFAAGALMSQSDYQDVVLERSIANICGYPLC 2106
            MAK+++++V +AVHK+QL L +GI+DE +L A+G+L+S+SDY+DVV ER+I+N CGYPLC
Sbjct: 1    MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 60

Query: 2105 RNTLPSERTRKGRYRISLKEHKVYDLHETYTYCSTSCLVNSRAFAGSLREERSPDLKPAK 1926
             N LPSE  RKGRYRISLKEHKVYDL ETY +CST+CL+NSRAFAGSL+EER   L  AK
Sbjct: 61   ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 120

Query: 1925 LSEVLRLFAGMSLEDSKDNLKKEKKSGVSDLRVQEKTDTKGREVSVEEWVGPSNAIEGYV 1746
            L+++L LF  + L+D  ++L K    G S+LR++E  + K  +VS+    GPSNAIEGYV
Sbjct: 121  LNDILSLFGDLDLDD--NDLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 175

Query: 1745 PRRD----HTPQPKRQKELEKGQKPKLGQVNKERSMIFNEMDFTSAIIMQDEYNVSK--- 1587
            P+R+     TP    + ++      KLG   KE   + NE+DF   IIM DEY +SK   
Sbjct: 176  PQRELISKPTPPKNNKNKVFDSSSSKLGS-KKEEYFVNNELDFAGTIIMNDEYIISKKPG 234

Query: 1586 AMDQTETVSSQKNKE---------AKRKVETDERKDKSKNLGESAVCKSSQV----HKKC 1446
            +  Q +       KE             +  DE        G    C  S +     K  
Sbjct: 235  SFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGI 294

Query: 1445 QKSDEPLCRNVMGEQXXXXXXXXXXXLSISDPVGLDQNTIYKKSTEADSKFHTEKASASN 1266
             K  E  C  V+              + +     + Q+ +   S EA+ + H +KA  S+
Sbjct: 295  CKDSEDKC--VISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSS 352

Query: 1265 ACXXXXXXXXXXXXXXSCSVTWAD-EQTDGHDNKNLCNYNECEKNRDSSSQSGSADIDMD 1089
                            +  VTWAD ++ D   N NLC   E E  +  S  SGSA+   D
Sbjct: 353  ETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGD 412

Query: 1088 DNSYRFXXXXXXXXXXXXXXXXXASGDSDVADIVSDAGVIILPPPRESDGAETQEEGNML 909
            DN  RF                 ASGDSDV D V + G+IILP   E D  E  E+G+ML
Sbjct: 413  DNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDML 472

Query: 908  DPERASIKWPSKPGLPNSDLFESDGSWYENPPDGFSLTLSPFAMMFMALFAWTSSSSLAY 729
            +PE A +KWP KPG+P+SD+F  + SW++ PP+GFSLTLS FA M+ ALF W +SSSLAY
Sbjct: 473  EPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAY 532

Query: 728  IYGNDESHHEEYLCVNGREYPRKAVLTDGRSSEIRQALSGCLARALPGVVADLRLPTPLS 549
            IYG DES HEEYL +NGREYPRK  L DGRSSEI++ L+ C++RALP +V DLRLP P+S
Sbjct: 533  IYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPIS 592

Query: 548  TLER 537
            TLE+
Sbjct: 593  TLEQ 596


Top