BLASTX nr result

ID: Rehmannia26_contig00009616 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia26_contig00009616
         (2326 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]   714   0.0  
ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258...   711   0.0  
ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni...   686   0.0  
ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni...   674   0.0  
ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm...   654   0.0  
ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu...   651   0.0  
gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus...   642   0.0  
ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subuni...   630   e-178
ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subuni...   625   e-176
ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni...   623   e-175
ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subuni...   622   e-175
gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma c...   619   e-174
gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]     611   e-172
gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlise...   595   e-167
ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni...   590   e-166
gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus pe...   585   e-164
ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni...   574   e-161
gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobro...   569   e-159
gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao]      528   e-147
gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao]      528   e-147

>emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]
          Length = 659

 Score =  714 bits (1843), Expect = 0.0
 Identities = 390/669 (58%), Positives = 485/669 (72%), Gaps = 25/669 (3%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M  D+ + VKDAVHKLQL LL+GI++ENQL AAGSL+SRSDY+DVVTERTIAN+CGYPLC
Sbjct: 1    MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             NSL +ER  KG YRISLKEHKVYDL ETYMYCSS C++NSR+FA SLQEER S LN  +
Sbjct: 61   SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            +N +L+LF   SL+S+  +GK+GDLGLS LKI+E  +  AG+V++E+WIGPSNAI+GYVP
Sbjct: 121  INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180

Query: 589  RRDRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK---- 756
            +RDR+LK     N+K   +   SK      +  + +  +M+F STIIT+DEYSISK    
Sbjct: 181  QRDRNLKPKNIKNHKEGSKSSNSK----MDSGKNFVIDEMDFVSTIITKDEYSISKSSKG 236

Query: 757  ---TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETR---SKNKSKNVITKDDK 918
               T    K+KEPK KAS   +  Q + ++K   P+ N  E++   SK +   VI KD+ 
Sbjct: 237  LKDTTSHAKSKEPKEKAS---IGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDE- 292

Query: 919  LSLLENIAGPSQNDS----TKAVKELQESTAGAXXXXXXXXXXXXA-----TRSVTWADE 1071
             S  E  + PSQ+ S     K  +E     A              +      RSVTWADE
Sbjct: 293  FSTAEVPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADE 352

Query: 1072 KTDG-DGQNLNECRELKDKKGAVVTSHSADEEVGEEP--YRFASAEACAMALTQAAEEVA 1242
            K D  D ++  + REL+ KK     +   D +VG++    RFASAEACA+AL+QAAE VA
Sbjct: 353  KMDSADSRDFCKVRELEVKKED--PNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVA 410

Query: 1243 SGKSEASDAVSEAGVIILPPPHGEDEEEN---GDVMETDPLQLKWPPKPGXXXXXXXXXX 1413
            SG+++ +DAVSEAG+IILP P   DE E+    D++E +P+ LKWP KPG          
Sbjct: 411  SGETDMTDAVSEAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSD 470

Query: 1414 XXWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKI 1593
              WYD+PPEGF+LTLSPF+TM+MALF+W++SSS+AYIYG++ESFHEEYLSVNGREYP+KI
Sbjct: 471  DSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKI 530

Query: 1594 VMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRM 1773
            V+ DGRSSEIKQTLAGCL+RALPGLVA+LRLPIPVS LEQG+GRLLDTMSF+D LP+FRM
Sbjct: 531  VLTDGRSSEIKQTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRM 590

Query: 1774 KQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRV 1953
            KQW  IVLLF+DALSV RIPALTP+M  RR+L PKV + AQ+SAEE+E+MKDLIIPLGRV
Sbjct: 591  KQWQVIVLLFIDALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRV 650

Query: 1954 PQFSTQSGG 1980
            PQFS QSGG
Sbjct: 651  PQFSAQSGG 659


>ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera]
            gi|296089830|emb|CBI39649.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  711 bits (1834), Expect = 0.0
 Identities = 388/669 (57%), Positives = 484/669 (72%), Gaps = 25/669 (3%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M  D+ + VKDAVHKLQL LL+GI++ENQL AAGSL+SRSDY+DVVTERTIAN+CGYPLC
Sbjct: 1    MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             NSL +ER  KG YRISLKEHKVYDL ETYMYCSS C++NSR+FA SLQEER S LN  +
Sbjct: 61   SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            +N +L+LF   SL+S+  +GK+GDLGLS LKI+E  +  AG+V++E+WIGPSNAI+GYVP
Sbjct: 121  INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180

Query: 589  RRDRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK---- 756
            +RDR+LK     N K   +   SK      +  + +  +M+F  TIIT+DEYSISK    
Sbjct: 181  QRDRNLKPKNIKNRKEGSKSSNSK----MDSGKNFVIDEMDFVRTIITEDEYSISKSSKG 236

Query: 757  ---TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETR---SKNKSKNVITKDDK 918
               T    K+KEPK KAS   +  Q + ++K   P+ N  E++   SK +   VI KD+ 
Sbjct: 237  LKDTTSHAKSKEPKEKAS---IGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDE- 292

Query: 919  LSLLENIAGPSQNDS----TKAVKELQESTAGAXXXXXXXXXXXXA-----TRSVTWADE 1071
             S  E  + PSQ+ S     K  +E     A              +     TRSVTWADE
Sbjct: 293  FSTAEVPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADE 352

Query: 1072 KTDG-DGQNLNECRELKDKKGAVVTSHSADEEVGEEP--YRFASAEACAMALTQAAEEVA 1242
            K D  D ++  + REL+ KK     +   D +VG++    RFASAEACA+AL+QAAE VA
Sbjct: 353  KMDSADSRDFCKVRELEVKKED--PNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVA 410

Query: 1243 SGKSEASDAVSEAGVIILPPPHGEDEEEN---GDVMETDPLQLKWPPKPGXXXXXXXXXX 1413
            SG+++ +DAVSEA +IILP P   DE E+    D++E +P+ LKWP KPG          
Sbjct: 411  SGETDMTDAVSEARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSD 470

Query: 1414 XXWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKI 1593
              WYD+PPEGF+LTLSPF+TM+MALF+W++SSS+AYIYG++ESFHEEYLSVNGREYP+KI
Sbjct: 471  DSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKI 530

Query: 1594 VMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRM 1773
            V+ DGRSSEIKQTLAGCLARALPGLVA+LRLPIPVS LEQG+GRLLDTMSF+D LP+FRM
Sbjct: 531  VLTDGRSSEIKQTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRM 590

Query: 1774 KQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRV 1953
            KQW  IVLLF+DALSV +IPALTP+M+ +R+L PKV + AQ+SAEE+E+MKDLIIPLGRV
Sbjct: 591  KQWQVIVLLFIDALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRV 650

Query: 1954 PQFSTQSGG 1980
            PQFS QSGG
Sbjct: 651  PQFSAQSGG 659


>ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Solanum lycopersicum]
          Length = 660

 Score =  686 bits (1769), Expect = 0.0
 Identities = 386/674 (57%), Positives = 469/674 (69%), Gaps = 30/674 (4%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M K E + VKDAVHKLQL LL+GIK E+QL AAGSL+SRSDYQDVVTER+IAN+CGYPLC
Sbjct: 1    MAKGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             NSL +ER  KG YRISLKEHKVYDL ETYMYCS++C++NS AFA SLQ+ERSSTLNPAK
Sbjct: 61   SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            LN+VL LF GL L S  ++ +NGD G S LKIQEK D   G+V+LEEW+GPSNAI+GYVP
Sbjct: 121  LNQVLNLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVP 180

Query: 589  RRDRDLKHPQSNN-NKGERREVGSKHRHVR-PNAADILSYDMNFTSTIITQDEYSISKTV 762
            +RDR +      N NKG      SK++H R  +  +++  + +F+STIITQDEYS+SK  
Sbjct: 181  QRDRSVNPALLKNINKG------SKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSK-F 233

Query: 763  PA-------VKAKEPKGKASSKE-------VNRQSNPVQKPTAPLTNIQETRSKNKSKNV 900
            PA       VK KE + K   K        + +Q + +Q     L + +ET   +K+   
Sbjct: 234  PAPVNADSNVKFKETQAKTRYKVRDDDVYILGKQVDALQ-----LRSGEETEKSDKNTRF 288

Query: 901  ITKDDKLSLLENIAGPSQND---------STKAVKELQESTAGAXXXXXXXXXXXXATRS 1053
            + K DK +  E  +GPSQ+D         S    K                      +RS
Sbjct: 289  L-KVDKFNSGEVSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKLKSSLKSSNSKKMSRS 347

Query: 1054 VTWADEKTDGD-GQNLNECRELKDKKG-AVVTSHSADEEVGEEPYRFASAEACAMALTQA 1227
            VTWADE  DG  G+      ++ + +  A   S S D E  ++ YRF SAEACA AL+QA
Sbjct: 348  VTWADESIDGGIGKKTESSSKISEYESQAYGGSASTDMEENDDSYRFESAEACAAALSQA 407

Query: 1228 AEEVASGKSEASDAVSEAGVIILPPPHGEDE---EENGDVMETDPLQLKWPPKPGXXXXX 1398
            AE VASG S+  DAVS+AG++ILPP    DE   +E  ++++ +   LKWP KPG     
Sbjct: 408  AEAVASG-SDVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYD 466

Query: 1399 XXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGRE 1578
                   WYDSPPEGFN+TLSPF TMF +LF+W+SSSSLA+IYG +ES +EEYLS+NGRE
Sbjct: 467  VFESEDSWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGRE 526

Query: 1579 YPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPL 1758
            YP+KIV+ DGRS+EIKQTLAGCLARALPGLVA+LRLP+P+STLEQGM  LL+TMSF+DPL
Sbjct: 527  YPRKIVLSDGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPL 586

Query: 1759 PAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLII 1938
            PAFRMKQW  IVLLFLDALSV RIP LTPYM  RR   PKV++GAQISA E+EIMKDLII
Sbjct: 587  PAFRMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLII 646

Query: 1939 PLGRVPQFSTQSGG 1980
            PLGRVPQFS QSGG
Sbjct: 647  PLGRVPQFSMQSGG 660


>ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Solanum tuberosum]
          Length = 662

 Score =  674 bits (1740), Expect = 0.0
 Identities = 381/669 (56%), Positives = 466/669 (69%), Gaps = 25/669 (3%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M K E + VKDAVHKLQL LL+GIK ENQL AAGSL+SRSDYQDVVTER+IAN+CGYPLC
Sbjct: 1    MAKGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             NSL +ER  KG YRISLKEHKVYDL ETYMYCS++C++NS AFA SLQ+ERSSTLNPAK
Sbjct: 61   SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAG-QVALEEWIGPSNAIDGYV 585
            LN+VL LF GL L S  ++ +NGDLG S LKIQEK D   G +V+LEEW+GPSNAI+GYV
Sbjct: 121  LNQVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYV 180

Query: 586  PRRDRDLKHPQSNN-NKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK-- 756
            P+RDR +      N NKG +    +KH  ++     IL+ + +F+STIITQDEYS+SK  
Sbjct: 181  PQRDRSVNPALLKNINKGFK----NKHARLQDEKNMILN-EFDFSSTIITQDEYSVSKFP 235

Query: 757  ----TVPAVKAKEPKGKASSKEVNRQSNPVQK--PTAPLTNIQETRSKNKSKNVITKDDK 918
                 V + K KE + K   K  +   + + K      L + +ET   +K+   + K DK
Sbjct: 236  APVNAVSSEKFKEAQAKTRYKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFL-KVDK 294

Query: 919  LSLLENIAGPSQND-STKAVKELQ----------ESTAGAXXXXXXXXXXXXATRSVTWA 1065
             +  E  +GPSQ+D   K+V  +           E                  ++SVTWA
Sbjct: 295  FNSGEVSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQSVTWA 354

Query: 1066 DEKTDGD-GQNLNECRELKDKKG-AVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEV 1239
            DE  DG  G+      ++ + +  A   S S D E  ++ YRF SAEACA AL+QAAE V
Sbjct: 355  DEIIDGGIGKKTESSSKISEYENQAYGGSASTDMEEDDDSYRFESAEACAAALSQAAEAV 414

Query: 1240 ASGKSEASDAVSEAGVIILPPPHGEDEE--ENGDVMETDPLQLKWPPKPGXXXXXXXXXX 1413
            ASG S+  DAVS+AG++ILP     DE   +  ++++ +P  LKWP KPG          
Sbjct: 415  ASG-SDVPDAVSKAGIVILPTSQEVDEAILQETEMLDIEPAPLKWPRKPGMPNYDVFESE 473

Query: 1414 XXWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKI 1593
              WYD PPEGFN+TLSPF+TMF +LF+W+SSSSLA+IYG +E+ +EEYLS+NGREYP KI
Sbjct: 474  DCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYPHKI 533

Query: 1594 VMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRM 1773
            V+ DG S+EIKQTLAGCLARALPGLVA+LRLP+P+STLEQGM  LL+TMSF+DPLPAFRM
Sbjct: 534  VLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFRM 593

Query: 1774 KQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRV 1953
            KQW  IVLLFLDALSV RIP LTPYM  RR  LPKV++GAQIS  E+EIMKDLIIPLGRV
Sbjct: 594  KQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPLGRV 653

Query: 1954 PQFSTQSGG 1980
            PQFS QSGG
Sbjct: 654  PQFSMQSGG 662


>ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis]
            gi|223538861|gb|EEF40460.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 645

 Score =  654 bits (1686), Expect = 0.0
 Identities = 357/655 (54%), Positives = 464/655 (70%), Gaps = 18/655 (2%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M K+E ++VKD V+KLQLSLL+GI++E+QL AAGSL+SRSDY+DVV ER+I+N+CGYPLC
Sbjct: 1    MAKEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             NSL ++RP+KGRYRISLKEH+VYDLQETYMYCSSSCL+NSRAF+ SLQE+R S LNP K
Sbjct: 61   NNSLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIK 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            LNE+L+ F+ L+LDS+  +G++GDLGLS LKIQEK++T  G+V+LEEWIGPSNAI+GYVP
Sbjct: 121  LNEILRKFNDLTLDSE-GLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVP 179

Query: 589  RRDRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK---- 756
            + DRD  +P   N+K   + +  K      +  D    D +FTSTIIT DEYSISK    
Sbjct: 180  QGDRD-PNPSLKNHKEGLKAICKKP----VSKQDCFFSDTDFTSTIITNDEYSISKGPSG 234

Query: 757  ---TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSKNVITKDDKLSL 927
               T   +K +   GK   + +N Q + ++K  +   +    +SK + K  + K+     
Sbjct: 235  LTSTASDIKLQAQTGKGH-EGLNAQLSSLRKQDSIKAS---RKSKGRRKEKVIKEQ---- 286

Query: 928  LENIAGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXXAT------RSVTWADEKTDGDG 1089
            L     PS +  T   +++ ++T  A            ++      RSVTWADE+ D  G
Sbjct: 287  LNFQDLPSSSYYTAEAEDISQATGAANLNESVLKPSLKSSGAKRSNRSVTWADERVDNAG 346

Query: 1090 -QNLNECRELKDKKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASD 1266
             +NL E +E++    +   S SA++       RF SAEACA+AL+QAAE VASG ++ + 
Sbjct: 347  SRNLCEVQEMEQTNESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNK 406

Query: 1267 AVSEAGVIILPPPH----GEDEEENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSP 1434
            A+SEAG+I+LPP      G + E+N D++E +   LKWP KPG            WYD+P
Sbjct: 407  AMSEAGIIVLPPSQDLGQGGNVEKN-DMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAP 465

Query: 1435 PEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRS 1614
            PEGF+LTLSPF+TM+MALF+WV+SSSLAYIYG++ES HE+YLSVNGREYP+KIV+ DGRS
Sbjct: 466  PEGFSLTLSPFATMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRS 525

Query: 1615 SEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIV 1794
            SEI+ T   CLAR  PGLVA LRLPIPVSTLEQG GRLL+TMSF+D LPAFR KQW  I 
Sbjct: 526  SEIRLTAESCLARTFPGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQVIA 585

Query: 1795 LLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQ 1959
            LLF++ALSV RIPALT YM  RR++L +V++GA ISAEE++IMKD ++PLGR PQ
Sbjct: 586  LLFIEALSVCRIPALTSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLGRDPQ 640


>ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa]
            gi|550321730|gb|EEF05523.2| hypothetical protein
            POPTR_0015s01330g [Populus trichocarpa]
          Length = 696

 Score =  651 bits (1680), Expect = 0.0
 Identities = 371/709 (52%), Positives = 469/709 (66%), Gaps = 66/709 (9%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M KD+   VKD ++KLQLSLLDGI++E+QL AAGS++S SDY+DVVTERTIAN+CGYPLC
Sbjct: 1    MAKDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
            GNSL ++RP KGRYRISLKEHKVYDL ETYMYCSSSC+INSR F+ SLQEER   LNPAK
Sbjct: 61   GNSLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAK 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            LNEVL LFD  SL S+ ++GKNGDLG S LKI+EKT+ V G+V+ E+WIGPSNAI+GYVP
Sbjct: 121  LNEVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVP 180

Query: 589  RRDR--------DL---------------------------KHPQSNNNKGERR-EVGSK 660
            +RDR        D+                           K  Q    KG  +   GSK
Sbjct: 181  QRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSK 240

Query: 661  HRHVRPNAA-DILSYDMNFTSTII-TQDEYSISK-------TVPAVKAKEPKGKASSKEV 813
             +  + ++  +    DMNFTSTII TQDEYSISK       T    K ++ K K S K  
Sbjct: 241  AKGTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSS 300

Query: 814  NRQSNPVQKPTAPLTN--IQETRSKNKSKNVITKDDKLSLLENIAGPS---------QND 960
              QS+  +K  +  T+  ++E RSK   K+ ++  D  S  ++    S         ++ 
Sbjct: 301  ENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSV 360

Query: 961  STKAVKELQES------TAGAXXXXXXXXXXXXATRSVTWADEKTDGDG-QNLNECRELK 1119
            S KA K ++ S      T+GA             TRSVTWADEK    G ++L E R ++
Sbjct: 361  SEKAAKPVESSLKPSLKTSGAKQL----------TRSVTWADEKVGSSGSRDLCEVRGME 410

Query: 1120 DKKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILP 1299
            D K       + D+       +F SAEACA AL+QAAE VASG ++AS+A+SEAG++ILP
Sbjct: 411  DTKAGPEIVDNIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILP 470

Query: 1300 PPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFS 1470
             PH  D+    E+ DV++ +   +KWP KPG            WYD+PPEGF+L LS F+
Sbjct: 471  QPHDLDQGDPMEDVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFA 530

Query: 1471 TMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLA 1650
            T++MALF+WV+SSSLAY+YGK+ES HEEYL VNGREYP+KIV+ DGRS EI+QT+ GCL 
Sbjct: 531  TIWMALFAWVTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLG 590

Query: 1651 RALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRI 1830
            RA P +VA+LRLPIP+STLEQG   LL TMSF+D +PAFRMKQW  I LLF++ALSV RI
Sbjct: 591  RAFPVVVADLRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRI 650

Query: 1831 PALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977
            PAL  YM +RR+    V++G ++SAEE+E+MKDL+IPLGR PQFS QSG
Sbjct: 651  PALISYMDNRRM----VVDGVRMSAEEYEVMKDLMIPLGRAPQFSPQSG 695


>gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris]
          Length = 706

 Score =  642 bits (1656), Expect = 0.0
 Identities = 368/713 (51%), Positives = 464/713 (65%), Gaps = 70/713 (9%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M KD+ ++VKDAV KLQ+ LL+GI++E+QL AAGSL+SRSDY+D+VTER+I NVCGYPLC
Sbjct: 1    MAKDKAVSVKDAVFKLQMLLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             N+L +ERP KG+YRISLKEHKVYDLQETYM+CSS+C+++S+AF+  LQ ER S L+P K
Sbjct: 61   CNALPSERPRKGKYRISLKEHKVYDLQETYMFCSSNCVVSSKAFSGILQAERCSALDPEK 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            LN VL LF+ L+L+   N+ K+GDLGLS LKIQEKT T +G+V LE+W+GPSNAI+GYVP
Sbjct: 121  LNNVLGLFENLNLEQTENVPKDGDLGLSNLKIQEKTVTTSGEVPLEQWVGPSNAIEGYVP 180

Query: 589  RRDRDLKHPQSNNNKGERREV--GSKHRHVRPNA-ADILSYDMNFTSTIITQDEYSISKT 759
            +       P+   +KG R+ V  GSK  H + N   D+++ +MNF STII QDEYS+SK 
Sbjct: 181  K-------PRERESKGLRKNVKKGSKAGHGKSNNDKDLINSEMNFVSTIIMQDEYSVSKA 233

Query: 760  VPA-----------------------------------------------VKAKEPKGKA 798
             P                                                + A E KGK 
Sbjct: 234  SPGQTDTTAHHQIKPTAVDRQQEEKVGLKVVRKDEDSIQDLSSSFESGLHLSASE-KGKE 292

Query: 799  SSK--EVNRQSNP---VQKPTAPLTNIQETR---SKNKS--KNVITKDDKLSLLENIAGP 948
             SK  EV  +S P   ++K  A   +I E      KN S  K+V  K +   +  N    
Sbjct: 293  VSKSCEVVVKSTPNLAIKKKDAHSVSISERHYDVEKNNSARKSVQLKGETSRVTVNGDAS 352

Query: 949  SQNDSTKAVKE-LQESTAGAXXXXXXXXXXXXA-----TRSVTWADEKTDGDG-QNLNEC 1107
            + N     VKE  Q    G             A     +R+VTWADEK +G G ++L E 
Sbjct: 353  TSNFDPDNVKEKFQVEKVGGLCETKLKSSLKSAGEKKLSRTVTWADEKINGAGNKDLCEV 412

Query: 1108 RELKDKKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGV 1287
            +E  D      +  + D    E+  R ASAEACA+AL+QA+E VASG S+A+DAVSEAG+
Sbjct: 413  KEFGDIIKESESVGNEDVANNEDMLRQASAEACAIALSQASEAVASGDSDATDAVSEAGI 472

Query: 1288 IILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTL 1458
            IILP PH   EE   E+ D+++ D + LKWP KPG            W+D+PPEGF+LTL
Sbjct: 473  IILPQPHDAVEEGTMEDADILQNDSVTLKWPRKPGISDIDFFESDDSWFDAPPEGFSLTL 532

Query: 1459 SPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLA 1638
            SPF+ M+ A+FSW++S SLAYIYG++ESFHEEYLSVNGREYP K+V+ DGRSSEIKQT A
Sbjct: 533  SPFANMWNAIFSWMTSYSLAYIYGRDESFHEEYLSVNGREYPCKVVLSDGRSSEIKQTFA 592

Query: 1639 GCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALS 1818
            GCLARA P LVA LRLPIP+STLEQGM  LL+TMSF+D LPAFR KQW  + LLF+DALS
Sbjct: 593  GCLARAFPALVAGLRLPIPISTLEQGMACLLETMSFVDALPAFRTKQWQVVALLFVDALS 652

Query: 1819 VSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977
            V RIP+L  YM DRR L  KV+ G+QI  EE+EI+KDL++PLGR P  S QSG
Sbjct: 653  VCRIPSLISYMTDRRALFHKVLSGSQIGMEEYEILKDLVVPLGRAPHISVQSG 705


>ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Glycine max]
          Length = 706

 Score =  630 bits (1626), Expect = e-178
 Identities = 354/713 (49%), Positives = 463/713 (64%), Gaps = 70/713 (9%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M KD+ ++VKDAV KLQ+SLL+GI++E+QL AAGSL+SRSDY+D+VTER+I N+CGYPLC
Sbjct: 1    MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             N+L ++RP KGRYRISLKEHKVYDLQETYM+CSS+CL++S+ FA SLQ ER S L+  K
Sbjct: 61   SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            LN VL LF+ L+L+    + KNGDLGLS LKIQEKT+  +G+V+LE+W GPSNAI+GYVP
Sbjct: 121  LNNVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVP 180

Query: 589  RRDRDLKHPQSNNNKGERREV--GSKHRHVRP-NAADILSYDMNFTSTIITQDEYSISKT 759
            +       P++ ++KG R+ V  GSK  H +  +  ++++ +M F STII QDEYS+SK 
Sbjct: 181  K-------PRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKV 233

Query: 760  VPA-------------VKAKEPKGKASSKEVNRQSNPVQ------KPTAPLTNIQETRSK 882
             P                 K+P+ K  ++ V +  + +Q      K +  L+  ++    
Sbjct: 234  PPGQMDATANHQIKPTATVKQPE-KVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEV 292

Query: 883  NKSKNVITKDD-----------KLSLLENIAGPSQNDSTKAVKELQEST----------- 996
             KS   + K              +S+ E      QNDS +   +++  T           
Sbjct: 293  TKSCEAVLKFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDAST 352

Query: 997  ----------------AGAXXXXXXXXXXXXA-----TRSVTWADEKTDGDG-QNLNECR 1110
                            AG             A     +R+VTWADEK +  G ++L E +
Sbjct: 353  SNLDPANVEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADEKINSTGSKDLCEFK 412

Query: 1111 ELKD-KKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGV 1287
            E  D KK +    ++ D    E+  R ASAEACA+AL+ A+E VASG S+ SDAVSEAG+
Sbjct: 413  EFGDIKKESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVSEAGI 472

Query: 1288 IILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTL 1458
             ILPPPH   EE   E+ D+++ D + LKWP K G            W+D+PPEGF+LTL
Sbjct: 473  TILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFDAPPEGFSLTL 532

Query: 1459 SPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLA 1638
            SPF+TM+  LFSW +SSSLAYIYG++ESFHEEYLSVNGREYP K+V+ DGRSSEIKQTLA
Sbjct: 533  SPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADGRSSEIKQTLA 592

Query: 1639 GCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALS 1818
             CLARALP LVA LRLPIPVS +EQGM  LL+TMSF+D LPAFR KQW  + LLF+DALS
Sbjct: 593  SCLARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQVVALLFIDALS 652

Query: 1819 VSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977
            V R+PAL  YM DRR    +V+ G+QI  EE+E++KDL++PLGR P  S+QSG
Sbjct: 653  VCRLPALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISSQSG 705


>ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Glycine max]
          Length = 706

 Score =  625 bits (1611), Expect = e-176
 Identities = 352/717 (49%), Positives = 454/717 (63%), Gaps = 74/717 (10%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M KD+ ++VKDAV KLQ+SLL+GI++E+QL AAGSL+SRSDY+D+VTER+I NVCGYPLC
Sbjct: 1    MEKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             N+L ++RP KGRYRISLKEHKVYDL ETYM+C S+C+++S+AFA SLQ ER S L+  K
Sbjct: 61   SNALPSDRPRKGRYRISLKEHKVYDLHETYMFCCSNCVVSSKAFAGSLQAERCSGLDLEK 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            LN +L LF+ L+L+   N+ KN D GLS LKIQEKT+T +G+V+LE+W GPSNAI+GYVP
Sbjct: 121  LNNILSLFENLNLEPAENLQKNEDFGLSDLKIQEKTETSSGEVSLEQWAGPSNAIEGYVP 180

Query: 589  RRDRDLKHPQSNNNKGERREV--GSKHRHVRP-NAADILSYDMNFTSTIITQDEYSISKT 759
            +       P+ +++KG R+ V  GSK  H +P +  +++S +M F STII QD YS+SK 
Sbjct: 181  K-------PRDHDSKGLRKNVKKGSKAGHGKPISDINLISSEMGFVSTIIMQDGYSVSKV 233

Query: 760  VPAVKAK------------EPKGKASSKEVNRQSNPVQKPTAP---------------LT 858
            +P  +              +  GK  +K V +    +Q  ++                L 
Sbjct: 234  LPGQRDATAHHQIKPTAIVKQLGKVDAKVVRKDDGSIQDLSSSFKSSLILGTSEKEEELA 293

Query: 859  NIQETRSKNKSKNVITKDD--KLSLLENIAGPSQNDSTK--------------------- 969
               E   K+     I K D   +S+ E      QNDS K                     
Sbjct: 294  QSCEAALKSSPDCAIKKKDVYSVSISERQCDVEQNDSAKKSVQVKGKMSRVTANDDASTS 353

Query: 970  ------AVKELQESTAGAXXXXXXXXXXXXA-----TRSVTWADEKTDGDG-------QN 1095
                    ++ Q   AG             A     +R+VTWAD+K +  G       +N
Sbjct: 354  NLDPANVEEKFQVEKAGGSLNTKPKSSLKSAGEKKLSRTVTWADKKINSTGSKDLCGFKN 413

Query: 1096 LNECRELKDKKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVS 1275
              + R   D  G     +S D    E+  R ASAEAC +AL+ A+E VASG S+ SDAVS
Sbjct: 414  FGDIRNESDSAG-----NSIDVANDEDTLRRASAEACVIALSSASEAVASGDSDVSDAVS 468

Query: 1276 EAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGF 1446
            EAG+IILPPPH   EE   E+ D+++ D + +KWP KPG            W+D+ PEGF
Sbjct: 469  EAGIIILPPPHDAGEEGTLEDVDILQNDSVTVKWPRKPGISEADFFESDDSWFDAAPEGF 528

Query: 1447 NLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIK 1626
            +LTLSPF+TM+  LFSW++SSSLAYIYG++ESF EEYLSVNGREYP K+V+ DGRSSEIK
Sbjct: 529  SLTLSPFATMWNTLFSWITSSSLAYIYGRDESFQEEYLSVNGREYPCKVVLADGRSSEIK 588

Query: 1627 QTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFL 1806
            QTLA CLARALP LVA LRLPIPVST+EQGM  LL+TMSF+D LPAFR KQW  + LLF+
Sbjct: 589  QTLASCLARALPTLVAVLRLPIPVSTMEQGMACLLETMSFVDALPAFRTKQWQVVALLFI 648

Query: 1807 DALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977
            DALSV R+PAL  YM DRR    +V+ G+QI  EE+E++KDL +PLGR P  S QSG
Sbjct: 649  DALSVCRLPALISYMTDRRASFHRVLSGSQIGMEEYEVLKDLAVPLGRAPHISAQSG 705


>ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cicer arietinum]
          Length = 666

 Score =  623 bits (1607), Expect = e-175
 Identities = 348/674 (51%), Positives = 451/674 (66%), Gaps = 31/674 (4%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M KD+ ++VKDAV KLQL+LL+GI+ E+QL AAGSLISRSDY+DVVTER+I  VC YPLC
Sbjct: 1    MEKDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             N+L +ERP KGRYRISLKEHKVYDL ETYM+CSSSC++NS+AFA SL+++R   L+P K
Sbjct: 61   CNALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQK 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            LN +L+LF   +L+   N GK+G+LGLS L+IQ+KT+TV  +V+LE+W+GPSNAI+GYVP
Sbjct: 121  LNNILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVT-EVSLEQWVGPSNAIEGYVP 179

Query: 589  R-RDRDLKHPQSNNNKGERREVGSKHRHVRPNAA-DILSYDMNFTSTIITQDEYSISKTV 762
            + RD   K  Q N  KG      SK  H + N   ++++ + +F STII QDEYS+SK  
Sbjct: 180  KKRDNGSKGSQKNTKKG------SKASHGKSNGVKNLINSEFDFMSTIIMQDEYSVSKVS 233

Query: 763  -------------PAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSK--- 894
                         P    ++PK      E+ R+ + +Q  ++   +     +  K K   
Sbjct: 234  SGQTDATVDHQIKPTAILEQPK--RVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEIA 291

Query: 895  ----NVITKDDKLSLLENIAGPSQNDSTKAVKELQ-ESTAGAXXXXXXXXXXXXAT---- 1047
                NV+          + +  S  D +   +++Q E   G+                  
Sbjct: 292  KSCKNVLKGKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLG 351

Query: 1048 RSVTWADEKTDGDGQ-NLNECRELKDKKGAVVTSHSADEEVGEEPYRFASAEACAMALTQ 1224
            RSVTWAD+K DG G  +L   +E  + K     + + D    E+  R  SAEACA+AL+Q
Sbjct: 352  RSVTWADKKIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALSQ 411

Query: 1225 AAEEVASGKSEASDAVSEAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXX 1395
            AAE VASG S+A DAVSEAG+IILP      EE   ++ D++ETD + LKWP KPG    
Sbjct: 412  AAEAVASGDSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKPGISDF 471

Query: 1396 XXXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGR 1575
                    W+D+PPEGF+LTLSPF+T++ A FSW++SSSLAYIYG++ SF+EE+LSV+GR
Sbjct: 472  DLFASDDSWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDGR 531

Query: 1576 EYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDP 1755
            EYP KIV+ DGRSSEIKQTLA CLARALP +VAEL+LP+PVSTLEQGM  LLDTMSF+DP
Sbjct: 532  EYPCKIVLSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSFVDP 591

Query: 1756 LPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLI 1935
            LP FR KQW  + LLF+DALSV RIPAL  YM DRR L  KV+ G+QI  EE+ ++KDLI
Sbjct: 592  LPGFRFKQWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLKDLI 651

Query: 1936 IPLGRVPQFSTQSG 1977
            +PLGR P FS+QSG
Sbjct: 652  VPLGRAPHFSSQSG 665


>ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X2 [Glycine max]
          Length = 716

 Score =  622 bits (1605), Expect = e-175
 Identities = 354/723 (48%), Positives = 463/723 (64%), Gaps = 80/723 (11%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M KD+ ++VKDAV KLQ+SLL+GI++E+QL AAGSL+SRSDY+D+VTER+I N+CGYPLC
Sbjct: 1    MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             N+L ++RP KGRYRISLKEHKVYDLQETYM+CSS+CL++S+ FA SLQ ER S L+  K
Sbjct: 61   SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            LN VL LF+ L+L+    + KNGDLGLS LKIQEKT+  +G+V+LE+W GPSNAI+GYVP
Sbjct: 121  LNNVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVP 180

Query: 589  RRDRDLKHPQSNNNKGERREV--GSKHRHVRP-NAADILSYDMNFTSTIITQDEYSISKT 759
            +       P++ ++KG R+ V  GSK  H +  +  ++++ +M F STII QDEYS+SK 
Sbjct: 181  K-------PRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKV 233

Query: 760  VPA-------------VKAKEPKGKASSKEVNRQSNPVQ------KPTAPLTNIQETRSK 882
             P                 K+P+ K  ++ V +  + +Q      K +  L+  ++    
Sbjct: 234  PPGQMDATANHQIKPTATVKQPE-KVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEV 292

Query: 883  NKSKNVITKDD-----------KLSLLENIAGPSQNDSTKAVKELQEST----------- 996
             KS   + K              +S+ E      QNDS +   +++  T           
Sbjct: 293  TKSCEAVLKFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDAST 352

Query: 997  ----------------AGAXXXXXXXXXXXXA-----TRSVTWADEKTDGDG-QNLNECR 1110
                            AG             A     +R+VTWADEK +  G ++L E +
Sbjct: 353  SNLDPANVEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADEKINSTGSKDLCEFK 412

Query: 1111 ELKD-KKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAV----- 1272
            E  D KK +    ++ D    E+  R ASAEACA+AL+ A+E VASG S+ SDAV     
Sbjct: 413  EFGDIKKESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVFSPMN 472

Query: 1273 -----SEAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYD 1428
                 SEAG+ ILPPPH   EE   E+ D+++ D + LKWP K G            W+D
Sbjct: 473  ETCAVSEAGITILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFD 532

Query: 1429 SPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDG 1608
            +PPEGF+LTLSPF+TM+  LFSW +SSSLAYIYG++ESFHEEYLSVNGREYP K+V+ DG
Sbjct: 533  APPEGFSLTLSPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADG 592

Query: 1609 RSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHA 1788
            RSSEIKQTLA CLARALP LVA LRLPIPVS +EQGM  LL+TMSF+D LPAFR KQW  
Sbjct: 593  RSSEIKQTLASCLARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQV 652

Query: 1789 IVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFST 1968
            + LLF+DALSV R+PAL  YM DRR    +V+ G+QI  EE+E++KDL++PLGR P  S+
Sbjct: 653  VALLFIDALSVCRLPALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISS 712

Query: 1969 QSG 1977
            QSG
Sbjct: 713  QSG 715


>gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao]
          Length = 739

 Score =  619 bits (1597), Expect = e-174
 Identities = 354/700 (50%), Positives = 454/700 (64%), Gaps = 51/700 (7%)
 Frame = +1

Query: 31   KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 210
            K++S  M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N 
Sbjct: 49   KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108

Query: 211  CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 390
            CGYPLC N L +E   KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S
Sbjct: 109  CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168

Query: 391  TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 570
             LN AKLN++L LF  L LD D ++GKNGDLG S L+I+E  +  A  V+L    GPSNA
Sbjct: 169  VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224

Query: 571  IDGYVPRRDRDLKHPQSNNNKGE-----RREVGSKHRHVRPNAADILSYDMNFTSTIITQ 735
            I+GYVP+R+   K     NNK +       ++GSK           ++ +++F  TII  
Sbjct: 225  IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278

Query: 736  DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 873
            DEY ISK   + K  +    +S KE              +N +    + P+    +  ++
Sbjct: 279  DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338

Query: 874  RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 987
              K   +  I KD  DK  +  + +   + DS+                    +A KE  
Sbjct: 339  NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398

Query: 988  ESTAGAXXXXXXXXXXXXA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1146
               A              A      R VTWAD+K  D  G  NL E +E++  KG    S
Sbjct: 399  ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458

Query: 1147 HSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 1326
             SA++   +   RF SAEACAMAL++AAE VASG S+ +DAV E G+IILP     D+EE
Sbjct: 459  GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEE 518

Query: 1327 ---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSW 1497
               +GD++E +   +KWP KPG            W+D+PPEGF+LTLS F+TM+ ALF W
Sbjct: 519  PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEW 578

Query: 1498 VSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAE 1677
            ++SSSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +
Sbjct: 579  ITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTD 638

Query: 1678 LRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMD 1857
            LRLPIP+STLEQGMG L+DT+SF++ LPAFRMKQW  IVLLF+DALSV RIPALTP+M +
Sbjct: 639  LRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTN 698

Query: 1858 RRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977
             R+LL KV++GAQIS EE+E+MKDLIIPLGR P FS QSG
Sbjct: 699  GRMLLHKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 738


>gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]
          Length = 695

 Score =  611 bits (1575), Expect = e-172
 Identities = 347/703 (49%), Positives = 453/703 (64%), Gaps = 66/703 (9%)
 Frame = +1

Query: 67   LTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLCGNSLSA 246
            ++VKD V++LQLSLL G+  E+QL AAGS++SRSDY DVVTER+IAN+CGYPLC N L +
Sbjct: 9    ISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYPLCPNPLPS 68

Query: 247  ERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAKLNEVLK 426
            +RP KGRYRISLKEHKVYDL ETYMYCSS C+INSR FAASL++ER + L+ A+++ VL+
Sbjct: 69   DRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDSARIDAVLR 128

Query: 427  LFDGLS-LDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVPRRDRD 603
            +F+  S L+ ++  GK+ DLG S LKI+EKT+   G V+LE+W GPSNAI+GYV +R+R 
Sbjct: 129  MFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRERK 188

Query: 604  LKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISKTVPAVK--- 774
               P+   +K  +R  GSK  +       +L  DM+F STIIT+DEY++SKT  ++K   
Sbjct: 189  ---PKELGSKSPKR--GSKANNT------VLINDMDFVSTIITEDEYTVSKTPSSLKKTG 237

Query: 775  ----AKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSKNVITKDDKLSLLENIA 942
                 +E +   + K +  +   ++   AP +N+  +R     ++V +     S L +  
Sbjct: 238  LDSKVREQEEILAKKAMGNEFAVLETSYAPASNV--SRVGLVFEDVTSSLRAGSCLSSAR 295

Query: 943  GPSQNDSTKAVKELQESTAGAXXXXXXXXXXXXATRSVTWADEKTDGDG----------- 1089
               ++   KA K     T  +             +R+VTWADEKTD  G           
Sbjct: 296  AEEESHDDKAEK----CTEASIKSSLKPSRKKKLSRTVTWADEKTDSSGGRKLCEIREIE 351

Query: 1090 ---------QNLN--------------------------------ECRELKDKKGAVVTS 1146
                     +N N                                E RE++D K A    
Sbjct: 352  DMKEDPSVVENKNGVSFTSSGKMKAGQSVIWADEKGDSSKSIDVCEVREIEDAKEAADML 411

Query: 1147 HSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDE-- 1320
             +AD    ++ +RFASAEACA AL +A+E VAS + E +DA+SEAG+IILP P   DE  
Sbjct: 412  CNADTGENDDTFRFASAEACARALDEASEAVASEELEVNDAMSEAGIIILPRPENGDEGE 471

Query: 1321 --EENGDVMETDPLQ--LKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFSTMFMAL 1488
              EE+ D   ++P Q  +KWP KPG            W+D+PPE F+LTLSPF+ M+ AL
Sbjct: 472  PMEEDDDDETSEPEQAPIKWPKKPGSQHSDLFDPEDSWFDAPPEDFSLTLSPFAKMWNAL 531

Query: 1489 FSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGL 1668
            F+W +SS+LAYIYG++ES HEEY  VNGREYP+KIV  DGRSSEIKQTLAG LARALPGL
Sbjct: 532  FTWTTSSTLAYIYGRDESLHEEYAVVNGREYPEKIVFGDGRSSEIKQTLAGSLARALPGL 591

Query: 1669 VAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRIPALTPY 1848
            VA+LRL  P+S+LEQGMGRLLDTMSF+D LP FRMKQW  I+LLFL+ALSV R+PALTP+
Sbjct: 592  VADLRLSTPISSLEQGMGRLLDTMSFVDALPPFRMKQWQVIILLFLEALSVYRLPALTPH 651

Query: 1849 MMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977
            MM RR+L  KV++ AQISAEE+E+MKDL+IPLGR P FS QSG
Sbjct: 652  MMYRRVLFHKVLDSAQISAEEYEVMKDLVIPLGRTPHFSAQSG 694


>gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlisea aurea]
          Length = 597

 Score =  595 bits (1534), Expect = e-167
 Identities = 338/639 (52%), Positives = 427/639 (66%), Gaps = 6/639 (0%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M KDE+LT+K+AV++LQ SLL+G K+ENQL+AAGSL+SR DYQD+VTER IA +CGYPLC
Sbjct: 1    MAKDEILTMKEAVYRLQTSLLEGAKNENQLSAAGSLMSRGDYQDLVTERVIAKICGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             N+L++ERP KGRYRISLKEHKVYD+QETY +CSS CLINSRAF+  L +ER+S L+P K
Sbjct: 61   SNNLNSERPSKGRYRISLKEHKVYDVQETYSFCSSGCLINSRAFSIGLPDERTSDLDPIK 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            LNEVLK FDG   +S  NMG+N DLGLS L+I EK +  AG+V+  EWIGPS+AIDGYVP
Sbjct: 121  LNEVLKRFDGFGANSTPNMGRNEDLGLSQLRIMEKENIEAGEVSSNEWIGPSDAIDGYVP 180

Query: 589  RRDRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISKTVPA 768
            RRDR+     S   KGE     S++         I   DM+FTS II Q+EYSI+KT   
Sbjct: 181  RRDRNSNTLSSKQKKGE-----SRYHLSLQVLTSIFPSDMSFTSVIIDQNEYSIAKTTTP 235

Query: 769  VKAKEPKGKASSKEVNRQS-NPVQKPTAPLTNIQETRSKNKSK-NVITK-DDKLSLLENI 939
              +K+  G+++ K +  +   P Q P + + NI+ +  +N SK N   K D KLS  E+ 
Sbjct: 236  SSSKQ-SGESNEKVIPEEDVRPKQSPDSSVANIKGSGFRNPSKRNGRAKIDAKLSASEDK 294

Query: 940  AGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXX---ATRSVTWADEKTDGDGQNLNECR 1110
            A  S+N     + +  +S  GA                TR+V+WAD K + DGQNL    
Sbjct: 295  A--SENGGEPKLADGDKSAQGAAVLKSSLKTSYSKETTTRTVSWADVKAE-DGQNLETVC 351

Query: 1111 ELKDKKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVI 1290
            E+ D  G  ++  ++            S E+   A T+A+++ A GK   +D        
Sbjct: 352  EMNDPHGGGISRETS------------SVESHKTASTKASKD-APGKFLLTDF------- 391

Query: 1291 ILPPPHGEDEEENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFS 1470
                         G++  T+ + LKWPPKPG             YD PP+GFNL+LSPF 
Sbjct: 392  -----------NEGEIF-TEAI-LKWPPKPGFSEADLVESDDTLYDRPPDGFNLSLSPFC 438

Query: 1471 TMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLA 1650
            T+F +LFSW+SSSSLAYIYGK++SFHEEY++ NGREYP K+V  DGRSSEIKQTL+  LA
Sbjct: 439  TLFNSLFSWISSSSLAYIYGKDDSFHEEYVNANGREYPCKVVAEDGRSSEIKQTLSAALA 498

Query: 1651 RALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRI 1830
            RALPG+V+ELRLP P+S LEQGMGRLLDTMSFIDPLP+ R KQW AIVLLFL+ALSVSRI
Sbjct: 499  RALPGVVSELRLPTPISILEQGMGRLLDTMSFIDPLPSLRTKQWQAIVLLFLNALSVSRI 558

Query: 1831 PALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLG 1947
            PAL+ Y+ DRR  + KV+EGA I  EEFE+MKDLIIPLG
Sbjct: 559  PALSKYLEDRRASIQKVLEGAGIGVEEFEVMKDLIIPLG 597


>ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 662

 Score =  590 bits (1522), Expect = e-166
 Identities = 341/673 (50%), Positives = 442/673 (65%), Gaps = 31/673 (4%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M K++ + +KD V+KLQL+L +GIK+ENQL AAGSL+SRSDY+DVVTER+IA++CGYPLC
Sbjct: 1    MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             ++L ++   +GRYRISLKEHKVYDL+ETY YCSS+CLINSRAF+  LQ+ER S +NP K
Sbjct: 61   HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            L E+LKLF+ +SLDS  NMG N D   SGL+IQEK ++  G+V +EEW+GPSNAI+GYVP
Sbjct: 121  LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177

Query: 589  RRDRDLKHPQSNNNKGERREVGSKHRHVRP--NAADILSYDMNFTSTIITQDEYSISKTV 762
             RD  +    S +  G+  + GSK + ++P     D  S D + TSTIIT +EYS+SK  
Sbjct: 178  HRDHKVMTLHSKD--GKESKDGSKAK-IKPLGGGKDFFS-DFSITSTIITDEEYSVSKIS 233

Query: 763  PAVK-------AKEPKGKASSKEVNRQSNPVQKPTAPL-----TNIQETRSKNKSKNVIT 906
              +K       +K   G+   KE N Q   ++ P AP         +   SK ++K   T
Sbjct: 234  SGLKEMALDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSAT 293

Query: 907  KDDKLSLLENIAGPSQNDSTKAVKELQESTAG-------AXXXXXXXXXXXXATRSVTWA 1065
            K+   +L  +    S+N ST      +E   G                      RSVTWA
Sbjct: 294  KESTDNL-SDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWA 352

Query: 1066 DEKTDGDG-QNLNECREL-KDKKGAVVTSHSAD-EEVGEEPYRFASAEACAMALTQAAEE 1236
            DEKTD     NL E  E+ K K+ +  TS+  + +   E+  R  SAEACAMAL+QAAE 
Sbjct: 353  DEKTDDASIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEA 412

Query: 1237 VASGKSEASDAVSEAGVIILPPPHGEDEEENGDVMETDPLQLKWPP-------KPGXXXX 1395
            + SG+SE SDAVSEAG+IILP P   +EE +     TDP+    P        K G    
Sbjct: 413  ITSGQSEVSDAVSEAGIIILPHPSDANEEAS-----TDPVNASEPHSFSEKSNKLGVLRS 467

Query: 1396 XXXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGR 1575
                    WYD+PPEGF+LTLS F+TM+MA+F+WV+SSSLAYIYGK++ FHEE+L ++G+
Sbjct: 468  DLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGK 527

Query: 1576 EYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDP 1755
            EYP KIV  DGRSSEIKQTLAGCL RA+PGL +EL L  P+S LE GM  LLDTM+F+D 
Sbjct: 528  EYPSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDA 587

Query: 1756 LPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLI 1935
            LPAFRMKQW  IVLLF++ALSVSRIP+L  +M   R L  KV++ AQI ++E+EIM+D I
Sbjct: 588  LPAFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHI 647

Query: 1936 IPLGRVPQFSTQS 1974
            +PLGR  Q S ++
Sbjct: 648  LPLGRTAQLSDEN 660


>gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica]
          Length = 711

 Score =  585 bits (1508), Expect = e-164
 Identities = 349/713 (48%), Positives = 448/713 (62%), Gaps = 76/713 (10%)
 Frame = +1

Query: 67   LTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLCGNSLSA 246
            ++VKD V+KLQL+LL+GIK ++ L  AGS+ISRSDY DVVTERTIAN+CGYPLC N+L +
Sbjct: 13   ISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCSNALPS 72

Query: 247  E--RPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAKLNEV 420
            +  RP KG YRISLKEHKVYDL ETYMYCSS C+I S+AFA SL EER   L+  K+  +
Sbjct: 73   DSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFGKVERI 132

Query: 421  LKLFDGLSLD-SDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEW--------------- 552
            L+ F  +  D  +V  G+ GDLG+S LKI+EK +T  G + +                  
Sbjct: 133  LRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHIGDLGA 192

Query: 553  IGPSNAIDGYVPRRDRDLKHPQSNNNKGERREVGSKHRHVRPNAA-DILSYDMNFTSTII 729
            +GPSNAI+GYVP+++R  K   S  NK      GSK +  + ++  DI+  +M+F STII
Sbjct: 193  VGPSNAIEGYVPQKERISKPLGSKKNK-----EGSKGKDAKMSSGMDIIFNEMDFMSTII 247

Query: 730  TQDEYSISKTVPAVKAK--EPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSKNVI 903
            T DEYS+SK  P+V     E K K S  +V    N          +++++R     KN  
Sbjct: 248  TSDEYSVSKIPPSVGEPDFETKFKKSKGKVGLNKN---------DSVKKSRQSKGGKNKN 298

Query: 904  TKDDKLSLLE--NIAGPSQ---NDSTKAVKE------LQESTAGAXXXXXXXXXXXXATR 1050
             K D + + E  + +  SQ   N STK  KE       ++S                  R
Sbjct: 299  VKKDDVCIREVPSTSDASQTVLNGSTKEEKEEFIVEKAEQSGEALLRSSLKPSGTKKLNR 358

Query: 1051 SVTWADEKTDGDG-QNLNECRELK---DKKGAVVTSH--SADEEVG-------------- 1170
            SVTWADE  D  G +NL E RE++   +   A  + H  S + +VG              
Sbjct: 359  SVTWADEMIDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDSTK 418

Query: 1171 ---------------------EEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGV 1287
                                 +E     SAEACAMAL QAAE VASG+S+ S AVS AG+
Sbjct: 419  SKNICEVREVQDADVLGSLDLQENEILESAEACAMALNQAAEAVASGESDVSGAVSGAGI 478

Query: 1288 IILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTL 1458
            IILP P G DEE   E+ D++E++   L WP KPG            W+D+PPEGF++TL
Sbjct: 479  IILPRPDGLDEEEPTEDVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDAPPEGFSVTL 537

Query: 1459 SPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLA 1638
            SPF+TM+ +LF+W++SS+LAYIYG++ESFHEE+LSVNGREYP KIV+  GRSSEIK+TL 
Sbjct: 538  SPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGRSSEIKKTLD 597

Query: 1639 GCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALS 1818
               ARALPG+V+ELRLP P+S+LEQGMGR+L+TMSFID +PAFRMKQW  IVLLFL+ LS
Sbjct: 598  ESFARALPGVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQVIVLLFLEGLS 657

Query: 1819 VSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977
            V RIPALTP+M +RR+L  KV+E  QISAE++E+MKDLIIPLGR PQFS QSG
Sbjct: 658  VCRIPALTPHMTNRRMLFYKVLENTQISAEQYELMKDLIIPLGRAPQFSAQSG 710


>ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 632

 Score =  574 bits (1479), Expect = e-161
 Identities = 333/666 (50%), Positives = 434/666 (65%), Gaps = 24/666 (3%)
 Frame = +1

Query: 49   MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228
            M K++ + +KD V+KLQL+L +GIK+ENQL AAGSL+SRSDY+DVVTER+IA++CGYPLC
Sbjct: 1    MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 229  GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408
             ++L ++   +GRYRISLKEHKVYDL+ETY YCSS+CLINSRAF+  LQ+ER S +NP K
Sbjct: 61   HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 409  LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588
            L E+LKLF+ +SLDS  NMG N D   SGL+IQEK ++  G+V +EEW+GPSNAI+GYVP
Sbjct: 121  LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177

Query: 589  RRDRDLKHPQSNNNKGERREVGSKHRHVRP--NAADILSYDMNFTSTIITQDEYSISKTV 762
             RD  +    S +  G+  + GSK + ++P     D  S D +FTSTIIT +EYS+SK  
Sbjct: 178  HRDHKVMTLHSKD--GKESKDGSKAK-IKPLGGGKDFFS-DFSFTSTIITDEEYSVSKIS 233

Query: 763  PAVK-------AKEPKGKASSKEVNRQSNPVQKPTAPL-----TNIQETRSKNKSKNVIT 906
              +K       +K   G+   K+ N Q   ++ P AP         +   SK ++K   T
Sbjct: 234  SGLKEMALDTNSKNQTGEFCGKKSNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSAT 293

Query: 907  KDDKLSLLENIAGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXXATRSVTWADEKTDGD 1086
            K +    L +    S N ST      +E                         DEKTD  
Sbjct: 294  K-ESTDNLSDAPSTSNNRSTNFNLMTEEP-----------------------RDEKTDDA 329

Query: 1087 G-QNLNECREL-KDKKGAVVTSHSAD-EEVGEEPYRFASAEACAMALTQAAEEVASGKSE 1257
               NL E  E+ K K+ +  TS+  + +   E+  R  SAEACAMAL+QAA+ + SG+SE
Sbjct: 330  SIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDLLRVESAEACAMALSQAAKAITSGQSE 389

Query: 1258 ASDAVSEAGVIILPPPHGEDEEENGDVMETDPLQLKWP-------PKPGXXXXXXXXXXX 1416
             SDAVSEAG+IILP P   +EE +     TDP+    P        K G           
Sbjct: 390  VSDAVSEAGIIILPHPSDANEEAS-----TDPVNASEPHSFSEKSNKLGVLRSDLFDPSD 444

Query: 1417 XWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIV 1596
             WYD+PPEGF+LTLS F+TM+MA+F+WV+SSSLAYIYGK++ FHEE+L ++G+EYP KIV
Sbjct: 445  SWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIV 504

Query: 1597 MPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMK 1776
              DGRSSEIKQTLAGCL RA+PGL +EL L  P+S LE GM  LLDTM+F+D LPAFRMK
Sbjct: 505  SADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMK 564

Query: 1777 QWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVP 1956
            QW  IVLLF++ALSVSRIP+L  +M   R L  KV++ AQI ++E+EIM+D I+PLGR  
Sbjct: 565  QWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTA 624

Query: 1957 QFSTQS 1974
            Q S ++
Sbjct: 625  QLSDEN 630


>gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao]
          Length = 703

 Score =  569 bits (1467), Expect = e-159
 Identities = 329/672 (48%), Positives = 426/672 (63%), Gaps = 48/672 (7%)
 Frame = +1

Query: 31   KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 210
            K++S  M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N 
Sbjct: 49   KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108

Query: 211  CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 390
            CGYPLC N L +E   KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S
Sbjct: 109  CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168

Query: 391  TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 570
             LN AKLN++L LF  L LD D ++GKNGDLG S L+I+E  +  A  V+L    GPSNA
Sbjct: 169  VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224

Query: 571  IDGYVPRRDRDLKHPQSNNNKGE-----RREVGSKHRHVRPNAADILSYDMNFTSTIITQ 735
            I+GYVP+R+   K     NNK +       ++GSK           ++ +++F  TII  
Sbjct: 225  IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278

Query: 736  DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 873
            DEY ISK   + K  +    +S KE              +N +    + P+    +  ++
Sbjct: 279  DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338

Query: 874  RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 987
              K   +  I KD  DK  +  + +   + DS+                    +A KE  
Sbjct: 339  NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398

Query: 988  ESTAGAXXXXXXXXXXXXA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1146
               A              A      R VTWAD+K  D  G  NL E +E++  KG    S
Sbjct: 399  ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458

Query: 1147 HSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 1326
             SA++   +   RF SAEACAMAL++AAE VASG S+ +DAV E           E+  E
Sbjct: 459  GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVCEVDK--------EEPME 510

Query: 1327 NGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSWVSS 1506
            +GD++E +   +KWP KPG            W+D+PPEGF+LTLS F+TM+ ALF W++S
Sbjct: 511  DGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITS 570

Query: 1507 SSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRL 1686
            SSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +LRL
Sbjct: 571  SSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRL 630

Query: 1687 PIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRI 1866
            PIP+STLEQGMG L+DT+SF++ LPAFRMKQW  IVLLF+DALSV RIPALTP+M + R+
Sbjct: 631  PIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRM 690

Query: 1867 LLPKVIEGAQIS 1902
            LL KV++GAQIS
Sbjct: 691  LLHKVLDGAQIS 702


>gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao]
          Length = 708

 Score =  528 bits (1361), Expect = e-147
 Identities = 307/635 (48%), Positives = 399/635 (62%), Gaps = 51/635 (8%)
 Frame = +1

Query: 31   KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 210
            K++S  M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N 
Sbjct: 49   KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108

Query: 211  CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 390
            CGYPLC N L +E   KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S
Sbjct: 109  CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168

Query: 391  TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 570
             LN AKLN++L LF  L LD D ++GKNGDLG S L+I+E  +  A  V+L    GPSNA
Sbjct: 169  VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224

Query: 571  IDGYVPRRDRDLKHPQSNNNKGE-----RREVGSKHRHVRPNAADILSYDMNFTSTIITQ 735
            I+GYVP+R+   K     NNK +       ++GSK           ++ +++F  TII  
Sbjct: 225  IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278

Query: 736  DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 873
            DEY ISK   + K  +    +S KE              +N +    + P+    +  ++
Sbjct: 279  DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338

Query: 874  RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 987
              K   +  I KD  DK  +  + +   + DS+                    +A KE  
Sbjct: 339  NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398

Query: 988  ESTAGAXXXXXXXXXXXXA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1146
               A              A      R VTWAD+K  D  G  NL E +E++  KG    S
Sbjct: 399  ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458

Query: 1147 HSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 1326
             SA++   +   RF SAEACAMAL++AAE VASG S+ +DAV E G+IILP     D+EE
Sbjct: 459  GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEE 518

Query: 1327 ---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSW 1497
               +GD++E +   +KWP KPG            W+D+PPEGF+LTLS F+TM+ ALF W
Sbjct: 519  PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEW 578

Query: 1498 VSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAE 1677
            ++SSSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +
Sbjct: 579  ITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTD 638

Query: 1678 LRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQW 1782
            LRLPIP+STLEQGMG L+DT+SF++ LPAFRMKQW
Sbjct: 639  LRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQW 673


>gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao]
          Length = 679

 Score =  528 bits (1361), Expect = e-147
 Identities = 307/635 (48%), Positives = 399/635 (62%), Gaps = 51/635 (8%)
 Frame = +1

Query: 31   KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 210
            K++S  M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N 
Sbjct: 49   KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108

Query: 211  CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 390
            CGYPLC N L +E   KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S
Sbjct: 109  CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168

Query: 391  TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 570
             LN AKLN++L LF  L LD D ++GKNGDLG S L+I+E  +  A  V+L    GPSNA
Sbjct: 169  VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224

Query: 571  IDGYVPRRDRDLKHPQSNNNKGE-----RREVGSKHRHVRPNAADILSYDMNFTSTIITQ 735
            I+GYVP+R+   K     NNK +       ++GSK           ++ +++F  TII  
Sbjct: 225  IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278

Query: 736  DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 873
            DEY ISK   + K  +    +S KE              +N +    + P+    +  ++
Sbjct: 279  DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338

Query: 874  RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 987
              K   +  I KD  DK  +  + +   + DS+                    +A KE  
Sbjct: 339  NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398

Query: 988  ESTAGAXXXXXXXXXXXXA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1146
               A              A      R VTWAD+K  D  G  NL E +E++  KG    S
Sbjct: 399  ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458

Query: 1147 HSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 1326
             SA++   +   RF SAEACAMAL++AAE VASG S+ +DAV E G+IILP     D+EE
Sbjct: 459  GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEE 518

Query: 1327 ---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSW 1497
               +GD++E +   +KWP KPG            W+D+PPEGF+LTLS F+TM+ ALF W
Sbjct: 519  PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEW 578

Query: 1498 VSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAE 1677
            ++SSSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +
Sbjct: 579  ITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTD 638

Query: 1678 LRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQW 1782
            LRLPIP+STLEQGMG L+DT+SF++ LPAFRMKQW
Sbjct: 639  LRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQW 673


Top