BLASTX nr result

ID: Rehmannia25_contig00006343 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00006343
         (2511 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]   711   0.0  
ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258...   708   0.0  
ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni...   684   0.0  
ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni...   673   0.0  
ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm...   650   0.0  
ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu...   648   0.0  
gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus...   642   0.0  
ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subuni...   631   e-178
ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subuni...   625   e-176
ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subuni...   623   e-175
ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni...   622   e-175
gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma c...   619   e-174
gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]     610   e-172
gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlise...   592   e-166
ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni...   589   e-165
gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus pe...   585   e-164
ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni...   573   e-160
gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobro...   568   e-159
gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao]      528   e-147
gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao]      528   e-147

>emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]
          Length = 659

 Score =  711 bits (1836), Expect = 0.0
 Identities = 390/669 (58%), Positives = 486/669 (72%), Gaps = 25/669 (3%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M  D+ + VKDAVHKLQL LL+GI++ENQL AAGSL+SRSDY+DVVTERTIAN+CGYPLC
Sbjct: 1    MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             NSL +ER  KG YRISLKEHKVYDL ETYMYCSS C++NSR+FA SLQEER S LN  +
Sbjct: 61   SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            +N +L+LF   SL+S+  +GK+GDLGLS LKI+E  +  AG+V++E+WIGPSNAI+GYVP
Sbjct: 121  INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180

Query: 1700 RRVRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK---- 1533
            +R R+LK     N+K   +   SK      +  + +  +M+F STIIT+DEYSISK    
Sbjct: 181  QRDRNLKPKNIKNHKEGSKSSNSK----MDSGKNFVIDEMDFVSTIITKDEYSISKSSKG 236

Query: 1532 ---TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETR---SKNKSKNVITKDDK 1371
               T    K+KEPK KAS   +  Q + ++K   P+ N  E++   SK +   VI KD+ 
Sbjct: 237  LKDTTSHAKSKEPKEKAS---IGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDE- 292

Query: 1370 LSLLENIAGPSQNDS----TKAVKELQESTAGAXXXXXXXXXXXKA-----TRSVTWADE 1218
             S  E  + PSQ+ S     K  +E     A              +      RSVTWADE
Sbjct: 293  FSTAEVPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADE 352

Query: 1217 KTDG-DGQNLNECRELKDKKGAVVTSHSADEEVGEE--SYRFASAEACAMALTQAAEEVA 1047
            K D  D ++  + REL+ KK     +   D +VG++  + RFASAEACA+AL+QAAE VA
Sbjct: 353  KMDSADSRDFCKVRELEVKKED--PNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVA 410

Query: 1046 SGKSEASDAVSEAGVIILPPPHGEDEEEN---GDVMETDPLQLKWPPKPGXXXXXXXXXX 876
            SG+++ +DAVSEAG+IILP P   DE E+    D++E +P+ LKWP KPG          
Sbjct: 411  SGETDMTDAVSEAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSD 470

Query: 875  XSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKI 696
             SWYD+PPEGF+LTLSPF+TM+MALF+W++SSS+AYIYG++ESFHEEYLSVNGREYP+KI
Sbjct: 471  DSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKI 530

Query: 695  VMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRM 516
            V+ DGRSSEIKQTLAGCL+RALPGLVA+LRLPIPVS LEQG+GRLLDTMSF+D LP+FRM
Sbjct: 531  VLTDGRSSEIKQTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRM 590

Query: 515  KQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRV 336
            KQW  IVLLF+DALSV RIPALTP+M  RR+L PKV + AQ+SAEE+E+MKDLIIPLGRV
Sbjct: 591  KQWQVIVLLFIDALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRV 650

Query: 335  PQFSTQSGG 309
            PQFS QSGG
Sbjct: 651  PQFSAQSGG 659


>ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera]
            gi|296089830|emb|CBI39649.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  708 bits (1827), Expect = 0.0
 Identities = 388/669 (57%), Positives = 485/669 (72%), Gaps = 25/669 (3%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M  D+ + VKDAVHKLQL LL+GI++ENQL AAGSL+SRSDY+DVVTERTIAN+CGYPLC
Sbjct: 1    MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             NSL +ER  KG YRISLKEHKVYDL ETYMYCSS C++NSR+FA SLQEER S LN  +
Sbjct: 61   SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            +N +L+LF   SL+S+  +GK+GDLGLS LKI+E  +  AG+V++E+WIGPSNAI+GYVP
Sbjct: 121  INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180

Query: 1700 RRVRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK---- 1533
            +R R+LK     N K   +   SK      +  + +  +M+F  TIIT+DEYSISK    
Sbjct: 181  QRDRNLKPKNIKNRKEGSKSSNSK----MDSGKNFVIDEMDFVRTIITEDEYSISKSSKG 236

Query: 1532 ---TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETR---SKNKSKNVITKDDK 1371
               T    K+KEPK KAS   +  Q + ++K   P+ N  E++   SK +   VI KD+ 
Sbjct: 237  LKDTTSHAKSKEPKEKAS---IGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDE- 292

Query: 1370 LSLLENIAGPSQNDS----TKAVKELQESTAGAXXXXXXXXXXXKA-----TRSVTWADE 1218
             S  E  + PSQ+ S     K  +E     A              +     TRSVTWADE
Sbjct: 293  FSTAEVPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADE 352

Query: 1217 KTDG-DGQNLNECRELKDKKGAVVTSHSADEEVGEE--SYRFASAEACAMALTQAAEEVA 1047
            K D  D ++  + REL+ KK     +   D +VG++  + RFASAEACA+AL+QAAE VA
Sbjct: 353  KMDSADSRDFCKVRELEVKKED--PNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVA 410

Query: 1046 SGKSEASDAVSEAGVIILPPPHGEDEEEN---GDVMETDPLQLKWPPKPGXXXXXXXXXX 876
            SG+++ +DAVSEA +IILP P   DE E+    D++E +P+ LKWP KPG          
Sbjct: 411  SGETDMTDAVSEARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSD 470

Query: 875  XSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKI 696
             SWYD+PPEGF+LTLSPF+TM+MALF+W++SSS+AYIYG++ESFHEEYLSVNGREYP+KI
Sbjct: 471  DSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKI 530

Query: 695  VMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRM 516
            V+ DGRSSEIKQTLAGCLARALPGLVA+LRLPIPVS LEQG+GRLLDTMSF+D LP+FRM
Sbjct: 531  VLTDGRSSEIKQTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRM 590

Query: 515  KQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRV 336
            KQW  IVLLF+DALSV +IPALTP+M+ +R+L PKV + AQ+SAEE+E+MKDLIIPLGRV
Sbjct: 591  KQWQVIVLLFIDALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRV 650

Query: 335  PQFSTQSGG 309
            PQFS QSGG
Sbjct: 651  PQFSAQSGG 659


>ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Solanum lycopersicum]
          Length = 660

 Score =  684 bits (1765), Expect = 0.0
 Identities = 388/674 (57%), Positives = 471/674 (69%), Gaps = 30/674 (4%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M K E + VKDAVHKLQL LL+GIK E+QL AAGSL+SRSDYQDVVTER+IAN+CGYPLC
Sbjct: 1    MAKGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             NSL +ER  KG YRISLKEHKVYDL ETYMYCS++C++NS AFA SLQ+ERSSTLNPAK
Sbjct: 61   SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            LN+VL LF GL L S  ++ +NGD G S LKIQEK D   G+V+LEEW+GPSNAI+GYVP
Sbjct: 121  LNQVLNLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVP 180

Query: 1700 RRVRDLKHPQSNN-NKGERREVGSKHRHVR-PNAADILSYDMNFTSTIITQDEYSISKTV 1527
            +R R +      N NKG      SK++H R  +  +++  + +F+STIITQDEYS+SK  
Sbjct: 181  QRDRSVNPALLKNINKG------SKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSK-F 233

Query: 1526 PA-------VKAKEPKGKASSKE-------VNRQSNPVQKPTAPLTNIQETRSKNKSKNV 1389
            PA       VK KE + K   K        + +Q + +Q     L + +ET   +K+   
Sbjct: 234  PAPVNADSNVKFKETQAKTRYKVRDDDVYILGKQVDALQ-----LRSGEETEKSDKNTRF 288

Query: 1388 ITKDDKLSLLENIAGPSQND---------STKAVKELQESTAGAXXXXXXXXXXXKATRS 1236
            + K DK +  E  +GPSQ+D         S    K                    K +RS
Sbjct: 289  L-KVDKFNSGEVSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKLKSSLKSSNSKKMSRS 347

Query: 1235 VTWADEKTDGD-GQNLNECRELKDKKG-AVVTSHSADEEVGEESYRFASAEACAMALTQA 1062
            VTWADE  DG  G+      ++ + +  A   S S D E  ++SYRF SAEACA AL+QA
Sbjct: 348  VTWADESIDGGIGKKTESSSKISEYESQAYGGSASTDMEENDDSYRFESAEACAAALSQA 407

Query: 1061 AEEVASGKSEASDAVSEAGVIILPPPHGEDE---EENGDVMETDPLQLKWPPKPGXXXXX 891
            AE VASG S+  DAVS+AG++ILPP    DE   +E  ++++ +   LKWP KPG     
Sbjct: 408  AEAVASG-SDVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYD 466

Query: 890  XXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGRE 711
                  SWYDSPPEGFN+TLSPF TMF +LF+W+SSSSLA+IYG +ES +EEYLS+NGRE
Sbjct: 467  VFESEDSWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGRE 526

Query: 710  YPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPL 531
            YP+KIV+ DGRS+EIKQTLAGCLARALPGLVA+LRLP+P+STLEQGM  LL+TMSF+DPL
Sbjct: 527  YPRKIVLSDGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPL 586

Query: 530  PAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLII 351
            PAFRMKQW  IVLLFLDALSV RIP LTPYM  RR   PKV++GAQISA E+EIMKDLII
Sbjct: 587  PAFRMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLII 646

Query: 350  PLGRVPQFSTQSGG 309
            PLGRVPQFS QSGG
Sbjct: 647  PLGRVPQFSMQSGG 660


>ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Solanum tuberosum]
          Length = 662

 Score =  673 bits (1736), Expect = 0.0
 Identities = 382/669 (57%), Positives = 467/669 (69%), Gaps = 25/669 (3%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M K E + VKDAVHKLQL LL+GIK ENQL AAGSL+SRSDYQDVVTER+IAN+CGYPLC
Sbjct: 1    MAKGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             NSL +ER  KG YRISLKEHKVYDL ETYMYCS++C++NS AFA SLQ+ERSSTLNPAK
Sbjct: 61   SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAG-QVALEEWIGPSNAIDGYV 1704
            LN+VL LF GL L S  ++ +NGDLG S LKIQEK D   G +V+LEEW+GPSNAI+GYV
Sbjct: 121  LNQVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYV 180

Query: 1703 PRRVRDLKHPQSNN-NKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK-- 1533
            P+R R +      N NKG +    +KH  ++     IL+ + +F+STIITQDEYS+SK  
Sbjct: 181  PQRDRSVNPALLKNINKGFK----NKHARLQDEKNMILN-EFDFSSTIITQDEYSVSKFP 235

Query: 1532 ----TVPAVKAKEPKGKASSKEVNRQSNPVQK--PTAPLTNIQETRSKNKSKNVITKDDK 1371
                 V + K KE + K   K  +   + + K      L + +ET   +K+   + K DK
Sbjct: 236  APVNAVSSEKFKEAQAKTRYKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFL-KVDK 294

Query: 1370 LSLLENIAGPSQND-STKAVKELQ----------ESTAGAXXXXXXXXXXXKATRSVTWA 1224
             +  E  +GPSQ+D   K+V  +           E                K ++SVTWA
Sbjct: 295  FNSGEVSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQSVTWA 354

Query: 1223 DEKTDGD-GQNLNECRELKDKKG-AVVTSHSADEEVGEESYRFASAEACAMALTQAAEEV 1050
            DE  DG  G+      ++ + +  A   S S D E  ++SYRF SAEACA AL+QAAE V
Sbjct: 355  DEIIDGGIGKKTESSSKISEYENQAYGGSASTDMEEDDDSYRFESAEACAAALSQAAEAV 414

Query: 1049 ASGKSEASDAVSEAGVIILPPPHGEDEE--ENGDVMETDPLQLKWPPKPGXXXXXXXXXX 876
            ASG S+  DAVS+AG++ILP     DE   +  ++++ +P  LKWP KPG          
Sbjct: 415  ASG-SDVPDAVSKAGIVILPTSQEVDEAILQETEMLDIEPAPLKWPRKPGMPNYDVFESE 473

Query: 875  XSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKI 696
              WYD PPEGFN+TLSPF+TMF +LF+W+SSSSLA+IYG +E+ +EEYLS+NGREYP KI
Sbjct: 474  DCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYPHKI 533

Query: 695  VMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRM 516
            V+ DG S+EIKQTLAGCLARALPGLVA+LRLP+P+STLEQGM  LL+TMSF+DPLPAFRM
Sbjct: 534  VLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFRM 593

Query: 515  KQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRV 336
            KQW  IVLLFLDALSV RIP LTPYM  RR  LPKV++GAQIS  E+EIMKDLIIPLGRV
Sbjct: 594  KQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPLGRV 653

Query: 335  PQFSTQSGG 309
            PQFS QSGG
Sbjct: 654  PQFSMQSGG 662


>ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis]
            gi|223538861|gb|EEF40460.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 645

 Score =  650 bits (1678), Expect = 0.0
 Identities = 358/655 (54%), Positives = 465/655 (70%), Gaps = 18/655 (2%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M K+E ++VKD V+KLQLSLL+GI++E+QL AAGSL+SRSDY+DVV ER+I+N+CGYPLC
Sbjct: 1    MAKEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             NSL ++RP+KGRYRISLKEH+VYDLQETYMYCSSSCL+NSRAF+ SLQE+R S LNP K
Sbjct: 61   NNSLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIK 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            LNE+L+ F+ L+LDS+  +G++GDLGLS LKIQEK++T  G+V+LEEWIGPSNAI+GYVP
Sbjct: 121  LNEILRKFNDLTLDSE-GLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVP 179

Query: 1700 RRVRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK---- 1533
            +  RD  +P   N+K   + +  K      +  D    D +FTSTIIT DEYSISK    
Sbjct: 180  QGDRD-PNPSLKNHKEGLKAICKKP----VSKQDCFFSDTDFTSTIITNDEYSISKGPSG 234

Query: 1532 ---TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSKNVITKDDKLSL 1362
               T   +K +   GK   + +N Q + ++K  +   +    +SK + K  + K+     
Sbjct: 235  LTSTASDIKLQAQTGKGH-EGLNAQLSSLRKQDSIKAS---RKSKGRRKEKVIKEQ---- 286

Query: 1361 LENIAGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXKAT------RSVTWADEKTDGDG 1200
            L     PS +  T   +++ ++T  A           K++      RSVTWADE+ D  G
Sbjct: 287  LNFQDLPSSSYYTAEAEDISQATGAANLNESVLKPSLKSSGAKRSNRSVTWADERVDNAG 346

Query: 1199 -QNLNECRELKDKKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASD 1023
             +NL E +E++    +   S SA++       RF SAEACA+AL+QAAE VASG ++ + 
Sbjct: 347  SRNLCEVQEMEQTNESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNK 406

Query: 1022 AVSEAGVIILPPPH----GEDEEENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSP 855
            A+SEAG+I+LPP      G + E+N D++E +   LKWP KPG           SWYD+P
Sbjct: 407  AMSEAGIIVLPPSQDLGQGGNVEKN-DMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAP 465

Query: 854  PEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRS 675
            PEGF+LTLSPF+TM+MALF+WV+SSSLAYIYG++ES HE+YLSVNGREYP+KIV+ DGRS
Sbjct: 466  PEGFSLTLSPFATMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRS 525

Query: 674  SEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIV 495
            SEI+ T   CLAR  PGLVA LRLPIPVSTLEQG GRLL+TMSF+D LPAFR KQW  I 
Sbjct: 526  SEIRLTAESCLARTFPGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQVIA 585

Query: 494  LLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQ 330
            LLF++ALSV RIPALT YM  RR++L +V++GA ISAEE++IMKD ++PLGR PQ
Sbjct: 586  LLFIEALSVCRIPALTSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLGRDPQ 640


>ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa]
            gi|550321730|gb|EEF05523.2| hypothetical protein
            POPTR_0015s01330g [Populus trichocarpa]
          Length = 696

 Score =  648 bits (1671), Expect = 0.0
 Identities = 371/709 (52%), Positives = 469/709 (66%), Gaps = 66/709 (9%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M KD+   VKD ++KLQLSLLDGI++E+QL AAGS++S SDY+DVVTERTIAN+CGYPLC
Sbjct: 1    MAKDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
            GNSL ++RP KGRYRISLKEHKVYDL ETYMYCSSSC+INSR F+ SLQEER   LNPAK
Sbjct: 61   GNSLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAK 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            LNEVL LFD  SL S+ ++GKNGDLG S LKI+EKT+ V G+V+ E+WIGPSNAI+GYVP
Sbjct: 121  LNEVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVP 180

Query: 1700 RRVR--------DL---------------------------KHPQSNNNKGERR-EVGSK 1629
            +R R        D+                           K  Q    KG  +   GSK
Sbjct: 181  QRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSK 240

Query: 1628 HRHVRPNAA-DILSYDMNFTSTII-TQDEYSISK-------TVPAVKAKEPKGKASSKEV 1476
             +  + ++  +    DMNFTSTII TQDEYSISK       T    K ++ K K S K  
Sbjct: 241  AKGTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSS 300

Query: 1475 NRQSNPVQKPTAPLTN--IQETRSKNKSKNVITKDDKLSLLENIAGPS---------QND 1329
              QS+  +K  +  T+  ++E RSK   K+ ++  D  S  ++    S         ++ 
Sbjct: 301  ENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSV 360

Query: 1328 STKAVKELQES------TAGAXXXXXXXXXXXKATRSVTWADEKTDGDG-QNLNECRELK 1170
            S KA K ++ S      T+GA             TRSVTWADEK    G ++L E R ++
Sbjct: 361  SEKAAKPVESSLKPSLKTSGAKQL----------TRSVTWADEKVGSSGSRDLCEVRGME 410

Query: 1169 DKKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILP 990
            D K       + D+       +F SAEACA AL+QAAE VASG ++AS+A+SEAG++ILP
Sbjct: 411  DTKAGPEIVDNIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILP 470

Query: 989  PPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFS 819
             PH  D+    E+ DV++ +   +KWP KPG           SWYD+PPEGF+L LS F+
Sbjct: 471  QPHDLDQGDPMEDVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFA 530

Query: 818  TMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLA 639
            T++MALF+WV+SSSLAY+YGK+ES HEEYL VNGREYP+KIV+ DGRS EI+QT+ GCL 
Sbjct: 531  TIWMALFAWVTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLG 590

Query: 638  RALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRI 459
            RA P +VA+LRLPIP+STLEQG   LL TMSF+D +PAFRMKQW  I LLF++ALSV RI
Sbjct: 591  RAFPVVVADLRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRI 650

Query: 458  PALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312
            PAL  YM +RR+    V++G ++SAEE+E+MKDL+IPLGR PQFS QSG
Sbjct: 651  PALISYMDNRRM----VVDGVRMSAEEYEVMKDLMIPLGRAPQFSPQSG 695


>gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris]
          Length = 706

 Score =  642 bits (1657), Expect = 0.0
 Identities = 369/713 (51%), Positives = 465/713 (65%), Gaps = 70/713 (9%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M KD+ ++VKDAV KLQ+ LL+GI++E+QL AAGSL+SRSDY+D+VTER+I NVCGYPLC
Sbjct: 1    MAKDKAVSVKDAVFKLQMLLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             N+L +ERP KG+YRISLKEHKVYDLQETYM+CSS+C+++S+AF+  LQ ER S L+P K
Sbjct: 61   CNALPSERPRKGKYRISLKEHKVYDLQETYMFCSSNCVVSSKAFSGILQAERCSALDPEK 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            LN VL LF+ L+L+   N+ K+GDLGLS LKIQEKT T +G+V LE+W+GPSNAI+GYVP
Sbjct: 121  LNNVLGLFENLNLEQTENVPKDGDLGLSNLKIQEKTVTTSGEVPLEQWVGPSNAIEGYVP 180

Query: 1700 RRVRDLKHPQSNNNKGERREV--GSKHRHVRPNA-ADILSYDMNFTSTIITQDEYSISKT 1530
            +       P+   +KG R+ V  GSK  H + N   D+++ +MNF STII QDEYS+SK 
Sbjct: 181  K-------PRERESKGLRKNVKKGSKAGHGKSNNDKDLINSEMNFVSTIIMQDEYSVSKA 233

Query: 1529 VPA-----------------------------------------------VKAKEPKGKA 1491
             P                                                + A E KGK 
Sbjct: 234  SPGQTDTTAHHQIKPTAVDRQQEEKVGLKVVRKDEDSIQDLSSSFESGLHLSASE-KGKE 292

Query: 1490 SSK--EVNRQSNP---VQKPTAPLTNIQETR---SKNKS--KNVITKDDKLSLLENIAGP 1341
             SK  EV  +S P   ++K  A   +I E      KN S  K+V  K +   +  N    
Sbjct: 293  VSKSCEVVVKSTPNLAIKKKDAHSVSISERHYDVEKNNSARKSVQLKGETSRVTVNGDAS 352

Query: 1340 SQNDSTKAVKE-LQESTAGAXXXXXXXXXXXKA-----TRSVTWADEKTDGDG-QNLNEC 1182
            + N     VKE  Q    G             A     +R+VTWADEK +G G ++L E 
Sbjct: 353  TSNFDPDNVKEKFQVEKVGGLCETKLKSSLKSAGEKKLSRTVTWADEKINGAGNKDLCEV 412

Query: 1181 RELKDKKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGV 1002
            +E  D      +  + D    E+  R ASAEACA+AL+QA+E VASG S+A+DAVSEAG+
Sbjct: 413  KEFGDIIKESESVGNEDVANNEDMLRQASAEACAIALSQASEAVASGDSDATDAVSEAGI 472

Query: 1001 IILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTL 831
            IILP PH   EE   E+ D+++ D + LKWP KPG           SW+D+PPEGF+LTL
Sbjct: 473  IILPQPHDAVEEGTMEDADILQNDSVTLKWPRKPGISDIDFFESDDSWFDAPPEGFSLTL 532

Query: 830  SPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLA 651
            SPF+ M+ A+FSW++S SLAYIYG++ESFHEEYLSVNGREYP K+V+ DGRSSEIKQT A
Sbjct: 533  SPFANMWNAIFSWMTSYSLAYIYGRDESFHEEYLSVNGREYPCKVVLSDGRSSEIKQTFA 592

Query: 650  GCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALS 471
            GCLARA P LVA LRLPIP+STLEQGM  LL+TMSF+D LPAFR KQW  + LLF+DALS
Sbjct: 593  GCLARAFPALVAGLRLPIPISTLEQGMACLLETMSFVDALPAFRTKQWQVVALLFVDALS 652

Query: 470  VSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312
            V RIP+L  YM DRR L  KV+ G+QI  EE+EI+KDL++PLGR P  S QSG
Sbjct: 653  VCRIPSLISYMTDRRALFHKVLSGSQIGMEEYEILKDLVVPLGRAPHISVQSG 705


>ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Glycine max]
          Length = 706

 Score =  631 bits (1627), Expect = e-178
 Identities = 355/713 (49%), Positives = 464/713 (65%), Gaps = 70/713 (9%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M KD+ ++VKDAV KLQ+SLL+GI++E+QL AAGSL+SRSDY+D+VTER+I N+CGYPLC
Sbjct: 1    MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             N+L ++RP KGRYRISLKEHKVYDLQETYM+CSS+CL++S+ FA SLQ ER S L+  K
Sbjct: 61   SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            LN VL LF+ L+L+    + KNGDLGLS LKIQEKT+  +G+V+LE+W GPSNAI+GYVP
Sbjct: 121  LNNVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVP 180

Query: 1700 RRVRDLKHPQSNNNKGERREV--GSKHRHVRP-NAADILSYDMNFTSTIITQDEYSISKT 1530
            +       P++ ++KG R+ V  GSK  H +  +  ++++ +M F STII QDEYS+SK 
Sbjct: 181  K-------PRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKV 233

Query: 1529 VPA-------------VKAKEPKGKASSKEVNRQSNPVQ------KPTAPLTNIQETRSK 1407
             P                 K+P+ K  ++ V +  + +Q      K +  L+  ++    
Sbjct: 234  PPGQMDATANHQIKPTATVKQPE-KVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEV 292

Query: 1406 NKSKNVITKDD-----------KLSLLENIAGPSQNDSTKAVKELQEST----------- 1293
             KS   + K              +S+ E      QNDS +   +++  T           
Sbjct: 293  TKSCEAVLKFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDAST 352

Query: 1292 ----------------AGAXXXXXXXXXXXKA-----TRSVTWADEKTDGDG-QNLNECR 1179
                            AG             A     +R+VTWADEK +  G ++L E +
Sbjct: 353  SNLDPANVEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADEKINSTGSKDLCEFK 412

Query: 1178 ELKD-KKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGV 1002
            E  D KK +    ++ D    E+  R ASAEACA+AL+ A+E VASG S+ SDAVSEAG+
Sbjct: 413  EFGDIKKESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVSEAGI 472

Query: 1001 IILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTL 831
             ILPPPH   EE   E+ D+++ D + LKWP K G           SW+D+PPEGF+LTL
Sbjct: 473  TILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFDAPPEGFSLTL 532

Query: 830  SPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLA 651
            SPF+TM+  LFSW +SSSLAYIYG++ESFHEEYLSVNGREYP K+V+ DGRSSEIKQTLA
Sbjct: 533  SPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADGRSSEIKQTLA 592

Query: 650  GCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALS 471
             CLARALP LVA LRLPIPVS +EQGM  LL+TMSF+D LPAFR KQW  + LLF+DALS
Sbjct: 593  SCLARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQVVALLFIDALS 652

Query: 470  VSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312
            V R+PAL  YM DRR    +V+ G+QI  EE+E++KDL++PLGR P  S+QSG
Sbjct: 653  VCRLPALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISSQSG 705


>ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Glycine max]
          Length = 706

 Score =  625 bits (1613), Expect = e-176
 Identities = 353/717 (49%), Positives = 456/717 (63%), Gaps = 74/717 (10%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M KD+ ++VKDAV KLQ+SLL+GI++E+QL AAGSL+SRSDY+D+VTER+I NVCGYPLC
Sbjct: 1    MEKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             N+L ++RP KGRYRISLKEHKVYDL ETYM+C S+C+++S+AFA SLQ ER S L+  K
Sbjct: 61   SNALPSDRPRKGRYRISLKEHKVYDLHETYMFCCSNCVVSSKAFAGSLQAERCSGLDLEK 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            LN +L LF+ L+L+   N+ KN D GLS LKIQEKT+T +G+V+LE+W GPSNAI+GYVP
Sbjct: 121  LNNILSLFENLNLEPAENLQKNEDFGLSDLKIQEKTETSSGEVSLEQWAGPSNAIEGYVP 180

Query: 1700 RRVRDLKHPQSNNNKGERREV--GSKHRHVRP-NAADILSYDMNFTSTIITQDEYSISKT 1530
            +       P+ +++KG R+ V  GSK  H +P +  +++S +M F STII QD YS+SK 
Sbjct: 181  K-------PRDHDSKGLRKNVKKGSKAGHGKPISDINLISSEMGFVSTIIMQDGYSVSKV 233

Query: 1529 VPAVKAK------------EPKGKASSKEVNRQSNPVQKPTAP---------------LT 1431
            +P  +              +  GK  +K V +    +Q  ++                L 
Sbjct: 234  LPGQRDATAHHQIKPTAIVKQLGKVDAKVVRKDDGSIQDLSSSFKSSLILGTSEKEEELA 293

Query: 1430 NIQETRSKNKSKNVITKDD--KLSLLENIAGPSQNDSTK--------------------- 1320
               E   K+     I K D   +S+ E      QNDS K                     
Sbjct: 294  QSCEAALKSSPDCAIKKKDVYSVSISERQCDVEQNDSAKKSVQVKGKMSRVTANDDASTS 353

Query: 1319 ------AVKELQESTAGAXXXXXXXXXXXKA-----TRSVTWADEKTDGDG-------QN 1194
                    ++ Q   AG             A     +R+VTWAD+K +  G       +N
Sbjct: 354  NLDPANVEEKFQVEKAGGSLNTKPKSSLKSAGEKKLSRTVTWADKKINSTGSKDLCGFKN 413

Query: 1193 LNECRELKDKKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVS 1014
              + R   D  G     +S D    E++ R ASAEAC +AL+ A+E VASG S+ SDAVS
Sbjct: 414  FGDIRNESDSAG-----NSIDVANDEDTLRRASAEACVIALSSASEAVASGDSDVSDAVS 468

Query: 1013 EAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGF 843
            EAG+IILPPPH   EE   E+ D+++ D + +KWP KPG           SW+D+ PEGF
Sbjct: 469  EAGIIILPPPHDAGEEGTLEDVDILQNDSVTVKWPRKPGISEADFFESDDSWFDAAPEGF 528

Query: 842  NLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIK 663
            +LTLSPF+TM+  LFSW++SSSLAYIYG++ESF EEYLSVNGREYP K+V+ DGRSSEIK
Sbjct: 529  SLTLSPFATMWNTLFSWITSSSLAYIYGRDESFQEEYLSVNGREYPCKVVLADGRSSEIK 588

Query: 662  QTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFL 483
            QTLA CLARALP LVA LRLPIPVST+EQGM  LL+TMSF+D LPAFR KQW  + LLF+
Sbjct: 589  QTLASCLARALPTLVAVLRLPIPVSTMEQGMACLLETMSFVDALPAFRTKQWQVVALLFI 648

Query: 482  DALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312
            DALSV R+PAL  YM DRR    +V+ G+QI  EE+E++KDL +PLGR P  S QSG
Sbjct: 649  DALSVCRLPALISYMTDRRASFHRVLSGSQIGMEEYEVLKDLAVPLGRAPHISAQSG 705


>ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X2 [Glycine max]
          Length = 716

 Score =  623 bits (1606), Expect = e-175
 Identities = 355/723 (49%), Positives = 464/723 (64%), Gaps = 80/723 (11%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M KD+ ++VKDAV KLQ+SLL+GI++E+QL AAGSL+SRSDY+D+VTER+I N+CGYPLC
Sbjct: 1    MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             N+L ++RP KGRYRISLKEHKVYDLQETYM+CSS+CL++S+ FA SLQ ER S L+  K
Sbjct: 61   SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            LN VL LF+ L+L+    + KNGDLGLS LKIQEKT+  +G+V+LE+W GPSNAI+GYVP
Sbjct: 121  LNNVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVP 180

Query: 1700 RRVRDLKHPQSNNNKGERREV--GSKHRHVRP-NAADILSYDMNFTSTIITQDEYSISKT 1530
            +       P++ ++KG R+ V  GSK  H +  +  ++++ +M F STII QDEYS+SK 
Sbjct: 181  K-------PRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKV 233

Query: 1529 VPA-------------VKAKEPKGKASSKEVNRQSNPVQ------KPTAPLTNIQETRSK 1407
             P                 K+P+ K  ++ V +  + +Q      K +  L+  ++    
Sbjct: 234  PPGQMDATANHQIKPTATVKQPE-KVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEV 292

Query: 1406 NKSKNVITKDD-----------KLSLLENIAGPSQNDSTKAVKELQEST----------- 1293
             KS   + K              +S+ E      QNDS +   +++  T           
Sbjct: 293  TKSCEAVLKFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDAST 352

Query: 1292 ----------------AGAXXXXXXXXXXXKA-----TRSVTWADEKTDGDG-QNLNECR 1179
                            AG             A     +R+VTWADEK +  G ++L E +
Sbjct: 353  SNLDPANVEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADEKINSTGSKDLCEFK 412

Query: 1178 ELKD-KKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAV----- 1017
            E  D KK +    ++ D    E+  R ASAEACA+AL+ A+E VASG S+ SDAV     
Sbjct: 413  EFGDIKKESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVFSPMN 472

Query: 1016 -----SEAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYD 861
                 SEAG+ ILPPPH   EE   E+ D+++ D + LKWP K G           SW+D
Sbjct: 473  ETCAVSEAGITILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFD 532

Query: 860  SPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDG 681
            +PPEGF+LTLSPF+TM+  LFSW +SSSLAYIYG++ESFHEEYLSVNGREYP K+V+ DG
Sbjct: 533  APPEGFSLTLSPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADG 592

Query: 680  RSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHA 501
            RSSEIKQTLA CLARALP LVA LRLPIPVS +EQGM  LL+TMSF+D LPAFR KQW  
Sbjct: 593  RSSEIKQTLASCLARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQV 652

Query: 500  IVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFST 321
            + LLF+DALSV R+PAL  YM DRR    +V+ G+QI  EE+E++KDL++PLGR P  S+
Sbjct: 653  VALLFIDALSVCRLPALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISS 712

Query: 320  QSG 312
            QSG
Sbjct: 713  QSG 715


>ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cicer arietinum]
          Length = 666

 Score =  622 bits (1604), Expect = e-175
 Identities = 346/675 (51%), Positives = 454/675 (67%), Gaps = 32/675 (4%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M KD+ ++VKDAV KLQL+LL+GI+ E+QL AAGSLISRSDY+DVVTER+I  VC YPLC
Sbjct: 1    MEKDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             N+L +ERP KGRYRISLKEHKVYDL ETYM+CSSSC++NS+AFA SL+++R   L+P K
Sbjct: 61   CNALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQK 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            LN +L+LF   +L+   N GK+G+LGLS L+IQ+KT+TV  +V+LE+W+GPSNAI+GYVP
Sbjct: 121  LNNILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVT-EVSLEQWVGPSNAIEGYVP 179

Query: 1700 RRVRDLKHPQSNNNKGERREV--GSKHRHVRPNAA-DILSYDMNFTSTIITQDEYSISKT 1530
            ++       + N +KG ++    GSK  H + N   ++++ + +F STII QDEYS+SK 
Sbjct: 180  KK-------RDNGSKGSQKNTKKGSKASHGKSNGVKNLINSEFDFMSTIIMQDEYSVSKV 232

Query: 1529 V-------------PAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSK-- 1395
                          P    ++PK      E+ R+ + +Q  ++   +     +  K K  
Sbjct: 233  SSGQTDATVDHQIKPTAILEQPK--RVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEI 290

Query: 1394 -----NVITKDDKLSLLENIAGPSQNDSTKAVKELQ-ESTAGAXXXXXXXXXXXKAT--- 1242
                 NV+          + +  S  D +   +++Q E   G+                 
Sbjct: 291  AKSCKNVLKGKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKL 350

Query: 1241 -RSVTWADEKTDGDGQ-NLNECRELKDKKGAVVTSHSADEEVGEESYRFASAEACAMALT 1068
             RSVTWAD+K DG G  +L   +E  + K     + + D    E+  R  SAEACA+AL+
Sbjct: 351  GRSVTWADKKIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALS 410

Query: 1067 QAAEEVASGKSEASDAVSEAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXX 897
            QAAE VASG S+A DAVSEAG+IILP      EE   ++ D++ETD + LKWP KPG   
Sbjct: 411  QAAEAVASGDSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKPGISD 470

Query: 896  XXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNG 717
                    SW+D+PPEGF+LTLSPF+T++ A FSW++SSSLAYIYG++ SF+EE+LSV+G
Sbjct: 471  FDLFASDDSWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDG 530

Query: 716  REYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFID 537
            REYP KIV+ DGRSSEIKQTLA CLARALP +VAEL+LP+PVSTLEQGM  LLDTMSF+D
Sbjct: 531  REYPCKIVLSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSFVD 590

Query: 536  PLPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDL 357
            PLP FR KQW  + LLF+DALSV RIPAL  YM DRR L  KV+ G+QI  EE+ ++KDL
Sbjct: 591  PLPGFRFKQWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLKDL 650

Query: 356  IIPLGRVPQFSTQSG 312
            I+PLGR P FS+QSG
Sbjct: 651  IVPLGRAPHFSSQSG 665


>gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao]
          Length = 739

 Score =  619 bits (1595), Expect = e-174
 Identities = 355/700 (50%), Positives = 456/700 (65%), Gaps = 51/700 (7%)
 Frame = -2

Query: 2258 KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 2079
            K++S  M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N 
Sbjct: 49   KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108

Query: 2078 CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 1899
            CGYPLC N L +E   KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S
Sbjct: 109  CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168

Query: 1898 TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 1719
             LN AKLN++L LF  L LD D ++GKNGDLG S L+I+E  +  A  V+L    GPSNA
Sbjct: 169  VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224

Query: 1718 IDGYVPRR--VRDLKHPQSNNNK---GERREVGSKHRHVRPNAADILSYDMNFTSTIITQ 1554
            I+GYVP+R  +     P++N NK       ++GSK           ++ +++F  TII  
Sbjct: 225  IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278

Query: 1553 DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 1416
            DEY ISK   + K  +    +S KE              +N +    + P+    +  ++
Sbjct: 279  DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338

Query: 1415 RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 1302
              K   +  I KD  DK  +  + +   + DS+                    +A KE  
Sbjct: 339  NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398

Query: 1301 ESTAGAXXXXXXXXXXXKA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1143
               A              A      R VTWAD+K  D  G  NL E +E++  KG    S
Sbjct: 399  ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458

Query: 1142 HSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 963
             SA++   +   RF SAEACAMAL++AAE VASG S+ +DAV E G+IILP     D+EE
Sbjct: 459  GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEE 518

Query: 962  ---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSW 792
               +GD++E +   +KWP KPG           SW+D+PPEGF+LTLS F+TM+ ALF W
Sbjct: 519  PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEW 578

Query: 791  VSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAE 612
            ++SSSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +
Sbjct: 579  ITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTD 638

Query: 611  LRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMD 432
            LRLPIP+STLEQGMG L+DT+SF++ LPAFRMKQW  IVLLF+DALSV RIPALTP+M +
Sbjct: 639  LRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTN 698

Query: 431  RRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312
             R+LL KV++GAQIS EE+E+MKDLIIPLGR P FS QSG
Sbjct: 699  GRMLLHKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 738


>gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]
          Length = 695

 Score =  610 bits (1573), Expect = e-172
 Identities = 349/703 (49%), Positives = 455/703 (64%), Gaps = 66/703 (9%)
 Frame = -2

Query: 2222 LTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLCGNSLSA 2043
            ++VKD V++LQLSLL G+  E+QL AAGS++SRSDY DVVTER+IAN+CGYPLC N L +
Sbjct: 9    ISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYPLCPNPLPS 68

Query: 2042 ERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAKLNEVLK 1863
            +RP KGRYRISLKEHKVYDL ETYMYCSS C+INSR FAASL++ER + L+ A+++ VL+
Sbjct: 69   DRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDSARIDAVLR 128

Query: 1862 LFDGLS-LDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVPRRVRD 1686
            +F+  S L+ ++  GK+ DLG S LKI+EKT+   G V+LE+W GPSNAI+GYV +R R 
Sbjct: 129  MFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRERK 188

Query: 1685 LKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISKTVPAVK--- 1515
               P+   +K  +R  GSK  +       +L  DM+F STIIT+DEY++SKT  ++K   
Sbjct: 189  ---PKELGSKSPKR--GSKANNT------VLINDMDFVSTIITEDEYTVSKTPSSLKKTG 237

Query: 1514 ----AKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSKNVITKDDKLSLLENIA 1347
                 +E +   + K +  +   ++   AP +N+  +R     ++V +     S L +  
Sbjct: 238  LDSKVREQEEILAKKAMGNEFAVLETSYAPASNV--SRVGLVFEDVTSSLRAGSCLSSAR 295

Query: 1346 GPSQNDSTKAVKELQESTAGAXXXXXXXXXXXKATRSVTWADEKTDGDG----------- 1200
               ++   KA K     T  +           K +R+VTWADEKTD  G           
Sbjct: 296  AEEESHDDKAEK----CTEASIKSSLKPSRKKKLSRTVTWADEKTDSSGGRKLCEIREIE 351

Query: 1199 ---------QNLN--------------------------------ECRELKDKKGAVVTS 1143
                     +N N                                E RE++D K A    
Sbjct: 352  DMKEDPSVVENKNGVSFTSSGKMKAGQSVIWADEKGDSSKSIDVCEVREIEDAKEAADML 411

Query: 1142 HSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDE-- 969
             +AD    ++++RFASAEACA AL +A+E VAS + E +DA+SEAG+IILP P   DE  
Sbjct: 412  CNADTGENDDTFRFASAEACARALDEASEAVASEELEVNDAMSEAGIIILPRPENGDEGE 471

Query: 968  --EENGDVMETDPLQ--LKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMAL 801
              EE+ D   ++P Q  +KWP KPG           SW+D+PPE F+LTLSPF+ M+ AL
Sbjct: 472  PMEEDDDDETSEPEQAPIKWPKKPGSQHSDLFDPEDSWFDAPPEDFSLTLSPFAKMWNAL 531

Query: 800  FSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGL 621
            F+W +SS+LAYIYG++ES HEEY  VNGREYP+KIV  DGRSSEIKQTLAG LARALPGL
Sbjct: 532  FTWTTSSTLAYIYGRDESLHEEYAVVNGREYPEKIVFGDGRSSEIKQTLAGSLARALPGL 591

Query: 620  VAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRIPALTPY 441
            VA+LRL  P+S+LEQGMGRLLDTMSF+D LP FRMKQW  I+LLFL+ALSV R+PALTP+
Sbjct: 592  VADLRLSTPISSLEQGMGRLLDTMSFVDALPPFRMKQWQVIILLFLEALSVYRLPALTPH 651

Query: 440  MMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312
            MM RR+L  KV++ AQISAEE+E+MKDL+IPLGR P FS QSG
Sbjct: 652  MMYRRVLFHKVLDSAQISAEEYEVMKDLVIPLGRTPHFSAQSG 694


>gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlisea aurea]
          Length = 597

 Score =  592 bits (1525), Expect = e-166
 Identities = 337/639 (52%), Positives = 427/639 (66%), Gaps = 6/639 (0%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M KDE+LT+K+AV++LQ SLL+G K+ENQL+AAGSL+SR DYQD+VTER IA +CGYPLC
Sbjct: 1    MAKDEILTMKEAVYRLQTSLLEGAKNENQLSAAGSLMSRGDYQDLVTERVIAKICGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             N+L++ERP KGRYRISLKEHKVYD+QETY +CSS CLINSRAF+  L +ER+S L+P K
Sbjct: 61   SNNLNSERPSKGRYRISLKEHKVYDVQETYSFCSSGCLINSRAFSIGLPDERTSDLDPIK 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            LNEVLK FDG   +S  NMG+N DLGLS L+I EK +  AG+V+  EWIGPS+AIDGYVP
Sbjct: 121  LNEVLKRFDGFGANSTPNMGRNEDLGLSQLRIMEKENIEAGEVSSNEWIGPSDAIDGYVP 180

Query: 1700 RRVRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISKTVPA 1521
            RR R+     S   KGE     S++         I   DM+FTS II Q+EYSI+KT   
Sbjct: 181  RRDRNSNTLSSKQKKGE-----SRYHLSLQVLTSIFPSDMSFTSVIIDQNEYSIAKTTTP 235

Query: 1520 VKAKEPKGKASSKEVNRQS-NPVQKPTAPLTNIQETRSKNKSK-NVITK-DDKLSLLENI 1350
              +K+  G+++ K +  +   P Q P + + NI+ +  +N SK N   K D KLS  E+ 
Sbjct: 236  SSSKQ-SGESNEKVIPEEDVRPKQSPDSSVANIKGSGFRNPSKRNGRAKIDAKLSASEDK 294

Query: 1349 AGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXK---ATRSVTWADEKTDGDGQNLNECR 1179
            A  S+N     + +  +S  GA                TR+V+WAD K + DGQNL    
Sbjct: 295  A--SENGGEPKLADGDKSAQGAAVLKSSLKTSYSKETTTRTVSWADVKAE-DGQNLETVC 351

Query: 1178 ELKDKKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVI 999
            E+ D  G  ++  ++            S E+   A T+A+++ A GK   +D        
Sbjct: 352  EMNDPHGGGISRETS------------SVESHKTASTKASKD-APGKFLLTDF------- 391

Query: 998  ILPPPHGEDEEENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFS 819
                         G++  T+ + LKWPPKPG           + YD PP+GFNL+LSPF 
Sbjct: 392  -----------NEGEIF-TEAI-LKWPPKPGFSEADLVESDDTLYDRPPDGFNLSLSPFC 438

Query: 818  TMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLA 639
            T+F +LFSW+SSSSLAYIYGK++SFHEEY++ NGREYP K+V  DGRSSEIKQTL+  LA
Sbjct: 439  TLFNSLFSWISSSSLAYIYGKDDSFHEEYVNANGREYPCKVVAEDGRSSEIKQTLSAALA 498

Query: 638  RALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRI 459
            RALPG+V+ELRLP P+S LEQGMGRLLDTMSFIDPLP+ R KQW AIVLLFL+ALSVSRI
Sbjct: 499  RALPGVVSELRLPTPISILEQGMGRLLDTMSFIDPLPSLRTKQWQAIVLLFLNALSVSRI 558

Query: 458  PALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLG 342
            PAL+ Y+ DRR  + KV+EGA I  EEFE+MKDLIIPLG
Sbjct: 559  PALSKYLEDRRASIQKVLEGAGIGVEEFEVMKDLIIPLG 597


>ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 662

 Score =  589 bits (1519), Expect = e-165
 Identities = 342/673 (50%), Positives = 443/673 (65%), Gaps = 31/673 (4%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M K++ + +KD V+KLQL+L +GIK+ENQL AAGSL+SRSDY+DVVTER+IA++CGYPLC
Sbjct: 1    MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             ++L ++   +GRYRISLKEHKVYDL+ETY YCSS+CLINSRAF+  LQ+ER S +NP K
Sbjct: 61   HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            L E+LKLF+ +SLDS  NMG N D   SGL+IQEK ++  G+V +EEW+GPSNAI+GYVP
Sbjct: 121  LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177

Query: 1700 RRVRDLKHPQSNNNKGERREVGSKHRHVRP--NAADILSYDMNFTSTIITQDEYSISKTV 1527
             R  D K    ++  G+  + GSK + ++P     D  S D + TSTIIT +EYS+SK  
Sbjct: 178  HR--DHKVMTLHSKDGKESKDGSKAK-IKPLGGGKDFFS-DFSITSTIITDEEYSVSKIS 233

Query: 1526 PAVK-------AKEPKGKASSKEVNRQSNPVQKPTAPL-----TNIQETRSKNKSKNVIT 1383
              +K       +K   G+   KE N Q   ++ P AP         +   SK ++K   T
Sbjct: 234  SGLKEMALDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSAT 293

Query: 1382 KDDKLSLLENIAGPSQNDSTKAVKELQESTAG-------AXXXXXXXXXXXKATRSVTWA 1224
            K+   +L  +    S+N ST      +E   G                      RSVTWA
Sbjct: 294  KESTDNL-SDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWA 352

Query: 1223 DEKTDGDG-QNLNECREL-KDKKGAVVTSHSAD-EEVGEESYRFASAEACAMALTQAAEE 1053
            DEKTD     NL E  E+ K K+ +  TS+  + +   E+  R  SAEACAMAL+QAAE 
Sbjct: 353  DEKTDDASIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEA 412

Query: 1052 VASGKSEASDAVSEAGVIILPPPHGEDEEENGDVMETDPLQLKWPP-------KPGXXXX 894
            + SG+SE SDAVSEAG+IILP P   +EE +     TDP+    P        K G    
Sbjct: 413  ITSGQSEVSDAVSEAGIIILPHPSDANEEAS-----TDPVNASEPHSFSEKSNKLGVLRS 467

Query: 893  XXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGR 714
                   SWYD+PPEGF+LTLS F+TM+MA+F+WV+SSSLAYIYGK++ FHEE+L ++G+
Sbjct: 468  DLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGK 527

Query: 713  EYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDP 534
            EYP KIV  DGRSSEIKQTLAGCL RA+PGL +EL L  P+S LE GM  LLDTM+F+D 
Sbjct: 528  EYPSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDA 587

Query: 533  LPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLI 354
            LPAFRMKQW  IVLLF++ALSVSRIP+L  +M   R L  KV++ AQI ++E+EIM+D I
Sbjct: 588  LPAFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHI 647

Query: 353  IPLGRVPQFSTQS 315
            +PLGR  Q S ++
Sbjct: 648  LPLGRTAQLSDEN 660


>gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica]
          Length = 711

 Score =  585 bits (1507), Expect = e-164
 Identities = 351/713 (49%), Positives = 450/713 (63%), Gaps = 76/713 (10%)
 Frame = -2

Query: 2222 LTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLCGNSLSA 2043
            ++VKD V+KLQL+LL+GIK ++ L  AGS+ISRSDY DVVTERTIAN+CGYPLC N+L +
Sbjct: 13   ISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCSNALPS 72

Query: 2042 E--RPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAKLNEV 1869
            +  RP KG YRISLKEHKVYDL ETYMYCSS C+I S+AFA SL EER   L+  K+  +
Sbjct: 73   DSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFGKVERI 132

Query: 1868 LKLFDGLSLD-SDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEW--------------- 1737
            L+ F  +  D  +V  G+ GDLG+S LKI+EK +T  G + +                  
Sbjct: 133  LRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHIGDLGA 192

Query: 1736 IGPSNAIDGYVPRRVRDLKHPQSNNNKGERREVGSKHRHVRPNAA-DILSYDMNFTSTII 1560
            +GPSNAI+GYVP++ R  K   S  NK      GSK +  + ++  DI+  +M+F STII
Sbjct: 193  VGPSNAIEGYVPQKERISKPLGSKKNK-----EGSKGKDAKMSSGMDIIFNEMDFMSTII 247

Query: 1559 TQDEYSISKTVPAVKAK--EPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSKNVI 1386
            T DEYS+SK  P+V     E K K S  +V    N          +++++R     KN  
Sbjct: 248  TSDEYSVSKIPPSVGEPDFETKFKKSKGKVGLNKN---------DSVKKSRQSKGGKNKN 298

Query: 1385 TKDDKLSLLE--NIAGPSQ---NDSTKAVKE------LQESTAGAXXXXXXXXXXXKATR 1239
             K D + + E  + +  SQ   N STK  KE       ++S               K  R
Sbjct: 299  VKKDDVCIREVPSTSDASQTVLNGSTKEEKEEFIVEKAEQSGEALLRSSLKPSGTKKLNR 358

Query: 1238 SVTWADEKTDGDG-QNLNECRELK---DKKGAVVTSH--SADEEVG-------------- 1119
            SVTWADE  D  G +NL E RE++   +   A  + H  S + +VG              
Sbjct: 359  SVTWADEMIDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDSTK 418

Query: 1118 ---------------------EESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGV 1002
                                 +E+    SAEACAMAL QAAE VASG+S+ S AVS AG+
Sbjct: 419  SKNICEVREVQDADVLGSLDLQENEILESAEACAMALNQAAEAVASGESDVSGAVSGAGI 478

Query: 1001 IILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTL 831
            IILP P G DEE   E+ D++E++   L WP KPG           SW+D+PPEGF++TL
Sbjct: 479  IILPRPDGLDEEEPTEDVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDAPPEGFSVTL 537

Query: 830  SPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLA 651
            SPF+TM+ +LF+W++SS+LAYIYG++ESFHEE+LSVNGREYP KIV+  GRSSEIK+TL 
Sbjct: 538  SPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGRSSEIKKTLD 597

Query: 650  GCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALS 471
               ARALPG+V+ELRLP P+S+LEQGMGR+L+TMSFID +PAFRMKQW  IVLLFL+ LS
Sbjct: 598  ESFARALPGVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQVIVLLFLEGLS 657

Query: 470  VSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312
            V RIPALTP+M +RR+L  KV+E  QISAE++E+MKDLIIPLGR PQFS QSG
Sbjct: 658  VCRIPALTPHMTNRRMLFYKVLENTQISAEQYELMKDLIIPLGRAPQFSAQSG 710


>ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 632

 Score =  573 bits (1476), Expect = e-160
 Identities = 334/666 (50%), Positives = 435/666 (65%), Gaps = 24/666 (3%)
 Frame = -2

Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061
            M K++ + +KD V+KLQL+L +GIK+ENQL AAGSL+SRSDY+DVVTER+IA++CGYPLC
Sbjct: 1    MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881
             ++L ++   +GRYRISLKEHKVYDL+ETY YCSS+CLINSRAF+  LQ+ER S +NP K
Sbjct: 61   HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701
            L E+LKLF+ +SLDS  NMG N D   SGL+IQEK ++  G+V +EEW+GPSNAI+GYVP
Sbjct: 121  LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177

Query: 1700 RRVRDLKHPQSNNNKGERREVGSKHRHVRP--NAADILSYDMNFTSTIITQDEYSISKTV 1527
               RD K    ++  G+  + GSK + ++P     D  S D +FTSTIIT +EYS+SK  
Sbjct: 178  H--RDHKVMTLHSKDGKESKDGSKAK-IKPLGGGKDFFS-DFSFTSTIITDEEYSVSKIS 233

Query: 1526 PAVK-------AKEPKGKASSKEVNRQSNPVQKPTAPL-----TNIQETRSKNKSKNVIT 1383
              +K       +K   G+   K+ N Q   ++ P AP         +   SK ++K   T
Sbjct: 234  SGLKEMALDTNSKNQTGEFCGKKSNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSAT 293

Query: 1382 KDDKLSLLENIAGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXKATRSVTWADEKTDGD 1203
            K +    L +    S N ST      +E                         DEKTD  
Sbjct: 294  K-ESTDNLSDAPSTSNNRSTNFNLMTEEP-----------------------RDEKTDDA 329

Query: 1202 G-QNLNECREL-KDKKGAVVTSHSAD-EEVGEESYRFASAEACAMALTQAAEEVASGKSE 1032
               NL E  E+ K K+ +  TS+  + +   E+  R  SAEACAMAL+QAA+ + SG+SE
Sbjct: 330  SIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDLLRVESAEACAMALSQAAKAITSGQSE 389

Query: 1031 ASDAVSEAGVIILPPPHGEDEEENGDVMETDPLQLKWP-------PKPGXXXXXXXXXXX 873
             SDAVSEAG+IILP P   +EE +     TDP+    P        K G           
Sbjct: 390  VSDAVSEAGIIILPHPSDANEEAS-----TDPVNASEPHSFSEKSNKLGVLRSDLFDPSD 444

Query: 872  SWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIV 693
            SWYD+PPEGF+LTLS F+TM+MA+F+WV+SSSLAYIYGK++ FHEE+L ++G+EYP KIV
Sbjct: 445  SWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIV 504

Query: 692  MPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMK 513
              DGRSSEIKQTLAGCL RA+PGL +EL L  P+S LE GM  LLDTM+F+D LPAFRMK
Sbjct: 505  SADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMK 564

Query: 512  QWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVP 333
            QW  IVLLF++ALSVSRIP+L  +M   R L  KV++ AQI ++E+EIM+D I+PLGR  
Sbjct: 565  QWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTA 624

Query: 332  QFSTQS 315
            Q S ++
Sbjct: 625  QLSDEN 630


>gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao]
          Length = 703

 Score =  568 bits (1465), Expect = e-159
 Identities = 330/672 (49%), Positives = 428/672 (63%), Gaps = 48/672 (7%)
 Frame = -2

Query: 2258 KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 2079
            K++S  M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N 
Sbjct: 49   KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108

Query: 2078 CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 1899
            CGYPLC N L +E   KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S
Sbjct: 109  CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168

Query: 1898 TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 1719
             LN AKLN++L LF  L LD D ++GKNGDLG S L+I+E  +  A  V+L    GPSNA
Sbjct: 169  VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224

Query: 1718 IDGYVPRR--VRDLKHPQSNNNK---GERREVGSKHRHVRPNAADILSYDMNFTSTIITQ 1554
            I+GYVP+R  +     P++N NK       ++GSK           ++ +++F  TII  
Sbjct: 225  IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278

Query: 1553 DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 1416
            DEY ISK   + K  +    +S KE              +N +    + P+    +  ++
Sbjct: 279  DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338

Query: 1415 RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 1302
              K   +  I KD  DK  +  + +   + DS+                    +A KE  
Sbjct: 339  NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398

Query: 1301 ESTAGAXXXXXXXXXXXKA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1143
               A              A      R VTWAD+K  D  G  NL E +E++  KG    S
Sbjct: 399  ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458

Query: 1142 HSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 963
             SA++   +   RF SAEACAMAL++AAE VASG S+ +DAV E           E+  E
Sbjct: 459  GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVCEVDK--------EEPME 510

Query: 962  NGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSS 783
            +GD++E +   +KWP KPG           SW+D+PPEGF+LTLS F+TM+ ALF W++S
Sbjct: 511  DGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITS 570

Query: 782  SSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRL 603
            SSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +LRL
Sbjct: 571  SSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRL 630

Query: 602  PIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRI 423
            PIP+STLEQGMG L+DT+SF++ LPAFRMKQW  IVLLF+DALSV RIPALTP+M + R+
Sbjct: 631  PIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRM 690

Query: 422  LLPKVIEGAQIS 387
            LL KV++GAQIS
Sbjct: 691  LLHKVLDGAQIS 702


>gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao]
          Length = 708

 Score =  528 bits (1359), Expect = e-147
 Identities = 308/635 (48%), Positives = 401/635 (63%), Gaps = 51/635 (8%)
 Frame = -2

Query: 2258 KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 2079
            K++S  M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N 
Sbjct: 49   KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108

Query: 2078 CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 1899
            CGYPLC N L +E   KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S
Sbjct: 109  CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168

Query: 1898 TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 1719
             LN AKLN++L LF  L LD D ++GKNGDLG S L+I+E  +  A  V+L    GPSNA
Sbjct: 169  VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224

Query: 1718 IDGYVPRR--VRDLKHPQSNNNK---GERREVGSKHRHVRPNAADILSYDMNFTSTIITQ 1554
            I+GYVP+R  +     P++N NK       ++GSK           ++ +++F  TII  
Sbjct: 225  IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278

Query: 1553 DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 1416
            DEY ISK   + K  +    +S KE              +N +    + P+    +  ++
Sbjct: 279  DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338

Query: 1415 RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 1302
              K   +  I KD  DK  +  + +   + DS+                    +A KE  
Sbjct: 339  NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398

Query: 1301 ESTAGAXXXXXXXXXXXKA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1143
               A              A      R VTWAD+K  D  G  NL E +E++  KG    S
Sbjct: 399  ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458

Query: 1142 HSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 963
             SA++   +   RF SAEACAMAL++AAE VASG S+ +DAV E G+IILP     D+EE
Sbjct: 459  GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEE 518

Query: 962  ---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSW 792
               +GD++E +   +KWP KPG           SW+D+PPEGF+LTLS F+TM+ ALF W
Sbjct: 519  PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEW 578

Query: 791  VSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAE 612
            ++SSSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +
Sbjct: 579  ITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTD 638

Query: 611  LRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQW 507
            LRLPIP+STLEQGMG L+DT+SF++ LPAFRMKQW
Sbjct: 639  LRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQW 673


>gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao]
          Length = 679

 Score =  528 bits (1359), Expect = e-147
 Identities = 308/635 (48%), Positives = 401/635 (63%), Gaps = 51/635 (8%)
 Frame = -2

Query: 2258 KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 2079
            K++S  M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N 
Sbjct: 49   KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108

Query: 2078 CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 1899
            CGYPLC N L +E   KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S
Sbjct: 109  CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168

Query: 1898 TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 1719
             LN AKLN++L LF  L LD D ++GKNGDLG S L+I+E  +  A  V+L    GPSNA
Sbjct: 169  VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224

Query: 1718 IDGYVPRR--VRDLKHPQSNNNK---GERREVGSKHRHVRPNAADILSYDMNFTSTIITQ 1554
            I+GYVP+R  +     P++N NK       ++GSK           ++ +++F  TII  
Sbjct: 225  IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278

Query: 1553 DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 1416
            DEY ISK   + K  +    +S KE              +N +    + P+    +  ++
Sbjct: 279  DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338

Query: 1415 RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 1302
              K   +  I KD  DK  +  + +   + DS+                    +A KE  
Sbjct: 339  NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398

Query: 1301 ESTAGAXXXXXXXXXXXKA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1143
               A              A      R VTWAD+K  D  G  NL E +E++  KG    S
Sbjct: 399  ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458

Query: 1142 HSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 963
             SA++   +   RF SAEACAMAL++AAE VASG S+ +DAV E G+IILP     D+EE
Sbjct: 459  GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEE 518

Query: 962  ---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSW 792
               +GD++E +   +KWP KPG           SW+D+PPEGF+LTLS F+TM+ ALF W
Sbjct: 519  PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEW 578

Query: 791  VSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAE 612
            ++SSSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +
Sbjct: 579  ITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTD 638

Query: 611  LRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQW 507
            LRLPIP+STLEQGMG L+DT+SF++ LPAFRMKQW
Sbjct: 639  LRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQW 673


Top