BLASTX nr result

ID: Sinomenium21_contig00018599 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00018599
         (3027 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]   624   e-176
ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258...   620   e-174
ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobr...   560   e-156
ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni...   542   e-151
ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni...   536   e-149
ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm...   536   e-149
ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni...   535   e-149
ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subuni...   530   e-147
gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]     525   e-146
ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni...   517   e-143
ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu...   516   e-143
ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prun...   506   e-140
ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Th...   506   e-140
ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni...   491   e-136
gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus...   481   e-133
ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma c...   474   e-130
ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma c...   473   e-130
ref|XP_004302308.1| PREDICTED: putative RNA polymerase II subuni...   472   e-130
ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobr...   446   e-122
ref|XP_006290171.1| hypothetical protein CARUB_v10003849mg [Caps...   434   e-119

>emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]
          Length = 659

 Score =  624 bits (1610), Expect = e-176
 Identities = 348/656 (53%), Positives = 436/656 (66%), Gaps = 10/656 (1%)
 Frame = -1

Query: 2019 EQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNNP 1840
            +Q I +KDAVHKLQL L EGI +E QLF A SLMSRSDY+DVVT+R+IAN CGYPLC+N 
Sbjct: 4    DQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLCSNS 63

Query: 1839 LLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKINE 1660
            L  ER RK HYR SL++ K  +  E YM+CS GC   SR+FAG LQ +RCS+ NSE+IN 
Sbjct: 64   LPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSERING 123

Query: 1659 VLRLFEELGLNEKEDLQKKDDL--NELKMEEKVD--VGNVLLEDCI-PLNSIEGYIPEID 1495
            +LRLF E  L   + L K  DL  +ELK+ E V+   G V +ED I P N+IEGY+P+ D
Sbjct: 124  ILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRD 183

Query: 1494 CVLKTSPSGHGKGGTESNKATLQKGKGKLLNEMDLTGTIIAGDQFTGPKMS-----CVSK 1330
              LK     + K G++S+ + +  GK  +++EMD   TII  D+++  K S       S 
Sbjct: 184  RNLKPKNIKNHKEGSKSSNSKMDSGKNFVIDEMDFVSTIITKDEYSISKSSKGLKDTTSH 243

Query: 1329 ENGAETKRAGLKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAEDTQFGKQEMFHGPT 1150
                E K      +Q S  E    P +              SR     +F   E+   P+
Sbjct: 244  AKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPSVPS 303

Query: 1149 QNVSDIAVKESKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADEKLNNTESGYLC 970
            Q+ S++   + K+    E   Q      KSSLK +G K++ RSVTWADEK+++ +S   C
Sbjct: 304  QSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADEKMDSADSRDFC 363

Query: 969  TFQETGVIQDTVENSRNSDVEDVDASLHLASAEACMLALSQAAEAVSSGEFDVTDAVSNA 790
              +E  V ++      + DV D D +L  ASAEAC +ALSQAAEAV+SGE D+TDAVS A
Sbjct: 364  KVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMTDAVSEA 423

Query: 789  GIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPEGFSL 610
            GIIILP   D D GES +D ++LEPE VPLKWP KP +S+S +FD +DSWYD PPEGFSL
Sbjct: 424  GIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPEGFSL 483

Query: 609  TLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSEIKQS 430
            TLS FATMW ALF WITSSS+AYIYGRD S HEE++ VNG+EYP+KI  +DGRSSEIKQ+
Sbjct: 484  TLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSEIKQT 543

Query: 429  LAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHVLALLFIDA 250
            LAG L+R +P LV +LRL  P+S LE G+  LLDTMSF+D LPS   +QW V+ LLFIDA
Sbjct: 544  LAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLLFIDA 603

Query: 249  LSVCRIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPLGRRLQFSAQSGG 82
            LSVCRIP L  HMTS R+L  KV D+AQ++ EEYE MKD+IIPLGR  QFSAQSGG
Sbjct: 604  LSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQSGG 659



 Score =  166 bits (419), Expect = 8e-38
 Identities = 125/351 (35%), Positives = 173/351 (49%), Gaps = 23/351 (6%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMDF 2848
            L+LF E  L+  + LG+  DLG SELKI E    + GEVSME+W+GPSNAIEGYVP  D 
Sbjct: 125  LRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRDR 184

Query: 2847 SSKPPTDXXXXXXXXXXXXXXXXXVN----EMDFTSTIIVGDQFCIPKFSYGSEQNGTEG 2680
            + KP                     N    EMDF STII  D++ I K S G +   +  
Sbjct: 185  NLKPKNIKNHKEGSKSSNSKMDSGKNFVIDEMDFVSTIITKDEYSISKSSKGLKDTTSHA 244

Query: 2679 RPKESKQ-----NQFIINDVTSATMQNVSETTLMESKLEEGSAASKTQTGELEMFYGPTQ 2515
            + KE K+     +Q  + + ++  +QN SE+ L ESK        K +    E+   P+Q
Sbjct: 245  KSKEPKEKASIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPSVPSQ 304

Query: 2514 NVSD-RGVEGSKQEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWSDERKIDNMSNDN 2338
            + S+  GV+G K+E   E  A    T  KSSLKPS   ++  R VTW+DE K+D+  + +
Sbjct: 305  SGSELNGVKG-KEEYHTENAAQLGPTKPKSSLKPSG-GKKVIRSVTWADE-KMDSADSRD 361

Query: 2337 LCNVHNTGNISDSVKSPINSSVEGVDHCLFLASEAVCELAPSQVAKAVTFGESDVTN--- 2167
             C V       +      +  V   D+ L  AS   C +A SQ A+AV  GE+D+T+   
Sbjct: 362  FCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMTDAVS 421

Query: 2166 ----------DDASKGDSQKNEAVNEPLVVLLEQPRKPDALLSGSFDVKDS 2044
                       D  +G+S K+  + EP  V L+ P KP    S  FD  DS
Sbjct: 422  EAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDS 472


>ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera]
            gi|296089830|emb|CBI39649.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  620 bits (1599), Expect = e-174
 Identities = 346/656 (52%), Positives = 435/656 (66%), Gaps = 10/656 (1%)
 Frame = -1

Query: 2019 EQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNNP 1840
            +Q I +KDAVHKLQL L EGI +E QLF A SLMSRSDY+DVVT+R+IAN CGYPLC+N 
Sbjct: 4    DQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLCSNS 63

Query: 1839 LLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKINE 1660
            L  ER RK HYR SL++ K  +  E YM+CS GC   SR+FAG LQ +RCS+ NSE+IN 
Sbjct: 64   LPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSERING 123

Query: 1659 VLRLFEELGLNEKEDLQKKDDL--NELKMEEKVD--VGNVLLEDCI-PLNSIEGYIPEID 1495
            +LRLF E  L   + L K  DL  +ELK+ E V+   G V +ED I P N+IEGY+P+ D
Sbjct: 124  ILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRD 183

Query: 1494 CVLKTSPSGHGKGGTESNKATLQKGKGKLLNEMDLTGTIIAGDQFTGPKMS-----CVSK 1330
              LK     + K G++S+ + +  GK  +++EMD   TII  D+++  K S       S 
Sbjct: 184  RNLKPKNIKNRKEGSKSSNSKMDSGKNFVIDEMDFVRTIITEDEYSISKSSKGLKDTTSH 243

Query: 1329 ENGAETKRAGLKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAEDTQFGKQEMFHGPT 1150
                E K      +Q S  E    P +              SR     +F   E+   P+
Sbjct: 244  AKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPSVPS 303

Query: 1149 QNVSDIAVKESKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADEKLNNTESGYLC 970
            Q+ S++   + K+    E   Q     LKS LK +G K+++RSVTWADEK+++ +S   C
Sbjct: 304  QSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADEKMDSADSRDFC 363

Query: 969  TFQETGVIQDTVENSRNSDVEDVDASLHLASAEACMLALSQAAEAVSSGEFDVTDAVSNA 790
              +E  V ++      + DV D D +L  ASAEAC +ALSQAAEAV+SGE D+TDAVS A
Sbjct: 364  KVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMTDAVSEA 423

Query: 789  GIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPEGFSL 610
             IIILP   D D GES +D ++LEPE VPLKWP KP +S+S +FD +DSWYD PPEGFSL
Sbjct: 424  RIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPEGFSL 483

Query: 609  TLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSEIKQS 430
            TLS FATMW ALF WITSSS+AYIYGRD S HEE++ VNG+EYP+KI  +DGRSSEIKQ+
Sbjct: 484  TLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSEIKQT 543

Query: 429  LAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHVLALLFIDA 250
            LAG LAR +P LV +LRL  P+S LE G+  LLDTMSF+D LPS   +QW V+ LLFIDA
Sbjct: 544  LAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLLFIDA 603

Query: 249  LSVCRIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPLGRRLQFSAQSGG 82
            LSVC+IP L  HM S R+L  KV D+AQ++ EEYE MKD+IIPLGR  QFSAQSGG
Sbjct: 604  LSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQSGG 659



 Score =  166 bits (420), Expect = 6e-38
 Identities = 124/351 (35%), Positives = 173/351 (49%), Gaps = 23/351 (6%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMDF 2848
            L+LF E  L+  + LG+  DLG SELKI E    + GEVSME+W+GPSNAIEGYVP  D 
Sbjct: 125  LRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRDR 184

Query: 2847 SSKPPTDXXXXXXXXXXXXXXXXXVN----EMDFTSTIIVGDQFCIPKFSYGSEQNGTEG 2680
            + KP                     N    EMDF  TII  D++ I K S G +   +  
Sbjct: 185  NLKPKNIKNRKEGSKSSNSKMDSGKNFVIDEMDFVRTIITEDEYSISKSSKGLKDTTSHA 244

Query: 2679 RPKESKQ-----NQFIINDVTSATMQNVSETTLMESKLEEGSAASKTQTGELEMFYGPTQ 2515
            + KE K+     +Q  + + ++  +QN SE+ L ESK        K +    E+   P+Q
Sbjct: 245  KSKEPKEKASIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPSVPSQ 304

Query: 2514 NVSD-RGVEGSKQEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWSDERKIDNMSNDN 2338
            + S+  GV+G K+E   E  A    T LKS LKPS   ++ +R VTW+DE K+D+  + +
Sbjct: 305  SGSELNGVKG-KEEYHTENAAQLGPTKLKSCLKPSG-GKKVTRSVTWADE-KMDSADSRD 361

Query: 2337 LCNVHNTGNISDSVKSPINSSVEGVDHCLFLASEAVCELAPSQVAKAVTFGESDVTN--- 2167
             C V       +      +  V   D+ L  AS   C +A SQ A+AV  GE+D+T+   
Sbjct: 362  FCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMTDAVS 421

Query: 2166 ----------DDASKGDSQKNEAVNEPLVVLLEQPRKPDALLSGSFDVKDS 2044
                       D  +G+S K+  + EP  V L+ P KP    S  FD  DS
Sbjct: 422  EARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDS 472


>ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao]
            gi|508787289|gb|EOY34545.1| F2P16.20 protein, putative
            isoform 1 [Theobroma cacao]
          Length = 739

 Score =  560 bits (1442), Expect = e-156
 Identities = 326/718 (45%), Positives = 436/718 (60%), Gaps = 50/718 (6%)
 Frame = -1

Query: 2088 KPDALLSGSFDVKDSRMSPASVKEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRS 1909
            KP +L   S +    + S +  KEQSI++ +AVHK+QL L +GI  EKQL  + SL+SRS
Sbjct: 35   KPPSLQQHSRERLSKKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRS 94

Query: 1908 DYDDVVTKRSIANKCGYPLCNNPLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATY 1729
            DY+DVVT+R+I+N CGYPLC NPL  E  RK  YR SL++ K  + +E YMFCS  C   
Sbjct: 95   DYEDVVTERTISNTCGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLIN 154

Query: 1728 SRAFAGGLQIKRCSLYNSEKINEVLRLFEELGLNEKEDLQKKDDLN----ELKMEEKVDV 1561
            SRAFAG LQ +RCS+ N  K+N++L LF +L L++  DL K  DL      +K  E+V  
Sbjct: 155  SRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKA 213

Query: 1560 GNVLLEDCIPLNSIEGYIPEIDCVLKTSPSGHGKGGT-ESNKATLQKGKGK--------- 1411
             +V L    P N+IEGY+P+ + + K +P  + K    +S+ + L   K +         
Sbjct: 214  EDVSLAG--PSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDF 271

Query: 1410 ----------------------------------LLNEMDLTGTIIAGDQFTGPKMSCVS 1333
                                              ++NEMD T  II  D++T  KM   S
Sbjct: 272  AGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGS 331

Query: 1332 KENGAETKRAGLKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAE-DTQFGKQEMFHG 1156
            K++  +           SN + +   G               S   E D+   +      
Sbjct: 332  KQSCFD-----------SNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKN 380

Query: 1155 PTQNVSDIAVKESKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADEK-LNNTESG 979
              Q+  D +  E+++    +K   S+E  LKSSLK  GAK+L+R VTWAD+K  +N  +G
Sbjct: 381  VYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNG 440

Query: 978  YLCTFQETGVIQDTVENSRNSDVEDVDASLHLASAEACMLALSQAAEAVSSGEFDVTDAV 799
             LC  +E   ++   E S +++    D  L   SAEAC +ALS+AAEAV+SG+ DVTDAV
Sbjct: 441  NLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAV 500

Query: 798  SNAGIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPEG 619
               G+IILP L + D  E  ED ++LEPE  P+KWP+KP + +S +F+ EDSW+DAPPEG
Sbjct: 501  YENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEG 560

Query: 618  FSLTLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSEI 439
            FSLTLS+FATMW ALF WITSSSLAYIYGRD S HEE++ +NG+EYPRKI   DGRSSEI
Sbjct: 561  FSLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEI 620

Query: 438  KQSLAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHVLALLF 259
            K++LA  ++R +P +VT+LRL  P+S LE GM  L+DT+SFM+ LP+   +QW V+ LLF
Sbjct: 621  KETLASCISRALPAIVTDLRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLF 680

Query: 258  IDALSVCRIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPLGRRLQFSAQSG 85
            IDALSVCRIP L  HMT+ R+LL+KVLD AQ++ EEYE MKD+IIPLGR   FSAQSG
Sbjct: 681  IDALSVCRIPALTPHMTNGRMLLHKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 738



 Score =  140 bits (353), Expect = 4e-30
 Identities = 119/383 (31%), Positives = 181/383 (47%), Gaps = 55/383 (14%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMDF 2848
            L LF +L LD  + LG+  DLGFS L+I+E   V+  +VS+    GPSNAIEGYVP  + 
Sbjct: 179  LSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNAIEGYVPQREL 234

Query: 2847 SSKPPTDXXXXXXXXXXXXXXXXXV-------NEMDFTSTIIVGDQFCIPKFSYGSEQNG 2689
             SKP                            NE+DF  TII+ D++ I K   GS + G
Sbjct: 235  ISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISK-KPGSFKQG 293

Query: 2688 TEGRPKESKQNQFIINDV------------TSATMQNVSETTLMESKLEE---------- 2575
               +   SK+  F+IN++            T + M + S+ +  +S L+E          
Sbjct: 294  DRTK-LSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDS 352

Query: 2574 -------GSAASKTQTGELEMFYGPTQNVSDRGVEGS----KQEVSGEKLALSNETTLKS 2428
                   GS+++  +     +    T+NV   G++ S    ++E   +K   S+ET LKS
Sbjct: 353  EDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKS 412

Query: 2427 SLKPSSDARRFSRDVTWSDERKIDNMSNDNLCNVHNTGNISDSVKSPINSSVE--GVDHC 2254
            SLK S+ A++ +R VTW+D++K DN  N NLC V     +     S I+ S E  G D+ 
Sbjct: 413  SLK-SAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGD--SEISGSAEDGGDDNM 469

Query: 2253 LFLASEAVCELAPSQVAKAVTFGESDVTND-------------DASKGDSQKNEAVNEPL 2113
            L   S   C +A S+ A+AV  G+SDVT+              +  K +  ++  + EP 
Sbjct: 470  LRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPE 529

Query: 2112 VVLLEQPRKPDALLSGSFDVKDS 2044
               ++ P+KP    S  F+ +DS
Sbjct: 530  TAPVKWPKKPGIPHSDMFNPEDS 552


>ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Solanum lycopersicum]
          Length = 660

 Score =  542 bits (1396), Expect = e-151
 Identities = 314/668 (47%), Positives = 423/668 (63%), Gaps = 21/668 (3%)
 Frame = -1

Query: 2022 KEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNN 1843
            K +++ +KDAVHKLQL L EGI  E QL  A SL+SRSDY DVVT+RSIAN CGYPLC+N
Sbjct: 3    KGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLCSN 62

Query: 1842 PLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKIN 1663
             L  ER RK HYR SL++ K  +  E YM+CS  C   S AFAG LQ +R S  N  K+N
Sbjct: 63   SLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAKLN 122

Query: 1662 EVLRLFEELGLNEKEDLQKKDDL--NELKMEEKVDV--GNVLLEDCI-PLNSIEGYIPEI 1498
            +VL LF+ L L+  +D+++  D   ++LK++EKVD+  G V LE+ + P N+IEGY+P+ 
Sbjct: 123  QVLNLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVPQR 182

Query: 1497 DCVLKTSPSGHGKGGTESNKATLQKGKGKLLNEMDLTGTIIAGDQFTGPKMSC-VSKENG 1321
            D  +  +   +   G+++  A LQ  K  +LNE D + TII  D+++  K    V+ ++ 
Sbjct: 183  DRSVNPALLKNINKGSKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNADSN 242

Query: 1320 AETKRAGLKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAEDTQFGKQEMFH------ 1159
             + K    K       + +   G+               ++ ++T+F K + F+      
Sbjct: 243  VKFKETQAKTRYKVRDDDVYILGKQVDALQLRSGEETE-KSDKNTRFLKVDKFNSGEVSS 301

Query: 1158 GPTQNVSDIAVKE----SKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADEKLNN 991
            GP+Q+  D+  K     S  G      +    + LKSSLK + +K++SRSVTWADE ++ 
Sbjct: 302  GPSQH--DVKNKSVLIMSDDGRKY--ASHGEHDKLKSSLKSSNSKKMSRSVTWADESIDG 357

Query: 990  -----TESGYLCTFQETGVIQDTVENSRNSDVEDVDASLHLASAEACMLALSQAAEAVSS 826
                 TES    +  E+         S ++D+E+ D S    SAEAC  ALSQAAEAV+S
Sbjct: 358  GIGKKTESSSKISEYES----QAYGGSASTDMEENDDSYRFESAEACAAALSQAAEAVAS 413

Query: 825  GEFDVTDAVSNAGIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDRED 646
            G  DV DAVS AGI+ILP   + D    QE D +L+ E  PLKWP KP + N  +F+ ED
Sbjct: 414  GS-DVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYDVFESED 472

Query: 645  SWYDAPPEGFSLTLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIF 466
            SWYD+PPEGF++TLS F TM+ +LF WI+SSSLA+IYG D S++EE++ +NG+EYPRKI 
Sbjct: 473  SWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGREYPRKIV 532

Query: 465  SSDGRSSEIKQSLAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTE 286
             SDGRS+EIKQ+LAG LAR +P LV +LRL  P+S LE GM  LL+TMSF+DPLP+   +
Sbjct: 533  LSDGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFRMK 592

Query: 285  QWHVLALLFIDALSVCRIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPLGRRL 106
            QW ++ LLF+DALSVCRIP L  +MT  R    KVLD AQ++  EYE MKD+IIPLGR  
Sbjct: 593  QWQLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLIIPLGRVP 652

Query: 105  QFSAQSGG 82
            QFS QSGG
Sbjct: 653  QFSMQSGG 660



 Score =  109 bits (272), Expect = 9e-21
 Identities = 105/359 (29%), Positives = 157/359 (43%), Gaps = 31/359 (8%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMDF 2848
            L LF+ L L   + +    D G S+LKI+EK +++ GEVS+E WMGPSNAIEGYVP  D 
Sbjct: 125  LNLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVPQRDR 184

Query: 2847 SSKPP----TDXXXXXXXXXXXXXXXXXVNEMDFTSTIIVGDQFCIPKF--------SYG 2704
            S  P      +                 +NE DF+STII  D++ + KF        +  
Sbjct: 185  SVNPALLKNINKGSKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNADSNVK 244

Query: 2703 SEQNGTEGRPKESKQNQFIINDVTSATMQNVSETTLMESKLEEGSAASKTQTGELEMFYG 2524
             ++   + R K    + +I+     A      E T    K        K  +GE+    G
Sbjct: 245  FKETQAKTRYKVRDDDVYILGKQVDALQLRSGEETEKSDKNTRFLKVDKFNSGEVSS--G 302

Query: 2523 PTQ-NVSDRGV-----EGSKQEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWSDERK 2362
            P+Q +V ++ V     +G K    GE         LKSSLK SS++++ SR VTW+DE  
Sbjct: 303  PSQHDVKNKSVLIMSDDGRKYASHGE------HDKLKSSLK-SSNSKKMSRSVTWADESI 355

Query: 2361 IDNMSNDNLCNVHNTGNISDSVKSPINSSVEGVDHCLFLASEAVCELAPSQVAKAVTFGE 2182
               +      +   +   S +     ++ +E  D      S   C  A SQ A+AV  G 
Sbjct: 356  DGGIGKKTESSSKISEYESQAYGGSASTDMEENDDSYRFESAEACAAALSQAAEAVASG- 414

Query: 2181 SDVTNDDASKG------DSQKNEAVNEPLVVLLE-------QPRKPDALLSGSFDVKDS 2044
            SDV +  +  G        + +EA+ +    +L+        PRKP       F+ +DS
Sbjct: 415  SDVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYDVFESEDS 473


>ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cicer arietinum]
          Length = 666

 Score =  536 bits (1381), Expect = e-149
 Identities = 313/666 (46%), Positives = 410/666 (61%), Gaps = 20/666 (3%)
 Frame = -1

Query: 2022 KEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNN 1843
            K+Q I++KDAV KLQL+L EGI  E QLF A SL+SRSDY+DVVT+RSI   C YPLC N
Sbjct: 3    KDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLCCN 62

Query: 1842 PLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKIN 1663
             L  ERPRK  YR SL++ K  +  E YMFCS  C   S+AFAG L+ KRC   + +K+N
Sbjct: 63   ALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQKLN 122

Query: 1662 EVLRLFEELGLNEKEDLQKKDDL--NELKMEEKVD-VGNVLLEDCI-PLNSIEGYIPEID 1495
             +LRLF    L   E+  K  +L  + L++++K + V  V LE  + P N+IEGY+P+  
Sbjct: 123  NILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVTEVSLEQWVGPSNAIEGYVPK-- 180

Query: 1494 CVLKTSPSGHGKGGTESNKATLQKGKG--KLLN-EMDLTGTIIAGDQFTGPKMSCVSKEN 1324
                    G  K   + +KA+  K  G   L+N E D   TII  D+++  K+S    + 
Sbjct: 181  -KRDNGSKGSQKNTKKGSKASHGKSNGVKNLINSEFDFMSTIIMQDEYSVSKVSSGQTDA 239

Query: 1323 GAETK---RAGLKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAEDTQFGK--QEMFH 1159
              + +    A L+  +  +HE +                   S + +D +  K  + +  
Sbjct: 240  TVDHQIKPTAILEQPKRVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEIAKSCKNVLK 299

Query: 1158 GPTQNV--------SDIAVKESKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADE 1003
            G T  V        S+    + ++   +EK   S     KSSLK  G K+L RSVTWAD+
Sbjct: 300  GKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLGRSVTWADK 359

Query: 1002 KLNNTESGYLCTFQETGVIQDTVENSRNSDVEDVDASLHLASAEACMLALSQAAEAVSSG 823
            K++   S  LC F+E G I+   + + N DV D +  L   SAEAC +ALSQAAEAV+SG
Sbjct: 360  KIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALSQAAEAVASG 419

Query: 822  EFDVTDAVSNAGIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDS 643
            + D  DAVS AGIIILP   +     + +D ++LE + V LKWP KP +S+  LF  +DS
Sbjct: 420  DSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKPGISDFDLFASDDS 479

Query: 642  WYDAPPEGFSLTLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFS 463
            W+DAPPEGFSLTLS FAT+W A F WITSSSLAYIYGRD S +EEF+ V+G+EYP KI  
Sbjct: 480  WFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDGREYPCKIVL 539

Query: 462  SDGRSSEIKQSLAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQ 283
            SDGRSSEIKQ+LA  LAR +P +V EL+L  P+S LE GM CLLDTMSF+DPLP    +Q
Sbjct: 540  SDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSFVDPLPGFRFKQ 599

Query: 282  WHVLALLFIDALSVCRIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPLGRRLQ 103
            W V+ALLF+DALSVCRIP L  +MT  R L +KVL  +Q+  EEY  +KD+I+PLGR   
Sbjct: 600  WQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLKDLIVPLGRAPH 659

Query: 102  FSAQSG 85
            FS+QSG
Sbjct: 660  FSSQSG 665



 Score =  109 bits (272), Expect = 9e-21
 Identities = 102/347 (29%), Positives = 152/347 (43%), Gaps = 32/347 (9%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVP---- 2860
            L+LF    L+  E  G++ +LG S L+I++K    V EVS+E W+GPSNAIEGYVP    
Sbjct: 125  LRLFGNSNLEPMENSGKDGELGLSSLRIQDKTET-VTEVSLEQWVGPSNAIEGYVPKKRD 183

Query: 2859 NMDFSSKPPTDXXXXXXXXXXXXXXXXXVNEMDFTSTIIVGDQFCIPKFSYGSEQNGTEG 2680
            N    S+  T                   +E DF STII+ D++ + K S G      + 
Sbjct: 184  NGSKGSQKNTKKGSKASHGKSNGVKNLINSEFDFMSTIIMQDEYSVSKVSSGQTDATVDH 243

Query: 2679 RPKES-------KQNQFII---NDVTSATMQNVSETTLMESKLEEGSAAS-----KTQTG 2545
            + K +       + +  ++   +D+   +    S   L  SK ++  A S     K +T 
Sbjct: 244  QIKPTAILEQPKRVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEIAKSCKNVLKGKTN 303

Query: 2544 ELEMFYGPTQNVSDRGVEGSKQEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWSDER 2365
             +        + S+      ++++  EK   S  T  KSSLK S+  ++  R VTW+D +
Sbjct: 304  RVAA--NDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLK-SNGKKKLGRSVTWAD-K 359

Query: 2364 KIDNMSNDNLCNVHNTGNISDSVKSPINSSVEGVDHCLFLASEAVCELAPSQVAKAVTFG 2185
            KID   + +LC     GNI        N  V   +  L   S   C +A SQ A+AV  G
Sbjct: 360  KIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALSQAAEAVASG 419

Query: 2184 ESDVTNDDASKGD---SQKNEAVNEPLV----------VLLEQPRKP 2083
            +SD  +  +  G         AV E  V          V L+ PRKP
Sbjct: 420  DSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKP 466


>ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis]
            gi|223538861|gb|EEF40460.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 645

 Score =  536 bits (1381), Expect = e-149
 Identities = 315/660 (47%), Positives = 414/660 (62%), Gaps = 14/660 (2%)
 Frame = -1

Query: 2022 KEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNN 1843
            KE+S+++KD V+KLQLSL EGI +E QL  A SLMSRSDY+DVV +RSI+N CGYPLCNN
Sbjct: 3    KEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLCNN 62

Query: 1842 PLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKIN 1663
             L  +RP K  YR SL++ +  + +E YM+CS  C   SRAF+  LQ KRCS+ N  K+N
Sbjct: 63   SLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIKLN 122

Query: 1662 EVLRLFEELGLNEKEDLQKKDDL--NELKMEEK--VDVGNVLLEDCI-PLNSIEGYIPEI 1498
            E+LR F +L L + E L +  DL  + LK++EK   +VG V LE+ I P N+IEGY+P+ 
Sbjct: 123  EILRKFNDLTL-DSEGLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVPQG 181

Query: 1497 DCVLKTSPSGHGKGGTESNKATLQKGKGK---LLNEMDLTGTIIAGDQFTGPKMSCVSKE 1327
            D      P+   K   E  KA  +K   K     ++ D T TII  D+++  K       
Sbjct: 182  D----RDPNPSLKNHKEGLKAICKKPVSKQDCFFSDTDFTSTIITNDEYSISKGPSGLTS 237

Query: 1326 NGAETKRAGLKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAEDTQFGKQEMFHGPTQ 1147
              ++ K   L+      HE + A                  +A+  ++  ++E       
Sbjct: 238  TASDIK---LQAQTGKGHEGLNAQ-------LSSLRKQDSIKASRKSKGRRKEKVIKEQL 287

Query: 1146 NVSDIAVKESKQGASVEKTTQS------NENTLKSSLKLTGAKRLSRSVTWADEKLNNTE 985
            N  D+    S   A  E  +Q+      NE+ LK SLK +GAKR +RSVTWADE+++N  
Sbjct: 288  NFQDLP-SSSYYTAEAEDISQATGAANLNESVLKPSLKSSGAKRSNRSVTWADERVDNAG 346

Query: 984  SGYLCTFQETGVIQDTVENSRNSDVEDVDASLHLASAEACMLALSQAAEAVSSGEFDVTD 805
            S  LC  QE     ++ E S +++  D    L   SAEAC +ALSQAAEAV+SG+ DV  
Sbjct: 347  SRNLCEVQEMEQTNESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNK 406

Query: 804  AVSNAGIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPP 625
            A+S AGII+LP   D   G + E ++++E E   LKWP KP +  S LFD EDSWYDAPP
Sbjct: 407  AMSEAGIIVLPPSQDLGQGGNVEKNDMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAPP 466

Query: 624  EGFSLTLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSS 445
            EGFSLTLS FATMW ALF W+TSSSLAYIYGRD S+HE+++ VNG+EYPRKI   DGRSS
Sbjct: 467  EGFSLTLSPFATMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRSS 526

Query: 444  EIKQSLAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHVLAL 265
            EI+ +    LAR  P LV  LRL  P+S LE G   LL+TMSF+D LP+  T+QW V+AL
Sbjct: 527  EIRLTAESCLARTFPGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQVIAL 586

Query: 264  LFIDALSVCRIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPLGRRLQFSAQSG 85
            LFI+ALSVCRIP L  +MTS R++L++VLD A ++ EEY+ MKD ++PLGR  Q  A+SG
Sbjct: 587  LFIEALSVCRIPALTSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLGRDPQ--ARSG 644



 Score =  141 bits (355), Expect = 2e-30
 Identities = 113/347 (32%), Positives = 164/347 (47%), Gaps = 19/347 (5%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMDF 2848
            L+ F +L LD  EGLGR  DLG S LKI+EK+   VG+VS+E W+GPSNAIEGYVP  D 
Sbjct: 125  LRKFNDLTLDS-EGLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVPQGDR 183

Query: 2847 SSKPPT---DXXXXXXXXXXXXXXXXXVNEMDFTSTIIVGDQFCIPKFSYG--SEQNGTE 2683
               P                        ++ DFTSTII  D++ I K   G  S  +  +
Sbjct: 184  DPNPSLKNHKEGLKAICKKPVSKQDCFFSDTDFTSTIITNDEYSISKGPSGLTSTASDIK 243

Query: 2682 GRPKESKQNQFIINDVTSATMQNVSETTLMESKLEEGSAASKTQTGELEMFYGPTQNVSD 2503
             + +  K ++ +   ++S   Q+    ++  S+  +G    K    +L     P+ +   
Sbjct: 244  LQAQTGKGHEGLNAQLSSLRKQD----SIKASRKSKGRRKEKVIKEQLNFQDLPSSSYYT 299

Query: 2502 RGVEGSKQEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWSDERKIDNMSNDNLCNVH 2323
               E   Q      L   NE+ LK SLK SS A+R +R VTW+DER +DN  + NLC V 
Sbjct: 300  AEAEDISQATGAANL---NESVLKPSLK-SSGAKRSNRSVTWADER-VDNAGSRNLCEVQ 354

Query: 2322 NTGNISDSVKSPINSSVEGVDHCLFLASEAVCELAPSQVAKAVTFGESDV---------- 2173
                 ++S +   +++     H L   S   C +A SQ A+AV  G++DV          
Sbjct: 355  EMEQTNESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNKAMSEAGII 414

Query: 2172 ----TNDDASKGDSQKNEAVNEPLVVLLEQPRKPDALLSGSFDVKDS 2044
                + D    G+ +KN+ + E     L+ P KP    S  FD +DS
Sbjct: 415  VLPPSQDLGQGGNVEKNDMI-EQESASLKWPTKPGIPQSDLFDPEDS 460


>ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Solanum tuberosum]
          Length = 662

 Score =  535 bits (1378), Expect = e-149
 Identities = 317/672 (47%), Positives = 424/672 (63%), Gaps = 25/672 (3%)
 Frame = -1

Query: 2022 KEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNN 1843
            K +++ +KDAVHKLQL L EGI  E QL  A SL+SRSDY DVVT+RSIAN CGYPLC+N
Sbjct: 3    KGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLCSN 62

Query: 1842 PLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKIN 1663
             L  ER RK HYR SL++ K  +  E YM+CS  C   S AFAG LQ +R S  N  K+N
Sbjct: 63   SLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAKLN 122

Query: 1662 EVLRLFEELGLNEKEDLQKKDDL--NELKMEEKVDV---GNVLLEDCI-PLNSIEGYIPE 1501
            +VL LF+ L L+  ED+++  DL  ++LK++EKVDV   G V LE+ + P N+IEGY+P+
Sbjct: 123  QVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYVPQ 182

Query: 1500 IDCVLKTSPSGHGKGGTESNKATLQKGKGKLLNEMDLTGTIIAGDQFTGPK----MSCVS 1333
             D  +  +   +   G ++  A LQ  K  +LNE D + TII  D+++  K    ++ VS
Sbjct: 183  RDRSVNPALLKNINKGFKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNAVS 242

Query: 1332 KENGAETKRAGLKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAEDTQFGKQEMFH-- 1159
             E   E + A  +Y    +  +IL                   ++ ++T+F K + F+  
Sbjct: 243  SEKFKEAQ-AKTRYKVRDDDVSILGK---RVDALQLRSGEETEKSDKNTRFLKVDKFNSG 298

Query: 1158 ----GPTQNVSDIAVKE----SKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADE 1003
                GP+Q+  D+  K     S  G       + ++  LKSSLK + +K++S+SVTWADE
Sbjct: 299  EVSSGPSQH--DVKNKSVLIMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQSVTWADE 356

Query: 1002 KLNN-----TESGYLCTFQETGVIQDTVENSRNSDVEDVDASLHLASAEACMLALSQAAE 838
             ++      TES    +  E          S ++D+E+ D S    SAEAC  ALSQAAE
Sbjct: 357  IIDGGIGKKTESSSKISEYEN----QAYGGSASTDMEEDDDSYRFESAEACAAALSQAAE 412

Query: 837  AVSSGEFDVTDAVSNAGIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLF 658
            AV+SG  DV DAVS AGI+ILP   + D    QE + +L+ E  PLKWP KP + N  +F
Sbjct: 413  AVASGS-DVPDAVSKAGIVILPTSQEVDEAILQETE-MLDIEPAPLKWPRKPGMPNYDVF 470

Query: 657  DREDSWYDAPPEGFSLTLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYP 478
            + ED WYD PPEGF++TLS FATM+ +LF WI+SSSLA+IYG D +++EE++ +NG+EYP
Sbjct: 471  ESEDCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYP 530

Query: 477  RKIFSSDGRSSEIKQSLAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPS 298
             KI  SDG S+EIKQ+LAG LAR +P LV +LRL  P+S LE GM  LL+TMSF+DPLP+
Sbjct: 531  HKIVLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPA 590

Query: 297  LSTEQWHVLALLFIDALSVCRIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPL 118
               +QW ++ LLF+DALSVCRIP L  +MT  R  L KVLD AQ++  EYE MKD+IIPL
Sbjct: 591  FRMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPL 650

Query: 117  GRRLQFSAQSGG 82
            GR  QFS QSGG
Sbjct: 651  GRVPQFSMQSGG 662



 Score =  105 bits (263), Expect = 1e-19
 Identities = 105/346 (30%), Positives = 155/346 (44%), Gaps = 31/346 (8%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVG-EVSMENWMGPSNAIEGYVPNMD 2851
            L LF+ L L   E +    DLG S+LKI+EK +V+ G EVS+E WMGPSNAIEGYVP  D
Sbjct: 125  LNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYVPQRD 184

Query: 2850 FSSKPP----TDXXXXXXXXXXXXXXXXXVNEMDFTSTIIVGDQFCIPKF--------SY 2707
             S  P      +                 +NE DF+STII  D++ + KF        S 
Sbjct: 185  RSVNPALLKNINKGFKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNAVSSE 244

Query: 2706 GSEQNGTEGRPKESKQNQFIINDVTSATMQNVSETTLMESKLEEGSAASKTQTGELEMFY 2527
              ++   + R K    +  I+     A      E T    K        K  +GE+    
Sbjct: 245  KFKEAQAKTRYKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFLKVDKFNSGEVSS-- 302

Query: 2526 GPTQ-NVSDRGV-----EGSKQEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWSDER 2365
            GP+Q +V ++ V     +G K    GE     ++  LKSSLK SS++++ S+ VTW+DE 
Sbjct: 303  GPSQHDVKNKSVLIMSDDGRKYASHGE----HDKQLLKSSLK-SSNSKKMSQSVTWADEI 357

Query: 2364 KIDNMSNDNLCNVHNTGNISDSVKSPINSSVEGVDHCLFLASEAVCELAPSQVAKAVTFG 2185
                +      +   +   + +     ++ +E  D      S   C  A SQ A+AV  G
Sbjct: 358  IDGGIGKKTESSSKISEYENQAYGGSASTDMEEDDDSYRFESAEACAAALSQAAEAVASG 417

Query: 2184 ESDVTNDDASKG------DSQKNEAVNEPLVVL------LEQPRKP 2083
             SDV +  +  G        + +EA+ +   +L      L+ PRKP
Sbjct: 418  -SDVPDAVSKAGIVILPTSQEVDEAILQETEMLDIEPAPLKWPRKP 462


>ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Glycine max]
          Length = 706

 Score =  530 bits (1365), Expect = e-147
 Identities = 317/711 (44%), Positives = 426/711 (59%), Gaps = 65/711 (9%)
 Frame = -1

Query: 2022 KEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNN 1843
            K++ +++KDAV KLQ+SL EGI +E QLF A SLMSRSDY+D+VT+RSI N CGYPLC+N
Sbjct: 3    KDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLCSN 62

Query: 1842 PLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKIN 1663
             L  +RPRK  YR SL++ K  +  E YMFC   C   S+AFAG LQ +RCS  + EK+N
Sbjct: 63   ALPSDRPRKGRYRISLKEHKVYDLHETYMFCCSNCVVSSKAFAGSLQAERCSGLDLEKLN 122

Query: 1662 EVLRLFEELGLNEKEDLQKKDD--LNELKMEEKVDV--GNVLLEDCI-PLNSIEGYIPEI 1498
             +L LFE L L   E+LQK +D  L++LK++EK +   G V LE    P N+IEGY+P+ 
Sbjct: 123  NILSLFENLNLEPAENLQKNEDFGLSDLKIQEKTETSSGEVSLEQWAGPSNAIEGYVPK- 181

Query: 1497 DCVLKTSPSGHGKGGTESN-KATLQKGKGK-------LLNEMDLTGTIIAGDQFTGPKM- 1345
                   P  H   G   N K   + G GK       + +EM    TII  D ++  K+ 
Sbjct: 182  -------PRDHDSKGLRKNVKKGSKAGHGKPISDINLISSEMGFVSTIIMQDGYSVSKVL 234

Query: 1344 --------------SCVSKENG-----AETKRAGLKYNQFSNHETILAPGRXXXXXXXXX 1222
                          + + K+ G        K  G   +  S+ ++ L  G          
Sbjct: 235  PGQRDATAHHQIKPTAIVKQLGKVDAKVVRKDDGSIQDLSSSFKSSLILGTSEKEEELAQ 294

Query: 1221 XXXXXSRAAEDTQFGKQEMF-----------------------HGPTQNV--SDIAVKES 1117
                  +++ D    K++++                        G    V  +D A   +
Sbjct: 295  SCEAALKSSPDCAIKKKDVYSVSISERQCDVEQNDSAKKSVQVKGKMSRVTANDDASTSN 354

Query: 1116 KQGASVEKTTQSNE-----NTLKSS-LKLTGAKRLSRSVTWADEKLNNTESGYLCTFQET 955
               A+VE+  Q  +     NT   S LK  G K+LSR+VTWAD+K+N+T S  LC F+  
Sbjct: 355  LDPANVEEKFQVEKAGGSLNTKPKSSLKSAGEKKLSRTVTWADKKINSTGSKDLCGFKNF 414

Query: 954  GVIQDTVENSRNS-DVEDVDASLHLASAEACMLALSQAAEAVSSGEFDVTDAVSNAGIII 778
            G I++  +++ NS DV + + +L  ASAEAC++ALS A+EAV+SG+ DV+DAVS AGIII
Sbjct: 415  GDIRNESDSAGNSIDVANDEDTLRRASAEACVIALSSASEAVASGDSDVSDAVSEAGIII 474

Query: 777  LPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPEGFSLTLSS 598
            LP  HD     + ED ++L+ + V +KWP KP +S +  F+ +DSW+DA PEGFSLTLS 
Sbjct: 475  LPPPHDAGEEGTLEDVDILQNDSVTVKWPRKPGISEADFFESDDSWFDAAPEGFSLTLSP 534

Query: 597  FATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSEIKQSLAGV 418
            FATMW  LF WITSSSLAYIYGRD S  EE++ VNG+EYP K+  +DGRSSEIKQ+LA  
Sbjct: 535  FATMWNTLFSWITSSSLAYIYGRDESFQEEYLSVNGREYPCKVVLADGRSSEIKQTLASC 594

Query: 417  LARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHVLALLFIDALSVC 238
            LAR +P LV  LRL  P+S +E GM CLL+TMSF+D LP+  T+QW V+ALLFIDALSVC
Sbjct: 595  LARALPTLVAVLRLPIPVSTMEQGMACLLETMSFVDALPAFRTKQWQVVALLFIDALSVC 654

Query: 237  RIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPLGRRLQFSAQSG 85
            R+P L  +MT  R   ++VL  +Q+  EEYE +KD+ +PLGR    SAQSG
Sbjct: 655  RLPALISYMTDRRASFHRVLSGSQIGMEEYEVLKDLAVPLGRAPHISAQSG 705



 Score =  107 bits (266), Expect = 4e-20
 Identities = 111/400 (27%), Positives = 168/400 (42%), Gaps = 72/400 (18%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPN-MD 2851
            L LFE L L+  E L + ED G S+LKI+EK     GEVS+E W GPSNAIEGYVP   D
Sbjct: 125  LSLFENLNLEPAENLQKNEDFGLSDLKIQEKTETSSGEVSLEQWAGPSNAIEGYVPKPRD 184

Query: 2850 FSSK---PPTDXXXXXXXXXXXXXXXXXVNEMDFTSTIIVGDQFCIPKFSYGSEQNGTEG 2680
              SK                         +EM F STII+ D + + K   G        
Sbjct: 185  HDSKGLRKNVKKGSKAGHGKPISDINLISSEMGFVSTIIMQDGYSVSKVLPGQRDATAHH 244

Query: 2679 RPKES--------------KQNQFIINDVTSATMQNV------SETTLMES--------- 2587
            + K +              +++   I D++S+   ++       E  L +S         
Sbjct: 245  QIKPTAIVKQLGKVDAKVVRKDDGSIQDLSSSFKSSLILGTSEKEEELAQSCEAALKSSP 304

Query: 2586 -------------------KLEEGSAASKTQTGELEMFY------GPTQNVSDRGVEGSK 2482
                                +E+  +A K+   + +M          T N+    VE   
Sbjct: 305  DCAIKKKDVYSVSISERQCDVEQNDSAKKSVQVKGKMSRVTANDDASTSNLDPANVE--- 361

Query: 2481 QEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWSDERKIDNMSNDNLCNVHNTGNISD 2302
            ++   EK   S  T  KSSLK S+  ++ SR VTW+D +KI++  + +LC   N G+I +
Sbjct: 362  EKFQVEKAGGSLNTKPKSSLK-SAGEKKLSRTVTWAD-KKINSTGSKDLCGFKNFGDIRN 419

Query: 2301 SVKSPINS-SVEGVDHCLFLASEAVCELAPSQVAKAVTFGESDVTN-------------D 2164
               S  NS  V   +  L  AS   C +A S  ++AV  G+SDV++              
Sbjct: 420  ESDSAGNSIDVANDEDTLRRASAEACVIALSSASEAVASGDSDVSDAVSEAGIIILPPPH 479

Query: 2163 DASKGDSQKNEAVNEPLVVLLEQPRKPDALLSGSFDVKDS 2044
            DA +  + ++  + +   V ++ PRKP    +  F+  DS
Sbjct: 480  DAGEEGTLEDVDILQNDSVTVKWPRKPGISEADFFESDDS 519


>gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]
          Length = 695

 Score =  525 bits (1351), Expect = e-146
 Identities = 317/719 (44%), Positives = 417/719 (57%), Gaps = 77/719 (10%)
 Frame = -1

Query: 2010 ITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNNPLLK 1831
            I++KD V++LQLSL +G+H E QLF A S+MSRSDY+DVVT+RSIAN CGYPLC NPL  
Sbjct: 9    ISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYPLCPNPLPS 68

Query: 1830 ERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKINEVLR 1651
            +RPRK  YR SL++ K  +  E YM+CS  C   SR FA  L+ +RC++ +S +I+ VLR
Sbjct: 69   DRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDSARIDAVLR 128

Query: 1650 LFEEL-GLNEKEDLQKKDDL--NELKMEEKVD--VGNVLLEDCI-PLNSIEGYIPEIDCV 1489
            +FE+  GL  +    K  DL  ++LK+EEK +  VG+V LE    P N+IEGY+ + +  
Sbjct: 129  MFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRERK 188

Query: 1488 LKTSPSGHGKGGTESNKATLQKGKGKLLNEMDLTGTIIAGDQFTGPKMSCVSKENGAETK 1309
             K   S   K G+++N   L       +N+MD   TII  D++T  K     K+ G ++K
Sbjct: 189  PKELGSKSPKRGSKANNTVL-------INDMDFVSTIITEDEYTVSKTPSSLKKTGLDSK 241

Query: 1308 RAGLKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAEDTQFGKQEMFHGPTQNVSDIA 1129
                        E ILA                  + A   +F   E  + P  NVS + 
Sbjct: 242  --------VREQEEILA------------------KKAMGNEFAVLETSYAPASNVSRVG 275

Query: 1128 V-----------------KESKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADEK 1000
            +                   +++ +  +K  +  E ++KSSLK +  K+LSR+VTWADEK
Sbjct: 276  LVFEDVTSSLRAGSCLSSARAEEESHDDKAEKCTEASIKSSLKPSRKKKLSRTVTWADEK 335

Query: 999  LNNTESGYLCTFQETGVIQD---TVENSR------------------------------- 922
             +++    LC  +E   +++    VEN                                 
Sbjct: 336  TDSSGGRKLCEIREIEDMKEDPSVVENKNGVSFTSSGKMKAGQSVIWADEKGDSSKSIDV 395

Query: 921  -----------------NSDVEDVDASLHLASAEACMLALSQAAEAVSSGEFDVTDAVSN 793
                             N+D  + D +   ASAEAC  AL +A+EAV+S E +V DA+S 
Sbjct: 396  CEVREIEDAKEAADMLCNADTGENDDTFRFASAEACARALDEASEAVASEELEVNDAMSE 455

Query: 792  AGIIILPQLHDYDGGESQE---DDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPE 622
            AGIIILP+  + D GE  E   DD   EPE  P+KWP+KP   +S LFD EDSW+DAPPE
Sbjct: 456  AGIIILPRPENGDEGEPMEEDDDDETSEPEQAPIKWPKKPGSQHSDLFDPEDSWFDAPPE 515

Query: 621  GFSLTLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSE 442
             FSLTLS FA MW ALF W TSS+LAYIYGRD S HEE+ +VNG+EYP KI   DGRSSE
Sbjct: 516  DFSLTLSPFAKMWNALFTWTTSSTLAYIYGRDESLHEEYAVVNGREYPEKIVFGDGRSSE 575

Query: 441  IKQSLAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHVLALL 262
            IKQ+LAG LAR +P LV +LRL  P+S+LE GM  LLDTMSF+D LP    +QW V+ LL
Sbjct: 576  IKQTLAGSLARALPGLVADLRLSTPISSLEQGMGRLLDTMSFVDALPPFRMKQWQVIILL 635

Query: 261  FIDALSVCRIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPLGRRLQFSAQSG 85
            F++ALSV R+P L  HM   RVL +KVLDSAQ++ EEYE MKD++IPLGR   FSAQSG
Sbjct: 636  FLEALSVYRLPALTPHMMYRRVLFHKVLDSAQISAEEYEVMKDLVIPLGRTPHFSAQSG 694



 Score =  120 bits (302), Expect = 3e-24
 Identities = 100/316 (31%), Positives = 151/316 (47%), Gaps = 13/316 (4%)
 Frame = -1

Query: 3027 LKLFEEL-GLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMD 2851
            L++FE+  GL+++ G G++ DLGFS+LKIEEK    VG+VS+E W GPSNAIEGYV   +
Sbjct: 127  LRMFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRE 186

Query: 2850 FSSKPPTDXXXXXXXXXXXXXXXXXVNEMDFTSTIIVGDQFCIPKFSYGSEQNGTEGRPK 2671
               + P +                 +N+MDF STII  D++ + K     ++ G + + +
Sbjct: 187  ---RKPKELGSKSPKRGSKANNTVLINDMDFVSTIITEDEYTVSKTPSSLKKTGLDSKVR 243

Query: 2670 ESKQ--------NQFIINDVTSATMQNVSETTL----MESKLEEGSAASKTQTGELEMFY 2527
            E ++        N+F + + + A   NVS   L    + S L  GS  S  +        
Sbjct: 244  EQEEILAKKAMGNEFAVLETSYAPASNVSRVGLVFEDVTSSLRAGSCLSSAR-------- 295

Query: 2526 GPTQNVSDRGVEGSKQEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWSDERKIDNMS 2347
                         +++E   +K     E ++KSSLKPS   ++ SR VTW+DE K D+  
Sbjct: 296  -------------AEEESHDDKAEKCTEASIKSSLKPSR-KKKLSRTVTWADE-KTDSSG 340

Query: 2346 NDNLCNVHNTGNISDSVKSPINSSVEGVDHCLFLASEAVCELAPSQVAKAVTFGESDVTN 2167
               LC +     I D  + P  S VE  +   F +S  +         KA   G+S +  
Sbjct: 341  GRKLCEIR---EIEDMKEDP--SVVENKNGVSFTSSGKM---------KA---GQSVIWA 383

Query: 2166 DDASKGDSQKNEAVNE 2119
            D+  KGDS K+  V E
Sbjct: 384  DE--KGDSSKSIDVCE 397


>ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 662

 Score =  517 bits (1332), Expect = e-143
 Identities = 310/674 (45%), Positives = 415/674 (61%), Gaps = 29/674 (4%)
 Frame = -1

Query: 2022 KEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNN 1843
            K QS+ +KD V+KLQL+L+EGI +E QLF A SLMSRSDY+DVVT+RSIA+ CGYPLC++
Sbjct: 3    KNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLCHS 62

Query: 1842 PLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKIN 1663
             L  +  R+  YR SL++ K  +  E Y +CS  C   SRAF+G LQ +RCS+ N +K+ 
Sbjct: 63   NLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDKLK 122

Query: 1662 EVLRLFEELGLNEKEDLQKKDDLNELKMEEKVD--VGNVLLEDCI-PLNSIEGYIPEIDC 1492
            E+L+LFE + L+ KE++    D + L+++EK++  +G V +E+ + P N+IEGY+P  D 
Sbjct: 123  EILKLFENMSLDSKENMGNNCD-SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHRDH 181

Query: 1491 VLKTSPSGHGKGGTESNKATLQK-GKGK-LLNEMDLTGTIIAGDQFTGPKMSCVSKENGA 1318
             + T  S  GK   + +KA ++  G GK   ++  +T TII  ++++  K+S   KE   
Sbjct: 182  KVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGLKEMAL 241

Query: 1317 ETKR-------AGLKYN-QFSNHETILAPGRXXXXXXXXXXXXXXSRAAEDTQFGKQEMF 1162
            +T          G + N QF+  ET  AP                 +A    +  K    
Sbjct: 242  DTNSKNQTGEFCGKESNDQFAILETPHAPA--------PPKNSVGRKARGSKERTKVSAT 293

Query: 1161 HGPTQNVSDIAVKESKQGASVEKTTQS--------NENTLKSSLKLTGAKRLSRSVTWAD 1006
               T N+SD       +  +    T+         +   LKSSLK  G K L RSVTWAD
Sbjct: 294  KESTDNLSDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWAD 353

Query: 1005 EKLNNTESGYLCTFQETGVIQDTVENSR--------NSDVEDVDASLHLASAEACMLALS 850
            EK   T+   +    E G +  T E SR        ++D ED+   L + SAEAC +ALS
Sbjct: 354  EK---TDDASIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDI---LRVESAEACAMALS 407

Query: 849  QAAEAVSSGEFDVTDAVSNAGIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSN 670
            QAAEA++SG+ +V+DAVS AGIIILP   D +   S +  N  EP     K  +  VL  
Sbjct: 408  QAAEAITSGQSEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEKSNKLGVL-R 466

Query: 669  SKLFDREDSWYDAPPEGFSLTLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNG 490
            S LFD  DSWYDAPPEGFSLTLSSFATMW A+F W+TSSSLAYIYG+D   HEEF+ ++G
Sbjct: 467  SDLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDG 526

Query: 489  KEYPRKIFSSDGRSSEIKQSLAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMD 310
            KEYP KI S+DGRSSEIKQ+LAG L R +P L +EL L  P+S LE+GM  LLDTM+F+D
Sbjct: 527  KEYPSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLD 586

Query: 309  PLPSLSTEQWHVLALLFIDALSVCRIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDI 130
             LP+   +QW V+ LLFI+ALSV RIP LA HM+S R L +KVLD AQ+  +EYE M+D 
Sbjct: 587  ALPAFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDH 646

Query: 129  IIPLGRRLQFSAQS 88
            I+PLGR  Q S ++
Sbjct: 647  ILPLGRTAQLSDEN 660



 Score =  105 bits (263), Expect = 1e-19
 Identities = 106/358 (29%), Positives = 151/358 (42%), Gaps = 30/358 (8%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMD- 2851
            LKLFE + LD KE +G   D G   L+I+EK    +GEV +E WMGPSNAIEGYVP+ D 
Sbjct: 125  LKLFENMSLDSKENMGNNCDSG---LEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHRDH 181

Query: 2850 -----FSSKPPTDXXXXXXXXXXXXXXXXXVNEMDFTSTIIVGDQFCIPKFSYGSEQNGT 2686
                  S                        ++   TSTII  +++ + K S G ++   
Sbjct: 182  KVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGLKEMAL 241

Query: 2685 EGRPK--------ESKQNQFIINDV--TSATMQNVSETTLMESKLEEGSAASKTQTGELE 2536
            +   K        +   +QF I +     A  +N        SK     +A+K  T  L 
Sbjct: 242  DTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKESTDNLS 301

Query: 2535 MFYGPTQNVSDRGVEGSKQEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWSDERKID 2356
                 ++N S        +E  G    LS  T LKSSLK     +   R VTW+DE K D
Sbjct: 302  DAPSTSKNRS-TNFNLMTEEPRGGFNDLSG-TELKSSLK-KPGKKNLCRSVTWADE-KTD 357

Query: 2355 NMSNDNLCNVHNTGNISDSVKSPIN--SSVEGVDHCLFLASEAVCELAPSQVAKAVTFGE 2182
            + S  NL  V   G   +  ++  N  +     +  L + S   C +A SQ A+A+T G+
Sbjct: 358  DASIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEAITSGQ 417

Query: 2181 SDVTNDDASKG-------DSQKNEAVNEPLVV-----LLEQPRKPDALLSGSFDVKDS 2044
            S+V++  +  G            EA  +P+         E+  K   L S  FD  DS
Sbjct: 418  SEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEKSNKLGVLRSDLFDPSDS 475


>ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa]
            gi|550321730|gb|EEF05523.2| hypothetical protein
            POPTR_0015s01330g [Populus trichocarpa]
          Length = 696

 Score =  516 bits (1328), Expect = e-143
 Identities = 305/697 (43%), Positives = 413/697 (59%), Gaps = 51/697 (7%)
 Frame = -1

Query: 2022 KEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNN 1843
            K+QS  +KD ++KLQLSL +GI +E QL  A S+MS SDY+DVVT+R+IAN CGYPLC N
Sbjct: 3    KDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLCGN 62

Query: 1842 PLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKIN 1663
             L  +RP+K  YR SL++ K  +  E YM+CS  C   SR F+G LQ +RC + N  K+N
Sbjct: 63   SLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAKLN 122

Query: 1662 EVLRLFEELGLNEKEDLQKKDDL--NELKMEEKVDV--GNVLLEDCI-PLNSIEGYIPEI 1498
            EVL LF+   L  +  L K  DL  + LK+EEK +   G V  E  I P N+IEGY+P+ 
Sbjct: 123  EVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVPQR 182

Query: 1497 DCV----------------------LKTSPSG-------------HGKGGTESNKATLQK 1423
            D +                      +  +PSG               KG  + +K +  K
Sbjct: 183  DRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSKAK 242

Query: 1422 G------KGKLLNEMDLTGTII-AGDQFTGPKMSCVSKENGAETK--RAGLKYNQFSNHE 1270
            G      +   +N+M+ T TII   D+++  K         ++TK  +   K +Q S+  
Sbjct: 243  GTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSSEN 302

Query: 1269 TILAPGRXXXXXXXXXXXXXXSRAAEDTQFGKQEMFH--GPTQNVSDIAVKESKQGASVE 1096
               A  +              S+ A   +   Q++       Q  S     E+K+ +  E
Sbjct: 303  QSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSVSE 362

Query: 1095 KTTQSNENTLKSSLKLTGAKRLSRSVTWADEKLNNTESGYLCTFQETGVIQDTVENSRNS 916
            K  +  E++LK SLK +GAK+L+RSVTWADEK+ ++ S  LC  +     +   E   N 
Sbjct: 363  KAAKPVESSLKPSLKTSGAKQLTRSVTWADEKVGSSGSRDLCEVRGMEDTKAGPEIVDNI 422

Query: 915  DVEDVDASLHLASAEACMLALSQAAEAVSSGEFDVTDAVSNAGIIILPQLHDYDGGESQE 736
            D  D        SAEAC  ALSQAAEAV+SG+ D ++A+S AG++ILPQ HD D G+  E
Sbjct: 423  DKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILPQPHDLDQGDPME 482

Query: 735  DDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPEGFSLTLSSFATMWTALFGWITS 556
            D +VL+ E   +KWP KP +  S+ FD E+SWYDAPPEGFSL LSSFAT+W ALF W+TS
Sbjct: 483  DVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFATIWMALFAWVTS 542

Query: 555  SSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSEIKQSLAGVLARMVPELVTELRL 376
            SSLAY+YG+D SSHEE+++VNG+EYPRKI   DGRS EI+Q++ G L R  P +V +LRL
Sbjct: 543  SSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLGRAFPVVVADLRL 602

Query: 375  RAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHVLALLFIDALSVCRIPGLALHMTSMRV 196
              P+S LE G   LL TMSF+D +P+   +QW V+ALLFI+ALSVCRIP L  +M + R+
Sbjct: 603  PIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRIPALISYMDNRRM 662

Query: 195  LLNKVLDSAQLTWEEYESMKDIIIPLGRRLQFSAQSG 85
                V+D  +++ EEYE MKD++IPLGR  QFS QSG
Sbjct: 663  ----VVDGVRMSAEEYEVMKDLMIPLGRAPQFSPQSG 695



 Score =  112 bits (279), Expect = 1e-21
 Identities = 107/409 (26%), Positives = 162/409 (39%), Gaps = 81/409 (19%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMDF 2848
            L LF+   L  +  LG+  DLGFS LKIEEK     GEVS E W+GPSNAIEGYVP  D 
Sbjct: 125  LMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVPQRD- 183

Query: 2847 SSKPPTDXXXXXXXXXXXXXXXXXVNEMDFTSTIIVGDQFCIPKF--------------- 2713
                                    +++MDFTS+II  D++ I K                
Sbjct: 184  -----------------RLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQK 226

Query: 2712 ---------SYGSEQNGTEGRPKESK--------------QNQFIINDVTSATMQNVSET 2602
                     S GS+  GT+   K+                Q+++ I+   S      S+T
Sbjct: 227  PKAKGSHKGSKGSKAKGTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKT 286

Query: 2601 TLMESKLEEGSAASKTQTG------------------------------ELEMFYGPTQN 2512
             + + K +    +S+ Q+                               +L   +   Q 
Sbjct: 287  KIQKQKEKVSQKSSENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQT 346

Query: 2511 VSDRGVEGSKQEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWSDERKIDNMSNDNLC 2332
             S      +K++   EK A   E++LK SLK +S A++ +R VTW+DE K+ +  + +LC
Sbjct: 347  SSITITAEAKEKSVSEKAAKPVESSLKPSLK-TSGAKQLTRSVTWADE-KVGSSGSRDLC 404

Query: 2331 NVHNTGNISDSVKSPINSSVEGVDHCLFLASEAVCELAPSQVAKAVTFGESDVTN----- 2167
             V    +     +   N       +     S   C  A SQ A+AV  G++D +N     
Sbjct: 405  EVRGMEDTKAGPEIVDNIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEA 464

Query: 2166 --------DDASKGDSQKNEAVNEPLVVLLEQPRKPDALLSGSFDVKDS 2044
                     D  +GD  ++  V +     ++ P KP    S  FD ++S
Sbjct: 465  GLVILPQPHDLDQGDPMEDVDVLDEESSTIKWPGKPGIPQSECFDPENS 513


>ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica]
            gi|462404075|gb|EMJ09632.1| hypothetical protein
            PRUPE_ppa002134mg [Prunus persica]
          Length = 711

 Score =  506 bits (1304), Expect = e-140
 Identities = 314/715 (43%), Positives = 421/715 (58%), Gaps = 69/715 (9%)
 Frame = -1

Query: 2022 KEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNN 1843
            ++  I++KD V+KLQL+L EGI  +  L+ A S++SRSDY+DVVT+R+IAN CGYPLC+N
Sbjct: 9    QQPRISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCSN 68

Query: 1842 PLLKE--RPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEK 1669
             L  +  RP K HYR SL++ K  +  E YM+CS  C   S+AFA  L  +RC + +  K
Sbjct: 69   ALPSDSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFGK 128

Query: 1668 INEVLRLFEELGLNEKE-DLQKKDDL--NELKMEEKVDVG-------NVLLED------- 1540
            +  +LR F ++G ++ E    +  DL  ++LK+EEKV+ G        + +E+       
Sbjct: 129  VERILRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHIG 188

Query: 1539 ----CIPLNSIEGYIPEIDCVLKTSPSGHGKGGTESNKATLQKGKGKLLNEMDLTGTIIA 1372
                  P N+IEGY+P+ + + K   S   K G++   A +  G   + NEMD   TII 
Sbjct: 189  DLGAVGPSNAIEGYVPQKERISKPLGSKKNKEGSKGKDAKMSSGMDIIFNEMDFMSTIIT 248

Query: 1371 GDQFTGPKMSCVSKENGAETK------RAGLKYNQFSNHETILAPGRXXXXXXXXXXXXX 1210
             D+++  K+     E   ETK      + GL  N           G+             
Sbjct: 249  SDEYSVSKIPPSVGEPDFETKFKKSKGKVGLNKNDSVKKSRQSKGGK----NKNVKKDDV 304

Query: 1209 XSRAAEDTQFGKQEMFHGPTQNVSDIAVKESKQGASVEKTTQSNENTLKSSLKLTGAKRL 1030
              R    T    Q + +G T        KE K+   VEK  QS E  L+SSLK +G K+L
Sbjct: 305  CIREVPSTSDASQTVLNGST--------KEEKEEFIVEKAEQSGEALLRSSLKPSGTKKL 356

Query: 1029 SRSVTWADEKLNNTESGYLCTFQETGVIQD-----------TVEN--------------- 928
            +RSVTWADE +++T S  L   +E   I +           +VEN               
Sbjct: 357  NRSVTWADEMIDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDS 416

Query: 927  --SRN----SDVEDVD--ASLH------LASAEACMLALSQAAEAVSSGEFDVTDAVSNA 790
              S+N     +V+D D   SL       L SAEAC +AL+QAAEAV+SGE DV+ AVS A
Sbjct: 417  TKSKNICEVREVQDADVLGSLDLQENEILESAEACAMALNQAAEAVASGESDVSGAVSGA 476

Query: 789  GIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPEGFSL 610
            GIIILP+    D  E  ED ++LE E  PL WP KP +  S LFD EDSW+DAPPEGFS+
Sbjct: 477  GIIILPRPDGLDEEEPTEDVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDAPPEGFSV 535

Query: 609  TLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSEIKQS 430
            TLS FATMW +LF WITSS+LAYIYGRD S HEEF+ VNG+EYP KI  + GRSSEIK++
Sbjct: 536  TLSPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGRSSEIKKT 595

Query: 429  LAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHVLALLFIDA 250
            L    AR +P +V+ELRL  P+S+LE GM  +L+TMSF+D +P+   +QW V+ LLF++ 
Sbjct: 596  LDESFARALPGVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQVIVLLFLEG 655

Query: 249  LSVCRIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPLGRRLQFSAQSG 85
            LSVCRIP L  HMT+ R+L  KVL++ Q++ E+YE MKD+IIPLGR  QFSAQSG
Sbjct: 656  LSVCRIPALTPHMTNRRMLFYKVLENTQISAEQYELMKDLIIPLGRAPQFSAQSG 710



 Score =  106 bits (264), Expect = 7e-20
 Identities = 115/402 (28%), Positives = 156/402 (38%), Gaps = 74/402 (18%)
 Frame = -1

Query: 3027 LKLFEELGLDKKE-GLGREEDLGFSELKIEEKANVEVGEVSMENW--------------- 2896
            L+ F ++G DK E G G   DLG S+LKIEEK    +G++ +                  
Sbjct: 133  LRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHIGDLGA 192

Query: 2895 MGPSNAIEGYVPNMDFSSKP----PTDXXXXXXXXXXXXXXXXXVNEMDFTSTIIVGDQF 2728
            +GPSNAIEGYVP  +  SKP                         NEMDF STII  D++
Sbjct: 193  VGPSNAIEGYVPQKERISKPLGSKKNKEGSKGKDAKMSSGMDIIFNEMDFMSTIITSDEY 252

Query: 2727 CIPKFSYGSEQNGTEGRPKESKQNQFIINDVTSATMQNVSETTLMESKLEEGSAASKTQT 2548
             + K      +   E + K+SK              +N S     +SK  +     K   
Sbjct: 253  SVSKIPPSVGEPDFETKFKKSKGK--------VGLNKNDSVKKSRQSKGGKNKNVKKDDV 304

Query: 2547 --GELEMFYGPTQNVSDRGVEGSKQEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWS 2374
               E+      +Q V +   +  K+E   EK   S E  L+SSLKPS   ++ +R VTW+
Sbjct: 305  CIREVPSTSDASQTVLNGSTKEEKEEFIVEKAEQSGEALLRSSLKPSG-TKKLNRSVTWA 363

Query: 2373 DERKIDNMSNDNLCNVHNTGNI---SDSVKSPINSSVEGVDHCL---------------- 2251
            DE  ID+  + NL  V     I   SD+  S    SVE    C                 
Sbjct: 364  DE-MIDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDSTKSKNI 422

Query: 2250 ---------------------FLASEAVCELAPSQVAKAVTFGESDVTNDDASKG----- 2149
                                  L S   C +A +Q A+AV  GESDV+   +  G     
Sbjct: 423  CEVREVQDADVLGSLDLQENEILESAEACAMALNQAAEAVASGESDVSGAVSGAGIIILP 482

Query: 2148 --DSQKNEAVNEPLVVLLEQ-----PRKPDALLSGSFDVKDS 2044
              D    E   E + +L  +     PRKP    S  FD +DS
Sbjct: 483  RPDGLDEEEPTEDVDMLESEQAPLWPRKPGIPCSDLFDPEDS 524


>ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao]
            gi|508787291|gb|EOY34547.1| F2P16.20-like protein isoform
            3, partial [Theobroma cacao]
          Length = 703

 Score =  506 bits (1303), Expect = e-140
 Identities = 301/693 (43%), Positives = 409/693 (59%), Gaps = 50/693 (7%)
 Frame = -1

Query: 2088 KPDALLSGSFDVKDSRMSPASVKEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRS 1909
            KP +L   S +    + S +  KEQSI++ +AVHK+QL L +GI  EKQL  + SL+SRS
Sbjct: 35   KPPSLQQHSRERLSKKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRS 94

Query: 1908 DYDDVVTKRSIANKCGYPLCNNPLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATY 1729
            DY+DVVT+R+I+N CGYPLC NPL  E  RK  YR SL++ K  + +E YMFCS  C   
Sbjct: 95   DYEDVVTERTISNTCGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLIN 154

Query: 1728 SRAFAGGLQIKRCSLYNSEKINEVLRLFEELGLNEKEDLQKKDDLN----ELKMEEKVDV 1561
            SRAFAG LQ +RCS+ N  K+N++L LF +L L++  DL K  DL      +K  E+V  
Sbjct: 155  SRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKA 213

Query: 1560 GNVLLEDCIPLNSIEGYIPEIDCVLKTSPSGHGKGGT-ESNKATLQKGKGK--------- 1411
             +V L    P N+IEGY+P+ + + K +P  + K    +S+ + L   K +         
Sbjct: 214  EDVSLAG--PSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDF 271

Query: 1410 ----------------------------------LLNEMDLTGTIIAGDQFTGPKMSCVS 1333
                                              ++NEMD T  II  D++T  KM   S
Sbjct: 272  AGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGS 331

Query: 1332 KENGAETKRAGLKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAE-DTQFGKQEMFHG 1156
            K++  +           SN + +   G               S   E D+   +      
Sbjct: 332  KQSCFD-----------SNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKN 380

Query: 1155 PTQNVSDIAVKESKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADEK-LNNTESG 979
              Q+  D +  E+++    +K   S+E  LKSSLK  GAK+L+R VTWAD+K  +N  +G
Sbjct: 381  VYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNG 440

Query: 978  YLCTFQETGVIQDTVENSRNSDVEDVDASLHLASAEACMLALSQAAEAVSSGEFDVTDAV 799
             LC  +E   ++   E S +++    D  L   SAEAC +ALS+AAEAV+SG+ DVTDAV
Sbjct: 441  NLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAV 500

Query: 798  SNAGIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPEG 619
                        + D  E  ED ++LEPE  P+KWP+KP + +S +F+ EDSW+DAPPEG
Sbjct: 501  C-----------EVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEG 549

Query: 618  FSLTLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSEI 439
            FSLTLS+FATMW ALF WITSSSLAYIYGRD S HEE++ +NG+EYPRKI   DGRSSEI
Sbjct: 550  FSLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEI 609

Query: 438  KQSLAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHVLALLF 259
            K++LA  ++R +P +VT+LRL  P+S LE GM  L+DT+SFM+ LP+   +QW V+ LLF
Sbjct: 610  KETLASCISRALPAIVTDLRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLF 669

Query: 258  IDALSVCRIPGLALHMTSMRVLLNKVLDSAQLT 160
            IDALSVCRIP L  HMT+ R+LL+KVLD AQ++
Sbjct: 670  IDALSVCRIPALTPHMTNGRMLLHKVLDGAQIS 702



 Score =  144 bits (364), Expect = 2e-31
 Identities = 119/372 (31%), Positives = 181/372 (48%), Gaps = 44/372 (11%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMDF 2848
            L LF +L LD  + LG+  DLGFS L+I+E   V+  +VS+    GPSNAIEGYVP  + 
Sbjct: 179  LSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNAIEGYVPQREL 234

Query: 2847 SSKPPTDXXXXXXXXXXXXXXXXXV-------NEMDFTSTIIVGDQFCIPKFSYGSEQNG 2689
             SKP                            NE+DF  TII+ D++ I K   GS + G
Sbjct: 235  ISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISK-KPGSFKQG 293

Query: 2688 TEGRPKESKQNQFIINDV------------TSATMQNVSETTLMESKLEE---------- 2575
               +   SK+  F+IN++            T + M + S+ +  +S L+E          
Sbjct: 294  DRTK-LSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDS 352

Query: 2574 -------GSAASKTQTGELEMFYGPTQNVSDRGVEGS----KQEVSGEKLALSNETTLKS 2428
                   GS+++  +     +    T+NV   G++ S    ++E   +K   S+ET LKS
Sbjct: 353  EDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKS 412

Query: 2427 SLKPSSDARRFSRDVTWSDERKIDNMSNDNLCNVHNTGNISDSVKSPINSSVE--GVDHC 2254
            SLK S+ A++ +R VTW+D++K DN  N NLC V     +     S I+ S E  G D+ 
Sbjct: 413  SLK-SAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGD--SEISGSAEDGGDDNM 469

Query: 2253 LFLASEAVCELAPSQVAKAVTFGESDVTND--DASKGDSQKNEAVNEPLVVLLEQPRKPD 2080
            L   S   C +A S+ A+AV  G+SDVT+   +  K +  ++  + EP    ++ P+KP 
Sbjct: 470  LRFVSAEACAMALSKAAEAVASGDSDVTDAVCEVDKEEPMEDGDMLEPETAPVKWPKKPG 529

Query: 2079 ALLSGSFDVKDS 2044
               S  F+ +DS
Sbjct: 530  IPHSDMFNPEDS 541


>ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 632

 Score =  491 bits (1265), Expect = e-136
 Identities = 292/660 (44%), Positives = 405/660 (61%), Gaps = 15/660 (2%)
 Frame = -1

Query: 2022 KEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNN 1843
            K QS+ +KD V+KLQL+L+EGI +E QLF A SLMSRSDY+DVVT+RSIA+ CGYPLC++
Sbjct: 3    KNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLCHS 62

Query: 1842 PLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKIN 1663
             L  +  R+  YR SL++ K  +  E Y +CS  C   SRAF+G LQ +RCS+ N +K+ 
Sbjct: 63   NLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDKLK 122

Query: 1662 EVLRLFEELGLNEKEDLQKKDDLNELKMEEKVD--VGNVLLEDCI-PLNSIEGYIPEIDC 1492
            E+L+LFE + L+ KE++    D + L+++EK++  +G V +E+ + P N+IEGY+P  D 
Sbjct: 123  EILKLFENMSLDSKENMGNNCD-SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHRDH 181

Query: 1491 VLKTSPSGHGKGGTESNKATLQK-GKGK-LLNEMDLTGTIIAGDQFTGPKMSCVSKENGA 1318
             + T  S  GK   + +KA ++  G GK   ++   T TII  ++++  K+S   KE   
Sbjct: 182  KVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMAL 241

Query: 1317 ETKRAGLKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAEDTQFGKQEMFHGPTQNVS 1138
            +T          S ++T    G+                   + QF   E  H P    +
Sbjct: 242  DTN---------SKNQTGEFCGK-----------------KSNDQFAILETPHAPAPPKN 275

Query: 1137 DIAVKE--SKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADEKLNNTESGYLCTF 964
             +  K   SK+   V  T +S +N L  +   +  +  + ++   + +   T+   +   
Sbjct: 276  SVGRKARGSKERTKVSATKESTDN-LSDAPSTSNNRSTNFNLMTEEPRDEKTDDASIMNL 334

Query: 963  QETGVIQDTVENSR--------NSDVEDVDASLHLASAEACMLALSQAAEAVSSGEFDVT 808
             E G +  T E SR        ++D ED+   L + SAEAC +ALSQAA+A++SG+ +V+
Sbjct: 335  PEVGEMGKTKECSRTTSNLVNFDNDNEDL---LRVESAEACAMALSQAAKAITSGQSEVS 391

Query: 807  DAVSNAGIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAP 628
            DAVS AGIIILP   D +   S +  N  EP     K  +  VL  S LFD  DSWYDAP
Sbjct: 392  DAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEKSNKLGVL-RSDLFDPSDSWYDAP 450

Query: 627  PEGFSLTLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRS 448
            PEGFSLTLSSFATMW A+F W+TSSSLAYIYG+D   HEEF+ ++GKEYP KI S+DGRS
Sbjct: 451  PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIVSADGRS 510

Query: 447  SEIKQSLAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHVLA 268
            SEIKQ+LAG L R +P L +EL L  P+S LE+GM  LLDTM+F+D LP+   +QW V+ 
Sbjct: 511  SEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMKQWQVIV 570

Query: 267  LLFIDALSVCRIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPLGRRLQFSAQS 88
            LLFI+ALSV RIP LA HM+S R L +KVLD AQ+  +EYE M+D I+PLGR  Q S ++
Sbjct: 571  LLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTAQLSDEN 630



 Score = 97.4 bits (241), Expect = 3e-17
 Identities = 92/356 (25%), Positives = 144/356 (40%), Gaps = 28/356 (7%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMD- 2851
            LKLFE + LD KE +G   D G   L+I+EK    +GEV +E WMGPSNAIEGYVP+ D 
Sbjct: 125  LKLFENMSLDSKENMGNNCDSG---LEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHRDH 181

Query: 2850 -----FSSKPPTDXXXXXXXXXXXXXXXXXVNEMDFTSTIIVGDQFCIPKFSYGSEQNGT 2686
                  S                        ++  FTSTII  +++ + K S G ++   
Sbjct: 182  KVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMAL 241

Query: 2685 EGRPKESKQNQFIINDVTSATMQNVSETTLMESKLEEGSAASKTQTGELEMFY-----GP 2521
            +                               SK + G    K    +  +        P
Sbjct: 242  D-----------------------------TNSKNQTGEFCGKKSNDQFAILETPHAPAP 272

Query: 2520 TQNVSDRGVEGSKQEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWSDERKIDNMSND 2341
             +N   R   GSK+     K++ + E+T   S  PS+   R +     ++E + +   + 
Sbjct: 273  PKNSVGRKARGSKERT---KVSATKESTDNLSDAPSTSNNRSTNFNLMTEEPRDEKTDDA 329

Query: 2340 NLCNVHNTGNISDSVK-SPINSSVEGVDH----CLFLASEAVCELAPSQVAKAVTFGESD 2176
            ++ N+   G +  + + S   S++   D+     L + S   C +A SQ AKA+T G+S+
Sbjct: 330  SIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDLLRVESAEACAMALSQAAKAITSGQSE 389

Query: 2175 VTNDDASKG-------DSQKNEAVNEPLVV-----LLEQPRKPDALLSGSFDVKDS 2044
            V++  +  G            EA  +P+         E+  K   L S  FD  DS
Sbjct: 390  VSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEKSNKLGVLRSDLFDPSDS 445


>gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus guttatus]
          Length = 597

 Score =  481 bits (1239), Expect = e-133
 Identities = 289/646 (44%), Positives = 394/646 (60%), Gaps = 5/646 (0%)
 Frame = -1

Query: 2004 LKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNNPLLKER 1825
            +KDAVHKLQLSL EGI HE QL  A SL+S+SDY DVVT+R+IA+ CGYPLC N L  E 
Sbjct: 9    VKDAVHKLQLSLLEGIKHESQLIAAGSLISQSDYQDVVTERTIAHVCGYPLCVNSLPSEP 68

Query: 1824 PRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKINEVLRLF 1645
            PRK HYR SL++ K  +  E +M+CS  C   SRAF   L+ +R S  +  KIN VL++F
Sbjct: 69   PRKGHYRISLKEHKVYDLHETHMYCSTECLIRSRAFGASLEEERSSSLDPAKINSVLKMF 128

Query: 1644 EELGLNEKEDLQKKDDL--NELKMEEKVDVGN--VLLEDCI-PLNSIEGYIPEIDCVLKT 1480
            + L L+    L K  DL  + LK+ EK+  G+  + LE+ + P N+I+GY+P  D   + 
Sbjct: 129  DGLSLDSVMGLDKSGDLGLSGLKIREKMVTGSGEMSLEEWVGPSNAIDGYVPRRDQNSER 188

Query: 1479 SPSGHGKGGTESNKATLQKGKGKLLNEMDLTGTIIAGDQFTGPKMSCVSKENGAETKRAG 1300
                  K  TESN A        L  +++ T TII  D+++  K + V +E   + K   
Sbjct: 189  KQPSRKK--TESNHAKPNLAD-TLPFDVNFTSTIIMQDEYSVSK-TAVPREAKGKVKGKM 244

Query: 1299 LKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAEDTQFGKQEMFHGPTQNVSDIAVKE 1120
            ++ +  +   ++L                      +DT         GP+QN + +    
Sbjct: 245  IRKSVKAEKISVL----------------------DDTA--------GPSQNDTTL---- 270

Query: 1119 SKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADEKLNNTESGYLCTFQETGVIQD 940
                             LKSSLK   +K+ +RSVTWADEK ++ +   +   +E G  + 
Sbjct: 271  -----------------LKSSLKTLDSKKETRSVTWADEK-SDGDGKSISECREIGDNKG 312

Query: 939  TVENSRNSDVEDVDASLHLASAEACMLALSQAAEAVSSGEFDVTDAVSNAGIIILPQLHD 760
             V     +D +  D S    SAEAC  ALSQA+EAV+SG+ D +DAVS AG+IILP  H+
Sbjct: 313  AVVMPHLTDEDVGDESYRFTSAEACARALSQASEAVASGKTDASDAVSEAGVIILPPPHE 372

Query: 759  YDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPEGFSLTLSSFATMWT 580
             D  + ++   V++ + + LKWP KP  S+  LFD EDSWYD+PPEGF+LTLS F+TM+ 
Sbjct: 373  VDEAKYEQIGEVVDVDPIELKWPPKPGFSSEDLFDSEDSWYDSPPEGFNLTLSPFSTMFM 432

Query: 579  ALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSEIKQSLAGVLARMVP 400
            +LF WI+SSSLAYIYG++   HE+++ +NG+EYP KI   DGRS+E+K +LAG LAR +P
Sbjct: 433  SLFAWISSSSLAYIYGKEERFHEDYLSINGREYPPKII-IDGRSAEVKHTLAGCLARALP 491

Query: 399  ELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHVLALLFIDALSVCRIPGLA 220
             LV+E+R+  P+S +E GM  LLDTMSF D LP    +QW V+ALLF+DALSV RIP L+
Sbjct: 492  GLVSEIRIPTPVSTIEQGMGRLLDTMSFTDALPGFRMKQWQVIALLFLDALSVSRIPALS 551

Query: 219  LHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPLGRRLQFSAQSGG 82
             +MT  R+LL KVL+ AQ+  EE+E MKD+IIPLGR  QFS QSGG
Sbjct: 552  PYMTGRRILLPKVLEGAQINVEEFEIMKDLIIPLGRVPQFSTQSGG 597



 Score = 86.3 bits (212), Expect = 8e-14
 Identities = 59/165 (35%), Positives = 89/165 (53%), Gaps = 5/165 (3%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMDF 2848
            LK+F+ L LD   GL +  DLG S LKI EK     GE+S+E W+GPSNAI+GYVP  D 
Sbjct: 125  LKMFDGLSLDSVMGLDKSGDLGLSGLKIREKMVTGSGEMSLEEWVGPSNAIDGYVPRRDQ 184

Query: 2847 SS--KPPTDXXXXXXXXXXXXXXXXXVNEMDFTSTIIVGDQFCIPKFSYGSEQNG-TEGR 2677
            +S  K P+                    +++FTSTII+ D++ + K +   E  G  +G+
Sbjct: 185  NSERKQPSRKKTESNHAKPNLADTLPF-DVNFTSTIIMQDEYSVSKTAVPREAKGKVKGK 243

Query: 2676 --PKESKQNQFIINDVTSATMQNVSETTLMESKLEEGSAASKTQT 2548
               K  K  +  + D T+   QN  +TTL++S L+   +  +T++
Sbjct: 244  MIRKSVKAEKISVLDDTAGPSQN--DTTLLKSSLKTLDSKKETRS 286


>ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma cacao]
            gi|508787293|gb|EOY34549.1| F2P16.20-like protein isoform
            5 [Theobroma cacao]
          Length = 708

 Score =  474 bits (1219), Expect = e-130
 Identities = 280/655 (42%), Positives = 384/655 (58%), Gaps = 50/655 (7%)
 Frame = -1

Query: 2088 KPDALLSGSFDVKDSRMSPASVKEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRS 1909
            KP +L   S +    + S +  KEQSI++ +AVHK+QL L +GI  EKQL  + SL+SRS
Sbjct: 35   KPPSLQQHSRERLSKKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRS 94

Query: 1908 DYDDVVTKRSIANKCGYPLCNNPLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATY 1729
            DY+DVVT+R+I+N CGYPLC NPL  E  RK  YR SL++ K  + +E YMFCS  C   
Sbjct: 95   DYEDVVTERTISNTCGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLIN 154

Query: 1728 SRAFAGGLQIKRCSLYNSEKINEVLRLFEELGLNEKEDLQKKDDLN----ELKMEEKVDV 1561
            SRAFAG LQ +RCS+ N  K+N++L LF +L L++  DL K  DL      +K  E+V  
Sbjct: 155  SRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKA 213

Query: 1560 GNVLLEDCIPLNSIEGYIPEIDCVLKTSPSGHGKGGT-ESNKATLQKGKGK--------- 1411
             +V L    P N+IEGY+P+ + + K +P  + K    +S+ + L   K +         
Sbjct: 214  EDVSLAG--PSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDF 271

Query: 1410 ----------------------------------LLNEMDLTGTIIAGDQFTGPKMSCVS 1333
                                              ++NEMD T  II  D++T  KM   S
Sbjct: 272  AGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGS 331

Query: 1332 KENGAETKRAGLKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAE-DTQFGKQEMFHG 1156
            K++  +           SN + +   G               S   E D+   +      
Sbjct: 332  KQSCFD-----------SNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKN 380

Query: 1155 PTQNVSDIAVKESKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADEK-LNNTESG 979
              Q+  D +  E+++    +K   S+E  LKSSLK  GAK+L+R VTWAD+K  +N  +G
Sbjct: 381  VYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNG 440

Query: 978  YLCTFQETGVIQDTVENSRNSDVEDVDASLHLASAEACMLALSQAAEAVSSGEFDVTDAV 799
             LC  +E   ++   E S +++    D  L   SAEAC +ALS+AAEAV+SG+ DVTDAV
Sbjct: 441  NLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAV 500

Query: 798  SNAGIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPEG 619
               G+IILP L + D  E  ED ++LEPE  P+KWP+KP + +S +F+ EDSW+DAPPEG
Sbjct: 501  YENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEG 560

Query: 618  FSLTLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSEI 439
            FSLTLS+FATMW ALF WITSSSLAYIYGRD S HEE++ +NG+EYPRKI   DGRSSEI
Sbjct: 561  FSLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEI 620

Query: 438  KQSLAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHV 274
            K++LA  ++R +P +VT+LRL  P+S LE GM  L+DT+SFM+ LP+   +QW +
Sbjct: 621  KETLASCISRALPAIVTDLRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWEI 675



 Score =  140 bits (353), Expect = 4e-30
 Identities = 119/383 (31%), Positives = 181/383 (47%), Gaps = 55/383 (14%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMDF 2848
            L LF +L LD  + LG+  DLGFS L+I+E   V+  +VS+    GPSNAIEGYVP  + 
Sbjct: 179  LSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNAIEGYVPQREL 234

Query: 2847 SSKPPTDXXXXXXXXXXXXXXXXXV-------NEMDFTSTIIVGDQFCIPKFSYGSEQNG 2689
             SKP                            NE+DF  TII+ D++ I K   GS + G
Sbjct: 235  ISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISK-KPGSFKQG 293

Query: 2688 TEGRPKESKQNQFIINDV------------TSATMQNVSETTLMESKLEE---------- 2575
               +   SK+  F+IN++            T + M + S+ +  +S L+E          
Sbjct: 294  DRTK-LSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDS 352

Query: 2574 -------GSAASKTQTGELEMFYGPTQNVSDRGVEGS----KQEVSGEKLALSNETTLKS 2428
                   GS+++  +     +    T+NV   G++ S    ++E   +K   S+ET LKS
Sbjct: 353  EDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKS 412

Query: 2427 SLKPSSDARRFSRDVTWSDERKIDNMSNDNLCNVHNTGNISDSVKSPINSSVE--GVDHC 2254
            SLK S+ A++ +R VTW+D++K DN  N NLC V     +     S I+ S E  G D+ 
Sbjct: 413  SLK-SAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGD--SEISGSAEDGGDDNM 469

Query: 2253 LFLASEAVCELAPSQVAKAVTFGESDVTND-------------DASKGDSQKNEAVNEPL 2113
            L   S   C +A S+ A+AV  G+SDVT+              +  K +  ++  + EP 
Sbjct: 470  LRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPE 529

Query: 2112 VVLLEQPRKPDALLSGSFDVKDS 2044
               ++ P+KP    S  F+ +DS
Sbjct: 530  TAPVKWPKKPGIPHSDMFNPEDS 552


>ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma cacao]
            gi|508787290|gb|EOY34546.1| F2P16.20-like protein isoform
            2 [Theobroma cacao]
          Length = 679

 Score =  473 bits (1216), Expect = e-130
 Identities = 280/653 (42%), Positives = 383/653 (58%), Gaps = 50/653 (7%)
 Frame = -1

Query: 2088 KPDALLSGSFDVKDSRMSPASVKEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRS 1909
            KP +L   S +    + S +  KEQSI++ +AVHK+QL L +GI  EKQL  + SL+SRS
Sbjct: 35   KPPSLQQHSRERLSKKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRS 94

Query: 1908 DYDDVVTKRSIANKCGYPLCNNPLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATY 1729
            DY+DVVT+R+I+N CGYPLC NPL  E  RK  YR SL++ K  + +E YMFCS  C   
Sbjct: 95   DYEDVVTERTISNTCGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLIN 154

Query: 1728 SRAFAGGLQIKRCSLYNSEKINEVLRLFEELGLNEKEDLQKKDDLN----ELKMEEKVDV 1561
            SRAFAG LQ +RCS+ N  K+N++L LF +L L++  DL K  DL      +K  E+V  
Sbjct: 155  SRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKA 213

Query: 1560 GNVLLEDCIPLNSIEGYIPEIDCVLKTSPSGHGKGGT-ESNKATLQKGKGK--------- 1411
             +V L    P N+IEGY+P+ + + K +P  + K    +S+ + L   K +         
Sbjct: 214  EDVSLAG--PSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDF 271

Query: 1410 ----------------------------------LLNEMDLTGTIIAGDQFTGPKMSCVS 1333
                                              ++NEMD T  II  D++T  KM   S
Sbjct: 272  AGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGS 331

Query: 1332 KENGAETKRAGLKYNQFSNHETILAPGRXXXXXXXXXXXXXXSRAAE-DTQFGKQEMFHG 1156
            K++  +           SN + +   G               S   E D+   +      
Sbjct: 332  KQSCFD-----------SNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKN 380

Query: 1155 PTQNVSDIAVKESKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADEK-LNNTESG 979
              Q+  D +  E+++    +K   S+E  LKSSLK  GAK+L+R VTWAD+K  +N  +G
Sbjct: 381  VYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNG 440

Query: 978  YLCTFQETGVIQDTVENSRNSDVEDVDASLHLASAEACMLALSQAAEAVSSGEFDVTDAV 799
             LC  +E   ++   E S +++    D  L   SAEAC +ALS+AAEAV+SG+ DVTDAV
Sbjct: 441  NLCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAV 500

Query: 798  SNAGIIILPQLHDYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPEG 619
               G+IILP L + D  E  ED ++LEPE  P+KWP+KP + +S +F+ EDSW+DAPPEG
Sbjct: 501  YENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEG 560

Query: 618  FSLTLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSEI 439
            FSLTLS+FATMW ALF WITSSSLAYIYGRD S HEE++ +NG+EYPRKI   DGRSSEI
Sbjct: 561  FSLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEI 620

Query: 438  KQSLAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQW 280
            K++LA  ++R +P +VT+LRL  P+S LE GM  L+DT+SFM+ LP+   +QW
Sbjct: 621  KETLASCISRALPAIVTDLRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQW 673



 Score =  140 bits (353), Expect = 4e-30
 Identities = 119/383 (31%), Positives = 181/383 (47%), Gaps = 55/383 (14%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMDF 2848
            L LF +L LD  + LG+  DLGFS L+I+E   V+  +VS+    GPSNAIEGYVP  + 
Sbjct: 179  LSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNAIEGYVPQREL 234

Query: 2847 SSKPPTDXXXXXXXXXXXXXXXXXV-------NEMDFTSTIIVGDQFCIPKFSYGSEQNG 2689
             SKP                            NE+DF  TII+ D++ I K   GS + G
Sbjct: 235  ISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISK-KPGSFKQG 293

Query: 2688 TEGRPKESKQNQFIINDV------------TSATMQNVSETTLMESKLEE---------- 2575
               +   SK+  F+IN++            T + M + S+ +  +S L+E          
Sbjct: 294  DRTK-LSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDS 352

Query: 2574 -------GSAASKTQTGELEMFYGPTQNVSDRGVEGS----KQEVSGEKLALSNETTLKS 2428
                   GS+++  +     +    T+NV   G++ S    ++E   +K   S+ET LKS
Sbjct: 353  EDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKS 412

Query: 2427 SLKPSSDARRFSRDVTWSDERKIDNMSNDNLCNVHNTGNISDSVKSPINSSVE--GVDHC 2254
            SLK S+ A++ +R VTW+D++K DN  N NLC V     +     S I+ S E  G D+ 
Sbjct: 413  SLK-SAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGD--SEISGSAEDGGDDNM 469

Query: 2253 LFLASEAVCELAPSQVAKAVTFGESDVTND-------------DASKGDSQKNEAVNEPL 2113
            L   S   C +A S+ A+AV  G+SDVT+              +  K +  ++  + EP 
Sbjct: 470  LRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPE 529

Query: 2112 VVLLEQPRKPDALLSGSFDVKDS 2044
               ++ P+KP    S  F+ +DS
Sbjct: 530  TAPVKWPKKPGIPHSDMFNPEDS 552


>ref|XP_004302308.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Fragaria vesca subsp. vesca]
          Length = 692

 Score =  472 bits (1215), Expect = e-130
 Identities = 296/718 (41%), Positives = 409/718 (56%), Gaps = 75/718 (10%)
 Frame = -1

Query: 2013 SITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNNPLL 1834
            S ++ DAV+KLQL+L + +    +L+ A S++SRSDY DVVT+RSIA+ CGYPLC+N L 
Sbjct: 10   SKSVNDAVYKLQLALLDSVKTLDRLYLAGSIISRSDYTDVVTERSIADLCGYPLCSNALP 69

Query: 1833 KE--RPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKINE 1660
             E  R RK HYR SL++ K  + RE  ++CS  C   S+AFA GL  +RC + +  K+  
Sbjct: 70   PEASRTRKGHYRISLKEHKVYDLRETKLYCSSKCVIDSKAFAQGLSEERCDVLDLGKVER 129

Query: 1659 VLRLFEELGLNEKEDLQKKDDLNELKMEEKVDVGNVLLEDCIPLNSIEGYIPEIDCVLKT 1480
            VLR F E    EK+++     L+ LK+EEK    +  +E+  P N+IEGY+P  D V K 
Sbjct: 130  VLREFGE----EKKEIGDLG-LSSLKIEEKSGTYSGKVEEFGPSNAIEGYVPRRDRVSKA 184

Query: 1479 SPSGHGKGGTESNKATLQKG-KGKLLNEMDLTGTIIAGDQFTGPKMSCVSKENGAETKRA 1303
            S +   K G++   A    G K  +LN+MD   T++A D+++  KM     +N  +T+  
Sbjct: 185  SGAKKNKQGSKGKDAKPSGGGKQLILNDMDFMSTLLACDEYSVSKMPPNVADNNVDTELK 244

Query: 1302 GLKYNQ----FSNHETILAPGRXXXXXXXXXXXXXXSRAAEDTQFGKQEMFHGPTQNVSD 1135
              K       FS  ET   P +                                  +V D
Sbjct: 245  KSKGKDLESGFSVLETSATPNKSEG-----------------------------VMDVGD 275

Query: 1134 IAVK----ESKQGASVEKTTQSNENTLKSSLKLTGAKRLSRSVTWADEKLNNTESGYLCT 967
            + +     E+++ + V K  +S+E TL+SSLK +G K+LSRSVTWADEK ++T    LC 
Sbjct: 276  LGMSRLKIEAEEESQVGKGEKSSEGTLRSSLKHSGTKKLSRSVTWADEKSDSTGRRNLCE 335

Query: 966  FQ--ETGV------------------------IQDTVENSRNSDVEDVDASLH------- 886
             +  E G+                        +  T+++++  ++ +V  +         
Sbjct: 336  VRDMEDGLENPGAFDSLYKPSSSSEAGSSFSWVDKTIDSTKCENICEVSGTHDAKEVPEV 395

Query: 885  -----------LASAEACMLALSQAAEAVSSGEFDVTDAVSNAGIIILPQLH-------- 763
                         SAEAC +ALS+AA AV +GEFD +DAVS AGIIILP+          
Sbjct: 396  VGSSVVQGNEWFESAEACAVALSEAAGAVETGEFDTSDAVSKAGIIILPRTDGVDEEEFI 455

Query: 762  ------------DYDGGESQEDDNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPEG 619
                          D  ES ED ++LEPE    KWP+KP  S   LF+ EDSW+DAPP+G
Sbjct: 456  VDGADEEDSIEDSVDEEESTEDIDMLEPEQALSKWPKKPESSQFDLFNPEDSWFDAPPDG 515

Query: 618  FSLTLSSFATMWTALFGWITSSSLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSEI 439
            F+LTLS FATMW ALF W TSS+LAYIYG+D S HEEF+ VNG+ YP KI  +DGRSSEI
Sbjct: 516  FNLTLSPFATMWNALFTWTTSSTLAYIYGKDDSFHEEFLNVNGRSYPHKIVLADGRSSEI 575

Query: 438  KQSLAGVLARMVPELVTELRLRAPLSALEHGMECLLDTMSFMDPLPSLSTEQWHVLALLF 259
            K ++   L+R +PE+V EL L  P   LE GM  +L+TMSF++ LP+   +QW V+ALLF
Sbjct: 576  KLTVGASLSRALPEIVAELGLAVP--NLEKGMGFMLNTMSFIEALPAFRMKQWQVIALLF 633

Query: 258  IDALSVCRIPGLALHMTSMRVLLNKVLDSAQLTWEEYESMKDIIIPLGRRLQFSAQSG 85
            I+ LSVCR+P L  HMT+ RVL+ +VLD A+++ EEYE MKD +IPLGR  QF++QSG
Sbjct: 634  IEGLSVCRMPALTPHMTNRRVLIQRVLDGARISVEEYEIMKDFLIPLGRAPQFASQSG 691



 Score = 90.9 bits (224), Expect = 3e-15
 Identities = 106/344 (30%), Positives = 148/344 (43%), Gaps = 17/344 (4%)
 Frame = -1

Query: 3024 KLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMDFS 2845
            ++  E G +KKE +G   DLG S LKIEEK+    G+V      GPSNAIEGYVP  D  
Sbjct: 129  RVLREFGEEKKE-IG---DLGLSSLKIEEKSGTYSGKVEE---FGPSNAIEGYVPRRDRV 181

Query: 2844 SKPP-----TDXXXXXXXXXXXXXXXXXVNEMDFTSTIIVGDQFCIPKFSYGSEQNGTEG 2680
            SK                          +N+MDF ST++  D++ + K       N  + 
Sbjct: 182  SKASGAKKNKQGSKGKDAKPSGGGKQLILNDMDFMSTLLACDEYSVSKMPPNVADNNVDT 241

Query: 2679 RPKESKQNQFIINDVTSATMQNVSETTLMESKLEEGSAASKTQTGELEMFYGPTQNVSDR 2500
              K+SK       D+ S    +V ET+   +K E          G+L M     +     
Sbjct: 242  ELKKSKG-----KDLESGF--SVLETSATPNKSE-----GVMDVGDLGMSRLKIE----- 284

Query: 2499 GVEGSKQEVSGEKLALSNETTLKSSLKPSSDARRFSRDVTWSDERKIDNMSNDNLCNVHN 2320
              E   Q   GEK   S+E TL+SSLK  S  ++ SR VTW+DE K D+    NLC V +
Sbjct: 285  -AEEESQVGKGEK---SSEGTLRSSLK-HSGTKKLSRSVTWADE-KSDSTGRRNLCEVRD 338

Query: 2319 ------TGNISDSVKSPINSSVEG-----VDHCL-FLASEAVCELAPSQVAKAVTFGESD 2176
                       DS+  P +SS  G     VD  +     E +CE++ +  AK V     +
Sbjct: 339  MEDGLENPGAFDSLYKPSSSSEAGSSFSWVDKTIDSTKCENICEVSGTHDAKEV----PE 394

Query: 2175 VTNDDASKGDSQKNEAVNEPLVVLLEQPRKPDALLSGSFDVKDS 2044
            V      +G+     A  E   V L +     A+ +G FD  D+
Sbjct: 395  VVGSSVVQGNEWFESA--EACAVALSE--AAGAVETGEFDTSDA 434


>ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobroma cacao]
            gi|508787292|gb|EOY34548.1| F2P16.20 protein, putative
            isoform 4 [Theobroma cacao]
          Length = 607

 Score =  446 bits (1146), Expect = e-122
 Identities = 265/610 (43%), Positives = 359/610 (58%), Gaps = 50/610 (8%)
 Frame = -1

Query: 2022 KEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMSRSDYDDVVTKRSIANKCGYPLCNN 1843
            KEQSI++ +AVHK+QL L +GI  EKQL  + SL+SRSDY+DVVT+R+I+N CGYPLC N
Sbjct: 3    KEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLCAN 62

Query: 1842 PLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCATYSRAFAGGLQIKRCSLYNSEKIN 1663
            PL  E  RK  YR SL++ K  + +E YMFCS  C   SRAFAG LQ +RCS+ N  K+N
Sbjct: 63   PLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAKLN 122

Query: 1662 EVLRLFEELGLNEKEDLQKKDDLN----ELKMEEKVDVGNVLLEDCIPLNSIEGYIPEID 1495
            ++L LF +L L++  DL K  DL      +K  E+V   +V L    P N+IEGY+P+ +
Sbjct: 123  DILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSLAG--PSNAIEGYVPQRE 179

Query: 1494 CVLKTSPSGHGKGGT-ESNKATLQKGKGK------------------------------- 1411
             + K +P  + K    +S+ + L   K +                               
Sbjct: 180  LISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSFKQG 239

Query: 1410 ------------LLNEMDLTGTIIAGDQFTGPKMSCVSKENGAETKRAGLKYNQFSNHET 1267
                        ++NEMD T  II  D++T  KM   SK++  +           SN + 
Sbjct: 240  DRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFD-----------SNLKE 288

Query: 1266 ILAPGRXXXXXXXXXXXXXXSRAAE-DTQFGKQEMFHGPTQNVSDIAVKESKQGASVEKT 1090
            +   G               S   E D+   +        Q+  D +  E+++    +K 
Sbjct: 289  VEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKA 348

Query: 1089 TQSNENTLKSSLKLTGAKRLSRSVTWADEK-LNNTESGYLCTFQETGVIQDTVENSRNSD 913
              S+E  LKSSLK  GAK+L+R VTWAD+K  +N  +G LC  +E   ++   E S +++
Sbjct: 349  VTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAE 408

Query: 912  VEDVDASLHLASAEACMLALSQAAEAVSSGEFDVTDAVSNAGIIILPQLHDYDGGESQED 733
                D  L   SAEAC +ALS+AAEAV+SG+ DVTDAV   G+IILP L + D  E  ED
Sbjct: 409  DGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMED 468

Query: 732  DNVLEPEMVPLKWPEKPVLSNSKLFDREDSWYDAPPEGFSLTLSSFATMWTALFGWITSS 553
             ++LEPE  P+KWP+KP + +S +F+ EDSW+DAPPEGFSLTLS+FATMW ALF WITSS
Sbjct: 469  GDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSS 528

Query: 552  SLAYIYGRDASSHEEFILVNGKEYPRKIFSSDGRSSEIKQSLAGVLARMVPELVTELRLR 373
            SLAYIYGRD S HEE++ +NG+EYPRKI   DGRSSEIK++LA  ++R +P +VT+LRL 
Sbjct: 529  SLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLP 588

Query: 372  APLSALEHGM 343
             P+S LE GM
Sbjct: 589  IPISTLEQGM 598



 Score =  140 bits (353), Expect = 4e-30
 Identities = 119/383 (31%), Positives = 181/383 (47%), Gaps = 55/383 (14%)
 Frame = -1

Query: 3027 LKLFEELGLDKKEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVPNMDF 2848
            L LF +L LD  + LG+  DLGFS L+I+E   V+  +VS+    GPSNAIEGYVP  + 
Sbjct: 125  LSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNAIEGYVPQREL 180

Query: 2847 SSKPPTDXXXXXXXXXXXXXXXXXV-------NEMDFTSTIIVGDQFCIPKFSYGSEQNG 2689
             SKP                            NE+DF  TII+ D++ I K   GS + G
Sbjct: 181  ISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISK-KPGSFKQG 239

Query: 2688 TEGRPKESKQNQFIINDV------------TSATMQNVSETTLMESKLEE---------- 2575
               +   SK+  F+IN++            T + M + S+ +  +S L+E          
Sbjct: 240  DRTK-LSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDS 298

Query: 2574 -------GSAASKTQTGELEMFYGPTQNVSDRGVEGS----KQEVSGEKLALSNETTLKS 2428
                   GS+++  +     +    T+NV   G++ S    ++E   +K   S+ET LKS
Sbjct: 299  EDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKS 358

Query: 2427 SLKPSSDARRFSRDVTWSDERKIDNMSNDNLCNVHNTGNISDSVKSPINSSVE--GVDHC 2254
            SLK S+ A++ +R VTW+D++K DN  N NLC V     +     S I+ S E  G D+ 
Sbjct: 359  SLK-SAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGD--SEISGSAEDGGDDNM 415

Query: 2253 LFLASEAVCELAPSQVAKAVTFGESDVTND-------------DASKGDSQKNEAVNEPL 2113
            L   S   C +A S+ A+AV  G+SDVT+              +  K +  ++  + EP 
Sbjct: 416  LRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPE 475

Query: 2112 VVLLEQPRKPDALLSGSFDVKDS 2044
               ++ P+KP    S  F+ +DS
Sbjct: 476  TAPVKWPKKPGIPHSDMFNPEDS 498


>ref|XP_006290171.1| hypothetical protein CARUB_v10003849mg [Capsella rubella]
            gi|482558877|gb|EOA23069.1| hypothetical protein
            CARUB_v10003849mg [Capsella rubella]
          Length = 743

 Score =  434 bits (1117), Expect = e-119
 Identities = 289/750 (38%), Positives = 398/750 (53%), Gaps = 80/750 (10%)
 Frame = -1

Query: 2094 PRKPDALLSGSFDVKDSRMSPASVKEQSITLKDAVHKLQLSLFEGIHHEKQLFTAQSLMS 1915
            P+    LLSGS  +K S         Q I + DAVHKLQL++ EGI  + QLF A  L+S
Sbjct: 6    PQSSPNLLSGSKPIKSSLHHTLRPNNQVIAINDAVHKLQLAMLEGITDQNQLFAAGKLIS 65

Query: 1914 RSDYDDVVTKRSIANKCGYPLCNNPLLKERPRKDHYRSSLRQQKAKNPREIYMFCSPGCA 1735
            R DY+DVVT+R+IA  CGYPLC   L  +  R+  YR SL++ K  + +E   FCS GC 
Sbjct: 66   RLDYEDVVTERTIAKLCGYPLCQRFLPSDVSRRGKYRISLKEHKVYDLQETRKFCSAGCL 125

Query: 1734 TYSRAFAGGLQIKRCSLYNSEKINEVLRLFEELGLNEKEDLQKKDDLNELKMEEKVDV-- 1561
              S+ F G LQ  R S ++S K+NE+L LF +  +    D+ K  DL++L + E  +V  
Sbjct: 126  IDSKTFLGTLQEARTSEFDSVKLNEILELFGDSEVKGSLDVNKDLDLSKLIIRENFEVRG 185

Query: 1560 ----------------GNVLLE--DCIPLNSIEG-------------------------- 1513
                            G V L+  DC   N  +G                          
Sbjct: 186  GESSLEQWMGPSNAVEGYVPLDQSDCKSRNCKDGDFKATQSNQEKHKDPPFSEMDFTSTV 245

Query: 1512 YIPEIDCVLK-------TSPSG---HGKGGTESNKATL---QKGKGKLLNEMDLTGTIIA 1372
             +P+   V K        SP G    GKG T   + T+    K K +   E +       
Sbjct: 246  IMPDEYSVSKLPLQTKQASPVGVSDDGKGKTVLREQTVVPATKKKSRFRREKEKEKKTFG 305

Query: 1371 GD-----QFTGPKMSCVSKENGAE--------------TKRAGLKYNQFSNHETILAPGR 1249
             D      F   +M CVS   G +              +    L YN     +T+     
Sbjct: 306  TDGIDLASFGFDEMGCVSSGTGKDGYSVEYSVSKQPQCSMEDSLSYNLKGGLQTLDGKNN 365

Query: 1248 XXXXXXXXXXXXXXSRAAEDTQFGKQEMFHGPT-QNVSDIAVKESKQGASVEKTTQSNEN 1072
                          +RA +  +      +H  + ++  +I   ES +   V+    S+E 
Sbjct: 366  LSGSSSGSNAKGSRTRAEKSGKKKISVEYHANSYEDGEEILAAESYERHKVQDVCSSSET 425

Query: 1071 TLKSSLKLTGAKRLSRSVTWADEKLNNTESGYLCTFQETGVIQDTVENSRNS-DVEDVDA 895
              KS LK++G+K+LSRSVTWAD+   N   G LC  +       TV  S +S D +D ++
Sbjct: 426  VTKSCLKISGSKKLSRSVTWADQ---NDGCGDLCEVRNNDF---TVGPSLSSNDTKDGNS 479

Query: 894  SLHLASAEACMLALSQAAEAVSSGEFDVTDAVSNAGIIILPQLHDYDGGESQEDDNVLEP 715
               LA AEAC  ALSQAAEAVS G+ D +DA + AGI++LP  H  D   ++E    +E 
Sbjct: 480  LSRLALAEACASALSQAAEAVSLGDTDASDATAKAGIVLLPSTHQLDEEVTEEH---IEE 536

Query: 714  EMVPLKWPEKPVLSNSKLFDREDSWYDAPPEGFSLTLSSFATMWTALFGWITSSSLAYIY 535
            E   LKWP KP + +S LFDR+ SW+D  PEGF+LTLSSFA MW +LFGW++SSSLAYIY
Sbjct: 537  EPTLLKWPTKPGIPDSDLFDRDQSWFDGAPEGFNLTLSSFAVMWDSLFGWVSSSSLAYIY 596

Query: 534  GRDASSHEEFILVNGKEYPRKIFSSDGRSSEIKQSLAGVLARMVPELVTELRLRAPLSAL 355
            G++ S+HEEF+ VNGKEYPRKI   DG SSEIK+++AG LAR +P + T LRL   +S L
Sbjct: 597  GKEESAHEEFLSVNGKEYPRKIILGDGLSSEIKETMAGCLARALPRVATYLRLPIAISEL 656

Query: 354  EHGMECLLDTMSFMDPLPSLSTEQWHVLALLFIDALSVCRIPGLALHMTSMRVLLNKVLD 175
            E G+  LL+TMS    +PSL  ++W V+ LLF+DALSV RIP +A ++++    +NK+L+
Sbjct: 657  EKGLGSLLETMSLTGAVPSLKMKEWLVIVLLFLDALSVSRIPLIAPYLSN----INKILE 712

Query: 174  SAQLTWEEYESMKDIIIPLGRRLQFSAQSG 85
             + +  +EYE MKDI +PLGR  QF+ +SG
Sbjct: 713  GSGIGNDEYEMMKDIFLPLGRVPQFATRSG 742



 Score = 68.6 bits (166), Expect = 2e-08
 Identities = 41/113 (36%), Positives = 55/113 (48%), Gaps = 2/113 (1%)
 Frame = -1

Query: 2994 KEGLGREEDLGFSELKIEEKANVEVGEVSMENWMGPSNAIEGYVP--NMDFSSKPPTDXX 2821
            K  L   +DL  S+L I E   V  GE S+E WMGPSNA+EGYVP    D  S+   D  
Sbjct: 161  KGSLDVNKDLDLSKLIIRENFEVRGGESSLEQWMGPSNAVEGYVPLDQSDCKSRNCKDGD 220

Query: 2820 XXXXXXXXXXXXXXXVNEMDFTSTIIVGDQFCIPKFSYGSEQNGTEGRPKESK 2662
                            +EMDFTST+I+ D++ + K    ++Q    G   + K
Sbjct: 221  FKATQSNQEKHKDPPFSEMDFTSTVIMPDEYSVSKLPLQTKQASPVGVSDDGK 273


Top