BLASTX nr result

ID: Sinomenium21_contig00008790 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00008790
         (1652 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI16022.3| unnamed protein product [Vitis vinifera]              340   1e-90
ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II tra...   302   3e-79
ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citr...   302   3e-79
ref|XP_007204950.1| hypothetical protein PRUPE_ppa000292mg [Prun...   300   1e-78
emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera]   279   2e-72
ref|XP_006848046.1| hypothetical protein AMTR_s00029p00190880 [A...   275   4e-71
ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314...   273   2e-70
ref|XP_004169561.1| PREDICTED: uncharacterized protein LOC101227...   270   1e-69
ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214...   270   1e-69
ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205...   270   1e-69
ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus c...   270   1e-69
ref|XP_007016237.1| Uncharacterized protein isoform 7 [Theobroma...   261   9e-67
ref|XP_007016232.1| Uncharacterized protein isoform 2 [Theobroma...   261   9e-67
ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Popu...   258   7e-66
ref|XP_007016238.1| Uncharacterized protein isoform 8 [Theobroma...   253   1e-64
ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Popu...   239   3e-60
ref|XP_007131393.1| hypothetical protein PHAVU_011G009900g [Phas...   219   2e-54
gb|EXB30469.1| hypothetical protein L484_006018 [Morus notabilis]     214   7e-53
ref|XP_004506322.1| PREDICTED: mediator of RNA polymerase II tra...   210   1e-51
ref|XP_004246977.1| PREDICTED: uncharacterized protein LOC101249...   207   1e-50

>emb|CBI16022.3| unnamed protein product [Vitis vinifera]
          Length = 1669

 Score =  340 bits (871), Expect = 1e-90
 Identities = 218/508 (42%), Positives = 269/508 (52%), Gaps = 61/508 (12%)
 Frame = -1

Query: 1652 DGRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQM 1473
            DGRQ DSH+PGS+E   FGQPS +  N M++NGG G        + S+P GLQ+ER+K +
Sbjct: 1203 DGRQSDSHIPGSSERGPFGQPSGVQSNMMRMNGGLGI-------ESSLPVGLQDERFKSL 1255

Query: 1472 PDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFE 1293
            P                         EPGR      +F EDLKQF RS+ LDS+ VPKF 
Sbjct: 1256 P-------------------------EPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFG 1290

Query: 1292 SYFS--RP-----------------DRASHGFNHDVGLKLDGNDNAPRLLPPYQPGS--- 1179
            +YFS  RP                 D+A  GFN+D G K        R  PP  PG    
Sbjct: 1291 NYFSSSRPLDRGSQGFVMDAAQGLLDKAPLGFNYDSGFKSSAGTGTSRFFPPPHPGGDGE 1350

Query: 1178 -LRPLDLCDDNMDRRVDIAAGVPPDFLRSAS--GRNRIDGFPLRSPGREYPSHPSSRF-- 1014
              R +   +DN+ R  D+A    P+FL S    GR+ +DG   RSP RE+   P   F  
Sbjct: 1351 RSRAVGFHEDNVGRS-DMAR-THPNFLGSVPEYGRHHMDGLNPRSPTREFSGIPHRGFGG 1408

Query: 1013 --------RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGS 858
                      L+D DGRE   F E SK+FNLPS+     E+RFP+LPSHLR+GE +G G 
Sbjct: 1409 LSGVPGRQSDLDDIDGRESRRFGEGSKTFNLPSD-----ESRFPVLPSHLRRGELEGPGE 1463

Query: 857  L-----------PARLRGGDLIGSNVPPGRLQSGEPIGHRNLPN--------------HL 753
            L           P  LRGGDLIG ++ P  LQ GE  G RN+P               H 
Sbjct: 1464 LVMADPIASRPAPHHLRGGDLIGQDILPSHLQRGEHFGSRNIPGQLRFGEPVFDAFLGHP 1523

Query: 752  HRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGG-NLPSRARGAESGFSSGFPIHGY 576
              G+++G G F +R    ++              FGG N     R  E GF S + +HGY
Sbjct: 1524 RMGELSGPGNFPSRLSAGES--------------FGGSNKSGHPRIGEPGFRSTYSLHGY 1569

Query: 575  QNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVL 396
             ND GF   GD+ESFD SRKRK  SM WCRIC +DCETV+GLDMHSQTREHQ+MAMD+VL
Sbjct: 1570 PNDHGFRPPGDMESFDNSRKRKPLSMAWCRICNIDCETVDGLDMHSQTREHQQMAMDIVL 1629

Query: 395  SIKKDNVKKQKVSSDDHKSHEDGSKSSK 312
            SIK+ N KKQK++S DH + ED SKS K
Sbjct: 1630 SIKQQNAKKQKLTSKDHSTPEDSSKSKK 1657


>ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            15-like isoform X1 [Citrus sinensis]
            gi|568870502|ref|XP_006488441.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 15-like isoform
            X2 [Citrus sinensis] gi|568870504|ref|XP_006488442.1|
            PREDICTED: mediator of RNA polymerase II transcription
            subunit 15-like isoform X3 [Citrus sinensis]
            gi|568870506|ref|XP_006488443.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 15-like isoform
            X4 [Citrus sinensis]
          Length = 1392

 Score =  302 bits (774), Expect = 3e-79
 Identities = 206/480 (42%), Positives = 251/480 (52%), Gaps = 27/480 (5%)
 Frame = -1

Query: 1652 DGRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQM 1473
            DGR+ DSH PGS +    G PS    N M++NGGPG  L                     
Sbjct: 985  DGRESDSHFPGSQQRSPLGPPSGTRSNMMRMNGGPGSELR-------------------- 1024

Query: 1472 PDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFE 1293
             DERFK  P+         R  PF ++P R +I R EFEEDLKQF R + LD+E VPK  
Sbjct: 1025 -DERFKSFPD--------GRLNPFPVDPARSVIDRGEFEEDLKQFSRPSHLDAEPVPKLG 1075

Query: 1292 SYF--SRP-DRASHGF-------------NHDVGLKLD--GNDNAPRLLPPYQPGSLRPL 1167
            S+F  SRP DR  HG+             ++D GLKLD  G     R LP Y        
Sbjct: 1076 SHFLPSRPFDRGPHGYGMDMGPRPFERGLSYDPGLKLDPMGASAPSRFLPAYH------- 1128

Query: 1166 DLCDDNMDRRVDIAAGVPPDFLRS--ASGRNRIDGFPLRSPGREY-------PSHPSSRF 1014
                D+   R D ++   PDF R   A GR  + G   RS  RE+        S   SR 
Sbjct: 1129 ----DDAAGRSD-SSHAHPDFPRPGRAYGRRHMGGLSPRSSFREFCGFGGLPGSLGGSRS 1183

Query: 1013 RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLPARLRGG 834
             R ED  GRE   F +          GN+FH++RFP+LPSHLR+GE +G G      R G
Sbjct: 1184 VR-EDIGGREFRRFGDPI--------GNSFHDSRFPVLPSHLRRGEFEGPG------RTG 1228

Query: 833  DLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXX 654
            DLIG    P  L+ GEP+G    P++L  G+  G GGF   A++ +              
Sbjct: 1229 DLIGQEFLPSHLRRGEPLG----PHNLRLGETVGLGGFPGPARMEELGGP---------- 1274

Query: 653  SFGGNLPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKV 474
               GN P    G E GF S F   G+ NDGGF+  GD+ES D SRKRK  SMGWCRICKV
Sbjct: 1275 ---GNFPPPRLG-EPGFRSSFSRQGFPNDGGFYT-GDMESIDNSRKRKPPSMGWCRICKV 1329

Query: 473  DCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESR 294
            DCETV+GLD+HSQTREHQKMAMDMVLSIK+ N KKQK++S D  S +D +KS    F+ R
Sbjct: 1330 DCETVDGLDLHSQTREHQKMAMDMVLSIKQ-NAKKQKLTSGDRCSTDDANKSRNVNFDGR 1388


>ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citrus clementina]
            gi|557526921|gb|ESR38227.1| hypothetical protein
            CICLE_v10027683mg [Citrus clementina]
          Length = 1392

 Score =  302 bits (774), Expect = 3e-79
 Identities = 206/480 (42%), Positives = 251/480 (52%), Gaps = 27/480 (5%)
 Frame = -1

Query: 1652 DGRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQM 1473
            DGR+ DSH PGS +    G PS    N M++NGGPG  L                     
Sbjct: 985  DGRESDSHFPGSQQRSPLGPPSGTRSNMMRMNGGPGSELR-------------------- 1024

Query: 1472 PDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFE 1293
             DERFK  P+         R  PF ++P R +I R EFEEDLKQF R + LD+E VPK  
Sbjct: 1025 -DERFKSFPD--------GRLNPFPVDPARSVIDRGEFEEDLKQFSRPSHLDAEPVPKLG 1075

Query: 1292 SYF--SRP-DRASHGF-------------NHDVGLKLD--GNDNAPRLLPPYQPGSLRPL 1167
            S+F  SRP DR  HG+             ++D GLKLD  G     R LP Y        
Sbjct: 1076 SHFLPSRPFDRGPHGYGMDMGPRPFERGLSYDPGLKLDPMGASAPSRFLPAYH------- 1128

Query: 1166 DLCDDNMDRRVDIAAGVPPDFLRS--ASGRNRIDGFPLRSPGREY-------PSHPSSRF 1014
                D+   R D ++   PDF R   A GR  + G   RS  RE+        S   SR 
Sbjct: 1129 ----DDAAGRSD-SSHAHPDFPRPGRAYGRRHMGGLSPRSSFREFCGFGGLPGSLGGSRS 1183

Query: 1013 RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLPARLRGG 834
             R ED  GRE   F +          GN+FH++RFP+LPSHLR+GE +G G      R G
Sbjct: 1184 VR-EDIGGREFRRFGDPI--------GNSFHDSRFPVLPSHLRRGEFEGPG------RTG 1228

Query: 833  DLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXX 654
            DLIG    P  L+ GEP+G    P++L  G+  G GGF   A++ +              
Sbjct: 1229 DLIGQEFLPSHLRRGEPLG----PHNLRLGETVGLGGFPGPARMEELGGP---------- 1274

Query: 653  SFGGNLPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKV 474
               GN P    G E GF S F   G+ NDGGF+  GD+ES D SRKRK  SMGWCRICKV
Sbjct: 1275 ---GNFPPPRLG-EPGFRSSFSHQGFPNDGGFYT-GDMESIDNSRKRKPPSMGWCRICKV 1329

Query: 473  DCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESR 294
            DCETV+GLD+HSQTREHQKMAMDMVLSIK+ N KKQK++S D  S +D +KS    F+ R
Sbjct: 1330 DCETVDGLDLHSQTREHQKMAMDMVLSIKQ-NAKKQKLTSGDRCSTDDANKSRNVNFDGR 1388


>ref|XP_007204950.1| hypothetical protein PRUPE_ppa000292mg [Prunus persica]
            gi|462400592|gb|EMJ06149.1| hypothetical protein
            PRUPE_ppa000292mg [Prunus persica]
          Length = 1334

 Score =  300 bits (769), Expect = 1e-78
 Identities = 204/467 (43%), Positives = 249/467 (53%), Gaps = 18/467 (3%)
 Frame = -1

Query: 1637 DSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQMPDERF 1458
            DSH    +     GQPS + PN +++NG PG        D S   G ++ER+K  P    
Sbjct: 931  DSHGGMMSRAAPIGQPSGIHPNMLRMNGTPGL-------DSSSTHGPRDERFKAFP---- 979

Query: 1457 KRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESYFSR 1278
                        G+R  PF ++P RH+I R EFE+DLKQFPR + LDSE V KF +Y SR
Sbjct: 980  ------------GERLNPFPVDPTRHVIDRVEFEDDLKQFPRPSYLDSEPVAKFGNYSSR 1027

Query: 1277 P-DRASHGFNHDVGLKLDG-NDNAP-RLLPPYQ-PGSLRPLDLCDDNMDRRVDIAAGVPP 1110
            P DRA HGF +D G   D     AP R L PY+  GS+   D  D     R++   G  P
Sbjct: 1028 PFDRAPHGFKYDSGPHTDPLAGTAPSRFLSPYRLGGSVHGNDAGDFG---RMEPTHG-HP 1083

Query: 1109 DFLRSASGRNRIDGFPLRSPGREYPSHPSSRFRRL--EDSDGRELHVFSEQSKSFNLPSE 936
            DF+    GR  +DG   RSP R+YP  P   FR    +D DGRE H F +          
Sbjct: 1084 DFV----GRRLVDGLAPRSPVRDYPGLPPHGFRGFGPDDFDGREFHRFGDPL-------- 1131

Query: 935  GNAFHENRFPILPSHLRKGESDGSGSLP-ARLRGGDLIGSNVPPGRLQSGEPIGHRNL-- 765
            GN FHE RF  LP H R+GE +G G+L     R  D IG +  PG L+ G+ +G  NL  
Sbjct: 1132 GNQFHEGRFSNLPGHFRRGEFEGPGNLRMVDHRRNDFIGQDGHPGHLRRGDHLGPHNLRE 1191

Query: 764  -----PNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESGFS 600
                   H H G +AG G F                       F GN P+  R  E GF 
Sbjct: 1192 PLGFGSRHSHMGDMAGPGNFE---------------------PFRGNRPNHPRLGEPGFR 1230

Query: 599  SGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQ 420
            S F +  + NDG +   GD+ESFD SRKRK  SMGWCRICKVDCETVEGLD+HSQTREHQ
Sbjct: 1231 SSFSLQRFPNDGTY--TGDLESFDHSRKRKPASMGWCRICKVDCETVEGLDLHSQTREHQ 1288

Query: 419  KMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSS----KAGFESRD 291
            KMAMDMV SIK+ N KKQK++S D    ED +KS     +AG +S D
Sbjct: 1289 KMAMDMVRSIKQ-NAKKQKLTSGDQSLLEDANKSKIPVLRAGEKSID 1334


>emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera]
          Length = 1131

 Score =  279 bits (714), Expect = 2e-72
 Identities = 187/451 (41%), Positives = 232/451 (51%), Gaps = 4/451 (0%)
 Frame = -1

Query: 1652 DGRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQM 1473
            DGRQ DSH+PGS+E   FGQPS    N M++NGG G        + S+P GLQ+ER+K +
Sbjct: 774  DGRQSDSHIPGSSERGPFGQPSGXQSNMMRMNGGLGI-------ESSLPVGLQDERFKSL 826

Query: 1472 PDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFE 1293
            P                         EPGR      +F EDLKQF RS+ LDS+ VPKF 
Sbjct: 827  P-------------------------EPGRRSSDHGKFAEDLKQFSRSSHLDSDLVPKFG 861

Query: 1292 SYFS--RP-DRASHGFNHDVGLKLDGNDNAPRLLPPYQPGSLRPLDLCDDNMDRRVDIAA 1122
            +YFS  RP DR S GF  D    L   D AP                             
Sbjct: 862  NYFSSSRPLDRGSQGFVMDAAQGL--LDKAPL---------------------------- 891

Query: 1121 GVPPDFLRSASGRNRIDGFPLRSPGREYPSHPSSRFRRLEDSDGRELHVFSEQSKSFNLP 942
                             GF   S  +      +SR   L+D DGRE   F E  ++FNLP
Sbjct: 892  -----------------GFNYDSGFKSSAGTGTSRQSDLDDIDGRESRRFGEGYQTFNLP 934

Query: 941  SEGNAFHENRFPILPSHLRKGESDGSGSLPARLRGGDLIGSNVPPGRLQSGEPIGHRNLP 762
            S+     E+RFP+LPSHLR+        LP+ L+ G+  GS   PG+L+ GEP+    L 
Sbjct: 935  SD-----ESRFPVLPSHLRRD------ILPSHLQRGEHFGSRNIPGQLRFGEPVFDAFL- 982

Query: 761  NHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGG-NLPSRARGAESGFSSGFPI 585
             H   G+++G G F +R    ++              FGG N     R  E GF S + +
Sbjct: 983  GHPRMGELSGPGNFPSRLSAGES--------------FGGSNKSGHPRIGEPGFRSTYSL 1028

Query: 584  HGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQKMAMD 405
            HGY ND GF   GD+ESFD SRKRK  SM WCRIC +DCETV+GLDMHSQTREHQ+MAMD
Sbjct: 1029 HGYPNDHGFRPPGDMESFDNSRKRKPLSMAWCRICNIDCETVDGLDMHSQTREHQQMAMD 1088

Query: 404  MVLSIKKDNVKKQKVSSDDHKSHEDGSKSSK 312
            +VLSIK+ N KKQK++S DH + ED SKS K
Sbjct: 1089 IVLSIKQQNAKKQKLTSKDHSTPEDSSKSKK 1119


>ref|XP_006848046.1| hypothetical protein AMTR_s00029p00190880 [Amborella trichopoda]
            gi|548851351|gb|ERN09627.1| hypothetical protein
            AMTR_s00029p00190880 [Amborella trichopoda]
          Length = 1626

 Score =  275 bits (703), Expect = 4e-71
 Identities = 198/490 (40%), Positives = 251/490 (51%), Gaps = 37/490 (7%)
 Frame = -1

Query: 1652 DGRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQM 1473
            DGRQPD H            PS+  P  + +NG  GK   S   + + P GL EER+  +
Sbjct: 1182 DGRQPDVHQ---------SLPSDRAPYGL-VNGAAGK--GSNVPESAFPHGLPEERFGPL 1229

Query: 1472 PDERFKRLPEEGFNM-LPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKF 1296
            P++RFK LPE+G    LP D F+P+ ++P R  I RREFEEDLK+FPRS  LD E   ++
Sbjct: 1230 PEDRFKHLPEDGLKKPLPDDHFRPYALDPSRRAIDRREFEEDLKKFPRSGHLDGEPASRY 1289

Query: 1295 ESYFSRPDRASHGFN--HDVGLKLDGNDNAPRL-----LPPYQPGSLRPLDLCD------ 1155
            + YFS  + + H        GL LD    APR      +PPY+      LDL D      
Sbjct: 1290 DGYFSSRNPSGHSPRSLERPGLNLD----APRYPEGMSVPPYRGAGGSSLDLGDRSKPGG 1345

Query: 1154 ---DNMDRRVDIAAGVPPDFLRSAS--GRNRIDGF-PLRSPGREYPSHPSSRFRR----- 1008
               D + R++D   G   D+        R+  DG  P RSP R+Y     S  R      
Sbjct: 1346 FHGDLIGRKLD-TTGARSDYGGPFPEVSRSHRDGLGPPRSPVRDYAGVRVSGVRPDYAGI 1404

Query: 1007 ---LEDSDGRELHVFSEQ-SKSFNLPSEGNAFHENRFPI-LPSHLRKGESDGSGSLPARL 843
               L+   GRE   F EQ +++F  P  G       F   LP   R  ES G G  P  L
Sbjct: 1405 PHPLDGLGGREPLGFGEQRARAFLDPIHGGKIPSGPFESRLPIPSRIAESAGFGDFPGHL 1464

Query: 842  RGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXX 663
            RGGD  G    P   +SGE      LP+HL   ++AG G      ++ +A          
Sbjct: 1465 RGGDPFG----PSHFRSGE------LPSHLRGRELAGSGNLPPHLRIGEAMGP------- 1507

Query: 662  XXXSFGGNLPSRARGAESGFSSGFPIHGYQNDGGFFNAG-----DVESFDLSRKRKLGSM 498
                 GG+L             GF + GY  DGGF+N G     DV++ + SRKRK GS 
Sbjct: 1508 -----GGHLRE----------PGFGMQGYPKDGGFYNPGSFPPSDVDALEYSRKRKPGST 1552

Query: 497  GWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKV--SSDDHKSHEDGS 324
            GWCRICKVDCETVEGLD+HSQTREHQKMAMDMVLSIK+D+ KKQK+  SS+DH   E+ +
Sbjct: 1553 GWCRICKVDCETVEGLDLHSQTREHQKMAMDMVLSIKQDSAKKQKLYGSSEDHVPQEEPT 1612

Query: 323  KSSKAGFESR 294
            K  +A FESR
Sbjct: 1613 KGRRASFESR 1622


>ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314450 [Fragaria vesca
            subsp. vesca]
          Length = 1316

 Score =  273 bits (697), Expect = 2e-70
 Identities = 190/441 (43%), Positives = 237/441 (53%), Gaps = 6/441 (1%)
 Frame = -1

Query: 1598 GQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQMPDERFKRLPEEGFNMLPG 1419
            GQPS +  N +++NG PG   +S         GL++ER+K +PD R         N  PG
Sbjct: 939  GQPSGIISNMLRMNGNPGFESSS-------TLGLRDERFKALPDGRL--------NPFPG 983

Query: 1418 DRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESYFSRP-DRASHGFNHDV 1242
            D        P R +I+R  FE+DLKQFPR + LDSE +PK  +Y SR  DR   G N+D 
Sbjct: 984  D--------PTR-VISRVGFEDDLKQFPRPSFLDSEPLPKLGNYSSRAFDRRPFGVNYDT 1034

Query: 1241 GLKLD-GNDNAPRLLPPYQPGSLRPLDLCDDNMDRRVDIAAGVPPDFLRSASGRNRIDGF 1065
             L +D    +APR L PY    L      +D +           PDF     GR  +DG 
Sbjct: 1035 RLNIDPAAGSAPRFLSPYGHAGLIH---ANDTIGH---------PDF----GGRRLMDGL 1078

Query: 1064 PLRSPGREYPSHPSSRFRRL--EDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSH 891
              RSP R+YP  PS RFR    +D DGRE H F +          G  FH+NRFP    H
Sbjct: 1079 ARRSPIRDYPGIPS-RFRGFGPDDFDGREFHRFGDPL--------GREFHDNRFP--NQH 1127

Query: 890  LRKGESDGSGSLPA--RLRGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFH 717
             R+GE +G G++    R+R  DLIG +   G LQ GE +G  NLP HLH  +  GFG   
Sbjct: 1128 FRRGEFEGPGNMRVDDRMRN-DLIGQDGHLGHLQRGEHLGPHNLPGHLHMREHVGFGVHP 1186

Query: 716  NRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVE 537
              A                  SF GN  +  R  E GF S F +  + NDG +  AG++E
Sbjct: 1187 RHA------------GPGSFESFIGNRANHPRLGEPGFRSSFSLKRFPNDGTY--AGELE 1232

Query: 536  SFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVS 357
            SFD SRKRK  SMGWCRICKV+CETVEGLD+HSQTREHQ+MAM+MV  I K N KKQK++
Sbjct: 1233 SFDHSRKRKPASMGWCRICKVNCETVEGLDVHSQTREHQRMAMEMV-QIIKQNAKKQKLT 1291

Query: 356  SDDHKSHEDGSKSSKAGFESR 294
            S D  S ED +KS     ES+
Sbjct: 1292 SGDQSSIEDANKSKITSSESQ 1312


>ref|XP_004169561.1| PREDICTED: uncharacterized protein LOC101227701 [Cucumis sativus]
          Length = 538

 Score =  270 bits (691), Expect = 1e-69
 Identities = 192/464 (41%), Positives = 245/464 (52%), Gaps = 16/464 (3%)
 Frame = -1

Query: 1637 DSHLPGSAEHVLFGQPSNM---PPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQMPD 1467
            DSHLPG+ EH     P ++   PPN + LNG PG        D S   GL+        D
Sbjct: 128  DSHLPGTMEH----HPPHLTGIPPNVLPLNGAPGP-------DSSSKLGLR--------D 168

Query: 1466 ERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESY 1287
            ERFK L EE  N  P        ++P R  I + + E+ L+QFPR + L+SE   +  +Y
Sbjct: 169  ERFKLLHEEQLNSFP--------LDPARRPINQTDAEDILRQFPRPSHLESELAQRIGNY 220

Query: 1286 FSRP-DRASHGFNHDVGLKLDGNDNAPRLLPP-------YQPGSLRPLDLCDDNMDRRVD 1131
              RP DR  HG N D GL +DG   A R+LPP       Y   + RP+   +D+  +  D
Sbjct: 221  SLRPFDRGVHGQNFDTGLTIDGAA-ASRVLPPRHIGGALYPTDAERPIAFYEDSTGQ-AD 278

Query: 1130 IAAGVPPDFLRSASGRNRIDGFPLRSPGREYPSHP--SSRFRRLEDSDGREL-HVFSEQS 960
             + G        + GR  +DGF  RSP  EY         F  +E+ DG++  H F +  
Sbjct: 279  RSRGHSDFPAPGSYGRRFVDGFGPRSPLHEYHGRGFGGRGFTGVEEIDGQDFPHHFGDPL 338

Query: 959  KSFNLPSEGNAFHENRFPILPSHLRKGESDGSGS--LPARLRGGDLIGSNVPPGRLQSGE 786
                      +F E+RFPI  SHL++G+ + SG+  +   LR GDLIG +          
Sbjct: 339  ----------SFRESRFPIFRSHLQRGDFESSGNFRMSEHLRTGDLIGQD---------R 379

Query: 785  PIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESG 606
              G R+LP HL  G++  FG     +++ D                GG+ P+  R  E G
Sbjct: 380  HFGPRSLPGHLRLGELTAFGSHPGHSRIGDLSVLGNFEPFG-----GGHRPNNPRLGEPG 434

Query: 605  FSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTRE 426
            F S F   G  +DG FF AGDVESFD SRKRK  SMGWCRICKVDCETVEGL++HSQTRE
Sbjct: 435  FRSSFSRQGLVDDGRFF-AGDVESFDNSRKRKPISMGWCRICKVDCETVEGLELHSQTRE 493

Query: 425  HQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESR 294
            HQKMAMDMV SIK+ N KK KV+ +DH S EDG KS   G ESR
Sbjct: 494  HQKMAMDMVQSIKQ-NAKKHKVTPNDHSS-EDG-KSKNVGLESR 534


>ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214768 [Cucumis sativus]
          Length = 1177

 Score =  270 bits (691), Expect = 1e-69
 Identities = 192/464 (41%), Positives = 245/464 (52%), Gaps = 16/464 (3%)
 Frame = -1

Query: 1637 DSHLPGSAEHVLFGQPSNM---PPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQMPD 1467
            DSHLPG+ EH     P ++   PPN + LNG PG        D S   GL+        D
Sbjct: 767  DSHLPGTMEH----HPPHLTGIPPNVLPLNGAPGP-------DSSSKLGLR--------D 807

Query: 1466 ERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESY 1287
            ERFK L EE  N  P        ++P R  I + + E+ L+QFPR + L+SE   +  +Y
Sbjct: 808  ERFKLLHEEQLNSFP--------LDPARRPINQTDAEDILRQFPRPSHLESELAQRIGNY 859

Query: 1286 FSRP-DRASHGFNHDVGLKLDGNDNAPRLLPP-------YQPGSLRPLDLCDDNMDRRVD 1131
              RP DR  HG N D GL +DG   A R+LPP       Y   + RP+   +D+  +  D
Sbjct: 860  SLRPFDRGVHGQNFDTGLTIDGAA-ASRVLPPRHIGGALYPTDAERPIAFYEDSTGQ-AD 917

Query: 1130 IAAGVPPDFLRSASGRNRIDGFPLRSPGREYPSHP--SSRFRRLEDSDGREL-HVFSEQS 960
             + G        + GR  +DGF  RSP  EY         F  +E+ DG++  H F +  
Sbjct: 918  RSRGHSDFPAPGSYGRRFVDGFGPRSPLHEYHGRGFGGRGFTGVEEIDGQDFPHHFGDPL 977

Query: 959  KSFNLPSEGNAFHENRFPILPSHLRKGESDGSGS--LPARLRGGDLIGSNVPPGRLQSGE 786
                      +F E+RFPI  SHL++G+ + SG+  +   LR GDLIG +          
Sbjct: 978  ----------SFRESRFPIFRSHLQRGDFESSGNFRMSEHLRTGDLIGQD---------R 1018

Query: 785  PIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESG 606
              G R+LP HL  G++  FG     +++ D                GG+ P+  R  E G
Sbjct: 1019 HFGPRSLPGHLRLGELTAFGSHPGHSRIGDLSVLGNFEPFG-----GGHRPNNPRLGEPG 1073

Query: 605  FSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTRE 426
            F S F   G  +DG FF AGDVESFD SRKRK  SMGWCRICKVDCETVEGL++HSQTRE
Sbjct: 1074 FRSSFSRQGLVDDGRFF-AGDVESFDNSRKRKPISMGWCRICKVDCETVEGLELHSQTRE 1132

Query: 425  HQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESR 294
            HQKMAMDMV SIK+ N KK KV+ +DH S EDG KS   G ESR
Sbjct: 1133 HQKMAMDMVQSIKQ-NAKKHKVTPNDHSS-EDG-KSKNVGLESR 1173


>ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205914 [Cucumis sativus]
          Length = 1434

 Score =  270 bits (691), Expect = 1e-69
 Identities = 192/464 (41%), Positives = 245/464 (52%), Gaps = 16/464 (3%)
 Frame = -1

Query: 1637 DSHLPGSAEHVLFGQPSNM---PPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQMPD 1467
            DSHLPG+ EH     P ++   PPN + LNG PG        D S   GL+        D
Sbjct: 1024 DSHLPGTMEH----HPPHLTGIPPNVLPLNGAPGP-------DSSSKLGLR--------D 1064

Query: 1466 ERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESY 1287
            ERFK L EE  N  P        ++P R  I + + E+ L+QFPR + L+SE   +  +Y
Sbjct: 1065 ERFKLLHEEQLNSFP--------LDPARRPINQTDAEDILRQFPRPSHLESELAQRIGNY 1116

Query: 1286 FSRP-DRASHGFNHDVGLKLDGNDNAPRLLPP-------YQPGSLRPLDLCDDNMDRRVD 1131
              RP DR  HG N D GL +DG   A R+LPP       Y   + RP+   +D+  +  D
Sbjct: 1117 SLRPFDRGVHGQNFDTGLTIDGAA-ASRVLPPRHIGGALYPTDAERPIAFYEDSTGQ-AD 1174

Query: 1130 IAAGVPPDFLRSASGRNRIDGFPLRSPGREYPSHP--SSRFRRLEDSDGREL-HVFSEQS 960
             + G        + GR  +DGF  RSP  EY         F  +E+ DG++  H F +  
Sbjct: 1175 RSRGHSDFPAPGSYGRRFVDGFGPRSPLHEYHGRGFGGRGFTGVEEIDGQDFPHHFGDPL 1234

Query: 959  KSFNLPSEGNAFHENRFPILPSHLRKGESDGSGS--LPARLRGGDLIGSNVPPGRLQSGE 786
                      +F E+RFPI  SHL++G+ + SG+  +   LR GDLIG +          
Sbjct: 1235 ----------SFRESRFPIFRSHLQRGDFESSGNFRMSEHLRTGDLIGQD---------R 1275

Query: 785  PIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESG 606
              G R+LP HL  G++  FG     +++ D                GG+ P+  R  E G
Sbjct: 1276 HFGPRSLPGHLRLGELTAFGSHPGHSRIGDLSVLGNFEPFG-----GGHRPNNPRLGEPG 1330

Query: 605  FSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTRE 426
            F S F   G  +DG FF AGDVESFD SRKRK  SMGWCRICKVDCETVEGL++HSQTRE
Sbjct: 1331 FRSSFSRQGLVDDGRFF-AGDVESFDNSRKRKPISMGWCRICKVDCETVEGLELHSQTRE 1389

Query: 425  HQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESR 294
            HQKMAMDMV SIK+ N KK KV+ +DH S EDG KS   G ESR
Sbjct: 1390 HQKMAMDMVQSIKQ-NAKKHKVTPNDHSS-EDG-KSKNVGLESR 1430


>ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus communis]
            gi|223540292|gb|EEF41863.1| hypothetical protein
            RCOM_0731250 [Ricinus communis]
          Length = 1329

 Score =  270 bits (690), Expect = 1e-69
 Identities = 181/462 (39%), Positives = 233/462 (50%), Gaps = 25/462 (5%)
 Frame = -1

Query: 1598 GQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQMPDERFKRLPEEGFNMLPG 1419
            GQ S M  NAM++NG PG        D S   GL+++R++   DE               
Sbjct: 935  GQQSGMHSNAMRMNGAPG-------MDSSSALGLRDDRFRPFSDEYMN------------ 975

Query: 1418 DRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESYFS--RP------DRAS 1263
                PF  +P + I+ RREFEEDLK F R + LD++   KF + FS  RP      D+  
Sbjct: 976  ----PFPKDPSQRIVDRREFEEDLKHFSRPSDLDTQSTTKFGANFSSSRPLDRGPLDKGL 1031

Query: 1262 HGFNHDVGLKLDGNDNAP--RLLPPYQ-PGSLRPLDLCDDNMDRRVDIAAGVPPDFLRSA 1092
            HG N+D G+KL+     P  R  PPY   G + P D+ + ++    D   G  PD +R+ 
Sbjct: 1032 HGPNYDSGMKLESLGGPPPSRFFPPYHHDGLMHPNDIAERSIGFH-DNTLGRQPDSVRAH 1090

Query: 1091 S---------GRNRIDGFPLRSPGREYPSHPSSRFRR---LEDSDGRELHVFSEQSKSFN 948
                       R   DG   RSPGR+YP   S  F     L+D DGRE   F        
Sbjct: 1091 PEFFGPGRRYDRRHRDGMAPRSPGRDYPGVSSRGFGAIPGLDDIDGRESRRF-------- 1142

Query: 947  LPSEGNAFHENRFPILPSHLRKGESDGSGS--LPARLRGGDLIGSNVPPGRLQSGEPIGH 774
                G++FH +RFP+LPSH+R GE +G          R G+ +G +    RL  GEPIG 
Sbjct: 1143 ----GDSFHGSRFPVLPSHMRMGEFEGPSQDGFSNHFRRGEHLGHHNMRNRL--GEPIGF 1196

Query: 773  RNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESGFSSG 594
               P     G ++G G F N                              R  E GF S 
Sbjct: 1197 GAFPGPAGMGDLSGTGNFFN-----------------------------PRLGEPGFRSS 1227

Query: 593  FPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQKM 414
            F   G+  DGG + AG++ESFD SR+RK  SMGWCRICKVDCETVEGLD+HSQTREHQK 
Sbjct: 1228 FSFKGFPGDGGIY-AGELESFDNSRRRKSSSMGWCRICKVDCETVEGLDLHSQTREHQKR 1286

Query: 413  AMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESRDN 288
            AMDMV++IK+ N KKQK++++DH S +D SKS     E R N
Sbjct: 1287 AMDMVVTIKQ-NAKKQKLANNDHSSVDDASKSKNTSIEGRGN 1327


>ref|XP_007016237.1| Uncharacterized protein isoform 7 [Theobroma cacao]
            gi|508786600|gb|EOY33856.1| Uncharacterized protein
            isoform 7 [Theobroma cacao]
          Length = 975

 Score =  261 bits (666), Expect = 9e-67
 Identities = 175/427 (40%), Positives = 218/427 (51%), Gaps = 22/427 (5%)
 Frame = -1

Query: 1502 GLQEERYKQMPDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQ 1323
            GL       +  ER K + +E  N  P DR          H   R +FEEDLK FPR + 
Sbjct: 600  GLDSTSTFSLRGERLKPVQDECSNQFPLDR---------GHRGDRGQFEEDLKHFPRPSH 650

Query: 1322 LDSEGVPKFESYFS--RP-DRASHGFNHDVGLKLDGND------------NAPRLLPPYQ 1188
            LD+E VPKF SY S  RP DR  HGF  D+G +    +               R LPPY 
Sbjct: 651  LDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPYH 710

Query: 1187 PGSL--RPLDLCDDNMDRRVDIAAGVPPDFLRSAS--GRNRIDGFPLRSPGREYPSHPSS 1020
            P     RP+ L  D + R         PDFL +    GR+R+DGF  RSPGREYP     
Sbjct: 711  PDDTGERPVGLPKDTLGR---------PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPH 761

Query: 1019 RF--RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLPAR 846
             F     ++ DGRE   FS+                 RFP LP HL +G  + S  +   
Sbjct: 762  GFGGHPGDEIDGRERR-FSD-----------------RFPGLPGHLHRGGFESSDRMEEH 803

Query: 845  LRGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXX 666
            LR  D+I  +  P   + GE +GH N+P HL  G+  GFG F +  ++ +          
Sbjct: 804  LRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGE---------- 853

Query: 665  XXXXSFGGNLPSR-ARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWC 489
                 FGG    R  R  E GF S F +  + NDGG +  G ++SF+  RKRK  SMGWC
Sbjct: 854  -----FGGPGNFRHPRLGEPGFRSSFSLQEFPNDGGIYTGG-MDSFENLRKRKPMSMGWC 907

Query: 488  RICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKA 309
            RICK+DCETVEGLD+HSQTREHQKMAMDMV++IK+ N KKQK++S DH    D SKS   
Sbjct: 908  RICKIDCETVEGLDLHSQTREHQKMAMDMVVTIKQ-NAKKQKLTSSDHSIRNDTSKSKNV 966

Query: 308  GFESRDN 288
             FE R N
Sbjct: 967  KFEGRVN 973


>ref|XP_007016232.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|590588563|ref|XP_007016233.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
            gi|590588573|ref|XP_007016234.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508786595|gb|EOY33851.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508786596|gb|EOY33852.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508786597|gb|EOY33853.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 1408

 Score =  261 bits (666), Expect = 9e-67
 Identities = 175/427 (40%), Positives = 218/427 (51%), Gaps = 22/427 (5%)
 Frame = -1

Query: 1502 GLQEERYKQMPDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQ 1323
            GL       +  ER K + +E  N  P DR          H   R +FEEDLK FPR + 
Sbjct: 1033 GLDSTSTFSLRGERLKPVQDECSNQFPLDR---------GHRGDRGQFEEDLKHFPRPSH 1083

Query: 1322 LDSEGVPKFESYFS--RP-DRASHGFNHDVGLKLDGND------------NAPRLLPPYQ 1188
            LD+E VPKF SY S  RP DR  HGF  D+G +    +               R LPPY 
Sbjct: 1084 LDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPYH 1143

Query: 1187 PGSL--RPLDLCDDNMDRRVDIAAGVPPDFLRSAS--GRNRIDGFPLRSPGREYPSHPSS 1020
            P     RP+ L  D + R         PDFL +    GR+R+DGF  RSPGREYP     
Sbjct: 1144 PDDTGERPVGLPKDTLGR---------PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPH 1194

Query: 1019 RF--RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLPAR 846
             F     ++ DGRE   FS+                 RFP LP HL +G  + S  +   
Sbjct: 1195 GFGGHPGDEIDGRERR-FSD-----------------RFPGLPGHLHRGGFESSDRMEEH 1236

Query: 845  LRGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXX 666
            LR  D+I  +  P   + GE +GH N+P HL  G+  GFG F +  ++ +          
Sbjct: 1237 LRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGE---------- 1286

Query: 665  XXXXSFGGNLPSR-ARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWC 489
                 FGG    R  R  E GF S F +  + NDGG +  G ++SF+  RKRK  SMGWC
Sbjct: 1287 -----FGGPGNFRHPRLGEPGFRSSFSLQEFPNDGGIYTGG-MDSFENLRKRKPMSMGWC 1340

Query: 488  RICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKA 309
            RICK+DCETVEGLD+HSQTREHQKMAMDMV++IK+ N KKQK++S DH    D SKS   
Sbjct: 1341 RICKIDCETVEGLDLHSQTREHQKMAMDMVVTIKQ-NAKKQKLTSSDHSIRNDTSKSKNV 1399

Query: 308  GFESRDN 288
             FE R N
Sbjct: 1400 KFEGRVN 1406


>ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Populus trichocarpa]
            gi|550331020|gb|ERP56830.1| hypothetical protein
            POPTR_0009s04520g [Populus trichocarpa]
          Length = 1315

 Score =  258 bits (658), Expect = 7e-66
 Identities = 174/417 (41%), Positives = 220/417 (52%), Gaps = 34/417 (8%)
 Frame = -1

Query: 1436 FNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESYF--SRP---- 1275
            F+ LP +   PF   P  H + + EFEEDLK FPR + LD+E VPK  S+F  SRP    
Sbjct: 931  FSSLPDEHLNPFPRGPAHHNVHQGEFEEDLKHFPRPSHLDTEPVPKSSSHFPSSRPLDRG 990

Query: 1274 -------------DRASHGFNHDVGLKLD--GNDNAPRLLPPYQPG-SLRPLDL-----C 1158
                         D+ SHGFN+D GL ++  G    PR  PPY    +L P D       
Sbjct: 991  PRGFGVDGAPRPLDKGSHGFNYDSGLNMEPLGGSAPPRFFPPYHHDKALHPSDAEVSLGY 1050

Query: 1157 DDNMDRRVDIAAGVPPDFLRS---ASGRNRIDGFPLRSPGREYPSHPSSRFRRL---EDS 996
             D++  R D A    P FL           +D    RSP R+YP  P+ RF  L   +D 
Sbjct: 1051 HDSLAGRSDFAR-TRPGFLGPPIPGYDHRHMDNLAPRSPVRDYPGMPTRRFGALPGLDDI 1109

Query: 995  DGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLP-ARLRGGDLIGS 819
            DGR+ H F ++  S        +  ++RFP+ PSHLR+GE +G G+L       GDL+G 
Sbjct: 1110 DGRDPHRFGDKFSS--------SLRDSRFPVFPSHLRRGELEGPGNLHMGEHLSGDLMGH 1161

Query: 818  NVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGN 639
            +  P  L+ GE +G RNLP+HL  G+   FG F   A++ +                 GN
Sbjct: 1162 DGRPAHLRRGEHLGPRNLPSHLWVGEPGNFGAFPGHARMGELAGP-------------GN 1208

Query: 638  LPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETV 459
                  G E GF S F        GG + AGD++ FD SRKRK  SMGWCRICKVDCETV
Sbjct: 1209 FYHHQLG-EPGFRSSF--------GGNY-AGDLQFFDNSRKRK-PSMGWCRICKVDCETV 1257

Query: 458  EGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESRDN 288
            E LD+HSQTREHQKMA+DMV++IK+ N KK K +   H S ED SKS  A FE R N
Sbjct: 1258 EALDLHSQTREHQKMALDMVVTIKQ-NAKKHKSTPCHHSSLEDKSKSRNASFEGRGN 1313


>ref|XP_007016238.1| Uncharacterized protein isoform 8 [Theobroma cacao]
            gi|508786601|gb|EOY33857.1| Uncharacterized protein
            isoform 8 [Theobroma cacao]
          Length = 972

 Score =  253 bits (647), Expect = 1e-64
 Identities = 174/427 (40%), Positives = 216/427 (50%), Gaps = 22/427 (5%)
 Frame = -1

Query: 1502 GLQEERYKQMPDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQ 1323
            GL       +  ER K + +E  N  P DR          H   R +FEEDLK FPR + 
Sbjct: 600  GLDSTSTFSLRGERLKPVQDECSNQFPLDR---------GHRGDRGQFEEDLKHFPRPSH 650

Query: 1322 LDSEGVPKFESYFS--RP-DRASHGFNHDVGLKLDGND------------NAPRLLPPYQ 1188
            LD+E VPKF SY S  RP DR  HGF  D+G +    +               R LPPY 
Sbjct: 651  LDNEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPYH 710

Query: 1187 PGSL--RPLDLCDDNMDRRVDIAAGVPPDFLRSAS--GRNRIDGFPLRSPGREYPSHPSS 1020
            P     RP+ L  D + R         PDFL +    GR+R+DGF  RSPGREYP     
Sbjct: 711  PDDTGERPVGLPKDTLGR---------PDFLGTVPSYGRHRMDGFVSRSPGREYPGISPH 761

Query: 1019 RF--RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLPAR 846
             F     ++ DGRE   FS+                 RFP LP HL +G  + S  +   
Sbjct: 762  GFGGHPGDEIDGRERR-FSD-----------------RFPGLPGHLHRGGFESSDRMEEH 803

Query: 845  LRGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXX 666
            LR  D+I  +  P   + GE +GH N+P HL  G+  GFG F +  ++ +          
Sbjct: 804  LRSRDMINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGE---------- 853

Query: 665  XXXXSFGGNLPSR-ARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWC 489
                 FGG    R  R  E GF S F +  + NDGG +  G ++SF+  RKRK  SMGWC
Sbjct: 854  -----FGGPGNFRHPRLGEPGFRSSFSLQEFPNDGGIYTGG-MDSFENLRKRKPMSMGWC 907

Query: 488  RICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKA 309
            RICK+DCETVEGLD+HSQTREHQKMAMDMV++IK+ N KKQK+   DH    D SKS   
Sbjct: 908  RICKIDCETVEGLDLHSQTREHQKMAMDMVVTIKQ-NAKKQKL---DHSIRNDTSKSKNV 963

Query: 308  GFESRDN 288
             FE R N
Sbjct: 964  KFEGRVN 970


>ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Populus trichocarpa]
            gi|222845587|gb|EEE83134.1| hypothetical protein
            POPTR_0001s25430g [Populus trichocarpa]
          Length = 1327

 Score =  239 bits (610), Expect = 3e-60
 Identities = 185/499 (37%), Positives = 236/499 (47%), Gaps = 45/499 (9%)
 Frame = -1

Query: 1649 GRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLAS--------GFQDPSIPFGLQ 1494
            GR P  H+P       +G P     +A       G+R +S        G Q PS P G Q
Sbjct: 872  GRLPPGHMPSH-----YGPPQGPYTHAPT---SQGERTSSYVHETSMFGNQRPSYPGGRQ 923

Query: 1493 EERYKQMPDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDS 1314
                  +     +    + F   P +   PF  +P R    + EFEEDLK F   + LD+
Sbjct: 924  GILSNAVGTNGAQDPNSDRFRSFPDEHLNPFPHDPARRNAHQGEFEEDLKHFTAPSCLDT 983

Query: 1313 EGVPKFESYFS--RP-----------------DRASHGFNHDVGLKLD--GNDNAPRLLP 1197
            + VPK   +FS  RP                 D+ SHG N+D GL ++  G    PR  P
Sbjct: 984  KPVPKSGGHFSSSRPLDRGPHGFGVDGAPKHLDKGSHGLNYDSGLNVEPLGGSAPPRFFP 1043

Query: 1196 PYQ----------PGSLRPLDLCDDNMDRRVDIAAGVPPDFLRSASGRNR--IDGFPLRS 1053
            P             GSL       DN+  R D A   P        G +   +D    RS
Sbjct: 1044 PIHHDRTLHRSEAEGSLG----FHDNLAGRTDFARTRPGLLGPPMPGYDHRDMDNLAPRS 1099

Query: 1052 PGREYPSHPSSRFRRL---EDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRK 882
            PGR+YP     RF  L   +D DGR     S+   S        + H++RFP+ PSHLR+
Sbjct: 1100 PGRDYPGMSMQRFGALPGLDDIDGRAPQRSSDPITS--------SLHDSRFPLFPSHLRR 1151

Query: 881  GESDGSGSLP-ARLRGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAK 705
            GE +G G+        GDL+G +  P  L+ GE +G RN P+HL  G+  GFG F   A+
Sbjct: 1152 GELNGPGNFHMGEHLSGDLMGHDGWPAHLRRGERLGPRNPPSHLRLGERGGFGSFPGHAR 1211

Query: 704  LSDAXXXXXXXXXXXXXSFGGNLPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDL 525
            + +                 GNL  +  G E GF S F        GG + AGD++  + 
Sbjct: 1212 MGELAGP-------------GNLYHQQLG-EPGFRSSF--------GGSY-AGDLQYSEN 1248

Query: 524  SRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDH 345
            SRKRK  SMGWCRICKVDCET EGLD+HSQTREHQKMAMDMV++IK+ NVKK K +  DH
Sbjct: 1249 SRKRK-SSMGWCRICKVDCETFEGLDLHSQTREHQKMAMDMVVTIKQ-NVKKHKSAPSDH 1306

Query: 344  KSHEDGSKSSKAGFESRDN 288
             S ED SK   A FE R N
Sbjct: 1307 SSLEDTSKLRNASFEGRGN 1325


>ref|XP_007131393.1| hypothetical protein PHAVU_011G009900g [Phaseolus vulgaris]
            gi|561004393|gb|ESW03387.1| hypothetical protein
            PHAVU_011G009900g [Phaseolus vulgaris]
          Length = 1314

 Score =  219 bits (559), Expect = 2e-54
 Identities = 167/489 (34%), Positives = 218/489 (44%), Gaps = 44/489 (8%)
 Frame = -1

Query: 1622 GSAE--HVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSI----PFGLQEERYKQMPDER 1461
            GSA   H       N PP   K        L  GFQ  S+    PF    E   +     
Sbjct: 877  GSAHDPHTGHASAENFPPTMFKQPQDSDITLGRGFQPQSLGPPQPFNQVHEPPFRAGTSN 936

Query: 1460 FKRLPEEGFNM-LPGD----------------------RFKPFLIEPGRHIIARREFEED 1350
            F RL    F   LPGD                      RFKPFL+   +  + RRE+++D
Sbjct: 937  FSRLGGPQFGAPLPGDMHGRMAANLPPHGTEGLGLHDERFKPFLVS-NQQTMDRREYDDD 995

Query: 1349 LKQFPRSAQLDSEGVPKFESYF---SRPDRASHGFNHDVGLKLDGNDNAPRLLPPYQPGS 1179
            LK+F R   +D+E + K+ +Y        + S G + DV +K  G+   P  L P  PG 
Sbjct: 996  LKKFSR-LPMDAESISKYGNYSLSAHESGKRSVGIHDDV-IKKSGSALHPGYLGP-GPGY 1052

Query: 1178 LRPLDLCDDNMDRRVDIAAGVPPDFLRSASGRNRIDGFPLRSPGREYPSHPSSRF----- 1014
                                          GR+ +DG   RSP  EY    S R      
Sbjct: 1053 ------------------------------GRHHMDGMTPRSPVGEYAEMSSRRLGPHSG 1082

Query: 1013 -----RRLEDSDGRELHVFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGS--L 855
                   ++D DGR    F            G  F ++RFP LPSHL + E DG G+  +
Sbjct: 1083 SLIGKSGIDDFDGRVPRHF------------GGEFRDSRFPHLPSHLHRDEFDGFGNFRI 1130

Query: 854  PARLRGGDLIGSNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXX 675
                R GD IG +   G  + GEP+G  N P HL  G+  GFG      +  +       
Sbjct: 1131 GEHPRSGDFIGQDEYAGHFRRGEPLGPHNFPRHLQLGEPVGFGAHPGHMRAVEHGSFRSF 1190

Query: 674  XXXXXXXSFGGNLPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMG 495
                      G+ P   +  E GF S F + G+ ND GF   GD+ SFD  R+RK+ SMG
Sbjct: 1191 ESFAK-----GSRPGHPQLGEPGFRSSFSLPGFPNDAGFLT-GDIRSFDNLRRRKVSSMG 1244

Query: 494  WCRICKVDCETVEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSS 315
            WCRICK DCETVEGLD+HSQT+EHQKMAMDMV +IK+ N KKQK+   +  + ++G+K+ 
Sbjct: 1245 WCRICKADCETVEGLDLHSQTKEHQKMAMDMVKTIKQ-NAKKQKLIPSEQPTVDEGNKTH 1303

Query: 314  KAGFESRDN 288
              GFE R N
Sbjct: 1304 NTGFEGRGN 1312


>gb|EXB30469.1| hypothetical protein L484_006018 [Morus notabilis]
          Length = 1320

 Score =  214 bits (546), Expect = 7e-53
 Identities = 177/475 (37%), Positives = 220/475 (46%), Gaps = 20/475 (4%)
 Frame = -1

Query: 1652 DGRQPDSHLPGSAEHVLFGQPSNMPPNAMKLNGGPGKRLASGFQDPSIPFGLQEERYKQM 1473
            D R PD H  GS EH    Q   + PN  ++N   G        D     G ++ER    
Sbjct: 924  DSRGPDPHFAGSLEHGAHSQSFGIHPNMTRMNDSHGF-------DSLSTLGPRDER---- 972

Query: 1472 PDERFKRLPEEGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFE 1293
                                F PF   P      R EFE+DLKQFPR             
Sbjct: 973  --------------------FNPFPAGPN----PRAEFEDDLKQFPRPF----------- 997

Query: 1292 SYFSRPDRASHGFNHDVGLKLD-GNDNAP-RLLPPYQPGSLRPLDLCDDNMDR----RVD 1131
                  DR  HG  +  GLK+D G  + P R L PY  G        +D  DR    R D
Sbjct: 998  ------DRGLHGLKYHTGLKMDSGVGSVPSRSLSPYNGGG------ANDGGDRLGWHRGD 1045

Query: 1130 IAAGVPP-----DFLRSASG--RNRIDGFPLRSPGREYPSHPSSRF--RRLEDSDGRELH 978
                + P     DFL    G  R R+D    RSP RE+P      F     +D  GREL 
Sbjct: 1046 AFGRMDPTRGHLDFLGPGLGYDRRRMDSLASRSPIREHPGISLRGFVGPGPDDIHGRELR 1105

Query: 977  VFSEQSKSFNLPSEGNAFHENRFPILPSHLRKGESDGSGSLPA--RLRGGDLIGSNVPPG 804
             F E   S        +FHE+RF +LP HLR+GE +G  ++     LR  DLIG +   G
Sbjct: 1106 RFGEPFDS--------SFHESRFSMLPGHLRRGEFEGPRNMGMGDHLRN-DLIGRDGLSG 1156

Query: 803  RLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFG-GNLPSR 627
             L+ GE +G  +   H H G+  GFG     A++ +               FG G+ PS 
Sbjct: 1157 PLRWGEHMG--DFHGHFHLGEPVGFGAHSRHARIREIGGPGSFDS------FGRGDGPSF 1208

Query: 626  ARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLD 447
                E GF S F  HG+    G F   +  +FD SRKRKL +MGWCRICKVDCETVEGL+
Sbjct: 1209 PHLGEPGFRSRFSSHGFPTGDGIFT--EDLAFDKSRKRKLPTMGWCRICKVDCETVEGLE 1266

Query: 446  MHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFE--SRDN 288
            +HSQTREHQKMAMDMV++IK+ N KKQK++  D  S  D S+   AG E   +DN
Sbjct: 1267 LHSQTREHQKMAMDMVVAIKQ-NAKKQKLTFGDQSSLGDASQPRSAGTEGHGKDN 1320


>ref|XP_004506322.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            12-like isoform X1 [Cicer arietinum]
            gi|502146144|ref|XP_004506323.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 12-like isoform
            X2 [Cicer arietinum] gi|502146146|ref|XP_004506324.1|
            PREDICTED: mediator of RNA polymerase II transcription
            subunit 12-like isoform X3 [Cicer arietinum]
          Length = 1283

 Score =  210 bits (535), Expect = 1e-51
 Identities = 154/399 (38%), Positives = 197/399 (49%), Gaps = 14/399 (3%)
 Frame = -1

Query: 1442 EGFNMLPGDRFKPFLIEPGRHIIARREFEEDLKQFPRSAQLDSEGVPKFESYFSRPDRAS 1263
            EGF +   +RFK F     +H I RREFE DLK+FPR    D+E  PKF +Y   P    
Sbjct: 936  EGFGV-QDERFKSF-----QHNIDRREFENDLKKFPRHP-FDAEPGPKFGNYQLGP---- 984

Query: 1262 HGFNHDVGLKLDG-NDNAPRLLPPYQPGS-LRPLDLCDDNMDRRVDIAAGVPPDFLRSAS 1089
                H+ G +  G +D+A +     +PGS L P  L             G  P +     
Sbjct: 985  ----HETGKRPVGYHDDAIK-----KPGSTLHPGHL-------------GPGPGY----- 1017

Query: 1088 GRNRIDGFPLRSPGREYPSHPSSRFRRL----------EDSDGRELHVFSEQSKSFNLPS 939
            G + +DG   RSPG EY   PS R   L          +D DGR    + +        S
Sbjct: 1018 GIHHMDGIAPRSPGSEYIDMPSRRSGPLSGGLVSKSGIDDFDGRTASRYGD--------S 1069

Query: 938  EGNAFHENRFPILPSHLRKGESDGSGS--LPARLRGGDLIGSNVPPGRLQSGEPIGHRNL 765
             G AF + RFP  PSHL +   DG G+  +    R G+ IG +   G  Q GE +G  N 
Sbjct: 1070 VGIAFRDGRFPHQPSHLHRDAFDGFGNFRMGEHPRRGNFIGRDEFSGHFQRGEHLGPHNF 1129

Query: 764  PNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGGNLPSRARGAESGFSSGFPI 585
            P HL  G+   FG      +  +                 GN P   +  E GF S F +
Sbjct: 1130 PRHLQLGERISFGDHPGHMRAFELGSSRSFESFSK-----GNRPGHPQLGEPGFRSSFSL 1184

Query: 584  HGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCETVEGLDMHSQTREHQKMAMD 405
             G+ ND GF   GD+ SFD  R+RK  SMGWCRICKVDCETVEGL++HSQTREHQKMA+D
Sbjct: 1185 AGFNNDAGFLT-GDIRSFDNLRRRKAASMGWCRICKVDCETVEGLELHSQTREHQKMAVD 1243

Query: 404  MVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESRDN 288
            +V +IK+ N KKQK+   +  S EDG ++   GFE   N
Sbjct: 1244 IVKTIKQ-NAKKQKLIPSEQSSVEDGKQTWGTGFEGHGN 1281


>ref|XP_004246977.1| PREDICTED: uncharacterized protein LOC101249008 [Solanum
            lycopersicum]
          Length = 1353

 Score =  207 bits (527), Expect = 1e-50
 Identities = 148/356 (41%), Positives = 179/356 (50%), Gaps = 26/356 (7%)
 Frame = -1

Query: 1283 SRP-DRASHGFNHDVGLKLDGNDNAP--RLLPPYQP-GSLR---------PLDLCDDNMD 1143
            SRP D+  HG  +D G K + +   P  RLLPP+ P GS+          PL   DD+  
Sbjct: 1009 SRPHDKPPHGLGYDSGSKFEASTGVPPNRLLPPHHPPGSMHFKDSGEREAPLGPHDDDRK 1068

Query: 1142 RRVDIAAGVPPDFLRSASGRNRIDGFPLRSPGREYPSHPSSRFRRLEDSDGRELHVFSEQ 963
            R     +G     L   S RN  DG     P R + SH        +D+ GRE   F E 
Sbjct: 1069 RG---GSGFGVHHLDYLSARNP-DGELFNIPQRGFVSHSG-----FDDTGGREPRQFIEG 1119

Query: 962  SKSFNLPSE--GNAFHENRFPILPSHLRKGESDGSGSLPA-----------RLRGGDLIG 822
               FNLPS   G  +  +RF  LP H    E+DG G L              ++ GDL G
Sbjct: 1120 PGHFNLPSNLAGGLYSNSRFQALPGHPHGVETDGLGDLRGGEHTTFGRPYKHVQSGDLFG 1179

Query: 821  SNVPPGRLQSGEPIGHRNLPNHLHRGKIAGFGGFHNRAKLSDAXXXXXXXXXXXXXSFGG 642
             ++P   L   E +    LP+HL   K  GFG F  RA + +                G 
Sbjct: 1180 KDMP-SHLHHDESLDPPKLPSHLRFDKPGGFGSFAGRAYMGELSGFGDIPGFDESV--GR 1236

Query: 641  NLPSRARGAESGFSSGFPIHGYQNDGGFFNAGDVESFDLSRKRKLGSMGWCRICKVDCET 462
            N P   +  E GF S +P+ GY N G +  AGDV+SFD  RKRK  SMGWCRICKVDCET
Sbjct: 1237 NKPGMPQFGEPGFRSRYPVPGYPNHGLY--AGDVDSFDRPRKRKPTSMGWCRICKVDCET 1294

Query: 461  VEGLDMHSQTREHQKMAMDMVLSIKKDNVKKQKVSSDDHKSHEDGSKSSKAGFESR 294
            VEGLDMHSQTREHQ MAMDMV SIK+ N  KQK  S D  S E+  ++ KA FESR
Sbjct: 1295 VEGLDMHSQTREHQDMAMDMVRSIKEQNRMKQKTFS-DRPSVEEKGRTRKAVFESR 1349


Top