BLASTX nr result

ID: Akebia25_contig00026140 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00026140
         (1413 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002528710.1| DNA binding protein, putative [Ricinus commu...   298   3e-78
ref|XP_006374055.1| hypothetical protein POPTR_0016s14630g [Popu...   290   1e-75
ref|XP_007033485.1| Uncharacterized protein isoform 3 [Theobroma...   287   7e-75
ref|XP_007033484.1| Uncharacterized protein isoform 2, partial [...   287   7e-75
ref|XP_006481235.1| PREDICTED: putative GPI-anchored protein PB1...   287   9e-75
gb|EXB74412.1| hypothetical protein L484_004233 [Morus notabilis]     286   2e-74
ref|XP_007208445.1| hypothetical protein PRUPE_ppa003389mg [Prun...   284   8e-74
gb|EYU42049.1| hypothetical protein MIMGU_mgv1a003860mg [Mimulus...   278   5e-72
ref|XP_006353835.1| PREDICTED: putative GPI-anchored protein PB1...   278   5e-72
ref|XP_007033483.1| Uncharacterized protein isoform 1 [Theobroma...   276   2e-71
ref|XP_004302521.1| PREDICTED: uncharacterized protein LOC101300...   270   9e-70
ref|XP_002266425.1| PREDICTED: uncharacterized protein LOC100267...   269   2e-69
ref|XP_004241080.1| PREDICTED: uncharacterized protein LOC101244...   267   7e-69
ref|XP_004170022.1| PREDICTED: uncharacterized protein LOC101225...   263   1e-67
ref|XP_004142729.1| PREDICTED: uncharacterized protein LOC101206...   263   1e-67
ref|XP_006429633.1| hypothetical protein CICLE_v100114702mg, par...   260   1e-66
ref|XP_006381258.1| hypothetical protein POPTR_0006s11130g [Popu...   259   3e-66
ref|XP_007033486.1| Uncharacterized protein isoform 4 [Theobroma...   255   3e-65
ref|XP_007140052.1| hypothetical protein PHAVU_008G080400g [Phas...   231   6e-58
ref|XP_006602722.1| PREDICTED: flocculation protein FLO11-like [...   231   8e-58

>ref|XP_002528710.1| DNA binding protein, putative [Ricinus communis]
            gi|223531882|gb|EEF33699.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 580

 Score =  298 bits (764), Expect = 3e-78
 Identities = 193/475 (40%), Positives = 245/475 (51%), Gaps = 6/475 (1%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPENNYXXXXXXXXX 182
            LLTPPGTPLF + D S+SQPT+ APRS S  RSVSTT  SRLSV+Q E+ +         
Sbjct: 110  LLTPPGTPLFPTSDGSDSQPTLVAPRSRSLSRSVSTTKASRLSVSQSESQHSSRPTRSSS 169

Query: 183  XXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXXX 362
                                       A V+S +RPS+P                     
Sbjct: 170  VTRSSISNSQYSTYSSNRSSSILNTSSASVSSYTRPSSP--ITRSPSTARPSTPSSRPTA 227

Query: 363  XXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXX 542
                              +K R SQ++RPSTP+SR+Q+PAN                   
Sbjct: 228  SRASTPSRVRPAPTSSLVDKNRQSQSSRPSTPSSRAQLPANSNSTSTRSNSRPSTPTQRN 287

Query: 543  XXXDPPLVTSRS---GRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLRT 713
                    +  S   GR+ +NGR                   QP+VPPDFPLDTPPNLRT
Sbjct: 288  PVSSVSPASGPSISAGRVPSNGRISAPASRPSSPGPRIRPSQQPVVPPDFPLDTPPNLRT 347

Query: 714  TLPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHANG 893
            TLPDRP+SAGRSRP A+ T++G+ E  G  N+ RRH SSPIV+RGRL E  G+GR H+NG
Sbjct: 348  TLPDRPISAGRSRPGASTTIKGSPETTGATNVPRRH-SSPIVSRGRLAEAPGKGRAHSNG 406

Query: 894  HHGGATENQKIKSFSDPAMRKHAKPA-TATESIGLGRTSSKKSLEI-VNRMDIRHGSSGI 1067
            H    +E +K+   SDP MRK  K + T T++ G GRT SKKSL++ +  MDIR G+   
Sbjct: 407  HAADISEPRKVSHVSDPGMRKPVKSSVTTTDNNGFGRTISKKSLDMAIRHMDIRTGNGST 466

Query: 1068 LPMSGTTRFPQSIRSG-MLQGQLAGHGSDALXXXXXXXXXXXXXXTTILGNRNCNGRSRT 1244
              +S TT FPQSIR+    Q     +  ++                   G+   +GR   
Sbjct: 467  RALSSTTLFPQSIRTATKAQSVRPMNAPESNNNGGILENGHHVSRPVEYGSEVNDGRY-- 524

Query: 1245 EDDNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
                   SAK+ E+DIYESSRYDA+LLKEDLK+TNWLH I++KSDQG IFD+ FE
Sbjct: 525  -------SAKLSEVDIYESSRYDALLLKEDLKNTNWLHSIDDKSDQGSIFDNGFE 572


>ref|XP_006374055.1| hypothetical protein POPTR_0016s14630g [Populus trichocarpa]
            gi|550321518|gb|ERP51852.1| hypothetical protein
            POPTR_0016s14630g [Populus trichocarpa]
          Length = 596

 Score =  290 bits (741), Expect = 1e-75
 Identities = 199/477 (41%), Positives = 244/477 (51%), Gaps = 7/477 (1%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNP-SRLSVTQPENNYXXXXXXXX 179
            LLTPPGTPLF S + SESQPT+ APRS+S  RS STT   S LSV+Q E+ +        
Sbjct: 117  LLTPPGTPLFPSSEGSESQPTLVAPRSSSLARSASTTKAASTLSVSQSESYHSSRPARSS 176

Query: 180  XXXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXX 359
                                        A V+S +RPS+P                    
Sbjct: 177  SVTRPSISSSQYSTYSSNRSSSILNTSSASVSSYTRPSSPVSRTPSIARPSTPSARPTPS 236

Query: 360  XXXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXX 539
                               +KTR SQN+RPSTP+SR QIPANL                 
Sbjct: 237  RSSTPSRARPAPTSSSI--DKTRPSQNSRPSTPSSRGQIPANLSTAPTRSNSRPSTPTRR 294

Query: 540  XXXXDPPLVTSRS---GRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLR 710
                     +S S   GR+++N R                   QP+VPPDFPLDTPPNLR
Sbjct: 295  NPAPSSSTASSPSTSAGRVLSNNR-IPGPTSRPNSPSPRVRPQQPVVPPDFPLDTPPNLR 353

Query: 711  TTLPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHAN 890
            TTLPDRPLSAGRSRP+   T++GN E  G     RRH SSPIV+RGRL E SG+GR+H+N
Sbjct: 354  TTLPDRPLSAGRSRPNVHATMKGNPETVGSVIAPRRH-SSPIVSRGRLTEPSGKGRVHSN 412

Query: 891  GHHGGATENQKIKSFSDPAMRKHAK-PATATESIGLGRTSSKKSLEI-VNRMDIRHGSSG 1064
            GH   A E +K+   S+  MRK  K  +TA+ES G GRT SKKSL++ +  MD+R+G+  
Sbjct: 413  GHIADAPEPRKVSHVSELGMRKPVKSSSTASESTGFGRTISKKSLDMAIRHMDLRNGTGS 472

Query: 1065 ILPMSGTTRFPQSIRSGMLQGQLAGHGSDALXXXXXXXXXXXXXXTTILGNRNCNGRSRT 1244
               +S TT FPQSIRS   +   A   S                      +R    R   
Sbjct: 473  TRSLSSTTLFPQSIRSATPKTHSARSRSAPESINNGNLQNGDVLENESYFSRATEIRREA 532

Query: 1245 EDDNCYPSAKVCE-LDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFEP 1412
             D   Y SAK+ E +DI ESSRYDAILLKEDLK+T+WLH I++KSDQG  FD+ FEP
Sbjct: 533  NDGQRY-SAKLSEVVDICESSRYDAILLKEDLKNTDWLHGIDDKSDQGPFFDNGFEP 588


>ref|XP_007033485.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508712514|gb|EOY04411.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 578

 Score =  287 bits (735), Expect = 7e-75
 Identities = 193/480 (40%), Positives = 247/480 (51%), Gaps = 10/480 (2%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPENNYXXXXXXXXX 182
            LLTPPGTPLF S + SESQ T  APRSNS VRSVSTT  SRLSV+Q E+N+         
Sbjct: 101  LLTPPGTPLFPSSEGSESQSTSLAPRSNSKVRSVSTTKTSRLSVSQSESNHSTRPTRSSS 160

Query: 183  XXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXXX 362
                                       + V+S +RPS+P                     
Sbjct: 161  VTRPSLSSSYSTYSSNRGPSILNTSSVS-VSSYTRPSSPITRSRPSTPSARSTPSRASTP 219

Query: 363  XXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXX 542
                              + +R      PSTP+SR QIPANL                  
Sbjct: 220  SKVRPSSTSSYIDKSRPSQSSR------PSTPSSRPQIPANLNSTAVRSNSRPSTPTRRN 273

Query: 543  XXXDPPL------VTSRSGRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPN 704
                P L       +  +GR ++NGR+                  QP+VPPDFPLDTPPN
Sbjct: 274  PI--PSLSSAAAGASPSAGRTLSNGRSAAPASRPSSPGPRVRPPQQPVVPPDFPLDTPPN 331

Query: 705  LRTTLPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLH 884
            LRTTLPDRP+SAGRSRP  +V ++ N +     N+ RRH SSPIV RGRL E  GR R+H
Sbjct: 332  LRTTLPDRPVSAGRSRPGVSVGMKANQDTTSSVNMPRRH-SSPIVTRGRLTEPPGRTRVH 390

Query: 885  ANGHHGGATENQKIKSFSDPAMRKHAKPATAT-ESIGLGRTSSKKSLEI-VNRMDIRHGS 1058
            +NGH     E++K    +D AMRK  K +T T +S G GRT SKKSL++ +  MDIR+G+
Sbjct: 391  SNGHASDIHESRKTSHVNDSAMRKPVKSSTTTADSAGFGRTISKKSLDMAIRHMDIRNGT 450

Query: 1059 SGILPMSGTTRFPQSIRSGMLQGQ--LAGHGSDALXXXXXXXXXXXXXXTTILGNRNCNG 1232
              I  +SGTT FPQSIRS   + Q   +   SD++              +    + +   
Sbjct: 451  GSIRSLSGTTLFPQSIRSATTRTQSLRSFSTSDSVNSNGSPGSLQNGDFSENGNSISRPV 510

Query: 1233 RSRTEDDNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFEP 1412
            ++ ++  +   SAK  E+DIYESSRYDAILLKEDLK+TNWLH I++KSD G IF++ FEP
Sbjct: 511  QNGSDSHDGRYSAKFSEVDIYESSRYDAILLKEDLKNTNWLHSIDDKSDPGSIFENGFEP 570


>ref|XP_007033484.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508712513|gb|EOY04410.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 522

 Score =  287 bits (735), Expect = 7e-75
 Identities = 193/480 (40%), Positives = 247/480 (51%), Gaps = 10/480 (2%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPENNYXXXXXXXXX 182
            LLTPPGTPLF S + SESQ T  APRSNS VRSVSTT  SRLSV+Q E+N+         
Sbjct: 45   LLTPPGTPLFPSSEGSESQSTSLAPRSNSKVRSVSTTKTSRLSVSQSESNHSTRPTRSSS 104

Query: 183  XXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXXX 362
                                       + V+S +RPS+P                     
Sbjct: 105  VTRPSLSSSYSTYSSNRGPSILNTSSVS-VSSYTRPSSPITRSRPSTPSARSTPSRASTP 163

Query: 363  XXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXX 542
                              + +R      PSTP+SR QIPANL                  
Sbjct: 164  SKVRPSSTSSYIDKSRPSQSSR------PSTPSSRPQIPANLNSTAVRSNSRPSTPTRRN 217

Query: 543  XXXDPPL------VTSRSGRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPN 704
                P L       +  +GR ++NGR+                  QP+VPPDFPLDTPPN
Sbjct: 218  PI--PSLSSAAAGASPSAGRTLSNGRSAAPASRPSSPGPRVRPPQQPVVPPDFPLDTPPN 275

Query: 705  LRTTLPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLH 884
            LRTTLPDRP+SAGRSRP  +V ++ N +     N+ RRH SSPIV RGRL E  GR R+H
Sbjct: 276  LRTTLPDRPVSAGRSRPGVSVGMKANQDTTSSVNMPRRH-SSPIVTRGRLTEPPGRTRVH 334

Query: 885  ANGHHGGATENQKIKSFSDPAMRKHAKPATAT-ESIGLGRTSSKKSLEI-VNRMDIRHGS 1058
            +NGH     E++K    +D AMRK  K +T T +S G GRT SKKSL++ +  MDIR+G+
Sbjct: 335  SNGHASDIHESRKTSHVNDSAMRKPVKSSTTTADSAGFGRTISKKSLDMAIRHMDIRNGT 394

Query: 1059 SGILPMSGTTRFPQSIRSGMLQGQ--LAGHGSDALXXXXXXXXXXXXXXTTILGNRNCNG 1232
              I  +SGTT FPQSIRS   + Q   +   SD++              +    + +   
Sbjct: 395  GSIRSLSGTTLFPQSIRSATTRTQSLRSFSTSDSVNSNGSPGSLQNGDFSENGNSISRPV 454

Query: 1233 RSRTEDDNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFEP 1412
            ++ ++  +   SAK  E+DIYESSRYDAILLKEDLK+TNWLH I++KSD G IF++ FEP
Sbjct: 455  QNGSDSHDGRYSAKFSEVDIYESSRYDAILLKEDLKNTNWLHSIDDKSDPGSIFENGFEP 514


>ref|XP_006481235.1| PREDICTED: putative GPI-anchored protein PB15E9.01c-like isoform X1
            [Citrus sinensis] gi|568855282|ref|XP_006481236.1|
            PREDICTED: putative GPI-anchored protein PB15E9.01c-like
            isoform X2 [Citrus sinensis]
          Length = 582

 Score =  287 bits (734), Expect = 9e-75
 Identities = 200/483 (41%), Positives = 244/483 (50%), Gaps = 13/483 (2%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPENN---YXXXXXX 173
            LLTPPGTPLF S D SESQ    APR +S  RSVST+  SRLSV+Q E+N   +      
Sbjct: 105  LLTPPGTPLFPSSDGSESQLNPVAPRISSLARSVSTSKASRLSVSQSESNHSVHPLRPAR 164

Query: 174  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXX 353
                                          A V+S +RP++P                  
Sbjct: 165  SSSVTRSSISASQYSTYSSNRSTSILNTSSASVSSYTRPASPSARSVSSARPSTPSARPT 224

Query: 354  XXXXXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANL--XXXXXXXXXXXXX 527
                                  +T  SQ +RPSTP+SR QIPANL               
Sbjct: 225  SSRSSTPSRTRPSLTSSSMDKTRT--SQTSRPSTPSSRPQIPANLNSSTARSSSRPSTPT 282

Query: 528  XXXXXXXXDPPLVTSRS-GRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPN 704
                     P + +S S GR+++NGR+                  QPIVPPDFPLDTPPN
Sbjct: 283  RRNPITSTSPAMSSSTSAGRVMSNGRS-QGPASRPSSPSPRVRSQQPIVPPDFPLDTPPN 341

Query: 705  LRTTLPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLH 884
            LRTTLPDRPLSAGRSRP AA+T++ N EA G  N+ RRH SSP+V RGRL E  GR R  
Sbjct: 342  LRTTLPDRPLSAGRSRPGAALTMKSNPEATGSVNMPRRH-SSPVVTRGRLTEPPGRSRTP 400

Query: 885  ANGHHGGATENQKIKSFSDPAMRKHAKPA-TATESIGLGRTSSKKSLEI-VNRMDIRHGS 1058
            ANGH   A E ++    S+ + R+  K   TA++  G GRT SKKSL++ +  MDIR+G+
Sbjct: 401  ANGHTADAHEYRRTSHISEQSTRRPVKSTNTASDGTGFGRTISKKSLDMAIRHMDIRNGA 460

Query: 1059 SGILPMSGTTRFPQSIRSGMLQGQLAG-----HGSDALXXXXXXXXXXXXXXTTILGNRN 1223
              I  +SGTT FPQSIRS   + + A      H +  L                  GN  
Sbjct: 461  GSIRQLSGTTLFPQSIRSATSKTRSARALESVHTNGILKNRDISEKGNTYSGPAENGNDA 520

Query: 1224 CNGRSRTEDDNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHR 1403
             +GR          SAK+ E DIYESSRYDAILLKEDLK+TNWLH  ++KSDQG IFD  
Sbjct: 521  HDGRY---------SAKLSEADIYESSRYDAILLKEDLKNTNWLHSFDDKSDQGAIFDTG 571

Query: 1404 FEP 1412
            FEP
Sbjct: 572  FEP 574


>gb|EXB74412.1| hypothetical protein L484_004233 [Morus notabilis]
          Length = 574

 Score =  286 bits (732), Expect = 2e-74
 Identities = 194/478 (40%), Positives = 242/478 (50%), Gaps = 9/478 (1%)
 Frame = +3

Query: 3    LLTPPGTP-LFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPENNYXXXXXXXX 179
            LLTPPGTP +F S + +E Q T++APRS+S  RS STT  SRLSV+Q E N+        
Sbjct: 94   LLTPPGTPTIFPSSEGNEPQRTIAAPRSSSLARSASTTKASRLSVSQSETNHSSRPTRSS 153

Query: 180  XXXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXX 359
                                        A V+S +RP++P                    
Sbjct: 154  SVTRSSTSTSLHNTYSSNRSSNILNTSSASVSSYTRPASP--ITRSSSTARPSTPSSRPT 211

Query: 360  XXXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXX 539
                               +++R  Q++RPSTP+SR QIPANL                 
Sbjct: 212  LSRPSTPSRAHPSPTSSSADRSRPIQSSRPSTPSSRPQIPANLSSPAARSNSRPSTPTRR 271

Query: 540  XXXXDPPLVTSRS---GRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLR 710
                      S S   GR+++NGRN                  QP+VPPDFPLDTPPNLR
Sbjct: 272  SPVSTISPAASPSISNGRVLSNGRNPTSSSRPSSPSPRIRPPPQPVVPPDFPLDTPPNLR 331

Query: 711  TTLPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHAN 890
            TTLPDRPLSAGRSRP + VT++GNSE     N SRRH SSPIV RGRL E +GRGRL  N
Sbjct: 332  TTLPDRPLSAGRSRPGSTVTMKGNSETTTTANTSRRH-SSPIVTRGRLTEPAGRGRLQGN 390

Query: 891  GHHGGATENQKIKSFSDPAMRKHAKPATAT-ESIGLGRTSSKKSLEI-VNRMDIRHGSSG 1064
            GH+  A E +K     D  MRK  K + A+ ++ G GRT SKKSL++ +  MDIR G   
Sbjct: 391  GHYTDA-EPRKASHAPDLTMRKPVKASIASLDNGGFGRTISKKSLDMAIRHMDIRSGGGN 449

Query: 1065 ILPMSGTTRFPQSIRSGMLQGQLAGHGSDALXXXXXXXXXXXXXXTTILGNRNCNGRSRT 1244
            + P++GTT FPQSIRS   + Q     S +                 I  N N   R   
Sbjct: 450  VRPLAGTTLFPQSIRSASSKTQ--SIRSSSAPSSIINGGLQTSYNGIISDNGNAIDRPAE 507

Query: 1245 ED---DNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
                 D     AK+ E+DIYESSRYD +LLKEDLK+TNWLH I++K+D G IFD+ FE
Sbjct: 508  NGIGADGGRHFAKLNEVDIYESSRYDTLLLKEDLKNTNWLHSIDDKTDHGPIFDNGFE 565


>ref|XP_007208445.1| hypothetical protein PRUPE_ppa003389mg [Prunus persica]
            gi|462404087|gb|EMJ09644.1| hypothetical protein
            PRUPE_ppa003389mg [Prunus persica]
          Length = 579

 Score =  284 bits (726), Expect = 8e-74
 Identities = 196/479 (40%), Positives = 241/479 (50%), Gaps = 10/479 (2%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPENNYXXXXXXXXX 182
            LLTPP TPLF S D SESQPT++APR NS  RS S + PSRLSV+Q E+N+         
Sbjct: 96   LLTPPETPLFPSSDGSESQPTLAAPR-NSLSRSGSASKPSRLSVSQSESNHPSRPARSSS 154

Query: 183  XXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXXX 362
                                       A V+S +RPS+P                     
Sbjct: 155  VTRSSTSASLYNNYSSNRNSNILNTSSASVSSYTRPSSPITRSPSTARPSTPTSRPSLSR 214

Query: 363  XXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTS-RSQIPANLXXXXXXXXXXXXXXXXX 539
                              EK R+ Q++RPSTP+S R QIPANL                 
Sbjct: 215  STTPSRPRTTSTSSSI--EKPRSVQSSRPSTPSSTRPQIPANLNSHASRPNSRPSTPTRR 272

Query: 540  XXXXDPPLVTSRS---GRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLR 710
                     +S S   GR+++NGR+                  QP+VPPDFPLDTPPNLR
Sbjct: 273  SSLPSLSPASSPSPSAGRVLSNGRSSAPSSRPSSPSPRIRPPPQPVVPPDFPLDTPPNLR 332

Query: 711  TTLPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHAN 890
            TTLPDRP+SAGRSRP A V+++G  E P    + RR  SSPI +RGRL E  GRGR+H  
Sbjct: 333  TTLPDRPISAGRSRPGAVVSMKGKPEPPAAVVVPRR-QSSPIASRGRLTEPPGRGRVHPT 391

Query: 891  GHHGGATENQKIKSFSDPAMRKHAKPA--TATESIGLGRTSSKKSLEI-VNRMDIRHGSS 1061
            GH     E +K     D  MRK  K +  TATES G GR  SKKSL++ +  MDIR+G+ 
Sbjct: 392  GHLPDVPEPRKATLIPDLGMRKPVKTSTTTATESTGFGRNISKKSLDMAIRHMDIRNGTG 451

Query: 1062 GILPMSGTTRFPQSIRSGMLQGQLAGHGSDALXXXXXXXXXXXXXXTTILGNRNCNGR-- 1235
                +SG+T FPQSIRS       +  G                    I  N N   R  
Sbjct: 452  NGRQLSGSTLFPQSIRSSSTPKPQSVRGLSVPASARTNGSLQTGSNGVISENGNIMNRPV 511

Query: 1236 -SRTEDDNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
             + +E D+   SAK+ E D+YESSRYDAILLKEDLKSTNWLH +++K DQG IFD+ FE
Sbjct: 512  DNGSEADSGRYSAKLSEADVYESSRYDAILLKEDLKSTNWLHSLDDKLDQGPIFDNGFE 570


>gb|EYU42049.1| hypothetical protein MIMGU_mgv1a003860mg [Mimulus guttatus]
          Length = 559

 Score =  278 bits (710), Expect = 5e-72
 Identities = 192/477 (40%), Positives = 243/477 (50%), Gaps = 8/477 (1%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPENNYXXXXXXXXX 182
            LLTPPGTPL  S + +ESQ  + APRS   VRS+ST   SRLSV+Q ENN+         
Sbjct: 96   LLTPPGTPLVPSSNVNESQTGLMAPRSGPLVRSISTAKASRLSVSQSENNHAAAKPTRSS 155

Query: 183  XXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXXX 362
                                       A V+S  RPSTP                     
Sbjct: 156  SVTRPSASSSQYNTYSNKSTSILNTSSASVSSYIRPSTP--TNRSSSISRPSTPSSRPTV 213

Query: 363  XXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXX 542
                              ++ R SQN+RPSTPTSR QI +N+                  
Sbjct: 214  SRSTTPARPRPALSTSSTDRPRPSQNSRPSTPTSRPQISSNMTSPAARTTSRPSTPTRRN 273

Query: 543  XXXDPPLVTSRS---GRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLRT 713
                    +  S   GR +TNGR+                  QPIV  DFPLDTPPNLRT
Sbjct: 274  PTPSLSPTSGPSTPGGRSLTNGRSGASVSRPSSPGPRVRPPPQPIVLHDFPLDTPPNLRT 333

Query: 714  TLPDRPLSAGRSRPSAAVTVRGNSE-APGPPNLSRRHSSSPIVARGRLPELSGRGRLHAN 890
            TLPDRP+SAGRSRP  ++T +GN+E  PG   + RRH SSPIV RGR+ E +GRGR HAN
Sbjct: 334  TLPDRPVSAGRSRPGVSLTSKGNAEPTPGNAAVPRRH-SSPIVTRGRVAEPNGRGRTHAN 392

Query: 891  GHHGGATENQKIKSFSDPAMRKHAKPATATESIGLGRTSSKKSLEI-VNRMDIRHGSSGI 1067
            G    A +++K     +   RK AK   +T+S G GRT SKKSL++ +  MDIR+G++G 
Sbjct: 393  GQLPDAMDSRK-----ELPARKPAK--ISTDSTGFGRTISKKSLDMAIRHMDIRNGNNGF 445

Query: 1068 LPMSGTTRFPQSIRS---GMLQGQLAGHGSDALXXXXXXXXXXXXXXTTILGNRNCNGRS 1238
             P++G+  FPQSIRS      QG  + +GS ++                I  N +    S
Sbjct: 446  RPLTGSNLFPQSIRSTNQKTQQGAASPNGSLSINSNG-----------AIAENGHRFSES 494

Query: 1239 RTEDDNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
             +E+D    SAK+  +DIYESSRYD ILLKEDLK+ NWLH I++KSDQG IFD+ FE
Sbjct: 495  GSEEDKYQYSAKLTNIDIYESSRYDMILLKEDLKNANWLHSIDDKSDQGSIFDNGFE 551


>ref|XP_006353835.1| PREDICTED: putative GPI-anchored protein PB15E9.01c-like [Solanum
            tuberosum]
          Length = 565

 Score =  278 bits (710), Expect = 5e-72
 Identities = 189/473 (39%), Positives = 239/473 (50%), Gaps = 4/473 (0%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPENNYXXXXXXXXX 182
            LLTPPGTPL  + D SES+P    PR +S  RS STT  SRLSV+Q E+N          
Sbjct: 94   LLTPPGTPLVPTSDGSESKPASVGPRGSSLGRSASTTKASRLSVSQSESN-TPARPTRSN 152

Query: 183  XXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXXX 362
                                       A V+S  RPSTP                     
Sbjct: 153  SVTRPSISSSQYCTYSNKSGSILNTSSASVSSYIRPSTPTSRSSSSARPSTPTSRATVSR 212

Query: 363  XXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXX 542
                                +R +Q++RPSTPTSR QI  NL                  
Sbjct: 213  PSTPSKARQAP-------STSRPTQSSRPSTPTSRPQISGNLSTPSRPTSRPSTPTRRTI 265

Query: 543  XXXDPPLVTSRS--GRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLRTT 716
                 P   S +  GR +TNGR                   QPIVPPDF L+TPPNLRTT
Sbjct: 266  TPSLSPASRSSTPAGRPVTNGRTAASLSRPSSPSPQVRRPSQPIVPPDFSLETPPNLRTT 325

Query: 717  LPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHANGH 896
            LPDRPLSAGRSRP+ +VT +GN+EAP   N   R  SSPIV+RGRL E SGRGR+  +G 
Sbjct: 326  LPDRPLSAGRSRPNPSVTTKGNAEAPSVAN--PRRQSSPIVSRGRLTEPSGRGRVLGSGQ 383

Query: 897  HGGATENQKIKSFSDPAMRKHAKPATATESIGLGRTSSKKSLEI-VNRMDIRHGSSGILP 1073
                +++++    S+ + RK  K  TA +++GLGRT SKKSL++ +  MDIR+G +G+ P
Sbjct: 384  LSDISDSRRASHVSELSTRKPVK--TAADNMGLGRTISKKSLDVAIRHMDIRNGINGVRP 441

Query: 1074 MSGTTRFPQSIRSGMLQGQLAGHGSDALXXXXXXXXXXXXXXTTILGN-RNCNGRSRTED 1250
             SG+T FP SIRS   +GQ   HGS                     GN  N +  + +E+
Sbjct: 442  SSGSTLFPHSIRSTNGKGQ-PSHGSTGASSFNENASYHYNGNLPENGNYLNRSSENGSEE 500

Query: 1251 DNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
                 SAK+ ++DIYESSRYD +LLKEDLK+TNWLH I++KSDQ  IF + FE
Sbjct: 501  AKSQHSAKLTDIDIYESSRYDVLLLKEDLKNTNWLHSIDDKSDQETIFGNGFE 553


>ref|XP_007033483.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508712512|gb|EOY04409.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 651

 Score =  276 bits (706), Expect = 2e-71
 Identities = 193/504 (38%), Positives = 247/504 (49%), Gaps = 34/504 (6%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPENNYXXXXXXXXX 182
            LLTPPGTPLF S + SESQ T  APRSNS VRSVSTT  SRLSV+Q E+N+         
Sbjct: 150  LLTPPGTPLFPSSEGSESQSTSLAPRSNSKVRSVSTTKTSRLSVSQSESNHSTRPTRSSS 209

Query: 183  XXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXXX 362
                                       + V+S +RPS+P                     
Sbjct: 210  VTRPSLSSSYSTYSSNRGPSILNTSSVS-VSSYTRPSSPITRSRPSTPSARSTPSRASTP 268

Query: 363  XXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXX 542
                              + +R      PSTP+SR QIPANL                  
Sbjct: 269  SKVRPSSTSSYIDKSRPSQSSR------PSTPSSRPQIPANLNSTAVRSNSRPSTPTRRN 322

Query: 543  XXXDPPL------VTSRSGRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPN 704
                P L       +  +GR ++NGR+                  QP+VPPDFPLDTPPN
Sbjct: 323  PI--PSLSSAAAGASPSAGRTLSNGRSAAPASRPSSPGPRVRPPQQPVVPPDFPLDTPPN 380

Query: 705  LRTTLPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLH 884
            LRTTLPDRP+SAGRSRP  +V ++ N +     N+ RRH SSPIV RGRL E  GR R+H
Sbjct: 381  LRTTLPDRPVSAGRSRPGVSVGMKANQDTTSSVNMPRRH-SSPIVTRGRLTEPPGRTRVH 439

Query: 885  ANGHHGGATENQKIKSFSDPAMRKHAKPATAT-ESIGLGRTSSKKSLEIVNR-------- 1037
            +NGH     E++K    +D AMRK  K +T T +S G GRT SKKSL++  R        
Sbjct: 440  SNGHASDIHESRKTSHVNDSAMRKPVKSSTTTADSAGFGRTISKKSLDMAIRHMSLELQW 499

Query: 1038 -----------------MDIRHGSSGILPMSGTTRFPQSIRSGMLQGQ--LAGHGSDALX 1160
                             +DIR+G+  I  +SGTT FPQSIRS   + Q   +   SD++ 
Sbjct: 500  HDQWLELVVIFYVEAYAVDIRNGTGSIRSLSGTTLFPQSIRSATTRTQSLRSFSTSDSVN 559

Query: 1161 XXXXXXXXXXXXXTTILGNRNCNGRSRTEDDNCYPSAKVCELDIYESSRYDAILLKEDLK 1340
                         +    + +   ++ ++  +   SAK  E+DIYESSRYDAILLKEDLK
Sbjct: 560  SNGSPGSLQNGDFSENGNSISRPVQNGSDSHDGRYSAKFSEVDIYESSRYDAILLKEDLK 619

Query: 1341 STNWLHDINEKSDQGLIFDHRFEP 1412
            +TNWLH I++KSD G IF++ FEP
Sbjct: 620  NTNWLHSIDDKSDPGSIFENGFEP 643


>ref|XP_004302521.1| PREDICTED: uncharacterized protein LOC101300547 [Fragaria vesca
            subsp. vesca]
          Length = 583

 Score =  270 bits (691), Expect = 9e-70
 Identities = 186/477 (38%), Positives = 240/477 (50%), Gaps = 8/477 (1%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPENNYXXXXXXXXX 182
            LLTPP TPLF S D SESQPT++A R ++ +RS S+  PSRLSV+Q E+N+         
Sbjct: 102  LLTPPETPLFPSSDGSESQPTLAAARGSALIRSTSSAKPSRLSVSQSESNHSSRPARSSS 161

Query: 183  XXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXXX 362
                                       A V+S SRPS+P                     
Sbjct: 162  VTRSSISSSQYNNYSSNRNSNFLNTSSASVSSYSRPSSPITRSPSTARPSTPTSRPSLSR 221

Query: 363  XXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXX 542
                              E+ R+  ++RPSTP+SR QIPANL                  
Sbjct: 222  PSTPSRARSVPASSSI--ERPRSVASSRPSTPSSRPQIPANLSSPAARTPSRPSTPTRRH 279

Query: 543  XXXD--PPLVTSRSGRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLRTT 716
                  P    S S   ++NGRN                  QPIVP DFPLDTPPNLRTT
Sbjct: 280  SLPSLSPASSPSPSAGRLSNGRNPAPTSRPSSPSPRVRPPPQPIVPHDFPLDTPPNLRTT 339

Query: 717  LPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHANGH 896
            LPDRP+SAGRSRP AAV V+G  E P    + RR SS P+V+RGRL +  GR R+ +NGH
Sbjct: 340  LPDRPISAGRSRPGAAVVVKGKLETPAAVVVPRRQSS-PVVSRGRLTDPPGRSRVLSNGH 398

Query: 897  HGGATENQKIKSFSDPAMRKHAKPA--TATESIGLGRTSSKKSLEI-VNRMDIRHGSSGI 1067
            H    E +K +   D  MRK  K +  TA E+ G GR  SKKSL++ +  MDI++G+   
Sbjct: 399  HD-VPELRKPQHLPDLGMRKPVKTSSTTAPENTGFGRNISKKSLDMAIRHMDIKNGTGNS 457

Query: 1068 LPMSGTTRFPQSIRSGMLQGQLAGHGSDALXXXXXXXXXXXXXXTTILGNRNCNG--RSR 1241
              +SG+T FPQSIRSG  + Q     S +                      N +   ++ 
Sbjct: 458  RQLSGSTLFPQSIRSGTPKTQTVRTLSSSASVNMNGGLQSRGNGFVYENGNNMSKPVQNG 517

Query: 1242 TE-DDNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
            TE +     S+K+ ++DIYESSRYDAILLKEDLK+TNWLH +++K D+G IFD+ FE
Sbjct: 518  TEANGGGRYSSKLTDVDIYESSRYDAILLKEDLKNTNWLHSLDDKLDEGPIFDNGFE 574


>ref|XP_002266425.1| PREDICTED: uncharacterized protein LOC100267210 [Vitis vinifera]
            gi|147841364|emb|CAN71240.1| hypothetical protein
            VITISV_034160 [Vitis vinifera]
            gi|296085846|emb|CBI31170.3| unnamed protein product
            [Vitis vinifera]
          Length = 570

 Score =  269 bits (688), Expect = 2e-69
 Identities = 186/478 (38%), Positives = 238/478 (49%), Gaps = 9/478 (1%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPENNYXXXXXXXXX 182
            LLTPPGTPLF S D +ESQPT+ APR NS++   ++T  +         +          
Sbjct: 94   LLTPPGTPLFPSSDGNESQPTMLAPR-NSNLARSASTTKASRLSVAQSESSYSRPTRSSS 152

Query: 183  XXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXXX 362
                                       A V+S +RPS+P                     
Sbjct: 153  VTRPSISTSQYSTYSSSRSSSILNTSSASVSSYTRPSSPITRSSSTARPSTPSARPTSSR 212

Query: 363  XXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXX 542
                              +K R S N+RP+TP+SR Q+ ANL                  
Sbjct: 213  SSTPSRARPGLTSSSI--DKPRPSPNSRPTTPSSRPQLQANLSSPAARSNSRPSTPTRRT 270

Query: 543  XXXD--PPLVTSRS-GRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLRT 713
                  P    S S  R ++NGRN                  QPIV PDFPLDTPPNLRT
Sbjct: 271  PAASLSPTAGPSPSTARAMSNGRNPAPASRPSSPSPRVRNPPQPIVLPDFPLDTPPNLRT 330

Query: 714  TLPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHANG 893
            TLPDRPLSAGRSRP AA+T++GNSE P       R  SSPIV RGR+ E + RGRLH+NG
Sbjct: 331  TLPDRPLSAGRSRPGAAMTMKGNSETP------TRRQSSPIVTRGRVSEPNARGRLHSNG 384

Query: 894  HHGGATENQKIKSFSDPAMRKHAKPATATESIGLGRTSSKKSLEI-VNRMDIRHGSSGIL 1070
            H   + E++K    ++P+ +      T++ES G GRT SKKSL++ +  MDIR+G+  I 
Sbjct: 385  HVADSPESRKASHVTEPSRKPVKTSTTSSESTGFGRTISKKSLDMAIRHMDIRNGTGSIR 444

Query: 1071 PMSGTTRFPQSIRSGMLQGQLAGHGSDALXXXXXXXXXXXXXXTTILGNRNCNGRSR--- 1241
            P+SGTT FPQSIRS   + Q A   S +                  + + N N  +R   
Sbjct: 445  PLSGTTLFPQSIRSAASKTQSARASSASSAPASVNSNGSLPASNNGVPSENGNYFTRPSE 504

Query: 1242 --TEDDNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
               E+D+   SAK+ + DIYESSRYDAILLKEDLK+TNWLH + +KSDQG IFD+ FE
Sbjct: 505  NGAEEDDGRFSAKLNQTDIYESSRYDAILLKEDLKNTNWLHSV-DKSDQGPIFDNGFE 561


>ref|XP_004241080.1| PREDICTED: uncharacterized protein LOC101244776 [Solanum
            lycopersicum]
          Length = 564

 Score =  267 bits (683), Expect = 7e-69
 Identities = 186/473 (39%), Positives = 236/473 (49%), Gaps = 4/473 (0%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPENNYXXXXXXXXX 182
            LLTPPGTPL  + D SES+P    PR +S  RS STT  SRLSV+  E+N          
Sbjct: 94   LLTPPGTPLVPTSDGSESKPASVGPRGSSLGRSSSTTKASRLSVSHSESN-TPARPTRSN 152

Query: 183  XXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXXX 362
                                       A V+S  RPSTP                     
Sbjct: 153  SVTRPSISSSQYSTYSNKSGSILNTSSASVSSYIRPSTPTRRSSSSARPSTPTSRATVSR 212

Query: 363  XXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXX 542
                                +R +Q++RPSTPTSR QI  NL                  
Sbjct: 213  PSTPSKAGQAP-------STSRPTQSSRPSTPTSRPQISGNLNTPSRPTSRPSTPTRRTI 265

Query: 543  XXXDPPLVTSRS--GRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLRTT 716
                 P   S +  GR +TNGR                   QPIVPPDF L+TPPNLRTT
Sbjct: 266  TASLSPASRSSTPAGRPVTNGRTAASLSRPSSPSPQVRRPSQPIVPPDFSLETPPNLRTT 325

Query: 717  LPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHANGH 896
            LPDRPLSAGRSRP+ +VT +GN+E P   N   R  SSPIV+RGRL E +GRGR   +G 
Sbjct: 326  LPDRPLSAGRSRPNPSVTTKGNAETPSVAN--PRRQSSPIVSRGRLTEPAGRGRALGSGQ 383

Query: 897  HGGATENQKIKSFSDPAMRKHAKPATATESIGLGRTSSKKSLEI-VNRMDIRHGSSGILP 1073
                +++++    SD + RK  K  TA +++GLGRT SKKSL++ +  MDIR+G +G+ P
Sbjct: 384  LSDISDSRRASHVSDLSTRKPVK--TAADNMGLGRTISKKSLDVAIRHMDIRNG-NGVRP 440

Query: 1074 MSGTTRFPQSIRSGMLQGQLAGHGSDALXXXXXXXXXXXXXXTTILGN-RNCNGRSRTED 1250
             SG+T FP SIRS   +GQ   HGS                     GN  N +  + +E+
Sbjct: 441  TSGSTLFPHSIRSTNGKGQ-PSHGSTGASSFNENASYHYNGNLPENGNYLNRSSENGSEE 499

Query: 1251 DNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
                 SAK+ ++DIYESSRYD +LLKED+K+TNWLH I++KSDQ  IF + FE
Sbjct: 500  AKPQHSAKLTDIDIYESSRYDKLLLKEDMKNTNWLHSIDDKSDQETIFGNGFE 552


>ref|XP_004170022.1| PREDICTED: uncharacterized protein LOC101225804, partial [Cucumis
            sativus]
          Length = 484

 Score =  263 bits (672), Expect = 1e-67
 Identities = 188/478 (39%), Positives = 234/478 (48%), Gaps = 9/478 (1%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPE-NNYXXXXXXXX 179
            LLTPPGTPLF S   SE Q TV+APRS++ VRS STT  SRLSV+Q E NN         
Sbjct: 1    LLTPPGTPLFPSSSESEIQSTVAAPRSSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSS 60

Query: 180  XXXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXX 359
                                        A V+S  RPS+P                    
Sbjct: 61   VSRSSVSTPQYSNYSSNRSASSILNTSSASVSSYIRPSSPSTRSASSARPSTPSSRSTPS 120

Query: 360  XXXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXX 539
                               EK R  Q++RPSTP SR QIPANL                 
Sbjct: 121  RSSTPSRARPSPNSPSI--EKPRPLQSSRPSTPNSRPQIPANLSSPAARSNSRPSTPTRR 178

Query: 540  XXXXDPPLV----TSRSGRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNL 707
                    V    +S S  L TNGR+                  QPIVPPDFPLDTPPNL
Sbjct: 179  NSAPSLSSVVGTPSSTSRVLSTNGRSSTSTSRPSSPSPRVRAAPQPIVPPDFPLDTPPNL 238

Query: 708  RTTLPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHA 887
            RTTLPDRP+SAGRSRP+ A +VRG+ E      + RR ++SP + RGR+ +  GRGRL+ 
Sbjct: 239  RTTLPDRPISAGRSRPTPASSVRGSPETTSTGTVPRR-AASPTITRGRITDAPGRGRLNT 297

Query: 888  NGHHGGATENQKIKSFSDPAMRKHAKPATAT-ESIGLGRTSSKKSLEI-VNRMDIRHGSS 1061
            NGH   + E +++ S SD + R+  K +T T ES G GR+ SKKSL++ +  MDIR+G  
Sbjct: 298  NGHLSDSPETRRLSSSSDLSGRRPVKASTTTAESNGFGRSISKKSLDMAIRHMDIRNGPG 357

Query: 1062 GILPMSGTTRFPQSIRSGMLQGQ-LAGHGSDALXXXXXXXXXXXXXXTTILGNRNCN-GR 1235
             +   SG T FP SIRS   + Q +A   S+A+                     +   G 
Sbjct: 358  SVRSGSGNTLFPHSIRSATSKTQSIALSNSEAIDTDYQMSSNNNMDRGNHFHRPSATIGT 417

Query: 1236 SRTEDDNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
                 +N   SA +  LDIYESSRYDAILLKEDLK+TNWLH  ++K+D   I D+ FE
Sbjct: 418  EVGGGENGRFSASLNHLDIYESSRYDAILLKEDLKNTNWLHSTDDKTDLASILDNGFE 475


>ref|XP_004142729.1| PREDICTED: uncharacterized protein LOC101206216 [Cucumis sativus]
          Length = 578

 Score =  263 bits (672), Expect = 1e-67
 Identities = 188/478 (39%), Positives = 234/478 (48%), Gaps = 9/478 (1%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNPSRLSVTQPE-NNYXXXXXXXX 179
            LLTPPGTPLF S   SE Q TV+APRS++ VRS STT  SRLSV+Q E NN         
Sbjct: 95   LLTPPGTPLFPSSSESEIQSTVAAPRSSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSS 154

Query: 180  XXXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXX 359
                                        A V+S  RPS+P                    
Sbjct: 155  VSRSSVSTPQYSSYSSNRSASSILNTSSASVSSYIRPSSPSTRSASSARPSTPSSRSTPS 214

Query: 360  XXXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXX 539
                               EK R  Q++RPSTP SR QIPANL                 
Sbjct: 215  RSSTPSRARPSPNSPSI--EKPRPLQSSRPSTPNSRPQIPANLSSPAARSNSRPSTPTRR 272

Query: 540  XXXXDPPLV----TSRSGRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNL 707
                    V    +S S  L TNGR+                  QPIVPPDFPLDTPPNL
Sbjct: 273  NSAPSLSSVVGTPSSTSRVLSTNGRSSTSTSRPSSPSPRVRAAPQPIVPPDFPLDTPPNL 332

Query: 708  RTTLPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHA 887
            RTTLPDRP+SAGRSRP+ A +VRG+ E      + RR ++SP + RGR+ +  GRGRL+ 
Sbjct: 333  RTTLPDRPISAGRSRPTPASSVRGSPETTSTGTVPRR-AASPTITRGRITDAPGRGRLNT 391

Query: 888  NGHHGGATENQKIKSFSDPAMRKHAKPATAT-ESIGLGRTSSKKSLEI-VNRMDIRHGSS 1061
            NGH   + E +++ S SD + R+  K +T T ES G GR+ SKKSL++ +  MDIR+G  
Sbjct: 392  NGHLSDSPETRRLSSSSDLSGRRPVKASTTTAESNGFGRSISKKSLDMAIRHMDIRNGPG 451

Query: 1062 GILPMSGTTRFPQSIRSGMLQGQ-LAGHGSDALXXXXXXXXXXXXXXTTILGNRNCN-GR 1235
             +   SG T FP SIRS   + Q +A   S+A+                     +   G 
Sbjct: 452  SVRSGSGNTLFPHSIRSATSKTQSIALSNSEAIDTDYQMSSNNNMDRGNHFHRPSATIGT 511

Query: 1236 SRTEDDNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
                 +N   SA +  LDIYESSRYDAILLKEDLK+TNWLH  ++K+D   I D+ FE
Sbjct: 512  EVGGGENGRFSASLNHLDIYESSRYDAILLKEDLKNTNWLHSTDDKTDLASILDNGFE 569


>ref|XP_006429633.1| hypothetical protein CICLE_v100114702mg, partial [Citrus clementina]
            gi|557531690|gb|ESR42873.1| hypothetical protein
            CICLE_v100114702mg, partial [Citrus clementina]
          Length = 368

 Score =  260 bits (664), Expect = 1e-66
 Identities = 163/342 (47%), Positives = 198/342 (57%), Gaps = 10/342 (2%)
 Frame = +3

Query: 417  EKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXXXXXD--PPLVTSRS-GRL 587
            +KTR SQ +RPSTP+SR QIPANL                        P + +S S GR+
Sbjct: 30   DKTRTSQTSRPSTPSSRPQIPANLNSSTARSSSRPSTPTRRNPITSTSPAMSSSTSAGRV 89

Query: 588  ITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLRTTLPDRPLSAGRSRPSAAV 767
            ++NGR+                  QPIVPPDFPLDTPPNLRTTLPDRPLSAGRSRP AA+
Sbjct: 90   MSNGRSQGPASRPSSPSPRVRSQ-QPIVPPDFPLDTPPNLRTTLPDRPLSAGRSRPGAAL 148

Query: 768  TVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHANGHHGGATENQKIKSFSDPA 947
            T++ N EA G  N+ RRH SSP+V RGRL E  GR R  ANGH   A E ++    S+ +
Sbjct: 149  TMKSNPEATGSVNMPRRH-SSPVVTRGRLTEPPGRSRTPANGHTADAHEYRRTSHISEQS 207

Query: 948  MRKHAKPA-TATESIGLGRTSSKKSLEI-VNRMDIRHGSSGILPMSGTTRFPQSIRSGML 1121
             R+  K   TA++  G GRT SKKSL++ +  MDIR+G+  I  +SGTT FPQSIRS   
Sbjct: 208  TRRPVKSTNTASDGTGFGRTISKKSLDMAIRHMDIRNGAGSIRQLSGTTLFPQSIRSATS 267

Query: 1122 QGQLAG-----HGSDALXXXXXXXXXXXXXXTTILGNRNCNGRSRTEDDNCYPSAKVCEL 1286
            + + A      H +  L                  GN   +GR          SAK+ E 
Sbjct: 268  KTRSARALESVHTNGILKNRDISEKGNTYSGPAENGNDAHDGRY---------SAKLSEA 318

Query: 1287 DIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFEP 1412
            DIYESSRYDAILLKEDLK+TNWLH  ++KSDQG IFD  FEP
Sbjct: 319  DIYESSRYDAILLKEDLKNTNWLHSFDDKSDQGAIFDTGFEP 360


>ref|XP_006381258.1| hypothetical protein POPTR_0006s11130g [Populus trichocarpa]
            gi|550335959|gb|ERP59055.1| hypothetical protein
            POPTR_0006s11130g [Populus trichocarpa]
          Length = 597

 Score =  259 bits (661), Expect = 3e-66
 Identities = 182/475 (38%), Positives = 230/475 (48%), Gaps = 6/475 (1%)
 Frame = +3

Query: 3    LLTPPGTPLFSSLDASESQPTVSAPRSNSHVRSVSTTNP-SRLSVTQPENNYXXXXXXXX 179
            LLTPPGTPL    + SES+PT  APRS+S  RS STT   SRLSV+Q E+ +        
Sbjct: 120  LLTPPGTPLSPPSEGSESKPTSVAPRSSSLARSTSTTKAVSRLSVSQSESYHSSRPTRSS 179

Query: 180  XXXXXXXXXXXXXXXXXXXXXXXXXXXXALVTSLSRPSTPGMXXXXXXXXXXXXXXXXXX 359
                                        A V+S +RPS+P                    
Sbjct: 180  SVTRPSISSSQYSTYSSNRSSSILNTSSASVSSYTRPSSP--ITRTPPIARPSTPPARPT 237

Query: 360  XXXXXXXXXXXXXXXXXXGEKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXX 539
                               +KT   QN+RPSTP+SR Q PAN                  
Sbjct: 238  PSRSSTPSRVRPAPTSSSVDKTPPFQNSRPSTPSSRGQSPANFSAAPTRSNSRPSTPTRR 297

Query: 540  XXXXDPPLVTSRS---GRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLR 710
                     +S S   GR+++NGR                   QP++PPDFPLDTPPNLR
Sbjct: 298  NPAPSSSAASSPSTSAGRVLSNGRIPGPASRPSSPSPRVRPPQQPVIPPDFPLDTPPNLR 357

Query: 711  TTLPDRPLSAGRSRPSAAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHAN 890
            TTL  RPLSAGRSR   +  ++GN E  G  N  RRHSS PIV RGRL E SG+GR+H+N
Sbjct: 358  TTLQGRPLSAGRSRTGVSSAMKGNPETMGSLNAPRRHSS-PIVTRGRLTEPSGKGRVHSN 416

Query: 891  GHHGGATENQKIKSFSDPAMRKHAKPATA-TESIGLGRTSSKKSLEI-VNRMDIRHGSSG 1064
            GH     E +K+   S+  +R+  K ++A ++S G GRT SKKSL++ +  MDIR+G+  
Sbjct: 417  GHVADTPEPRKVSHVSEVGIRRPVKSSSAASDSTGFGRTISKKSLDMAIRHMDIRNGTGS 476

Query: 1065 ILPMSGTTRFPQSIRSGMLQGQLAGHGSDALXXXXXXXXXXXXXXTTILGNRNCNGRSRT 1244
               +S TT FPQSIRS   + Q                         I  +R        
Sbjct: 477  ARSLSSTTLFPQSIRSTTPKSQSVRSQRTQESINNGNSQNGDVLDDEIHFSRAAEIGHEA 536

Query: 1245 EDDNCYPSAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
             D     SAK+ ++DIYESSRYDAILL EDLK+TNWLH I++KSDQG  FD+  E
Sbjct: 537  NDGRY--SAKLSDVDIYESSRYDAILL-EDLKNTNWLHSIDDKSDQGPFFDNGSE 588


>ref|XP_007033486.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508712515|gb|EOY04412.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 442

 Score =  255 bits (652), Expect = 3e-65
 Identities = 154/342 (45%), Positives = 203/342 (59%), Gaps = 10/342 (2%)
 Frame = +3

Query: 417  EKTRASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXXXXXDPPLVTSRSG----- 581
            +K+R SQ++RPSTP+SR QIPANL                      P L ++ +G     
Sbjct: 96   DKSRPSQSSRPSTPSSRPQIPANLNSTAVRSNSRPSTPTRRNPI--PSLSSAAAGASPSA 153

Query: 582  -RLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLRTTLPDRPLSAGRSRPS 758
             R ++NGR+                  QP+VPPDFPLDTPPNLRTTLPDRP+SAGRSRP 
Sbjct: 154  GRTLSNGRSAAPASRPSSPGPRVRPPQQPVVPPDFPLDTPPNLRTTLPDRPVSAGRSRPG 213

Query: 759  AAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHANGHHGGATENQKIKSFS 938
             +V ++ N +     N+ RRHSS PIV RGRL E  GR R+H+NGH     E++K    +
Sbjct: 214  VSVGMKANQDTTSSVNMPRRHSS-PIVTRGRLTEPPGRTRVHSNGHASDIHESRKTSHVN 272

Query: 939  DPAMRKHAKPATAT-ESIGLGRTSSKKSLEI-VNRMDIRHGSSGILPMSGTTRFPQSIRS 1112
            D AMRK  K +T T +S G GRT SKKSL++ +  MDIR+G+  I  +SGTT FPQSIRS
Sbjct: 273  DSAMRKPVKSSTTTADSAGFGRTISKKSLDMAIRHMDIRNGTGSIRSLSGTTLFPQSIRS 332

Query: 1113 GMLQGQL--AGHGSDALXXXXXXXXXXXXXXTTILGNRNCNGRSRTEDDNCYPSAKVCEL 1286
               + Q   +   SD++              +    + +   ++ ++  +   SAK  E+
Sbjct: 333  ATTRTQSLRSFSTSDSVNSNGSPGSLQNGDFSENGNSISRPVQNGSDSHDGRYSAKFSEV 392

Query: 1287 DIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFEP 1412
            DIYESSRYDAILLKEDLK+TNWLH I++KSD G IF++ FEP
Sbjct: 393  DIYESSRYDAILLKEDLKNTNWLHSIDDKSDPGSIFENGFEP 434


>ref|XP_007140052.1| hypothetical protein PHAVU_008G080400g [Phaseolus vulgaris]
            gi|561013185|gb|ESW12046.1| hypothetical protein
            PHAVU_008G080400g [Phaseolus vulgaris]
          Length = 586

 Score =  231 bits (589), Expect = 6e-58
 Identities = 147/343 (42%), Positives = 184/343 (53%), Gaps = 16/343 (4%)
 Frame = +3

Query: 429  ASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXXXXXDPPLVT----------SRS 578
            +SQ +RPSTP+SR  IPANL                       P ++          S S
Sbjct: 236  SSQGSRPSTPSSRPHIPANLHSPSAPSTRSLSRPSTPTRRSSMPSLSPSPSPTPGSLSSS 295

Query: 579  GRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLRTTLPDRPLSAGRSRPS 758
             R   NGR+                  QPIVPPDFPLDTPPNLRTTLPDRP+SAGRSRP 
Sbjct: 296  SRASLNGRSSAPASRPSSPSPRIRPPPQPIVPPDFPLDTPPNLRTTLPDRPVSAGRSRPG 355

Query: 759  AAVTVRGNSEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHANGHHGGATENQKIKSFS 938
                    SE         R  SSP+V+RGR+ E   + R +ANGHH  A E +K+    
Sbjct: 356  GTTLKANGSETQASSVTVPRRHSSPVVSRGRMTEPLAKSRGYANGHHADAPEPRKVAHTP 415

Query: 939  DPAMRKHAKPA-TATESIGLGRTSSKKSLEI-VNRMDIRHGSSGILPMSGTTRFPQSIRS 1112
            + A RK  K + TAT++ G GRT SKKSL++ +  MDIR+GS  I  +S TT FPQSIR+
Sbjct: 416  ELAARKSVKASTTATDNNGFGRTISKKSLDMAIKHMDIRNGSGNIRSLSSTTLFPQSIRT 475

Query: 1113 GMLQGQLAGHGSDALXXXXXXXXXXXXXXTTI-LGNRNCNG---RSRTEDDNCYPSAKVC 1280
               +       + A               +   +GN   N    R+R  D+  Y SAKV 
Sbjct: 476  STPKSHSHRVCAPASVDMNGSLLSSSNNGSNFDMGNGISNRSMIRAREVDERQY-SAKVS 534

Query: 1281 ELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
            E+DIYESSRYDA+L KEDLK+TNWLH +++K DQG IFD+ FE
Sbjct: 535  EVDIYESSRYDALLFKEDLKNTNWLHGVDDKCDQGPIFDNGFE 577


>ref|XP_006602722.1| PREDICTED: flocculation protein FLO11-like [Glycine max]
          Length = 585

 Score =  231 bits (588), Expect = 8e-58
 Identities = 148/348 (42%), Positives = 194/348 (55%), Gaps = 17/348 (4%)
 Frame = +3

Query: 417  EKTR-ASQNTRPSTPTSRSQIPANLXXXXXXXXXXXXXXXXXXXXXDPPLVT-------- 569
            EK R +SQ +RPSTP+SR  IPANL                       P ++        
Sbjct: 232  EKNRPSSQGSRPSTPSSRPHIPANLHSPSASSTRSLSRPSTPTRRSSMPSLSPSPSPTTG 291

Query: 570  --SRSGRLITNGRNXXXXXXXXXXXXXXXXXXQPIVPPDFPLDTPPNLRTTLPDRPLSAG 743
              + +GR+ +NGR+                  QPIVPPDFPL+TPPNLRTTLPDRP+SAG
Sbjct: 292  SLTSAGRVSSNGRSSAPASRPSSPSPRVRPPPQPIVPPDFPLETPPNLRTTLPDRPVSAG 351

Query: 744  RSRPSAAVTVRGN-SEAPGPPNLSRRHSSSPIVARGRLPELSGRGRLHANGHHGGATENQ 920
            RSRP   VT++ N SE    P    R  SSPIV+RGR+ E + + R ++NGHH  A+E +
Sbjct: 352  RSRP-GGVTMKANVSETQASPVTMPRRHSSPIVSRGRVTEPAAKTRGYSNGHHADASEPR 410

Query: 921  KIKSFSDPAMRKHAKPA-TATESIGLGRTSSKKSLEI-VNRMDIRHGSSGILPMSGTTRF 1094
            K+    + A RK  + + TA ++ G GRT SKKSL++ +  MDIR+ S  I  +S TT F
Sbjct: 411  KVSHAPEVAARKSIRSSTTAPDNTGFGRTISKKSLDMAIKHMDIRNSSGNIRSLSSTTLF 470

Query: 1095 PQSIRSGMLQGQLAGHGSDALXXXXXXXXXXXXXXTTILGN---RNCNGRSRTEDDNCYP 1265
            PQSIR+   +       + A                  +GN   RN   + R  D+  Y 
Sbjct: 471  PQSIRTSTSKSHRVS-SAPASVDMNGSMISSKNGANFDVGNGIDRNMMMKGRDADERQY- 528

Query: 1266 SAKVCELDIYESSRYDAILLKEDLKSTNWLHDINEKSDQGLIFDHRFE 1409
            SAK+ E+DIYESSRYDA+L KEDLK+TNWLH  ++K DQG IFD+ FE
Sbjct: 529  SAKLSEVDIYESSRYDALLFKEDLKNTNWLHGADDKCDQGPIFDNGFE 576


Top