BLASTX nr result

ID: Akebia22_contig00006233 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00006233
         (3224 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265987.2| PREDICTED: uncharacterized protein LOC100241...   398   e-107
ref|XP_007225552.1| hypothetical protein PRUPE_ppa002972m2g, par...   386   e-104
ref|XP_007018942.1| C-jun-amino-terminal kinase-interacting prot...   377   e-101
gb|EXB60491.1| hypothetical protein L484_014946 [Morus notabilis]     376   e-101
ref|XP_002513834.1| conserved hypothetical protein [Ricinus comm...   374   e-100
ref|XP_004140377.1| PREDICTED: uncharacterized protein LOC101213...   358   7e-96
ref|XP_006472701.1| PREDICTED: cell wall protein AWA1-like [Citr...   352   8e-94
ref|XP_006434106.1| hypothetical protein CICLE_v10000635mg [Citr...   349   4e-93
ref|XP_004300437.1| PREDICTED: uncharacterized protein LOC101294...   346   3e-92
ref|XP_007041567.1| Uncharacterized protein isoform 1 [Theobroma...   341   1e-90
ref|XP_007041568.1| Uncharacterized protein isoform 2 [Theobroma...   340   3e-90
ref|XP_003544160.1| PREDICTED: putative GPI-anchored protein PB1...   335   6e-89
ref|XP_004142686.1| PREDICTED: uncharacterized protein LOC101213...   330   3e-87
ref|XP_003542703.1| PREDICTED: cell wall protein AWA1-like isofo...   327   3e-86
ref|XP_002301016.1| hypothetical protein POPTR_0002s08960g [Popu...   327   3e-86
ref|XP_007141111.1| hypothetical protein PHAVU_008G168300g [Phas...   325   6e-86
ref|XP_007141110.1| hypothetical protein PHAVU_008G168200g [Phas...   325   1e-85
ref|XP_006351189.1| PREDICTED: putative GPI-anchored protein PB1...   324   2e-85
ref|XP_003614856.1| hypothetical protein MTR_5g060420 [Medicago ...   323   4e-85
ref|XP_004163112.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   318   1e-83

>ref|XP_002265987.2| PREDICTED: uncharacterized protein LOC100241871 [Vitis vinifera]
          Length = 665

 Score =  398 bits (1022), Expect = e-107
 Identities = 280/681 (41%), Positives = 349/681 (51%), Gaps = 69/681 (10%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDRVTTTSRNRSSVGIGDCXXXX 1186
            M+++EP LVPEWLK             HHF  S L SDD        +  V   D     
Sbjct: 1    MDKTEPALVPEWLKSSGSVTGGGSTN-HHFAPSLLQSDDGAALKPARKLMVNSNDHDTGR 59

Query: 1187 XXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRSYSNFGRNHRDRDWDKDPLDFRENERS 1366
                              NGS         H RS+S+FGR +R+R+W+KD  D+R+ ++S
Sbjct: 60   SSNLERTTSSYFRRSSSSNGS--------GHPRSFSSFGRTNREREWEKDIHDYRDKDKS 111

Query: 1367 VLAS----------------RIEKDMLRRSQSMISGKRGEVGPRRVSADPXXXXXXXXXX 1498
            VL+                 R+E+DMLRRSQSMI+GKRG++ PR+V+AD           
Sbjct: 112  VLSDHRHRDYSDPLGNILPGRLERDMLRRSQSMITGKRGDMWPRKVAADVSTVNKTIHSN 171

Query: 1499 XXXXX--------IHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSLPMGSSAMI 1654
                         + KA F+R+FPSLGAE+KQG PDIGRV+SPGLTSAI SLP+G++ +I
Sbjct: 172  GDGQLASGIVTSSVQKAAFDRNFPSLGAEDKQGAPDIGRVTSPGLTSAIQSLPIGNTVVI 231

Query: 1655 GGNGWTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRAR- 1831
            GG+GWTSALAEVP+ IG+N+                       GLNMAE L Q P+RAR 
Sbjct: 232  GGDGWTSALAEVPVIIGSNTTGVSSVQQSVSASSVSVAPSTTSGLNMAETLVQGPARARA 291

Query: 1832 -TTPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSEKQKPKTASRNEANVANMIGQ 2008
              TPQLSV TQRLEELA+KQSRQLIPMTPSMPKT ++ S   KPK          + IG 
Sbjct: 292  NATPQLSVGTQRLEELALKQSRQLIPMTPSMPKT-LVPSPSDKPK----------SKIGL 340

Query: 2009 QQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDSLKPINVGK 2188
            Q      HL+NH+ R GG AR D  K S+ GKL VLKP+RE +    T  DSL P    +
Sbjct: 341  QPL----HLVNHSQR-GGPARSDVTKTSNVGKLHVLKPSRERNGVSPTAKDSLSPTMGSR 395

Query: 2189 IANDPHAVAPS-TGFTSVRSP-NHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRND 2362
            +AN P AV PS  G  S+RSP N+P L+  ER+ + + T    S+EK+PT  SQAQSRND
Sbjct: 396  VANSPLAVTPSAAGSASLRSPRNNPTLASAERRPSVVLT----SVEKRPT--SQAQSRND 449

Query: 2363 FFNLMRKKTXXXXXXXXXXXXXXXXXXXXXGE----TGIATAPVSPQG----XXXXXXXX 2518
            FFNLMRKK+                           T + TAPV+P+G            
Sbjct: 450  FFNLMRKKSSTNPPSAVPESGPAVSSSVSEKSDELITEVVTAPVTPKGRDILSSDNSGLD 509

Query: 2519 XXXXXXXXVTENGGNIT---------------------------------SNGDVGEESR 2599
                     TENG N                                    NGD  + S+
Sbjct: 510  WSNENRGDKTENGNNEACGVSQNDRDDEIDNVNGDACDVSQRDQGDEVHDGNGDACDVSQ 569

Query: 2600 GFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXXXXXXXXXXXXINSFYKEYIKLKPSS 2779
             F + GEKHSSP+ +LYPDEEEAAFLRSL                IN+FYKE +KLKPSS
Sbjct: 570  KFLDNGEKHSSPDEVLYPDEEEAAFLRSL-GWEENGEDEGLTEEEINAFYKECMKLKPSS 628

Query: 2780 KLCQGVQHQKLQLPLDSHKGN 2842
             L Q +   K+   LDS  G+
Sbjct: 629  NLLQRML-PKISPLLDSQMGS 648


>ref|XP_007225552.1| hypothetical protein PRUPE_ppa002972m2g, partial [Prunus persica]
            gi|462422488|gb|EMJ26751.1| hypothetical protein
            PRUPE_ppa002972m2g, partial [Prunus persica]
          Length = 571

 Score =  386 bits (992), Expect = e-104
 Identities = 264/611 (43%), Positives = 325/611 (53%), Gaps = 28/611 (4%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDRVTTTS---RNRSSVGIGDCX 1177
            MERSEPTLVPEWL+             HHF  SS HSD  VT+ +   RNR+S  I D  
Sbjct: 1    MERSEPTLVPEWLRSTGSVTGGGNSA-HHFASSSSHSD--VTSLAHHLRNRTSKSISDFD 57

Query: 1178 XXXXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRSYSNFGRNHRDRD----------- 1324
                                 NGS  H         +YS+F R+HRD+D           
Sbjct: 58   TPRSAFLLDRSSSSNSRRSSSNGSAKH---------AYSSFNRSHRDKDRDKEKERLNYG 108

Query: 1325 --WDKDPLDFRENERSVLASRIEKDMLRRSQSMISGKRGEVGPRRV-----SADPXXXXX 1483
              WD+D  D   N   +  SR+EKD LRRSQSM++ K+ E+ PRR      S++      
Sbjct: 109  DHWDRDCSDPLGN---IFTSRVEKDTLRRSQSMVARKQSELLPRRAVIDSKSSNSNHNNG 165

Query: 1484 XXXXXXXXXXIHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSLPMGSSAMIGGN 1663
                      I K  F++DFPSLG EE+  VPDIGRV SPG ++A+ SLP+GSSA+IGG 
Sbjct: 166  NGLLSGVGVSIQKVVFDKDFPSLGTEERPAVPDIGRVPSPGFSTAVQSLPVGSSALIGGE 225

Query: 1664 GWTSALAEVPIK-IGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRARTTP 1840
            GWTSALAEVP   I ++S                       GLNMAE LAQ P+RART P
Sbjct: 226  GWTSALAEVPSTIIASSSSGSFPVQPTVAATSGSGTSTAMAGLNMAEALAQAPARARTAP 285

Query: 1841 QLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLN-SEKQKPKTASR-NEANVANMIGQQQ 2014
            QLS++TQRLEELAIKQSRQLIP+TPSMPK SVLN S+K KPKTA+R  E NV    GQQQ
Sbjct: 286  QLSIKTQRLEELAIKQSRQLIPVTPSMPKASVLNSSDKSKPKTAARTGEMNVPAKGGQQQ 345

Query: 2015 QLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDSLKPINVGKIA 2194
            Q +  H  N +LR GG  + D  K SH GK  VLKP  EN  S +  + +    N  ++A
Sbjct: 346  QPSQLHHANQSLR-GGPVKSDPPKTSH-GKFLVLKPVWENGVSSSPKDVTSPTNNASRVA 403

Query: 2195 NDPHAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRNDFFNL 2374
            N P  VAP+     +RSPN+PKLS VERK  AL     S++EK+P ++SQ QSRNDFFNL
Sbjct: 404  NSPLVVAPAVASAPLRSPNNPKLSPVERKVAALDLKSGSTLEKRP-SLSQVQSRNDFFNL 462

Query: 2375 MRKKT--XXXXXXXXXXXXXXXXXXXXXGE-TG-IATAPVSPQGXXXXXXXXXXXXXXXX 2542
            ++KKT                       GE TG + + P SP                  
Sbjct: 463  LKKKTSMNSSITLPDSGPIISSPTMEKSGELTGEVFSDPASPH----------------- 505

Query: 2543 VTENGGNITSNGDVGEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXXXXXXXX 2722
              ENGG +T NGD  EE + FS+TG     P+  +YPDEEEA FLRSL            
Sbjct: 506  AIENGGEVTVNGDSSEEVQRFSDTG-----PSVAVYPDEEEARFLRSLGWDDNPCDDGGL 560

Query: 2723 XXXXINSFYKE 2755
                I++FY +
Sbjct: 561  TEEEISAFYDQ 571


>ref|XP_007018942.1| C-jun-amino-terminal kinase-interacting protein 3, putative
            [Theobroma cacao] gi|508724270|gb|EOY16167.1|
            C-jun-amino-terminal kinase-interacting protein 3,
            putative [Theobroma cacao]
          Length = 625

 Score =  377 bits (969), Expect = e-101
 Identities = 266/648 (41%), Positives = 339/648 (52%), Gaps = 41/648 (6%)
 Frame = +2

Query: 995  SILVMERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSD-DRVTTTSRNRSSVGIGD 1171
            ++ +MERSEP L PEWL+             HHF  SS HSD   V    RNR+S  + D
Sbjct: 4    NVSLMERSEPALAPEWLRSTGTVTGGGNSA-HHFASSSSHSDVSSVAHHGRNRNSRNLID 62

Query: 1172 CXXXXXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHS---RSYSNFGRNHRDRD------ 1324
                                     S ++ + SS++     +YS+F RNHRD+D      
Sbjct: 63   -------------FDSPHSAFLDRASSLNSRRSSSNGSAKHAYSSFSRNHRDKDRDRDKE 109

Query: 1325 -------WDKDPLDFRENERS--------VLASRIEKDMLRRSQSMISGKRGEVGPRRVS 1459
                   WD+D  D  E+  +        +  SR+E++ LRRS SM+S K+GE   RR++
Sbjct: 110  RSSFGDHWDRDSSDPLESILTSRVEKLGGISISRVERETLRRSYSMVSRKQGEPLSRRIA 169

Query: 1460 ADPXXXXXXXXXXXXXXX--------IHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTS 1615
             D                        IHKA FE+DFPSLG EEKQGVP+I RVSSPGL+S
Sbjct: 170  VDSRDSGNGNHNNGNGLLSGGTIGSSIHKAVFEKDFPSLGNEEKQGVPEIARVSSPGLSS 229

Query: 1616 AIHSLPMGSSAMIGGNGWTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNM 1795
            A  SLP+G+SA+IGG GWTSALAEVP  +G++S                       GLNM
Sbjct: 230  ASQSLPVGNSALIGGEGWTSALAEVPSVVGSSS-TGSLPAPVTVSTSGSGAPSVTAGLNM 288

Query: 1796 AEKLAQTPSRARTTPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLN-SEKQKPKTAS 1972
            AE L Q PSR RT PQLSV+TQR EELAIKQSRQLIP+TPSMPK SVLN S+K K K A 
Sbjct: 289  AEALVQAPSRIRTAPQLSVKTQRREELAIKQSRQLIPVTPSMPKGSVLNSSDKSKAKPAV 348

Query: 1973 R-NEANVANMIGQQQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPT 2149
            R +E N+A   GQQQ   SPH        GG A+ D  K S  GKL VLKP  EN  S  
Sbjct: 349  RTSEMNIAVKSGQQQ---SPH--------GGHAKSDMPKTS--GKLLVLKPGWENGVSSP 395

Query: 2150 TTNDSLKPI--NVGKIANDPHAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEK 2323
            T  D   P   +  + A + HAVAP T  +  R+ N+ KLS  ERK  AL      ++EK
Sbjct: 396  TQKDVASPTTNSNSRAATNQHAVAPVTS-SPARNSNNTKLSAGERKPAALNPIAGFTVEK 454

Query: 2324 KPTTMSQAQSRNDFFNLMRKKTXXXXXXXXXXXXXXXXXXXXXGETGIATAPVSPQGXXX 2503
            +P +++Q QSRNDFFNL++KKT                        G++ + +       
Sbjct: 455  RP-SLAQTQSRNDFFNLLKKKTSTNT------------------SAGLSDSDLHNSSCTT 495

Query: 2504 XXXXXXXXXXXXXVT----ENGGNITSNGDVGEESRGFSNTGEKHSSPNAILYPDEEEAA 2671
                          T    ENG    SNGD  +E++ FS+ GEK+ S  A++YPDEEEAA
Sbjct: 496  EKSEVTKEVVCASATAHANENGTASNSNGDACQEAQRFSDDGEKNMSSTAMVYPDEEEAA 555

Query: 2672 FLRSLXXXXXXXXXXXXXXXXINSFYKEYIKLKPSSKLCQGVQHQKLQ 2815
            FLRSL                IN+FY+EY+KL+PS KLC+GVQ ++ +
Sbjct: 556  FLRSLGWEENSGEDEGLTEEEINAFYQEYMKLRPSLKLCRGVQPKQAE 603


>gb|EXB60491.1| hypothetical protein L484_014946 [Morus notabilis]
          Length = 609

 Score =  376 bits (965), Expect = e-101
 Identities = 263/648 (40%), Positives = 337/648 (52%), Gaps = 36/648 (5%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDRVTTTSRNRSSVGIGDCXXXX 1186
            MERSEPTLVP+WL+             H F  SS HSD  +   +RNR+S  I +     
Sbjct: 1    MERSEPTLVPQWLRSAGSVTGGGNSAPH-FASSSSHSDVSLAPNARNRASKSISEFETPR 59

Query: 1187 XXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSR----------SYSNFGRNHRDRD---- 1324
                                S   D+ SS++SR          +YS+F RNHRD+D    
Sbjct: 60   --------------------SAFLDRSSSSNSRRGSSNGSAKHAYSSFNRNHRDKDREKD 99

Query: 1325 -------WDKDPLDFRENERSVLASRIEKDMLRRSQSMISGKRGEVGPRRVSAD------ 1465
                   WD+D  D   N   +  SR+EKD LRRSQS++S K+GE+  RR + D      
Sbjct: 100  RDRFGDHWDRDSSDPLGN---IFPSRVEKDTLRRSQSLVSRKQGELVSRRANVDLKTSSN 156

Query: 1466 -PXXXXXXXXXXXXXXXIHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSLPMGS 1642
                             I KA+FE+DFPSLGAEE+QG P+IGRV SPG T+A+ SLP+GS
Sbjct: 157  SNHNNGNGLLSVSIGAGIQKASFEKDFPSLGAEERQGGPEIGRVPSPGFTTAVQSLPVGS 216

Query: 1643 SAMIGGNGWTSALAEVPIKIGNNS-GXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTP 1819
            SA++GG GWTSALAEVP  +G++S G                      GLNMAE LAQ P
Sbjct: 217  SALVGGEGWTSALAEVPSLMGSSSSGSLSSAQQTAAPTSGSATPTAMAGLNMAEALAQAP 276

Query: 1820 SRARTTPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSEKQKPKTASRN-EANVAN 1996
            SRART PQ+SV+TQRLEELAIKQSRQLIP+TPSMPK SVLNSEK KPKT +R+ E NV  
Sbjct: 277  SRARTAPQVSVKTQRLEELAIKQSRQLIPVTPSMPKASVLNSEKSKPKTGARSGEMNVGT 336

Query: 1997 MIGQQQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDSLKPI 2176
               QQQ  +S   +N  LR  G  + D+ K SH GK  VLKP  EN  +P  + D   P 
Sbjct: 337  KTVQQQP-SSLQNVNQYLR-SGNVKSDTPKTSH-GKYLVLKPVWENGVTP-PSKDVTSPT 392

Query: 2177 N--VGKIANDPHAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQ 2350
            N    + ++   AVAP       RSPN  K+S ++ K+        S++EK+P ++SQ Q
Sbjct: 393  NSSTSRASSTQLAVAPPVVSAPSRSPNSQKVSSLDLKSG-------STLEKRP-SLSQVQ 444

Query: 2351 SRNDFFNLMRKKT----XXXXXXXXXXXXXXXXXXXXXGETGIATAPVSPQGXXXXXXXX 2518
            SRNDFFNL++KKT                         G   + +AP SP          
Sbjct: 445  SRNDFFNLIKKKTSVNPSATLPESGPNISSPTSEKSGEGNREVCSAPASPHPV------- 497

Query: 2519 XXXXXXXXVTENGGNITSNGDVGEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXX 2698
                        G  +  NG+  +E + FS+ GE    P++ +Y DEEEA FL+SL    
Sbjct: 498  ------------GAEVNGNGENCKEIQRFSDNGEDECPPSSDIYLDEEEAKFLKSLGWDE 545

Query: 2699 XXXXXXXXXXXXINSFYKEYIKLKPSSKLCQGVQHQKLQLPLDSHKGN 2842
                        IN+FY+E +K KP  KLC+G+Q QKL +   SH  N
Sbjct: 546  NAGEDEGLTEEEINAFYEECMKTKPPLKLCRGLQ-QKLSMLSKSHVTN 592


>ref|XP_002513834.1| conserved hypothetical protein [Ricinus communis]
            gi|223546920|gb|EEF48417.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 596

 Score =  374 bits (959), Expect = e-100
 Identities = 275/643 (42%), Positives = 337/643 (52%), Gaps = 34/643 (5%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDRVTTT-SRNRSSVGIGDCXXX 1183
            MERSEPTLVPEWL+             HHF  SS HSD   +   SR+R+S    D    
Sbjct: 1    MERSEPTLVPEWLRSSGSVPGGGSSA-HHFASSSPHSDVSSSVHHSRSRNSKSTSDFDSP 59

Query: 1184 XXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSR----------SYSNFGRNHRDRDW-- 1327
                                 S   D+ SS++SR          +YS+F R+HRD+D   
Sbjct: 60   R--------------------SAFLDRTSSSNSRRSSSNGSAKHAYSSFSRSHRDKDRER 99

Query: 1328 DKDPLDFR---ENERS----VLASRIEKDMLRRSQSMISGKRGEVGPRRVSAD------- 1465
            DK+ L+F    +N+ S     + SR EKD LRRS SM+S K GEV PRR +AD       
Sbjct: 100  DKERLNFGNHWDNDASDPLGSILSRNEKDALRRSHSMVSRKLGEVLPRRFAADLRNGSNS 159

Query: 1466 -PXXXXXXXXXXXXXXXIHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSLPMGS 1642
                             I KA FE+DFPSLG+EE+QG PDIGRVSSPGL++A+ SLP+ S
Sbjct: 160  NHVNGNGLISGGGVGNSIPKAVFEKDFPSLGSEERQGAPDIGRVSSPGLSTAVQSLPVSS 219

Query: 1643 SAMIGGNGWTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPS 1822
            SA+IGG GWTSALAEVP  IGNNS                       GLNMAE L Q P+
Sbjct: 220  SALIGGEGWTSALAEVPAIIGNNSS-GSSSSVQTVATSASGAPSTVAGLNMAEALTQAPT 278

Query: 1823 RARTTPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLN-SEKQKPKTASR-NEANVAN 1996
            R RT PQLSV+TQRLEELAIKQSRQLIP+TPSMPK+SVLN S+K KPKT  R +E N+A 
Sbjct: 279  RTRTAPQLSVQTQRLEELAIKQSRQLIPVTPSMPKSSVLNSSDKSKPKTVVRSSEMNMAP 338

Query: 1997 MIGQQQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDSLKPI 2176
                QQQ +S H +  +L  GG  + D+ K SH GKLFVLKP  EN  SP +  D   P 
Sbjct: 339  K-NLQQQPSSLHAVTQSL-AGGHVKSDASKASH-GKLFVLKPGWENGASP-SPKDIANPN 394

Query: 2177 NVGKIANDPHAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSR 2356
            N G+ AN   A APS     +RSPN+PKLS  ERK+ +L      ++EK+P  +SQ QSR
Sbjct: 395  NAGRAANSQLAAAPSVPSAPLRSPNNPKLSAGERKSASLNLISGFNVEKRP-LLSQTQSR 453

Query: 2357 NDFFNLMRKKTXXXXXXXXXXXXXXXXXXXXXGETGI----ATAPVSPQGXXXXXXXXXX 2524
            +DFFNL++KKT                         I    A+AP  PQ           
Sbjct: 454  HDFFNLLKKKTLKNSSTALTDSASAISSPTNEKACEINKEAASAPSCPQ----------- 502

Query: 2525 XXXXXXVTENGGNITSNGDVGEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXX 2704
                    +NG  +T NG   EE                     EEEAAFLRSL      
Sbjct: 503  ------AIKNGSELTGNGGTCEE-------------------VSEEEAAFLRSLGWEENS 537

Query: 2705 XXXXXXXXXXINSFYKEYIKLKPSSKLCQGVQHQKLQLPLDSH 2833
                      IN+F +E +KLKPS K+C+G+Q QKL   ++SH
Sbjct: 538  GEDEGLTEEEINAFIQECMKLKPSLKVCRGMQ-QKL---IESH 576


>ref|XP_004140377.1| PREDICTED: uncharacterized protein LOC101213347 [Cucumis sativus]
          Length = 615

 Score =  358 bits (920), Expect = 7e-96
 Identities = 264/638 (41%), Positives = 335/638 (52%), Gaps = 30/638 (4%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDRVTTTSRNRSSVGIGDCXXXX 1186
            MERSEPTLVPEWL+             HHFP SS HSD    + SRNR S   GD     
Sbjct: 1    MERSEPTLVPEWLRSTGSVAGGGNPN-HHFPSSSSHSDVPSLSQSRNRISKTTGD----- 54

Query: 1187 XXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSR----------SYSNFGRNHRDRDWDK- 1333
                              + S   D+ SS++SR          +YS+F R HRD+D +K 
Sbjct: 55   ---------------FDSSRSSFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKE 99

Query: 1334 -DPLDFREN-ERS-------VLASRIEKDMLRRSQSMISGKRGEVGPRRVSAD---PXXX 1477
             D L+F +N +R        +L++RI+KD LRRS SM+S K+GE+  RRV  +       
Sbjct: 100  KDRLNFGDNWDRDAHDPLGKILSNRIDKDALRRSHSMVSRKQGELFHRRVGTELKSHNSS 159

Query: 1478 XXXXXXXXXXXXIHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSLPMGSSAMI- 1654
                        I KA FE+DFPSLG+EEKQG  +IGRVSSPGL+S + SLP+G+SA+I 
Sbjct: 160  NGILSGTSVGSSIQKAVFEKDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIV 219

Query: 1655 GGNGWTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRART 1834
            GG GWTSALAEVP  IG+ +G                      GLNMAE L Q PSRAR 
Sbjct: 220  GGEGWTSALAEVPSMIGSTTG-SSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARA 278

Query: 1835 TPQ---LSVETQRLEELAIKQSRQLIPMTPSMPKTSVL-NSEKQKPKTASRNEANVANMI 2002
             PQ   LSV+TQRLEELAIKQSRQLIP+TPSMPK  VL +S+K KPK ASR     A + 
Sbjct: 279  APQVSELSVKTQRLEELAIKQSRQLIPVTPSMPKAMVLSSSDKSKPKLASRTGELNATIK 338

Query: 2003 GQQQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDSLKPINV 2182
            G Q Q   P L++      G  +PD+ K SH GK  VLKP REN  S    + S    N 
Sbjct: 339  GGQPQ---PLLVHANQSRVGHVKPDAQKSSH-GKFLVLKPVRENGVSLAAKDVSSPTSNA 394

Query: 2183 GKI-ANDPHAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRN 2359
              + AN   A+APS     +RSPN+  +S +ERK  +L     +++EK+P ++SQ QSRN
Sbjct: 395  NSMAANSQFALAPSVPHAPLRSPNNINVSSMERKIASLDLKTGTTLEKRP-SLSQVQSRN 453

Query: 2360 DFFNLMRKKTXXXXXXXXXXXXXXXXXXXXXGETGIATAPVSPQ-GXXXXXXXXXXXXXX 2536
            DFF L++KKT                       +   ++  SP  G              
Sbjct: 454  DFFKLIKKKTSMNSSAVL---------------SDSCSSVKSPSIGQSNELTSEEMGTAS 498

Query: 2537 XXVTENGGNITSNGDVGEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXXXXXX 2716
              V ENG     NG+  EE +   ++GEK  S  A    DEEEAAFLRSL          
Sbjct: 499  PRVIENGAVENRNGNSSEEVQVSRDSGEKTESHVAAESLDEEEAAFLRSLGWDESCGEDE 558

Query: 2717 XXXXXXINSFYKEYIKLKPSSKLCQGVQHQKLQLPLDS 2830
                  INSFY+EY+ LKPS K+ + +Q  K+ +P +S
Sbjct: 559  GLTEEEINSFYREYVNLKPSLKIGRCIQ-PKIFVPSES 595


>ref|XP_006472701.1| PREDICTED: cell wall protein AWA1-like [Citrus sinensis]
          Length = 607

 Score =  352 bits (902), Expect = 8e-94
 Identities = 264/626 (42%), Positives = 324/626 (51%), Gaps = 31/626 (4%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDRVT---TTSRNRSSVGIGDCX 1177
            ME+SEPTLVP+WL+             HHF  SS HSD   +   T +RN  S   G   
Sbjct: 1    MEKSEPTLVPQWLRNAGSVTGGGGST-HHFSSSS-HSDVPSSVHHTRTRNSKS---GSDF 55

Query: 1178 XXXXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRSYSNFGRNHRDRD----------- 1324
                                 NGS  H         +YS+F RNHRD+D           
Sbjct: 56   DAPRSAFLDRSSSSNSRRSSSNGSAKH---------AYSSFNRNHRDKDRERDKERSSYG 106

Query: 1325 --WDKDPLDFRENERSVLASRIEKDMLRRSQSMISGKRGEVGPRRVSADPXXXXXXXXXX 1498
              WD+D  D      S+L+SR+EKD LRRS SM+S K+ E+ PRRV+ D           
Sbjct: 107  DLWDRDSSD---PLGSILSSRMEKD-LRRSHSMVSRKQNELLPRRVAVDSKINSNSNHIN 162

Query: 1499 XXXXX--------IHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSLPMGSSAMI 1654
                         I K  FE+DFPSLG+EEKQGVPDIGRVSSPGL+SA+ SLP+G+S +I
Sbjct: 163  GNDDVTGGSTGSSIKKVVFEKDFPSLGSEEKQGVPDIGRVSSPGLSSAVQSLPVGNSTLI 222

Query: 1655 GGNGWTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRART 1834
            GG GWTSALAEVP  IGN+S                       GLNMAE LAQ PSRART
Sbjct: 223  GGEGWTSALAEVPPIIGNSSS-GSLSAQTGSGTTLSGPPSVMAGLNMAEALAQAPSRART 281

Query: 1835 TPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLN-SEKQKPKTASR-NEANVANMIGQ 2008
             PQLSV+TQRL+EL IK+S+QLIP+TPSMPK+SVLN S+K KPKTA R ++ ++A   GQ
Sbjct: 282  APQLSVKTQRLDELTIKKSKQLIPVTPSMPKSSVLNFSDKSKPKTAVRISDMSMAVKNGQ 341

Query: 2009 QQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDSLKPINVGK 2188
            QQ  A  H  N +L   G  + D  K SH GKL VLKPA EN  S +  + +    N   
Sbjct: 342  QQP-APLHHANQSLH-VGNVKTDVPKTSH-GKLLVLKPAWENGVSHSPKDGASPTNNANS 398

Query: 2189 IANDPHAVA-PSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRNDF 2365
             A    ++A PS    + RSPN+PKL   ERK TAL      S E++P ++SQ QSRNDF
Sbjct: 399  RATTSQSIAVPSVASATPRSPNNPKLPSGERKATALNPISGFSAERRP-SLSQTQSRNDF 457

Query: 2366 FNLMRKKT--XXXXXXXXXXXXXXXXXXXXXGET--GIATAPVSPQGXXXXXXXXXXXXX 2533
            FNL++KKT                       GE    + +AP SP               
Sbjct: 458  FNLLKKKTSMNTSGLPADSGTDIPSPAGEKHGEVTKDVISAPSSPH-------------- 503

Query: 2534 XXXVTENGGNITSNGDVGEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXXXXX 2713
               V ENG  +T NG   +E++ FS  GEK  S  A + PD EEAAFLRSL         
Sbjct: 504  ---VIENGAQVTINGGTHKETQRFSGAGEKTMSRYAAVDPD-EEAAFLRSLGWEENSGED 559

Query: 2714 XXXXXXXINSFYKEYIKLKPSSKLCQ 2791
                   I +FY+E+ K     KL Q
Sbjct: 560  EGLTEEEIKAFYQEFEKRGMQLKLPQ 585


>ref|XP_006434106.1| hypothetical protein CICLE_v10000635mg [Citrus clementina]
            gi|557536228|gb|ESR47346.1| hypothetical protein
            CICLE_v10000635mg [Citrus clementina]
          Length = 607

 Score =  349 bits (896), Expect = 4e-93
 Identities = 262/626 (41%), Positives = 322/626 (51%), Gaps = 31/626 (4%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDRVT---TTSRNRSSVGIGDCX 1177
            ME+SEPTLVP+WL+             H    SS HSD   +   T +RN  S   G   
Sbjct: 1    MEKSEPTLVPQWLRNAGSVTGGGGSTNHF--SSSSHSDVPSSVHHTRTRNSKS---GSDF 55

Query: 1178 XXXXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRSYSNFGRNHRDRD----------- 1324
                                 NGS  H         +YS+F RNHRD+D           
Sbjct: 56   DAPRSAFLDRSSSSNSRRSSSNGSAKH---------AYSSFNRNHRDKDRERDKERSSYG 106

Query: 1325 --WDKDPLDFRENERSVLASRIEKDMLRRSQSMISGKRGEVGPRRVSADP--------XX 1474
              WD+D  D      S+L+SR+EKD LRRS SM+S K+ E+ PRRV+ D           
Sbjct: 107  DLWDRDSSD---PLGSILSSRMEKD-LRRSHSMVSRKQNELLPRRVAVDSKINSNSNHIN 162

Query: 1475 XXXXXXXXXXXXXIHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSLPMGSSAMI 1654
                         I K  FE+DFPSLG+EEKQGVPDIGRVSSPGL+SA+ SLP+G+S +I
Sbjct: 163  GNDDVTGGSTGSSIKKVVFEKDFPSLGSEEKQGVPDIGRVSSPGLSSAVQSLPVGNSTLI 222

Query: 1655 GGNGWTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRART 1834
            GG GWTSALAEVP  IGN+S                       GLNMAE LAQ PSRART
Sbjct: 223  GGEGWTSALAEVPPIIGNSSS-GSLSAQTGSGTTLSGPPSVMAGLNMAEALAQAPSRART 281

Query: 1835 TPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLN-SEKQKPKTASR-NEANVANMIGQ 2008
             PQLSV+TQRL+EL IK+S+QLIP+TPSMPK+SVLN S+K KPKTA R ++ ++A   GQ
Sbjct: 282  APQLSVKTQRLDELTIKKSKQLIPVTPSMPKSSVLNFSDKSKPKTAVRISDMSMAVKNGQ 341

Query: 2009 QQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDSLKPINVGK 2188
            QQ  A  H  N +L   G  + D  K SH GKL VLKPA EN  S +  + +    N   
Sbjct: 342  QQP-APLHHANQSLH-VGNVKTDVPKTSH-GKLLVLKPAWENGVSHSPKDGASPTNNANS 398

Query: 2189 IANDPHAVA-PSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRNDF 2365
             A    + A PS    + RSPN+PKL   ERK TAL      S E++P ++SQ QSRNDF
Sbjct: 399  RATTSQSTAVPSVASATPRSPNNPKLPSGERKATALNPISGFSAERRP-SLSQTQSRNDF 457

Query: 2366 FNLMRKKT--XXXXXXXXXXXXXXXXXXXXXGET--GIATAPVSPQGXXXXXXXXXXXXX 2533
            FNL++KKT                       GE    + +AP+SP               
Sbjct: 458  FNLLKKKTSMNTSGLPADSGTDIPSPAGEKHGEVTKDVISAPLSPH-------------- 503

Query: 2534 XXXVTENGGNITSNGDVGEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXXXXX 2713
               V ENG  +T NG   +E++ FS  GEK  S  A + PD EEAAFLRSL         
Sbjct: 504  ---VIENGAQVTINGGTHKETQRFSGAGEKTMSRYAAVDPD-EEAAFLRSLGWEENSGED 559

Query: 2714 XXXXXXXINSFYKEYIKLKPSSKLCQ 2791
                   I +FY+E+ K     KL Q
Sbjct: 560  EGLTEEEIKAFYQEFEKRGMQLKLPQ 585


>ref|XP_004300437.1| PREDICTED: uncharacterized protein LOC101294372 [Fragaria vesca
            subsp. vesca]
          Length = 611

 Score =  346 bits (888), Expect = 3e-92
 Identities = 256/640 (40%), Positives = 323/640 (50%), Gaps = 28/640 (4%)
 Frame = +2

Query: 1007 MERSEPTL---VPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDRVTTTSRNRSSVGIGDCX 1177
            ME+SEP L    P+WL+             HHF  SS   D +    SR+R++    D  
Sbjct: 1    MEKSEPPLGPLAPQWLRNTGGVTGGGSSTHHHFASSS---DVQPAHHSRSRTTKTTSDID 57

Query: 1178 XXXXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRSYSNFGRNHRDRDWDKDP--LDFR 1351
                                 NGS  H         +YS+F R+HRD+D +K+   L+F 
Sbjct: 58   PTRSSYLERSSSSNPRRSSS-NGSAKH---------AYSSFSRSHRDKDREKEKERLNFG 107

Query: 1352 EN-ERSV---LASRIEKDMLRRSQSMISGKRGEVGPRRVSADPXXXXXXXXXXXXXXXIH 1519
            E  +R     L+    KD LRRSQSM S  + E   RR++ D                  
Sbjct: 108  EPWDRDCPDHLSLYSNKDALRRSQSMSSRNKSETLSRRIAIDSKSGSNSIHNNGNGLLSG 167

Query: 1520 ------KATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSLPMGSSAMIGGNGWTSAL 1681
                   A F++DFPSLG EE+QGVPDIGRV SPG TSA+ SLP+G+SA+IGG  + SAL
Sbjct: 168  GGVGSPNAVFDKDFPSLGTEERQGVPDIGRVPSPGFTSAVQSLPVGNSALIGGEQFKSAL 227

Query: 1682 AEVP-IKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRARTTPQLSVET 1858
            AEVP   IG++S                       GLNMAE L Q P+RART PQLS+ T
Sbjct: 228  AEVPNAIIGSSSSGSFSVQPTVAATSESGASVAMAGLNMAEALVQAPARARTVPQLSIRT 287

Query: 1859 QRLEELAIKQSRQLIPMTPSMPKTSVL-NSEKQKPKTASRNEANVANMIGQQQQLASPHL 2035
            QRLEELA+KQSRQLIP+TPSMPK+S L +S+K KPK A R    +A + G QQQ +  H 
Sbjct: 288  QRLEELALKQSRQLIPVTPSMPKSSALSSSDKLKPKPAVRAGEMIAPVKGGQQQPSQSHH 347

Query: 2036 INHALRGGGQARPDSGKISHGGKLFVLKPARENS-------TSPTTTNDSLKPINVGKIA 2194
             N +L  GG  + D+ K SHG    VLKP  EN        TSPT+   S       + A
Sbjct: 348  ANQSLH-GGPVKSDAPKTSHGKGFLVLKPVWENGISSPKDVTSPTSNASS-------RAA 399

Query: 2195 NDPHAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRNDFFNL 2374
            N P AVAP       RSPN+PKL  VERK  AL     +++EK+P ++SQ QSRNDFFNL
Sbjct: 400  NSPLAVAPPVVSAPSRSPNNPKLLAVERKVAALDLKSGATLEKRP-SLSQVQSRNDFFNL 458

Query: 2375 MRKKTXXXXXXXXXXXXXXXXXXXXXGE---TG-IATAPVSPQGXXXXXXXXXXXXXXXX 2542
            ++KKT                          TG + + P SP                  
Sbjct: 459  LKKKTSVNSSITLPDSGPNISPPTIEKSGDITGEVFSDPASPH----------------- 501

Query: 2543 VTENGGNITSNGDVGEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXXXXXXXX 2722
              ENGG +T NG   EE + FS TG     P+A +YPDEEEA FLRSL            
Sbjct: 502  -IENGGEVTGNGVSSEEVQRFSGTG-----PSAAVYPDEEEARFLRSLGWEENSGDDGGL 555

Query: 2723 XXXXINSFYKEYIKLKPSSKLCQGVQHQKLQLPLDSHKGN 2842
                IN+FY +Y+KL+PS KL +G+Q +   LP +SH  N
Sbjct: 556  TEEEINAFYDQYMKLRPSLKLNRGMQPKLSTLP-ESHATN 594


>ref|XP_007041567.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508705502|gb|EOX97398.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 625

 Score =  341 bits (875), Expect = 1e-90
 Identities = 250/645 (38%), Positives = 331/645 (51%), Gaps = 33/645 (5%)
 Frame = +2

Query: 1004 VMERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDR-VTTTSRNRSSVGIGDCXX 1180
            VMERSEP+LVPEWLK             H F  SSLHSD+      +RN+ SV  GD   
Sbjct: 5    VMERSEPSLVPEWLKSGGSVTGSGNSN-HQFTSSSLHSDNHSALRPTRNKLSVA-GDHDV 62

Query: 1181 XXXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRSYSNFGRNHRDRDWDKDPLDFRENE 1360
                                NGS        AH RSYS+F + HRDRDWDKD   + + E
Sbjct: 63   GGTSVLDRTTSAYFRRSSSSNGS--------AHLRSYSSFTKGHRDRDWDKDINGYHDRE 114

Query: 1361 RSVLA----------------SRIEKDMLRRSQSMISGKRGEVGPRRVSADPXXXXXXXX 1492
            +SV++                S  EKD+L RSQS I+GKR +  P++V++D         
Sbjct: 115  KSVISDHRNRNFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNH 173

Query: 1493 XXXXXXXI-------HKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSLPMGSSAM 1651
                           +K+ FER+FP LGAEE+Q   +IGRVSSPGL++A  SLP+G+SA+
Sbjct: 174  SSSNGLLSGVSTTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAI 233

Query: 1652 IGGNGWTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRAR 1831
             G +GWTSALA++P  +G++                        GLNMAE L Q PSRAR
Sbjct: 234  SGSDGWTSALADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRAR 293

Query: 1832 TTPQLSVETQRLEELAIKQSRQLIPM-TPSMPKTSVLN-SEKQKPKTASRNEANVANMIG 2005
            T P L+V TQRLEELAIKQSRQL+P+ T S PK  V++ SEK KPK            +G
Sbjct: 294  TPPLLNVGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPK------------VG 341

Query: 2006 QQQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDSLKPIN-V 2182
            QQQ  +    ++     GG +R DS K+S+ G+L +LKP+RE +     T D+L P N  
Sbjct: 342  QQQHAS----LSLNYTRGGTSRSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGS 397

Query: 2183 GKIANDPHAVAPSTGFTSV--RSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSR 2356
             K+ N P +V PS   ++    S N P  +  ER  T        ++EK+PT  +QAQSR
Sbjct: 398  SKLVNSPLSVTPSASASAPFRSSGNSPSFATAERNQTPFRI----NIEKRPT--AQAQSR 451

Query: 2357 NDFFNLMRKK--TXXXXXXXXXXXXXXXXXXXXXGETGI--ATAPVSPQGXXXXXXXXXX 2524
            NDFFNL++KK  T                      E G   A+  V+ QG          
Sbjct: 452  NDFFNLLKKKSTTNSPSSVADRGPAASPSVSEKSDELGTEDASTSVTLQGGSVPSSEISI 511

Query: 2525 XXXXXXVTENGGNITSNGDVGEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXX 2704
                   T+N   IT NGD    S+  S+ G++H+ P+A LYPDEEEAAFLRSL      
Sbjct: 512  ADLP---TDNRSEITHNGDAYSGSQQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWEENA 568

Query: 2705 XXXXXXXXXXINSFYKEYIKLKPSSKLCQGVQHQKLQLPLDSHKG 2839
                      I++F++E++KLKPS+KL   +Q     +PL+SH G
Sbjct: 569  GDDEGLTEEEISAFFEEHMKLKPSAKLFHRMQS---IVPLNSHNG 610


>ref|XP_007041568.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508705503|gb|EOX97399.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 620

 Score =  340 bits (871), Expect = 3e-90
 Identities = 249/644 (38%), Positives = 330/644 (51%), Gaps = 33/644 (5%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDR-VTTTSRNRSSVGIGDCXXX 1183
            MERSEP+LVPEWLK             H F  SSLHSD+      +RN+ SV  GD    
Sbjct: 1    MERSEPSLVPEWLKSGGSVTGSGNSN-HQFTSSSLHSDNHSALRPTRNKLSVA-GDHDVG 58

Query: 1184 XXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRSYSNFGRNHRDRDWDKDPLDFRENER 1363
                               NGS        AH RSYS+F + HRDRDWDKD   + + E+
Sbjct: 59   GTSVLDRTTSAYFRRSSSSNGS--------AHLRSYSSFTKGHRDRDWDKDINGYHDREK 110

Query: 1364 SVLA----------------SRIEKDMLRRSQSMISGKRGEVGPRRVSADPXXXXXXXXX 1495
            SV++                S  EKD+L RSQS I+GKR +  P++V++D          
Sbjct: 111  SVISDHRNRNFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHS 169

Query: 1496 XXXXXXI-------HKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSLPMGSSAMI 1654
                          +K+ FER+FP LGAEE+Q   +IGRVSSPGL++A  SLP+G+SA+ 
Sbjct: 170  SSNGLLSGVSTTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAIS 229

Query: 1655 GGNGWTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRART 1834
            G +GWTSALA++P  +G++                        GLNMAE L Q PSRART
Sbjct: 230  GSDGWTSALADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAETLVQGPSRART 289

Query: 1835 TPQLSVETQRLEELAIKQSRQLIPM-TPSMPKTSVLN-SEKQKPKTASRNEANVANMIGQ 2008
             P L+V TQRLEELAIKQSRQL+P+ T S PK  V++ SEK KPK            +GQ
Sbjct: 290  PPLLNVGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPK------------VGQ 337

Query: 2009 QQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDSLKPIN-VG 2185
            QQ  +    ++     GG +R DS K+S+ G+L +LKP+RE +     T D+L P N   
Sbjct: 338  QQHAS----LSLNYTRGGTSRSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSS 393

Query: 2186 KIANDPHAVAPSTGFTSV--RSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRN 2359
            K+ N P +V PS   ++    S N P  +  ER  T        ++EK+PT  +QAQSRN
Sbjct: 394  KLVNSPLSVTPSASASAPFRSSGNSPSFATAERNQTPFRI----NIEKRPT--AQAQSRN 447

Query: 2360 DFFNLMRKK--TXXXXXXXXXXXXXXXXXXXXXGETGI--ATAPVSPQGXXXXXXXXXXX 2527
            DFFNL++KK  T                      E G   A+  V+ QG           
Sbjct: 448  DFFNLLKKKSTTNSPSSVADRGPAASPSVSEKSDELGTEDASTSVTLQGGSVPSSEISIA 507

Query: 2528 XXXXXVTENGGNITSNGDVGEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXXX 2707
                  T+N   IT NGD    S+  S+ G++H+ P+A LYPDEEEAAFLRSL       
Sbjct: 508  DLP---TDNRSEITHNGDAYSGSQQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWEENAG 564

Query: 2708 XXXXXXXXXINSFYKEYIKLKPSSKLCQGVQHQKLQLPLDSHKG 2839
                     I++F++E++KLKPS+KL   +Q     +PL+SH G
Sbjct: 565  DDEGLTEEEISAFFEEHMKLKPSAKLFHRMQS---IVPLNSHNG 605


>ref|XP_003544160.1| PREDICTED: putative GPI-anchored protein PB15E9.01c-like [Glycine
            max]
          Length = 618

 Score =  335 bits (860), Expect = 6e-89
 Identities = 264/657 (40%), Positives = 327/657 (49%), Gaps = 28/657 (4%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDRVTTTSRNRSSVGIGDCXXXX 1186
            MERSEP LVPEWL+               F  SS H+D   + + RN+SS   G      
Sbjct: 1    MERSEPALVPEWLRSAGSVAGAGSSA-QQFASSSAHTD---SLSVRNKSSKN-GSDFDSA 55

Query: 1187 XXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRSYSNFGRNHRDRDWDKDP--------- 1339
                              NGS  H         +YS+F RNHRD+D D++          
Sbjct: 56   RSVFLERTSSSNSRRSSMNGSAKH---------AYSSFNRNHRDKDRDREKDRSSFGDHW 106

Query: 1340 -LDFRENERSVLASRIEKDMLRRSQSMISGKRGEVGPRRVSADPXXXXXXXXXXXXXXX- 1513
              D  +   ++   R+E+D LRRS SM+S K+ EV PRRV  D                 
Sbjct: 107  DCDGSDPLANIFPGRMERDTLRRSHSMVSRKQNEVIPRRVVVDTKSGGSHQNNSNGILSG 166

Query: 1514 ------IHKATFERDFPSLGAEEKQGVPDIGRVSSPGL-TSAIHSLPMGSSAMIGGNGWT 1672
                  I KA F++DFPSL  EEKQG+ D+ RVSSP L  +A  SLP+GSSA+IGG GWT
Sbjct: 167  SNVSNSIQKAVFDKDFPSLSTEEKQGIADVVRVSSPALGAAASQSLPVGSSALIGGEGWT 226

Query: 1673 SALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRARTTPQLSV 1852
            SALAEVP  IG++S                       GLNMAE LAQTPSRAR+ PQ+ V
Sbjct: 227  SALAEVPAIIGSSSTGSLSVQQTVNTTSGSVASSTTAGLNMAEALAQTPSRARSAPQVLV 286

Query: 1853 ETQRLEELAIKQSRQLIPMTPSMPKTSVLNSEKQKPKTASRN-EANVANMIGQQQQLASP 2029
            +TQRLEELAIKQSRQLIP+TPSMPK SV NSEK KPKTA RN + NV      QQ  A  
Sbjct: 287  KTQRLEELAIKQSRQLIPVTPSMPKASVHNSEKSKPKTAIRNADMNVVTKSVPQQPPAL- 345

Query: 2030 HLINHALRGGGQARPDSGKISHGGKLFVLKP-ARENSTSPTTTNDSLKPINVGKI-ANDP 2203
            H+ N ++R    ++ D+ K S  GK   LK    EN TSP T+ D   P N       + 
Sbjct: 346  HIANQSVR-SVNSKVDAPKTS--GKFTDLKSVVWENGTSP-TSKDVSNPTNYSNSKPGNQ 401

Query: 2204 HAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRNDFFNLMRK 2383
            HAVA       +R+PN+ K S  ERK T++     S++EKK  ++SQ QSRNDFFNL++K
Sbjct: 402  HAVALGAASAPLRNPNNLK-SPTERKPTSMDLKLGSNLEKK-HSISQVQSRNDFFNLIKK 459

Query: 2384 KT--XXXXXXXXXXXXXXXXXXXXXGET--GIATAP-VSPQGXXXXXXXXXXXXXXXXVT 2548
            KT                       GE   G+  +P  SPQ                   
Sbjct: 460  KTLMNSSAVLPDSGPMVSSPAMEKSGEVNRGVIVSPSASPQSHG---------------- 503

Query: 2549 ENGGNITSNG-DVGEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXXXXXXXXX 2725
             NG  +TSNG    EE    S+  EK S+P+  +YPDEEEAAFLRSL             
Sbjct: 504  -NGTELTSNGTHAHEEVHRLSDNEEKESNPSVTIYPDEEEAAFLRSLGWEENSDEDEGLT 562

Query: 2726 XXXINSFYKEYIKLKPSS-KLCQGVQHQKLQLPLDSHKGNXXXXXXXXXXXDTRSIA 2893
               IN+FY+E  KL P++ KLCQG Q  KL    +S+  N           D RS A
Sbjct: 563  EEEINAFYQECKKLDPTTFKLCQGKQ-PKLSKLFESYASNLCESSAELSSSDPRSEA 618


>ref|XP_004142686.1| PREDICTED: uncharacterized protein LOC101213356 [Cucumis sativus]
          Length = 619

 Score =  330 bits (845), Expect = 3e-87
 Identities = 241/617 (39%), Positives = 307/617 (49%), Gaps = 25/617 (4%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSD-DRVTTTSRNRSSVGIGDCXXX 1183
            MERSEPTLVPEWL+               F  SS HSD       SR+R+S  I D    
Sbjct: 1    MERSEPTLVPEWLRSSGSLSGSGIA--QQFASSSSHSDISSQGHYSRSRTSKSISDIDKP 58

Query: 1184 XXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRSYSNFGRNHRDRDWDKDP-------- 1339
                               + S      + +   +YSNF RNHRDRD +K+         
Sbjct: 59   HFDFLDWSS----------SSSTRRSSSNGSGKNAYSNFNRNHRDRDREKEKDMSNHGDS 108

Query: 1340 --LDFRENERSVLASRIEKDMLRRSQSMISGKRGEVGPRRVSADPXXXXXXXXXXXXXXX 1513
               DF     +V +SR EK+ LRRS SM+S K+G++ P+RV+ D                
Sbjct: 109  WGYDFSSPLVNVFSSRAEKETLRRSHSMVSRKQGDLFPQRVAVDLKSGGYNHKANSNGFH 168

Query: 1514 I--------HKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSLPMGSSAMIGGNGW 1669
            +         KA F++DFPSLG+EE+QG PD+GRVSSPGLT+ + SLP+GSS +IG  GW
Sbjct: 169  LGSTINGITDKAVFDKDFPSLGSEERQGGPDVGRVSSPGLTTCVQSLPIGSSTLIGREGW 228

Query: 1670 TSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRARTTPQ-- 1843
            TSALAEVP  +  +                           MAE L Q P+R R T Q  
Sbjct: 229  TSALAEVPTTVTGSPAAPSSIQQTANSGLGSPNATTPR--KMAEALTQAPTRGRVTSQST 286

Query: 1844 -LSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNS-EKQKPKTASRN-EANVANMIGQQQ 2014
             LSV+TQRLEELAIKQSRQLIP+TPSMPK SVL++ EK K K ASR  E NV    GQQQ
Sbjct: 287  ELSVKTQRLEELAIKQSRQLIPVTPSMPKVSVLSTFEKSKSKGASRTAEMNVPGKGGQQQ 346

Query: 2015 QLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDSLKPINVGKIA 2194
                 H  N     GGQ + DS K +H GK  VLKP  EN      +N  +  +N     
Sbjct: 347  LSMMQH--NSQPLRGGQVKSDSPKTTH-GKFLVLKPVWENGVLKDGSN-PINNVNSRTAN 402

Query: 2195 NDPHAVAPS-TGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRNDFFN 2371
            + P +VA S T  TS    N    S +ERK  AL     S++E++P + +Q+QSR+DFFN
Sbjct: 403  SQPSSVASSATSNTSRNQNNLTPSSSLERKVAALDLKSGSTLERRPPS-AQSQSRSDFFN 461

Query: 2372 LMRKKTXXXXXXXXXXXXXXXXXXXXXGETGIATAPVSPQGXXXXXXXXXXXXXXXXVTE 2551
            L++KKT                      ++GI T+P+  +                 VT+
Sbjct: 462  LIKKKTLVNGSTCLQ-------------DSGICTSPIKEKSGIANGEVVSAAVHPSAVTD 508

Query: 2552 NGGNITSNGDVGEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXXXXXXXXXXX 2731
            +   + SNGD  EE + FS    K  SPN  L  DEEEAAFLRSL               
Sbjct: 509  D--EVASNGDTSEEVQRFSEVVNKSLSPNKALCTDEEEAAFLRSLGWEENSGEDEGLTEE 566

Query: 2732 XINSFYKEYIKLKPSSK 2782
             IN+FY++Y+ LKPS K
Sbjct: 567  EINAFYQQYMNLKPSLK 583


>ref|XP_003542703.1| PREDICTED: cell wall protein AWA1-like isoform X1 [Glycine max]
          Length = 621

 Score =  327 bits (837), Expect = 3e-86
 Identities = 256/640 (40%), Positives = 321/640 (50%), Gaps = 28/640 (4%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDD-RVTTTSRNRSSVGIGDCXXX 1183
            MERSEP LVPEWL+               F  SS H+D   V   SRNRSS   G     
Sbjct: 1    MERSEPALVPEWLRSAGSVAGAGSSA-QQFASSSGHTDSLSVAHHSRNRSSKN-GSDFDS 58

Query: 1184 XXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRSYSNFGRNHRDRDWDKDP-------- 1339
                               NGS  H         +YS+F R+HRD+D D++         
Sbjct: 59   ARSVFLERTSSSNSRRSSINGSAKH---------AYSSFNRSHRDKDRDREKDRSSFGDH 109

Query: 1340 --LDFRENERSVLASRIEKDMLRRSQSMISGKRGEVGPRRVSADPXXXXXXXXXXXXXXX 1513
               D  +   ++   R+E+D LRRS SM+S K+ EV PRRV+ D                
Sbjct: 110  WDCDGSDPLANLFPGRMERDTLRRSHSMVSRKQSEVIPRRVAVDTKSGGSHQNNSNGILS 169

Query: 1514 -------IHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAI-HSLPMGSSAMIGGNGW 1669
                   I KA F++DFPSL  EEKQG+ ++ RVSSPGL +A+  SLP+GSSA+IGG GW
Sbjct: 170  GSNVSSSIQKAVFDKDFPSLSTEEKQGIAEVVRVSSPGLGAAVSQSLPVGSSALIGGEGW 229

Query: 1670 TSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRARTTPQLS 1849
            TSALAEVP  IG++S                       GLNMAE LAQTPSRAR+ PQ+ 
Sbjct: 230  TSALAEVPAIIGSSSTGSLSVQQTVNTTSGSVAPSTTAGLNMAEALAQTPSRARSAPQVL 289

Query: 1850 VETQRLEELAIKQSRQLIPMTPSMPKTSVLNSEKQKPKTASRN-EANVANMIGQQQQLAS 2026
            V+TQRLEELAIKQSRQLIP+TPSMPK SV NSEK KPKTA RN + NV      QQ  A 
Sbjct: 290  VKTQRLEELAIKQSRQLIPVTPSMPKASVHNSEKSKPKTAIRNADMNVVTKTVPQQPSAL 349

Query: 2027 PHLINHALRGGGQARPDSGKISHGGKLFVLKP-ARENSTSPTTTNDSLKPINVGKI-AND 2200
             H+ + ++R    A+ D+ K S  GK   LK    EN  SP T+ D   P N       +
Sbjct: 350  -HIASQSVR-SVNAKVDTPKTS--GKFTDLKSVVWENGASP-TSKDVSNPTNYSNSKPGN 404

Query: 2201 PHAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRNDFFNLMR 2380
             HAVA       +R+PN+ K S  ERK +++     S++EKK  ++SQ QSRNDFFNL++
Sbjct: 405  QHAVASGAASAPLRNPNNLK-SPTERKPSSMDLKLGSNLEKK-HSISQVQSRNDFFNLIK 462

Query: 2381 KKT--XXXXXXXXXXXXXXXXXXXXXGETG--IATAPVSPQGXXXXXXXXXXXXXXXXVT 2548
            KKT                       GE    I     SPQ                   
Sbjct: 463  KKTLMNCSAVLPDSGPMVSSPAMEKSGEVNREIVNPSASPQSLG---------------- 506

Query: 2549 ENGGNITSNGDVGEE-SRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXXXXXXXXX 2725
             NG  +TSNG    E     S+  EK S+P+  +YP+EEEAAFLRSL             
Sbjct: 507  -NGTELTSNGTHAHEVIHRISDNEEKESNPSVTIYPEEEEAAFLRSLGWEENSDEDEGLT 565

Query: 2726 XXXINSFYKEYIKLKPSS-KLCQGVQHQKLQLPLDSHKGN 2842
               IN+FY+E  KL P++ KL QG+Q  KL    +S+  N
Sbjct: 566  EEEINAFYQECKKLDPTAFKLSQGMQ-PKLSKLFESYASN 604


>ref|XP_002301016.1| hypothetical protein POPTR_0002s08960g [Populus trichocarpa]
            gi|222842742|gb|EEE80289.1| hypothetical protein
            POPTR_0002s08960g [Populus trichocarpa]
          Length = 591

 Score =  327 bits (837), Expect = 3e-86
 Identities = 254/646 (39%), Positives = 313/646 (48%), Gaps = 37/646 (5%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSD-DRVTTTSRNRSSVGIGDCXXX 1183
            MERSEP+LVPEWL+             HHF  SS HSD   +   +RNRS   I D    
Sbjct: 1    MERSEPSLVPEWLRSPGSVSGAGNSA-HHFASSSSHSDVSSLGNHTRNRSFKSINDFDSP 59

Query: 1184 XXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRS----------YSNFGRNHRDRD--- 1324
                                 S   D+ SS++SR           YS+F R+HRD+D   
Sbjct: 60   R--------------------SAFLDRQSSSNSRRSSINGSAKHPYSSFSRSHRDKDRER 99

Query: 1325 ----------WDKDPLDFRENERSVLASRIEKDMLRRSQSMISGKRGEVGPRRV------ 1456
                      WD+D  D       +L SR EKD LR S SM+S K  EV  RR       
Sbjct: 100  DKERSSFGDHWDRDSSD---PLGGILTSRNEKDTLRHSHSMVSRKHSEVMLRRAASELKN 156

Query: 1457 --SADPXXXXXXXXXXXXXXXIHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSL 1630
              S++                  KA FE+DFPSLG E+++GVPDI RVSSPGL+S++ +L
Sbjct: 157  GSSSNLANSNGLVSGGSFGSSSQKAVFEKDFPSLGNEDREGVPDIARVSSPGLSSSVQNL 216

Query: 1631 PMGSSAMIGGNGWTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLA 1810
            P+GSSA+IGG GWTSALAEVP  IGN+S                       GLNMAE L 
Sbjct: 217  PVGSSALIGGEGWTSALAEVPTIIGNSS-TSSSSTAQTVAASSSGTSSVMAGLNMAEALT 275

Query: 1811 QTPSRARTTPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNS-EKQKPKTASR-NEA 1984
            Q P R RT PQLSV+TQRLEELAIKQSRQLIP+TPSMPK  VL+S +K KPKT  R  E 
Sbjct: 276  QAPLRTRTAPQLSVQTQRLEELAIKQSRQLIPVTPSMPKNLVLSSSDKSKPKTGIRPGEM 335

Query: 1985 NVANMIGQQQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDS 2164
            N+A    QQQ  +S H  N +   G   + D+ K S  GKLFVLKP  EN  SP+   D+
Sbjct: 336  NMAAKSSQQQ--SSLHPANQS-SVGVHVKSDATKTS--GKLFVLKPVWENGVSPSP-KDA 389

Query: 2165 LKPINVGKIANDPHAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQ 2344
              P    + AN   A APS     +RSPN+PK+S V+RK T+L        EK+      
Sbjct: 390  ASPNTSSRTANSQLA-APSVPSPPLRSPNNPKISSVDRKPTSLNLNSGFGGEKR------ 442

Query: 2345 AQSRNDFFNLMRKKTXXXXXXXXXXXXXXXXXXXXXG---ETGIATAPVSPQGXXXXXXX 2515
             QSRN+FFN ++KKT                            + +AP SPQ        
Sbjct: 443  TQSRNNFFNDLKKKTAMNTSSVADSASVVLSPASEKSCEVIKEVVSAPASPQA------- 495

Query: 2516 XXXXXXXXXVTENGGNITSNGDVGEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXX 2695
                       +NG  +TSNG   EE + FS                EEE +FLRSL   
Sbjct: 496  ----------VQNGAELTSNGGTLEEVQRFS----------------EEEVSFLRSLGWE 529

Query: 2696 XXXXXXXXXXXXXINSFYKEYIKLKPSSKLCQGVQHQKLQLPLDSH 2833
                         IN+F +EYI  KPS K+C+G+    LQ P + H
Sbjct: 530  ENSGEEEGLTEEEINAFLQEYITKKPSLKVCRGM----LQKPNECH 571


>ref|XP_007141111.1| hypothetical protein PHAVU_008G168300g [Phaseolus vulgaris]
            gi|593488489|ref|XP_007141112.1| hypothetical protein
            PHAVU_008G168300g [Phaseolus vulgaris]
            gi|561014244|gb|ESW13105.1| hypothetical protein
            PHAVU_008G168300g [Phaseolus vulgaris]
            gi|561014245|gb|ESW13106.1| hypothetical protein
            PHAVU_008G168300g [Phaseolus vulgaris]
          Length = 623

 Score =  325 bits (834), Expect = 6e-86
 Identities = 246/636 (38%), Positives = 317/636 (49%), Gaps = 24/636 (3%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDR-VTTTSRNRSSVGIGDCXXX 1183
            MERSEPTLVPEWL+              HFP SS H+D   V   +RN+S    GD    
Sbjct: 1    MERSEPTLVPEWLRSAGSVAGAGSST-QHFPSSSNHTDSSSVAHHTRNKSFKNAGD-FDS 58

Query: 1184 XXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRSYSNFGRNHRDRDWDKD--------- 1336
                               NGS  H         +YS+F R+HRD+D D++         
Sbjct: 59   ARSVFLERTSSSNSRRSSINGSAKH---------AYSSFNRSHRDKDRDRERDRSSFGDN 109

Query: 1337 -PLDFRENERSVLASRIEKDMLRRSQSMISGKRGEVGPRRVSADPXXXXXXXXXXXXXXX 1513
              +D  +   ++ + R+E+D LRRS SMIS K+ E+ PRRV+ D                
Sbjct: 110  WEIDGSDPLTNLFSGRMERDTLRRSHSMISRKQSEIVPRRVAVDTKSGGNSHYNNSNGIL 169

Query: 1514 --------IHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAI-HSLPMGSSAMIGGNG 1666
                    I KA F++DFPSLG EEKQG  ++ RVSSPGL  A   SLP+GSS +IGG G
Sbjct: 170  SGSNVSSSIQKAVFDKDFPSLGTEEKQGTAEVVRVSSPGLGGAASQSLPVGSSTLIGGEG 229

Query: 1667 WTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRARTTPQL 1846
            WTSALAEVP  IG++S                         NMAE LAQTPSRAR+TPQ+
Sbjct: 230  WTSALAEVPAIIGSSSTGSLSVQHTVNTNSGSVASITTASRNMAEALAQTPSRARSTPQV 289

Query: 1847 SVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSEKQKPKTASRNEANVANMIGQQQQLAS 2026
             V+TQRLEELAIKQSRQLIP+TPS+ K SVL+SEK KPKT+ RN           QQ ++
Sbjct: 290  LVKTQRLEELAIKQSRQLIPVTPSIAKASVLSSEKSKPKTSIRNADMSVVTKTVSQQPSA 349

Query: 2027 PHLINHALRGGGQARPDSGKISHGGKLFVLKP-ARENSTSPTTTNDSLKPINVGKIANDP 2203
             H+ + ++R    A+ ++ K S  GK   LK    EN  SPT+   S           + 
Sbjct: 350  LHIASQSVR-SVNAKVEAPKTS--GKFTDLKSVVWENGASPTSKEVSHPTNYSNSKPGNQ 406

Query: 2204 HAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRNDFFNLMRK 2383
            HAVA       +R+PN+ K S  ERK+ +      S+++KK  ++SQ QSRNDFFNL++K
Sbjct: 407  HAVASGATSAPLRNPNNLK-SSTERKSASSDLKLGSTLDKK-HSISQVQSRNDFFNLIKK 464

Query: 2384 KTXXXXXXXXXXXXXXXXXXXXXGETGIATAPVSPQGXXXXXXXXXXXXXXXXVTENGGN 2563
            KT                         +    VS  G                   NG  
Sbjct: 465  KTLMNASTVLPDSVPMVSSPMMEKSDEVNREIVSESGSPQSLG-------------NGTE 511

Query: 2564 ITSNGDV--GEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXXXXXXXXXXXXI 2737
            +TSNG+    EE +  S+  EK S P A +YPDEEEAAFLRSL                I
Sbjct: 512  LTSNGNAHGHEEFQRLSDKDEKESIPCATIYPDEEEAAFLRSLGWEENSDEDEGLTEEEI 571

Query: 2738 NSFYKEYIKLKPSS-KLCQGVQHQKLQLPLDSHKGN 2842
            N+FY+E   L P++ KLCQG+Q  KL    +S+  N
Sbjct: 572  NAFYQECKNLDPTTLKLCQGMQ-PKLSKLFESYASN 606


>ref|XP_007141110.1| hypothetical protein PHAVU_008G168200g [Phaseolus vulgaris]
            gi|561014243|gb|ESW13104.1| hypothetical protein
            PHAVU_008G168200g [Phaseolus vulgaris]
          Length = 618

 Score =  325 bits (832), Expect = 1e-85
 Identities = 253/640 (39%), Positives = 324/640 (50%), Gaps = 28/640 (4%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDD-RVTTTSRNRSSVGIGDCXXX 1183
            MERSEPTLVPEWL+              HFP SS H+D   V   +R+RSS   G     
Sbjct: 1    MERSEPTLVPEWLRSAGSVAGAGTSA-QHFPSSSTHNDSPSVAHHARSRSSKN-GSDFDN 58

Query: 1184 XXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRSYSNFGRNHRDRDWDKDP-------- 1339
                               NGS  H         +YS+F R+HRD+D D++         
Sbjct: 59   ARSLFLERTSSSNSRRSSVNGSAKH---------AYSSFNRSHRDKDRDREKDRSSFGDI 109

Query: 1340 --LDFRENERSVLASRIEKDMLRRSQSMISGKRGEVGPRRVSADPXXXXXXXXXXXXXXX 1513
               D  +   ++ + R+E+D LRRS SM+S K+ +V PRRV+ D                
Sbjct: 110  WDCDGSDPLANLFSGRMERDTLRRSHSMVSRKQSDVLPRRVAVDTKSGGSSHQSNNNGIL 169

Query: 1514 --------IHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAI-HSLPMGSSAMIGGNG 1666
                    I KA F++DFPSL  EEKQG P++ RVSSPGL  A   SLP+GSSA+IGG G
Sbjct: 170  SGSNVNSSIQKAVFDKDFPSLSTEEKQGSPEVVRVSSPGLGGATSQSLPVGSSALIGGEG 229

Query: 1667 WTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRARTTPQL 1846
            WTSALAEVP  IG++S                        LNMAE L QTPSRAR+TPQ+
Sbjct: 230  WTSALAEVPTIIGSSSAGSLSVQHTVNTTSGSVASSTTASLNMAEALTQTPSRARSTPQV 289

Query: 1847 SVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSEKQKPKTASRN-EANVANMIGQQQQLA 2023
             V+TQRLEELAIKQSRQLIP+TPSMPK SVLNSEK KPKTA RN E NV        Q +
Sbjct: 290  LVKTQRLEELAIKQSRQLIPVTPSMPKASVLNSEKSKPKTAIRNAEMNVVTK-SVPLQPS 348

Query: 2024 SPHLINHALRGGGQARPDSGKISHGGKLFVLKP-ARENSTSPTTTNDSLKPINV--GKIA 2194
            + H+ + ++R    A+ D+ K S  GK   LK    EN  SP T+ D   P N    K  
Sbjct: 349  ALHMASQSVR-SINAKVDAPKTS--GKFTDLKSVVWENGGSP-TSKDVSHPTNYSNSKPG 404

Query: 2195 NDPHAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRNDFFNL 2374
            N P A AP      +R+PN+ K S  ERK+ +L      +++KK  ++SQ QSRNDFFNL
Sbjct: 405  NHPAAAAP------LRNPNNLK-SSTERKSVSLDLKLGPTLDKK-HSISQVQSRNDFFNL 456

Query: 2375 MRKKTXXXXXXXXXXXXXXXXXXXXXGETGIATAPVSPQGXXXXXXXXXXXXXXXXVTEN 2554
            ++KKT                      ++G   +    +                    N
Sbjct: 457  IKKKTLMNSSAVLP-------------DSGPMVSSPMVEKSDEVNGEIVHESSSPQSLGN 503

Query: 2555 GGNITSNGDV---GEESRGFSNTGEKHSSPNAILYPDEEEAAFLRSLXXXXXXXXXXXXX 2725
            G  +TSNG+    GE  R  S+  +K S P + +YPDEEEAAFLRSL             
Sbjct: 504  GTELTSNGNAHAHGEVQR-LSDNEDKESIPCSTIYPDEEEAAFLRSLGWEENSDEDEGLT 562

Query: 2726 XXXINSFYKEYIKLKPSS-KLCQGVQHQKLQLPLDSHKGN 2842
               IN+FY+E   L P++ K+CQG+Q  KL    +S+  N
Sbjct: 563  EEEINAFYQECKNLDPTTFKICQGMQ-PKLSKLFESYASN 601


>ref|XP_006351189.1| PREDICTED: putative GPI-anchored protein PB15E9.01c-like [Solanum
            tuberosum]
          Length = 615

 Score =  324 bits (830), Expect = 2e-85
 Identities = 249/643 (38%), Positives = 326/643 (50%), Gaps = 38/643 (5%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDRVTT-TSRNRSSVGIGDCXXX 1183
            MERSEP LVPEWL+             H F  SSLHSD  ++T +SRNRS   + D    
Sbjct: 1    MERSEPALVPEWLRSTGSVTGGGSSSPH-FATSSLHSDVTLSTLSSRNRSPRSVSDKDSP 59

Query: 1184 XXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSRS----------YSNFGRNHRDRD--- 1324
                                 S+  D+ SS++SR           YS+F RNHRD++   
Sbjct: 60   R--------------------SVFLDRSSSSNSRRSSSGTSSKHPYSSFNRNHRDKNRER 99

Query: 1325 ----------WDKDPLDFRENERSVLASRIEKDMLRRSQSMISGKRGEVGPRRV------ 1456
                      WD D  D   N   +LA R++K+ LRRSQS++S K GE  PRR       
Sbjct: 100  EKERPGTVDLWDHDTSDPLGN---ILAGRVDKNSLRRSQSLVSRKPGEFLPRRTEDSKGG 156

Query: 1457 --SADPXXXXXXXXXXXXXXXIHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSL 1630
              S                    KA FE+DFPSLG EE+Q    + RVSSPGL+SA+ SL
Sbjct: 157  ISSTHSSGNGIHSGGSSSFNGNQKAAFEKDFPSLGIEERQ----VTRVSSPGLSSAVQSL 212

Query: 1631 PMGSSAMIGGNGWTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLA 1810
            P+G+SA++G + WTSALAEVP  IG+                          LNMAE L+
Sbjct: 213  PIGNSALLGADKWTSALAEVPPIIGSIGMGSSASQQSVAVAPTPRALSGTASLNMAEALS 272

Query: 1811 QTPSRARTTPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNS--EKQKPKTASRNEA 1984
            Q P RAR+T Q+  +TQRLEELAIKQSRQLIP+ PSMPK SV +S  + ++PK+ +R   
Sbjct: 273  QAPPRARSTMQIPDKTQRLEELAIKQSRQLIPVIPSMPKVSVSSSADKSKQPKSIARTNE 332

Query: 1985 NVANMIGQQQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDS 2164
             V      QQ  +S  L N A    GQ R ++   SHG  L VLK  REN  +  +   S
Sbjct: 333  MVGITKSMQQPFSS-QLANQA--RSGQVRAEAPATSHGKTLLVLKSGRENGVTSLSKEAS 389

Query: 2165 LKPINVG-KIANDPHAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMS 2341
                N G ++AN P AVAPS    +V SP   +++ +E K  AL+    S+ EK+ +++S
Sbjct: 390  TPANNTGNRLANCPPAVAPSA--PAVTSPT-SRVTSLETKAAALSLKPRSTAEKR-SSLS 445

Query: 2342 QAQSRNDFFNLMRKKTXXXXXXXXXXXXXXXXXXXXXGETGIATAPVSPQGXXXXXXXXX 2521
            QAQSR+DFFNLMRKKT                      ++G+A++  S +          
Sbjct: 446  QAQSRSDFFNLMRKKTSNSSTALP--------------DSGMASSN-SREQSCLKTKDED 490

Query: 2522 XXXXXXXVTENGGNITSNGDVGEESRGFS--NTGEKHSSP-NAILYPDEEEAAFLRSLXX 2692
                   V+ENG   TSNGD  E        N  E+++SP N  +YPDE+EAAFLRSL  
Sbjct: 491  SASLSPCVSENGSERTSNGDPHEAQNHVQRHNDVEENNSPINGSVYPDEKEAAFLRSLGW 550

Query: 2693 XXXXXXXXXXXXXXINSFYKEYIKLKPSSKLCQGVQHQKLQLP 2821
                          IN+FY+EY+KLKPS K+ +G Q + L LP
Sbjct: 551  DENAVEEEGLTEEEINAFYQEYMKLKPSLKVYKGAQPKCLMLP 593


>ref|XP_003614856.1| hypothetical protein MTR_5g060420 [Medicago truncatula]
            gi|355516191|gb|AES97814.1| hypothetical protein
            MTR_5g060420 [Medicago truncatula]
          Length = 685

 Score =  323 bits (827), Expect = 4e-85
 Identities = 264/651 (40%), Positives = 326/651 (50%), Gaps = 47/651 (7%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDR---VTTTSRNRSSVGIGDCX 1177
            M+RSEP+LVPEWL+              HF  SS H+D         +RNRSS   GD  
Sbjct: 1    MDRSEPSLVPEWLRSAGSVVGAGNSA-QHFASSSSHADSHSPSAANNNRNRSSKNTGD-- 57

Query: 1178 XXXXXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSR----------SYSNFGRNHRDRD- 1324
                                 + S+  D+ SSA SR          +YS+F RNHRD+D 
Sbjct: 58   ------------------FDSSRSVFLDRTSSASSRRGSINGSAKHAYSSFNRNHRDKDR 99

Query: 1325 ------------WDKDPLDFRENERSVLASRIEKDMLRRSQSMISGKRGEVGPRRVSADP 1468
                        WD+D  D   N   + + RIE+D LRRS SM+S K+GE  PRRV+AD 
Sbjct: 100  DREKDRSNFGDHWDRDGSDPLVN---LFSGRIERDTLRRSHSMVSRKQGETLPRRVAADT 156

Query: 1469 XXXXXXXXXXXXXXX--------IHKATFERDFPSLGAEEKQGVPDIGRVSSPGL-TSAI 1621
                                   I KA F++DFPSLGA+EKQG+ +IGRVSSPGL  +A 
Sbjct: 157  KSGGSSNHNNGNGALSVGSVGSSIQKAVFDKDFPSLGADEKQGIAEIGRVSSPGLGATAS 216

Query: 1622 HSLPMGSSAMIGGNGWTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAE 1801
             SLP+GSSA+IGG GWTSALAEVP  IG++S                       GLNMAE
Sbjct: 217  QSLPVGSSALIGGEGWTSALAEVPSVIGSSSAGSSSAQQTIAATSVSVSSSTAAGLNMAE 276

Query: 1802 KLAQTPSRARTTPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLN-SEKQKPKTASRN 1978
             LAQ PSRAR+TPQ+SV+TQRLEELAIKQSRQLIP+TPSMPK   LN SEK KPKTA RN
Sbjct: 277  ALAQAPSRARSTPQVSVKTQRLEELAIKQSRQLIPVTPSMPKALALNSSEKSKPKTAVRN 336

Query: 1979 -EANVANMIGQQQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKP-ARENSTSPTT 2152
             E NVA     QQ  A  H+ + ++R    A+ D  K S  GK   LK    EN  SP T
Sbjct: 337  AEMNVATKSALQQPSAL-HIASQSVR-IVNAKVDVPKTS--GKFTDLKSVVWENGASP-T 391

Query: 2153 TNDSLKPINV--GKIANDPHAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKK 2326
            + D   P N    K AN  H VA +   T VR+P++   S  ERK  +L     S+++KK
Sbjct: 392  SKDVSNPTNYANSKSANQ-HCVASAAAPTPVRNPSNLN-SPRERKPASLDLKLGSALDKK 449

Query: 2327 PTTMSQAQSRNDFFNLMRKKTXXXXXXXXXXXXXXXXXXXXXGETGIATAPVSPQGXXXX 2506
              ++SQ +SRNDFFNL++ KT                         +    V P      
Sbjct: 450  -QSISQVKSRNDFFNLLKNKTATNSSTVFPDSGQMVSSPTLEKSGEVNRESVMPSASPQS 508

Query: 2507 XXXXXXXXXXXXVTENGGNITSNGD----VGEESRGFSNTGEKHSSPNAILYPDEEEAAF 2674
                           N    TSNG+      E     S+  EK+S   A +YPDEEEAAF
Sbjct: 509  -------------VGNAAEPTSNGNAHAHAHEVLSRISDDDEKNS--RATVYPDEEEAAF 553

Query: 2675 LRSLXXXXXXXXXXXXXXXXINSFYKEYI-KLKPSS-KLC-QGVQHQKLQL 2818
            LRSL                IN+FY+E   KL PS+ KLC +G+Q Q  +L
Sbjct: 554  LRSLGWEENSDEDEGLTEEEINAFYQEVCKKLDPSALKLCIEGMQPQLSKL 604


>ref|XP_004163112.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101231906
            [Cucumis sativus]
          Length = 536

 Score =  318 bits (815), Expect = 1e-83
 Identities = 220/490 (44%), Positives = 276/490 (56%), Gaps = 29/490 (5%)
 Frame = +2

Query: 1007 MERSEPTLVPEWLKXXXXXXXXXXXXXHHFPLSSLHSDDRVTTTSRNRSSVGIGDCXXXX 1186
            MERSEPTLVPEWL+             HHFP SS H+D    + SRNR S   GD     
Sbjct: 1    MERSEPTLVPEWLRSTGSVAGGGNPN-HHFPSSSSHADVPSLSQSRNRISKTTGD----- 54

Query: 1187 XXXXXXXXXXXXXXXXXXNGSMVHDKDSSAHSR----------SYSNFGRNHRDRDWDK- 1333
                              + S   D+ SS++SR          +YS+F R HRD+D +K 
Sbjct: 55   ---------------FDSSRSSFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKE 99

Query: 1334 -DPLDFREN-ERS-------VLASRIEKDMLRRSQSMISGKRGEVGPRRVSAD---PXXX 1477
             D L+F +N +R        +L++RI+KD LRRS SM+S K+GE+  RRV  +       
Sbjct: 100  KDRLNFGDNWDRDAHDPLGKILSNRIDKDALRRSHSMVSRKQGELFHRRVGTELKSHNSS 159

Query: 1478 XXXXXXXXXXXXIHKATFERDFPSLGAEEKQGVPDIGRVSSPGLTSAIHSLPMGSSAMI- 1654
                        I KA FE+DFPSLG+EEKQG  +IGRVSSPGL+S + SLP+G+SA+I 
Sbjct: 160  NGILSGTSVGSSIQKAVFEKDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIV 219

Query: 1655 GGNGWTSALAEVPIKIGNNSGXXXXXXXXXXXXXXXXXXXXXXGLNMAEKLAQTPSRART 1834
            GG GWTSALAEVP  IG+ +G                      GLNMAE L Q PSRAR 
Sbjct: 220  GGEGWTSALAEVPSMIGSTTG-SSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARA 278

Query: 1835 TPQ---LSVETQRLEELAIKQSRQLIPMTPSMPKTSVL-NSEKQKPKTASRNEANVANMI 2002
             PQ   LSV+TQRLEELAIKQSRQLIP+TPSMPK  VL +S+K KPK ASR     A + 
Sbjct: 279  APQVSELSVKTQRLEELAIKQSRQLIPVTPSMPKAMVLSSSDKSKPKLASRTGELNATIK 338

Query: 2003 GQQQQLASPHLINHALRGGGQARPDSGKISHGGKLFVLKPARENSTSPTTTNDSLKPINV 2182
            G Q Q   P L++      G  +PD+ K SH GK  VLKP REN  S    + S    N 
Sbjct: 339  GGQPQ---PLLVHANQSRVGHVKPDAQKSSH-GKFLVLKPVRENGVSLAAKDVSSPTSNA 394

Query: 2183 GKI-ANDPHAVAPSTGFTSVRSPNHPKLSLVERKTTALTTTHVSSMEKKPTTMSQAQSRN 2359
              + AN   A+APS     +RSPN+  +S +ERK  +L     +++EK P ++SQ QSRN
Sbjct: 395  NSMAANSQFALAPSVPHAPLRSPNNINVSSMERKIASLDLKTGTTLEKXP-SLSQVQSRN 453

Query: 2360 DFFNLMRKKT 2389
            DFF L++KKT
Sbjct: 454  DFFKLIKKKT 463


Top