BLASTX nr result

ID: Akebia22_contig00016530 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00016530
         (1820 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN68624.1| hypothetical protein VITISV_010682 [Vitis vinifera]   367   1e-98
emb|CBI21048.3| unnamed protein product [Vitis vinifera]              361   5e-97
ref|XP_002285150.2| PREDICTED: uncharacterized protein LOC100266...   361   7e-97
ref|XP_007011854.1| Uncharacterized protein isoform 3 [Theobroma...   330   2e-87
ref|XP_002282244.1| PREDICTED: uncharacterized protein LOC100250...   313   1e-82
gb|EXB96866.1| hypothetical protein L484_016640 [Morus notabilis]     300   1e-78
ref|XP_007011852.1| Uncharacterized protein isoform 1 [Theobroma...   298   7e-78
ref|XP_004288790.1| PREDICTED: uncharacterized protein LOC101293...   293   1e-76
ref|XP_007011853.1| Uncharacterized protein isoform 2 [Theobroma...   290   1e-75
ref|XP_006450882.1| hypothetical protein CICLE_v10008357mg [Citr...   288   7e-75
ref|XP_002309630.1| hypothetical protein POPTR_0006s27080g [Popu...   288   7e-75
ref|XP_006600747.1| PREDICTED: uncharacterized protein LOC100806...   287   1e-74
ref|XP_007202061.1| hypothetical protein PRUPE_ppa006809mg [Prun...   286   2e-74
ref|XP_004139410.1| PREDICTED: uncharacterized protein LOC101205...   285   5e-74
ref|XP_002324862.2| hypothetical protein POPTR_0018s01770g [Popu...   283   2e-73
ref|XP_007222724.1| hypothetical protein PRUPE_ppa006474mg [Prun...   277   1e-71
ref|XP_002527188.1| conserved hypothetical protein [Ricinus comm...   276   2e-71
ref|XP_002324645.2| hypothetical protein POPTR_0018s12880g [Popu...   270   1e-69
ref|XP_007013493.1| Uncharacterized protein TCM_038116 [Theobrom...   267   1e-68
ref|XP_002515508.1| conserved hypothetical protein [Ricinus comm...   263   2e-67

>emb|CAN68624.1| hypothetical protein VITISV_010682 [Vitis vinifera]
          Length = 526

 Score =  367 bits (942), Expect = 1e-98
 Identities = 216/444 (48%), Positives = 259/444 (58%), Gaps = 8/444 (1%)
 Frame = +2

Query: 341  FELLLKMPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKL 520
            F    +MPRPGPRPYECVRRAWHSDRHQP+RGSLI+EIFRVV EIHSSATKKNKEW+EKL
Sbjct: 19   FNFNKRMPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEIHSSATKKNKEWQEKL 78

Query: 521  PVVVLKAEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALI 700
            P+VVLKAEEIMYSKANSEAEYMD++TLWDR NDAIN              LQPCIEA+L 
Sbjct: 79   PIVVLKAEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESTETGEFLQPCIEASLN 138

Query: 701  LGCTARSASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGS---SHFLPHHSS-- 865
            LGC  R ASRSQRNNN RCYL+P+TQEP S+ P   + +N  QG     S  +  +++  
Sbjct: 139  LGCPQRRASRSQRNNNPRCYLTPSTQEPISISP--SILENSPQGNHTTISQVMSRYATFI 196

Query: 866  NPTSSPKFMPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSN 1045
             P+S     P   G E  +   H N+  T                        E +P+SN
Sbjct: 197  KPSSMSVIQP---GLEPHSTAFHNNDCPT----XKFLFSSENCPPSGNKCLQMEVYPASN 249

Query: 1046 LARVYPLYYGKLQQTTEPQFGFQI---PHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDE 1216
            L  VYPLY G   Q  E Q GF +   P SN            +EP     +Q+LF    
Sbjct: 250  LCAVYPLYDGNQLQCEESQCGFGVQSHPKSN-----------PMEPAGMGTIQNLF--SY 296

Query: 1217 AVNASNSMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXX 1396
            A++ +    + D     EN P+I CDLSLRLG +S P  +    WP+E EDVG       
Sbjct: 297  AIDPTKKPSQTDFGHVTENSPKIDCDLSLRLGPLSIPCVSVENSWPQEFEDVGSSCSREG 356

Query: 1397 XXXCDLSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAG 1576
                DLS R+D +F FFPR N DDPL+SC SKRSSEG+ LN E + RKRK   S  +   
Sbjct: 357  SKFSDLSPRVDKQFPFFPRGNTDDPLDSCLSKRSSEGENLNMEATMRKRKAVISYPLEDR 416

Query: 1577 QFSQKPKFSQNICFSRMKKPDQ*R 1648
            QF  +PK   N    RM+  D+ R
Sbjct: 417  QFCCQPKLPYNYLPGRMRNADEGR 440


>emb|CBI21048.3| unnamed protein product [Vitis vinifera]
          Length = 451

 Score =  361 bits (927), Expect = 5e-97
 Identities = 210/435 (48%), Positives = 255/435 (58%), Gaps = 8/435 (1%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHSDRHQP+RGSLI+EIFRVV EIHSSATKKNKEW+EKLP+VVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPIVVLK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSEAEYMD++TLWDR NDAIN              LQPCIEA+L LGC  R
Sbjct: 61   AEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESTETGEFLQPCIEASLNLGCPQR 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGS---SHFLPHHSS--NPTSSP 883
             ASRSQRNNN RCYL+P+TQEP S+ P   + +N  QG     S  +  +++   P+S  
Sbjct: 121  RASRSQRNNNPRCYLTPSTQEPISISP--SILENSPQGNHTTISQVMSRYATFIKPSSMS 178

Query: 884  KFMPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYP 1063
               P   G E  +   H N+  T+                       E +P+SN+  VYP
Sbjct: 179  VIQP---GLEPHSTAFHNNDCPTS----KFLFSSENCPPSGNKCLQMEVYPASNVCAVYP 231

Query: 1064 LYYGKLQQTTEPQFGFQI---PHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASN 1234
            LY G   Q  E Q GF +   P SN            +EP     +Q+LF    A++ + 
Sbjct: 232  LYDGNQLQCEESQCGFGVQSHPKSN-----------PMEPAGMGTIQNLF--SYAIDPTK 278

Query: 1235 SMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDL 1414
               + D     EN P+I CDLSLRLG +S P  +    WP+E EDVG           DL
Sbjct: 279  KPSQTDFGHVTENSPKIDCDLSLRLGPLSIPCVSVENSWPQEFEDVGSSCSREGSKFSDL 338

Query: 1415 SLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKP 1594
            S ++D +F FFPR N DDPL+SC SKRSSEG+ LN E + RKRK   S  +   QF  +P
Sbjct: 339  SPQVDKQFPFFPRGNTDDPLDSCLSKRSSEGENLNMEATMRKRKAVISYPLEDRQFCCQP 398

Query: 1595 KFSQNICFSRMKKPD 1639
            K   N    RM+  +
Sbjct: 399  KLPYNYLPGRMRNAE 413


>ref|XP_002285150.2| PREDICTED: uncharacterized protein LOC100266444 [Vitis vinifera]
          Length = 414

 Score =  361 bits (926), Expect = 7e-97
 Identities = 210/432 (48%), Positives = 254/432 (58%), Gaps = 8/432 (1%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHSDRHQP+RGSLI+EIFRVV EIHSSATKKNKEW+EKLP+VVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPIVVLK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSEAEYMD++TLWDR NDAIN              LQPCIEA+L LGC  R
Sbjct: 61   AEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESTETGEFLQPCIEASLNLGCPQR 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGS---SHFLPHHSS--NPTSSP 883
             ASRSQRNNN RCYL+P+TQEP S+ P   + +N  QG     S  +  +++   P+S  
Sbjct: 121  RASRSQRNNNPRCYLTPSTQEPISISP--SILENSPQGNHTTISQVMSRYATFIKPSSMS 178

Query: 884  KFMPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYP 1063
               P   G E  +   H N+  T+                       E +P+SN+  VYP
Sbjct: 179  VIQP---GLEPHSTAFHNNDCPTS----KFLFSSENCPPSGNKCLQMEVYPASNVCAVYP 231

Query: 1064 LYYGKLQQTTEPQFGFQI---PHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASN 1234
            LY G   Q  E Q GF +   P SN            +EP     +Q+LF    A++ + 
Sbjct: 232  LYDGNQLQCEESQCGFGVQSHPKSN-----------PMEPAGMGTIQNLF--SYAIDPTK 278

Query: 1235 SMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDL 1414
               + D     EN P+I CDLSLRLG +S P  +    WP+E EDVG           DL
Sbjct: 279  KPSQTDFGHVTENSPKIDCDLSLRLGPLSIPCVSVENSWPQEFEDVGSSCSREGSKFSDL 338

Query: 1415 SLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKP 1594
            S ++D +F FFPR N DDPL+SC SKRSSEG+ LN E + RKRK   S  +   QF  +P
Sbjct: 339  SPQVDKQFPFFPRGNTDDPLDSCLSKRSSEGENLNMEATMRKRKAVISYPLEDRQFCCQP 398

Query: 1595 KFSQNICFSRMK 1630
            K   N    RM+
Sbjct: 399  KLPYNYLPGRMR 410


>ref|XP_007011854.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508782217|gb|EOY29473.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 447

 Score =  330 bits (845), Expect = 2e-87
 Identities = 210/440 (47%), Positives = 244/440 (55%), Gaps = 14/440 (3%)
 Frame = +2

Query: 353  LKMPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVV 532
            LKMPRPGPRPY C RRAWHSDRHQPMRGSLI+EIFRVV EIHSSATKKNKEW+EKLPVVV
Sbjct: 41   LKMPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVVV 100

Query: 533  LKAEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCT 712
            LKAEEIMYSKANSEAEYMD+++LWDRTNDAIN             LLQPCIEAAL LGCT
Sbjct: 101  LKAEEIMYSKANSEAEYMDLKSLWDRTNDAINTIIKRDESTETGELLQPCIEAALNLGCT 160

Query: 713  ARSASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFM 892
             R   RSQRN N RCYLSP TQE           +N TQ           +N T++P FM
Sbjct: 161  PRRTLRSQRNCNPRCYLSPGTQE----------AENTTQ-----------ANLTTNPNFM 199

Query: 893  PYY-------------LGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAH 1033
              Y             LGSES   I   +N TT                        E +
Sbjct: 200  ASYSGFMKSTIMNVTHLGSESQKHIAQDSNCTT---YKFPFASENGPLPSNSQCLPMEKY 256

Query: 1034 PSSNLARVYPLYYGKLQQTTEPQFGFQI-PHSNAASLGTPHVQFSVEPTEKSFLQDLFPR 1210
            P  NL  VYPLYYG   +  E Q GF I P S         +  +VEP +   + +LF  
Sbjct: 257  PPPNLYSVYPLYYGNHLKFEEMQHGFGIFPKS---------ISNTVEPAKMGVIDNLFSS 307

Query: 1211 DEAVNASNSMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXX 1390
            D  V++SN+M + D+S T  NP E  CDLSLRLG +S P  + G   P+ +ED G     
Sbjct: 308  D--VDSSNNMNQTDVSNTSNNPHENACDLSLRLGPLSIPCLSVGKSRPQVIEDTGSTSLE 365

Query: 1391 XXXXXCDLSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVV 1570
                  DL+  ID     FPR N DDPL S  ++ S EG+ +N + + RKRK       V
Sbjct: 366  WNRFG-DLTPSIDKMLSSFPRSNRDDPLNSSLNRWSLEGEHVNVDATMRKRKTVYGP-TV 423

Query: 1571 AGQFSQKPKFSQNICFSRMK 1630
              QF   PK   +    RMK
Sbjct: 424  DQQFCLPPKLPYSHLTGRMK 443


>ref|XP_002282244.1| PREDICTED: uncharacterized protein LOC100250879 [Vitis vinifera]
          Length = 424

 Score =  313 bits (803), Expect = 1e-82
 Identities = 187/419 (44%), Positives = 228/419 (54%), Gaps = 6/419 (1%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHSDRHQPMRGS+I++IFRVV + HSSATKKN+EW+EKLP+VVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRVVTDTHSSATKKNREWQEKLPIVVLK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSE EYMD+ TLWDR NDA+N             LL PCIEAAL LGC   
Sbjct: 61   AEEIMYSKANSETEYMDLGTLWDRVNDAVNTIIRRDESTETGELLPPCIEAALNLGCVPV 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFM-- 892
             ASRSQR+NN R YL+  TQEPTSV P  RV DN          P  + N  +  +    
Sbjct: 121  RASRSQRHNNPRSYLTHRTQEPTSVSP--RVLDNAVNERCPQLQPPSAGNQLTFGRLNMD 178

Query: 893  PYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLYY 1072
              +L  +S   +T  N+  TT +                     E +   N   VYPLYY
Sbjct: 179  STHLVLDSDRHVTQNNSLATTRN---FHFPYENFPLGSNQSMTVETNTPLNFGSVYPLYY 235

Query: 1073 GKLQQTTEPQFGFQIP---HSNAASLGTPHVQFSVEPTEKS-FLQDLFPRDEAVNASNSM 1240
            G   Q  E   GFQ+P   ++N   +G P      EP+E    LQ+LF  D   N  N  
Sbjct: 236  GTHFQNEESHLGFQMPETANANTVFVGAPIGTSIAEPSEMGIILQNLFSSDGTENVLNKN 295

Query: 1241 IRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSL 1420
             + +   T    P  +CDLSLRLG  S P          + EDVG            LS 
Sbjct: 296  AQENFRDTCGKEPVAECDLSLRLGLSSDPCMRKEKCSAPDTEDVGSSSSQEGAKVSGLSP 355

Query: 1421 RIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKPK 1597
                 FCFFP +  + P  SCS+K +S  +G N + + RKRK P ++ +  GQF   P+
Sbjct: 356  GKSKGFCFFPSETANSPFGSCSNKWNSGDEGQNMDATVRKRKAPFNNDLEGGQFFLSPE 414


>gb|EXB96866.1| hypothetical protein L484_016640 [Morus notabilis]
          Length = 409

 Score =  300 bits (768), Expect = 1e-78
 Identities = 188/425 (44%), Positives = 225/425 (52%), Gaps = 8/425 (1%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHSDRHQPMRGS+I++IFRV  E HS+ TKKNKEW+EKLP+VVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPMRGSVIQQIFRVANEAHSATTKKNKEWQEKLPIVVLK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSEAEY +++TLWDR NDAIN             LL PC+EAAL LGC   
Sbjct: 61   AEEIMYSKANSEAEYTNLDTLWDRVNDAINTIIRREETTETGDLLPPCVEAALNLGCIPV 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTS---SPKF 889
             ASRSQR++N R YL+    EP S    +RV D  +       LP HS N  +   +P  
Sbjct: 121  RASRSQRHSNPRTYLTARAHEPFSA--GTRVLDRTSDERRPQLLPLHSGNQLTFARAPIA 178

Query: 890  MPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLY 1069
             P    SES T +   NN  T P                      + + S NL  VYPLY
Sbjct: 179  NPANFLSESNTHVNRNNNNLTAPR--SHAFSPENVVSGHSQATTIDTNASLNLGSVYPLY 236

Query: 1070 YGKLQQTTEPQFGFQIP---HSNAASLGTPHV--QFSVEPTEKSFLQDLFPRDEAVNASN 1234
            +G   +T +   G  IP   HS    +GTP V    + EPT    +QD F R +A+    
Sbjct: 237  HGTNYRTEKYHSGSPIPENVHSKTIYVGTPVVTPAATAEPT----MQDCFTRVDAM---- 288

Query: 1235 SMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDL 1414
                     T ENP E +CDLSLRL   S P+      +  E EDVG           D+
Sbjct: 289  --------GTQENPQEAECDLSLRLSLFSNPFGRTQKNFATETEDVGSSSSQDAGKVNDV 340

Query: 1415 SLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKP 1594
               +  EFCFFP  +  D  ES S   +S G+G N E   RK K   S     GQF  +P
Sbjct: 341  RQSMGREFCFFPGKSACDLSESSSRMWNSGGEGQNLEAFVRKGKETFSSNEKDGQFCWQP 400

Query: 1595 KFSQN 1609
                N
Sbjct: 401  GVPSN 405


>ref|XP_007011852.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782215|gb|EOY29471.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 417

 Score =  298 bits (762), Expect = 7e-78
 Identities = 197/440 (44%), Positives = 230/440 (52%), Gaps = 14/440 (3%)
 Frame = +2

Query: 353  LKMPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVV 532
            LKMPRPGPRPY C RRAWHSDRHQPMRGSLI+EIFRVV EIHSSATKKNKEW+EKLPVVV
Sbjct: 41   LKMPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVVV 100

Query: 533  LKAEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCT 712
            LKAEEIMYSKANSEAEYMD+++LWDRTNDAIN             LLQPCIEAAL LGCT
Sbjct: 101  LKAEEIMYSKANSEAEYMDLKSLWDRTNDAINTIIKRDESTETGELLQPCIEAALNLGCT 160

Query: 713  ARSASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFM 892
             R   RSQRN N RCYLSP TQE           +N TQ           +N T++P FM
Sbjct: 161  PRRTLRSQRNCNPRCYLSPGTQE----------AENTTQ-----------ANLTTNPNFM 199

Query: 893  PYY-------------LGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAH 1033
              Y             LGSES   I   +N TT                        E +
Sbjct: 200  ASYSGFMKSTIMNVTHLGSESQKHIAQDSNCTT---YKFPFASENGPLPSNSQCLPMEKY 256

Query: 1034 PSSNLARVYPLYYGKLQQTTEPQFGFQI-PHSNAASLGTPHVQFSVEPTEKSFLQDLFPR 1210
            P  NL  VYPLYYG   +  E Q GF I P S         +  +VEP +   + +LF  
Sbjct: 257  PPPNLYSVYPLYYGNHLKFEEMQHGFGIFPKS---------ISNTVEPAKMGVIDNLFSS 307

Query: 1211 DEAVNASNSMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXX 1390
            D  V++SN+M + D+S T  NP E  CDLSLRLG +S P  + G   P+ +ED G     
Sbjct: 308  D--VDSSNNMNQTDVSNTSNNPHENACDLSLRLGPLSIPCLSVGKSRPQVIEDTGSTSLE 365

Query: 1391 XXXXXCDLSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVV 1570
                                            ++ S EG+ +N + + RKRK       V
Sbjct: 366  W-------------------------------NRWSLEGEHVNVDATMRKRKTVYGP-TV 393

Query: 1571 AGQFSQKPKFSQNICFSRMK 1630
              QF   PK   +    RMK
Sbjct: 394  DQQFCLPPKLPYSHLTGRMK 413


>ref|XP_004288790.1| PREDICTED: uncharacterized protein LOC101293823 [Fragaria vesca
            subsp. vesca]
          Length = 421

 Score =  293 bits (751), Expect = 1e-76
 Identities = 182/423 (43%), Positives = 230/423 (54%), Gaps = 6/423 (1%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHSDRHQPMRGS+I+++FR V E+HS+ TK NKEW+EKLP+VV K
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQMFRAVNEVHSAKTKNNKEWQEKLPMVVFK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSEAEY++ +TLWDR NDAIN             LL PC+EAAL LGC A 
Sbjct: 61   AEEIMYSKANSEAEYINSDTLWDRANDAINTIIRREEGNETGDLLPPCVEAALNLGCVAV 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFL-PHHSSNPTSSPKFMP 895
             ASRSQR++N R YL P  QEP S  PP+RV D  +      F  PHH  N ++  +  P
Sbjct: 121  RASRSQRHSNPRSYLMPRPQEPPS--PPTRVLDRPSDERRPPFSPPHHPGNQSNFAR--P 176

Query: 896  YYLGSESCTP--ITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLY 1069
              + S    P  ++HAN  +                           +   NL  VYPLY
Sbjct: 177  STVNSAHLVPESLSHANQSSNLSSPRHYPFSVENVPGGHNQITTISTNNQLNLGSVYPLY 236

Query: 1070 YGKLQQTTEPQFGFQIP---HSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSM 1240
            +G    T  PQ   Q+P   HS    +GTP V    EPT+K     +F    A N S+ +
Sbjct: 237  HGFSYPTEAPQ--LQVPENVHSRTIYVGTP-VTSIQEPTKK----HIFTSQRAENVSHRI 289

Query: 1241 IRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSL 1420
             + D+    E P +   DLSLRLG +S   ++       ++ED+G           + S 
Sbjct: 290  PQVDVMDIQEKPRDEGYDLSLRLGPVSHLCTDRSL--ASQMEDIGSSNSQEGGKLHNYSP 347

Query: 1421 RIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKPKF 1600
             I  EFCFFP     DP ES S+  +SEG+  + E + RKRK    +    GQF  +P  
Sbjct: 348  SISKEFCFFPTKTAYDPSESTSNMWNSEGEDRSLEATLRKRKATFRNNEEDGQFFSQPPG 407

Query: 1601 SQN 1609
              N
Sbjct: 408  QPN 410


>ref|XP_007011853.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508782216|gb|EOY29472.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 359

 Score =  290 bits (742), Expect = 1e-75
 Identities = 178/349 (51%), Positives = 203/349 (58%), Gaps = 14/349 (4%)
 Frame = +2

Query: 353  LKMPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVV 532
            LKMPRPGPRPY C RRAWHSDRHQPMRGSLI+EIFRVV EIHSSATKKNKEW+EKLPVVV
Sbjct: 41   LKMPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVVV 100

Query: 533  LKAEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCT 712
            LKAEEIMYSKANSEAEYMD+++LWDRTNDAIN             LLQPCIEAAL LGCT
Sbjct: 101  LKAEEIMYSKANSEAEYMDLKSLWDRTNDAINTIIKRDESTETGELLQPCIEAALNLGCT 160

Query: 713  ARSASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFM 892
             R   RSQRN N RCYLSP TQE           +N TQ           +N T++P FM
Sbjct: 161  PRRTLRSQRNCNPRCYLSPGTQE----------AENTTQ-----------ANLTTNPNFM 199

Query: 893  PYY-------------LGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAH 1033
              Y             LGSES   I   +N TT                        E +
Sbjct: 200  ASYSGFMKSTIMNVTHLGSESQKHIAQDSNCTT---YKFPFASENGPLPSNSQCLPMEKY 256

Query: 1034 PSSNLARVYPLYYGKLQQTTEPQFGFQI-PHSNAASLGTPHVQFSVEPTEKSFLQDLFPR 1210
            P  NL  VYPLYYG   +  E Q GF I P S         +  +VEP +   + +LF  
Sbjct: 257  PPPNLYSVYPLYYGNHLKFEEMQHGFGIFPKS---------ISNTVEPAKMGVIDNLFSS 307

Query: 1211 DEAVNASNSMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPR 1357
            D  V++SN+M + D+S T  NP E  CDLSLRLG +S P  + G   P+
Sbjct: 308  D--VDSSNNMNQTDVSNTSNNPHENACDLSLRLGPLSIPCLSVGKSRPQ 354


>ref|XP_006450882.1| hypothetical protein CICLE_v10008357mg [Citrus clementina]
            gi|568843993|ref|XP_006475881.1| PREDICTED:
            uncharacterized protein LOC102614540 [Citrus sinensis]
            gi|557554108|gb|ESR64122.1| hypothetical protein
            CICLE_v10008357mg [Citrus clementina]
          Length = 430

 Score =  288 bits (736), Expect = 7e-75
 Identities = 183/440 (41%), Positives = 225/440 (51%), Gaps = 14/440 (3%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHS+RHQPMRGS I++IFRV  E HS+ TKKNKEW+EKLP+VVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSERHQPMRGSTIQQIFRVADEFHSTQTKKNKEWQEKLPIVVLK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEI+YSKANSE EYM+++TL DR NDA+N             LL PC+EAAL LGC   
Sbjct: 61   AEEILYSKANSEDEYMNLDTLRDRVNDAVNTIIRRDESTETGELLPPCVEAALNLGCIPV 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNP--------- 871
             ASRSQR+++ R YL+   QEP  +PP  ++ D   +     F PHHS N          
Sbjct: 121  RASRSQRHSHPRTYLNLRPQEPAPLPP--KIVDKTIEDQCPRFSPHHSGNQFNFSRSFKN 178

Query: 872  TSSPKFMPYYLGSESCTPITHAN--NPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSN 1045
             +S   +P      SC  I + N   P++ P                      +A+    
Sbjct: 179  ANSTTLVP----ESSCQVIGNDNLAAPSSYP------SSYENIPSRHSKMMRVDANVQLK 228

Query: 1046 LARVYPLYYGKLQQTTEPQFGFQIP---HSNAASLGTPHVQFSVEPTEKSFLQDLFPRDE 1216
            L  VYPLYYG   QT + + G  I    +S    +G P      EP E   L  L+    
Sbjct: 229  LGSVYPLYYGTRFQTEDTKLGSGISENVNSTTIFVGKPIGTSIPEPGEMGALPRLYSCSG 288

Query: 1217 AVNASNSMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXX 1396
            +  AS    + D       P    CDLSLRLG    P  +      RE EDVG       
Sbjct: 289  SDTASKPTTKPDFLELQRRPHVTGCDLSLRLGLSGDPCMSLDRSSARETEDVGSIRSQEG 348

Query: 1397 XXXCDLSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAG 1576
                DLS   + EFCFFP    D P ESCSSKR  E +  N E S RK K   SD +   
Sbjct: 349  NKLKDLSSERNKEFCFFPEKTADKPHESCSSKRFPEHECRNLEASIRKPKALFSDNLEDV 408

Query: 1577 QFSQKPKFSQNICFSRMKKP 1636
            QF  +P    N    +++ P
Sbjct: 409  QFCWQPGPPSNQFTGQIRGP 428


>ref|XP_002309630.1| hypothetical protein POPTR_0006s27080g [Populus trichocarpa]
            gi|222855606|gb|EEE93153.1| hypothetical protein
            POPTR_0006s27080g [Populus trichocarpa]
          Length = 407

 Score =  288 bits (736), Expect = 7e-75
 Identities = 193/428 (45%), Positives = 236/428 (55%), Gaps = 4/428 (0%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHSDRHQP+RGSLI+EIFR+V E HSS TKKNKEW+EKLPVVVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRLVNEAHSSTTKKNKEWQEKLPVVVLK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSEAEYM+++TLWDRTNDAIN             LLQPCIEAAL LGCT R
Sbjct: 61   AEEIMYSKANSEAEYMELKTLWDRTNDAINTIIRRDESTEIGELLQPCIEAALNLGCTPR 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPS-RVPDNVTQGGSSHFLPHHSS--NPTSSPKF 889
             ASRSQRN N   YLSP+TQEP ++   S        +  +SH LP++SS   P      
Sbjct: 121  RASRSQRNCNPSFYLSPSTQEPNTLSSGSVHSAIQANRTSNSHVLPNYSSMVKPIIMNST 180

Query: 890  MPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLY 1069
             P   GSES   +  +N   T+                          PS  L  VYPLY
Sbjct: 181  PP---GSESQDFVGQSNG--TSNRFLFIDDSIPLSNANQCLPLGNYRIPS--LCSVYPLY 233

Query: 1070 YGKLQQTTEPQFGF-QIPHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSMIR 1246
            YG      EPQ G   +P +   ++         EP + + +Q+ FP +E  +       
Sbjct: 234  YG---CCLEPQRGCGALPKTFPGTM---------EPVKVAVMQNFFPCNE--DTPVKTCH 279

Query: 1247 GDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSLRI 1426
             D   +   P EI CDLSLRLG++ AP  +  T+  ++ +D G           D   ++
Sbjct: 280  ADHKDSPLQPQEIGCDLSLRLGSLPAPMLSVKTKQLKDAKDGGHDCSQEGGKVDDWMPQV 339

Query: 1427 DNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKPKFSQ 1606
            D E  FF R NV DPL S SSK     + +N + + +KRK    D  V  QF  +PK   
Sbjct: 340  DKELPFFTRVNVADPLVSHSSK---SREHVNIDETKKKRKA-VLDHHVEDQFCWQPKLHC 395

Query: 1607 NICFSRMK 1630
            N    RMK
Sbjct: 396  NQLTCRMK 403


>ref|XP_006600747.1| PREDICTED: uncharacterized protein LOC100806760 [Glycine max]
          Length = 405

 Score =  287 bits (734), Expect = 1e-74
 Identities = 178/424 (41%), Positives = 224/424 (52%), Gaps = 2/424 (0%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHS+RHQP+RGS+I++IFRVV + HS ATKKNKEW+EKLPVVVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSERHQPVRGSIIQQIFRVVNDAHSPATKKNKEWQEKLPVVVLK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSEAEY++ +TLWDR NDA+N             LL PC+EAAL LGC A 
Sbjct: 61   AEEIMYSKANSEAEYLNPDTLWDRLNDAVNTIIRRDETTETGGLLPPCVEAALNLGCKAV 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFMPY 898
              SRS R+NN R YLSP  Q+P  +PP     + +             + P  SP     
Sbjct: 121  RTSRSDRHNNPRTYLSPRIQQPPCLPPKPVAGNPLNYA--------KVTTPAVSP----- 167

Query: 899  YLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLYYGK 1078
                    P+  + +P +                        EA PS NL  VYPLYYG 
Sbjct: 168  -------IPVPDSIHPNSRLMGSSKYPFSEGIPSGHHQPLTMEARPSLNLGSVYPLYYGY 220

Query: 1079 LQQTTEP-QFGFQIPHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSMIRGDL 1255
              +  EP         S+   +G P +  SV       L + F      + +N M +   
Sbjct: 221  EAREPEPWTTATDTTCSDTIFVGRPVI--SVPEPSGIGLSENFSYGTFHHVANRMRKETA 278

Query: 1256 SATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSLRIDNE 1435
              T E  P+ +CDLSLRLG    P S++ +    EV+DVG            LSL+ + E
Sbjct: 279  VGTQEAAPDRECDLSLRLGQCLHPCSSSKSSSAYEVDDVGLGVSPESCKFSHLSLQRNRE 338

Query: 1436 FCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVP-SSDLVVAGQFSQKPKFSQNI 1612
            FCF+PR+     +ES S K + EG+ LN E + RKRK P   +    GQF + P    N 
Sbjct: 339  FCFYPRETGYGTIESTSGKCNVEGEDLNLEATLRKRKAPLCGNNEEDGQFCRNPGVPSNR 398

Query: 1613 CFSR 1624
              SR
Sbjct: 399  FTSR 402


>ref|XP_007202061.1| hypothetical protein PRUPE_ppa006809mg [Prunus persica]
            gi|462397592|gb|EMJ03260.1| hypothetical protein
            PRUPE_ppa006809mg [Prunus persica]
          Length = 394

 Score =  286 bits (732), Expect = 2e-74
 Identities = 177/427 (41%), Positives = 215/427 (50%), Gaps = 1/427 (0%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHSDRHQPMRGS+I++IFRVV E+H S TKKNKEW+EKLP+VV K
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRVVSEVHGSVTKKNKEWQEKLPMVVFK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSEAEYM++ETLWDR NDA+N             LL PC+EAAL LGC   
Sbjct: 61   AEEIMYSKANSEAEYMNLETLWDRVNDAVNTIIRRDEGTETGELLPPCVEAALNLGCIPV 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFMPY 898
             ASRSQR++N R YL+   QEP S   P+R+ D  T      F PH S N  +  K    
Sbjct: 121  RASRSQRHSNPRIYLTSRAQEPPSA--PTRILDRTTDERRPQFPPHRSGNQLNFAKASTG 178

Query: 899  YLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLYYGK 1078
                      +  N  T                           + S +L  VYPLYYG 
Sbjct: 179  NSAHSVPESYSRINQNTNLNSRRNDPFSRENLPAGHNQLTTMSTNNSLDLGSVYPLYYG- 237

Query: 1079 LQQTTEPQFGFQIPHSNAASLGTPHVQF-SVEPTEKSFLQDLFPRDEAVNASNSMIRGDL 1255
                                    H QF   +PT+     +LF    A N S+ + + ++
Sbjct: 238  -----------------------AHYQFEECQPTK----HNLFSSQTAENVSHRITQVEV 270

Query: 1256 SATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSLRIDNE 1435
                + P E +CDLSLRLG +  P          E ED+G           DLS  I  E
Sbjct: 271  ---HDKPLETECDLSLRLGPVLHPCIQRSL--ASETEDIGSSSSQDGGKLNDLSPSISKE 325

Query: 1436 FCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKPKFSQNIC 1615
             CFFP     D  ES SS  +SEG+G + E + RKRK P       G+F ++P    N  
Sbjct: 326  ICFFPTKTACDRFESTSSMWNSEGEGRSLEATVRKRKAPFCSNEEDGKFCEQPDVLPNRL 385

Query: 1616 FSRMKKP 1636
              R   P
Sbjct: 386  TDRTTGP 392


>ref|XP_004139410.1| PREDICTED: uncharacterized protein LOC101205660 [Cucumis sativus]
          Length = 423

 Score =  285 bits (729), Expect = 5e-74
 Identities = 178/412 (43%), Positives = 218/412 (52%), Gaps = 14/412 (3%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHSDRHQPMRGS+I++IFRVV E H+ ATKKNKEW+EKLP+VV +
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRVVNENHTPATKKNKEWQEKLPIVVFR 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSE EYM++ETLWDR NDA+N             LL PC+EAAL LGC   
Sbjct: 61   AEEIMYSKANSEVEYMNLETLWDRLNDAVNTIIRRDESSESGELLPPCVEAALNLGCVPV 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFM-- 892
             ASRSQR++N R YL+P  QEPTS      +   + +      LP    +P +   F   
Sbjct: 121  RASRSQRHSNPRTYLTPRGQEPTST-----LATTLNKATDERRLPVSPLHPGNQLNFARA 175

Query: 893  ----PYYLGSESCTPITHANNPT--TTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLAR 1054
                  +  SE  + I   NNPT  +TP                      E +   NL  
Sbjct: 176  KSMNSSFFASERSSQIKQHNNPTIPSTP-----AFLIENVPVVHNNYSMTETNTPLNLGS 230

Query: 1055 VYPLYYGKLQQTTEPQFGFQI---PHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVN 1225
            VYPLYYG   QT EP    QI    +     LG P +  S EP E S          +  
Sbjct: 231  VYPLYYGIRCQTEEPNLSSQISADANQQTIFLGRPIIS-SAEPAEHSL--------RSYK 281

Query: 1226 ASNSMIRGD---LSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXX 1396
              N+M R     ++A  E  P+ +CDLSLRLG  S P  ++   W  E  DV        
Sbjct: 282  TGNAMSRFPSEFITAREEKLPDTECDLSLRLGVPSLPCVSSRKTWALETGDVAPSSSRER 341

Query: 1397 XXXCDLSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVP 1552
                D +   + EF FFP     +   SCS+  SS+G G  SE S++KRK P
Sbjct: 342  HQFHDQTTYANEEFSFFPTRTEFERFGSCSNMWSSDGGGQISESSTKKRKEP 393


>ref|XP_002324862.2| hypothetical protein POPTR_0018s01770g [Populus trichocarpa]
            gi|550317816|gb|EEF03427.2| hypothetical protein
            POPTR_0018s01770g [Populus trichocarpa]
          Length = 448

 Score =  283 bits (723), Expect = 2e-73
 Identities = 191/433 (44%), Positives = 238/433 (54%), Gaps = 8/433 (1%)
 Frame = +2

Query: 356  KMPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVL 535
            KMPRPGPRPYECVRRAWHSDRHQP+RGSLI+EIFR+V E H  ATKKNKEW+EKLPVVVL
Sbjct: 41   KMPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRLVNEAHCPATKKNKEWQEKLPVVVL 100

Query: 536  KAEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTA 715
            KAEEIMYSKANSEAEYMD++TLWDR NDAIN             LLQPCIEAAL LGCT 
Sbjct: 101  KAEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESLETGELLQPCIEAALNLGCTP 160

Query: 716  RSASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQG--GSSHFLPHHSS--NPT--- 874
            R ASRSQRN N R YLSP+TQE  ++ P + V + +      +SH L  +S+   PT   
Sbjct: 161  RRASRSQRNCNLRFYLSPSTQESNTLSPAA-VHNAIRANHISNSHCLRDYSNLVKPTIMN 219

Query: 875  SSPKFMPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLAR 1054
            S+P       GSES   +   N+ +                         E +   +L  
Sbjct: 220  SAPS------GSESQDLVGQGNDTSNR----FLFRSDNIPPSNVNRCLPLENYRIPSLCS 269

Query: 1055 VYPLYYGKLQQTTEPQFGF-QIPHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNAS 1231
            VYPLYYG      EPQ G   +P +   +         +EP +   +Q+ FP +E     
Sbjct: 270  VYPLYYGSC---LEPQRGCGALPKTFPGT---------IEPVKVVAVQNFFPCNEDTPVR 317

Query: 1232 NSMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCD 1411
             S + G        P EI+CDLSLRLG+I AP   A T+  ++ +D G           D
Sbjct: 318  TSQV-GHKDCL--QPQEIECDLSLRLGSILAPVPRAKTKQIKDAKDGGHDCSQEGGKFDD 374

Query: 1412 LSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQK 1591
               ++D E  FFP+ +V DP  S SSK     + +  + + +KRK+     V   QF  +
Sbjct: 375  WMPQMDKELSFFPKVDVVDPQVSHSSK---SREHIIVDVTMKKRKLVFDHHVEDQQFLWQ 431

Query: 1592 PKFSQNICFSRMK 1630
            PK   N    RMK
Sbjct: 432  PKLPCNKLTGRMK 444


>ref|XP_007222724.1| hypothetical protein PRUPE_ppa006474mg [Prunus persica]
            gi|462419660|gb|EMJ23923.1| hypothetical protein
            PRUPE_ppa006474mg [Prunus persica]
          Length = 410

 Score =  277 bits (708), Expect = 1e-71
 Identities = 179/416 (43%), Positives = 225/416 (54%), Gaps = 3/416 (0%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPR GPRPYECVRRAWHS+RHQPMRGSLIKEIFRVV EIHSSAT+KNKEW++KLP+VVLK
Sbjct: 1    MPRSGPRPYECVRRAWHSERHQPMRGSLIKEIFRVVNEIHSSATRKNKEWQDKLPIVVLK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSEAEYMD++TLWDRTNDAIN              LQPCIEAAL LGC  R
Sbjct: 61   AEEIMYSKANSEAEYMDLKTLWDRTNDAINTIIRRDEGTETGDFLQPCIEAALNLGCIPR 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQ---GGSSHFLPHHSSNPTSSPKF 889
              SRSQR+ N  CYL P T +   + P   V +N +Q     +S + PH  +     PK 
Sbjct: 121  RTSRSQRHANPSCYLIPITSDVPGISP--SVVENASQRDYTSNSQYRPHCPN--FVKPKS 176

Query: 890  MPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLY 1069
            M   LG ES  P+   N+ TT                        E+  +SN +  YPL+
Sbjct: 177  MTTQLGFESRFPVVQNNDCTT---MKFRIASENIPPSGYDQFSPRESMATSNFSS-YPLH 232

Query: 1070 YGKLQQTTEPQFGFQIPHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSMIRG 1249
            Y    Q  E + GF I       L  P V   +EP +   + +L    +    SN   + 
Sbjct: 233  YRNFPQFEELKPGFVI-------LPKP-VSDPIEPAKMGVISNLLCNGD---KSNDNTQT 281

Query: 1250 DLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSLRID 1429
            D     ENP  + CDLSLRLG +S  +S      P EV+DVG               + D
Sbjct: 282  DTRDYTENPCTVGCDLSLRLGPLSTQHSIGENSQPEEVKDVGAQEGTMCSD--QSQPQFD 339

Query: 1430 NEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKPK 1597
                F  + N   P +S SS+ + EG+ +N + + RKRK   +      +F ++P+
Sbjct: 340  RRPSFIGKGNEYGPRDSYSSRLNFEGEYMNVQATMRKRKAAFNHPTGDTKFYRQPE 395


>ref|XP_002527188.1| conserved hypothetical protein [Ricinus communis]
            gi|223533453|gb|EEF35201.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 373

 Score =  276 bits (706), Expect = 2e-71
 Identities = 165/377 (43%), Positives = 197/377 (52%), Gaps = 6/377 (1%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHSDRHQPMRGS+I +IFRVV E HS+ TKKNKEW+EKLP+VVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPMRGSVIHQIFRVVSETHSAITKKNKEWQEKLPIVVLK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSEAEYM+ +TLWDR NDAIN             LL PCIEAAL LGC   
Sbjct: 61   AEEIMYSKANSEAEYMNPDTLWDRVNDAINTIIRRDESNETGELLPPCIEAALNLGCIPV 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFMPY 898
             ASRSQR++N R YLSP   EP  VP   R+ +           P  SS   S   F   
Sbjct: 121  RASRSQRHSNPRSYLSPRMHEP--VPAALRIVERANDKQCPQLSPPQSS---SQLNFARP 175

Query: 899  YLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXX----EAHPSSNLARVYPL 1066
                 S  P++ +N   T                            E +   NL  VYPL
Sbjct: 176  TTAVNSTLPVSESNCHLTESSNIDASCSYPLLYDNISLGSSQLMSKEINKQLNLGSVYPL 235

Query: 1067 YYGKLQQTTEPQFGFQIP--HSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSM 1240
            YYG      +P    Q+P  +SN   +GTP    + EP E S   D      A  ++  +
Sbjct: 236  YYGNNYHIKQPHLASQVPEKNSNTIFVGTPISTSAAEPAEMSVFHDFLTCPSAEISAKRI 295

Query: 1241 IRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSL 1420
             + DL  T E P  ++CDLSLRLG  +    N      +E EDVG           +  L
Sbjct: 296  SQADLGNTHEKPSGVQCDLSLRLGLFTDLSVNMKRSLVQETEDVGSSNSQDRSKSSNFYL 355

Query: 1421 RIDNEFCFFPRDNVDDP 1471
            + + E  F P  N +DP
Sbjct: 356  QKNKELFFSPSRNTNDP 372


>ref|XP_002324645.2| hypothetical protein POPTR_0018s12880g [Populus trichocarpa]
            gi|550318626|gb|EEF03210.2| hypothetical protein
            POPTR_0018s12880g [Populus trichocarpa]
          Length = 395

 Score =  270 bits (691), Expect = 1e-69
 Identities = 170/416 (40%), Positives = 208/416 (50%), Gaps = 8/416 (1%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHSDRH+P+RGS+I +I R+ Y+ HS+ATK N+EW++KL +VV +
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHKPIRGSMIGQILRMAYDTHSAATKGNREWQDKLLLVVYR 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSEAEY+  +TLWDR NDA+N             LL PCIEAAL LGC   
Sbjct: 61   AEEIMYSKANSEAEYVSQDTLWDRVNDAVNTIIRRDESTETGDLLPPCIEAALNLGCKVE 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNP-----TSSP 883
             ASRSQR+NN R YLSP TQEP SV P  R  D          +P HS NP      ++ 
Sbjct: 121  RASRSQRHNNPRSYLSPRTQEPASVAP--RAVDRTHDEQGPRLMPIHSINPLNFAARATT 178

Query: 884  KFMPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYP 1063
               P    SES   +  ++N    PH                     + H   N   VYP
Sbjct: 179  IVNPNLPVSESSHRLAESSN-AAPPHSCPILYENIPPGSDQLTTKEADMH--QNFGSVYP 235

Query: 1064 LYYGKLQQTTEPQFGFQIPHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSMI 1243
            L+YG                           Q+ +E +      D+         SN+++
Sbjct: 236  LFYGD--------------------------QYQIEAS------DMVSEVSTRMNSNTIL 263

Query: 1244 RG---DLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDL 1414
             G   D     E P   +CDLSLRLG  S P  +      +E E VG             
Sbjct: 264  VGKPIDFRNIHEKPTGTQCDLSLRLGPCSDPCISTERNQAQENEIVGSSSSQERDKFSVF 323

Query: 1415 SLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQF 1582
            S   + EFCFFP  +  DP ESC  K +SEG   N E + RKRK P SD V  G F
Sbjct: 324  SQHRNKEFCFFPSTSNRDPSESCPDKWASEGDVQNLEANIRKRKAPFSDNVEDGHF 379


>ref|XP_007013493.1| Uncharacterized protein TCM_038116 [Theobroma cacao]
            gi|508783856|gb|EOY31112.1| Uncharacterized protein
            TCM_038116 [Theobroma cacao]
          Length = 407

 Score =  267 bits (683), Expect = 1e-68
 Identities = 169/422 (40%), Positives = 215/422 (50%), Gaps = 5/422 (1%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHS+RHQP+RGS+I++I R+  + HS+ATKKNKEW++K+  V+ K
Sbjct: 1    MPRPGPRPYECVRRAWHSERHQPIRGSIIQQILRLAIDTHSTATKKNKEWQDKILTVIFK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSE+EYM+ ETLWDR NDAIN             LL PC+EAAL LGC   
Sbjct: 61   AEEIMYSKANSESEYMNPETLWDRVNDAINTIIRRDESTETGELLPPCVEAALNLGCHPV 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFMPY 898
             ASRSQR+   R YL+P  QEP S  P  RV D   +GG               P+  P 
Sbjct: 121  RASRSQRHCIPRTYLTPRAQEPISAAP--RVLD---KGGEER-----------CPQLSPV 164

Query: 899  YLGSESCTPITHANN--PTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLYY 1072
            + GS+     T+ N+    +  +                     E +   NL +VYPLYY
Sbjct: 165  HSGSQFTRIATNVNSNISVSQTNRHSYPFLSDNCPPGHDQLTRMETNTRPNLGQVYPLYY 224

Query: 1073 GKLQQTTEPQFGFQIPHSNAAS---LGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSMI 1243
            G   Q  E Q G  +  + A+    +G P      +P E   LQ+LF   +       + 
Sbjct: 225  GIHYQNVESQTGSPVQENIASDNIIVGRPIGTSVAQPVEMGSLQNLFSSSDVDVGGKRIG 284

Query: 1244 RGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSLR 1423
            + D+  T E     +CDLSLRLG  S P  +       E EDVG           +   +
Sbjct: 285  QQDIRHTNEKSFGTECDLSLRLGLFSDPCMHVEKNSIGETEDVGPSSSQEGGKVNEAFQQ 344

Query: 1424 IDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKPKFS 1603
               EFCFFP  NV+D  ES S K   + +G N   + RKRK          QF  +P  S
Sbjct: 345  KSKEFCFFPERNVNDHYESFSRKWIIDIEGRNLGATMRKRKATFGGNSEDEQFCWQPGPS 404

Query: 1604 QN 1609
             N
Sbjct: 405  SN 406


>ref|XP_002515508.1| conserved hypothetical protein [Ricinus communis]
            gi|223545452|gb|EEF46957.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 391

 Score =  263 bits (672), Expect = 2e-67
 Identities = 175/412 (42%), Positives = 213/412 (51%), Gaps = 16/412 (3%)
 Frame = +2

Query: 359  MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538
            MPRPGPRPYECVRRAWHSDRHQP+RGSLI+EIFRVV E+HS ATKKNKEW+EKLPVVVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPVRGSLIQEIFRVVNEVHSPATKKNKEWQEKLPVVVLK 60

Query: 539  AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718
            AEEIMYSKANSEAEYMD++TLW+R ND IN             LL PCIEAAL LGCT R
Sbjct: 61   AEEIMYSKANSEAEYMDLKTLWERVNDVINTIVRRDESTETGDLLLPCIEAALNLGCTPR 120

Query: 719  SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFMPY 898
              SRSQRN N RCYL+P TQEP ++PP                +P H+++    P ++  
Sbjct: 121  RTSRSQRNCNPRCYLTPGTQEPNTLPPA---------------MPTHTTSLQCIPNYLDL 165

Query: 899  ---------YLGSE----SCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPS 1039
                     +LGSE     C  I+  +N                           E +P 
Sbjct: 166  IKPAIVNSTHLGSELQNLVCQNISVTSN-------KFLLATDNGCLSNYNQSFPMENYPM 218

Query: 1040 SNLARVYPLYYGKLQQTTEPQFGFQIPHSNAASLGTPHVQFSVEPTEKSFLQDLFP--RD 1213
            S+L  VYPL YG +                        V  ++EP +    Q+LF    D
Sbjct: 219  SSLYSVYPLCYGLIP-----------------------VSSTLEPGKVGVEQNLFSFGDD 255

Query: 1214 EAVNASNSMIRGDLSATFENPPEIKCDLSLRLGTIS-APYSNAGTRWPREVEDVGXXXXX 1390
             AV  +    +  L     N  E  CDLSLRLG++S AP  +   R  ++ ED G     
Sbjct: 256  AAVKFNQPDPQSPL-----NQHETGCDLSLRLGSLSAAPLPSDKNRQLQDFEDNGHGSFQ 310

Query: 1391 XXXXXCDLSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRK 1546
                        D+E  F  R N D+ L+SCSSK S      +  G  +KRK
Sbjct: 311  EGIKFKTQMQHTDDELHFVTRLNTDNSLDSCSSKLSEHA---SVNGIIKKRK 359


Top