BLASTX nr result

ID: Mentha29_contig00018585 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00018585
         (1707 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prun...   402   e-109
ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252...   400   e-109
gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]     392   e-106
ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family prot...   392   e-106
emb|CBI26785.3| unnamed protein product [Vitis vinifera]              387   e-105
ref|XP_007045471.1| Hydroxyproline-rich glycoprotein family prot...   387   e-105
ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family prot...   387   e-105
emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]   384   e-104
ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309...   383   e-103
ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm...   379   e-102
ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618...   377   e-102
ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citr...   376   e-101
ref|XP_006448090.1| hypothetical protein CICLE_v10014588mg [Citr...   375   e-101
ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu...   369   3e-99
gb|ABK95394.1| unknown [Populus trichocarpa]                          369   3e-99
ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261...   364   7e-98
ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600...   362   4e-97
ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr...   362   4e-97
ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210...   355   3e-95
ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809...   353   1e-94

>ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica]
            gi|462422058|gb|EMJ26321.1| hypothetical protein
            PRUPE_ppa002630mg [Prunus persica]
          Length = 650

 Score =  402 bits (1034), Expect = e-109
 Identities = 213/424 (50%), Positives = 273/424 (64%), Gaps = 24/424 (5%)
 Frame = -3

Query: 1477 EKSNLEVSPKSFVATEFYDGKLVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQL 1298
            +K NL + PK+F+  E  DGK VNV +GLK+YED   D+++ KL +LV+DLRAAGKR QL
Sbjct: 217  QKQNLSIVPKTFIGNEISDGKTVNVVDGLKLYEDFLGDTEVSKLVSLVNDLRAAGKRRQL 276

Query: 1297 PGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLL 1118
             GQT+V  KRPMKGHGREMIQLGIPIADAPPEDE++AG S+D KIEPIP  LQDVI+RL+
Sbjct: 277  QGQTYVVSKRPMKGHGREMIQLGIPIADAPPEDEISAGTSKDRKIEPIPSLLQDVIDRLV 336

Query: 1117 TKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEISFGKIISADGPGNY 938
              +V+++KPDS IID++NEGDHSQPH WP WFGRPVC + LT C+++FG+++  D PG+Y
Sbjct: 337  GMHVMTVKPDSCIIDVYNEGDHSQPHTWPSWFGRPVCALYLTECDMTFGRLLLMDHPGDY 396

Query: 937  XXXXXXXXXXXSVISMQGRSADFARHAIPSIQKQRILVTLVKPQSRKIVGGEPHRF---- 770
                       S++ MQG+SADFA+HAIPSI+KQRILVTL K Q +K    +  RF    
Sbjct: 397  RGSLRLSLTPGSILLMQGKSADFAKHAIPSIRKQRILVTLTKSQPKKSTTSDGQRFPAPA 456

Query: 769  XXXXXXXXXXPSRTPGQIR-PAPAKHFGPLPSTGVXXXXXXXXXXXXPNG---MYVATPV 602
                      PSR+P  IR P   KH+  +P+TGV             NG   ++V  PV
Sbjct: 457  PAQSSYWGPPPSRSPNHIRHPTGPKHYAAVPTTGVLPAPPIRSQLPPQNGIQPLFVPAPV 516

Query: 601  APGIAYPAAVPLPPTSSAWPAALPRHPQPRLPVPGTGVFLPSQGQG--------PGNSTT 446
             P I + AAVP+PP S+ WPAA PRHP PR+P+PGTGVFLP  G G        PG +T 
Sbjct: 517  GPAIPFAAAVPIPPGSAGWPAA-PRHPPPRIPLPGTGVFLPPPGSGNSSAPQQLPGTATE 575

Query: 445  NQP----PSTENCATEDSVGKMNGSRLPPTKVDEEAAQKECNGS----GWGEILKEEEEN 290
              P    PS  +          + S  P  K D +A +++CNGS    G G    +EEE 
Sbjct: 576  MSPTVETPSPRDKDNGSGKSNHSTSASPKGKSDGKAQRQDCNGSAEGTGSGRTAVKEEEQ 635

Query: 289  ESHD 278
            +++D
Sbjct: 636  QTYD 639


>ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera]
          Length = 698

 Score =  400 bits (1029), Expect = e-109
 Identities = 231/472 (48%), Positives = 295/472 (62%), Gaps = 33/472 (6%)
 Frame = -3

Query: 1594 KSEENGSATRQRSTQGDVTQADVDAEDTGSCSVDGSGLA-------EKSNLEVSPKSFVA 1436
            KS EN   +R   ++   T+A+ D +D GSC++     A       EK N   SPK+FV 
Sbjct: 221  KSSENSEGSRCGISE---TEAN-DMDDGGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVG 276

Query: 1435 TEFYDGKLVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLPGQTFVALKRPMKG 1256
            TE +DGK VNV +GLK+YE+LFDDS++ K  +LV+DLRAAGKRGQL GQTFV  KRPMKG
Sbjct: 277  TEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKG 336

Query: 1255 HGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAII 1076
            HGREMIQLG+PIADAP EDE   G S+D + E IP  LQDVI  L+   V+++KPD+ II
Sbjct: 337  HGREMIQLGVPIADAPLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACII 396

Query: 1075 DIFNEGDHSQPHIWPQWFGRPVCVISLTVCEISFGKIISADGPGNYXXXXXXXXXXXSVI 896
            D +NEGDHSQPHIWP WFGRPVC++ LT C+++FG++I AD PG+Y           S++
Sbjct: 397  DFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLL 456

Query: 895  SMQGRSADFARHAIPSIQKQRILVTLVKPQSRKIVGGEPHRF---XXXXXXXXXXPSRTP 725
             MQG+SADFA+HAIPS++KQRILVT  K Q +K +  +  R              PSR+P
Sbjct: 457  VMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTMASDGQRLLPPAAQSSHWVPPPSRSP 516

Query: 724  GQIR-PAPAKHFGPLPSTGV--XXXXXXXXXXXXPNGM---YVATPVAPGIAYPAAVPLP 563
              +R P   KH+G +P+TGV              PNGM   +V T VAP + +PA VPLP
Sbjct: 517  NHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLP 576

Query: 562  PTSSAWPAALPRHPQPRLPVPGTGVFLPSQGQGPGNSTTNQPPSTENCAT---------- 413
              S  WPAA PRHP PRLPVPGTGVFLP  G   GNS++ Q  STE  +T          
Sbjct: 577  TGSPGWPAAPPRHPPPRLPVPGTGVFLPPPGS--GNSSSPQHISTEATSTSVETAAPTEK 634

Query: 412  EDSVGKMNGSR---LPPTKVDEEAAQKECNGS----GWGEILKEEEENESHD 278
            E+  GK + +     P  K+D +  ++ECNGS    G  E    +EE + +D
Sbjct: 635  ENGSGKSSSNSNTVSPKGKLDGKVHRQECNGSMDETGVDERAVTKEEQQHND 686


>gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]
          Length = 681

 Score =  392 bits (1007), Expect = e-106
 Identities = 219/486 (45%), Positives = 293/486 (60%), Gaps = 32/486 (6%)
 Frame = -3

Query: 1618 REVTEFNEKSEENGSATRQRSTQGDVT--QADVDAEDTGSCSVDGSGLA-------EKSN 1466
            +E  +   KS+E+G+     + +G V+  + +V A D G  S      +       E SN
Sbjct: 193  KEKNDSAAKSQEDGNVKSLGNFEGVVSGSEPEVHAVDDGCTSSSKENDSHSTPKQNENSN 252

Query: 1465 LEVSPKSFVATEFYDGKLVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLPGQT 1286
            L   PK+F   E +DGK VNV EGLK+YE+   D+++ KL  LV+DLR+AG+RG    QT
Sbjct: 253  LANVPKTFSGNEMFDGKPVNVVEGLKLYEEFCADTEVSKLVALVNDLRSAGERGHFQSQT 312

Query: 1285 FVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNV 1106
            +V  KRPMKGHGRE IQLG+PIADAP EDE++AG  +D + E IP  LQDV ERL++  V
Sbjct: 313  YVVSKRPMKGHGREKIQLGLPIADAPVEDEISAGTLKDRRTEAIPPLLQDVAERLVSMQV 372

Query: 1105 VSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEISFGKIISADGPGNYXXXX 926
             ++KPDS IID +NEGDHSQPH+WP WFGRPVCV+ LT C+++FG++ + D PG+Y    
Sbjct: 373  ATVKPDSCIIDFYNEGDHSQPHLWPSWFGRPVCVLFLTECDMTFGRVFAIDHPGDYRGAL 432

Query: 925  XXXXXXXSVISMQGRSADFARHAIPSIQKQRILVTLVKPQSRKIVGGEPHRF----XXXX 758
                   S+++MQG+SADFA+HAIPS+++QRILVT  K Q +K +  +  R         
Sbjct: 433  KLSLKPGSLLAMQGKSADFAKHAIPSLRRQRILVTFTKSQPKKSMPSDGQRMPSPGVAPS 492

Query: 757  XXXXXXPSRTPGQIRPAPAKHFGPLPSTGVXXXXXXXXXXXXPNG---MYVATPVAPGIA 587
                  PSR+P  IR    KH+ P+P+TGV            PNG   ++V  PVAP + 
Sbjct: 493  SHWGPQPSRSPNHIRHPGPKHYAPVPTTGVLQASPVRPQIPPPNGIQPLFVTAPVAPAMP 552

Query: 586  YPAAVPLPPTSSAWPAALPRHPQPRLPVPGTGVFLPSQGQGP---------GNSTTNQPP 434
            +PA VP+PP+SS W AA PRHP PRLPVPGTGVFLP  G G          GN T +   
Sbjct: 553  FPAPVPIPPSSSGWSAAPPRHPPPRLPVPGTGVFLPPPGSGGNSSGSQQVLGNDTNHTVE 612

Query: 433  STENCATEDSVGKMNG--SRLPPTKVDEEAAQKECNGS--GWGEIL---KEEEENESHDV 275
            +      E+  GK+N   +  P  KVD +  ++ECNGS  G G ++   KEE +  S + 
Sbjct: 613  TAAPPEKENGSGKLNHGMTASPKGKVDSKTQKQECNGSLDGSGSVISVTKEERQQSSDNT 672

Query: 274  AISGGA 257
            A S  A
Sbjct: 673  ATSKSA 678


>ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
            [Theobroma cacao] gi|590697545|ref|XP_007045470.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709403|gb|EOY01300.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao]
          Length = 680

 Score =  392 bits (1006), Expect = e-106
 Identities = 230/509 (45%), Positives = 309/509 (60%), Gaps = 37/509 (7%)
 Frame = -3

Query: 1705 GGEMNGKDLNNGYYKSNQKLTEKMEK--------KEEREVTEFNEKSEENGSATRQRSTQ 1550
            G + +G        + N++ +EK E+        K E + + F E  ++ GS  +  +  
Sbjct: 167  GVDSDGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGS--KPHAGD 224

Query: 1549 GDVTQADVDAEDTGS------CSVDGSGLAEKSNLEVSPKSFVATEFYDGKLVNVAEGLK 1388
             +    DV+   T S      CS+      EK NL   PK+FV  E +DGK+VNV +GLK
Sbjct: 225  AESVTEDVNGGCTSSYKENDLCSIQNQN--EKQNLAAGPKTFVGNEMFDGKMVNVVDGLK 282

Query: 1387 VYEDLFDDSDILKLNNLVSDLRAAGKRGQLPGQTFVALKRPMKGHGREMIQLGIPIADAP 1208
            +YE+LFDD ++L L +LV+DLRAAGKRGQL GQT+VA KRPMKGHGREMIQLG+PIADAP
Sbjct: 283  LYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPMKGHGREMIQLGLPIADAP 342

Query: 1207 PEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQ 1028
             +DE AAG S+D +IE IP  LQD IERL+   V+++KPDS IID++NEGDHSQP +WP 
Sbjct: 343  LDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRMWPP 402

Query: 1027 WFGRPVCVISLTVCEISFGK-IISADGPGNYXXXXXXXXXXXSVISMQGRSADFARHAIP 851
            WFG+PVC++ LT C+I+FG+ +I AD PG+Y           S++ MQG+SADFA+HA+P
Sbjct: 403  WFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKHALP 462

Query: 850  SIQKQRILVTLVKPQSRKIVGGEPHRF----XXXXXXXXXXPSRTPGQIR-PAPAKHFGP 686
            S++KQRILVT  K    K    +  R               PSR+P +IR  A  KH+  
Sbjct: 463  SVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGPKHYAV 522

Query: 685  LPSTGVXXXXXXXXXXXXPNG---MYVATPVAPGIAYPAAVPLPPTSSAWPAALPRHPQP 515
            +P+TGV             +G   ++V T VAP I++PA VP+PP S+ WPAA PRHP P
Sbjct: 523  IPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAA-PRHPPP 581

Query: 514  RLPVPGTGVFLPSQGQGPGNSTTNQPPSTE-NCATEDSV--GKMNGS-------RLPPTK 365
            RLPVPGTGVFLP  G G  +S      +TE N   E +    K NGS         P  +
Sbjct: 582  RLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPNHHTTSPRGR 641

Query: 364  VDEEAAQKECNGS----GWGEILKEEEEN 290
            +D ++ +++CNGS    G G  L +EE++
Sbjct: 642  LDGKSPKQDCNGSVDGAGSGRALMKEEQH 670


>emb|CBI26785.3| unnamed protein product [Vitis vinifera]
          Length = 672

 Score =  387 bits (995), Expect = e-105
 Identities = 227/496 (45%), Positives = 295/496 (59%), Gaps = 17/496 (3%)
 Frame = -3

Query: 1702 GEMNGKDLNNGYYKSNQKLTEKMEKKEEREVTEFNEKSEENGSATRQRSTQGDVTQADVD 1523
            G++  KDL     K     T+ + K      ++ +E SE  GS      T+ +       
Sbjct: 191  GKLEDKDLAAAEEKKAG--TDAVAKPNANSCSKSSENSE--GSRCGISETEANDMDDGGT 246

Query: 1522 AEDTGSCSVDGSGLA-------EKSNLEVSPKSFVATEFYDGKLVNVAEGLKVYEDLFDD 1364
                GSC++     A       EK N   SPK+FV TE +DGK VNV +GLK+YE+LFDD
Sbjct: 247  LNPKGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDD 306

Query: 1363 SDILKLNNLVSDLRAAGKRGQLP-GQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAA 1187
            S++ K  +LV+DLRAAGKRGQL  GQTFV  KRPMKGHGREMIQLG+PIADAP EDE   
Sbjct: 307  SEVSKFVSLVNDLRAAGKRGQLQAGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESVV 366

Query: 1186 GASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVC 1007
            G S+D + E IP  LQDVI  L+   V+++KPD+ IID +NEGDHSQPHIWP WFGRPVC
Sbjct: 367  GTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACIIDFYNEGDHSQPHIWPTWFGRPVC 426

Query: 1006 VISLTVCEISFGKIISADGPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSIQKQRIL 827
            ++ LT C+++FG++I AD PG+Y           S++ MQG+SADFA+HAIPS++KQRIL
Sbjct: 427  ILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIPSLRKQRIL 486

Query: 826  VTLVKPQSRKIVGGEPHRF---XXXXXXXXXXPSRTPGQIR-PAPAKHFGPLPSTGV--X 665
            VT  K Q +K +  +  R              PSR+P  +R P   KH+G +P+TGV   
Sbjct: 487  VTFTKSQPKKTMASDGQRLLPPAAQSSHWVPPPSRSPNHMRHPMGPKHYGAVPTTGVLPA 546

Query: 664  XXXXXXXXXXXPNGM---YVATPVAPGIAYPAAVPLPPTSSAWPAALPRHPQPRLPVPGT 494
                       PNGM   +V T VAP + +PA VPLP  S  WPAA PRHP PRLPVPGT
Sbjct: 547  PAPPMRPQLPPPNGMQPLFVTTAVAPAMPFPAPVPLPTGSPGWPAAPPRHPPPRLPVPGT 606

Query: 493  GVFLPSQGQGPGNSTTNQPPSTENCATEDSVGKMNGSRLPPTKVDEEAAQKECNGSGWGE 314
            GVFLP  G   GNS++ Q  STE  +               T V+  A  ++ NGSG   
Sbjct: 607  GVFLPPPGS--GNSSSPQHISTEATS---------------TSVETAAPTEKENGSGKSS 649

Query: 313  ILKEEEENESHDVAIS 266
             + +EE+  + ++ ++
Sbjct: 650  TVTKEEQQHNDELKVA 665


>ref|XP_007045471.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 5
            [Theobroma cacao] gi|508709406|gb|EOY01303.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 5 [Theobroma cacao]
          Length = 572

 Score =  387 bits (994), Expect = e-105
 Identities = 230/510 (45%), Positives = 309/510 (60%), Gaps = 38/510 (7%)
 Frame = -3

Query: 1705 GGEMNGKDLNNGYYKSNQKLTEKMEK--------KEEREVTEFNEKSEENGSATRQRSTQ 1550
            G + +G        + N++ +EK E+        K E + + F E  ++ GS  +  +  
Sbjct: 58   GVDSDGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGS--KPHAGD 115

Query: 1549 GDVTQADVDAEDTGS------CSVDGSGLAEKSNLEVSPKSFVATEFYDGKLVNVAEGLK 1388
             +    DV+   T S      CS+      EK NL   PK+FV  E +DGK+VNV +GLK
Sbjct: 116  AESVTEDVNGGCTSSYKENDLCSIQNQN--EKQNLAAGPKTFVGNEMFDGKMVNVVDGLK 173

Query: 1387 VYEDLFDDSDILKLNNLVSDLRAAGKRGQLP-GQTFVALKRPMKGHGREMIQLGIPIADA 1211
            +YE+LFDD ++L L +LV+DLRAAGKRGQL  GQT+VA KRPMKGHGREMIQLG+PIADA
Sbjct: 174  LYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIADA 233

Query: 1210 PPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWP 1031
            P +DE AAG S+D +IE IP  LQD IERL+   V+++KPDS IID++NEGDHSQP +WP
Sbjct: 234  PLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRMWP 293

Query: 1030 QWFGRPVCVISLTVCEISFGK-IISADGPGNYXXXXXXXXXXXSVISMQGRSADFARHAI 854
             WFG+PVC++ LT C+I+FG+ +I AD PG+Y           S++ MQG+SADFA+HA+
Sbjct: 294  PWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKHAL 353

Query: 853  PSIQKQRILVTLVKPQSRKIVGGEPHRF----XXXXXXXXXXPSRTPGQIR-PAPAKHFG 689
            PS++KQRILVT  K    K    +  R               PSR+P +IR  A  KH+ 
Sbjct: 354  PSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGPKHYA 413

Query: 688  PLPSTGVXXXXXXXXXXXXPNG---MYVATPVAPGIAYPAAVPLPPTSSAWPAALPRHPQ 518
             +P+TGV             +G   ++V T VAP I++PA VP+PP S+ WPAA PRHP 
Sbjct: 414  VIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAA-PRHPP 472

Query: 517  PRLPVPGTGVFLPSQGQGPGNSTTNQPPSTE-NCATEDSV--GKMNGS-------RLPPT 368
            PRLPVPGTGVFLP  G G  +S      +TE N   E +    K NGS         P  
Sbjct: 473  PRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPNHHTTSPRG 532

Query: 367  KVDEEAAQKECNGS----GWGEILKEEEEN 290
            ++D ++ +++CNGS    G G  L +EE++
Sbjct: 533  RLDGKSPKQDCNGSVDGAGSGRALMKEEQH 562


>ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|590697542|ref|XP_007045469.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709402|gb|EOY01299.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 681

 Score =  387 bits (994), Expect = e-105
 Identities = 230/510 (45%), Positives = 309/510 (60%), Gaps = 38/510 (7%)
 Frame = -3

Query: 1705 GGEMNGKDLNNGYYKSNQKLTEKMEK--------KEEREVTEFNEKSEENGSATRQRSTQ 1550
            G + +G        + N++ +EK E+        K E + + F E  ++ GS  +  +  
Sbjct: 167  GVDSDGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGS--KPHAGD 224

Query: 1549 GDVTQADVDAEDTGS------CSVDGSGLAEKSNLEVSPKSFVATEFYDGKLVNVAEGLK 1388
             +    DV+   T S      CS+      EK NL   PK+FV  E +DGK+VNV +GLK
Sbjct: 225  AESVTEDVNGGCTSSYKENDLCSIQNQN--EKQNLAAGPKTFVGNEMFDGKMVNVVDGLK 282

Query: 1387 VYEDLFDDSDILKLNNLVSDLRAAGKRGQLP-GQTFVALKRPMKGHGREMIQLGIPIADA 1211
            +YE+LFDD ++L L +LV+DLRAAGKRGQL  GQT+VA KRPMKGHGREMIQLG+PIADA
Sbjct: 283  LYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIADA 342

Query: 1210 PPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWP 1031
            P +DE AAG S+D +IE IP  LQD IERL+   V+++KPDS IID++NEGDHSQP +WP
Sbjct: 343  PLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRMWP 402

Query: 1030 QWFGRPVCVISLTVCEISFGK-IISADGPGNYXXXXXXXXXXXSVISMQGRSADFARHAI 854
             WFG+PVC++ LT C+I+FG+ +I AD PG+Y           S++ MQG+SADFA+HA+
Sbjct: 403  PWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKHAL 462

Query: 853  PSIQKQRILVTLVKPQSRKIVGGEPHRF----XXXXXXXXXXPSRTPGQIR-PAPAKHFG 689
            PS++KQRILVT  K    K    +  R               PSR+P +IR  A  KH+ 
Sbjct: 463  PSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGPKHYA 522

Query: 688  PLPSTGVXXXXXXXXXXXXPNG---MYVATPVAPGIAYPAAVPLPPTSSAWPAALPRHPQ 518
             +P+TGV             +G   ++V T VAP I++PA VP+PP S+ WPAA PRHP 
Sbjct: 523  VIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPAPVPIPPGSTGWPAA-PRHPP 581

Query: 517  PRLPVPGTGVFLPSQGQGPGNSTTNQPPSTE-NCATEDSV--GKMNGS-------RLPPT 368
            PRLPVPGTGVFLP  G G  +S      +TE N   E +    K NGS         P  
Sbjct: 582  PRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKENGSVKPNHHTTSPRG 641

Query: 367  KVDEEAAQKECNGS----GWGEILKEEEEN 290
            ++D ++ +++CNGS    G G  L +EE++
Sbjct: 642  RLDGKSPKQDCNGSVDGAGSGRALMKEEQH 671


>emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]
          Length = 1145

 Score =  384 bits (987), Expect = e-104
 Identities = 216/430 (50%), Positives = 273/430 (63%), Gaps = 30/430 (6%)
 Frame = -3

Query: 1477 EKSNLEVSPKSFVATEFYDGKLVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQL 1298
            EK N   SPK+FV TE +DGK VNV +GLK+YE+LFDDS++ K  +LV+DLRAAGKRGQL
Sbjct: 272  EKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQL 331

Query: 1297 PGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASR----DPKIEPIPVALQDVI 1130
             GQTFV  KRPMKGHGREMIQLG+PIADAP EDE   G S+    + + E IP  LQDVI
Sbjct: 332  QGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKGMFHNRRTESIPSLLQDVI 391

Query: 1129 ERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEISFGKIISADG 950
             +L+   V+++KPD+ IID +NEGDHSQPHIWP WFGRPVC++ LT C+++FG++I AD 
Sbjct: 392  GQLVGSQVLTVKPDACIIDFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADH 451

Query: 949  PGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSIQKQRILVTLVKPQSRKIVGGEPHRF 770
            PG+Y           S++ MQG+SADFA+HAIPS++KQRILVT  K Q +K    +  R 
Sbjct: 452  PGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTTASDGQRL 511

Query: 769  ---XXXXXXXXXXPSRTPGQIR-PAPAKHFGPLPSTGV--XXXXXXXXXXXXPNGM---Y 617
                         PSR+P  +R P   KH+G +P+TGV              PNGM   +
Sbjct: 512  LPPAAQSSHWVPPPSRSPNHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLF 571

Query: 616  VATPVAPGIAYPAAVPLPPTSSAWPAALPRHPQPRLPVPGTGVFLPSQGQGPGNSTTNQP 437
            V T VAP + +PA  PLP  S  WPAA PRHP PRLPVPGTGVFLP  G   GNS++ Q 
Sbjct: 572  VTTAVAPAMPFPAPXPLPTGSPGWPAAPPRHPPPRLPVPGTGVFLPPPGS--GNSSSPQH 629

Query: 436  PSTENCAT----------EDSVGKMNGSR---LPPTKVDEEAAQKECNGS----GWGEIL 308
             STE  +T          E+  GK + +     P  K+D +  ++ECNGS    G  E  
Sbjct: 630  ISTEATSTSVETAAPTEKENGSGKSSSNSNTVSPKGKLDGKVHRQECNGSMDETGVDERA 689

Query: 307  KEEEENESHD 278
              +EE + +D
Sbjct: 690  VTKEEQQHND 699


>ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca
            subsp. vesca]
          Length = 682

 Score =  383 bits (983), Expect = e-103
 Identities = 207/468 (44%), Positives = 283/468 (60%), Gaps = 25/468 (5%)
 Frame = -3

Query: 1600 NEKSEENGSATRQRSTQGDVTQADVDAEDTGSCSVDGSGLAEKSNLEVSPKSFVATEFYD 1421
            N +   +G++  + +   +   + +   ++ S  +      EK NL + PK+FV  E +D
Sbjct: 213  NSQGTISGNSESEDAVVNEGCTSSIKENESNSIQIQN----EKQNLSLIPKTFVGNETFD 268

Query: 1420 GKLVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLPGQTFVALKRPMKGHGREM 1241
            GK VNV +GLK+YE+   D+++ KL +LV+DLR  G+RGQL GQT+V  KRPMKGHGREM
Sbjct: 269  GKTVNVVDGLKLYEEFLGDTEVSKLFSLVNDLRTTGRRGQLQGQTYVLSKRPMKGHGREM 328

Query: 1240 IQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNE 1061
            IQLGIPIAD P EDE++AG S+D ++E IP  LQDVI+RL+   V++ KPDS IID FNE
Sbjct: 329  IQLGIPIADGPQEDEISAGISKDRRMEAIPSLLQDVIDRLIGTQVLTDKPDSCIIDFFNE 388

Query: 1060 GDHSQPHIWPQWFGRPVCVISLTVCEISFGKIISADGPGNYXXXXXXXXXXXSVISMQGR 881
            GDHS PH+WP WFGRPV V+ LT C+++FGK++  D PG+Y           S++ +QG+
Sbjct: 389  GDHSHPHMWPPWFGRPVSVLFLTECDLTFGKVLGMDHPGDYRGALRLSLTPGSLLLLQGK 448

Query: 880  SADFARHAIPSIQKQRILVTLVKPQSRKIVGGEPHRFXXXXXXXXXXPS----RTPGQIR 713
            SAD+A+HAIPSI+KQRILVT  K Q RK    +  R            S    R+P  IR
Sbjct: 449  SADYAKHAIPSIRKQRILVTFTKSQPRKSFPTDGQRLPSPGPSQSPYWSPPPGRSPNHIR 508

Query: 712  -PAPAKHFGPLPSTGVXXXXXXXXXXXXPNG---MYVATPVAPGIAYPAAVPLPPTSSAW 545
             PA  KH+  +P+TGV             NG   ++VA PV P + +PA V +PP S  W
Sbjct: 509  HPAGPKHYAAVPTTGVLPAPPNRPQLPPANGIQPLFVAAPVGPAMPFPAPVVIPPGSPGW 568

Query: 544  PAALPRHPQPRLPVPGTGVFLPSQGQGPGNSTTNQPPST--------ENCATEDSVGKMN 389
             AA PRHP PR+P+PGTGVFLP  G G  ++   Q PST        E  +TE   G   
Sbjct: 569  VAA-PRHPPPRMPLPGTGVFLPPPGSGSSSAPPQQFPSTATEMNPSVETASTEKDNGTAK 627

Query: 388  GSRL---PPTKVDEEAAQKECN------GSGWGEILKEEEENESHDVA 272
             S     P  K+D +A +++CN      GSG G + +E+++N ++  A
Sbjct: 628  SSHAIASPKAKLDVKAQRQDCNGSVDGTGSGRGTVKQEQQQNSNNAAA 675


>ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis]
            gi|223533099|gb|EEF34858.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 697

 Score =  379 bits (973), Expect = e-102
 Identities = 210/491 (42%), Positives = 290/491 (59%), Gaps = 19/491 (3%)
 Frame = -3

Query: 1693 NGKDLNNGYYKSNQKLTEKMEKKEEREVTEFNEKSEENGSATRQRSTQGDVTQADVDAED 1514
            +G D      KS     +K +   +  V          GS +    T+ +        ++
Sbjct: 199  SGGDSGRLENKSLATAEDKKDAASKPHVDNLKSSGNSEGSLSGNLETEAEAVHEQSSPKE 258

Query: 1513 TGSCSVDGSGLAEKSNLEVSPKSFVATEFYDGKLVNVAEGLKVYEDLFDDSDILKLNNLV 1334
              S  +    +  K NL  +PK+FV  E  DGK VNV +GLK+YE L DD ++ KL +LV
Sbjct: 259  HDSHFIQNQIV--KLNLTTTPKTFVGAEMVDGKSVNVVDGLKLYEQLLDDVEVSKLVSLV 316

Query: 1333 SDLRAAGKRGQLPGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPI 1154
            +DLRAAG++GQ  GQ +V  KRPMKGHGREMIQLG+PIADAP E+E AAG S+D KIE I
Sbjct: 317  NDLRAAGRKGQFQGQAYVVSKRPMKGHGREMIQLGLPIADAPAEEENAAGTSKDRKIESI 376

Query: 1153 PVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEISF 974
            P  LQ+VIER ++  ++++KPDS IIDI+NEGDHSQPH+WP WFG+P+ V+ LT C+++F
Sbjct: 377  PTLLQEVIERFVSMQIMTMKPDSCIIDIYNEGDHSQPHMWPPWFGKPISVLFLTECDLTF 436

Query: 973  GKIISADGPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSIQKQRILVTLVKPQSRKI 794
            G++I+AD PG+Y           S++ MQG++ DFA+HAIP+I+KQR+L+T  K Q +K 
Sbjct: 437  GRVITADHPGDYRGSLKLPLAPGSLLVMQGKATDFAKHAIPAIRKQRVLLTFTKSQPKKF 496

Query: 793  VGGEPHRF----XXXXXXXXXXPSRTPGQIRPAPAKHFGPLPSTGVXXXXXXXXXXXXPN 626
            V  +  R               PSR+P  IR   +KH+ P+P+TGV            PN
Sbjct: 497  VQSDGQRLTSPAASPSSHWGPPPSRSPNHIRHPVSKHYAPIPTTGVLPAPSIRPQIAPPN 556

Query: 625  G---MYVATPVAPGIAYPAAVPLPPTSSAWPAALPRHPQPRL--PVPGTGVFLPSQGQG- 464
            G   ++V  PVA  + +PA VP+PP S+ WPAA PRHP  RL  PVPGTGVFLP  G G 
Sbjct: 557  GVQPLFVTAPVAAPMPFPAPVPMPPVSTGWPAA-PRHPPNRLPVPVPGTGVFLPPPGSGN 615

Query: 463  ------PGNSTTNQPPSTENCA-TEDSVGKMNGSRL--PPTKVDEEAAQKECNGSGWGEI 311
                  P  +  N P  T +    E+ +GK N      P  K++ ++ +++CNG   G+ 
Sbjct: 616  ASSPQIPNATEINFPAETASLQDKENGLGKSNHGTCASPKEKLEAKSQKQDCNGITDGKA 675

Query: 310  LKEEEENESHD 278
              +EE  +S D
Sbjct: 676  GTKEEHQQSVD 686


>ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618872 [Citrus sinensis]
          Length = 627

 Score =  377 bits (968), Expect = e-102
 Identities = 222/519 (42%), Positives = 305/519 (58%), Gaps = 44/519 (8%)
 Frame = -3

Query: 1687 KDLNNGYYKSNQKLTEKMEKKEEREVTEFNEKSEENGSATRQRSTQGDVTQADVDAEDTG 1508
            KD +N    +N          ++++  +   K+ ++GSA    +++  +TQ   DAE   
Sbjct: 115  KDFHNNNNNNNHAFDSNSSAFDDKK--DVVMKAHDDGSAKSLGNSE--ITQVG-DAEPKA 169

Query: 1507 SCSVDGS--GLAE-----------KSNLEVSPKSFVATEFYDGKLVNVAEGLKVYEDLFD 1367
                DG   GL E           K N  ++ KSFV TE  DGK+VNV +GLK+YE++  
Sbjct: 170  EALDDGCTPGLKENDSQSVQSQNEKQNQSMAAKSFVGTEMVDGKMVNVVDGLKLYEEVSG 229

Query: 1366 DSDILKLNNLVSDLRAAGKRGQLPGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAA 1187
            +S++ KL +LV+DLR AGKRGQ+ G  +V  KRP++GHGRE+IQLG+PI D PPEDE+AA
Sbjct: 230  NSEVSKLVSLVNDLRTAGKRGQIQGPAYVVSKRPIRGHGREVIQLGLPIVDGPPEDEIAA 289

Query: 1186 GASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVC 1007
            G SRD +IEPIP  LQDVI+RL+   ++++KPDS I+D+FNEGDHSQPHI P WFGRPVC
Sbjct: 290  GTSRDRRIEPIPSLLQDVIDRLVGLQIMTVKPDSCIVDVFNEGDHSQPHISPSWFGRPVC 349

Query: 1006 VISLTVCEISFGKIISADGPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSIQKQRIL 827
            ++ LT C+++FG++I  D PG+Y           S++ MQG+SAD A+HAI SI+KQRIL
Sbjct: 350  ILFLTECDMTFGRMIGIDHPGDYRGTLRLSVAPGSLLVMQGKSADIAKHAISSIRKQRIL 409

Query: 826  VTLVKPQSRKIVGGEPHRF----XXXXXXXXXXPSRTPGQIR-PAPAKHFGPLPSTGVXX 662
            VT  K Q +K+   +  R               P R P  IR P   KHF P+P+TGV  
Sbjct: 410  VTFTKSQPKKLTPTDGQRLASPGIAPSPHWGLPPGRPPNHIRHPTGPKHFAPIPTTGVLP 469

Query: 661  XXXXXXXXXXPNG---MYVATPVAPGIAYPAAVPLPPTSSAWPAALPRH---PQPRLPVP 500
                       NG   ++V+ PV P + +PA VP+PP S+ W AA PRH   P PRLPVP
Sbjct: 470  APAIRAQIPPTNGVPPIFVSPPVTPAMPFPAPVPIPPGSTGWTAAPPRHTPPPPPRLPVP 529

Query: 499  GTGVFLPSQGQGPGNSTTNQPPSTENCATEDSVGKM-------NGS-------RLPPTKV 362
            GTGVFLP     PG+  ++ P    + ATE  + +M       NGS         P  K+
Sbjct: 530  GTGVFLPP----PGSGGSSSPRQVSSAATEHLIPEMGSQAEKENGSGKSNHETNAPKEKL 585

Query: 361  DEEAAQKECNGS--GWGE---ILKEEEENES-HDVAISG 263
              E   + CNGS  G G    ++KEE +++S  D +++G
Sbjct: 586  VGETQGQGCNGSVDGTGSVKAVMKEENQHQSVEDTSVAG 624


>ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citrus clementina]
            gi|557550702|gb|ESR61331.1| hypothetical protein
            CICLE_v10014588mg [Citrus clementina]
          Length = 635

 Score =  376 bits (966), Expect = e-101
 Identities = 218/513 (42%), Positives = 298/513 (58%), Gaps = 36/513 (7%)
 Frame = -3

Query: 1693 NGKDLNNGYYKSNQKLTEKMEKKEEREVTEFNEKSEENGSATRQRSTQGDVTQAD----V 1526
            N  + NN  + SN    +  +    +   + + KS  N   T+    +      D     
Sbjct: 126  NNNNNNNHAFDSNSSAFDDKKDVVMKAHDDGSAKSLGNSEITQVGDAEPKAEALDDGCTP 185

Query: 1525 DAEDTGSCSVDGSGLAEKSNLEVSPKSFVATEFYDGKLVNVAEGLKVYEDLFDDSDILKL 1346
              ++  S SV      EK N  ++ KSFV TE  DGK+VNV +GLK+YE++  +S++ KL
Sbjct: 186  SLKENDSQSVQSQN--EKQNQSMAAKSFVGTEMVDGKMVNVVDGLKLYEEVSGNSEVSKL 243

Query: 1345 NNLVSDLRAAGKRGQLPGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPK 1166
             +LV+DLR AGKRGQ+ G  +V  KRP++GHGRE+IQLG+PI D PPEDE+AAG SRD +
Sbjct: 244  VSLVNDLRTAGKRGQIQGPAYVVSKRPIRGHGREVIQLGLPIVDGPPEDEIAAGTSRDRR 303

Query: 1165 IEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVC 986
            IEPIP  LQDVI+RL+   ++++KPDS I+D+FNEGDHSQPHI P WFGRPVC++ LT C
Sbjct: 304  IEPIPSLLQDVIDRLVGLQIMTVKPDSCIVDVFNEGDHSQPHISPSWFGRPVCILFLTEC 363

Query: 985  EISFGKIISADGPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSIQKQRILVTLVKPQ 806
            +++FG++I  D PG+Y           S++ MQG+SAD A+HAI SI+KQRILVT  K Q
Sbjct: 364  DMTFGRMIGIDHPGDYRGTLRLSVAPGSLLVMQGKSADIAKHAISSIRKQRILVTFTKSQ 423

Query: 805  SRKIVGGEPHRF----XXXXXXXXXXPSRTPGQIR-PAPAKHFGPLPSTGVXXXXXXXXX 641
             +K+   +  R               P R P  IR P   KHF P+P+TGV         
Sbjct: 424  PKKLTPTDGQRLASPGIAPSPHWGPPPGRPPNHIRHPTGPKHFAPIPTTGVLPAPAIRAQ 483

Query: 640  XXXPNG---MYVATPVAPGIAYPAAVPLPPTSSAWPAALPRH----PQPRLPVPGTGVFL 482
                NG   ++V+ PV P + +PA VP+PP S+ W AA PRH    P PRLPVPGTGVFL
Sbjct: 484  IPPTNGVPPIFVSPPVTPAMPFPAPVPIPPGSTGWTAAPPRHTPPPPPPRLPVPGTGVFL 543

Query: 481  PSQGQGPGNSTTNQPPSTENCATEDSVGKM-------NGS-------RLPPTKVDEEAAQ 344
            P     PG+  ++ P    + ATE  + +M       NGS         P  K+  E   
Sbjct: 544  PP----PGSGGSSSPRQVSSAATEHLIPEMGSQAEKENGSGKSNHETNAPKEKLVGETQG 599

Query: 343  KECNGS--GWGE---ILKEEEENES-HDVAISG 263
            + CNGS  G G    ++KEE +++S  D +++G
Sbjct: 600  QGCNGSVDGTGSVKAVMKEENQHQSVEDTSVAG 632


>ref|XP_006448090.1| hypothetical protein CICLE_v10014588mg [Citrus clementina]
            gi|557550701|gb|ESR61330.1| hypothetical protein
            CICLE_v10014588mg [Citrus clementina]
          Length = 486

 Score =  375 bits (962), Expect = e-101
 Identities = 205/437 (46%), Positives = 274/437 (62%), Gaps = 32/437 (7%)
 Frame = -3

Query: 1477 EKSNLEVSPKSFVATEFYDGKLVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQL 1298
            EK N  ++ KSFV TE  DGK+VNV +GLK+YE++  +S++ KL +LV+DLR AGKRGQ+
Sbjct: 51   EKQNQSMAAKSFVGTEMVDGKMVNVVDGLKLYEEVSGNSEVSKLVSLVNDLRTAGKRGQI 110

Query: 1297 PGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLL 1118
             G  +V  KRP++GHGRE+IQLG+PI D PPEDE+AAG SRD +IEPIP  LQDVI+RL+
Sbjct: 111  QGPAYVVSKRPIRGHGREVIQLGLPIVDGPPEDEIAAGTSRDRRIEPIPSLLQDVIDRLV 170

Query: 1117 TKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEISFGKIISADGPGNY 938
               ++++KPDS I+D+FNEGDHSQPHI P WFGRPVC++ LT C+++FG++I  D PG+Y
Sbjct: 171  GLQIMTVKPDSCIVDVFNEGDHSQPHISPSWFGRPVCILFLTECDMTFGRMIGIDHPGDY 230

Query: 937  XXXXXXXXXXXSVISMQGRSADFARHAIPSIQKQRILVTLVKPQSRKIVGGEPHRF---- 770
                       S++ MQG+SAD A+HAI SI+KQRILVT  K Q +K+   +  R     
Sbjct: 231  RGTLRLSVAPGSLLVMQGKSADIAKHAISSIRKQRILVTFTKSQPKKLTPTDGQRLASPG 290

Query: 769  XXXXXXXXXXPSRTPGQIR-PAPAKHFGPLPSTGVXXXXXXXXXXXXPNG---MYVATPV 602
                      P R P  IR P   KHF P+P+TGV             NG   ++V+ PV
Sbjct: 291  IAPSPHWGPPPGRPPNHIRHPTGPKHFAPIPTTGVLPAPAIRAQIPPTNGVPPIFVSPPV 350

Query: 601  APGIAYPAAVPLPPTSSAWPAALPRH----PQPRLPVPGTGVFLPSQGQGPGNSTTNQPP 434
             P + +PA VP+PP S+ W AA PRH    P PRLPVPGTGVFLP     PG+  ++ P 
Sbjct: 351  TPAMPFPAPVPIPPGSTGWTAAPPRHTPPPPPPRLPVPGTGVFLPP----PGSGGSSSPR 406

Query: 433  STENCATEDSVGKM-------NGS-------RLPPTKVDEEAAQKECNGS--GWGE---I 311
               + ATE  + +M       NGS         P  K+  E   + CNGS  G G    +
Sbjct: 407  QVSSAATEHLIPEMGSQAEKENGSGKSNHETNAPKEKLVGETQGQGCNGSVDGTGSVKAV 466

Query: 310  LKEEEENES-HDVAISG 263
            +KEE +++S  D +++G
Sbjct: 467  MKEENQHQSVEDTSVAG 483


>ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa]
            gi|550333016|gb|ERP57586.1| hypothetical protein
            POPTR_0008s13830g [Populus trichocarpa]
          Length = 693

 Score =  369 bits (946), Expect = 3e-99
 Identities = 206/475 (43%), Positives = 281/475 (59%), Gaps = 21/475 (4%)
 Frame = -3

Query: 1633 EKKEEREVTEFNEKSEENGSATRQRSTQGDVTQADVDAEDTGSCSVDGSGLAEKSNLEVS 1454
            +KK+    +  +     +G+A    S   +    D  +    S S   +   EK NL ++
Sbjct: 213  DKKDATAKSHTDNHKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAIT 272

Query: 1453 PKSFVATEFYDGKLVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLPGQTFVAL 1274
            PK+FVA E  DG++VNV +GLK+YE+L D  ++ KL +LV++LRA G+RGQ  GQT++  
Sbjct: 273  PKTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILS 332

Query: 1273 KRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIK 1094
            KRPMKGHGREMIQLG+PIADAP EDE A G S++ ++E IP  LQDVIE  +   V+++K
Sbjct: 333  KRPMKGHGREMIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMK 392

Query: 1093 PDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEISFGKIISADGPGNYXXXXXXXX 914
            PDS IIDI+NEGDHSQPH+WP WFG+PV V+ LT CE++FGK+I     G+Y        
Sbjct: 393  PDSCIIDIYNEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSV 452

Query: 913  XXXSVISMQGRSADFARHAIPSIQKQRILVTLVKPQSRKIVGGE----PHRFXXXXXXXX 746
               S++ MQG+S+D A+HAIP I+KQR+LVT  K Q +K+   +    P           
Sbjct: 453  APGSLLVMQGKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAVAPSSHWG 512

Query: 745  XXPSRTPGQIRPAPAKHFGPLPSTGVXXXXXXXXXXXXPNG---MYVATPVAPGIAYPAA 575
              PSR+P  +R    KH+  +P+TGV            PNG   +++ TPVA  + +PA 
Sbjct: 513  PPPSRSPNHLRHPVPKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAP 572

Query: 574  VPLPPTSSAWPAALPRHPQPRLPV--PGTGVFLPSQGQGPGNST---------TNQPPST 428
            VP+PP S+ WP + PRHP  RLPV  PGTGVFLP  G G  +S           N P  T
Sbjct: 573  VPIPPVSTGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTET 632

Query: 427  ENCATEDSVGKMN--GSRLPPTKVDEEAAQKECNGSGWG-EILKEEEENESHDVA 272
            E    E+  GK N   S  P  K  E+  +++ NG   G  + KEE+++ SH VA
Sbjct: 633  EK-EKENGPGKSNHDTSASPKEKSAEKTQRQDSNGDVDGIAVKKEEQQSVSHTVA 686


>gb|ABK95394.1| unknown [Populus trichocarpa]
          Length = 694

 Score =  369 bits (946), Expect = 3e-99
 Identities = 206/475 (43%), Positives = 281/475 (59%), Gaps = 21/475 (4%)
 Frame = -3

Query: 1633 EKKEEREVTEFNEKSEENGSATRQRSTQGDVTQADVDAEDTGSCSVDGSGLAEKSNLEVS 1454
            +KK+    +  +     +G+A    S   +    D  +    S S   +   EK NL ++
Sbjct: 214  DKKDATAKSHTDNHKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAIT 273

Query: 1453 PKSFVATEFYDGKLVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLPGQTFVAL 1274
            PK+FVA E  DG++VNV +GLK+YE+L D  ++ KL +LV++LRA G+RGQ  GQT++  
Sbjct: 274  PKTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILS 333

Query: 1273 KRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIK 1094
            KRPMKGHGREMIQLG+PIADAP EDE A G S++ ++E IP  LQDVIE  +   V+++K
Sbjct: 334  KRPMKGHGREMIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMK 393

Query: 1093 PDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEISFGKIISADGPGNYXXXXXXXX 914
            PDS IIDI+NEGDHSQPH+WP WFG+PV V+ LT CE++FGK+I     G+Y        
Sbjct: 394  PDSCIIDIYNEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSV 453

Query: 913  XXXSVISMQGRSADFARHAIPSIQKQRILVTLVKPQSRKIVGGE----PHRFXXXXXXXX 746
               S++ MQG+S+D A+HAIP I+KQR+LVT  K Q +K+   +    P           
Sbjct: 454  APGSLLVMQGKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAVAPSSHWG 513

Query: 745  XXPSRTPGQIRPAPAKHFGPLPSTGVXXXXXXXXXXXXPNG---MYVATPVAPGIAYPAA 575
              PSR+P  +R    KH+  +P+TGV            PNG   +++ TPVA  + +PA 
Sbjct: 514  PPPSRSPNHLRHPVPKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAAPMPFPAP 573

Query: 574  VPLPPTSSAWPAALPRHPQPRLPV--PGTGVFLPSQGQGPGNST---------TNQPPST 428
            VP+PP S+ WP + PRHP  RLPV  PGTGVFLP  G G  +S           N P  T
Sbjct: 574  VPIPPVSTGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTET 633

Query: 427  ENCATEDSVGKMN--GSRLPPTKVDEEAAQKECNGSGWG-EILKEEEENESHDVA 272
            E    E+  GK N   S  P  K  E+  +++ NG   G  + KEE+++ SH VA
Sbjct: 634  EK-EKENGPGKSNHDTSASPKEKSAEKTQRQDSNGDVDGIAVKKEEQQSVSHTVA 687


>ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261013 [Solanum
            lycopersicum]
          Length = 641

 Score =  364 bits (934), Expect = 7e-98
 Identities = 214/482 (44%), Positives = 283/482 (58%), Gaps = 23/482 (4%)
 Frame = -3

Query: 1705 GGEMNGKDLN-NGYYKSNQKLTEKMEKKEERE--VTEFNEKSEENGSA-----TRQRSTQ 1550
            G E  G++ + + + K+N    EK++  EE++    E   K E N S      T    +Q
Sbjct: 157  GKESQGQNFSLDAHSKTNG--VEKIDVVEEKQGDKKELAAKPEANSSVKGSVCTEAGDSQ 214

Query: 1549 GDVTQADV--DAEDTGSCSVDGSGLA-----EKSNLEVSPKSFVATEFYDGKLVNVAEGL 1391
            G+V + D   D+   GS +V+    +     EK N  V PK+FVATE YDGK VNV +G+
Sbjct: 215  GEVDKTDDKRDSNSEGSSNVESESHSFQIPTEKQN--VVPKTFVATEIYDGKPVNVVDGM 272

Query: 1390 KVYEDLFDDSDILKLNNLVSDLRAAGKRGQLPGQTFVALKRPMKGHGREMIQLGIPIADA 1211
            K+YE+L   S++ KL  LV+DLRAAG+RGQLP Q F+  KRPMKGHGREM+QLG+PI DA
Sbjct: 273  KLYEELLSSSEVSKLVTLVNDLRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDA 332

Query: 1210 PPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWP 1031
            PPE+E A    +D K E IP  LQDVI++L     +S+KPD+ +IDIFNEGDHSQPH+WP
Sbjct: 333  PPEEESAISTYKDRKTEAIPGLLQDVIDQLSAMQALSVKPDACVIDIFNEGDHSQPHLWP 392

Query: 1030 QWFGRPVCVISLTVCEISFGKIISADGPGNYXXXXXXXXXXXSVISMQGRSADFARHAIP 851
             W+GRP+  + LT CE++FGK+I  D PG+Y           SV+ MQGRS +FA++AIP
Sbjct: 393  YWYGRPISTLFLTDCEMTFGKVIGVDHPGDYRGSLKLSLAPGSVLVMQGRSTEFAKYAIP 452

Query: 850  SIQKQRILVTLVKPQSRKIVGGEPHRF---XXXXXXXXXXPSRTPGQI-RPAPAKHFGPL 683
            SI+KQR+LVT  K Q R+I  G+  RF             PSR+   I RP   KH+G +
Sbjct: 453  SIRKQRMLVTFTKLQLRRIKSGDSQRFPSSAGGPVSQWVPPSRSSNHIRRPFGPKHYGSM 512

Query: 682  PSTGVXXXXXXXXXXXXPN--GMYVATPVAPGIAYPAAVPLPPTSSAWPAALPRHPQPRL 509
            P+TGV             N   ++V   VAP + +PA V LPP S+ W     RHP PRL
Sbjct: 513  PATGVLPIPGVRPQFAPANMQPIFVPATVAPAMPFPAPVALPPASAGWAVPPIRHPPPRL 572

Query: 508  PVPGTGVFLPSQGQGPGNSTTNQPPSTENCATEDSV--GKMNGSRLPPTKVDEEAAQKEC 335
            P+PGTGVFLP    G G S+T+  P+       DS    K+N           E   ++C
Sbjct: 573  PLPGTGVFLP---PGSGTSSTDNIPAENTGPLSDSTVSQKVNSD-------SSEVQTQDC 622

Query: 334  NG 329
            NG
Sbjct: 623  NG 624


>ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600383 [Solanum tuberosum]
          Length = 638

 Score =  362 bits (928), Expect = 4e-97
 Identities = 209/464 (45%), Positives = 273/464 (58%), Gaps = 15/464 (3%)
 Frame = -3

Query: 1675 NGYYKSNQKLTEKMEKKEEREVTEFNEKSEENGSATRQRSTQGDVTQADV--DAEDTGSC 1502
            NG  K +    ++ EKKE     E N  S ++   T    +QG+V + D   D+   GS 
Sbjct: 171  NGVEKIDVVEVKQGEKKELAANPEANS-SVKSSVCTEAGDSQGEVDKTDDKRDSNSEGSS 229

Query: 1501 SVDGSGLA-----EKSNLEVSPKSFVATEFYDGKLVNVAEGLKVYEDLFDDSDILKLNNL 1337
            +V+    +     EK N  V PK+FVATE YDGK VNV +G+K+YE+L   S++ KL  L
Sbjct: 230  NVESESHSIQVPTEKQN--VVPKTFVATEIYDGKPVNVVDGMKLYEELLSSSEVSKLLTL 287

Query: 1336 VSDLRAAGKRGQLPGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEP 1157
            V+DLRAAG+RGQLP Q F+  KRPMKGHGREM+QLG+PI DAPPE+E A    +D K E 
Sbjct: 288  VNDLRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPEEEAAISTYKDRKTEA 347

Query: 1156 IPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEIS 977
            IP   QDVI++L     +S+KPD+ +IDIFNEGDHSQPH+WP W+GRP+ ++ LT CE++
Sbjct: 348  IPGLFQDVIDQLSAMQALSVKPDACVIDIFNEGDHSQPHLWPYWYGRPISMLFLTDCEMT 407

Query: 976  FGKIISADGPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSIQKQRILVTLVKPQSRK 797
            FGK+I  D PG+Y           SV+ MQGRS +FA++AIPS +KQRILVT  K Q R+
Sbjct: 408  FGKVIGVDHPGDYRGSLKLSLAPGSVLVMQGRSTEFAKYAIPSTRKQRILVTFTKLQLRR 467

Query: 796  IVGGEPHRF---XXXXXXXXXXPSRTPGQI-RPAPAKHFGPLPSTGVXXXXXXXXXXXXP 629
            I   +  RF             PSR+P  I RP   KH+G + +TGV             
Sbjct: 468  IKSADSQRFPSSAGGPVSQWVPPSRSPNHIRRPFGPKHYGSMSTTGVLPIPGVRPQFAPA 527

Query: 628  N--GMYVATPVAPGIAYPAAVPLPPTSSAWPAALPRHPQPRLPVPGTGVFLPSQGQGPGN 455
            N   ++V   VAP + +PA V LPP S+ W     RHP PRLP+PGTGVFLP    G G 
Sbjct: 528  NMQPIFVPATVAPAMPFPAPVALPPASAGWAVPPLRHPPPRLPLPGTGVFLP---PGSGT 584

Query: 454  STTNQPPSTENCATEDSV--GKMNGSRLPPTKVDEEAAQKECNG 329
            S+T+  P+ +     DS    K+N           E   +ECNG
Sbjct: 585  SSTDNIPAEKAGPLSDSTVSQKVNSG-------SSEVQTQECNG 621


>ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550333015|gb|EEE88914.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 675

 Score =  362 bits (928), Expect = 4e-97
 Identities = 210/491 (42%), Positives = 288/491 (58%), Gaps = 21/491 (4%)
 Frame = -3

Query: 1681 LNNGYYKSNQKLTEKMEKKEEREVTEFNEKSEENGSATRQRSTQGDVTQADVDAEDTGSC 1502
            + N  +  N     + EK EE +      KS++  +    +S   +   +  +A+ T S 
Sbjct: 181  VENHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKKADATAKSHTDNHKNSSGNAQGTFSG 240

Query: 1501 SVDGSGLAEKSNLEVSPKSFVATEFYDGKLVNVAEGLKVYEDLFDDSDILKLNNLVSDLR 1322
            + +     EK NL ++PK+FVA E  DG++VNV +GLK+YE+L D  ++ KL +LV++LR
Sbjct: 241  NSEAVA-NEKQNLAITPKTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELR 299

Query: 1321 AAGKRGQLPGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVAL 1142
            A G+RGQ  GQT++  KRPMKGHGREMIQLG+PIADAP EDE A G S+   +E IP  L
Sbjct: 300  ATGRRGQCQGQTYILSKRPMKGHGREMIQLGLPIADAPAEDENATGTSKGT-VESIPALL 358

Query: 1141 QDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEISFGKII 962
            QDVIE  +   V+++KPDS IIDI+NEGDHSQPH+WP WFG+PV V+ LT CE++FGK+I
Sbjct: 359  QDVIEHFVAMQVMTMKPDSCIIDIYNEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVI 418

Query: 961  SADGPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSIQKQRILVTLVKPQSRKIVGGE 782
                 G+Y           S++ MQG+S+D A+HAIP I+KQR+LVT  K Q +K+   +
Sbjct: 419  DTLHHGDYKGSLKLSVAPGSLLVMQGKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSND 478

Query: 781  ----PHRFXXXXXXXXXXPSRTPGQIRPAPAKHFGPLPSTGVXXXXXXXXXXXXPNG--- 623
                P             PSR+P  +R    KH+  +P+TGV            PNG   
Sbjct: 479  GPRLPSHAVAPSSHWGPPPSRSPNHLRHPVPKHYAAIPTTGVLLVPPIRPQIPPPNGVQP 538

Query: 622  MYVATPVAPGIAYPAAVPLPPTSSAWPAALPRHPQPRLPV--PGTGVFLPSQGQGPGNST 449
            +++ TPVA  + +PA VP+PP S+ WP + PRHP  RLPV  PGTGVFLP  G G  +S 
Sbjct: 539  LFMTTPVAAPMPFPAPVPIPPVSTGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSA 598

Query: 448  ---------TNQPPSTENCATEDSVGKMN--GSRLPPTKVDEEAAQKECNGSGWG-EILK 305
                      N P  TE    E+  GK N   S  P  K  E+  +++ NG   G  + K
Sbjct: 599  LQLSATATEMNFPTETEK-EKENGPGKSNHDTSASPKEKSAEKTQRQDSNGDVDGIAVKK 657

Query: 304  EEEENESHDVA 272
            EE+++ SH VA
Sbjct: 658  EEQQSVSHTVA 668


>ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210274 [Cucumis sativus]
            gi|449481289|ref|XP_004156139.1| PREDICTED:
            uncharacterized LOC101210274 [Cucumis sativus]
          Length = 684

 Score =  355 bits (912), Expect = 3e-95
 Identities = 209/494 (42%), Positives = 287/494 (58%), Gaps = 18/494 (3%)
 Frame = -3

Query: 1705 GGEMNGKDLNNGYYKSNQKLTEKMEKKEEREVTEFNEKSEENGSATRQRSTQGDVTQADV 1526
            G  ++ KD  +G  +SN K T+  E  E+  + + ++   ++G ++  R  +    Q   
Sbjct: 202  GSAVDNKD-THGKDQSNCK-TKSAENLEDNAINKDSQVEPDDGCSSSHRDKELQSVQ--- 256

Query: 1525 DAEDTGSCSVDGSGLAEKSNLEVSPKSFVATEFYDGKLVNVAEGLKVYEDLFDDSDILKL 1346
                    S +G     K     +P++FVA+E +DGK+VNV +GLK++E+L DD+++ KL
Sbjct: 257  --------SQNG-----KQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEVSKL 303

Query: 1345 NNLVSDLRAAGKRGQLPGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPK 1166
             +LV+DLRA+GKRGQ  GQT+V  KRPMKGHGREMIQLG PIADAP ED+ + G S+D +
Sbjct: 304  LSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMIQLGFPIADAPHEDDNSLGLSKDRR 363

Query: 1165 IEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVC 986
            IEPIP  LQD+I+RL+   V+++KPDS IID +NEGDHSQPH+WP WFGRPV V+ LT C
Sbjct: 364  IEPIPSLLQDLIDRLVGDQVMTVKPDSCIIDFYNEGDHSQPHVWPSWFGRPVGVLLLTEC 423

Query: 985  EISFGKIISADGPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSIQKQRILVTLVKPQ 806
            EI+FG++I  D  GNY           +++ +QG+SADFA+HA+P+I+KQRILVTL K Q
Sbjct: 424  EITFGRVIGTDHSGNYRGAMKLSLTPGNLLVVQGKSADFAKHALPAIRKQRILVTLTKSQ 483

Query: 805  SRKIVGGEPHRF---XXXXXXXXXXPSRTPGQIRPAPAKHFGPLPSTGVXXXXXXXXXXX 635
             ++    +  R               +R+P        K +  +PSTGV           
Sbjct: 484  PKRAAPADGQRTSLNVGTFSGWGPPSARSPNPRLSPGQKPYPTVPSTGVLPVPPIRPQMA 543

Query: 634  XPNGM--YVATPVAPGIAYPAAVPLPPTSSAWPAALPRHPQPRLPVPGTGVFLPSQG--Q 467
             PNG+   +  PVA  + +   VP+P   SAWP A  RHP PRLPVPGTGVFLP  G   
Sbjct: 544  PPNGIPPLIVPPVASPMPF-TPVPIPTGPSAWPTAHTRHPPPRLPVPGTGVFLPPPGSSS 602

Query: 466  GPGNSTTNQPPSTENCATEDSVGKMNG--------SRLPPTKVDEEAAQKECNGS---GW 320
             P  S   Q P   N  T     K NG           P  K D +A ++ECNGS     
Sbjct: 603  APTPSPQQQLP-ISNIETGSLSEKENGLTKSDHSSGTFPGEKPDAKAQRQECNGSIDGSG 661

Query: 319  GEILKEEEENESHD 278
             + +KEEE+ +  +
Sbjct: 662  NDKVKEEEQQQQQE 675


>ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine
            max]
          Length = 681

 Score =  353 bits (907), Expect = 1e-94
 Identities = 194/448 (43%), Positives = 266/448 (59%), Gaps = 26/448 (5%)
 Frame = -3

Query: 1594 KSEENGSATRQRSTQGDVTQADVDAEDTGSCSVDGSG--------LAEKSNLEVSPKSFV 1439
            K + +GS    RST+G ++  + +A     C  +  G          +  +L    K+F+
Sbjct: 212  KHQTDGSLKSTRSTEGSLSNLESEAVVNDECISNSKGDDSHSVQNQHQSQSLSTKAKTFI 271

Query: 1438 ATEFYDGKLVNVAEGLKVYEDLFDDSDILKLNNLVSDLRAAGKRGQLPG-QTFVALKRPM 1262
              E +DGK+VNV +GLK+YEDLFD ++I  L +LV+DLR +GK+GQL G Q ++  +RPM
Sbjct: 272  GNEMFDGKMVNVVDGLKLYEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPM 331

Query: 1261 KGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSA 1082
            KGHGREMIQLG+PIADAP E E   GAS+D  +EPIP   QD+IER+++  V+++KPD  
Sbjct: 332  KGHGREMIQLGVPIADAPAEGENMTGASKDMNVEPIPSLFQDIIERMVSSQVMTVKPDCC 391

Query: 1081 IIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEISFGKIISADGPGNYXXXXXXXXXXXS 902
            I+D +NEGDHSQPH WP W+GRPV ++ LT CE++FG++I+++ PG+Y           S
Sbjct: 392  IVDFYNEGDHSQPHSWPSWYGRPVYILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPGS 451

Query: 901  VISMQGRSADFARHAIPSIQKQRILVTLVKPQSRKIVGGEPHRF--XXXXXXXXXXPSRT 728
            ++ M+G+S+DFA+HA+PS++KQRILVT  K Q RK +  +  R             PSR+
Sbjct: 452  LLVMEGKSSDFAKHALPSVRKQRILVTFTKSQPRKSLSSDAQRLASTATSSHWGPLPSRS 511

Query: 727  PGQIR-PAPAKHFGPLPSTGVXXXXXXXXXXXXPNGM---YVATPVAPGIAYPAAVPLPP 560
            P  +R    +KH+  LP+TGV            P GM   +V  PV P + +PA V  PP
Sbjct: 512  PNHVRHHVGSKHYATLPTTGVLPSPPIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVAFPP 571

Query: 559  TSSAWPAA-LPRHPQPRLPVPGTGVFLPSQGQG------PGNSTTNQPPSTEN-CATEDS 404
             S+ W  A  PRHP PR+P PGTGVFLP  G G      P  +     PSTE     E  
Sbjct: 572  GSTGWTGAPPPRHPPPRVPAPGTGVFLPPPGSGNSSQQLPAGTLAEVNPSTETPTMLEKE 631

Query: 403  VGKMN---GSRLPPTKVDEEAAQKECNG 329
             GK N    S  P  KV ++    ECNG
Sbjct: 632  NGKTNHNSTSASPKGKVQKQ----ECNG 655


Top