BLASTX nr result

ID: Mentha22_contig00045496 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00045496
         (1443 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU38347.1| hypothetical protein MIMGU_mgv1a005691mg [Mimulus...   276   1e-71
ref|XP_007009440.1| Uncharacterized protein isoform 9 [Theobroma...   271   7e-70
ref|XP_007009437.1| Uncharacterized protein isoform 6 [Theobroma...   271   7e-70
ref|XP_007009432.1| Uncharacterized protein isoform 1 [Theobroma...   271   7e-70
ref|XP_007009438.1| Uncharacterized protein isoform 7 [Theobroma...   263   1e-67
ref|XP_007009435.1| Uncharacterized protein isoform 4 [Theobroma...   263   1e-67
ref|XP_007009434.1| Uncharacterized protein isoform 3 [Theobroma...   263   1e-67
ref|XP_007139261.1| hypothetical protein PHAVU_008G014500g [Phas...   253   1e-64
ref|XP_007139260.1| hypothetical protein PHAVU_008G014500g [Phas...   253   1e-64
ref|XP_007139258.1| hypothetical protein PHAVU_008G014500g [Phas...   253   1e-64
ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787...   250   1e-63
gb|EXB65298.1| hypothetical protein L484_025377 [Morus notabilis]     249   2e-63
ref|XP_007009439.1| Uncharacterized protein isoform 8 [Theobroma...   247   8e-63
ref|XP_007009436.1| Uncharacterized protein isoform 5 [Theobroma...   247   8e-63
ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782...   244   5e-62
ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206...   244   7e-62
ref|XP_007218931.1| hypothetical protein PRUPE_ppa003346mg [Prun...   241   8e-61
ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cuc...   223   2e-55
ref|XP_002316304.2| hypothetical protein POPTR_0010s21640g, part...   210   1e-51
ref|XP_002311151.2| hypothetical protein POPTR_0008s05120g [Popu...   209   2e-51

>gb|EYU38347.1| hypothetical protein MIMGU_mgv1a005691mg [Mimulus guttatus]
          Length = 474

 Score =  276 bits (707), Expect = 1e-71
 Identities = 177/426 (41%), Positives = 237/426 (55%), Gaps = 10/426 (2%)
 Frame = +2

Query: 194  MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373
            MSFQNN IWL N SGS+ANGE+CY++ TRIDQKR + WF   SEQEL  +K+QAV+  + 
Sbjct: 1    MSFQNNGIWLTNSSGSLANGEMCYDSTTRIDQKRSHQWFTGPSEQELFTNKKQAVESTRE 60

Query: 374  TSGPAVMESSLWHDGPHFQLESHA-PTLFNPKAVRSSNVSDNNNPRVSATMNMERKDLGH 550
             + P  ++   W DG + Q E      LF PK VR                         
Sbjct: 61   VTEPVTVDG-FWRDGSNSQSEGQTGDRLFAPKPVR------------------------- 94

Query: 551  QFGNDQSICLTMSHEVNDSLCLNSGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDVMTT 730
                           + D L LN+G RKVKVNEV I +NC PE++G+T            
Sbjct: 95   ---------------LEDPLSLNTGLRKVKVNEVTIPDNCFPEFMGNTM----------- 128

Query: 731  TFQRTSNNMFSGPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGI 910
             FQR+ +NM+S PT N+                             D NFML NQYYNGI
Sbjct: 129  -FQRSGSNMYSEPTNNT-----------------------------DANFMLTNQYYNGI 158

Query: 911  DNNVLSIGQAFNRGNYNVDALGDQYEKENS-NLSSVCPTY-NRGQENLFGLESFYSKVNE 1084
            DNN+LSIG  FN GNY+      QYEKE S N  ++ P Y ++G +N F +E F +++NE
Sbjct: 159  DNNLLSIG--FNGGNYS-STHTVQYEKEASCNFVAISPNYGSKGHDNFFAVEPFCNRLNE 215

Query: 1085 TFISAGPGKGESHIAFQGQQDSTVASLGALFNKENSSIL------RKGEETTISFGGFQN 1246
            TF++AG     ++     Q D+ + SLG+L+NKENS +L      + GEE TISFGG ++
Sbjct: 216  TFMTAGSTYNNNNNI---QHDAPIVSLGSLYNKENSGLLSMVANSKNGEEATISFGGVED 272

Query: 1247 NADERDHSGRVISSYEVLLNQSSAQSSGALVQKDSTDQLSANAIPASSSKPEGAP-KSKD 1423
            + +ERD SGR+IS+Y++L NQS+ Q+  AL       QLSAN + A++SK +GA  K+K+
Sbjct: 273  SHEERDLSGRLISNYDLLANQSTGQNESAL-------QLSANVVTAATSKTDGAQIKNKE 325

Query: 1424 QKTKKG 1441
            QKTKKG
Sbjct: 326  QKTKKG 331


>ref|XP_007009440.1| Uncharacterized protein isoform 9 [Theobroma cacao]
            gi|508726353|gb|EOY18250.1| Uncharacterized protein
            isoform 9 [Theobroma cacao]
          Length = 477

 Score =  271 bits (692), Expect = 7e-70
 Identities = 170/440 (38%), Positives = 250/440 (56%), Gaps = 25/440 (5%)
 Frame = +2

Query: 194  MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373
            MSFQ+   WLP   G + NGE+ Y+ ++R + KR + WFMD++  EL  +K+QA++    
Sbjct: 1    MSFQHKSFWLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNS 60

Query: 374  --TSGPAVMESSLWHDGPHFQLESH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERK 538
               SG A +  S WH+   FQ  S   +  LF  + +R+ N+ D N   V S  MNM RK
Sbjct: 61   RPVSGIADVNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRK 120

Query: 539  DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712
            D   Q+ N  S  L+MSH + D S C +  G RKVKVN+VR S N +P  +G T+  G  
Sbjct: 121  DFDDQYVNSSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVN 180

Query: 713  NDV-MTTTFQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFML 886
            + V M+T + ++ NN  S GPT+ S   N IS+ P ++K D NF S+GH+ +KRDG+F+ 
Sbjct: 181  STVSMSTVYSKSDNNAISLGPTYGSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFIS 240

Query: 887  PNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESF 1066
                YN  + ++LS+GQAF + + +  ++G  YEK ++NL S+  +Y +GQEN   +   
Sbjct: 241  VGHNYNKGNESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPA 300

Query: 1067 YSKVNETFISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210
            Y K NE+ IS  P   K E  I   G    + D  + ++     K  SSIL      +KG
Sbjct: 301  YGKPNESLISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKG 360

Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAI 1378
            E  TISFGGF + + E + SG +IS Y++L+ NQ+SAQ+S  L QK+  +     + N  
Sbjct: 361  ESNTISFGGFHDES-ETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNA 419

Query: 1379 PASSSKPEGAPKSKDQKTKK 1438
            P  +S+ +  PK K+ KT K
Sbjct: 420  PKHNSRTDANPKHKEPKTAK 439


>ref|XP_007009437.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508726350|gb|EOY18247.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 561

 Score =  271 bits (692), Expect = 7e-70
 Identities = 170/440 (38%), Positives = 250/440 (56%), Gaps = 25/440 (5%)
 Frame = +2

Query: 194  MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373
            MSFQ+   WLP   G + NGE+ Y+ ++R + KR + WFMD++  EL  +K+QA++    
Sbjct: 1    MSFQHKSFWLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNS 60

Query: 374  --TSGPAVMESSLWHDGPHFQLESH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERK 538
               SG A +  S WH+   FQ  S   +  LF  + +R+ N+ D N   V S  MNM RK
Sbjct: 61   RPVSGIADVNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRK 120

Query: 539  DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712
            D   Q+ N  S  L+MSH + D S C +  G RKVKVN+VR S N +P  +G T+  G  
Sbjct: 121  DFDDQYVNSSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVN 180

Query: 713  NDV-MTTTFQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFML 886
            + V M+T + ++ NN  S GPT+ S   N IS+ P ++K D NF S+GH+ +KRDG+F+ 
Sbjct: 181  STVSMSTVYSKSDNNAISLGPTYGSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFIS 240

Query: 887  PNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESF 1066
                YN  + ++LS+GQAF + + +  ++G  YEK ++NL S+  +Y +GQEN   +   
Sbjct: 241  VGHNYNKGNESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPA 300

Query: 1067 YSKVNETFISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210
            Y K NE+ IS  P   K E  I   G    + D  + ++     K  SSIL      +KG
Sbjct: 301  YGKPNESLISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKG 360

Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAI 1378
            E  TISFGGF + + E + SG +IS Y++L+ NQ+SAQ+S  L QK+  +     + N  
Sbjct: 361  ESNTISFGGFHDES-ETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNA 419

Query: 1379 PASSSKPEGAPKSKDQKTKK 1438
            P  +S+ +  PK K+ KT K
Sbjct: 420  PKHNSRTDANPKHKEPKTAK 439


>ref|XP_007009432.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590563660|ref|XP_007009433.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508726345|gb|EOY18242.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508726346|gb|EOY18243.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 584

 Score =  271 bits (692), Expect = 7e-70
 Identities = 170/440 (38%), Positives = 250/440 (56%), Gaps = 25/440 (5%)
 Frame = +2

Query: 194  MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373
            MSFQ+   WLP   G + NGE+ Y+ ++R + KR + WFMD++  EL  +K+QA++    
Sbjct: 1    MSFQHKSFWLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNS 60

Query: 374  --TSGPAVMESSLWHDGPHFQLESH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERK 538
               SG A +  S WH+   FQ  S   +  LF  + +R+ N+ D N   V S  MNM RK
Sbjct: 61   RPVSGIADVNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRK 120

Query: 539  DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712
            D   Q+ N  S  L+MSH + D S C +  G RKVKVN+VR S N +P  +G T+  G  
Sbjct: 121  DFDDQYVNSSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVN 180

Query: 713  NDV-MTTTFQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFML 886
            + V M+T + ++ NN  S GPT+ S   N IS+ P ++K D NF S+GH+ +KRDG+F+ 
Sbjct: 181  STVSMSTVYSKSDNNAISLGPTYGSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFIS 240

Query: 887  PNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESF 1066
                YN  + ++LS+GQAF + + +  ++G  YEK ++NL S+  +Y +GQEN   +   
Sbjct: 241  VGHNYNKGNESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPA 300

Query: 1067 YSKVNETFISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210
            Y K NE+ IS  P   K E  I   G    + D  + ++     K  SSIL      +KG
Sbjct: 301  YGKPNESLISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKG 360

Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAI 1378
            E  TISFGGF + + E + SG +IS Y++L+ NQ+SAQ+S  L QK+  +     + N  
Sbjct: 361  ESNTISFGGFHDES-ETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNA 419

Query: 1379 PASSSKPEGAPKSKDQKTKK 1438
            P  +S+ +  PK K+ KT K
Sbjct: 420  PKHNSRTDANPKHKEPKTAK 439


>ref|XP_007009438.1| Uncharacterized protein isoform 7 [Theobroma cacao]
            gi|508726351|gb|EOY18248.1| Uncharacterized protein
            isoform 7 [Theobroma cacao]
          Length = 558

 Score =  263 bits (673), Expect = 1e-67
 Identities = 166/432 (38%), Positives = 245/432 (56%), Gaps = 25/432 (5%)
 Frame = +2

Query: 218  WLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG--TSGPAV 391
            WLP   G + NGE+ Y+ ++R + KR + WFMD++  EL  +K+QA++       SG A 
Sbjct: 6    WLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNSRPVSGIAD 65

Query: 392  MESSLWHDGPHFQLESH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERKDLGHQFGN 562
            +  S WH+   FQ  S   +  LF  + +R+ N+ D N   V S  MNM RKD   Q+ N
Sbjct: 66   VNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRKDFDDQYVN 125

Query: 563  DQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTT 733
              S  L+MSH + D S C +  G RKVKVN+VR S N +P  +G T+  G  + V M+T 
Sbjct: 126  SSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVNSTVSMSTV 185

Query: 734  FQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGI 910
            + ++ NN  S GPT+ S   N IS+ P ++K D NF S+GH+ +KRDG+F+     YN  
Sbjct: 186  YSKSDNNAISLGPTYGSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFISVGHNYNKG 245

Query: 911  DNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETF 1090
            + ++LS+GQAF + + +  ++G  YEK ++NL S+  +Y +GQEN   +   Y K NE+ 
Sbjct: 246  NESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNESL 305

Query: 1091 ISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKGEETTISFG 1234
            IS  P   K E  I   G    + D  + ++     K  SSIL      +KGE  TISFG
Sbjct: 306  ISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISFG 365

Query: 1235 GFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPE 1402
            GF + + E + SG +IS Y++L+ NQ+SAQ+S  L QK+  +     + N  P  +S+ +
Sbjct: 366  GFHDES-ETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTD 424

Query: 1403 GAPKSKDQKTKK 1438
              PK K+ KT K
Sbjct: 425  ANPKHKEPKTAK 436


>ref|XP_007009435.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508726348|gb|EOY18245.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 479

 Score =  263 bits (673), Expect = 1e-67
 Identities = 166/432 (38%), Positives = 245/432 (56%), Gaps = 25/432 (5%)
 Frame = +2

Query: 218  WLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG--TSGPAV 391
            WLP   G + NGE+ Y+ ++R + KR + WFMD++  EL  +K+QA++       SG A 
Sbjct: 6    WLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNSRPVSGIAD 65

Query: 392  MESSLWHDGPHFQLESH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERKDLGHQFGN 562
            +  S WH+   FQ  S   +  LF  + +R+ N+ D N   V S  MNM RKD   Q+ N
Sbjct: 66   VNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRKDFDDQYVN 125

Query: 563  DQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTT 733
              S  L+MSH + D S C +  G RKVKVN+VR S N +P  +G T+  G  + V M+T 
Sbjct: 126  SSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVNSTVSMSTV 185

Query: 734  FQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGI 910
            + ++ NN  S GPT+ S   N IS+ P ++K D NF S+GH+ +KRDG+F+     YN  
Sbjct: 186  YSKSDNNAISLGPTYGSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFISVGHNYNKG 245

Query: 911  DNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETF 1090
            + ++LS+GQAF + + +  ++G  YEK ++NL S+  +Y +GQEN   +   Y K NE+ 
Sbjct: 246  NESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNESL 305

Query: 1091 ISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKGEETTISFG 1234
            IS  P   K E  I   G    + D  + ++     K  SSIL      +KGE  TISFG
Sbjct: 306  ISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISFG 365

Query: 1235 GFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPE 1402
            GF + + E + SG +IS Y++L+ NQ+SAQ+S  L QK+  +     + N  P  +S+ +
Sbjct: 366  GFHDES-ETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTD 424

Query: 1403 GAPKSKDQKTKK 1438
              PK K+ KT K
Sbjct: 425  ANPKHKEPKTAK 436


>ref|XP_007009434.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508726347|gb|EOY18244.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 581

 Score =  263 bits (673), Expect = 1e-67
 Identities = 166/432 (38%), Positives = 245/432 (56%), Gaps = 25/432 (5%)
 Frame = +2

Query: 218  WLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG--TSGPAV 391
            WLP   G + NGE+ Y+ ++R + KR + WFMD++  EL  +K+QA++       SG A 
Sbjct: 6    WLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNSRPVSGIAD 65

Query: 392  MESSLWHDGPHFQLESH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERKDLGHQFGN 562
            +  S WH+   FQ  S   +  LF  + +R+ N+ D N   V S  MNM RKD   Q+ N
Sbjct: 66   VNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRKDFDDQYVN 125

Query: 563  DQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTT 733
              S  L+MSH + D S C +  G RKVKVN+VR S N +P  +G T+  G  + V M+T 
Sbjct: 126  SSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVNSTVSMSTV 185

Query: 734  FQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGI 910
            + ++ NN  S GPT+ S   N IS+ P ++K D NF S+GH+ +KRDG+F+     YN  
Sbjct: 186  YSKSDNNAISLGPTYGSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFISVGHNYNKG 245

Query: 911  DNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETF 1090
            + ++LS+GQAF + + +  ++G  YEK ++NL S+  +Y +GQEN   +   Y K NE+ 
Sbjct: 246  NESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNESL 305

Query: 1091 ISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKGEETTISFG 1234
            IS  P   K E  I   G    + D  + ++     K  SSIL      +KGE  TISFG
Sbjct: 306  ISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISFG 365

Query: 1235 GFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPE 1402
            GF + + E + SG +IS Y++L+ NQ+SAQ+S  L QK+  +     + N  P  +S+ +
Sbjct: 366  GFHDES-ETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTD 424

Query: 1403 GAPKSKDQKTKK 1438
              PK K+ KT K
Sbjct: 425  ANPKHKEPKTAK 436


>ref|XP_007139261.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris]
            gi|561012394|gb|ESW11255.1| hypothetical protein
            PHAVU_008G014500g [Phaseolus vulgaris]
          Length = 472

 Score =  253 bits (647), Expect = 1e-64
 Identities = 162/436 (37%), Positives = 243/436 (55%), Gaps = 23/436 (5%)
 Frame = +2

Query: 194  MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373
            MS+Q+   W+P  +G +A   + YE ++RI+ KR + WFMD+ E E++ +K+QAV+   G
Sbjct: 1    MSYQHKSFWMPRDAGCMAEENVGYENSSRIEPKRSHQWFMDTGEPEIVSNKKQAVEDVSG 60

Query: 374  T--SGPAVMESSLWH--DGPHFQLESHAPTLFNPKAVRSSNVSDNNNPR-VSATMNMERK 538
               SG + +  S W    G H  +   +  LF     R+ N+ D N P  VS  MNM RK
Sbjct: 61   RPISGVSHVNVSQWDTSSGFHSVMGQFSDRLFGSDLARTVNLVDKNVPSIVSGNMNMGRK 120

Query: 539  DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712
            D  HQ+GND S+ L++SH + D S CLN  G RKVKVN+VR S+NC+P        S E 
Sbjct: 121  DFEHQYGNDPSVGLSISHSIADPSSCLNFGGIRKVKVNQVRDSDNCMPSAAMGHSYSRED 180

Query: 713  NDVMTTT--FQRTSNNMFSGPTFNSVVGNGISVDPAYS-KMDKNFASVGHSSSKRDGNFM 883
            N  ++    + +   N+  GPT+N    N I +    S K D N  SV H+ +K DG FM
Sbjct: 181  NSTISVGAGYNKNDGNISLGPTYNHRNDNTIGMGSRISSKTDDNLLSVAHNFNKGDGGFM 240

Query: 884  LPNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLES 1063
            L    Y   D ++LS+GQ F++G+ N  ++G  YEKE+ NL S+  +Y++G E+   +  
Sbjct: 241  LMGHNYGKGDESILSMGQPFDKGDGNFISMGQSYEKEDGNLISLGTSYSKGHESFISIGP 300

Query: 1064 FYSKVNETFISAGP-GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210
             + K  E FI+  P  KG  H+   G    + DS +AS    +++ +SS L       KG
Sbjct: 301  TFGKSGENFITVAPYDKGTDHLISMGPTYDKVDSNIASTVPSYDRGDSSSLPVGQNHHKG 360

Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTDQLS-ANAIPA 1384
            + +TISFGGF ++  E + SG +IS Y++L+ NQ+SAQ   +      T+  S  N+IP 
Sbjct: 361  QSSTISFGGFHDD-PEANPSGGIISGYDLLIGNQNSAQGLDSQNDLSETNTESLVNSIPK 419

Query: 1385 SSSKPEGAPKSKDQKT 1432
             ++K +   K+K+ KT
Sbjct: 420  LNTKNDTVVKNKEPKT 435


>ref|XP_007139260.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris]
            gi|561012393|gb|ESW11254.1| hypothetical protein
            PHAVU_008G014500g [Phaseolus vulgaris]
          Length = 503

 Score =  253 bits (647), Expect = 1e-64
 Identities = 162/436 (37%), Positives = 243/436 (55%), Gaps = 23/436 (5%)
 Frame = +2

Query: 194  MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373
            MS+Q+   W+P  +G +A   + YE ++RI+ KR + WFMD+ E E++ +K+QAV+   G
Sbjct: 1    MSYQHKSFWMPRDAGCMAEENVGYENSSRIEPKRSHQWFMDTGEPEIVSNKKQAVEDVSG 60

Query: 374  T--SGPAVMESSLWH--DGPHFQLESHAPTLFNPKAVRSSNVSDNNNPR-VSATMNMERK 538
               SG + +  S W    G H  +   +  LF     R+ N+ D N P  VS  MNM RK
Sbjct: 61   RPISGVSHVNVSQWDTSSGFHSVMGQFSDRLFGSDLARTVNLVDKNVPSIVSGNMNMGRK 120

Query: 539  DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712
            D  HQ+GND S+ L++SH + D S CLN  G RKVKVN+VR S+NC+P        S E 
Sbjct: 121  DFEHQYGNDPSVGLSISHSIADPSSCLNFGGIRKVKVNQVRDSDNCMPSAAMGHSYSRED 180

Query: 713  NDVMTTT--FQRTSNNMFSGPTFNSVVGNGISVDPAYS-KMDKNFASVGHSSSKRDGNFM 883
            N  ++    + +   N+  GPT+N    N I +    S K D N  SV H+ +K DG FM
Sbjct: 181  NSTISVGAGYNKNDGNISLGPTYNHRNDNTIGMGSRISSKTDDNLLSVAHNFNKGDGGFM 240

Query: 884  LPNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLES 1063
            L    Y   D ++LS+GQ F++G+ N  ++G  YEKE+ NL S+  +Y++G E+   +  
Sbjct: 241  LMGHNYGKGDESILSMGQPFDKGDGNFISMGQSYEKEDGNLISLGTSYSKGHESFISIGP 300

Query: 1064 FYSKVNETFISAGP-GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210
             + K  E FI+  P  KG  H+   G    + DS +AS    +++ +SS L       KG
Sbjct: 301  TFGKSGENFITVAPYDKGTDHLISMGPTYDKVDSNIASTVPSYDRGDSSSLPVGQNHHKG 360

Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTDQLS-ANAIPA 1384
            + +TISFGGF ++  E + SG +IS Y++L+ NQ+SAQ   +      T+  S  N+IP 
Sbjct: 361  QSSTISFGGFHDD-PEANPSGGIISGYDLLIGNQNSAQGLDSQNDLSETNTESLVNSIPK 419

Query: 1385 SSSKPEGAPKSKDQKT 1432
             ++K +   K+K+ KT
Sbjct: 420  LNTKNDTVVKNKEPKT 435


>ref|XP_007139258.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris]
            gi|593331666|ref|XP_007139259.1| hypothetical protein
            PHAVU_008G014500g [Phaseolus vulgaris]
            gi|593331672|ref|XP_007139262.1| hypothetical protein
            PHAVU_008G014500g [Phaseolus vulgaris]
            gi|561012391|gb|ESW11252.1| hypothetical protein
            PHAVU_008G014500g [Phaseolus vulgaris]
            gi|561012392|gb|ESW11253.1| hypothetical protein
            PHAVU_008G014500g [Phaseolus vulgaris]
            gi|561012395|gb|ESW11256.1| hypothetical protein
            PHAVU_008G014500g [Phaseolus vulgaris]
          Length = 583

 Score =  253 bits (647), Expect = 1e-64
 Identities = 162/436 (37%), Positives = 243/436 (55%), Gaps = 23/436 (5%)
 Frame = +2

Query: 194  MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373
            MS+Q+   W+P  +G +A   + YE ++RI+ KR + WFMD+ E E++ +K+QAV+   G
Sbjct: 1    MSYQHKSFWMPRDAGCMAEENVGYENSSRIEPKRSHQWFMDTGEPEIVSNKKQAVEDVSG 60

Query: 374  T--SGPAVMESSLWH--DGPHFQLESHAPTLFNPKAVRSSNVSDNNNPR-VSATMNMERK 538
               SG + +  S W    G H  +   +  LF     R+ N+ D N P  VS  MNM RK
Sbjct: 61   RPISGVSHVNVSQWDTSSGFHSVMGQFSDRLFGSDLARTVNLVDKNVPSIVSGNMNMGRK 120

Query: 539  DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712
            D  HQ+GND S+ L++SH + D S CLN  G RKVKVN+VR S+NC+P        S E 
Sbjct: 121  DFEHQYGNDPSVGLSISHSIADPSSCLNFGGIRKVKVNQVRDSDNCMPSAAMGHSYSRED 180

Query: 713  NDVMTTT--FQRTSNNMFSGPTFNSVVGNGISVDPAYS-KMDKNFASVGHSSSKRDGNFM 883
            N  ++    + +   N+  GPT+N    N I +    S K D N  SV H+ +K DG FM
Sbjct: 181  NSTISVGAGYNKNDGNISLGPTYNHRNDNTIGMGSRISSKTDDNLLSVAHNFNKGDGGFM 240

Query: 884  LPNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLES 1063
            L    Y   D ++LS+GQ F++G+ N  ++G  YEKE+ NL S+  +Y++G E+   +  
Sbjct: 241  LMGHNYGKGDESILSMGQPFDKGDGNFISMGQSYEKEDGNLISLGTSYSKGHESFISIGP 300

Query: 1064 FYSKVNETFISAGP-GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210
             + K  E FI+  P  KG  H+   G    + DS +AS    +++ +SS L       KG
Sbjct: 301  TFGKSGENFITVAPYDKGTDHLISMGPTYDKVDSNIASTVPSYDRGDSSSLPVGQNHHKG 360

Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTDQLS-ANAIPA 1384
            + +TISFGGF ++  E + SG +IS Y++L+ NQ+SAQ   +      T+  S  N+IP 
Sbjct: 361  QSSTISFGGFHDD-PEANPSGGIISGYDLLIGNQNSAQGLDSQNDLSETNTESLVNSIPK 419

Query: 1385 SSSKPEGAPKSKDQKT 1432
             ++K +   K+K+ KT
Sbjct: 420  LNTKNDTVVKNKEPKT 435


>ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787520 [Glycine max]
          Length = 581

 Score =  250 bits (639), Expect = 1e-63
 Identities = 166/440 (37%), Positives = 243/440 (55%), Gaps = 25/440 (5%)
 Frame = +2

Query: 194  MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373
            MS+Q+   W+P  +G +A   + YE ++R++ KR + WFMD+ E E+  +K+QAV+   G
Sbjct: 1    MSYQHKSFWMPRDAGCMAEENVGYENSSRVESKRSHKWFMDAGEPEIFSNKKQAVEAVSG 60

Query: 374  --TSGPAVMESSLW--HDGPHFQLESHAPTLFNPKAVRSSNVSDNNNPR-VSATMNMERK 538
               SG +    S W  + G H      +  LF     R+ N+ D N P  VS  +NM RK
Sbjct: 61   RPVSGVSHANVSQWDNNSGFHSVTSQFSDRLFGSDLARTVNLVDKNVPSIVSGNLNMGRK 120

Query: 539  DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712
            D  HQ+GND S+ L+MSH + D S CLN  G RKVKVN+VR S+NC+P        S E 
Sbjct: 121  DFEHQYGNDPSVGLSMSHSIADTSSCLNFGGIRKVKVNQVRDSDNCMPAASMGHSYSRED 180

Query: 713  NDVMTTTFQRTSN---NMFSGPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFM 883
            N  ++       N   N+  GPT+N+V  N I++    SK D N  S+ H+ +K DG FM
Sbjct: 181  NSTISVGAGYNKNDGGNISLGPTYNNVNDNTIAMGSRMSKTDDNLLSMAHTFNKGDGGFM 240

Query: 884  LPNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLES 1063
            L    Y   D ++LS+GQ F++G+ N  ++G  YEKE+ NL S+  +Y +G EN   +  
Sbjct: 241  LLGHNYGKGDESILSMGQPFDKGDGNFISMGQSYEKEDGNLISLGTSYTKGHENFIPVGP 300

Query: 1064 FYSKVNETFISAGP-GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210
             Y K  E FI+  P  KG  HI   G    + DS +AS    F++ +SS L       KG
Sbjct: 301  TYGKSGENFITVAPYDKGTDHIISLGPTYDKVDSNIASTIPSFDRGDSSSLPVGQNHHKG 360

Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAI 1378
            + ++ISFGGF ++      SG +IS Y++L+ +Q+SAQ  G   Q D T+   +   N+I
Sbjct: 361  QNSSISFGGFHDDPGPNIPSG-IISGYDLLIGSQNSAQ--GMDSQNDLTETNTESLVNSI 417

Query: 1379 PASSSKPEGAPKSKDQKTKK 1438
            P  ++K +   K+K+ KT K
Sbjct: 418  PKPNTKND-IVKNKEPKTTK 436


>gb|EXB65298.1| hypothetical protein L484_025377 [Morus notabilis]
          Length = 574

 Score =  249 bits (637), Expect = 2e-63
 Identities = 165/432 (38%), Positives = 247/432 (57%), Gaps = 26/432 (6%)
 Frame = +2

Query: 221  LPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKGT--SGPAVM 394
            +P  +G +A+GE+ Y+ ++R++QKR   WFMD++  +L  +K+QAV+   G   SG   M
Sbjct: 1    MPKDAGCLADGEMGYDNSSRMEQKR-GQWFMDANGPQLF-NKKQAVEAVNGRPISGVPHM 58

Query: 395  ESSLWHDGPHFQLESHAPT--LFNPKAVRSSNVSDNNNPRV-SATMNMERKDLGHQFGND 565
              S W +   FQ      T  LF  + VR+SN+ D N   + S  MNM RK    Q+GN 
Sbjct: 59   NVSQWDNTSGFQSVPGQFTDRLFGSEPVRNSNLVDRNVQSIGSGNMNMGRKGFESQYGNT 118

Query: 566  QSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTTF 736
             S+ L+MSH + D S CLN  G RKVKVN+VR S+N L   +G+++G  E N + M  ++
Sbjct: 119  PSVGLSMSHTIEDPSSCLNFGGIRKVKVNQVRDSDNILNPSMGNSYGRVENNTISMGNSY 178

Query: 737  QRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGID 913
             ++ NN  S  P +N+   N IS+ P ++K D++F S+GH+ +K DGNF+     Y   D
Sbjct: 179  NKSDNNSISLAPAYNNGEENTISMGPTFTKADESFISIGHTFNKGDGNFISMGHNYGKGD 238

Query: 914  NNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETFI 1093
            N +LS+ Q +++G+ N  ++G  YEK +  + S+  +YN+G E    + + Y K N  FI
Sbjct: 239  NGLLSMSQPYDKGDGNFISMGQSYEKGDGGVISLGTSYNKGHEEFISVGTTYGKANNNFI 298

Query: 1094 SAGPG--KGESHIAFQG-----QQDSTVASLGALFNKENSSIL------RKGEETTISFG 1234
               P   KG   I   G     + DS V  +G  ++K +SS L       K E TTISFG
Sbjct: 299  QMAPSYIKGNDSIISMGPTPTYKADSNVVPMGPNYDKGDSSNLSMGQTYNKAESTTISFG 358

Query: 1235 GFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPE 1402
            GF ++  E + SG +ISSY++L+ NQ+SAQ+     QK+S D     S N+IP +  K +
Sbjct: 359  GF-HDEPETNPSGGIISSYDLLMSNQNSAQTLEVSEQKNSADFNVNPSVNSIPQADLKSD 417

Query: 1403 GAPKSKDQKTKK 1438
              PK+K+ KT K
Sbjct: 418  NIPKNKEPKTVK 429


>ref|XP_007009439.1| Uncharacterized protein isoform 8 [Theobroma cacao]
            gi|508726352|gb|EOY18249.1| Uncharacterized protein
            isoform 8 [Theobroma cacao]
          Length = 540

 Score =  247 bits (631), Expect = 8e-63
 Identities = 159/417 (38%), Positives = 236/417 (56%), Gaps = 25/417 (5%)
 Frame = +2

Query: 263  YETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG--TSGPAVMESSLWHDGPHFQLE 436
            Y+ ++R + KR + WFMD++  EL  +K+QA++       SG A +  S WH+   FQ  
Sbjct: 3    YDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNSRPVSGIADVNVSPWHNASSFQSV 62

Query: 437  SH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERKDLGHQFGNDQSICLTMSHEVND- 604
            S   +  LF  + +R+ N+ D N   V S  MNM RKD   Q+ N  S  L+MSH + D 
Sbjct: 63   SSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRKDFDDQYVNSSSAGLSMSHTIEDP 122

Query: 605  SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTTFQRTSNNMFS-GPTF 775
            S C +  G RKVKVN+VR S N +P  +G T+  G  + V M+T + ++ NN  S GPT+
Sbjct: 123  SSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVNSTVSMSTVYSKSDNNAISLGPTY 182

Query: 776  NSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGIDNNVLSIGQAFNRGN 955
             S   N IS+ P ++K D NF S+GH+ +KRDG+F+     YN  + ++LS+GQAF + +
Sbjct: 183  GSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFISVGHNYNKGNESILSVGQAFEKED 242

Query: 956  YNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETFISAGP--GKGESHIA 1129
             +  ++G  YEK ++NL S+  +Y +GQEN   +   Y K NE+ IS  P   K E  I 
Sbjct: 243  GSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNESLISMAPTFDKEEDTII 302

Query: 1130 FQG----QQDSTVASLGALFNKENSSIL------RKGEETTISFGGFQNNADERDHSGRV 1279
              G    + D  + ++     K  SSIL      +KGE  TISFGGF + + E + SG +
Sbjct: 303  PMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISFGGFHDES-ETNPSGSI 361

Query: 1280 ISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPEGAPKSKDQKTKK 1438
            IS Y++L+ NQ+SAQ+S  L QK+  +     + N  P  +S+ +  PK K+ KT K
Sbjct: 362  ISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTDANPKHKEPKTAK 418


>ref|XP_007009436.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508726349|gb|EOY18246.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 563

 Score =  247 bits (631), Expect = 8e-63
 Identities = 159/417 (38%), Positives = 236/417 (56%), Gaps = 25/417 (5%)
 Frame = +2

Query: 263  YETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG--TSGPAVMESSLWHDGPHFQLE 436
            Y+ ++R + KR + WFMD++  EL  +K+QA++       SG A +  S WH+   FQ  
Sbjct: 3    YDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNSRPVSGIADVNVSPWHNASSFQSV 62

Query: 437  SH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERKDLGHQFGNDQSICLTMSHEVND- 604
            S   +  LF  + +R+ N+ D N   V S  MNM RKD   Q+ N  S  L+MSH + D 
Sbjct: 63   SSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRKDFDDQYVNSSSAGLSMSHTIEDP 122

Query: 605  SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTTFQRTSNNMFS-GPTF 775
            S C +  G RKVKVN+VR S N +P  +G T+  G  + V M+T + ++ NN  S GPT+
Sbjct: 123  SSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVNSTVSMSTVYSKSDNNAISLGPTY 182

Query: 776  NSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGIDNNVLSIGQAFNRGN 955
             S   N IS+ P ++K D NF S+GH+ +KRDG+F+     YN  + ++LS+GQAF + +
Sbjct: 183  GSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFISVGHNYNKGNESILSVGQAFEKED 242

Query: 956  YNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETFISAGP--GKGESHIA 1129
             +  ++G  YEK ++NL S+  +Y +GQEN   +   Y K NE+ IS  P   K E  I 
Sbjct: 243  GSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNESLISMAPTFDKEEDTII 302

Query: 1130 FQG----QQDSTVASLGALFNKENSSIL------RKGEETTISFGGFQNNADERDHSGRV 1279
              G    + D  + ++     K  SSIL      +KGE  TISFGGF + + E + SG +
Sbjct: 303  PMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISFGGFHDES-ETNPSGSI 361

Query: 1280 ISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPEGAPKSKDQKTKK 1438
            IS Y++L+ NQ+SAQ+S  L QK+  +     + N  P  +S+ +  PK K+ KT K
Sbjct: 362  ISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTDANPKHKEPKTAK 418


>ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782217 [Glycine max]
          Length = 582

 Score =  244 bits (624), Expect = 5e-62
 Identities = 162/440 (36%), Positives = 241/440 (54%), Gaps = 25/440 (5%)
 Frame = +2

Query: 194  MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373
            MS+Q+   W+P  +G +A     YE ++RI+ KR + WFMD+ E E+  +K+QAV+   G
Sbjct: 1    MSYQHKSFWMPRDAGCMAEENAGYENSSRIEPKRSHQWFMDTGEPEIFSNKKQAVEAVSG 60

Query: 374  T--SGPAVMESSLW--HDGPHFQLESHAPTLFNPKAVRSSNVSDNNNPR-VSATMNMERK 538
               SG +    S W  + G H      +  LF     R+ N+ D N P  VS  +NM RK
Sbjct: 61   RPISGVSHANVSQWDTNSGFHSVTSQFSDRLFGSDLARTVNLVDKNVPSIVSGNLNMGRK 120

Query: 539  DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712
            D  HQ+GND S+ L++SH + D S CLN  G RKVKVN+VR S+NC+P        S E 
Sbjct: 121  DFEHQYGNDPSVGLSISHSIADPSSCLNFGGIRKVKVNQVRDSDNCMPAASMGPSYSRED 180

Query: 713  NDVMTTTFQRTSN---NMFSGPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFM 883
            N  ++       N   N+  GPT+N+   N I++    SK D N  S+ H+ SK DG FM
Sbjct: 181  NSTISVGAGYNKNDGDNISLGPTYNNGYDNTIAMGSRISKTDDNLLSMAHTFSKGDGGFM 240

Query: 884  LPNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLES 1063
            L    Y   D +++S+GQ F++G+ N  ++G  YEKE+ NL S+  +Y +  E+   +  
Sbjct: 241  LMGHNYGKGDESIVSMGQPFDKGDGNFISMGQSYEKEDGNLISLGTSYTKVHESFIPVGP 300

Query: 1064 FYSKVNETFISAGP-GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210
             Y K  E FI+  P  KG +HI   G    + DS +AS    +++ +SS L       KG
Sbjct: 301  TYGKSGENFITVAPYDKGTNHIISMGPTYDKVDSNIASTVPSYDRGDSSSLPVGQNHHKG 360

Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAI 1378
            + ++ISFGGF ++  E +  G +IS Y++L+  Q+SAQ  G   Q D T+   +   N+I
Sbjct: 361  QSSSISFGGFHDD-PEPNTPGGIISGYDLLIGGQNSAQ--GLDSQNDLTETNTESLVNSI 417

Query: 1379 PASSSKPEGAPKSKDQKTKK 1438
            P  ++K +   K+K+ KT K
Sbjct: 418  PKPNTKNDIVVKNKEPKTTK 437


>ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206313 [Cucumis sativus]
          Length = 582

 Score =  244 bits (623), Expect = 7e-62
 Identities = 158/438 (36%), Positives = 237/438 (54%), Gaps = 23/438 (5%)
 Frame = +2

Query: 194  MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373
            MSFQ+   W+P  +G + +GE+ Y++++RI+ KR + WFMD S  EL  SK+QA++    
Sbjct: 1    MSFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRGHQWFMDGSAPELFSSKKQAIEAVNS 60

Query: 374  TSGPAV--MESSLWHDGPHFQLESH-APTLFNPKAVRSSNVSDNNNPRVSATMNMERKDL 544
               P V  M  S W +     +  H    LF  + +R+ N+ D      +A M+M RK+ 
Sbjct: 61   RPVPGVPHMNVSPWENSSFQSVPGHFTDRLFGSEPIRTVNLVDRGISVGNANMDMGRKEF 120

Query: 545  GHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKND 718
             + F N+ S+ L+MS  + D S CLN  G RKVKVN+VR  +  +P  +G  +  G+   
Sbjct: 121  ENHFTNNPSVGLSMSQSIEDPSSCLNFGGIRKVKVNQVRDPDVGMPASLGHAYTRGDNCT 180

Query: 719  V-MTTTFQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPN 892
            + M T F +   N  S G T+NS   N ISV PAY K D NF S+GH+ SK DG+F+   
Sbjct: 181  ISMGTGFNKNHENTISLGQTYNSRDENAISVGPAYHKTDDNFISMGHAFSKGDGSFITIG 240

Query: 893  QYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYS 1072
              Y+  DN++LS+ Q F++G+ +  ++G  YEK   N+ S   +YN+GQEN   +   YS
Sbjct: 241  HNYSKGDNSILSMNQPFDKGDDSFISMGQSYEKAEGNIISFA-SYNKGQENFISMGPAYS 299

Query: 1073 KVNETFIS------AGPGKGESHIAFQGQQDSTVASLGALFNKENSSIL------RKGEE 1216
            K  +TFIS       G     S      + +S +  +G  F+K +S  +       KGE 
Sbjct: 300  KAGDTFISMASSFNKGNDDNLSMAPTYDKVNSDIVHVGPKFDKADSGAVSMAHNYHKGES 359

Query: 1217 TTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPA 1384
             TISFGGF +     + SG +ISSY++L+ NQ+SAQ+S     +DS D   +++ N    
Sbjct: 360  NTISFGGFDDENGTDNPSGGIISSYDLLMANQASAQASEVSTLRDSVDPNVEVNINGAIK 419

Query: 1385 SSSKPEGAPKSKDQKTKK 1438
               K +   KSK+ +  K
Sbjct: 420  VDGKIDTNSKSKEPRMSK 437


>ref|XP_007218931.1| hypothetical protein PRUPE_ppa003346mg [Prunus persica]
            gi|462415393|gb|EMJ20130.1| hypothetical protein
            PRUPE_ppa003346mg [Prunus persica]
          Length = 583

 Score =  241 bits (614), Expect = 8e-61
 Identities = 159/442 (35%), Positives = 244/442 (55%), Gaps = 27/442 (6%)
 Frame = +2

Query: 194  MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373
            MSFQ    W+P  +  + +GE+ Y+ ++RI+ KR N WFMDS+  E   +K+QA++   G
Sbjct: 1    MSFQPKSFWIPRDASCLTDGEMGYDNSSRIESKRGNRWFMDSNGLEFFNNKKQAMEAVNG 60

Query: 374  --TSGPAVMESSLWHDGPHFQLESHAPT--LFNPKAVRSSNVSDNNNPRV-SATMNMERK 538
               SG   +  S W +   FQ      T  LF  + VR+ N+ D N   V S  MN+ RK
Sbjct: 61   RPVSGVPHLAISPWDNTSGFQSVPGQFTDRLFGSEPVRTVNLGDRNIQSVGSENMNLGRK 120

Query: 539  DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712
                Q+GND S+ L+MSH + D S CLN  G RKVKVNEVR S++ +   +G ++  G+ 
Sbjct: 121  GFEDQYGNDPSVGLSMSHTIEDPSSCLNFGGIRKVKVNEVRDSDDVVSASMGHSYCKGDS 180

Query: 713  NDV-MTTTFQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFML 886
            N + M  T+ ++ +N  S G  +N+   N IS+ P+++K D NF S+GH+ SK + NF+ 
Sbjct: 181  NTMSMANTYNKSDDNAISLGSAYNTGEENAISIGPSFNKADDNFISMGHTFSKANSNFIS 240

Query: 887  PNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESF 1066
                YN  DN++LS+GQ F++ + N  ++G  YEK +S+  S+  +Y++G EN   + + 
Sbjct: 241  MAHNYNKGDNSILSMGQPFDKEDGNFISMGQSYEKGDSSFISLGNSYHKGHENFISMGAT 300

Query: 1067 YSKVNETFISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSI-----LRKGE 1213
            Y K NE FIS  P   K   ++   G    + DS V  +G  ++K  S++       K E
Sbjct: 301  YGKANENFISMAPTYDKQTDNMMSMGPNYDKADSNVVPIGPPYHKGESNVSMSHNYNKNE 360

Query: 1214 ETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTDQLSANAIP--- 1381
             TTISFG F +  D  + SG +ISSY++L+ NQ++A+ S    +    D + +N  P   
Sbjct: 361  STTISFGSFHHETD-TNPSGGIISSYDLLMNNQNTAEQS---EESGLKDPIQSNMDPNVD 416

Query: 1382 ---ASSSKPEGAPKSKDQKTKK 1438
                  SK +   K K+ KT +
Sbjct: 417  DALKLDSKTDTVSKIKEPKTAR 438


>ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cucumis sativus]
          Length = 561

 Score =  223 bits (568), Expect = 2e-55
 Identities = 149/415 (35%), Positives = 222/415 (53%), Gaps = 23/415 (5%)
 Frame = +2

Query: 263  YETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKGTSGPAV--MESSLWHDGPHFQLE 436
            Y++++RI+ KR + WFMD S  EL  SK+QA++       P V  M  S W +     + 
Sbjct: 3    YDSSSRIETKRGHQWFMDGSAPELFSSKKQAIEAVNSRPVPGVPHMNVSPWENSSFQSVP 62

Query: 437  SH-APTLFNPKAVRSSNVSDNNNPRVSATMNMERKDLGHQFGNDQSICLTMSHEVND-SL 610
             H    LF  + +R+ N+ D      +A M+M RK+  + F N+ S+ L+MS  + D S 
Sbjct: 63   GHFTDRLFGSEPIRTVNLVDRGISVGNANMDMGRKEFENHFTNNPSVGLSMSQSIEDPSS 122

Query: 611  CLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTTFQRTSNNMFS-GPTFNS 781
            CLN  G RKVKVN+VR  +  +P  +G  +  G+   + M T F +   N  S G T+NS
Sbjct: 123  CLNFGGIRKVKVNQVRDPDVGMPASLGHAYTRGDNCTISMGTGFNKNHENTISLGQTYNS 182

Query: 782  VVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGIDNNVLSIGQAFNRGNYN 961
               N ISV PAY K D NF S+GH+ SK DG+F+     Y+  DN++LS+ Q F++G+ +
Sbjct: 183  RDENAISVGPAYHKTDDNFISMGHAFSKGDGSFITIGHNYSKGDNSILSMNQPFDKGDDS 242

Query: 962  VDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETFIS------AGPGKGESH 1123
              ++G  YEK   N+ S   +YN+GQEN   +   YSK  +TFIS       G     S 
Sbjct: 243  FISMGQSYEKAEGNIISFA-SYNKGQENFISMGPAYSKAGDTFISMASSFNKGNDDNLSM 301

Query: 1124 IAFQGQQDSTVASLGALFNKENSSIL------RKGEETTISFGGFQNNADERDHSGRVIS 1285
                 + +S +  +G  F+K +S  +       KGE  TISFGGF +     + SG +IS
Sbjct: 302  APTYDKVNSDIVHVGPKFDKADSGAVSMAHNYHKGESNTISFGGFDDENGTDNPSGGIIS 361

Query: 1286 SYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPEGAPKSKDQKTKK 1438
            SY++L+ NQ+SAQ+S     +DS D   +++ N       K +   KSK+ +  K
Sbjct: 362  SYDLLMANQASAQASEVSTLRDSVDPNVEVNINGAIKVDGKIDTNSKSKEPRMSK 416


>ref|XP_002316304.2| hypothetical protein POPTR_0010s21640g, partial [Populus trichocarpa]
            gi|550330316|gb|EEF02475.2| hypothetical protein
            POPTR_0010s21640g, partial [Populus trichocarpa]
          Length = 644

 Score =  210 bits (535), Expect = 1e-51
 Identities = 143/407 (35%), Positives = 224/407 (55%), Gaps = 23/407 (5%)
 Frame = +2

Query: 197  SFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKGT 376
            SFQ    W+    G + +G+I ++ ++R++ KR + W MDS+  EL  +K+QAV+P    
Sbjct: 1    SFQQKSFWMTRDVGCLTDGDIGFDNSSRMEPKRGHQWLMDSTGPELFSNKKQAVEPSSNN 60

Query: 377  S---GPAVMESSLWHDGPHFQLESHA--PTLFNPKAVRSSNVSDNNNPRVS-ATMNMERK 538
                G + M  S W++   FQ  S      LF  + +R +  S +N P  S   MNMERK
Sbjct: 61   RPVMGMSHMNISPWNNTSCFQSVSGQFNDRLFGFEPLRIN--SGSNVPSASNGNMNMERK 118

Query: 539  DLGHQFGNDQSICLTMSHEVNDSLCLNS--GPRKVKVNEVRISENCLPEYVGSTFGSGEK 712
            D    +G++ S+ L+MSH V D     S  G RKV+VN+VR S N +   VG ++  G+ 
Sbjct: 119  DFNDLYGSNCSMGLSMSHNVEDPPASISFGGLRKVRVNQVRDSSNDISSSVGHSYSRGDD 178

Query: 713  NDV-MTTTFQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFML 886
            N + M T + +  +N  S G T+N+   N IS+ P +SK D +F S+GH+ +K D NF+ 
Sbjct: 179  NIISMGTAYNKRESNAISLGSTYNNGDENTISISPTFSKADGSFISMGHAFNKDDDNFIS 238

Query: 887  PNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESF 1066
              Q YN  D ++LS+GQ F++ + N   +G  Y+KE+++  S+  +YN+G E+   +   
Sbjct: 239  MGQGYNKGDESILSMGQPFDKKDANFITMGPSYDKEDNHFISMALSYNKGHESFISMGPS 298

Query: 1067 YSKVNETFISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210
            Y K +E FI  G    KG  ++   G    + D  +AS+    +K NS IL       KG
Sbjct: 299  YDKTSENFILMGSSFSKGGDNVISNGPIYDKADIDIASMTPAQDKGNSGILSIGHNYNKG 358

Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKD 1348
            +  +ISF  F ++  E + SG VI  Y++L+ NQ++AQ+S   VQ +
Sbjct: 359  DNNSISFQSF-HDEPETNMSGNVIRGYDLLVSNQNTAQTSEVPVQNN 404


>ref|XP_002311151.2| hypothetical protein POPTR_0008s05120g [Populus trichocarpa]
            gi|550332456|gb|EEE88518.2| hypothetical protein
            POPTR_0008s05120g [Populus trichocarpa]
          Length = 616

 Score =  209 bits (532), Expect = 2e-51
 Identities = 146/425 (34%), Positives = 231/425 (54%), Gaps = 26/425 (6%)
 Frame = +2

Query: 218  WLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKGT---SGPA 388
            W+   +G + +G++ ++ ++R++ K  + WFMDS   EL  +K+QAV+        +G +
Sbjct: 6    WITRDAGCLNDGDVGFDNSSRMEAKHSHQWFMDSPGPELFSNKKQAVEHSSNNRPVAGMS 65

Query: 389  VMESSLWHDGPHFQLES--HAPTLFNPKAVRSSNVSDNNNPRVSATMNMERKDLGHQFGN 562
             M  S W++   FQ  S   +  LF  + +R +N S N     +  MNM RKD    +G+
Sbjct: 66   HMNISPWNNTSSFQSVSGHFSDRLFGSEPLRPNNGS-NFLSSGNGNMNMGRKDF--IYGS 122

Query: 563  DQSICLTMSHEVNDSLCLNS--GPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTT 733
            + S+ L+M+H + D     S  G RKVKVN+VR S   +   VG ++  G+ N + M   
Sbjct: 123  NCSMGLSMTHNIEDPSASISFGGIRKVKVNQVRDSN--ISSSVGHSYARGDDNIISMGPA 180

Query: 734  F-QRTSNNMFSGPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGI 910
            + +R SN +  G T+N+   N IS+ P +SK D NF S+ H+ SK DGNF+     YN  
Sbjct: 181  YNKRESNTISLGSTYNNGDENTISISPTFSKADGNFISIRHAFSKDDGNFISMGHNYNKG 240

Query: 911  DNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETF 1090
            D ++LS+GQ F++ + N   +G  Y+KEN++  S+ P+YN+G +N   +   Y K +E F
Sbjct: 241  DESMLSMGQPFDKEDANFITIGPSYDKENNHFISMAPSYNKGHDNFISMGPSYDKTSENF 300

Query: 1091 ISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKGEETTISFG 1234
            I  G    KG  +I   G    + DS + S+    +K NS IL       KG+   ISFG
Sbjct: 301  ILMGSSFSKGGDNIISNGPAYDKADSDITSMAPAQDKGNSGILSMGHNYNKGDNNAISFG 360

Query: 1235 GFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTDQLSANAIPASSSKP---- 1399
            GF ++  E + SG +I+ YE+L+ NQ +AQ+S         + LS N +P +++ P    
Sbjct: 361  GF-HDEPETNSSGNIITGYELLVSNQDTAQTS---------EVLSQNVLPQANADPQLNT 410

Query: 1400 EGAPK 1414
            + APK
Sbjct: 411  DSAPK 415


Top