BLASTX nr result
ID: Mentha22_contig00045496
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00045496 (1443 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU38347.1| hypothetical protein MIMGU_mgv1a005691mg [Mimulus... 276 1e-71 ref|XP_007009440.1| Uncharacterized protein isoform 9 [Theobroma... 271 7e-70 ref|XP_007009437.1| Uncharacterized protein isoform 6 [Theobroma... 271 7e-70 ref|XP_007009432.1| Uncharacterized protein isoform 1 [Theobroma... 271 7e-70 ref|XP_007009438.1| Uncharacterized protein isoform 7 [Theobroma... 263 1e-67 ref|XP_007009435.1| Uncharacterized protein isoform 4 [Theobroma... 263 1e-67 ref|XP_007009434.1| Uncharacterized protein isoform 3 [Theobroma... 263 1e-67 ref|XP_007139261.1| hypothetical protein PHAVU_008G014500g [Phas... 253 1e-64 ref|XP_007139260.1| hypothetical protein PHAVU_008G014500g [Phas... 253 1e-64 ref|XP_007139258.1| hypothetical protein PHAVU_008G014500g [Phas... 253 1e-64 ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787... 250 1e-63 gb|EXB65298.1| hypothetical protein L484_025377 [Morus notabilis] 249 2e-63 ref|XP_007009439.1| Uncharacterized protein isoform 8 [Theobroma... 247 8e-63 ref|XP_007009436.1| Uncharacterized protein isoform 5 [Theobroma... 247 8e-63 ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782... 244 5e-62 ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206... 244 7e-62 ref|XP_007218931.1| hypothetical protein PRUPE_ppa003346mg [Prun... 241 8e-61 ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cuc... 223 2e-55 ref|XP_002316304.2| hypothetical protein POPTR_0010s21640g, part... 210 1e-51 ref|XP_002311151.2| hypothetical protein POPTR_0008s05120g [Popu... 209 2e-51 >gb|EYU38347.1| hypothetical protein MIMGU_mgv1a005691mg [Mimulus guttatus] Length = 474 Score = 276 bits (707), Expect = 1e-71 Identities = 177/426 (41%), Positives = 237/426 (55%), Gaps = 10/426 (2%) Frame = +2 Query: 194 MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373 MSFQNN IWL N SGS+ANGE+CY++ TRIDQKR + WF SEQEL +K+QAV+ + Sbjct: 1 MSFQNNGIWLTNSSGSLANGEMCYDSTTRIDQKRSHQWFTGPSEQELFTNKKQAVESTRE 60 Query: 374 TSGPAVMESSLWHDGPHFQLESHA-PTLFNPKAVRSSNVSDNNNPRVSATMNMERKDLGH 550 + P ++ W DG + Q E LF PK VR Sbjct: 61 VTEPVTVDG-FWRDGSNSQSEGQTGDRLFAPKPVR------------------------- 94 Query: 551 QFGNDQSICLTMSHEVNDSLCLNSGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDVMTT 730 + D L LN+G RKVKVNEV I +NC PE++G+T Sbjct: 95 ---------------LEDPLSLNTGLRKVKVNEVTIPDNCFPEFMGNTM----------- 128 Query: 731 TFQRTSNNMFSGPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGI 910 FQR+ +NM+S PT N+ D NFML NQYYNGI Sbjct: 129 -FQRSGSNMYSEPTNNT-----------------------------DANFMLTNQYYNGI 158 Query: 911 DNNVLSIGQAFNRGNYNVDALGDQYEKENS-NLSSVCPTY-NRGQENLFGLESFYSKVNE 1084 DNN+LSIG FN GNY+ QYEKE S N ++ P Y ++G +N F +E F +++NE Sbjct: 159 DNNLLSIG--FNGGNYS-STHTVQYEKEASCNFVAISPNYGSKGHDNFFAVEPFCNRLNE 215 Query: 1085 TFISAGPGKGESHIAFQGQQDSTVASLGALFNKENSSIL------RKGEETTISFGGFQN 1246 TF++AG ++ Q D+ + SLG+L+NKENS +L + GEE TISFGG ++ Sbjct: 216 TFMTAGSTYNNNNNI---QHDAPIVSLGSLYNKENSGLLSMVANSKNGEEATISFGGVED 272 Query: 1247 NADERDHSGRVISSYEVLLNQSSAQSSGALVQKDSTDQLSANAIPASSSKPEGAP-KSKD 1423 + +ERD SGR+IS+Y++L NQS+ Q+ AL QLSAN + A++SK +GA K+K+ Sbjct: 273 SHEERDLSGRLISNYDLLANQSTGQNESAL-------QLSANVVTAATSKTDGAQIKNKE 325 Query: 1424 QKTKKG 1441 QKTKKG Sbjct: 326 QKTKKG 331 >ref|XP_007009440.1| Uncharacterized protein isoform 9 [Theobroma cacao] gi|508726353|gb|EOY18250.1| Uncharacterized protein isoform 9 [Theobroma cacao] Length = 477 Score = 271 bits (692), Expect = 7e-70 Identities = 170/440 (38%), Positives = 250/440 (56%), Gaps = 25/440 (5%) Frame = +2 Query: 194 MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373 MSFQ+ WLP G + NGE+ Y+ ++R + KR + WFMD++ EL +K+QA++ Sbjct: 1 MSFQHKSFWLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNS 60 Query: 374 --TSGPAVMESSLWHDGPHFQLESH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERK 538 SG A + S WH+ FQ S + LF + +R+ N+ D N V S MNM RK Sbjct: 61 RPVSGIADVNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRK 120 Query: 539 DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712 D Q+ N S L+MSH + D S C + G RKVKVN+VR S N +P +G T+ G Sbjct: 121 DFDDQYVNSSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVN 180 Query: 713 NDV-MTTTFQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFML 886 + V M+T + ++ NN S GPT+ S N IS+ P ++K D NF S+GH+ +KRDG+F+ Sbjct: 181 STVSMSTVYSKSDNNAISLGPTYGSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFIS 240 Query: 887 PNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESF 1066 YN + ++LS+GQAF + + + ++G YEK ++NL S+ +Y +GQEN + Sbjct: 241 VGHNYNKGNESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPA 300 Query: 1067 YSKVNETFISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210 Y K NE+ IS P K E I G + D + ++ K SSIL +KG Sbjct: 301 YGKPNESLISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKG 360 Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAI 1378 E TISFGGF + + E + SG +IS Y++L+ NQ+SAQ+S L QK+ + + N Sbjct: 361 ESNTISFGGFHDES-ETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNA 419 Query: 1379 PASSSKPEGAPKSKDQKTKK 1438 P +S+ + PK K+ KT K Sbjct: 420 PKHNSRTDANPKHKEPKTAK 439 >ref|XP_007009437.1| Uncharacterized protein isoform 6 [Theobroma cacao] gi|508726350|gb|EOY18247.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 561 Score = 271 bits (692), Expect = 7e-70 Identities = 170/440 (38%), Positives = 250/440 (56%), Gaps = 25/440 (5%) Frame = +2 Query: 194 MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373 MSFQ+ WLP G + NGE+ Y+ ++R + KR + WFMD++ EL +K+QA++ Sbjct: 1 MSFQHKSFWLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNS 60 Query: 374 --TSGPAVMESSLWHDGPHFQLESH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERK 538 SG A + S WH+ FQ S + LF + +R+ N+ D N V S MNM RK Sbjct: 61 RPVSGIADVNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRK 120 Query: 539 DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712 D Q+ N S L+MSH + D S C + G RKVKVN+VR S N +P +G T+ G Sbjct: 121 DFDDQYVNSSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVN 180 Query: 713 NDV-MTTTFQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFML 886 + V M+T + ++ NN S GPT+ S N IS+ P ++K D NF S+GH+ +KRDG+F+ Sbjct: 181 STVSMSTVYSKSDNNAISLGPTYGSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFIS 240 Query: 887 PNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESF 1066 YN + ++LS+GQAF + + + ++G YEK ++NL S+ +Y +GQEN + Sbjct: 241 VGHNYNKGNESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPA 300 Query: 1067 YSKVNETFISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210 Y K NE+ IS P K E I G + D + ++ K SSIL +KG Sbjct: 301 YGKPNESLISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKG 360 Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAI 1378 E TISFGGF + + E + SG +IS Y++L+ NQ+SAQ+S L QK+ + + N Sbjct: 361 ESNTISFGGFHDES-ETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNA 419 Query: 1379 PASSSKPEGAPKSKDQKTKK 1438 P +S+ + PK K+ KT K Sbjct: 420 PKHNSRTDANPKHKEPKTAK 439 >ref|XP_007009432.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590563660|ref|XP_007009433.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508726345|gb|EOY18242.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508726346|gb|EOY18243.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 584 Score = 271 bits (692), Expect = 7e-70 Identities = 170/440 (38%), Positives = 250/440 (56%), Gaps = 25/440 (5%) Frame = +2 Query: 194 MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373 MSFQ+ WLP G + NGE+ Y+ ++R + KR + WFMD++ EL +K+QA++ Sbjct: 1 MSFQHKSFWLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNS 60 Query: 374 --TSGPAVMESSLWHDGPHFQLESH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERK 538 SG A + S WH+ FQ S + LF + +R+ N+ D N V S MNM RK Sbjct: 61 RPVSGIADVNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRK 120 Query: 539 DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712 D Q+ N S L+MSH + D S C + G RKVKVN+VR S N +P +G T+ G Sbjct: 121 DFDDQYVNSSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVN 180 Query: 713 NDV-MTTTFQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFML 886 + V M+T + ++ NN S GPT+ S N IS+ P ++K D NF S+GH+ +KRDG+F+ Sbjct: 181 STVSMSTVYSKSDNNAISLGPTYGSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFIS 240 Query: 887 PNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESF 1066 YN + ++LS+GQAF + + + ++G YEK ++NL S+ +Y +GQEN + Sbjct: 241 VGHNYNKGNESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPA 300 Query: 1067 YSKVNETFISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210 Y K NE+ IS P K E I G + D + ++ K SSIL +KG Sbjct: 301 YGKPNESLISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKG 360 Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAI 1378 E TISFGGF + + E + SG +IS Y++L+ NQ+SAQ+S L QK+ + + N Sbjct: 361 ESNTISFGGFHDES-ETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNA 419 Query: 1379 PASSSKPEGAPKSKDQKTKK 1438 P +S+ + PK K+ KT K Sbjct: 420 PKHNSRTDANPKHKEPKTAK 439 >ref|XP_007009438.1| Uncharacterized protein isoform 7 [Theobroma cacao] gi|508726351|gb|EOY18248.1| Uncharacterized protein isoform 7 [Theobroma cacao] Length = 558 Score = 263 bits (673), Expect = 1e-67 Identities = 166/432 (38%), Positives = 245/432 (56%), Gaps = 25/432 (5%) Frame = +2 Query: 218 WLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG--TSGPAV 391 WLP G + NGE+ Y+ ++R + KR + WFMD++ EL +K+QA++ SG A Sbjct: 6 WLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNSRPVSGIAD 65 Query: 392 MESSLWHDGPHFQLESH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERKDLGHQFGN 562 + S WH+ FQ S + LF + +R+ N+ D N V S MNM RKD Q+ N Sbjct: 66 VNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRKDFDDQYVN 125 Query: 563 DQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTT 733 S L+MSH + D S C + G RKVKVN+VR S N +P +G T+ G + V M+T Sbjct: 126 SSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVNSTVSMSTV 185 Query: 734 FQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGI 910 + ++ NN S GPT+ S N IS+ P ++K D NF S+GH+ +KRDG+F+ YN Sbjct: 186 YSKSDNNAISLGPTYGSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFISVGHNYNKG 245 Query: 911 DNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETF 1090 + ++LS+GQAF + + + ++G YEK ++NL S+ +Y +GQEN + Y K NE+ Sbjct: 246 NESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNESL 305 Query: 1091 ISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKGEETTISFG 1234 IS P K E I G + D + ++ K SSIL +KGE TISFG Sbjct: 306 ISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISFG 365 Query: 1235 GFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPE 1402 GF + + E + SG +IS Y++L+ NQ+SAQ+S L QK+ + + N P +S+ + Sbjct: 366 GFHDES-ETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTD 424 Query: 1403 GAPKSKDQKTKK 1438 PK K+ KT K Sbjct: 425 ANPKHKEPKTAK 436 >ref|XP_007009435.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508726348|gb|EOY18245.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 479 Score = 263 bits (673), Expect = 1e-67 Identities = 166/432 (38%), Positives = 245/432 (56%), Gaps = 25/432 (5%) Frame = +2 Query: 218 WLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG--TSGPAV 391 WLP G + NGE+ Y+ ++R + KR + WFMD++ EL +K+QA++ SG A Sbjct: 6 WLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNSRPVSGIAD 65 Query: 392 MESSLWHDGPHFQLESH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERKDLGHQFGN 562 + S WH+ FQ S + LF + +R+ N+ D N V S MNM RKD Q+ N Sbjct: 66 VNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRKDFDDQYVN 125 Query: 563 DQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTT 733 S L+MSH + D S C + G RKVKVN+VR S N +P +G T+ G + V M+T Sbjct: 126 SSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVNSTVSMSTV 185 Query: 734 FQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGI 910 + ++ NN S GPT+ S N IS+ P ++K D NF S+GH+ +KRDG+F+ YN Sbjct: 186 YSKSDNNAISLGPTYGSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFISVGHNYNKG 245 Query: 911 DNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETF 1090 + ++LS+GQAF + + + ++G YEK ++NL S+ +Y +GQEN + Y K NE+ Sbjct: 246 NESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNESL 305 Query: 1091 ISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKGEETTISFG 1234 IS P K E I G + D + ++ K SSIL +KGE TISFG Sbjct: 306 ISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISFG 365 Query: 1235 GFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPE 1402 GF + + E + SG +IS Y++L+ NQ+SAQ+S L QK+ + + N P +S+ + Sbjct: 366 GFHDES-ETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTD 424 Query: 1403 GAPKSKDQKTKK 1438 PK K+ KT K Sbjct: 425 ANPKHKEPKTAK 436 >ref|XP_007009434.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508726347|gb|EOY18244.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 581 Score = 263 bits (673), Expect = 1e-67 Identities = 166/432 (38%), Positives = 245/432 (56%), Gaps = 25/432 (5%) Frame = +2 Query: 218 WLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG--TSGPAV 391 WLP G + NGE+ Y+ ++R + KR + WFMD++ EL +K+QA++ SG A Sbjct: 6 WLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNSRPVSGIAD 65 Query: 392 MESSLWHDGPHFQLESH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERKDLGHQFGN 562 + S WH+ FQ S + LF + +R+ N+ D N V S MNM RKD Q+ N Sbjct: 66 VNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRKDFDDQYVN 125 Query: 563 DQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTT 733 S L+MSH + D S C + G RKVKVN+VR S N +P +G T+ G + V M+T Sbjct: 126 SSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVNSTVSMSTV 185 Query: 734 FQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGI 910 + ++ NN S GPT+ S N IS+ P ++K D NF S+GH+ +KRDG+F+ YN Sbjct: 186 YSKSDNNAISLGPTYGSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFISVGHNYNKG 245 Query: 911 DNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETF 1090 + ++LS+GQAF + + + ++G YEK ++NL S+ +Y +GQEN + Y K NE+ Sbjct: 246 NESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNESL 305 Query: 1091 ISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKGEETTISFG 1234 IS P K E I G + D + ++ K SSIL +KGE TISFG Sbjct: 306 ISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISFG 365 Query: 1235 GFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPE 1402 GF + + E + SG +IS Y++L+ NQ+SAQ+S L QK+ + + N P +S+ + Sbjct: 366 GFHDES-ETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTD 424 Query: 1403 GAPKSKDQKTKK 1438 PK K+ KT K Sbjct: 425 ANPKHKEPKTAK 436 >ref|XP_007139261.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris] gi|561012394|gb|ESW11255.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris] Length = 472 Score = 253 bits (647), Expect = 1e-64 Identities = 162/436 (37%), Positives = 243/436 (55%), Gaps = 23/436 (5%) Frame = +2 Query: 194 MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373 MS+Q+ W+P +G +A + YE ++RI+ KR + WFMD+ E E++ +K+QAV+ G Sbjct: 1 MSYQHKSFWMPRDAGCMAEENVGYENSSRIEPKRSHQWFMDTGEPEIVSNKKQAVEDVSG 60 Query: 374 T--SGPAVMESSLWH--DGPHFQLESHAPTLFNPKAVRSSNVSDNNNPR-VSATMNMERK 538 SG + + S W G H + + LF R+ N+ D N P VS MNM RK Sbjct: 61 RPISGVSHVNVSQWDTSSGFHSVMGQFSDRLFGSDLARTVNLVDKNVPSIVSGNMNMGRK 120 Query: 539 DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712 D HQ+GND S+ L++SH + D S CLN G RKVKVN+VR S+NC+P S E Sbjct: 121 DFEHQYGNDPSVGLSISHSIADPSSCLNFGGIRKVKVNQVRDSDNCMPSAAMGHSYSRED 180 Query: 713 NDVMTTT--FQRTSNNMFSGPTFNSVVGNGISVDPAYS-KMDKNFASVGHSSSKRDGNFM 883 N ++ + + N+ GPT+N N I + S K D N SV H+ +K DG FM Sbjct: 181 NSTISVGAGYNKNDGNISLGPTYNHRNDNTIGMGSRISSKTDDNLLSVAHNFNKGDGGFM 240 Query: 884 LPNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLES 1063 L Y D ++LS+GQ F++G+ N ++G YEKE+ NL S+ +Y++G E+ + Sbjct: 241 LMGHNYGKGDESILSMGQPFDKGDGNFISMGQSYEKEDGNLISLGTSYSKGHESFISIGP 300 Query: 1064 FYSKVNETFISAGP-GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210 + K E FI+ P KG H+ G + DS +AS +++ +SS L KG Sbjct: 301 TFGKSGENFITVAPYDKGTDHLISMGPTYDKVDSNIASTVPSYDRGDSSSLPVGQNHHKG 360 Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTDQLS-ANAIPA 1384 + +TISFGGF ++ E + SG +IS Y++L+ NQ+SAQ + T+ S N+IP Sbjct: 361 QSSTISFGGFHDD-PEANPSGGIISGYDLLIGNQNSAQGLDSQNDLSETNTESLVNSIPK 419 Query: 1385 SSSKPEGAPKSKDQKT 1432 ++K + K+K+ KT Sbjct: 420 LNTKNDTVVKNKEPKT 435 >ref|XP_007139260.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris] gi|561012393|gb|ESW11254.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris] Length = 503 Score = 253 bits (647), Expect = 1e-64 Identities = 162/436 (37%), Positives = 243/436 (55%), Gaps = 23/436 (5%) Frame = +2 Query: 194 MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373 MS+Q+ W+P +G +A + YE ++RI+ KR + WFMD+ E E++ +K+QAV+ G Sbjct: 1 MSYQHKSFWMPRDAGCMAEENVGYENSSRIEPKRSHQWFMDTGEPEIVSNKKQAVEDVSG 60 Query: 374 T--SGPAVMESSLWH--DGPHFQLESHAPTLFNPKAVRSSNVSDNNNPR-VSATMNMERK 538 SG + + S W G H + + LF R+ N+ D N P VS MNM RK Sbjct: 61 RPISGVSHVNVSQWDTSSGFHSVMGQFSDRLFGSDLARTVNLVDKNVPSIVSGNMNMGRK 120 Query: 539 DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712 D HQ+GND S+ L++SH + D S CLN G RKVKVN+VR S+NC+P S E Sbjct: 121 DFEHQYGNDPSVGLSISHSIADPSSCLNFGGIRKVKVNQVRDSDNCMPSAAMGHSYSRED 180 Query: 713 NDVMTTT--FQRTSNNMFSGPTFNSVVGNGISVDPAYS-KMDKNFASVGHSSSKRDGNFM 883 N ++ + + N+ GPT+N N I + S K D N SV H+ +K DG FM Sbjct: 181 NSTISVGAGYNKNDGNISLGPTYNHRNDNTIGMGSRISSKTDDNLLSVAHNFNKGDGGFM 240 Query: 884 LPNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLES 1063 L Y D ++LS+GQ F++G+ N ++G YEKE+ NL S+ +Y++G E+ + Sbjct: 241 LMGHNYGKGDESILSMGQPFDKGDGNFISMGQSYEKEDGNLISLGTSYSKGHESFISIGP 300 Query: 1064 FYSKVNETFISAGP-GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210 + K E FI+ P KG H+ G + DS +AS +++ +SS L KG Sbjct: 301 TFGKSGENFITVAPYDKGTDHLISMGPTYDKVDSNIASTVPSYDRGDSSSLPVGQNHHKG 360 Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTDQLS-ANAIPA 1384 + +TISFGGF ++ E + SG +IS Y++L+ NQ+SAQ + T+ S N+IP Sbjct: 361 QSSTISFGGFHDD-PEANPSGGIISGYDLLIGNQNSAQGLDSQNDLSETNTESLVNSIPK 419 Query: 1385 SSSKPEGAPKSKDQKT 1432 ++K + K+K+ KT Sbjct: 420 LNTKNDTVVKNKEPKT 435 >ref|XP_007139258.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris] gi|593331666|ref|XP_007139259.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris] gi|593331672|ref|XP_007139262.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris] gi|561012391|gb|ESW11252.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris] gi|561012392|gb|ESW11253.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris] gi|561012395|gb|ESW11256.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris] Length = 583 Score = 253 bits (647), Expect = 1e-64 Identities = 162/436 (37%), Positives = 243/436 (55%), Gaps = 23/436 (5%) Frame = +2 Query: 194 MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373 MS+Q+ W+P +G +A + YE ++RI+ KR + WFMD+ E E++ +K+QAV+ G Sbjct: 1 MSYQHKSFWMPRDAGCMAEENVGYENSSRIEPKRSHQWFMDTGEPEIVSNKKQAVEDVSG 60 Query: 374 T--SGPAVMESSLWH--DGPHFQLESHAPTLFNPKAVRSSNVSDNNNPR-VSATMNMERK 538 SG + + S W G H + + LF R+ N+ D N P VS MNM RK Sbjct: 61 RPISGVSHVNVSQWDTSSGFHSVMGQFSDRLFGSDLARTVNLVDKNVPSIVSGNMNMGRK 120 Query: 539 DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712 D HQ+GND S+ L++SH + D S CLN G RKVKVN+VR S+NC+P S E Sbjct: 121 DFEHQYGNDPSVGLSISHSIADPSSCLNFGGIRKVKVNQVRDSDNCMPSAAMGHSYSRED 180 Query: 713 NDVMTTT--FQRTSNNMFSGPTFNSVVGNGISVDPAYS-KMDKNFASVGHSSSKRDGNFM 883 N ++ + + N+ GPT+N N I + S K D N SV H+ +K DG FM Sbjct: 181 NSTISVGAGYNKNDGNISLGPTYNHRNDNTIGMGSRISSKTDDNLLSVAHNFNKGDGGFM 240 Query: 884 LPNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLES 1063 L Y D ++LS+GQ F++G+ N ++G YEKE+ NL S+ +Y++G E+ + Sbjct: 241 LMGHNYGKGDESILSMGQPFDKGDGNFISMGQSYEKEDGNLISLGTSYSKGHESFISIGP 300 Query: 1064 FYSKVNETFISAGP-GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210 + K E FI+ P KG H+ G + DS +AS +++ +SS L KG Sbjct: 301 TFGKSGENFITVAPYDKGTDHLISMGPTYDKVDSNIASTVPSYDRGDSSSLPVGQNHHKG 360 Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTDQLS-ANAIPA 1384 + +TISFGGF ++ E + SG +IS Y++L+ NQ+SAQ + T+ S N+IP Sbjct: 361 QSSTISFGGFHDD-PEANPSGGIISGYDLLIGNQNSAQGLDSQNDLSETNTESLVNSIPK 419 Query: 1385 SSSKPEGAPKSKDQKT 1432 ++K + K+K+ KT Sbjct: 420 LNTKNDTVVKNKEPKT 435 >ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787520 [Glycine max] Length = 581 Score = 250 bits (639), Expect = 1e-63 Identities = 166/440 (37%), Positives = 243/440 (55%), Gaps = 25/440 (5%) Frame = +2 Query: 194 MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373 MS+Q+ W+P +G +A + YE ++R++ KR + WFMD+ E E+ +K+QAV+ G Sbjct: 1 MSYQHKSFWMPRDAGCMAEENVGYENSSRVESKRSHKWFMDAGEPEIFSNKKQAVEAVSG 60 Query: 374 --TSGPAVMESSLW--HDGPHFQLESHAPTLFNPKAVRSSNVSDNNNPR-VSATMNMERK 538 SG + S W + G H + LF R+ N+ D N P VS +NM RK Sbjct: 61 RPVSGVSHANVSQWDNNSGFHSVTSQFSDRLFGSDLARTVNLVDKNVPSIVSGNLNMGRK 120 Query: 539 DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712 D HQ+GND S+ L+MSH + D S CLN G RKVKVN+VR S+NC+P S E Sbjct: 121 DFEHQYGNDPSVGLSMSHSIADTSSCLNFGGIRKVKVNQVRDSDNCMPAASMGHSYSRED 180 Query: 713 NDVMTTTFQRTSN---NMFSGPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFM 883 N ++ N N+ GPT+N+V N I++ SK D N S+ H+ +K DG FM Sbjct: 181 NSTISVGAGYNKNDGGNISLGPTYNNVNDNTIAMGSRMSKTDDNLLSMAHTFNKGDGGFM 240 Query: 884 LPNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLES 1063 L Y D ++LS+GQ F++G+ N ++G YEKE+ NL S+ +Y +G EN + Sbjct: 241 LLGHNYGKGDESILSMGQPFDKGDGNFISMGQSYEKEDGNLISLGTSYTKGHENFIPVGP 300 Query: 1064 FYSKVNETFISAGP-GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210 Y K E FI+ P KG HI G + DS +AS F++ +SS L KG Sbjct: 301 TYGKSGENFITVAPYDKGTDHIISLGPTYDKVDSNIASTIPSFDRGDSSSLPVGQNHHKG 360 Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAI 1378 + ++ISFGGF ++ SG +IS Y++L+ +Q+SAQ G Q D T+ + N+I Sbjct: 361 QNSSISFGGFHDDPGPNIPSG-IISGYDLLIGSQNSAQ--GMDSQNDLTETNTESLVNSI 417 Query: 1379 PASSSKPEGAPKSKDQKTKK 1438 P ++K + K+K+ KT K Sbjct: 418 PKPNTKND-IVKNKEPKTTK 436 >gb|EXB65298.1| hypothetical protein L484_025377 [Morus notabilis] Length = 574 Score = 249 bits (637), Expect = 2e-63 Identities = 165/432 (38%), Positives = 247/432 (57%), Gaps = 26/432 (6%) Frame = +2 Query: 221 LPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKGT--SGPAVM 394 +P +G +A+GE+ Y+ ++R++QKR WFMD++ +L +K+QAV+ G SG M Sbjct: 1 MPKDAGCLADGEMGYDNSSRMEQKR-GQWFMDANGPQLF-NKKQAVEAVNGRPISGVPHM 58 Query: 395 ESSLWHDGPHFQLESHAPT--LFNPKAVRSSNVSDNNNPRV-SATMNMERKDLGHQFGND 565 S W + FQ T LF + VR+SN+ D N + S MNM RK Q+GN Sbjct: 59 NVSQWDNTSGFQSVPGQFTDRLFGSEPVRNSNLVDRNVQSIGSGNMNMGRKGFESQYGNT 118 Query: 566 QSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTTF 736 S+ L+MSH + D S CLN G RKVKVN+VR S+N L +G+++G E N + M ++ Sbjct: 119 PSVGLSMSHTIEDPSSCLNFGGIRKVKVNQVRDSDNILNPSMGNSYGRVENNTISMGNSY 178 Query: 737 QRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGID 913 ++ NN S P +N+ N IS+ P ++K D++F S+GH+ +K DGNF+ Y D Sbjct: 179 NKSDNNSISLAPAYNNGEENTISMGPTFTKADESFISIGHTFNKGDGNFISMGHNYGKGD 238 Query: 914 NNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETFI 1093 N +LS+ Q +++G+ N ++G YEK + + S+ +YN+G E + + Y K N FI Sbjct: 239 NGLLSMSQPYDKGDGNFISMGQSYEKGDGGVISLGTSYNKGHEEFISVGTTYGKANNNFI 298 Query: 1094 SAGPG--KGESHIAFQG-----QQDSTVASLGALFNKENSSIL------RKGEETTISFG 1234 P KG I G + DS V +G ++K +SS L K E TTISFG Sbjct: 299 QMAPSYIKGNDSIISMGPTPTYKADSNVVPMGPNYDKGDSSNLSMGQTYNKAESTTISFG 358 Query: 1235 GFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPE 1402 GF ++ E + SG +ISSY++L+ NQ+SAQ+ QK+S D S N+IP + K + Sbjct: 359 GF-HDEPETNPSGGIISSYDLLMSNQNSAQTLEVSEQKNSADFNVNPSVNSIPQADLKSD 417 Query: 1403 GAPKSKDQKTKK 1438 PK+K+ KT K Sbjct: 418 NIPKNKEPKTVK 429 >ref|XP_007009439.1| Uncharacterized protein isoform 8 [Theobroma cacao] gi|508726352|gb|EOY18249.1| Uncharacterized protein isoform 8 [Theobroma cacao] Length = 540 Score = 247 bits (631), Expect = 8e-63 Identities = 159/417 (38%), Positives = 236/417 (56%), Gaps = 25/417 (5%) Frame = +2 Query: 263 YETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG--TSGPAVMESSLWHDGPHFQLE 436 Y+ ++R + KR + WFMD++ EL +K+QA++ SG A + S WH+ FQ Sbjct: 3 YDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNSRPVSGIADVNVSPWHNASSFQSV 62 Query: 437 SH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERKDLGHQFGNDQSICLTMSHEVND- 604 S + LF + +R+ N+ D N V S MNM RKD Q+ N S L+MSH + D Sbjct: 63 SSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRKDFDDQYVNSSSAGLSMSHTIEDP 122 Query: 605 SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTTFQRTSNNMFS-GPTF 775 S C + G RKVKVN+VR S N +P +G T+ G + V M+T + ++ NN S GPT+ Sbjct: 123 SSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVNSTVSMSTVYSKSDNNAISLGPTY 182 Query: 776 NSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGIDNNVLSIGQAFNRGN 955 S N IS+ P ++K D NF S+GH+ +KRDG+F+ YN + ++LS+GQAF + + Sbjct: 183 GSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFISVGHNYNKGNESILSVGQAFEKED 242 Query: 956 YNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETFISAGP--GKGESHIA 1129 + ++G YEK ++NL S+ +Y +GQEN + Y K NE+ IS P K E I Sbjct: 243 GSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNESLISMAPTFDKEEDTII 302 Query: 1130 FQG----QQDSTVASLGALFNKENSSIL------RKGEETTISFGGFQNNADERDHSGRV 1279 G + D + ++ K SSIL +KGE TISFGGF + + E + SG + Sbjct: 303 PMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISFGGFHDES-ETNPSGSI 361 Query: 1280 ISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPEGAPKSKDQKTKK 1438 IS Y++L+ NQ+SAQ+S L QK+ + + N P +S+ + PK K+ KT K Sbjct: 362 ISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTDANPKHKEPKTAK 418 >ref|XP_007009436.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508726349|gb|EOY18246.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 563 Score = 247 bits (631), Expect = 8e-63 Identities = 159/417 (38%), Positives = 236/417 (56%), Gaps = 25/417 (5%) Frame = +2 Query: 263 YETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG--TSGPAVMESSLWHDGPHFQLE 436 Y+ ++R + KR + WFMD++ EL +K+QA++ SG A + S WH+ FQ Sbjct: 3 YDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNSRPVSGIADVNVSPWHNASSFQSV 62 Query: 437 SH--APTLFNPKAVRSSNVSDNNNPRV-SATMNMERKDLGHQFGNDQSICLTMSHEVND- 604 S + LF + +R+ N+ D N V S MNM RKD Q+ N S L+MSH + D Sbjct: 63 SSQLSDRLFGSEPLRTVNLVDRNMSSVDSGNMNMGRKDFDDQYVNSSSAGLSMSHTIEDP 122 Query: 605 SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTTFQRTSNNMFS-GPTF 775 S C + G RKVKVN+VR S N +P +G T+ G + V M+T + ++ NN S GPT+ Sbjct: 123 SSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVNSTVSMSTVYSKSDNNAISLGPTY 182 Query: 776 NSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGIDNNVLSIGQAFNRGN 955 S N IS+ P ++K D NF S+GH+ +KRDG+F+ YN + ++LS+GQAF + + Sbjct: 183 GSGDENTISIGPTFTKADGNFISMGHTFNKRDGDFISVGHNYNKGNESILSVGQAFEKED 242 Query: 956 YNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETFISAGP--GKGESHIA 1129 + ++G YEK ++NL S+ +Y +GQEN + Y K NE+ IS P K E I Sbjct: 243 GSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNESLISMAPTFDKEEDTII 302 Query: 1130 FQG----QQDSTVASLGALFNKENSSIL------RKGEETTISFGGFQNNADERDHSGRV 1279 G + D + ++ K SSIL +KGE TISFGGF + + E + SG + Sbjct: 303 PMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISFGGFHDES-ETNPSGSI 361 Query: 1280 ISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPEGAPKSKDQKTKK 1438 IS Y++L+ NQ+SAQ+S L QK+ + + N P +S+ + PK K+ KT K Sbjct: 362 ISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTDANPKHKEPKTAK 418 >ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782217 [Glycine max] Length = 582 Score = 244 bits (624), Expect = 5e-62 Identities = 162/440 (36%), Positives = 241/440 (54%), Gaps = 25/440 (5%) Frame = +2 Query: 194 MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373 MS+Q+ W+P +G +A YE ++RI+ KR + WFMD+ E E+ +K+QAV+ G Sbjct: 1 MSYQHKSFWMPRDAGCMAEENAGYENSSRIEPKRSHQWFMDTGEPEIFSNKKQAVEAVSG 60 Query: 374 T--SGPAVMESSLW--HDGPHFQLESHAPTLFNPKAVRSSNVSDNNNPR-VSATMNMERK 538 SG + S W + G H + LF R+ N+ D N P VS +NM RK Sbjct: 61 RPISGVSHANVSQWDTNSGFHSVTSQFSDRLFGSDLARTVNLVDKNVPSIVSGNLNMGRK 120 Query: 539 DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712 D HQ+GND S+ L++SH + D S CLN G RKVKVN+VR S+NC+P S E Sbjct: 121 DFEHQYGNDPSVGLSISHSIADPSSCLNFGGIRKVKVNQVRDSDNCMPAASMGPSYSRED 180 Query: 713 NDVMTTTFQRTSN---NMFSGPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFM 883 N ++ N N+ GPT+N+ N I++ SK D N S+ H+ SK DG FM Sbjct: 181 NSTISVGAGYNKNDGDNISLGPTYNNGYDNTIAMGSRISKTDDNLLSMAHTFSKGDGGFM 240 Query: 884 LPNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLES 1063 L Y D +++S+GQ F++G+ N ++G YEKE+ NL S+ +Y + E+ + Sbjct: 241 LMGHNYGKGDESIVSMGQPFDKGDGNFISMGQSYEKEDGNLISLGTSYTKVHESFIPVGP 300 Query: 1064 FYSKVNETFISAGP-GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210 Y K E FI+ P KG +HI G + DS +AS +++ +SS L KG Sbjct: 301 TYGKSGENFITVAPYDKGTNHIISMGPTYDKVDSNIASTVPSYDRGDSSSLPVGQNHHKG 360 Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAI 1378 + ++ISFGGF ++ E + G +IS Y++L+ Q+SAQ G Q D T+ + N+I Sbjct: 361 QSSSISFGGFHDD-PEPNTPGGIISGYDLLIGGQNSAQ--GLDSQNDLTETNTESLVNSI 417 Query: 1379 PASSSKPEGAPKSKDQKTKK 1438 P ++K + K+K+ KT K Sbjct: 418 PKPNTKNDIVVKNKEPKTTK 437 >ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206313 [Cucumis sativus] Length = 582 Score = 244 bits (623), Expect = 7e-62 Identities = 158/438 (36%), Positives = 237/438 (54%), Gaps = 23/438 (5%) Frame = +2 Query: 194 MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373 MSFQ+ W+P +G + +GE+ Y++++RI+ KR + WFMD S EL SK+QA++ Sbjct: 1 MSFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRGHQWFMDGSAPELFSSKKQAIEAVNS 60 Query: 374 TSGPAV--MESSLWHDGPHFQLESH-APTLFNPKAVRSSNVSDNNNPRVSATMNMERKDL 544 P V M S W + + H LF + +R+ N+ D +A M+M RK+ Sbjct: 61 RPVPGVPHMNVSPWENSSFQSVPGHFTDRLFGSEPIRTVNLVDRGISVGNANMDMGRKEF 120 Query: 545 GHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKND 718 + F N+ S+ L+MS + D S CLN G RKVKVN+VR + +P +G + G+ Sbjct: 121 ENHFTNNPSVGLSMSQSIEDPSSCLNFGGIRKVKVNQVRDPDVGMPASLGHAYTRGDNCT 180 Query: 719 V-MTTTFQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPN 892 + M T F + N S G T+NS N ISV PAY K D NF S+GH+ SK DG+F+ Sbjct: 181 ISMGTGFNKNHENTISLGQTYNSRDENAISVGPAYHKTDDNFISMGHAFSKGDGSFITIG 240 Query: 893 QYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYS 1072 Y+ DN++LS+ Q F++G+ + ++G YEK N+ S +YN+GQEN + YS Sbjct: 241 HNYSKGDNSILSMNQPFDKGDDSFISMGQSYEKAEGNIISFA-SYNKGQENFISMGPAYS 299 Query: 1073 KVNETFIS------AGPGKGESHIAFQGQQDSTVASLGALFNKENSSIL------RKGEE 1216 K +TFIS G S + +S + +G F+K +S + KGE Sbjct: 300 KAGDTFISMASSFNKGNDDNLSMAPTYDKVNSDIVHVGPKFDKADSGAVSMAHNYHKGES 359 Query: 1217 TTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPA 1384 TISFGGF + + SG +ISSY++L+ NQ+SAQ+S +DS D +++ N Sbjct: 360 NTISFGGFDDENGTDNPSGGIISSYDLLMANQASAQASEVSTLRDSVDPNVEVNINGAIK 419 Query: 1385 SSSKPEGAPKSKDQKTKK 1438 K + KSK+ + K Sbjct: 420 VDGKIDTNSKSKEPRMSK 437 >ref|XP_007218931.1| hypothetical protein PRUPE_ppa003346mg [Prunus persica] gi|462415393|gb|EMJ20130.1| hypothetical protein PRUPE_ppa003346mg [Prunus persica] Length = 583 Score = 241 bits (614), Expect = 8e-61 Identities = 159/442 (35%), Positives = 244/442 (55%), Gaps = 27/442 (6%) Frame = +2 Query: 194 MSFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKG 373 MSFQ W+P + + +GE+ Y+ ++RI+ KR N WFMDS+ E +K+QA++ G Sbjct: 1 MSFQPKSFWIPRDASCLTDGEMGYDNSSRIESKRGNRWFMDSNGLEFFNNKKQAMEAVNG 60 Query: 374 --TSGPAVMESSLWHDGPHFQLESHAPT--LFNPKAVRSSNVSDNNNPRV-SATMNMERK 538 SG + S W + FQ T LF + VR+ N+ D N V S MN+ RK Sbjct: 61 RPVSGVPHLAISPWDNTSGFQSVPGQFTDRLFGSEPVRTVNLGDRNIQSVGSENMNLGRK 120 Query: 539 DLGHQFGNDQSICLTMSHEVND-SLCLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEK 712 Q+GND S+ L+MSH + D S CLN G RKVKVNEVR S++ + +G ++ G+ Sbjct: 121 GFEDQYGNDPSVGLSMSHTIEDPSSCLNFGGIRKVKVNEVRDSDDVVSASMGHSYCKGDS 180 Query: 713 NDV-MTTTFQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFML 886 N + M T+ ++ +N S G +N+ N IS+ P+++K D NF S+GH+ SK + NF+ Sbjct: 181 NTMSMANTYNKSDDNAISLGSAYNTGEENAISIGPSFNKADDNFISMGHTFSKANSNFIS 240 Query: 887 PNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESF 1066 YN DN++LS+GQ F++ + N ++G YEK +S+ S+ +Y++G EN + + Sbjct: 241 MAHNYNKGDNSILSMGQPFDKEDGNFISMGQSYEKGDSSFISLGNSYHKGHENFISMGAT 300 Query: 1067 YSKVNETFISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSI-----LRKGE 1213 Y K NE FIS P K ++ G + DS V +G ++K S++ K E Sbjct: 301 YGKANENFISMAPTYDKQTDNMMSMGPNYDKADSNVVPIGPPYHKGESNVSMSHNYNKNE 360 Query: 1214 ETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTDQLSANAIP--- 1381 TTISFG F + D + SG +ISSY++L+ NQ++A+ S + D + +N P Sbjct: 361 STTISFGSFHHETD-TNPSGGIISSYDLLMNNQNTAEQS---EESGLKDPIQSNMDPNVD 416 Query: 1382 ---ASSSKPEGAPKSKDQKTKK 1438 SK + K K+ KT + Sbjct: 417 DALKLDSKTDTVSKIKEPKTAR 438 >ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cucumis sativus] Length = 561 Score = 223 bits (568), Expect = 2e-55 Identities = 149/415 (35%), Positives = 222/415 (53%), Gaps = 23/415 (5%) Frame = +2 Query: 263 YETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKGTSGPAV--MESSLWHDGPHFQLE 436 Y++++RI+ KR + WFMD S EL SK+QA++ P V M S W + + Sbjct: 3 YDSSSRIETKRGHQWFMDGSAPELFSSKKQAIEAVNSRPVPGVPHMNVSPWENSSFQSVP 62 Query: 437 SH-APTLFNPKAVRSSNVSDNNNPRVSATMNMERKDLGHQFGNDQSICLTMSHEVND-SL 610 H LF + +R+ N+ D +A M+M RK+ + F N+ S+ L+MS + D S Sbjct: 63 GHFTDRLFGSEPIRTVNLVDRGISVGNANMDMGRKEFENHFTNNPSVGLSMSQSIEDPSS 122 Query: 611 CLN-SGPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTTFQRTSNNMFS-GPTFNS 781 CLN G RKVKVN+VR + +P +G + G+ + M T F + N S G T+NS Sbjct: 123 CLNFGGIRKVKVNQVRDPDVGMPASLGHAYTRGDNCTISMGTGFNKNHENTISLGQTYNS 182 Query: 782 VVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGIDNNVLSIGQAFNRGNYN 961 N ISV PAY K D NF S+GH+ SK DG+F+ Y+ DN++LS+ Q F++G+ + Sbjct: 183 RDENAISVGPAYHKTDDNFISMGHAFSKGDGSFITIGHNYSKGDNSILSMNQPFDKGDDS 242 Query: 962 VDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETFIS------AGPGKGESH 1123 ++G YEK N+ S +YN+GQEN + YSK +TFIS G S Sbjct: 243 FISMGQSYEKAEGNIISFA-SYNKGQENFISMGPAYSKAGDTFISMASSFNKGNDDNLSM 301 Query: 1124 IAFQGQQDSTVASLGALFNKENSSIL------RKGEETTISFGGFQNNADERDHSGRVIS 1285 + +S + +G F+K +S + KGE TISFGGF + + SG +IS Sbjct: 302 APTYDKVNSDIVHVGPKFDKADSGAVSMAHNYHKGESNTISFGGFDDENGTDNPSGGIIS 361 Query: 1286 SYEVLL-NQSSAQSSGALVQKDSTD---QLSANAIPASSSKPEGAPKSKDQKTKK 1438 SY++L+ NQ+SAQ+S +DS D +++ N K + KSK+ + K Sbjct: 362 SYDLLMANQASAQASEVSTLRDSVDPNVEVNINGAIKVDGKIDTNSKSKEPRMSK 416 >ref|XP_002316304.2| hypothetical protein POPTR_0010s21640g, partial [Populus trichocarpa] gi|550330316|gb|EEF02475.2| hypothetical protein POPTR_0010s21640g, partial [Populus trichocarpa] Length = 644 Score = 210 bits (535), Expect = 1e-51 Identities = 143/407 (35%), Positives = 224/407 (55%), Gaps = 23/407 (5%) Frame = +2 Query: 197 SFQNNIIWLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKGT 376 SFQ W+ G + +G+I ++ ++R++ KR + W MDS+ EL +K+QAV+P Sbjct: 1 SFQQKSFWMTRDVGCLTDGDIGFDNSSRMEPKRGHQWLMDSTGPELFSNKKQAVEPSSNN 60 Query: 377 S---GPAVMESSLWHDGPHFQLESHA--PTLFNPKAVRSSNVSDNNNPRVS-ATMNMERK 538 G + M S W++ FQ S LF + +R + S +N P S MNMERK Sbjct: 61 RPVMGMSHMNISPWNNTSCFQSVSGQFNDRLFGFEPLRIN--SGSNVPSASNGNMNMERK 118 Query: 539 DLGHQFGNDQSICLTMSHEVNDSLCLNS--GPRKVKVNEVRISENCLPEYVGSTFGSGEK 712 D +G++ S+ L+MSH V D S G RKV+VN+VR S N + VG ++ G+ Sbjct: 119 DFNDLYGSNCSMGLSMSHNVEDPPASISFGGLRKVRVNQVRDSSNDISSSVGHSYSRGDD 178 Query: 713 NDV-MTTTFQRTSNNMFS-GPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFML 886 N + M T + + +N S G T+N+ N IS+ P +SK D +F S+GH+ +K D NF+ Sbjct: 179 NIISMGTAYNKRESNAISLGSTYNNGDENTISISPTFSKADGSFISMGHAFNKDDDNFIS 238 Query: 887 PNQYYNGIDNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESF 1066 Q YN D ++LS+GQ F++ + N +G Y+KE+++ S+ +YN+G E+ + Sbjct: 239 MGQGYNKGDESILSMGQPFDKKDANFITMGPSYDKEDNHFISMALSYNKGHESFISMGPS 298 Query: 1067 YSKVNETFISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKG 1210 Y K +E FI G KG ++ G + D +AS+ +K NS IL KG Sbjct: 299 YDKTSENFILMGSSFSKGGDNVISNGPIYDKADIDIASMTPAQDKGNSGILSIGHNYNKG 358 Query: 1211 EETTISFGGFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKD 1348 + +ISF F ++ E + SG VI Y++L+ NQ++AQ+S VQ + Sbjct: 359 DNNSISFQSF-HDEPETNMSGNVIRGYDLLVSNQNTAQTSEVPVQNN 404 >ref|XP_002311151.2| hypothetical protein POPTR_0008s05120g [Populus trichocarpa] gi|550332456|gb|EEE88518.2| hypothetical protein POPTR_0008s05120g [Populus trichocarpa] Length = 616 Score = 209 bits (532), Expect = 2e-51 Identities = 146/425 (34%), Positives = 231/425 (54%), Gaps = 26/425 (6%) Frame = +2 Query: 218 WLPNGSGSVANGEICYETATRIDQKRPNPWFMDSSEQELIVSKRQAVDPFKGT---SGPA 388 W+ +G + +G++ ++ ++R++ K + WFMDS EL +K+QAV+ +G + Sbjct: 6 WITRDAGCLNDGDVGFDNSSRMEAKHSHQWFMDSPGPELFSNKKQAVEHSSNNRPVAGMS 65 Query: 389 VMESSLWHDGPHFQLES--HAPTLFNPKAVRSSNVSDNNNPRVSATMNMERKDLGHQFGN 562 M S W++ FQ S + LF + +R +N S N + MNM RKD +G+ Sbjct: 66 HMNISPWNNTSSFQSVSGHFSDRLFGSEPLRPNNGS-NFLSSGNGNMNMGRKDF--IYGS 122 Query: 563 DQSICLTMSHEVNDSLCLNS--GPRKVKVNEVRISENCLPEYVGSTFGSGEKNDV-MTTT 733 + S+ L+M+H + D S G RKVKVN+VR S + VG ++ G+ N + M Sbjct: 123 NCSMGLSMTHNIEDPSASISFGGIRKVKVNQVRDSN--ISSSVGHSYARGDDNIISMGPA 180 Query: 734 F-QRTSNNMFSGPTFNSVVGNGISVDPAYSKMDKNFASVGHSSSKRDGNFMLPNQYYNGI 910 + +R SN + G T+N+ N IS+ P +SK D NF S+ H+ SK DGNF+ YN Sbjct: 181 YNKRESNTISLGSTYNNGDENTISISPTFSKADGNFISIRHAFSKDDGNFISMGHNYNKG 240 Query: 911 DNNVLSIGQAFNRGNYNVDALGDQYEKENSNLSSVCPTYNRGQENLFGLESFYSKVNETF 1090 D ++LS+GQ F++ + N +G Y+KEN++ S+ P+YN+G +N + Y K +E F Sbjct: 241 DESMLSMGQPFDKEDANFITIGPSYDKENNHFISMAPSYNKGHDNFISMGPSYDKTSENF 300 Query: 1091 ISAGP--GKGESHIAFQG----QQDSTVASLGALFNKENSSIL------RKGEETTISFG 1234 I G KG +I G + DS + S+ +K NS IL KG+ ISFG Sbjct: 301 ILMGSSFSKGGDNIISNGPAYDKADSDITSMAPAQDKGNSGILSMGHNYNKGDNNAISFG 360 Query: 1235 GFQNNADERDHSGRVISSYEVLL-NQSSAQSSGALVQKDSTDQLSANAIPASSSKP---- 1399 GF ++ E + SG +I+ YE+L+ NQ +AQ+S + LS N +P +++ P Sbjct: 361 GF-HDEPETNSSGNIITGYELLVSNQDTAQTS---------EVLSQNVLPQANADPQLNT 410 Query: 1400 EGAPK 1414 + APK Sbjct: 411 DSAPK 415