BLASTX nr result
ID: Mentha26_contig00033333
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00033333 (816 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU23579.1| hypothetical protein MIMGU_mgv1a000155mg [Mimulus... 322 1e-85 emb|CBI15156.3| unnamed protein product [Vitis vinifera] 245 2e-62 ref|XP_007210488.1| hypothetical protein PRUPE_ppa000133mg [Prun... 236 1e-59 ref|XP_007037486.1| Uncharacterized protein isoform 8 [Theobroma... 229 9e-58 ref|XP_007037485.1| Uncharacterized protein isoform 7 [Theobroma... 229 9e-58 ref|XP_007037484.1| Uncharacterized protein isoform 6, partial [... 229 9e-58 ref|XP_007037483.1| Uncharacterized protein isoform 5 [Theobroma... 229 9e-58 ref|XP_007037482.1| Uncharacterized protein isoform 4 [Theobroma... 229 9e-58 ref|XP_007037481.1| Uncharacterized protein isoform 3 [Theobroma... 229 9e-58 ref|XP_007037480.1| Uncharacterized protein isoform 2 [Theobroma... 229 9e-58 ref|XP_007037479.1| Uncharacterized protein isoform 1 [Theobroma... 229 9e-58 ref|XP_002514697.1| hypothetical protein RCOM_1470550 [Ricinus c... 224 2e-56 ref|XP_006345163.1| PREDICTED: uncharacterized protein LOC102602... 223 5e-56 gb|EXC11028.1| hypothetical protein L484_015248 [Morus notabilis] 223 7e-56 ref|XP_006477617.1| PREDICTED: uncharacterized protein LOC102610... 220 6e-55 ref|XP_006440689.1| hypothetical protein CICLE_v10018469mg [Citr... 219 7e-55 ref|XP_002317968.2| hypothetical protein POPTR_0012s06850g [Popu... 219 1e-54 ref|XP_004236487.1| PREDICTED: uncharacterized protein LOC101264... 217 4e-54 ref|XP_004301126.1| PREDICTED: uncharacterized protein LOC101303... 216 6e-54 tpg|DAA41320.1| TPA: hypothetical protein ZEAMMB73_745179 [Zea m... 207 4e-51 >gb|EYU23579.1| hypothetical protein MIMGU_mgv1a000155mg [Mimulus guttatus] Length = 1553 Score = 322 bits (825), Expect = 1e-85 Identities = 179/271 (66%), Positives = 201/271 (74%), Gaps = 1/271 (0%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 +SEIVDGFLWT+AAI+ HVSCN++Q+QMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS Sbjct: 1053 VSEIVDGFLWTVAAIIGHVSCNDFQIQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 1112 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRLESSMDT 365 PFPSSILLGINLLTVLTSKFR S IDWDSFPND+ QG KIG + + ESS+D Sbjct: 1113 PFPSSILLGINLLTVLTSKFRESSSIDWDSFPNDVMQGYKIGPSTSADSRFTSSESSLDG 1172 Query: 366 MPSLPTGDLLTELQESTEDGCPNIPITNSQVPTSDSSLDAEHIASNATVQDVMDESPTAL 545 P LP +G P + Q T S+ EH ASN + DVMDES TA Sbjct: 1173 RPLLP----------DLPEGSPLEDFLSIQGTTDAHSV--EHTASNNQIVDVMDESLTAP 1220 Query: 546 IEDKHQCSVAE-DSNKCVSNDSELRNRNGAITKQPAAFLLAAMFETGLVCLPSMLTAVLL 722 ED H SV + D N +S+++E N + +KQPA FLL+AM ETGLVCLPSMLTAVLL Sbjct: 1221 NEDAHHSSVTQKDRNNSLSSNAESNRGNVSDSKQPAKFLLSAMSETGLVCLPSMLTAVLL 1280 Query: 723 QGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 Q NNRLSAEQSSYVLPSNFEEVATGVLKVLN Sbjct: 1281 QANNRLSAEQSSYVLPSNFEEVATGVLKVLN 1311 >emb|CBI15156.3| unnamed protein product [Vitis vinifera] Length = 1617 Score = 245 bits (625), Expect = 2e-62 Identities = 144/271 (53%), Positives = 176/271 (64%), Gaps = 1/271 (0%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 ISE++DGFLWT+ I+ H+S +E QLQMQDGL+ELVIAYQ+IHRLRDLFALYDRPQVEG+ Sbjct: 1169 ISEVLDGFLWTVTTIIGHISSDERQLQMQDGLLELVIAYQVIHRLRDLFALYDRPQVEGA 1228 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRLESSMDT 365 PFPSSILL INLLTVLTS+ R LIDW SFP + G +I E KL+ Sbjct: 1229 PFPSSILLSINLLTVLTSRPRTISLIDWKSFPVETITGNEIQEAKLT------------- 1275 Query: 366 MPSLPTGDLLTELQESTEDGCPNIPITNSQVPTSDSSLDAEHIASNATVQDVMDESPTAL 545 ES + G +S +D S++ ++ SN T D D S T L Sbjct: 1276 --------------ESADFG-------HSYKRLADISIELNNVDSNMT--DASDSSQTNL 1312 Query: 546 IEDKHQCSVAEDSNKCVSN-DSELRNRNGAITKQPAAFLLAAMFETGLVCLPSMLTAVLL 722 ED + + + + N +E + N + KQP AFLL+A+ +TGLV LPS+LTAVLL Sbjct: 1313 SEDISKSCIPQKGEQNSKNICAEQKTENISSLKQPMAFLLSAISDTGLVSLPSLLTAVLL 1372 Query: 723 QGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 Q NNRLS+EQ SYVLPSNFEEVATGVLKVLN Sbjct: 1373 QANNRLSSEQGSYVLPSNFEEVATGVLKVLN 1403 >ref|XP_007210488.1| hypothetical protein PRUPE_ppa000133mg [Prunus persica] gi|462406223|gb|EMJ11687.1| hypothetical protein PRUPE_ppa000133mg [Prunus persica] Length = 1687 Score = 236 bits (601), Expect = 1e-59 Identities = 143/287 (49%), Positives = 184/287 (64%), Gaps = 17/287 (5%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 ISE++DG+LWT+ IV H+S +E QLQM+DGL+EL+IAYQ+IHRLRDLFALYDRPQVEGS Sbjct: 1164 ISEVLDGYLWTVTTIVSHISSDEQQLQMRDGLLELLIAYQVIHRLRDLFALYDRPQVEGS 1223 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKL---SVTEDIRLESS 356 PFPSSILL INLL VLTS+ + IDW P + G E K TED+ L S Sbjct: 1224 PFPSSILLSINLLVVLTSRSEMNCSIDWKYVPIETVVGNGSEEAKFPGGDSTEDLPLTQS 1283 Query: 357 M-DTMP--SLPTGDLLTELQESTEDG-CPNIPITNSQVPTSDSSLDAEHIASNATVQDVM 524 + D+ P S+ G + L + EDG I N + D+E SN+ V+ Sbjct: 1284 LGDSRPPLSVQNGGTVVHLPDVPEDGPLDESCIINKSTEAVSTGKDSEKEQSNSLVEARN 1343 Query: 525 DESPTALIEDKHQCSVAEDSNKCVSNDSELRN--RNGAITK--------QPAAFLLAAMF 674 D + + D+ Q +ED+ + ++ + ++ NGA+ K QP AFLL A+ Sbjct: 1344 DNTIKTDLPDETQKFPSEDTLEPFASQKDGKHLVDNGAVQKNEIIVSLEQPVAFLLTAVS 1403 Query: 675 ETGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 ETGLV LPS+LT+VLLQ NNRLS+EQ+S VLPSNFE+VATGVLKVLN Sbjct: 1404 ETGLVSLPSLLTSVLLQANNRLSSEQTSDVLPSNFEDVATGVLKVLN 1450 >ref|XP_007037486.1| Uncharacterized protein isoform 8 [Theobroma cacao] gi|508774731|gb|EOY21987.1| Uncharacterized protein isoform 8 [Theobroma cacao] Length = 1481 Score = 229 bits (584), Expect = 9e-58 Identities = 141/288 (48%), Positives = 185/288 (64%), Gaps = 18/288 (6%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 +SE++DGFLWT++AI+ H+S +E QLQM+DGL+EL+IAYQ+IHRLRDLFALYDRPQVEGS Sbjct: 1172 VSEVLDGFLWTVSAIIGHISSDERQLQMRDGLLELLIAYQVIHRLRDLFALYDRPQVEGS 1231 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRL----ES 353 PFPSSILL I+LL VLTS G+ I+W+S P ++ G + E K++ T D + Sbjct: 1232 PFPSSILLSIHLLVVLTSS-PGNSSINWESLPIEMELGNESQETKIAATPDCGCSFVNSN 1290 Query: 354 SMDTMPSLPT--GDLLTELQESTED-----GC-----PNIPITNSQV--PTSDSSLDAEH 491 + D P L + G ++ L + ED C N+ + V T+D S+ + Sbjct: 1291 TGDDRPPLSSLNGSVVAPLSDVPEDRPLDESCRINKNDNLVLIGKDVERKTTDGSVQLNN 1350 Query: 492 IASNATVQDVMDESPTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAM 671 + S A + D D SP L+E K + V + E N N + KQP AFLL+ + Sbjct: 1351 V-STARI-DGTDVSPKNLVEQKEEKLV-------IIPSEEKLNENISSLKQPLAFLLSTI 1401 Query: 672 FETGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 ETGLV LPS+LT+VLLQ NNRLS++Q S LPSNFEEVATGVLKVLN Sbjct: 1402 SETGLVSLPSLLTSVLLQANNRLSSDQVSNALPSNFEEVATGVLKVLN 1449 >ref|XP_007037485.1| Uncharacterized protein isoform 7 [Theobroma cacao] gi|508774730|gb|EOY21986.1| Uncharacterized protein isoform 7 [Theobroma cacao] Length = 1529 Score = 229 bits (584), Expect = 9e-58 Identities = 141/288 (48%), Positives = 185/288 (64%), Gaps = 18/288 (6%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 +SE++DGFLWT++AI+ H+S +E QLQM+DGL+EL+IAYQ+IHRLRDLFALYDRPQVEGS Sbjct: 1172 VSEVLDGFLWTVSAIIGHISSDERQLQMRDGLLELLIAYQVIHRLRDLFALYDRPQVEGS 1231 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRL----ES 353 PFPSSILL I+LL VLTS G+ I+W+S P ++ G + E K++ T D + Sbjct: 1232 PFPSSILLSIHLLVVLTSS-PGNSSINWESLPIEMELGNESQETKIAATPDCGCSFVNSN 1290 Query: 354 SMDTMPSLPT--GDLLTELQESTED-----GC-----PNIPITNSQV--PTSDSSLDAEH 491 + D P L + G ++ L + ED C N+ + V T+D S+ + Sbjct: 1291 TGDDRPPLSSLNGSVVAPLSDVPEDRPLDESCRINKNDNLVLIGKDVERKTTDGSVQLNN 1350 Query: 492 IASNATVQDVMDESPTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAM 671 + S A + D D SP L+E K + V + E N N + KQP AFLL+ + Sbjct: 1351 V-STARI-DGTDVSPKNLVEQKEEKLV-------IIPSEEKLNENISSLKQPLAFLLSTI 1401 Query: 672 FETGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 ETGLV LPS+LT+VLLQ NNRLS++Q S LPSNFEEVATGVLKVLN Sbjct: 1402 SETGLVSLPSLLTSVLLQANNRLSSDQVSNALPSNFEEVATGVLKVLN 1449 >ref|XP_007037484.1| Uncharacterized protein isoform 6, partial [Theobroma cacao] gi|508774729|gb|EOY21985.1| Uncharacterized protein isoform 6, partial [Theobroma cacao] Length = 1525 Score = 229 bits (584), Expect = 9e-58 Identities = 141/288 (48%), Positives = 185/288 (64%), Gaps = 18/288 (6%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 +SE++DGFLWT++AI+ H+S +E QLQM+DGL+EL+IAYQ+IHRLRDLFALYDRPQVEGS Sbjct: 1172 VSEVLDGFLWTVSAIIGHISSDERQLQMRDGLLELLIAYQVIHRLRDLFALYDRPQVEGS 1231 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRL----ES 353 PFPSSILL I+LL VLTS G+ I+W+S P ++ G + E K++ T D + Sbjct: 1232 PFPSSILLSIHLLVVLTSS-PGNSSINWESLPIEMELGNESQETKIAATPDCGCSFVNSN 1290 Query: 354 SMDTMPSLPT--GDLLTELQESTED-----GC-----PNIPITNSQV--PTSDSSLDAEH 491 + D P L + G ++ L + ED C N+ + V T+D S+ + Sbjct: 1291 TGDDRPPLSSLNGSVVAPLSDVPEDRPLDESCRINKNDNLVLIGKDVERKTTDGSVQLNN 1350 Query: 492 IASNATVQDVMDESPTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAM 671 + S A + D D SP L+E K + V + E N N + KQP AFLL+ + Sbjct: 1351 V-STARI-DGTDVSPKNLVEQKEEKLV-------IIPSEEKLNENISSLKQPLAFLLSTI 1401 Query: 672 FETGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 ETGLV LPS+LT+VLLQ NNRLS++Q S LPSNFEEVATGVLKVLN Sbjct: 1402 SETGLVSLPSLLTSVLLQANNRLSSDQVSNALPSNFEEVATGVLKVLN 1449 >ref|XP_007037483.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508774728|gb|EOY21984.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 1571 Score = 229 bits (584), Expect = 9e-58 Identities = 141/288 (48%), Positives = 185/288 (64%), Gaps = 18/288 (6%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 +SE++DGFLWT++AI+ H+S +E QLQM+DGL+EL+IAYQ+IHRLRDLFALYDRPQVEGS Sbjct: 1172 VSEVLDGFLWTVSAIIGHISSDERQLQMRDGLLELLIAYQVIHRLRDLFALYDRPQVEGS 1231 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRL----ES 353 PFPSSILL I+LL VLTS G+ I+W+S P ++ G + E K++ T D + Sbjct: 1232 PFPSSILLSIHLLVVLTSS-PGNSSINWESLPIEMELGNESQETKIAATPDCGCSFVNSN 1290 Query: 354 SMDTMPSLPT--GDLLTELQESTED-----GC-----PNIPITNSQV--PTSDSSLDAEH 491 + D P L + G ++ L + ED C N+ + V T+D S+ + Sbjct: 1291 TGDDRPPLSSLNGSVVAPLSDVPEDRPLDESCRINKNDNLVLIGKDVERKTTDGSVQLNN 1350 Query: 492 IASNATVQDVMDESPTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAM 671 + S A + D D SP L+E K + V + E N N + KQP AFLL+ + Sbjct: 1351 V-STARI-DGTDVSPKNLVEQKEEKLV-------IIPSEEKLNENISSLKQPLAFLLSTI 1401 Query: 672 FETGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 ETGLV LPS+LT+VLLQ NNRLS++Q S LPSNFEEVATGVLKVLN Sbjct: 1402 SETGLVSLPSLLTSVLLQANNRLSSDQVSNALPSNFEEVATGVLKVLN 1449 >ref|XP_007037482.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508774727|gb|EOY21983.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 1540 Score = 229 bits (584), Expect = 9e-58 Identities = 141/288 (48%), Positives = 185/288 (64%), Gaps = 18/288 (6%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 +SE++DGFLWT++AI+ H+S +E QLQM+DGL+EL+IAYQ+IHRLRDLFALYDRPQVEGS Sbjct: 1172 VSEVLDGFLWTVSAIIGHISSDERQLQMRDGLLELLIAYQVIHRLRDLFALYDRPQVEGS 1231 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRL----ES 353 PFPSSILL I+LL VLTS G+ I+W+S P ++ G + E K++ T D + Sbjct: 1232 PFPSSILLSIHLLVVLTSS-PGNSSINWESLPIEMELGNESQETKIAATPDCGCSFVNSN 1290 Query: 354 SMDTMPSLPT--GDLLTELQESTED-----GC-----PNIPITNSQV--PTSDSSLDAEH 491 + D P L + G ++ L + ED C N+ + V T+D S+ + Sbjct: 1291 TGDDRPPLSSLNGSVVAPLSDVPEDRPLDESCRINKNDNLVLIGKDVERKTTDGSVQLNN 1350 Query: 492 IASNATVQDVMDESPTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAM 671 + S A + D D SP L+E K + V + E N N + KQP AFLL+ + Sbjct: 1351 V-STARI-DGTDVSPKNLVEQKEEKLV-------IIPSEEKLNENISSLKQPLAFLLSTI 1401 Query: 672 FETGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 ETGLV LPS+LT+VLLQ NNRLS++Q S LPSNFEEVATGVLKVLN Sbjct: 1402 SETGLVSLPSLLTSVLLQANNRLSSDQVSNALPSNFEEVATGVLKVLN 1449 >ref|XP_007037481.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508774726|gb|EOY21982.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 1707 Score = 229 bits (584), Expect = 9e-58 Identities = 141/288 (48%), Positives = 185/288 (64%), Gaps = 18/288 (6%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 +SE++DGFLWT++AI+ H+S +E QLQM+DGL+EL+IAYQ+IHRLRDLFALYDRPQVEGS Sbjct: 1172 VSEVLDGFLWTVSAIIGHISSDERQLQMRDGLLELLIAYQVIHRLRDLFALYDRPQVEGS 1231 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRL----ES 353 PFPSSILL I+LL VLTS G+ I+W+S P ++ G + E K++ T D + Sbjct: 1232 PFPSSILLSIHLLVVLTSS-PGNSSINWESLPIEMELGNESQETKIAATPDCGCSFVNSN 1290 Query: 354 SMDTMPSLPT--GDLLTELQESTED-----GC-----PNIPITNSQV--PTSDSSLDAEH 491 + D P L + G ++ L + ED C N+ + V T+D S+ + Sbjct: 1291 TGDDRPPLSSLNGSVVAPLSDVPEDRPLDESCRINKNDNLVLIGKDVERKTTDGSVQLNN 1350 Query: 492 IASNATVQDVMDESPTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAM 671 + S A + D D SP L+E K + V + E N N + KQP AFLL+ + Sbjct: 1351 V-STARI-DGTDVSPKNLVEQKEEKLV-------IIPSEEKLNENISSLKQPLAFLLSTI 1401 Query: 672 FETGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 ETGLV LPS+LT+VLLQ NNRLS++Q S LPSNFEEVATGVLKVLN Sbjct: 1402 SETGLVSLPSLLTSVLLQANNRLSSDQVSNALPSNFEEVATGVLKVLN 1449 >ref|XP_007037480.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508774725|gb|EOY21981.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 1550 Score = 229 bits (584), Expect = 9e-58 Identities = 141/288 (48%), Positives = 185/288 (64%), Gaps = 18/288 (6%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 +SE++DGFLWT++AI+ H+S +E QLQM+DGL+EL+IAYQ+IHRLRDLFALYDRPQVEGS Sbjct: 1172 VSEVLDGFLWTVSAIIGHISSDERQLQMRDGLLELLIAYQVIHRLRDLFALYDRPQVEGS 1231 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRL----ES 353 PFPSSILL I+LL VLTS G+ I+W+S P ++ G + E K++ T D + Sbjct: 1232 PFPSSILLSIHLLVVLTSS-PGNSSINWESLPIEMELGNESQETKIAATPDCGCSFVNSN 1290 Query: 354 SMDTMPSLPT--GDLLTELQESTED-----GC-----PNIPITNSQV--PTSDSSLDAEH 491 + D P L + G ++ L + ED C N+ + V T+D S+ + Sbjct: 1291 TGDDRPPLSSLNGSVVAPLSDVPEDRPLDESCRINKNDNLVLIGKDVERKTTDGSVQLNN 1350 Query: 492 IASNATVQDVMDESPTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAM 671 + S A + D D SP L+E K + V + E N N + KQP AFLL+ + Sbjct: 1351 V-STARI-DGTDVSPKNLVEQKEEKLV-------IIPSEEKLNENISSLKQPLAFLLSTI 1401 Query: 672 FETGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 ETGLV LPS+LT+VLLQ NNRLS++Q S LPSNFEEVATGVLKVLN Sbjct: 1402 SETGLVSLPSLLTSVLLQANNRLSSDQVSNALPSNFEEVATGVLKVLN 1449 >ref|XP_007037479.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508774724|gb|EOY21980.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1684 Score = 229 bits (584), Expect = 9e-58 Identities = 141/288 (48%), Positives = 185/288 (64%), Gaps = 18/288 (6%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 +SE++DGFLWT++AI+ H+S +E QLQM+DGL+EL+IAYQ+IHRLRDLFALYDRPQVEGS Sbjct: 1172 VSEVLDGFLWTVSAIIGHISSDERQLQMRDGLLELLIAYQVIHRLRDLFALYDRPQVEGS 1231 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRL----ES 353 PFPSSILL I+LL VLTS G+ I+W+S P ++ G + E K++ T D + Sbjct: 1232 PFPSSILLSIHLLVVLTSS-PGNSSINWESLPIEMELGNESQETKIAATPDCGCSFVNSN 1290 Query: 354 SMDTMPSLPT--GDLLTELQESTED-----GC-----PNIPITNSQV--PTSDSSLDAEH 491 + D P L + G ++ L + ED C N+ + V T+D S+ + Sbjct: 1291 TGDDRPPLSSLNGSVVAPLSDVPEDRPLDESCRINKNDNLVLIGKDVERKTTDGSVQLNN 1350 Query: 492 IASNATVQDVMDESPTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAM 671 + S A + D D SP L+E K + V + E N N + KQP AFLL+ + Sbjct: 1351 V-STARI-DGTDVSPKNLVEQKEEKLV-------IIPSEEKLNENISSLKQPLAFLLSTI 1401 Query: 672 FETGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 ETGLV LPS+LT+VLLQ NNRLS++Q S LPSNFEEVATGVLKVLN Sbjct: 1402 SETGLVSLPSLLTSVLLQANNRLSSDQVSNALPSNFEEVATGVLKVLN 1449 >ref|XP_002514697.1| hypothetical protein RCOM_1470550 [Ricinus communis] gi|223546301|gb|EEF47803.1| hypothetical protein RCOM_1470550 [Ricinus communis] Length = 1809 Score = 224 bits (572), Expect = 2e-56 Identities = 141/288 (48%), Positives = 175/288 (60%), Gaps = 18/288 (6%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 ISE++D FLW + +V H S E +LQM+DGL+EL+ AYQ++HRLRDLFALYDRPQVEGS Sbjct: 1275 ISEVLDNFLWIVGTVVGHTSSEERELQMRDGLLELLTAYQVVHRLRDLFALYDRPQVEGS 1334 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRLES---- 353 PFPSSILL I LL VLT + + + IDW+S P + + E KL+ + S Sbjct: 1335 PFPSSILLSIRLLVVLTYRPKTTSSIDWESSPMETIVEFENQESKLAEISEFGYPSANMT 1394 Query: 354 SMDTMP--SLPTGDLLTELQESTED-----GCPNIPITNSQVPTSD-------SSLDAEH 491 S D P S+ G L ++ ED C I S D SS + H Sbjct: 1395 SGDCRPPLSVLNGSTLVSPPDALEDRPLHESCTINKIDESLTALKDGEKKPTYSSEELNH 1454 Query: 492 IASNATVQDVMDESPTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAM 671 + N + +V+DES LIE K D V+ +E +N N TKQP AF L+A+ Sbjct: 1455 ASIN--LGNVLDESQKILIEGK-------DEKHMVNVVAEKKNDNILSTKQPVAFFLSAI 1505 Query: 672 FETGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 ETGLV LPS+LTAVLLQ NNRLS+EQ SYVLPSNFEEVATGVL+VLN Sbjct: 1506 AETGLVSLPSLLTAVLLQANNRLSSEQGSYVLPSNFEEVATGVLRVLN 1553 >ref|XP_006345163.1| PREDICTED: uncharacterized protein LOC102602693 [Solanum tuberosum] Length = 1631 Score = 223 bits (569), Expect = 5e-56 Identities = 134/281 (47%), Positives = 172/281 (61%), Gaps = 11/281 (3%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 ++E++DGFLWT AAI+ H S +E LQ+QDGLIELVIAYQ+IHRLRDLFALYDRP VEGS Sbjct: 1096 MAEVLDGFLWTAAAIIGHTSTDERSLQLQDGLIELVIAYQVIHRLRDLFALYDRPPVEGS 1155 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRLESSM-- 359 PFPSSILLG+NLL VLT +FR + + + P T + + +L+ D++ S + Sbjct: 1156 PFPSSILLGVNLLAVLTFRFRNTSSLTCKNIPGASTHRNEKNDIELAEAADLKSSSPLCN 1215 Query: 360 ---DTMPSLP--TGDLLTELQESTE----DGCPNIPITNSQVPTSDSSLDAEHIASNATV 512 D P G + L + E D P I V + SS + +A++ Sbjct: 1216 SQNDGKLVFPGVNGGVALGLSDVPEDRPLDEFPTIKEHQGTVVNALSSDKVDSVAASIET 1275 Query: 513 QDVMDESPTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAMFETGLVC 692 DV+ ES + + + Q D K N N ++ K FLL+A+ ETGLVC Sbjct: 1276 ADVLQESTSNVTYNNLQ----TDEKKSRDNSEGHIGGNESVMKPAVKFLLSAVSETGLVC 1331 Query: 693 LPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 LPSMLTAVLLQ NNR S +Q+SYVLPSNFE+VATGVLKVLN Sbjct: 1332 LPSMLTAVLLQANNRCSEQQASYVLPSNFEDVATGVLKVLN 1372 >gb|EXC11028.1| hypothetical protein L484_015248 [Morus notabilis] Length = 1663 Score = 223 bits (568), Expect = 7e-56 Identities = 138/287 (48%), Positives = 175/287 (60%), Gaps = 17/287 (5%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 ISEI++GFLW++ I+ HV+ E Q+QM+DGL+EL+ AYQ+IHRLRDLFALYDRPQVEGS Sbjct: 1126 ISEILEGFLWSVTTIIGHVNSEEQQIQMRDGLLELLTAYQVIHRLRDLFALYDRPQVEGS 1185 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRLESSMDT 365 PFPSSILL I LL VLTS+ + LIDW+ + G + + SV ++ S D Sbjct: 1186 PFPSSILLSIYLLVVLTSRPETNLLIDWEYLETLVRNGSQASKFAESVDTVYPIDHSTDL 1245 Query: 366 MPSLPT--GDLLTELQESTEDGCPNIPITNS-----QVPTSDSSLDAEHIASNATV---- 512 P LPT G + +L + ED P+ S V + ++DA+ SN V Sbjct: 1246 RPPLPTQNGSKVVQLPDVPED----TPLDESYKMDKNVVSESINMDADKEQSNCLVDPNK 1301 Query: 513 -----QDVMDESPTALIEDKHQCSVAEDSNK-CVSNDSELRNRNGAITKQPAAFLLAAMF 674 D ES IED + + +K V+ E +N N QP AFLL+A+ Sbjct: 1302 ADVAKSDDPKESEKIPIEDILKSFPPQKDDKISVNVGVEEKNENALNLDQPVAFLLSAIS 1361 Query: 675 ETGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 ETGLV + S+LTAVLLQ NNRLS+EQ Y LPSNFEEVATGVLKVLN Sbjct: 1362 ETGLVSVLSVLTAVLLQANNRLSSEQGLYALPSNFEEVATGVLKVLN 1408 >ref|XP_006477617.1| PREDICTED: uncharacterized protein LOC102610780 [Citrus sinensis] Length = 1688 Score = 220 bits (560), Expect = 6e-55 Identities = 131/274 (47%), Positives = 172/274 (62%), Gaps = 4/274 (1%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 I+E++DGFLWT+A I H+S +E QLQM+DGL+EL+IAYQ+IHRLRDLFALYDRPQVEGS Sbjct: 1170 ITEVLDGFLWTVATIFGHISSDEQQLQMRDGLLELLIAYQVIHRLRDLFALYDRPQVEGS 1229 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRLESSMDT 365 PFPSSILL I+LL VLTS I+W+ P + E KL+V+ + S +T Sbjct: 1230 PFPSSILLSISLLLVLTSSSGIVSSINWEPSPIETVAVNDSPEMKLAVSVETGYGSINNT 1289 Query: 366 MPSLPTGDLLTELQESTEDGCPNIPITNSQVPTSDSSL--DAEHIASNATVQ--DVMDES 533 +GD++ L + E+ P+ S + D+E +N++V D E Sbjct: 1290 -----SGDMIVPLADVPEES----PLDESCKVKDSGPIGNDSEKKMNNSSVGLIDTDREK 1340 Query: 534 PTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAMFETGLVCLPSMLTA 713 + E + + +D + +N KQP AFLL+A+ ETGLV LPS+LT+ Sbjct: 1341 TDGIDESQRTVTQGKDEKHLADMVAVQKNEKMLNLKQPVAFLLSAISETGLVSLPSLLTS 1400 Query: 714 VLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 VLLQ NNRLS+EQ+ YVLPSNFEE ATGVLKVLN Sbjct: 1401 VLLQANNRLSSEQALYVLPSNFEEAATGVLKVLN 1434 >ref|XP_006440689.1| hypothetical protein CICLE_v10018469mg [Citrus clementina] gi|557542951|gb|ESR53929.1| hypothetical protein CICLE_v10018469mg [Citrus clementina] Length = 1688 Score = 219 bits (559), Expect = 7e-55 Identities = 130/274 (47%), Positives = 173/274 (63%), Gaps = 4/274 (1%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 I+E++DGFLWT+A I H+S +E+QLQM+DGL+EL+I+YQ+IHRLRDLFALYDRPQVEGS Sbjct: 1170 ITEVLDGFLWTVATIFGHISSDEWQLQMRDGLLELLISYQVIHRLRDLFALYDRPQVEGS 1229 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRLESSMDT 365 PFPSSILL I+LL VLTS I+W+ P + E KL+V+ + S +T Sbjct: 1230 PFPSSILLSISLLLVLTSSSGIVSSINWEPSPIETVAVNDSPEMKLAVSVESGYGSINNT 1289 Query: 366 MPSLPTGDLLTELQESTEDGCPNIPITNSQVPTSDSSL--DAEHIASNATVQ--DVMDES 533 +GD++ L + E+ P+ S + D+E +N++V D E Sbjct: 1290 -----SGDMIVPLADVPEES----PLDESCKVKDSGPIGNDSEKKMNNSSVGLIDTDREK 1340 Query: 534 PTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAMFETGLVCLPSMLTA 713 + E + + +D + +N KQP AFLL+A+ ETGLV LPS+LT+ Sbjct: 1341 TDGIDESQRTVTQGKDEKHLADMVAVQKNEKMLNLKQPVAFLLSAISETGLVSLPSLLTS 1400 Query: 714 VLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 VLLQ NNRLS+EQ+ YVLPSNFEE ATGVLKVLN Sbjct: 1401 VLLQANNRLSSEQALYVLPSNFEEAATGVLKVLN 1434 >ref|XP_002317968.2| hypothetical protein POPTR_0012s06850g [Populus trichocarpa] gi|550326532|gb|EEE96188.2| hypothetical protein POPTR_0012s06850g [Populus trichocarpa] Length = 1427 Score = 219 bits (557), Expect = 1e-54 Identities = 135/286 (47%), Positives = 173/286 (60%), Gaps = 16/286 (5%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 ISE++D FLWT+ ++ H S +E Q+QMQDGL+EL+IAYQ+IHRLRDLFALYDRPQVEGS Sbjct: 890 ISEVLDNFLWTVGTVIGHASSDEQQVQMQDGLLELLIAYQVIHRLRDLFALYDRPQVEGS 949 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRLES---- 353 PFPSSILL I+LL LT + + I+W+S P + E K D + S Sbjct: 950 PFPSSILLSIHLLVALTYRPGTNSSINWESSPVKTVLRFENQEAKPVENADFQYSSAVVT 1009 Query: 354 SMDTMPSLPTGDLLTELQEST-------EDGCPNIPITNSQVPTSDSSLDAEHIA----- 497 S D P+L + T + ++ C NI V S H + Sbjct: 1010 SEDYRPTLFVLNCSTVVSPPNVSDDIHIDESC-NINEIKESVSLSKDGEQKPHSSVELNI 1068 Query: 498 SNATVQDVMDESPTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAMFE 677 +N +D DE+ LIE+K D + VS+ +E +N K+P AFLL+A+ E Sbjct: 1069 ANTNTRDGQDEAQKNLIEEK-------DEKQFVSDCAEHKNNVMLNMKEPVAFLLSAISE 1121 Query: 678 TGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 TGLV LPS+LTAVLLQ NNRL++EQ SY+LPSNFEEVATGVLKVLN Sbjct: 1122 TGLVSLPSLLTAVLLQANNRLTSEQGSYILPSNFEEVATGVLKVLN 1167 >ref|XP_004236487.1| PREDICTED: uncharacterized protein LOC101264110 [Solanum lycopersicum] Length = 1631 Score = 217 bits (553), Expect = 4e-54 Identities = 133/281 (47%), Positives = 168/281 (59%), Gaps = 11/281 (3%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 +SE++DGFLWT AAI+ H S +E LQ+QDGLIELVIAYQ+IHRLRDLFALYDRP VEGS Sbjct: 1096 MSEVLDGFLWTAAAIIGHASTDERSLQLQDGLIELVIAYQVIHRLRDLFALYDRPPVEGS 1155 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLSVTEDIRLESSM-- 359 PFPSSILLG+NLL VLT +FR + ++FP T + + + D++ S + Sbjct: 1156 PFPSSILLGVNLLAVLTFRFRNMSSLTCENFPGVSTHENEKNDIEFVEAADLKSSSFLCN 1215 Query: 360 -----DTMPSLPTGDLLTELQESTEDG----CPNIPITNSQVPTSDSSLDAEHIASNATV 512 + S G + L + ED P I V SS + + +A + Sbjct: 1216 YGTEGKLVFSGVNGGVALGLSDVPEDSPLDEFPKIKEHQGAVVNDLSSDNVDSVAVSLET 1275 Query: 513 QDVMDESPTALIEDKHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLAAMFETGLVC 692 DV+ ES + + Q K N N ++ K FLL+A+ ETGLVC Sbjct: 1276 ADVLQESASNGTYNNLQTV----EKKYQDNGKGHIGGNESMMKPAVKFLLSAVSETGLVC 1331 Query: 693 LPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 LPSMLTAVLLQ NNR S +Q+SYVLPSNFE+VATGVLKVLN Sbjct: 1332 LPSMLTAVLLQANNRCSEQQASYVLPSNFEDVATGVLKVLN 1372 >ref|XP_004301126.1| PREDICTED: uncharacterized protein LOC101303041 [Fragaria vesca subsp. vesca] Length = 1675 Score = 216 bits (551), Expect = 6e-54 Identities = 131/290 (45%), Positives = 176/290 (60%), Gaps = 20/290 (6%) Frame = +3 Query: 6 ISEIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGS 185 ISE++DG+LWT+ I+ H+S +E QLQM+D L+EL+I+YQ+I RLRDLFALYDRPQVEGS Sbjct: 1149 ISEVLDGYLWTVTTILSHISSDERQLQMRDSLLELLISYQVIQRLRDLFALYDRPQVEGS 1208 Query: 186 PFPSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERKLS------------V 329 PFPSSI+L I LL VLTS+ IDW P +I G E K++ Sbjct: 1209 PFPSSIILSIRLLVVLTSRSETDCSIDWKYEPVEILLGNGSEEAKVAECDNSEYLPPTLT 1268 Query: 330 TEDIRLESSMDT------MPSLPTGDLLTELQESTEDGCPNIPITNSQVPTSDSSLDAEH 491 ED R SS+ +P +P + E+ + E ++ ++ + + + E Sbjct: 1269 LEDFRPPSSLLNGGKFVHLPDVPKDGPVDEMCKINE----SVESVSAAKGSEERNSLVEA 1324 Query: 492 IASNATVQDVMDESPTALIED--KHQCSVAEDSNKCVSNDSELRNRNGAITKQPAAFLLA 665 +N DV DE P ++ D + E+ V N +E +N N +QP AFLL+ Sbjct: 1325 NNANKVKTDVPDE-PQKMVNDDIMEPFASVEEEKHLVDNGAEHKNDNCVTLQQPVAFLLS 1383 Query: 666 AMFETGLVCLPSMLTAVLLQGNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 A+ ETGLV LPS+LT+VLLQ NNRLS+EQ+S LPSNFE+VATGVLKVLN Sbjct: 1384 AVSETGLVSLPSLLTSVLLQANNRLSSEQASDALPSNFEDVATGVLKVLN 1433 >tpg|DAA41320.1| TPA: hypothetical protein ZEAMMB73_745179 [Zea mays] Length = 1422 Score = 207 bits (527), Expect = 4e-51 Identities = 135/289 (46%), Positives = 170/289 (58%), Gaps = 21/289 (7%) Frame = +3 Query: 12 EIVDGFLWTIAAIVDHVSCNEYQLQMQDGLIELVIAYQIIHRLRDLFALYDRPQVEGSPF 191 E++DGFLWT+A IV HV N QLQMQ GLIEL++AYQIIHRLRDLFALYDRPQVEGSP Sbjct: 1075 EVLDGFLWTVAMIVGHVHINGEQLQMQGGLIELIVAYQIIHRLRDLFALYDRPQVEGSPL 1134 Query: 192 PSSILLGINLLTVLTSKFRGSYLIDWDSFPNDITQGKKIGERK-LSVTEDIRLES-SMDT 365 PSSIL G+NLL+VLTSK IDW+S G + E + LS + + +S ++D Sbjct: 1135 PSSILFGLNLLSVLTSKPGNFSTIDWESCKCRTLGGNIVQEYEYLSSQDSLGCQSMTLDQ 1194 Query: 366 MPSLPTGDLLTELQESTEDGCP----NIPITNSQVPTSDSSLDAEHIASNATVQDVMDES 533 + + +EL E ++ C +IP+ V + L N M S Sbjct: 1195 FGDAKSPTIYSELAEDSK-SCKQHDLSIPVDRKLVDEASKDLLVMAAGLN---NSAMQPS 1250 Query: 534 PTALIEDKHQCSVAE-DSNKCVSNDSELRNRNGAIT-------------KQPAAFLLAAM 671 + +KH + ++ D N V + E R N KQPA LL+A+ Sbjct: 1251 DLGITTEKHSGNPSQGDENNTVDSFLEGRKTNNVCALYSSSGKGNEMNLKQPAMLLLSAL 1310 Query: 672 FETGLVCLPSMLTAVLLQ-GNNRLSAEQSSYVLPSNFEEVATGVLKVLN 815 ETGLV LPS+LTAVLLQ NNR S+EQ+ +LPSNFEEVATGVLKVLN Sbjct: 1311 AETGLVTLPSLLTAVLLQANNNRSSSEQTLAILPSNFEEVATGVLKVLN 1359