BLASTX nr result
ID: Mentha29_contig00025674
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00025674 (1826 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007014883.1| Uncharacterized protein TCM_040459 [Theobrom... 199 3e-48 ref|XP_007052371.1| Uncharacterized protein TCM_005764 [Theobrom... 181 7e-43 ref|XP_002275053.2| PREDICTED: uncharacterized protein LOC100256... 174 2e-40 ref|XP_006847242.1| hypothetical protein AMTR_s04652p00001680, p... 172 3e-40 ref|XP_004229085.1| PREDICTED: uncharacterized protein LOC101249... 167 1e-38 ref|XP_007043203.1| Uncharacterized protein TCM_007662 [Theobrom... 167 1e-38 ref|XP_003633694.1| PREDICTED: uncharacterized protein LOC100241... 164 9e-38 ref|XP_004246853.1| PREDICTED: uncharacterized protein LOC101246... 162 6e-37 ref|XP_004235419.1| PREDICTED: uncharacterized protein LOC101264... 162 6e-37 ref|XP_006470974.1| PREDICTED: uncharacterized protein LOC102620... 160 2e-36 ref|XP_007032195.1| Uncharacterized protein TCM_017607 [Theobrom... 160 2e-36 ref|XP_006830015.1| hypothetical protein AMTR_s04836p00002630, p... 159 5e-36 ref|XP_004248932.1| PREDICTED: uncharacterized protein LOC101243... 156 3e-35 emb|CAN69231.1| hypothetical protein VITISV_008803 [Vitis vinifera] 156 3e-35 emb|CAN68581.1| hypothetical protein VITISV_011863 [Vitis vinifera] 156 3e-35 ref|XP_004229090.1| PREDICTED: uncharacterized protein LOC101254... 152 5e-34 ref|XP_006368044.1| PREDICTED: uncharacterized protein LOC102605... 152 6e-34 ref|XP_007036749.1| Uncharacterized protein TCM_012672 [Theobrom... 149 4e-33 ref|XP_006848744.1| hypothetical protein AMTR_s04155p00003130 [A... 147 1e-32 gb|EXB36258.1| hypothetical protein L484_013693 [Morus notabilis] 146 3e-32 >ref|XP_007014883.1| Uncharacterized protein TCM_040459 [Theobroma cacao] gi|508785246|gb|EOY32502.1| Uncharacterized protein TCM_040459 [Theobroma cacao] Length = 715 Score = 199 bits (507), Expect = 3e-48 Identities = 142/529 (26%), Positives = 241/529 (45%), Gaps = 14/529 (2%) Frame = +3 Query: 282 KIYVIHGGEWYDGNKYKGGTRELIPIPPDGISLKDLVEKIEMRLMKPRDGYVYEIDALIN 461 ++ + H G+W DG YKGG + + D +S + L++ +E + + E+ ALI+ Sbjct: 10 RLVIRHDGQWVDGI-YKGGESRMRKVKSD-LSYEGLMKLVEDVVGVNSEIDEIELHALIS 67 Query: 462 DGXXXXXXXXXXVVKTKITDDFELSFVMGSAAK-PVIYVTTTPINAELSRLLSHQNI--F 632 + + I DD + + ++ PV+YV I + ++SH+ + Sbjct: 68 T--------PGELSRPIIKDDEDAALILLEQRNVPVVYVN---IKGCQTNVMSHEEVGQH 116 Query: 633 SASKPVASKTTRRSHKDPVADKTIPILDKEPVASKTQTSXXXXXXXXXXXXXXXXXEFFH 812 P++++ T + + L+ + + E + Sbjct: 117 ECVMPLSNENTTLEDNNVRLEGDTATLEDKTAFDEGNEDLFVAGEDRFDDTSDDGLEQWQ 176 Query: 813 EDVAVED-LDGDDFGTHQEDHGSNRH----NVSPLQ-RTQDSVQGSSDVVRQWTIPGSSF 974 +D + +D L D + G ++ +Q D +G++ + R W I G Sbjct: 177 DDSSDDDCLYDSDIPIYNNVEGETESVRGVDIRDVQCDDSDQEKGNAGISRTWVIAG--- 233 Query: 975 HMSAPSEDLESAVVDVVDCTLDTIAKDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRER 1154 ++ ++ C D + K +F+SKA + ++ + ++ + K+S + R Sbjct: 234 ---VERFSFQTITIEESTCAEDRLYKGRMFSSKAELKQALNMLVIIEKFAIRVKRSCKAR 290 Query: 1155 YVLVCKHNKDVCPFILRAKSL---GKAWVVDVWC-THTCKKDLRYHADPTVFSKVLASYF 1322 Y + CK C F +RA L G+ W V + HTC D PT +K++ Sbjct: 291 YEVGCKDK--ACKFSVRAMKLPDRGEYWQVRTFHKVHTCTVDGLQRWFPTTSAKMIGELM 348 Query: 1323 VPNLMVEGVVLRPRDIQAQVQRLFGAKIKYATALSARNQALTMTYGDSSESFQRLPSYFH 1502 L GV LRP+DI +++ +G + Y A+ A + + ESFQ LPSYF+ Sbjct: 349 SHKLRANGVALRPKDIICEMRVQWGLECLYGKVWQAKEYAERLVFSPLEESFQLLPSYFY 408 Query: 1503 MLEQSNPGSALHLQTAEAGAFEYCFMSLAASIRGFK-ACRPVIVVDGTHLKGKYNGIMFV 1679 MLEQ PG+ + T E F+YCF S A IRGF P + +D THLKG++NG++FV Sbjct: 409 MLEQEIPGTVTVMATDEEERFKYCFWSYGACIRGFSDVMHPTVAIDATHLKGRFNGVLFV 468 Query: 1680 AATKDANEQIFPLAFGFGAKECDESWIWFLEQCRQTFGCPENLLIVSDQ 1826 KDANE ++P+ FG E ++SW WFL + R GC EN + +S+Q Sbjct: 469 TVCKDANECVYPVGFGIDHVEDEDSWTWFLSKLRDVVGCHENTMFISNQ 517 >ref|XP_007052371.1| Uncharacterized protein TCM_005764 [Theobroma cacao] gi|508704632|gb|EOX96528.1| Uncharacterized protein TCM_005764 [Theobroma cacao] Length = 458 Score = 181 bits (460), Expect = 7e-43 Identities = 101/271 (37%), Positives = 144/271 (53%), Gaps = 5/271 (1%) Frame = +3 Query: 1029 CTLDTIAKDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRA 1208 C D + K +F+SKA + ++ + + + K+S + RY + CK C F +RA Sbjct: 132 CAEDRLYKGRMFSSKAELKRALNMLAIKEKFAIRVKRSCKARYEVGCKDK--ACKFSVRA 189 Query: 1209 KSL---GKAWVVDVWC-THTCKKDLRYHADPTVFSKVLASYFVPNLMVEGVVLRPRDIQA 1376 L G+ V + HTC D PT +K++ + GV LRP+DI Sbjct: 190 MKLLDRGEYLKVQKFHKVHTCTVDGLQGWFPTKSAKMITELMSHKIRANGVALRPKDIIC 249 Query: 1377 QVQRLFGAKIKYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEA 1556 ++ +G + Y A + A + +G ESFQ LPSYF+MLEQ NPG + T E Sbjct: 250 DMRVQWGLECLYGKAWQVKEYAKRLVFGPPEESFQLLPSYFYMLEQENPGIVTAVATNEE 309 Query: 1557 GAFEYCFMSLAASIRGF-KACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFG 1733 F+YC S A IRGF RP I +D THLKG++ G++FVA KDANE ++P+AFG Sbjct: 310 KRFKYCLWSYGACIRGFMDVMRPTIAIDATHLKGRFKGVLFVAVCKDANECVYPIAFGID 369 Query: 1734 AKECDESWIWFLEQCRQTFGCPENLLIVSDQ 1826 E ++SW WFL + R GC EN + + DQ Sbjct: 370 HIEDEDSWTWFLSKLRDAVGCLENTMFIFDQ 400 >ref|XP_002275053.2| PREDICTED: uncharacterized protein LOC100256986 [Vitis vinifera] Length = 1111 Score = 174 bits (440), Expect = 2e-40 Identities = 114/355 (32%), Positives = 178/355 (50%), Gaps = 16/355 (4%) Frame = +3 Query: 810 HEDVAVEDLDGDDFGTHQEDHGSNRHNVSPLQRTQ------DSVQGSSDVVRQWTIPGSS 971 HE V D + ++ ++ED+ N + S Q TQ S Q + + T G Sbjct: 175 HEGYKVHDWNMNETAINEEDYRMNTNPTSDKQVTQIGSFRTGSAQSAEILTMIDTSDGFI 234 Query: 972 FHMSAPSEDLESAVVDVVDCTL------DTIAKDIIFNSKANMIASVGLYHLVHHLEFKT 1133 ED+ + +++ + D + + I++SK + + + L EFKT Sbjct: 235 HDNPTIIEDVANERQNMMQQPIVSGISDDHLEEHQIYSSKKELQRKLYMMALKRKFEFKT 294 Query: 1134 KKSSRERYVLVCKHNKDVCPFILRAKSLGKA---WVVDVWCTHTCKKDLRYHADPTVFSK 1304 KS+ + ++ C + C + +RA LG + ++ + THTC+ D+ + S Sbjct: 295 TKSTTKLLLVECFDKE--CKWRVRATKLGISNMFQIMKFYSTHTCRLDMMSRDNRHASSW 352 Query: 1305 VLASYFVPNLMVEGVVLRPRDIQAQVQRLFGAKIKYATALSARNQALTMTYGDSSESFQR 1484 ++ G RP+DI A +++ +G I Y A A+ AL G ES+ Sbjct: 353 LIGESIRETYQGIGCEFRPKDIVADIRKQYGIPISYDKAWRAKELALGSIRGSPEESYNT 412 Query: 1485 LPSYFHMLEQSNPGSALHLQTAEAGAFEYCFMSLAASIRGFK-ACRPVIVVDGTHLKGKY 1661 LPSY ++LEQ NPG+ + T F+Y FMS+ AS+ GF + RPV+VVDGT LK KY Sbjct: 413 LPSYCYVLEQKNPGTITDIVTDCDNQFKYFFMSIGASLAGFHTSIRPVVVVDGTFLKAKY 472 Query: 1662 NGIMFVAATKDANEQIFPLAFGFGAKECDESWIWFLEQCRQTFGCPENLLIVSDQ 1826 G +F+AA KD N QI+PLAFG G E D SW WFL++ G ++L ++SD+ Sbjct: 473 LGTLFIAACKDGNNQIYPLAFGIGDSENDASWEWFLQKLHDALGHIDDLFVISDR 527 >ref|XP_006847242.1| hypothetical protein AMTR_s04652p00001680, partial [Amborella trichopoda] gi|548850309|gb|ERN08823.1| hypothetical protein AMTR_s04652p00001680, partial [Amborella trichopoda] Length = 607 Score = 172 bits (437), Expect = 3e-40 Identities = 87/261 (33%), Positives = 149/261 (57%), Gaps = 5/261 (1%) Frame = +3 Query: 1059 IFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRAKSLG--KAWV 1232 I+ K + + +G Y + ++ +F+ KKS Y + C K C + L A G K+++ Sbjct: 111 IYKDKETLQSVLGFYAIRNNFQFRVKKSCARTYKICCLDPK--CKWALTASRNGPTKSFI 168 Query: 1233 V---DVWCTHTCKKDLRYHADPTVFSKVLASYFVPNLMVEGVVLRPRDIQAQVQRLFGAK 1403 + D HTC ++R+ +K++ +Y P P+DI+ +++ +G + Sbjct: 169 IRKYDRKVIHTCDLNIRFADKRQATTKLIGNYIKPRFTNIKTTQTPQDIRGEMKHKYGVR 228 Query: 1404 IKYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEAGAFEYCFMS 1583 + Y A ++ A G ++ES++ LP + HML+++NPG+ +H++T + +F+Y F++ Sbjct: 229 MNYMKAWRSKEHAQEELRGKANESYRLLPGFLHMLQKTNPGTIVHMETEDDNSFKYLFVA 288 Query: 1584 LAASIRGFKACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKECDESWIW 1763 L ASI+G+K C+P+IVVDGT LK Y G + A T+DAN IFPLAF E + SW W Sbjct: 289 LDASIKGWKKCKPIIVVDGTFLKSTYGGTLLSACTQDANGHIFPLAFSVVDSENNNSWQW 348 Query: 1764 FLEQCRQTFGCPENLLIVSDQ 1826 F + R+T+G E ++SD+ Sbjct: 349 FFTKVRETYGIREEQCLISDR 369 >ref|XP_004229085.1| PREDICTED: uncharacterized protein LOC101249572 [Solanum lycopersicum] Length = 813 Score = 167 bits (424), Expect = 1e-38 Identities = 88/262 (33%), Positives = 144/262 (54%), Gaps = 3/262 (1%) Frame = +3 Query: 1050 KDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRAKSLGKAW 1229 +D ++ +K + + Y + + ++KTK SS Y LVC C +I++A ++ K+ Sbjct: 246 EDQVYKNKRTLKVVMMKYAIDNRFQWKTKISSFVSYTLVCVSEN--CGWIMKASNINKSG 303 Query: 1230 VVDVWC---THTCKKDLRYHADPTVFSKVLASYFVPNLMVEGVVLRPRDIQAQVQRLFGA 1400 + + HTC + ++ S ++ P L+ L P+DIQ V+ G Sbjct: 304 MFRIRKFVDEHTCSLKDKVYSQRQATSSLVGGIITPKLVDHKRKLTPKDIQVDVRLELGV 363 Query: 1401 KIKYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEAGAFEYCFM 1580 + Y+ A +AR +A+ G S+S+ +LPSY ++L+ + PGS + L+ E F Y F+ Sbjct: 364 NVSYSVAWNAREKAINTLRGKPSDSYSKLPSYLYILDTTYPGSHIRLKKTEENEFLYVFI 423 Query: 1581 SLAASIRGFKACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKECDESWI 1760 +L I+GF++CRP++VVDG+HL+G YNG A+T D I PLA+G E D SW Sbjct: 424 ALFPFIKGFESCRPIVVVDGSHLRGTYNGTFVSASTLDGAGNILPLAYGLIDSENDASWT 483 Query: 1761 WFLEQCRQTFGCPENLLIVSDQ 1826 WF EQ R+ G + + +VSD+ Sbjct: 484 WFFEQFREAHGLKDKMCVVSDR 505 >ref|XP_007043203.1| Uncharacterized protein TCM_007662 [Theobroma cacao] gi|508707138|gb|EOX99034.1| Uncharacterized protein TCM_007662 [Theobroma cacao] Length = 502 Score = 167 bits (423), Expect = 1e-38 Identities = 100/341 (29%), Positives = 162/341 (47%), Gaps = 6/341 (1%) Frame = +3 Query: 813 EDVAVEDLDGDDFGTHQEDHGSNRHNVSPLQRTQDSVQGS----SDVVRQWTIPGSSFHM 980 ED D +D T ED + + Q DS SD+ + G + + Sbjct: 156 EDNTAFDEGNEDLFTAGEDRFDDTSDDGLEQSQDDSSDDDCLYDSDITICNNVEGKTEPV 215 Query: 981 SAPSE-DLESAVVDVVDCTLDTIAKDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERY 1157 + ++ ++ C D + K +F+SKA + ++ + + + K+S + RY Sbjct: 216 GGAEKFSFQTITIEESTCAEDRLYKGRMFSSKAELKRALHMLVIKEKFAVRVKRSCKARY 275 Query: 1158 VLVCKHNKDVCPFILRAKSLGKAWVVDVWCTHTCKKDLRYHADPTVFSKVLASYFVPNLM 1337 + K HTC D PT+ +K++ + Sbjct: 276 EIFHK-------------------------VHTCTVDGLQEWFPTMSTKMIGELISHKIQ 310 Query: 1338 VEGVVLRPRDIQAQVQRLFGAKIKYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQS 1517 V LRP+D+ +++ +G + + A + A + +G +SFQ LPSYF++LEQ Sbjct: 311 ANAVALRPKDVICEMRVQWGLECLHGKAWQVKEYAERLVFGPPKKSFQLLPSYFYILEQE 370 Query: 1518 NPGSALHLQTAEAGAFEYCFMSLAASIRGFK-ACRPVIVVDGTHLKGKYNGIMFVAATKD 1694 NP + + + T E F+YCF S A IRGF+ RP++ +D THLK ++ GI+FVA KD Sbjct: 371 NPDTVIAVATDEEERFKYCFWSYEACIRGFRDVMRPMVAIDTTHLKDRFKGILFVAVCKD 430 Query: 1695 ANEQIFPLAFGFGAKECDESWIWFLEQCRQTFGCPENLLIV 1817 ANE ++P+AFG G E +SW WFL + R GCPEN +++ Sbjct: 431 ANECVYPVAFGIGHVEDKDSWTWFLSKLRDAVGCPENTMLI 471 >ref|XP_003633694.1| PREDICTED: uncharacterized protein LOC100241533 [Vitis vinifera] Length = 734 Score = 164 bits (416), Expect = 9e-38 Identities = 95/267 (35%), Positives = 145/267 (54%), Gaps = 4/267 (1%) Frame = +3 Query: 1038 DTIAKDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRAKSL 1217 D + + I++SK + + + L EFKT KS+ + ++ C + C + +RA L Sbjct: 169 DHLEEHQIYSSKKELQRKLYMMALKRKFEFKTTKSTTKLLLVECFDKE--CKWRVRATKL 226 Query: 1218 GKA---WVVDVWCTHTCKKDLRYHADPTVFSKVLASYFVPNLMVEGVVLRPRDIQAQVQR 1388 G + ++ + THTC+ D+ + S ++ G R +DI A +++ Sbjct: 227 GISNMFQIMKFYSTHTCRLDMMSRDNRHASSWLIGESIRETYQGIGCEFRLKDIVADIRK 286 Query: 1389 LFGAKIKYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEAGAFE 1568 +G +I Y A A+ AL G ES+ LPSY ++LEQ NPG+ + T F+ Sbjct: 287 QYGIQISYDKAWRAKELALGSIRGSPEESYNTLPSYCYVLEQKNPGTITDIVTDCDNQFK 346 Query: 1569 YCFMSLAASIRGFK-ACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKEC 1745 Y FMS+ AS+ GF + RPV+ VDGT LK KY G +F+AA KD N QI+PLAFG G E Sbjct: 347 YFFMSIGASLAGFHTSIRPVVAVDGTFLKAKYFGTLFIAACKDGNNQIYPLAFGIGDSEN 406 Query: 1746 DESWIWFLEQCRQTFGCPENLLIVSDQ 1826 D SW WFL++ G ++L ++SD+ Sbjct: 407 DASWEWFLQKLHDAIGHIDDLFVISDR 433 >ref|XP_004246853.1| PREDICTED: uncharacterized protein LOC101246857 [Solanum lycopersicum] Length = 654 Score = 162 bits (409), Expect = 6e-37 Identities = 92/262 (35%), Positives = 137/262 (52%), Gaps = 3/262 (1%) Frame = +3 Query: 1050 KDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRAKSLGKAW 1229 KD ++ +K + +S+ + +++H +FKT +SS Y + C C + LRA SL K+ Sbjct: 85 KDQVYKNKYVLTSSLKRHSILNHFQFKTTRSSAISYSIQCLGES--CSWSLRASSLNKSE 142 Query: 1230 VVDVW---CTHTCKKDLRYHADPTVFSKVLASYFVPNLMVEGVVLRPRDIQAQVQRLFGA 1400 + + HTC ++ V+ S V P+DIQ + +G Sbjct: 143 MFKIREFESEHTCLLLHNSLSERLATKSVVGSIIVGKYAEPDANYTPKDIQRDMLAEYGV 202 Query: 1401 KIKYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEAGAFEYCFM 1580 ++ Y A A+ L + GD +S+ +LPSYFH+LE + PGS + +E F Y F+ Sbjct: 203 RLTYMQAWRAKEATLELIRGDPIQSYAKLPSYFHILEATYPGSHIRFHKSEDDRFLYAFV 262 Query: 1581 SLAASIRGFKACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKECDESWI 1760 +L SI+G++ CRP++VVDGT LKG Y G + A T DA I PLA+ E D SW Sbjct: 263 ALFTSIKGWEYCRPIVVVDGTFLKGAYKGTLLTANTLDAAGSILPLAYAIVDSENDSSWG 322 Query: 1761 WFLEQCRQTFGCPENLLIVSDQ 1826 WF EQ R FG + IVSD+ Sbjct: 323 WFFEQFRDAFGQRPEMCIVSDR 344 >ref|XP_004235419.1| PREDICTED: uncharacterized protein LOC101264907 [Solanum lycopersicum] Length = 743 Score = 162 bits (409), Expect = 6e-37 Identities = 92/262 (35%), Positives = 138/262 (52%), Gaps = 3/262 (1%) Frame = +3 Query: 1050 KDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRAKSLGKAW 1229 KD ++ +K + +++ + +++H +FKT +SS Y + C C + LRA SL K+ Sbjct: 174 KDQVYKNKYVLTSALKRHSILNHFQFKTTRSSAISYSIQCLGES--CSWSLRASSLNKSE 231 Query: 1230 VVDVW---CTHTCKKDLRYHADPTVFSKVLASYFVPNLMVEGVVLRPRDIQAQVQRLFGA 1400 + + HTC ++ V+ S V P+DIQ + +G Sbjct: 232 MFKIREFESEHTCLLLHNSLSERLATKSVVGSIIVGKYAEPDANYTPKDIQHDMLAEYGV 291 Query: 1401 KIKYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEAGAFEYCFM 1580 ++ Y A A+ AL + GD +S+ +LPSYFH+LE + PGS + +E F Y F+ Sbjct: 292 RLTYMQAWRAKEAALELIRGDPIQSYAKLPSYFHILEATYPGSHIRFHKSEDDRFLYAFV 351 Query: 1581 SLAASIRGFKACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKECDESWI 1760 +L SI+G++ CRP++VVDGT LKG Y G + A T DA I PLA+ E D SW Sbjct: 352 ALFTSIKGWEYCRPIVVVDGTFLKGAYKGTLLTANTLDAAGSILPLAYAIVDSENDSSWG 411 Query: 1761 WFLEQCRQTFGCPENLLIVSDQ 1826 WF EQ R FG + IVSD+ Sbjct: 412 WFFEQFRDAFGQRPEMCIVSDR 433 >ref|XP_006470974.1| PREDICTED: uncharacterized protein LOC102620129 [Citrus sinensis] Length = 805 Score = 160 bits (404), Expect = 2e-36 Identities = 97/266 (36%), Positives = 138/266 (51%), Gaps = 4/266 (1%) Frame = +3 Query: 1038 DTIAKDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRAKSL 1217 D + +F K + +G+ L + E++ +SS++ VL C KD C + LRA L Sbjct: 232 DGVYVKALFKDKEELQMKIGMLALQKNFEYRVTRSSKDILVLKCVA-KD-CKWRLRAAKL 289 Query: 1218 GKAWVVDV---WCTHTCKKDLRYHADPTVFSKVLASYFVPNLMVEGVVLRPRDIQAQVQR 1388 + V + H C D+ SK+++ + +P I+ + + Sbjct: 290 KGSDFFQVRKYYPVHNCSLDISQRNHRQASSKLISKFIQSKCDGVARSYKPGSIREDILK 349 Query: 1389 LFGAKIKYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEAGAFE 1568 FG I Y A AR AL G ESF L +Y MLE+ NPG+ H++T F+ Sbjct: 350 QFGVNISYDKAWRAREYALHSVKGSLEESFSLLSAYCEMLEKKNPGTITHIETDLENHFQ 409 Query: 1569 YCFMSLAASIRGFKAC-RPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKEC 1745 FM+L +SIRGF++ RPVI VD LKGKY GIMF+AA KD N+Q +PLAFG G E Sbjct: 410 SFFMALGSSIRGFRSSIRPVIAVDRALLKGKYQGIMFLAACKDGNDQTYPLAFGIGDSES 469 Query: 1746 DESWIWFLEQCRQTFGCPENLLIVSD 1823 D SW WFL + R G ++L+ +SD Sbjct: 470 DSSWDWFLTKLRDLMGEVDDLVFISD 495 >ref|XP_007032195.1| Uncharacterized protein TCM_017607 [Theobroma cacao] gi|508711224|gb|EOY03121.1| Uncharacterized protein TCM_017607 [Theobroma cacao] Length = 680 Score = 160 bits (404), Expect = 2e-36 Identities = 97/287 (33%), Positives = 140/287 (48%), Gaps = 5/287 (1%) Frame = +3 Query: 981 SAPSEDLESAVVDVVDCTLDTIAKDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERYV 1160 SA ++ + C D + K +F SK + ++ + + + K+S + Y Sbjct: 214 SAEKFSFQTITTEESTCAEDRLYKGRMFLSKGELKRALNMLVIKEKFAIRVKRSCKAHYE 273 Query: 1161 LVCKHNKDVCPFILRAKSL---GKAWVVDVWC-THTCKKDLRYHADPTVFSKVLASYFVP 1328 + CK C F +RA L G W V + HTC D PT +K++ Sbjct: 274 VGCKDK--ACKFSVRATKLLDRGVYWKVRTFHKVHTCTVDGLQGQFPTTSAKMIGELMSH 331 Query: 1329 NLMVEGVVLRPRDIQAQVQRLFGAKIKYATALSARNQALTMTYGDSSESFQRLPSYFHML 1508 L GV LRP++ V+RL +G ESFQ L SYF+ML Sbjct: 332 KLRANGVALRPKEY---VERLI--------------------FGPPEESFQLLSSYFYML 368 Query: 1509 EQSNPGSALHLQTAEAGAFEYCFMSLAASIRGFK-ACRPVIVVDGTHLKGKYNGIMFVAA 1685 EQ NPG+ + T E F+Y F S A I+GF+ RP I +D THLKG++ ++FVA Sbjct: 369 EQKNPGTVTAVATDEEERFKYYFWSYRACIQGFRDVMRPTIAIDATHLKGRFKRVLFVAI 428 Query: 1686 TKDANEQIFPLAFGFGAKECDESWIWFLEQCRQTFGCPENLLIVSDQ 1826 KD NE ++P+AFG + ++SW WFL + R GCPEN + +SDQ Sbjct: 429 CKDENECVYPVAFGISHVQDEDSWTWFLSKLRDAVGCPENTMFISDQ 475 >ref|XP_006830015.1| hypothetical protein AMTR_s04836p00002630, partial [Amborella trichopoda] gi|548835784|gb|ERM97431.1| hypothetical protein AMTR_s04836p00002630, partial [Amborella trichopoda] Length = 547 Score = 159 bits (401), Expect = 5e-36 Identities = 91/260 (35%), Positives = 142/260 (54%), Gaps = 4/260 (1%) Frame = +3 Query: 1059 IFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRAKSLGKAWVVD 1238 ++ +K + VG + L + EF KKS + + C+ + C + +R + + + + Sbjct: 283 LYTNKTELKNVVGRFALKMNFEFMVKKSGTDVFYATCRGSD--CKWRVRGRKRARCDMFE 340 Query: 1239 VWC---THTCKKDLRYHADPTVFSKVLASYFVPN-LMVEGVVLRPRDIQAQVQRLFGAKI 1406 V HTC D R HAD + + + + N +G + +DIQ + + +G K+ Sbjct: 341 VTVFHNEHTCSLDSR-HADNRQAAPWVVGHLIKNKFKSDGTKYKAKDIQRDMFQEYGIKM 399 Query: 1407 KYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEAGAFEYCFMSL 1586 Y A R + L + G + ++ +LP YF++LEQ NPG+ + T E F+YCF SL Sbjct: 400 SYEKAWRCREKGLMYSRGTPAAAYSQLPGYFYVLEQKNPGTITDIIT-EDNRFKYCFWSL 458 Query: 1587 AASIRGFKACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKECDESWIWF 1766 AA RGFK CRPVI +DGT LK ++ G M VA DAN Q+FP+AF E +SW +F Sbjct: 459 AACRRGFKFCRPVISIDGTFLKTRFGGTMLVAVAYDANNQLFPIAFAIVDSENHDSWKYF 518 Query: 1767 LEQCRQTFGCPENLLIVSDQ 1826 L++ ++ G ENL+ VSD+ Sbjct: 519 LQKLKEAIGEVENLVFVSDR 538 >ref|XP_004248932.1| PREDICTED: uncharacterized protein LOC101243650 [Solanum lycopersicum] Length = 699 Score = 156 bits (394), Expect = 3e-35 Identities = 91/289 (31%), Positives = 147/289 (50%), Gaps = 12/289 (4%) Frame = +3 Query: 996 DLESAVVDV------VDCTLDT------IAKDIIFNSKANMIASVGLYHLVHHLEFKTKK 1139 D ES V+ V V+C + T + +D ++ K + A + Y + + ++KT + Sbjct: 128 DTESLVLSVPNNSDNVNCDIITNVKHKVVLEDQVYKDKGTLKAVMTQYAIDNRFQWKTDR 187 Query: 1140 SSRERYVLVCKHNKDVCPFILRAKSLGKAWVVDVWCTHTCKKDLRYHADPTVFSKVLASY 1319 SS+ Y LVC D C ++L++ S+ K+ ++ + Sbjct: 188 SSQTCYTLVCV--SDNCGWVLKSSSINKSGIL------------------------IGGM 221 Query: 1320 FVPNLMVEGVVLRPRDIQAQVQRLFGAKIKYATALSARNQALTMTYGDSSESFQRLPSYF 1499 P L+ L P+DIQ V G + Y+ A +A+ +AL G S S+ +L SY Sbjct: 222 IQPKLVDHKRKLTPKDIQQDVSLALGVNVSYSVAWNAKEKALVSLRGTPSGSYGKLASYL 281 Query: 1500 HMLEQSNPGSALHLQTAEAGAFEYCFMSLAASIRGFKACRPVIVVDGTHLKGKYNGIMFV 1679 ++L+ + PGS + ++ + F Y F+SL I+GF+ C+P++VVDG+HL+G YNG+ Sbjct: 282 YVLDATYPGSHIRMKKTDENQFLYLFISLFPFIKGFEFCKPIVVVDGSHLRGTYNGVFVS 341 Query: 1680 AATKDANEQIFPLAFGFGAKECDESWIWFLEQCRQTFGCPENLLIVSDQ 1826 A+T D I PLA+G E D SW WF EQ R+ G N+ +VSD+ Sbjct: 342 ASTVDGAGNILPLAYGIIDPENDASWTWFFEQFREAHGVKYNMCVVSDR 390 >emb|CAN69231.1| hypothetical protein VITISV_008803 [Vitis vinifera] Length = 751 Score = 156 bits (394), Expect = 3e-35 Identities = 105/289 (36%), Positives = 150/289 (51%), Gaps = 11/289 (3%) Frame = +3 Query: 990 SEDLESAVVDVVDCTL--DTIAKDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVL 1163 + D + + + DC D I + I++SK + + + L EF+T KS+ + VL Sbjct: 223 NNDQDMSRIGTSDCGTNDDHIEEKQIYSSKKELQKKLYIIALKEKFEFRTIKSTTKLLVL 282 Query: 1164 VCKHNKDVCPFILRAKSLGKA---WVVDVWCTHTCK-----KDLRYHADPTVFSKVLASY 1319 C N+ C + RA LG + V+ THT + +D R+ + V + +Y Sbjct: 283 QCVDNE--CKWRFRATKLGSSNFFQVMKYHPTHTYRLNMMSRDNRHASSWLVGESMRQTY 340 Query: 1320 FVPNLMVEGVVLRPRDIQAQVQRLFGAKIKYATALSARNQALTMTYGDSSESFQRLPSYF 1499 V G RP+DI ++ +G +I Y A AR A G ES+ LPSY Sbjct: 341 QV------GRQYRPKDIIGDIRNKYGVQISYDKAWRAREFAFNSIRGSPEESYDVLPSYC 394 Query: 1500 HMLEQSNPGSALHLQTAEAGAFEYCFMSLAASIRGFK-ACRPVIVVDGTHLKGKYNGIMF 1676 +MLEQ NPG+ + T F+Y FM+ +A I GF+ + R VI VDGT LK KY G +F Sbjct: 395 YMLEQKNPGTITDIVTDVDNKFKYLFMAFSACISGFRTSIRLVIAVDGTFLKSKYLGTLF 454 Query: 1677 VAATKDANEQIFPLAFGFGAKECDESWIWFLEQCRQTFGCPENLLIVSD 1823 VAA+KD N QI+PLAF G E D SW WFL + G ++L++VSD Sbjct: 455 VAASKDGNNQIYPLAFEIGDSENDASWEWFLTKLYDVIGHVDDLVVVSD 503 >emb|CAN68581.1| hypothetical protein VITISV_011863 [Vitis vinifera] Length = 276 Score = 156 bits (394), Expect = 3e-35 Identities = 91/266 (34%), Positives = 140/266 (52%), Gaps = 4/266 (1%) Frame = +3 Query: 1038 DTIAKDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRAKSL 1217 D + + I++SK + + + L EFKT KS+ + ++ C + C + + A L Sbjct: 13 DHLEEHQIYSSKKELQRKLYMMALKRKFEFKTTKSTTKLLLIECFDKE--CKWRVXATKL 70 Query: 1218 GKA---WVVDVWCTHTCKKDLRYHADPTVFSKVLASYFVPNLMVEGVVLRPRDIQAQVQR 1388 G + ++ + THTC D+ + V S ++ G RP+DI A +++ Sbjct: 71 GISNMFQIMKFYSTHTCXLDMMSRDNRHVSSWLIGESIRETYQEVGCEFRPKDIVADIRK 130 Query: 1389 LFGAKIKYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEAGAFE 1568 +G +I Y A AR AL G ES+ LPSY ++LEQ NPG+ + F+ Sbjct: 131 QYGVQISYDKAWRARELALGSIRGSPKESYNTLPSYCYVLEQKNPGTITDIVIDCDNQFK 190 Query: 1569 YCFMSLAASIRGF-KACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKEC 1745 Y FMS+ AS+ GF + R V+ +DGT LK KY G +F+ A KD N QI+PLAFG G E Sbjct: 191 YFFMSIGASLVGFHTSIRLVVAIDGTFLKAKYLGTLFIVACKDGNNQIYPLAFGIGDSEN 250 Query: 1746 DESWIWFLEQCRQTFGCPENLLIVSD 1823 D SW W L++ G ++L ++S+ Sbjct: 251 DASWEWCLQKLHDALGHIDDLFVISB 276 >ref|XP_004229090.1| PREDICTED: uncharacterized protein LOC101254935 [Solanum lycopersicum] Length = 336 Score = 152 bits (384), Expect = 5e-34 Identities = 89/264 (33%), Positives = 138/264 (52%), Gaps = 3/264 (1%) Frame = +3 Query: 1044 IAKDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRAKSLGK 1223 +A++ I+N+K L ++ H+ K S R V + N C +++RA S K Sbjct: 26 VAENQIYNNKEI------LKEVMRHVGIVEKFSFR-----VARCNASNCSWMMRASSFNK 74 Query: 1224 AWVVDVW---CTHTCKKDLRYHADPTVFSKVLASYFVPNLMVEGVVLRPRDIQAQVQRLF 1394 + + V HTC R + + V+A + N + V P+D+ + +L Sbjct: 75 SSLFKVRKYIAQHTCCVRERVYVIRQGITDVVAVLIMDNYIDPSKVYTPKDVADDMLKLH 134 Query: 1395 GAKIKYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEAGAFEYC 1574 G + Y A A+ +A+ + GD +ES+ RLP YF++LEQ+ PGS L ++ E F Y Sbjct: 135 GVSLTYIQAWRAKEKAVKLVRGDPAESYARLPGYFYILEQTYPGSVLKIKRNEDDTFLYA 194 Query: 1575 FMSLAASIRGFKACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKECDES 1754 F +L A I+G++ CRP++VVDG LK Y G M A+T D I PLA+ E D S Sbjct: 195 FFALEACIKGWEYCRPIVVVDGAALKCSYGGTMLTASTLDPGGHILPLAYAIVDSENDAS 254 Query: 1755 WIWFLEQCRQTFGCPENLLIVSDQ 1826 W WF EQ R+ +G +N+ +SD+ Sbjct: 255 WTWFFEQFREAYGVRQNMCFMSDR 278 >ref|XP_006368044.1| PREDICTED: uncharacterized protein LOC102605968 [Solanum tuberosum] Length = 507 Score = 152 bits (383), Expect = 6e-34 Identities = 85/264 (32%), Positives = 135/264 (51%), Gaps = 3/264 (1%) Frame = +3 Query: 1044 IAKDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRAKSLGK 1223 I D I+ K+ + + Y + +F ++S+ Y LVC+ + CP++++A S+ K Sbjct: 144 IMTDQIYMDKSTLKDVMEKYSIEKRFKFLVERSNSISYTLVCQSKE--CPWLMKASSINK 201 Query: 1224 AWVVDVWC---THTCKKDLRYHADPTVFSKVLASYFVPNLMVEGVVLRPRDIQAQVQRLF 1394 + + + HTC H+D S ++ P L P DI+ V+ Sbjct: 202 SKMFRIRVFNSEHTCPLKDGVHSDCRATSGLIGGIIAPKLRNHKRKYTPNDIRDDVRLDL 261 Query: 1395 GAKIKYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEAGAFEYC 1574 G I Y A A+ +AL G + S+ +LP Y + L+++ PGS + ++ F Y Sbjct: 262 GIDINYMLAWRAKEKALESIMGQPAASYGKLPGYLYTLDKTYPGSHIRMKKTPENEFLYV 321 Query: 1575 FMSLAASIRGFKACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKECDES 1754 F++L A I+GF CRP++VVD +HLK Y G A+T D I PLA+G E D S Sbjct: 322 FIALHAFIKGFDYCRPIVVVDASHLKSTYTGAFVSASTLDGAGNILPLAYGVIDSENDAS 381 Query: 1755 WIWFLEQCRQTFGCPENLLIVSDQ 1826 W WF EQ ++ +G EN+ +VSD+ Sbjct: 382 WTWFFEQFKEAYGERENMCVVSDR 405 Score = 75.5 bits (184), Expect = 8e-11 Identities = 34/76 (44%), Positives = 46/76 (60%) Frame = +3 Query: 1563 FEYCFMSLAASIRGFKACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKE 1742 F Y F++L A I+GF CRP++VVD +HLK Y G A T D I PLA+G E Sbjct: 9 FLYMFIALHAFIKGFDYCRPIVVVDASHLKSAYTGAFVSANTLDGAGNILPLAYGVIDSE 68 Query: 1743 CDESWIWFLEQCRQTF 1790 D++W WF EQ ++ + Sbjct: 69 NDDAWTWFFEQFKEAY 84 >ref|XP_007036749.1| Uncharacterized protein TCM_012672 [Theobroma cacao] gi|508773994|gb|EOY21250.1| Uncharacterized protein TCM_012672 [Theobroma cacao] Length = 391 Score = 149 bits (376), Expect = 4e-33 Identities = 85/240 (35%), Positives = 127/240 (52%), Gaps = 5/240 (2%) Frame = +3 Query: 1029 CTLDTIAKDIIFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRA 1208 C D + K +F+SKA + ++ + + + K+S + Y + CK C F LRA Sbjct: 100 CADDHLYKGRMFSSKAELKRALNMLVIKEKFAIRVKRSCKGCYEVGCKDK--ACKFSLRA 157 Query: 1209 KSL---GKAWVVDVWC-THTCKKDLRYHADPTVFSKVLASYFVPNLMVEGVVLRPRDIQA 1376 L G+ W V + HTC D PT +K++ L GV LRP++I Sbjct: 158 TKLLERGEYWQVRTFHKVHTCTVDGLQGRFPTTSAKIIGELISHKLRANGVALRPKNIIC 217 Query: 1377 QVQRLFGAKIKYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEA 1556 +++ +G + A A+ + + S ESFQ LPSYF+MLEQ NP + + T EA Sbjct: 218 EMRVQWGLECLNGKAWQAKEYVERLVFDLSEESFQLLPSYFYMLEQENPNTLTAMATNEA 277 Query: 1557 GAFEYCFMSLAASIRGFK-ACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFG 1733 F+YCF S IRGF+ RP + +D THLK ++ G++FVA KD N+ ++P+AFG G Sbjct: 278 ERFKYCFWSYGTCIRGFRDVMRPTVAIDATHLKSRFKGVLFVATCKDENQCVYPVAFGIG 337 >ref|XP_006848744.1| hypothetical protein AMTR_s04155p00003130 [Amborella trichopoda] gi|548852169|gb|ERN10325.1| hypothetical protein AMTR_s04155p00003130 [Amborella trichopoda] Length = 590 Score = 147 bits (372), Expect = 1e-32 Identities = 90/258 (34%), Positives = 129/258 (50%), Gaps = 3/258 (1%) Frame = +3 Query: 1062 FNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRAKSLGKAWVVDV 1241 + +K + VG + L ++ E+ KKSS + + CK C + LR + + +V Sbjct: 24 YENKEELRQVVGRFALNNNFEWMVKKSSPDVLYVTCKAPD--CKWRLRGRKKMHSDNFEV 81 Query: 1242 WC---THTCKKDLRYHADPTVFSKVLASYFVPNLMVEGVVLRPRDIQAQVQRLFGAKIKY 1412 HTC + R V+ +G + +DIQ + +G + Y Sbjct: 82 TVFHNEHTCNLNARRSDHRQAAPWVVGHLIKGKYTQDGTKYKAKDIQRDMFDNYGISMSY 141 Query: 1413 ATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEAGAFEYCFMSLAA 1592 A R LT G S+ +LP Y +MLEQ NPG+ L E F+YCF+SL A Sbjct: 142 VKAWRCREMGLTYARGTPEFSYMKLPGYLYMLEQKNPGTITDLYL-EDERFKYCFISLGA 200 Query: 1593 SIRGFKACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKECDESWIWFLE 1772 RGF CRPV+ +DGT LK KY GIM VA DAN Q+ P+A+G E ++SW +FL+ Sbjct: 201 CRRGFSFCRPVLSIDGTFLKTKYGGIMLVAVAYDANNQLLPVAYGIVDSENNDSWTYFLQ 260 Query: 1773 QCRQTFGCPENLLIVSDQ 1826 + R G ENL+ VSD+ Sbjct: 261 KLRVAIGVVENLVFVSDR 278 >gb|EXB36258.1| hypothetical protein L484_013693 [Morus notabilis] Length = 821 Score = 146 bits (369), Expect = 3e-32 Identities = 87/260 (33%), Positives = 134/260 (51%), Gaps = 4/260 (1%) Frame = +3 Query: 1059 IFNSKANMIASVGLYHLVHHLEFKTKKSSRERYVLVCKHNKDVCPFILRAKSLGKAWVVD 1238 +F +K + ++ + L +++ KS + R +L C C + +RA + + Sbjct: 263 LFKNKDTLSRTISMIALKEKYQYRVFKSDKTRIILRCVDEN--CKWRVRATKYNETDMFQ 320 Query: 1239 VWC---THTCKKDLRYHADPTVFSKVLASYFVPNLMVEG-VVLRPRDIQAQVQRLFGAKI 1406 V THTC DL S++++ + G V P +I ++ FG + Sbjct: 321 VTKYNETHTCSLDLLQCDHRQASSQIISEHIKKKYEESGRTVYAPNNIIEDLKNEFGIDV 380 Query: 1407 KYATALSARNQALTMTYGDSSESFQRLPSYFHMLEQSNPGSALHLQTAEAGAFEYCFMSL 1586 Y A AR AL GD +S++ L S+ + L ++NPGS + + F+Y FM++ Sbjct: 381 SYEKAWRARKIALEKIGGDVDKSYEELASFLYTLNKTNPGSVADIVLDDENKFKYMFMAV 440 Query: 1587 AASIRGFKACRPVIVVDGTHLKGKYNGIMFVAATKDANEQIFPLAFGFGAKECDESWIWF 1766 AASI G+K CRPVIVVD L+ K+ G + A +DAN QIFPLAFG G E DESW +F Sbjct: 441 AASIHGWKHCRPVIVVDEIFLECKHRGSLLCACAEDANNQIFPLAFGIGESENDESWEYF 500 Query: 1767 LEQCRQTFGCPENLLIVSDQ 1826 ++ + F + + IVSDQ Sbjct: 501 FKRLSEAFSERDGMWIVSDQ 520