BLASTX nr result
ID: Sinomenium22_contig00026077
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00026077 (1398 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245... 410 e-112 ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620... 401 e-109 ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620... 399 e-108 ref|XP_007034267.1| Uncharacterized protein isoform 1 [Theobroma... 396 e-107 ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Popu... 392 e-106 ref|XP_007225696.1| hypothetical protein PRUPE_ppa006350mg [Prun... 374 e-101 ref|XP_002518043.1| conserved hypothetical protein [Ricinus comm... 373 e-101 ref|XP_007034270.1| Uncharacterized protein isoform 4, partial [... 357 5e-96 ref|XP_007034272.1| Uncharacterized protein isoform 6 [Theobroma... 357 7e-96 ref|XP_007034271.1| Uncharacterized protein isoform 5 [Theobroma... 356 1e-95 ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Caps... 343 1e-91 ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arab... 339 2e-90 ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211... 330 9e-88 ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244... 329 2e-87 gb|EXB89377.1| hypothetical protein L484_017343 [Morus notabilis] 320 7e-85 ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] ... 319 2e-84 ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592... 319 2e-84 ref|XP_007034268.1| Uncharacterized protein isoform 2 [Theobroma... 314 5e-83 gb|EXC33904.1| hypothetical protein L484_012794 [Morus notabilis] 313 9e-83 ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, part... 310 1e-81 >ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245254 [Vitis vinifera] gi|298205214|emb|CBI17273.3| unnamed protein product [Vitis vinifera] Length = 425 Score = 410 bits (1054), Expect = e-112 Identities = 220/414 (53%), Positives = 292/414 (70%), Gaps = 2/414 (0%) Frame = -1 Query: 1362 PSSEQLDLETIRSRVQALSEVLRTSKEFSELSPSESDKLLKECVIGLENRIEECMSEFSD 1183 P++ +DL+TIRSR+ L+ + S+ +P +S L +E L++R+ + +S++SD Sbjct: 5 PAAGTMDLDTIRSRMSELNRIHTNYSHISDSNPLDSRSLFQEFSHHLQSRVNQILSQYSD 64 Query: 1182 FSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLK 1003 +L +DLD YL + K+ELNL+E+EN KI NEIE L+ +++EDS +LE DLE L S+ Sbjct: 65 VESLEADDLDAYLGHLKKELNLVESENAKISNEIEALTRTYVEDSNQLESDLEVLKHSVD 124 Query: 1002 FIESQNRHKLEMGTDVAHSVPTGGSETFDRTHEDN-FKVLELDNQIEKNKVTLNSLQDLC 826 F+ SQ + E G V +S H DN F++L+L+ Q +KNK+TL SLQDL Sbjct: 125 FVASQGLKRAEAGALVDYSSSVEDQLDSRTAHGDNNFEILDLNYQTQKNKITLKSLQDLD 184 Query: 825 DILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDPPAVDH 646 KRFEA+ +IED LTGLKVI+ EGNCIRLSL TF+PNLE LL +K+E +P ++H Sbjct: 185 YTFKRFEAIEKIEDALTGLKVIDFEGNCIRLSLSTFIPNLEGLLCEEKIEAVNEPSELNH 244 Query: 645 ELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAK-SRQFCPSLAVLEMGSSLEWLVRKXXX 469 ELLIEV D +MELKNVEIFPNDV++GEI+D+AK SR+ +++LE SSLEW VRK Sbjct: 245 ELLIEVMDQSMELKNVEIFPNDVYLGEIIDAAKSSRKLFSHMSILETRSSLEWFVRKVQD 304 Query: 468 XXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCALKLIS 289 V ANKSRHS EY DRDEI++AHMVGG++A+IK+ Q WP+ ALKL S Sbjct: 305 KIILCALRQSIVKGANKSRHSLEYLDRDEIIVAHMVGGVDAYIKVCQGWPVSNNALKLKS 364 Query: 288 LKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSELH 127 LK+SD S+ ISLSFLCKVEE+ANSLDV IR+N+ SFVDAIE ILV+QM+S+LH Sbjct: 365 LKSSDQQSKGISLSFLCKVEEMANSLDVSIRKNISSFVDAIEEILVQQMQSKLH 418 >ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620884 isoform X2 [Citrus sinensis] Length = 444 Score = 401 bits (1031), Expect = e-109 Identities = 221/420 (52%), Positives = 287/420 (68%), Gaps = 9/420 (2%) Frame = -1 Query: 1359 SSEQLDLETIRSRVQALSEVLRTSKEFSELS-PSESDKLLKECVIGLENRIEECMSEFSD 1183 SS LDL ++RS V+ L E+ R+ E + S+S+ LLKE E++++E ++E++D Sbjct: 19 SSSPLDLHSLRSEVKELMEIHRSGIEDEPNTVSSDSENLLKEYAHDFESKVKEIITEYAD 78 Query: 1182 FSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLK 1003 S LGIEDLD YLE+ KEEL +EAE+ KI NEIE L+ + +EDS LE DLE LNC++ Sbjct: 79 VSFLGIEDLDAYLEHLKEELKTVEAESSKISNEIETLTRTQVEDSDRLESDLEELNCAID 138 Query: 1002 FIESQNRHKLEM------GTDVAHSVPTGGSETFDRTHEDN-FKVLELDNQIEKNKVTLN 844 I S+N + G D T + HED+ F++LEL++QIEKNK+ LN Sbjct: 139 LIVSENAKEDRQAVCPARGEDQVCPTHTEDQSDLIKIHEDHRFEILELESQIEKNKIILN 198 Query: 843 SLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAID 664 SLQDL +LKRF+AV QIED+LTGLKVI+ +G C RLS++T++P LE K+E I+ Sbjct: 199 SLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDGKCFRLSMQTYIPTLEESSFQHKIEDVIE 258 Query: 663 PPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEWL 487 P V+HELLIEV DGTME+KNVE+FPNDV I ++VD+AKS RQ L LE SSL+W Sbjct: 259 PSEVNHELLIEVIDGTMEIKNVEMFPNDVHISDLVDAAKSFRQSGTQLDSLETSSSLQWF 318 Query: 486 VRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTC 307 +R V ANKSRH FEY +RDE+++AH+VGG++AFIK Q WP+ Sbjct: 319 IRNVQDRIILSTLRRFVVKTANKSRHFFEYFERDEMIVAHLVGGVDAFIKPSQGWPLSNS 378 Query: 306 ALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSELH 127 LK+ISLKNSD+HS+ ISLSF C+VEE ANSLDV IRQNL SFVD +E IL+ QMR ELH Sbjct: 379 PLKVISLKNSDHHSKGISLSFFCRVEEAANSLDVHIRQNLSSFVDGVEKILLEQMRVELH 438 >ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620884 isoform X1 [Citrus sinensis] Length = 447 Score = 399 bits (1025), Expect = e-108 Identities = 220/423 (52%), Positives = 286/423 (67%), Gaps = 12/423 (2%) Frame = -1 Query: 1359 SSEQLDLETIRSRVQALSEVLRTSKEFSELS-PSESDKLLKECVIGLENRIEECMSEFSD 1183 SS LDL ++RS V+ L E+ R+ E + S+S+ LLKE E++++E ++E++D Sbjct: 19 SSSPLDLHSLRSEVKELMEIHRSGIEDEPNTVSSDSENLLKEYAHDFESKVKEIITEYAD 78 Query: 1182 FSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLK 1003 S LGIEDLD YLE+ KEEL +EAE+ KI NEIE L+ + +EDS LE DLE LNC++ Sbjct: 79 VSFLGIEDLDAYLEHLKEELKTVEAESSKISNEIETLTRTQVEDSDRLESDLEELNCAID 138 Query: 1002 FIESQNRHKLE---------MGTDVAHSVPTGGSETFDRTHEDN-FKVLELDNQIEKNKV 853 I S+ + G D T + HED+ F++LEL++QIEKNK+ Sbjct: 139 LIVSEGSQNAKEDRQAVCPARGEDQVCPTHTEDQSDLIKIHEDHRFEILELESQIEKNKI 198 Query: 852 TLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEY 673 LNSLQDL +LKRF+AV QIED+LTGLKVI+ +G C RLS++T++P LE K+E Sbjct: 199 ILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDGKCFRLSMQTYIPTLEESSFQHKIED 258 Query: 672 AIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSL 496 I+P V+HELLIEV DGTME+KNVE+FPNDV I ++VD+AKS RQ L LE SSL Sbjct: 259 VIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHISDLVDAAKSFRQSGTQLDSLETSSSL 318 Query: 495 EWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPM 316 +W +R V ANKSRH FEY +RDE+++AH+VGG++AFIK Q WP+ Sbjct: 319 QWFIRNVQDRIILSTLRRFVVKTANKSRHFFEYFERDEMIVAHLVGGVDAFIKPSQGWPL 378 Query: 315 LTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRS 136 LK+ISLKNSD+HS+ ISLSF C+VEE ANSLDV IRQNL SFVD +E IL+ QMR Sbjct: 379 SNSPLKVISLKNSDHHSKGISLSFFCRVEEAANSLDVHIRQNLSSFVDGVEKILLEQMRV 438 Query: 135 ELH 127 ELH Sbjct: 439 ELH 441 >ref|XP_007034267.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508713296|gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 430 Score = 396 bits (1018), Expect = e-107 Identities = 216/425 (50%), Positives = 289/425 (68%), Gaps = 4/425 (0%) Frame = -1 Query: 1386 MEELLESVPSSEQLDLETIRSRVQALSEVLRT--SKEFSELSPSESDKLLKECVIGLENR 1213 M E +E SSE LDL +IRSR+ LSE+ R +K+ E S+KLLK+C + E++ Sbjct: 1 MAEPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESK 60 Query: 1212 IEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033 +++ + E+SD LGIEDLD YL + KEELN +EAE+ KI NEIE LS + IE+S LE Sbjct: 61 VKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEG 120 Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSV-PTGGSETFDRTHEDNFKVLELDNQIEKNK 856 +LEGL +L I SQ +E + S+ S E F+++EL++QIEKN Sbjct: 121 NLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNN 180 Query: 855 VTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKME 676 + L SLQDL + KR + + QIED LTGLKVI +GNCIRLSL+T++P LE LL + +E Sbjct: 181 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240 Query: 675 YAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSS 499 +P ++HELL+E+ DGTME+KNVE+FPNDV++G+I+D+AKS RQ +L V + SS Sbjct: 241 DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 300 Query: 498 LEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWP 319 LEW V K V NKSRHSFEY +RDE ++AH+VGGI+AFIKL Q WP Sbjct: 301 LEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWP 360 Query: 318 MLTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMR 139 + LKL+S+K+SD+HSR ISLS LCK EE+ANSLD+ IRQNL +FVDA+E +L+ QMR Sbjct: 361 LSKSPLKLLSIKSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLEQMR 420 Query: 138 SELHT 124 +L + Sbjct: 421 LDLQS 425 >ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa] gi|222847415|gb|EEE84962.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa] Length = 429 Score = 392 bits (1006), Expect = e-106 Identities = 211/424 (49%), Positives = 294/424 (69%), Gaps = 7/424 (1%) Frame = -1 Query: 1374 LESVPSS--EQLDLETIRSRVQALSEVLR--TSKEFSELSPSESDKLLKECVIGLENRIE 1207 +E PS+ E L+L TIRSR+ L E+ R + FSE++ S+SD+L+K+ L +++ Sbjct: 1 MEISPSTTQESLNLNTIRSRINELEEIYRDCNADSFSEINSSDSDELMKDSAQQLVSKVS 60 Query: 1206 ECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDL 1027 + ++E+SDFS LGIEDLD YL + KEEL+ EAE+ KI NEIE+L+ + +EDS+ELE DL Sbjct: 61 QTVTEYSDFSFLGIEDLDAYLAHLKEELDAAEAESAKISNEIELLNRTCMEDSSELENDL 120 Query: 1026 EGLNCSLKFIESQNRHKLEMGTDVAHSVPTGGSET--FDRTHEDNFKVLELDNQIEKNKV 853 E + CSL I SQ + E G + +G +++ + E+ F++L+LDNQIE++ Sbjct: 121 EWMKCSLDLISSQRDREKEKGDEQMEHFSSGENQSNLINTNEENKFEILKLDNQIEESTR 180 Query: 852 TLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEY 673 L S+QDL + K ++A+ QIED L+GLKVIE +G CIRLSL+T++P + +L QK+E Sbjct: 181 ILKSMQDLDSVCKWYDAIEQIEDVLSGLKVIEFDGTCIRLSLRTYIPK-QDVLFLQKIEE 239 Query: 672 AIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSL 496 P ++HE LIEV +G+ME+K VE+FPND++IG+IVD+AKS RQ LA++E SSL Sbjct: 240 TNVPYEINHEFLIEVTNGSMEIKKVEMFPNDIYIGDIVDAAKSFRQMFLHLALMETSSSL 299 Query: 495 EWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPM 316 EW VRK A+ SR S EY DRDEI++AHMVGG++AF+++ Q WP+ Sbjct: 300 EWFVRKAQDRIIQSTLRRLVARSASTSRQSIEYLDRDEIIVAHMVGGVDAFMEVSQGWPI 359 Query: 315 LTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRS 136 LKL+SLKNS++H++ ISL FLCKVEE ANSLDV RQNL SFVD++E ILV QM Sbjct: 360 TNSPLKLVSLKNSNHHAKEISLGFLCKVEEAANSLDVHTRQNLSSFVDSVEKILVEQMHL 419 Query: 135 ELHT 124 ELH+ Sbjct: 420 ELHS 423 >ref|XP_007225696.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica] gi|462422632|gb|EMJ26895.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica] Length = 416 Score = 374 bits (961), Expect = e-101 Identities = 208/424 (49%), Positives = 284/424 (66%), Gaps = 4/424 (0%) Frame = -1 Query: 1386 MEELLESVPSSEQLDLETIRSRVQALSEVLRTSKE--FSELSPSESDKLLKECVIGLENR 1213 MEE + +PSSE LDL TI+ +V+ L E++ + ++ SELSPS+SD L++ C + L++R Sbjct: 1 MEE--DPIPSSEPLDLNTIQRQVRELEEIIESCRQDDASELSPSDSDDLIRNCGLLLQSR 58 Query: 1212 IEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033 +E+ +SE SD L ++ + Y+ ++ELN +EAE+ K+ N IE L + ED L Sbjct: 59 VEQIVSECSDVGLLEDQEFEAYVGRFEQELNSVEAESTKVSNGIEDLIRTHGEDFNRLGT 118 Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSVPTGGSETFD--RTHEDNFKVLELDNQIEKN 859 DL L CSL F+E ++ K ++G DV + G + D + D F++LEL+NQIEKN Sbjct: 119 DLAQLKCSLDFVEEKDLEKAKLGADVDYH--KCGKDLLDPMNVNADKFELLELENQIEKN 176 Query: 858 KVTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKM 679 + L SLQDL LK + QIED +TGLKVI EGNC+RLSL+T++P LE L +K+ Sbjct: 177 NIILKSLQDLECTLKWLDNTEQIEDAVTGLKVIAFEGNCVRLSLRTYIPKLEDLFSPKKV 236 Query: 678 EYAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKSRQFCPSLAVLEMGSS 499 A +P V+HELLIE+ +GTM L+NVEIFPNDV+I +I+D+AKS + SS Sbjct: 237 GDATEPSEVNHELLIELLEGTMGLRNVEIFPNDVYINDILDAAKSLR----------KSS 286 Query: 498 LEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWP 319 L+W V K V + NKSRHS EY D+DE ++AH+VGG++AFIK+PQ WP Sbjct: 287 LQWFVTKVQDRIVLCTMRRLVVKNENKSRHSLEYLDKDETVVAHVVGGVDAFIKVPQGWP 346 Query: 318 MLTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMR 139 +L+ LKLI LK+SD HS+ ISLSFLC V+E+ANSL V+IRQ L SFVDAIE ILV QM Sbjct: 347 LLSSPLKLIYLKSSDQHSKGISLSFLCTVQELANSLAVRIRQTLSSFVDAIEKILVEQMC 406 Query: 138 SELH 127 SE+H Sbjct: 407 SEIH 410 >ref|XP_002518043.1| conserved hypothetical protein [Ricinus communis] gi|223542639|gb|EEF44176.1| conserved hypothetical protein [Ricinus communis] Length = 415 Score = 373 bits (958), Expect = e-101 Identities = 206/412 (50%), Positives = 283/412 (68%), Gaps = 4/412 (0%) Frame = -1 Query: 1347 LDLETIRSRVQALSEVLRTSKEFSELSPSESDKLLKECVIGLENRIEECMSEFSDFSALG 1168 LDL +I ++ L E+ +E+ S SD++L++C + LE+++++ MSE SDF+ LG Sbjct: 5 LDLNSIICGIKDLEEIYSGCNGDTEMLSSHSDQVLEDCALHLESKVQQIMSECSDFNFLG 64 Query: 1167 IEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLKFIESQ 988 IEDLD ++E+ KEEL+ +E KI EIE L+ + +ED T LE D+E L CSL FI S+ Sbjct: 65 IEDLDAFVEHLKEELSTTMSETAKISTEIEALNRNHMEDFTRLESDIEMLKCSLDFISSK 124 Query: 987 NRHKLEMGTDVAHSVPTGGSETFDRTHED-NFKVLELDNQIEKNKVTLNSLQDLCDILKR 811 + +E +VA ++ H D F++ +LD+QI K+K+ L SLQD + KR Sbjct: 125 D---VEKEKEVACREDLYSTDA----HRDYEFEISKLDDQIAKSKMILKSLQDFDSVFKR 177 Query: 810 FEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDPPAVDHELLIE 631 +AV QIE+ L+GLKVIE +G+CIRLSL+T+LP L+ ++ K E +P V+HELLIE Sbjct: 178 VDAVEQIEEALSGLKVIEFDGSCIRLSLRTYLPKLDDVMCQHKTEDTAEPSEVNHELLIE 237 Query: 630 VFDGTMELKNVEIFPNDVFIGEIVDSAKS--RQFCPS-LAVLEMGSSLEWLVRKXXXXXX 460 V GTMELKNVEIFPND++I +IVD+AKS ++F S L E SSL WLVRK Sbjct: 238 VVSGTMELKNVEIFPNDIYISDIVDAAKSFRKEFLYSALTESETRSSLGWLVRKVQDRII 297 Query: 459 XXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCALKLISLKN 280 V +NKSR+SFEY DRDE ++AH+VGG++AFIKL Q WP+ LKLISLK+ Sbjct: 298 QFTLRRLVVKSSNKSRYSFEYLDRDETVVAHLVGGVDAFIKLSQGWPVSRSPLKLISLKS 357 Query: 279 SDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSELHT 124 S++HS+ ISLSFLC+VEE+ NSLD+Q+R NL+SFV+ IE +LV QMR ELH+ Sbjct: 358 SNHHSKEISLSFLCRVEEVVNSLDIQMRLNLLSFVEVIEKLLVEQMRIELHS 409 >ref|XP_007034270.1| Uncharacterized protein isoform 4, partial [Theobroma cacao] gi|508713299|gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theobroma cacao] Length = 372 Score = 357 bits (917), Expect = 5e-96 Identities = 190/367 (51%), Positives = 254/367 (69%), Gaps = 2/367 (0%) Frame = -1 Query: 1218 NRIEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTEL 1039 +++++ + E+SD LGIEDLD YL + KEELN +EAE+ KI NEIE LS + IE+S L Sbjct: 1 SKVKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNIL 60 Query: 1038 ERDLEGLNCSLKFIESQNRHKLEMGTDVAHSV-PTGGSETFDRTHEDNFKVLELDNQIEK 862 E +LEGL +L I SQ +E + S+ S E F+++EL++QIEK Sbjct: 61 EGNLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEK 120 Query: 861 NKVTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQK 682 N + L SLQDL + KR + + QIED LTGLKVI +GNCIRLSL+T++P LE LL + Sbjct: 121 NNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKT 180 Query: 681 MEYAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMG 505 +E +P ++HELL+E+ DGTME+KNVE+FPNDV++G+I+D+AKS RQ +L V + Sbjct: 181 IEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQ 240 Query: 504 SSLEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQS 325 SSLEW V K V NKSRHSFEY +RDE ++AH+VGGI+AFIKL Q Sbjct: 241 SSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQG 300 Query: 324 WPMLTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQ 145 WP+ LKL+S+K+SD+HSR ISLS LCK EE+ANSLD+ IRQNL +FVDA+E +L+ Q Sbjct: 301 WPLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLEQ 360 Query: 144 MRSELHT 124 MR +L + Sbjct: 361 MRLDLQS 367 >ref|XP_007034272.1| Uncharacterized protein isoform 6 [Theobroma cacao] gi|508713301|gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 432 Score = 357 bits (916), Expect = 7e-96 Identities = 196/391 (50%), Positives = 261/391 (66%), Gaps = 4/391 (1%) Frame = -1 Query: 1386 MEELLESVPSSEQLDLETIRSRVQALSEVLRT--SKEFSELSPSESDKLLKECVIGLENR 1213 M E +E SSE LDL +IRSR+ LSE+ R +K+ E S+KLLK+C + E++ Sbjct: 1 MAEPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESK 60 Query: 1212 IEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033 +++ + E+SD LGIEDLD YL + KEELN +EAE+ KI NEIE LS + IE+S LE Sbjct: 61 VKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEG 120 Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSV-PTGGSETFDRTHEDNFKVLELDNQIEKNK 856 +LEGL +L I SQ +E + S+ S E F+++EL++QIEKN Sbjct: 121 NLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNN 180 Query: 855 VTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKME 676 + L SLQDL + KR + + QIED LTGLKVI +GNCIRLSL+T++P LE LL + +E Sbjct: 181 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240 Query: 675 YAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSS 499 +P ++HELL+E+ DGTME+KNVE+FPNDV++G+I+D+AKS RQ +L V + SS Sbjct: 241 DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 300 Query: 498 LEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWP 319 LEW V K V NKSRHSFEY +RDE ++AH+VGGI+AFIKL Q WP Sbjct: 301 LEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWP 360 Query: 318 MLTCALKLISLKNSDNHSRVISLSFLCKVEE 226 + LKL+S+K+SD+HSR ISLS LCK EE Sbjct: 361 LSKSPLKLLSIKSSDHHSRGISLSLLCKAEE 391 >ref|XP_007034271.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508713300|gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 392 Score = 356 bits (914), Expect = 1e-95 Identities = 195/392 (49%), Positives = 261/392 (66%), Gaps = 4/392 (1%) Frame = -1 Query: 1386 MEELLESVPSSEQLDLETIRSRVQALSEVLRT--SKEFSELSPSESDKLLKECVIGLENR 1213 M E +E SSE LDL +IRSR+ LSE+ R +K+ E S+KLLK+C + E++ Sbjct: 1 MAEPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESK 60 Query: 1212 IEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033 +++ + E+SD LGIEDLD YL + KEELN +EAE+ KI NEIE LS + IE+S LE Sbjct: 61 VKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEG 120 Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSV-PTGGSETFDRTHEDNFKVLELDNQIEKNK 856 +LEGL +L I SQ +E + S+ S E F+++EL++QIEKN Sbjct: 121 NLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNN 180 Query: 855 VTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKME 676 + L SLQDL + KR + + QIED LTGLKVI +GNCIRLSL+T++P LE LL + +E Sbjct: 181 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240 Query: 675 YAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSS 499 +P ++HELL+E+ DGTME+KNVE+FPNDV++G+I+D+AKS RQ +L V + SS Sbjct: 241 DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 300 Query: 498 LEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWP 319 LEW V K V NKSRHSFEY +RDE ++AH+VGGI+AFIKL Q WP Sbjct: 301 LEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWP 360 Query: 318 MLTCALKLISLKNSDNHSRVISLSFLCKVEEI 223 + LKL+S+K+SD+HSR ISLS LCK E + Sbjct: 361 LSKSPLKLLSIKSSDHHSRGISLSLLCKAERV 392 >ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Capsella rubella] gi|482566470|gb|EOA30659.1| hypothetical protein CARUB_v10013795mg [Capsella rubella] Length = 420 Score = 343 bits (880), Expect = 1e-91 Identities = 188/410 (45%), Positives = 265/410 (64%), Gaps = 2/410 (0%) Frame = -1 Query: 1347 LDLETIRSRVQALSEVLRTSK-EFSELSPSESDKLLKECVIGLENRIEECMSEFSDFSAL 1171 LDL+ IRSRV+ L + R K E E S+S+ L+++ V+ E ++ E + ++SD L Sbjct: 10 LDLQQIRSRVKELESIHRNCKYEPGESCTSDSENLVQDFVLQFETKVNEIVEDYSDVDIL 69 Query: 1170 GIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLKFIES 991 +ED D YLE ++EL+ +EAE+ K+ EIE LS S EDS+ LERDLEGL SL + S Sbjct: 70 DVEDSDAYLEYLRKELHSVEAESAKVSEEIERLSRSHAEDSSRLERDLEGLLLSLDSMSS 129 Query: 990 QNRHKLEMGTDVAHSVPTGGSETFDRTHEDNFKVLELDNQIEKNKVTLNSLQDLCDILKR 811 Q+ +K + S+ E + +D FK+ EL+NQ+E+ ++ L SL+DL + KR Sbjct: 130 QDVNKSKESPPSCSSM-----EVCEVNDDDKFKMFELENQMEEKRMILKSLEDLDSLRKR 184 Query: 810 FEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDPPAVDHELLIE 631 F+A Q+ED LTGLKV+E +GN IRL L+T++P L+ L K E+ P + HELLI Sbjct: 185 FDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIPELDGLPAQHKFEHTTKPSELIHELLIY 244 Query: 630 VFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEWLVRKXXXXXXXX 454 + D T E+ +E+FPNDV+IG+I+++A S RQ AVL+ SS++W+V K Sbjct: 245 LKDKTTEITKLEMFPNDVYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDRIITT 304 Query: 453 XXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCALKLISLKNSD 274 V + RH+F+Y D+DE ++AH+ GGI+AF+K+ WP+L LKL SLKNSD Sbjct: 305 TLRKYIVTSSKTMRHTFKYYDKDETIVAHIAGGIDAFLKVSDGWPLLNSPLKLASLKNSD 364 Query: 273 NHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSELHT 124 N S+ ISLS +CKVEE+ANSLD+Q RQNL F+DAIE ILV Q R EL + Sbjct: 365 NQSKGISLSLICKVEELANSLDLQTRQNLSGFIDAIEKILVHQTREELQS 414 >ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata] gi|297331444|gb|EFH61863.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata] Length = 421 Score = 339 bits (870), Expect = 2e-90 Identities = 187/408 (45%), Positives = 266/408 (65%), Gaps = 2/408 (0%) Frame = -1 Query: 1347 LDLETIRSRVQALSEVLRTSK-EFSELSPSESDKLLKECVIGLENRIEECMSEFSDFSAL 1171 LDL+ IRSRV+ L + R + E E S+S+ L+++ V+ E +++E + ++SD L Sbjct: 10 LDLQEIRSRVKELEFIHRNCRDEPGESCSSDSETLVQDFVLQFEPKVKEIVEDYSDVDLL 69 Query: 1170 GIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLKFIES 991 +ED D YLE ++EL +EAE+ K+ EIE LS S +DS+ LERDLEGL SL + S Sbjct: 70 DVEDSDAYLEYLRKELQSVEAESAKVSEEIERLSKSHAQDSSRLERDLEGLLLSLDSMSS 129 Query: 990 QNRHKLEMGTDVAHSVPTGGSETFDRTHEDNFKVLELDNQIEKNKVTLNSLQDLCDILKR 811 Q+ K + + S+ E + +D FK+ EL+NQ+E+ + L SL+DL + KR Sbjct: 130 QDVEKSKENQPSSSSM-----EVCEVNDDDKFKMFELENQMEEKRSILKSLEDLDSLRKR 184 Query: 810 FEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDPPAVDHELLIE 631 F+A Q+ED LTGLKV+E +GN IRL L+T++P L+SLL QK E+ +P + HELLI Sbjct: 185 FDAAEQVEDALTGLKVLEFDGNFIRLQLQTYIPKLDSLLGQQKFEHTTEPSELIHELLIY 244 Query: 630 VFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEWLVRKXXXXXXXX 454 + D T E+ E+FPNDV+IG+I+++A S RQ AVL+ SS++W+V K Sbjct: 245 LKDKTTEITKFEMFPNDVYIGDIIEAADSFRQVSLHSAVLDTRSSVQWVVAKVQDRIISS 304 Query: 453 XXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCALKLISLKNSD 274 V + RH+FEY ++DE ++ H+ GGI+AF+K+ WP+L LKL SLKNSD Sbjct: 305 TLRKYLVTSSKTIRHTFEYYEKDETIVGHIAGGIDAFLKVSNGWPLLNTPLKLESLKNSD 364 Query: 273 NHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSEL 130 N S+ ISLS +CKVE++ANSLD+Q RQNL F+DAIE ILV+Q R EL Sbjct: 365 NQSKGISLSLICKVEDLANSLDLQTRQNLSGFMDAIEKILVQQTREEL 412 >ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211137 [Cucumis sativus] gi|449527675|ref|XP_004170835.1| PREDICTED: uncharacterized protein LOC101229419 [Cucumis sativus] Length = 414 Score = 330 bits (846), Expect = 9e-88 Identities = 197/422 (46%), Positives = 269/422 (63%), Gaps = 3/422 (0%) Frame = -1 Query: 1386 MEELLESVPS-SEQLDLETIRSRVQALSEVLRTSKEFSELSPSESDKLLKECVIGLENRI 1210 M E +E+ PS LDL+ +RS ++ L L ++E S S+KLL+EC + LE+RI Sbjct: 2 MPESMEATPSVPPSLDLQAVRSELEELQRSLEENEE-STTDSLGSEKLLRECALHLESRI 60 Query: 1209 EECMSEFSDF-SALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033 ++ +SE+S+ S LGI+DLD Y+E+ KEEL +EAE+ KI NEIEVL + IEDS +L+ Sbjct: 61 QQVLSEYSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEVLKRTNIEDSNKLKM 120 Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSVPTGGSETFDRTHEDN-FKVLELDNQIEKNK 856 DLE L SL SQ+ E T S+ E N F+VLEL++QIEKNK Sbjct: 121 DLEVLKLSLDRFPSQDP---EEATFNCSSMNGEDPMNVIVNRECNAFEVLELESQIEKNK 177 Query: 855 VTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKME 676 L SLQ++ +I K + + Q+E T+ G+KVI++ N IRLSL T +PN+E Q++E Sbjct: 178 KILKSLQEVDEIFKSLDVIEQVEGTIGGMKVIDVADNSIRLSLHTHIPNVEDFSTLQRLE 237 Query: 675 YAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKSRQFCPSLAVLEMGSSL 496 I+ +DHEL+IEV DGTMELKN EIFP DV + +I++++KS SSL Sbjct: 238 GLIEKSELDHELIIEVLDGTMELKNAEIFPADVHLHDIINASKSIS----------NSSL 287 Query: 495 EWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPM 316 EW VRK V ANKS HSFEY D+DE+++ M+GGI+A IK+ Q WP+ Sbjct: 288 EWFVRKVQDRIVLCTLRRFAVKSANKSCHSFEYLDQDEMIMCSMIGGIDACIKVSQGWPL 347 Query: 315 LTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRS 136 LKLISLK+SD++++ +SLS +CKVE++ANSLD IR+NL SF DA+E IL QM Sbjct: 348 ADSPLKLISLKSSDHYTKGVSLSLICKVEKMANSLDAHIRRNLSSFADAVEKILKEQMHL 407 Query: 135 EL 130 EL Sbjct: 408 EL 409 >ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244321 [Solanum lycopersicum] Length = 415 Score = 329 bits (843), Expect = 2e-87 Identities = 183/403 (45%), Positives = 259/403 (64%), Gaps = 2/403 (0%) Frame = -1 Query: 1344 DLETIRSRVQALSEVLRTSKEFSELSPSESDKLLKECVIGLENRIEECMSEFSDFSALGI 1165 D +++R +Q L ++ R+ +E E E K L++C + E+++E+ + + S+ + Sbjct: 8 DADSLRREIQELRDIQRSVEE-PEAFGLELKKSLEDCTLQFESKVEQLLCDASEVNFSSD 66 Query: 1164 EDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLKFIESQN 985 +DLD + K EL+ EA+N KI +EIE LS ++E ++L ++EGL+C L+ IES Sbjct: 67 QDLDEFWNYLKNELSTEEAKNAKIADEIEGLSREYVEGYSKLVNEVEGLSCLLELIESLG 126 Query: 984 RHKLEMGTDVAHSVPTGGSETFDRTH-EDNFKVLELDNQIEKNKVTLNSLQDLCDILKRF 808 + T+ S P E NFK+ EL NQ+EK+K+ L SL++L RF Sbjct: 127 IEQGRALTNFPCSTPGEDKGNLSSAPVEHNFKIFELGNQLEKSKLNLESLEELESTFNRF 186 Query: 807 EAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDPPAVDHELLIEV 628 EA+ +IED +GLK+++ EGN IRLSL+TF+PNLE+LL +Q + A +PP +HELLIE+ Sbjct: 187 EAIEKIEDAFSGLKIVQFEGNRIRLSLRTFIPNLENLLHNQTIGVA-EPPEQNHELLIEL 245 Query: 627 FDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEWLVRKXXXXXXXXX 451 DGTMELK+VEIFPNDV I EI D+AKS RQ + VLE SSLEWLV++ Sbjct: 246 VDGTMELKHVEIFPNDVSISEITDTAKSLRQVYFPVGVLENRSSLEWLVKRVQDRIILST 305 Query: 450 XXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCALKLISLKNSDN 271 V AN SRHSF+Y +R+E ++AHMVGGI+AF+KLPQ WP+ L L+SLK+S Sbjct: 306 LRRFLVKSANSSRHSFDYVEREETIVAHMVGGIDAFVKLPQGWPLTCSGLTLMSLKSSSQ 365 Query: 270 HSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQM 142 +S+ ISL+ LCKV E ANSLD RQ + F D +E IL++QM Sbjct: 366 YSQQISLTLLCKVAEAANSLDTNARQTISGFTDRVEEILMQQM 408 >gb|EXB89377.1| hypothetical protein L484_017343 [Morus notabilis] Length = 550 Score = 320 bits (821), Expect = 7e-85 Identities = 188/427 (44%), Positives = 263/427 (61%), Gaps = 8/427 (1%) Frame = -1 Query: 1386 MEELLESVPSSEQ---LDLETIRSRVQALSEVLRTSKEF-SELSPSESDKLLKECVIGLE 1219 ME +E VP S + LDL+TIRSR + L E+L + ++ SEL S+ +KL+K+C + + Sbjct: 134 MENAMEIVPPSSEHLDLDLDTIRSRAKELEEMLSSLEDNDSELFHSDLEKLVKDCALKFQ 193 Query: 1218 NRIEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTEL 1039 +R+EE SE+SD S L +D D LE+ EELNL+EAEN ++ EIE+L+ ++ EDS +L Sbjct: 194 SRMEEIGSEWSDVSFLEDKDFDACLEHLGEELNLVEAENSRMSEEIEILTRTYAEDSNQL 253 Query: 1038 ERDLEGLNCSLKFIESQNRHKLEMGTDVAHSVPTGGSETFDRTHEDN----FKVLELDNQ 871 E +LEGL ++ Q+ ++G + + R ED +LEL+N+ Sbjct: 254 EIELEGLKSAMDLTALQDLENAKLGA----------CDDYPRNTEDKQHLVLHLLELENE 303 Query: 870 IEKNKVTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLR 691 I+K + L SL+DL I K F+A+ QIED LT +KVI LE NCIR SL+T++PNLES+L Sbjct: 304 IKKKNIILKSLEDLDGICKWFDAIEQIEDILTSVKVIALEENCIRFSLQTYIPNLESILS 363 Query: 690 HQKMEYAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKSRQFCPSLAVLE 511 Q +E P V ELLIE+ + T++ KN EIFPNDV+I I ++AK C Sbjct: 364 QQTIEAVNVPFEVKLELLIELLEWTLDQKNAEIFPNDVYINNISNAAKCFSKC------- 416 Query: 510 MGSSLEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLP 331 SL+W V K V ANKS +S EY D+DE+M+AH+ GG++AFIK+ Sbjct: 417 ---SLQWFVTKVQDRIVSCTMRQLVVKSANKSGYSLEYFDKDEVMVAHLAGGVDAFIKVS 473 Query: 330 QSWPMLTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILV 151 Q WP+ LKL SLK+SD++++ I FLCKVEE NSL V I NL SFVDA++ IL Sbjct: 474 QGWPLSNSPLKLTSLKSSDHNTKGIPSIFLCKVEERVNSLAVHICHNLSSFVDAVDKILT 533 Query: 150 RQMRSEL 130 Q + E+ Sbjct: 534 EQKQLEI 540 >ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] gi|1742965|emb|CAA70756.1| HAPp48,5 protein [Arabidopsis thaliana] gi|9294659|dbj|BAB03008.1| HAPp48,5 protein [Arabidopsis thaliana] gi|20259510|gb|AAM13875.1| putative HAPp48,5 protein [Arabidopsis thaliana] gi|21436469|gb|AAM51435.1| putative HAPp48,5 protein [Arabidopsis thaliana] gi|332643310|gb|AEE76831.1| uncharacterized protein AT3G23910 [Arabidopsis thaliana] Length = 421 Score = 319 bits (818), Expect = 2e-84 Identities = 178/420 (42%), Positives = 258/420 (61%), Gaps = 3/420 (0%) Frame = -1 Query: 1374 LESVPSSEQLDLETIRSRVQALSEVLRTSKE--FSELSPSESDKLLKECVIGLENRIEEC 1201 +E LDL+ IR RV+ L R +E S ++++ V+ E +++E Sbjct: 1 MEEETHDGSLDLQEIRRRVKELDFFPRNCREEPVESCSSDYETLVVQDFVLQFEPKVKEI 60 Query: 1200 MSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEG 1021 + E+ D L +ED D YLE + EL +EAE+ K+ EIE LS S +DS+ L+RDLEG Sbjct: 61 VEEYGDVDLLDVEDSDAYLEYLRNELQSVEAESAKVSEEIERLSQSHAQDSSRLQRDLEG 120 Query: 1020 LNCSLKFIESQNRHKLEMGTDVAHSVPTGGSETFDRTHEDNFKVLELDNQIEKNKVTLNS 841 L SL + SQ+ K + + S+ E + +D FK+ EL+NQ+E+ ++ L S Sbjct: 121 LLLSLDSMSSQDVEKSKENQPSSSSM-----EVCEVIDDDKFKMFELENQMEEKRMILKS 175 Query: 840 LQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDP 661 L+DL + KRF+A Q+ED LTGLKV+E +GN IRL L+T++ L+ L K ++ +P Sbjct: 176 LEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFDHITEP 235 Query: 660 PAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEWLV 484 + HELLI + D T E+ E+FPND++IG+I+++A S RQ AVL+ SS++W+V Sbjct: 236 SELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVV 295 Query: 483 RKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCA 304 K V + R++FEY D+DE ++AH+ GGI+AF+K+ WP+L Sbjct: 296 AKVQDKIISTTLRKYIVMSSKTIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGWPLLNTP 355 Query: 303 LKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSELHT 124 LKL SLKNSDN S+ ISLS +CKVEE+ANSLD++ RQNL F+DAIE ILV Q R EL + Sbjct: 356 LKLASLKNSDNQSKGISLSLICKVEELANSLDLETRQNLSGFMDAIEKILVEQTREELQS 415 >ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592291 [Solanum tuberosum] Length = 428 Score = 319 bits (817), Expect = 2e-84 Identities = 184/416 (44%), Positives = 257/416 (61%), Gaps = 15/416 (3%) Frame = -1 Query: 1344 DLETIRSRVQALSEVLRTSKEFSELSPSESDKLLKECVIGLENRIEECMSEFSDFSALGI 1165 D+++ R +Q L ++ R+ +E E E K L++C + E ++E+ + + S+ S Sbjct: 8 DVDSFRREIQELRDIQRSVEE-PEAFGLELKKSLEDCTLQFERKVEQILCDASEISFSSD 66 Query: 1164 EDLDT-------------YLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLE 1024 +DL + + K EL+ EA N KI +EIE LS ++E ++L ++E Sbjct: 67 QDLGRKKAVHIFFFPPYEFWKYLKNELSTEEANNAKIADEIEGLSREYVEGYSKLVNEIE 126 Query: 1023 GLNCSLKFIESQNRHKLEMGTDVAHSVPTGGSETFDRTH-EDNFKVLELDNQIEKNKVTL 847 GL+C L+ IES + + T+ S P E NFKV EL NQ+EK+K+ L Sbjct: 127 GLSCPLELIESLGLEQGRVLTNFPCSTPGEDKGNVSSAPVEQNFKVFELGNQLEKSKLNL 186 Query: 846 NSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAI 667 SL++L RFEA+ +IED +GLK++E EGN IRLSL+TF+PNLE+LL +Q ++ A Sbjct: 187 KSLEELESTFNRFEAIEKIEDAFSGLKIVEFEGNRIRLSLRTFIPNLENLLHNQTIDVA- 245 Query: 666 DPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEW 490 +PP +HELLIE+ DGTMELK+VEIFPNDV I I D+AKS RQ + VLE SSLEW Sbjct: 246 EPPEQNHELLIELMDGTMELKHVEIFPNDVSISYITDTAKSLRQVYFPVGVLENRSSLEW 305 Query: 489 LVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLT 310 V+ V AN SRHSF+Y DR+E ++AHMVGGI+AFIKLPQ WP+ + Sbjct: 306 FVKGVQDRIVLSTLRRFLVKSANSSRHSFDYVDREETIVAHMVGGIDAFIKLPQGWPLTS 365 Query: 309 CALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQM 142 L L+SLK+S +S+ ISL+ LCKV E+AN LD RQ + F D +E IL++QM Sbjct: 366 SGLTLMSLKSSSQYSQQISLTLLCKVAEVANLLDTNERQTISGFTDRVEEILMQQM 421 >ref|XP_007034268.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|590656431|ref|XP_007034269.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508713297|gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508713298|gb|EOY05195.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 369 Score = 314 bits (805), Expect = 5e-83 Identities = 175/357 (49%), Positives = 235/357 (65%), Gaps = 4/357 (1%) Frame = -1 Query: 1386 MEELLESVPSSEQLDLETIRSRVQALSEVLRT--SKEFSELSPSESDKLLKECVIGLENR 1213 M E +E SSE LDL +IRSR+ LSE+ R +K+ E S+KLLK+C + E++ Sbjct: 1 MAEPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESK 60 Query: 1212 IEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033 +++ + E+SD LGIEDLD YL + KEELN +EAE+ KI NEIE LS + IE+S LE Sbjct: 61 VKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEG 120 Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSV-PTGGSETFDRTHEDNFKVLELDNQIEKNK 856 +LEGL +L I SQ +E + S+ S E F+++EL++QIEKN Sbjct: 121 NLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNN 180 Query: 855 VTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKME 676 + L SLQDL + KR + + QIED LTGLKVI +GNCIRLSL+T++P LE LL + +E Sbjct: 181 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240 Query: 675 YAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSS 499 +P ++HELL+E+ DGTME+KNVE+FPNDV++G+I+D+AKS RQ +L V + SS Sbjct: 241 DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 300 Query: 498 LEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQ 328 LEW V K V NKSRHSFEY +RDE ++AH+VGGI+AFIKL Q Sbjct: 301 LEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQ 357 >gb|EXC33904.1| hypothetical protein L484_012794 [Morus notabilis] Length = 412 Score = 313 bits (803), Expect = 9e-83 Identities = 187/422 (44%), Positives = 261/422 (61%), Gaps = 3/422 (0%) Frame = -1 Query: 1386 MEELLESVP-SSEQLDLETIRSRVQALSEVLRTSKEF-SELSPSESDKLLKECVIGLENR 1213 ME +E VP SSE LDL+TIRSR + L E+L + ++ SEL S+ +KL+K+C + ++R Sbjct: 1 MENAMEIVPPSSEHLDLDTIRSRAKELEEMLSSLEDNDSELFHSDLEKLVKDCALKFQSR 60 Query: 1212 IEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033 +EE SE+SD S L + D LE+ EELNL+EAEN + +IEVL+ ++ EDS +LE Sbjct: 61 MEEIGSEWSDVSFLEDKGFDACLEHLGEELNLVEAENSIMSEKIEVLTRTYAEDSNQLEI 120 Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSVPTGGSETFDRTHEDN-FKVLELDNQIEKNK 856 +LEGL + Q+ ++G + + R ED +LEL+ +I++ Sbjct: 121 ELEGLKNVMDLTALQDLGNAKLGA----------CDDYPRNTEDKQHSLLELEKEIKQKN 170 Query: 855 VTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKME 676 + L SL+DL I K F+A+ QIED LTG+KVI LE NCIR SL+T++PNLES L Q +E Sbjct: 171 IILKSLEDLDGICKWFDAIEQIEDILTGVKVIALEENCIRFSLQTYIPNLESFLLQQTIE 230 Query: 675 YAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKSRQFCPSLAVLEMGSSL 496 P V HELLIE+ + T++ KNVEIFPNDV++ I ++AK C SL Sbjct: 231 AVNVPFEVKHELLIELLEWTLDQKNVEIFPNDVYLNNISNAAKDFSKC----------SL 280 Query: 495 EWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPM 316 +W V K V AN S +S EY D+DE+M+AH+ GG++AFIK+ Q WP+ Sbjct: 281 QWFVTKVQDRIVSCTMRQLVVKSANTSGYSLEYFDKDEVMVAHLAGGVDAFIKVSQGWPL 340 Query: 315 LTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRS 136 LKL SLK+SD++++ I FL KV+E NSL V I QNL SFVDA++ IL Q + Sbjct: 341 SNSPLKLTSLKSSDHNTKGIPSIFLFKVKERVNSLAVHICQNLSSFVDAVDKILTEQKQL 400 Query: 135 EL 130 E+ Sbjct: 401 EI 402 >ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum] gi|557096755|gb|ESQ37263.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum] Length = 355 Score = 310 bits (793), Expect = 1e-81 Identities = 168/346 (48%), Positives = 229/346 (66%), Gaps = 2/346 (0%) Frame = -1 Query: 1155 DTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLKFIESQNRHK 976 D YLE ++EL+ +EAE+ K+ EIE LS S EDS+ L+RDLEGL SL F+ SQ K Sbjct: 5 DAYLEYLRKELHSVEAESAKVSEEIERLSSSHAEDSSRLDRDLEGLLLSLDFLSSQEVQK 64 Query: 975 LEMGTDVAHSVPTGGSETF-DRTHEDNFKVLELDNQIEKNKVTLNSLQDLCDILKRFEAV 799 + S+ + T+ D ++ FK+ EL+NQIE+ + L SL++L + KRF+A Sbjct: 65 SKENPPSTSSMERCDASTWIDVNDDEKFKMFELENQIEEKRRILKSLENLDSVCKRFDAA 124 Query: 798 RQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDPPAVDHELLIEVFDG 619 Q+ED LTGLKV+E +GN IRL L+T++P L+ LL K+ + +P + HELLI++ D Sbjct: 125 EQVEDALTGLKVLEFDGNFIRLQLRTYIPKLDGLLGQHKLLHNTEPSELIHELLIDLKDK 184 Query: 618 TMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEWLVRKXXXXXXXXXXXX 442 T E+ VE+ PNDV+IG+I D+A S RQ A+L+ SSL+WLV K Sbjct: 185 TTEITKVEMLPNDVYIGDITDAADSFRQIRLHSALLDTRSSLQWLVAKVQERIITTNLRK 244 Query: 441 XXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCALKLISLKNSDNHSR 262 V + RH+FEY D+DE ++AH+ GGI+AF+K+ WP+L+ LKL SLKNSDN S Sbjct: 245 HIVKSSKTIRHTFEYYDKDETIVAHITGGIDAFLKVSVGWPLLSTPLKLTSLKNSDNQSN 304 Query: 261 VISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSELHT 124 ISLS +CKVEE+ANSLD+Q RQNL F+DAIE ILV+Q R ELH+ Sbjct: 305 GISLSLICKVEELANSLDLQTRQNLSGFMDAIEKILVQQTREELHS 350