BLASTX nr result
ID: Sinomenium21_contig00001464
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00001464 (4555 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002270971.2| PREDICTED: uncharacterized protein LOC100241... 350 4e-93 ref|XP_007024788.1| Muscle M-line assembly protein unc-89, putat... 313 4e-82 ref|XP_007024787.1| Muscle M-line assembly protein unc-89, putat... 313 4e-82 ref|XP_007024786.1| Muscle M-line assembly protein unc-89, putat... 313 4e-82 ref|XP_002303336.2| hypothetical protein POPTR_0003s07100g [Popu... 302 8e-79 ref|XP_006369073.1| hypothetical protein POPTR_0001s16200g [Popu... 301 1e-78 ref|XP_006342882.1| PREDICTED: uncharacterized protein LOC102583... 298 2e-77 ref|XP_006426813.1| hypothetical protein CICLE_v10025233mg [Citr... 298 2e-77 ref|XP_004235521.1| PREDICTED: uncharacterized protein LOC101243... 297 3e-77 ref|XP_004235520.1| PREDICTED: uncharacterized protein LOC101243... 297 3e-77 ref|XP_002533812.1| conserved hypothetical protein [Ricinus comm... 296 5e-77 ref|XP_006465794.1| PREDICTED: uncharacterized abhydrolase domai... 293 7e-76 ref|XP_004144449.1| PREDICTED: uncharacterized protein LOC101208... 291 1e-75 ref|XP_006842720.1| hypothetical protein AMTR_s00147p00104660 [A... 289 1e-74 gb|EYU21263.1| hypothetical protein MIMGU_mgv1a003152mg [Mimulus... 287 3e-74 gb|AAF02854.1|AC009324_3 Unknown protein [Arabidopsis thaliana] 285 2e-73 gb|EXB64651.1| hypothetical protein L484_017984 [Morus notabilis] 283 5e-73 ref|NP_001031183.1| uncharacterized protein [Arabidopsis thalian... 282 1e-72 ref|NP_564641.2| uncharacterized protein [Arabidopsis thaliana] ... 282 1e-72 gb|AAM70555.1| At1g53800/T18A20_4 [Arabidopsis thaliana] 282 1e-72 >ref|XP_002270971.2| PREDICTED: uncharacterized protein LOC100241217 [Vitis vinifera] gi|297742921|emb|CBI35788.3| unnamed protein product [Vitis vinifera] Length = 586 Score = 350 bits (898), Expect = 4e-93 Identities = 206/421 (48%), Positives = 257/421 (61%), Gaps = 10/421 (2%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 SEETR KIGVGVRMGWQRR EK+M+QETC+ +WQSLIAEASRRG A EEELQWDSY+IL+ Sbjct: 178 SEETRVKIGVGVRMGWQRRREKRMLQETCYFEWQSLIAEASRRGYAGEEELQWDSYDILD 237 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 +QLE+EWLES+E+RK M RPKGSKRAPK+PEQRRKIS AISAKW+DP YRERVCSALAKY Sbjct: 238 EQLEREWLESVEERKRMPRPKGSKRAPKSPEQRRKISEAISAKWSDPAYRERVCSALAKY 297 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSE--ASNCSSTEKRSQERLRLRKSSSPSYK 3462 HG P G ++P RRRP+GD QS +S K++ + S K ++ RL+KS+SP YK Sbjct: 298 HGIPEGAPRKP-RRRPSGDTQSTRSPANKTTSHILDSAGSETKSQNQKTRLKKSNSPMYK 356 Query: 3463 DPLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARASL 3642 DPLA+SKLEMIKN RAQR+ ETKK EA+ SPLA ASL Sbjct: 357 DPLANSKLEMIKNIRAQRVAAETKKTEAIERARLLIAEAEKAAKALEVAATRSPLAHASL 416 Query: 3643 LETRKLIAEATRSIEAVETGR---NAYSKNISYTSESDRQESDYGAEADTKNGDYTSDRK 3813 +ET+KLIAEA +SIE++E G+ + S++ S++S + +A + + RK Sbjct: 417 METKKLIAEAIQSIESIEAGQISSHENSRDPSFSSAVPVNHVEKEMDAGIEGLNQADQRK 476 Query: 3814 VNGTHVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPS-----QPSDLINGMEDPNP 3978 VNGT ++ +DF K Q LLN + E++ S P DL + + P Sbjct: 477 VNGTKTLVSSKNDNEGFDFGKFTWQDLLN--GDMELLSTSSSGYGLSPLDLDSLIGSTKP 534 Query: 3979 RDPQVGYGELCTPSKYERTSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEVTKS 4158 D ER N + TTKKWV GRLVEV + Sbjct: 535 LDQLPNL-------NVERED--NPLPNGSKLKPRKEAAPANSVTTTKKWVRGRLVEVAEE 585 Query: 4159 D 4161 D Sbjct: 586 D 586 >ref|XP_007024788.1| Muscle M-line assembly protein unc-89, putative isoform 3 [Theobroma cacao] gi|508780154|gb|EOY27410.1| Muscle M-line assembly protein unc-89, putative isoform 3 [Theobroma cacao] Length = 425 Score = 313 bits (803), Expect = 4e-82 Identities = 188/420 (44%), Positives = 245/420 (58%), Gaps = 11/420 (2%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 S+ETR KIG+GVRMGW+RR EK MVQE C +W +LIAEASR+G EEELQWDSY+IL Sbjct: 11 SKETREKIGIGVRMGWERRREKLMVQENCHFEWMNLIAEASRKGYLGEEELQWDSYKILA 70 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 QL ++WLES+E+RK M R KGSKRAPK+ EQRRKI+AAI+AKWADP YR+RVCS LAKY Sbjct: 71 AQLTKDWLESVEERKTMPRTKGSKRAPKSLEQRRKIAAAIAAKWADPEYRKRVCSGLAKY 130 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTEKRSQ--ERLRLRKSSSPSYK 3462 HG+ G +++P +R+PTG QS +S K+ + +N SST + ERL LR+ + P YK Sbjct: 131 HGTQAGAERKP-KRKPTGGAQSKQSPSKRKASDTNYSSTSETISPIERLSLRRRNKPLYK 189 Query: 3463 DPLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARASL 3642 DP+A SKLEMIKN RAQR E++KIEA+ SP+ARASL Sbjct: 190 DPMASSKLEMIKNIRAQRATEESRKIEAVERARLLIAEAEKAAKALEVAAVKSPVARASL 249 Query: 3643 LETRKLIAEATRSIEAVETGRNAYSKNISYTSESDRQ-----ESDYGAEADTKNGDYTSD 3807 +ETRKLIAEA +SIE++E G+ +N Y S + E E++ Sbjct: 250 IETRKLIAEAIQSIESIERGQVTSDENGGYISVDSAEPVSQVEKKTQIESENSGLSQAEQ 309 Query: 3808 RKVNGTHVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQPSDL----INGMEDPN 3975 ++VNG SL E ++F Q+++N G+N E+ P S L + + Sbjct: 310 KEVNGKQNLSLSKNEE--FNFPNFMFQRIVN-GDNDELTSPSSNNYSLSTLNFESLIKKS 366 Query: 3976 PRDPQVGYGELCTPSKYERTSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEVTK 4155 V E K+ER L N T+KWV G+LVEVT+ Sbjct: 367 DSSKHVDLLETNGIIKHERNPLPNGIKVKLKDGDVPSKPV----TVTRKWVRGKLVEVTE 422 >ref|XP_007024787.1| Muscle M-line assembly protein unc-89, putative isoform 2 [Theobroma cacao] gi|508780153|gb|EOY27409.1| Muscle M-line assembly protein unc-89, putative isoform 2 [Theobroma cacao] Length = 582 Score = 313 bits (803), Expect = 4e-82 Identities = 188/420 (44%), Positives = 245/420 (58%), Gaps = 11/420 (2%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 S+ETR KIG+GVRMGW+RR EK MVQE C +W +LIAEASR+G EEELQWDSY+IL Sbjct: 168 SKETREKIGIGVRMGWERRREKLMVQENCHFEWMNLIAEASRKGYLGEEELQWDSYKILA 227 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 QL ++WLES+E+RK M R KGSKRAPK+ EQRRKI+AAI+AKWADP YR+RVCS LAKY Sbjct: 228 AQLTKDWLESVEERKTMPRTKGSKRAPKSLEQRRKIAAAIAAKWADPEYRKRVCSGLAKY 287 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTEKRSQ--ERLRLRKSSSPSYK 3462 HG+ G +++P +R+PTG QS +S K+ + +N SST + ERL LR+ + P YK Sbjct: 288 HGTQAGAERKP-KRKPTGGAQSKQSPSKRKASDTNYSSTSETISPIERLSLRRRNKPLYK 346 Query: 3463 DPLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARASL 3642 DP+A SKLEMIKN RAQR E++KIEA+ SP+ARASL Sbjct: 347 DPMASSKLEMIKNIRAQRATEESRKIEAVERARLLIAEAEKAAKALEVAAVKSPVARASL 406 Query: 3643 LETRKLIAEATRSIEAVETGRNAYSKNISYTSESDRQ-----ESDYGAEADTKNGDYTSD 3807 +ETRKLIAEA +SIE++E G+ +N Y S + E E++ Sbjct: 407 IETRKLIAEAIQSIESIERGQVTSDENGGYISVDSAEPVSQVEKKTQIESENSGLSQAEQ 466 Query: 3808 RKVNGTHVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQPSDL----INGMEDPN 3975 ++VNG SL E ++F Q+++N G+N E+ P S L + + Sbjct: 467 KEVNGKQNLSLSKNEE--FNFPNFMFQRIVN-GDNDELTSPSSNNYSLSTLNFESLIKKS 523 Query: 3976 PRDPQVGYGELCTPSKYERTSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEVTK 4155 V E K+ER L N T+KWV G+LVEVT+ Sbjct: 524 DSSKHVDLLETNGIIKHERNPLPNGIKVKLKDGDVPSKPV----TVTRKWVRGKLVEVTE 579 >ref|XP_007024786.1| Muscle M-line assembly protein unc-89, putative isoform 1 [Theobroma cacao] gi|508780152|gb|EOY27408.1| Muscle M-line assembly protein unc-89, putative isoform 1 [Theobroma cacao] Length = 611 Score = 313 bits (803), Expect = 4e-82 Identities = 188/420 (44%), Positives = 245/420 (58%), Gaps = 11/420 (2%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 S+ETR KIG+GVRMGW+RR EK MVQE C +W +LIAEASR+G EEELQWDSY+IL Sbjct: 197 SKETREKIGIGVRMGWERRREKLMVQENCHFEWMNLIAEASRKGYLGEEELQWDSYKILA 256 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 QL ++WLES+E+RK M R KGSKRAPK+ EQRRKI+AAI+AKWADP YR+RVCS LAKY Sbjct: 257 AQLTKDWLESVEERKTMPRTKGSKRAPKSLEQRRKIAAAIAAKWADPEYRKRVCSGLAKY 316 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTEKRSQ--ERLRLRKSSSPSYK 3462 HG+ G +++P +R+PTG QS +S K+ + +N SST + ERL LR+ + P YK Sbjct: 317 HGTQAGAERKP-KRKPTGGAQSKQSPSKRKASDTNYSSTSETISPIERLSLRRRNKPLYK 375 Query: 3463 DPLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARASL 3642 DP+A SKLEMIKN RAQR E++KIEA+ SP+ARASL Sbjct: 376 DPMASSKLEMIKNIRAQRATEESRKIEAVERARLLIAEAEKAAKALEVAAVKSPVARASL 435 Query: 3643 LETRKLIAEATRSIEAVETGRNAYSKNISYTSESDRQ-----ESDYGAEADTKNGDYTSD 3807 +ETRKLIAEA +SIE++E G+ +N Y S + E E++ Sbjct: 436 IETRKLIAEAIQSIESIERGQVTSDENGGYISVDSAEPVSQVEKKTQIESENSGLSQAEQ 495 Query: 3808 RKVNGTHVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQPSDL----INGMEDPN 3975 ++VNG SL E ++F Q+++N G+N E+ P S L + + Sbjct: 496 KEVNGKQNLSLSKNEE--FNFPNFMFQRIVN-GDNDELTSPSSNNYSLSTLNFESLIKKS 552 Query: 3976 PRDPQVGYGELCTPSKYERTSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEVTK 4155 V E K+ER L N T+KWV G+LVEVT+ Sbjct: 553 DSSKHVDLLETNGIIKHERNPLPNGIKVKLKDGDVPSKPV----TVTRKWVRGKLVEVTE 608 >ref|XP_002303336.2| hypothetical protein POPTR_0003s07100g [Populus trichocarpa] gi|550342603|gb|EEE78315.2| hypothetical protein POPTR_0003s07100g [Populus trichocarpa] Length = 600 Score = 302 bits (774), Expect = 8e-79 Identities = 196/427 (45%), Positives = 249/427 (58%), Gaps = 18/427 (4%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 S+ETR KIG GVR+GWQ+R EKQMVQE C+ +WQ+LIAEASRRG EEELQWDSY IL Sbjct: 193 SKETREKIGHGVRLGWQKRREKQMVQEGCYFEWQNLIAEASRRGYTGEEELQWDSYNILR 252 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 +QLE EW+ES++QRK + RPKGSKRAPK+ EQRRKIS AI+AKWADP YRERV S L+KY Sbjct: 253 QQLEDEWVESVQQRKTLPRPKGSKRAPKSLEQRRKISEAIAAKWADPEYRERVYSGLSKY 312 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTEK---RSQERLRLRKSSSPSY 3459 HG+ G ++P RR P+G QS ++ S S TEK RS + R+S +PSY Sbjct: 313 HGTLAGAARKP-RRMPSGSSQSA----RRDSSKRRTSDTEKGYARSPIQQLRRRSRTPSY 367 Query: 3460 KDPLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARAS 3639 KDPLA SKLEMIKN RAQR+ ETKK EA+ SP+ARAS Sbjct: 368 KDPLASSKLEMIKNIRAQRIATETKKNEAIERARSLIVEAEKAANALEAAAMKSPIARAS 427 Query: 3640 LLETRKLIAEATRSIEAVETGRNAYSKNISYTSESDRQES--------DYGAEADTKNG- 3792 L E RKLI+EA +SIE+++ G S +IS + +DR S + E + NG Sbjct: 428 LTEARKLISEAIQSIESLDQGNGVSSDSIS--NVNDRYPSLALTELVTEDEKEINAGNGS 485 Query: 3793 -DYTSDRKVNGTHVASLGSEESITYDFDKAAMQKLLN-EGE----NAEVIFPPSQPSDLI 3954 D R+VNGT + +E + +F A LLN +GE ++ PS D Sbjct: 486 MDQVELRQVNGTMIMETSKDEDL--NFSNLAFHDLLNGQGELLPLSSSAYSLPSSTIDHS 543 Query: 3955 NGMEDPNPRDPQVGYGELCTPSKYERTSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCG 4134 + + P+ +P G L + E+ +L N + S TKKWV G Sbjct: 544 SSGKQPDQAEPN---GSLTS----EKINLPNGSRVQYVEEETP-----SKSVATKKWVHG 591 Query: 4135 RLVEVTK 4155 RLVE T+ Sbjct: 592 RLVEGTE 598 >ref|XP_006369073.1| hypothetical protein POPTR_0001s16200g [Populus trichocarpa] gi|550347432|gb|ERP65642.1| hypothetical protein POPTR_0001s16200g [Populus trichocarpa] Length = 593 Score = 301 bits (772), Expect = 1e-78 Identities = 192/426 (45%), Positives = 250/426 (58%), Gaps = 14/426 (3%) Frame = +1 Query: 2917 ILFCSEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSY 3096 +L S+ETR KIG GVR+GWQ+R EKQM+QE C+ +WQ+LI EASRRG E ELQWDSY Sbjct: 178 LLSYSKETRVKIGHGVRLGWQKRREKQMMQEGCYFEWQNLITEASRRGYTGEGELQWDSY 237 Query: 3097 EILNKQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSA 3276 IL +QLE EW+ES+E+RK RPKGSKRAPK+ EQRRKIS AI+AKWADP YRERV S Sbjct: 238 NILRQQLEFEWVESVEKRKTTPRPKGSKRAPKSLEQRRKISEAIAAKWADPEYRERVFSG 297 Query: 3277 LAKYHGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTEKRSQERLRLRKSSSPS 3456 ++KYHG+PVG +++P RRRP+G QS + + + + + T +Q+ LR R+S +PS Sbjct: 298 ISKYHGTPVGAERKP-RRRPSGGSQSARQDSTRRTNDTEKGDTRSPTQQ-LR-RRSKTPS 354 Query: 3457 YKDPLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARA 3636 YKDPLA SKLEMIKN RA+R ETKK EA+ SP+ARA Sbjct: 355 YKDPLARSKLEMIKNIRAERTATETKKNEAVERARSLITEAEKAANTLEAAAVRSPIARA 414 Query: 3637 SLLETRKLIAEATRSIEAVETGRNAYSKNISYTSESDR----------QESDYGAEADTK 3786 SL+E RKLIAEA +SIE+V+TG + N S ++E DR Q S+ E + Sbjct: 415 SLIEARKLIAEAIQSIESVDTGYSI--SNDSISNEIDRHPDPSLAPTKQVSEVEKEINAG 472 Query: 3787 NG--DYTSDRKVNGTHVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQ--PSDLI 3954 NG + R+VNGT + +E + +F A +LN + + + PS + Sbjct: 473 NGGLGQVALRQVNGTKILETSKDEDL--NFCNLAFNDILNGEKELHHLGTGAYGLPSLSM 530 Query: 3955 NGMEDPNPRDPQVGYGELCTPSKYERTSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCG 4134 D + Q G E K E+ +L N + +TTKKWV G Sbjct: 531 ASPVDHSSSRKQPGQVEPNGSLKSEKINLPNGSRVQYVKEETP-----SKPDTTKKWVRG 585 Query: 4135 RLVEVT 4152 RLVE T Sbjct: 586 RLVEGT 591 >ref|XP_006342882.1| PREDICTED: uncharacterized protein LOC102583814 isoform X1 [Solanum tuberosum] Length = 616 Score = 298 bits (762), Expect = 2e-77 Identities = 192/448 (42%), Positives = 256/448 (57%), Gaps = 39/448 (8%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 SEETR KIGV VRMGW+RR +QETC +WQ+LIAEASRRG EEELQWDSYEIL+ Sbjct: 176 SEETRLKIGVAVRMGWERRRGMLRLQETCHYEWQNLIAEASRRGLLGEEELQWDSYEILS 235 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 KQLEQEW++S+++RK R KG+KRAPK+ EQRRKIS AI+AKWADP YR RV SAL+KY Sbjct: 236 KQLEQEWIQSVQERKNKPRLKGNKRAPKSAEQRRKISEAIAAKWADPDYRSRVQSALSKY 295 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSN--LKKSSEASNCSSTEKRSQ-ERLRLRKSSSPSY 3459 HG P G ++RP RR+P D Q+ K + KK++E N E +SQ +R+RLR+ ++P Y Sbjct: 296 HGIPDGVERRP-RRKPASDEQTRKRSPPKKKANELDNLVMPEPKSQVQRVRLRRKNTPMY 354 Query: 3460 KDPLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARAS 3639 KDPLA SKLEM+KN RAQR ++ KKIEA+ SP+A+AS Sbjct: 355 KDPLASSKLEMLKNIRAQRAGIDQKKIEAVMRAKALIAEAEKAAEALEMAAHNSPVAQAS 414 Query: 3640 LLETRKLIAEATRSIEAVETGRNAYSKNISYTS------ESDRQESDYGAEADTKNGDYT 3801 L+ETRKLI+EA RSIE++E + +++S S +D +S++GA AD Sbjct: 415 LIETRKLISEAIRSIESIEKEVSVTDRDLSPPSTELGSHTADDGDSEFGALAD------P 468 Query: 3802 SDRKVNGTHVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQPSDLINGMEDPNPR 3981 +R++NG H + D + A+Q L N G+N + S DL+ ++ Sbjct: 469 GERRINGWHAVTPMDRGIYHLDDGRHALQGLPN-GKNT-ALLSSSSDYDLLGDRQEVYQM 526 Query: 3982 -------DPQVGYGELCTPSKY------------ERTSLLN-----------XXXXXXXX 4071 + +V + T ++ E LLN Sbjct: 527 ISSNLSLEKEVNITQSTTSTQRFDEDEANGSPGDEHKQLLNRDEANASPGDEQKPLPDGL 586 Query: 4072 XXXXXXXXCTTSNTTKKWVCGRLVEVTK 4155 TT+ TTKKWV GRLVEV++ Sbjct: 587 ISGAKIEAATTTTTTKKWVRGRLVEVSE 614 >ref|XP_006426813.1| hypothetical protein CICLE_v10025233mg [Citrus clementina] gi|557528803|gb|ESR40053.1| hypothetical protein CICLE_v10025233mg [Citrus clementina] Length = 588 Score = 298 bits (762), Expect = 2e-77 Identities = 187/427 (43%), Positives = 244/427 (57%), Gaps = 18/427 (4%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 SEET+ KIG+GVRMGW++R K MVQE+C+ +WQ+LIAEA+RRG A EEELQW SY IL+ Sbjct: 176 SEETKKKIGIGVRMGWEKRRGKLMVQESCYFEWQNLIAEAARRGLAGEEELQWYSYNILD 235 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 +QL++EWLES+E+RK M R KGSKRAPK EQR+KI+ AI+AKWADP YRERVC+ L+K+ Sbjct: 236 EQLKKEWLESVERRKTMPRTKGSKRAPKPAEQRKKIAEAIAAKWADPEYRERVCAGLSKF 295 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTE---KRSQERLRLRKSSSPSY 3459 HG PVG +R +R+P QS K KK E S + E+ +LR+S+ P Y Sbjct: 296 HGVPVG-VERKAKRKPRAITQSSKQTPKKKKETDTDFSPRNEPNKQIEKFKLRRSNRPLY 354 Query: 3460 KDPLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARAS 3639 KDP A SKLEMIKN RAQR E+KK EA+ SP+ARAS Sbjct: 355 KDPSAGSKLEMIKNIRAQRSATESKKTEAIERARLLIAEAEKAAKALGVAAVKSPIARAS 414 Query: 3640 LLETRKLIAEATRSIEAVETGRNAYSKNISYTSESDRQESDYGAE---------ADTKNG 3792 L+ETRKLIAEAT++IE++ETG I+ +E+D S AE +T+NG Sbjct: 415 LIETRKLIAEATQTIESIETG------EITSNNENDGFPSAISAELVSQGKKETEETENG 468 Query: 3793 --DYTSDRKVNGTHVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQPSDL----I 3954 D +VNG + G +E DF+ A+ + + E++ S L + Sbjct: 469 AVDLPEHVRVNGNQTLACGKDE----DFNFASF-TIPGKMNGEEILCANSNGYSLQTLNL 523 Query: 3955 NGMEDPNPRDPQVGYGELCTPSKYERTSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCG 4134 + + VGY E S+YE+ N TKKWV G Sbjct: 524 ESLMMQSDSATHVGYLEPNGTSEYEK----NPQPNGSEVKNMEVEKLSKPETVTKKWVRG 579 Query: 4135 RLVEVTK 4155 RLVEVT+ Sbjct: 580 RLVEVTE 586 >ref|XP_004235521.1| PREDICTED: uncharacterized protein LOC101243687 isoform 2 [Solanum lycopersicum] Length = 617 Score = 297 bits (761), Expect = 3e-77 Identities = 194/447 (43%), Positives = 254/447 (56%), Gaps = 38/447 (8%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 SEETR KIGV VRMGW+RR +QETC +WQ+LIAEASRRG EEELQWDSYEIL+ Sbjct: 176 SEETRLKIGVAVRMGWERRRGMLRLQETCHYEWQNLIAEASRRGLLGEEELQWDSYEILS 235 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 KQLEQEW++S+++RK R KG+KRAPK+ EQRRKIS AI+AKWADP YR RV SAL+KY Sbjct: 236 KQLEQEWIQSVQERKNRPRLKGNKRAPKSAEQRRKISEAIAAKWADPDYRSRVQSALSKY 295 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSN--LKKSSEASNCSSTEKRSQ-ERLRLRKSSSPSY 3459 HG P G ++RP RR+P D Q+ K + KK++E N E +SQ +R+RLR+ ++P Y Sbjct: 296 HGIPDGVERRP-RRKPASDEQTRKRSPPKKKANELDNPVKPEPKSQVQRVRLRRKNTPMY 354 Query: 3460 KDPLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARAS 3639 KDPLA SKLEMIKN RAQR ++ KKIEA+ SP+A+AS Sbjct: 355 KDPLASSKLEMIKNIRAQRAGIDQKKIEAVMRAKALIAEAEKAAEALEMAAHNSPVAQAS 414 Query: 3640 LLETRKLIAEATRSIEAVETGRNAYSKNISYTS------ESDRQESDYGAEADTKNGDYT 3801 L+ETRKLI+EA RSIE++E + +++S S +D +S++GA AD Sbjct: 415 LIETRKLISEAIRSIESIEKEVSLSDEDLSPPSTELGSNTADEGDSEFGALAD------P 468 Query: 3802 SDRKVNGTHVASLGSEESITYDFDKAAMQKLLNE---------------GENAEVIFPPS 3936 S+R++NG H A+ + D + A++ L N G+ EV S Sbjct: 469 SERRINGWHSATPMDRDIYHLDDGRHALRGLPNGKSTTLLSSSSDYDLLGDRQEVYQMIS 528 Query: 3937 QPSDL---INGMEDPNPRDPQVGYGELCTPSKYERTSLLN-----------XXXXXXXXX 4074 L +N + N E E+ LLN Sbjct: 529 SSLSLEKEVNVTQSTNSTQRFDEKDEANESPGDEQKQLLNRDEANASPGDEQKPLPNGLI 588 Query: 4075 XXXXXXXCTTSNTTKKWVCGRLVEVTK 4155 TT+ +TKKWV GRLVEV++ Sbjct: 589 SGSKTEATTTTTSTKKWVRGRLVEVSE 615 >ref|XP_004235520.1| PREDICTED: uncharacterized protein LOC101243687 isoform 1 [Solanum lycopersicum] Length = 618 Score = 297 bits (761), Expect = 3e-77 Identities = 194/447 (43%), Positives = 254/447 (56%), Gaps = 38/447 (8%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 SEETR KIGV VRMGW+RR +QETC +WQ+LIAEASRRG EEELQWDSYEIL+ Sbjct: 177 SEETRLKIGVAVRMGWERRRGMLRLQETCHYEWQNLIAEASRRGLLGEEELQWDSYEILS 236 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 KQLEQEW++S+++RK R KG+KRAPK+ EQRRKIS AI+AKWADP YR RV SAL+KY Sbjct: 237 KQLEQEWIQSVQERKNRPRLKGNKRAPKSAEQRRKISEAIAAKWADPDYRSRVQSALSKY 296 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSN--LKKSSEASNCSSTEKRSQ-ERLRLRKSSSPSY 3459 HG P G ++RP RR+P D Q+ K + KK++E N E +SQ +R+RLR+ ++P Y Sbjct: 297 HGIPDGVERRP-RRKPASDEQTRKRSPPKKKANELDNPVKPEPKSQVQRVRLRRKNTPMY 355 Query: 3460 KDPLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARAS 3639 KDPLA SKLEMIKN RAQR ++ KKIEA+ SP+A+AS Sbjct: 356 KDPLASSKLEMIKNIRAQRAGIDQKKIEAVMRAKALIAEAEKAAEALEMAAHNSPVAQAS 415 Query: 3640 LLETRKLIAEATRSIEAVETGRNAYSKNISYTS------ESDRQESDYGAEADTKNGDYT 3801 L+ETRKLI+EA RSIE++E + +++S S +D +S++GA AD Sbjct: 416 LIETRKLISEAIRSIESIEKEVSLSDEDLSPPSTELGSNTADEGDSEFGALAD------P 469 Query: 3802 SDRKVNGTHVASLGSEESITYDFDKAAMQKLLNE---------------GENAEVIFPPS 3936 S+R++NG H A+ + D + A++ L N G+ EV S Sbjct: 470 SERRINGWHSATPMDRDIYHLDDGRHALRGLPNGKSTTLLSSSSDYDLLGDRQEVYQMIS 529 Query: 3937 QPSDL---INGMEDPNPRDPQVGYGELCTPSKYERTSLLN-----------XXXXXXXXX 4074 L +N + N E E+ LLN Sbjct: 530 SSLSLEKEVNVTQSTNSTQRFDEKDEANESPGDEQKQLLNRDEANASPGDEQKPLPNGLI 589 Query: 4075 XXXXXXXCTTSNTTKKWVCGRLVEVTK 4155 TT+ +TKKWV GRLVEV++ Sbjct: 590 SGSKTEATTTTTSTKKWVRGRLVEVSE 616 >ref|XP_002533812.1| conserved hypothetical protein [Ricinus communis] gi|223526249|gb|EEF28565.1| conserved hypothetical protein [Ricinus communis] Length = 595 Score = 296 bits (759), Expect = 5e-77 Identities = 189/421 (44%), Positives = 236/421 (56%), Gaps = 10/421 (2%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 S+ETR KIGVGVRM W++R EK+ VQETC +WQ+LIAEASRRG A EEE+QWDSY+IL Sbjct: 197 SKETRTKIGVGVRMRWKKRREKKNVQETCLFEWQNLIAEASRRGYAGEEEMQWDSYKILT 256 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 ++LE EW+ESIEQRK M RPKGSKRAPK+PEQRRKI+ AI+AKWADP YRERVCSAL+KY Sbjct: 257 EKLEVEWVESIEQRKTMPRPKGSKRAPKSPEQRRKIAEAIAAKWADPEYRERVCSALSKY 316 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTEKRSQERLRLRKSSSPSYKDP 3468 HG+PVG K R + D KS+ + S++ R RLR+S +P YKDP Sbjct: 317 HGTPVGIKPRRRTQPKKQDPAMKKSDTENLSKSDTAGP-----MRRPRLRRSKTPVYKDP 371 Query: 3469 LADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARASLLE 3648 LA SKLEMIK R QR TKK EA+ SP+A+ASL+E Sbjct: 372 LARSKLEMIKKIREQRAAAGTKKTEAIERARLLIAEAQKAAKALEVAATTSPIAQASLIE 431 Query: 3649 TRKLIAEATRSIEAVETGRNAYSKNISYTSESDRQESDY-GAEADTKNGDYTSD--RKVN 3819 RKLIAEA SIE+V+ SK+ S S + + EAD NG+ + ++VN Sbjct: 432 ARKLIAEAILSIESVDAEYMTSSKDDIDPSLSPIELAGLIDEEADVNNGNSSQAELKEVN 491 Query: 3820 GTHVASLGSEESITYDFDKAAMQKLLNEGENAEVI-------FPPSQPSDLINGMEDPNP 3978 GT + + S E +F ++ +LN GE + FP +I P P Sbjct: 492 GTKIVA--SSEDKDLNFTNLSLHDILN-GEYELLSTRSNGFNFPSINLESIIEHSSSPKP 548 Query: 3979 RDPQVGYGELCTPSKYERTSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEVTKS 4158 K E++ L N + KKWVCGRLVEVT Sbjct: 549 NGSH----------KSEKSPLPNGSKVQHLKEELPSKPI----TSAKKWVCGRLVEVTDE 594 Query: 4159 D 4161 D Sbjct: 595 D 595 >ref|XP_006465794.1| PREDICTED: uncharacterized abhydrolase domain-containing protein DDB_G0269086-like [Citrus sinensis] Length = 588 Score = 293 bits (749), Expect = 7e-76 Identities = 183/421 (43%), Positives = 241/421 (57%), Gaps = 12/421 (2%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 SEET+ KIG+GVRMGW++R K MVQE+C+ +WQ+LIAEA+RRG A EEELQW SY IL+ Sbjct: 176 SEETKKKIGIGVRMGWEKRRGKLMVQESCYFEWQNLIAEAARRGLAGEEELQWYSYNILD 235 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 +QL +EWLES+E+RK M R KGS+RAPK+ EQR+KI+ AI+AKWADP YRERVC+ L+K+ Sbjct: 236 EQLMKEWLESVERRKTMPRTKGSRRAPKSAEQRKKIAEAIAAKWADPEYRERVCAGLSKF 295 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTE---KRSQERLRLRKSSSPSY 3459 HG PVG +R +R+P QS K KK E S + E+ +LR+S+ P Y Sbjct: 296 HGVPVG-VERKAKRKPRAVTQSSKQTPKKKKETDTDFSPRNEPNKQIEKFKLRRSNRPLY 354 Query: 3460 KDPLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARAS 3639 KD A SKLEMIKN RAQR E+KK EA+ SP+ARAS Sbjct: 355 KDSSAGSKLEMIKNIRAQRSATESKKTEAIERARLLIAEAEKAAKALEVAAVKSPIARAS 414 Query: 3640 LLETRKLIAEATRSIEAVETGR-NAYSKNISYTSESDRQESDYG----AEADTKNGDYTS 3804 L+ETRKLIAEAT++IE++ETG + ++N + S + G EA+ D Sbjct: 415 LIETRKLIAEATQTIESIETGEITSNNENDGFPSAISAELVSQGKKETEEAENGAVDLLE 474 Query: 3805 DRKVNGTHVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQPSDL----INGMEDP 3972 +VNG + G +E ++F M +N GE E++ S L + + Sbjct: 475 HVRVNGNQTLACGKDED--FNFASFTMPGKMN-GE--EILCANSNGYSLQTLNLESLMMQ 529 Query: 3973 NPRDPQVGYGELCTPSKYERTSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEVT 4152 + VGY E S+YE+ N TKKWV GRLVEVT Sbjct: 530 SDSATHVGYLEPNGTSEYEK----NPQPNGSEVKNMEVEKLSKPETVTKKWVRGRLVEVT 585 Query: 4153 K 4155 + Sbjct: 586 E 586 >ref|XP_004144449.1| PREDICTED: uncharacterized protein LOC101208479 [Cucumis sativus] gi|449523814|ref|XP_004168918.1| PREDICTED: uncharacterized LOC101208479 [Cucumis sativus] Length = 577 Score = 291 bits (746), Expect = 1e-75 Identities = 185/424 (43%), Positives = 239/424 (56%), Gaps = 15/424 (3%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 SEETR KIGVGVRMGWQRR EKQ++QETC +WQ+LIAEASR+G EEELQWDSY+ILN Sbjct: 175 SEETRLKIGVGVRMGWQRRREKQVLQETCHFEWQNLIAEASRQGYKGEEELQWDSYQILN 234 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 ++L++EWLES+EQRK R GS+RAPK+ EQR+KIS +ISAKWADP YR+RVCSALAKY Sbjct: 235 EELKKEWLESVEQRKKTPRVVGSRRAPKSAEQRKKISESISAKWADPDYRDRVCSALAKY 294 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEA-SNCSSTEKRSQERLRLRKSSSPSYKD 3465 HG+P G +RP R+R S+ K+ S+ S+ + + +RL+L+KS +P +KD Sbjct: 295 HGTPTGVIRRPRRKRSESTATITTSSKKEKSDVNSSLAGGFRIENQRLKLKKSKAPRFKD 354 Query: 3466 PLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARASLL 3645 PLA SKLEMIK+ RAQR + ET+K+EA+ SP+ARASLL Sbjct: 355 PLASSKLEMIKSIRAQRAMAETQKMEAIERARLLIAEAEKAAEALEVAATRSPIARASLL 414 Query: 3646 ETRKLIAEATRSIEAVETGRNAYSKNISYTSESDRQESDYGAEADTKNGDYTS----DRK 3813 ETRKLIAEA +SIE+V + A + T E + S E T N S + + Sbjct: 415 ETRKLIAEAIQSIESVNIEQTASPQ----TEEPNAAASYSCYEVVTPNNKEESLGRKEDQ 470 Query: 3814 VNGTHVASLGSE---ESITYDFD--KAAMQKLLNEGENAEVI-----FPPSQPSDLINGM 3963 + + G++ +I DFD K ++Q LL + V S S L N Sbjct: 471 NRAVQIIANGTQWFPSNIDEDFDCSKFSLQDLLGREKEVPVSTNGYGLSHSSFSSLANQA 530 Query: 3964 EDPNPRDPQVGYGELCTPSKYERTSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLV 4143 P D + +R TKKWV GRLV Sbjct: 531 NGNKPSDHKPSLNGTRLHHLEDRAD-------------------SQVITVTKKWVRGRLV 571 Query: 4144 EVTK 4155 EV + Sbjct: 572 EVAE 575 >ref|XP_006842720.1| hypothetical protein AMTR_s00147p00104660 [Amborella trichopoda] gi|548844821|gb|ERN04395.1| hypothetical protein AMTR_s00147p00104660 [Amborella trichopoda] Length = 509 Score = 289 bits (739), Expect = 1e-74 Identities = 176/412 (42%), Positives = 240/412 (58%), Gaps = 5/412 (1%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 S+ETR KIG GVR+GW+RR E+ +QETC LQWQ+LI EASR+G E+ELQWDSYE L+ Sbjct: 112 SKETRVKIGQGVRIGWERRRERLALQETCCLQWQNLITEASRKGIHGEDELQWDSYETLD 171 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 ++LE+EW ESIE+R+ M RPKG +RAPK+PEQRRKIS AISAKWADP YR+RV S L KY Sbjct: 172 RELEKEWQESIERRRSMPRPKGGRRAPKSPEQRRKISEAISAKWADPEYRDRVFSGLTKY 231 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSN-LKKSSEASNCSSTEKRSQERLRLRKSSSPSYKD 3465 HG+PVG +R RRR D ++KS+ +KK ++ ST K + ++ S+PSY D Sbjct: 232 HGTPVGAVRRSPRRRQMEDANAMKSSPIKKQEMLNSGGSTGKAGP---KSKEISTPSYTD 288 Query: 3466 PLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARASLL 3645 PLA+SKLEM+K R QR METKK EA +PLARA+L Sbjct: 289 PLANSKLEMLKKIRKQRAAMETKKKEATERARLLIAEAEKAAKALEVAAMSNPLARATLA 348 Query: 3646 ETRKLIAEATRSIEAVETGR-NAYSKNISYTSESDRQESDYGAEADTKNGDYTSDRKVNG 3822 ETRKLIAEATRS+E+++ G+ N+++++ + S T N + +NG Sbjct: 349 ETRKLIAEATRSLESIDNGQINSHAQDQQVLNTS------------TPNPELIK-TYMNG 395 Query: 3823 THVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQPSDLIN-GMEDPNPRDPQVGY 3999 H + + + FDK A+Q ++N E+ + I + S+ G + + + Sbjct: 396 KHHLTQSDNKFENFGFDKLALQNVMNGTEDPDTINNVRERSENAGLGYLSCSLQSGNATF 455 Query: 4000 GELCTPSKYER--TSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEV 4149 C P+ E+ + + + T KKWVCGRLVEV Sbjct: 456 EHNC-PATQEKIVAEGVRLGAEMGISQFRKTESSASATATRKKWVCGRLVEV 506 >gb|EYU21263.1| hypothetical protein MIMGU_mgv1a003152mg [Mimulus guttatus] Length = 604 Score = 287 bits (735), Expect = 3e-74 Identities = 179/418 (42%), Positives = 240/418 (57%), Gaps = 9/418 (2%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 SEET+ KIGVGVR+GW+RR E+ +QETC QWQ LIA A+R+G EEELQWDSY++L+ Sbjct: 195 SEETKIKIGVGVRLGWERRRERLQLQETCHHQWQDLIAVAARKGFLGEEELQWDSYKVLS 254 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 KQLE+EW++S+EQR+ R KGSKRAPK+ EQ+RKIS AI+AKWADP YR+RV S LAK+ Sbjct: 255 KQLEKEWVQSVEQRRNTPRIKGSKRAPKSAEQKRKISEAIAAKWADPEYRDRVYSGLAKF 314 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTEKRSQ-ERLRLRKSSSPSYKD 3465 HG P G +++ RR+ + D QS K K + E N + +E +SQ +R R ++S +PSYKD Sbjct: 315 HGIPEGTERKS-RRKTSIDGQSRKRGPKNTEETDNLAKSESKSQNQRTRTKRSKTPSYKD 373 Query: 3466 PLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARASLL 3645 PLA SKLEM+KN RAQR + KK EA+ +PLA+ASL+ Sbjct: 374 PLASSKLEMLKNIRAQRSAVLNKKSEAVTRAKLLIAGAEKAAEALEIAARENPLAQASLM 433 Query: 3646 ETRKLIAEATRSIEAVETGRNAYS----KNISYTSESDRQESDYGAEADTKNGDYTSDRK 3813 E+R LIAEA + IE++E S +N S S Q + +T N + RK Sbjct: 434 ESRMLIAEAYQIIESIEYEDEVSSEDDKENNSENSIEPVQNLKLVMDENTLNLANGNPRK 493 Query: 3814 VNGTHVASLGSE--ESITYDFDKAAMQKLLNEGENAEVI--FPPSQPSDLINGMEDPNPR 3981 VNG H S S E+ + FDK +Q L+N +A P + + NG++ P+ + Sbjct: 494 VNGVHSISSASSAVENDNFSFDKFMLQDLMNGNGSASSFNDMPEREENIRSNGLQSPDHK 553 Query: 3982 DPQVGYGELCTPSKYERTSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEVTK 4155 G S + LN T KKW+ GRLVEV + Sbjct: 554 PSPNGI------SVQTQKQSLNGLDFQSDNAEASSKKQV---KTVKKWLRGRLVEVAE 602 >gb|AAF02854.1|AC009324_3 Unknown protein [Arabidopsis thaliana] Length = 603 Score = 285 bits (728), Expect = 2e-73 Identities = 174/422 (41%), Positives = 248/422 (58%), Gaps = 10/422 (2%) Frame = +1 Query: 2923 FCSEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEI 3102 F S+ETR KIG GVRM W RR E++ VQETC +WQ+L+AEA+++G DEEELQWDSY I Sbjct: 194 FYSKETRMKIGEGVRMRWARRKERRKVQETCHFEWQNLLAEAAKQGYTDEEELQWDSYNI 253 Query: 3103 LNKQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALA 3282 L++Q + EWLES+EQRK + K ++RAPK+PEQRR+I+ AI+AKWADP YRERVCS LA Sbjct: 254 LDQQNQLEWLESVEQRKAIKGAKSNRRAPKSPEQRRRIAEAIAAKWADPSYRERVCSGLA 313 Query: 3283 KYHGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTEKRSQ-ERLRLRKSSSPSY 3459 KYHG PVG ++R RRRP D + K K S + S E++SQ + +++RK +P+Y Sbjct: 314 KYHGIPVGVERR--RRRPRSDAEPRKKTPTKKS--TRDSEFERQSQVQVVKVRKRKTPAY 369 Query: 3460 KDPLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARAS 3639 KDPLA SKLEMIK+ RA+R+ E+KK++A+ SP+A+AS Sbjct: 370 KDPLASSKLEMIKSIRAKRVAEESKKMDAVERARLLISEAEKAAKVLEIAALKSPVAQAS 429 Query: 3640 LLETRKLIAEATRSIEAVETGRNAYSKNISYTSESDRQESDYGAEADTKN-GDYTSDRKV 3816 LLE++KLIAEAT+ I+++E + A ++ +Y Q +D +E++TK+ D ++ Sbjct: 430 LLESKKLIAEATQLIKSLEMRQIASDEDGTYPFLLSPQPND--SESETKDTNDQERPGEI 487 Query: 3817 NGTHVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQPSDLINGMEDPNPRDPQVG 3996 NGTH + ES+ + + + EG + + SD+ + D ++G Sbjct: 488 NGTHTLQING-ESLHMNMRSNDLPTFVIEGTTNQFV------SDMESNTSQGGREDIKLG 540 Query: 3997 Y-----GELCTPSKYERTSLL---NXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEVT 4152 G P ++ N + N TKKWV GRLVEVT Sbjct: 541 IVGQPNGTRVHPPAESNGAISLAENHPLPNGYHGIDEKAASLESGNVTKKWVRGRLVEVT 600 Query: 4153 KS 4158 ++ Sbjct: 601 EA 602 >gb|EXB64651.1| hypothetical protein L484_017984 [Morus notabilis] Length = 528 Score = 283 bits (724), Expect = 5e-73 Identities = 180/418 (43%), Positives = 238/418 (56%), Gaps = 12/418 (2%) Frame = +1 Query: 2938 TRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILNKQL 3117 TR KIG GVRMGWQRR +K ++QETC+ +WQ+LIAEASRRG E++LQW+SYE+LN+QL Sbjct: 118 TRKKIGAGVRMGWQRRRKKLLLQETCYFEWQNLIAEASRRGFDGEDKLQWNSYEVLNEQL 177 Query: 3118 EQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKYHGS 3297 ++ WLES+E+RK M RPKGSKRAPK+ EQ+RKIS AIS KWAD GYRERV SALA+YHG Sbjct: 178 KEAWLESVEKRKSMPRPKGSKRAPKSAEQKRKISEAISRKWADFGYRERVVSALARYHGI 237 Query: 3298 PVGEKKRPLRRRPTGDVQS-VKSNLKKS-SEASNCSSTEKRSQ-ERLRLRKSSSPSYKDP 3468 G +++P RR+P+ QS +S KK ++A+ S +E + Q R ++ + + YKDP Sbjct: 238 EPGTERKP-RRKPSDSSQSPTRSPAKKDLNDANKSSKSEMKIQTPRPKVGRRKALLYKDP 296 Query: 3469 LADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARASLLE 3648 L SKLEMIKN RAQR ETKKIEA+ SP+ARASL+E Sbjct: 297 LVSSKLEMIKNIRAQRAAAETKKIEAIERARLLIAEAEKAAKALEAAATKSPIARASLME 356 Query: 3649 TRKLIAEATRSIEAVETGRNAYSKNISYTSESDRQESDYGAEADTKNGDYTSD---RKVN 3819 TRKLIAEA +SIE++E + N S + + + G+ ++ KVN Sbjct: 357 TRKLIAEAVQSIESIEAEQITSQGNGEDPSAVPDELGGHVEKHIVAIGEVPAEAKPSKVN 416 Query: 3820 GTHVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQ------PSDLINGMEDPNPR 3981 GT + +L EE F K +Q +LN GE + S + + DP Sbjct: 417 GTRILALSREED--SHFGKVNLQDILN-GEEGLLSTSTSNYGLSSFSYETLMKQSDPRNE 473 Query: 3982 DPQVGYGELCTPSKYERTSLLNXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEVTK 4155 + Q+G P+K + TKKWV GRLVEV + Sbjct: 474 NGQLG------PNKESEQQEMPHLNGARAEISNDQQTPAEVVTVTKKWVRGRLVEVAE 525 >ref|NP_001031183.1| uncharacterized protein [Arabidopsis thaliana] gi|222424381|dbj|BAH20146.1| AT1G53800 [Arabidopsis thaliana] gi|332194883|gb|AEE33004.1| uncharacterized protein AT1G53800 [Arabidopsis thaliana] Length = 572 Score = 282 bits (721), Expect = 1e-72 Identities = 172/420 (40%), Positives = 247/420 (58%), Gaps = 10/420 (2%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 ++ETR KIG GVRM W RR E++ VQETC +WQ+L+AEA+++G DEEELQWDSY IL+ Sbjct: 165 NKETRMKIGEGVRMRWARRKERRKVQETCHFEWQNLLAEAAKQGYTDEEELQWDSYNILD 224 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 +Q + EWLES+EQRK + K ++RAPK+PEQRR+I+ AI+AKWADP YRERVCS LAKY Sbjct: 225 QQNQLEWLESVEQRKAIKGAKSNRRAPKSPEQRRRIAEAIAAKWADPSYRERVCSGLAKY 284 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTEKRSQ-ERLRLRKSSSPSYKD 3465 HG PVG ++R RRRP D + K K S + S E++SQ + +++RK +P+YKD Sbjct: 285 HGIPVGVERR--RRRPRSDAEPRKKTPTKKS--TRDSEFERQSQVQVVKVRKRKTPAYKD 340 Query: 3466 PLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARASLL 3645 PLA SKLEMIK+ RA+R+ E+KK++A+ SP+A+ASLL Sbjct: 341 PLASSKLEMIKSIRAKRVAEESKKMDAVERARLLISEAEKAAKVLEIAALKSPVAQASLL 400 Query: 3646 ETRKLIAEATRSIEAVETGRNAYSKNISYTSESDRQESDYGAEADTKN-GDYTSDRKVNG 3822 E++KLIAEAT+ I+++E + A ++ +Y Q +D +E++TK+ D ++NG Sbjct: 401 ESKKLIAEATQLIKSLEMRQIASDEDGTYPFLLSPQPND--SESETKDTNDQERPGEING 458 Query: 3823 THVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQPSDLINGMEDPNPRDPQVGY- 3999 TH + ES+ + + + EG + + SD+ + D ++G Sbjct: 459 THTLQING-ESLHMNMRSNDLPTFVIEGTTNQFV------SDMESNTSQGGREDIKLGIV 511 Query: 4000 ----GELCTPSKYERTSLL---NXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEVTKS 4158 G P ++ N + N TKKWV GRLVEVT++ Sbjct: 512 GQPNGTRVHPPAESNGAISLAENHPLPNGYHGIDEKAASLESGNVTKKWVRGRLVEVTEA 571 >ref|NP_564641.2| uncharacterized protein [Arabidopsis thaliana] gi|332194882|gb|AEE33003.1| uncharacterized protein AT1G53800 [Arabidopsis thaliana] Length = 568 Score = 282 bits (721), Expect = 1e-72 Identities = 172/420 (40%), Positives = 247/420 (58%), Gaps = 10/420 (2%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 ++ETR KIG GVRM W RR E++ VQETC +WQ+L+AEA+++G DEEELQWDSY IL+ Sbjct: 161 NKETRMKIGEGVRMRWARRKERRKVQETCHFEWQNLLAEAAKQGYTDEEELQWDSYNILD 220 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 +Q + EWLES+EQRK + K ++RAPK+PEQRR+I+ AI+AKWADP YRERVCS LAKY Sbjct: 221 QQNQLEWLESVEQRKAIKGAKSNRRAPKSPEQRRRIAEAIAAKWADPSYRERVCSGLAKY 280 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTEKRSQ-ERLRLRKSSSPSYKD 3465 HG PVG ++R RRRP D + K K S + S E++SQ + +++RK +P+YKD Sbjct: 281 HGIPVGVERR--RRRPRSDAEPRKKTPTKKS--TRDSEFERQSQVQVVKVRKRKTPAYKD 336 Query: 3466 PLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARASLL 3645 PLA SKLEMIK+ RA+R+ E+KK++A+ SP+A+ASLL Sbjct: 337 PLASSKLEMIKSIRAKRVAEESKKMDAVERARLLISEAEKAAKVLEIAALKSPVAQASLL 396 Query: 3646 ETRKLIAEATRSIEAVETGRNAYSKNISYTSESDRQESDYGAEADTKN-GDYTSDRKVNG 3822 E++KLIAEAT+ I+++E + A ++ +Y Q +D +E++TK+ D ++NG Sbjct: 397 ESKKLIAEATQLIKSLEMRQIASDEDGTYPFLLSPQPND--SESETKDTNDQERPGEING 454 Query: 3823 THVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQPSDLINGMEDPNPRDPQVGY- 3999 TH + ES+ + + + EG + + SD+ + D ++G Sbjct: 455 THTLQING-ESLHMNMRSNDLPTFVIEGTTNQFV------SDMESNTSQGGREDIKLGIV 507 Query: 4000 ----GELCTPSKYERTSLL---NXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEVTKS 4158 G P ++ N + N TKKWV GRLVEVT++ Sbjct: 508 GQPNGTRVHPPAESNGAISLAENHPLPNGYHGIDEKAASLESGNVTKKWVRGRLVEVTEA 567 >gb|AAM70555.1| At1g53800/T18A20_4 [Arabidopsis thaliana] Length = 418 Score = 282 bits (721), Expect = 1e-72 Identities = 172/420 (40%), Positives = 247/420 (58%), Gaps = 10/420 (2%) Frame = +1 Query: 2929 SEETRAKIGVGVRMGWQRRHEKQMVQETCFLQWQSLIAEASRRGNADEEELQWDSYEILN 3108 ++ETR KIG GVRM W RR E++ VQETC +WQ+L+AEA+++G DEEELQWDSY IL+ Sbjct: 11 NKETRMKIGEGVRMRWARRKERRKVQETCHFEWQNLLAEAAKQGYTDEEELQWDSYNILD 70 Query: 3109 KQLEQEWLESIEQRKLMSRPKGSKRAPKTPEQRRKISAAISAKWADPGYRERVCSALAKY 3288 +Q + EWLES+EQRK + K ++RAPK+PEQRR+I+ AI+AKWADP YRERVCS LAKY Sbjct: 71 QQNQLEWLESVEQRKAIKGAKSNRRAPKSPEQRRRIAEAIAAKWADPSYRERVCSGLAKY 130 Query: 3289 HGSPVGEKKRPLRRRPTGDVQSVKSNLKKSSEASNCSSTEKRSQ-ERLRLRKSSSPSYKD 3465 HG PVG ++R RRRP D + K K S + S E++SQ + +++RK +P+YKD Sbjct: 131 HGIPVGVERR--RRRPRSDAEPRKKTPTKKS--TRDSEFERQSQVQVVKVRKRKTPAYKD 186 Query: 3466 PLADSKLEMIKNNRAQRMVMETKKIEAMXXXXXXXXXXXXXXXXXXXXXXXSPLARASLL 3645 PLA SKLEMIK+ RA+R+ E+KK++A+ SP+A+ASLL Sbjct: 187 PLASSKLEMIKSIRAKRVAEESKKMDAVERARLLISEAEKAAKVLEIAALKSPVAQASLL 246 Query: 3646 ETRKLIAEATRSIEAVETGRNAYSKNISYTSESDRQESDYGAEADTKN-GDYTSDRKVNG 3822 E++KLIAEAT+ I+++E + A ++ +Y Q +D +E++TK+ D ++NG Sbjct: 247 ESKKLIAEATQLIKSLEMRQIASDEDGTYPFLLSPQPND--SESETKDTNDQERPGEING 304 Query: 3823 THVASLGSEESITYDFDKAAMQKLLNEGENAEVIFPPSQPSDLINGMEDPNPRDPQVGY- 3999 TH + ES+ + + + EG + + SD+ + D ++G Sbjct: 305 THTLQING-ESLHMNMRSNDLPTFVIEGTTNQFV------SDMESNTSQGGREDIKLGIV 357 Query: 4000 ----GELCTPSKYERTSLL---NXXXXXXXXXXXXXXXXCTTSNTTKKWVCGRLVEVTKS 4158 G P ++ N + N TKKWV GRLVEVT++ Sbjct: 358 GQPNGTRVHPPAESNGAISLAENHPLPNGYHGIDEKAASLESGNVTKKWVRGRLVEVTEA 417