BLASTX nr result
ID: Mentha26_contig00041545
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00041545 (1181 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU25506.1| hypothetical protein MIMGU_mgv11b017110mg, partia... 338 2e-90 ref|XP_006362061.1| PREDICTED: QWRF motif-containing protein 2-l... 290 1e-75 ref|XP_006362059.1| PREDICTED: QWRF motif-containing protein 2-l... 285 3e-74 ref|XP_004238098.1| PREDICTED: uncharacterized protein LOC101261... 281 3e-73 ref|XP_007042615.1| Family of Uncharacterized protein function (... 269 2e-69 ref|XP_007018531.1| Family of Uncharacterized protein function, ... 268 3e-69 ref|XP_007018530.1| Family of Uncharacterized protein function, ... 268 3e-69 ref|XP_002283295.1| PREDICTED: uncharacterized protein LOC100242... 268 4e-69 ref|XP_006393204.1| hypothetical protein EUTSA_v10011296mg [Eutr... 260 7e-67 ref|XP_002527498.1| conserved hypothetical protein [Ricinus comm... 259 1e-66 ref|XP_002298769.1| hypothetical protein POPTR_0001s30290g [Popu... 259 2e-66 ref|XP_007225113.1| hypothetical protein PRUPE_ppa002663mg [Prun... 258 3e-66 ref|XP_007042616.1| Family of Uncharacterized protein function, ... 258 3e-66 gb|EXB80036.1| hypothetical protein L484_003678 [Morus notabilis] 258 5e-66 ref|XP_004136940.1| PREDICTED: uncharacterized protein LOC101215... 257 8e-66 gb|AAG60167.1|AC074110_5 hypothetical protein [Arabidopsis thali... 256 1e-65 gb|AAG51768.1|AC079674_1 unknown protein; 38618-41990 [Arabidops... 256 1e-65 ref|XP_002891540.1| hypothetical protein ARALYDRAFT_891910 [Arab... 256 1e-65 ref|NP_564558.1| uncharacterized protein [Arabidopsis thaliana] ... 256 1e-65 ref|XP_006306956.1| hypothetical protein CARUB_v10008528mg [Caps... 256 2e-65 >gb|EYU25506.1| hypothetical protein MIMGU_mgv11b017110mg, partial [Mimulus guttatus] Length = 473 Score = 338 bits (868), Expect = 2e-90 Identities = 187/342 (54%), Positives = 234/342 (68%), Gaps = 19/342 (5%) Frame = +1 Query: 1 PSNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVR 180 PS+AEKMLV S+RSLSVSFQG+SYS+ VGTP RKGTPERRKAGVTP R Sbjct: 113 PSSAEKMLVTSMRSLSVSFQGESYSIPVSKVKPPPAAVGTPSALRKGTPERRKAGVTPTR 172 Query: 181 DRTAK--ETPRPIEH---QQHLWPGRLRSENPSLLSRSLDYGTDRAKWNGSSPALTELRK 345 DR + E RP +H QQH WPGRLR E+ S L+RSLDYG++R K +GS AL E RK Sbjct: 173 DRRERDIENSRPSDHHQLQQHRWPGRLRREDSSFLTRSLDYGSERVKCSGSGAALKEFRK 232 Query: 346 SVNAE---------LKLDNREIEDPAVSD-EGRSRLGGXXXXXXXXXXXXXXXXXXXXQL 495 SV E LKL+ ++E + + E R R G QL Sbjct: 233 SVGEENSSDKVGNDLKLETNDVEVRGIGELENRPRSGSSLNLDVESTATT--------QL 284 Query: 496 RGGPRGVIVAARFYQDASNRVQKVLDPASPL----SNRTTGSPRIMVAKKFQNDNPISSP 663 RGGPR V+V R +Q+ +NRV KV DPASPL SNRT G ++++AKKFQND+P+SSP Sbjct: 285 RGGPRAVVVPQRCWQE-TNRVNKVRDPASPLPISTSNRTIGPSKLVLAKKFQNDSPVSSP 343 Query: 664 REICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRARNGAGSVMNDSNMCSTPS 843 RE+ +RG+SPLRGG RAASP +AL+S SG++ RGM+SP+RAR+G G+ MN++N CSTPS Sbjct: 344 REVSSSRGISPLRGGVRAASPCKALSSSSGSVSRGMASPTRARSGVGNSMNENNTCSTPS 403 Query: 844 MISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVE 969 ++ F ++ RRG GEN+MADAH LR+LYNRLLQWRLANA+ + Sbjct: 404 VLRFAVDARRGNSGENQMADAHVLRLLYNRLLQWRLANARAD 445 >ref|XP_006362061.1| PREDICTED: QWRF motif-containing protein 2-like isoform X3 [Solanum tuberosum] Length = 665 Score = 290 bits (741), Expect = 1e-75 Identities = 183/432 (42%), Positives = 240/432 (55%), Gaps = 40/432 (9%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGV----- 168 S A K L+ S RSLSVSFQGQS+S+ T G RKGTPERRK Sbjct: 126 SAAAKQLLNSSRSLSVSFQGQSFSIPVSKAKPPPAT-NNIGSVRKGTPERRKVTAEFVTP 184 Query: 169 ----------TPVRDRTAKETPRPIEHQ--------QHLWPGRLRSENPSLLSRSLDYGT 294 TP R + A E P + QH WPGR +S N S L+RS+D G+ Sbjct: 185 ERKKATAEFFTPERSKVAAELSTPARDRTENVKTSDQHRWPGRSKSLNSSFLTRSMDCGS 244 Query: 295 -DRAKWNGSSPALTEL--------RKSVNAELK--LDNREIEDPAVSDEGRSR--LGGXX 435 D+ K+ S + + R + A LK DN E++ + S L Sbjct: 245 IDKPKFGSGSVTSSSMKSVIDIYHRARIEARLKPQSDNGEVDMKSAYGSAMSADALASDS 304 Query: 436 XXXXXXXXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPASPLSN---RTTG 606 +GGPRG++V ARF+Q+ +NR+++ + S + N +T Sbjct: 305 ESVSSGSTSGVHDGPTVTHGKGGPRGIVVPARFWQETNNRIRRGPELGSSMDNGNLKTVA 364 Query: 607 SPRIMVAKKFQNDNPISSPREICPNRGL-SPLRGGSRAASPSRALTSGSGALLRGMSSPS 783 S + M KKF D+P +SPR + +RGL SPLRGG R ASPS+ALT + LRGM SP+ Sbjct: 365 SSKQMGNKKFLIDSPRTSPRVVPASRGLVSPLRGGLRPASPSKALTPSANTPLRGMPSPT 424 Query: 784 RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963 R +NG+ V +N C PS++SF + RRGK+GENR+ DAH+LR+LYNR LQWR NAQ Sbjct: 425 RTKNGS-MVSISNNSCIMPSILSFAADARRGKVGENRIVDAHELRLLYNRNLQWRFVNAQ 483 Query: 964 VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143 E L Q AER LYNAW+ T KLRHSV+SK I+LQ LR N+KL+S+LK Q LE W Sbjct: 484 AEAALRAQTTTAERTLYNAWLTTLKLRHSVKSKRIQLQLLRKNVKLHSILKGQRPCLENW 543 Query: 1144 GLLDRDYCNSIS 1179 ++D D+CNS+S Sbjct: 544 SMIDGDHCNSLS 555 >ref|XP_006362059.1| PREDICTED: QWRF motif-containing protein 2-like isoform X1 [Solanum tuberosum] gi|565392762|ref|XP_006362060.1| PREDICTED: QWRF motif-containing protein 2-like isoform X2 [Solanum tuberosum] Length = 677 Score = 285 bits (729), Expect = 3e-74 Identities = 183/444 (41%), Positives = 240/444 (54%), Gaps = 52/444 (11%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGV----- 168 S A K L+ S RSLSVSFQGQS+S+ T G RKGTPERRK Sbjct: 126 SAAAKQLLNSSRSLSVSFQGQSFSIPVSKAKPPPAT-NNIGSVRKGTPERRKVTAEFVTP 184 Query: 169 ----------------------TPVRDRTAKETPRPIEHQ--------QHLWPGRLRSEN 258 TP R + A E P + QH WPGR +S N Sbjct: 185 ERRKVTAEFVTPERKKATAEFFTPERSKVAAELSTPARDRTENVKTSDQHRWPGRSKSLN 244 Query: 259 PSLLSRSLDYGT-DRAKWNGSSPALTEL--------RKSVNAELK--LDNREIEDPAVSD 405 S L+RS+D G+ D+ K+ S + + R + A LK DN E++ + Sbjct: 245 SSFLTRSMDCGSIDKPKFGSGSVTSSSMKSVIDIYHRARIEARLKPQSDNGEVDMKSAYG 304 Query: 406 EGRSR--LGGXXXXXXXXXXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPA 579 S L +GGPRG++V ARF+Q+ +NR+++ + Sbjct: 305 SAMSADALASDSESVSSGSTSGVHDGPTVTHGKGGPRGIVVPARFWQETNNRIRRGPELG 364 Query: 580 SPLSN---RTTGSPRIMVAKKFQNDNPISSPREICPNRGL-SPLRGGSRAASPSRALTSG 747 S + N +T S + M KKF D+P +SPR + +RGL SPLRGG R ASPS+ALT Sbjct: 365 SSMDNGNLKTVASSKQMGNKKFLIDSPRTSPRVVPASRGLVSPLRGGLRPASPSKALTPS 424 Query: 748 SGALLRGMSSPSRARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLY 927 + LRGM SP+R +NG+ V +N C PS++SF + RRGK+GENR+ DAH+LR+LY Sbjct: 425 ANTPLRGMPSPTRTKNGS-MVSISNNSCIMPSILSFAADARRGKVGENRIVDAHELRLLY 483 Query: 928 NRLLQWRLANAQVENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYS 1107 NR LQWR NAQ E L Q AER LYNAW+ T KLRHSV+SK I+LQ LR N+KL+S Sbjct: 484 NRNLQWRFVNAQAEAALRAQTTTAERTLYNAWLTTLKLRHSVKSKRIQLQLLRKNVKLHS 543 Query: 1108 VLKEQELHLERWGLLDRDYCNSIS 1179 +LK Q LE W ++D D+CNS+S Sbjct: 544 ILKGQRPCLENWSMIDGDHCNSLS 567 >ref|XP_004238098.1| PREDICTED: uncharacterized protein LOC101261324 [Solanum lycopersicum] Length = 677 Score = 281 bits (720), Expect = 3e-73 Identities = 183/445 (41%), Positives = 240/445 (53%), Gaps = 53/445 (11%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGV----- 168 S A K L+ S RSLSVSFQGQS+S+ T G R+GTPERRK Sbjct: 126 SAAAKQLLNSSRSLSVSFQGQSFSIPVSKAKPPPAT-NNIGNVRRGTPERRKVTADFVTP 184 Query: 169 ----------------------TPVRDRTAKETPRPIEHQ--------QHLWPGRLRSEN 258 TP R + A E P + QH WPGR +S N Sbjct: 185 ERRKVSANFVTPERKKATADFYTPERSKVAAELSTPARDRTENVKTSDQHRWPGRSKSLN 244 Query: 259 PSLLSRSLDYGT-DRAKWNGSSPALTEL--------RKSVNAELK--LDNREIEDPAVSD 405 S L+RS+D G+ D+ K+ S + + R + A+LK DN E++ + Sbjct: 245 SSFLTRSMDCGSIDKPKFGSGSVTSSSMKSVIDIYHRARIEAKLKPQSDNDEVDMKSAYG 304 Query: 406 EGRS--RLGGXXXXXXXXXXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPA 579 S L RGGPRG++V ARF+Q+ +NR+++ + Sbjct: 305 SAMSADTLASDSESVSSGSTSGVHDGPSVIHGRGGPRGIVVPARFWQETNNRIRRGPELG 364 Query: 580 SPLSN---RTTGSPRIMVAKKFQNDNPISSPREICPNRGL-SPLRGGSRAASPSRALTSG 747 S + N +T S + M KKF D+P +S R + +RGL SPLRGG R ASPS+ LT Sbjct: 365 SSMDNGNLKTVASSKQMGNKKFLTDSPRTSARVVPASRGLGSPLRGGLRPASPSKTLTPS 424 Query: 748 SGALLRGMSSPSRARNGA-GSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRML 924 + LRGM SP+R +NG+ GS N N C PS++SF + RRGK+GENR+ DAH+LR+L Sbjct: 425 ANTPLRGMPSPTRTKNGSMGSTSN--NSCIMPSILSFAADARRGKVGENRIVDAHELRLL 482 Query: 925 YNRLLQWRLANAQVENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLY 1104 YNR LQWR NAQ E L Q AER LYNAW+ T KLRHSV+SK I+LQ LR N+KL+ Sbjct: 483 YNRNLQWRFVNAQAEAALRAQTTTAERTLYNAWLTTLKLRHSVKSKRIQLQLLRKNVKLH 542 Query: 1105 SVLKEQELHLERWGLLDRDYCNSIS 1179 S+LK Q LE W ++D D+CNS+S Sbjct: 543 SILKGQGPCLENWSMIDGDHCNSLS 567 >ref|XP_007042615.1| Family of Uncharacterized protein function (DUF566), putative isoform 1 [Theobroma cacao] gi|508706550|gb|EOX98446.1| Family of Uncharacterized protein function (DUF566), putative isoform 1 [Theobroma cacao] Length = 684 Score = 269 bits (688), Expect = 2e-69 Identities = 174/428 (40%), Positives = 246/428 (57%), Gaps = 37/428 (8%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183 S A KML+ S RSLSVSFQG+++SL VG+ ++RK TPERR+A TPVRD Sbjct: 161 SAATKMLITSTRSLSVSFQGEAFSLPISKTKAQ---VGS-AMTRKATPERRRA--TPVRD 214 Query: 184 RTAKETPRPIEHQQHLWPGRLRSENPSL--LSRSLDYGTDRAKWNGSSPALTELRKSV-- 351 E +P++ QH WPGR R N LSRSLDY ++R + + L++S+ Sbjct: 215 HG--ENSKPVD--QHRWPGRTRQGNSGTNPLSRSLDYSSERKMFGSGAIVAKSLQQSMML 270 Query: 352 -----------NAELKLD------------------NREIEDPAVSDEGRSRLGGXXXXX 444 ++ L LD N E VS + + Sbjct: 271 DESSRRVSFDGSSRLSLDLGSSAELLKEATKQNSDANSINEASCVSCDLTASDTDSVSSG 330 Query: 445 XXXXXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPASPLS----NRTTGSP 612 + R GPR ++V+ARF+Q+ ++R++++ DP SPLS +R S Sbjct: 331 STNSGMQECGGSGILKGRSGPRNIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRIGASA 390 Query: 613 RIMVAKKFQNDNPISSPREICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRAR 792 + +K+F +D +SSPR + SP+RGG+R ASPS+ TS + + LRG+S P+R R Sbjct: 391 KFSQSKRFSSDGVVSSPRTMA-----SPIRGGTRPASPSKLWTSATSSPLRGLS-PARVR 444 Query: 793 NGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVEN 972 N G M N +TPS++SF +++RRGK+GE+R+ DAH LR+LYNR LQWR ANA+ + Sbjct: 445 NAVGGQMM-GNSVNTPSILSFSVDIRRGKMGEDRIVDAHMLRLLYNRYLQWRFANARADA 503 Query: 973 TLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLL 1152 T ++QK +AE+NL+NAWV TS+LRHSV K I+L LR LKL S+LK Q +LE W LL Sbjct: 504 TFMLQKLSAEKNLWNAWVTTSELRHSVTLKRIKLLLLRQKLKLTSILKGQIAYLEEWALL 563 Query: 1153 DRDYCNSI 1176 DRD+ +S+ Sbjct: 564 DRDHSSSL 571 >ref|XP_007018531.1| Family of Uncharacterized protein function, putative isoform 2 [Theobroma cacao] gi|508723859|gb|EOY15756.1| Family of Uncharacterized protein function, putative isoform 2 [Theobroma cacao] Length = 609 Score = 268 bits (686), Expect = 3e-69 Identities = 167/407 (41%), Positives = 239/407 (58%), Gaps = 15/407 (3%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183 S +K+L S RSLSVSFQG+S+S +P +RKGTPERRK T Sbjct: 135 SAVQKLLFTSTRSLSVSFQGESFSYQFSKAKPAP----SPSAARKGTPERRKP--TAATT 188 Query: 184 RTAKETPRPIEHQQHLWPGRLRSENPSLLSRSLDYGTDRA--KWNGSSPALTELRKSVNA 357 + T + + WP RLR P+ +SRS+D +R K +G+ + L+ S+ Sbjct: 189 TPGRATDQMENSKAERWPARLRQ--PNSMSRSMDCTDERKRLKGSGNGNVVRALQDSM-- 244 Query: 358 ELKLDNREIE-----------DPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLRGG 504 +DNR++ DPAVSD G ++ G Sbjct: 245 ---IDNRDLTVVPAVGSEAQCDPAVSDTESVSSGSTSGALESSCNGNG-------DIKRG 294 Query: 505 PRGVIVAARFYQDASNRVQKVLDPASPLSNRTTGSPRIMVAKKFQNDNPISSPREICPNR 684 PRG++V ARF+Q+ +NR+++ DP SP+S + T +++ +KF D+P+SSP+ + +R Sbjct: 295 PRGIVVPARFWQETNNRLRRS-DPGSPVSKKNTAQSKLIAPEKFGIDSPLSSPKSVVNSR 353 Query: 685 GLS-PLRGGSRAASPSRALTSGSGALLRGMSSPSRARNGAGSVMNDSNMCSTPSMISFPL 861 G S P+RG R ASPS+ S + + LRGMS PSR RNG GS N+ +TPS++SF Sbjct: 354 GQSSPIRGPVRPASPSKLGVSSTSSPLRGMS-PSRVRNGLGS-----NLVNTPSILSFAG 407 Query: 862 EL-RRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENTLLVQKHAAERNLYNAWVNTSK 1038 ++ + GK+GEN+++DAH LR+L+NRLLQWR NA+ + L Q+ AE++LYNAW+ TSK Sbjct: 408 DVVKMGKIGENKVSDAHFLRLLHNRLLQWRFVNAREDAALSSQRSNAEKSLYNAWITTSK 467 Query: 1039 LRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLDRDYCNSIS 1179 LR SVR+K ELQ LR NLKL S+LK Q + L+ W +LD DYC+S+S Sbjct: 468 LRESVRTKRTELQLLRQNLKLMSILKGQMIVLDEWAILDHDYCSSLS 514 >ref|XP_007018530.1| Family of Uncharacterized protein function, putative isoform 1 [Theobroma cacao] gi|508723858|gb|EOY15755.1| Family of Uncharacterized protein function, putative isoform 1 [Theobroma cacao] Length = 615 Score = 268 bits (686), Expect = 3e-69 Identities = 167/407 (41%), Positives = 239/407 (58%), Gaps = 15/407 (3%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183 S +K+L S RSLSVSFQG+S+S +P +RKGTPERRK T Sbjct: 135 SAVQKLLFTSTRSLSVSFQGESFSYQFSKAKPAP----SPSAARKGTPERRKP--TAATT 188 Query: 184 RTAKETPRPIEHQQHLWPGRLRSENPSLLSRSLDYGTDRA--KWNGSSPALTELRKSVNA 357 + T + + WP RLR P+ +SRS+D +R K +G+ + L+ S+ Sbjct: 189 TPGRATDQMENSKAERWPARLRQ--PNSMSRSMDCTDERKRLKGSGNGNVVRALQDSM-- 244 Query: 358 ELKLDNREIE-----------DPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLRGG 504 +DNR++ DPAVSD G ++ G Sbjct: 245 ---IDNRDLTVVPAVGSEAQCDPAVSDTESVSSGSTSGALESSCNGNG-------DIKRG 294 Query: 505 PRGVIVAARFYQDASNRVQKVLDPASPLSNRTTGSPRIMVAKKFQNDNPISSPREICPNR 684 PRG++V ARF+Q+ +NR+++ DP SP+S + T +++ +KF D+P+SSP+ + +R Sbjct: 295 PRGIVVPARFWQETNNRLRRS-DPGSPVSKKNTAQSKLIAPEKFGIDSPLSSPKSVVNSR 353 Query: 685 GLS-PLRGGSRAASPSRALTSGSGALLRGMSSPSRARNGAGSVMNDSNMCSTPSMISFPL 861 G S P+RG R ASPS+ S + + LRGMS PSR RNG GS N+ +TPS++SF Sbjct: 354 GQSSPIRGPVRPASPSKLGVSSTSSPLRGMS-PSRVRNGLGS-----NLVNTPSILSFAG 407 Query: 862 EL-RRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENTLLVQKHAAERNLYNAWVNTSK 1038 ++ + GK+GEN+++DAH LR+L+NRLLQWR NA+ + L Q+ AE++LYNAW+ TSK Sbjct: 408 DVVKMGKIGENKVSDAHFLRLLHNRLLQWRFVNAREDAALSSQRSNAEKSLYNAWITTSK 467 Query: 1039 LRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLDRDYCNSIS 1179 LR SVR+K ELQ LR NLKL S+LK Q + L+ W +LD DYC+S+S Sbjct: 468 LRESVRTKRTELQLLRQNLKLMSILKGQMIVLDEWAILDHDYCSSLS 514 >ref|XP_002283295.1| PREDICTED: uncharacterized protein LOC100242050 [Vitis vinifera] Length = 743 Score = 268 bits (684), Expect = 4e-69 Identities = 179/427 (41%), Positives = 242/427 (56%), Gaps = 35/427 (8%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVR- 180 + A KML+ S RSLSVSFQG+S+SL T P RKGTPERRK TP R Sbjct: 229 TTASKMLITSARSLSVSFQGESFSLRVSK------TKPAPASVRKGTPERRKP--TPTRA 280 Query: 181 DRTAKETPRPIEHQQHLWPGRLRSENPSLLSRSLDYGTDRAKWNGSSPALTELRKSV--- 351 D+T E +P++ QH WPGR R N L+RS+D ++ K GS L++S+ Sbjct: 281 DQT--ENSKPVD--QHRWPGRSRQVNS--LTRSMDCTDEKKKLGGSGIMARSLQQSMIDE 334 Query: 352 ---------------NAELKLDNREIE-----------DPAVSDEGRSRLGGXXXXXXXX 453 NAEL N + DPA SD G Sbjct: 335 RNRTPLDGRLNLDSGNAELGKANELVNANSVVGSTMTSDPAASDTESVSSGSTSGAQESG 394 Query: 454 XXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPASPLSN----RTTG-SPRI 618 Q RG PRG++V ARF+Q+ SNR+++ +P+SP S RT P++ Sbjct: 395 GGGGGT------QGRGVPRGIMVPARFWQETSNRLRRTPEPSSPQSKSNGLRTPAVPPKL 448 Query: 619 MVAKKFQNDNPISSPREICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRARNG 798 + KK D+P+SSPR I P+RG SPLRG R ASPS+ +T+ + + LRGM SP+R R Sbjct: 449 IAPKKLLTDSPMSSPRGILPSRGQSPLRGPVRPASPSKLVTTSTYSPLRGMPSPTRVRAV 508 Query: 799 AGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENTL 978 GS+ + N+ + PS++SF ++RRGK+GENRM DAH LR+L+NR LQWR NA+ + +L Sbjct: 509 VGSL--NGNLSNNPSILSFAADVRRGKVGENRMVDAHLLRLLHNRYLQWRFINARADASL 566 Query: 979 LVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLDR 1158 LVQ+ AE++L NA V LR SVR K LQ +R LKL ++LK Q ++L+ WG +DR Sbjct: 567 LVQRMNAEQSLCNARVAIVDLRDSVRDKRKMLQLMRQKLKLTTILKGQIMYLDEWGPMDR 626 Query: 1159 DYCNSIS 1179 D+ NS+S Sbjct: 627 DHSNSLS 633 >ref|XP_006393204.1| hypothetical protein EUTSA_v10011296mg [Eutrema salsugineum] gi|557089782|gb|ESQ30490.1| hypothetical protein EUTSA_v10011296mg [Eutrema salsugineum] Length = 661 Score = 260 bits (665), Expect = 7e-67 Identities = 171/432 (39%), Positives = 245/432 (56%), Gaps = 40/432 (9%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183 S A KML+ S RSLSVSFQG+++SL TP RK TPERR++ TPVRD Sbjct: 133 SAATKMLITSTRSLSVSFQGEAFSLPISKKKEAT----TPVSHRKSTPERRRS--TPVRD 186 Query: 184 RTAKETPRPIEHQQHLWPGRLRSEN-----PSLLSRSLDYGTDRAKWNGSSPALTEL--- 339 + +E +P++ Q+ WPG R N P+ LSRSLD G+DR K + L Sbjct: 187 Q--RENSKPVDQQR--WPGASRRGNSESVAPNPLSRSLDCGSDRGKLGSGYVGRSMLHSS 242 Query: 340 ------RKSVNAELKLDNREIEDPA-VSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLR 498 R S+N L LD ++ + DE + R Sbjct: 243 MIDESPRVSINGRLSLDMEGRDEYLEIGDESQRRPNNGLTSSVSCDFTASDTDSVSSGST 302 Query: 499 GG------------------PRGVIVAARFYQDASNRVQKVLDPASPLSNR-----TTGS 609 G PR ++ +ARF+Q+ ++R++++ DP SPLS+ ++ S Sbjct: 303 NGVQECGSGVNGDISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKTSSVS 362 Query: 610 PRIMVAKKFQNDN-PISSPREICPNRGLSPLRGGS-RAASPSRALTSGSGALLRGMSSPS 783 + ++K+F +D P+SSPR + SP+RG + R+ASPS+ + + + R +SSPS Sbjct: 363 SKFGLSKRFSSDAAPLSSPRGMA-----SPVRGSAIRSASPSKLWATTTSSPARALSSPS 417 Query: 784 RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963 RARNG MN N +TPS++SF ++RRGK+GE+R+ DAH +R+LYNR LQWR NA+ Sbjct: 418 RARNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLVRLLYNRYLQWRFVNAR 477 Query: 964 VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143 ++TL+VQ+ AE+NL+NAWV+ S+LRHSV K I+L LR LKL S+L+ Q +LE W Sbjct: 478 ADSTLMVQRLNAEKNLWNAWVSISELRHSVTLKRIKLLLLRQKLKLASILRGQMGYLEEW 537 Query: 1144 GLLDRDYCNSIS 1179 LLDRD+ NS+S Sbjct: 538 SLLDRDHSNSLS 549 >ref|XP_002527498.1| conserved hypothetical protein [Ricinus communis] gi|223533138|gb|EEF34896.1| conserved hypothetical protein [Ricinus communis] Length = 634 Score = 259 bits (663), Expect = 1e-66 Identities = 174/417 (41%), Positives = 239/417 (57%), Gaps = 26/417 (6%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183 S A +ML+ S RSLSVSFQG+++SL +P V+RK TPERRK+ TPVRD Sbjct: 126 SAATRMLITSTRSLSVSFQGEAFSLPISKAKAVS---SSPNVTRKVTPERRKS--TPVRD 180 Query: 184 RTAKETPRPIEHQQHLWPGRLRSENPSL------LSRSLD--YGTDRAKWNGS------- 318 + E RP++ QH WPGR R N +L LSRS D G D + GS Sbjct: 181 QG--ENSRPLD--QHRWPGRSRGGNLALNERNPSLSRSFDCSVGGDEKRVMGSGFMSVKS 236 Query: 319 ---SPALTELRKSV---NAELKLD-NREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXX 477 S + E R S+ NA+ D N + D V+ G Sbjct: 237 LQQSMIVDERRLSLDLGNAKRNPDVNSSVSDSFVT--GDLTASDSDSVSSGSTSGLQDFG 294 Query: 478 XXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPASPLSN----RTTGSPRIMVAKKFQND 645 + + GPRG+ V+ARF+Q+ ++R++++ DP SPLS RT+ S + + +K+F +D Sbjct: 295 SGISRAKTGPRGIAVSARFWQETNSRLRRLQDPGSPLSTSPNPRTSISSKTIQSKRFSSD 354 Query: 646 NPISSPREICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRARNGAGSVMNDSN 825 P++SPR G SP+RG +R ASPS+ T + + RG+SSPSR R + SN Sbjct: 355 APVASPRTF----GSSPIRGATRPASPSKLWTHSASSPSRGISSPSRGRPMS------SN 404 Query: 826 MCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENTLLVQKHAAER 1005 + S PS++SF ++LRRGK+GE+R+ DAH LR+LYN LQWR NA+ + T VQ+ AE+ Sbjct: 405 LSSMPSILSFAVDLRRGKMGEDRIGDAHMLRLLYNHYLQWRFVNARADATFFVQRVNAEK 464 Query: 1006 NLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLDRDYCNSI 1176 NL+NAWV S+LRHSV K ++L LR LKL S+LK Q LE W LLDRD+ S+ Sbjct: 465 NLWNAWVTISELRHSVTLKRVKLLLLRQKLKLTSILKGQITCLEEWSLLDRDHSTSL 521 >ref|XP_002298769.1| hypothetical protein POPTR_0001s30290g [Populus trichocarpa] gi|222846027|gb|EEE83574.1| hypothetical protein POPTR_0001s30290g [Populus trichocarpa] Length = 651 Score = 259 bits (662), Expect = 2e-66 Identities = 169/413 (40%), Positives = 234/413 (56%), Gaps = 22/413 (5%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183 S A KML+ S RSLSVSFQG+++SL T V+RK TPE+R+A TPV D Sbjct: 147 SAATKMLITSTRSLSVSFQGEAFSLPISKAKSV--TPPQNNVARKATPEKRRA--TPVGD 202 Query: 184 RTAKETPRPIEHQQHLWPGRLRS----ENPSLLSRSLDY------GTDRAKWNGSSPALT 333 + E RP++H H WPGR R E LLSRSLD G D+ + Sbjct: 203 QG--ENSRPVDH--HRWPGRSREGNLKERNQLLSRSLDCSVVVGCGGDKRVVGSGLMGVK 258 Query: 334 ELRKSV------NAELKLDNREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXX-- 489 L++S+ L L N ++P S G Sbjct: 259 SLQQSMMVGEGRRLSLDLGNIAKQNPDTISVNESSYTGDLTASDSDSVSSGSTSGVTEIG 318 Query: 490 QLRGGPRGVIVAARFYQDASNRVQKVLDPASPLS----NRTTGSPRIMVAKKFQNDNPIS 657 + + G RG+ V+ARF+Q+ ++R++++ DP SPLS +R SP+ + +K+F +D P++ Sbjct: 319 KWKTGARGIAVSARFWQETNSRMRRLQDPGSPLSTSPGSRMGVSPKAIQSKRFSSDGPLA 378 Query: 658 SPREICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRARNGAGSVMNDSNMCST 837 SPR + SP+RG +R ASP + TS + RGMSSPSR R + S T Sbjct: 379 SPRMMAA----SPIRGATRPASPGKLWTSSFSSPSRGMSSPSRVRPMSSS---------T 425 Query: 838 PSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENTLLVQKHAAERNLYN 1017 PS++SF ++LRRGK+GE+R+ DAH LR+LYNR LQWR NA+ + T +VQ+ +AE+NL+N Sbjct: 426 PSILSFSVDLRRGKMGEDRIVDAHMLRLLYNRYLQWRFVNARADATFMVQRLSAEKNLWN 485 Query: 1018 AWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLDRDYCNSI 1176 AWV S+LRHSV + I+L LR LKL S+LK Q HLE W LLDRD+ +S+ Sbjct: 486 AWVTISELRHSVTLRRIKLILLRQKLKLTSILKRQIAHLEEWSLLDRDHSSSL 538 >ref|XP_007225113.1| hypothetical protein PRUPE_ppa002663mg [Prunus persica] gi|462422049|gb|EMJ26312.1| hypothetical protein PRUPE_ppa002663mg [Prunus persica] Length = 647 Score = 258 bits (660), Expect = 3e-66 Identities = 175/414 (42%), Positives = 242/414 (58%), Gaps = 22/414 (5%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVS-RKGTPERRKAGVTPVR 180 S A+K+L S RSLSVSFQG+SYSL TP S RKGTPERRKA TP R Sbjct: 128 SAAQKLLFTSTRSLSVSFQGESYSLQVSKVKP------TPSPSTRKGTPERRKA-TTPFR 180 Query: 181 DRTAKETPRPIEHQQHLWPGRLRSENPSLLSRSLDYGTDRAKWNGSSPALTELRKS---- 348 E +P E Q+ WP RLR P+ ++RSLD +R + +GS + ++ Sbjct: 181 -ADQSENSKPTEQQR--WPARLRQ--PNCMTRSLDCTDERRRMSGSGANVVRALQNSMVD 235 Query: 349 -VNAELKLDN---REIEDPAVSDEGRSRL---------GGXXXXXXXXXXXXXXXXXXXX 489 V+ L+ ++ ++ D+G S Sbjct: 236 DVDGRLRSNSCNLGSVKATETVDDGTSATTQSEPVACSDTDSVSSGSTNSGPHESNGHGG 295 Query: 490 QLRGG-PRGVIVAARFYQDASNRVQKVLDP-ASPLSNRTTGSPRIMVAKKFQNDNPISSP 663 L+G PRG++V ARF+Q+ +NR+++ + A RT GSP+I A + D+P SSP Sbjct: 296 ALQGPRPRGIVVPARFWQETNNRLRRQSESKAIGAGARTMGSPKIAEANRLSIDSPTSSP 355 Query: 664 REICPNRG-LSPLRGGSRAASPSRALTS-GSGALLRGMSSPSRARNGAGSVMNDSNMCST 837 R + +R LSP+RG +R ASPS+ S + + +RG+S PSR RNG + + SN+ +T Sbjct: 356 RGVANSRAQLSPIRGTARPASPSKLSRSLMTSSPMRGVS-PSRVRNGVAATPS-SNLSNT 413 Query: 838 PSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENTLLVQKHAAERNLYN 1017 PS++SF ++RRGK+GENR+ DAH +R+L+NRLLQWR NA+ +L Q+ AER+LYN Sbjct: 414 PSILSFAADVRRGKVGENRIVDAHVVRLLHNRLLQWRFVNARANASLAAQRSNAERSLYN 473 Query: 1018 AWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLDRDYCNSIS 1179 AWV +SKLR SVR+K IELQ LR NLKL S+LK Q ++LE L+DRDY NS+S Sbjct: 474 AWVTSSKLRESVRAKRIELQMLRQNLKLTSILKGQMIYLEELSLMDRDYSNSLS 527 >ref|XP_007042616.1| Family of Uncharacterized protein function, putative isoform 2 [Theobroma cacao] gi|508706551|gb|EOX98447.1| Family of Uncharacterized protein function, putative isoform 2 [Theobroma cacao] Length = 571 Score = 258 bits (659), Expect = 3e-66 Identities = 169/419 (40%), Positives = 238/419 (56%), Gaps = 37/419 (8%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183 S A KML+ S RSLSVSFQG+++SL VG+ ++RK TPERR+A TPVRD Sbjct: 161 SAATKMLITSTRSLSVSFQGEAFSLPISKTKAQ---VGS-AMTRKATPERRRA--TPVRD 214 Query: 184 RTAKETPRPIEHQQHLWPGRLRSENPSL--LSRSLDYGTDRAKWNGSSPALTELRKSV-- 351 E +P++ QH WPGR R N LSRSLDY ++R + + L++S+ Sbjct: 215 HG--ENSKPVD--QHRWPGRTRQGNSGTNPLSRSLDYSSERKMFGSGAIVAKSLQQSMML 270 Query: 352 -----------NAELKLD------------------NREIEDPAVSDEGRSRLGGXXXXX 444 ++ L LD N E VS + + Sbjct: 271 DESSRRVSFDGSSRLSLDLGSSAELLKEATKQNSDANSINEASCVSCDLTASDTDSVSSG 330 Query: 445 XXXXXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPASPLS----NRTTGSP 612 + R GPR ++V+ARF+Q+ ++R++++ DP SPLS +R S Sbjct: 331 STNSGMQECGGSGILKGRSGPRNIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRIGASA 390 Query: 613 RIMVAKKFQNDNPISSPREICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRAR 792 + +K+F +D +SSPR + SP+RGG+R ASPS+ TS + + LRG+S P+R R Sbjct: 391 KFSQSKRFSSDGVVSSPRTMA-----SPIRGGTRPASPSKLWTSATSSPLRGLS-PARVR 444 Query: 793 NGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVEN 972 N G M N +TPS++SF +++RRGK+GE+R+ DAH LR+LYNR LQWR ANA+ + Sbjct: 445 NAVGGQMM-GNSVNTPSILSFSVDIRRGKMGEDRIVDAHMLRLLYNRYLQWRFANARADA 503 Query: 973 TLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGL 1149 T ++QK +AE+NL+NAWV TS+LRHSV K I+L LR LKL S+LK Q +LE W L Sbjct: 504 TFMLQKLSAEKNLWNAWVTTSELRHSVTLKRIKLLLLRQKLKLTSILKGQIAYLEEWAL 562 >gb|EXB80036.1| hypothetical protein L484_003678 [Morus notabilis] Length = 670 Score = 258 bits (658), Expect = 5e-66 Identities = 177/434 (40%), Positives = 243/434 (55%), Gaps = 43/434 (9%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVR- 180 S A K+LV S RSLSVSFQG+++SL TP +RK TPERR+ TP+R Sbjct: 139 SAATKLLVTSTRSLSVSFQGEAFSLPISKTKPT-----TPSGARKATPERRRT--TPLRG 191 Query: 181 -DRTAKETPRPIEHQQHLWPGRLRSENPS------LLSRSLDYGT--DRAKWNG------ 315 +R E +P + QH WP R R N + LLSRS+D+G D K NG Sbjct: 192 GERDQLENSKPGD--QHRWPARTRQGNSNSSNSNPLLSRSVDFGAGGDGRKLNGFRSGTV 249 Query: 316 ----SSPALTELRKSV----------NAEL-KLDNREIEDPAVSDEGRSRLGGXXXXXXX 450 L E R+S +AEL K+++ E A SD S Sbjct: 250 VRALQQSLLDETRRSSFDGRLSLDLGSAELLKVNSSNNESSAPSDLTASDTDSVSSGSTS 309 Query: 451 XXXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPASPLS----NRTTGSPRI 618 G PRG++V+ARF+Q+ ++R++++ DP SPLS +R + Sbjct: 310 GMQDANGVSKART---GTPRGIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRMGAPAKF 366 Query: 619 MVAKKFQND-NPISSPREICPNRGLSPLRGGSRAASPSRALTSGSG-------ALLRGMS 774 + +K++ D NP+SSPR + SP+RG +R ASPS+ TS S + RG++ Sbjct: 367 VQSKRYSGDINPLSSPRTMA-----SPIRGANRPASPSKLWTSSSMPSPSRGMSPSRGIA 421 Query: 775 SPSRARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLA 954 SPSR RNG MN S +TPS++SF +++RRGK+GE+R+ DAH LR+LYNR LQWR Sbjct: 422 SPSRVRNGVAGSMNGSYGGNTPSILSFSVDIRRGKMGEDRIVDAHMLRLLYNRYLQWRFV 481 Query: 955 NAQVENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHL 1134 NA+ + T +VQK AE+NL+NAWV S+LRHSV K I+L LR LKL S++K Q +L Sbjct: 482 NARADATFMVQKLNAEKNLWNAWVTISELRHSVTLKRIKLLLLRQKLKLTSIIKGQITYL 541 Query: 1135 ERWGLLDRDYCNSI 1176 E W LLDRD+ +S+ Sbjct: 542 EDWALLDRDHSSSL 555 >ref|XP_004136940.1| PREDICTED: uncharacterized protein LOC101215899 [Cucumis sativus] Length = 667 Score = 257 bits (656), Expect = 8e-66 Identities = 173/427 (40%), Positives = 240/427 (56%), Gaps = 36/427 (8%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVS--RKG-TPERRKAGVTP 174 S A K+LV S RSLSVSFQG+++SL TP +S RKG TPERR+A TP Sbjct: 141 SAAAKLLVTSTRSLSVSFQGEAFSLPISKTK----ATATPSLSNARKGSTPERRRA--TP 194 Query: 175 VRDRTAKETPRPIEHQ----QHLWPGRLRSEN--PSLLSRSLDYGTDRAKWNGSSPALT- 333 +RD++ + +E+ QH WP R R N + LSRS D G ++ K NG + Sbjct: 195 LRDKSDGSGVQ-VENSKLLDQHRWPARNRHANLEGNPLSRSFDCGGEQKKVNGIGSGMVV 253 Query: 334 ----------ELRKSVNAELKLDNREIE-------DPAVSDEGRSRLGGXXXXXXXXXXX 462 R S + L LD E +P S + Sbjct: 254 RALQQTISDDSRRASFDGRLSLDLNSSELIKAVRQNPDADSVNESSVPSDLTTSDTDSVS 313 Query: 463 XXXXXXXXX-----QLRGGPRGVIVAARFYQDASNRVQKVLDPASPLSNRT---TGSP-R 615 + R GPRG++V+ARF+Q+ ++R++++ DP SPLS G+P + Sbjct: 314 SGSTSGVQDCGSVAKGRNGPRGIVVSARFWQETNSRLRRLHDPGSPLSTSPGARVGAPSK 373 Query: 616 IMVAKKFQNDNPISSPREICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRARN 795 +K+F ND P+SSPR + SP+RGG+R SPS+ TS + RG+SSPSR RN Sbjct: 374 FSQSKRFSNDGPLSSPRTMA-----SPIRGGTRPPSPSKLWTSSVSSPSRGISSPSRTRN 428 Query: 796 GAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENT 975 G G + SN STPS++SF +++RRGK+GE+R+ DAH LR+ +NR LQWR NA+ + T Sbjct: 429 GVGGSLV-SNSISTPSILSFSVDIRRGKMGEDRIVDAHVLRLHHNRYLQWRFVNARADAT 487 Query: 976 LLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLD 1155 ++Q+ AERN++NAWV S+LRH+V K I+L LR LKL SVLK Q +LE W LLD Sbjct: 488 FMLQRLNAERNVWNAWVTISELRHTVTLKRIKLLLLRQKLKLTSVLKGQISYLEEWALLD 547 Query: 1156 RDYCNSI 1176 RD+ +S+ Sbjct: 548 RDHSSSM 554 >gb|AAG60167.1|AC074110_5 hypothetical protein [Arabidopsis thaliana] Length = 722 Score = 256 bits (655), Expect = 1e-65 Identities = 172/432 (39%), Positives = 244/432 (56%), Gaps = 40/432 (9%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183 S A KML+ S RSLSVSFQG+++SL T TP RK TPERR++ TPVRD Sbjct: 130 SAATKMLITSTRSLSVSFQGEAFSLPISKKKE---TTSTPVSHRKSTPERRRS--TPVRD 184 Query: 184 RTAKETPRPIEHQQHLWPGRLRSEN-----PSLLSRSLDYGTDRAKWNGSSPALTEL--- 339 + +E +P++ Q+ WPG R N P+ LSRSLD G+DR K + L Sbjct: 185 Q--RENSKPVDQQR--WPGASRRGNSESVVPNSLSRSLDCGSDRGKLGSGFVGRSMLHNS 240 Query: 340 ------RKSVNAELKLD-NREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLR 498 R SVN L LD E + D+ + R Sbjct: 241 MIDESPRVSVNGRLSLDLGGRDEYLDIGDDIQRRPNNGLTSSVSCDFTASDTDSVSSGST 300 Query: 499 GG------------------PRGVIVAARFYQDASNRVQKVLDPASPLSNR-----TTGS 609 G PR ++ +ARF+Q+ ++R++++ DP SPLS+ ++ S Sbjct: 301 NGVQECGSGVNGEISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKTSSIS 360 Query: 610 PRIMVAKKFQNDN-PISSPREICPNRGLSPLRGGS-RAASPSRALTSGSGALLRGMSSPS 783 + ++K+F +D P+SSPR + SP+RG + R+ASPS+ + + + R +SSPS Sbjct: 361 SKFGLSKRFSSDAVPLSSPRGMA-----SPVRGSAIRSASPSKLWATTTSSPARALSSPS 415 Query: 784 RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963 RARNG MN N +TPS++SF ++RRGK+GE+R+ DAH LR+LYNR LQWR NA+ Sbjct: 416 RARNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLLRLLYNRDLQWRFVNAR 475 Query: 964 VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143 ++T++VQ+ AE+NL+NAWV+ S+LRHSV K I+L LR LKL S+L+ Q LE W Sbjct: 476 ADSTVMVQRLNAEKNLWNAWVSISELRHSVTLKRIKLLLLRQKLKLASILRGQMGFLEEW 535 Query: 1144 GLLDRDYCNSIS 1179 LLDRD+ +S+S Sbjct: 536 SLLDRDHSSSLS 547 >gb|AAG51768.1|AC079674_1 unknown protein; 38618-41990 [Arabidopsis thaliana] Length = 718 Score = 256 bits (655), Expect = 1e-65 Identities = 172/432 (39%), Positives = 244/432 (56%), Gaps = 40/432 (9%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183 S A KML+ S RSLSVSFQG+++SL T TP RK TPERR++ TPVRD Sbjct: 130 SAATKMLITSTRSLSVSFQGEAFSLPISKKKE---TTSTPVSHRKSTPERRRS--TPVRD 184 Query: 184 RTAKETPRPIEHQQHLWPGRLRSEN-----PSLLSRSLDYGTDRAKWNGSSPALTEL--- 339 + +E +P++ Q+ WPG R N P+ LSRSLD G+DR K + L Sbjct: 185 Q--RENSKPVDQQR--WPGASRRGNSESVVPNSLSRSLDCGSDRGKLGSGFVGRSMLHNS 240 Query: 340 ------RKSVNAELKLD-NREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLR 498 R SVN L LD E + D+ + R Sbjct: 241 MIDESPRVSVNGRLSLDLGGRDEYLDIGDDIQRRPNNGLTSSVSCDFTASDTDSVSSGST 300 Query: 499 GG------------------PRGVIVAARFYQDASNRVQKVLDPASPLSNR-----TTGS 609 G PR ++ +ARF+Q+ ++R++++ DP SPLS+ ++ S Sbjct: 301 NGVQECGSGVNGEISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKTSSIS 360 Query: 610 PRIMVAKKFQNDN-PISSPREICPNRGLSPLRGGS-RAASPSRALTSGSGALLRGMSSPS 783 + ++K+F +D P+SSPR + SP+RG + R+ASPS+ + + + R +SSPS Sbjct: 361 SKFGLSKRFSSDAVPLSSPRGMA-----SPVRGSAIRSASPSKLWATTTSSPARALSSPS 415 Query: 784 RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963 RARNG MN N +TPS++SF ++RRGK+GE+R+ DAH LR+LYNR LQWR NA+ Sbjct: 416 RARNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLLRLLYNRDLQWRFVNAR 475 Query: 964 VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143 ++T++VQ+ AE+NL+NAWV+ S+LRHSV K I+L LR LKL S+L+ Q LE W Sbjct: 476 ADSTVMVQRLNAEKNLWNAWVSISELRHSVTLKRIKLLLLRQKLKLASILRGQMGFLEEW 535 Query: 1144 GLLDRDYCNSIS 1179 LLDRD+ +S+S Sbjct: 536 SLLDRDHSSSLS 547 >ref|XP_002891540.1| hypothetical protein ARALYDRAFT_891910 [Arabidopsis lyrata subsp. lyrata] gi|297337382|gb|EFH67799.1| hypothetical protein ARALYDRAFT_891910 [Arabidopsis lyrata subsp. lyrata] Length = 660 Score = 256 bits (655), Expect = 1e-65 Identities = 170/432 (39%), Positives = 242/432 (56%), Gaps = 40/432 (9%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183 S A KML+ S RSLSVSFQG+++SL TP RK TPERR++ TPVRD Sbjct: 131 SAATKMLITSTRSLSVSFQGEAFSLPISKKKE---ATTTPVSHRKSTPERRRS--TPVRD 185 Query: 184 RTAKETPRPIEHQQHLWPGRLRSEN-----PSLLSRSLDYGTDRAKWNGSSPALTEL--- 339 + +E +P++ Q+ WPG R N P+ LSRSLD G+DR K + L Sbjct: 186 Q--RENSKPVDQQR--WPGASRRGNSESVVPNSLSRSLDCGSDRGKLGSGFVGRSMLHNS 241 Query: 340 ------RKSVNAELKLD-NREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLR 498 R S+N L LD E + DE + R Sbjct: 242 MIDESPRVSINGRLSLDLGGRDEYLEIGDESQRRPNNGLTSSVSCDFTASDTDSVSSGST 301 Query: 499 GG------------------PRGVIVAARFYQDASNRVQKVLDPASPLSNR-----TTGS 609 G PR ++ +ARF+Q+ ++R++++ DP SPLS+ ++ S Sbjct: 302 NGVQECGSGVNGEISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKTSSVS 361 Query: 610 PRIMVAKKFQNDN-PISSPREICPNRGLSPLRGGS-RAASPSRALTSGSGALLRGMSSPS 783 + ++K+F +D P+SSPR + SP+RG + R+ASPS+ + + + R +SSPS Sbjct: 362 SKFGLSKRFSSDAVPLSSPRGMA-----SPVRGSAIRSASPSKLWATTTSSPARALSSPS 416 Query: 784 RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963 R RNG MN N +TPS++SF ++RRGK+GE+R+ DAH LR+LYNR LQWR NA+ Sbjct: 417 RVRNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLLRLLYNRYLQWRFVNAR 476 Query: 964 VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143 ++T++VQ+ AE+NL+NAWV+ S+LRHSV K I+L LR LKL S+L+ Q LE W Sbjct: 477 ADSTVMVQRLNAEKNLWNAWVSISELRHSVTLKRIKLLLLRQKLKLASILRGQMGFLEEW 536 Query: 1144 GLLDRDYCNSIS 1179 LLDRD+ +S+S Sbjct: 537 SLLDRDHSSSLS 548 >ref|NP_564558.1| uncharacterized protein [Arabidopsis thaliana] gi|75164975|sp|Q94AI1.1|QWRF2_ARATH RecName: Full=QWRF motif-containing protein 2 gi|15028145|gb|AAK76696.1| unknown protein [Arabidopsis thaliana] gi|24030506|gb|AAN41399.1| unknown protein [Arabidopsis thaliana] gi|332194367|gb|AEE32488.1| uncharacterized protein AT1G49890 [Arabidopsis thaliana] Length = 659 Score = 256 bits (655), Expect = 1e-65 Identities = 172/432 (39%), Positives = 244/432 (56%), Gaps = 40/432 (9%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183 S A KML+ S RSLSVSFQG+++SL T TP RK TPERR++ TPVRD Sbjct: 130 SAATKMLITSTRSLSVSFQGEAFSLPISKKKE---TTSTPVSHRKSTPERRRS--TPVRD 184 Query: 184 RTAKETPRPIEHQQHLWPGRLRSEN-----PSLLSRSLDYGTDRAKWNGSSPALTEL--- 339 + +E +P++ Q+ WPG R N P+ LSRSLD G+DR K + L Sbjct: 185 Q--RENSKPVDQQR--WPGASRRGNSESVVPNSLSRSLDCGSDRGKLGSGFVGRSMLHNS 240 Query: 340 ------RKSVNAELKLD-NREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLR 498 R SVN L LD E + D+ + R Sbjct: 241 MIDESPRVSVNGRLSLDLGGRDEYLDIGDDIQRRPNNGLTSSVSCDFTASDTDSVSSGST 300 Query: 499 GG------------------PRGVIVAARFYQDASNRVQKVLDPASPLSNR-----TTGS 609 G PR ++ +ARF+Q+ ++R++++ DP SPLS+ ++ S Sbjct: 301 NGVQECGSGVNGEISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKTSSIS 360 Query: 610 PRIMVAKKFQNDN-PISSPREICPNRGLSPLRGGS-RAASPSRALTSGSGALLRGMSSPS 783 + ++K+F +D P+SSPR + SP+RG + R+ASPS+ + + + R +SSPS Sbjct: 361 SKFGLSKRFSSDAVPLSSPRGMA-----SPVRGSAIRSASPSKLWATTTSSPARALSSPS 415 Query: 784 RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963 RARNG MN N +TPS++SF ++RRGK+GE+R+ DAH LR+LYNR LQWR NA+ Sbjct: 416 RARNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLLRLLYNRDLQWRFVNAR 475 Query: 964 VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143 ++T++VQ+ AE+NL+NAWV+ S+LRHSV K I+L LR LKL S+L+ Q LE W Sbjct: 476 ADSTVMVQRLNAEKNLWNAWVSISELRHSVTLKRIKLLLLRQKLKLASILRGQMGFLEEW 535 Query: 1144 GLLDRDYCNSIS 1179 LLDRD+ +S+S Sbjct: 536 SLLDRDHSSSLS 547 >ref|XP_006306956.1| hypothetical protein CARUB_v10008528mg [Capsella rubella] gi|482575667|gb|EOA39854.1| hypothetical protein CARUB_v10008528mg [Capsella rubella] Length = 659 Score = 256 bits (653), Expect = 2e-65 Identities = 170/432 (39%), Positives = 242/432 (56%), Gaps = 40/432 (9%) Frame = +1 Query: 4 SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183 S A KML+ S RSLSVSFQG+++SL T TP RK TPERR++ TPVRD Sbjct: 130 SAATKMLITSTRSLSVSFQGEAFSLPISKKKE---TTSTPVSHRKSTPERRRS--TPVRD 184 Query: 184 RTAKETPRPIEHQQHLWPGRLRSEN-----PSLLSRSLDYGTDRAKWNGSSPALTEL--- 339 + +E +P++ Q+ WPG R N P+ LSRSLD G+DR K + L Sbjct: 185 Q--RENSKPVDQQR--WPGASRRGNSESVVPNSLSRSLDCGSDRGKLGSGFAGRSMLHNS 240 Query: 340 ------RKSVNAELKLD-NREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLR 498 R SVN L LD E + D+ + R Sbjct: 241 MIDESPRVSVNGRLSLDLGGRDEYLEIGDDSQRRPSNGLTSSVSCDFTASDTDSVSSGST 300 Query: 499 GG------------------PRGVIVAARFYQDASNRVQKVLDPASPLSNR-----TTGS 609 G PR ++ +ARF+Q+ ++R++++ DP SPLS+ ++ S Sbjct: 301 NGVQECGSGVNGEISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKTSSIS 360 Query: 610 PRIMVAKKFQNDN-PISSPREICPNRGLSPLRGGS-RAASPSRALTSGSGALLRGMSSPS 783 + ++K+F +D P SSPR + SP+RG + R+ASPS+ + + + R +SSPS Sbjct: 361 SKFGLSKRFSSDAVPSSSPRGMA-----SPVRGSAIRSASPSKLWATTTSSPARALSSPS 415 Query: 784 RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963 R RNG MN N +TPS++SF ++RRGK+GE+R+ DAH LR+LYNR LQWR NA+ Sbjct: 416 RVRNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLLRLLYNRYLQWRFVNAR 475 Query: 964 VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143 ++T++VQ+ AE+NL+NAWV+ S+LRHSV K I+L LR LKL S+L+ Q LE W Sbjct: 476 ADSTVMVQRLNAEKNLWNAWVSISELRHSVTLKRIKLLLLRQKLKLASILRGQMGFLEEW 535 Query: 1144 GLLDRDYCNSIS 1179 LLD+D+ +S+S Sbjct: 536 SLLDKDHSSSLS 547