BLASTX nr result
ID: Mentha24_contig00020989
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00020989 (886 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU19774.1| hypothetical protein MIMGU_mgv1a001675mg [Mimulus... 255 2e-65 ref|XP_006361469.1| PREDICTED: proline-, glutamic acid- and leuc... 187 5e-45 ref|XP_004249969.1| PREDICTED: uncharacterized protein LOC101268... 175 2e-41 ref|XP_002276178.2| PREDICTED: uncharacterized protein LOC100256... 169 2e-39 emb|CBI35005.3| unnamed protein product [Vitis vinifera] 169 2e-39 ref|XP_006378815.1| hypothetical protein POPTR_0010s24450g [Popu... 156 1e-35 ref|XP_003529901.1| PREDICTED: proline-, glutamic acid- and leuc... 147 5e-33 ref|XP_007010407.1| Uncharacterized protein isoform 2 [Theobroma... 145 2e-32 ref|XP_007010406.1| Uncharacterized protein isoform 1 [Theobroma... 145 2e-32 ref|XP_007221563.1| hypothetical protein PRUPE_ppa014774mg [Prun... 144 3e-32 ref|XP_002521170.1| conserved hypothetical protein [Ricinus comm... 143 8e-32 ref|XP_006598922.1| PREDICTED: uncharacterized protein LOC100803... 142 2e-31 gb|EXB36971.1| hypothetical protein L484_018348 [Morus notabilis] 139 1e-30 ref|XP_004510734.1| PREDICTED: proline-, glutamic acid- and leuc... 131 4e-28 ref|XP_007135214.1| hypothetical protein PHAVU_010G110700g [Phas... 127 7e-27 ref|XP_004301668.1| PREDICTED: uncharacterized protein LOC101297... 125 2e-26 ref|XP_006306777.1| hypothetical protein CARUB_v10008316mg [Caps... 120 5e-25 ref|XP_002893608.1| binding protein [Arabidopsis lyrata subsp. l... 120 7e-25 gb|AAG50563.1|AC073506_5 hypothetical protein [Arabidopsis thali... 116 1e-23 ref|NP_174315.2| uncharacterized protein [Arabidopsis thaliana] ... 116 1e-23 >gb|EYU19774.1| hypothetical protein MIMGU_mgv1a001675mg [Mimulus guttatus] Length = 774 Score = 255 bits (651), Expect = 2e-65 Identities = 146/309 (47%), Positives = 186/309 (60%), Gaps = 19/309 (6%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKRT--KNSLQELPS 711 +H+SQDI+SN+ +DLDFL EKD +SGT+ +V +L E KKRK + S Q+ P+ Sbjct: 385 IHISQDIISNVFIDLDFLGGEKDGKNSGTHSEVPTKLLTESRQKKRKHSIIARSSQDEPA 444 Query: 710 HGGVEAEVSDNLTQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRGGW 531 H +E N T ISVKI LT+ G++RSESWRG+VD LLITV T AF+GGW Sbjct: 445 HNSLEVGTPHNSTPISVKIAALEALEALLTLAGAMRSESWRGNVDNLLITVATNAFKGGW 504 Query: 530 SKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATGTQ 351 SKEE++ +S D+TP W DFQ SPGR+RPSHLAL L+LFRRG Q TGT+ Sbjct: 505 SKEEKRKFLSDDSTPIWQDFQLAALRALLASLLSPGRVRPSHLALGLQLFRRGMQETGTK 564 Query: 350 LAGYCGHALLALQVLIHPRSLPLSDFDSSADN---------YQALYPSGDRQISNYQPDE 198 L YCGHALLAL+VLIHPR+LPL D S N YQ P G+ P E Sbjct: 565 LGEYCGHALLALEVLIHPRALPLLDLASIGSNEFKDPNHTVYQDGGPRGN------GPAE 618 Query: 197 PESEEDDLIENWLGKDEEMEIQVTERQQ--------NTDAPKKNEVATSGNDPNNKGMTS 42 PESE+DDL ENWL KD++ E+ + + ++ NT+ P ++E+A P S Sbjct: 619 PESEDDDLNENWLSKDDDRELDIKDSKRQRENAHYNNTETPSQDELA-----PVKVSAAS 673 Query: 41 DHDVVENVE 15 V E VE Sbjct: 674 TSRVPERVE 682 >ref|XP_006361469.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1-like [Solanum tuberosum] Length = 852 Score = 187 bits (475), Expect = 5e-45 Identities = 116/297 (39%), Positives = 160/297 (53%), Gaps = 17/297 (5%) Frame = -3 Query: 881 HVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKRTKNS--LQELPSH 708 H++ IV+N L+DLD SS + PE + + HKKRK S L E P Sbjct: 444 HLTDVIVNNSLMDLDERGT-----SSVAQQNIHPETTTKTSHKKRKHASTSSLLDEQPDK 498 Query: 707 GGVEAEVSDNLTQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRGGWS 528 E EVS N+ +SVKI L VGGS RSESWR +VD LL+ V A +GGW+ Sbjct: 499 DVFEVEVSPNMASLSVKIAALEALESLLAVGGSRRSESWRVNVDHLLLDVTRNASKGGWA 558 Query: 527 KEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATGTQL 348 K+ R +VS T W D+Q SPGR RP L+ L+LFRRGT+ GT++ Sbjct: 559 KDGRGSLVSDSTTSIWRDYQIAALRALLASLLSPGRTRPPQLSQGLDLFRRGTREIGTKV 618 Query: 347 AGYCGHALLALQVLIHPRSLPLSDFDSSADNYQA--LYPSGDRQISN------------- 213 A C HA+LAL+VLIHPR+LPL D +S+ +NY+ + SG+ ISN Sbjct: 619 AECCAHAILALEVLIHPRALPLLDLESTDNNYEVGNKWFSGNVHISNRAANNTFHIGTSR 678 Query: 212 YQPDEPESEEDDLIENWLGKDEEMEIQVTERQQNTDAPKKNEVATSGNDPNNKGMTS 42 PDEP+S DDL +W+ E+++ + ++TD N + DP+++ +TS Sbjct: 679 KAPDEPDSYNDDLYADWMRNGEDLDTVAADPGKDTDT--SNRPPETLRDPSSEKLTS 733 >ref|XP_004249969.1| PREDICTED: uncharacterized protein LOC101268822 [Solanum lycopersicum] Length = 852 Score = 175 bits (444), Expect = 2e-41 Identities = 109/276 (39%), Positives = 150/276 (54%), Gaps = 17/276 (6%) Frame = -3 Query: 881 HVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKR--TKNSLQELPSH 708 H++ IV+N L+DLD SS + P+ + + +KKRK T +SL E Sbjct: 444 HLTDVIVNNSLMDLDERGT-----SSVAQQNIYPDSTTKTSNKKRKHASTSSSLDEQCDK 498 Query: 707 GGVEAEVSDNLTQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRGGWS 528 E EV N+ +SVKI L VGGS RSESWR +VD LL+ V A +GGW+ Sbjct: 499 DVFEVEVCSNMASLSVKIAALEALEALLAVGGSRRSESWRVNVDHLLLDVTRNASKGGWA 558 Query: 527 KEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATGTQL 348 K+ R +VS T W D+Q SPGR RP HL+ LELFRRGT+ GT++ Sbjct: 559 KDGRGSLVSKSPTSIWGDYQIAALRALLASLLSPGRTRPPHLSQGLELFRRGTREIGTKV 618 Query: 347 AGYCGHALLALQVLIHPRSLPLSDFDSSADNYQA--LYPSGDRQISN------------- 213 A C HA+LAL+VLIHPR+LPL D +S+ +NY+ + SG+ +SN Sbjct: 619 AECCAHAILALEVLIHPRALPLLDLESTDNNYEVGNKWFSGNVNLSNRAANNTFHIGTSR 678 Query: 212 YQPDEPESEEDDLIENWLGKDEEMEIQVTERQQNTD 105 PDEP+S DDL +W+ E++ + ++TD Sbjct: 679 KAPDEPDSYNDDLYADWMRNGEDVVTVPADPAKDTD 714 >ref|XP_002276178.2| PREDICTED: uncharacterized protein LOC100256091 [Vitis vinifera] Length = 911 Score = 169 bits (427), Expect = 2e-39 Identities = 118/314 (37%), Positives = 157/314 (50%), Gaps = 20/314 (6%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKRTKN---SLQELP 714 VH+++++++N DL+ + SS N K + H+KRK S +E Sbjct: 471 VHLAEEVINNAFADLNPIDQGTGDVSSSANSKASTGALLQTRHRKRKHATTATGSSEEQL 530 Query: 713 SHGGVEAEVSDNLTQ-ISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRG 537 E EV T I VKI LTVGG++RSE WR VD LLIT+ T A +G Sbjct: 531 DRVNFEKEVPKGYTTFIPVKIAALEALEALLTVGGALRSEHWRLKVDLLLITIATNACKG 590 Query: 536 GWSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATG 357 GW+ +ER I + DAT T DFQ SP R+RP +LA LELFRRG Q TG Sbjct: 591 GWADDERVISLPSDATSTQADFQLAALRALLASLLSPARVRPPYLAQGLELFRRGKQETG 650 Query: 356 TQLAGYCGHALLALQVLIHPRSLPLSDFDS----SADN-----YQALYPSGDRQISNYQP 204 T+LA +C HALLAL+VLIHPR+LPL DF + S DN Y SG + ++ Sbjct: 651 TRLAEFCTHALLALEVLIHPRALPLEDFPTVNRKSFDNGANHKYPESMYSGGQDLNTPFS 710 Query: 203 DEP-------ESEEDDLIENWLGKDEEMEIQVTERQQNTDAPKKNEVATSGNDPNNKGMT 45 P + + DL + WLG D+E++I VT+ P KN N + Sbjct: 711 RGPLGMALGVPNPDYDLYDKWLGSDDEIDIPVTD-------PSKNR----NNVDDASEAF 759 Query: 44 SDHDVVENVEIDGA 3 DH + +DGA Sbjct: 760 RDHQTEKLPSVDGA 773 >emb|CBI35005.3| unnamed protein product [Vitis vinifera] Length = 937 Score = 169 bits (427), Expect = 2e-39 Identities = 118/314 (37%), Positives = 157/314 (50%), Gaps = 20/314 (6%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKRTKN---SLQELP 714 VH+++++++N DL+ + SS N K + H+KRK S +E Sbjct: 445 VHLAEEVINNAFADLNPIDQGTGDVSSSANSKASTGALLQTRHRKRKHATTATGSSEEQL 504 Query: 713 SHGGVEAEVSDNLTQ-ISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRG 537 E EV T I VKI LTVGG++RSE WR VD LLIT+ T A +G Sbjct: 505 DRVNFEKEVPKGYTTFIPVKIAALEALEALLTVGGALRSEHWRLKVDLLLITIATNACKG 564 Query: 536 GWSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATG 357 GW+ +ER I + DAT T DFQ SP R+RP +LA LELFRRG Q TG Sbjct: 565 GWADDERVISLPSDATSTQADFQLAALRALLASLLSPARVRPPYLAQGLELFRRGKQETG 624 Query: 356 TQLAGYCGHALLALQVLIHPRSLPLSDFDS----SADN-----YQALYPSGDRQISNYQP 204 T+LA +C HALLAL+VLIHPR+LPL DF + S DN Y SG + ++ Sbjct: 625 TRLAEFCTHALLALEVLIHPRALPLEDFPTVNRKSFDNGANHKYPESMYSGGQDLNTPFS 684 Query: 203 DEP-------ESEEDDLIENWLGKDEEMEIQVTERQQNTDAPKKNEVATSGNDPNNKGMT 45 P + + DL + WLG D+E++I VT+ P KN N + Sbjct: 685 RGPLGMALGVPNPDYDLYDKWLGSDDEIDIPVTD-------PSKNR----NNVDDASEAF 733 Query: 44 SDHDVVENVEIDGA 3 DH + +DGA Sbjct: 734 RDHQTEKLPSVDGA 747 >ref|XP_006378815.1| hypothetical protein POPTR_0010s24450g [Populus trichocarpa] gi|550330520|gb|ERP56612.1| hypothetical protein POPTR_0010s24450g [Populus trichocarpa] Length = 837 Score = 156 bits (394), Expect = 1e-35 Identities = 98/287 (34%), Positives = 155/287 (54%), Gaps = 14/287 (4%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKRTKNSLQELPSHG 705 ++++Q++V+ L DL+ + + +++ + + P FH K++ SL++L Sbjct: 413 IYLAQEVVNCSLHDLNPILDGTSFHANAKSELLLPP----FHRKRKHGVTGSLEQLHDRI 468 Query: 704 GVEAEVSDNL-TQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRGGWS 528 G+E E S N T ISVKI LTVGG +RSESWR VD LLIT+ T + + GW Sbjct: 469 GLEVETSKNRPTAISVKIAALGALETLLTVGGGLRSESWRSKVDNLLITIATESCKEGWV 528 Query: 527 KEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATGTQL 348 +E + + ++T T +D Q SP +RP HLA ALELFRRG Q GT++ Sbjct: 529 SDESKTFLPNESTLTCSDLQLAALHALLASLLSPSGVRPPHLAPALELFRRGRQEIGTKV 588 Query: 347 AGYCGHALLALQVLIHPRSLPLSDFDSSAD----NY---QALYPSGDRQISNYQPDEPES 189 + +C +ALLAL+VLIHPR+LPL+DF S++ N+ + +Y + + Y ++ Sbjct: 589 SEFCAYALLALEVLIHPRALPLADFPSASSFNEVNHRFPENIYSVAQKHSNPYSSGVQDT 648 Query: 188 ------EEDDLIENWLGKDEEMEIQVTERQQNTDAPKKNEVATSGND 66 +DDL ++WL +E E V + +T+ P + G + Sbjct: 649 GHGLSDSDDDLYKSWLDSSKETEAPV-GKSMDTERPSETLTVQQGEN 694 >ref|XP_003529901.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1-like [Glycine max] Length = 883 Score = 147 bits (371), Expect = 5e-33 Identities = 99/305 (32%), Positives = 147/305 (48%), Gaps = 14/305 (4%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKD--LNSSGTNMKVQPELSNEFHHKKRKRTKNSLQELPS 711 ++++Q++++N DL + ++ LN S +N L +K T SLQE Sbjct: 445 LYLAQEVINNAFADLSIIEHKNSGILNGSNSNASAGALLLPIHRKRKHSSTTGSLQE-HG 503 Query: 710 HGGVEAEVSDN--LTQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRG 537 GG+ EV N LT +S++I +TV G+++SE WR VD LL+ +F+ Sbjct: 504 EGGLSVEVPKNRPLTPVSLRIAALETLESLITVAGALKSEPWRSKVDSLLLVTAMDSFKE 563 Query: 536 GWSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATG 357 G EER + + T T+ Q S R+RP +LA LELFRRG Q TG Sbjct: 564 GSVSEERSVFQQKEPAATTTELQLAALRALLVSLLSFARVRPPYLAQGLELFRRGRQQTG 623 Query: 356 TQLAGYCGHALLALQVLIHPRSLPL--------SDFDSSADNYQALYPSGDRQISNYQPD 201 T+LA +C HALL L+VLIHPR+LP+ S F + N Q Y P Sbjct: 624 TKLAEFCAHALLTLEVLIHPRALPMVDYAYANNSSFGEAHSNLQHGYFGWSHNTPYGLPQ 683 Query: 200 EPESEEDDLIENWLGKDEEMEIQVTERQQNTDAPKKNEVATSGNDPN--NKGMTSDHDVV 27 P +DDL WL D E+ + + + T P + A +DP ++SD ++ Sbjct: 684 VPPDYDDDLCARWLENDNEVGESLDKNTKYTQEPSE---ACRASDPEVLFVHVSSDTNIQ 740 Query: 26 ENVEI 12 E +E+ Sbjct: 741 ERIEM 745 >ref|XP_007010407.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508727320|gb|EOY19217.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 803 Score = 145 bits (367), Expect = 2e-32 Identities = 93/268 (34%), Positives = 137/268 (51%), Gaps = 19/268 (7%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRK---RTKNSLQELP 714 ++++ D++ N + DL+ +E D+ +S TN+ + ++KRK +T + ++ Sbjct: 370 IYLAPDVIDNAINDLNSFGDE-DVETSPTNIGPSTGALPQPSNRKRKHGTKTGSPEEKQT 428 Query: 713 SHGGVEAEVSDNLTQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRGG 534 VE T I+VKI LTVGG+ +SESWR +D LLI T + + G Sbjct: 429 ISSEVEPLNPHQTTPITVKIAALDTLEVLLTVGGASKSESWRSRIDSLLIKTATNSCKRG 488 Query: 533 WSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATGT 354 W EE + ++T W DFQ +P RIRP L+ LELFR+G Q GT Sbjct: 489 WGNEENNNFLPHESTSIWVDFQLSSLRALLASFLAPARIRPPFLSQGLELFRKGKQEAGT 548 Query: 353 QLAGYCGHALLALQVLIHPRSLPLSDFDSS----ADNYQALYP--------SGD----RQ 222 +LAG+C ALLAL+VLIHPR+LPL DF SS D +P GD + Sbjct: 549 KLAGFCASALLALEVLIHPRALPLDDFPSSYQTFTDGASHRFPENMPFYGQKGDTMFSKS 608 Query: 221 ISNYQPDEPESEEDDLIENWLGKDEEME 138 + + +S++DDL + WL + E E Sbjct: 609 MQGAEQSALKSDDDDLYDRWLQNENENE 636 >ref|XP_007010406.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508727319|gb|EOY19216.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 813 Score = 145 bits (367), Expect = 2e-32 Identities = 93/268 (34%), Positives = 137/268 (51%), Gaps = 19/268 (7%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRK---RTKNSLQELP 714 ++++ D++ N + DL+ +E D+ +S TN+ + ++KRK +T + ++ Sbjct: 380 IYLAPDVIDNAINDLNSFGDE-DVETSPTNIGPSTGALPQPSNRKRKHGTKTGSPEEKQT 438 Query: 713 SHGGVEAEVSDNLTQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRGG 534 VE T I+VKI LTVGG+ +SESWR +D LLI T + + G Sbjct: 439 ISSEVEPLNPHQTTPITVKIAALDTLEVLLTVGGASKSESWRSRIDSLLIKTATNSCKRG 498 Query: 533 WSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATGT 354 W EE + ++T W DFQ +P RIRP L+ LELFR+G Q GT Sbjct: 499 WGNEENNNFLPHESTSIWVDFQLSSLRALLASFLAPARIRPPFLSQGLELFRKGKQEAGT 558 Query: 353 QLAGYCGHALLALQVLIHPRSLPLSDFDSS----ADNYQALYP--------SGD----RQ 222 +LAG+C ALLAL+VLIHPR+LPL DF SS D +P GD + Sbjct: 559 KLAGFCASALLALEVLIHPRALPLDDFPSSYQTFTDGASHRFPENMPFYGQKGDTMFSKS 618 Query: 221 ISNYQPDEPESEEDDLIENWLGKDEEME 138 + + +S++DDL + WL + E E Sbjct: 619 MQGAEQSALKSDDDDLYDRWLQNENENE 646 >ref|XP_007221563.1| hypothetical protein PRUPE_ppa014774mg [Prunus persica] gi|462418313|gb|EMJ22762.1| hypothetical protein PRUPE_ppa014774mg [Prunus persica] Length = 822 Score = 144 bits (364), Expect = 3e-32 Identities = 101/292 (34%), Positives = 146/292 (50%), Gaps = 23/292 (7%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPEL---SNEFHHKKRKRTKNSLQ-EL 717 V ++Q++V++ +DL+ +ANE SS N K E + + H+KRK +S E Sbjct: 388 VCLAQEVVNSAFIDLNPIANESGGASSSGNSKPSTEALVQTPQHSHRKRKHGASSGSLEW 447 Query: 716 PSHGGVEAEVSDNLTQ--ISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAF 543 + +E N T I+VKI LTVGG+++SE WR DVD LLI + T + Sbjct: 448 HNTSRLEGGTPKNHTTSPIAVKIAALRALEALLTVGGALKSEGWRSDVDLLLINIATNSL 507 Query: 542 RGGWSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQA 363 +G W E I + Q S +RP++LA L+LFRRG Q Sbjct: 508 KGAWGGENGNIYQLNEPGDIGGGMQLAALRALLASFLSSSCVRPTYLAEGLDLFRRGKQE 567 Query: 362 TGTQLAGYCGHALLALQVLIHPRSLPLSDFDSS---ADNYQALYP--------------S 234 TGT+LA +C HALLAL+VLIHPR+LPL+DF + +D P S Sbjct: 568 TGTKLAEFCAHALLALEVLIHPRALPLADFTDATLLSDRVHYKLPENMYSGSLRPRTPFS 627 Query: 233 GDRQISNYQPDEPESEEDDLIENWLGKDEEMEIQVTERQQNTDAPKKNEVAT 78 GD I D +S+ DDL ++WL +EME V++ + A + ++ T Sbjct: 628 GD--IQGMMHDAADSDHDDLYDSWLASSKEMEAPVSDLGKTMQAGEPSKTVT 677 >ref|XP_002521170.1| conserved hypothetical protein [Ricinus communis] gi|223539617|gb|EEF41201.1| conserved hypothetical protein [Ricinus communis] Length = 863 Score = 143 bits (361), Expect = 8e-32 Identities = 102/333 (30%), Positives = 163/333 (48%), Gaps = 42/333 (12%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLAN---EKDLNSSGTNMKVQPELSNEFHHKKRKRTKNSLQELP 714 ++++Q++V+N L+DLD + + +QP ++KRK + Sbjct: 446 IYLAQEVVNNSLLDLDPSVGCIFSSAYSKASFGALLQP------CNRKRKHGASEQNYDQ 499 Query: 713 SHGGVEAEVSDNLTQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRGG 534 +EA S + ISVKI LTVGG+++SESWR V+KLLIT+ + +GG Sbjct: 500 LSLEMEAPKSCPASTISVKIAALEALRTLLTVGGALKSESWRSKVEKLLITLAADSCKGG 559 Query: 533 WSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATGT 354 WS EER + T+ D Q SP R+RP HLA +LELF RG Q TGT Sbjct: 560 WSSEERTAFLPNGVASTYADLQLAVLRALLASLLSPSRVRPPHLAQSLELFHRGKQETGT 619 Query: 353 QLAGYCGHALLALQVLIHPRSLPLSDFDSSADNYQALY-----------------PSGDR 225 +++ +C +AL AL+VLIHPR+LPL+D S+ +++ Y SG R Sbjct: 620 EISEFCSYALSALEVLIHPRALPLADLPSANSSHEINYGFPETLYSGGQKHNTPISSGMR 679 Query: 224 QISNYQPDEPESEEDDLIENWLGKDEEME--------------IQVTERQQN-------T 108 I + PD +DDL ++WL ++E + ++V + ++N T Sbjct: 680 GIGHGSPD----SDDDLCDSWLDGNKETDTPDKITISNKPSENLKVQQAEKNFLAGPSAT 735 Query: 107 DAPKKNEVATSGNDPN-NKGMTSDHDVVENVEI 12 +P+++E+ + + + G D +V E+ Sbjct: 736 KSPRQSELEPAADSADVETGNLGDEMIVRTEEV 768 >ref|XP_006598922.1| PREDICTED: uncharacterized protein LOC100803198 [Glycine max] Length = 885 Score = 142 bits (358), Expect = 2e-31 Identities = 98/305 (32%), Positives = 144/305 (47%), Gaps = 14/305 (4%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKD--LNSSGTNMKVQPELSNEFHHKKRKRTKNSLQELPS 711 ++++Q++++N DL + ++ LN S +N L +K T SLQE Sbjct: 445 LYLAQEVINNAFADLSSIEHKNGGILNGSYSNASAGTLLPPSHRKRKHSSTTGSLQE-HG 503 Query: 710 HGGVEAEVSDN--LTQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRG 537 GG+ EV N L +S++I +TV G+++SE WR VD LLI +F+ Sbjct: 504 EGGLSVEVPKNRPLIPMSLRIAALETLESLITVAGALKSEPWRSKVDSLLIVTAMDSFKE 563 Query: 536 GWSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATG 357 G EER + + T TD Q S R+RP +LA LELFR+G Q TG Sbjct: 564 GSVGEERSVFQQKEPAATTTDLQLAALRALLVSFLSFARVRPPYLAQGLELFRKGRQQTG 623 Query: 356 TQLAGYCGHALLALQVLIHPRSLPL--------SDFDSSADNYQALYPSGDRQISNYQPD 201 T+LA +C HALL L+VLIHPR+LP+ S F + N Q Y P Sbjct: 624 TKLAEFCAHALLTLEVLIHPRALPMVDYAYANNSSFGEAHSNLQHEYFGWSNSTPYGLPQ 683 Query: 200 EPESEEDDLIENWLGKDEEMEIQVTERQQNTDAPKKNEVATSGNDPNNKGM--TSDHDVV 27 +P +DDL WL E + + + + T P + A +DP M +S ++ Sbjct: 684 DPPDYDDDLCARWLENGNEADESLDKNTKYTQEPSE---ACRASDPEVLSMHVSSGTNIQ 740 Query: 26 ENVEI 12 E E+ Sbjct: 741 ERTEM 745 >gb|EXB36971.1| hypothetical protein L484_018348 [Morus notabilis] Length = 872 Score = 139 bits (350), Expect = 1e-30 Identities = 96/272 (35%), Positives = 139/272 (51%), Gaps = 20/272 (7%) Frame = -3 Query: 878 VSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKR--TKNSLQELPSHG 705 ++QD+V+N VDL+ + + +S N K E + +KRK SL+E HG Sbjct: 446 LAQDVVNNAFVDLNPIGSGTG-GTSSENPKTSSEALQQTSRRKRKHGTPTGSLEE--GHG 502 Query: 704 GVEAEVSDNLTQ----ISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRG 537 G EV Q IS++I LTVGG++RSE WR ++D LLI +V + +G Sbjct: 503 GSSLEVEALKNQPSILISLRIAAVEALEALLTVGGALRSEGWRSNLDLLLINLVKNSLKG 562 Query: 536 GWSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATG 357 GW+ EE I T W + Q S R+R ++A LELFRRG Q T Sbjct: 563 GWACEEINIFQHSGPTEIWANMQLAALRALLASFLS-SRVRSPYIAEGLELFRRGKQETS 621 Query: 356 TQLAGYCGHALLALQVLIHPRSLPLSDFDSS------ADNYQ-ALYPSGDRQISNYQ--- 207 T+LA +C HALLAL+VLIHPR+LP+ DF S YQ +Y + I+ + Sbjct: 622 TKLADFCAHALLALEVLIHPRALPVEDFPFSNRISDGVHKYQEKIYSGNPKYITPFSSGA 681 Query: 206 ----PDEPESEEDDLIENWLGKDEEMEIQVTE 123 ++ +S+ DDL ++WL +E E ++ Sbjct: 682 NGMGQNDLDSDHDDLCDSWLENGKEAEATASD 713 >ref|XP_004510734.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1-like [Cicer arietinum] Length = 876 Score = 131 bits (329), Expect = 4e-28 Identities = 95/299 (31%), Positives = 149/299 (49%), Gaps = 10/299 (3%) Frame = -3 Query: 878 VSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKR--TKNSLQELPSHG 705 +S+++V+N + DL + N+ +G+N V H+KRK T SL E + Sbjct: 447 LSKEVVNNAIADLSTIENKNGGTLNGSNTDVSTVAPQPARHRKRKHNNTTGSLLENDASS 506 Query: 704 GVEAEVSD--NLTQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRGGW 531 G+ EV + T IS+++ +TV G++RSE WR VD LLI + +FR G Sbjct: 507 GLVVEVPKKCHATPISLRVAALEALEALITVAGALRSEQWRPQVDSLLIVIAMDSFREGS 566 Query: 530 SKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATGTQ 351 S EE + + D T TD Q S + +L+ LELFRRG Q TGT+ Sbjct: 567 SSEEINVFQNKDPAATATDLQLAAFRALLASFLSVTAPQTPYLSQGLELFRRGKQQTGTK 626 Query: 350 LAGYCGHALLALQVLIHPRSLPLSDFDSSADN--YQALYPSGDRQISNYQP-DEPESE-- 186 LA +C HA+L L+VLIHP++ PL D+ +N +A D S P PE++ Sbjct: 627 LAEFCAHAMLTLEVLIHPKTYPLVDYVRPNNNTYEEAKVSFRDEYFSRNNPFGLPEAKPP 686 Query: 185 -EDDLIENWLGKDEEMEIQVTERQQNTDAPKKNEVATSGNDPNNKGMTSDHDVVENVEI 12 D++ + + D+++ + TE ++T+ K +E+ T + S D+ E+ EI Sbjct: 687 VRDEITDYLINDDDDLGVLWTESTKDTN--KSSEMVTP--------LPSSTDIQESSEI 735 >ref|XP_007135214.1| hypothetical protein PHAVU_010G110700g [Phaseolus vulgaris] gi|561008259|gb|ESW07208.1| hypothetical protein PHAVU_010G110700g [Phaseolus vulgaris] Length = 876 Score = 127 bits (318), Expect = 7e-27 Identities = 93/287 (32%), Positives = 134/287 (46%), Gaps = 13/287 (4%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKRTK--NSLQELPS 711 ++++Q++V+N DL+ + +G+N H+KRK + SLQE Sbjct: 446 LYLAQEVVNNAFTDLNSTEHMDGGILNGSNSNASAGAQQPPSHRKRKHSSATGSLQEHDE 505 Query: 710 HGGVEAEVSDN--LTQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRG 537 GG EV N LT IS++I LTV G+++S WR +D LLI + T +F+ Sbjct: 506 GGGSGVEVPKNRPLTPISLRIAALETLEALLTVAGALKSAPWRSKLDSLLIVIATDSFKE 565 Query: 536 GWSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATG 357 G VS + T TD Q S R RP + + LELFRRG Q T Sbjct: 566 G--------TVSEEPAATVTDLQLAALRTLQASFLSFIRERPPYFSQGLELFRRGKQQTA 617 Query: 356 T-QLAGYCGHALLALQVLIHPRSLPLSDFDSSADN--------YQALYPSGDRQISNYQP 204 +LA +C HALL L+VLIHPR+LPL D+ + +N Q Y P Sbjct: 618 VPKLAEFCAHALLTLEVLIHPRALPLVDYAYAVNNSSGEAHGSLQHEYSGRSNSTPFGLP 677 Query: 203 DEPESEEDDLIENWLGKDEEMEIQVTERQQNTDAPKKNEVATSGNDP 63 +P +DDL WL +E ++ + + +N P + A NDP Sbjct: 678 QDPPDSDDDLCARWLETGKEDDVSMGKDAENNQKPSE---ACRDNDP 721 >ref|XP_004301668.1| PREDICTED: uncharacterized protein LOC101297648 [Fragaria vesca subsp. vesca] Length = 832 Score = 125 bits (315), Expect = 2e-26 Identities = 88/269 (32%), Positives = 137/269 (50%), Gaps = 20/269 (7%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQ-PELSNEFHHKKRKRTKNSLQELPSH 708 V ++Q++V++ VDL+ + E + + +Q P+ SN RKR +L L H Sbjct: 449 VSLAQEVVNSTSVDLNPIVMESSASVKPSEALLQTPQSSN------RKRKHGTLTSLEMH 502 Query: 707 GGVEAEV--SDNLTQIS--VKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFR 540 EV + N T+ S V++ LTV G +SE WR +VD LLI + T + + Sbjct: 503 NSSNLEVGTTKNHTRCSMAVQVAALEALEALLTVDGVFKSEGWRSNVDLLLINIATNSLK 562 Query: 539 GGWSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQAT 360 GG + E I + T +D Q S R+RP +LA ++LFRRG + Sbjct: 563 GGLAGENASIYQPNEPTDVCSDIQLAALRALLASFLSSSRVRPLYLAQGVDLFRRGKLES 622 Query: 359 GTQLAGYCGHALLALQVLIHPRSLPLSDFDSSADN-------YQALYPSGD-RQISNYQP 204 GT+LA +C HALL L+VLIHPR+LPL+DF +S N YQ + SG+ + ++Y Sbjct: 623 GTKLAEFCAHALLVLEVLIHPRALPLADFSNSTSNDERAHHDYQGNFYSGNLKHGTSYST 682 Query: 203 D-------EPESEEDDLIENWLGKDEEME 138 + P+ D+L +W+ +++E Sbjct: 683 NIHGTADIAPDLYRDELYSSWIETSKKVE 711 >ref|XP_006306777.1| hypothetical protein CARUB_v10008316mg [Capsella rubella] gi|482575488|gb|EOA39675.1| hypothetical protein CARUB_v10008316mg [Capsella rubella] Length = 826 Score = 120 bits (302), Expect = 5e-25 Identities = 91/300 (30%), Positives = 136/300 (45%), Gaps = 11/300 (3%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKRTKNSLQELPSHG 705 + ++QD+V+N DLD + E S NM + + KKRK + NS E + Sbjct: 442 MQLAQDVVTNASADLDPRSVEGFDAVSSKNMSLTNGAVPQACSKKRKHSTNSGVEA-DNS 500 Query: 704 GVEAEVSDNLTQ--ISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVTYAFRGGW 531 E V N + I++KI LT+GG+ S+SWR VD LL+T T A G W Sbjct: 501 AFEVRVPHNHSSSPITLKIASLEALETLLTIGGAFGSDSWRERVDNLLMTTATNACEGRW 560 Query: 530 SKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRGTQATGTQ 351 + E +T +FQ SP R+RP+ LA LELFR G G + Sbjct: 561 ANSETYHFFPNKSTTDLVEFQLAALRAFLASLVSPSRVRPAFLAEGLELFRTGKLQAGMK 620 Query: 350 LAGYCGHALLALQVLIHPRSLPLSDFDSSADNYQALYPSGDRQISNYQPDEPE------- 192 +AG+C AL++L+V+IHPR+LPL S ++ + G +++ Q + P Sbjct: 621 VAGFCAQALMSLEVVIHPRALPLDGLPSLSNWF-----PGSNSLASQQHNNPNLNNLNGI 675 Query: 191 -SEEDDLIENWLGKDEEMEIQVTERQQNTDAP-KKNEVATSGNDPNNKGMTSDHDVVENV 18 + DDL WL K + ++ T P ++ + GND S D + V Sbjct: 676 AHDGDDLCNRWLAKADVPSNNAIQKTLETTLPSQETKRLKLGNDLTTVASLSVEDHTDMV 735 >ref|XP_002893608.1| binding protein [Arabidopsis lyrata subsp. lyrata] gi|297339450|gb|EFH69867.1| binding protein [Arabidopsis lyrata subsp. lyrata] Length = 838 Score = 120 bits (301), Expect = 7e-25 Identities = 88/286 (30%), Positives = 136/286 (47%), Gaps = 13/286 (4%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKRTKNSLQELPSHG 705 + ++Q++V N VDLD + E +S N + + KKRK H Sbjct: 442 MQLAQEVVINASVDLDQTSLEAFDVASSKNPSLTNGALLQACSKKRK-----------HS 490 Query: 704 GVEAEVS---------DNLTQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVT 552 GVEAE S + + IS+KI LT+GG++ S+SWR VD LL+T T Sbjct: 491 GVEAENSVFEVRIPHNHSRSPISLKIASLEALETLLTIGGALGSDSWRESVDNLLLTTAT 550 Query: 551 YAFRGGWSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRG 372 A G W+ E + +T +FQ SP R+RP+ LA LELFR G Sbjct: 551 NACEGRWANAETYHCLPNKSTTDLVEFQLAALRAFSASLVSPSRVRPAFLAEGLELFRTG 610 Query: 371 TQATGTQLAGYCGHALLALQVLIHPRSLPLSDFDSSADNYQALYPSGDRQISNYQPDEPE 192 G ++AG+C HAL++L+V+IHPR+LPL + ++ + G ++ + ++ Sbjct: 611 KLQAGMKVAGFCAHALMSLEVVIHPRALPLDGLPTLSNRFPESNSFGSQKHNTPNLNKLN 670 Query: 191 ---SEEDDLIENWLGKDEEMEIQVTERQQNTDAP-KKNEVATSGND 66 + DDL WL K + +R +T P ++++ GND Sbjct: 671 VIAHDGDDLGNRWLAKADVPSNNAIQRTFDTTLPLQESKRLKVGND 716 >gb|AAG50563.1|AC073506_5 hypothetical protein [Arabidopsis thaliana] Length = 873 Score = 116 bits (291), Expect = 1e-23 Identities = 88/286 (30%), Positives = 135/286 (47%), Gaps = 13/286 (4%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKRTKNSLQELPSHG 705 + ++Q++V N VDLD + E +S N + + KKRK H Sbjct: 488 MQLAQEVVINASVDLDQTSLEAFDVASSKNPSLTNGALLQACSKKRK-----------HS 536 Query: 704 GVEAEVS--------DNL-TQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVT 552 GVEAE S ++L + IS+KI LT+GG++ S+SWR VD LL+T T Sbjct: 537 GVEAENSVFELRIPHNHLRSPISLKIASLEALETLLTIGGALGSDSWRESVDNLLLTTAT 596 Query: 551 YAFRGGWSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRG 372 A G W+ E + +T +FQ SP R+RP+ LA LELFR G Sbjct: 597 NACEGRWANAETYHCLPNKSTTDLVEFQLAALRAFSASLVSPSRVRPAFLAEGLELFRTG 656 Query: 371 TQATGTQLAGYCGHALLALQVLIHPRSLPLSDFDSSADNYQALYPSGDRQISNYQPDEPE 192 G ++AG+C HAL++L+V+IHPR+LPL + ++ + G + + ++ Sbjct: 657 KLQAGMKVAGFCAHALMSLEVVIHPRALPLDGLPTLSNRFPESNSFGSEKHNTPNLNKLN 716 Query: 191 ---SEEDDLIENWLGKDEEMEIQVTERQQNTDAP-KKNEVATSGND 66 + DDL W K + +R +T P +++ GND Sbjct: 717 VIAHDGDDLGNRWQAKADVPSNNAIQRTLDTTLPLQESNRLKVGND 762 >ref|NP_174315.2| uncharacterized protein [Arabidopsis thaliana] gi|332193076|gb|AEE31197.1| uncharacterized protein AT1G30240 [Arabidopsis thaliana] Length = 825 Score = 116 bits (291), Expect = 1e-23 Identities = 88/286 (30%), Positives = 135/286 (47%), Gaps = 13/286 (4%) Frame = -3 Query: 884 VHVSQDIVSNLLVDLDFLANEKDLNSSGTNMKVQPELSNEFHHKKRKRTKNSLQELPSHG 705 + ++Q++V N VDLD + E +S N + + KKRK H Sbjct: 440 MQLAQEVVINASVDLDQTSLEAFDVASSKNPSLTNGALLQACSKKRK-----------HS 488 Query: 704 GVEAEVS--------DNL-TQISVKIXXXXXXXXXLTVGGSIRSESWRGDVDKLLITVVT 552 GVEAE S ++L + IS+KI LT+GG++ S+SWR VD LL+T T Sbjct: 489 GVEAENSVFELRIPHNHLRSPISLKIASLEALETLLTIGGALGSDSWRESVDNLLLTTAT 548 Query: 551 YAFRGGWSKEERQIVVSGDATPTWTDFQXXXXXXXXXXXXSPGRIRPSHLALALELFRRG 372 A G W+ E + +T +FQ SP R+RP+ LA LELFR G Sbjct: 549 NACEGRWANAETYHCLPNKSTTDLVEFQLAALRAFSASLVSPSRVRPAFLAEGLELFRTG 608 Query: 371 TQATGTQLAGYCGHALLALQVLIHPRSLPLSDFDSSADNYQALYPSGDRQISNYQPDEPE 192 G ++AG+C HAL++L+V+IHPR+LPL + ++ + G + + ++ Sbjct: 609 KLQAGMKVAGFCAHALMSLEVVIHPRALPLDGLPTLSNRFPESNSFGSEKHNTPNLNKLN 668 Query: 191 ---SEEDDLIENWLGKDEEMEIQVTERQQNTDAP-KKNEVATSGND 66 + DDL W K + +R +T P +++ GND Sbjct: 669 VIAHDGDDLGNRWQAKADVPSNNAIQRTLDTTLPLQESNRLKVGND 714