BLASTX nr result
ID: Mentha24_contig00015097
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00015097 (550 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus... 113 4e-23 gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea] 99 5e-19 ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu... 96 6e-18 ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247... 94 2e-17 ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm... 94 2e-17 gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding prot... 83 4e-14 ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein... 83 4e-14 gb|AAM65660.1| Contains similarity to RNA-binding protein from A... 83 4e-14 ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Caps... 82 9e-14 ref|XP_006837366.1| hypothetical protein AMTR_s00111p00111440 [A... 82 1e-13 ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp.... 81 2e-13 dbj|BAD43943.1| unknown protein [Arabidopsis thaliana] gi|519705... 80 3e-13 ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citr... 80 4e-13 ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutr... 80 4e-13 ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1... 79 7e-13 ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus tr... 78 1e-12 ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prun... 77 4e-12 ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phas... 75 1e-11 ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507... 74 2e-11 ref|XP_007014541.1| Hydroxyproline-rich glycoprotein family prot... 72 7e-11 >gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus guttatus] Length = 493 Score = 113 bits (282), Expect = 4e-23 Identities = 77/216 (35%), Positives = 106/216 (49%), Gaps = 37/216 (17%) Frame = -1 Query: 538 EAQSGWTESETPPPKDKALPTGILGVLSGAGRGKPTK-PSVPHPEK----TRQTGGREPS 374 E Q+ ESE P ++ L + I+ VLSGAGRGKP K P+ PEK R R P Sbjct: 147 EEQADAAESEVPSAQETLLRSDIVSVLSGAGRGKPGKPPTAAQPEKPQSENRHIRQRPPQ 206 Query: 373 QSP----NKDTAV--REQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXY 212 P + D A QLS+EE V+KAKEILSK Sbjct: 207 GKPPVAVSSDGAAPPAVQLSKEEMVKKAKEILSKGDEDGGVSRPEVRDNRDNRDNRGGGR 266 Query: 211 EERGDQSRGRFSGHG--------------------------AADREKLAKRLGPEIMSKV 110 RG++ RGR G G AD EK+A++LGP++M+++ Sbjct: 267 GGRGERGRGRGRGRGRGRGRGRGDDRYEESDDESDALFIGDPADEEKVAQKLGPDVMAQL 326 Query: 109 VEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEY 2 EG++EM+SR +P P +A +DAF+T++ +EC PEY Sbjct: 327 AEGIDEMSSRVLPSPFDDAYMDAFETNLRIECEPEY 362 >gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea] Length = 426 Score = 99.4 bits (246), Expect = 5e-19 Identities = 70/194 (36%), Positives = 96/194 (49%), Gaps = 26/194 (13%) Frame = -1 Query: 505 PPPKDKALPTGILGVLSGAGRGKPTKPSVPHPEKTRQTGG----REPSQSPNKDTAVREQ 338 PPP+D A IL LSG GRG P KP P+ + T R+P P+ + +Q Sbjct: 114 PPPRDTAALDDILTNLSGMGRGTPGKPP---PQTLKPTPINRHIRQPQPRPSTALSPDQQ 170 Query: 337 LSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXYEERGDQSRG-RFSGHGA- 164 LS+EEK++KA EILS+ RG RG RFSG G Sbjct: 171 LSKEEKLKKAVEILSRGDPDRGPIRSPTGRGRGRG---------RGRGGRGGRFSGRGRG 221 Query: 163 --------------------ADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVD 44 AD +K+A++LG E+M+K+ EG+EEM+SR +P +A VD Sbjct: 222 READAAIESDEELPGMFGDPADEQKVAEKLGVEVMNKITEGMEEMSSRVLPSLIDDAYVD 281 Query: 43 AFQTDIMLECLPEY 2 A+ T+++LEC PEY Sbjct: 282 AYHTNLLLECEPEY 295 >ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum] Length = 480 Score = 95.9 bits (237), Expect = 6e-18 Identities = 67/208 (32%), Positives = 100/208 (48%), Gaps = 30/208 (14%) Frame = -1 Query: 535 AQSGWTESETPPPKDKA-LPTGILGVLSGAGRGKPTKPSVPHPEKTRQTGGR-EPSQSPN 362 A S + S+ P P+D + L + ++ VL+GAGRGKP + + P EK ++ P Q Sbjct: 142 ADSNSSSSDAPTPRDDSNLSSSVISVLTGAGRGKPLQTASPVSEKPKEENRHLRPRQQKV 201 Query: 361 KDTAVR------EQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXY---E 209 D+ R ++LS+E+ V+KA ILS+ Sbjct: 202 ADSGERASSPPPQRLSREDAVKKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVR 261 Query: 208 ERGDQSRGRFSGHGA-------------------ADREKLAKRLGPEIMSKVVEGLEEMA 86 RG + RGR G G AD EKLA++LGPE M+ + EG EEM+ Sbjct: 262 GRGGRGRGRGRGRGRRDEERGDGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMS 321 Query: 85 SRAVPDPHKEALVDAFQTDIMLECLPEY 2 +R +P P +A ++A T++M+EC PEY Sbjct: 322 ARVLPSPMDDAYIEALHTNMMIECEPEY 349 >ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED: uncharacterized protein LOC101247662 isoform 2 [Solanum lycopersicum] Length = 473 Score = 94.0 bits (232), Expect = 2e-17 Identities = 66/203 (32%), Positives = 97/203 (47%), Gaps = 27/203 (13%) Frame = -1 Query: 529 SGWTESETPPPKDKA-LPTGILGVLSGAGRGKPTKPSVPHPEKTRQTGGR-EPSQSPNKD 356 S + S P P+D + LP+ ++ VL+GAGRGKP + + EK ++ P Q D Sbjct: 141 SNSSSSNAPKPRDDSNLPSSVISVLTGAGRGKPLQTASSVSEKPKEENRHLRPRQQKVAD 200 Query: 355 TAVR------EQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXYEERGDQ 194 + R ++LS+E+ V+KA ILS+ RG + Sbjct: 201 SGERASSPPPQRLSREDAVKKAVGILSR-SDDGDVGGGRGMGGGFRGRGGRGAVRGRGGR 259 Query: 193 SRGRFSGHGA-------------------ADREKLAKRLGPEIMSKVVEGLEEMASRAVP 71 RGR G G AD EKLA +LGPE M+ + EG EEM++R +P Sbjct: 260 GRGRGRGRGRRDEERGDGNLESGFYLGDDADGEKLAAKLGPESMNTLAEGFEEMSARVLP 319 Query: 70 DPHKEALVDAFQTDIMLECLPEY 2 P +A ++A T++M+EC PEY Sbjct: 320 SPMDDAYLEALHTNMMIECEPEY 342 >ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis] gi|223537066|gb|EEF38701.1| conserved hypothetical protein [Ricinus communis] Length = 436 Score = 94.0 bits (232), Expect = 2e-17 Identities = 64/191 (33%), Positives = 93/191 (48%), Gaps = 12/191 (6%) Frame = -1 Query: 538 EAQSGWTESETPPPKDKALPTGILGVLSGAGRGKPTKPSVPHP---EKTRQTGGREPSQS 368 + + G + T D LP+ I LSG GRG+P KP VP P E+ R R ++ Sbjct: 115 DPEPGPSRQPTESQSDSVLPSTIHSSLSGFGRGEPDKPVVPTPQVKEENRHIRDRSRAKP 174 Query: 367 PNKDTAVREQ--LSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXYEERGDQ 194 ++ VR + +S+EE V++A ILS+ E+RG Sbjct: 175 KTEEAEVRAKPKISREEAVKRAVSILSQGDTGEGMGRGRGGGRGRGRGRGRGRLEQRGRM 234 Query: 193 SRGRFSGHGA-------ADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQ 35 G G+ AD EKLA ++G E M+K+VEG EEM+ R +P P ++A +DA Sbjct: 235 MDDVDEGFGSGLFLGDNADGEKLAGKIGVENMNKLVEGYEEMSGRVLPSPMEDAYLDALH 294 Query: 34 TDIMLECLPEY 2 T+ M+E PEY Sbjct: 295 TNYMIEFEPEY 305 >gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding protein from Arabidopsis thaliana gi|2129727 and contains RNA recognition PF|00076 domain. ESTs gb|H37317, gb|F14415, gb|AA651290 come from this gene [Arabidopsis thaliana] Length = 829 Score = 83.2 bits (204), Expect = 4e-14 Identities = 65/210 (30%), Positives = 87/210 (41%), Gaps = 39/210 (18%) Frame = -1 Query: 514 SETPPPKDKA----LPTGILGVL-------SGAGRGKPTKPSVP-HPEKTRQTGGREPSQ 371 S PPP+ K P I L SGAGRGKP S P E RQ R P Sbjct: 491 SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQI--RRPPP 548 Query: 370 SPN--------------KDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXX 233 P KD + QLS EE R+A+ LS+ Sbjct: 549 PPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRG 608 Query: 232 XXXXXXY-------------EERGDQSRGRFSGHGAADREKLAKRLGPEIMSKVVEGLEE 92 EE G+Q R +AD EK A+++GPE+M + EG EE Sbjct: 609 RGARGRGRGRGGDGWRDDKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEE 668 Query: 91 MASRAVPDPHKEALVDAFQTDIMLECLPEY 2 + +A+P +A++DA+ T++M+EC PEY Sbjct: 669 ICEKALPSTTHDAIIDAYDTNLMIECEPEY 698 >ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|12324041|gb|AAG51990.1|AC024260_28 unknown protein; 43598-45751 [Arabidopsis thaliana] gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8 [Arabidopsis thaliana] gi|23506017|gb|AAN28868.1| At1g53640/F22G10.8 [Arabidopsis thaliana] gi|110740318|dbj|BAF02054.1| hypothetical protein [Arabidopsis thaliana] gi|332194854|gb|AEE32975.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 523 Score = 83.2 bits (204), Expect = 4e-14 Identities = 65/210 (30%), Positives = 87/210 (41%), Gaps = 39/210 (18%) Frame = -1 Query: 514 SETPPPKDKA----LPTGILGVL-------SGAGRGKPTKPSVP-HPEKTRQTGGREPSQ 371 S PPP+ K P I L SGAGRGKP S P E RQ R P Sbjct: 185 SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQI--RRPPP 242 Query: 370 SPN--------------KDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXX 233 P KD + QLS EE R+A+ LS+ Sbjct: 243 PPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRG 302 Query: 232 XXXXXXY-------------EERGDQSRGRFSGHGAADREKLAKRLGPEIMSKVVEGLEE 92 EE G+Q R +AD EK A+++GPE+M + EG EE Sbjct: 303 RGARGRGRGRGGDGWRDDKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEE 362 Query: 91 MASRAVPDPHKEALVDAFQTDIMLECLPEY 2 + +A+P +A++DA+ T++M+EC PEY Sbjct: 363 ICEKALPSTTHDAIIDAYDTNLMIECEPEY 392 >gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis thaliana gi|2129727 and contains RNA recognition PF|00076 domain [Arabidopsis thaliana] Length = 523 Score = 83.2 bits (204), Expect = 4e-14 Identities = 65/210 (30%), Positives = 87/210 (41%), Gaps = 39/210 (18%) Frame = -1 Query: 514 SETPPPKDKA----LPTGILGVL-------SGAGRGKPTKPSVP-HPEKTRQTGGREPSQ 371 S PPP+ K P I L SGAGRGKP S P E RQ R P Sbjct: 185 SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQI--RRPPP 242 Query: 370 SPN--------------KDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXX 233 P KD + QLS EE R+A+ LS+ Sbjct: 243 PPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRG 302 Query: 232 XXXXXXY-------------EERGDQSRGRFSGHGAADREKLAKRLGPEIMSKVVEGLEE 92 EE G+Q R +AD EK A+++GPE+M + EG EE Sbjct: 303 RGARGRGRGRGGDGWRDDKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEE 362 Query: 91 MASRAVPDPHKEALVDAFQTDIMLECLPEY 2 + +A+P +A++DA+ T++M+EC PEY Sbjct: 363 ICEKALPSTTHDAIIDAYDTNLMIECEPEY 392 >ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Capsella rubella] gi|482575944|gb|EOA40131.1| hypothetical protein CARUB_v10008838mg [Capsella rubella] Length = 525 Score = 82.0 bits (201), Expect = 9e-14 Identities = 60/207 (28%), Positives = 90/207 (43%), Gaps = 36/207 (17%) Frame = -1 Query: 514 SETPPPKDKA----LPTGILGVL-------SGAGRGKPTKPSVP--------------HP 410 S P P+ K+ LP + L SGAGRGKP S P P Sbjct: 189 SSPPAPESKSGQTDLPDNVFNALGSEIPHSSGAGRGKPLVESAPIQREENRHIRRPPPPP 248 Query: 409 EKTRQTGGREPSQSPNKDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXX 230 ++ R ++ +Q+P +T R +LS EE R+A+ LS+ Sbjct: 249 QQQRSQPQQKRAQTPRDETP-RPRLSAEEAGRRARSELSRGEAEGSGVRGRGGRGRGRGA 307 Query: 229 XXXXXY-----------EERGDQSRGRFSGHGAADREKLAKRLGPEIMSKVVEGLEEMAS 83 EE G+Q +AD EK A ++GPE+M + EG EE+ Sbjct: 308 RGRGRGRGGEGWRDDKKEEEGEQEAMSVFAGDSADGEKFANKMGPELMKTLAEGFEEVCE 367 Query: 82 RAVPDPHKEALVDAFQTDIMLECLPEY 2 +A+P +A++DA+ T++M+EC PEY Sbjct: 368 KALPSTTHDAIIDAYDTNLMIECEPEY 394 >ref|XP_006837366.1| hypothetical protein AMTR_s00111p00111440 [Amborella trichopoda] gi|548839984|gb|ERN00220.1| hypothetical protein AMTR_s00111p00111440 [Amborella trichopoda] Length = 447 Score = 81.6 bits (200), Expect = 1e-13 Identities = 61/196 (31%), Positives = 87/196 (44%), Gaps = 21/196 (10%) Frame = -1 Query: 526 GWTESETPPPKDKALPTGILGV-LSGAGRGKPTKPSVPH---PEKTRQTGGREP-----S 374 G +++ PP + LP I + G GRGKPT P + H E+ R R P Sbjct: 142 GRVQAQNLPPTESPLPRSISPAPIEGFGRGKPTSPLLSHGIEEEENRHIRRRSPPPERAG 201 Query: 373 QSPNKDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXY------ 212 Q+ + +LS EE VR AK+ILS+ Sbjct: 202 QASRGRASNERKLSSEEAVRNAKDILSRGEGRGGRGLRGGRGLRGGRGRGGVWAGRGRQG 261 Query: 211 ------EERGDQSRGRFSGHGAADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEAL 50 + R D S G + G A D EKL KRLG E ++++ E +EM+ R +P P +EA Sbjct: 262 RGARYQDRREDDSVGLYLGDDA-DGEKLVKRLGEENVNQIFEAFDEMSGRVLPSPMEEAY 320 Query: 49 VDAFQTDIMLECLPEY 2 +DA T+ ++E PEY Sbjct: 321 LDALHTNCLIEFEPEY 336 >ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297340299|gb|EFH70716.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 769 Score = 81.3 bits (199), Expect = 2e-13 Identities = 55/180 (30%), Positives = 82/180 (45%), Gaps = 29/180 (16%) Frame = -1 Query: 454 GAGRGKPT---------------KPSVPHPEKTRQTGGREPSQ--SPN-KDTAVREQLSQ 329 GAGRGKP +P P P + +Q +P Q +P KD A + QLS+ Sbjct: 459 GAGRGKPLVESAPIQQEDNRQIRRPQPPPPPQQQQQQRAQPQQKRAPTVKDEAPKPQLSR 518 Query: 328 EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXY-----------EERGDQSRGR 182 EE R+A+ LS+ EE G+Q Sbjct: 519 EEAGRRARSELSRGEAEGGGVRGRGGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMS 578 Query: 181 FSGHGAADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEY 2 +AD EK A+++GPE+M + EG EE+ +A+P +A++DA+ T++M+EC PEY Sbjct: 579 IFAGDSADGEKFAQKMGPELMKTLAEGFEEVCEKALPSTTHDAIIDAYDTNLMIECEPEY 638 >dbj|BAD43943.1| unknown protein [Arabidopsis thaliana] gi|51970532|dbj|BAD43958.1| unknown protein [Arabidopsis thaliana] Length = 417 Score = 80.5 bits (197), Expect = 3e-13 Identities = 64/209 (30%), Positives = 86/209 (41%), Gaps = 39/209 (18%) Frame = -1 Query: 514 SETPPPKDKA----LPTGILGVL-------SGAGRGKPTKPSVP-HPEKTRQTGGREPSQ 371 S PPP+ K P I L SGAGRGKP S P E RQ R P Sbjct: 185 SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQI--RRPPP 242 Query: 370 SPN--------------KDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXX 233 P KD + QLS EE R+A+ LS+ Sbjct: 243 PPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRG 302 Query: 232 XXXXXXY-------------EERGDQSRGRFSGHGAADREKLAKRLGPEIMSKVVEGLEE 92 EE G+Q R +AD EK A+++GPE+M + EG EE Sbjct: 303 RGARGRGRGRGGDGWRDDKKEEEGEQEAMRIFAGDSADGEKFAEKMGPELMKTLAEGFEE 362 Query: 91 MASRAVPDPHKEALVDAFQTDIMLECLPE 5 + +A+P +A++DA+ T++M+EC PE Sbjct: 363 ICEKALPSTTHDAIIDAYDTNLMIECEPE 391 >ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citrus clementina] gi|557544515|gb|ESR55493.1| hypothetical protein CICLE_v10019766mg [Citrus clementina] Length = 511 Score = 79.7 bits (195), Expect = 4e-13 Identities = 60/206 (29%), Positives = 92/206 (44%), Gaps = 35/206 (16%) Frame = -1 Query: 514 SETPPPKDKALPTGILGVLSGAGRGKPT-------------KPSVPHPEKTRQTGGR--- 383 +++ P + LP+ I+ L GAGRGK +P P E+ R R Sbjct: 190 TDSTQPSEPNLPSSIISTLPGAGRGKTVVTQQQQQQQHQRQQPGPPPQEENRHIRARLQP 249 Query: 382 --EPSQSPNKDT-AVREQLSQEEKVRKAKEILSK-------------XXXXXXXXXXXXX 251 P ++P +T + + +LS+E+ V+ A +ILS+ Sbjct: 250 QPRPEKAPAAETGSAQPKLSKEDAVKMAMKILSRGEEGEGEGISAGGPGRGRGMGRGGGR 309 Query: 250 XXXXXXXXXXXXYEERGDQSRGRFSG---HGAADREKLAKRLGPEIMSKVVEGLEEMASR 80 +E D GRF G AD EKLA+++G E M+ +VEG EEM+ R Sbjct: 310 GRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVEGFEEMSGR 369 Query: 79 AVPDPHKEALVDAFQTDIMLECLPEY 2 +P P ++A +DA T+ M+E PEY Sbjct: 370 VLPSPMEDAYIDALHTNCMIEFEPEY 395 >ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutrema salsugineum] gi|557089350|gb|ESQ30058.1| hypothetical protein EUTSA_v10011382mg [Eutrema salsugineum] Length = 531 Score = 79.7 bits (195), Expect = 4e-13 Identities = 62/215 (28%), Positives = 90/215 (41%), Gaps = 44/215 (20%) Frame = -1 Query: 514 SETPPPKDKALPTGILGVLSGAGRGKP---------------TKPSVPHPEKTRQTGGRE 380 SE P + +P SGAGRGKP +P P P++ +Q + Sbjct: 204 SEFSQPNQRIVPG------SGAGRGKPFVESAPLQQEENRHIRRPQPPPPQQQQQRSQPQ 257 Query: 379 PSQS-----PNKDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXX 215 P P KD A R +LS EE R+A+ LS+ Sbjct: 258 PQHQQKRVQPPKDEAPRPKLSIEEAGRRARSQLSRGEAEGGGLRGRGGGRG--------- 308 Query: 214 YEERGDQSRGRFSGHG------------------------AADREKLAKRLGPEIMSKVV 107 RG +RGR G G +AD EK A ++GPEIM + Sbjct: 309 ---RGRGARGRGRGRGGEGWRDVKMEEEAEQEAISTFVGDSADGEKFANKMGPEIMKMLA 365 Query: 106 EGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEY 2 +G E++ RA+P +A++DA++T++M+EC PEY Sbjct: 366 DGYEDICERALPSTANDAVLDAYETNLMIECEPEY 400 >ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis] Length = 407 Score = 79.0 bits (193), Expect = 7e-13 Identities = 59/207 (28%), Positives = 92/207 (44%), Gaps = 36/207 (17%) Frame = -1 Query: 514 SETPPPKDKALPTGILGVLSGAGRGKPT--------------KPSVPHPEKTRQTGGR-- 383 +++ P + LP+ I+ L GAGRGK +P P E+ R R Sbjct: 70 TDSTQPSEPNLPSSIISTLPGAGRGKTAVTQQQQQQQQHQRQQPGPPPQEENRHIRARLQ 129 Query: 382 ---EPSQSPNKDT-AVREQLSQEEKVRKAKEILSK-------------XXXXXXXXXXXX 254 P ++P +T + + +LS+E+ V+ A ++LS+ Sbjct: 130 PQPRPEKAPAAETGSAQPKLSKEDAVKMAMKVLSRGEEGEGEGISAGGPGRGRGMGRGRG 189 Query: 253 XXXXXXXXXXXXXYEERGDQSRGRFSG---HGAADREKLAKRLGPEIMSKVVEGLEEMAS 83 +E D GRF G AD EKLA+++G E M+ +VEG EEM+ Sbjct: 190 RGRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVEGFEEMSG 249 Query: 82 RAVPDPHKEALVDAFQTDIMLECLPEY 2 R +P P ++A +DA T+ M+E PEY Sbjct: 250 RVLPSPMEDAYIDALHTNCMIEFEPEY 276 >ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550322664|gb|EEF06007.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 466 Score = 78.2 bits (191), Expect = 1e-12 Identities = 66/204 (32%), Positives = 86/204 (42%), Gaps = 32/204 (15%) Frame = -1 Query: 517 ESETPPPKDKALPTGILGVLSGAGRGKPTKPSVP-HPEKTRQTGGREPSQ---------- 371 ESE P + LP IL L GAGRGKP K VP P K R SQ Sbjct: 133 ESEPPKKAEANLPPSILSGLGGAGRGKPVKQEVPIEPAKEENRHLRARSQPRSQPRTRQQ 192 Query: 370 -SPNKDTAV--REQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXYEER- 203 +P+ D AV ++ ++E V+KA E+LS+ R Sbjct: 193 KTPDGDDAVPATTKMGRQEAVKKAMELLSRGGGEGEVGGRGGGRGSFVPGRGGGRGGARG 252 Query: 202 GDQSRGR-----------------FSGHGAADREKLAKRLGPEIMSKVVEGLEEMASRAV 74 G + RGR GH D EK A+ +G E M+ +VE EEM+ R + Sbjct: 253 GGRGRGRGRRGYGDKEVEYGSGMSLEGH-EEDEEKFAQSVGVETMNTLVEAFEEMSGRVL 311 Query: 73 PDPHKEALVDAFQTDIMLECLPEY 2 P P ++ VDAF T+ E PEY Sbjct: 312 PCPIEDEYVDAFDTNCSFEFEPEY 335 >ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica] gi|462409156|gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica] Length = 428 Score = 76.6 bits (187), Expect = 4e-12 Identities = 57/165 (34%), Positives = 74/165 (44%), Gaps = 12/165 (7%) Frame = -1 Query: 460 LSGAGRGKP---TKPSVPHPEKTRQTGGREPSQSPNKDTAVREQLSQEEKVRKAKEILSK 290 L G+GRGKP T+P V E+ R R P PN+ + + R + Sbjct: 150 LPGSGRGKPMNFTRPEVQVKEENRHIQAR-PEPDPNQPRTRPRGPNGRGRGRGMR----- 203 Query: 289 XXXXXXXXXXXXXXXXXXXXXXXXXYEERGDQSRGRFSGHGAA---------DREKLAKR 137 ERGD+ RG+ S A D EKLAK+ Sbjct: 204 -----------GRGRGRGRGRGDFRMSERGDRRRGKDSDGSYASGLYLGDNADGEKLAKK 252 Query: 136 LGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEY 2 LGPEIM+K+VE EEM+S +P P +A VDA T+ M+EC PEY Sbjct: 253 LGPEIMNKLVERFEEMSSEVLPSPLDDAYVDAMHTNFMIECEPEY 297 >ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris] gi|561020640|gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris] Length = 532 Score = 74.7 bits (182), Expect = 1e-11 Identities = 66/215 (30%), Positives = 91/215 (42%), Gaps = 32/215 (14%) Frame = -1 Query: 550 FRFGEAQSGWTESETPPPKDKA--LPTGILGVLSGAGRGKPTKPSVPHP---EKTRQTGG 386 F+ + S T + P ++A LP I+ VLSG GRGKP K S P E+ R Sbjct: 189 FKREDIASPTTRDDFPIDVEQANKLPGNIIEVLSGLGRGKPMKQSDPETRVTEENRHLRA 248 Query: 385 REPSQSPNKDTAVREQL--SQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXY 212 + DT Q S+++ VR A+ LS+ Sbjct: 249 PRARGAAASDTLYERQPIPSRDDAVRNARNFLSQGEDDVGGTGRGRGFRERGGLGRGRGR 308 Query: 211 EE-----------RG---DQSRGRFSGHGA-----------ADREKLAKRLGPEIMSKVV 107 RG D+ RGRF A AD EKLAK++GPEIM+++ Sbjct: 309 GRGRGRGTGRGGFRGRDMDERRGRFMDAEASDDIGPYVGDDADGEKLAKKVGPEIMNQLT 368 Query: 106 EGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEY 2 EG EEMA R +P P ++ +DA + +E PEY Sbjct: 369 EGFEEMAGRVLPSPLEDEYLDALDINYAIEFEPEY 403 >ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507965 [Cicer arietinum] Length = 504 Score = 74.3 bits (181), Expect = 2e-11 Identities = 57/192 (29%), Positives = 81/192 (42%), Gaps = 28/192 (14%) Frame = -1 Query: 493 DKALPTGILGVLSGAGRGKPTKPSVPHP---EKTRQTGGREPSQSPNKDTAVREQLSQEE 323 D +L VLSGAGRGKP +P+V E+ R R S P + + L+ + Sbjct: 187 DNRFSMSVLKVLSGAGRGKPIEPAVSETQVVEENRHVRNRRASDVPMR----QPMLTGDG 242 Query: 322 KVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXYEERGDQSRGR--FSGHGAADR-- 155 ++ A++ LSK + RGR F G G DR Sbjct: 243 ALQNARKYLSKFDGDGSGSGRGGEPRERGAFGRGRGRGRGRGRGRGRGGFRGTGGDDRFG 302 Query: 154 ---------------------EKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAF 38 EKLAK++GPE+M++ EG EEM SR +P P ++ V+AF Sbjct: 303 QIQDNARSNASGLFLGDDVDGEKLAKKVGPEVMNQFTEGFEEMISRVLPSPLEDEYVEAF 362 Query: 37 QTDIMLECLPEY 2 + +E PEY Sbjct: 363 DINCAIEFEPEY 374 >ref|XP_007014541.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|508784904|gb|EOY32160.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 403 Score = 72.4 bits (176), Expect = 7e-11 Identities = 60/183 (32%), Positives = 77/183 (42%), Gaps = 27/183 (14%) Frame = -1 Query: 469 LGVLSGAGRGKPTKPSVPHPEKTRQTGGRE----PSQSPNKDTAVREQLSQEEKVRKAKE 302 + VLSGAGRGKP K P P RQ R QSP+ Q+SQEE +KA Sbjct: 169 VSVLSGAGRGKPVKQ--PEPASRRQEENRHIRVAQQQSPSA------QMSQEEATKKAMG 220 Query: 301 ILSKXXXXXXXXXXXXXXXXXXXXXXXXXYEERGDQSRGRF--------------SGHGA 164 ILS+ + GR SG G+ Sbjct: 221 ILSRRSESGESGMVGRGGRASMGMGGGRGRGRGRGRGMGRGRGRRQGEDTRIVKDSGEGS 280 Query: 163 ADR---------EKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECL 11 AD EK A+ +G + M+K+VEG EEM SR +P P +A +DA T+ +E Sbjct: 281 ADGLYLGDNADGEKFAQTIGADNMNKLVEGFEEMGSRVLPSPMDDAYLDALHTNCSIEFE 340 Query: 10 PEY 2 PEY Sbjct: 341 PEY 343