BLASTX nr result
ID: Mentha26_contig00024266
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00024266 (1042 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus... 223 1e-55 gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea] 192 1e-46 ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citr... 186 2e-44 ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1... 184 4e-44 ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247... 183 1e-43 ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu... 181 3e-43 ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family prot... 181 4e-43 ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm... 180 1e-42 ref|XP_006837366.1| hypothetical protein AMTR_s00111p00111440 [A... 176 2e-41 ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507... 174 7e-41 ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phas... 171 4e-40 ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [... 171 4e-40 ref|XP_007146827.1| hypothetical protein PHAVU_006G0732001g, par... 171 5e-40 ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Caps... 171 6e-40 ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prun... 170 8e-40 gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding prot... 167 5e-39 ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein... 167 5e-39 gb|AAM65660.1| Contains similarity to RNA-binding protein from A... 167 5e-39 ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus tr... 167 9e-39 ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp.... 166 2e-38 >gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus guttatus] Length = 493 Score = 223 bits (567), Expect = 1e-55 Identities = 145/341 (42%), Positives = 186/341 (54%), Gaps = 46/341 (13%) Frame = +1 Query: 148 GRGRGVXXXXXXXXXXXXXXXXXXXXESKFDLQPPKPDVKKLFRF---GEAQSGWTESET 318 GRGRGV ES + PPKP+VK F F E Q+ ESE Sbjct: 102 GRGRGVAIPASPTPPPPPPRVS----ESPSEKPPPKPNVKLPFLFVKDEEEQADAAESEV 157 Query: 319 PPPKEKALPTGILGILSGAGRGKP-TIPSAPQPEKTLQTGGREPSQSPNKD--------- 468 P +E L + I+ +LSGAGRGKP P+A QPEK Q+ R Q P + Sbjct: 158 PSAQETLLRSDIVSVLSGAGRGKPGKPPTAAQPEKP-QSENRHIRQRPPQGKPPVAVSSD 216 Query: 469 --TPVREQLSQEEKVRKAKEILS---KXXXXXXXXXXXXXXXXXXXXXXXXXXXDQSRGR 633 P QLS+EE V+KAKEILS + ++ RGR Sbjct: 217 GAAPPAVQLSKEEMVKKAKEILSKGDEDGGVSRPEVRDNRDNRDNRGGGRGGRGERGRGR 276 Query: 634 FSGDGA--------------------------ADREKLAKRLGPEIMSKVVEGLEEMASR 735 G G AD EK+A++LGP++M+++ EG++EM+SR Sbjct: 277 GRGRGRGRGRGRGDDRYEESDDESDALFIGDPADEEKVAQKLGPDVMAQLAEGIDEMSSR 336 Query: 736 AVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYE 915 +P P +A +DAF+T++ +EC PEY MEEFGTNPDIDEK P+PLR ALEKMKPFLM YE Sbjct: 337 VLPSPFDDAYMDAFETNLRIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMVYE 396 Query: 916 GIQSQEDWEEVMEETMKRVPLLKKAVQDH--GPDRATAKHQ 1032 GI+ QE+WE+++EETMK VPL+K+ V DH GPDR TAK Q Sbjct: 397 GIKDQEEWEKIIEETMKDVPLIKEIV-DHYSGPDRVTAKQQ 436 >gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea] Length = 426 Score = 192 bits (489), Expect = 1e-46 Identities = 114/265 (43%), Positives = 152/265 (57%), Gaps = 24/265 (9%) Frame = +1 Query: 319 PPPKEKALPTGILGILSGAGRGKPTIPSAPQPEKT-LQTGGREPSQSPNKDTPVREQLSQ 495 PPP++ A IL LSG GRG P P + T + R+P P+ +QLS+ Sbjct: 114 PPPRDTAALDDILTNLSGMGRGTPGKPPPQTLKPTPINRHIRQPQPRPSTALSPDQQLSK 173 Query: 496 EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXXXDQSRGRFSGDGA-------- 651 EEK++KA EILS+ GRFSG G Sbjct: 174 EEKLKKAVEILSRGDPDRGPIRSPTGRGRGRGRGRGGRG-----GRFSGRGRGREADAAI 228 Query: 652 -------------ADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIM 792 AD +K+A++LG E+M+K+ EG+EEM+SR +P +A VDA+ T+++ Sbjct: 229 ESDEELPGMFGDPADEQKVAEKLGVEVMNKITEGMEEMSSRVLPSLIDDAYVDAYHTNLL 288 Query: 793 LECLPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSQEDWEEVMEETMKRV 972 LEC PEYFME+FGTNPDID+K P+PLR A EKMKPFLM + GI++QE+WE+++EETM+ V Sbjct: 289 LECEPEYFMEDFGTNPDIDDKPPIPLREAFEKMKPFLMQHIGIETQEEWEQIIEETMESV 348 Query: 973 PLLKKAVQDH--GPDRATAKHQCGE 1041 P KK + DH GPDR TA Q GE Sbjct: 349 PRWKKII-DHYAGPDRVTALQQIGE 372 >ref|XP_006442253.1| hypothetical protein CICLE_v10019766mg [Citrus clementina] gi|557544515|gb|ESR55493.1| hypothetical protein CICLE_v10019766mg [Citrus clementina] Length = 511 Score = 186 bits (471), Expect = 2e-44 Identities = 123/305 (40%), Positives = 160/305 (52%), Gaps = 40/305 (13%) Frame = +1 Query: 238 DLQPPKPDVKKLFRFGEAQSGWTESETPPPKEKALPTGILGILSGAGRGKPTI------- 396 D QP KP + F E+ + T+ P E LP+ I+ L GAGRGK + Sbjct: 174 DAQPAKP---RTFTPNESATDSTQ-----PSEPNLPSSIISTLPGAGRGKTVVTQQQQQQ 225 Query: 397 ------PSAPQPEKTLQTGGR-----EPSQSPNKDT-PVREQLSQEEKVRKAKEILSKXX 540 P P E+ R P ++P +T + +LS+E+ V+ A +ILS+ Sbjct: 226 QHQRQQPGPPPQEENRHIRARLQPQPRPEKAPAAETGSAQPKLSKEDAVKMAMKILSRGE 285 Query: 541 XXXXXXXXXXXXXXXXXXXXXXXXX----------------DQSRGRFSG---DGAADRE 663 D GRF G AD E Sbjct: 286 EGEGEGISAGGPGRGRGMGRGGGRGRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGE 345 Query: 664 KLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPD 843 KLA+++G E M+ +VEG EEM+ R +P P ++A +DA T+ M+E PEY MEEFGTNPD Sbjct: 346 KLAEKVGAEKMNMLVEGFEEMSGRVLPSPMEDAYIDALHTNCMIEFEPEYLMEEFGTNPD 405 Query: 844 IDEKAPMPLRVALEKMKPFLMSYEGIQSQEDWEEVMEETMKRVPLLKKAVQDH--GPDRA 1017 IDEK P+PLR ALEKMKPFLM+YEGIQSQ++WEE + E M+RVPLLK+ V DH GPDR Sbjct: 406 IDEKPPIPLRDALEKMKPFLMAYEGIQSQKEWEEAVNEVMERVPLLKEIV-DHYSGPDRV 464 Query: 1018 TAKHQ 1032 TAK Q Sbjct: 465 TAKQQ 469 >ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis] Length = 407 Score = 184 bits (468), Expect = 4e-44 Identities = 120/306 (39%), Positives = 157/306 (51%), Gaps = 41/306 (13%) Frame = +1 Query: 238 DLQPPKPDVKKLFRFGEAQSGWTESETPPPKEKALPTGILGILSGAGRGKPTI------- 396 D QP KP + +++ P E LP+ I+ L GAGRGK + Sbjct: 54 DAQPAKPRT--------CTPNESATDSTQPSEPNLPSSIISTLPGAGRGKTAVTQQQQQQ 105 Query: 397 -------PSAPQPEKTLQTGGR-----EPSQSPNKDT-PVREQLSQEEKVRKAKEILSKX 537 P P E+ R P ++P +T + +LS+E+ V+ A ++LS+ Sbjct: 106 QQHQRQQPGPPPQEENRHIRARLQPQPRPEKAPAAETGSAQPKLSKEDAVKMAMKVLSRG 165 Query: 538 XXXXXXXXXXXXXXXXXXXXXXXXXX----------------DQSRGRFSG---DGAADR 660 D GRF G AD Sbjct: 166 EEGEGEGISAGGPGRGRGMGRGRGRGRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNADG 225 Query: 661 EKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNP 840 EKLA+++G E M+ +VEG EEM+ R +P P ++A +DA T+ M+E PEY MEEFGTNP Sbjct: 226 EKLAEKVGAEKMNMLVEGFEEMSGRVLPSPMEDAYIDALHTNCMIEFEPEYLMEEFGTNP 285 Query: 841 DIDEKAPMPLRVALEKMKPFLMSYEGIQSQEDWEEVMEETMKRVPLLKKAVQDH--GPDR 1014 DIDEK P+PLR ALEKMKPFLM+YEGIQSQE+WEE + E M+RVPLLK+ V DH GPDR Sbjct: 286 DIDEKPPIPLRDALEKMKPFLMAYEGIQSQEEWEEAVNEVMERVPLLKEIV-DHYSGPDR 344 Query: 1015 ATAKHQ 1032 TAK Q Sbjct: 345 VTAKQQ 350 >ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED: uncharacterized protein LOC101247662 isoform 2 [Solanum lycopersicum] Length = 473 Score = 183 bits (464), Expect = 1e-43 Identities = 115/296 (38%), Positives = 161/296 (54%), Gaps = 33/296 (11%) Frame = +1 Query: 244 QPPKPDVKKLFRFGEAQ---SGWTESETPPPKEKA-LPTGILGILSGAGRGKPTIPSAPQ 411 Q +P K +F E + S + S P P++ + LP+ ++ +L+GAGRGKP ++ Sbjct: 121 QQQQPLRKPIFFAKEEETTDSNSSSSNAPKPRDDSNLPSSVISVLTGAGRGKPLQTASSV 180 Query: 412 PEKTLQTGGR-EPSQSPNKD------TPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXX 570 EK + P Q D +P ++LS+E+ V+KA ILS+ Sbjct: 181 SEKPKEENRHLRPRQQKVADSGERASSPPPQRLSREDAVKKAVGILSRSDDGDVGGGRGM 240 Query: 571 XXXXXXXXXXXXXXXDQSRGRFSGDGA---------------------ADREKLAKRLGP 687 RGR G G AD EKLA +LGP Sbjct: 241 GGGFRGRGGRGAVRGRGGRGRGRGRGRGRRDEERGDGNLESGFYLGDDADGEKLAAKLGP 300 Query: 688 EIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMP 867 E M+ + EG EEM++R +P P +A ++A T++M+EC PEY M +F +NPDIDE P+P Sbjct: 301 ESMNTLAEGFEEMSARVLPSPMDDAYLEALHTNMMIECEPEYLMGDFESNPDIDETPPIP 360 Query: 868 LRVALEKMKPFLMSYEGIQSQEDWEEVMEETMKRVPLLKKAVQDH-GPDRATAKHQ 1032 LR ALEKMKPFLM+YEGI+ QE+WEEV++ETM+ VPL+K+ V + GPDR TAK Q Sbjct: 361 LRDALEKMKPFLMAYEGIKDQEEWEEVIKETMETVPLMKEIVDYYSGPDRVTAKQQ 416 >ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum] Length = 480 Score = 181 bits (460), Expect = 3e-43 Identities = 112/282 (39%), Positives = 159/282 (56%), Gaps = 34/282 (12%) Frame = +1 Query: 289 AQSGWTESETPPPKEKA-LPTGILGILSGAGRGKPTIPSAPQPEKTLQTGGR-EPSQSPN 462 A S + S+ P P++ + L + ++ +L+GAGRGKP ++P EK + P Q Sbjct: 142 ADSNSSSSDAPTPRDDSNLSSSVISVLTGAGRGKPLQTASPVSEKPKEENRHLRPRQQKV 201 Query: 463 KDT------PVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXXX--- 615 D+ P ++LS+E+ V+KA ILS+ Sbjct: 202 ADSGERASSPPPQRLSREDAVKKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVR 261 Query: 616 ---------DQSRGRFS---GDGA----------ADREKLAKRLGPEIMSKVVEGLEEMA 729 + RGR GDG+ AD EKLA++LGPE M+ + EG EEM+ Sbjct: 262 GRGGRGRGRGRGRGRRDEERGDGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMS 321 Query: 730 SRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMS 909 +R +P P +A ++A T++M+EC PEY M +F +NPDIDE P+PLR ALEKMKPFLM+ Sbjct: 322 ARVLPSPMDDAYIEALHTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMA 381 Query: 910 YEGIQSQEDWEEVMEETMKRVPLLKKAVQDH-GPDRATAKHQ 1032 YEGI+ QE+WEEV++ETM+ VPL+K+ V + GPDR TAK Q Sbjct: 382 YEGIKDQEEWEEVIKETMETVPLMKEIVDYYSGPDRVTAKQQ 423 >ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508784903|gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 474 Score = 181 bits (459), Expect = 4e-43 Identities = 126/301 (41%), Positives = 155/301 (51%), Gaps = 39/301 (12%) Frame = +1 Query: 247 PPKPDVKKLFRFGEAQSGWTES------ETPPPKEKALPTGIL--GILSGAGRGKPTIPS 402 PP K+ + TES E E P IL +LSGAGRGKP Sbjct: 125 PPPAQAKQPIFIKKKDEDETESSAKAAAEPIQSSEPIFPPNILPVSVLSGAGRGKPV--K 182 Query: 403 APQPEKTLQTGGRE----PSQSPNKDTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXX 570 P+P Q R QSP+ Q+SQEE +KA ILS+ Sbjct: 183 QPEPASRRQEENRHIRVAQQQSPSA------QMSQEEATKKAMGILSRRSESGESGMVGR 236 Query: 571 XXXXXXXXXXXXXXX-------DQSRGRF----------SGDGAADR---------EKLA 672 + RGR SG+G+AD EK A Sbjct: 237 GGRASMGMGGGRGRGRGRGRGMGRGRGRRQGEDTRIVKDSGEGSADGLYLGDNADGEKFA 296 Query: 673 KRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDIDE 852 + +G + M+K+VEG EEM SR +P P +A +DA T+ +E PEY MEEFGTNPDIDE Sbjct: 297 QTIGADNMNKLVEGFEEMGSRVLPSPMDDAYLDALHTNCSIEFEPEYLMEEFGTNPDIDE 356 Query: 853 KAPMPLRVALEKMKPFLMSYEGIQSQEDWEEVMEETMKRVPLLKKAVQDH-GPDRATAKH 1029 K PMPLR ALEKMKPFLM+YEGIQSQE+WEEV++ETM+RVPLL++ V + GPDR TAK Sbjct: 357 KPPMPLRDALEKMKPFLMAYEGIQSQEEWEEVIKETMERVPLLQEIVDYYSGPDRVTAKK 416 Query: 1030 Q 1032 Q Sbjct: 417 Q 417 >ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis] gi|223537066|gb|EEF38701.1| conserved hypothetical protein [Ricinus communis] Length = 436 Score = 180 bits (456), Expect = 1e-42 Identities = 107/266 (40%), Positives = 146/266 (54%), Gaps = 17/266 (6%) Frame = +1 Query: 286 EAQSGWTESETPPPKEKALPTGILGILSGAGRGKPTIPSAPQPE-----KTLQTGGREPS 450 + + G + T + LP+ I LSG GRG+P P P P+ + ++ R Sbjct: 115 DPEPGPSRQPTESQSDSVLPSTIHSSLSGFGRGEPDKPVVPTPQVKEENRHIRDRSRAKP 174 Query: 451 QSPNKDTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXXXDQSR- 627 ++ + + ++S+EE V++A ILS+ + R Sbjct: 175 KTEEAEVRAKPKISREEAVKRAVSILSQGDTGEGMGRGRGGGRGRGRGRGRGRLEQRGRM 234 Query: 628 ----------GRFSGDGAADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAF 777 G F GD A D EKLA ++G E M+K+VEG EEM+ R +P P ++A +DA Sbjct: 235 MDDVDEGFGSGLFLGDNA-DGEKLAGKIGVENMNKLVEGYEEMSGRVLPSPMEDAYLDAL 293 Query: 778 QTDIMLECLPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSQEDWEEVMEE 957 T+ M+E PEY M EF NPDIDEK PMPLR LEK+KPF+M+YEGIQSQE+WE +EE Sbjct: 294 HTNYMIEFEPEYLMGEFDQNPDIDEKPPMPLRDVLEKVKPFIMAYEGIQSQEEWEAAVEE 353 Query: 958 TMKRVPLLKKAVQDH-GPDRATAKHQ 1032 TMK VPL K+ V + GPDR TAK Q Sbjct: 354 TMKNVPLFKEIVDYYSGPDRITAKKQ 379 >ref|XP_006837366.1| hypothetical protein AMTR_s00111p00111440 [Amborella trichopoda] gi|548839984|gb|ERN00220.1| hypothetical protein AMTR_s00111p00111440 [Amborella trichopoda] Length = 447 Score = 176 bits (445), Expect = 2e-41 Identities = 115/290 (39%), Positives = 151/290 (52%), Gaps = 27/290 (9%) Frame = +1 Query: 244 QPP--KPDVKKLFRFGEAQSGWTESETPPPKEKALPTGILGI-LSGAGRGKPTIPSAP-- 408 +PP KP K G +++ PP E LP I + G GRGKPT P Sbjct: 122 EPPSRKPIFFKRDEIEGTDEGRVQAQNLPPTESPLPRSISPAPIEGFGRGKPTSPLLSHG 181 Query: 409 -QPEKTLQTGGREP-----SQSPNKDTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXX 570 + E+ R P Q+ +LS EE VR AK+ILS+ Sbjct: 182 IEEEENRHIRRRSPPPERAGQASRGRASNERKLSSEEAVRNAKDILSRGEGRGGRGLRGG 241 Query: 571 XXXXXXXXXXXXXXX---------------DQSRGRFSGDGAADREKLAKRLGPEIMSKV 705 D S G + GD A D EKL KRLG E ++++ Sbjct: 242 RGLRGGRGRGGVWAGRGRQGRGARYQDRREDDSVGLYLGDDA-DGEKLVKRLGEENVNQI 300 Query: 706 VEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPLRVALE 885 E +EM+ R +P P +EA +DA T+ ++E PEY MEEFGTNPDIDEK P+PL ALE Sbjct: 301 FEAFDEMSGRVLPSPMEEAYLDALHTNCLIEFEPEYHMEEFGTNPDIDEKPPIPLCDALE 360 Query: 886 KMKPFLMSYEGIQSQEDWEEVMEETMKRVPLLKKAVQDH-GPDRATAKHQ 1032 K+KPF+M+YEGIQ+QE+WEEV++ETM +VP LK+ V + GPDR TA+ Q Sbjct: 361 KIKPFIMTYEGIQNQEEWEEVVKETMDKVPYLKELVDIYSGPDRVTARQQ 410 >ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507965 [Cicer arietinum] Length = 504 Score = 174 bits (440), Expect = 7e-41 Identities = 118/298 (39%), Positives = 155/298 (52%), Gaps = 31/298 (10%) Frame = +1 Query: 232 KFDLQPPKPDVKKLFRFGEAQSGWTESETPPPKEKALPTGILGILSGAGRGKPTIPSAPQ 411 K D+ PPK K +F E S + + + +L +LSGAGRGKP P+ + Sbjct: 158 KDDVSPPK---KPVFTRREDFSP-IDLSSDQESDNRFSMSVLKVLSGAGRGKPIEPAVSE 213 Query: 412 PEKTLQTGGREPSQSPNKDTPVRE-QLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXX 588 + + R D P+R+ L+ + ++ A++ LSK Sbjct: 214 TQVVEEN--RHVRNRRASDVPMRQPMLTGDGALQNARKYLSKFDGDGSGSGRGGEPRERG 271 Query: 589 XXXXXXXXX-----DQSRGRFSGDGAADR-----------------------EKLAKRLG 684 + RG F G G DR EKLAK++G Sbjct: 272 AFGRGRGRGRGRGRGRGRGGFRGTGGDDRFGQIQDNARSNASGLFLGDDVDGEKLAKKVG 331 Query: 685 PEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDIDEKAPM 864 PE+M++ EG EEM SR +P P ++ V+AF + +E PEY ME F +NPDIDEK P+ Sbjct: 332 PEVMNQFTEGFEEMISRVLPSPLEDEYVEAFDINCAIEFEPEYIME-FDSNPDIDEKEPI 390 Query: 865 PLRVALEKMKPFLMSYEGIQSQEDWEEVMEETMKRVPLLKKAVQDH--GPDRATAKHQ 1032 PLR ALEKMKPFLM+YEGIQSQE+WE +MEETM+RVPLLKK V DH GPDR TAK Q Sbjct: 391 PLRDALEKMKPFLMNYEGIQSQEEWEAIMEETMERVPLLKKIV-DHYSGPDRVTAKKQ 447 >ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris] gi|561020640|gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris] Length = 532 Score = 171 bits (434), Expect = 4e-40 Identities = 122/304 (40%), Positives = 156/304 (51%), Gaps = 39/304 (12%) Frame = +1 Query: 238 DLQPPKPDVKK--LFRFGEAQSGWTESETPPPKEKA--LPTGILGILSGAGRGKPTIPSA 405 DL PP KK F+ + S T + P E+A LP I+ +LSG GRGKP S Sbjct: 175 DLGPPDSGPKKPIFFKREDIASPTTRDDFPIDVEQANKLPGNIIEVLSGLGRGKPMKQSD 234 Query: 406 PQPEKTLQTGGREPSQSPN---KDTPVREQL--SQEEKVRKAKEILS------------- 531 P+ T + ++ DT Q S+++ VR A+ LS Sbjct: 235 PETRVTEENRHLRAPRARGAAASDTLYERQPIPSRDDAVRNARNFLSQGEDDVGGTGRGR 294 Query: 532 ----KXXXXXXXXXXXXXXXXXXXXXXXXXXXDQSRGRFSGDGA-----------ADREK 666 + D+ RGRF A AD EK Sbjct: 295 GFRERGGLGRGRGRGRGRGRGTGRGGFRGRDMDERRGRFMDAEASDDIGPYVGDDADGEK 354 Query: 667 LAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDI 846 LAK++GPEIM+++ EG EEMA R +P P ++ +DA + +E PEY +E NPDI Sbjct: 355 LAKKVGPEIMNQLTEGFEEMAGRVLPSPLEDEYLDALDINYAIEFEPEYLVEF--DNPDI 412 Query: 847 DEKAPMPLRVALEKMKPFLMSYEGIQSQEDWEEVMEETMKRVPLLKKAVQDH--GPDRAT 1020 DEK P+PLR ALEKMKPFLM+YEGIQSQE+WEE+MEETM +VPLLK+ V DH GPDR T Sbjct: 413 DEKEPIPLRDALEKMKPFLMAYEGIQSQEEWEEIMEETMAQVPLLKEIV-DHYSGPDRVT 471 Query: 1021 AKHQ 1032 AK Q Sbjct: 472 AKKQ 475 >ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [Glycine max] gi|571476117|ref|XP_006586864.1| PREDICTED: la-related protein 1 isoform X2 [Glycine max] Length = 481 Score = 171 bits (434), Expect = 4e-40 Identities = 126/315 (40%), Positives = 159/315 (50%), Gaps = 50/315 (15%) Frame = +1 Query: 238 DLQPPKPDVKK--LFRFGEAQSGWTESETPPPK-------EKALPTGILGILSGAGRGKP 390 DLQPP KK F+ ++ S ++ PPK + LP I G+LSG GRGK Sbjct: 121 DLQPPDSGPKKPIFFKREDSVSPTASNDFLPPKRSVDHAHDNKLPGSIPGVLSGLGRGK- 179 Query: 391 TIPSAPQPE------------KTLQTGGREPSQSPNKDTPVREQLSQEEKVRKAKEIL-- 528 S QP+ +T Q G S++ K +P+ SQE+ R A +IL Sbjct: 180 ---SMKQPDLETQVTEENRHLRTRQAPGAASSETVPKRSPIP---SQEDATRNALKILSH 233 Query: 529 -----SKXXXXXXXXXXXXXXXXXXXXXXXXXXXDQSRGRF------------------- 636 S RGRF Sbjct: 234 GKDDGSDTGRGREYGGRGGLDRGRGRGRGRGRGRGMGRGRFVERDVDEKVMDTDDYATGL 293 Query: 637 -SGDGAADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEY 813 +GD A D EKLA+++GPEIM+++ EG EEM SR +P P ++ +DA + +E PEY Sbjct: 294 YAGDDA-DGEKLARKVGPEIMNQLTEGFEEMTSRVLPSPLEDEFLDALDINYAIEFEPEY 352 Query: 814 FMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSQEDWEEVMEETMKRVPLLKKAV 993 +E NPDIDEK P+ LR ALEK KPFLMSYEGIQSQE+WEE+MEETM RVPLLKK + Sbjct: 353 LVEF--DNPDIDEKEPISLRDALEKAKPFLMSYEGIQSQEEWEEIMEETMARVPLLKKII 410 Query: 994 QDH--GPDRATAKHQ 1032 DH GPDR TAK Q Sbjct: 411 -DHYSGPDRVTAKKQ 424 >ref|XP_007146827.1| hypothetical protein PHAVU_006G0732001g, partial [Phaseolus vulgaris] gi|561020050|gb|ESW18821.1| hypothetical protein PHAVU_006G0732001g, partial [Phaseolus vulgaris] Length = 471 Score = 171 bits (433), Expect = 5e-40 Identities = 121/303 (39%), Positives = 153/303 (50%), Gaps = 38/303 (12%) Frame = +1 Query: 238 DLQPPKPDVKKLFRFGEAQSGWTESETPPPKEKA--LPTGILGILSGAGRGKPTIPSAPQ 411 DL PP P FR ++ S P E LP I G+LSG GRGKP P+ Sbjct: 144 DLGPPGPKKPIFFRRKDSVSPTVTDGFPIDVEHVNKLPGTIPGVLSGLGRGKPMKQPEPE 203 Query: 412 PEKTLQTGGREPSQSPN---KDT-PVREQLSQ-EEKVRKAKEILSKXXXXXXXXXXXXXX 576 T + P ++P DT P R+ + + ++ VR A+ LS+ Sbjct: 204 TRVTEENRHLRPPRAPGAAASDTLPERQPMPRRDDAVRNARNFLSQGEDDGSGTGRGRGF 263 Query: 577 XXXXXXXXXXXXXDQSRGRFSGDGA-----------------------------ADREKL 669 + RGR G G AD E+L Sbjct: 264 RGRGGLGRGRG---RGRGRGIGRGGFRGRDINERLGRFMDADDSDVAGLYVGDDADGERL 320 Query: 670 AKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDID 849 AK+ GPEIM+++ EG EE+A R +P P ++ +DA + +E PEY +E NPDID Sbjct: 321 AKKFGPEIMNQLTEGFEEVAGRVLPSPLEDEYLDALDINYAIEFEPEYLVEF--DNPDID 378 Query: 850 EKAPMPLRVALEKMKPFLMSYEGIQSQEDWEEVMEETMKRVPLLKKAVQDH--GPDRATA 1023 EK P+PLR ALEKMKPFLM+YEGIQSQE+WEE+MEETM RVPLLKK V DH GPDR TA Sbjct: 379 EKEPIPLRDALEKMKPFLMAYEGIQSQEEWEEIMEETMARVPLLKKIV-DHYSGPDRVTA 437 Query: 1024 KHQ 1032 K Q Sbjct: 438 KKQ 440 >ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Capsella rubella] gi|482575944|gb|EOA40131.1| hypothetical protein CARUB_v10008838mg [Capsella rubella] Length = 525 Score = 171 bits (432), Expect = 6e-40 Identities = 105/283 (37%), Positives = 150/283 (53%), Gaps = 42/283 (14%) Frame = +1 Query: 310 SETPPPKEKA----LPTGILGIL-------SGAGRGKPTIPSAP--------------QP 414 S P P+ K+ LP + L SGAGRGKP + SAP P Sbjct: 189 SSPPAPESKSGQTDLPDNVFNALGSEIPHSSGAGRGKPLVESAPIQREENRHIRRPPPPP 248 Query: 415 EKTLQTGGREPSQSPNKDTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXX 594 ++ ++ +Q+P +TP R +LS EE R+A+ LS+ Sbjct: 249 QQQRSQPQQKRAQTPRDETP-RPRLSAEEAGRRARSELSRGEAEGSGVRGRGGRGRGRGA 307 Query: 595 XXXXXXX---------------DQSRGRFSGDGAADREKLAKRLGPEIMSKVVEGLEEMA 729 ++ F+GD +AD EK A ++GPE+M + EG EE+ Sbjct: 308 RGRGRGRGGEGWRDDKKEEEGEQEAMSVFAGD-SADGEKFANKMGPELMKTLAEGFEEVC 366 Query: 730 SRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMS 909 +A+P +A++DA+ T++M+EC PEY M +FG+NPDIDEK PM LR LEK+KPF+++ Sbjct: 367 EKALPSTTHDAIIDAYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVA 426 Query: 910 YEGIQSQEDWEEVMEETMKRVPLLKKAVQDH--GPDRATAKHQ 1032 YEGI+ QE+WEE + E M + PL+K+ V DH GPDR TAK Q Sbjct: 427 YEGIKDQEEWEEAINEAMAQAPLMKEIV-DHYSGPDRVTAKKQ 468 >ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica] gi|462409156|gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica] Length = 428 Score = 170 bits (431), Expect = 8e-40 Identities = 89/137 (64%), Positives = 105/137 (76%), Gaps = 2/137 (1%) Frame = +1 Query: 628 GRFSGDGAADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLP 807 G + GD A D EKLAK+LGPEIM+K+VE EEM+S +P P +A VDA T+ M+EC P Sbjct: 237 GLYLGDNA-DGEKLAKKLGPEIMNKLVERFEEMSSEVLPSPLDDAYVDAMHTNFMIECEP 295 Query: 808 EYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSQEDWEEVMEETMKRVPLLKK 987 EY M EF NPDIDEK P+ LR ALEKMKPFLM+YE I+SQE+WEEV+ ETM+RVPLLK+ Sbjct: 296 EYLMGEFNKNPDIDEKPPISLRDALEKMKPFLMAYENIESQEEWEEVVNETMERVPLLKE 355 Query: 988 AVQDH--GPDRATAKHQ 1032 V DH GPDR TAK Q Sbjct: 356 IV-DHYSGPDRVTAKKQ 371 >gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding protein from Arabidopsis thaliana gi|2129727 and contains RNA recognition PF|00076 domain. ESTs gb|H37317, gb|F14415, gb|AA651290 come from this gene [Arabidopsis thaliana] Length = 829 Score = 167 bits (424), Expect = 5e-39 Identities = 111/288 (38%), Positives = 150/288 (52%), Gaps = 47/288 (16%) Frame = +1 Query: 310 SETPPPKEKA----LPTGILGIL-------SGAGRGKPTIPSAP----------QPEKTL 426 S PPP+ K P I L SGAGRGKP + SAP +P Sbjct: 491 SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQIRRPPPPP 550 Query: 427 QTGGREPSQ--SPN-KDTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXX 597 Q +P Q +P KD + QLS EE R+A+ LS+ Sbjct: 551 QQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRGRG 610 Query: 598 XXXXXXDQSRGR---------------------FSGDGAADREKLAKRLGPEIMSKVVEG 714 + RGR F+GD +AD EK A+++GPE+M + EG Sbjct: 611 ARG----RGRGRGGDGWRDDKKEEEGEQEAMRIFAGD-SADGEKFAEKMGPELMKTLAEG 665 Query: 715 LEEMASRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPLRVALEKMK 894 EE+ +A+P +A++DA+ T++M+EC PEY M +FG+NPDIDEK PM LR LEK+K Sbjct: 666 FEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEKVK 725 Query: 895 PFLMSYEGIQSQEDWEEVMEETMKRVPLLKKAVQDH--GPDRATAKHQ 1032 PF+++YEGI+ QE+WEE + E M + PL+K+ V DH GPDR TAK Q Sbjct: 726 PFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIV-DHYSGPDRVTAKKQ 772 >ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|12324041|gb|AAG51990.1|AC024260_28 unknown protein; 43598-45751 [Arabidopsis thaliana] gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8 [Arabidopsis thaliana] gi|23506017|gb|AAN28868.1| At1g53640/F22G10.8 [Arabidopsis thaliana] gi|110740318|dbj|BAF02054.1| hypothetical protein [Arabidopsis thaliana] gi|332194854|gb|AEE32975.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 523 Score = 167 bits (424), Expect = 5e-39 Identities = 111/288 (38%), Positives = 150/288 (52%), Gaps = 47/288 (16%) Frame = +1 Query: 310 SETPPPKEKA----LPTGILGIL-------SGAGRGKPTIPSAP----------QPEKTL 426 S PPP+ K P I L SGAGRGKP + SAP +P Sbjct: 185 SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQIRRPPPPP 244 Query: 427 QTGGREPSQ--SPN-KDTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXX 597 Q +P Q +P KD + QLS EE R+A+ LS+ Sbjct: 245 QQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRGRG 304 Query: 598 XXXXXXDQSRGR---------------------FSGDGAADREKLAKRLGPEIMSKVVEG 714 + RGR F+GD +AD EK A+++GPE+M + EG Sbjct: 305 ARG----RGRGRGGDGWRDDKKEEEGEQEAMRIFAGD-SADGEKFAEKMGPELMKTLAEG 359 Query: 715 LEEMASRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPLRVALEKMK 894 EE+ +A+P +A++DA+ T++M+EC PEY M +FG+NPDIDEK PM LR LEK+K Sbjct: 360 FEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEKVK 419 Query: 895 PFLMSYEGIQSQEDWEEVMEETMKRVPLLKKAVQDH--GPDRATAKHQ 1032 PF+++YEGI+ QE+WEE + E M + PL+K+ V DH GPDR TAK Q Sbjct: 420 PFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIV-DHYSGPDRVTAKKQ 466 >gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis thaliana gi|2129727 and contains RNA recognition PF|00076 domain [Arabidopsis thaliana] Length = 523 Score = 167 bits (424), Expect = 5e-39 Identities = 111/288 (38%), Positives = 150/288 (52%), Gaps = 47/288 (16%) Frame = +1 Query: 310 SETPPPKEKA----LPTGILGIL-------SGAGRGKPTIPSAP----------QPEKTL 426 S PPP+ K P I L SGAGRGKP + SAP +P Sbjct: 185 SSPPPPESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESAPIRQEDNRQIRRPPPPP 244 Query: 427 QTGGREPSQ--SPN-KDTPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXX 597 Q +P Q +P KD + QLS EE R+A+ LS+ Sbjct: 245 QQQRVQPQQKRAPTVKDGTPKPQLSAEEAGRRARSELSRGEAEGSSVGGRGGRGRGRGRG 304 Query: 598 XXXXXXDQSRGR---------------------FSGDGAADREKLAKRLGPEIMSKVVEG 714 + RGR F+GD +AD EK A+++GPE+M + EG Sbjct: 305 ARG----RGRGRGGDGWRDDKKEEEGEQEAMRIFAGD-SADGEKFAEKMGPELMKTLAEG 359 Query: 715 LEEMASRAVPDPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPLRVALEKMK 894 EE+ +A+P +A++DA+ T++M+EC PEY M +FG+NPDIDEK PM LR LEK+K Sbjct: 360 FEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEKVK 419 Query: 895 PFLMSYEGIQSQEDWEEVMEETMKRVPLLKKAVQDH--GPDRATAKHQ 1032 PF+++YEGI+ QE+WEE + E M + PL+K+ V DH GPDR TAK Q Sbjct: 420 PFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIV-DHYSGPDRVTAKKQ 466 >ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550322664|gb|EEF06007.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 466 Score = 167 bits (422), Expect = 9e-39 Identities = 106/277 (38%), Positives = 138/277 (49%), Gaps = 35/277 (12%) Frame = +1 Query: 307 ESETPPPKEKALPTGILGILSGAGRGKPT---IPSAPQPEKTLQTGGREPSQS------- 456 ESE P E LP IL L GAGRGKP +P P E+ R +S Sbjct: 133 ESEPPKKAEANLPPSILSGLGGAGRGKPVKQEVPIEPAKEENRHLRARSQPRSQPRTRQQ 192 Query: 457 --PNKD--TPVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXXXDQS 624 P+ D P ++ ++E V+KA E+LS+ + Sbjct: 193 KTPDGDDAVPATTKMGRQEAVKKAMELLSRGGGEGEVGGRGGGRGSFVPGRGGGRGGARG 252 Query: 625 RGRFSGDGAA--------------------DREKLAKRLGPEIMSKVVEGLEEMASRAVP 744 GR G G D EK A+ +G E M+ +VE EEM+ R +P Sbjct: 253 GGRGRGRGRRGYGDKEVEYGSGMSLEGHEEDEEKFAQSVGVETMNTLVEAFEEMSGRVLP 312 Query: 745 DPHKEALVDAFQTDIMLECLPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQ 924 P ++ VDAF T+ E PEY M EF NPDIDEK PMPLR ALEK+KPF+M+Y GI+ Sbjct: 313 CPIEDEYVDAFDTNCSFEFEPEYLMGEFDKNPDIDEKPPMPLRDALEKVKPFMMAYMGIK 372 Query: 925 SQEDWEEVMEETMKRVPLLKKAVQDH-GPDRATAKHQ 1032 + E+WEE++EETMK PL+KK V + GPDR + K Q Sbjct: 373 THEEWEEIVEETMKDAPLMKKIVDSYSGPDRVSGKKQ 409 >ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297340299|gb|EFH70716.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 769 Score = 166 bits (419), Expect = 2e-38 Identities = 99/256 (38%), Positives = 137/256 (53%), Gaps = 35/256 (13%) Frame = +1 Query: 370 GAGRGKPTIPSAP-QPEKTLQTGGREPSQSPN-----------------KDTPVREQLSQ 495 GAGRGKP + SAP Q E Q +P P KD + QLS+ Sbjct: 459 GAGRGKPLVESAPIQQEDNRQIRRPQPPPPPQQQQQQRAQPQQKRAPTVKDEAPKPQLSR 518 Query: 496 EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXXXXX---------------DQSRG 630 EE R+A+ LS+ ++ Sbjct: 519 EEAGRRARSELSRGEAEGGGVRGRGGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMS 578 Query: 631 RFSGDGAADREKLAKRLGPEIMSKVVEGLEEMASRAVPDPHKEALVDAFQTDIMLECLPE 810 F+GD +AD EK A+++GPE+M + EG EE+ +A+P +A++DA+ T++M+EC PE Sbjct: 579 IFAGD-SADGEKFAQKMGPELMKTLAEGFEEVCEKALPSTTHDAIIDAYDTNLMIECEPE 637 Query: 811 YFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSQEDWEEVMEETMKRVPLLKKA 990 Y M +FG+NPDIDEK PM LR LEK+KPF+++YEGI+ QE+WEE + E M + PL+K+ Sbjct: 638 YIMADFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAVNEAMAQAPLMKEI 697 Query: 991 VQDH--GPDRATAKHQ 1032 V DH GPDR TAK Q Sbjct: 698 V-DHYSGPDRVTAKKQ 712