BLASTX nr result
ID: Rehmannia25_contig00006865
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00006865 (1877 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu... 387 e-105 ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247... 384 e-104 ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253... 357 1e-95 gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, ... 352 2e-94 gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea] 348 5e-93 ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1... 342 3e-91 ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [... 327 1e-86 gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus... 319 2e-84 ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507... 318 5e-84 ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm... 318 7e-84 gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus pe... 317 1e-83 ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215... 315 3e-83 emb|CBI17195.3| unnamed protein product [Vitis vinifera] 308 5e-81 ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Caps... 306 2e-80 ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus tr... 303 1e-79 gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding prot... 303 2e-79 ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein... 303 2e-79 gb|AAM65660.1| Contains similarity to RNA-binding protein from A... 303 2e-79 ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutr... 301 6e-79 ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300... 301 6e-79 >ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum] Length = 480 Score = 387 bits (995), Expect = e-105 Identities = 227/438 (51%), Positives = 273/438 (62%), Gaps = 10/438 (2%) Frame = -1 Query: 1679 DKKPDNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAPPLGRGRGFIXXXX 1500 D KP++S +P GHGRGRG ++N + PP GRGRG I Sbjct: 58 DSKPESSTP--TTPSGTGHGRGRGKPLPSSPIVPSFYSVVDNPN--PPAGRGRGGIGPFS 113 Query: 1499 XXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDE-AQYNAAESEIPAIQEKP-LPNDVINVI 1326 Q + +KP+ F K++E A N++ S+ P ++ L + VI+V+ Sbjct: 114 PPPQPQQQQ-----QQQQQPLRKPIFFAKEEETADSNSSSSDAPTPRDDSNLSSSVISVL 168 Query: 1325 SGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXPREQLSQEEKV 1146 +GAGRGKP+++ +P SEKPK ENRH+R RQQ ++LS+E+ V Sbjct: 169 TGAGRGKPLQTASPVSEKPKEENRHLRPRQQKVADSGERASSPPP------QRLSREDAV 222 Query: 1145 KKAKEILSRGEP------VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESD 984 KKA ILSR + V R EE Sbjct: 223 KKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGRGRRDEERG 282 Query: 983 DEA--SGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLM 810 D + SG YLGD AD EK+AQKLGPE MN LAEGFEEMS+RVLPSP+DDAY++A HTN+M Sbjct: 283 DGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMSARVLPSPMDDAYIEALHTNMM 342 Query: 809 IECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKV 630 IECEPEYLM +F +NPDIDE PPIPLRDALEKMKPFLMAYEGI+ Q ETM+ V Sbjct: 343 IECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEVIKETMETV 402 Query: 629 PLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFD 450 PL+KEIVD+YSGPDRVTAKQQQ+ELERVAKTLP SAP+ VKRFTERAVLSLQSNPGWGFD Sbjct: 403 PLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNPGWGFD 462 Query: 449 KKCQFMDKLVMEVSQQYK 396 KKCQFMDK+VME SQ YK Sbjct: 463 KKCQFMDKVVMEASQHYK 480 >ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED: uncharacterized protein LOC101247662 isoform 2 [Solanum lycopersicum] Length = 473 Score = 384 bits (987), Expect = e-104 Identities = 227/434 (52%), Positives = 270/434 (62%), Gaps = 6/434 (1%) Frame = -1 Query: 1679 DKKPDNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAPPLGRGRGFIXXXX 1500 D KP++S A+P GHGRGRG ++N + P GRGRG I Sbjct: 58 DSKPESSTP--ATPSGTGHGRGRGKPLPSSPIVPSFHSFVDNPNT--PAGRGRGGIGPFS 113 Query: 1499 XXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDEA-QYNAAESEIPAIQEKP-LPNDVINVI 1326 Q + +KP+ F K++E N++ S P ++ LP+ VI+V+ Sbjct: 114 PPPQPQ--------QQQQQPLRKPIFFAKEEETTDSNSSSSNAPKPRDDSNLPSSVISVL 165 Query: 1325 SGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXPREQLSQEEKV 1146 +GAGRGKP+++ + SEKPK ENRH+R RQQ ++LS+E+ V Sbjct: 166 TGAGRGKPLQTASSVSEKPKEENRHLRPRQQKVADSGERASSPPP------QRLSREDAV 219 Query: 1145 KKAKEILSRGEP--VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDDE-- 978 KKA ILSR + V R EE D Sbjct: 220 KKAVGILSRSDDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGRGRRDEERGDGNL 279 Query: 977 ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECE 798 SG YLGD AD EK+A KLGPE MN LAEGFEEMS+RVLPSP+DDAYL+A HTN+MIECE Sbjct: 280 ESGFYLGDDADGEKLAAKLGPESMNTLAEGFEEMSARVLPSPMDDAYLEALHTNMMIECE 339 Query: 797 PEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIK 618 PEYLM +F +NPDIDE PPIPLRDALEKMKPFLMAYEGI+ Q ETM+ VPL+K Sbjct: 340 PEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEVIKETMETVPLMK 399 Query: 617 EIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQ 438 EIVD+YSGPDRVTAKQQQ+ELERVAKTLP SAP+ VKRFTERAVLSLQSNPGWGFDKKCQ Sbjct: 400 EIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNPGWGFDKKCQ 459 Query: 437 FMDKLVMEVSQQYK 396 FMDK+VMEVSQ YK Sbjct: 460 FMDKVVMEVSQHYK 473 >ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253300 [Vitis vinifera] Length = 482 Score = 357 bits (916), Expect = 1e-95 Identities = 221/461 (47%), Positives = 268/461 (58%), Gaps = 21/461 (4%) Frame = -1 Query: 1715 ATPFQFTADSPSDKKP--DNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKA 1542 A+PF F + +P +P D +++ SP P G G GRG + + Sbjct: 42 ASPFDFASGAPEKTEPTADPNSESSESPFPLGLGHGRGKPPSQPSAPTLPSF---SSFAS 98 Query: 1541 PPLGRGRGFIXXXXXXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDEAQYNAA-ESEIPAI 1365 +GRGRG + P KKP+ F K+D A +S++ Sbjct: 99 TGIGRGRGRLTAHPTDSVPQ--------QSPDFAPKKPIFFSKEDAADSAPKPQSQLGTT 150 Query: 1364 --QEKPLPNDVINVISG-AGRGKPMK-SPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXX 1197 +E LP +++ +SG AGRG+P+K +PAP PK ENRH+RQ +QP Sbjct: 151 PPEENNLPVSILSALSGGAGRGQPLKQTPAP----PKEENRHLRQPRQP-----VFRSPQ 201 Query: 1196 XXXXXXPREQLSQEEKVKKAKEILSRG----------EPVXXXXXXXXXXXXXXXXXXXX 1047 P+ +LS+EE VKKA ILSRG E Sbjct: 202 QPVAGPPQPRLSREEAVKKAVGILSRGGDGGGDGDGDEGGRGRGFRGRGRGRGRGAQGWM 261 Query: 1046 XXXXXXXXXXXXGDDRY----EESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEE 879 DR + DD +GLYLGD AD EK++ K+G E M+KL E FEE Sbjct: 262 GRGRGRGRGRGRMGDRRGRGGDAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEE 321 Query: 878 MSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFL 699 MS RVLPSP++DAYLDA HTN +IE EPEYLMEEFGTNPDIDE PPIPLRDALEKMKPFL Sbjct: 322 MSGRVLPSPIEDAYLDALHTNCLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFL 381 Query: 698 MAYEGIQSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAP 519 M YEGIQSQ ETM+ VP +KE+VD+YSGPDRVTAK+QQEELERVAKTLP +AP Sbjct: 382 MQYEGIQSQEEWEEVMKETMENVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAP 441 Query: 518 DPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVMEVSQQYK 396 + VKRFT+RA+LSLQSNPGWGFDKKCQFMDKLV EVSQ YK Sbjct: 442 NSVKRFTDRAILSLQSNPGWGFDKKCQFMDKLVWEVSQHYK 482 >gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 474 Score = 352 bits (904), Expect = 2e-94 Identities = 220/447 (49%), Positives = 251/447 (56%), Gaps = 17/447 (3%) Frame = -1 Query: 1685 PSDKKPDNSNDDGASPLPRG--HGRGRGTXXXXXXXXXXXXXXLNNDSKAPPLGRGRGFI 1512 P +SN D A P G HGRGRG S G GRG + Sbjct: 62 PGKSGSGDSNRDSAESPPAGVGHGRGRGGPLSSDPIPHPF------SSFVSQTGSGRGRV 115 Query: 1511 XXXXXXXXXXXXXXXXXPNQPKPNDKKPLLFVK---DDEAQYNAAESEIPAIQEKPL--P 1347 P P K +F+K +DE + +A + P +P+ P Sbjct: 116 TSESVPPPP-----------PPPAQAKQPIFIKKKDEDETESSAKAAAEPIQSSEPIFPP 164 Query: 1346 NDV-INVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXPRE 1170 N + ++V+SGAGRGKP+K P P S + + ENRHIR QQ Sbjct: 165 NILPVSVLSGAGRGKPVKQPEPASRRQE-ENRHIRVAQQQSPSA---------------- 207 Query: 1169 QLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEE 990 Q+SQEE KKA ILSR G R + Sbjct: 208 QMSQEEATKKAMGILSRRSESGESGMVGRGGRASMGMGGGRGRGRGRGRGMGRGRGRRQG 267 Query: 989 SDDE---------ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAY 837 D A GLYLGD AD EK AQ +G + MNKL EGFEEM SRVLPSP+DDAY Sbjct: 268 EDTRIVKDSGEGSADGLYLGDNADGEKFAQTIGADNMNKLVEGFEEMGSRVLPSPMDDAY 327 Query: 836 LDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXX 657 LDA HTN IE EPEYLMEEFGTNPDIDEKPP+PLRDALEKMKPFLMAYEGIQSQ Sbjct: 328 LDALHTNCSIEFEPEYLMEEFGTNPDIDEKPPMPLRDALEKMKPFLMAYEGIQSQEEWEE 387 Query: 656 XXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSL 477 ETM++VPL++EIVD+YSGPDRVTAK+QQEELERVAKT+P AP VK+F RAVLSL Sbjct: 388 VIKETMERVPLLQEIVDYYSGPDRVTAKKQQEELERVAKTIPERAPSSVKQFANRAVLSL 447 Query: 476 QSNPGWGFDKKCQFMDKLVMEVSQQYK 396 QSNPGWGFDKKCQFMDKLV EVSQQYK Sbjct: 448 QSNPGWGFDKKCQFMDKLVWEVSQQYK 474 >gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea] Length = 426 Score = 348 bits (893), Expect = 5e-93 Identities = 211/441 (47%), Positives = 252/441 (57%), Gaps = 3/441 (0%) Frame = -1 Query: 1709 PFQFTADSPSDKKPDNSNDDGASPL---PRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAP 1539 P F ++ PS ++ SP P G GRGR ++NDS AP Sbjct: 38 PNTFASNKPSGSVELGNSKIDDSPTTAPPYGRGRGRIQPLPSSPLLPSFASIVSNDSGAP 97 Query: 1538 PLGRGRGFIXXXXXXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDEAQYNAAESEIPAIQE 1359 P+G GRG I P P D L Sbjct: 98 PIGGGRGKIPTRPPLP-------------PPPRDTAAL---------------------- 122 Query: 1358 KPLPNDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXX 1179 +D++ +SG GRG P K P PQ+ KP NRHIRQ QP+ Sbjct: 123 ----DDILTNLSGMGRGTPGKPP-PQTLKPTPINRHIRQ-PQPRPSTALSPD-------- 168 Query: 1178 PREQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDR 999 +QLS+EEK+KKA EILSRG+P D Sbjct: 169 --QQLSKEEKLKKAVEILSRGDPDRGPIRSPTGRGRGRGRGRGGRGGRFSGRGRGREADA 226 Query: 998 YEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHT 819 ESD+E G++ GDPADE+K+A+KLG EVMNK+ EG EEMSSRVLPS +DDAY+DAYHT Sbjct: 227 AIESDEELPGMF-GDPADEQKVAEKLGVEVMNKITEGMEEMSSRVLPSLIDDAYVDAYHT 285 Query: 818 NLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETM 639 NL++ECEPEY ME+FGTNPDID+KPPIPLR+A EKMKPFLM + GI++Q ETM Sbjct: 286 NLLLECEPEYFMEDFGTNPDIDDKPPIPLREAFEKMKPFLMQHIGIETQEEWEQIIEETM 345 Query: 638 KKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGW 459 + VP K+I+DHY+GPDRVTA QQ ELERVA TLPA+AP VKRFTERAVLSL+SNPGW Sbjct: 346 ESVPRWKKIIDHYAGPDRVTALQQIGELERVAGTLPATAPASVKRFTERAVLSLKSNPGW 405 Query: 458 GFDKKCQFMDKLVMEVSQQYK 396 GF KKCQFMDK+VMEVSQQYK Sbjct: 406 GFKKKCQFMDKVVMEVSQQYK 426 >ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis] Length = 407 Score = 342 bits (877), Expect = 3e-91 Identities = 191/345 (55%), Positives = 223/345 (64%), Gaps = 12/345 (3%) Frame = -1 Query: 1394 NAAESEIPAIQEKPLPNDVINVISGAGRGKPMKSPAPQSEK----------PKAENRHIR 1245 +A +S P+ E LP+ +I+ + GAGRGK + Q ++ P+ ENRHIR Sbjct: 68 SATDSTQPS--EPNLPSSIISTLPGAGRGKTAVTQQQQQQQQHQRQQPGPPPQEENRHIR 125 Query: 1244 QRQQPKXXXXXXXXXXXXXXXXPREQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXX 1065 R QP+ + +LS+E+ VK A ++LSRGE Sbjct: 126 ARLQPQPRPEKAPAAETGSA---QPKLSKEDAVKMAMKVLSRGEEGEGEGISAGGPGRGR 182 Query: 1064 XXXXXXXXXXXXXXXXXXGDDRYEESDDEA--SGLYLGDPADEEKMAQKLGPEVMNKLAE 891 + E D++ GLYLGD AD EK+A+K+G E MN L E Sbjct: 183 GMGRGRGRGRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVE 242 Query: 890 GFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKM 711 GFEEMS RVLPSP++DAY+DA HTN MIE EPEYLMEEFGTNPDIDEKPPIPLRDALEKM Sbjct: 243 GFEEMSGRVLPSPMEDAYIDALHTNCMIEFEPEYLMEEFGTNPDIDEKPPIPLRDALEKM 302 Query: 710 KPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLP 531 KPFLMAYEGIQSQ E M++VPL+KEIVDHYSGPDRVTAKQQ EELERVAKT+P Sbjct: 303 KPFLMAYEGIQSQEEWEEAVNEVMERVPLLKEIVDHYSGPDRVTAKQQGEELERVAKTIP 362 Query: 530 ASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVMEVSQQYK 396 SAP +KRF RAVLSLQSNPGWGFDKKCQFMDKL EVSQQYK Sbjct: 363 ESAPASIKRFANRAVLSLQSNPGWGFDKKCQFMDKLAWEVSQQYK 407 >ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [Glycine max] gi|571476117|ref|XP_006586864.1| PREDICTED: la-related protein 1 isoform X2 [Glycine max] Length = 481 Score = 327 bits (837), Expect = 1e-86 Identities = 194/439 (44%), Positives = 238/439 (54%), Gaps = 15/439 (3%) Frame = -1 Query: 1667 DNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAPPLGRGRGFIXXXXXXXX 1488 ++ +D P+P G G G G + PP GRGRG Sbjct: 64 ESKSDTTEPPIPPGSGLGHGRGKPMPPSGLPSFSSFISSINQPPAGRGRG---------- 113 Query: 1487 XXXXXXXXXPNQPKPNDKKPLLFVKDDEAQYNAAESEIPA------IQEKPLPNDVINVI 1326 P KKP+ F ++D A+ +P + LP + V+ Sbjct: 114 -TAPHPQHDLQPPDSGPKKPIFFKREDSVSPTASNDFLPPKRSVDHAHDNKLPGSIPGVL 172 Query: 1325 SGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXPREQLSQEEKV 1146 SG GRGK MK P +++ + ENRH+R RQ P SQE+ Sbjct: 173 SGLGRGKSMKQPDLETQVTE-ENRHLRTRQAPGAASSETVPKRSPIP-------SQEDAT 224 Query: 1145 KKAKEILSRGEP---------VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYE 993 + A +ILS G+ D++ Sbjct: 225 RNALKILSHGKDDGSDTGRGREYGGRGGLDRGRGRGRGRGRGRGMGRGRFVERDVDEKVM 284 Query: 992 ESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNL 813 ++DD A+GLY GD AD EK+A+K+GPE+MN+L EGFEEM+SRVLPSPL+D +LDA N Sbjct: 285 DTDDYATGLYAGDDADGEKLARKVGPEIMNQLTEGFEEMTSRVLPSPLEDEFLDALDINY 344 Query: 812 MIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKK 633 IE EPEYL+E NPDIDEK PI LRDALEK KPFLM+YEGIQSQ ETM + Sbjct: 345 AIEFEPEYLVEF--DNPDIDEKEPISLRDALEKAKPFLMSYEGIQSQEEWEEIMEETMAR 402 Query: 632 VPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGF 453 VPL+K+I+DHYSGPDRVTAK+QQEELERVAKTLP S P VK+FT RAV+SLQSNPGWGF Sbjct: 403 VPLLKKIIDHYSGPDRVTAKKQQEELERVAKTLPGSVPSSVKQFTNRAVISLQSNPGWGF 462 Query: 452 DKKCQFMDKLVMEVSQQYK 396 DKKC FMDKLV EVSQ YK Sbjct: 463 DKKCHFMDKLVWEVSQHYK 481 >gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris] Length = 532 Score = 319 bits (818), Expect = 2e-84 Identities = 216/498 (43%), Positives = 262/498 (52%), Gaps = 60/498 (12%) Frame = -1 Query: 1709 PFQFTADSPSDKKPDNS---NDDGASPLP----RGHGRGRGTXXXXXXXXXXXXXXLN-- 1557 PF F +P KP++S +D SP+P GHGRG+ +N Sbjct: 49 PFNFNERAPG--KPNSSEPKSDTTESPIPPGSGHGHGRGKPMPPSGLPSFSSFLSSINQP 106 Query: 1556 -------------NDSKAP-----------------PLGRGRGFIXXXXXXXXXXXXXXX 1467 ND ++P P GRGR + Sbjct: 107 PAGRGRPTVPHHQNDLQSPAGRGRPTVPHHQNDLQSPAGRGRPTVPRHQNDLQSPAGRGR 166 Query: 1466 XXPNQPKPND--------KKPLLFVKDDEAQYNAAES-EIPAIQEKPLPNDVINVISGAG 1314 QP PND KKP+ F ++D A + I Q LP ++I V+SG G Sbjct: 167 ATVPQP-PNDLGPPDSGPKKPIFFKREDIASPTTRDDFPIDVEQANKLPGNIIEVLSGLG 225 Query: 1313 RGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXPREQL-SQEEKVKKA 1137 RGKPMK P++ + ENRH+R + R+ + S+++ V+ A Sbjct: 226 RGKPMKQSDPETRVTE-ENRHLRAPRA--------RGAAASDTLYERQPIPSRDDAVRNA 276 Query: 1136 KEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDD-------- 981 + LS+GE G R + D+ Sbjct: 277 RNFLSQGEDDVGGTGRGRGFRERGGLGRGRGRGRGRGRGTGRGGFRGRDMDERRGRFMDA 336 Query: 980 EAS---GLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLM 810 EAS G Y+GD AD EK+A+K+GPE+MN+L EGFEEM+ RVLPSPL+D YLDA N Sbjct: 337 EASDDIGPYVGDDADGEKLAKKVGPEIMNQLTEGFEEMAGRVLPSPLEDEYLDALDINYA 396 Query: 809 IECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKV 630 IE EPEYL+E NPDIDEK PIPLRDALEKMKPFLMAYEGIQSQ ETM +V Sbjct: 397 IEFEPEYLVEF--DNPDIDEKEPIPLRDALEKMKPFLMAYEGIQSQEEWEEIMEETMAQV 454 Query: 629 PLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFD 450 PL+KEIVDHYSGPDRVTAK+QQEELERVAKTLP SAP VK+FT RAV+SLQSNPGWGFD Sbjct: 455 PLLKEIVDHYSGPDRVTAKKQQEELERVAKTLPESAPSSVKQFTNRAVVSLQSNPGWGFD 514 Query: 449 KKCQFMDKLVMEVSQQYK 396 KKC FMDKLV EVSQ YK Sbjct: 515 KKCHFMDKLVWEVSQHYK 532 >ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507965 [Cicer arietinum] Length = 504 Score = 318 bits (815), Expect = 5e-84 Identities = 204/455 (44%), Positives = 247/455 (54%), Gaps = 26/455 (5%) Frame = -1 Query: 1682 SDKKPDNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAPPLGRGRGFIXXX 1503 S++ + D SP G G GRG L + K P +GRGRGF Sbjct: 65 SNESKSEATDSPFSPPGAGRGHGRGGSVPPPTGFPSFSSFLTS-IKQPSIGRGRGF---- 119 Query: 1502 XXXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDEAQYNAAESEIPAIQEKPL--------P 1347 QP KKP+LF +D + ++ +KP+ P Sbjct: 120 -GPSPFQPENDTQQLQQPDSVPKKPVLFRSEDSVSQTGGKDDVSP-PKKPVFTRREDFSP 177 Query: 1346 ND--------------VINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXX 1209 D V+ V+SGAGRGKP++ PA + ENRH+R R+ Sbjct: 178 IDLSSDQESDNRFSMSVLKVLSGAGRGKPIE-PAVSETQVVEENRHVRNRRASDVPMRQP 236 Query: 1208 XXXXXXXXXXPREQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXX 1029 R+ LS+ + GEP Sbjct: 237 MLTGDGALQNARKYLSKFDGDGSGSG--RGGEP----RERGAFGRGRGRGRGRGRGRGRG 290 Query: 1028 XXXXXXGDDRYEESDDEA----SGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVL 861 GDDR+ + D A SGL+LGD D EK+A+K+GPEVMN+ EGFEEM SRVL Sbjct: 291 GFRGTGGDDRFGQIQDNARSNASGLFLGDDVDGEKLAKKVGPEVMNQFTEGFEEMISRVL 350 Query: 860 PSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGI 681 PSPL+D Y++A+ N IE EPEY+ME F +NPDIDEK PIPLRDALEKMKPFLM YEGI Sbjct: 351 PSPLEDEYVEAFDINCAIEFEPEYIME-FDSNPDIDEKEPIPLRDALEKMKPFLMNYEGI 409 Query: 680 QSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRF 501 QSQ ETM++VPL+K+IVDHYSGPDRVTAK+QQEELERVAKTLPASAP V +F Sbjct: 410 QSQEEWEAIMEETMERVPLLKKIVDHYSGPDRVTAKKQQEELERVAKTLPASAPSSVVQF 469 Query: 500 TERAVLSLQSNPGWGFDKKCQFMDKLVMEVSQQYK 396 T RAV+SLQSNPGWGFDKKCQFMDKLV EVSQ +K Sbjct: 470 TNRAVMSLQSNPGWGFDKKCQFMDKLVFEVSQHHK 504 >ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis] gi|223537066|gb|EEF38701.1| conserved hypothetical protein [Ricinus communis] Length = 436 Score = 318 bits (814), Expect = 7e-84 Identities = 178/322 (55%), Positives = 210/322 (65%), Gaps = 4/322 (1%) Frame = -1 Query: 1352 LPNDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXPR 1173 LP+ + + +SG GRG+P K P + + K ENRHIR R + K + Sbjct: 133 LPSTIHSSLSGFGRGEPDKPVVP-TPQVKEENRHIRDRSRAKPKTEEAEVRA-------K 184 Query: 1172 EQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYE 993 ++S+EE VK+A ILS+G+ + R Sbjct: 185 PKISREEAVKRAVSILSQGDT-----------GEGMGRGRGGGRGRGRGRGRGRLEQRGR 233 Query: 992 ESDDE----ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAY 825 DD SGL+LGD AD EK+A K+G E MNKL EG+EEMS RVLPSP++DAYLDA Sbjct: 234 MMDDVDEGFGSGLFLGDNADGEKLAGKIGVENMNKLVEGYEEMSGRVLPSPMEDAYLDAL 293 Query: 824 HTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXE 645 HTN MIE EPEYLM EF NPDIDEKPP+PLRD LEK+KPF+MAYEGIQSQ E Sbjct: 294 HTNYMIEFEPEYLMGEFDQNPDIDEKPPMPLRDVLEKVKPFIMAYEGIQSQEEWEAAVEE 353 Query: 644 TMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNP 465 TMK VPL KEIVD+YSGPDR+TAK+Q+EELERVA T+PASAP VKRF +RAVLSLQSNP Sbjct: 354 TMKNVPLFKEIVDYYSGPDRITAKKQEEELERVANTIPASAPASVKRFADRAVLSLQSNP 413 Query: 464 GWGFDKKCQFMDKLVMEVSQQY 399 GWGFDKKCQFMDKLV EV+Q Y Sbjct: 414 GWGFDKKCQFMDKLVREVNQCY 435 >gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica] Length = 428 Score = 317 bits (812), Expect = 1e-83 Identities = 159/202 (78%), Positives = 173/202 (85%), Gaps = 1/202 (0%) Frame = -1 Query: 1001 RYEESDDE-ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAY 825 R ++SD ASGLYLGD AD EK+A+KLGPE+MNKL E FEEMSS VLPSPLDDAY+DA Sbjct: 226 RGKDSDGSYASGLYLGDNADGEKLAKKLGPEIMNKLVERFEEMSSEVLPSPLDDAYVDAM 285 Query: 824 HTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXE 645 HTN MIECEPEYLM EF NPDIDEKPPI LRDALEKMKPFLMAYE I+SQ E Sbjct: 286 HTNFMIECEPEYLMGEFNKNPDIDEKPPISLRDALEKMKPFLMAYENIESQEEWEEVVNE 345 Query: 644 TMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNP 465 TM++VPL+KEIVDHYSGPDRVTAK+QQEELERVAKTLPA PD VKRFT+RAVLSLQSNP Sbjct: 346 TMERVPLLKEIVDHYSGPDRVTAKKQQEELERVAKTLPAKVPDSVKRFTDRAVLSLQSNP 405 Query: 464 GWGFDKKCQFMDKLVMEVSQQY 399 GWGFD+KCQFMDKLV +VSQ Y Sbjct: 406 GWGFDRKCQFMDKLVAKVSQHY 427 >ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215545 [Cucumis sativus] gi|449502143|ref|XP_004161555.1| PREDICTED: uncharacterized protein LOC101224016 [Cucumis sativus] Length = 478 Score = 315 bits (808), Expect = 3e-83 Identities = 195/449 (43%), Positives = 242/449 (53%), Gaps = 11/449 (2%) Frame = -1 Query: 1709 PFQFTADSPSDKKPDNSNDDGASPLPR---GHGRGRGTXXXXXXXXXXXXXXLNNDSKAP 1539 PF FT P+ + + S + P GHGRG+ T S Sbjct: 50 PFDFTPPVPNQEHSNASKQEPIDSRPTPGLGHGRGKPTPSSPLRPSFSSFSPSVRPSS-- 107 Query: 1538 PLGRGRGFIXXXXXXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDEAQYNAAESEIPAIQ- 1362 +GRGRG P +P KKP+ F K++ +AA + + + Sbjct: 108 -VGRGRG-----------DASPSIRSPPEPDSEPKKPVFFSKNNAGD-SAASTSLGGLHR 154 Query: 1361 ---EKPLPNDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXX 1191 E+ LP + + SG GRGKPMK P P+ ++PK ENRH+R RQ+ Sbjct: 155 VSGERNLPESLHSEFSGVGRGKPMKQPVPE-DQPKQENRHLRPRQE----GDGPGAGERG 209 Query: 1190 XXXXPREQLSQEEKVKKAKEILSR----GEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1023 ++ + E + ++S+ GE Sbjct: 210 RGRGFEPRIGRGEPWRNTNRMVSKDGPDGEVGGGRGTSGYRGRGARGPYRRGARGSFRTG 269 Query: 1022 XXXXGDDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDD 843 +++ D A+GLYLG+ D E++A+++G E MNKL EGFEEMS RVLPSPL D Sbjct: 270 ERRERRSGHDKEDGYAAGLYLGNNEDGERLAKRIGTENMNKLVEGFEEMSGRVLPSPLVD 329 Query: 842 AYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXX 663 YLD TN MIECEPEYLM +F NPDIDE PPIPLRDALEKMKPFLMAYE IQS Sbjct: 330 QYLDGMDTNFMIECEPEYLMGDFENNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEW 389 Query: 662 XXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVL 483 ETM+ VPL+KEIVD Y GPDRVTAK+QQ ELERVAKTLP SAP+ VK+FT R VL Sbjct: 390 EEIVEETMQSVPLLKEIVDAYGGPDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRVVL 449 Query: 482 SLQSNPGWGFDKKCQFMDKLVMEVSQQYK 396 SLQSNPGWGFDKK Q MDKLV S++YK Sbjct: 450 SLQSNPGWGFDKKWQLMDKLVEGFSKRYK 478 >emb|CBI17195.3| unnamed protein product [Vitis vinifera] Length = 209 Score = 308 bits (789), Expect = 5e-81 Identities = 151/200 (75%), Positives = 169/200 (84%) Frame = -1 Query: 995 EESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTN 816 + DD +GLYLGD AD EK++ K+G E M+KL E FEEMS RVLPSP++DAYLDA HTN Sbjct: 10 DAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMSGRVLPSPIEDAYLDALHTN 69 Query: 815 LMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMK 636 +IE EPEYLMEEFGTNPDIDE PPIPLRDALEKMKPFLM YEGIQSQ ETM+ Sbjct: 70 CLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQYEGIQSQEEWEEVMKETME 129 Query: 635 KVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWG 456 VP +KE+VD+YSGPDRVTAK+QQEELERVAKTLP +AP+ VKRFT+RA+LSLQSNPGWG Sbjct: 130 NVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWG 189 Query: 455 FDKKCQFMDKLVMEVSQQYK 396 FDKKCQFMDKLV EVSQ YK Sbjct: 190 FDKKCQFMDKLVWEVSQHYK 209 >ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Capsella rubella] gi|482575944|gb|EOA40131.1| hypothetical protein CARUB_v10008838mg [Capsella rubella] Length = 525 Score = 306 bits (785), Expect = 2e-80 Identities = 178/373 (47%), Positives = 223/373 (59%), Gaps = 19/373 (5%) Frame = -1 Query: 1457 NQPKPNDKKP--LLFVKDDEAQYNAAESEIPAIQ--EKPLPNDVINVI-------SGAGR 1311 +QP+PND+ +FVK E + + P + + LP++V N + SGAGR Sbjct: 164 SQPQPNDESQGSPVFVKLQEMKDVTSSPPAPESKSGQTDLPDNVFNALGSEIPHSSGAGR 223 Query: 1310 GKPMKSPAPQSEKPKAENRHIR--------QRQQPKXXXXXXXXXXXXXXXXPREQLSQE 1155 GKP+ AP + ENRHIR QR QP+ R +LS E Sbjct: 224 GKPLVESAPIQRE---ENRHIRRPPPPPQQQRSQPQQKRAQTPRDETP-----RPRLSAE 275 Query: 1154 EKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDDEA 975 E ++A+ LSRGE D + EE + EA Sbjct: 276 EAGRRARSELSRGEA---EGSGVRGRGGRGRGRGARGRGRGRGGEGWRDDKKEEEGEQEA 332 Query: 974 SGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEP 795 ++ GD AD EK A K+GPE+M LAEGFEE+ + LPS DA +DAY TNLMIECEP Sbjct: 333 MSVFAGDSADGEKFANKMGPELMKTLAEGFEEVCEKALPSTTHDAIIDAYDTNLMIECEP 392 Query: 794 EYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIKE 615 EY+M +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q E M + PL+KE Sbjct: 393 EYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMAQAPLMKE 452 Query: 614 IVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQF 435 IVDHYSGPDRVTAK+Q EEL+R+A TLP SAPD VKRF +RA L+L+SNPGWGFDKK QF Sbjct: 453 IVDHYSGPDRVTAKKQNEELDRIATTLPKSAPDSVKRFADRAALTLKSNPGWGFDKKYQF 512 Query: 434 MDKLVMEVSQQYK 396 MDKLV+EVSQ YK Sbjct: 513 MDKLVLEVSQSYK 525 >ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550322664|gb|EEF06007.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 466 Score = 303 bits (777), Expect = 1e-79 Identities = 189/442 (42%), Positives = 234/442 (52%), Gaps = 6/442 (1%) Frame = -1 Query: 1703 QFTADSPSDKKPDNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAPP-LGR 1527 ++ A +P D S + + P G G GRG +++ + P GR Sbjct: 56 EYGAAAPGKPDLDESKTESSESQPSGLGHGRGKPVGTGPILPAFSTFISSVKNSQPGAGR 115 Query: 1526 GRGFIXXXXXXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDEAQYNAAESEIPAIQEKPLP 1347 GRG +P P+ + + ESE P E LP Sbjct: 116 GRG-------------------TTEPGPS-----------RSTESRPESEPPKKAEANLP 145 Query: 1346 NDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXP--R 1173 +++ + GAGRGKP+K P E K ENRH+R R QP+ Sbjct: 146 PSILSGLGGAGRGKPVKQEVP-IEPAKEENRHLRARSQPRSQPRTRQQKTPDGDDAVPAT 204 Query: 1172 EQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDR-Y 996 ++ ++E VKKA E+LSRG R Y Sbjct: 205 TKMGRQEAVKKAMELLSRGGGEGEVGGRGGGRGSFVPGRGGGRGGARGGGRGRGRGRRGY 264 Query: 995 EESDDE-ASGLYL-GDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYH 822 + + E SG+ L G DEEK AQ +G E MN L E FEEMS RVLP P++D Y+DA+ Sbjct: 265 GDKEVEYGSGMSLEGHEEDEEKFAQSVGVETMNTLVEAFEEMSGRVLPCPIEDEYVDAFD 324 Query: 821 TNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXET 642 TN E EPEYLM EF NPDIDEKPP+PLRDALEK+KPF+MAY GI++ ET Sbjct: 325 TNCSFEFEPEYLMGEFDKNPDIDEKPPMPLRDALEKVKPFMMAYMGIKTHEEWEEIVEET 384 Query: 641 MKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPG 462 MK PL+K+IVD YSGPDRV+ K+Q+EELERVAKT+PASAPD VK F +RAVLSLQSNPG Sbjct: 385 MKDAPLMKKIVDSYSGPDRVSGKKQKEELERVAKTIPASAPDSVKSFADRAVLSLQSNPG 444 Query: 461 WGFDKKCQFMDKLVMEVSQQYK 396 WGFDKKC FMDKL EVSQ YK Sbjct: 445 WGFDKKCMFMDKLAKEVSQHYK 466 >gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding protein from Arabidopsis thaliana gi|2129727 and contains RNA recognition PF|00076 domain. ESTs gb|H37317, gb|F14415, gb|AA651290 come from this gene [Arabidopsis thaliana] Length = 829 Score = 303 bits (775), Expect = 2e-79 Identities = 177/369 (47%), Positives = 223/369 (60%), Gaps = 16/369 (4%) Frame = -1 Query: 1454 QPKPNDKKP--LLFVKDDEAQYNAAESEIPAIQEKP----LPNDVINVI-------SGAG 1314 Q +PND+ +FVK E Q A S P + KP P+++ N + SGAG Sbjct: 467 QQQPNDESQGSPVFVKLQEMQ--DATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGAG 524 Query: 1313 RGKPMKSPAPQSEKPKAENRHIRQRQQP---KXXXXXXXXXXXXXXXXPREQLSQEEKVK 1143 RGKP+ AP ++ +NR IR+ P + P+ QLS EE + Sbjct: 525 RGKPLVESAPIRQE---DNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGR 581 Query: 1142 KAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDDEASGLY 963 +A+ LSRGE D + EE + EA ++ Sbjct: 582 RARSELSRGE-AEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMRIF 640 Query: 962 LGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLM 783 GD AD EK A+K+GPE+M LAEGFEE+ + LPS DA +DAY TNLMIECEPEY+M Sbjct: 641 AGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIM 700 Query: 782 EEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDH 603 +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q E M + PL+KEIVDH Sbjct: 701 PDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDH 760 Query: 602 YSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKL 423 YSGPDRVTAK+Q EEL+R+A TLPASAPD VKRF +RA L+L+SNPGWGFDKK QFMDKL Sbjct: 761 YSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKL 820 Query: 422 VMEVSQQYK 396 V+EVSQ YK Sbjct: 821 VLEVSQSYK 829 >ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|12324041|gb|AAG51990.1|AC024260_28 unknown protein; 43598-45751 [Arabidopsis thaliana] gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8 [Arabidopsis thaliana] gi|23506017|gb|AAN28868.1| At1g53640/F22G10.8 [Arabidopsis thaliana] gi|110740318|dbj|BAF02054.1| hypothetical protein [Arabidopsis thaliana] gi|332194854|gb|AEE32975.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 523 Score = 303 bits (775), Expect = 2e-79 Identities = 177/369 (47%), Positives = 223/369 (60%), Gaps = 16/369 (4%) Frame = -1 Query: 1454 QPKPNDKKP--LLFVKDDEAQYNAAESEIPAIQEKP----LPNDVINVI-------SGAG 1314 Q +PND+ +FVK E Q A S P + KP P+++ N + SGAG Sbjct: 161 QQQPNDESQGSPVFVKLQEMQ--DATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGAG 218 Query: 1313 RGKPMKSPAPQSEKPKAENRHIRQRQQP---KXXXXXXXXXXXXXXXXPREQLSQEEKVK 1143 RGKP+ AP ++ +NR IR+ P + P+ QLS EE + Sbjct: 219 RGKPLVESAPIRQE---DNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGR 275 Query: 1142 KAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDDEASGLY 963 +A+ LSRGE D + EE + EA ++ Sbjct: 276 RARSELSRGE-AEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMRIF 334 Query: 962 LGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLM 783 GD AD EK A+K+GPE+M LAEGFEE+ + LPS DA +DAY TNLMIECEPEY+M Sbjct: 335 AGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIM 394 Query: 782 EEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDH 603 +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q E M + PL+KEIVDH Sbjct: 395 PDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDH 454 Query: 602 YSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKL 423 YSGPDRVTAK+Q EEL+R+A TLPASAPD VKRF +RA L+L+SNPGWGFDKK QFMDKL Sbjct: 455 YSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKL 514 Query: 422 VMEVSQQYK 396 V+EVSQ YK Sbjct: 515 VLEVSQSYK 523 >gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis thaliana gi|2129727 and contains RNA recognition PF|00076 domain [Arabidopsis thaliana] Length = 523 Score = 303 bits (775), Expect = 2e-79 Identities = 177/369 (47%), Positives = 223/369 (60%), Gaps = 16/369 (4%) Frame = -1 Query: 1454 QPKPNDKKP--LLFVKDDEAQYNAAESEIPAIQEKP----LPNDVINVI-------SGAG 1314 Q +PND+ +FVK E Q A S P + KP P+++ N + SGAG Sbjct: 161 QQQPNDESQGSPVFVKLQEMQ--DATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGAG 218 Query: 1313 RGKPMKSPAPQSEKPKAENRHIRQRQQP---KXXXXXXXXXXXXXXXXPREQLSQEEKVK 1143 RGKP+ AP ++ +NR IR+ P + P+ QLS EE + Sbjct: 219 RGKPLVESAPIRQE---DNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGR 275 Query: 1142 KAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDDEASGLY 963 +A+ LSRGE D + EE + EA ++ Sbjct: 276 RARSELSRGE-AEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMRIF 334 Query: 962 LGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLM 783 GD AD EK A+K+GPE+M LAEGFEE+ + LPS DA +DAY TNLMIECEPEY+M Sbjct: 335 AGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIM 394 Query: 782 EEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDH 603 +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q E M + PL+KEIVDH Sbjct: 395 PDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDH 454 Query: 602 YSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKL 423 YSGPDRVTAK+Q EEL+R+A TLPASAPD VKRF +RA L+L+SNPGWGFDKK QFMDKL Sbjct: 455 YSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKL 514 Query: 422 VMEVSQQYK 396 V+EVSQ YK Sbjct: 515 VLEVSQSYK 523 >ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutrema salsugineum] gi|557089350|gb|ESQ30058.1| hypothetical protein EUTSA_v10011382mg [Eutrema salsugineum] Length = 531 Score = 301 bits (771), Expect = 6e-79 Identities = 192/469 (40%), Positives = 240/469 (51%), Gaps = 41/469 (8%) Frame = -1 Query: 1679 DKKPDNSND-------DGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAPPLGRGR 1521 +++P+ +N+ S P G+G GRG + DS P +GRGR Sbjct: 70 NREPERANEAAGHGRGSSESQSPGGYGHGRGRPIQSDPISPAFSSFVRPDS--PSVGRGR 127 Query: 1520 GFIXXXXXXXXXXXXXXXXXPNQPKPN------DKKPLLFVKDDEAQYNAAESEIPAIQE 1359 G + +P + P +F K E + + P + Sbjct: 128 GSVGSDPVSPFAAPSPPPPRDQSHRPQLSSEEQPQSPPVFAKLQEMKDATSSPPPPPTES 187 Query: 1358 K-----PL-------------PNDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQ 1233 K PL PN I SGAGRGKP AP ++ ENRHIR+ Q Sbjct: 188 KSGQTAPLNNIFNGLGSEFSQPNQRIVPGSGAGRGKPFVESAPLQQE---ENRHIRRPQP 244 Query: 1232 PKXXXXXXXXXXXXXXXXPREQ----------LSQEEKVKKAKEILSRGEPVXXXXXXXX 1083 P R Q LS EE ++A+ LSRGE Sbjct: 245 PPPQQQQQRSQPQPQHQQKRVQPPKDEAPRPKLSIEEAGRRARSQLSRGEAEGGGLRGRG 304 Query: 1082 XXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMN 903 + EE++ EA ++GD AD EK A K+GPE+M Sbjct: 305 GGRGRGRGARGRGRGRGGEGWRDVKME--EEAEQEAISTFVGDSADGEKFANKMGPEIMK 362 Query: 902 KLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDA 723 LA+G+E++ R LPS +DA LDAY TNLMIECEPEYLM FG+NPDIDEKPP+ LR+ Sbjct: 363 MLADGYEDICERALPSTANDAVLDAYETNLMIECEPEYLMPAFGSNPDIDEKPPMSLREC 422 Query: 722 LEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVA 543 LEK+KPF++AYEGI+ Q E M + PLIKEIVDHYSGPDRVTAK+Q EEL+R+A Sbjct: 423 LEKVKPFIVAYEGIKDQEEWEEAIDEVMAQAPLIKEIVDHYSGPDRVTAKKQNEELDRIA 482 Query: 542 KTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVMEVSQQYK 396 T+P SAPD VKRF +RA LSL+SNPGWGFDKK QFMDKLV EVSQ YK Sbjct: 483 TTVPKSAPDSVKRFADRAALSLKSNPGWGFDKKYQFMDKLVAEVSQSYK 531 >ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300131 [Fragaria vesca subsp. vesca] Length = 464 Score = 301 bits (771), Expect = 6e-79 Identities = 150/206 (72%), Positives = 171/206 (83%), Gaps = 3/206 (1%) Frame = -1 Query: 1004 DRYEESDDE---ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYL 834 DR D++ ASGLYLGD AD EK+A+KLGPEVMN+L E FE+MS+ VLPSPLDDAY+ Sbjct: 259 DRRRRGDEDGGIASGLYLGDNADGEKLAEKLGPEVMNQLTEAFEDMSTHVLPSPLDDAYV 318 Query: 833 DAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXX 654 DA TN IE EPEYLM EF NPDIDE+PPIPLRDALEKMKPFLMAYEGIQSQ Sbjct: 319 DALDTNCKIEFEPEYLMGEFNQNPDIDEEPPIPLRDALEKMKPFLMAYEGIQSQEEWEEA 378 Query: 653 XXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQ 474 ETM++VPL+K+IVDHYSGPDRVTAK+Q+EELERVAKTLPA+ PD VK+FT+RAVLSLQ Sbjct: 379 IKETMERVPLLKKIVDHYSGPDRVTAKKQREELERVAKTLPANVPDSVKQFTDRAVLSLQ 438 Query: 473 SNPGWGFDKKCQFMDKLVMEVSQQYK 396 NPGWGF +KCQFMDKL +VS+ YK Sbjct: 439 GNPGWGFHRKCQFMDKLTQKVSKHYK 464