BLASTX nr result

ID: Rehmannia24_contig00011893 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia24_contig00011893
         (1887 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu...   387   e-105
ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247...   384   e-104
ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253...   357   1e-95
gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, ...   352   2e-94
gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]       348   5e-93
ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1...   342   3e-91
ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [...   327   1e-86
gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus...   319   2e-84
ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507...   318   5e-84
ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm...   318   7e-84
gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus pe...   317   1e-83
ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215...   315   3e-83
emb|CBI17195.3| unnamed protein product [Vitis vinifera]              308   5e-81
ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Caps...   306   2e-80
ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus tr...   303   1e-79
gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding prot...   303   2e-79
ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein...   303   2e-79
gb|AAM65660.1| Contains similarity to RNA-binding protein from A...   303   2e-79
ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutr...   301   6e-79
ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300...   301   6e-79

>ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum]
          Length = 480

 Score =  387 bits (995), Expect = e-105
 Identities = 226/438 (51%), Positives = 271/438 (61%), Gaps = 10/438 (2%)
 Frame = +3

Query: 192  DKKPDNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXXNNDSKAPPLGRGRGFIXXXX 371
            D KP++S     +P   GHGRGRG                +N +  PP GRGRG I    
Sbjct: 58   DSKPESSTP--TTPSGTGHGRGRGKPLPSSPIVPSFYSVVDNPN--PPAGRGRGGIGPFS 113

Query: 372  XXXXXXXXXXXXXXNQPKPNDKKPLLFVKDDE-AQYNAAESEIPAIQEKP-LPNDVINVI 545
                           Q +   +KP+ F K++E A  N++ S+ P  ++   L + VI+V+
Sbjct: 114  PPPQPQQQQ-----QQQQQPLRKPIFFAKEEETADSNSSSSDAPTPRDDSNLSSSVISVL 168

Query: 546  SGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXXREQLSQEEKV 725
            +GAGRGKP+++ +P SEKPK ENRH+R RQQ                    ++LS+E+ V
Sbjct: 169  TGAGRGKPLQTASPVSEKPKEENRHLRPRQQKVADSGERASSPPP------QRLSREDAV 222

Query: 726  KKAKEILSRGEP------VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDDRYEESD 887
            KKA  ILSR +       V                                   R EE  
Sbjct: 223  KKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGRGRRDEERG 282

Query: 888  DEA--SGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLM 1061
            D +  SG YLGD AD EK+AQKLGPE MN LAEGFEEMS+RVLPSP+DDAY++A HTN+M
Sbjct: 283  DGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMSARVLPSPMDDAYIEALHTNMM 342

Query: 1062 IECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXXTMKKV 1241
            IECEPEYLM +F +NPDIDE PPIPLRDALEKMKPFLMAYEGI+ Q         TM+ V
Sbjct: 343  IECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEVIKETMETV 402

Query: 1242 PLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFD 1421
            PL+KEIVD+YSGPDRVTAKQQQ+ELERVAKTLP SAP+ VKRFTERAVLSLQSNPGWGFD
Sbjct: 403  PLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNPGWGFD 462

Query: 1422 KKCQFMDKLVMEVSQQYK 1475
            KKCQFMDK+VME SQ YK
Sbjct: 463  KKCQFMDKVVMEASQHYK 480


>ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum
            lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED:
            uncharacterized protein LOC101247662 isoform 2 [Solanum
            lycopersicum]
          Length = 473

 Score =  384 bits (987), Expect = e-104
 Identities = 226/434 (52%), Positives = 268/434 (61%), Gaps = 6/434 (1%)
 Frame = +3

Query: 192  DKKPDNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXXNNDSKAPPLGRGRGFIXXXX 371
            D KP++S    A+P   GHGRGRG                +N +   P GRGRG I    
Sbjct: 58   DSKPESSTP--ATPSGTGHGRGRGKPLPSSPIVPSFHSFVDNPNT--PAGRGRGGIGPFS 113

Query: 372  XXXXXXXXXXXXXXNQPKPNDKKPLLFVKDDEA-QYNAAESEIPAIQEKP-LPNDVINVI 545
                           Q +   +KP+ F K++E    N++ S  P  ++   LP+ VI+V+
Sbjct: 114  PPPQPQ--------QQQQQPLRKPIFFAKEEETTDSNSSSSNAPKPRDDSNLPSSVISVL 165

Query: 546  SGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXXREQLSQEEKV 725
            +GAGRGKP+++ +  SEKPK ENRH+R RQQ                    ++LS+E+ V
Sbjct: 166  TGAGRGKPLQTASSVSEKPKEENRHLRPRQQKVADSGERASSPPP------QRLSREDAV 219

Query: 726  KKAKEILSRGEP--VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDDRYEESDDE-- 893
            KKA  ILSR +   V                                   R EE  D   
Sbjct: 220  KKAVGILSRSDDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGRGRRDEERGDGNL 279

Query: 894  ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECE 1073
             SG YLGD AD EK+A KLGPE MN LAEGFEEMS+RVLPSP+DDAYL+A HTN+MIECE
Sbjct: 280  ESGFYLGDDADGEKLAAKLGPESMNTLAEGFEEMSARVLPSPMDDAYLEALHTNMMIECE 339

Query: 1074 PEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXXTMKKVPLIK 1253
            PEYLM +F +NPDIDE PPIPLRDALEKMKPFLMAYEGI+ Q         TM+ VPL+K
Sbjct: 340  PEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEVIKETMETVPLMK 399

Query: 1254 EIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQ 1433
            EIVD+YSGPDRVTAKQQQ+ELERVAKTLP SAP+ VKRFTERAVLSLQSNPGWGFDKKCQ
Sbjct: 400  EIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNPGWGFDKKCQ 459

Query: 1434 FMDKLVMEVSQQYK 1475
            FMDK+VMEVSQ YK
Sbjct: 460  FMDKVVMEVSQHYK 473


>ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253300 [Vitis vinifera]
          Length = 482

 Score =  357 bits (916), Expect = 1e-95
 Identities = 219/461 (47%), Positives = 266/461 (57%), Gaps = 21/461 (4%)
 Frame = +3

Query: 156  ATPFQFTADSPSDKKP--DNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXXNNDSKA 329
            A+PF F + +P   +P  D +++   SP P G G GRG                 +   +
Sbjct: 42   ASPFDFASGAPEKTEPTADPNSESSESPFPLGLGHGRGKPPSQPSAPTLPSF---SSFAS 98

Query: 330  PPLGRGRGFIXXXXXXXXXXXXXXXXXXNQPKPNDKKPLLFVKDDEAQYNAA-ESEIPAI 506
              +GRGRG +                    P    KKP+ F K+D A      +S++   
Sbjct: 99   TGIGRGRGRLTAHPTDSVPQ--------QSPDFAPKKPIFFSKEDAADSAPKPQSQLGTT 150

Query: 507  --QEKPLPNDVINVISG-AGRGKPMK-SPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXX 674
              +E  LP  +++ +SG AGRG+P+K +PAP    PK ENRH+RQ +QP           
Sbjct: 151  PPEENNLPVSILSALSGGAGRGQPLKQTPAP----PKEENRHLRQPRQP-----VFRSPQ 201

Query: 675  XXXXXXXREQLSQEEKVKKAKEILSRG----------EPVXXXXXXXXXXXXXXXXXXXX 824
                   + +LS+EE VKKA  ILSRG          E                      
Sbjct: 202  QPVAGPPQPRLSREEAVKKAVGILSRGGDGGGDGDGDEGGRGRGFRGRGRGRGRGAQGWM 261

Query: 825  XXXXXXXXXXXXXDDRY----EESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEE 992
                          DR     +  DD  +GLYLGD AD EK++ K+G E M+KL E FEE
Sbjct: 262  GRGRGRGRGRGRMGDRRGRGGDAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEE 321

Query: 993  MSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFL 1172
            MS RVLPSP++DAYLDA HTN +IE EPEYLMEEFGTNPDIDE PPIPLRDALEKMKPFL
Sbjct: 322  MSGRVLPSPIEDAYLDALHTNCLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFL 381

Query: 1173 MAYEGIQSQXXXXXXXXXTMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAP 1352
            M YEGIQSQ         TM+ VP +KE+VD+YSGPDRVTAK+QQEELERVAKTLP +AP
Sbjct: 382  MQYEGIQSQEEWEEVMKETMENVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAP 441

Query: 1353 DPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVMEVSQQYK 1475
            + VKRFT+RA+LSLQSNPGWGFDKKCQFMDKLV EVSQ YK
Sbjct: 442  NSVKRFTDRAILSLQSNPGWGFDKKCQFMDKLVWEVSQHYK 482


>gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao]
          Length = 474

 Score =  352 bits (904), Expect = 2e-94
 Identities = 218/447 (48%), Positives = 249/447 (55%), Gaps = 17/447 (3%)
 Frame = +3

Query: 186  PSDKKPDNSNDDGASPLPRG--HGRGRGTXXXXXXXXXXXXXXXNNDSKAPPLGRGRGFI 359
            P      +SN D A   P G  HGRGRG                   S     G GRG +
Sbjct: 62   PGKSGSGDSNRDSAESPPAGVGHGRGRGGPLSSDPIPHPF------SSFVSQTGSGRGRV 115

Query: 360  XXXXXXXXXXXXXXXXXXNQPKPNDKKPLLFVK---DDEAQYNAAESEIPAIQEKPL--P 524
                                P P   K  +F+K   +DE + +A  +  P    +P+  P
Sbjct: 116  TSESVPPPP-----------PPPAQAKQPIFIKKKDEDETESSAKAAAEPIQSSEPIFPP 164

Query: 525  NDV-INVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXXRE 701
            N + ++V+SGAGRGKP+K P P S + + ENRHIR  QQ                     
Sbjct: 165  NILPVSVLSGAGRGKPVKQPEPASRRQE-ENRHIRVAQQQSPSA---------------- 207

Query: 702  QLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDDRYEE 881
            Q+SQEE  KKA  ILSR                                       R + 
Sbjct: 208  QMSQEEATKKAMGILSRRSESGESGMVGRGGRASMGMGGGRGRGRGRGRGMGRGRGRRQG 267

Query: 882  SDDE---------ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAY 1034
             D           A GLYLGD AD EK AQ +G + MNKL EGFEEM SRVLPSP+DDAY
Sbjct: 268  EDTRIVKDSGEGSADGLYLGDNADGEKFAQTIGADNMNKLVEGFEEMGSRVLPSPMDDAY 327

Query: 1035 LDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXX 1214
            LDA HTN  IE EPEYLMEEFGTNPDIDEKPP+PLRDALEKMKPFLMAYEGIQSQ     
Sbjct: 328  LDALHTNCSIEFEPEYLMEEFGTNPDIDEKPPMPLRDALEKMKPFLMAYEGIQSQEEWEE 387

Query: 1215 XXXXTMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSL 1394
                TM++VPL++EIVD+YSGPDRVTAK+QQEELERVAKT+P  AP  VK+F  RAVLSL
Sbjct: 388  VIKETMERVPLLQEIVDYYSGPDRVTAKKQQEELERVAKTIPERAPSSVKQFANRAVLSL 447

Query: 1395 QSNPGWGFDKKCQFMDKLVMEVSQQYK 1475
            QSNPGWGFDKKCQFMDKLV EVSQQYK
Sbjct: 448  QSNPGWGFDKKCQFMDKLVWEVSQQYK 474


>gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]
          Length = 426

 Score =  348 bits (893), Expect = 5e-93
 Identities = 210/441 (47%), Positives = 250/441 (56%), Gaps = 3/441 (0%)
 Frame = +3

Query: 162  PFQFTADSPSDKKPDNSNDDGASPL---PRGHGRGRGTXXXXXXXXXXXXXXXNNDSKAP 332
            P  F ++ PS      ++    SP    P G GRGR                 +NDS AP
Sbjct: 38   PNTFASNKPSGSVELGNSKIDDSPTTAPPYGRGRGRIQPLPSSPLLPSFASIVSNDSGAP 97

Query: 333  PLGRGRGFIXXXXXXXXXXXXXXXXXXNQPKPNDKKPLLFVKDDEAQYNAAESEIPAIQE 512
            P+G GRG I                    P P D   L                      
Sbjct: 98   PIGGGRGKIPTRPPLP-------------PPPRDTAAL---------------------- 122

Query: 513  KPLPNDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXX 692
                +D++  +SG GRG P K P PQ+ KP   NRHIRQ  QP+                
Sbjct: 123  ----DDILTNLSGMGRGTPGKPP-PQTLKPTPINRHIRQ-PQPRPSTALSPD-------- 168

Query: 693  XREQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDDR 872
              +QLS+EEK+KKA EILSRG+P                                   D 
Sbjct: 169  --QQLSKEEKLKKAVEILSRGDPDRGPIRSPTGRGRGRGRGRGGRGGRFSGRGRGREADA 226

Query: 873  YEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHT 1052
              ESD+E  G++ GDPADE+K+A+KLG EVMNK+ EG EEMSSRVLPS +DDAY+DAYHT
Sbjct: 227  AIESDEELPGMF-GDPADEQKVAEKLGVEVMNKITEGMEEMSSRVLPSLIDDAYVDAYHT 285

Query: 1053 NLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXXTM 1232
            NL++ECEPEY ME+FGTNPDID+KPPIPLR+A EKMKPFLM + GI++Q         TM
Sbjct: 286  NLLLECEPEYFMEDFGTNPDIDDKPPIPLREAFEKMKPFLMQHIGIETQEEWEQIIEETM 345

Query: 1233 KKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGW 1412
            + VP  K+I+DHY+GPDRVTA QQ  ELERVA TLPA+AP  VKRFTERAVLSL+SNPGW
Sbjct: 346  ESVPRWKKIIDHYAGPDRVTALQQIGELERVAGTLPATAPASVKRFTERAVLSLKSNPGW 405

Query: 1413 GFDKKCQFMDKLVMEVSQQYK 1475
            GF KKCQFMDK+VMEVSQQYK
Sbjct: 406  GFKKKCQFMDKVVMEVSQQYK 426


>ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis]
          Length = 407

 Score =  342 bits (877), Expect = 3e-91
 Identities = 190/345 (55%), Positives = 222/345 (64%), Gaps = 12/345 (3%)
 Frame = +3

Query: 477  NAAESEIPAIQEKPLPNDVINVISGAGRGKPMKSPAPQSEK----------PKAENRHIR 626
            +A +S  P+  E  LP+ +I+ + GAGRGK   +   Q ++          P+ ENRHIR
Sbjct: 68   SATDSTQPS--EPNLPSSIISTLPGAGRGKTAVTQQQQQQQQHQRQQPGPPPQEENRHIR 125

Query: 627  QRQQPKXXXXXXXXXXXXXXXXXREQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXX 806
             R QP+                 + +LS+E+ VK A ++LSRGE                
Sbjct: 126  ARLQPQPRPEKAPAAETGSA---QPKLSKEDAVKMAMKVLSRGEEGEGEGISAGGPGRGR 182

Query: 807  XXXXXXXXXXXXXXXXXXXDDRYEESDDEA--SGLYLGDPADEEKMAQKLGPEVMNKLAE 980
                                 +  E D++    GLYLGD AD EK+A+K+G E MN L E
Sbjct: 183  GMGRGRGRGRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVE 242

Query: 981  GFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKM 1160
            GFEEMS RVLPSP++DAY+DA HTN MIE EPEYLMEEFGTNPDIDEKPPIPLRDALEKM
Sbjct: 243  GFEEMSGRVLPSPMEDAYIDALHTNCMIEFEPEYLMEEFGTNPDIDEKPPIPLRDALEKM 302

Query: 1161 KPFLMAYEGIQSQXXXXXXXXXTMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLP 1340
            KPFLMAYEGIQSQ          M++VPL+KEIVDHYSGPDRVTAKQQ EELERVAKT+P
Sbjct: 303  KPFLMAYEGIQSQEEWEEAVNEVMERVPLLKEIVDHYSGPDRVTAKQQGEELERVAKTIP 362

Query: 1341 ASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVMEVSQQYK 1475
             SAP  +KRF  RAVLSLQSNPGWGFDKKCQFMDKL  EVSQQYK
Sbjct: 363  ESAPASIKRFANRAVLSLQSNPGWGFDKKCQFMDKLAWEVSQQYK 407


>ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [Glycine max]
            gi|571476117|ref|XP_006586864.1| PREDICTED: la-related
            protein 1 isoform X2 [Glycine max]
          Length = 481

 Score =  327 bits (837), Expect = 1e-86
 Identities = 193/439 (43%), Positives = 237/439 (53%), Gaps = 15/439 (3%)
 Frame = +3

Query: 204  DNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXXNNDSKAPPLGRGRGFIXXXXXXXX 383
            ++ +D    P+P G G G G                 +    PP GRGRG          
Sbjct: 64   ESKSDTTEPPIPPGSGLGHGRGKPMPPSGLPSFSSFISSINQPPAGRGRG---------- 113

Query: 384  XXXXXXXXXXNQPKPNDKKPLLFVKDDEAQYNAAESEIPA------IQEKPLPNDVINVI 545
                        P    KKP+ F ++D     A+   +P         +  LP  +  V+
Sbjct: 114  -TAPHPQHDLQPPDSGPKKPIFFKREDSVSPTASNDFLPPKRSVDHAHDNKLPGSIPGVL 172

Query: 546  SGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXXREQLSQEEKV 725
            SG GRGK MK P  +++  + ENRH+R RQ P                      SQE+  
Sbjct: 173  SGLGRGKSMKQPDLETQVTE-ENRHLRTRQAPGAASSETVPKRSPIP-------SQEDAT 224

Query: 726  KKAKEILSRGEP---------VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDDRYE 878
            + A +ILS G+                                            D++  
Sbjct: 225  RNALKILSHGKDDGSDTGRGREYGGRGGLDRGRGRGRGRGRGRGMGRGRFVERDVDEKVM 284

Query: 879  ESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNL 1058
            ++DD A+GLY GD AD EK+A+K+GPE+MN+L EGFEEM+SRVLPSPL+D +LDA   N 
Sbjct: 285  DTDDYATGLYAGDDADGEKLARKVGPEIMNQLTEGFEEMTSRVLPSPLEDEFLDALDINY 344

Query: 1059 MIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXXTMKK 1238
             IE EPEYL+E    NPDIDEK PI LRDALEK KPFLM+YEGIQSQ         TM +
Sbjct: 345  AIEFEPEYLVEF--DNPDIDEKEPISLRDALEKAKPFLMSYEGIQSQEEWEEIMEETMAR 402

Query: 1239 VPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGF 1418
            VPL+K+I+DHYSGPDRVTAK+QQEELERVAKTLP S P  VK+FT RAV+SLQSNPGWGF
Sbjct: 403  VPLLKKIIDHYSGPDRVTAKKQQEELERVAKTLPGSVPSSVKQFTNRAVISLQSNPGWGF 462

Query: 1419 DKKCQFMDKLVMEVSQQYK 1475
            DKKC FMDKLV EVSQ YK
Sbjct: 463  DKKCHFMDKLVWEVSQHYK 481


>gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris]
          Length = 532

 Score =  319 bits (818), Expect = 2e-84
 Identities = 214/498 (42%), Positives = 259/498 (52%), Gaps = 60/498 (12%)
 Frame = +3

Query: 162  PFQFTADSPSDKKPDNS---NDDGASPLP----RGHGRGRGTXXXXXXXXXXXXXXXN-- 314
            PF F   +P   KP++S   +D   SP+P     GHGRG+                 N  
Sbjct: 49   PFNFNERAPG--KPNSSEPKSDTTESPIPPGSGHGHGRGKPMPPSGLPSFSSFLSSINQP 106

Query: 315  -------------NDSKAP-----------------PLGRGRGFIXXXXXXXXXXXXXXX 404
                         ND ++P                 P GRGR  +               
Sbjct: 107  PAGRGRPTVPHHQNDLQSPAGRGRPTVPHHQNDLQSPAGRGRPTVPRHQNDLQSPAGRGR 166

Query: 405  XXXNQPKPND--------KKPLLFVKDDEAQYNAAES-EIPAIQEKPLPNDVINVISGAG 557
                QP PND        KKP+ F ++D A     +   I   Q   LP ++I V+SG G
Sbjct: 167  ATVPQP-PNDLGPPDSGPKKPIFFKREDIASPTTRDDFPIDVEQANKLPGNIIEVLSGLG 225

Query: 558  RGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXXREQL-SQEEKVKKA 734
            RGKPMK   P++   + ENRH+R  +                    R+ + S+++ V+ A
Sbjct: 226  RGKPMKQSDPETRVTE-ENRHLRAPRA--------RGAAASDTLYERQPIPSRDDAVRNA 276

Query: 735  KEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDDRYEESDD-------- 890
            +  LS+GE                                     R  + D+        
Sbjct: 277  RNFLSQGEDDVGGTGRGRGFRERGGLGRGRGRGRGRGRGTGRGGFRGRDMDERRGRFMDA 336

Query: 891  EAS---GLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLM 1061
            EAS   G Y+GD AD EK+A+K+GPE+MN+L EGFEEM+ RVLPSPL+D YLDA   N  
Sbjct: 337  EASDDIGPYVGDDADGEKLAKKVGPEIMNQLTEGFEEMAGRVLPSPLEDEYLDALDINYA 396

Query: 1062 IECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXXTMKKV 1241
            IE EPEYL+E    NPDIDEK PIPLRDALEKMKPFLMAYEGIQSQ         TM +V
Sbjct: 397  IEFEPEYLVEF--DNPDIDEKEPIPLRDALEKMKPFLMAYEGIQSQEEWEEIMEETMAQV 454

Query: 1242 PLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFD 1421
            PL+KEIVDHYSGPDRVTAK+QQEELERVAKTLP SAP  VK+FT RAV+SLQSNPGWGFD
Sbjct: 455  PLLKEIVDHYSGPDRVTAKKQQEELERVAKTLPESAPSSVKQFTNRAVVSLQSNPGWGFD 514

Query: 1422 KKCQFMDKLVMEVSQQYK 1475
            KKC FMDKLV EVSQ YK
Sbjct: 515  KKCHFMDKLVWEVSQHYK 532


>ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507965 [Cicer arietinum]
          Length = 504

 Score =  318 bits (815), Expect = 5e-84
 Identities = 201/455 (44%), Positives = 244/455 (53%), Gaps = 26/455 (5%)
 Frame = +3

Query: 189  SDKKPDNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXXNNDSKAPPLGRGRGFIXXX 368
            S++    + D   SP   G G GRG                 +  K P +GRGRGF    
Sbjct: 65   SNESKSEATDSPFSPPGAGRGHGRGGSVPPPTGFPSFSSFLTS-IKQPSIGRGRGF---- 119

Query: 369  XXXXXXXXXXXXXXXNQPKPNDKKPLLFVKDDEAQYNAAESEIPAIQEKPL--------P 524
                            QP    KKP+LF  +D       + ++    +KP+        P
Sbjct: 120  -GPSPFQPENDTQQLQQPDSVPKKPVLFRSEDSVSQTGGKDDVSP-PKKPVFTRREDFSP 177

Query: 525  ND--------------VINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXX 662
             D              V+ V+SGAGRGKP++ PA    +   ENRH+R R+         
Sbjct: 178  IDLSSDQESDNRFSMSVLKVLSGAGRGKPIE-PAVSETQVVEENRHVRNRRASDVPMRQP 236

Query: 663  XXXXXXXXXXXREQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXX 842
                       R+ LS+ +           GEP                           
Sbjct: 237  MLTGDGALQNARKYLSKFDGDGSGSG--RGGEP----RERGAFGRGRGRGRGRGRGRGRG 290

Query: 843  XXXXXXXDDRYEESDDEA----SGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVL 1010
                   DDR+ +  D A    SGL+LGD  D EK+A+K+GPEVMN+  EGFEEM SRVL
Sbjct: 291  GFRGTGGDDRFGQIQDNARSNASGLFLGDDVDGEKLAKKVGPEVMNQFTEGFEEMISRVL 350

Query: 1011 PSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGI 1190
            PSPL+D Y++A+  N  IE EPEY+ME F +NPDIDEK PIPLRDALEKMKPFLM YEGI
Sbjct: 351  PSPLEDEYVEAFDINCAIEFEPEYIME-FDSNPDIDEKEPIPLRDALEKMKPFLMNYEGI 409

Query: 1191 QSQXXXXXXXXXTMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRF 1370
            QSQ         TM++VPL+K+IVDHYSGPDRVTAK+QQEELERVAKTLPASAP  V +F
Sbjct: 410  QSQEEWEAIMEETMERVPLLKKIVDHYSGPDRVTAKKQQEELERVAKTLPASAPSSVVQF 469

Query: 1371 TERAVLSLQSNPGWGFDKKCQFMDKLVMEVSQQYK 1475
            T RAV+SLQSNPGWGFDKKCQFMDKLV EVSQ +K
Sbjct: 470  TNRAVMSLQSNPGWGFDKKCQFMDKLVFEVSQHHK 504


>ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis]
            gi|223537066|gb|EEF38701.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 436

 Score =  318 bits (814), Expect = 7e-84
 Identities = 177/322 (54%), Positives = 209/322 (64%), Gaps = 4/322 (1%)
 Frame = +3

Query: 519  LPNDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXXR 698
            LP+ + + +SG GRG+P K   P + + K ENRHIR R + K                 +
Sbjct: 133  LPSTIHSSLSGFGRGEPDKPVVP-TPQVKEENRHIRDRSRAKPKTEEAEVRA-------K 184

Query: 699  EQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDDRYE 878
             ++S+EE VK+A  ILS+G+                                   + R  
Sbjct: 185  PKISREEAVKRAVSILSQGDT-----------GEGMGRGRGGGRGRGRGRGRGRLEQRGR 233

Query: 879  ESDDE----ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAY 1046
              DD      SGL+LGD AD EK+A K+G E MNKL EG+EEMS RVLPSP++DAYLDA 
Sbjct: 234  MMDDVDEGFGSGLFLGDNADGEKLAGKIGVENMNKLVEGYEEMSGRVLPSPMEDAYLDAL 293

Query: 1047 HTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXX 1226
            HTN MIE EPEYLM EF  NPDIDEKPP+PLRD LEK+KPF+MAYEGIQSQ         
Sbjct: 294  HTNYMIEFEPEYLMGEFDQNPDIDEKPPMPLRDVLEKVKPFIMAYEGIQSQEEWEAAVEE 353

Query: 1227 TMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNP 1406
            TMK VPL KEIVD+YSGPDR+TAK+Q+EELERVA T+PASAP  VKRF +RAVLSLQSNP
Sbjct: 354  TMKNVPLFKEIVDYYSGPDRITAKKQEEELERVANTIPASAPASVKRFADRAVLSLQSNP 413

Query: 1407 GWGFDKKCQFMDKLVMEVSQQY 1472
            GWGFDKKCQFMDKLV EV+Q Y
Sbjct: 414  GWGFDKKCQFMDKLVREVNQCY 435


>gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica]
          Length = 428

 Score =  317 bits (812), Expect = 1e-83
 Identities = 158/202 (78%), Positives = 172/202 (85%), Gaps = 1/202 (0%)
 Frame = +3

Query: 870  RYEESDDE-ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAY 1046
            R ++SD   ASGLYLGD AD EK+A+KLGPE+MNKL E FEEMSS VLPSPLDDAY+DA 
Sbjct: 226  RGKDSDGSYASGLYLGDNADGEKLAKKLGPEIMNKLVERFEEMSSEVLPSPLDDAYVDAM 285

Query: 1047 HTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXX 1226
            HTN MIECEPEYLM EF  NPDIDEKPPI LRDALEKMKPFLMAYE I+SQ         
Sbjct: 286  HTNFMIECEPEYLMGEFNKNPDIDEKPPISLRDALEKMKPFLMAYENIESQEEWEEVVNE 345

Query: 1227 TMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNP 1406
            TM++VPL+KEIVDHYSGPDRVTAK+QQEELERVAKTLPA  PD VKRFT+RAVLSLQSNP
Sbjct: 346  TMERVPLLKEIVDHYSGPDRVTAKKQQEELERVAKTLPAKVPDSVKRFTDRAVLSLQSNP 405

Query: 1407 GWGFDKKCQFMDKLVMEVSQQY 1472
            GWGFD+KCQFMDKLV +VSQ Y
Sbjct: 406  GWGFDRKCQFMDKLVAKVSQHY 427


>ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215545 [Cucumis sativus]
            gi|449502143|ref|XP_004161555.1| PREDICTED:
            uncharacterized protein LOC101224016 [Cucumis sativus]
          Length = 478

 Score =  315 bits (808), Expect = 3e-83
 Identities = 193/449 (42%), Positives = 240/449 (53%), Gaps = 11/449 (2%)
 Frame = +3

Query: 162  PFQFTADSPSDKKPDNSNDDGASPLPR---GHGRGRGTXXXXXXXXXXXXXXXNNDSKAP 332
            PF FT   P+ +  + S  +     P    GHGRG+ T                  S   
Sbjct: 50   PFDFTPPVPNQEHSNASKQEPIDSRPTPGLGHGRGKPTPSSPLRPSFSSFSPSVRPSS-- 107

Query: 333  PLGRGRGFIXXXXXXXXXXXXXXXXXXNQPKPNDKKPLLFVKDDEAQYNAAESEIPAIQ- 509
             +GRGRG                     +P    KKP+ F K++    +AA + +  +  
Sbjct: 108  -VGRGRG-----------DASPSIRSPPEPDSEPKKPVFFSKNNAGD-SAASTSLGGLHR 154

Query: 510  ---EKPLPNDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXX 680
               E+ LP  + +  SG GRGKPMK P P+ ++PK ENRH+R RQ+              
Sbjct: 155  VSGERNLPESLHSEFSGVGRGKPMKQPVPE-DQPKQENRHLRPRQE----GDGPGAGERG 209

Query: 681  XXXXXREQLSQEEKVKKAKEILSR----GEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXX 848
                   ++ + E  +    ++S+    GE                              
Sbjct: 210  RGRGFEPRIGRGEPWRNTNRMVSKDGPDGEVGGGRGTSGYRGRGARGPYRRGARGSFRTG 269

Query: 849  XXXXXDDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDD 1028
                    +++ D  A+GLYLG+  D E++A+++G E MNKL EGFEEMS RVLPSPL D
Sbjct: 270  ERRERRSGHDKEDGYAAGLYLGNNEDGERLAKRIGTENMNKLVEGFEEMSGRVLPSPLVD 329

Query: 1029 AYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXX 1208
             YLD   TN MIECEPEYLM +F  NPDIDE PPIPLRDALEKMKPFLMAYE IQS    
Sbjct: 330  QYLDGMDTNFMIECEPEYLMGDFENNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEW 389

Query: 1209 XXXXXXTMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVL 1388
                  TM+ VPL+KEIVD Y GPDRVTAK+QQ ELERVAKTLP SAP+ VK+FT R VL
Sbjct: 390  EEIVEETMQSVPLLKEIVDAYGGPDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRVVL 449

Query: 1389 SLQSNPGWGFDKKCQFMDKLVMEVSQQYK 1475
            SLQSNPGWGFDKK Q MDKLV   S++YK
Sbjct: 450  SLQSNPGWGFDKKWQLMDKLVEGFSKRYK 478


>emb|CBI17195.3| unnamed protein product [Vitis vinifera]
          Length = 209

 Score =  308 bits (789), Expect = 5e-81
 Identities = 150/200 (75%), Positives = 168/200 (84%)
 Frame = +3

Query: 876  EESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTN 1055
            +  DD  +GLYLGD AD EK++ K+G E M+KL E FEEMS RVLPSP++DAYLDA HTN
Sbjct: 10   DAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMSGRVLPSPIEDAYLDALHTN 69

Query: 1056 LMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXXTMK 1235
             +IE EPEYLMEEFGTNPDIDE PPIPLRDALEKMKPFLM YEGIQSQ         TM+
Sbjct: 70   CLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQYEGIQSQEEWEEVMKETME 129

Query: 1236 KVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWG 1415
             VP +KE+VD+YSGPDRVTAK+QQEELERVAKTLP +AP+ VKRFT+RA+LSLQSNPGWG
Sbjct: 130  NVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWG 189

Query: 1416 FDKKCQFMDKLVMEVSQQYK 1475
            FDKKCQFMDKLV EVSQ YK
Sbjct: 190  FDKKCQFMDKLVWEVSQHYK 209


>ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Capsella rubella]
            gi|482575944|gb|EOA40131.1| hypothetical protein
            CARUB_v10008838mg [Capsella rubella]
          Length = 525

 Score =  306 bits (785), Expect = 2e-80
 Identities = 177/373 (47%), Positives = 222/373 (59%), Gaps = 19/373 (5%)
 Frame = +3

Query: 414  NQPKPNDKKP--LLFVKDDEAQYNAAESEIPAIQ--EKPLPNDVINVI-------SGAGR 560
            +QP+PND+     +FVK  E +   +    P  +  +  LP++V N +       SGAGR
Sbjct: 164  SQPQPNDESQGSPVFVKLQEMKDVTSSPPAPESKSGQTDLPDNVFNALGSEIPHSSGAGR 223

Query: 561  GKPMKSPAPQSEKPKAENRHIR--------QRQQPKXXXXXXXXXXXXXXXXXREQLSQE 716
            GKP+   AP   +   ENRHIR        QR QP+                 R +LS E
Sbjct: 224  GKPLVESAPIQRE---ENRHIRRPPPPPQQQRSQPQQKRAQTPRDETP-----RPRLSAE 275

Query: 717  EKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDDRYEESDDEA 896
            E  ++A+  LSRGE                                   D + EE + EA
Sbjct: 276  EAGRRARSELSRGEA---EGSGVRGRGGRGRGRGARGRGRGRGGEGWRDDKKEEEGEQEA 332

Query: 897  SGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEP 1076
              ++ GD AD EK A K+GPE+M  LAEGFEE+  + LPS   DA +DAY TNLMIECEP
Sbjct: 333  MSVFAGDSADGEKFANKMGPELMKTLAEGFEEVCEKALPSTTHDAIIDAYDTNLMIECEP 392

Query: 1077 EYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXXTMKKVPLIKE 1256
            EY+M +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q          M + PL+KE
Sbjct: 393  EYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMAQAPLMKE 452

Query: 1257 IVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQF 1436
            IVDHYSGPDRVTAK+Q EEL+R+A TLP SAPD VKRF +RA L+L+SNPGWGFDKK QF
Sbjct: 453  IVDHYSGPDRVTAKKQNEELDRIATTLPKSAPDSVKRFADRAALTLKSNPGWGFDKKYQF 512

Query: 1437 MDKLVMEVSQQYK 1475
            MDKLV+EVSQ YK
Sbjct: 513  MDKLVLEVSQSYK 525


>ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550322664|gb|EEF06007.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 466

 Score =  303 bits (777), Expect = 1e-79
 Identities = 188/442 (42%), Positives = 232/442 (52%), Gaps = 6/442 (1%)
 Frame = +3

Query: 168  QFTADSPSDKKPDNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXXNNDSKAPP-LGR 344
            ++ A +P     D S  + +   P G G GRG                ++   + P  GR
Sbjct: 56   EYGAAAPGKPDLDESKTESSESQPSGLGHGRGKPVGTGPILPAFSTFISSVKNSQPGAGR 115

Query: 345  GRGFIXXXXXXXXXXXXXXXXXXNQPKPNDKKPLLFVKDDEAQYNAAESEIPAIQEKPLP 524
            GRG                     +P P+            +  +  ESE P   E  LP
Sbjct: 116  GRG-------------------TTEPGPS-----------RSTESRPESEPPKKAEANLP 145

Query: 525  NDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXX--R 698
              +++ + GAGRGKP+K   P  E  K ENRH+R R QP+                    
Sbjct: 146  PSILSGLGGAGRGKPVKQEVP-IEPAKEENRHLRARSQPRSQPRTRQQKTPDGDDAVPAT 204

Query: 699  EQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDDR-Y 875
             ++ ++E VKKA E+LSRG                                      R Y
Sbjct: 205  TKMGRQEAVKKAMELLSRGGGEGEVGGRGGGRGSFVPGRGGGRGGARGGGRGRGRGRRGY 264

Query: 876  EESDDE-ASGLYL-GDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYH 1049
             + + E  SG+ L G   DEEK AQ +G E MN L E FEEMS RVLP P++D Y+DA+ 
Sbjct: 265  GDKEVEYGSGMSLEGHEEDEEKFAQSVGVETMNTLVEAFEEMSGRVLPCPIEDEYVDAFD 324

Query: 1050 TNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXXT 1229
            TN   E EPEYLM EF  NPDIDEKPP+PLRDALEK+KPF+MAY GI++          T
Sbjct: 325  TNCSFEFEPEYLMGEFDKNPDIDEKPPMPLRDALEKVKPFMMAYMGIKTHEEWEEIVEET 384

Query: 1230 MKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPG 1409
            MK  PL+K+IVD YSGPDRV+ K+Q+EELERVAKT+PASAPD VK F +RAVLSLQSNPG
Sbjct: 385  MKDAPLMKKIVDSYSGPDRVSGKKQKEELERVAKTIPASAPDSVKSFADRAVLSLQSNPG 444

Query: 1410 WGFDKKCQFMDKLVMEVSQQYK 1475
            WGFDKKC FMDKL  EVSQ YK
Sbjct: 445  WGFDKKCMFMDKLAKEVSQHYK 466


>gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding protein from Arabidopsis thaliana
            gi|2129727 and contains RNA recognition PF|00076 domain.
            ESTs gb|H37317, gb|F14415, gb|AA651290 come from this
            gene [Arabidopsis thaliana]
          Length = 829

 Score =  303 bits (775), Expect = 2e-79
 Identities = 175/369 (47%), Positives = 221/369 (59%), Gaps = 16/369 (4%)
 Frame = +3

Query: 417  QPKPNDKKP--LLFVKDDEAQYNAAESEIPAIQEKP----LPNDVINVI-------SGAG 557
            Q +PND+     +FVK  E Q   A S  P  + KP     P+++ N +       SGAG
Sbjct: 467  QQQPNDESQGSPVFVKLQEMQ--DATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGAG 524

Query: 558  RGKPMKSPAPQSEKPKAENRHIRQRQQP---KXXXXXXXXXXXXXXXXXREQLSQEEKVK 728
            RGKP+   AP  ++   +NR IR+   P   +                 + QLS EE  +
Sbjct: 525  RGKPLVESAPIRQE---DNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGR 581

Query: 729  KAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDDRYEESDDEASGLY 908
            +A+  LSRGE                                   D + EE + EA  ++
Sbjct: 582  RARSELSRGE-AEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMRIF 640

Query: 909  LGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLM 1088
             GD AD EK A+K+GPE+M  LAEGFEE+  + LPS   DA +DAY TNLMIECEPEY+M
Sbjct: 641  AGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIM 700

Query: 1089 EEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXXTMKKVPLIKEIVDH 1268
             +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q          M + PL+KEIVDH
Sbjct: 701  PDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDH 760

Query: 1269 YSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKL 1448
            YSGPDRVTAK+Q EEL+R+A TLPASAPD VKRF +RA L+L+SNPGWGFDKK QFMDKL
Sbjct: 761  YSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKL 820

Query: 1449 VMEVSQQYK 1475
            V+EVSQ YK
Sbjct: 821  VLEVSQSYK 829


>ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|12324041|gb|AAG51990.1|AC024260_28 unknown
            protein; 43598-45751 [Arabidopsis thaliana]
            gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8
            [Arabidopsis thaliana] gi|23506017|gb|AAN28868.1|
            At1g53640/F22G10.8 [Arabidopsis thaliana]
            gi|110740318|dbj|BAF02054.1| hypothetical protein
            [Arabidopsis thaliana] gi|332194854|gb|AEE32975.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis thaliana]
          Length = 523

 Score =  303 bits (775), Expect = 2e-79
 Identities = 175/369 (47%), Positives = 221/369 (59%), Gaps = 16/369 (4%)
 Frame = +3

Query: 417  QPKPNDKKP--LLFVKDDEAQYNAAESEIPAIQEKP----LPNDVINVI-------SGAG 557
            Q +PND+     +FVK  E Q   A S  P  + KP     P+++ N +       SGAG
Sbjct: 161  QQQPNDESQGSPVFVKLQEMQ--DATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGAG 218

Query: 558  RGKPMKSPAPQSEKPKAENRHIRQRQQP---KXXXXXXXXXXXXXXXXXREQLSQEEKVK 728
            RGKP+   AP  ++   +NR IR+   P   +                 + QLS EE  +
Sbjct: 219  RGKPLVESAPIRQE---DNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGR 275

Query: 729  KAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDDRYEESDDEASGLY 908
            +A+  LSRGE                                   D + EE + EA  ++
Sbjct: 276  RARSELSRGE-AEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMRIF 334

Query: 909  LGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLM 1088
             GD AD EK A+K+GPE+M  LAEGFEE+  + LPS   DA +DAY TNLMIECEPEY+M
Sbjct: 335  AGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIM 394

Query: 1089 EEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXXTMKKVPLIKEIVDH 1268
             +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q          M + PL+KEIVDH
Sbjct: 395  PDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDH 454

Query: 1269 YSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKL 1448
            YSGPDRVTAK+Q EEL+R+A TLPASAPD VKRF +RA L+L+SNPGWGFDKK QFMDKL
Sbjct: 455  YSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKL 514

Query: 1449 VMEVSQQYK 1475
            V+EVSQ YK
Sbjct: 515  VLEVSQSYK 523


>gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis thaliana
            gi|2129727 and contains RNA recognition PF|00076 domain
            [Arabidopsis thaliana]
          Length = 523

 Score =  303 bits (775), Expect = 2e-79
 Identities = 175/369 (47%), Positives = 221/369 (59%), Gaps = 16/369 (4%)
 Frame = +3

Query: 417  QPKPNDKKP--LLFVKDDEAQYNAAESEIPAIQEKP----LPNDVINVI-------SGAG 557
            Q +PND+     +FVK  E Q   A S  P  + KP     P+++ N +       SGAG
Sbjct: 161  QQQPNDESQGSPVFVKLQEMQ--DATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGAG 218

Query: 558  RGKPMKSPAPQSEKPKAENRHIRQRQQP---KXXXXXXXXXXXXXXXXXREQLSQEEKVK 728
            RGKP+   AP  ++   +NR IR+   P   +                 + QLS EE  +
Sbjct: 219  RGKPLVESAPIRQE---DNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGR 275

Query: 729  KAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDDRYEESDDEASGLY 908
            +A+  LSRGE                                   D + EE + EA  ++
Sbjct: 276  RARSELSRGE-AEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMRIF 334

Query: 909  LGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLM 1088
             GD AD EK A+K+GPE+M  LAEGFEE+  + LPS   DA +DAY TNLMIECEPEY+M
Sbjct: 335  AGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIM 394

Query: 1089 EEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXXTMKKVPLIKEIVDH 1268
             +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q          M + PL+KEIVDH
Sbjct: 395  PDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDH 454

Query: 1269 YSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKL 1448
            YSGPDRVTAK+Q EEL+R+A TLPASAPD VKRF +RA L+L+SNPGWGFDKK QFMDKL
Sbjct: 455  YSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKL 514

Query: 1449 VMEVSQQYK 1475
            V+EVSQ YK
Sbjct: 515  VLEVSQSYK 523


>ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutrema salsugineum]
            gi|557089350|gb|ESQ30058.1| hypothetical protein
            EUTSA_v10011382mg [Eutrema salsugineum]
          Length = 531

 Score =  301 bits (771), Expect = 6e-79
 Identities = 191/469 (40%), Positives = 238/469 (50%), Gaps = 41/469 (8%)
 Frame = +3

Query: 192  DKKPDNSND-------DGASPLPRGHGRGRGTXXXXXXXXXXXXXXXNNDSKAPPLGRGR 350
            +++P+ +N+          S  P G+G GRG                  DS  P +GRGR
Sbjct: 70   NREPERANEAAGHGRGSSESQSPGGYGHGRGRPIQSDPISPAFSSFVRPDS--PSVGRGR 127

Query: 351  GFIXXXXXXXXXXXXXXXXXXNQPKPN------DKKPLLFVKDDEAQYNAAESEIPAIQE 512
            G +                     +P        + P +F K  E +   +    P  + 
Sbjct: 128  GSVGSDPVSPFAAPSPPPPRDQSHRPQLSSEEQPQSPPVFAKLQEMKDATSSPPPPPTES 187

Query: 513  K-----PL-------------PNDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQ 638
            K     PL             PN  I   SGAGRGKP    AP  ++   ENRHIR+ Q 
Sbjct: 188  KSGQTAPLNNIFNGLGSEFSQPNQRIVPGSGAGRGKPFVESAPLQQE---ENRHIRRPQP 244

Query: 639  PKXXXXXXXXXXXXXXXXXREQ----------LSQEEKVKKAKEILSRGEPVXXXXXXXX 788
            P                  R Q          LS EE  ++A+  LSRGE          
Sbjct: 245  PPPQQQQQRSQPQPQHQQKRVQPPKDEAPRPKLSIEEAGRRARSQLSRGEAEGGGLRGRG 304

Query: 789  XXXXXXXXXXXXXXXXXXXXXXXXXDDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMN 968
                                      +  EE++ EA   ++GD AD EK A K+GPE+M 
Sbjct: 305  GGRGRGRGARGRGRGRGGEGWRDVKME--EEAEQEAISTFVGDSADGEKFANKMGPEIMK 362

Query: 969  KLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDA 1148
             LA+G+E++  R LPS  +DA LDAY TNLMIECEPEYLM  FG+NPDIDEKPP+ LR+ 
Sbjct: 363  MLADGYEDICERALPSTANDAVLDAYETNLMIECEPEYLMPAFGSNPDIDEKPPMSLREC 422

Query: 1149 LEKMKPFLMAYEGIQSQXXXXXXXXXTMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVA 1328
            LEK+KPF++AYEGI+ Q          M + PLIKEIVDHYSGPDRVTAK+Q EEL+R+A
Sbjct: 423  LEKVKPFIVAYEGIKDQEEWEEAIDEVMAQAPLIKEIVDHYSGPDRVTAKKQNEELDRIA 482

Query: 1329 KTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVMEVSQQYK 1475
             T+P SAPD VKRF +RA LSL+SNPGWGFDKK QFMDKLV EVSQ YK
Sbjct: 483  TTVPKSAPDSVKRFADRAALSLKSNPGWGFDKKYQFMDKLVAEVSQSYK 531


>ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300131 [Fragaria vesca
            subsp. vesca]
          Length = 464

 Score =  301 bits (771), Expect = 6e-79
 Identities = 149/206 (72%), Positives = 170/206 (82%), Gaps = 3/206 (1%)
 Frame = +3

Query: 867  DRYEESDDE---ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYL 1037
            DR    D++   ASGLYLGD AD EK+A+KLGPEVMN+L E FE+MS+ VLPSPLDDAY+
Sbjct: 259  DRRRRGDEDGGIASGLYLGDNADGEKLAEKLGPEVMNQLTEAFEDMSTHVLPSPLDDAYV 318

Query: 1038 DAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXX 1217
            DA  TN  IE EPEYLM EF  NPDIDE+PPIPLRDALEKMKPFLMAYEGIQSQ      
Sbjct: 319  DALDTNCKIEFEPEYLMGEFNQNPDIDEEPPIPLRDALEKMKPFLMAYEGIQSQEEWEEA 378

Query: 1218 XXXTMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQ 1397
               TM++VPL+K+IVDHYSGPDRVTAK+Q+EELERVAKTLPA+ PD VK+FT+RAVLSLQ
Sbjct: 379  IKETMERVPLLKKIVDHYSGPDRVTAKKQREELERVAKTLPANVPDSVKQFTDRAVLSLQ 438

Query: 1398 SNPGWGFDKKCQFMDKLVMEVSQQYK 1475
             NPGWGF +KCQFMDKL  +VS+ YK
Sbjct: 439  GNPGWGFHRKCQFMDKLTQKVSKHYK 464


Top