BLASTX nr result

ID: Rehmannia25_contig00006865 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00006865
         (1877 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu...   387   e-105
ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247...   384   e-104
ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253...   357   1e-95
gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, ...   352   2e-94
gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]       348   5e-93
ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1...   342   3e-91
ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [...   327   1e-86
gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus...   319   2e-84
ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507...   318   5e-84
ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm...   318   7e-84
gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus pe...   317   1e-83
ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215...   315   3e-83
emb|CBI17195.3| unnamed protein product [Vitis vinifera]              308   5e-81
ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Caps...   306   2e-80
ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus tr...   303   1e-79
gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding prot...   303   2e-79
ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein...   303   2e-79
gb|AAM65660.1| Contains similarity to RNA-binding protein from A...   303   2e-79
ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutr...   301   6e-79
ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300...   301   6e-79

>ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum]
          Length = 480

 Score =  387 bits (995), Expect = e-105
 Identities = 227/438 (51%), Positives = 273/438 (62%), Gaps = 10/438 (2%)
 Frame = -1

Query: 1679 DKKPDNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAPPLGRGRGFIXXXX 1500
            D KP++S     +P   GHGRGRG               ++N +  PP GRGRG I    
Sbjct: 58   DSKPESSTP--TTPSGTGHGRGRGKPLPSSPIVPSFYSVVDNPN--PPAGRGRGGIGPFS 113

Query: 1499 XXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDE-AQYNAAESEIPAIQEKP-LPNDVINVI 1326
                           Q +   +KP+ F K++E A  N++ S+ P  ++   L + VI+V+
Sbjct: 114  PPPQPQQQQ-----QQQQQPLRKPIFFAKEEETADSNSSSSDAPTPRDDSNLSSSVISVL 168

Query: 1325 SGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXPREQLSQEEKV 1146
            +GAGRGKP+++ +P SEKPK ENRH+R RQQ                    ++LS+E+ V
Sbjct: 169  TGAGRGKPLQTASPVSEKPKEENRHLRPRQQKVADSGERASSPPP------QRLSREDAV 222

Query: 1145 KKAKEILSRGEP------VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESD 984
            KKA  ILSR +       V                                   R EE  
Sbjct: 223  KKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGRGRRDEERG 282

Query: 983  DEA--SGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLM 810
            D +  SG YLGD AD EK+AQKLGPE MN LAEGFEEMS+RVLPSP+DDAY++A HTN+M
Sbjct: 283  DGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMSARVLPSPMDDAYIEALHTNMM 342

Query: 809  IECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKV 630
            IECEPEYLM +F +NPDIDE PPIPLRDALEKMKPFLMAYEGI+ Q        ETM+ V
Sbjct: 343  IECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEVIKETMETV 402

Query: 629  PLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFD 450
            PL+KEIVD+YSGPDRVTAKQQQ+ELERVAKTLP SAP+ VKRFTERAVLSLQSNPGWGFD
Sbjct: 403  PLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNPGWGFD 462

Query: 449  KKCQFMDKLVMEVSQQYK 396
            KKCQFMDK+VME SQ YK
Sbjct: 463  KKCQFMDKVVMEASQHYK 480


>ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum
            lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED:
            uncharacterized protein LOC101247662 isoform 2 [Solanum
            lycopersicum]
          Length = 473

 Score =  384 bits (987), Expect = e-104
 Identities = 227/434 (52%), Positives = 270/434 (62%), Gaps = 6/434 (1%)
 Frame = -1

Query: 1679 DKKPDNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAPPLGRGRGFIXXXX 1500
            D KP++S    A+P   GHGRGRG               ++N +   P GRGRG I    
Sbjct: 58   DSKPESSTP--ATPSGTGHGRGRGKPLPSSPIVPSFHSFVDNPNT--PAGRGRGGIGPFS 113

Query: 1499 XXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDEA-QYNAAESEIPAIQEKP-LPNDVINVI 1326
                           Q +   +KP+ F K++E    N++ S  P  ++   LP+ VI+V+
Sbjct: 114  PPPQPQ--------QQQQQPLRKPIFFAKEEETTDSNSSSSNAPKPRDDSNLPSSVISVL 165

Query: 1325 SGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXPREQLSQEEKV 1146
            +GAGRGKP+++ +  SEKPK ENRH+R RQQ                    ++LS+E+ V
Sbjct: 166  TGAGRGKPLQTASSVSEKPKEENRHLRPRQQKVADSGERASSPPP------QRLSREDAV 219

Query: 1145 KKAKEILSRGEP--VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDDE-- 978
            KKA  ILSR +   V                                   R EE  D   
Sbjct: 220  KKAVGILSRSDDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGRGRRDEERGDGNL 279

Query: 977  ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECE 798
             SG YLGD AD EK+A KLGPE MN LAEGFEEMS+RVLPSP+DDAYL+A HTN+MIECE
Sbjct: 280  ESGFYLGDDADGEKLAAKLGPESMNTLAEGFEEMSARVLPSPMDDAYLEALHTNMMIECE 339

Query: 797  PEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIK 618
            PEYLM +F +NPDIDE PPIPLRDALEKMKPFLMAYEGI+ Q        ETM+ VPL+K
Sbjct: 340  PEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEVIKETMETVPLMK 399

Query: 617  EIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQ 438
            EIVD+YSGPDRVTAKQQQ+ELERVAKTLP SAP+ VKRFTERAVLSLQSNPGWGFDKKCQ
Sbjct: 400  EIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNPGWGFDKKCQ 459

Query: 437  FMDKLVMEVSQQYK 396
            FMDK+VMEVSQ YK
Sbjct: 460  FMDKVVMEVSQHYK 473


>ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253300 [Vitis vinifera]
          Length = 482

 Score =  357 bits (916), Expect = 1e-95
 Identities = 221/461 (47%), Positives = 268/461 (58%), Gaps = 21/461 (4%)
 Frame = -1

Query: 1715 ATPFQFTADSPSDKKP--DNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKA 1542
            A+PF F + +P   +P  D +++   SP P G G GRG                 +   +
Sbjct: 42   ASPFDFASGAPEKTEPTADPNSESSESPFPLGLGHGRGKPPSQPSAPTLPSF---SSFAS 98

Query: 1541 PPLGRGRGFIXXXXXXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDEAQYNAA-ESEIPAI 1365
              +GRGRG +                    P    KKP+ F K+D A      +S++   
Sbjct: 99   TGIGRGRGRLTAHPTDSVPQ--------QSPDFAPKKPIFFSKEDAADSAPKPQSQLGTT 150

Query: 1364 --QEKPLPNDVINVISG-AGRGKPMK-SPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXX 1197
              +E  LP  +++ +SG AGRG+P+K +PAP    PK ENRH+RQ +QP           
Sbjct: 151  PPEENNLPVSILSALSGGAGRGQPLKQTPAP----PKEENRHLRQPRQP-----VFRSPQ 201

Query: 1196 XXXXXXPREQLSQEEKVKKAKEILSRG----------EPVXXXXXXXXXXXXXXXXXXXX 1047
                  P+ +LS+EE VKKA  ILSRG          E                      
Sbjct: 202  QPVAGPPQPRLSREEAVKKAVGILSRGGDGGGDGDGDEGGRGRGFRGRGRGRGRGAQGWM 261

Query: 1046 XXXXXXXXXXXXGDDRY----EESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEE 879
                          DR     +  DD  +GLYLGD AD EK++ K+G E M+KL E FEE
Sbjct: 262  GRGRGRGRGRGRMGDRRGRGGDAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEE 321

Query: 878  MSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFL 699
            MS RVLPSP++DAYLDA HTN +IE EPEYLMEEFGTNPDIDE PPIPLRDALEKMKPFL
Sbjct: 322  MSGRVLPSPIEDAYLDALHTNCLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFL 381

Query: 698  MAYEGIQSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAP 519
            M YEGIQSQ        ETM+ VP +KE+VD+YSGPDRVTAK+QQEELERVAKTLP +AP
Sbjct: 382  MQYEGIQSQEEWEEVMKETMENVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAP 441

Query: 518  DPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVMEVSQQYK 396
            + VKRFT+RA+LSLQSNPGWGFDKKCQFMDKLV EVSQ YK
Sbjct: 442  NSVKRFTDRAILSLQSNPGWGFDKKCQFMDKLVWEVSQHYK 482


>gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao]
          Length = 474

 Score =  352 bits (904), Expect = 2e-94
 Identities = 220/447 (49%), Positives = 251/447 (56%), Gaps = 17/447 (3%)
 Frame = -1

Query: 1685 PSDKKPDNSNDDGASPLPRG--HGRGRGTXXXXXXXXXXXXXXLNNDSKAPPLGRGRGFI 1512
            P      +SN D A   P G  HGRGRG                   S     G GRG +
Sbjct: 62   PGKSGSGDSNRDSAESPPAGVGHGRGRGGPLSSDPIPHPF------SSFVSQTGSGRGRV 115

Query: 1511 XXXXXXXXXXXXXXXXXPNQPKPNDKKPLLFVK---DDEAQYNAAESEIPAIQEKPL--P 1347
                                P P   K  +F+K   +DE + +A  +  P    +P+  P
Sbjct: 116  TSESVPPPP-----------PPPAQAKQPIFIKKKDEDETESSAKAAAEPIQSSEPIFPP 164

Query: 1346 NDV-INVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXPRE 1170
            N + ++V+SGAGRGKP+K P P S + + ENRHIR  QQ                     
Sbjct: 165  NILPVSVLSGAGRGKPVKQPEPASRRQE-ENRHIRVAQQQSPSA---------------- 207

Query: 1169 QLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEE 990
            Q+SQEE  KKA  ILSR                                    G  R + 
Sbjct: 208  QMSQEEATKKAMGILSRRSESGESGMVGRGGRASMGMGGGRGRGRGRGRGMGRGRGRRQG 267

Query: 989  SDDE---------ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAY 837
             D           A GLYLGD AD EK AQ +G + MNKL EGFEEM SRVLPSP+DDAY
Sbjct: 268  EDTRIVKDSGEGSADGLYLGDNADGEKFAQTIGADNMNKLVEGFEEMGSRVLPSPMDDAY 327

Query: 836  LDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXX 657
            LDA HTN  IE EPEYLMEEFGTNPDIDEKPP+PLRDALEKMKPFLMAYEGIQSQ     
Sbjct: 328  LDALHTNCSIEFEPEYLMEEFGTNPDIDEKPPMPLRDALEKMKPFLMAYEGIQSQEEWEE 387

Query: 656  XXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSL 477
               ETM++VPL++EIVD+YSGPDRVTAK+QQEELERVAKT+P  AP  VK+F  RAVLSL
Sbjct: 388  VIKETMERVPLLQEIVDYYSGPDRVTAKKQQEELERVAKTIPERAPSSVKQFANRAVLSL 447

Query: 476  QSNPGWGFDKKCQFMDKLVMEVSQQYK 396
            QSNPGWGFDKKCQFMDKLV EVSQQYK
Sbjct: 448  QSNPGWGFDKKCQFMDKLVWEVSQQYK 474


>gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]
          Length = 426

 Score =  348 bits (893), Expect = 5e-93
 Identities = 211/441 (47%), Positives = 252/441 (57%), Gaps = 3/441 (0%)
 Frame = -1

Query: 1709 PFQFTADSPSDKKPDNSNDDGASPL---PRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAP 1539
            P  F ++ PS      ++    SP    P G GRGR                ++NDS AP
Sbjct: 38   PNTFASNKPSGSVELGNSKIDDSPTTAPPYGRGRGRIQPLPSSPLLPSFASIVSNDSGAP 97

Query: 1538 PLGRGRGFIXXXXXXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDEAQYNAAESEIPAIQE 1359
            P+G GRG I                    P P D   L                      
Sbjct: 98   PIGGGRGKIPTRPPLP-------------PPPRDTAAL---------------------- 122

Query: 1358 KPLPNDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXX 1179
                +D++  +SG GRG P K P PQ+ KP   NRHIRQ  QP+                
Sbjct: 123  ----DDILTNLSGMGRGTPGKPP-PQTLKPTPINRHIRQ-PQPRPSTALSPD-------- 168

Query: 1178 PREQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDR 999
              +QLS+EEK+KKA EILSRG+P                                   D 
Sbjct: 169  --QQLSKEEKLKKAVEILSRGDPDRGPIRSPTGRGRGRGRGRGGRGGRFSGRGRGREADA 226

Query: 998  YEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHT 819
              ESD+E  G++ GDPADE+K+A+KLG EVMNK+ EG EEMSSRVLPS +DDAY+DAYHT
Sbjct: 227  AIESDEELPGMF-GDPADEQKVAEKLGVEVMNKITEGMEEMSSRVLPSLIDDAYVDAYHT 285

Query: 818  NLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETM 639
            NL++ECEPEY ME+FGTNPDID+KPPIPLR+A EKMKPFLM + GI++Q        ETM
Sbjct: 286  NLLLECEPEYFMEDFGTNPDIDDKPPIPLREAFEKMKPFLMQHIGIETQEEWEQIIEETM 345

Query: 638  KKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGW 459
            + VP  K+I+DHY+GPDRVTA QQ  ELERVA TLPA+AP  VKRFTERAVLSL+SNPGW
Sbjct: 346  ESVPRWKKIIDHYAGPDRVTALQQIGELERVAGTLPATAPASVKRFTERAVLSLKSNPGW 405

Query: 458  GFDKKCQFMDKLVMEVSQQYK 396
            GF KKCQFMDK+VMEVSQQYK
Sbjct: 406  GFKKKCQFMDKVVMEVSQQYK 426


>ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis]
          Length = 407

 Score =  342 bits (877), Expect = 3e-91
 Identities = 191/345 (55%), Positives = 223/345 (64%), Gaps = 12/345 (3%)
 Frame = -1

Query: 1394 NAAESEIPAIQEKPLPNDVINVISGAGRGKPMKSPAPQSEK----------PKAENRHIR 1245
            +A +S  P+  E  LP+ +I+ + GAGRGK   +   Q ++          P+ ENRHIR
Sbjct: 68   SATDSTQPS--EPNLPSSIISTLPGAGRGKTAVTQQQQQQQQHQRQQPGPPPQEENRHIR 125

Query: 1244 QRQQPKXXXXXXXXXXXXXXXXPREQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXX 1065
             R QP+                 + +LS+E+ VK A ++LSRGE                
Sbjct: 126  ARLQPQPRPEKAPAAETGSA---QPKLSKEDAVKMAMKVLSRGEEGEGEGISAGGPGRGR 182

Query: 1064 XXXXXXXXXXXXXXXXXXGDDRYEESDDEA--SGLYLGDPADEEKMAQKLGPEVMNKLAE 891
                                 +  E D++    GLYLGD AD EK+A+K+G E MN L E
Sbjct: 183  GMGRGRGRGRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVE 242

Query: 890  GFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKM 711
            GFEEMS RVLPSP++DAY+DA HTN MIE EPEYLMEEFGTNPDIDEKPPIPLRDALEKM
Sbjct: 243  GFEEMSGRVLPSPMEDAYIDALHTNCMIEFEPEYLMEEFGTNPDIDEKPPIPLRDALEKM 302

Query: 710  KPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLP 531
            KPFLMAYEGIQSQ        E M++VPL+KEIVDHYSGPDRVTAKQQ EELERVAKT+P
Sbjct: 303  KPFLMAYEGIQSQEEWEEAVNEVMERVPLLKEIVDHYSGPDRVTAKQQGEELERVAKTIP 362

Query: 530  ASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVMEVSQQYK 396
             SAP  +KRF  RAVLSLQSNPGWGFDKKCQFMDKL  EVSQQYK
Sbjct: 363  ESAPASIKRFANRAVLSLQSNPGWGFDKKCQFMDKLAWEVSQQYK 407


>ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [Glycine max]
            gi|571476117|ref|XP_006586864.1| PREDICTED: la-related
            protein 1 isoform X2 [Glycine max]
          Length = 481

 Score =  327 bits (837), Expect = 1e-86
 Identities = 194/439 (44%), Positives = 238/439 (54%), Gaps = 15/439 (3%)
 Frame = -1

Query: 1667 DNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAPPLGRGRGFIXXXXXXXX 1488
            ++ +D    P+P G G G G                 +    PP GRGRG          
Sbjct: 64   ESKSDTTEPPIPPGSGLGHGRGKPMPPSGLPSFSSFISSINQPPAGRGRG---------- 113

Query: 1487 XXXXXXXXXPNQPKPNDKKPLLFVKDDEAQYNAAESEIPA------IQEKPLPNDVINVI 1326
                        P    KKP+ F ++D     A+   +P         +  LP  +  V+
Sbjct: 114  -TAPHPQHDLQPPDSGPKKPIFFKREDSVSPTASNDFLPPKRSVDHAHDNKLPGSIPGVL 172

Query: 1325 SGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXPREQLSQEEKV 1146
            SG GRGK MK P  +++  + ENRH+R RQ P                      SQE+  
Sbjct: 173  SGLGRGKSMKQPDLETQVTE-ENRHLRTRQAPGAASSETVPKRSPIP-------SQEDAT 224

Query: 1145 KKAKEILSRGEP---------VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYE 993
            + A +ILS G+                                            D++  
Sbjct: 225  RNALKILSHGKDDGSDTGRGREYGGRGGLDRGRGRGRGRGRGRGMGRGRFVERDVDEKVM 284

Query: 992  ESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNL 813
            ++DD A+GLY GD AD EK+A+K+GPE+MN+L EGFEEM+SRVLPSPL+D +LDA   N 
Sbjct: 285  DTDDYATGLYAGDDADGEKLARKVGPEIMNQLTEGFEEMTSRVLPSPLEDEFLDALDINY 344

Query: 812  MIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKK 633
             IE EPEYL+E    NPDIDEK PI LRDALEK KPFLM+YEGIQSQ        ETM +
Sbjct: 345  AIEFEPEYLVEF--DNPDIDEKEPISLRDALEKAKPFLMSYEGIQSQEEWEEIMEETMAR 402

Query: 632  VPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGF 453
            VPL+K+I+DHYSGPDRVTAK+QQEELERVAKTLP S P  VK+FT RAV+SLQSNPGWGF
Sbjct: 403  VPLLKKIIDHYSGPDRVTAKKQQEELERVAKTLPGSVPSSVKQFTNRAVISLQSNPGWGF 462

Query: 452  DKKCQFMDKLVMEVSQQYK 396
            DKKC FMDKLV EVSQ YK
Sbjct: 463  DKKCHFMDKLVWEVSQHYK 481


>gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris]
          Length = 532

 Score =  319 bits (818), Expect = 2e-84
 Identities = 216/498 (43%), Positives = 262/498 (52%), Gaps = 60/498 (12%)
 Frame = -1

Query: 1709 PFQFTADSPSDKKPDNS---NDDGASPLP----RGHGRGRGTXXXXXXXXXXXXXXLN-- 1557
            PF F   +P   KP++S   +D   SP+P     GHGRG+                +N  
Sbjct: 49   PFNFNERAPG--KPNSSEPKSDTTESPIPPGSGHGHGRGKPMPPSGLPSFSSFLSSINQP 106

Query: 1556 -------------NDSKAP-----------------PLGRGRGFIXXXXXXXXXXXXXXX 1467
                         ND ++P                 P GRGR  +               
Sbjct: 107  PAGRGRPTVPHHQNDLQSPAGRGRPTVPHHQNDLQSPAGRGRPTVPRHQNDLQSPAGRGR 166

Query: 1466 XXPNQPKPND--------KKPLLFVKDDEAQYNAAES-EIPAIQEKPLPNDVINVISGAG 1314
                QP PND        KKP+ F ++D A     +   I   Q   LP ++I V+SG G
Sbjct: 167  ATVPQP-PNDLGPPDSGPKKPIFFKREDIASPTTRDDFPIDVEQANKLPGNIIEVLSGLG 225

Query: 1313 RGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXPREQL-SQEEKVKKA 1137
            RGKPMK   P++   + ENRH+R  +                    R+ + S+++ V+ A
Sbjct: 226  RGKPMKQSDPETRVTE-ENRHLRAPRA--------RGAAASDTLYERQPIPSRDDAVRNA 276

Query: 1136 KEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDD-------- 981
            +  LS+GE                                  G  R  + D+        
Sbjct: 277  RNFLSQGEDDVGGTGRGRGFRERGGLGRGRGRGRGRGRGTGRGGFRGRDMDERRGRFMDA 336

Query: 980  EAS---GLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLM 810
            EAS   G Y+GD AD EK+A+K+GPE+MN+L EGFEEM+ RVLPSPL+D YLDA   N  
Sbjct: 337  EASDDIGPYVGDDADGEKLAKKVGPEIMNQLTEGFEEMAGRVLPSPLEDEYLDALDINYA 396

Query: 809  IECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKV 630
            IE EPEYL+E    NPDIDEK PIPLRDALEKMKPFLMAYEGIQSQ        ETM +V
Sbjct: 397  IEFEPEYLVEF--DNPDIDEKEPIPLRDALEKMKPFLMAYEGIQSQEEWEEIMEETMAQV 454

Query: 629  PLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFD 450
            PL+KEIVDHYSGPDRVTAK+QQEELERVAKTLP SAP  VK+FT RAV+SLQSNPGWGFD
Sbjct: 455  PLLKEIVDHYSGPDRVTAKKQQEELERVAKTLPESAPSSVKQFTNRAVVSLQSNPGWGFD 514

Query: 449  KKCQFMDKLVMEVSQQYK 396
            KKC FMDKLV EVSQ YK
Sbjct: 515  KKCHFMDKLVWEVSQHYK 532


>ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507965 [Cicer arietinum]
          Length = 504

 Score =  318 bits (815), Expect = 5e-84
 Identities = 204/455 (44%), Positives = 247/455 (54%), Gaps = 26/455 (5%)
 Frame = -1

Query: 1682 SDKKPDNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAPPLGRGRGFIXXX 1503
            S++    + D   SP   G G GRG               L +  K P +GRGRGF    
Sbjct: 65   SNESKSEATDSPFSPPGAGRGHGRGGSVPPPTGFPSFSSFLTS-IKQPSIGRGRGF---- 119

Query: 1502 XXXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDEAQYNAAESEIPAIQEKPL--------P 1347
                            QP    KKP+LF  +D       + ++    +KP+        P
Sbjct: 120  -GPSPFQPENDTQQLQQPDSVPKKPVLFRSEDSVSQTGGKDDVSP-PKKPVFTRREDFSP 177

Query: 1346 ND--------------VINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXX 1209
             D              V+ V+SGAGRGKP++ PA    +   ENRH+R R+         
Sbjct: 178  IDLSSDQESDNRFSMSVLKVLSGAGRGKPIE-PAVSETQVVEENRHVRNRRASDVPMRQP 236

Query: 1208 XXXXXXXXXXPREQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXX 1029
                       R+ LS+ +           GEP                           
Sbjct: 237  MLTGDGALQNARKYLSKFDGDGSGSG--RGGEP----RERGAFGRGRGRGRGRGRGRGRG 290

Query: 1028 XXXXXXGDDRYEESDDEA----SGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVL 861
                  GDDR+ +  D A    SGL+LGD  D EK+A+K+GPEVMN+  EGFEEM SRVL
Sbjct: 291  GFRGTGGDDRFGQIQDNARSNASGLFLGDDVDGEKLAKKVGPEVMNQFTEGFEEMISRVL 350

Query: 860  PSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGI 681
            PSPL+D Y++A+  N  IE EPEY+ME F +NPDIDEK PIPLRDALEKMKPFLM YEGI
Sbjct: 351  PSPLEDEYVEAFDINCAIEFEPEYIME-FDSNPDIDEKEPIPLRDALEKMKPFLMNYEGI 409

Query: 680  QSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRF 501
            QSQ        ETM++VPL+K+IVDHYSGPDRVTAK+QQEELERVAKTLPASAP  V +F
Sbjct: 410  QSQEEWEAIMEETMERVPLLKKIVDHYSGPDRVTAKKQQEELERVAKTLPASAPSSVVQF 469

Query: 500  TERAVLSLQSNPGWGFDKKCQFMDKLVMEVSQQYK 396
            T RAV+SLQSNPGWGFDKKCQFMDKLV EVSQ +K
Sbjct: 470  TNRAVMSLQSNPGWGFDKKCQFMDKLVFEVSQHHK 504


>ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis]
            gi|223537066|gb|EEF38701.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 436

 Score =  318 bits (814), Expect = 7e-84
 Identities = 178/322 (55%), Positives = 210/322 (65%), Gaps = 4/322 (1%)
 Frame = -1

Query: 1352 LPNDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXPR 1173
            LP+ + + +SG GRG+P K   P + + K ENRHIR R + K                 +
Sbjct: 133  LPSTIHSSLSGFGRGEPDKPVVP-TPQVKEENRHIRDRSRAKPKTEEAEVRA-------K 184

Query: 1172 EQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYE 993
             ++S+EE VK+A  ILS+G+                                   + R  
Sbjct: 185  PKISREEAVKRAVSILSQGDT-----------GEGMGRGRGGGRGRGRGRGRGRLEQRGR 233

Query: 992  ESDDE----ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAY 825
              DD      SGL+LGD AD EK+A K+G E MNKL EG+EEMS RVLPSP++DAYLDA 
Sbjct: 234  MMDDVDEGFGSGLFLGDNADGEKLAGKIGVENMNKLVEGYEEMSGRVLPSPMEDAYLDAL 293

Query: 824  HTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXE 645
            HTN MIE EPEYLM EF  NPDIDEKPP+PLRD LEK+KPF+MAYEGIQSQ        E
Sbjct: 294  HTNYMIEFEPEYLMGEFDQNPDIDEKPPMPLRDVLEKVKPFIMAYEGIQSQEEWEAAVEE 353

Query: 644  TMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNP 465
            TMK VPL KEIVD+YSGPDR+TAK+Q+EELERVA T+PASAP  VKRF +RAVLSLQSNP
Sbjct: 354  TMKNVPLFKEIVDYYSGPDRITAKKQEEELERVANTIPASAPASVKRFADRAVLSLQSNP 413

Query: 464  GWGFDKKCQFMDKLVMEVSQQY 399
            GWGFDKKCQFMDKLV EV+Q Y
Sbjct: 414  GWGFDKKCQFMDKLVREVNQCY 435


>gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica]
          Length = 428

 Score =  317 bits (812), Expect = 1e-83
 Identities = 159/202 (78%), Positives = 173/202 (85%), Gaps = 1/202 (0%)
 Frame = -1

Query: 1001 RYEESDDE-ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAY 825
            R ++SD   ASGLYLGD AD EK+A+KLGPE+MNKL E FEEMSS VLPSPLDDAY+DA 
Sbjct: 226  RGKDSDGSYASGLYLGDNADGEKLAKKLGPEIMNKLVERFEEMSSEVLPSPLDDAYVDAM 285

Query: 824  HTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXE 645
            HTN MIECEPEYLM EF  NPDIDEKPPI LRDALEKMKPFLMAYE I+SQ        E
Sbjct: 286  HTNFMIECEPEYLMGEFNKNPDIDEKPPISLRDALEKMKPFLMAYENIESQEEWEEVVNE 345

Query: 644  TMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNP 465
            TM++VPL+KEIVDHYSGPDRVTAK+QQEELERVAKTLPA  PD VKRFT+RAVLSLQSNP
Sbjct: 346  TMERVPLLKEIVDHYSGPDRVTAKKQQEELERVAKTLPAKVPDSVKRFTDRAVLSLQSNP 405

Query: 464  GWGFDKKCQFMDKLVMEVSQQY 399
            GWGFD+KCQFMDKLV +VSQ Y
Sbjct: 406  GWGFDRKCQFMDKLVAKVSQHY 427


>ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215545 [Cucumis sativus]
            gi|449502143|ref|XP_004161555.1| PREDICTED:
            uncharacterized protein LOC101224016 [Cucumis sativus]
          Length = 478

 Score =  315 bits (808), Expect = 3e-83
 Identities = 195/449 (43%), Positives = 242/449 (53%), Gaps = 11/449 (2%)
 Frame = -1

Query: 1709 PFQFTADSPSDKKPDNSNDDGASPLPR---GHGRGRGTXXXXXXXXXXXXXXLNNDSKAP 1539
            PF FT   P+ +  + S  +     P    GHGRG+ T                  S   
Sbjct: 50   PFDFTPPVPNQEHSNASKQEPIDSRPTPGLGHGRGKPTPSSPLRPSFSSFSPSVRPSS-- 107

Query: 1538 PLGRGRGFIXXXXXXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDEAQYNAAESEIPAIQ- 1362
             +GRGRG                   P +P    KKP+ F K++    +AA + +  +  
Sbjct: 108  -VGRGRG-----------DASPSIRSPPEPDSEPKKPVFFSKNNAGD-SAASTSLGGLHR 154

Query: 1361 ---EKPLPNDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXX 1191
               E+ LP  + +  SG GRGKPMK P P+ ++PK ENRH+R RQ+              
Sbjct: 155  VSGERNLPESLHSEFSGVGRGKPMKQPVPE-DQPKQENRHLRPRQE----GDGPGAGERG 209

Query: 1190 XXXXPREQLSQEEKVKKAKEILSR----GEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1023
                   ++ + E  +    ++S+    GE                              
Sbjct: 210  RGRGFEPRIGRGEPWRNTNRMVSKDGPDGEVGGGRGTSGYRGRGARGPYRRGARGSFRTG 269

Query: 1022 XXXXGDDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDD 843
                    +++ D  A+GLYLG+  D E++A+++G E MNKL EGFEEMS RVLPSPL D
Sbjct: 270  ERRERRSGHDKEDGYAAGLYLGNNEDGERLAKRIGTENMNKLVEGFEEMSGRVLPSPLVD 329

Query: 842  AYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXX 663
             YLD   TN MIECEPEYLM +F  NPDIDE PPIPLRDALEKMKPFLMAYE IQS    
Sbjct: 330  QYLDGMDTNFMIECEPEYLMGDFENNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEW 389

Query: 662  XXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVL 483
                 ETM+ VPL+KEIVD Y GPDRVTAK+QQ ELERVAKTLP SAP+ VK+FT R VL
Sbjct: 390  EEIVEETMQSVPLLKEIVDAYGGPDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRVVL 449

Query: 482  SLQSNPGWGFDKKCQFMDKLVMEVSQQYK 396
            SLQSNPGWGFDKK Q MDKLV   S++YK
Sbjct: 450  SLQSNPGWGFDKKWQLMDKLVEGFSKRYK 478


>emb|CBI17195.3| unnamed protein product [Vitis vinifera]
          Length = 209

 Score =  308 bits (789), Expect = 5e-81
 Identities = 151/200 (75%), Positives = 169/200 (84%)
 Frame = -1

Query: 995 EESDDEASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTN 816
           +  DD  +GLYLGD AD EK++ K+G E M+KL E FEEMS RVLPSP++DAYLDA HTN
Sbjct: 10  DAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMSGRVLPSPIEDAYLDALHTN 69

Query: 815 LMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMK 636
            +IE EPEYLMEEFGTNPDIDE PPIPLRDALEKMKPFLM YEGIQSQ        ETM+
Sbjct: 70  CLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQYEGIQSQEEWEEVMKETME 129

Query: 635 KVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWG 456
            VP +KE+VD+YSGPDRVTAK+QQEELERVAKTLP +AP+ VKRFT+RA+LSLQSNPGWG
Sbjct: 130 NVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWG 189

Query: 455 FDKKCQFMDKLVMEVSQQYK 396
           FDKKCQFMDKLV EVSQ YK
Sbjct: 190 FDKKCQFMDKLVWEVSQHYK 209


>ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Capsella rubella]
            gi|482575944|gb|EOA40131.1| hypothetical protein
            CARUB_v10008838mg [Capsella rubella]
          Length = 525

 Score =  306 bits (785), Expect = 2e-80
 Identities = 178/373 (47%), Positives = 223/373 (59%), Gaps = 19/373 (5%)
 Frame = -1

Query: 1457 NQPKPNDKKP--LLFVKDDEAQYNAAESEIPAIQ--EKPLPNDVINVI-------SGAGR 1311
            +QP+PND+     +FVK  E +   +    P  +  +  LP++V N +       SGAGR
Sbjct: 164  SQPQPNDESQGSPVFVKLQEMKDVTSSPPAPESKSGQTDLPDNVFNALGSEIPHSSGAGR 223

Query: 1310 GKPMKSPAPQSEKPKAENRHIR--------QRQQPKXXXXXXXXXXXXXXXXPREQLSQE 1155
            GKP+   AP   +   ENRHIR        QR QP+                 R +LS E
Sbjct: 224  GKPLVESAPIQRE---ENRHIRRPPPPPQQQRSQPQQKRAQTPRDETP-----RPRLSAE 275

Query: 1154 EKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDDEA 975
            E  ++A+  LSRGE                                   D + EE + EA
Sbjct: 276  EAGRRARSELSRGEA---EGSGVRGRGGRGRGRGARGRGRGRGGEGWRDDKKEEEGEQEA 332

Query: 974  SGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEP 795
              ++ GD AD EK A K+GPE+M  LAEGFEE+  + LPS   DA +DAY TNLMIECEP
Sbjct: 333  MSVFAGDSADGEKFANKMGPELMKTLAEGFEEVCEKALPSTTHDAIIDAYDTNLMIECEP 392

Query: 794  EYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIKE 615
            EY+M +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q        E M + PL+KE
Sbjct: 393  EYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMAQAPLMKE 452

Query: 614  IVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQF 435
            IVDHYSGPDRVTAK+Q EEL+R+A TLP SAPD VKRF +RA L+L+SNPGWGFDKK QF
Sbjct: 453  IVDHYSGPDRVTAKKQNEELDRIATTLPKSAPDSVKRFADRAALTLKSNPGWGFDKKYQF 512

Query: 434  MDKLVMEVSQQYK 396
            MDKLV+EVSQ YK
Sbjct: 513  MDKLVLEVSQSYK 525


>ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550322664|gb|EEF06007.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 466

 Score =  303 bits (777), Expect = 1e-79
 Identities = 189/442 (42%), Positives = 234/442 (52%), Gaps = 6/442 (1%)
 Frame = -1

Query: 1703 QFTADSPSDKKPDNSNDDGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAPP-LGR 1527
            ++ A +P     D S  + +   P G G GRG               +++   + P  GR
Sbjct: 56   EYGAAAPGKPDLDESKTESSESQPSGLGHGRGKPVGTGPILPAFSTFISSVKNSQPGAGR 115

Query: 1526 GRGFIXXXXXXXXXXXXXXXXXPNQPKPNDKKPLLFVKDDEAQYNAAESEIPAIQEKPLP 1347
            GRG                     +P P+            +  +  ESE P   E  LP
Sbjct: 116  GRG-------------------TTEPGPS-----------RSTESRPESEPPKKAEANLP 145

Query: 1346 NDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQPKXXXXXXXXXXXXXXXXP--R 1173
              +++ + GAGRGKP+K   P  E  K ENRH+R R QP+                    
Sbjct: 146  PSILSGLGGAGRGKPVKQEVP-IEPAKEENRHLRARSQPRSQPRTRQQKTPDGDDAVPAT 204

Query: 1172 EQLSQEEKVKKAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDR-Y 996
             ++ ++E VKKA E+LSRG                                      R Y
Sbjct: 205  TKMGRQEAVKKAMELLSRGGGEGEVGGRGGGRGSFVPGRGGGRGGARGGGRGRGRGRRGY 264

Query: 995  EESDDE-ASGLYL-GDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYH 822
             + + E  SG+ L G   DEEK AQ +G E MN L E FEEMS RVLP P++D Y+DA+ 
Sbjct: 265  GDKEVEYGSGMSLEGHEEDEEKFAQSVGVETMNTLVEAFEEMSGRVLPCPIEDEYVDAFD 324

Query: 821  TNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXET 642
            TN   E EPEYLM EF  NPDIDEKPP+PLRDALEK+KPF+MAY GI++         ET
Sbjct: 325  TNCSFEFEPEYLMGEFDKNPDIDEKPPMPLRDALEKVKPFMMAYMGIKTHEEWEEIVEET 384

Query: 641  MKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPG 462
            MK  PL+K+IVD YSGPDRV+ K+Q+EELERVAKT+PASAPD VK F +RAVLSLQSNPG
Sbjct: 385  MKDAPLMKKIVDSYSGPDRVSGKKQKEELERVAKTIPASAPDSVKSFADRAVLSLQSNPG 444

Query: 461  WGFDKKCQFMDKLVMEVSQQYK 396
            WGFDKKC FMDKL  EVSQ YK
Sbjct: 445  WGFDKKCMFMDKLAKEVSQHYK 466


>gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding protein from Arabidopsis thaliana
            gi|2129727 and contains RNA recognition PF|00076 domain.
            ESTs gb|H37317, gb|F14415, gb|AA651290 come from this
            gene [Arabidopsis thaliana]
          Length = 829

 Score =  303 bits (775), Expect = 2e-79
 Identities = 177/369 (47%), Positives = 223/369 (60%), Gaps = 16/369 (4%)
 Frame = -1

Query: 1454 QPKPNDKKP--LLFVKDDEAQYNAAESEIPAIQEKP----LPNDVINVI-------SGAG 1314
            Q +PND+     +FVK  E Q   A S  P  + KP     P+++ N +       SGAG
Sbjct: 467  QQQPNDESQGSPVFVKLQEMQ--DATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGAG 524

Query: 1313 RGKPMKSPAPQSEKPKAENRHIRQRQQP---KXXXXXXXXXXXXXXXXPREQLSQEEKVK 1143
            RGKP+   AP  ++   +NR IR+   P   +                P+ QLS EE  +
Sbjct: 525  RGKPLVESAPIRQE---DNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGR 581

Query: 1142 KAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDDEASGLY 963
            +A+  LSRGE                                   D + EE + EA  ++
Sbjct: 582  RARSELSRGE-AEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMRIF 640

Query: 962  LGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLM 783
             GD AD EK A+K+GPE+M  LAEGFEE+  + LPS   DA +DAY TNLMIECEPEY+M
Sbjct: 641  AGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIM 700

Query: 782  EEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDH 603
             +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q        E M + PL+KEIVDH
Sbjct: 701  PDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDH 760

Query: 602  YSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKL 423
            YSGPDRVTAK+Q EEL+R+A TLPASAPD VKRF +RA L+L+SNPGWGFDKK QFMDKL
Sbjct: 761  YSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKL 820

Query: 422  VMEVSQQYK 396
            V+EVSQ YK
Sbjct: 821  VLEVSQSYK 829


>ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|12324041|gb|AAG51990.1|AC024260_28 unknown
            protein; 43598-45751 [Arabidopsis thaliana]
            gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8
            [Arabidopsis thaliana] gi|23506017|gb|AAN28868.1|
            At1g53640/F22G10.8 [Arabidopsis thaliana]
            gi|110740318|dbj|BAF02054.1| hypothetical protein
            [Arabidopsis thaliana] gi|332194854|gb|AEE32975.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis thaliana]
          Length = 523

 Score =  303 bits (775), Expect = 2e-79
 Identities = 177/369 (47%), Positives = 223/369 (60%), Gaps = 16/369 (4%)
 Frame = -1

Query: 1454 QPKPNDKKP--LLFVKDDEAQYNAAESEIPAIQEKP----LPNDVINVI-------SGAG 1314
            Q +PND+     +FVK  E Q   A S  P  + KP     P+++ N +       SGAG
Sbjct: 161  QQQPNDESQGSPVFVKLQEMQ--DATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGAG 218

Query: 1313 RGKPMKSPAPQSEKPKAENRHIRQRQQP---KXXXXXXXXXXXXXXXXPREQLSQEEKVK 1143
            RGKP+   AP  ++   +NR IR+   P   +                P+ QLS EE  +
Sbjct: 219  RGKPLVESAPIRQE---DNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGR 275

Query: 1142 KAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDDEASGLY 963
            +A+  LSRGE                                   D + EE + EA  ++
Sbjct: 276  RARSELSRGE-AEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMRIF 334

Query: 962  LGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLM 783
             GD AD EK A+K+GPE+M  LAEGFEE+  + LPS   DA +DAY TNLMIECEPEY+M
Sbjct: 335  AGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIM 394

Query: 782  EEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDH 603
             +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q        E M + PL+KEIVDH
Sbjct: 395  PDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDH 454

Query: 602  YSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKL 423
            YSGPDRVTAK+Q EEL+R+A TLPASAPD VKRF +RA L+L+SNPGWGFDKK QFMDKL
Sbjct: 455  YSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKL 514

Query: 422  VMEVSQQYK 396
            V+EVSQ YK
Sbjct: 515  VLEVSQSYK 523


>gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis thaliana
            gi|2129727 and contains RNA recognition PF|00076 domain
            [Arabidopsis thaliana]
          Length = 523

 Score =  303 bits (775), Expect = 2e-79
 Identities = 177/369 (47%), Positives = 223/369 (60%), Gaps = 16/369 (4%)
 Frame = -1

Query: 1454 QPKPNDKKP--LLFVKDDEAQYNAAESEIPAIQEKP----LPNDVINVI-------SGAG 1314
            Q +PND+     +FVK  E Q   A S  P  + KP     P+++ N +       SGAG
Sbjct: 161  QQQPNDESQGSPVFVKLQEMQ--DATSSPPPPESKPGQADPPDNIFNALGNEFSHPSGAG 218

Query: 1313 RGKPMKSPAPQSEKPKAENRHIRQRQQP---KXXXXXXXXXXXXXXXXPREQLSQEEKVK 1143
            RGKP+   AP  ++   +NR IR+   P   +                P+ QLS EE  +
Sbjct: 219  RGKPLVESAPIRQE---DNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSAEEAGR 275

Query: 1142 KAKEILSRGEPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDDEASGLY 963
            +A+  LSRGE                                   D + EE + EA  ++
Sbjct: 276  RARSELSRGE-AEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEAMRIF 334

Query: 962  LGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLM 783
             GD AD EK A+K+GPE+M  LAEGFEE+  + LPS   DA +DAY TNLMIECEPEY+M
Sbjct: 335  AGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIM 394

Query: 782  EEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDH 603
             +FG+NPDIDEKPP+ LR+ LEK+KPF++AYEGI+ Q        E M + PL+KEIVDH
Sbjct: 395  PDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDH 454

Query: 602  YSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKL 423
            YSGPDRVTAK+Q EEL+R+A TLPASAPD VKRF +RA L+L+SNPGWGFDKK QFMDKL
Sbjct: 455  YSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKL 514

Query: 422  VMEVSQQYK 396
            V+EVSQ YK
Sbjct: 515  VLEVSQSYK 523


>ref|XP_006392772.1| hypothetical protein EUTSA_v10011382mg [Eutrema salsugineum]
            gi|557089350|gb|ESQ30058.1| hypothetical protein
            EUTSA_v10011382mg [Eutrema salsugineum]
          Length = 531

 Score =  301 bits (771), Expect = 6e-79
 Identities = 192/469 (40%), Positives = 240/469 (51%), Gaps = 41/469 (8%)
 Frame = -1

Query: 1679 DKKPDNSND-------DGASPLPRGHGRGRGTXXXXXXXXXXXXXXLNNDSKAPPLGRGR 1521
            +++P+ +N+          S  P G+G GRG               +  DS  P +GRGR
Sbjct: 70   NREPERANEAAGHGRGSSESQSPGGYGHGRGRPIQSDPISPAFSSFVRPDS--PSVGRGR 127

Query: 1520 GFIXXXXXXXXXXXXXXXXXPNQPKPN------DKKPLLFVKDDEAQYNAAESEIPAIQE 1359
            G +                     +P        + P +F K  E +   +    P  + 
Sbjct: 128  GSVGSDPVSPFAAPSPPPPRDQSHRPQLSSEEQPQSPPVFAKLQEMKDATSSPPPPPTES 187

Query: 1358 K-----PL-------------PNDVINVISGAGRGKPMKSPAPQSEKPKAENRHIRQRQQ 1233
            K     PL             PN  I   SGAGRGKP    AP  ++   ENRHIR+ Q 
Sbjct: 188  KSGQTAPLNNIFNGLGSEFSQPNQRIVPGSGAGRGKPFVESAPLQQE---ENRHIRRPQP 244

Query: 1232 PKXXXXXXXXXXXXXXXXPREQ----------LSQEEKVKKAKEILSRGEPVXXXXXXXX 1083
            P                  R Q          LS EE  ++A+  LSRGE          
Sbjct: 245  PPPQQQQQRSQPQPQHQQKRVQPPKDEAPRPKLSIEEAGRRARSQLSRGEAEGGGLRGRG 304

Query: 1082 XXXXXXXXXXXXXXXXXXXXXXXXGDDRYEESDDEASGLYLGDPADEEKMAQKLGPEVMN 903
                                      +  EE++ EA   ++GD AD EK A K+GPE+M 
Sbjct: 305  GGRGRGRGARGRGRGRGGEGWRDVKME--EEAEQEAISTFVGDSADGEKFANKMGPEIMK 362

Query: 902  KLAEGFEEMSSRVLPSPLDDAYLDAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDA 723
             LA+G+E++  R LPS  +DA LDAY TNLMIECEPEYLM  FG+NPDIDEKPP+ LR+ 
Sbjct: 363  MLADGYEDICERALPSTANDAVLDAYETNLMIECEPEYLMPAFGSNPDIDEKPPMSLREC 422

Query: 722  LEKMKPFLMAYEGIQSQXXXXXXXXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVA 543
            LEK+KPF++AYEGI+ Q        E M + PLIKEIVDHYSGPDRVTAK+Q EEL+R+A
Sbjct: 423  LEKVKPFIVAYEGIKDQEEWEEAIDEVMAQAPLIKEIVDHYSGPDRVTAKKQNEELDRIA 482

Query: 542  KTLPASAPDPVKRFTERAVLSLQSNPGWGFDKKCQFMDKLVMEVSQQYK 396
             T+P SAPD VKRF +RA LSL+SNPGWGFDKK QFMDKLV EVSQ YK
Sbjct: 483  TTVPKSAPDSVKRFADRAALSLKSNPGWGFDKKYQFMDKLVAEVSQSYK 531


>ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300131 [Fragaria vesca
            subsp. vesca]
          Length = 464

 Score =  301 bits (771), Expect = 6e-79
 Identities = 150/206 (72%), Positives = 171/206 (83%), Gaps = 3/206 (1%)
 Frame = -1

Query: 1004 DRYEESDDE---ASGLYLGDPADEEKMAQKLGPEVMNKLAEGFEEMSSRVLPSPLDDAYL 834
            DR    D++   ASGLYLGD AD EK+A+KLGPEVMN+L E FE+MS+ VLPSPLDDAY+
Sbjct: 259  DRRRRGDEDGGIASGLYLGDNADGEKLAEKLGPEVMNQLTEAFEDMSTHVLPSPLDDAYV 318

Query: 833  DAYHTNLMIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQXXXXXX 654
            DA  TN  IE EPEYLM EF  NPDIDE+PPIPLRDALEKMKPFLMAYEGIQSQ      
Sbjct: 319  DALDTNCKIEFEPEYLMGEFNQNPDIDEEPPIPLRDALEKMKPFLMAYEGIQSQEEWEEA 378

Query: 653  XXETMKKVPLIKEIVDHYSGPDRVTAKQQQEELERVAKTLPASAPDPVKRFTERAVLSLQ 474
              ETM++VPL+K+IVDHYSGPDRVTAK+Q+EELERVAKTLPA+ PD VK+FT+RAVLSLQ
Sbjct: 379  IKETMERVPLLKKIVDHYSGPDRVTAKKQREELERVAKTLPANVPDSVKQFTDRAVLSLQ 438

Query: 473  SNPGWGFDKKCQFMDKLVMEVSQQYK 396
             NPGWGF +KCQFMDKL  +VS+ YK
Sbjct: 439  GNPGWGFHRKCQFMDKLTQKVSKHYK 464


Top