BLASTX nr result

ID: Mentha24_contig00015096 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00015096
         (1216 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus...   325   3e-86
ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu...   278   4e-72
gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]       278   4e-72
ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247...   276   1e-71
ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1...   274   6e-71
ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253...   267   6e-69
ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family prot...   267   8e-69
ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm...   262   2e-67
ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [...   262   3e-67
ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phas...   261   3e-67
ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507...   261   3e-67
ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prun...   254   7e-65
ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus tr...   253   1e-64
emb|CBI17195.3| unnamed protein product [Vitis vinifera]              248   5e-63
ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300...   244   7e-62
ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215...   244   7e-62
gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding prot...   238   5e-60
ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein...   238   5e-60
ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp....   238   5e-60
gb|AAM65660.1| Contains similarity to RNA-binding protein from A...   238   5e-60

>gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus guttatus]
          Length = 493

 Score =  325 bits (832), Expect = 3e-86
 Identities = 190/373 (50%), Positives = 233/373 (62%), Gaps = 41/373 (10%)
 Frame = +1

Query: 34   NESKFDLQPPKPDVKMPFRF---GEAQLGWSESETPPPKEKALLTGILGVISGAGRGKPT 204
            +ES  +  PPKP+VK+PF F    E Q   +ESE P  +E  L + I+ V+SGAGRGKP 
Sbjct: 123  SESPSEKPPPKPNVKLPFLFVKDEEEQADAAESEVPSAQETLLRSDIVSVLSGAGRGKPG 182

Query: 205  KP-YAPHPEKTRQTDGREPSQSP---------NKDTAVRE--QLSQEEKVRKAKEILSKX 348
            KP  A  PEK  Q++ R   Q P         + D A     QLS+EE V+KAKEILSK 
Sbjct: 183  KPPTAAQPEKP-QSENRHIRQRPPQGKPPVAVSSDGAAPPAVQLSKEEMVKKAKEILSKG 241

Query: 349  XXXXXXXXXXXXXXXXXXXXXX----------------------ETKDYNRYEGRDDQS- 459
                                                          +  +RYE  DD+S 
Sbjct: 242  DEDGGVSRPEVRDNRDNRDNRGGGRGGRGERGRGRGRGRGRGRGRGRGDDRYEESDDESD 301

Query: 460  ---IGDGAADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEP 630
               IGD  AD EK+A++LGP++M ++ EG++EM+S+VLP P  +A ++A+E N+ +ECEP
Sbjct: 302  ALFIGD-PADEEKVAQKLGPDVMAQLAEGIDEMSSRVLPSPFDDAYMDAFETNLRIECEP 360

Query: 631  EYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPLLKK 810
            EY MEEFGTNPDIDEK PIPLRDALEKMKPFLM YEGI+           TMK VPL+K+
Sbjct: 361  EYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMVYEGIKDQEEWEKIIEETMKDVPLIKE 420

Query: 811  IIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHF 990
            I+D   GPDR TAKQQ  ELERVAKTLPASAPASVKRFT+RA+LSLQSNPGWGFDKKC F
Sbjct: 421  IVDHYSGPDRVTAKQQNEELERVAKTLPASAPASVKRFTERALLSLQSNPGWGFDKKCQF 480

Query: 991  MDKLVTEVEQHYK 1029
            MDK++ EV Q+YK
Sbjct: 481  MDKVIMEVSQNYK 493


>ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum]
          Length = 480

 Score =  278 bits (710), Expect = 4e-72
 Identities = 158/334 (47%), Positives = 202/334 (60%), Gaps = 29/334 (8%)
 Frame = +1

Query: 115  SESETPPPKEKA-LLTGILGVISGAGRGKPTKPYAPHPEKTRQTDGR-EPSQSPNKDTAV 288
            S S+ P P++ + L + ++ V++GAGRGKP +  +P  EK ++ +    P Q    D+  
Sbjct: 147  SSSDAPTPRDDSNLSSSVISVLTGAGRGKPLQTASPVSEKPKEENRHLRPRQQKVADSGE 206

Query: 289  R------EQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXX-----------E 417
            R      ++LS+E+ V+KA  ILS+                                   
Sbjct: 207  RASSPPPQRLSREDAVKKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVRGRGGR 266

Query: 418  TKDYNRYEGRDDQSIGDGA----------ADREKLAKRLGPEIMEKIVEGLEEMASKVLP 567
             +   R  GR D+  GDG+          AD EKLA++LGPE M  + EG EEM+++VLP
Sbjct: 267  GRGRGRGRGRRDEERGDGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMSARVLP 326

Query: 568  DPHKEALLEAYEHNITLECEPEYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQ 747
             P  +A +EA   N+ +ECEPEY M +F +NPDIDE  PIPLRDALEKMKPFLM+YEGI+
Sbjct: 327  SPMDDAYIEALHTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIK 386

Query: 748  SHXXXXXXXXXTMKRVPLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFT 927
                       TM+ VPL+K+I+D   GPDR TAKQQ  ELERVAKTLP SAP SVKRFT
Sbjct: 387  DQEEWEEVIKETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFT 446

Query: 928  DRAVLSLQSNPGWGFDKKCHFMDKLVTEVEQHYK 1029
            +RAVLSLQSNPGWGFDKKC FMDK+V E  QHYK
Sbjct: 447  ERAVLSLQSNPGWGFDKKCQFMDKVVMEASQHYK 480


>gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]
          Length = 426

 Score =  278 bits (710), Expect = 4e-72
 Identities = 158/318 (49%), Positives = 195/318 (61%), Gaps = 18/318 (5%)
 Frame = +1

Query: 130  PPPKEKALLTGILGVISGAGRGKPTKPYAPHPEKTRQTDG----REPSQSPNKDTAVREQ 297
            PPP++ A L  IL  +SG GRG P KP    P+  + T      R+P   P+   +  +Q
Sbjct: 114  PPPRDTAALDDILTNLSGMGRGTPGKP---PPQTLKPTPINRHIRQPQPRPSTALSPDQQ 170

Query: 298  LSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXETKDYNRYEGRDDQSIGDGA- 474
            LS+EEK++KA EILS+                             R+ GR      D A 
Sbjct: 171  LSKEEKLKKAVEILSRGDPDRGPIRSPTGRGRGRGRGRGGRG--GRFSGRGRGREADAAI 228

Query: 475  -------------ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNIT 615
                         AD +K+A++LG E+M KI EG+EEM+S+VLP    +A ++AY  N+ 
Sbjct: 229  ESDEELPGMFGDPADEQKVAEKLGVEVMNKITEGMEEMSSRVLPSLIDDAYVDAYHTNLL 288

Query: 616  LECEPEYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRV 795
            LECEPEYFME+FGTNPDID+K PIPLR+A EKMKPFLM + GI++          TM+ V
Sbjct: 289  LECEPEYFMEDFGTNPDIDDKPPIPLREAFEKMKPFLMQHIGIETQEEWEQIIEETMESV 348

Query: 796  PLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFD 975
            P  KKIID   GPDR TA QQ GELERVA TLPA+APASVKRFT+RAVLSL+SNPGWGF 
Sbjct: 349  PRWKKIIDHYAGPDRVTALQQIGELERVAGTLPATAPASVKRFTERAVLSLKSNPGWGFK 408

Query: 976  KKCHFMDKLVTEVEQHYK 1029
            KKC FMDK+V EV Q YK
Sbjct: 409  KKCQFMDKVVMEVSQQYK 426


>ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum
            lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED:
            uncharacterized protein LOC101247662 isoform 2 [Solanum
            lycopersicum]
          Length = 473

 Score =  276 bits (707), Expect = 1e-71
 Identities = 159/330 (48%), Positives = 199/330 (60%), Gaps = 25/330 (7%)
 Frame = +1

Query: 115  SESETPPPKEKALL-TGILGVISGAGRGKPTKPYAPHPEKTRQTDGR-EPSQSPNKDTAV 288
            S S  P P++ + L + ++ V++GAGRGKP +  +   EK ++ +    P Q    D+  
Sbjct: 144  SSSNAPKPRDDSNLPSSVISVLTGAGRGKPLQTASSVSEKPKEENRHLRPRQQKVADSGE 203

Query: 289  R------EQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXETKDYN------ 432
            R      ++LS+E+ V+KA  ILS+                         +         
Sbjct: 204  RASSPPPQRLSREDAVKKAVGILSRSDDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGR 263

Query: 433  -RYEGRDDQSIGDGA----------ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHK 579
             R  GR D+  GDG           AD EKLA +LGPE M  + EG EEM+++VLP P  
Sbjct: 264  GRGRGRRDEERGDGNLESGFYLGDDADGEKLAAKLGPESMNTLAEGFEEMSARVLPSPMD 323

Query: 580  EALLEAYEHNITLECEPEYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXX 759
            +A LEA   N+ +ECEPEY M +F +NPDIDE  PIPLRDALEKMKPFLM+YEGI+    
Sbjct: 324  DAYLEALHTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEE 383

Query: 760  XXXXXXXTMKRVPLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAV 939
                   TM+ VPL+K+I+D   GPDR TAKQQ  ELERVAKTLP SAP SVKRFT+RAV
Sbjct: 384  WEEVIKETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAV 443

Query: 940  LSLQSNPGWGFDKKCHFMDKLVTEVEQHYK 1029
            LSLQSNPGWGFDKKC FMDK+V EV QHYK
Sbjct: 444  LSLQSNPGWGFDKKCQFMDKVVMEVSQHYK 473


>ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis]
          Length = 407

 Score =  274 bits (700), Expect = 6e-71
 Identities = 167/369 (45%), Positives = 210/369 (56%), Gaps = 37/369 (10%)
 Frame = +1

Query: 34   NES-KFDLQPPKPDVKMPFRFGEAQLGWSESETPPPKEKALLTGILGVISGAGRGKPT-- 204
            NES + D QP KP    P          S +++  P E  L + I+  + GAGRGK    
Sbjct: 48   NESPRPDAQPAKPRTCTPNE--------SATDSTQPSEPNLPSSIISTLPGAGRGKTAVT 99

Query: 205  ------------KPYAPHPEKTRQTDGR-----EPSQSPNKDT-AVREQLSQEEKVRKAK 330
                        +P  P  E+ R    R      P ++P  +T + + +LS+E+ V+ A 
Sbjct: 100  QQQQQQQQHQRQQPGPPPQEENRHIRARLQPQPRPEKAPAAETGSAQPKLSKEDAVKMAM 159

Query: 331  EILSKXXXXXXXXXXXXXXXXXXXXXXXETKDYNRYEGR---------DDQS-------I 462
            ++LS+                         +   R +GR         DD+        +
Sbjct: 160  KVLSRGEEGEGEGISAGGPGRGRGMGRGRGRGRGRGQGRGRMRRQEMEDDEDGRFGGLYL 219

Query: 463  GDGAADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYFM 642
            GD A D EKLA+++G E M  +VEG EEM+ +VLP P ++A ++A   N  +E EPEY M
Sbjct: 220  GDNA-DGEKLAEKVGAEKMNMLVEGFEEMSGRVLPSPMEDAYIDALHTNCMIEFEPEYLM 278

Query: 643  EEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPLLKKIIDD 822
            EEFGTNPDIDEK PIPLRDALEKMKPFLM+YEGIQS           M+RVPLLK+I+D 
Sbjct: 279  EEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQEEWEEAVNEVMERVPLLKEIVDH 338

Query: 823  RGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDKL 1002
              GPDR TAKQQ  ELERVAKT+P SAPAS+KRF +RAVLSLQSNPGWGFDKKC FMDKL
Sbjct: 339  YSGPDRVTAKQQGEELERVAKTIPESAPASIKRFANRAVLSLQSNPGWGFDKKCQFMDKL 398

Query: 1003 VTEVEQHYK 1029
              EV Q YK
Sbjct: 399  AWEVSQQYK 407


>ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253300 [Vitis vinifera]
          Length = 482

 Score =  267 bits (683), Expect = 6e-69
 Identities = 166/339 (48%), Positives = 194/339 (57%), Gaps = 34/339 (10%)
 Frame = +1

Query: 115  SESETPPPKEKALLTGILGVISG-AGRGKPTKPYAPHPEKTRQTDGREPSQ----SPNKD 279
            S+  T PP+E  L   IL  +SG AGRG+P K   P P K      R+P Q    SP + 
Sbjct: 145  SQLGTTPPEENNLPVSILSALSGGAGRGQPLKQ-TPAPPKEENRHLRQPRQPVFRSPQQP 203

Query: 280  TAVREQ--LSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXETKDYN------- 432
             A   Q  LS+EE V+KA  ILS+                         +          
Sbjct: 204  VAGPPQPRLSREEAVKKAVGILSRGGDGGGDGDGDEGGRGRGFRGRGRGRGRGAQGWMGR 263

Query: 433  -RYEGRDDQSIGD------------GA-------ADREKLAKRLGPEIMEKIVEGLEEMA 552
             R  GR    +GD            GA       AD EKL+ ++G E M K+ E  EEM+
Sbjct: 264  GRGRGRGRGRMGDRRGRGGDAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMS 323

Query: 553  SKVLPDPHKEALLEAYEHNITLECEPEYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMS 732
             +VLP P ++A L+A   N  +E EPEY MEEFGTNPDIDE  PIPLRDALEKMKPFLM 
Sbjct: 324  GRVLPSPIEDAYLDALHTNCLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQ 383

Query: 733  YEGIQSHXXXXXXXXXTMKRVPLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPAS 912
            YEGIQS          TM+ VP LK+++D   GPDR TAK+Q  ELERVAKTLP +AP S
Sbjct: 384  YEGIQSQEEWEEVMKETMENVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNS 443

Query: 913  VKRFTDRAVLSLQSNPGWGFDKKCHFMDKLVTEVEQHYK 1029
            VKRFTDRA+LSLQSNPGWGFDKKC FMDKLV EV QHYK
Sbjct: 444  VKRFTDRAILSLQSNPGWGFDKKCQFMDKLVWEVSQHYK 482


>ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|508784903|gb|EOY32159.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 474

 Score =  267 bits (682), Expect = 8e-69
 Identities = 155/314 (49%), Positives = 186/314 (59%), Gaps = 26/314 (8%)
 Frame = +1

Query: 166  LGVISGAGRGKPTKPYAPHPEKTRQTDGRE----PSQSPNKDTAVREQLSQEEKVRKAKE 333
            + V+SGAGRGKP K   P P   RQ + R       QSP+       Q+SQEE  +KA  
Sbjct: 169  VSVLSGAGRGKPVKQ--PEPASRRQEENRHIRVAQQQSPSA------QMSQEEATKKAMG 220

Query: 334  ILSKXXXXXXXXXXXXXXXXXXXXXXXETKDYNR----------YEGRDDQSIGDGA--- 474
            ILS+                         +   R           +G D + + D     
Sbjct: 221  ILSRRSESGESGMVGRGGRASMGMGGGRGRGRGRGRGMGRGRGRRQGEDTRIVKDSGEGS 280

Query: 475  ---------ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECE 627
                     AD EK A+ +G + M K+VEG EEM S+VLP P  +A L+A   N ++E E
Sbjct: 281  ADGLYLGDNADGEKFAQTIGADNMNKLVEGFEEMGSRVLPSPMDDAYLDALHTNCSIEFE 340

Query: 628  PEYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPLLK 807
            PEY MEEFGTNPDIDEK P+PLRDALEKMKPFLM+YEGIQS          TM+RVPLL+
Sbjct: 341  PEYLMEEFGTNPDIDEKPPMPLRDALEKMKPFLMAYEGIQSQEEWEEVIKETMERVPLLQ 400

Query: 808  KIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCH 987
            +I+D   GPDR TAK+Q  ELERVAKT+P  AP+SVK+F +RAVLSLQSNPGWGFDKKC 
Sbjct: 401  EIVDYYSGPDRVTAKKQQEELERVAKTIPERAPSSVKQFANRAVLSLQSNPGWGFDKKCQ 460

Query: 988  FMDKLVTEVEQHYK 1029
            FMDKLV EV Q YK
Sbjct: 461  FMDKLVWEVSQQYK 474


>ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis]
            gi|223537066|gb|EEF38701.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 436

 Score =  262 bits (670), Expect = 2e-67
 Identities = 153/317 (48%), Positives = 187/317 (58%), Gaps = 11/317 (3%)
 Frame = +1

Query: 109  GWSESETPPPKEKALLTGILGVISGAGRGKPTKPYAPHP---EKTRQTDGREPSQSPNKD 279
            G S   T    +  L + I   +SG GRG+P KP  P P   E+ R    R  ++   ++
Sbjct: 119  GPSRQPTESQSDSVLPSTIHSSLSGFGRGEPDKPVVPTPQVKEENRHIRDRSRAKPKTEE 178

Query: 280  TAVREQ--LSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXX-ETKDYNRYEGRD 450
              VR +  +S+EE V++A  ILS+                          +   R     
Sbjct: 179  AEVRAKPKISREEAVKRAVSILSQGDTGEGMGRGRGGGRGRGRGRGRGRLEQRGRMMDDV 238

Query: 451  DQSIGDGA-----ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNIT 615
            D+  G G      AD EKLA ++G E M K+VEG EEM+ +VLP P ++A L+A   N  
Sbjct: 239  DEGFGSGLFLGDNADGEKLAGKIGVENMNKLVEGYEEMSGRVLPSPMEDAYLDALHTNYM 298

Query: 616  LECEPEYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRV 795
            +E EPEY M EF  NPDIDEK P+PLRD LEK+KPF+M+YEGIQS          TMK V
Sbjct: 299  IEFEPEYLMGEFDQNPDIDEKPPMPLRDVLEKVKPFIMAYEGIQSQEEWEAAVEETMKNV 358

Query: 796  PLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFD 975
            PL K+I+D   GPDR TAK+Q  ELERVA T+PASAPASVKRF DRAVLSLQSNPGWGFD
Sbjct: 359  PLFKEIVDYYSGPDRITAKKQEEELERVANTIPASAPASVKRFADRAVLSLQSNPGWGFD 418

Query: 976  KKCHFMDKLVTEVEQHY 1026
            KKC FMDKLV EV Q Y
Sbjct: 419  KKCQFMDKLVREVNQCY 435


>ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [Glycine max]
            gi|571476117|ref|XP_006586864.1| PREDICTED: la-related
            protein 1 isoform X2 [Glycine max]
          Length = 481

 Score =  262 bits (669), Expect = 3e-67
 Identities = 168/366 (45%), Positives = 203/366 (55%), Gaps = 39/366 (10%)
 Frame = +1

Query: 49   DLQPPKPDVKMP--FRFGEAQLGWSESETPPPK-------EKALLTGILGVISGAGRGKP 201
            DLQPP    K P  F+  ++    + ++  PPK       +  L   I GV+SG GRGK 
Sbjct: 121  DLQPPDSGPKKPIFFKREDSVSPTASNDFLPPKRSVDHAHDNKLPGSIPGVLSGLGRGKS 180

Query: 202  TKPYAPHPE--------KTRQTDGREPSQSPNKDTAVREQLSQEEKVRKAKEILSKXXXX 357
             K      +        +TRQ  G   S++  K + +    SQE+  R A +ILS     
Sbjct: 181  MKQPDLETQVTEENRHLRTRQAPGAASSETVPKRSPIP---SQEDATRNALKILSHGKDD 237

Query: 358  XXXXXXXXXXXXXXXXXXXETKDYNRYEGR-------------------DDQSIGDGA-- 474
                                 +   R  GR                   DD + G  A  
Sbjct: 238  GSDTGRGREYGGRGGLDRGRGRGRGRGRGRGMGRGRFVERDVDEKVMDTDDYATGLYAGD 297

Query: 475  -ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYFMEEF 651
             AD EKLA+++GPEIM ++ EG EEM S+VLP P ++  L+A + N  +E EPEY +E  
Sbjct: 298  DADGEKLARKVGPEIMNQLTEGFEEMTSRVLPSPLEDEFLDALDINYAIEFEPEYLVEF- 356

Query: 652  GTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPLLKKIIDDRGG 831
              NPDIDEK PI LRDALEK KPFLMSYEGIQS          TM RVPLLKKIID   G
Sbjct: 357  -DNPDIDEKEPISLRDALEKAKPFLMSYEGIQSQEEWEEIMEETMARVPLLKKIIDHYSG 415

Query: 832  PDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDKLVTE 1011
            PDR TAK+Q  ELERVAKTLP S P+SVK+FT+RAV+SLQSNPGWGFDKKCHFMDKLV E
Sbjct: 416  PDRVTAKKQQEELERVAKTLPGSVPSSVKQFTNRAVISLQSNPGWGFDKKCHFMDKLVWE 475

Query: 1012 VEQHYK 1029
            V QHYK
Sbjct: 476  VSQHYK 481


>ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris]
            gi|561020640|gb|ESW19411.1| hypothetical protein
            PHAVU_006G122700g [Phaseolus vulgaris]
          Length = 532

 Score =  261 bits (668), Expect = 3e-67
 Identities = 166/364 (45%), Positives = 204/364 (56%), Gaps = 37/364 (10%)
 Frame = +1

Query: 49   DLQPPKPDVKMPFRFGEAQLG--WSESETPPPKEKA--LLTGILGVISGAGRGKPTKPYA 216
            DL PP    K P  F    +    +  + P   E+A  L   I+ V+SG GRGKP K   
Sbjct: 175  DLGPPDSGPKKPIFFKREDIASPTTRDDFPIDVEQANKLPGNIIEVLSGLGRGKPMKQSD 234

Query: 217  PHPEKTRQTDGREPSQSPN------KDTAVREQL--SQEEKVRKAKEILSKXXXXXXXXX 372
            P   +TR T+     ++P        DT    Q   S+++ VR A+  LS+         
Sbjct: 235  P---ETRVTEENRHLRAPRARGAAASDTLYERQPIPSRDDAVRNARNFLSQGEDDVGGTG 291

Query: 373  XXXXXXXXXXXXXXETKDYNR--------YEGRDDQS-----------------IGDGAA 477
                            +   R        + GRD                    +GD A 
Sbjct: 292  RGRGFRERGGLGRGRGRGRGRGRGTGRGGFRGRDMDERRGRFMDAEASDDIGPYVGDDA- 350

Query: 478  DREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYFMEEFGT 657
            D EKLAK++GPEIM ++ EG EEMA +VLP P ++  L+A + N  +E EPEY +E    
Sbjct: 351  DGEKLAKKVGPEIMNQLTEGFEEMAGRVLPSPLEDEYLDALDINYAIEFEPEYLVEF--D 408

Query: 658  NPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPLLKKIIDDRGGPD 837
            NPDIDEK PIPLRDALEKMKPFLM+YEGIQS          TM +VPLLK+I+D   GPD
Sbjct: 409  NPDIDEKEPIPLRDALEKMKPFLMAYEGIQSQEEWEEIMEETMAQVPLLKEIVDHYSGPD 468

Query: 838  RATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDKLVTEVE 1017
            R TAK+Q  ELERVAKTLP SAP+SVK+FT+RAV+SLQSNPGWGFDKKCHFMDKLV EV 
Sbjct: 469  RVTAKKQQEELERVAKTLPESAPSSVKQFTNRAVVSLQSNPGWGFDKKCHFMDKLVWEVS 528

Query: 1018 QHYK 1029
            QHYK
Sbjct: 529  QHYK 532


>ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507965 [Cicer arietinum]
          Length = 504

 Score =  261 bits (668), Expect = 3e-67
 Identities = 164/357 (45%), Positives = 203/357 (56%), Gaps = 28/357 (7%)
 Frame = +1

Query: 43   KFDLQPPK-PDVKMPFRFGEAQLGWSESETPPPKEKALLTGILGVISGAGRGKPTKPYAP 219
            K D+ PPK P       F    L  S+ E+    +      +L V+SGAGRGKP +P   
Sbjct: 158  KDDVSPPKKPVFTRREDFSPIDLS-SDQES----DNRFSMSVLKVLSGAGRGKPIEPAVS 212

Query: 220  HP---EKTRQTDGREPSQSPNKDTAVREQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXX 390
                 E+ R    R  S  P +    +  L+ +  ++ A++ LSK               
Sbjct: 213  ETQVVEENRHVRNRRASDVPMR----QPMLTGDGALQNARKYLSKFDGDGSGSGRGGEPR 268

Query: 391  XXXXXXXXETKDYNRYEGR----------DDQ--SIGDGA------------ADREKLAK 498
                      +   R  GR          DD+   I D A             D EKLAK
Sbjct: 269  ERGAFGRGRGRGRGRGRGRGRGGFRGTGGDDRFGQIQDNARSNASGLFLGDDVDGEKLAK 328

Query: 499  RLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYFMEEFGTNPDIDEK 678
            ++GPE+M +  EG EEM S+VLP P ++  +EA++ N  +E EPEY ME F +NPDIDEK
Sbjct: 329  KVGPEVMNQFTEGFEEMISRVLPSPLEDEYVEAFDINCAIEFEPEYIME-FDSNPDIDEK 387

Query: 679  APIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPLLKKIIDDRGGPDRATAKQQ 858
             PIPLRDALEKMKPFLM+YEGIQS          TM+RVPLLKKI+D   GPDR TAK+Q
Sbjct: 388  EPIPLRDALEKMKPFLMNYEGIQSQEEWEAIMEETMERVPLLKKIVDHYSGPDRVTAKKQ 447

Query: 859  CGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDKLVTEVEQHYK 1029
              ELERVAKTLPASAP+SV +FT+RAV+SLQSNPGWGFDKKC FMDKLV EV QH+K
Sbjct: 448  QEELERVAKTLPASAPSSVVQFTNRAVMSLQSNPGWGFDKKCQFMDKLVFEVSQHHK 504


>ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica]
            gi|462409156|gb|EMJ14490.1| hypothetical protein
            PRUPE_ppa006080mg [Prunus persica]
          Length = 428

 Score =  254 bits (648), Expect = 7e-65
 Identities = 128/189 (67%), Positives = 145/189 (76%)
 Frame = +1

Query: 460  IGDGAADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYF 639
            +GD A D EKLAK+LGPEIM K+VE  EEM+S+VLP P  +A ++A   N  +ECEPEY 
Sbjct: 240  LGDNA-DGEKLAKKLGPEIMNKLVERFEEMSSEVLPSPLDDAYVDAMHTNFMIECEPEYL 298

Query: 640  MEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPLLKKIID 819
            M EF  NPDIDEK PI LRDALEKMKPFLM+YE I+S          TM+RVPLLK+I+D
Sbjct: 299  MGEFNKNPDIDEKPPISLRDALEKMKPFLMAYENIESQEEWEEVVNETMERVPLLKEIVD 358

Query: 820  DRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDK 999
               GPDR TAK+Q  ELERVAKTLPA  P SVKRFTDRAVLSLQSNPGWGFD+KC FMDK
Sbjct: 359  HYSGPDRVTAKKQQEELERVAKTLPAKVPDSVKRFTDRAVLSLQSNPGWGFDRKCQFMDK 418

Query: 1000 LVTEVEQHY 1026
            LV +V QHY
Sbjct: 419  LVAKVSQHY 427


>ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550322664|gb|EEF06007.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 466

 Score =  253 bits (646), Expect = 1e-64
 Identities = 150/334 (44%), Positives = 186/334 (55%), Gaps = 30/334 (8%)
 Frame = +1

Query: 118  ESETPPPKEKALLTGILGVISGAGRGKPTK---PYAPHPEKTRQTDGREPSQS------- 267
            ESE P   E  L   IL  + GAGRGKP K   P  P  E+ R    R   +S       
Sbjct: 133  ESEPPKKAEANLPPSILSGLGGAGRGKPVKQEVPIEPAKEENRHLRARSQPRSQPRTRQQ 192

Query: 268  --PNKDTAV--REQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXE---TKD 426
              P+ D AV    ++ ++E V+KA E+LS+                            + 
Sbjct: 193  KTPDGDDAVPATTKMGRQEAVKKAMELLSRGGGEGEVGGRGGGRGSFVPGRGGGRGGARG 252

Query: 427  YNRYEGRDDQSIGDGAA-------------DREKLAKRLGPEIMEKIVEGLEEMASKVLP 567
              R  GR  +  GD                D EK A+ +G E M  +VE  EEM+ +VLP
Sbjct: 253  GGRGRGRGRRGYGDKEVEYGSGMSLEGHEEDEEKFAQSVGVETMNTLVEAFEEMSGRVLP 312

Query: 568  DPHKEALLEAYEHNITLECEPEYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQ 747
             P ++  ++A++ N + E EPEY M EF  NPDIDEK P+PLRDALEK+KPF+M+Y GI+
Sbjct: 313  CPIEDEYVDAFDTNCSFEFEPEYLMGEFDKNPDIDEKPPMPLRDALEKVKPFMMAYMGIK 372

Query: 748  SHXXXXXXXXXTMKRVPLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFT 927
            +H         TMK  PL+KKI+D   GPDR + K+Q  ELERVAKT+PASAP SVK F 
Sbjct: 373  THEEWEEIVEETMKDAPLMKKIVDSYSGPDRVSGKKQKEELERVAKTIPASAPDSVKSFA 432

Query: 928  DRAVLSLQSNPGWGFDKKCHFMDKLVTEVEQHYK 1029
            DRAVLSLQSNPGWGFDKKC FMDKL  EV QHYK
Sbjct: 433  DRAVLSLQSNPGWGFDKKCMFMDKLAKEVSQHYK 466


>emb|CBI17195.3| unnamed protein product [Vitis vinifera]
          Length = 209

 Score =  248 bits (632), Expect = 5e-63
 Identities = 125/190 (65%), Positives = 143/190 (75%)
 Frame = +1

Query: 460  IGDGAADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYF 639
            +GD A D EKL+ ++G E M K+ E  EEM+ +VLP P ++A L+A   N  +E EPEY 
Sbjct: 21   LGDNA-DAEKLSNKIGLEKMSKLDEAFEEMSGRVLPSPIEDAYLDALHTNCLIEFEPEYL 79

Query: 640  MEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPLLKKIID 819
            MEEFGTNPDIDE  PIPLRDALEKMKPFLM YEGIQS          TM+ VP LK+++D
Sbjct: 80   MEEFGTNPDIDENPPIPLRDALEKMKPFLMQYEGIQSQEEWEEVMKETMENVPYLKELVD 139

Query: 820  DRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDK 999
               GPDR TAK+Q  ELERVAKTLP +AP SVKRFTDRA+LSLQSNPGWGFDKKC FMDK
Sbjct: 140  YYSGPDRVTAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWGFDKKCQFMDK 199

Query: 1000 LVTEVEQHYK 1029
            LV EV QHYK
Sbjct: 200  LVWEVSQHYK 209


>ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300131 [Fragaria vesca
            subsp. vesca]
          Length = 464

 Score =  244 bits (622), Expect = 7e-62
 Identities = 123/204 (60%), Positives = 147/204 (72%), Gaps = 5/204 (2%)
 Frame = +1

Query: 433  RYEGRDDQSIGDGA-----ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEA 597
            R  G +D  I  G      AD EKLA++LGPE+M ++ E  E+M++ VLP P  +A ++A
Sbjct: 261  RRRGDEDGGIASGLYLGDNADGEKLAEKLGPEVMNQLTEAFEDMSTHVLPSPLDDAYVDA 320

Query: 598  YEHNITLECEPEYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXX 777
             + N  +E EPEY M EF  NPDIDE+ PIPLRDALEKMKPFLM+YEGIQS         
Sbjct: 321  LDTNCKIEFEPEYLMGEFNQNPDIDEEPPIPLRDALEKMKPFLMAYEGIQSQEEWEEAIK 380

Query: 778  XTMKRVPLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSN 957
             TM+RVPLLKKI+D   GPDR TAK+Q  ELERVAKTLPA+ P SVK+FTDRAVLSLQ N
Sbjct: 381  ETMERVPLLKKIVDHYSGPDRVTAKKQREELERVAKTLPANVPDSVKQFTDRAVLSLQGN 440

Query: 958  PGWGFDKKCHFMDKLVTEVEQHYK 1029
            PGWGF +KC FMDKL  +V +HYK
Sbjct: 441  PGWGFHRKCQFMDKLTQKVSKHYK 464


>ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215545 [Cucumis sativus]
            gi|449502143|ref|XP_004161555.1| PREDICTED:
            uncharacterized protein LOC101224016 [Cucumis sativus]
          Length = 478

 Score =  244 bits (622), Expect = 7e-62
 Identities = 153/366 (41%), Positives = 188/366 (51%), Gaps = 33/366 (9%)
 Frame = +1

Query: 31   GNESKFDLQPPKPDV--KMPFRFGEAQLGWSESETP------PPKEKALLTGILGVISGA 186
            G+ S     PP+PD   K P  F +   G S + T          E+ L   +    SG 
Sbjct: 113  GDASPSIRSPPEPDSEPKKPVFFSKNNAGDSAASTSLGGLHRVSGERNLPESLHSEFSGV 172

Query: 187  GRGKPTKPYAPHPEKTRQTDGREPSQSPN--------KDTAVREQLSQEEKVRKAKEILS 342
            GRGKP K   P  +  ++     P Q  +        +      ++ + E  R    ++S
Sbjct: 173  GRGKPMKQPVPEDQPKQENRHLRPRQEGDGPGAGERGRGRGFEPRIGRGEPWRNTNRMVS 232

Query: 343  KXXXXXXXXXXXXXXXXXXXXXXX--------ETKDYNRYEGRDDQSIGDGAA------- 477
            K                                 +   R E R      DG A       
Sbjct: 233  KDGPDGEVGGGRGTSGYRGRGARGPYRRGARGSFRTGERRERRSGHDKEDGYAAGLYLGN 292

Query: 478  --DREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYFMEEF 651
              D E+LAKR+G E M K+VEG EEM+ +VLP P  +  L+  + N  +ECEPEY M +F
Sbjct: 293  NEDGERLAKRIGTENMNKLVEGFEEMSGRVLPSPLVDQYLDGMDTNFMIECEPEYLMGDF 352

Query: 652  GTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPLLKKIIDDRGG 831
              NPDIDE  PIPLRDALEKMKPFLM+YE IQSH         TM+ VPLLK+I+D  GG
Sbjct: 353  ENNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLLKEIVDAYGG 412

Query: 832  PDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDKLVTE 1011
            PDR TAK+Q GELERVAKTLP SAP SVK+FT+R VLSLQSNPGWGFDKK   MDKLV  
Sbjct: 413  PDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRVVLSLQSNPGWGFDKKWQLMDKLVEG 472

Query: 1012 VEQHYK 1029
              + YK
Sbjct: 473  FSKRYK 478


>gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding protein from Arabidopsis thaliana
            gi|2129727 and contains RNA recognition PF|00076 domain.
            ESTs gb|H37317, gb|F14415, gb|AA651290 come from this
            gene [Arabidopsis thaliana]
          Length = 829

 Score =  238 bits (606), Expect = 5e-60
 Identities = 136/313 (43%), Positives = 178/313 (56%), Gaps = 29/313 (9%)
 Frame = +1

Query: 178  SGAGRGKPTKPYAPHPEKTRQTDGREPSQSPN-----------------KDTAVREQLSQ 306
            SGAGRGKP    AP     RQ D R+  + P                  KD   + QLS 
Sbjct: 521  SGAGRGKPLVESAP----IRQEDNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSA 576

Query: 307  EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXETKDYNRYEG-RDDQSIGDG---- 471
            EE  R+A+  LS+                               +G RDD+   +G    
Sbjct: 577  EEAGRRARSELSRGEAEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEA 636

Query: 472  -------AADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEP 630
                   +AD EK A+++GPE+M+ + EG EE+  K LP    +A+++AY+ N+ +ECEP
Sbjct: 637  MRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEP 696

Query: 631  EYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPLLKK 810
            EY M +FG+NPDIDEK P+ LR+ LEK+KPF+++YEGI+            M + PL+K+
Sbjct: 697  EYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKE 756

Query: 811  IIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHF 990
            I+D   GPDR TAK+Q  EL+R+A TLPASAP SVKRF DRA L+L+SNPGWGFDKK  F
Sbjct: 757  IVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQF 816

Query: 991  MDKLVTEVEQHYK 1029
            MDKLV EV Q YK
Sbjct: 817  MDKLVLEVSQSYK 829


>ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|12324041|gb|AAG51990.1|AC024260_28 unknown
            protein; 43598-45751 [Arabidopsis thaliana]
            gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8
            [Arabidopsis thaliana] gi|23506017|gb|AAN28868.1|
            At1g53640/F22G10.8 [Arabidopsis thaliana]
            gi|110740318|dbj|BAF02054.1| hypothetical protein
            [Arabidopsis thaliana] gi|332194854|gb|AEE32975.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis thaliana]
          Length = 523

 Score =  238 bits (606), Expect = 5e-60
 Identities = 136/313 (43%), Positives = 178/313 (56%), Gaps = 29/313 (9%)
 Frame = +1

Query: 178  SGAGRGKPTKPYAPHPEKTRQTDGREPSQSPN-----------------KDTAVREQLSQ 306
            SGAGRGKP    AP     RQ D R+  + P                  KD   + QLS 
Sbjct: 215  SGAGRGKPLVESAP----IRQEDNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSA 270

Query: 307  EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXETKDYNRYEG-RDDQSIGDG---- 471
            EE  R+A+  LS+                               +G RDD+   +G    
Sbjct: 271  EEAGRRARSELSRGEAEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEA 330

Query: 472  -------AADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEP 630
                   +AD EK A+++GPE+M+ + EG EE+  K LP    +A+++AY+ N+ +ECEP
Sbjct: 331  MRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEP 390

Query: 631  EYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPLLKK 810
            EY M +FG+NPDIDEK P+ LR+ LEK+KPF+++YEGI+            M + PL+K+
Sbjct: 391  EYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKE 450

Query: 811  IIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHF 990
            I+D   GPDR TAK+Q  EL+R+A TLPASAP SVKRF DRA L+L+SNPGWGFDKK  F
Sbjct: 451  IVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQF 510

Query: 991  MDKLVTEVEQHYK 1029
            MDKLV EV Q YK
Sbjct: 511  MDKLVLEVSQSYK 523


>ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297340299|gb|EFH70716.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 769

 Score =  238 bits (606), Expect = 5e-60
 Identities = 135/316 (42%), Positives = 182/316 (57%), Gaps = 33/316 (10%)
 Frame = +1

Query: 181  GAGRGKPT---------------KPYAPHPEKTRQTDGREPSQ--SPN-KDTAVREQLSQ 306
            GAGRGKP                +P  P P + +Q    +P Q  +P  KD A + QLS+
Sbjct: 459  GAGRGKPLVESAPIQQEDNRQIRRPQPPPPPQQQQQQRAQPQQKRAPTVKDEAPKPQLSR 518

Query: 307  EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXETKDYNRYEG----RDDQSIGDG- 471
            EE  R+A+  LS+                         +   R  G    RDD+   +G 
Sbjct: 519  EEAGRRARSELSRGEAEGGGVRGRGGRGRGRG-----ARGRGRGRGGDGWRDDKKEEEGE 573

Query: 472  ----------AADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLE 621
                      +AD EK A+++GPE+M+ + EG EE+  K LP    +A+++AY+ N+ +E
Sbjct: 574  QEAMSIFAGDSADGEKFAQKMGPELMKTLAEGFEEVCEKALPSTTHDAIIDAYDTNLMIE 633

Query: 622  CEPEYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPL 801
            CEPEY M +FG+NPDIDEK P+ LR+ LEK+KPF+++YEGI+            M + PL
Sbjct: 634  CEPEYIMADFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAVNEAMAQAPL 693

Query: 802  LKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKK 981
            +K+I+D   GPDR TAK+Q  EL+ +A T+PASAP SVKRF DRA L+L+SNPGWGFDKK
Sbjct: 694  MKEIVDHYSGPDRVTAKKQNEELDSIATTIPASAPDSVKRFADRAALTLKSNPGWGFDKK 753

Query: 982  CHFMDKLVTEVEQHYK 1029
              FMDKLV EV Q YK
Sbjct: 754  YQFMDKLVLEVSQSYK 769


>gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis thaliana
            gi|2129727 and contains RNA recognition PF|00076 domain
            [Arabidopsis thaliana]
          Length = 523

 Score =  238 bits (606), Expect = 5e-60
 Identities = 136/313 (43%), Positives = 178/313 (56%), Gaps = 29/313 (9%)
 Frame = +1

Query: 178  SGAGRGKPTKPYAPHPEKTRQTDGREPSQSPN-----------------KDTAVREQLSQ 306
            SGAGRGKP    AP     RQ D R+  + P                  KD   + QLS 
Sbjct: 215  SGAGRGKPLVESAP----IRQEDNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSA 270

Query: 307  EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXXETKDYNRYEG-RDDQSIGDG---- 471
            EE  R+A+  LS+                               +G RDD+   +G    
Sbjct: 271  EEAGRRARSELSRGEAEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEA 330

Query: 472  -------AADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEP 630
                   +AD EK A+++GPE+M+ + EG EE+  K LP    +A+++AY+ N+ +ECEP
Sbjct: 331  MRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEP 390

Query: 631  EYFMEEFGTNPDIDEKAPIPLRDALEKMKPFLMSYEGIQSHXXXXXXXXXTMKRVPLLKK 810
            EY M +FG+NPDIDEK P+ LR+ LEK+KPF+++YEGI+            M + PL+K+
Sbjct: 391  EYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKE 450

Query: 811  IIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHF 990
            I+D   GPDR TAK+Q  EL+R+A TLPASAP SVKRF DRA L+L+SNPGWGFDKK  F
Sbjct: 451  IVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQF 510

Query: 991  MDKLVTEVEQHYK 1029
            MDKLV EV Q YK
Sbjct: 511  MDKLVLEVSQSYK 523


Top