BLASTX nr result

ID: Mentha22_contig00011093 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00011093
         (1373 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus...   318   5e-84
gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]       275   3e-71
ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu...   273   1e-70
ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247...   272   3e-70
ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1...   268   5e-69
ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family prot...   265   4e-68
ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253...   263   2e-67
ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm...   260   9e-67
ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507...   256   1e-65
ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phas...   254   8e-65
ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [...   251   4e-64
ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus tr...   251   5e-64
ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prun...   249   2e-63
emb|CBI17195.3| unnamed protein product [Vitis vinifera]              243   1e-61
ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300...   239   2e-60
ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215...   238   4e-60
gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding prot...   238   6e-60
ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein...   238   6e-60
ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp....   238   6e-60
gb|AAM65660.1| Contains similarity to RNA-binding protein from A...   238   6e-60

>gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Mimulus guttatus]
          Length = 493

 Score =  318 bits (814), Expect = 5e-84
 Identities = 187/365 (51%), Positives = 229/365 (62%), Gaps = 41/365 (11%)
 Frame = -3

Query: 1371 PPKPDVKMPFRF---GEAQLGWSESETPPPKEKALLTGILGVISGAGRGKPTKP-YAPHP 1204
            PPKP+VK+PF F    E Q   +ESE P  +E  L + I+ V+SGAGRGKP KP  A  P
Sbjct: 131  PPKPNVKLPFLFVKDEEEQADAAESEVPSAQETLLRSDIVSVLSGAGRGKPGKPPTAAQP 190

Query: 1203 EKTRQTDGREPSQSP---------NKDTAVRE--QLSQEEKVRKAKEILSKXXXXXXXXX 1057
            EK  Q++ R   Q P         + D A     QLS+EE V+KAKEILSK         
Sbjct: 191  EKP-QSENRHIRQRPPQGKPPVAVSSDGAAPPAVQLSKEEMVKKAKEILSKGDEDGGVSR 249

Query: 1056 XXXXXXXXXXXXXR----------------------ETKDYNRYEGRDDQS----IGDGA 955
                                                  +  +RYE  DD+S    IGD  
Sbjct: 250  PEVRDNRDNRDNRGGGRGGRGERGRGRGRGRGRGRGRGRGDDRYEESDDESDALFIGD-P 308

Query: 954  ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYFMEEFG 775
            AD EK+A++LGP++M ++ EG++EM+S+VLP P  +A ++A+E N+ +ECEPEY MEEFG
Sbjct: 309  ADEEKVAQKLGPDVMAQLAEGIDEMSSRVLPSPFDDAYMDAFETNLRIECEPEYLMEEFG 368

Query: 774  TNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPLLKKIIDDRGGP 595
            TNPDIDEK P+PLR ALEKMKPFLM YEGI+          ETMK VPL+K+I+D   GP
Sbjct: 369  TNPDIDEKPPIPLRDALEKMKPFLMVYEGIKDQEEWEKIIEETMKDVPLIKEIVDHYSGP 428

Query: 594  DRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDKLVTEV 415
            DR TAKQQ  ELERVAKTLPASAPASVKRFT+RA+LSLQSNPGWGFDKKC FMDK++ EV
Sbjct: 429  DRVTAKQQNEELERVAKTLPASAPASVKRFTERALLSLQSNPGWGFDKKCQFMDKVIMEV 488

Query: 414  EQHYK 400
             Q+YK
Sbjct: 489  SQNYK 493


>gb|EPS65553.1| hypothetical protein M569_09226 [Genlisea aurea]
          Length = 426

 Score =  275 bits (703), Expect = 3e-71
 Identities = 158/318 (49%), Positives = 195/318 (61%), Gaps = 18/318 (5%)
 Frame = -3

Query: 1299 PPPKEKALLTGILGVISGAGRGKPTKPYAPHPEKTRQTDG----REPSQSPNKDTAVREQ 1132
            PPP++ A L  IL  +SG GRG P KP    P+  + T      R+P   P+   +  +Q
Sbjct: 114  PPPRDTAALDDILTNLSGMGRGTPGKP---PPQTLKPTPINRHIRQPQPRPSTALSPDQQ 170

Query: 1131 LSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXRETKDYNRYEGRDDQSIGDGA- 955
            LS+EEK++KA EILS+                             R+ GR      D A 
Sbjct: 171  LSKEEKLKKAVEILSRGDPDRGPIRSPTGRGRGRGRGRGGRG--GRFSGRGRGREADAAI 228

Query: 954  -------------ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNIT 814
                         AD +K+A++LG E+M KI EG+EEM+S+VLP    +A ++AY  N+ 
Sbjct: 229  ESDEELPGMFGDPADEQKVAEKLGVEVMNKITEGMEEMSSRVLPSLIDDAYVDAYHTNLL 288

Query: 813  LECEPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRV 634
            LECEPEYFME+FGTNPDID+K P+PLR A EKMKPFLM + GI++         ETM+ V
Sbjct: 289  LECEPEYFMEDFGTNPDIDDKPPIPLREAFEKMKPFLMQHIGIETQEEWEQIIEETMESV 348

Query: 633  PLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFD 454
            P  KKIID   GPDR TA QQ GELERVA TLPA+APASVKRFT+RAVLSL+SNPGWGF 
Sbjct: 349  PRWKKIIDHYAGPDRVTALQQIGELERVAGTLPATAPASVKRFTERAVLSLKSNPGWGFK 408

Query: 453  KKCHFMDKLVTEVEQHYK 400
            KKC FMDK+V EV Q YK
Sbjct: 409  KKCQFMDKVVMEVSQQYK 426


>ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum]
          Length = 480

 Score =  273 bits (698), Expect = 1e-70
 Identities = 158/334 (47%), Positives = 203/334 (60%), Gaps = 29/334 (8%)
 Frame = -3

Query: 1314 SESETPPPKEKA-LLTGILGVISGAGRGKPTKPYAPHPEKTRQTDGR-EPSQSPNKDTAV 1141
            S S+ P P++ + L + ++ V++GAGRGKP +  +P  EK ++ +    P Q    D+  
Sbjct: 147  SSSDAPTPRDDSNLSSSVISVLTGAGRGKPLQTASPVSEKPKEENRHLRPRQQKVADSGE 206

Query: 1140 R------EQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXR-----------E 1012
            R      ++LS+E+ V+KA  ILS+                      R            
Sbjct: 207  RASSPPPQRLSREDAVKKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVRGRGGR 266

Query: 1011 TKDYNRYEGRDDQSIGDGA----------ADREKLAKRLGPEIMEKIVEGLEEMASKVLP 862
             +   R  GR D+  GDG+          AD EKLA++LGPE M  + EG EEM+++VLP
Sbjct: 267  GRGRGRGRGRRDEERGDGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMSARVLP 326

Query: 861  DPHKEALLEAYEHNITLECEPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQ 682
             P  +A +EA   N+ +ECEPEY M +F +NPDIDE  P+PLR ALEKMKPFLM+YEGI+
Sbjct: 327  SPMDDAYIEALHTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIK 386

Query: 681  SHXXXXXXXXETMKRVPLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFT 502
                      ETM+ VPL+K+I+D   GPDR TAKQQ  ELERVAKTLP SAP SVKRFT
Sbjct: 387  DQEEWEEVIKETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFT 446

Query: 501  DRAVLSLQSNPGWGFDKKCHFMDKLVTEVEQHYK 400
            +RAVLSLQSNPGWGFDKKC FMDK+V E  QHYK
Sbjct: 447  ERAVLSLQSNPGWGFDKKCQFMDKVVMEASQHYK 480


>ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum
            lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED:
            uncharacterized protein LOC101247662 isoform 2 [Solanum
            lycopersicum]
          Length = 473

 Score =  272 bits (695), Expect = 3e-70
 Identities = 158/330 (47%), Positives = 199/330 (60%), Gaps = 25/330 (7%)
 Frame = -3

Query: 1314 SESETPPPKEKALL-TGILGVISGAGRGKPTKPYAPHPEKTRQTDGR-EPSQSPNKDTAV 1141
            S S  P P++ + L + ++ V++GAGRGKP +  +   EK ++ +    P Q    D+  
Sbjct: 144  SSSNAPKPRDDSNLPSSVISVLTGAGRGKPLQTASSVSEKPKEENRHLRPRQQKVADSGE 203

Query: 1140 R------EQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXRETKDYN------ 997
            R      ++LS+E+ V+KA  ILS+                         +         
Sbjct: 204  RASSPPPQRLSREDAVKKAVGILSRSDDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGR 263

Query: 996  -RYEGRDDQSIGDGA----------ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHK 850
             R  GR D+  GDG           AD EKLA +LGPE M  + EG EEM+++VLP P  
Sbjct: 264  GRGRGRRDEERGDGNLESGFYLGDDADGEKLAAKLGPESMNTLAEGFEEMSARVLPSPMD 323

Query: 849  EALLEAYEHNITLECEPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXX 670
            +A LEA   N+ +ECEPEY M +F +NPDIDE  P+PLR ALEKMKPFLM+YEGI+    
Sbjct: 324  DAYLEALHTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEE 383

Query: 669  XXXXXXETMKRVPLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAV 490
                  ETM+ VPL+K+I+D   GPDR TAKQQ  ELERVAKTLP SAP SVKRFT+RAV
Sbjct: 384  WEEVIKETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAV 443

Query: 489  LSLQSNPGWGFDKKCHFMDKLVTEVEQHYK 400
            LSLQSNPGWGFDKKC FMDK+V EV QHYK
Sbjct: 444  LSLQSNPGWGFDKKCQFMDKVVMEVSQHYK 473


>ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis]
          Length = 407

 Score =  268 bits (684), Expect = 5e-69
 Identities = 161/359 (44%), Positives = 207/359 (57%), Gaps = 36/359 (10%)
 Frame = -3

Query: 1368 PKPDVKMPFRFGEAQLGWSESETPPPKEKALLTGILGVISGAGRGKPT------------ 1225
            P+PD + P +        S +++  P E  L + I+  + GAGRGK              
Sbjct: 51   PRPDAQ-PAKPRTCTPNESATDSTQPSEPNLPSSIISTLPGAGRGKTAVTQQQQQQQQHQ 109

Query: 1224 --KPYAPHPEKTRQTDGR-----EPSQSPNKDT-AVREQLSQEEKVRKAKEILSKXXXXX 1069
              +P  P  E+ R    R      P ++P  +T + + +LS+E+ V+ A ++LS+     
Sbjct: 110  RQQPGPPPQEENRHIRARLQPQPRPEKAPAAETGSAQPKLSKEDAVKMAMKVLSRGEEGE 169

Query: 1068 XXXXXXXXXXXXXXXXXRETKDYNRYEGR---------DDQS-------IGDGAADREKL 937
                                +   R +GR         DD+        +GD A D EKL
Sbjct: 170  GEGISAGGPGRGRGMGRGRGRGRGRGQGRGRMRRQEMEDDEDGRFGGLYLGDNA-DGEKL 228

Query: 936  AKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYFMEEFGTNPDID 757
            A+++G E M  +VEG EEM+ +VLP P ++A ++A   N  +E EPEY MEEFGTNPDID
Sbjct: 229  AEKVGAEKMNMLVEGFEEMSGRVLPSPMEDAYIDALHTNCMIEFEPEYLMEEFGTNPDID 288

Query: 756  EKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPLLKKIIDDRGGPDRATAK 577
            EK P+PLR ALEKMKPFLM+YEGIQS         E M+RVPLLK+I+D   GPDR TAK
Sbjct: 289  EKPPIPLRDALEKMKPFLMAYEGIQSQEEWEEAVNEVMERVPLLKEIVDHYSGPDRVTAK 348

Query: 576  QQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDKLVTEVEQHYK 400
            QQ  ELERVAKT+P SAPAS+KRF +RAVLSLQSNPGWGFDKKC FMDKL  EV Q YK
Sbjct: 349  QQGEELERVAKTIPESAPASIKRFANRAVLSLQSNPGWGFDKKCQFMDKLAWEVSQQYK 407


>ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|508784903|gb|EOY32159.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 474

 Score =  265 bits (677), Expect = 4e-68
 Identities = 156/314 (49%), Positives = 186/314 (59%), Gaps = 26/314 (8%)
 Frame = -3

Query: 1263 LGVISGAGRGKPTKPYAPHPEKTRQTDGRE----PSQSPNKDTAVREQLSQEEKVRKAKE 1096
            + V+SGAGRGKP K   P P   RQ + R       QSP+       Q+SQEE  +KA  
Sbjct: 169  VSVLSGAGRGKPVKQ--PEPASRRQEENRHIRVAQQQSPSA------QMSQEEATKKAMG 220

Query: 1095 ILSKXXXXXXXXXXXXXXXXXXXXXXRETKDYNR----------YEGRDDQSIGDGA--- 955
            ILS+                         +   R           +G D + + D     
Sbjct: 221  ILSRRSESGESGMVGRGGRASMGMGGGRGRGRGRGRGMGRGRGRRQGEDTRIVKDSGEGS 280

Query: 954  ---------ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECE 802
                     AD EK A+ +G + M K+VEG EEM S+VLP P  +A L+A   N ++E E
Sbjct: 281  ADGLYLGDNADGEKFAQTIGADNMNKLVEGFEEMGSRVLPSPMDDAYLDALHTNCSIEFE 340

Query: 801  PEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPLLK 622
            PEY MEEFGTNPDIDEK PMPLR ALEKMKPFLM+YEGIQS         ETM+RVPLL+
Sbjct: 341  PEYLMEEFGTNPDIDEKPPMPLRDALEKMKPFLMAYEGIQSQEEWEEVIKETMERVPLLQ 400

Query: 621  KIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCH 442
            +I+D   GPDR TAK+Q  ELERVAKT+P  AP+SVK+F +RAVLSLQSNPGWGFDKKC 
Sbjct: 401  EIVDYYSGPDRVTAKKQQEELERVAKTIPERAPSSVKQFANRAVLSLQSNPGWGFDKKCQ 460

Query: 441  FMDKLVTEVEQHYK 400
            FMDKLV EV Q YK
Sbjct: 461  FMDKLVWEVSQQYK 474


>ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253300 [Vitis vinifera]
          Length = 482

 Score =  263 bits (671), Expect = 2e-67
 Identities = 165/339 (48%), Positives = 194/339 (57%), Gaps = 34/339 (10%)
 Frame = -3

Query: 1314 SESETPPPKEKALLTGILGVISG-AGRGKPTKPYAPHPEKTRQTDGREPSQ----SPNKD 1150
            S+  T PP+E  L   IL  +SG AGRG+P K   P P K      R+P Q    SP + 
Sbjct: 145  SQLGTTPPEENNLPVSILSALSGGAGRGQPLKQ-TPAPPKEENRHLRQPRQPVFRSPQQP 203

Query: 1149 TAVREQ--LSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXRETKDYN------- 997
             A   Q  LS+EE V+KA  ILS+                         +          
Sbjct: 204  VAGPPQPRLSREEAVKKAVGILSRGGDGGGDGDGDEGGRGRGFRGRGRGRGRGAQGWMGR 263

Query: 996  -RYEGRDDQSIGD------------GA-------ADREKLAKRLGPEIMEKIVEGLEEMA 877
             R  GR    +GD            GA       AD EKL+ ++G E M K+ E  EEM+
Sbjct: 264  GRGRGRGRGRMGDRRGRGGDAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMS 323

Query: 876  SKVLPDPHKEALLEAYEHNITLECEPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMS 697
             +VLP P ++A L+A   N  +E EPEY MEEFGTNPDIDE  P+PLR ALEKMKPFLM 
Sbjct: 324  GRVLPSPIEDAYLDALHTNCLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQ 383

Query: 696  YEGIQSHXXXXXXXXETMKRVPLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPAS 517
            YEGIQS         ETM+ VP LK+++D   GPDR TAK+Q  ELERVAKTLP +AP S
Sbjct: 384  YEGIQSQEEWEEVMKETMENVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNS 443

Query: 516  VKRFTDRAVLSLQSNPGWGFDKKCHFMDKLVTEVEQHYK 400
            VKRFTDRA+LSLQSNPGWGFDKKC FMDKLV EV QHYK
Sbjct: 444  VKRFTDRAILSLQSNPGWGFDKKCQFMDKLVWEVSQHYK 482


>ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis]
            gi|223537066|gb|EEF38701.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 436

 Score =  260 bits (665), Expect = 9e-67
 Identities = 155/317 (48%), Positives = 188/317 (59%), Gaps = 11/317 (3%)
 Frame = -3

Query: 1320 GWSESETPPPKEKALLTGILGVISGAGRGKPTKPYAPHP---EKTRQTDGREPSQSPNKD 1150
            G S   T    +  L + I   +SG GRG+P KP  P P   E+ R    R  ++   ++
Sbjct: 119  GPSRQPTESQSDSVLPSTIHSSLSGFGRGEPDKPVVPTPQVKEENRHIRDRSRAKPKTEE 178

Query: 1149 TAVREQ--LSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXR-ETKDYNRYEGRD 979
              VR +  +S+EE V++A  ILS+                      R   +   R     
Sbjct: 179  AEVRAKPKISREEAVKRAVSILSQGDTGEGMGRGRGGGRGRGRGRGRGRLEQRGRMMDDV 238

Query: 978  DQSIGDGA-----ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNIT 814
            D+  G G      AD EKLA ++G E M K+VEG EEM+ +VLP P ++A L+A   N  
Sbjct: 239  DEGFGSGLFLGDNADGEKLAGKIGVENMNKLVEGYEEMSGRVLPSPMEDAYLDALHTNYM 298

Query: 813  LECEPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRV 634
            +E EPEY M EF  NPDIDEK PMPLR  LEK+KPF+M+YEGIQS         ETMK V
Sbjct: 299  IEFEPEYLMGEFDQNPDIDEKPPMPLRDVLEKVKPFIMAYEGIQSQEEWEAAVEETMKNV 358

Query: 633  PLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFD 454
            PL K+I+D   GPDR TAK+Q  ELERVA T+PASAPASVKRF DRAVLSLQSNPGWGFD
Sbjct: 359  PLFKEIVDYYSGPDRITAKKQEEELERVANTIPASAPASVKRFADRAVLSLQSNPGWGFD 418

Query: 453  KKCHFMDKLVTEVEQHY 403
            KKC FMDKLV EV Q Y
Sbjct: 419  KKCQFMDKLVREVNQCY 435


>ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507965 [Cicer arietinum]
          Length = 504

 Score =  256 bits (655), Expect = 1e-65
 Identities = 153/316 (48%), Positives = 189/316 (59%), Gaps = 27/316 (8%)
 Frame = -3

Query: 1266 ILGVISGAGRGKPTKPYAPHP---EKTRQTDGREPSQSPNKDTAVREQLSQEEKVRKAKE 1096
            +L V+SGAGRGKP +P        E+ R    R  S  P +    +  L+ +  ++ A++
Sbjct: 194  VLKVLSGAGRGKPIEPAVSETQVVEENRHVRNRRASDVPMR----QPMLTGDGALQNARK 249

Query: 1095 ILSKXXXXXXXXXXXXXXXXXXXXXXRETKDYNRYEGR----------DDQ--SIGDGA- 955
             LSK                         +   R  GR          DD+   I D A 
Sbjct: 250  YLSKFDGDGSGSGRGGEPRERGAFGRGRGRGRGRGRGRGRGGFRGTGGDDRFGQIQDNAR 309

Query: 954  -----------ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLE 808
                        D EKLAK++GPE+M +  EG EEM S+VLP P ++  +EA++ N  +E
Sbjct: 310  SNASGLFLGDDVDGEKLAKKVGPEVMNQFTEGFEEMISRVLPSPLEDEYVEAFDINCAIE 369

Query: 807  CEPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPL 628
             EPEY ME F +NPDIDEK P+PLR ALEKMKPFLM+YEGIQS         ETM+RVPL
Sbjct: 370  FEPEYIME-FDSNPDIDEKEPIPLRDALEKMKPFLMNYEGIQSQEEWEAIMEETMERVPL 428

Query: 627  LKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKK 448
            LKKI+D   GPDR TAK+Q  ELERVAKTLPASAP+SV +FT+RAV+SLQSNPGWGFDKK
Sbjct: 429  LKKIVDHYSGPDRVTAKKQQEELERVAKTLPASAPSSVVQFTNRAVMSLQSNPGWGFDKK 488

Query: 447  CHFMDKLVTEVEQHYK 400
            C FMDKLV EV QH+K
Sbjct: 489  CQFMDKLVFEVSQHHK 504


>ref|XP_007147417.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris]
            gi|561020640|gb|ESW19411.1| hypothetical protein
            PHAVU_006G122700g [Phaseolus vulgaris]
          Length = 532

 Score =  254 bits (648), Expect = 8e-65
 Identities = 163/361 (45%), Positives = 202/361 (55%), Gaps = 37/361 (10%)
 Frame = -3

Query: 1371 PPKPDVKMPFRFGEAQLG--WSESETPPPKEKA--LLTGILGVISGAGRGKPTKPYAPHP 1204
            PP    K P  F    +    +  + P   E+A  L   I+ V+SG GRGKP K   P  
Sbjct: 178  PPDSGPKKPIFFKREDIASPTTRDDFPIDVEQANKLPGNIIEVLSGLGRGKPMKQSDP-- 235

Query: 1203 EKTRQTDGREPSQSPN------KDTAVREQL--SQEEKVRKAKEILSKXXXXXXXXXXXX 1048
             +TR T+     ++P        DT    Q   S+++ VR A+  LS+            
Sbjct: 236  -ETRVTEENRHLRAPRARGAAASDTLYERQPIPSRDDAVRNARNFLSQGEDDVGGTGRGR 294

Query: 1047 XXXXXXXXXXRETKDYNR--------YEGRDDQS-----------------IGDGAADRE 943
                         +   R        + GRD                    +GD A D E
Sbjct: 295  GFRERGGLGRGRGRGRGRGRGTGRGGFRGRDMDERRGRFMDAEASDDIGPYVGDDA-DGE 353

Query: 942  KLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYFMEEFGTNPD 763
            KLAK++GPEIM ++ EG EEMA +VLP P ++  L+A + N  +E EPEY +E    NPD
Sbjct: 354  KLAKKVGPEIMNQLTEGFEEMAGRVLPSPLEDEYLDALDINYAIEFEPEYLVEF--DNPD 411

Query: 762  IDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPLLKKIIDDRGGPDRAT 583
            IDEK P+PLR ALEKMKPFLM+YEGIQS         ETM +VPLLK+I+D   GPDR T
Sbjct: 412  IDEKEPIPLRDALEKMKPFLMAYEGIQSQEEWEEIMEETMAQVPLLKEIVDHYSGPDRVT 471

Query: 582  AKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDKLVTEVEQHY 403
            AK+Q  ELERVAKTLP SAP+SVK+FT+RAV+SLQSNPGWGFDKKCHFMDKLV EV QHY
Sbjct: 472  AKKQQEELERVAKTLPESAPSSVKQFTNRAVVSLQSNPGWGFDKKCHFMDKLVWEVSQHY 531

Query: 402  K 400
            K
Sbjct: 532  K 532


>ref|XP_006586863.1| PREDICTED: la-related protein 1 isoform X1 [Glycine max]
            gi|571476117|ref|XP_006586864.1| PREDICTED: la-related
            protein 1 isoform X2 [Glycine max]
          Length = 481

 Score =  251 bits (642), Expect = 4e-64
 Identities = 164/363 (45%), Positives = 200/363 (55%), Gaps = 39/363 (10%)
 Frame = -3

Query: 1371 PPKPDVKMP--FRFGEAQLGWSESETPPPK-------EKALLTGILGVISGAGRGKPTKP 1219
            PP    K P  F+  ++    + ++  PPK       +  L   I GV+SG GRGK  K 
Sbjct: 124  PPDSGPKKPIFFKREDSVSPTASNDFLPPKRSVDHAHDNKLPGSIPGVLSGLGRGKSMKQ 183

Query: 1218 YAPHPE--------KTRQTDGREPSQSPNKDTAVREQLSQEEKVRKAKEILSKXXXXXXX 1063
                 +        +TRQ  G   S++  K + +    SQE+  R A +ILS        
Sbjct: 184  PDLETQVTEENRHLRTRQAPGAASSETVPKRSPIP---SQEDATRNALKILSHGKDDGSD 240

Query: 1062 XXXXXXXXXXXXXXXRETKDYNRYEGR-------------------DDQSIGDGA---AD 949
                              +   R  GR                   DD + G  A   AD
Sbjct: 241  TGRGREYGGRGGLDRGRGRGRGRGRGRGMGRGRFVERDVDEKVMDTDDYATGLYAGDDAD 300

Query: 948  REKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYFMEEFGTN 769
             EKLA+++GPEIM ++ EG EEM S+VLP P ++  L+A + N  +E EPEY +E    N
Sbjct: 301  GEKLARKVGPEIMNQLTEGFEEMTSRVLPSPLEDEFLDALDINYAIEFEPEYLVEF--DN 358

Query: 768  PDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPLLKKIIDDRGGPDR 589
            PDIDEK P+ LR ALEK KPFLMSYEGIQS         ETM RVPLLKKIID   GPDR
Sbjct: 359  PDIDEKEPISLRDALEKAKPFLMSYEGIQSQEEWEEIMEETMARVPLLKKIIDHYSGPDR 418

Query: 588  ATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDKLVTEVEQ 409
             TAK+Q  ELERVAKTLP S P+SVK+FT+RAV+SLQSNPGWGFDKKCHFMDKLV EV Q
Sbjct: 419  VTAKKQQEELERVAKTLPGSVPSSVKQFTNRAVISLQSNPGWGFDKKCHFMDKLVWEVSQ 478

Query: 408  HYK 400
            HYK
Sbjct: 479  HYK 481


>ref|XP_002321880.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550322664|gb|EEF06007.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 466

 Score =  251 bits (641), Expect = 5e-64
 Identities = 151/334 (45%), Positives = 186/334 (55%), Gaps = 30/334 (8%)
 Frame = -3

Query: 1311 ESETPPPKEKALLTGILGVISGAGRGKPTK---PYAPHPEKTRQTDGREPSQS------- 1162
            ESE P   E  L   IL  + GAGRGKP K   P  P  E+ R    R   +S       
Sbjct: 133  ESEPPKKAEANLPPSILSGLGGAGRGKPVKQEVPIEPAKEENRHLRARSQPRSQPRTRQQ 192

Query: 1161 --PNKDTAV--REQLSQEEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXRE---TKD 1003
              P+ D AV    ++ ++E V+KA E+LS+                            + 
Sbjct: 193  KTPDGDDAVPATTKMGRQEAVKKAMELLSRGGGEGEVGGRGGGRGSFVPGRGGGRGGARG 252

Query: 1002 YNRYEGRDDQSIGDGAA-------------DREKLAKRLGPEIMEKIVEGLEEMASKVLP 862
              R  GR  +  GD                D EK A+ +G E M  +VE  EEM+ +VLP
Sbjct: 253  GGRGRGRGRRGYGDKEVEYGSGMSLEGHEEDEEKFAQSVGVETMNTLVEAFEEMSGRVLP 312

Query: 861  DPHKEALLEAYEHNITLECEPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQ 682
             P ++  ++A++ N + E EPEY M EF  NPDIDEK PMPLR ALEK+KPF+M+Y GI+
Sbjct: 313  CPIEDEYVDAFDTNCSFEFEPEYLMGEFDKNPDIDEKPPMPLRDALEKVKPFMMAYMGIK 372

Query: 681  SHXXXXXXXXETMKRVPLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFT 502
            +H        ETMK  PL+KKI+D   GPDR + K+Q  ELERVAKT+PASAP SVK F 
Sbjct: 373  THEEWEEIVEETMKDAPLMKKIVDSYSGPDRVSGKKQKEELERVAKTIPASAPDSVKSFA 432

Query: 501  DRAVLSLQSNPGWGFDKKCHFMDKLVTEVEQHYK 400
            DRAVLSLQSNPGWGFDKKC FMDKL  EV QHYK
Sbjct: 433  DRAVLSLQSNPGWGFDKKCMFMDKLAKEVSQHYK 466


>ref|XP_007213291.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica]
           gi|462409156|gb|EMJ14490.1| hypothetical protein
           PRUPE_ppa006080mg [Prunus persica]
          Length = 428

 Score =  249 bits (636), Expect = 2e-63
 Identities = 127/189 (67%), Positives = 145/189 (76%)
 Frame = -3

Query: 969 IGDGAADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYF 790
           +GD A D EKLAK+LGPEIM K+VE  EEM+S+VLP P  +A ++A   N  +ECEPEY 
Sbjct: 240 LGDNA-DGEKLAKKLGPEIMNKLVERFEEMSSEVLPSPLDDAYVDAMHTNFMIECEPEYL 298

Query: 789 MEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPLLKKIID 610
           M EF  NPDIDEK P+ LR ALEKMKPFLM+YE I+S         ETM+RVPLLK+I+D
Sbjct: 299 MGEFNKNPDIDEKPPISLRDALEKMKPFLMAYENIESQEEWEEVVNETMERVPLLKEIVD 358

Query: 609 DRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDK 430
              GPDR TAK+Q  ELERVAKTLPA  P SVKRFTDRAVLSLQSNPGWGFD+KC FMDK
Sbjct: 359 HYSGPDRVTAKKQQEELERVAKTLPAKVPDSVKRFTDRAVLSLQSNPGWGFDRKCQFMDK 418

Query: 429 LVTEVEQHY 403
           LV +V QHY
Sbjct: 419 LVAKVSQHY 427


>emb|CBI17195.3| unnamed protein product [Vitis vinifera]
          Length = 209

 Score =  243 bits (620), Expect = 1e-61
 Identities = 124/190 (65%), Positives = 143/190 (75%)
 Frame = -3

Query: 969 IGDGAADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYF 790
           +GD A D EKL+ ++G E M K+ E  EEM+ +VLP P ++A L+A   N  +E EPEY 
Sbjct: 21  LGDNA-DAEKLSNKIGLEKMSKLDEAFEEMSGRVLPSPIEDAYLDALHTNCLIEFEPEYL 79

Query: 789 MEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPLLKKIID 610
           MEEFGTNPDIDE  P+PLR ALEKMKPFLM YEGIQS         ETM+ VP LK+++D
Sbjct: 80  MEEFGTNPDIDENPPIPLRDALEKMKPFLMQYEGIQSQEEWEEVMKETMENVPYLKELVD 139

Query: 609 DRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDK 430
              GPDR TAK+Q  ELERVAKTLP +AP SVKRFTDRA+LSLQSNPGWGFDKKC FMDK
Sbjct: 140 YYSGPDRVTAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWGFDKKCQFMDK 199

Query: 429 LVTEVEQHYK 400
           LV EV QHYK
Sbjct: 200 LVWEVSQHYK 209


>ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300131 [Fragaria vesca
           subsp. vesca]
          Length = 464

 Score =  239 bits (610), Expect = 2e-60
 Identities = 122/204 (59%), Positives = 147/204 (72%), Gaps = 5/204 (2%)
 Frame = -3

Query: 996 RYEGRDDQSIGDGA-----ADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEA 832
           R  G +D  I  G      AD EKLA++LGPE+M ++ E  E+M++ VLP P  +A ++A
Sbjct: 261 RRRGDEDGGIASGLYLGDNADGEKLAEKLGPEVMNQLTEAFEDMSTHVLPSPLDDAYVDA 320

Query: 831 YEHNITLECEPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXX 652
            + N  +E EPEY M EF  NPDIDE+ P+PLR ALEKMKPFLM+YEGIQS         
Sbjct: 321 LDTNCKIEFEPEYLMGEFNQNPDIDEEPPIPLRDALEKMKPFLMAYEGIQSQEEWEEAIK 380

Query: 651 ETMKRVPLLKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSN 472
           ETM+RVPLLKKI+D   GPDR TAK+Q  ELERVAKTLPA+ P SVK+FTDRAVLSLQ N
Sbjct: 381 ETMERVPLLKKIVDHYSGPDRVTAKKQREELERVAKTLPANVPDSVKQFTDRAVLSLQGN 440

Query: 471 PGWGFDKKCHFMDKLVTEVEQHYK 400
           PGWGF +KC FMDKL  +V +HYK
Sbjct: 441 PGWGFHRKCQFMDKLTQKVSKHYK 464


>ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215545 [Cucumis sativus]
            gi|449502143|ref|XP_004161555.1| PREDICTED:
            uncharacterized protein LOC101224016 [Cucumis sativus]
          Length = 478

 Score =  238 bits (608), Expect = 4e-60
 Identities = 150/357 (42%), Positives = 185/357 (51%), Gaps = 33/357 (9%)
 Frame = -3

Query: 1371 PPKPDV--KMPFRFGEAQLGWSESETP------PPKEKALLTGILGVISGAGRGKPTKPY 1216
            PP+PD   K P  F +   G S + T          E+ L   +    SG GRGKP K  
Sbjct: 122  PPEPDSEPKKPVFFSKNNAGDSAASTSLGGLHRVSGERNLPESLHSEFSGVGRGKPMKQP 181

Query: 1215 APHPEKTRQTDGREPSQSPN--------KDTAVREQLSQEEKVRKAKEILSKXXXXXXXX 1060
             P  +  ++     P Q  +        +      ++ + E  R    ++SK        
Sbjct: 182  VPEDQPKQENRHLRPRQEGDGPGAGERGRGRGFEPRIGRGEPWRNTNRMVSKDGPDGEVG 241

Query: 1059 XXXXXXXXXXXXXXR--------ETKDYNRYEGRDDQSIGDGAA---------DREKLAK 931
                                     +   R E R      DG A         D E+LAK
Sbjct: 242  GGRGTSGYRGRGARGPYRRGARGSFRTGERRERRSGHDKEDGYAAGLYLGNNEDGERLAK 301

Query: 930  RLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEPEYFMEEFGTNPDIDEK 751
            R+G E M K+VEG EEM+ +VLP P  +  L+  + N  +ECEPEY M +F  NPDIDE 
Sbjct: 302  RIGTENMNKLVEGFEEMSGRVLPSPLVDQYLDGMDTNFMIECEPEYLMGDFENNPDIDEN 361

Query: 750  APMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPLLKKIIDDRGGPDRATAKQQ 571
             P+PLR ALEKMKPFLM+YE IQSH        ETM+ VPLLK+I+D  GGPDR TAK+Q
Sbjct: 362  PPIPLRDALEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLLKEIVDAYGGPDRVTAKEQ 421

Query: 570  CGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHFMDKLVTEVEQHYK 400
             GELERVAKTLP SAP SVK+FT+R VLSLQSNPGWGFDKK   MDKLV    + YK
Sbjct: 422  QGELERVAKTLPQSAPNSVKQFTNRVVLSLQSNPGWGFDKKWQLMDKLVEGFSKRYK 478


>gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding protein from Arabidopsis thaliana
            gi|2129727 and contains RNA recognition PF|00076 domain.
            ESTs gb|H37317, gb|F14415, gb|AA651290 come from this
            gene [Arabidopsis thaliana]
          Length = 829

 Score =  238 bits (606), Expect = 6e-60
 Identities = 139/313 (44%), Positives = 179/313 (57%), Gaps = 29/313 (9%)
 Frame = -3

Query: 1251 SGAGRGKPTKPYAPHPEKTRQTDGREPSQSPN-----------------KDTAVREQLSQ 1123
            SGAGRGKP    AP     RQ D R+  + P                  KD   + QLS 
Sbjct: 521  SGAGRGKPLVESAP----IRQEDNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSA 576

Query: 1122 EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXRETKDYNRYEG-RDDQSIGDG---- 958
            EE  R+A+  LS+                      R        +G RDD+   +G    
Sbjct: 577  EEAGRRARSELSRGEAEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEA 636

Query: 957  -------AADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEP 799
                   +AD EK A+++GPE+M+ + EG EE+  K LP    +A+++AY+ N+ +ECEP
Sbjct: 637  MRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEP 696

Query: 798  EYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPLLKK 619
            EY M +FG+NPDIDEK PM LR  LEK+KPF+++YEGI+          E M + PL+K+
Sbjct: 697  EYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKE 756

Query: 618  IIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHF 439
            I+D   GPDR TAK+Q  EL+R+A TLPASAP SVKRF DRA L+L+SNPGWGFDKK  F
Sbjct: 757  IVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQF 816

Query: 438  MDKLVTEVEQHYK 400
            MDKLV EV Q YK
Sbjct: 817  MDKLVLEVSQSYK 829


>ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|12324041|gb|AAG51990.1|AC024260_28 unknown
            protein; 43598-45751 [Arabidopsis thaliana]
            gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8
            [Arabidopsis thaliana] gi|23506017|gb|AAN28868.1|
            At1g53640/F22G10.8 [Arabidopsis thaliana]
            gi|110740318|dbj|BAF02054.1| hypothetical protein
            [Arabidopsis thaliana] gi|332194854|gb|AEE32975.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis thaliana]
          Length = 523

 Score =  238 bits (606), Expect = 6e-60
 Identities = 139/313 (44%), Positives = 179/313 (57%), Gaps = 29/313 (9%)
 Frame = -3

Query: 1251 SGAGRGKPTKPYAPHPEKTRQTDGREPSQSPN-----------------KDTAVREQLSQ 1123
            SGAGRGKP    AP     RQ D R+  + P                  KD   + QLS 
Sbjct: 215  SGAGRGKPLVESAP----IRQEDNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSA 270

Query: 1122 EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXRETKDYNRYEG-RDDQSIGDG---- 958
            EE  R+A+  LS+                      R        +G RDD+   +G    
Sbjct: 271  EEAGRRARSELSRGEAEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEA 330

Query: 957  -------AADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEP 799
                   +AD EK A+++GPE+M+ + EG EE+  K LP    +A+++AY+ N+ +ECEP
Sbjct: 331  MRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEP 390

Query: 798  EYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPLLKK 619
            EY M +FG+NPDIDEK PM LR  LEK+KPF+++YEGI+          E M + PL+K+
Sbjct: 391  EYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKE 450

Query: 618  IIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHF 439
            I+D   GPDR TAK+Q  EL+R+A TLPASAP SVKRF DRA L+L+SNPGWGFDKK  F
Sbjct: 451  IVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQF 510

Query: 438  MDKLVTEVEQHYK 400
            MDKLV EV Q YK
Sbjct: 511  MDKLVLEVSQSYK 523


>ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297340299|gb|EFH70716.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 769

 Score =  238 bits (606), Expect = 6e-60
 Identities = 137/316 (43%), Positives = 182/316 (57%), Gaps = 33/316 (10%)
 Frame = -3

Query: 1248 GAGRGKPT---------------KPYAPHPEKTRQTDGREPSQ--SPN-KDTAVREQLSQ 1123
            GAGRGKP                +P  P P + +Q    +P Q  +P  KD A + QLS+
Sbjct: 459  GAGRGKPLVESAPIQQEDNRQIRRPQPPPPPQQQQQQRAQPQQKRAPTVKDEAPKPQLSR 518

Query: 1122 EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXRETKDYNRYEG----RDDQSIGDG- 958
            EE  R+A+  LS+                         +   R  G    RDD+   +G 
Sbjct: 519  EEAGRRARSELSRGEAEGGGVRGRGGRGRGRG-----ARGRGRGRGGDGWRDDKKEEEGE 573

Query: 957  ----------AADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLE 808
                      +AD EK A+++GPE+M+ + EG EE+  K LP    +A+++AY+ N+ +E
Sbjct: 574  QEAMSIFAGDSADGEKFAQKMGPELMKTLAEGFEEVCEKALPSTTHDAIIDAYDTNLMIE 633

Query: 807  CEPEYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPL 628
            CEPEY M +FG+NPDIDEK PM LR  LEK+KPF+++YEGI+          E M + PL
Sbjct: 634  CEPEYIMADFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAVNEAMAQAPL 693

Query: 627  LKKIIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKK 448
            +K+I+D   GPDR TAK+Q  EL+ +A T+PASAP SVKRF DRA L+L+SNPGWGFDKK
Sbjct: 694  MKEIVDHYSGPDRVTAKKQNEELDSIATTIPASAPDSVKRFADRAALTLKSNPGWGFDKK 753

Query: 447  CHFMDKLVTEVEQHYK 400
              FMDKLV EV Q YK
Sbjct: 754  YQFMDKLVLEVSQSYK 769


>gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis thaliana
            gi|2129727 and contains RNA recognition PF|00076 domain
            [Arabidopsis thaliana]
          Length = 523

 Score =  238 bits (606), Expect = 6e-60
 Identities = 139/313 (44%), Positives = 179/313 (57%), Gaps = 29/313 (9%)
 Frame = -3

Query: 1251 SGAGRGKPTKPYAPHPEKTRQTDGREPSQSPN-----------------KDTAVREQLSQ 1123
            SGAGRGKP    AP     RQ D R+  + P                  KD   + QLS 
Sbjct: 215  SGAGRGKPLVESAP----IRQEDNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKPQLSA 270

Query: 1122 EEKVRKAKEILSKXXXXXXXXXXXXXXXXXXXXXXRETKDYNRYEG-RDDQSIGDG---- 958
            EE  R+A+  LS+                      R        +G RDD+   +G    
Sbjct: 271  EEAGRRARSELSRGEAEGSSVGGRGGRGRGRGRGARGRGRGRGGDGWRDDKKEEEGEQEA 330

Query: 957  -------AADREKLAKRLGPEIMEKIVEGLEEMASKVLPDPHKEALLEAYEHNITLECEP 799
                   +AD EK A+++GPE+M+ + EG EE+  K LP    +A+++AY+ N+ +ECEP
Sbjct: 331  MRIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEP 390

Query: 798  EYFMEEFGTNPDIDEKAPMPLRVALEKMKPFLMSYEGIQSHXXXXXXXXETMKRVPLLKK 619
            EY M +FG+NPDIDEK PM LR  LEK+KPF+++YEGI+          E M + PL+K+
Sbjct: 391  EYIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKE 450

Query: 618  IIDDRGGPDRATAKQQCGELERVAKTLPASAPASVKRFTDRAVLSLQSNPGWGFDKKCHF 439
            I+D   GPDR TAK+Q  EL+R+A TLPASAP SVKRF DRA L+L+SNPGWGFDKK  F
Sbjct: 451  IVDHYSGPDRVTAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQF 510

Query: 438  MDKLVTEVEQHYK 400
            MDKLV EV Q YK
Sbjct: 511  MDKLVLEVSQSYK 523


Top