BLASTX nr result

ID: Paeonia25_contig00000004 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00000004
         (1374 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268658.1| PREDICTED: uncharacterized protein LOC100241...   253   2e-64
ref|XP_007200556.1| hypothetical protein PRUPE_ppa009673mg [Prun...   250   9e-64
ref|XP_004292660.1| PREDICTED: uncharacterized protein LOC101313...   242   2e-61
ref|XP_006444350.1| hypothetical protein CICLE_v10021537mg [Citr...   240   1e-60
ref|XP_007050920.1| LYR motif-containing protein 7 isoform 2 [Th...   231   7e-58
ref|XP_004152251.1| PREDICTED: uncharacterized protein LOC101206...   230   1e-57
ref|XP_007050919.1| LYR motif-containing protein 7 isoform 1 [Th...   229   3e-57
gb|EXC10295.1| hypothetical protein L484_006190 [Morus notabilis]     227   8e-57
ref|XP_002523082.1| conserved hypothetical protein [Ricinus comm...   223   1e-55
ref|XP_006355807.1| PREDICTED: uncharacterized protein LOC102585...   211   5e-52
ref|XP_004240545.1| PREDICTED: uncharacterized protein LOC101258...   211   6e-52
ref|XP_004495959.1| PREDICTED: uncharacterized protein LOC101493...   202   4e-49
ref|XP_006588589.1| PREDICTED: uncharacterized protein LOC100798...   198   4e-48
ref|XP_003519006.1| PREDICTED: uncharacterized protein LOC100795...   197   1e-47
ref|XP_007144318.1| hypothetical protein PHAVU_007G146100g [Phas...   196   2e-47
ref|XP_002321007.2| hypothetical protein POPTR_0014s12400g [Popu...   190   1e-45
ref|XP_003591530.1| hypothetical protein MTR_1g088580 [Medicago ...   189   2e-45
ref|XP_002301478.1| hypothetical protein POPTR_0002s20610g [Popu...   180   1e-42
ref|XP_004172491.1| PREDICTED: uncharacterized LOC101206482, par...   163   1e-37
ref|XP_006396451.1| hypothetical protein EUTSA_v10028893mg [Eutr...   157   1e-35

>ref|XP_002268658.1| PREDICTED: uncharacterized protein LOC100241933 [Vitis vinifera]
          Length = 269

 Score =  253 bits (645), Expect = 2e-64
 Identities = 142/284 (50%), Positives = 164/284 (57%), Gaps = 8/284 (2%)
 Frame = +3

Query: 219  LDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRNSKPYS-----NAFHTKRPCLSD 374
            + P+ SIE   FQLH+WRPF LPTTP + ++   H  NSKPYS     N  H+KRPCLSD
Sbjct: 1    MSPKTSIESCTFQLHSWRPFQLPTTP-KTLEPDSH--NSKPYSITTSSNGLHSKRPCLSD 57

Query: 375  RATSFPLDTLDMSKLSLFDDDRPIRAGSHKQXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 554
            R TSFP+D LD+SKLSL +DD+P  +    +                             
Sbjct: 58   RKTSFPIDALDISKLSLLEDDKPASSAPRNRGNVRWIDRKRRRRGSRSVSGRSSDRSGTR 117

Query: 555  XYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXXXXXXXXX 734
              CS GAS AYATCSDFPVAAGTDSSGELFVNGD+NW+SDVSEAK               
Sbjct: 118  RCCSVGASAAYATCSDFPVAAGTDSSGELFVNGDSNWSSDVSEAKNSRKDRDGGSGEKEN 177

Query: 735  XXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFWGDRLGGK 914
                          +  GNESGYGSEPGYRGDAE GYG          RLLFWG++LG  
Sbjct: 178  LGSGFGHIGIF---ETQGNESGYGSEPGYRGDAEFGYGDELDEEEDDARLLFWGEQLG-- 232

Query: 915  LVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
                   D D+NME+V EN FS+QKAHHRCRRKK D+RM   LR
Sbjct: 233  -------DNDTNMEMVGENTFSEQKAHHRCRRKKHDYRMIDALR 269


>ref|XP_007200556.1| hypothetical protein PRUPE_ppa009673mg [Prunus persica]
            gi|462395956|gb|EMJ01755.1| hypothetical protein
            PRUPE_ppa009673mg [Prunus persica]
          Length = 282

 Score =  250 bits (639), Expect = 9e-64
 Identities = 146/293 (49%), Positives = 164/293 (55%), Gaps = 12/293 (4%)
 Frame = +3

Query: 204  MSHKTLDPRHSIE---FQLHNWRPFHL--PTTPT-RPIDSSDHSRNSKPYSNA-----FH 350
            MSHK L+ RH I+   FQLH+WRPFHL   TTPT + +DS     N KPY+++      H
Sbjct: 1    MSHKALEHRHPIDSCAFQLHSWRPFHLHQQTTPTSKTLDSDPSLPNPKPYNSSSNGLVVH 60

Query: 351  TKRPCLSDRATSFPLDTLDMSKLSLFDDDRPIRAGSH-KQXXXXXXXXXXXXXXXXXXXX 527
            TKRPCLS+RATSF +D +DMS+L+L DDDR I  G H +                     
Sbjct: 61   TKRPCLSNRATSFSIDAIDMSRLTLVDDDRTISGGHHNRHGSFRFIAKKRRRHGSRSVSG 120

Query: 528  XXXXXXXXXXYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXX 707
                       CS GAS AY TCSDFPVA GTDSSGELF NGDANWASDVSEA+      
Sbjct: 121  RSSDRSGTRRCCSVGASAAYGTCSDFPVAVGTDSSGELFGNGDANWASDVSEAR--NSRK 178

Query: 708  XXXXXXXXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLL 887
                                   D  GNESGYGSEPGYRGDAE GYG          RLL
Sbjct: 179  ERDGGGSGEKENLGIGFGPIGGFDVQGNESGYGSEPGYRGDAEFGYGDELDEEEEDTRLL 238

Query: 888  FWGDRLGGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
            FW         GD+FGD DS ME+V EN F DQK+HHRCRRKK D RM   LR
Sbjct: 239  FW---------GDQFGDADSMMEIVGENTFVDQKSHHRCRRKKHDCRMVDTLR 282


>ref|XP_004292660.1| PREDICTED: uncharacterized protein LOC101313678 [Fragaria vesca
            subsp. vesca]
          Length = 271

 Score =  242 bits (618), Expect = 2e-61
 Identities = 142/281 (50%), Positives = 159/281 (56%), Gaps = 5/281 (1%)
 Frame = +3

Query: 204  MSHKTLDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRNSKPYSNAFHTKRPCLSD 374
            MSHK LD R S++   FQLH+WRPF L   PT+ +DS     N KPY    HTKRPCLS+
Sbjct: 1    MSHKALDSRPSLDSCTFQLHSWRPFQLQQQPTKTLDSDP--ANPKPY----HTKRPCLSN 54

Query: 375  RATS-FPLDTLDMSKLSLFDDDRPIRAGSH-KQXXXXXXXXXXXXXXXXXXXXXXXXXXX 548
            RATS F +D +DMS+L+L DDDR I  G H K                            
Sbjct: 55   RATSSFSIDAIDMSRLTLVDDDRTISGGHHHKHGSFRFLARKRRRHGSRSVSGRSSDRSG 114

Query: 549  XXXYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXXXXXXX 728
                CS GAS A+ TCSDFPVA GTDSSGELF NGDANWASDVSEA+             
Sbjct: 115  TRRCCSVGASAAHGTCSDFPVAIGTDSSGELFGNGDANWASDVSEAR-NLRKERDGVGSG 173

Query: 729  XXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFWGDRLG 908
                            DA GNESGYGSEPGYRGDAE GYG          RLLFW     
Sbjct: 174  EKETTPGVGFGPGGGFDAQGNESGYGSEPGYRGDAEFGYGDELDEEEEDARLLFW----- 228

Query: 909  GKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM 1031
                G+RFGD+D+ MEVV EN F+DQK+HHRCRRKK D RM
Sbjct: 229  ----GNRFGDSDTMMEVVGENTFTDQKSHHRCRRKKHDCRM 265


>ref|XP_006444350.1| hypothetical protein CICLE_v10021537mg [Citrus clementina]
            gi|568852594|ref|XP_006479957.1| PREDICTED:
            uncharacterized protein LOC102627953 [Citrus sinensis]
            gi|557546612|gb|ESR57590.1| hypothetical protein
            CICLE_v10021537mg [Citrus clementina]
          Length = 283

 Score =  240 bits (612), Expect = 1e-60
 Identities = 145/295 (49%), Positives = 164/295 (55%), Gaps = 13/295 (4%)
 Frame = +3

Query: 201  SMSHKTLDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRNSKPYSNAFHTKRPCLS 371
            S SHK LD RHSI+    QLHNWRPFHL      P+DSSD ++ S   S+  HTKRPCLS
Sbjct: 2    SHSHKPLDSRHSIDSCALQLHNWRPFHLQN----PLDSSDSTKPSYSPSSWVHTKRPCLS 57

Query: 372  DRATSFPL---DTLDMSKLSLFDDD---RPIRAGSHKQXXXXXXXXXXXXXXXXXXXXXX 533
            DRATSF +     +D+SKLSLFDDD   +P+ A +  Q                      
Sbjct: 58   DRATSFSIIDAAAIDLSKLSLFDDDNVIKPMTAATAPQSRGGYRLIARKRRRRGSRSVSG 117

Query: 534  XXXXXXXXY--CSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXX 707
                       CS GAS AY TCSDFPVA GTDSSGELF NG+ANWASDVSEA+      
Sbjct: 118  RSSDRSGTRRCCSVGASAAYGTCSDFPVAVGTDSSGELFGNGEANWASDVSEARNSRRER 177

Query: 708  XXXXXXXXXXXXXXXXXXXXXXXDA--LGNESGYGSEPGYRGDAELGYGXXXXXXXXXHR 881
                                   +A  LGNESGYGSEPGYRGDAE GYG          +
Sbjct: 178  DNGNGSGEKENSGTGFGGQVGCLEAQVLGNESGYGSEPGYRGDAEFGYGDELDEEEEDAK 237

Query: 882  LLFWGDRLGGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
            LLFW         G+RFGD DS ME+V EN F+DQK+HHRCRRKK D RM   LR
Sbjct: 238  LLFW---------GNRFGDVDSKMEMVGENTFTDQKSHHRCRRKKHDCRMVDALR 283


>ref|XP_007050920.1| LYR motif-containing protein 7 isoform 2 [Theobroma cacao]
            gi|508703181|gb|EOX95077.1| LYR motif-containing protein
            7 isoform 2 [Theobroma cacao]
          Length = 270

 Score =  231 bits (588), Expect = 7e-58
 Identities = 142/292 (48%), Positives = 161/292 (55%), Gaps = 11/292 (3%)
 Frame = +3

Query: 204  MSHKTLDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRN--SKPYSNAFHTKRPCL 368
            MSHK L+PRHSI+   FQLH+WRPF L  T    +DSSD  +    +  +N FH+KRPCL
Sbjct: 1    MSHKALEPRHSIDSCTFQLHSWRPFQLQQT----LDSSDPQQTPPKRASTNCFHSKRPCL 56

Query: 369  SDRATSFPLDTLDMSKLSLFDDDR-----PIRAGSHKQXXXXXXXXXXXXXXXXXXXXXX 533
            SDR TSF   ++D+SKL+L DDD      PI A + K+                      
Sbjct: 57   SDRTTSF---SIDLSKLTLLDDDNNSSYNPI-AANPKRGSFRLFARKRRRRGSRSVSGRS 112

Query: 534  XXXXXXXXYCSAGASGAYATCSDFPVAAGTDSSGELFVNG-DANWASDVSEAKXXXXXXX 710
                     CS GAS AY TCSDFPVA GTDSSGELF NG DA WASDVSEA+       
Sbjct: 113  SDRSGTRRCCSVGASAAYGTCSDFPVAVGTDSSGELFGNGADAYWASDVSEARNSRRERG 172

Query: 711  XXXXXXXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLF 890
                                  DA GNESGYGSEPGYRGD E GYG          RLLF
Sbjct: 173  DGGSGEKESLGGQFGGF-----DAQGNESGYGSEPGYRGDGEFGYGDEVDEEEEDARLLF 227

Query: 891  WGDRLGGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
            WG           FGDTDS ME+V EN FSDQKAHHRCRRKK D+RM  ++R
Sbjct: 228  WGHH---------FGDTDSKMEMVGENTFSDQKAHHRCRRKKHDYRMVDSVR 270


>ref|XP_004152251.1| PREDICTED: uncharacterized protein LOC101206482 [Cucumis sativus]
          Length = 266

 Score =  230 bits (586), Expect = 1e-57
 Identities = 137/290 (47%), Positives = 154/290 (53%), Gaps = 9/290 (3%)
 Frame = +3

Query: 204  MSHKTLDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSR----NSKPY--SNAFHTK 356
            MS + LD RHSI+    + H W PFHLP T    +DS  H+     NSKPY  S   HTK
Sbjct: 1    MSRRPLDSRHSIDSCTLKFHGWTPFHLPKT----LDSDPHNTSAPTNSKPYYSSTPLHTK 56

Query: 357  RPCLSDRATSFPLDTLDMSKLSLFDDDRPIRAGSHKQXXXXXXXXXXXXXXXXXXXXXXX 536
            RPCLSDR TSF +D +DMS LSL DDD+P    +                          
Sbjct: 57   RPCLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARS---FRLIARKRRRRGSRSVSGRSS 113

Query: 537  XXXXXXXYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXXX 716
                    CS GAS A+ TCSDFP+A GTDSSGELFVNGDANW+SDVSEAK         
Sbjct: 114  DRSGTRRCCSVGASAAHGTCSDFPIAVGTDSSGELFVNGDANWSSDVSEAK------NSR 167

Query: 717  XXXXXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFWG 896
                                DA GNESGYGSEPGYRGD E GYG          RLL WG
Sbjct: 168  REREEKDHLGSGFVSSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWG 227

Query: 897  DRLGGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
            +RLG           DS ME+V EN F+DQK+HHRCRRKK + RM   LR
Sbjct: 228  ERLG-----------DSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR 266


>ref|XP_007050919.1| LYR motif-containing protein 7 isoform 1 [Theobroma cacao]
            gi|508703180|gb|EOX95076.1| LYR motif-containing protein
            7 isoform 1 [Theobroma cacao]
          Length = 271

 Score =  229 bits (583), Expect = 3e-57
 Identities = 141/292 (48%), Positives = 160/292 (54%), Gaps = 11/292 (3%)
 Frame = +3

Query: 204  MSHKTLDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRN--SKPYSNAFHTKRPCL 368
            MSHK L+PRHSI+   FQLH+WRPF L  T    +DSSD  +    +  +N FH+KRPCL
Sbjct: 1    MSHKALEPRHSIDSCTFQLHSWRPFQLQQT----LDSSDPQQTPPKRASTNCFHSKRPCL 56

Query: 369  SDRATSFPLDTLDMSKLSLFDDDR-----PIRAGSHKQXXXXXXXXXXXXXXXXXXXXXX 533
            SDR TSF   ++D+SKL+L DDD      PI A + K+                      
Sbjct: 57   SDRTTSF---SIDLSKLTLLDDDNNSSYNPI-AANPKRGSFRLFARKRRRRGSRSVSGRS 112

Query: 534  XXXXXXXXYCSAGASGAYATCSDFPVAAGTDSSGELFVNG-DANWASDVSEAKXXXXXXX 710
                     CS GAS AY TCSDFPVA GTDSSGELF NG DA WASDVSEA+       
Sbjct: 113  SDRSGTRRCCSVGASAAYGTCSDFPVAVGTDSSGELFGNGADAYWASDVSEARNSRRERG 172

Query: 711  XXXXXXXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLF 890
                                  DA GNESGYGSEPGYRGD E GYG          RLLF
Sbjct: 173  DGGSGEKESLGGQFGGF-----DAQGNESGYGSEPGYRGDGEFGYGDEVDEEEEDARLLF 227

Query: 891  WGDRLGGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
            WG   G         DTDS ME+V EN FSDQKAHHRCRRKK D+RM  ++R
Sbjct: 228  WGHHFG--------ADTDSKMEMVGENTFSDQKAHHRCRRKKHDYRMVDSVR 271


>gb|EXC10295.1| hypothetical protein L484_006190 [Morus notabilis]
          Length = 275

 Score =  227 bits (579), Expect = 8e-57
 Identities = 136/287 (47%), Positives = 159/287 (55%), Gaps = 6/287 (2%)
 Frame = +3

Query: 204  MSHKTLDPRHSIE---FQLHNWRPFHLPTTP-TRPIDSSDHSRNSKPYSNAFH-TKRPCL 368
            MS K LD RHSI+   FQLH+WRPF   +TP T+ +D++++ R+ +    A   TKRPCL
Sbjct: 1    MSPKLLDSRHSIDSCAFQLHSWRPFQQHSTPPTKTLDAANNPRHYRSNGGAHAITKRPCL 60

Query: 369  SDRATSFPLDTLDMSKLSLFDDDRPIRAGSHKQXXXXXXXXXXXXXXXXXXXXXXXXXXX 548
            SDRATSFP+D +DMS+LSL DDD         +                           
Sbjct: 61   SDRATSFPIDAIDMSRLSLVDDDTARPHHHQYRGSLRLLARKRRRRGSRSVSGRSSDRSG 120

Query: 549  XXXYCSAGASGAYATCSDFPVAAGTDSSGELFVN-GDANWASDVSEAKXXXXXXXXXXXX 725
                CS GAS AY TCSDFPVA GTDSSGELF+N GDANW+SDVSEA+            
Sbjct: 121  TRRCCSVGASAAYGTCSDFPVAVGTDSSGELFLNTGDANWSSDVSEARNSRRERDGAGGG 180

Query: 726  XXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFWGDRL 905
                             D+ G ESGYGSEPGYRGDAE GYG          RLLFWG+R 
Sbjct: 181  SGEKESFGGVIGGF---DSQGAESGYGSEPGYRGDAEFGYGDEHDEEEDDARLLFWGNR- 236

Query: 906  GGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
                    F DTDS  E+V EN FSDQK HHRCRRKK D RM  ++R
Sbjct: 237  --------FEDTDSMTEIVGENTFSDQKVHHRCRRKKHDCRMVDSVR 275


>ref|XP_002523082.1| conserved hypothetical protein [Ricinus communis]
            gi|223537644|gb|EEF39267.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 261

 Score =  223 bits (569), Expect = 1e-55
 Identities = 133/288 (46%), Positives = 156/288 (54%), Gaps = 7/288 (2%)
 Frame = +3

Query: 204  MSHKTLDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRNSKPYSNAFHTKRPCLSD 374
            MSH++LD RHSI+   FQLH+WRPFHL T  + P          KPYS+   TKRPCLSD
Sbjct: 1    MSHRSLDSRHSIDSCTFQLHSWRPFHLQTLDSDP---------PKPYSST--TKRPCLSD 49

Query: 375  RATSFPLDTLDMSKLSLFDDDRPIRAGS----HKQXXXXXXXXXXXXXXXXXXXXXXXXX 542
            R TSFP+D++D+SKLS+ DDD+PI   +    + +                         
Sbjct: 50   RTTSFPIDSIDISKLSIIDDDKPISVSAATAYNSRGSLRLIARKRRRRGSRSVSGRSSDR 109

Query: 543  XXXXXYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXXXXX 722
                  CS GA G   TCSDFPVA GTDSSGELF NGD+NW SDVSEAK           
Sbjct: 110  SGTRRCCSVGAHG---TCSDFPVAVGTDSSGELFGNGDSNWGSDVSEAK----NSIKREK 162

Query: 723  XXXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFWGDR 902
                              +  GNESGYGSEPGYRGDAE GY           +LLFWGD 
Sbjct: 163  DREREEKENMGYGQFGTFENQGNESGYGSEPGYRGDAEFGYEDEIDEEEDDAKLLFWGDH 222

Query: 903  LGGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
             GG         T   ME+V EN+FSDQK+HHRCRRKK D RM  ++R
Sbjct: 223  FGG---------TGPKMEMVGENSFSDQKSHHRCRRKKHDNRMLDSVR 261


>ref|XP_006355807.1| PREDICTED: uncharacterized protein LOC102585515 [Solanum tuberosum]
          Length = 269

 Score =  211 bits (538), Expect = 5e-52
 Identities = 121/279 (43%), Positives = 139/279 (49%), Gaps = 3/279 (1%)
 Frame = +3

Query: 204  MSHKTLDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRNSKPYSNAFHTKRPCLSD 374
            MS KTLD RH+IE   + LH+W+PF  PT  ++ +D       S       HTKR C +D
Sbjct: 1    MSPKTLDSRHAIESCTYHLHSWKPFQFPTPNSKTLDLDSPKTYSPSTHGGVHTKRQCRAD 60

Query: 375  RATSFPLDTLDMSKLSLFDDDRPIRAGSHKQXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 554
            R TS P++ LDMSKLSLF++DRP+     ++                             
Sbjct: 61   RTTSIPIEALDMSKLSLFEEDRPLSV-HKRENLRLIAGKRRRRGSRSVSGRSSDRSGTRR 119

Query: 555  XYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXXXXXXXXX 734
              CS GAS AY TCSDFPVA GTDSSGELFVNGD +W  DVSE                 
Sbjct: 120  RCCSVGASAAYGTCSDFPVAVGTDSSGELFVNGDMHWTLDVSEVTKSLRKEKEGGGVGER 179

Query: 735  XXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFWGDRLGGK 914
                          + LGNESGYGSEPGYRGDAE GYG          RL FWGD  G  
Sbjct: 180  ESNLNGLSVQSGNFEGLGNESGYGSEPGYRGDAEFGYGDEFDEEEDDQRLSFWGDEFGAL 239

Query: 915  LVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM 1031
                      S ME V EN    QK HHRCRR+KQD RM
Sbjct: 240  ----------SRMEKVGENTL--QKVHHRCRRRKQDCRM 266


>ref|XP_004240545.1| PREDICTED: uncharacterized protein LOC101258757 [Solanum
            lycopersicum]
          Length = 269

 Score =  211 bits (537), Expect = 6e-52
 Identities = 120/279 (43%), Positives = 141/279 (50%), Gaps = 3/279 (1%)
 Frame = +3

Query: 204  MSHKTLDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRNSKPYSNAFHTKRPCLSD 374
            MS KTLD RH+IE   + LH+W+PF  P+  ++ +D       S       HTKR C +D
Sbjct: 1    MSPKTLDSRHAIESCTYHLHSWKPFQFPSPNSKTLDLDSPKTYSPSTHGGLHTKRQCRAD 60

Query: 375  RATSFPLDTLDMSKLSLFDDDRPIRAGSHKQXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 554
            R TS P++ LDMSKLSLF++D+P+     ++                             
Sbjct: 61   RTTSIPIEALDMSKLSLFEEDKPLSV-HKRENLRLIAGKRRRRGSRSVSGRSSDRSGTRR 119

Query: 555  XYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXXXXXXXXX 734
              CS GAS AY TCSDFPVAAGTDSSGELFVNGD +W  DVSE                 
Sbjct: 120  RCCSVGASAAYGTCSDFPVAAGTDSSGELFVNGDMHWTLDVSEVTKSLRKEKEGGGVGER 179

Query: 735  XXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFWGDRLGGK 914
                          + LGNESGYGSEPGYRGDAE GYG          RL FWGD  G  
Sbjct: 180  ENNLNGLSVQSGNFEGLGNESGYGSEPGYRGDAEFGYGDEFDEEEDDQRLSFWGDEFGAL 239

Query: 915  LVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM 1031
                      S ME V EN+   QK HHRCRR+KQD RM
Sbjct: 240  ----------SRMEKVGENSL--QKVHHRCRRRKQDCRM 266


>ref|XP_004495959.1| PREDICTED: uncharacterized protein LOC101493408 [Cicer arietinum]
          Length = 261

 Score =  202 bits (513), Expect = 4e-49
 Identities = 134/287 (46%), Positives = 154/287 (53%), Gaps = 3/287 (1%)
 Frame = +3

Query: 195  SKSMSHK-TLDPRHSIEFQLHNWRPFH-LPTTPTRPIDSSDHSRNSKPYSNAFHTKRPCL 368
            ++SMSHK T+D   S   QL  WRPFH L    T  +D S +   +KP  N    KRPCL
Sbjct: 2    TQSMSHKSTID---SCVLQLRTWRPFHHLHPQTTSSLDGSHNP--TKPSLN----KRPCL 52

Query: 369  SDRAT-SFPLDTLDMSKLSLFDDDRPIRAGSHKQXXXXXXXXXXXXXXXXXXXXXXXXXX 545
            SDR T SF   +LD+SKL+L DDDRPI   ++ +                          
Sbjct: 53   SDRTTTSF---SLDLSKLTLADDDRPINNTANHRLIARKRRRRCSRSVSGRSSDRSATRR 109

Query: 546  XXXXYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXXXXXX 725
                 CS GAS AY TCSDFPVA GTDSSGELF NGDANW+SDVSEAK            
Sbjct: 110  C----CSVGASAAYGTCSDFPVAMGTDSSGELFGNGDANWSSDVSEAK----NSRDGGSG 161

Query: 726  XXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFWGDRL 905
                             +A GNESGYGSEPGYRGDAE GYG         HR+LFWG++L
Sbjct: 162  EKEKENVALGFGVNGCSEANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDHRVLFWGNQL 221

Query: 906  GGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
            GG  V       DS ME+V EN   DQK+HHR RR+K D RM   LR
Sbjct: 222  GGAAV-------DSKMEMVGENTLLDQKSHHRLRRRKNDCRMIDALR 261


>ref|XP_006588589.1| PREDICTED: uncharacterized protein LOC100798288 [Glycine max]
          Length = 260

 Score =  198 bits (504), Expect = 4e-48
 Identities = 130/289 (44%), Positives = 146/289 (50%), Gaps = 5/289 (1%)
 Frame = +3

Query: 195  SKSMSHKTLDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRNSKPYSNAFHTKRPC 365
            ++SMSHK LD RHSI+    QL +W+PF L      P          KPY    + KRPC
Sbjct: 2    TQSMSHKPLDSRHSIDSCLLQLRSWKPFKLQQDGPHP----------KPY----YHKRPC 47

Query: 366  LSDRAT-SFPLDTLDMSKLSLFDDDRPIR-AGSHKQXXXXXXXXXXXXXXXXXXXXXXXX 539
            LSDR T SF   +LDMSKL+L  DD  I    +++                         
Sbjct: 48   LSDRTTTSF---SLDMSKLTLAADDDTIHNPNNNRATNYRLVARKRRRRGSRSLSGRSSD 104

Query: 540  XXXXXXYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXXXX 719
                   CS GAS AY TCSDFPVA GTDSSGELF NGD NW+SDVSEAK          
Sbjct: 105  RSGTRRCCSVGASAAYGTCSDFPVAMGTDSSGELFGNGDPNWSSDVSEAK----NSRRER 160

Query: 720  XXXXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFWGD 899
                               DA GNESGYGSEPGYRGDAE GYG          RLLFWGD
Sbjct: 161  ERDGEKENVGVGFGVSGCSDANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDPRLLFWGD 220

Query: 900  RLGGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
            +LG           DS  E+V EN   DQK+HHRCRR+K D RM   LR
Sbjct: 221  QLGA---------VDSKREMVGENTLLDQKSHHRCRRRKHDCRMVDALR 260


>ref|XP_003519006.1| PREDICTED: uncharacterized protein LOC100795813 [Glycine max]
          Length = 260

 Score =  197 bits (500), Expect = 1e-47
 Identities = 128/288 (44%), Positives = 145/288 (50%), Gaps = 4/288 (1%)
 Frame = +3

Query: 195  SKSMSHKTLDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRNSKPYSNAFHTKRPC 365
            ++SMSHK LD RH+ +    QL  W+PF L      P          KPY    + KRPC
Sbjct: 2    TQSMSHKPLDSRHTTDSCLLQLRTWKPFKLQQDGPHP----------KPY----YHKRPC 47

Query: 366  LSDRAT-SFPLDTLDMSKLSLFDDDRPIRAGSHKQXXXXXXXXXXXXXXXXXXXXXXXXX 542
            LSDR T SF   +LDMSKL+L DDD      +++                          
Sbjct: 48   LSDRTTTSF---SLDMSKLTLADDDN--HNPNNRATNYRLVARKRRRRGSRSVSGRSSDR 102

Query: 543  XXXXXYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXXXXX 722
                  CS GAS AY TCSDFPVA GTDSSGELF NGD NW+SDVSEAK           
Sbjct: 103  SGTRRCCSVGASAAYGTCSDFPVAMGTDSSGELFGNGDPNWSSDVSEAKNSRRERERDGG 162

Query: 723  XXXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFWGDR 902
                              +A GNESGYGSEPGYRGDAE GYG          RLLFWGD+
Sbjct: 163  SGEKENLGVGFGVSGCS-EANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDPRLLFWGDQ 221

Query: 903  LGGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
            LG           DS ME+V EN   DQK+HHRCRR+K D RM   LR
Sbjct: 222  LGA---------VDSKMEMVGENTLLDQKSHHRCRRRKHDCRMVDALR 260


>ref|XP_007144318.1| hypothetical protein PHAVU_007G146100g [Phaseolus vulgaris]
            gi|561017508|gb|ESW16312.1| hypothetical protein
            PHAVU_007G146100g [Phaseolus vulgaris]
          Length = 261

 Score =  196 bits (498), Expect = 2e-47
 Identities = 130/291 (44%), Positives = 149/291 (51%), Gaps = 7/291 (2%)
 Frame = +3

Query: 195  SKSMSHKTLDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRNSKPYSNAFHTKRPC 365
            ++SMSHK LD RHSI+    QL +W+PF L   P           + KPY    + KRPC
Sbjct: 2    TQSMSHKPLDSRHSIDSCMLQLRSWKPFKLQDGP-----------HPKPY----YYKRPC 46

Query: 366  LSDRAT-SFPLDTLDMSKLSLFD-DDRPIRAGS--HKQXXXXXXXXXXXXXXXXXXXXXX 533
            LSDRAT SF   +LD++KL+L D DD    A +  H+                       
Sbjct: 47   LSDRATTSF---SLDIAKLTLADADDTTTIANNPNHRATNYRLVARKRRRRGSRSVSGRS 103

Query: 534  XXXXXXXXYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXX 713
                     CS GAS AY TCSDFPVA GTDSSGELF NGD NW+SDVSEAK        
Sbjct: 104  SDRSGTRRCCSVGASAAYGTCSDFPVAMGTDSSGELFGNGDPNWSSDVSEAK----NSRR 159

Query: 714  XXXXXXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFW 893
                                 +A GNESGYGSEPGYRGDAE GYG          RLLFW
Sbjct: 160  ERERDGERENVGVGFGVSGCSEANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDPRLLFW 219

Query: 894  GDRLGGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
                     GD+FG  DS  E+V EN   DQK+HHRCRR+K D RM   LR
Sbjct: 220  ---------GDQFGAVDSKREMVGENTLLDQKSHHRCRRRKHDCRMVDALR 261


>ref|XP_002321007.2| hypothetical protein POPTR_0014s12400g [Populus trichocarpa]
            gi|550324059|gb|EEE99322.2| hypothetical protein
            POPTR_0014s12400g [Populus trichocarpa]
          Length = 279

 Score =  190 bits (483), Expect = 1e-45
 Identities = 133/301 (44%), Positives = 150/301 (49%), Gaps = 17/301 (5%)
 Frame = +3

Query: 195  SKSMSHKTLDPRHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRNSKPY-SNAFHTKRP 362
            S  MSH +   RHSI+    QLH+WRPF            SD   NSKPY S+    KRP
Sbjct: 16   SMVMSHNS---RHSIDSCTLQLHSWRPFL----------DSDPPTNSKPYASSRTLPKRP 62

Query: 363  CLSDRATSFP--LDTLDMSKLSLFDDD-----RPIRA------GSHKQXXXXXXXXXXXX 503
            CLSDRATSFP  +D++D+SKLSL  DD     +PI A        +K+            
Sbjct: 63   CLSDRATSFPSNIDSIDISKLSLLQDDDNNNNKPIPATPAVTNSPYKRGTLRLIERKRRR 122

Query: 504  XXXXXXXXXXXXXXXXXXYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSE 683
                               CS GA  A+ TCSDFPVA GTDSSGELFVNGDANWASDVSE
Sbjct: 123  RGSRSVSGRSSDRSGTWRCCSVGA--AHGTCSDFPVAVGTDSSGELFVNGDANWASDVSE 180

Query: 684  AKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXX 863
            AK                               L +ESGYGSEPGYRGDAE GYG     
Sbjct: 181  AKNSIKEREEKENLLGVGSAFGN----------LDSESGYGSEPGYRGDAEFGYGDEVDE 230

Query: 864  XXXXHRLLFWGDRLGGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNL 1043
                 RLLFWG               DS ME+V EN F D K HHRCRR+K D+RM  +L
Sbjct: 231  EEDDARLLFWGHHF-----------QDSKMEMVGENTF-DPKTHHRCRRRKHDYRMVDSL 278

Query: 1044 R 1046
            R
Sbjct: 279  R 279


>ref|XP_003591530.1| hypothetical protein MTR_1g088580 [Medicago truncatula]
            gi|355480578|gb|AES61781.1| hypothetical protein
            MTR_1g088580 [Medicago truncatula]
          Length = 249

 Score =  189 bits (480), Expect = 2e-45
 Identities = 127/285 (44%), Positives = 144/285 (50%), Gaps = 4/285 (1%)
 Frame = +3

Query: 204  MSHK-TLDPRHSIEFQLHNWRPFHLPTTPTRPIDSSDHSRNSKPYSNAFHTKRPCLSDRA 380
            MSHK TLD   +   QL  W+PFH             H   S  ++N    KRPCLSDR 
Sbjct: 1    MSHKPTLD---TCVLQLRTWKPFH-----------QIHDHGSHSHNNNNINKRPCLSDRT 46

Query: 381  T-SFPLDTLDMSKLSLFDDDRPI--RAGSHKQXXXXXXXXXXXXXXXXXXXXXXXXXXXX 551
            T SF   +LD+SKL+L D++ P   R  + K+                            
Sbjct: 47   TTSF---SLDLSKLTLTDNNPPANYRLIARKRRRRGSRSVSGRSSDRSATRRC------- 96

Query: 552  XXYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXXXXXXXX 731
               CS GAS AY TCSDFPVA GTDSSGELF NGDANW+SDVSEAK              
Sbjct: 97   ---CSVGASAAYGTCSDFPVAMGTDSSGELFGNGDANWSSDVSEAKNSRDCGGSGEKEKE 153

Query: 732  XXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFWGDRLGG 911
                           DA GNESGYGSEPGYRGDAE GYG         HRLLFWG++L  
Sbjct: 154  KENVGVGFGVNGCS-DANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDHRLLFWGNQL-- 210

Query: 912  KLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
                   G  DS ME+V EN   DQK+HHRCRR+K D RM   LR
Sbjct: 211  ------VGAVDSKMEMVGENTLLDQKSHHRCRRRKNDCRMIDALR 249


>ref|XP_002301478.1| hypothetical protein POPTR_0002s20610g [Populus trichocarpa]
            gi|222843204|gb|EEE80751.1| hypothetical protein
            POPTR_0002s20610g [Populus trichocarpa]
          Length = 263

 Score =  180 bits (457), Expect = 1e-42
 Identities = 122/286 (42%), Positives = 144/286 (50%), Gaps = 13/286 (4%)
 Frame = +3

Query: 204  MSHKTLDPRHSIEFQLHNWRPFHLPTTPTRPIDSSDHSRNSKPY-SNAFHTKRPCLSDRA 380
            MSH +     S   QLH+WRPF            SD + + KP+ S+   TKRPCLSDR+
Sbjct: 1    MSHNSRQSLDSCTLQLHSWRPFL----------DSDPTTSYKPHASSPTLTKRPCLSDRS 50

Query: 381  TSFP--LDTLDMSKLSLFDDD------RPIRAGS---HKQXXXXXXXXXXXXXXXXXXXX 527
            TSFP  +D++D+SKL+L +DD      +PI A +   +K+                    
Sbjct: 51   TSFPSNVDSIDLSKLTLLEDDHNNTNNKPIPAVTSRPYKRGTLRLIQRKRRRRGSRSVSG 110

Query: 528  XXXXXXXXXXYCSAGA-SGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXX 704
                       CS GA S A+ATCSDF VA GTDSSGELFVNGDANWASDVS+AK     
Sbjct: 111  RSSDRSGTRRCCSVGAASAAHATCSDFHVAVGTDSSGELFVNGDANWASDVSQAKNSVKE 170

Query: 705  XXXXXXXXXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRL 884
                                      L +ESGYGSEPGYRGDAE+GYG          RL
Sbjct: 171  REEKENLLGVGNVIGN----------LDSESGYGSEPGYRGDAEVGYGDEVDEEEDDARL 220

Query: 885  LFWGDRLGGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQD 1022
            LFWG               DS ME+V EN F D K HHRCRRKK D
Sbjct: 221  LFWGHHF-----------QDSKMEMVGENTF-DSKTHHRCRRKKHD 254


>ref|XP_004172491.1| PREDICTED: uncharacterized LOC101206482, partial [Cucumis sativus]
          Length = 171

 Score =  163 bits (413), Expect = 1e-37
 Identities = 88/162 (54%), Positives = 96/162 (59%)
 Frame = +3

Query: 561  CSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXXXXXXXXXXX 740
            CS GAS A+ TCSDFP+A GTDSSGELFVNGDANW+SDVSEAK                 
Sbjct: 27   CSVGASAAHGTCSDFPIAVGTDSSGELFVNGDANWSSDVSEAK------NSRREREEKDH 80

Query: 741  XXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFWGDRLGGKLV 920
                        DA GNESGYGSEPGYRGD E GYG          RLL WG+RLG    
Sbjct: 81   LGSGFVSSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG---- 136

Query: 921  GDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQDWRM*GNLR 1046
                   DS ME+V EN F+DQK+HHRCRRKK + RM   LR
Sbjct: 137  -------DSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR 171


>ref|XP_006396451.1| hypothetical protein EUTSA_v10028893mg [Eutrema salsugineum]
            gi|557097468|gb|ESQ37904.1| hypothetical protein
            EUTSA_v10028893mg [Eutrema salsugineum]
          Length = 259

 Score =  157 bits (396), Expect = 1e-35
 Identities = 112/282 (39%), Positives = 134/282 (47%), Gaps = 10/282 (3%)
 Frame = +3

Query: 204  MSHKTLDP-RHSIE---FQLHNWRPFHLPTTPTRPIDSSDHSRNSKPYSNAFHTKRPCLS 371
            MS K L+  R SIE    QL +WRPFH   T    +DSSD S++ KPY +   TKRPC S
Sbjct: 1    MSQKHLESSRSSIESCTLQLLSWRPFHRSKT----LDSSDQSQSHKPYGS-ISTKRPCFS 55

Query: 372  DRATSFPLDTLDMSKLSLFDDDRPIRAGS------HKQXXXXXXXXXXXXXXXXXXXXXX 533
            DR+TSF ++   MS+LSL DDD     G       + +                      
Sbjct: 56   DRSTSFSIEA--MSRLSLADDDNNNNGGKLSASNYNSKGSFRLVARKRRRRNSRSVSGRS 113

Query: 534  XXXXXXXXYCSAGASGAYATCSDFPVAAGTDSSGELFVNGDANWASDVSEAKXXXXXXXX 713
                     CS GA G   TCSDFP A GTDSSGELF   +ANWASDVSEA+        
Sbjct: 114  SDRSGTRRCCSIGAHG---TCSDFPFAVGTDSSGELF--SEANWASDVSEARRERRDSGG 168

Query: 714  XXXXXXXXXXXXXXXXXXXXXDALGNESGYGSEPGYRGDAELGYGXXXXXXXXXHRLLFW 893
                                 D +GNESGYGSEPGYRGDAE GYG          + LFW
Sbjct: 169  EKEASGFGFAVGI--------DLMGNESGYGSEPGYRGDAEFGYGDEFDDEEEDVKPLFW 220

Query: 894  GDRLGGKLVGDRFGDTDSNMEVVEENAFSDQKAHHRCRRKKQ 1019
                         GDT S ME+  +  F++ K   RCRR++Q
Sbjct: 221  -------------GDTGSTMEMSGDTKFTESKHQFRCRRRRQ 249


Top