BLASTX nr result

ID: Catharanthus23_contig00007404 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00007404
         (1381 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268658.1| PREDICTED: uncharacterized protein LOC100241...   238   6e-60
ref|XP_004240545.1| PREDICTED: uncharacterized protein LOC101258...   230   1e-57
ref|XP_006355807.1| PREDICTED: uncharacterized protein LOC102585...   228   4e-57
gb|EMJ01755.1| hypothetical protein PRUPE_ppa009673mg [Prunus pe...   223   1e-55
ref|XP_002523082.1| conserved hypothetical protein [Ricinus comm...   210   1e-51
ref|XP_004152251.1| PREDICTED: uncharacterized protein LOC101206...   209   3e-51
ref|XP_006444350.1| hypothetical protein CICLE_v10021537mg [Citr...   206   1e-50
ref|XP_004292660.1| PREDICTED: uncharacterized protein LOC101313...   203   2e-49
gb|EOX95076.1| LYR motif-containing protein 7 isoform 1 [Theobro...   195   3e-47
gb|EOX95077.1| LYR motif-containing protein 7 isoform 2 [Theobro...   194   6e-47
ref|XP_003519006.1| PREDICTED: uncharacterized protein LOC100795...   188   4e-45
gb|EXC10295.1| hypothetical protein L484_006190 [Morus notabilis]     186   2e-44
gb|ESW16312.1| hypothetical protein PHAVU_007G146100g [Phaseolus...   179   2e-42
ref|XP_006588589.1| PREDICTED: uncharacterized protein LOC100798...   178   5e-42
ref|XP_002321007.2| hypothetical protein POPTR_0014s12400g [Popu...   175   4e-41
ref|XP_002301478.1| hypothetical protein POPTR_0002s20610g [Popu...   155   5e-35
ref|XP_004495959.1| PREDICTED: uncharacterized protein LOC101493...   153   2e-34
ref|XP_006288487.1| hypothetical protein CARUB_v10001746mg [Caps...   142   4e-31
ref|NP_849288.1| uncharacterized protein [Arabidopsis thaliana] ...   140   1e-30
ref|XP_003591530.1| hypothetical protein MTR_1g088580 [Medicago ...   134   7e-29

>ref|XP_002268658.1| PREDICTED: uncharacterized protein LOC100241933 [Vitis vinifera]
          Length = 269

 Score =  238 bits (606), Expect = 6e-60
 Identities = 142/273 (52%), Positives = 163/273 (59%), Gaps = 13/273 (4%)
 Frame = -3

Query: 1178 LDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDS--PKPYS--TTANGFHSKRPCRADRA 1011
            +  + SIESCT Q HSWRPFQ P   PKTL+ DS   KPYS  T++NG HSKRPC +DR 
Sbjct: 1    MSPKTSIESCTFQLHSWRPFQLPTT-PKTLEPDSHNSKPYSITTSSNGLHSKRPCLSDRK 59

Query: 1010 TSFSIEALDMSKLSLFDDDRPLSSAHK-----RWFAXXXXXXXXXXXXXXXXXXXGTHXX 846
            TSF I+ALD+SKLSL +DD+P SSA +     RW                     GT   
Sbjct: 60   TSFPIDALDISKLSLLEDDKPASSAPRNRGNVRWIDRKRRRRGSRSVSGRSSDRSGTRRC 119

Query: 845  XXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGSGEK 666
                            DFP+AAGTDSSGELFVNGD NWSSDVSE AKNSR++RD GSGEK
Sbjct: 120  CSVGASAAYATCS---DFPVAAGTDSSGELFVNGDSNWSSDVSE-AKNSRKDRDGGSGEK 175

Query: 665  DNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQGFG--V 492
            +NL SGF  +G  E               GDAEFGYG        D+RLLFWG+  G   
Sbjct: 176  ENLGSGFGHIGIFETQGNESGYGSEPGYRGDAEFGYGDELDEEEDDARLLFWGEQLGDND 235

Query: 491  SSMERVGENML--QKAHHRCRRKKHDLRMVDIL 399
            ++ME VGEN    QKAHHRCRRKKHD RM+D L
Sbjct: 236  TNMEMVGENTFSEQKAHHRCRRKKHDYRMIDAL 268


>ref|XP_004240545.1| PREDICTED: uncharacterized protein LOC101258757 [Solanum
            lycopersicum]
          Length = 269

 Score =  230 bits (586), Expect = 1e-57
 Identities = 138/270 (51%), Positives = 160/270 (59%), Gaps = 8/270 (2%)
 Frame = -3

Query: 1193 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYS-TTANGFHSKRPCRAD 1017
            MS +TLDSRH+IESCT   HSW+PFQFP    KTLD DSPK YS +T  G H+KR CRAD
Sbjct: 1    MSPKTLDSRHAIESCTYHLHSWKPFQFPSPNSKTLDLDSPKTYSPSTHGGLHTKRQCRAD 60

Query: 1016 RATSFSIEALDMSKLSLFDDDRPLSSAHKR----WFAXXXXXXXXXXXXXXXXXXXGTHX 849
            R TS  IEALDMSKLSLF++D+PLS  HKR      A                   GT  
Sbjct: 61   RTTSIPIEALDMSKLSLFEEDKPLS-VHKRENLRLIAGKRRRRGSRSVSGRSSDRSGTRR 119

Query: 848  XXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGS-G 672
                             DFP+AAGTDSSGELFVNGD +W+ DVSE  K+ R+E++ G  G
Sbjct: 120  RCCSVGASAAYGTCS--DFPVAAGTDSSGELFVNGDMHWTLDVSEVTKSLRKEKEGGGVG 177

Query: 671  EKDNLSSGFA-QVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQGFG 495
            E++N  +G + Q GN E L             GDAEFGYG        D RL FWG  FG
Sbjct: 178  ERENNLNGLSVQSGNFEGLGNESGYGSEPGYRGDAEFGYGDEFDEEEDDQRLSFWGDEFG 237

Query: 494  -VSSMERVGENMLQKAHHRCRRKKHDLRMV 408
             +S ME+VGEN LQK HHRCRR+K D RMV
Sbjct: 238  ALSRMEKVGENSLQKVHHRCRRRKQDCRMV 267


>ref|XP_006355807.1| PREDICTED: uncharacterized protein LOC102585515 [Solanum tuberosum]
          Length = 269

 Score =  228 bits (582), Expect = 4e-57
 Identities = 138/270 (51%), Positives = 158/270 (58%), Gaps = 8/270 (2%)
 Frame = -3

Query: 1193 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYS-TTANGFHSKRPCRAD 1017
            MS +TLDSRH+IESCT   HSW+PFQFP    KTLD DSPK YS +T  G H+KR CRAD
Sbjct: 1    MSPKTLDSRHAIESCTYHLHSWKPFQFPTPNSKTLDLDSPKTYSPSTHGGVHTKRQCRAD 60

Query: 1016 RATSFSIEALDMSKLSLFDDDRPLSSAHKR----WFAXXXXXXXXXXXXXXXXXXXGTHX 849
            R TS  IEALDMSKLSLF++DRPLS  HKR      A                   GT  
Sbjct: 61   RTTSIPIEALDMSKLSLFEEDRPLS-VHKRENLRLIAGKRRRRGSRSVSGRSSDRSGTRR 119

Query: 848  XXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGS-G 672
                             DFP+A GTDSSGELFVNGD +W+ DVSE  K+ R+E++ G  G
Sbjct: 120  RCCSVGASAAYGTCS--DFPVAVGTDSSGELFVNGDMHWTLDVSEVTKSLRKEKEGGGVG 177

Query: 671  EKD-NLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQGFG 495
            E++ NL+    Q GN E L             GDAEFGYG        D RL FWG  FG
Sbjct: 178  ERESNLNGLSVQSGNFEGLGNESGYGSEPGYRGDAEFGYGDEFDEEEDDQRLSFWGDEFG 237

Query: 494  -VSSMERVGENMLQKAHHRCRRKKHDLRMV 408
             +S ME+VGEN LQK HHRCRR+K D RMV
Sbjct: 238  ALSRMEKVGENTLQKVHHRCRRRKQDCRMV 267


>gb|EMJ01755.1| hypothetical protein PRUPE_ppa009673mg [Prunus persica]
          Length = 282

 Score =  223 bits (569), Expect = 1e-55
 Identities = 138/285 (48%), Positives = 164/285 (57%), Gaps = 20/285 (7%)
 Frame = -3

Query: 1193 MSHRTLDSRHSIESCTLQFHSWRPF---QFPVAIPKTLDSD----SPKPYSTTANGF--H 1041
            MSH+ L+ RH I+SC  Q HSWRPF   Q      KTLDSD    +PKPY++++NG   H
Sbjct: 1    MSHKALEHRHPIDSCAFQLHSWRPFHLHQQTTPTSKTLDSDPSLPNPKPYNSSSNGLVVH 60

Query: 1040 SKRPCRADRATSFSIEALDMSKLSLFDDDRPLSSAHK------RWFAXXXXXXXXXXXXX 879
            +KRPC ++RATSFSI+A+DMS+L+L DDDR +S  H       R+ A             
Sbjct: 61   TKRPCLSNRATSFSIDAIDMSRLTLVDDDRTISGGHHNRHGSFRFIAKKRRRHGSRSVSG 120

Query: 878  XXXXXXGTHXXXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNS 699
                  GT                   DFP+A GTDSSGELF NGD NW+SDVSE A+NS
Sbjct: 121  RSSDRSGTRRCCSVGASAAYGTCS---DFPVAVGTDSSGELFGNGDANWASDVSE-ARNS 176

Query: 698  RRERD-NGSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSR 522
            R+ERD  GSGEK+NL  GF  +G  +               GDAEFGYG        D+R
Sbjct: 177  RKERDGGGSGEKENLGIGFGPIGGFDVQGNESGYGSEPGYRGDAEFGYGDELDEEEEDTR 236

Query: 521  LLFWGQGFG--VSSMERVGENML--QKAHHRCRRKKHDLRMVDIL 399
            LLFWG  FG   S ME VGEN    QK+HHRCRRKKHD RMVD L
Sbjct: 237  LLFWGDQFGDADSMMEIVGENTFVDQKSHHRCRRKKHDCRMVDTL 281


>ref|XP_002523082.1| conserved hypothetical protein [Ricinus communis]
            gi|223537644|gb|EEF39267.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 261

 Score =  210 bits (534), Expect = 1e-51
 Identities = 124/270 (45%), Positives = 152/270 (56%), Gaps = 7/270 (2%)
 Frame = -3

Query: 1193 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHSKRPCRADR 1014
            MSHR+LDSRHSI+SCT Q HSWRPF       +TLDSD PKPYS+T     +KRPC +DR
Sbjct: 1    MSHRSLDSRHSIDSCTFQLHSWRPFHL-----QTLDSDPPKPYSST-----TKRPCLSDR 50

Query: 1013 ATSFSIEALDMSKLSLFDDDRPLSSAHKRWF---AXXXXXXXXXXXXXXXXXXXGTHXXX 843
             TSF I+++D+SKLS+ DDD+P+S +    +                        +    
Sbjct: 51   TTSFPIDSIDISKLSIIDDDKPISVSAATAYNSRGSLRLIARKRRRRGSRSVSGRSSDRS 110

Query: 842  XXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGSGEKD 663
                          SDFP+A GTDSSGELF NGD NW SDVSEA  + +RE+D    EK+
Sbjct: 111  GTRRCCSVGAHGTCSDFPVAVGTDSSGELFGNGDSNWGSDVSEAKNSIKREKDREREEKE 170

Query: 662  NLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQGFGVS-- 489
            N+  G+ Q G  EN              GDAEFGY         D++LLFWG  FG +  
Sbjct: 171  NM--GYGQFGTFENQGNESGYGSEPGYRGDAEFGYEDEIDEEEDDAKLLFWGDHFGGTGP 228

Query: 488  SMERVGENML--QKAHHRCRRKKHDLRMVD 405
             ME VGEN    QK+HHRCRRKKHD RM+D
Sbjct: 229  KMEMVGENSFSDQKSHHRCRRKKHDNRMLD 258


>ref|XP_004152251.1| PREDICTED: uncharacterized protein LOC101206482 [Cucumis sativus]
          Length = 266

 Score =  209 bits (531), Expect = 3e-51
 Identities = 132/276 (47%), Positives = 154/276 (55%), Gaps = 11/276 (3%)
 Frame = -3

Query: 1193 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSD--------SPKPYSTTANGFHS 1038
            MS R LDSRHSI+SCTL+FH W PF     +PKTLDSD        + KPY ++    H+
Sbjct: 1    MSRRPLDSRHSIDSCTLKFHGWTPFH----LPKTLDSDPHNTSAPTNSKPYYSSTP-LHT 55

Query: 1037 KRPCRADRATSFSIEALDMSKLSLFDDDRPLSSAHKRWFAXXXXXXXXXXXXXXXXXXXG 858
            KRPC +DR TSF+++A+DMS LSL DDD+P S    R F                     
Sbjct: 56   KRPCLSDRTTSFNVDAIDMSALSLIDDDKP-SIPPARSFRLIARKRRRRGSRSVSGRSSD 114

Query: 857  THXXXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNG 678
                               SDFP+A GTDSSGELFVNGD NWSSDVSE AKNSRRER+  
Sbjct: 115  RSGTRRCCSVGASAAHGTCSDFPIAVGTDSSGELFVNGDANWSSDVSE-AKNSRRERE-- 171

Query: 677  SGEKDNLSSGF-AQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQG 501
              EKD+L SGF +  G  +               GD EFGYG        D+RLL WG+ 
Sbjct: 172  --EKDHLGSGFVSSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGER 229

Query: 500  FGVSSMERVGENML--QKAHHRCRRKKHDLRMVDIL 399
             G S ME VGEN    QK+HHRCRRKKH+ RMVD L
Sbjct: 230  LGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDAL 265


>ref|XP_006444350.1| hypothetical protein CICLE_v10021537mg [Citrus clementina]
            gi|568852594|ref|XP_006479957.1| PREDICTED:
            uncharacterized protein LOC102627953 [Citrus sinensis]
            gi|557546612|gb|ESR57590.1| hypothetical protein
            CICLE_v10021537mg [Citrus clementina]
          Length = 283

 Score =  206 bits (525), Expect = 1e-50
 Identities = 137/284 (48%), Positives = 163/284 (57%), Gaps = 20/284 (7%)
 Frame = -3

Query: 1190 SHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDS-DSPKPYSTTANGFHSKRPCRADR 1014
            SH+ LDSRHSI+SC LQ H+WRPF     +   LDS DS KP  + ++  H+KRPC +DR
Sbjct: 4    SHKPLDSRHSIDSCALQLHNWRPFH----LQNPLDSSDSTKPSYSPSSWVHTKRPCLSDR 59

Query: 1013 ATSFSI---EALDMSKLSLFDDD---RPLSSA----HKRWFAXXXXXXXXXXXXXXXXXX 864
            ATSFSI    A+D+SKLSLFDDD   +P+++A     +  +                   
Sbjct: 60   ATSFSIIDAAAIDLSKLSLFDDDNVIKPMTAATAPQSRGGYRLIARKRRRRGSRSVSGRS 119

Query: 863  XGTHXXXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERD 684
                                 SDFP+A GTDSSGELF NG+ NW+SDVSE A+NSRRERD
Sbjct: 120  SDRSGTRRCCSVGASAAYGTCSDFPVAVGTDSSGELFGNGEANWASDVSE-ARNSRRERD 178

Query: 683  --NGSGEKDNLSSGF-AQVGNLEN--LXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRL 519
              NGSGEK+N  +GF  QVG LE   L             GDAEFGYG        D++L
Sbjct: 179  NGNGSGEKENSGTGFGGQVGCLEAQVLGNESGYGSEPGYRGDAEFGYGDELDEEEEDAKL 238

Query: 518  LFWGQGFG--VSSMERVGENML--QKAHHRCRRKKHDLRMVDIL 399
            LFWG  FG   S ME VGEN    QK+HHRCRRKKHD RMVD L
Sbjct: 239  LFWGNRFGDVDSKMEMVGENTFTDQKSHHRCRRKKHDCRMVDAL 282


>ref|XP_004292660.1| PREDICTED: uncharacterized protein LOC101313678 [Fragaria vesca
            subsp. vesca]
          Length = 271

 Score =  203 bits (516), Expect = 2e-49
 Identities = 133/277 (48%), Positives = 154/277 (55%), Gaps = 15/277 (5%)
 Frame = -3

Query: 1193 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSD--SPKPYSTTANGFHSKRPCRA 1020
            MSH+ LDSR S++SCT Q HSWRPFQ      KTLDSD  +PKPY       H+KRPC +
Sbjct: 1    MSHKALDSRPSLDSCTFQLHSWRPFQLQQQPTKTLDSDPANPKPY-------HTKRPCLS 53

Query: 1019 DRATS-FSIEALDMSKLSLFDDDRPLSSAHK------RWFAXXXXXXXXXXXXXXXXXXX 861
            +RATS FSI+A+DMS+L+L DDDR +S  H       R+ A                   
Sbjct: 54   NRATSSFSIDAIDMSRLTLVDDDRTISGGHHHKHGSFRFLARKRRRHGSRSVSGRSSDRS 113

Query: 860  GTHXXXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDN 681
            GT                   DFP+A GTDSSGELF NGD NW+SDVSE A+N R+ERD 
Sbjct: 114  GTRRCCSVGASAAHGTCS---DFPVAIGTDSSGELFGNGDANWASDVSE-ARNLRKERDG 169

Query: 680  -GSGEKDNLSS-GFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWG 507
             GSGEK+     GF   G  +               GDAEFGYG        D+RLLFWG
Sbjct: 170  VGSGEKETTPGVGFGPGGGFDAQGNESGYGSEPGYRGDAEFGYGDELDEEEEDARLLFWG 229

Query: 506  QGFGVSS--MERVGENML--QKAHHRCRRKKHDLRMV 408
              FG S   ME VGEN    QK+HHRCRRKKHD RMV
Sbjct: 230  NRFGDSDTMMEVVGENTFTDQKSHHRCRRKKHDCRMV 266


>gb|EOX95076.1| LYR motif-containing protein 7 isoform 1 [Theobroma cacao]
          Length = 271

 Score =  195 bits (496), Expect = 3e-47
 Identities = 131/279 (46%), Positives = 151/279 (54%), Gaps = 16/279 (5%)
 Frame = -3

Query: 1193 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPK---PYSTTANGFHSKRPCR 1023
            MSH+ L+ RHSI+SCT Q HSWRPFQ    + +TLDS  P+   P   + N FHSKRPC 
Sbjct: 1    MSHKALEPRHSIDSCTFQLHSWRPFQ----LQQTLDSSDPQQTPPKRASTNCFHSKRPCL 56

Query: 1022 ADRATSFSIEALDMSKLSLFDDDR-----PLSSAHKRW-FAXXXXXXXXXXXXXXXXXXX 861
            +DR TSFSI   D+SKL+L DDD      P+++  KR  F                    
Sbjct: 57   SDRTTSFSI---DLSKLTLLDDDNNSSYNPIAANPKRGSFRLFARKRRRRGSRSVSGRSS 113

Query: 860  GTHXXXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNG-DPNWSSDVSEAAKNSRRER- 687
                                SDFP+A GTDSSGELF NG D  W+SDVSE A+NSRRER 
Sbjct: 114  DRSGTRRCCSVGASAAYGTCSDFPVAVGTDSSGELFGNGADAYWASDVSE-ARNSRRERG 172

Query: 686  DNGSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWG 507
            D GSGEK++L     Q G  +               GD EFGYG        D+RLLFWG
Sbjct: 173  DGGSGEKESLG---GQFGGFDAQGNESGYGSEPGYRGDGEFGYGDEVDEEEEDARLLFWG 229

Query: 506  QGFGV---SSMERVGENML--QKAHHRCRRKKHDLRMVD 405
              FG    S ME VGEN    QKAHHRCRRKKHD RMVD
Sbjct: 230  HHFGADTDSKMEMVGENTFSDQKAHHRCRRKKHDYRMVD 268


>gb|EOX95077.1| LYR motif-containing protein 7 isoform 2 [Theobroma cacao]
          Length = 270

 Score =  194 bits (494), Expect = 6e-47
 Identities = 131/278 (47%), Positives = 151/278 (54%), Gaps = 15/278 (5%)
 Frame = -3

Query: 1193 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPK---PYSTTANGFHSKRPCR 1023
            MSH+ L+ RHSI+SCT Q HSWRPFQ    + +TLDS  P+   P   + N FHSKRPC 
Sbjct: 1    MSHKALEPRHSIDSCTFQLHSWRPFQ----LQQTLDSSDPQQTPPKRASTNCFHSKRPCL 56

Query: 1022 ADRATSFSIEALDMSKLSLFDDDR-----PLSSAHKRW-FAXXXXXXXXXXXXXXXXXXX 861
            +DR TSFSI   D+SKL+L DDD      P+++  KR  F                    
Sbjct: 57   SDRTTSFSI---DLSKLTLLDDDNNSSYNPIAANPKRGSFRLFARKRRRRGSRSVSGRSS 113

Query: 860  GTHXXXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNG-DPNWSSDVSEAAKNSRRER- 687
                                SDFP+A GTDSSGELF NG D  W+SDVSE A+NSRRER 
Sbjct: 114  DRSGTRRCCSVGASAAYGTCSDFPVAVGTDSSGELFGNGADAYWASDVSE-ARNSRRERG 172

Query: 686  DNGSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWG 507
            D GSGEK++L     Q G  +               GD EFGYG        D+RLLFWG
Sbjct: 173  DGGSGEKESLG---GQFGGFDAQGNESGYGSEPGYRGDGEFGYGDEVDEEEEDARLLFWG 229

Query: 506  QGFG--VSSMERVGENML--QKAHHRCRRKKHDLRMVD 405
              FG   S ME VGEN    QKAHHRCRRKKHD RMVD
Sbjct: 230  HHFGDTDSKMEMVGENTFSDQKAHHRCRRKKHDYRMVD 267


>ref|XP_003519006.1| PREDICTED: uncharacterized protein LOC100795813 [Glycine max]
          Length = 260

 Score =  188 bits (478), Expect = 4e-45
 Identities = 126/273 (46%), Positives = 142/273 (52%), Gaps = 7/273 (2%)
 Frame = -3

Query: 1196 AMSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHSKRPCRAD 1017
            +MSH+ LDSRH+ +SC LQ  +W+PF+         D   PKPY       + KRPC +D
Sbjct: 4    SMSHKPLDSRHTTDSCLLQLRTWKPFKLQQ------DGPHPKPY-------YHKRPCLSD 50

Query: 1016 RAT-SFSIEALDMSKLSLFDDDRPLSSAHKRWFAXXXXXXXXXXXXXXXXXXXGTHXXXX 840
            R T SFS   LDMSKL+L DDD    +     +                           
Sbjct: 51   RTTTSFS---LDMSKLTLADDDNHNPNNRATNYRLVARKRRRRGSRSVSGRSSDRSGTRR 107

Query: 839  XXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSR--RERDNGSGEK 666
                         SDFP+A GTDSSGELF NGDPNWSSDVSE AKNSR  RERD GSGEK
Sbjct: 108  CCSVGASAAYGTCSDFPVAMGTDSSGELFGNGDPNWSSDVSE-AKNSRRERERDGGSGEK 166

Query: 665  DNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQGFGV-- 492
            +NL  GF   G  E               GDAEFGYG        D RLLFWG   G   
Sbjct: 167  ENLGVGFGVSGCSEANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDPRLLFWGDQLGAVD 226

Query: 491  SSMERVGENML--QKAHHRCRRKKHDLRMVDIL 399
            S ME VGEN L  QK+HHRCRR+KHD RMVD L
Sbjct: 227  SKMEMVGENTLLDQKSHHRCRRRKHDCRMVDAL 259


>gb|EXC10295.1| hypothetical protein L484_006190 [Morus notabilis]
          Length = 275

 Score =  186 bits (473), Expect = 2e-44
 Identities = 126/276 (45%), Positives = 148/276 (53%), Gaps = 13/276 (4%)
 Frame = -3

Query: 1193 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIP-KTLDSDSPKPYSTTANGFHS--KRPCR 1023
            MS + LDSRHSI+SC  Q HSWRPFQ     P KTLD+ +   +  +  G H+  KRPC 
Sbjct: 1    MSPKLLDSRHSIDSCAFQLHSWRPFQQHSTPPTKTLDAANNPRHYRSNGGAHAITKRPCL 60

Query: 1022 ADRATSFSIEALDMSKLSLFDDD--RPLSSAHKRWFAXXXXXXXXXXXXXXXXXXXGTHX 849
            +DRATSF I+A+DMS+LSL DDD  RP    ++                           
Sbjct: 61   SDRATSFPIDAIDMSRLSLVDDDTARPHHHQYRGSLRLLARKRRRRGSRSVSGRSSDRSG 120

Query: 848  XXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVN-GDPNWSSDVSEAAKNSRRERD---N 681
                            SDFP+A GTDSSGELF+N GD NWSSDVSE A+NSRRERD    
Sbjct: 121  TRRCCSVGASAAYGTCSDFPVAVGTDSSGELFLNTGDANWSSDVSE-ARNSRRERDGAGG 179

Query: 680  GSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQG 501
            GSGEK++       +G  ++              GDAEFGYG        D+RLLFWG  
Sbjct: 180  GSGEKESFG---GVIGGFDSQGAESGYGSEPGYRGDAEFGYGDEHDEEEDDARLLFWGNR 236

Query: 500  F--GVSSMERVGENML--QKAHHRCRRKKHDLRMVD 405
            F    S  E VGEN    QK HHRCRRKKHD RMVD
Sbjct: 237  FEDTDSMTEIVGENTFSDQKVHHRCRRKKHDCRMVD 272


>gb|ESW16312.1| hypothetical protein PHAVU_007G146100g [Phaseolus vulgaris]
          Length = 261

 Score =  179 bits (455), Expect = 2e-42
 Identities = 127/279 (45%), Positives = 145/279 (51%), Gaps = 13/279 (4%)
 Frame = -3

Query: 1196 AMSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHSKRPCRAD 1017
            +MSH+ LDSRHSI+SC LQ  SW+PF       K  D   PKPY       + KRPC +D
Sbjct: 4    SMSHKPLDSRHSIDSCMLQLRSWKPF-------KLQDGPHPKPY-------YYKRPCLSD 49

Query: 1016 RAT-SFSIEALDMSKLSLFDDDRPLSSAHK--------RWFAXXXXXXXXXXXXXXXXXX 864
            RAT SFS   LD++KL+L D D   + A+         R  A                  
Sbjct: 50   RATTSFS---LDIAKLTLADADDTTTIANNPNHRATNYRLVARKRRRRGSRSVSGRSSDR 106

Query: 863  XGTHXXXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERD 684
             GT                   DFP+A GTDSSGELF NGDPNWSSDVSE AKNSRRER+
Sbjct: 107  SGTRRCCSVGASAAYGTCS---DFPVAMGTDSSGELFGNGDPNWSSDVSE-AKNSRRERE 162

Query: 683  NGSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQ 504
               GE++N+  GF   G  E               GDAEFGYG        D RLLFWG 
Sbjct: 163  R-DGERENVGVGFGVSGCSEANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDPRLLFWGD 221

Query: 503  GFGV--SSMERVGENML--QKAHHRCRRKKHDLRMVDIL 399
             FG   S  E VGEN L  QK+HHRCRR+KHD RMVD L
Sbjct: 222  QFGAVDSKREMVGENTLLDQKSHHRCRRRKHDCRMVDAL 260


>ref|XP_006588589.1| PREDICTED: uncharacterized protein LOC100798288 [Glycine max]
          Length = 260

 Score =  178 bits (451), Expect = 5e-42
 Identities = 122/274 (44%), Positives = 141/274 (51%), Gaps = 8/274 (2%)
 Frame = -3

Query: 1196 AMSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHSKRPCRAD 1017
            +MSH+ LDSRHSI+SC LQ  SW+PF+         D   PKPY       + KRPC +D
Sbjct: 4    SMSHKPLDSRHSIDSCLLQLRSWKPFKLQQ------DGPHPKPY-------YHKRPCLSD 50

Query: 1016 RAT-SFSIEALDMSKLSLFDDDRPLSSAHKRW---FAXXXXXXXXXXXXXXXXXXXGTHX 849
            R T SFS   LDMSKL+L  DD  + + +      +                        
Sbjct: 51   RTTTSFS---LDMSKLTLAADDDTIHNPNNNRATNYRLVARKRRRRGSRSLSGRSSDRSG 107

Query: 848  XXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGSGE 669
                            SDFP+A GTDSSGELF NGDPNWSSDVSE AKNSRRER+   GE
Sbjct: 108  TRRCCSVGASAAYGTCSDFPVAMGTDSSGELFGNGDPNWSSDVSE-AKNSRRERER-DGE 165

Query: 668  KDNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQGFGV- 492
            K+N+  GF   G  +               GDAEFGYG        D RLLFWG   G  
Sbjct: 166  KENVGVGFGVSGCSDANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDPRLLFWGDQLGAV 225

Query: 491  -SSMERVGENML--QKAHHRCRRKKHDLRMVDIL 399
             S  E VGEN L  QK+HHRCRR+KHD RMVD L
Sbjct: 226  DSKREMVGENTLLDQKSHHRCRRRKHDCRMVDAL 259


>ref|XP_002321007.2| hypothetical protein POPTR_0014s12400g [Populus trichocarpa]
            gi|550324059|gb|EEE99322.2| hypothetical protein
            POPTR_0014s12400g [Populus trichocarpa]
          Length = 279

 Score =  175 bits (444), Expect = 4e-41
 Identities = 126/283 (44%), Positives = 152/283 (53%), Gaps = 18/283 (6%)
 Frame = -3

Query: 1193 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSP---KPYSTTANGFHSKRPCR 1023
            MSH   +SRHSI+SCTLQ HSWRPF         LDSD P   KPY+++      KRPC 
Sbjct: 19   MSH---NSRHSIDSCTLQLHSWRPF---------LDSDPPTNSKPYASSRT--LPKRPCL 64

Query: 1022 ADRATSF--SIEALDMSKLSLFDDD-----RPL-------SSAHKRWFAXXXXXXXXXXX 885
            +DRATSF  +I+++D+SKLSL  DD     +P+       +S +KR              
Sbjct: 65   SDRATSFPSNIDSIDISKLSLLQDDDNNNNKPIPATPAVTNSPYKRG-TLRLIERKRRRR 123

Query: 884  XXXXXXXXGTHXXXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAK 705
                     +                  SDFP+A GTDSSGELFVNGD NW+SDVSE AK
Sbjct: 124  GSRSVSGRSSDRSGTWRCCSVGAAHGTCSDFPVAVGTDSSGELFVNGDANWASDVSE-AK 182

Query: 704  NSRRERDNGSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDS 525
            NS +ER+    EK+NL    +  GNL++              GDAEFGYG        D+
Sbjct: 183  NSIKERE----EKENLLGVGSAFGNLDS---ESGYGSEPGYRGDAEFGYGDEVDEEEDDA 235

Query: 524  RLLFWGQGFGVSSMERVGENMLQ-KAHHRCRRKKHDLRMVDIL 399
            RLLFWG  F  S ME VGEN    K HHRCRR+KHD RMVD L
Sbjct: 236  RLLFWGHHFQDSKMEMVGENTFDPKTHHRCRRRKHDYRMVDSL 278


>ref|XP_002301478.1| hypothetical protein POPTR_0002s20610g [Populus trichocarpa]
            gi|222843204|gb|EEE80751.1| hypothetical protein
            POPTR_0002s20610g [Populus trichocarpa]
          Length = 263

 Score =  155 bits (391), Expect = 5e-35
 Identities = 116/282 (41%), Positives = 141/282 (50%), Gaps = 19/282 (6%)
 Frame = -3

Query: 1193 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANG-FHSKRPCRAD 1017
            MSH   +SR S++SCTLQ HSWRPF         LDSD    Y   A+    +KRPC +D
Sbjct: 1    MSH---NSRQSLDSCTLQLHSWRPF---------LDSDPTTSYKPHASSPTLTKRPCLSD 48

Query: 1016 RATSF--SIEALDMSKLSLFDDD--------------RPLSSAHKRWFAXXXXXXXXXXX 885
            R+TSF  +++++D+SKL+L +DD              RP      R              
Sbjct: 49   RSTSFPSNVDSIDLSKLTLLEDDHNNTNNKPIPAVTSRPYKRGTLRLIQRKRRRRGSRSV 108

Query: 884  XXXXXXXXGTHXXXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAK 705
                    GT                   DF +A GTDSSGELFVNGD NW+SDVS+ AK
Sbjct: 109  SGRSSDRSGTRRCCSVGAASAAHATCS--DFHVAVGTDSSGELFVNGDANWASDVSQ-AK 165

Query: 704  NSRRERDNGSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDS 525
            NS +ER+    EK+NL      +GNL++              GDAE GYG        D+
Sbjct: 166  NSVKERE----EKENLLGVGNVIGNLDS---ESGYGSEPGYRGDAEVGYGDEVDEEEDDA 218

Query: 524  RLLFWGQGFGVSSMERVGENML-QKAHHRCRRKKHDL-RMVD 405
            RLLFWG  F  S ME VGEN    K HHRCRRKKHD  RMVD
Sbjct: 219  RLLFWGHHFQDSKMEMVGENTFDSKTHHRCRRKKHDCSRMVD 260


>ref|XP_004495959.1| PREDICTED: uncharacterized protein LOC101493408 [Cicer arietinum]
          Length = 261

 Score =  153 bits (386), Expect = 2e-34
 Identities = 119/278 (42%), Positives = 143/278 (51%), Gaps = 12/278 (4%)
 Frame = -3

Query: 1196 AMSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDS--DSPKPYSTTANGFHSKRPCR 1023
            +MSH++     +I+SC LQ  +WRPF      P+T  S   S  P   + N    KRPC 
Sbjct: 4    SMSHKS-----TIDSCVLQLRTWRPFHH--LHPQTTSSLDGSHNPTKPSLN----KRPCL 52

Query: 1022 ADRAT-SFSIEALDMSKLSLFDDDRPLSS-AHKRWFAXXXXXXXXXXXXXXXXXXXGTHX 849
            +DR T SFS   LD+SKL+L DDDRP+++ A+ R  A                    T  
Sbjct: 53   SDRTTTSFS---LDLSKLTLADDDRPINNTANHRLIARKRRRRCSRSVSGRSSDRSATRR 109

Query: 848  XXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGSG- 672
                             DFP+A GTDSSGELF NGD NWSSDVSE AKNS   RD GSG 
Sbjct: 110  CCSVGASAAYGTCS---DFPVAMGTDSSGELFGNGDANWSSDVSE-AKNS---RDGGSGE 162

Query: 671  -EKDNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQGFG 495
             EK+N++ GF   G  E               GDAEFGYG        D R+LFWG   G
Sbjct: 163  KEKENVALGFGVNGCSEANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDHRVLFWGNQLG 222

Query: 494  ----VSSMERVGENML--QKAHHRCRRKKHDLRMVDIL 399
                 S ME VGEN L  QK+HHR RR+K+D RM+D L
Sbjct: 223  GAAVDSKMEMVGENTLLDQKSHHRLRRRKNDCRMIDAL 260


>ref|XP_006288487.1| hypothetical protein CARUB_v10001746mg [Capsella rubella]
            gi|482557193|gb|EOA21385.1| hypothetical protein
            CARUB_v10001746mg [Capsella rubella]
          Length = 261

 Score =  142 bits (357), Expect = 4e-31
 Identities = 110/270 (40%), Positives = 133/270 (49%), Gaps = 7/270 (2%)
 Frame = -3

Query: 1193 MSHRTLD-SRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHS--KRPCR 1023
            MS + L+ SR SIESCT Q  SWRPFQ      KTLDS    P +   NGFHS  KRPC 
Sbjct: 1    MSQKHLEPSRSSIESCTSQLLSWRPFQRS----KTLDSPDHPPQT---NGFHSTTKRPCF 53

Query: 1022 ADRATSFSIEALDMSKLSLFDDD---RPLSSAHKRWFAXXXXXXXXXXXXXXXXXXXGTH 852
            +DR+TSFSIEA  MS+LSL DDD   + LS+++                         + 
Sbjct: 54   SDRSTSFSIEA--MSRLSLADDDNGGKTLSASNYSNRGSFRLVARKRRRRNSRSVSGRSS 111

Query: 851  XXXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGSG 672
                             SD P A GTDSSGELF  G+ NW SDVSEAA+NSRRER +  G
Sbjct: 112  DRSGTRRCCSIGAHGTCSDLPFAVGTDSSGELF--GEANWGSDVSEAARNSRRERRDSGG 169

Query: 671  EKDNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQGFGV 492
            EK+  S GF     ++ +             GDAEFGYG        D + LFWG     
Sbjct: 170  EKE-ASGGFGFAIGIDPMGNESGYGSEPGYRGDAEFGYGDEFDDEEEDVKPLFWGDTDST 228

Query: 491  SSMERVGENMLQKAHHRCRRKK-HDLRMVD 405
              M    +    K   RCRR++ HD + VD
Sbjct: 229  MGMAGDTKFSDNKPQFRCRRRRQHDYKTVD 258


>ref|NP_849288.1| uncharacterized protein [Arabidopsis thaliana]
            gi|26450275|dbj|BAC42254.1| unknown protein [Arabidopsis
            thaliana] gi|28973027|gb|AAO63838.1| unknown protein
            [Arabidopsis thaliana] gi|332656769|gb|AEE82169.1|
            uncharacterized protein AT4G02425 [Arabidopsis thaliana]
          Length = 262

 Score =  140 bits (354), Expect = 1e-30
 Identities = 108/271 (39%), Positives = 133/271 (49%), Gaps = 8/271 (2%)
 Frame = -3

Query: 1193 MSHRTLDS-RHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHS---KRPC 1026
            MS + L+S R SIESCT Q  SWRPF       KTLDS    P +   NGFHS   KRPC
Sbjct: 1    MSPKHLESSRSSIESCTSQLLSWRPFHRS----KTLDSSDQPPQT---NGFHSFTPKRPC 53

Query: 1025 RADRATSFSIEALDMSKLSLFDDD---RPLSSAHKRWFAXXXXXXXXXXXXXXXXXXXGT 855
             +DR+TSF+IEA  MS+LSL DDD   + LS+++                         +
Sbjct: 54   FSDRSTSFTIEA--MSRLSLADDDNGGKTLSASNYSNRGSFRLVARKRRRRNSRSVSGRS 111

Query: 854  HXXXXXXXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGS 675
                              SD P A GTDSSGELF  G+ NW+SDVSEAA+NSRRER +  
Sbjct: 112  SDRSGTRRCCSIGAHGTCSDLPFAVGTDSSGELF--GEANWASDVSEAARNSRRERRDSG 169

Query: 674  GEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQGFG 495
            GEK+  S GF     ++ +             GDAEFGYG        D + LFWG    
Sbjct: 170  GEKE-ASGGFGFANGVDPMGNESGYGSEPGYRGDAEFGYGDEFDDEEEDVKPLFWGDTDS 228

Query: 494  VSSMERVGENMLQKAHHRCRRKK-HDLRMVD 405
               M    +    K   RCRR++ HD + VD
Sbjct: 229  TMGMSGETKFSDSKPQFRCRRRRQHDYKTVD 259


>ref|XP_003591530.1| hypothetical protein MTR_1g088580 [Medicago truncatula]
            gi|355480578|gb|AES61781.1| hypothetical protein
            MTR_1g088580 [Medicago truncatula]
          Length = 249

 Score =  134 bits (338), Expect = 7e-29
 Identities = 108/273 (39%), Positives = 132/273 (48%), Gaps = 8/273 (2%)
 Frame = -3

Query: 1193 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHSKRPCRADR 1014
            MSH+      ++++C LQ  +W+PF       +  D  S    +   N    KRPC +DR
Sbjct: 1    MSHKP-----TLDTCVLQLRTWKPFH------QIHDHGSHSHNNNNIN----KRPCLSDR 45

Query: 1013 AT-SFSIEALDMSKLSLFDDDRPLSSAHKRWFAXXXXXXXXXXXXXXXXXXXGTHXXXXX 837
             T SFS   LD+SKL+L D++ P   A+ R  A                    T      
Sbjct: 46   TTTSFS---LDLSKLTLTDNNPP---ANYRLIARKRRRRGSRSVSGRSSDRSATRRCCSV 99

Query: 836  XXXXXXXXXXXXSDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNG--SGEKD 663
                         DFP+A GTDSSGELF NGD NWSSDVSE AKNSR    +G    EK+
Sbjct: 100  GASAAYGTCS---DFPVAMGTDSSGELFGNGDANWSSDVSE-AKNSRDCGGSGEKEKEKE 155

Query: 662  NLSSGFAQVGNLENLXXXXXXXXXXXXXGDAEFGYGXXXXXXXXDSRLLFWGQ---GFGV 492
            N+  GF   G  +               GDAEFGYG        D RLLFWG    G   
Sbjct: 156  NVGVGFGVNGCSDANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDHRLLFWGNQLVGAVD 215

Query: 491  SSMERVGENML--QKAHHRCRRKKHDLRMVDIL 399
            S ME VGEN L  QK+HHRCRR+K+D RM+D L
Sbjct: 216  SKMEMVGENTLLDQKSHHRCRRRKNDCRMIDAL 248


Top