BLASTX nr result

ID: Jatropha_contig00039694 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Jatropha_contig00039694
         (698 letters)

Database: NCBI-nr (updated 2014/02/11) 
           35,149,712 sequences; 12,374,887,350 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EEF01549.2| hypothetical protein POPTR_0010s25780g [Populus t...   121   2e-25
gb|ESR40999.1| hypothetical protein CICLE_v10025586mg [Citrus cl...   108   2e-21
gb|ESR40998.1| hypothetical protein CICLE_v10025586mg [Citrus cl...   108   2e-21
gb|EOY26364.1| Cation efflux family protein isoform 3 [Theobroma...   108   2e-21
gb|EOY26363.1| Cation efflux family protein isoform 2 [Theobroma...   108   2e-21
gb|EOY26362.1| Cation efflux family protein isoform 1 [Theobroma...   108   2e-21
ref|XP_004159008.1| PREDICTED: metal tolerance protein C4-like [...   103   6e-20
ref|XP_004141828.1| PREDICTED: metal tolerance protein C4-like [...   103   6e-20
ref|XP_004507015.1| PREDICTED: metal tolerance protein C4-like [...    94   5e-17
ref|XP_006304879.1| hypothetical protein CARUB_v10012642mg [Caps...    93   7e-17
ref|XP_002891647.1| hypothetical protein ARALYDRAFT_892136 [Arab...    93   7e-17
gb|ESQ30315.1| hypothetical protein EUTSA_v10011469mg [Eutrema s...    90   6e-16
gb|ESQ30314.1| hypothetical protein EUTSA_v10011469mg [Eutrema s...    90   6e-16
emb|CBI18478.3| unnamed protein product [Vitis vinifera]               89   1e-15
ref|XP_002262765.1| PREDICTED: metal tolerance protein C4 [Vitis...    89   1e-15
gb|AAK92793.1| unknown protein [Arabidopsis thaliana]                  88   3e-15
gb|AAG50870.1|AC025294_8 hypothetical protein [Arabidopsis thali...    88   3e-15
ref|NP_564594.1| metal tolerance protein C4 [Arabidopsis thalian...    88   3e-15
ref|XP_003530162.1| PREDICTED: metal tolerance protein C4-like [...    83   7e-14
gb|ESW14120.1| hypothetical protein PHAVU_008G2549001g, partial ...    83   9e-14

>gb|EEF01549.2| hypothetical protein POPTR_0010s25780g [Populus trichocarpa]
          Length = 459

 Score =  121 bits (304), Expect = 2e-25
 Identities = 68/140 (48%), Positives = 85/140 (60%), Gaps = 5/140 (3%)
 Frame = +2

Query: 293 LHFKTSSSSPILWNKDLIFLXXXXXXXXXXXXXXXXLDPNFIKQNLSSKYLFGSFTQSRG 472
           L+     +  IL N+DL+FL                L+PNF  QN S  + + S+TQS G
Sbjct: 11  LYHSQKRNPSILCNRDLLFLLTHDSNGNTTV-----LEPNFTAQNQS--FAYSSYTQSCG 63

Query: 473 FLTSNSSKRFLLLGLVAL-----DHQGHHHYITRRSFFRRAKQVQKIEINDQHSQRAVRT 637
           F  S SSKRF   GLV+L     D    + Y+  R FF RAK V++IEI+DQHSQRAV T
Sbjct: 64  FFASKSSKRFAFWGLVSLYGNQNDQNSSYSYLAHRGFFTRAKPVKRIEISDQHSQRAVTT 123

Query: 638 ALWCNFLVFSLKFGVWLASS 697
           ALWCNFLVFSLKFGVW +++
Sbjct: 124 ALWCNFLVFSLKFGVWFSTN 143


>gb|ESR40999.1| hypothetical protein CICLE_v10025586mg [Citrus clementina]
          Length = 355

 Score =  108 bits (269), Expect = 2e-21
 Identities = 54/82 (65%), Positives = 64/82 (78%), Gaps = 3/82 (3%)
 Frame = +2

Query: 461 QSRGFLTSNSSKRFLLLGLVALDHQG---HHHYITRRSFFRRAKQVQKIEINDQHSQRAV 631
           Q RGF + + SKRF+LLGLV+ D+ G   HH Y + R+FF RAKQV+KIE  D+HSQRAV
Sbjct: 83  QFRGFCSVHCSKRFVLLGLVSFDNSGSNQHHKYSSNRNFFTRAKQVKKIETTDEHSQRAV 142

Query: 632 RTALWCNFLVFSLKFGVWLASS 697
            TALW NFLVFSLKFGVWL +S
Sbjct: 143 TTALWGNFLVFSLKFGVWLGTS 164


>gb|ESR40998.1| hypothetical protein CICLE_v10025586mg [Citrus clementina]
          Length = 452

 Score =  108 bits (269), Expect = 2e-21
 Identities = 54/82 (65%), Positives = 64/82 (78%), Gaps = 3/82 (3%)
 Frame = +2

Query: 461 QSRGFLTSNSSKRFLLLGLVALDHQG---HHHYITRRSFFRRAKQVQKIEINDQHSQRAV 631
           Q RGF + + SKRF+LLGLV+ D+ G   HH Y + R+FF RAKQV+KIE  D+HSQRAV
Sbjct: 83  QFRGFCSVHCSKRFVLLGLVSFDNSGSNQHHKYSSNRNFFTRAKQVKKIETTDEHSQRAV 142

Query: 632 RTALWCNFLVFSLKFGVWLASS 697
            TALW NFLVFSLKFGVWL +S
Sbjct: 143 TTALWGNFLVFSLKFGVWLGTS 164


>gb|EOY26364.1| Cation efflux family protein isoform 3 [Theobroma cacao]
          Length = 361

 Score =  108 bits (269), Expect = 2e-21
 Identities = 62/108 (57%), Positives = 74/108 (68%), Gaps = 9/108 (8%)
 Frame = +2

Query: 401 LDPNFIKQNLSSKYLFGSFTQS---RGFLTSNSSKRFLLLGLVALDHQG----HHH--YI 553
           L+P+ I  N    + F S+T     RGF + NSS+RF+LLG V  ++      HH   Y 
Sbjct: 37  LEPSLIPHN-QGFFQFRSYTNIGPVRGFSSVNSSRRFVLLGFVLPENDPIIYQHHRQFYS 95

Query: 554 TRRSFFRRAKQVQKIEINDQHSQRAVRTALWCNFLVFSLKFGVWLASS 697
             RSFF RAKQ++KIEINDQH QRAV TALWCNFLVFSLKFGVWLA+S
Sbjct: 96  PYRSFFTRAKQIKKIEINDQHIQRAVTTALWCNFLVFSLKFGVWLATS 143


>gb|EOY26363.1| Cation efflux family protein isoform 2 [Theobroma cacao]
          Length = 377

 Score =  108 bits (269), Expect = 2e-21
 Identities = 62/108 (57%), Positives = 74/108 (68%), Gaps = 9/108 (8%)
 Frame = +2

Query: 401 LDPNFIKQNLSSKYLFGSFTQS---RGFLTSNSSKRFLLLGLVALDHQG----HHH--YI 553
           L+P+ I  N    + F S+T     RGF + NSS+RF+LLG V  ++      HH   Y 
Sbjct: 37  LEPSLIPHN-QGFFQFRSYTNIGPVRGFSSVNSSRRFVLLGFVLPENDPIIYQHHRQFYS 95

Query: 554 TRRSFFRRAKQVQKIEINDQHSQRAVRTALWCNFLVFSLKFGVWLASS 697
             RSFF RAKQ++KIEINDQH QRAV TALWCNFLVFSLKFGVWLA+S
Sbjct: 96  PYRSFFTRAKQIKKIEINDQHIQRAVTTALWCNFLVFSLKFGVWLATS 143


>gb|EOY26362.1| Cation efflux family protein isoform 1 [Theobroma cacao]
          Length = 459

 Score =  108 bits (269), Expect = 2e-21
 Identities = 62/108 (57%), Positives = 74/108 (68%), Gaps = 9/108 (8%)
 Frame = +2

Query: 401 LDPNFIKQNLSSKYLFGSFTQS---RGFLTSNSSKRFLLLGLVALDHQG----HHH--YI 553
           L+P+ I  N    + F S+T     RGF + NSS+RF+LLG V  ++      HH   Y 
Sbjct: 37  LEPSLIPHN-QGFFQFRSYTNIGPVRGFSSVNSSRRFVLLGFVLPENDPIIYQHHRQFYS 95

Query: 554 TRRSFFRRAKQVQKIEINDQHSQRAVRTALWCNFLVFSLKFGVWLASS 697
             RSFF RAKQ++KIEINDQH QRAV TALWCNFLVFSLKFGVWLA+S
Sbjct: 96  PYRSFFTRAKQIKKIEINDQHIQRAVTTALWCNFLVFSLKFGVWLATS 143


>ref|XP_004159008.1| PREDICTED: metal tolerance protein C4-like [Cucumis sativus]
          Length = 471

 Score =  103 bits (256), Expect = 6e-20
 Identities = 54/91 (59%), Positives = 65/91 (71%), Gaps = 6/91 (6%)
 Frame = +2

Query: 443 LFGSFTQSRGFLTSNS-SKRFLLLGLVALDHQG-----HHHYITRRSFFRRAKQVQKIEI 604
           L G++ QSR   + +S S+R +LLGL++LD        +HHY   R FF RAK VQ+IE 
Sbjct: 65  LIGTYFQSRRLSSPSSCSRRSVLLGLISLDSNPPRPPLNHHYAFHRGFFTRAKPVQRIEF 124

Query: 605 NDQHSQRAVRTALWCNFLVFSLKFGVWLASS 697
           ND HSQRAV TALWCNFLVFSLKFGVW A+S
Sbjct: 125 NDYHSQRAVTTALWCNFLVFSLKFGVWFATS 155


>ref|XP_004141828.1| PREDICTED: metal tolerance protein C4-like [Cucumis sativus]
          Length = 449

 Score =  103 bits (256), Expect = 6e-20
 Identities = 54/91 (59%), Positives = 65/91 (71%), Gaps = 6/91 (6%)
 Frame = +2

Query: 443 LFGSFTQSRGFLTSNS-SKRFLLLGLVALDHQG-----HHHYITRRSFFRRAKQVQKIEI 604
           L G++ QSR   + +S S+R +LLGL++LD        +HHY   R FF RAK VQ+IE 
Sbjct: 43  LIGTYFQSRRLSSPSSCSRRSVLLGLISLDSNPPRPPLNHHYAFHRGFFTRAKPVQRIEF 102

Query: 605 NDQHSQRAVRTALWCNFLVFSLKFGVWLASS 697
           ND HSQRAV TALWCNFLVFSLKFGVW A+S
Sbjct: 103 NDYHSQRAVTTALWCNFLVFSLKFGVWFATS 133


>ref|XP_004507015.1| PREDICTED: metal tolerance protein C4-like [Cicer arietinum]
          Length = 428

 Score = 93.6 bits (231), Expect = 5e-17
 Identities = 47/70 (67%), Positives = 53/70 (75%), Gaps = 1/70 (1%)
 Frame = +2

Query: 491 SKRFLLLGLVALDHQGH-HHYITRRSFFRRAKQVQKIEINDQHSQRAVRTALWCNFLVFS 667
           S+RFL +G+ A  +  H  HY   RSFF RAK  Q IE ND+HSQRAV+TALWCNFLVFS
Sbjct: 46  SRRFLFIGIDARRNIHHSRHYSFHRSFFTRAKPAQIIEFNDRHSQRAVKTALWCNFLVFS 105

Query: 668 LKFGVWLASS 697
           LKFGVW ASS
Sbjct: 106 LKFGVWFASS 115


>ref|XP_006304879.1| hypothetical protein CARUB_v10012642mg [Capsella rubella]
           gi|482573590|gb|EOA37777.1| hypothetical protein
           CARUB_v10012642mg [Capsella rubella]
          Length = 457

 Score = 93.2 bits (230), Expect = 7e-17
 Identities = 53/88 (60%), Positives = 61/88 (69%), Gaps = 3/88 (3%)
 Frame = +2

Query: 443 LFGSFTQSRGFLTSNSSKRFLLLGL-VALDHQGH--HHYITRRSFFRRAKQVQKIEINDQ 613
           L  S  Q RGFL++N S + L +   V+LD        Y + R FF RAKQV++IEINDQ
Sbjct: 54  LIRSNPQLRGFLSTNCSNKGLGVRCSVSLDRDTPLIDSYSSHRGFFTRAKQVKRIEINDQ 113

Query: 614 HSQRAVRTALWCNFLVFSLKFGVWLASS 697
           HSQRAV TALWCNFLVFSLKFGVW  SS
Sbjct: 114 HSQRAVTTALWCNFLVFSLKFGVWWTSS 141


>ref|XP_002891647.1| hypothetical protein ARALYDRAFT_892136 [Arabidopsis lyrata subsp.
           lyrata] gi|297337489|gb|EFH67906.1| hypothetical protein
           ARALYDRAFT_892136 [Arabidopsis lyrata subsp. lyrata]
          Length = 451

 Score = 93.2 bits (230), Expect = 7e-17
 Identities = 50/98 (51%), Positives = 63/98 (64%)
 Frame = +2

Query: 404 DPNFIKQNLSSKYLFGSFTQSRGFLTSNSSKRFLLLGLVALDHQGHHHYITRRSFFRRAK 583
           D  F+  + S   L  S +  R F+++N S + L +     +      Y + R+FF RAK
Sbjct: 38  DKGFVDTHRSFSSLIHSNSHLRRFISTNCSNKGLGVRCSVSETPLIDTYSSHRNFFTRAK 97

Query: 584 QVQKIEINDQHSQRAVRTALWCNFLVFSLKFGVWLASS 697
           QV++IEINDQHSQRAV TALWCNFLVFSLKFGVW  SS
Sbjct: 98  QVKRIEINDQHSQRAVTTALWCNFLVFSLKFGVWWTSS 135


>gb|ESQ30315.1| hypothetical protein EUTSA_v10011469mg [Eutrema salsugineum]
          Length = 452

 Score = 90.1 bits (222), Expect = 6e-16
 Identities = 54/96 (56%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
 Frame = +2

Query: 416 IKQNLSSKYLFGSFTQSRGFLTSNSSKRFLLLGLVALDHQGH--HHYITRRSFFRRAKQV 589
           ++ + S   L  S  Q RGFL   S K   +   V+LD        Y TRR+FF RAKQV
Sbjct: 42  VETHRSFSSLIRSNPQLRGFL---SRKGVGVRCSVSLDRDTPLIDSYSTRRNFFTRAKQV 98

Query: 590 QKIEINDQHSQRAVRTALWCNFLVFSLKFGVWLASS 697
           ++IEI+DQHSQRAV TALWCNFLVFSLKFGVW  SS
Sbjct: 99  KRIEISDQHSQRAVTTALWCNFLVFSLKFGVWWTSS 134


>gb|ESQ30314.1| hypothetical protein EUTSA_v10011469mg [Eutrema salsugineum]
          Length = 375

 Score = 90.1 bits (222), Expect = 6e-16
 Identities = 54/96 (56%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
 Frame = +2

Query: 416 IKQNLSSKYLFGSFTQSRGFLTSNSSKRFLLLGLVALDHQGH--HHYITRRSFFRRAKQV 589
           ++ + S   L  S  Q RGFL   S K   +   V+LD        Y TRR+FF RAKQV
Sbjct: 42  VETHRSFSSLIRSNPQLRGFL---SRKGVGVRCSVSLDRDTPLIDSYSTRRNFFTRAKQV 98

Query: 590 QKIEINDQHSQRAVRTALWCNFLVFSLKFGVWLASS 697
           ++IEI+DQHSQRAV TALWCNFLVFSLKFGVW  SS
Sbjct: 99  KRIEISDQHSQRAVTTALWCNFLVFSLKFGVWWTSS 134


>emb|CBI18478.3| unnamed protein product [Vitis vinifera]
          Length = 222

 Score = 89.0 bits (219), Expect = 1e-15
 Identities = 42/53 (79%), Positives = 47/53 (88%)
 Frame = +2

Query: 539 HHHYITRRSFFRRAKQVQKIEINDQHSQRAVRTALWCNFLVFSLKFGVWLASS 697
           + HYI R +FF RAK+V+ IEINDQHSQRAVRTALWCNFLVFSLKFGVWL +S
Sbjct: 92  NRHYINR-NFFTRAKEVKNIEINDQHSQRAVRTALWCNFLVFSLKFGVWLTTS 143


>ref|XP_002262765.1| PREDICTED: metal tolerance protein C4 [Vitis vinifera]
           gi|297735796|emb|CBI18483.3| unnamed protein product
           [Vitis vinifera]
          Length = 459

 Score = 89.0 bits (219), Expect = 1e-15
 Identities = 42/53 (79%), Positives = 47/53 (88%)
 Frame = +2

Query: 539 HHHYITRRSFFRRAKQVQKIEINDQHSQRAVRTALWCNFLVFSLKFGVWLASS 697
           + HYI R +FF RAK+V+ IEINDQHSQRAVRTALWCNFLVFSLKFGVWL +S
Sbjct: 92  NRHYINR-NFFTRAKEVKNIEINDQHSQRAVRTALWCNFLVFSLKFGVWLTTS 143


>gb|AAK92793.1| unknown protein [Arabidopsis thaliana]
          Length = 457

 Score = 87.8 bits (216), Expect = 3e-15
 Identities = 49/88 (55%), Positives = 61/88 (69%), Gaps = 3/88 (3%)
 Frame = +2

Query: 443 LFGSFTQSRGFLTSNSSKRFLLLGL-VALDHQGH--HHYITRRSFFRRAKQVQKIEINDQ 613
           L  S +  RG +++N   + L +   V+LD +      Y + R+FF RAKQV++IEINDQ
Sbjct: 52  LIRSSSHVRGLISTNCLNKGLGVRCSVSLDRETPLIDTYSSHRNFFTRAKQVKRIEINDQ 111

Query: 614 HSQRAVRTALWCNFLVFSLKFGVWLASS 697
           HSQRAV TALWCNFLVFSLKFGVW  SS
Sbjct: 112 HSQRAVTTALWCNFLVFSLKFGVWWTSS 139


>gb|AAG50870.1|AC025294_8 hypothetical protein [Arabidopsis thaliana]
           gi|12325358|gb|AAG52617.1|AC024261_4 unknown protein;
           4121-1125 [Arabidopsis thaliana]
          Length = 423

 Score = 87.8 bits (216), Expect = 3e-15
 Identities = 49/88 (55%), Positives = 61/88 (69%), Gaps = 3/88 (3%)
 Frame = +2

Query: 443 LFGSFTQSRGFLTSNSSKRFLLLGL-VALDHQGH--HHYITRRSFFRRAKQVQKIEINDQ 613
           L  S +  RG +++N   + L +   V+LD +      Y + R+FF RAKQV++IEINDQ
Sbjct: 52  LIRSSSHVRGLISTNCLNKGLGVRCSVSLDRETPLIDTYSSHRNFFTRAKQVKRIEINDQ 111

Query: 614 HSQRAVRTALWCNFLVFSLKFGVWLASS 697
           HSQRAV TALWCNFLVFSLKFGVW  SS
Sbjct: 112 HSQRAVTTALWCNFLVFSLKFGVWWTSS 139


>ref|NP_564594.1| metal tolerance protein C4 [Arabidopsis thaliana]
           gi|71151966|sp|Q8H1G3.1|MTPC4_ARATH RecName: Full=Metal
           tolerance protein C4; Short=AtMTPc4; AltName:
           Full=AtMTP7 gi|23297266|gb|AAN12928.1| unknown protein
           [Arabidopsis thaliana] gi|332194569|gb|AEE32690.1| metal
           tolerance protein C4 [Arabidopsis thaliana]
          Length = 457

 Score = 87.8 bits (216), Expect = 3e-15
 Identities = 49/88 (55%), Positives = 61/88 (69%), Gaps = 3/88 (3%)
 Frame = +2

Query: 443 LFGSFTQSRGFLTSNSSKRFLLLGL-VALDHQGH--HHYITRRSFFRRAKQVQKIEINDQ 613
           L  S +  RG +++N   + L +   V+LD +      Y + R+FF RAKQV++IEINDQ
Sbjct: 52  LIRSSSHVRGLISTNCLNKGLGVRCSVSLDRETPLIDTYSSHRNFFTRAKQVKRIEINDQ 111

Query: 614 HSQRAVRTALWCNFLVFSLKFGVWLASS 697
           HSQRAV TALWCNFLVFSLKFGVW  SS
Sbjct: 112 HSQRAVTTALWCNFLVFSLKFGVWWTSS 139


>ref|XP_003530162.1| PREDICTED: metal tolerance protein C4-like [Glycine max]
          Length = 425

 Score = 83.2 bits (204), Expect = 7e-14
 Identities = 46/70 (65%), Positives = 48/70 (68%)
 Frame = +2

Query: 488 SSKRFLLLGLVALDHQGHHHYITRRSFFRRAKQVQKIEINDQHSQRAVRTALWCNFLVFS 667
           S   FLL G  A      HH    RSFF RAK    IE ND+HSQRAV+TALWCNFLVFS
Sbjct: 43  SHSLFLLHGFNA------HH----RSFFTRAKPATNIEFNDRHSQRAVKTALWCNFLVFS 92

Query: 668 LKFGVWLASS 697
           LKFGVWLASS
Sbjct: 93  LKFGVWLASS 102


>gb|ESW14120.1| hypothetical protein PHAVU_008G2549001g, partial [Phaseolus
           vulgaris]
          Length = 338

 Score = 82.8 bits (203), Expect = 9e-14
 Identities = 38/46 (82%), Positives = 40/46 (86%)
 Frame = +2

Query: 560 RSFFRRAKQVQKIEINDQHSQRAVRTALWCNFLVFSLKFGVWLASS 697
           RSFF RAK  Q IE ND+HSQRAV+TALWCNFLVFSLKFGVWL SS
Sbjct: 61  RSFFTRAKPAQNIEFNDRHSQRAVKTALWCNFLVFSLKFGVWLTSS 106


Top