BLASTX nr result

ID: Jatropha_contig00041031 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Jatropha_contig00041031
         (474 letters)

Database: NCBI-nr (updated 2014/02/11) 
           35,149,712 sequences; 12,374,887,350 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOX91373.1| C-terminal LisH motif isoform 1 [Theobroma cacao]      112   6e-23
gb|EOX91374.1| C-terminal LisH motif isoform 2 [Theobroma cacao]      109   3e-22
emb|CBI21442.3| unnamed protein product [Vitis vinifera]              105   5e-21
ref|XP_002282018.1| PREDICTED: uncharacterized protein LOC100244...   105   5e-21
gb|ESR39101.1| hypothetical protein CICLE_v10025029mg [Citrus cl...   105   8e-21
gb|EMJ05826.1| hypothetical protein PRUPE_ppa002130mg [Prunus pe...   103   3e-20
gb|EEE94256.2| hypothetical protein POPTR_0005s18060g [Populus t...   102   4e-20
ref|XP_004172719.1| PREDICTED: uncharacterized LOC101218546, par...   102   4e-20
ref|XP_004142009.1| PREDICTED: uncharacterized protein LOC101218...   102   4e-20
ref|XP_002307260.1| predicted protein [Populus trichocarpa]           102   4e-20
ref|XP_002523556.1| conserved hypothetical protein [Ricinus comm...   102   5e-20
ref|XP_004288051.1| PREDICTED: uncharacterized protein LOC101299...   101   8e-20
ref|XP_002865049.1| hypothetical protein ARALYDRAFT_496919 [Arab...   100   2e-19
ref|XP_006339948.1| PREDICTED: uncharacterized protein LOC102581...   100   2e-19
ref|XP_004232041.1| PREDICTED: uncharacterized protein LOC101246...    99   4e-19
ref|NP_201482.2| uncharacterized protein [Arabidopsis thaliana] ...    99   6e-19
dbj|BAB08623.1| unnamed protein product [Arabidopsis thaliana]         99   6e-19
dbj|BAF00693.1| hypothetical protein [Arabidopsis thaliana]            99   6e-19
gb|ESQ31162.1| hypothetical protein EUTSA_v10003742mg [Eutrema s...    94   2e-17
ref|XP_006280114.1| hypothetical protein CARUB_v10026006mg [Caps...    94   2e-17

>gb|EOX91373.1| C-terminal LisH motif isoform 1 [Theobroma cacao]
          Length = 782

 Score =  112 bits (279), Expect = 6e-23
 Identities = 66/123 (53%), Positives = 75/123 (60%), Gaps = 12/123 (9%)
 Frame = +1

Query: 142 KNFPQE--KEMDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXX 315
           K FP+E  K MDS+PV+WEALD+L++DFAKSENLIED                       
Sbjct: 59  KTFPRELTKHMDSSPVNWEALDALILDFAKSENLIEDSSPPSSPSLTSPSSPSLSSSSYR 118

Query: 316 XXRLVIRQIRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ----------VCFLC 465
             RL+IRQIRR LE G+ID AID L  HAPFILDDHR LFRLQKQ            FLC
Sbjct: 119 S-RLIIRQIRRLLEAGDIDAAIDLLGAHAPFILDDHRFLFRLQKQFEFASVIEFFFFFLC 177

Query: 466 FLL 474
           F+L
Sbjct: 178 FVL 180


>gb|EOX91374.1| C-terminal LisH motif isoform 2 [Theobroma cacao]
          Length = 569

 Score =  109 bits (273), Expect = 3e-22
 Identities = 61/105 (58%), Positives = 69/105 (65%), Gaps = 2/105 (1%)
 Frame = +1

Query: 142 KNFPQE--KEMDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXX 315
           K FP+E  K MDS+PV+WEALD+L++DFAKSENLIED                       
Sbjct: 59  KTFPRELTKHMDSSPVNWEALDALILDFAKSENLIEDSSPPSSPSLTSPSSPSLSSSSYR 118

Query: 316 XXRLVIRQIRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
             RL+IRQIRR LE G+ID AID L  HAPFILDDHR LFRLQKQ
Sbjct: 119 S-RLIIRQIRRLLEAGDIDAAIDLLGAHAPFILDDHRFLFRLQKQ 162


>emb|CBI21442.3| unnamed protein product [Vitis vinifera]
          Length = 710

 Score =  105 bits (263), Expect = 5e-21
 Identities = 57/95 (60%), Positives = 63/95 (66%)
 Frame = +1

Query: 166 MDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXXXXRLVIRQIR 345
           MDS PV+WEALD+L++DFAKSENLIED                         RL+IRQIR
Sbjct: 1   MDSMPVNWEALDTLIIDFAKSENLIEDSVTCTSSSSPSSSPSSSSYHQ----RLIIRQIR 56

Query: 346 RCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
           R LEVG+ID A D LR HAPFILDDHR LFRLQKQ
Sbjct: 57  RSLEVGDIDAATDLLRVHAPFILDDHRFLFRLQKQ 91


>ref|XP_002282018.1| PREDICTED: uncharacterized protein LOC100244129 [Vitis vinifera]
          Length = 690

 Score =  105 bits (263), Expect = 5e-21
 Identities = 57/95 (60%), Positives = 63/95 (66%)
 Frame = +1

Query: 166 MDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXXXXRLVIRQIR 345
           MDS PV+WEALD+L++DFAKSENLIED                         RL+IRQIR
Sbjct: 1   MDSMPVNWEALDTLIIDFAKSENLIEDSVTCTSSSSPSSSPSSSSYHQ----RLIIRQIR 56

Query: 346 RCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
           R LEVG+ID A D LR HAPFILDDHR LFRLQKQ
Sbjct: 57  RSLEVGDIDAATDLLRVHAPFILDDHRFLFRLQKQ 91


>gb|ESR39101.1| hypothetical protein CICLE_v10025029mg [Citrus clementina]
          Length = 707

 Score =  105 bits (261), Expect = 8e-21
 Identities = 57/97 (58%), Positives = 65/97 (67%), Gaps = 2/97 (2%)
 Frame = +1

Query: 166 MDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXXXX--RLVIRQ 339
           M+STPV+WEALD+L+++FAKSENLIED                           RL+IRQ
Sbjct: 1   MESTPVNWEALDALILEFAKSENLIEDSIVSSPPSSPSSSSTSSVSLSSSSYHSRLIIRQ 60

Query: 340 IRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
           IRR LE G+ID AID LR HAPFILDDHRLLFRLQKQ
Sbjct: 61  IRRSLEYGDIDAAIDLLRAHAPFILDDHRLLFRLQKQ 97


>gb|EMJ05826.1| hypothetical protein PRUPE_ppa002130mg [Prunus persica]
          Length = 712

 Score =  103 bits (256), Expect = 3e-20
 Identities = 54/99 (54%), Positives = 65/99 (65%), Gaps = 4/99 (4%)
 Frame = +1

Query: 166 MDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXXXX----RLVI 333
           MD+TP++WEALD+L++DFAKSE L+ED                             RL+I
Sbjct: 1   MDTTPINWEALDALIIDFAKSEKLVEDSSFTTSSSPPSSPPSSSSPSSISSSTYHSRLII 60

Query: 334 RQIRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
           RQIRR LE G+ID AID LR+HAPFIL+DHRLLFRLQKQ
Sbjct: 61  RQIRRLLEAGDIDAAIDLLRSHAPFILEDHRLLFRLQKQ 99


>gb|EEE94256.2| hypothetical protein POPTR_0005s18060g [Populus trichocarpa]
          Length = 770

 Score =  102 bits (255), Expect = 4e-20
 Identities = 57/107 (53%), Positives = 70/107 (65%), Gaps = 5/107 (4%)
 Frame = +1

Query: 145 NFPQ-----EKEMDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXX 309
           NFP+     ++ MDSTPV+WEALD L++DFAKSENLI+D                     
Sbjct: 73  NFPEKIKTKQQIMDSTPVNWEALDRLILDFAKSENLIDDSASTSIISSPSSSPPSFSSSY 132

Query: 310 XXXXRLVIRQIRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
               R +IRQIRR LE G+ID+++  LR+HAPFILDDHRLLFRLQKQ
Sbjct: 133 QS--RFIIRQIRRFLESGDIDSSLHLLRSHAPFILDDHRLLFRLQKQ 177


>ref|XP_004172719.1| PREDICTED: uncharacterized LOC101218546, partial [Cucumis sativus]
          Length = 602

 Score =  102 bits (255), Expect = 4e-20
 Identities = 55/93 (59%), Positives = 63/93 (67%)
 Frame = +1

Query: 172 STPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXXXXRLVIRQIRRC 351
           STP++WEALD+L++DFA+SENLIED                         RL+IRQIRR 
Sbjct: 5   STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHS-----RLIIRQIRRS 59

Query: 352 LEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
           LE G ID+AID LR HAPFILDDHRLLFRLQKQ
Sbjct: 60  LEAGHIDSAIDLLRLHAPFILDDHRLLFRLQKQ 92


>ref|XP_004142009.1| PREDICTED: uncharacterized protein LOC101218546 [Cucumis sativus]
          Length = 681

 Score =  102 bits (255), Expect = 4e-20
 Identities = 55/93 (59%), Positives = 63/93 (67%)
 Frame = +1

Query: 172 STPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXXXXRLVIRQIRRC 351
           STP++WEALD+L++DFA+SENLIED                         RL+IRQIRR 
Sbjct: 5   STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHS-----RLIIRQIRRS 59

Query: 352 LEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
           LE G ID+AID LR HAPFILDDHRLLFRLQKQ
Sbjct: 60  LEAGHIDSAIDLLRLHAPFILDDHRLLFRLQKQ 92


>ref|XP_002307260.1| predicted protein [Populus trichocarpa]
          Length = 770

 Score =  102 bits (255), Expect = 4e-20
 Identities = 57/107 (53%), Positives = 70/107 (65%), Gaps = 5/107 (4%)
 Frame = +1

Query: 145 NFPQ-----EKEMDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXX 309
           NFP+     ++ MDSTPV+WEALD L++DFAKSENLI+D                     
Sbjct: 73  NFPEKIKTKQQIMDSTPVNWEALDRLILDFAKSENLIDDSASTSIISSPSSSPPSFSSSY 132

Query: 310 XXXXRLVIRQIRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
               R +IRQIRR LE G+ID+++  LR+HAPFILDDHRLLFRLQKQ
Sbjct: 133 QS--RFIIRQIRRFLESGDIDSSLHLLRSHAPFILDDHRLLFRLQKQ 177


>ref|XP_002523556.1| conserved hypothetical protein [Ricinus communis]
           gi|223537118|gb|EEF38751.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 685

 Score =  102 bits (254), Expect = 5e-20
 Identities = 53/95 (55%), Positives = 63/95 (66%)
 Frame = +1

Query: 166 MDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXXXXRLVIRQIR 345
           M++TPV+WEALD L+++FAKSE LIED                         RL+IRQIR
Sbjct: 1   METTPVNWEALDRLIIEFAKSEKLIEDSFSSPLSSPSPSSSSSSVSSSSYHSRLIIRQIR 60

Query: 346 RCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
           R LE G+IDT I+ L +HAPFILDDHRLLFRLQKQ
Sbjct: 61  RFLESGDIDTTIELLGSHAPFILDDHRLLFRLQKQ 95


>ref|XP_004288051.1| PREDICTED: uncharacterized protein LOC101299124 [Fragaria vesca
           subsp. vesca]
          Length = 671

 Score =  101 bits (252), Expect = 8e-20
 Identities = 52/95 (54%), Positives = 63/95 (66%)
 Frame = +1

Query: 166 MDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXXXXRLVIRQIR 345
           MDS P++WE LD+L++DFAKSENLIED                         R +IR+IR
Sbjct: 1   MDSMPINWETLDALIIDFAKSENLIEDSSPTPPSSPSSVSSSSYHS------RRIIRRIR 54

Query: 346 RCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
           R LE G+ID A+D LR+HAPF+LDDHRLLFRLQKQ
Sbjct: 55  RSLEAGDIDAAVDLLRSHAPFVLDDHRLLFRLQKQ 89


>ref|XP_002865049.1| hypothetical protein ARALYDRAFT_496919 [Arabidopsis lyrata subsp.
           lyrata] gi|297310884|gb|EFH41308.1| hypothetical protein
           ARALYDRAFT_496919 [Arabidopsis lyrata subsp. lyrata]
          Length = 745

 Score =  100 bits (249), Expect = 2e-19
 Identities = 52/104 (50%), Positives = 66/104 (63%), Gaps = 5/104 (4%)
 Frame = +1

Query: 154 QEKEMDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXXXX---- 321
           +E  MDSTPV+WEALD+L++DF  SENL+ED                             
Sbjct: 48  RETTMDSTPVNWEALDALIIDFVSSENLVEDDAAAANSSPSPLSSPSSSCSPSISSSSYH 107

Query: 322 -RLVIRQIRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
            RL+IR+IR  +E G+I+TAID LR+HAPF+LDDHR+LFRLQKQ
Sbjct: 108 SRLIIRRIRNSIESGDIETAIDILRSHAPFVLDDHRILFRLQKQ 151


>ref|XP_006339948.1| PREDICTED: uncharacterized protein LOC102581578 [Solanum tuberosum]
          Length = 719

 Score =  100 bits (248), Expect = 2e-19
 Identities = 55/98 (56%), Positives = 62/98 (63%), Gaps = 3/98 (3%)
 Frame = +1

Query: 166 MDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXXXX---RLVIR 336
           MDS PV+WEALDSL++DF KSENLIED                            RL+IR
Sbjct: 13  MDSLPVNWEALDSLIIDFVKSENLIEDSGSPSTSPSTSLSPSSSTSSSSSSSYQSRLLIR 72

Query: 337 QIRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
           QIRR +E G+ID AID LR HAPF+LDDHRLLF LQKQ
Sbjct: 73  QIRRSVEFGDIDAAIDLLRLHAPFVLDDHRLLFCLQKQ 110


>ref|XP_004232041.1| PREDICTED: uncharacterized protein LOC101246489, partial [Solanum
           lycopersicum]
          Length = 716

 Score = 99.4 bits (246), Expect = 4e-19
 Identities = 54/98 (55%), Positives = 62/98 (63%), Gaps = 3/98 (3%)
 Frame = +1

Query: 166 MDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXXXX---RLVIR 336
           MDS PV+WEALD+L++DF KSENLIED                            RL+IR
Sbjct: 8   MDSLPVNWEALDTLIIDFVKSENLIEDSGSPSTSPSTSLSPSSSTSSSSSSSYQSRLLIR 67

Query: 337 QIRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
           QIRR +E G+ID AID LR HAPF+LDDHRLLF LQKQ
Sbjct: 68  QIRRLVEFGDIDAAIDLLRVHAPFVLDDHRLLFCLQKQ 105


>ref|NP_201482.2| uncharacterized protein [Arabidopsis thaliana]
           gi|332010882|gb|AED98265.1| uncharacterized protein
           AT5G66810 [Arabidopsis thaliana]
          Length = 750

 Score = 99.0 bits (245), Expect = 6e-19
 Identities = 53/110 (48%), Positives = 70/110 (63%), Gaps = 7/110 (6%)
 Frame = +1

Query: 142 KNFPQEKE--MDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXX 315
           K  P++++  MDSTPV+WEALD+L++DF  SENL+ED                       
Sbjct: 47  KEEPRKRKATMDSTPVNWEALDALIIDFVSSENLVEDAAAAVNSPPSPLSSPSSSSSPSI 106

Query: 316 XX-----RLVIRQIRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
                  RL+IR+IR  +E G+I+TAID LR+HAPF+LDDHR+LFRLQKQ
Sbjct: 107 SSSSYHSRLIIRRIRSSIESGDIETAIDILRSHAPFVLDDHRILFRLQKQ 156


>dbj|BAB08623.1| unnamed protein product [Arabidopsis thaliana]
          Length = 752

 Score = 99.0 bits (245), Expect = 6e-19
 Identities = 53/110 (48%), Positives = 70/110 (63%), Gaps = 7/110 (6%)
 Frame = +1

Query: 142 KNFPQEKE--MDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXX 315
           K  P++++  MDSTPV+WEALD+L++DF  SENL+ED                       
Sbjct: 47  KEEPRKRKATMDSTPVNWEALDALIIDFVSSENLVEDAAAAVNSPPSPLSSPSSSSSPSI 106

Query: 316 XX-----RLVIRQIRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
                  RL+IR+IR  +E G+I+TAID LR+HAPF+LDDHR+LFRLQKQ
Sbjct: 107 SSSSYHSRLIIRRIRSSIESGDIETAIDILRSHAPFVLDDHRILFRLQKQ 156


>dbj|BAF00693.1| hypothetical protein [Arabidopsis thaliana]
          Length = 732

 Score = 99.0 bits (245), Expect = 6e-19
 Identities = 53/110 (48%), Positives = 70/110 (63%), Gaps = 7/110 (6%)
 Frame = +1

Query: 142 KNFPQEKE--MDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXX 315
           K  P++++  MDSTPV+WEALD+L++DF  SENL+ED                       
Sbjct: 29  KEEPRKRKATMDSTPVNWEALDALIIDFVSSENLVEDAAAAVNSPPSPLSSPSSSSSPSI 88

Query: 316 XX-----RLVIRQIRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
                  RL+IR+IR  +E G+I+TAID LR+HAPF+LDDHR+LFRLQKQ
Sbjct: 89  SSSSYHSRLIIRRIRSSIESGDIETAIDILRSHAPFVLDDHRILFRLQKQ 138


>gb|ESQ31162.1| hypothetical protein EUTSA_v10003742mg [Eutrema salsugineum]
          Length = 694

 Score = 93.6 bits (231), Expect = 2e-17
 Identities = 47/97 (48%), Positives = 63/97 (64%), Gaps = 2/97 (2%)
 Frame = +1

Query: 166 MDSTPVSWEALDSLVVDFAKSENLI--EDXXXXXXXXXXXXXXXXXXXXXXXXXRLVIRQ 339
           MDSTPV+WEALD+L++DF  SENL+  ED                         RL+IR+
Sbjct: 1   MDSTPVNWEALDALIIDFVSSENLVVEEDTCANSSQSPLSSPSSPSISSSSYHARLIIRR 60

Query: 340 IRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
           IR  +E G+I+ A+D +R++APF+LDDHR+LFRLQKQ
Sbjct: 61  IRNSIESGDIEAAMDIIRSNAPFVLDDHRILFRLQKQ 97


>ref|XP_006280114.1| hypothetical protein CARUB_v10026006mg [Capsella rubella]
           gi|482548818|gb|EOA13012.1| hypothetical protein
           CARUB_v10026006mg [Capsella rubella]
          Length = 686

 Score = 93.6 bits (231), Expect = 2e-17
 Identities = 48/100 (48%), Positives = 62/100 (62%), Gaps = 5/100 (5%)
 Frame = +1

Query: 166 MDSTPVSWEALDSLVVDFAKSENLIEDXXXXXXXXXXXXXXXXXXXXXXXXX-----RLV 330
           MDSTPV+WEALD+L++DF  SENL+E                               RL+
Sbjct: 1   MDSTPVNWEALDALIIDFVSSENLVEAAAAAAAANSSSSPSSSSPSSPSISSSSYHSRLI 60

Query: 331 IRQIRRCLEVGEIDTAIDPLRTHAPFILDDHRLLFRLQKQ 450
           I ++R  +E G+I+TAID LR+HAPF+LDDHR+LFRLQKQ
Sbjct: 61  IHRVRNSIESGDIETAIDILRSHAPFVLDDHRILFRLQKQ 100


Top