BLASTX nr result

ID: Jatropha_contig00001022 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Jatropha_contig00001022
         (576 letters)

Database: NCBI-nr (updated 2014/02/11) 
           35,149,712 sequences; 12,374,887,350 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002515547.1| tso1, putative [Ricinus communis] gi|2235454...   159   6e-37
gb|EEE93134.2| cysteine-rich polycomb-like family protein [Popul...   125   9e-27
ref|XP_002309611.1| predicted protein [Populus trichocarpa]           125   9e-27
gb|ESR63721.1| hypothetical protein CICLE_v10007369mg [Citrus cl...   120   2e-25
gb|EOY31448.1| Tesmin/TSO1-like CXC domain-containing protein, p...   111   1e-22
gb|EOY31447.1| Tesmin/TSO1-like CXC domain-containing protein, p...   111   1e-22
gb|EOY31444.1| Tesmin/TSO1-like CXC domain-containing protein, p...   111   1e-22
gb|EOY31443.1| Tesmin/TSO1-like CXC domain-containing protein, p...   111   1e-22
ref|XP_002282493.2| PREDICTED: uncharacterized protein LOC100261...   100   2e-19
emb|CBI21012.3| unnamed protein product [Vitis vinifera]              100   2e-19
ref|XP_006364287.1| PREDICTED: CRC domain-containing protein TSO...    79   1e-12
ref|XP_002324849.1| predicted protein [Populus trichocarpa]            67   2e-09
gb|EMJ26483.1| hypothetical protein PRUPE_ppa001375mg [Prunus pe...    64   2e-08
gb|ERP49533.1| hypothetical protein POPTR_0018s01480g [Populus t...    64   3e-08
ref|NP_001236112.1| cysteine-rich polycomb-like protein [Glycine...    64   3e-08
emb|CAF02297.1| cysteine-rich polycomb-like protein [Lotus japon...    57   3e-06

>ref|XP_002515547.1| tso1, putative [Ricinus communis] gi|223545491|gb|EEF46996.1| tso1,
           putative [Ricinus communis]
          Length = 873

 Score =  159 bits (401), Expect = 6e-37
 Identities = 89/146 (60%), Positives = 102/146 (69%)
 Frame = +3

Query: 87  NLEATLQGDIQNVAKCDSEASQLQRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGS 266
           N  AT     QN+ + D EA QLQRG SRRCLQFEEA++KI +N   S + T NV   GS
Sbjct: 241 NAAATTHRANQNIGQRDPEAGQLQRGMSRRCLQFEEARQKITLNRIHSTDPTNNVNGSGS 300

Query: 267 PASAADLESLDPSHVGITESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLN 446
           PAS  +LE+LD S++ I   S KK MINLS   TSMFP R SGKS +V SKPSGIGLHLN
Sbjct: 301 PASTTELENLDSSYIEIAAYSHKK-MINLSEPTTSMFP-RFSGKSLVVVSKPSGIGLHLN 358

Query: 447 SIVKALPMGHTATASMKSTPIMSFHQ 524
           SIV A+PMGH+   S KSTPIMS HQ
Sbjct: 359 SIVTAMPMGHSGAESNKSTPIMSCHQ 384


>gb|EEE93134.2| cysteine-rich polycomb-like family protein [Populus trichocarpa]
          Length = 847

 Score =  125 bits (313), Expect = 9e-27
 Identities = 77/147 (52%), Positives = 91/147 (61%)
 Frame = +3

Query: 114 IQNVAKCDSEASQLQRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGSPASAADLES 293
           IQ +    SE SQLQRG SRRCLQFE+AQ++   + T SPN   N+    SPAS+ +LE 
Sbjct: 260 IQLITYNGSEVSQLQRGMSRRCLQFEQAQQETTKDGTYSPNPAINLFGSISPASSTELEI 319

Query: 294 LDPSHVGITESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLNSIVKALPMG 473
           LD S V +T SS K+Q      TM++MF    SGK P+  SKPSGIGLHLNSIV  LPMG
Sbjct: 320 LDSSQVELTISSHKEQ------TMSAMFSANISGKCPVAVSKPSGIGLHLNSIVNTLPMG 373

Query: 474 HTATASMKSTPIMSFHQVDYHNSSLKL 554
             A     S PIMS H V+   S  KL
Sbjct: 374 SGA-----SGPIMSHHLVENKISCSKL 395


>ref|XP_002309611.1| predicted protein [Populus trichocarpa]
          Length = 847

 Score =  125 bits (313), Expect = 9e-27
 Identities = 77/147 (52%), Positives = 91/147 (61%)
 Frame = +3

Query: 114 IQNVAKCDSEASQLQRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGSPASAADLES 293
           IQ +    SE SQLQRG SRRCLQFE+AQ++   + T SPN   N+    SPAS+ +LE 
Sbjct: 260 IQLITYNGSEVSQLQRGMSRRCLQFEQAQQETTKDGTYSPNPAINLFGSISPASSTELEI 319

Query: 294 LDPSHVGITESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLNSIVKALPMG 473
           LD S V +T SS K+Q      TM++MF    SGK P+  SKPSGIGLHLNSIV  LPMG
Sbjct: 320 LDSSQVELTISSHKEQ------TMSAMFSANISGKCPVAVSKPSGIGLHLNSIVNTLPMG 373

Query: 474 HTATASMKSTPIMSFHQVDYHNSSLKL 554
             A     S PIMS H V+   S  KL
Sbjct: 374 SGA-----SGPIMSHHLVENKISCSKL 395


>gb|ESR63721.1| hypothetical protein CICLE_v10007369mg [Citrus clementina]
          Length = 952

 Score =  120 bits (302), Expect = 2e-25
 Identities = 71/159 (44%), Positives = 94/159 (59%), Gaps = 1/159 (0%)
 Frame = +3

Query: 9   CPRTTLLSNMM-IQRFRWFRHVNSIVANLEATLQGDIQNVAKCDSEASQLQRGFSRRCLQ 185
           CP  +L   +  IQ +  FR       N  A L G   N    D EA + QRG SRRCLQ
Sbjct: 272 CPSQSLREPLQTIQTYEDFRE------NAGAILYGPHDNPMH-DPEAGKHQRGMSRRCLQ 324

Query: 186 FEEAQKKIIVNSTSSPNLTKNVTSLGSPASAADLESLDPSHVGITESSQKKQMINLSRTM 365
           FEEAQ K+ V S++  N   +VTS   P +  + ES DPSHV +  +S K+Q+ +L   +
Sbjct: 325 FEEAQLKVTVCSSNPSNKLNDVTSSQLPTTPVESESPDPSHVDLNITSGKRQLASLPHPV 384

Query: 366 TSMFPLRRSGKSPIVASKPSGIGLHLNSIVKALPMGHTA 482
           T +FP   +GKSP+  SKPSGIGLHLN+++K+ P GH A
Sbjct: 385 TPVFPPHHTGKSPLTVSKPSGIGLHLNNLIKSSPEGHGA 423


>gb|EOY31448.1| Tesmin/TSO1-like CXC domain-containing protein, putative isoform 6
           [Theobroma cacao] gi|508784194|gb|EOY31450.1|
           Tesmin/TSO1-like CXC domain-containing protein, putative
           isoform 6 [Theobroma cacao]
          Length = 668

 Score =  111 bits (277), Expect = 1e-22
 Identities = 62/137 (45%), Positives = 87/137 (63%)
 Frame = +3

Query: 87  NLEATLQGDIQNVAKCDSEASQLQRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGS 266
           N E   +  + ++   D EAS+ QRG SRRCLQF +AQ +   N +SS +L  ++ +  S
Sbjct: 286 NFEGVAEVTVDSMTN-DLEASEHQRGMSRRCLQFGDAQPEATANCSSS-SLANDMITSRS 343

Query: 267 PASAADLESLDPSHVGITESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLN 446
            A+ ++ E L  SHV ++  S+K+Q++NLS+   +M P     KS +  SKPSGIGLHLN
Sbjct: 344 VATTSETEGLGLSHVDLSVISRKRQLVNLSQLAINMIPQHYGEKSSLTVSKPSGIGLHLN 403

Query: 447 SIVKALPMGHTATASMK 497
           SIV A+PMG   TASMK
Sbjct: 404 SIVNAIPMGRGGTASMK 420


>gb|EOY31447.1| Tesmin/TSO1-like CXC domain-containing protein, putative isoform 5
           [Theobroma cacao] gi|508784193|gb|EOY31449.1|
           Tesmin/TSO1-like CXC domain-containing protein, putative
           isoform 5 [Theobroma cacao]
          Length = 667

 Score =  111 bits (277), Expect = 1e-22
 Identities = 62/137 (45%), Positives = 87/137 (63%)
 Frame = +3

Query: 87  NLEATLQGDIQNVAKCDSEASQLQRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGS 266
           N E   +  + ++   D EAS+ QRG SRRCLQF +AQ +   N +SS +L  ++ +  S
Sbjct: 286 NFEGVAEVTVDSMTN-DLEASEHQRGMSRRCLQFGDAQPEATANCSSS-SLANDMITSRS 343

Query: 267 PASAADLESLDPSHVGITESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLN 446
            A+ ++ E L  SHV ++  S+K+Q++NLS+   +M P     KS +  SKPSGIGLHLN
Sbjct: 344 VATTSETEGLGLSHVDLSVISRKRQLVNLSQLAINMIPQHYGEKSSLTVSKPSGIGLHLN 403

Query: 447 SIVKALPMGHTATASMK 497
           SIV A+PMG   TASMK
Sbjct: 404 SIVNAIPMGRGGTASMK 420


>gb|EOY31444.1| Tesmin/TSO1-like CXC domain-containing protein, putative isoform 2
           [Theobroma cacao]
          Length = 704

 Score =  111 bits (277), Expect = 1e-22
 Identities = 62/137 (45%), Positives = 87/137 (63%)
 Frame = +3

Query: 87  NLEATLQGDIQNVAKCDSEASQLQRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGS 266
           N E   +  + ++   D EAS+ QRG SRRCLQF +AQ +   N +SS +L  ++ +  S
Sbjct: 50  NFEGVAEVTVDSMTN-DLEASEHQRGMSRRCLQFGDAQPEATANCSSS-SLANDMITSRS 107

Query: 267 PASAADLESLDPSHVGITESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLN 446
            A+ ++ E L  SHV ++  S+K+Q++NLS+   +M P     KS +  SKPSGIGLHLN
Sbjct: 108 VATTSETEGLGLSHVDLSVISRKRQLVNLSQLAINMIPQHYGEKSSLTVSKPSGIGLHLN 167

Query: 447 SIVKALPMGHTATASMK 497
           SIV A+PMG   TASMK
Sbjct: 168 SIVNAIPMGRGGTASMK 184


>gb|EOY31443.1| Tesmin/TSO1-like CXC domain-containing protein, putative isoform 1
           [Theobroma cacao] gi|508784189|gb|EOY31445.1|
           Tesmin/TSO1-like CXC domain-containing protein, putative
           isoform 1 [Theobroma cacao] gi|508784190|gb|EOY31446.1|
           Tesmin/TSO1-like CXC domain-containing protein, putative
           isoform 1 [Theobroma cacao]
          Length = 940

 Score =  111 bits (277), Expect = 1e-22
 Identities = 62/137 (45%), Positives = 87/137 (63%)
 Frame = +3

Query: 87  NLEATLQGDIQNVAKCDSEASQLQRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGS 266
           N E   +  + ++   D EAS+ QRG SRRCLQF +AQ +   N +SS +L  ++ +  S
Sbjct: 286 NFEGVAEVTVDSMTN-DLEASEHQRGMSRRCLQFGDAQPEATANCSSS-SLANDMITSRS 343

Query: 267 PASAADLESLDPSHVGITESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLN 446
            A+ ++ E L  SHV ++  S+K+Q++NLS+   +M P     KS +  SKPSGIGLHLN
Sbjct: 344 VATTSETEGLGLSHVDLSVISRKRQLVNLSQLAINMIPQHYGEKSSLTVSKPSGIGLHLN 403

Query: 447 SIVKALPMGHTATASMK 497
           SIV A+PMG   TASMK
Sbjct: 404 SIVNAIPMGRGGTASMK 420


>ref|XP_002282493.2| PREDICTED: uncharacterized protein LOC100261127 [Vitis vinifera]
          Length = 1001

 Score =  100 bits (249), Expect = 2e-19
 Identities = 61/133 (45%), Positives = 80/133 (60%), Gaps = 3/133 (2%)
 Frame = +3

Query: 108 GDIQNVAKCDSEASQL---QRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGSPASA 278
           G ++N    D EASQ    QRG  RRCLQF EAQ  II N+ S    T    +    A+ 
Sbjct: 328 GPVENTVLHDPEASQQTQHQRGMLRRCLQFGEAQLNIITNNPSFSYPTSIAANSRLLATP 387

Query: 279 ADLESLDPSHVGITESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLNSIVK 458
           +D E  + S V +T +S  KQ+   S+ +T+M P + SG S +  SKPSGIGLHLNSIV 
Sbjct: 388 SDSELPESSCVDLTTTSSNKQLA-YSQPVTAMLPPQNSGNSSLAGSKPSGIGLHLNSIVN 446

Query: 459 ALPMGHTATASMK 497
           A+PMG ++T S+K
Sbjct: 447 AVPMGFSSTTSLK 459


>emb|CBI21012.3| unnamed protein product [Vitis vinifera]
          Length = 1094

 Score =  100 bits (249), Expect = 2e-19
 Identities = 61/133 (45%), Positives = 80/133 (60%), Gaps = 3/133 (2%)
 Frame = +3

Query: 108 GDIQNVAKCDSEASQL---QRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGSPASA 278
           G ++N    D EASQ    QRG  RRCLQF EAQ  II N+ S    T    +    A+ 
Sbjct: 357 GPVENTVLHDPEASQQTQHQRGMLRRCLQFGEAQLNIITNNPSFSYPTSIAANSRLLATP 416

Query: 279 ADLESLDPSHVGITESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLNSIVK 458
           +D E  + S V +T +S  KQ+   S+ +T+M P + SG S +  SKPSGIGLHLNSIV 
Sbjct: 417 SDSELPESSCVDLTTTSSNKQLA-YSQPVTAMLPPQNSGNSSLAGSKPSGIGLHLNSIVN 475

Query: 459 ALPMGHTATASMK 497
           A+PMG ++T S+K
Sbjct: 476 AVPMGFSSTTSLK 488


>ref|XP_006364287.1| PREDICTED: CRC domain-containing protein TSO1-like [Solanum
           tuberosum]
          Length = 962

 Score = 78.6 bits (192), Expect = 1e-12
 Identities = 53/130 (40%), Positives = 73/130 (56%), Gaps = 2/130 (1%)
 Frame = +3

Query: 117 QNVAKCDSE--ASQLQRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGSPASAADLE 290
           + +A  DS+  A Q Q G SRRCLQFE+AQ+K+   S+SS N +  V+    P S A +E
Sbjct: 305 EGIALHDSQTKAGQHQSGISRRCLQFEDAQQKMAPASSSSQNASGIVSCSIQPVSPAVIE 364

Query: 291 SLDPSHVGITESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLNSIVKALPM 470
            ++P  V    SS+  Q+++ S    S+          +  SKPSGIGLHLNSIV  +  
Sbjct: 365 VVEP--VSSNRSSKLTQLVSSSVNSESL---------NVKVSKPSGIGLHLNSIVNGMEA 413

Query: 471 GHTATASMKS 500
           G   T S+KS
Sbjct: 414 GSGVTVSVKS 423


>ref|XP_002324849.1| predicted protein [Populus trichocarpa]
          Length = 561

 Score = 67.4 bits (163), Expect = 2e-09
 Identities = 38/85 (44%), Positives = 52/85 (61%)
 Frame = +3

Query: 102 LQGDIQNVAKCDSEASQLQRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGSPASAA 281
           L G +Q +    SE SQL RG SRRCLQFE+AQ+K  ++ T S +L   +    S +S  
Sbjct: 239 LHGPVQLITHHGSEVSQLHRGMSRRCLQFEKAQQKTPMDGTYSLDLAITIIGSISSSSNT 298

Query: 282 DLESLDPSHVGITESSQKKQMINLS 356
           +LE LD S V +  SS+KKQ + +S
Sbjct: 299 ELEILDSSQVELPSSSRKKQTVFVS 323


>gb|EMJ26483.1| hypothetical protein PRUPE_ppa001375mg [Prunus persica]
          Length = 842

 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 45/144 (31%), Positives = 63/144 (43%)
 Frame = +3

Query: 141 EASQLQRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGSPASAADLESLDPSHVGIT 320
           +A   Q G  RRCLQFEEA          S +  + V +   P+S  + + +  S+  + 
Sbjct: 253 QARSEQGGMHRRCLQFEEAPPCATGKRDCSLSSIQEVNNSEPPSSMGESKLVKLSYADLK 312

Query: 321 ESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLNSIVKALPMGHTATASMKS 500
            +S+        R M +  P R  G SP    KPSGIGLHLNSIV A P+   +  S   
Sbjct: 313 STSK--------RQMGTPLPPRCGGNSPSTVPKPSGIGLHLNSIVNAAPLVRASVMSSHL 364

Query: 501 TPIMSFHQVDYHNSSLKLIAPEDR 572
              +    +  +        PEDR
Sbjct: 365 PDNVRCRSISLNMVEKDSAGPEDR 388


>gb|ERP49533.1| hypothetical protein POPTR_0018s01480g [Populus trichocarpa]
          Length = 520

 Score = 63.9 bits (154), Expect = 3e-08
 Identities = 42/104 (40%), Positives = 54/104 (51%)
 Frame = +3

Query: 168 SRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGSPASAADLESLDPSHVGITESSQKKQMI 347
           SRRCLQFE+AQ+K  ++ T S +L   +    S +S  +LE LD S V +  SS+KKQ  
Sbjct: 2   SRRCLQFEKAQQKTPMDGTYSLDLAITIIGSISSSSNTELEILDSSQVELPSSSRKKQ-- 59

Query: 348 NLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLNSIVKALPMGHT 479
                              +  S PSG GLHLNSIV  L +G T
Sbjct: 60  ------------------TVFVSMPSGFGLHLNSIVNTLLIGCT 85


>ref|NP_001236112.1| cysteine-rich polycomb-like protein [Glycine max]
           gi|4218187|emb|CAA09028.1| cysteine-rich polycomb-like
           protein [Glycine max]
          Length = 896

 Score = 63.9 bits (154), Expect = 3e-08
 Identities = 47/127 (37%), Positives = 63/127 (49%)
 Frame = +3

Query: 117 QNVAKCDSEASQLQRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGSPASAADLESL 296
           +N+ +  SEA+    G  RRCLQF EA         +S  L +NV       +AA     
Sbjct: 294 ENILQDGSEATLKHHGIRRRCLQFGEA---------ASNALGRNVK-----LNAA----- 334

Query: 297 DPSHVGITESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLNSIVKALPMGH 476
             SH  IT        +  S  +TS+ P R SG  P  + KPSGIGLHLNSI+ A+P+  
Sbjct: 335 --SHTMIT--------VKPSELVTSLCPRRGSGNFPSTSPKPSGIGLHLNSIINAIPIDQ 384

Query: 477 TATASMK 497
            AT  ++
Sbjct: 385 AATTGVR 391


>emb|CAF02297.1| cysteine-rich polycomb-like protein [Lotus japonicus]
           gi|40241253|emb|CAF02298.1| cysteine-rich polycomb-like
           protein [Lotus japonicus]
          Length = 897

 Score = 57.0 bits (136), Expect = 3e-06
 Identities = 41/139 (29%), Positives = 70/139 (50%)
 Frame = +3

Query: 96  ATLQGDIQNVAKCDSEASQLQRGFSRRCLQFEEAQKKIIVNSTSSPNLTKNVTSLGSPAS 275
           +TL    +N+++  SEAS    G  RRCL+F EA                  ++LGS   
Sbjct: 296 STLHVRQENISQDGSEASLKYHGIRRRCLKFGEAAS----------------SALGS--- 336

Query: 276 AADLESLDPSHVGITESSQKKQMINLSRTMTSMFPLRRSGKSPIVASKPSGIGLHLNSIV 455
                  + S++ +  +S +   +N  + ++S++ L+R    P   SKP+GIGLHLNSI+
Sbjct: 337 -------NMSNMKLNATSSQMHFVNPFKPVSSLY-LQRG--IPETGSKPAGIGLHLNSII 386

Query: 456 KALPMGHTATASMKSTPIM 512
             +P    +T  M+S+ ++
Sbjct: 387 NGMPPSCASTTGMRSSDVL 405


Top