BLASTX nr result

ID: Jatropha_contig00039468 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Jatropha_contig00039468
         (523 letters)

Database: NCBI-nr (updated 2014/02/11) 
           35,149,712 sequences; 12,374,887,350 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002523146.1| nucleic acid binding protein, putative [Rici...   118   7e-25
ref|XP_006306427.1| hypothetical protein CARUB_v10012366mg [Caps...    86   6e-15
gb|ESQ33790.1| hypothetical protein EUTSA_v10006950mg [Eutrema s...    84   2e-14
gb|ESQ33789.1| hypothetical protein EUTSA_v10006950mg [Eutrema s...    84   2e-14
gb|AAO00844.1| Unknown protein [Arabidopsis thaliana]                  77   2e-12
ref|NP_849735.1| toprim domain-containing protein [Arabidopsis t...    77   2e-12
ref|XP_003534794.1| PREDICTED: uncharacterized protein LOC100804...    77   2e-12
ref|XP_004512933.1| PREDICTED: DNA primase/helicase-like [Cicer ...    74   1e-11
ref|XP_003546288.1| PREDICTED: uncharacterized protein LOC100779...    74   2e-11
gb|AAD25755.1|AC007060_13 T5I8.13 [Arabidopsis thaliana]               72   5e-11
gb|EOY25656.1| Toprim domain-containing protein isoform 4 [Theob...    69   5e-10
gb|EOY25655.1| Toprim domain-containing protein isoform 3 [Theob...    69   5e-10
gb|EOY25654.1| Toprim domain-containing protein isoform 2 [Theob...    69   5e-10
gb|EOY25653.1| Toprim domain-containing protein isoform 1 [Theob...    69   5e-10
gb|ESW19430.1| hypothetical protein PHAVU_006G124400g [Phaseolus...    65   7e-09
ref|XP_002299018.1| predicted protein [Populus trichocarpa] gi|2...    62   7e-08
ref|XP_006360317.1| PREDICTED: twinkle homolog protein, chloropl...    59   8e-07
ref|XP_002268852.1| PREDICTED: uncharacterized protein LOC100257...    56   4e-06

>ref|XP_002523146.1| nucleic acid binding protein, putative [Ricinus communis]
           gi|223537553|gb|EEF39177.1| nucleic acid binding
           protein, putative [Ricinus communis]
          Length = 700

 Score =  118 bits (296), Expect = 7e-25
 Identities = 72/159 (45%), Positives = 89/159 (55%), Gaps = 4/159 (2%)
 Frame = +3

Query: 57  MLRFAHCNPQTHLRKLIFLPSSINLLHMGSKFFFKPATXXXXXXXXXXXXT--RNPYHTF 230
           M R+A+ +PQ HL KL    S +    MGSK F KP T            +  R  YHT 
Sbjct: 1   MFRYAYYSPQIHLYKLSSSSSKVGF--MGSKLFLKPTTTTLPPLSPFSYSSSGRLQYHTC 58

Query: 231 KRLLPVACSKPISKIPPY--KANGFTSQATVPTPVYGEEEDXXXXXXXXXXXXXXXXGIE 404
           +RLLPV CSKPISK  PY  K NGF   AT+P PV    ED                GI+
Sbjct: 59  RRLLPVFCSKPISKNRPYLPKTNGF---ATLPAPV--SSEDSEKPHLEKLRGKLEVLGIQ 113

Query: 405 MDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
           M+N VPGQ++ LLCP C GG S E++L+LFI+PDG+ A+
Sbjct: 114 MENLVPGQYSSLLCPMCNGGQSGERSLSLFISPDGANAT 152


>ref|XP_006306427.1| hypothetical protein CARUB_v10012366mg [Capsella rubella]
           gi|482575138|gb|EOA39325.1| hypothetical protein
           CARUB_v10012366mg [Capsella rubella]
          Length = 715

 Score = 85.5 bits (210), Expect = 6e-15
 Identities = 62/164 (37%), Positives = 83/164 (50%), Gaps = 11/164 (6%)
 Frame = +3

Query: 60  LRFAHCNPQTHLRKLIFLPSSINLLHMGSKFFFKPATXXXXXXXXXXXXT--RNPYHTFK 233
           +RF    PQTHLRKL     S++LL MGSK F +               +  R      +
Sbjct: 1   MRFLLRLPQTHLRKL---SCSMSLL-MGSKQFLEFCLLPSFAVCSSSSSSPGRQLSSVSR 56

Query: 234 RLLPVACSKPISKIPPY--KANGFTSQATVP---TPVYGEEEDXXXXXXXXXXXXXXXX- 395
           R  PV  S+P+SK  P+  K NG +S  ++P   TPV  EEE+                 
Sbjct: 57  RFRPVLASRPVSKNSPFHQKTNGLSSYTSIPRVQTPVDPEEEEADKRAVSSKLVTLRRKL 116

Query: 396 ---GIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGA 518
              GI+  N  PGQH+ L CP+C GGDS EK+L+L+++PDGS A
Sbjct: 117 FEQGIDAQNCHPGQHSGLTCPQCEGGDSGEKSLSLYVSPDGSSA 160


>gb|ESQ33790.1| hypothetical protein EUTSA_v10006950mg [Eutrema salsugineum]
          Length = 708

 Score = 84.0 bits (206), Expect = 2e-14
 Identities = 64/164 (39%), Positives = 84/164 (51%), Gaps = 10/164 (6%)
 Frame = +3

Query: 60  LRFAHCNPQTHLRKLIFLPSSINLLHMGSKFF--FKPATXXXXXXXXXXXXTRNPYHTFK 233
           +RF    PQTHLRKL     S+++L MGSK F  F  A              R      K
Sbjct: 1   MRFLLRLPQTHLRKL---SCSMSVL-MGSKQFLEFCLAPSFAASPSYTPGRKRQLSSVSK 56

Query: 234 RLLPVACSKPISKIPPY--KANG---FTSQATVPTPVYGEEEDXXXXXXXXXXXXXXXX- 395
           RL+PV+ S+P+SK  PY  + NG   +TS + +PTPV  EEE                  
Sbjct: 57  RLVPVSASRPVSKNSPYQNRTNGLSSYTSVSRIPTPVDPEEEADKRAVQFRLANLRRRLA 116

Query: 396 --GIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
             GI+  N   GQ++ L+CP C GGDS EK+L+L+I PD S A+
Sbjct: 117 ENGIDAQNCPSGQYSGLICPECEGGDSGEKSLSLYIAPDCSSAT 160


>gb|ESQ33789.1| hypothetical protein EUTSA_v10006950mg [Eutrema salsugineum]
          Length = 673

 Score = 84.0 bits (206), Expect = 2e-14
 Identities = 64/164 (39%), Positives = 84/164 (51%), Gaps = 10/164 (6%)
 Frame = +3

Query: 60  LRFAHCNPQTHLRKLIFLPSSINLLHMGSKFF--FKPATXXXXXXXXXXXXTRNPYHTFK 233
           +RF    PQTHLRKL     S+++L MGSK F  F  A              R      K
Sbjct: 1   MRFLLRLPQTHLRKL---SCSMSVL-MGSKQFLEFCLAPSFAASPSYTPGRKRQLSSVSK 56

Query: 234 RLLPVACSKPISKIPPY--KANG---FTSQATVPTPVYGEEEDXXXXXXXXXXXXXXXX- 395
           RL+PV+ S+P+SK  PY  + NG   +TS + +PTPV  EEE                  
Sbjct: 57  RLVPVSASRPVSKNSPYQNRTNGLSSYTSVSRIPTPVDPEEEADKRAVQFRLANLRRRLA 116

Query: 396 --GIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
             GI+  N   GQ++ L+CP C GGDS EK+L+L+I PD S A+
Sbjct: 117 ENGIDAQNCPSGQYSGLICPECEGGDSGEKSLSLYIAPDCSSAT 160


>gb|AAO00844.1| Unknown protein [Arabidopsis thaliana]
          Length = 709

 Score = 77.4 bits (189), Expect = 2e-12
 Identities = 59/165 (35%), Positives = 81/165 (49%), Gaps = 11/165 (6%)
 Frame = +3

Query: 60  LRFAHCNPQTHLRKLIFLPSSINLLHMGSKFFFKPA---TXXXXXXXXXXXXTRNPYHTF 230
           +RF    PQ H RKL     S+++L MGSK F +     +            +R      
Sbjct: 1   MRFLLRLPQIHFRKL---SCSMSVL-MGSKQFLEFCLLPSFASYPSSPSYSSSRQVSSVS 56

Query: 231 KRLLPVACSKPISKIPPY--KANGFTSQAT---VPTPVYGEEEDXXXXXXXXXXXXXXXX 395
           +R  PV  S+P+SK  PY  + NG +S  +   VPTPV  E E                 
Sbjct: 57  RRFRPVLASRPVSKNSPYYQRTNGLSSYNSIPRVPTPVDTEVEADKRVVLSRLVTLRRKL 116

Query: 396 ---GIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
              G++ +N  PGQH+ L+CP C GG+S EK+L+LFI PDGS A+
Sbjct: 117 AEQGVDAENCPPGQHSGLICPTCEGGNSGEKSLSLFIAPDGSSAT 161


>ref|NP_849735.1| toprim domain-containing protein [Arabidopsis thaliana]
           gi|487522982|sp|B5X582.1|TWIH_ARATH RecName:
           Full=Twinkle homolog protein,
           chloroplastic/mitochondrial; AltName: Full=DNA helicase;
           AltName: Full=DNA primase; Flags: Precursor
           gi|209529811|gb|ACI49800.1| At1g30680 [Arabidopsis
           thaliana] gi|332193138|gb|AEE31259.1| toprim
           domain-containing protein [Arabidopsis thaliana]
          Length = 709

 Score = 77.4 bits (189), Expect = 2e-12
 Identities = 59/165 (35%), Positives = 81/165 (49%), Gaps = 11/165 (6%)
 Frame = +3

Query: 60  LRFAHCNPQTHLRKLIFLPSSINLLHMGSKFFFKPA---TXXXXXXXXXXXXTRNPYHTF 230
           +RF    PQ H RKL     S+++L MGSK F +     +            +R      
Sbjct: 1   MRFLLRLPQIHFRKL---SCSMSVL-MGSKQFLEFCLLPSFASYPSSPSYSSSRQVSSVS 56

Query: 231 KRLLPVACSKPISKIPPY--KANGFTSQAT---VPTPVYGEEEDXXXXXXXXXXXXXXXX 395
           +R  PV  S+P+SK  PY  + NG +S  +   VPTPV  E E                 
Sbjct: 57  RRFRPVLASRPVSKNSPYYQRTNGLSSYNSIPRVPTPVDTEVEADKRVVLSRLVTLRRKL 116

Query: 396 ---GIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
              G++ +N  PGQH+ L+CP C GG+S EK+L+LFI PDGS A+
Sbjct: 117 AEQGVDAENCPPGQHSGLICPTCEGGNSGEKSLSLFIAPDGSSAT 161


>ref|XP_003534794.1| PREDICTED: uncharacterized protein LOC100804637 [Glycine max]
          Length = 679

 Score = 77.0 bits (188), Expect = 2e-12
 Identities = 43/110 (39%), Positives = 59/110 (53%), Gaps = 6/110 (5%)
 Frame = +3

Query: 210 RNPYHTFKRLLPVACSKPISKIPPY--KANGF--TSQATVPTPVYGEE--EDXXXXXXXX 371
           R+ +   +    V CSKPIS+ PP   + NG+   SQA++P PV  E   E         
Sbjct: 23  RHRFPCHRPFFTVFCSKPISRNPPLPLRTNGYHGASQASIPRPVQLESPVEKNMELQLNI 82

Query: 372 XXXXXXXXGIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
                   G+E +   PGQ+NHLLCP C GGD  E++L+L+I PDG  A+
Sbjct: 83  LKKKLEAIGVETEMCEPGQYNHLLCPECLGGDQEERSLSLYIAPDGGSAA 132


>ref|XP_004512933.1| PREDICTED: DNA primase/helicase-like [Cicer arietinum]
          Length = 697

 Score = 74.3 bits (181), Expect = 1e-11
 Identities = 44/101 (43%), Positives = 57/101 (56%), Gaps = 5/101 (4%)
 Frame = +3

Query: 231 KRLLPVACSKPI-SKIPPY--KANGF--TSQATVPTPVYGEEEDXXXXXXXXXXXXXXXX 395
           + +  V CSK   SK PP   K NG+   SQA VP PVY  EE+                
Sbjct: 54  RTIFTVFCSKKRNSKYPPLPLKTNGYHGASQAKVPKPVY-LEENKLEMQFGVLKKKLEVV 112

Query: 396 GIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGA 518
           GI+ +  VPGQ+NHLLCP C GGD+ EK+L++++ PDG  A
Sbjct: 113 GIDTEICVPGQYNHLLCPECQGGDAGEKSLSIYVAPDGGSA 153


>ref|XP_003546288.1| PREDICTED: uncharacterized protein LOC100779625 [Glycine max]
          Length = 678

 Score = 73.9 bits (180), Expect = 2e-11
 Identities = 42/110 (38%), Positives = 58/110 (52%), Gaps = 6/110 (5%)
 Frame = +3

Query: 210 RNPYHTFKRLLPVACSKPISKIPP--YKANGF--TSQATVPTPVYGEE--EDXXXXXXXX 371
           R  + + +    V CSKPIS+ PP   + NG+  +S A++P PV  E   E         
Sbjct: 23  RRRFPSHRPFFTVFCSKPISRNPPSPLRTNGYHGSSHASIPRPVQLESPMEKSVEFQLNI 82

Query: 372 XXXXXXXXGIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
                   G+E     PGQ+NHLLCP C GGD  E++L+L+I PDG  A+
Sbjct: 83  LKKKLEAIGMETGMCEPGQYNHLLCPECLGGDQEERSLSLYIAPDGGSAA 132


>gb|AAD25755.1|AC007060_13 T5I8.13 [Arabidopsis thaliana]
          Length = 670

 Score = 72.4 bits (176), Expect = 5e-11
 Identities = 43/105 (40%), Positives = 58/105 (55%), Gaps = 8/105 (7%)
 Frame = +3

Query: 231 KRLLPVACSKPISKIPPY--KANGFTSQAT---VPTPVYGEEEDXXXXXXXXXXXXXXXX 395
           +R  PV  S+P+SK  PY  + NG +S  +   VPTPV  E E                 
Sbjct: 35  RRFRPVLASRPVSKNSPYYQRTNGLSSYNSIPRVPTPVDTEVEADKRVVLSRLVTLRRKL 94

Query: 396 ---GIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
              G++ +N  PGQH+ L+CP C GG+S EK+L+LFI PDGS A+
Sbjct: 95  AEQGVDAENCPPGQHSGLICPTCEGGNSGEKSLSLFIAPDGSSAT 139


>gb|EOY25656.1| Toprim domain-containing protein isoform 4 [Theobroma cacao]
          Length = 578

 Score = 69.3 bits (168), Expect = 5e-10
 Identities = 46/104 (44%), Positives = 56/104 (53%), Gaps = 7/104 (6%)
 Frame = +3

Query: 231 KRLLPVACSKPISKIPPY--KANGFTS--QATVPTPVYGEE-EDXXXXXXXXXXXXXXXX 395
           KRL+P   SKP SK      + NGF+S   A V  PVY +E ED                
Sbjct: 56  KRLVPYLSSKPYSKNHSLSLRTNGFSSIPSANVSAPVYSKELEDRPLNMRSLEILKHKLK 115

Query: 396 --GIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
             GI++   VPG+ N LLCP C GG+S E +L+LFIN DGS AS
Sbjct: 116 QLGIDISACVPGRENRLLCPSCNGGESEEISLSLFINQDGSSAS 159


>gb|EOY25655.1| Toprim domain-containing protein isoform 3 [Theobroma cacao]
          Length = 712

 Score = 69.3 bits (168), Expect = 5e-10
 Identities = 46/104 (44%), Positives = 56/104 (53%), Gaps = 7/104 (6%)
 Frame = +3

Query: 231 KRLLPVACSKPISKIPPY--KANGFTS--QATVPTPVYGEE-EDXXXXXXXXXXXXXXXX 395
           KRL+P   SKP SK      + NGF+S   A V  PVY +E ED                
Sbjct: 56  KRLVPYLSSKPYSKNHSLSLRTNGFSSIPSANVSAPVYSKELEDRPLNMRSLEILKHKLK 115

Query: 396 --GIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
             GI++   VPG+ N LLCP C GG+S E +L+LFIN DGS AS
Sbjct: 116 QLGIDISACVPGRENRLLCPSCNGGESEEISLSLFINQDGSSAS 159


>gb|EOY25654.1| Toprim domain-containing protein isoform 2 [Theobroma cacao]
          Length = 682

 Score = 69.3 bits (168), Expect = 5e-10
 Identities = 46/104 (44%), Positives = 56/104 (53%), Gaps = 7/104 (6%)
 Frame = +3

Query: 231 KRLLPVACSKPISKIPPY--KANGFTS--QATVPTPVYGEE-EDXXXXXXXXXXXXXXXX 395
           KRL+P   SKP SK      + NGF+S   A V  PVY +E ED                
Sbjct: 56  KRLVPYLSSKPYSKNHSLSLRTNGFSSIPSANVSAPVYSKELEDRPLNMRSLEILKHKLK 115

Query: 396 --GIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
             GI++   VPG+ N LLCP C GG+S E +L+LFIN DGS AS
Sbjct: 116 QLGIDISACVPGRENRLLCPSCNGGESEEISLSLFINQDGSSAS 159


>gb|EOY25653.1| Toprim domain-containing protein isoform 1 [Theobroma cacao]
          Length = 705

 Score = 69.3 bits (168), Expect = 5e-10
 Identities = 46/104 (44%), Positives = 56/104 (53%), Gaps = 7/104 (6%)
 Frame = +3

Query: 231 KRLLPVACSKPISKIPPY--KANGFTS--QATVPTPVYGEE-EDXXXXXXXXXXXXXXXX 395
           KRL+P   SKP SK      + NGF+S   A V  PVY +E ED                
Sbjct: 56  KRLVPYLSSKPYSKNHSLSLRTNGFSSIPSANVSAPVYSKELEDRPLNMRSLEILKHKLK 115

Query: 396 --GIEMDNFVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
             GI++   VPG+ N LLCP C GG+S E +L+LFIN DGS AS
Sbjct: 116 QLGIDISACVPGRENRLLCPSCNGGESEEISLSLFINQDGSSAS 159


>gb|ESW19430.1| hypothetical protein PHAVU_006G124400g [Phaseolus vulgaris]
          Length = 697

 Score = 65.5 bits (158), Expect = 7e-09
 Identities = 38/110 (34%), Positives = 55/110 (50%), Gaps = 6/110 (5%)
 Frame = +3

Query: 210 RNPYHTFKRLLPVACSKPISKIPP--YKANGF--TSQATVPTPVYGEEEDXXXXXXXXXX 377
           R+ +   +    V CSKP S+  P   + NG+   S A++P PV  E             
Sbjct: 43  RHRFLPHRPFFTVFCSKPTSRNSPSPLRTNGYHGASHASIPRPVQLESPGAKSVELQFNI 102

Query: 378 XXXXXXGIEMDN--FVPGQHNHLLCPRCYGGDSREKTLALFINPDGSGAS 521
                  + M+    VPGQ+NHLLCP C GG+  E++L+L+I PDG  A+
Sbjct: 103 LKKRLEAVGMETGICVPGQYNHLLCPECQGGERAERSLSLYIAPDGGSAA 152


>ref|XP_002299018.1| predicted protein [Populus trichocarpa] gi|222846276|gb|EEE83823.1|
           toprim domain-containing family protein [Populus
           trichocarpa]
          Length = 658

 Score = 62.0 bits (149), Expect = 7e-08
 Identities = 29/69 (42%), Positives = 39/69 (56%)
 Frame = +3

Query: 315 VPTPVYGEEEDXXXXXXXXXXXXXXXXGIEMDNFVPGQHNHLLCPRCYGGDSREKTLALF 494
           +P  VYG + +                GIE+D+F PGQ+N L CP C GG S+EK+ +LF
Sbjct: 9   LPQKVYGLDPEVKKSKLEILRFKLAEVGIELDHFAPGQYNALTCPMCKGGGSKEKSFSLF 68

Query: 495 INPDGSGAS 521
           I+ DG  AS
Sbjct: 69  ISADGGNAS 77


>ref|XP_006360317.1| PREDICTED: twinkle homolog protein,
           chloroplastic/mitochondrial-like [Solanum tuberosum]
          Length = 695

 Score = 58.5 bits (140), Expect = 8e-07
 Identities = 44/135 (32%), Positives = 60/135 (44%), Gaps = 7/135 (5%)
 Frame = +3

Query: 138 MGSKFFF-KPATXXXXXXXXXXXXTRNPYHTFKRLLPVACSKPISKIPPYKANGFTSQAT 314
           MGSK+F  KP+                 + T + +     SKPIS      +  +  Q  
Sbjct: 19  MGSKYFLHKPSITLPTIYKSIPVL----FQTQRLIFSAFASKPISPNRGTSSFSYRPQR- 73

Query: 315 VPTPVYG------EEEDXXXXXXXXXXXXXXXXGIEMDNFVPGQHNHLLCPRCYGGDSRE 476
           +P PV G      +EE                 GI++ +  PGQ+N LLCP C GG S E
Sbjct: 74  IPPPVSGVMLEDPKEEIAESDHEKALKQKLSQVGIDIGSCGPGQYNGLLCPMCKGGGSNE 133

Query: 477 KTLALFINPDGSGAS 521
           K+L+LFI PDG  A+
Sbjct: 134 KSLSLFITPDGHAAT 148


>ref|XP_002268852.1| PREDICTED: uncharacterized protein LOC100257655 [Vitis vinifera]
           gi|297740887|emb|CBI31069.3| unnamed protein product
           [Vitis vinifera]
          Length = 705

 Score = 56.2 bits (134), Expect = 4e-06
 Identities = 35/94 (37%), Positives = 43/94 (45%), Gaps = 7/94 (7%)
 Frame = +3

Query: 258 KPISKIPPYKANGF----TSQATVPTPVYGEEEDXXXXXXXXXXXXXXXX---GIEMDNF 416
           KP S+I P     F    TS + VP PVY E  +                   G +    
Sbjct: 66  KPNSRILPISLKTFALPYTSHSNVPGPVYSENPEDTSNSSARLNVLKKKLEVIGFDTQML 125

Query: 417 VPGQHNHLLCPRCYGGDSREKTLALFINPDGSGA 518
             GQ++HL CP C GGDS EK+L+LFI  DG  A
Sbjct: 126 KTGQYSHLTCPTCKGGDSMEKSLSLFITLDGDHA 159


Top