BLASTX nr result

ID: Jatropha_contig00018966 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Jatropha_contig00018966
         (639 letters)

Database: NCBI-nr (updated 2014/02/11) 
           35,149,712 sequences; 12,374,887,350 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAT38786.2| Gag-pol polyprotein, putative [Solanum demissum]        56   1e-11
gb|EOY32548.1| Uncharacterized protein TCM_040529 [Theobroma cacao]    47   2e-09
gb|EOY08837.1| Uncharacterized protein TCM_024078 [Theobroma cacao]    46   4e-09
gb|EOY11267.1| Uncharacterized protein TCM_026511 [Theobroma cacao]    47   1e-08
emb|CAN70258.1| hypothetical protein VITISV_024387 [Vitis vinifera]    60   7e-07
gb|EOY22757.1| Uncharacterized protein TCM_014834 [Theobroma cacao]    42   2e-06
ref|XP_003535080.1| PREDICTED: uncharacterized protein LOC100781...    40   3e-06
gb|EOY21703.1| Uncharacterized protein TCM_013805 [Theobroma cacao]    57   4e-06
gb|EOY20979.1| Uncharacterized protein TCM_012294 [Theobroma cacao]    57   4e-06
gb|EOY05264.1| CCHC-type integrase-like protein [Theobroma cacao]      57   4e-06
emb|CAN74228.1| hypothetical protein VITISV_000583 [Vitis vinifera]    44   6e-06
gb|EOY13296.1| Uncharacterized protein TCM_031836 [Theobroma cacao]    56   7e-06
ref|XP_003522041.1| PREDICTED: ARM REPEAT PROTEIN INTERACTING WI...    39   7e-06

>gb|AAT38786.2| Gag-pol polyprotein, putative [Solanum demissum]
          Length = 1140

 Score = 56.2 bits (134), Expect(2) = 1e-11
 Identities = 26/48 (54%), Positives = 39/48 (81%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           + R K+T+ +K YID +MK+ NQIRLLGE++ E+RI+E+V+V + EKF
Sbjct: 122 MLRMKETEKVKDYIDRIMKIFNQIRLLGEKVEEQRIVEKVIVTLPEKF 169



 Score = 39.3 bits (90), Expect(2) = 1e-11
 Identities = 29/86 (33%), Positives = 38/86 (44%), Gaps = 2/86 (2%)
 Frame = -3

Query: 412 AMKQRKVFRQVTVVEE-ALVENHAIKDQ-PSNYXXXXXXXXXXXXKDQHGGRFKGRRGNY 239
           A+++RK FR+     E ALV     K Q                 K+Q   R + RR  Y
Sbjct: 196 AVEKRKAFRKEESTSEIALVAAQRSKAQLDGEPKRQQNGRYGKEKKEQSNNRGRYRRSKY 255

Query: 238 PPCPCPHYSKRNHNGNYYYYWPRVQC 161
           PPCP  +  K NH   + +Y P VQC
Sbjct: 256 PPCP--YCKKTNHTDKFCWYRPGVQC 279


>gb|EOY32548.1| Uncharacterized protein TCM_040529 [Theobroma cacao]
          Length = 1266

 Score = 47.0 bits (110), Expect(2) = 2e-09
 Identities = 20/48 (41%), Positives = 34/48 (70%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           + + KD + +K Y D V++VVNQ+RL GE + E+R++ + +V + EKF
Sbjct: 123 VLKMKDEETMKDYSDKVLRVVNQLRLFGENITERRVVNKFLVSLPEKF 170



 Score = 41.2 bits (95), Expect(2) = 2e-09
 Identities = 30/113 (26%), Positives = 47/113 (41%)
 Frame = -3

Query: 499 KNFDSKIXXXXXXXXXXXXXXXXXXXXLQAMKQRKVFRQVTVVEEALVENHAIKDQPSNY 320
           + F+SKI                    LQA +QR+  RQ   VE AL      K + S+ 
Sbjct: 168 EKFESKISSLEDSKDLTTMSVSELINVLQAQEQRRALRQEDHVEAALAARRVDK-RTSSG 226

Query: 319 XXXXXXXXXXXXKDQHGGRFKGRRGNYPPCPCPHYSKRNHNGNYYYYWPRVQC 161
                         ++  + +G++G +PPC   +  K+NH   Y +Y P V+C
Sbjct: 227 SHKKSEYEKKDKDKRYEEKKQGKKGQFPPCS--YCKKKNHIERYCWYRPHVKC 277


>gb|EOY08837.1| Uncharacterized protein TCM_024078 [Theobroma cacao]
          Length = 703

 Score = 45.8 bits (107), Expect(2) = 4e-09
 Identities = 18/48 (37%), Positives = 37/48 (77%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           L R K+ + I  +++ +MK+VNQIRL+G+ L++ +++E++M+ + E+F
Sbjct: 98  LLRMKENQTIGEFVEDLMKLVNQIRLMGDSLIDLKVVEKIMLSLPERF 145



 Score = 41.2 bits (95), Expect(2) = 4e-09
 Identities = 26/87 (29%), Positives = 41/87 (47%), Gaps = 1/87 (1%)
 Frame = -3

Query: 415 QAMKQRKVFRQVTVVEEALVENHAIK-DQPSNYXXXXXXXXXXXXKDQHGGRFKGRRGNY 239
           +A +QRK  R+   V+ AL      K    S++                 GRF+ ++G +
Sbjct: 171 EADEQRKAARRDERVDHALAARAKGKAPADSSFKKNSNETKEKDKTGTTAGRFRNKKGKF 230

Query: 238 PPCPCPHYSKRNHNGNYYYYWPRVQCS 158
           P CP  H  KR+H   Y ++ PRV+C+
Sbjct: 231 PICP--HCKKRSHYEAYCWFRPRVKCN 255


>gb|EOY11267.1| Uncharacterized protein TCM_026511 [Theobroma cacao]
          Length = 1318

 Score = 47.0 bits (110), Expect(2) = 1e-08
 Identities = 20/48 (41%), Positives = 34/48 (70%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           + + KD + +K Y D V++VVNQ+RL GE + E+R++ + +V + EKF
Sbjct: 123 VLKMKDEETMKDYSDKVLRVVNQLRLFGENITERRVVNKFLVSLPEKF 170



 Score = 38.1 bits (87), Expect(2) = 1e-08
 Identities = 29/113 (25%), Positives = 46/113 (40%)
 Frame = -3

Query: 499 KNFDSKIXXXXXXXXXXXXXXXXXXXXLQAMKQRKVFRQVTVVEEALVENHAIKDQPSNY 320
           + F+SKI                    LQA +QR+  RQ   VE AL      K + S+ 
Sbjct: 168 EKFESKISSLEDSKDLTTMSVSELINALQAQEQRRALRQEDHVEAALAARRVDK-RTSSG 226

Query: 319 XXXXXXXXXXXXKDQHGGRFKGRRGNYPPCPCPHYSKRNHNGNYYYYWPRVQC 161
                         ++  + +G++  +PPC   +  K+NH   Y +Y P V+C
Sbjct: 227 SHKKSEYEKKDKDKRYEEKKQGKKWQFPPCS--YCKKKNHIERYCWYRPHVKC 277


>emb|CAN70258.1| hypothetical protein VITISV_024387 [Vitis vinifera]
          Length = 530

 Score = 59.7 bits (143), Expect = 7e-07
 Identities = 26/48 (54%), Positives = 41/48 (85%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           + R KD + IK YID +M+VVN+IRLLGE+L+++R++E+V+V +LE+F
Sbjct: 22  VLRMKDNESIKDYIDRLMEVVNKIRLLGEDLIDQRVVEKVLVSLLERF 69


>gb|EOY22757.1| Uncharacterized protein TCM_014834 [Theobroma cacao]
          Length = 996

 Score = 42.0 bits (97), Expect(2) = 2e-06
 Identities = 15/48 (31%), Positives = 36/48 (75%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           L R K+ + +  +++ +MK+VNQ +L+G+ L++ +++E++M+ + E+F
Sbjct: 70  LLRMKENQPVGEFVEDLMKLVNQSKLMGDSLIDLKVVEKIMLSLPERF 117



 Score = 36.2 bits (82), Expect(2) = 2e-06
 Identities = 27/89 (30%), Positives = 43/89 (48%), Gaps = 3/89 (3%)
 Frame = -3

Query: 415 QAMKQRKVFRQVTVVEEALVENHAIKDQPSNYXXXXXXXXXXXXKDQHG---GRFKGRRG 245
           +A +QRK  R+   V+ AL      KD+                KD+ G   GR + ++G
Sbjct: 143 EADEQRKAARRDERVDHALAAR--AKDKAPVDPSFKKNSNENREKDKAGTAAGRSQNKKG 200

Query: 244 NYPPCPCPHYSKRNHNGNYYYYWPRVQCS 158
            +P CP  +  KRNH+  Y ++ P V+C+
Sbjct: 201 KFPVCP--YCKKRNHSEAYCWFRPGVKCN 227


>ref|XP_003535080.1| PREDICTED: uncharacterized protein LOC100781109 [Glycine max]
          Length = 444

 Score = 39.7 bits (91), Expect(2) = 3e-06
 Identities = 16/48 (33%), Positives = 35/48 (72%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           L R ++++ IK Y++ ++ + N+I+LLG +  + RI+E+++V V E++
Sbjct: 124 LQRMEESETIKEYLNKLLGIANKIKLLGSDFTDSRIVEKILVTVPERY 171



 Score = 37.4 bits (85), Expect(2) = 3e-06
 Identities = 25/86 (29%), Positives = 35/86 (40%)
 Frame = -3

Query: 415 QAMKQRKVFRQVTVVEEALVENHAIKDQPSNYXXXXXXXXXXXXKDQHGGRFKGRRGNYP 236
           QA +QR++ RQ  VVE AL   H   D+                   +  + K ++ NYP
Sbjct: 197 QAQEQRRLMRQDRVVEGALPAKHHEFDESKKNFFKKNQPASSKNSTNNQNKGKDKKKNYP 256

Query: 235 PCPCPHYSKRNHNGNYYYYWPRVQCS 158
             PC H  K  H     +  P  +CS
Sbjct: 257 --PCQHCEKLGHPPYKCWKRPDTKCS 280


>gb|EOY21703.1| Uncharacterized protein TCM_013805 [Theobroma cacao]
          Length = 886

 Score = 57.0 bits (136), Expect = 4e-06
 Identities = 24/48 (50%), Positives = 39/48 (81%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           L R KD++ ++ +ID VMKVVNQIRLLGE L + +++E++++ +LE+F
Sbjct: 99  LLRMKDSQNVQKFIDAVMKVVNQIRLLGENLSDAKVVEKILISLLERF 146


>gb|EOY20979.1| Uncharacterized protein TCM_012294 [Theobroma cacao]
          Length = 434

 Score = 57.0 bits (136), Expect = 4e-06
 Identities = 27/48 (56%), Positives = 37/48 (77%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           + R K+ K IK Y D +MK+VNQ+RLLGE+L EKRI+ +V+V + EKF
Sbjct: 91  ILRMKEDKTIKEYSDKIMKLVNQLRLLGEDLSEKRIVNKVLVSLSEKF 138


>gb|EOY05264.1| CCHC-type integrase-like protein [Theobroma cacao]
          Length = 640

 Score = 57.0 bits (136), Expect = 4e-06
 Identities = 23/48 (47%), Positives = 38/48 (79%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           L R KDT+ +K Y+D VMK++NQIR+LGE+L E  +++++++ + EKF
Sbjct: 125 LMRMKDTQTVKEYVDQVMKLINQIRMLGEKLSETSVVQKILISIPEKF 172


>emb|CAN74228.1| hypothetical protein VITISV_000583 [Vitis vinifera]
          Length = 909

 Score = 43.5 bits (101), Expect(2) = 6e-06
 Identities = 16/48 (33%), Positives = 37/48 (77%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           + + K+T+ IK Y D ++ +VN++RLLG++  ++RI++++++ + EK+
Sbjct: 124 MLKMKETETIKDYSDKLLGIVNKVRLLGKDFSDERIVQKILITLPEKY 171



 Score = 32.7 bits (73), Expect(2) = 6e-06
 Identities = 28/89 (31%), Positives = 41/89 (46%), Gaps = 4/89 (4%)
 Frame = -3

Query: 415 QAMKQRKVFRQVTVVEEAL---VENHAI-KDQPSNYXXXXXXXXXXXXKDQHGGRFKGRR 248
           QA +QR++ R+   +E AL    EN    KD+ +N             K ++    K + 
Sbjct: 197 QAQEQRRMIRKEESMEGALQAKAENSGGGKDKKNN------------NKKKNNKIDKNKD 244

Query: 247 GNYPPCPCPHYSKRNHNGNYYYYWPRVQC 161
           G YPPCP  H  K NH     ++ P V+C
Sbjct: 245 GTYPPCP--HCKKTNHPQRKCWWRPDVKC 271


>gb|EOY13296.1| Uncharacterized protein TCM_031836 [Theobroma cacao]
          Length = 202

 Score = 56.2 bits (134), Expect = 7e-06
 Identities = 27/48 (56%), Positives = 38/48 (79%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           + + KD++ IK Y D VMKVVNQ+RLLGE+L EKRI+ +V+V + +KF
Sbjct: 124 VLKMKDSETIKEYSDKVMKVVNQLRLLGEDLSEKRIVNKVLVSLPDKF 171


>ref|XP_003522041.1| PREDICTED: ARM REPEAT PROTEIN INTERACTING WITH ABF2-like [Glycine
           max]
          Length = 710

 Score = 38.5 bits (88), Expect(2) = 7e-06
 Identities = 16/48 (33%), Positives = 34/48 (70%)
 Frame = -2

Query: 635 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 492
           L R ++++ IK Y + ++ + N+I+LLG +  + +I+E++MV V E++
Sbjct: 124 LQRMEESETIKEYSNKLLGIANKIKLLGSDFADSKIVEKIMVTVSERY 171



 Score = 37.4 bits (85), Expect(2) = 7e-06
 Identities = 24/86 (27%), Positives = 36/86 (41%)
 Frame = -3

Query: 415 QAMKQRKVFRQVTVVEEALVENHAIKDQPSNYXXXXXXXXXXXXKDQHGGRFKGRRGNYP 236
           QA +QR++ RQ  VVE AL   H   D+                   +  + K ++ NYP
Sbjct: 197 QAQEQRRLMRQDRVVEGALPAKHHEVDESKKNFFKKNQPASSENSANNQNKGKDKKKNYP 256

Query: 235 PCPCPHYSKRNHNGNYYYYWPRVQCS 158
             PC H  K+ H     +  P  +C+
Sbjct: 257 --PCQHCGKKGHAPFRCWRRPDAKCN 280


Top