BLASTX nr result
ID: Jatropha_contig00018967
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Jatropha_contig00018967 (668 letters) Database: NCBI-nr (updated 2014/02/11) 35,149,712 sequences; 12,374,887,350 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAT38786.2| Gag-pol polyprotein, putative [Solanum demissum] 58 4e-12 gb|EOY08837.1| Uncharacterized protein TCM_024078 [Theobroma cacao] 48 1e-09 gb|EOY32548.1| Uncharacterized protein TCM_040529 [Theobroma cacao] 47 2e-09 gb|EOY11267.1| Uncharacterized protein TCM_026511 [Theobroma cacao] 47 2e-08 emb|CAN70258.1| hypothetical protein VITISV_024387 [Vitis vinifera] 61 3e-07 gb|EOY22757.1| Uncharacterized protein TCM_014834 [Theobroma cacao] 44 5e-07 ref|XP_003535080.1| PREDICTED: uncharacterized protein LOC100781... 40 2e-06 gb|EOY31124.1| Uncharacterized protein TCM_038123 [Theobroma cacao] 58 3e-06 gb|EOY20979.1| Uncharacterized protein TCM_012294 [Theobroma cacao] 58 3e-06 gb|EOY05264.1| CCHC-type integrase-like protein [Theobroma cacao] 58 3e-06 gb|EOY21703.1| Uncharacterized protein TCM_013805 [Theobroma cacao] 57 4e-06 ref|XP_003522041.1| PREDICTED: ARM REPEAT PROTEIN INTERACTING WI... 39 5e-06 emb|CAN74228.1| hypothetical protein VITISV_000583 [Vitis vinifera] 44 6e-06 gb|EOY13296.1| Uncharacterized protein TCM_031836 [Theobroma cacao] 56 8e-06 >gb|AAT38786.2| Gag-pol polyprotein, putative [Solanum demissum] Length = 1140 Score = 58.2 bits (139), Expect(2) = 4e-12 Identities = 27/55 (49%), Positives = 43/55 (78%) Frame = -2 Query: 655 SIEKKI*LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 ++ K+ + R K+T+ +K YID +MK+ NQIRLLGE++ E+RI+E+V+V + EKF Sbjct: 115 NLRKEFEMLRMKETEKVKDYIDRIMKIFNQIRLLGEKVEEQRIVEKVIVTLPEKF 169 Score = 39.3 bits (90), Expect(2) = 4e-12 Identities = 29/86 (33%), Positives = 38/86 (44%), Gaps = 2/86 (2%) Frame = -3 Query: 411 AMKQRKVFRQVTVVEE-ALVENHAIKDQ-PSNYXXXXXXXXXXXXKDQHGGRFKGRRGNY 238 A+++RK FR+ E ALV K Q K+Q R + RR Y Sbjct: 196 AVEKRKAFRKEESTSEIALVAAQRSKAQLDGEPKRQQNGRYGKEKKEQSNNRGRYRRSKY 255 Query: 237 PPCPCPHYSKRNHNGNYYYYWPRVQC 160 PPCP + K NH + +Y P VQC Sbjct: 256 PPCP--YCKKTNHTDKFCWYRPGVQC 279 >gb|EOY08837.1| Uncharacterized protein TCM_024078 [Theobroma cacao] Length = 703 Score = 47.8 bits (112), Expect(2) = 1e-09 Identities = 19/55 (34%), Positives = 41/55 (74%) Frame = -2 Query: 655 SIEKKI*LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 ++ +K L R K+ + I +++ +MK+VNQIRL+G+ L++ +++E++M+ + E+F Sbjct: 91 NLRRKYELLRMKENQTIGEFVEDLMKLVNQIRLMGDSLIDLKVVEKIMLSLPERF 145 Score = 41.2 bits (95), Expect(2) = 1e-09 Identities = 26/87 (29%), Positives = 41/87 (47%), Gaps = 1/87 (1%) Frame = -3 Query: 414 QAMKQRKVFRQVTVVEEALVENHAIK-DQPSNYXXXXXXXXXXXXKDQHGGRFKGRRGNY 238 +A +QRK R+ V+ AL K S++ GRF+ ++G + Sbjct: 171 EADEQRKAARRDERVDHALAARAKGKAPADSSFKKNSNETKEKDKTGTTAGRFRNKKGKF 230 Query: 237 PPCPCPHYSKRNHNGNYYYYWPRVQCS 157 P CP H KR+H Y ++ PRV+C+ Sbjct: 231 PICP--HCKKRSHYEAYCWFRPRVKCN 255 >gb|EOY32548.1| Uncharacterized protein TCM_040529 [Theobroma cacao] Length = 1266 Score = 47.0 bits (110), Expect(2) = 2e-09 Identities = 20/48 (41%), Positives = 34/48 (70%) Frame = -2 Query: 634 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 + + KD + +K Y D V++VVNQ+RL GE + E+R++ + +V + EKF Sbjct: 123 VLKMKDEETMKDYSDKVLRVVNQLRLFGENITERRVVNKFLVSLPEKF 170 Score = 41.2 bits (95), Expect(2) = 2e-09 Identities = 30/113 (26%), Positives = 47/113 (41%) Frame = -3 Query: 498 KNFDSKIXXXXXXXXXXXXXXXXXXXXLQAMKQRKVFRQVTVVEEALVENHAIKDQPSNY 319 + F+SKI LQA +QR+ RQ VE AL K + S+ Sbjct: 168 EKFESKISSLEDSKDLTTMSVSELINVLQAQEQRRALRQEDHVEAALAARRVDK-RTSSG 226 Query: 318 XXXXXXXXXXXXKDQHGGRFKGRRGNYPPCPCPHYSKRNHNGNYYYYWPRVQC 160 ++ + +G++G +PPC + K+NH Y +Y P V+C Sbjct: 227 SHKKSEYEKKDKDKRYEEKKQGKKGQFPPCS--YCKKKNHIERYCWYRPHVKC 277 >gb|EOY11267.1| Uncharacterized protein TCM_026511 [Theobroma cacao] Length = 1318 Score = 47.0 bits (110), Expect(2) = 2e-08 Identities = 20/48 (41%), Positives = 34/48 (70%) Frame = -2 Query: 634 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 + + KD + +K Y D V++VVNQ+RL GE + E+R++ + +V + EKF Sbjct: 123 VLKMKDEETMKDYSDKVLRVVNQLRLFGENITERRVVNKFLVSLPEKF 170 Score = 38.1 bits (87), Expect(2) = 2e-08 Identities = 29/113 (25%), Positives = 46/113 (40%) Frame = -3 Query: 498 KNFDSKIXXXXXXXXXXXXXXXXXXXXLQAMKQRKVFRQVTVVEEALVENHAIKDQPSNY 319 + F+SKI LQA +QR+ RQ VE AL K + S+ Sbjct: 168 EKFESKISSLEDSKDLTTMSVSELINALQAQEQRRALRQEDHVEAALAARRVDK-RTSSG 226 Query: 318 XXXXXXXXXXXXKDQHGGRFKGRRGNYPPCPCPHYSKRNHNGNYYYYWPRVQC 160 ++ + +G++ +PPC + K+NH Y +Y P V+C Sbjct: 227 SHKKSEYEKKDKDKRYEEKKQGKKWQFPPCS--YCKKKNHIERYCWYRPHVKC 277 >emb|CAN70258.1| hypothetical protein VITISV_024387 [Vitis vinifera] Length = 530 Score = 60.8 bits (146), Expect = 3e-07 Identities = 26/55 (47%), Positives = 46/55 (83%) Frame = -2 Query: 655 SIEKKI*LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 +++++ + R KD + IK YID +M+VVN+IRLLGE+L+++R++E+V+V +LE+F Sbjct: 15 NLKRQFEVLRMKDNESIKDYIDRLMEVVNKIRLLGEDLIDQRVVEKVLVSLLERF 69 >gb|EOY22757.1| Uncharacterized protein TCM_014834 [Theobroma cacao] Length = 996 Score = 43.9 bits (102), Expect(2) = 5e-07 Identities = 16/55 (29%), Positives = 40/55 (72%) Frame = -2 Query: 655 SIEKKI*LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 ++ +K L R K+ + + +++ +MK+VNQ +L+G+ L++ +++E++M+ + E+F Sbjct: 63 NLRRKYELLRMKENQPVGEFVEDLMKLVNQSKLMGDSLIDLKVVEKIMLSLPERF 117 Score = 36.2 bits (82), Expect(2) = 5e-07 Identities = 27/89 (30%), Positives = 43/89 (48%), Gaps = 3/89 (3%) Frame = -3 Query: 414 QAMKQRKVFRQVTVVEEALVENHAIKDQPSNYXXXXXXXXXXXXKDQHG---GRFKGRRG 244 +A +QRK R+ V+ AL KD+ KD+ G GR + ++G Sbjct: 143 EADEQRKAARRDERVDHALAAR--AKDKAPVDPSFKKNSNENREKDKAGTAAGRSQNKKG 200 Query: 243 NYPPCPCPHYSKRNHNGNYYYYWPRVQCS 157 +P CP + KRNH+ Y ++ P V+C+ Sbjct: 201 KFPVCP--YCKKRNHSEAYCWFRPGVKCN 227 >ref|XP_003535080.1| PREDICTED: uncharacterized protein LOC100781109 [Glycine max] Length = 444 Score = 40.4 bits (93), Expect(2) = 2e-06 Identities = 16/55 (29%), Positives = 39/55 (70%) Frame = -2 Query: 655 SIEKKI*LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 ++ ++ L R ++++ IK Y++ ++ + N+I+LLG + + RI+E+++V V E++ Sbjct: 117 NLRREFELQRMEESETIKEYLNKLLGIANKIKLLGSDFTDSRIVEKILVTVPERY 171 Score = 37.4 bits (85), Expect(2) = 2e-06 Identities = 25/86 (29%), Positives = 35/86 (40%) Frame = -3 Query: 414 QAMKQRKVFRQVTVVEEALVENHAIKDQPSNYXXXXXXXXXXXXKDQHGGRFKGRRGNYP 235 QA +QR++ RQ VVE AL H D+ + + K ++ NYP Sbjct: 197 QAQEQRRLMRQDRVVEGALPAKHHEFDESKKNFFKKNQPASSKNSTNNQNKGKDKKKNYP 256 Query: 234 PCPCPHYSKRNHNGNYYYYWPRVQCS 157 PC H K H + P +CS Sbjct: 257 --PCQHCEKLGHPPYKCWKRPDTKCS 280 >gb|EOY31124.1| Uncharacterized protein TCM_038123 [Theobroma cacao] Length = 586 Score = 57.8 bits (138), Expect = 3e-06 Identities = 25/56 (44%), Positives = 43/56 (76%) Frame = -2 Query: 658 SSIEKKI*LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 S++ ++ L R K+T+ +K YI+ VM++VNQIR+LGE L E R++E++++ + EKF Sbjct: 29 SNVRREFVLMRLKETQTVKEYINQVMRLVNQIRMLGENLPEVRVVEKILISIPEKF 84 >gb|EOY20979.1| Uncharacterized protein TCM_012294 [Theobroma cacao] Length = 434 Score = 57.8 bits (138), Expect = 3e-06 Identities = 27/55 (49%), Positives = 41/55 (74%) Frame = -2 Query: 655 SIEKKI*LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 ++ ++ + R K+ K IK Y D +MK+VNQ+RLLGE+L EKRI+ +V+V + EKF Sbjct: 84 NLHREFEILRMKEDKTIKEYSDKIMKLVNQLRLLGEDLSEKRIVNKVLVSLSEKF 138 >gb|EOY05264.1| CCHC-type integrase-like protein [Theobroma cacao] Length = 640 Score = 57.8 bits (138), Expect = 3e-06 Identities = 23/55 (41%), Positives = 42/55 (76%) Frame = -2 Query: 655 SIEKKI*LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 ++ ++ L R KDT+ +K Y+D VMK++NQIR+LGE+L E +++++++ + EKF Sbjct: 118 NLRREFELMRMKDTQTVKEYVDQVMKLINQIRMLGEKLSETSVVQKILISIPEKF 172 >gb|EOY21703.1| Uncharacterized protein TCM_013805 [Theobroma cacao] Length = 886 Score = 57.4 bits (137), Expect = 4e-06 Identities = 24/55 (43%), Positives = 43/55 (78%) Frame = -2 Query: 655 SIEKKI*LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 ++ ++ L R KD++ ++ +ID VMKVVNQIRLLGE L + +++E++++ +LE+F Sbjct: 92 NLRRQYELLRMKDSQNVQKFIDAVMKVVNQIRLLGENLSDAKVVEKILISLLERF 146 >ref|XP_003522041.1| PREDICTED: ARM REPEAT PROTEIN INTERACTING WITH ABF2-like [Glycine max] Length = 710 Score = 39.3 bits (90), Expect(2) = 5e-06 Identities = 16/55 (29%), Positives = 38/55 (69%) Frame = -2 Query: 655 SIEKKI*LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 ++ ++ L R ++++ IK Y + ++ + N+I+LLG + + +I+E++MV V E++ Sbjct: 117 NLRREFELQRMEESETIKEYSNKLLGIANKIKLLGSDFADSKIVEKIMVTVSERY 171 Score = 37.4 bits (85), Expect(2) = 5e-06 Identities = 24/86 (27%), Positives = 36/86 (41%) Frame = -3 Query: 414 QAMKQRKVFRQVTVVEEALVENHAIKDQPSNYXXXXXXXXXXXXKDQHGGRFKGRRGNYP 235 QA +QR++ RQ VVE AL H D+ + + K ++ NYP Sbjct: 197 QAQEQRRLMRQDRVVEGALPAKHHEVDESKKNFFKKNQPASSENSANNQNKGKDKKKNYP 256 Query: 234 PCPCPHYSKRNHNGNYYYYWPRVQCS 157 PC H K+ H + P +C+ Sbjct: 257 --PCQHCGKKGHAPFRCWRRPDAKCN 280 >emb|CAN74228.1| hypothetical protein VITISV_000583 [Vitis vinifera] Length = 909 Score = 43.5 bits (101), Expect(2) = 6e-06 Identities = 16/48 (33%), Positives = 37/48 (77%) Frame = -2 Query: 634 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 + + K+T+ IK Y D ++ +VN++RLLG++ ++RI++++++ + EK+ Sbjct: 124 MLKMKETETIKDYSDKLLGIVNKVRLLGKDFSDERIVQKILITLPEKY 171 Score = 32.7 bits (73), Expect(2) = 6e-06 Identities = 28/89 (31%), Positives = 41/89 (46%), Gaps = 4/89 (4%) Frame = -3 Query: 414 QAMKQRKVFRQVTVVEEAL---VENHAI-KDQPSNYXXXXXXXXXXXXKDQHGGRFKGRR 247 QA +QR++ R+ +E AL EN KD+ +N K ++ K + Sbjct: 197 QAQEQRRMIRKEESMEGALQAKAENSGGGKDKKNN------------NKKKNNKIDKNKD 244 Query: 246 GNYPPCPCPHYSKRNHNGNYYYYWPRVQC 160 G YPPCP H K NH ++ P V+C Sbjct: 245 GTYPPCP--HCKKTNHPQRKCWWRPDVKC 271 >gb|EOY13296.1| Uncharacterized protein TCM_031836 [Theobroma cacao] Length = 202 Score = 56.2 bits (134), Expect = 8e-06 Identities = 27/48 (56%), Positives = 38/48 (79%) Frame = -2 Query: 634 LFRRKDTKGIK*YIDWVMKVVNQIRLLGEELLEKRIIERVMVGVLEKF 491 + + KD++ IK Y D VMKVVNQ+RLLGE+L EKRI+ +V+V + +KF Sbjct: 124 VLKMKDSETIKEYSDKVMKVVNQLRLLGEDLSEKRIVNKVLVSLPDKF 171