BLASTX nr result
ID: Jatropha_contig00000959
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Jatropha_contig00000959 (614 letters) Database: NCBI-nr (updated 2014/02/11) 35,149,712 sequences; 12,374,887,350 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003553268.1| PREDICTED: putative ribonuclease H protein A... 54 1e-10 emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulga... 59 2e-10 ref|XP_003521972.1| PREDICTED: uncharacterized protein LOC100800... 54 2e-10 gb|EOY20792.1| Uncharacterized protein TCM_012134 [Theobroma cacao] 47 6e-10 emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulga... 50 8e-09 gb|EOY15545.1| Uncharacterized protein TCM_034564 [Theobroma cacao] 62 1e-07 ref|XP_003530391.1| PREDICTED: uncharacterized protein LOC100804... 50 1e-07 gb|EOY06142.1| Uncharacterized protein TCM_020959 [Theobroma cacao] 62 2e-07 ref|XP_003541643.1| PREDICTED: uncharacterized protein LOC100817... 58 2e-06 emb|CCA66140.1| hypothetical protein [Beta vulgaris subsp. vulga... 40 2e-06 emb|CCA65974.1| hypothetical protein [Beta vulgaris subsp. vulga... 44 2e-06 gb|EOY06145.1| Uncharacterized protein TCM_020961 [Theobroma cacao] 57 3e-06 ref|XP_003528143.1| PREDICTED: uncharacterized protein LOC100778... 43 3e-06 emb|CAN71985.1| hypothetical protein VITISV_011667 [Vitis vinifera] 42 5e-06 emb|CAN63585.1| hypothetical protein VITISV_026806 [Vitis vinifera] 41 9e-06 >ref|XP_003553268.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 625 Score = 54.3 bits (129), Expect(2) = 1e-10 Identities = 34/143 (23%), Positives = 55/143 (38%), Gaps = 1/143 (0%) Frame = +3 Query: 186 YGNALISKYGNEWTWNSNLGSKYWKKLSLVWRDISSIGTSSLKIIFKGGFRWFLGSGNFI 365 + L+SKYG WN+ + S W+D+ S+ FRW +G G+ + Sbjct: 288 WARVLLSKYGG---WNTLCSGRDNAHFSQWWKDLRSVFQQHHSNSLINNFRWKVGDGSRL 344 Query: 366 NFWLDLCID-DXXXXXXXXXXXXXAKNKRSKICECGFFNNGRWVWQIKWLRDFRSIEDEY 542 NFW D + D + + I G + RW W+ +W R+ E + Sbjct: 345 NFWKDKWREGDLSLKDKYPSLYNVSTQQNHLINSMGILVDNRWEWKFQWRRNLFDHEVDM 404 Query: 543 WAQLKRRLERLSIIPKSENQLIW 611 + I P S + L+W Sbjct: 405 AVAFMADIAEFQIQPASRDLLLW 427 Score = 37.7 bits (86), Expect(2) = 1e-10 Identities = 16/46 (34%), Positives = 24/46 (52%) Frame = +2 Query: 53 LIWVVMSSTCCSKTLGGVGIPNLKRKNLSLFEKWWWWFRFENGFLW 190 + WV + C K+ GG+GI +L + N +L KW W + LW Sbjct: 243 IAWVNWDTVCLPKSKGGLGIKDLTKFNEALLGKWGWELANNHNQLW 288 >emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 59.3 bits (142), Expect(2) = 2e-10 Identities = 34/115 (29%), Positives = 50/115 (43%), Gaps = 2/115 (1%) Frame = +3 Query: 276 WRDI--SSIGTSSLKIIFKGGFRWFLGSGNFINFWLDLCIDDXXXXXXXXXXXXXAKNKR 449 WR I S + S + K R +G+G FWLD + D N Sbjct: 905 WRFICASILNHPSARSFVKTKLRKAVGNGVKTLFWLDTWLGDSPLKLRFPRLFTIVDNPM 964 Query: 450 SKICECGFFNNGRWVWQIKWLRDFRSIEDEYWAQLKRRLERLSIIPKSENQLIWT 614 + I CG + WVW W R FR + E W +L+ L + + P ++++LIWT Sbjct: 965 AYIASCGSWCGREWVWNFSWSRVFRPRDAEEWEELQGLLGSVCLSPSTDDRLIWT 1019 Score = 32.3 bits (72), Expect(2) = 2e-10 Identities = 16/51 (31%), Positives = 23/51 (45%) Frame = +2 Query: 41 RLQALIWVVMSSTCCSKTLGGVGIPNLKRKNLSLFEKWWWWFRFENGFLWK 193 R +L V + K GG+ NL +N+SL KW W + LW+ Sbjct: 828 RKSSLALVAWNQVVLPKESGGLNCGNLLNRNISLLFKWIWRLSHDPESLWQ 878 >ref|XP_003521972.1| PREDICTED: uncharacterized protein LOC100800774 [Glycine max] Length = 684 Score = 53.9 bits (128), Expect(2) = 2e-10 Identities = 34/143 (23%), Positives = 55/143 (38%), Gaps = 1/143 (0%) Frame = +3 Query: 186 YGNALISKYGNEWTWNSNLGSKYWKKLSLVWRDISSIGTSSLKIIFKGGFRWFLGSGNFI 365 + L+SKYG WN+ + S W+D+ S+ FRW +G G+ + Sbjct: 288 WARVLLSKYGG---WNTLCSGRDSAHFSQWWKDLRSVFQQHHSNSLINNFRWKVGDGSRL 344 Query: 366 NFWLDLCID-DXXXXXXXXXXXXXAKNKRSKICECGFFNNGRWVWQIKWLRDFRSIEDEY 542 FW D + D + + I G + RW W+ +W R+ E + Sbjct: 345 KFWKDKWREGDLSLKDKYPSLYNVSTQQNHLINSMGILVDNRWEWKFQWRRNLFDHEVDM 404 Query: 543 WAQLKRRLERLSIIPKSENQLIW 611 A + I P S + L+W Sbjct: 405 AAAFMADIAEFQIQPASRDLLLW 427 Score = 37.7 bits (86), Expect(2) = 2e-10 Identities = 16/46 (34%), Positives = 24/46 (52%) Frame = +2 Query: 53 LIWVVMSSTCCSKTLGGVGIPNLKRKNLSLFEKWWWWFRFENGFLW 190 + WV + C K+ GG+GI +L + N +L KW W + LW Sbjct: 243 IAWVNWDTVCLPKSKGGLGIKDLTKFNEALLGKWGWELANNHNQLW 288 >gb|EOY20792.1| Uncharacterized protein TCM_012134 [Theobroma cacao] Length = 356 Score = 46.6 bits (109), Expect(2) = 6e-10 Identities = 36/154 (23%), Positives = 65/154 (42%), Gaps = 9/154 (5%) Frame = +3 Query: 174 KMAFYGNALISKYGNEWTWNSNLGSKYW-------KKLSLVWRDISSIGT--SSLKIIFK 326 K A + ++ KYG G +W ++S +WR I + + ++ Sbjct: 49 KDALWRRLIMEKYG--------AGQPHWIPSSSSTARMSSIWRSIVQLPSIEGMQNLVGF 100 Query: 327 GGFRWFLGSGNFINFWLDLCIDDXXXXXXXXXXXXXAKNKRSKICECGFFNNGRWVWQIK 506 +RW +G+G I FW D IDD A +K ++ + NG +W I Sbjct: 101 HAYRWIVGNGETICFWFDKWIDDIPLASKFPRLFSLAVDKDMRVLDA--CQNG--LWSIN 156 Query: 507 WLRDFRSIEDEYWAQLKRRLERLSIIPKSENQLI 608 + R S E E ++ L +S++P +++L+ Sbjct: 157 FRRVLYSWEKEDLDRILNSLSSVSLVPLRDDKLV 190 Score = 43.1 bits (100), Expect(2) = 6e-10 Identities = 17/38 (44%), Positives = 22/38 (57%) Frame = +2 Query: 80 CCSKTLGGVGIPNLKRKNLSLFEKWWWWFRFENGFLWK 193 C K G +GI NL KNL+L KWWW + + LW+ Sbjct: 17 CKPKRFGELGITNLSYKNLTLLAKWWWRYGTDKDALWR 54 >emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1383 Score = 50.1 bits (118), Expect(2) = 8e-09 Identities = 31/114 (27%), Positives = 48/114 (42%), Gaps = 2/114 (1%) Frame = +3 Query: 276 WRDISS--IGTSSLKIIFKGGFRWFLGSGNFINFWLDLCIDDXXXXXXXXXXXXXAKNKR 449 W+ I + +G ++I G R +G+G FW D + + A NK Sbjct: 906 WKSICAAVLGHEGARLIAVNGMRKNVGNGISSLFWHDTWLCEQPLKRIAPRLFSIAINKN 965 Query: 450 SKICECGFFNNGRWVWQIKWLRDFRSIEDEYWAQLKRRLERLSIIPKSENQLIW 611 S I G + WVW W R R + A L L+ + + P +++QLIW Sbjct: 966 SSIASYGVWEGFNWVWVFSWKRVLRPQDLVEKAHLDELLKSVRLDPNADDQLIW 1019 Score = 35.8 bits (81), Expect(2) = 8e-09 Identities = 18/30 (60%), Positives = 20/30 (66%) Frame = +2 Query: 89 KTLGGVGIPNLKRKNLSLFEKWWWWFRFEN 178 KT GG+GI N+ KNLSL KW W FEN Sbjct: 845 KTSGGLGIGNILHKNLSLLFKWIWRL-FEN 873 >gb|EOY15545.1| Uncharacterized protein TCM_034564 [Theobroma cacao] Length = 175 Score = 62.0 bits (149), Expect = 1e-07 Identities = 32/128 (25%), Positives = 53/128 (41%) Frame = +3 Query: 228 WNSNLGSKYWKKLSLVWRDISSIGTSSLKIIFKGGFRWFLGSGNFINFWLDLCIDDXXXX 407 W GS+YW+ ++ + S S F +G+G +NFW D I+ Sbjct: 31 WEDTFGSRYWRAGAMYRGNAPSPLESICLNWVHSNFGLIVGNGENLNFWQDEWIEGVVLA 90 Query: 408 XXXXXXXXXAKNKRSKICECGFFNNGRWVWQIKWLRDFRSIEDEYWAQLKRRLERLSIIP 587 A K K+ E G + +GRW W +++ R E E W Q L+ + Sbjct: 91 DAFPRMFALAVKKSGKVTEFGIWEDGRWAWNVQFRRQLFDWEVEQWEQFHDSLKEFHLCK 150 Query: 588 KSENQLIW 611 +++L+W Sbjct: 151 DFKDELVW 158 >ref|XP_003530391.1| PREDICTED: uncharacterized protein LOC100804594 [Glycine max] Length = 4413 Score = 49.7 bits (117), Expect(2) = 1e-07 Identities = 34/144 (23%), Positives = 57/144 (39%), Gaps = 1/144 (0%) Frame = +3 Query: 183 FYGNALISKYGNEWTWNSNLGSKYWKKLSLVWRDISSIGTSSLKIIFKGGFRWFLGSGNF 362 F+ LISKYG W + K W S W+D+ + I + W +G G+ Sbjct: 3020 FWARVLISKYGG-WADLQSGRDKVWH--SQWWKDLRRLYHQPDFNIIQQNMIWKVGCGDQ 3076 Query: 363 INFWLDLCIDDXXXXXXXXXXXXXAKNKRS-KICECGFFNNGRWVWQIKWLRDFRSIEDE 539 I FW D + + +++ I G F++ W W +KW R E E Sbjct: 3077 IKFWQDSWLSEGCNLQQKYNQLFMISRQQNLPISNLGKFSHNLWSWDLKWRRRLFDHEYE 3136 Query: 540 YWAQLKRRLERLSIIPKSENQLIW 611 + + I + ++Q++W Sbjct: 3137 MAVAFMEEISDIPIQQQVQDQMLW 3160 Score = 32.0 bits (71), Expect(2) = 1e-07 Identities = 14/44 (31%), Positives = 20/44 (45%) Frame = +2 Query: 59 WVVMSSTCCSKTLGGVGIPNLKRKNLSLFEKWWWWFRFENGFLW 190 WV C K GG+GI +L + N +L +W W + W Sbjct: 2978 WVKWDVICLPKNEGGLGIKDLAKFNAALRGRWIWDLAANHNQFW 3021 >gb|EOY06142.1| Uncharacterized protein TCM_020959 [Theobroma cacao] Length = 253 Score = 61.6 bits (148), Expect = 2e-07 Identities = 35/115 (30%), Positives = 54/115 (46%), Gaps = 2/115 (1%) Frame = +3 Query: 273 VWRDISSIGTSS--LKIIFKGGFRWFLGSGNFINFWLDLCIDDXXXXXXXXXXXXXAKNK 446 +W++I S T S L + G + +GSG I FW D ID K Sbjct: 1 MWKNIISPLTPSWNLSSVLHVGIGYLIGSGTRIKFWDDDWIDGIILRSTFSRIFSLTNKK 60 Query: 447 RSKICECGFFNNGRWVWQIKWLRDFRSIEDEYWAQLKRRLERLSIIPKSENQLIW 611 K+ E G+++NG W WQ+ R E +YWA +K L + + ++ ++LIW Sbjct: 61 FGKVSEFGYWDNGGWQWQMDLRRRLFDWEKDYWAHVKECLGHIHLDLETNDKLIW 115 >ref|XP_003541643.1| PREDICTED: uncharacterized protein LOC100817727 [Glycine max] Length = 1869 Score = 57.8 bits (138), Expect = 2e-06 Identities = 49/170 (28%), Positives = 68/170 (40%), Gaps = 19/170 (11%) Frame = +3 Query: 159 GGLDLK-MAFYGNALISKYGNEWTWNSN------LGSKY--WKKL---------SLVWRD 284 GGL +K + + AL+SK+G E N N L SKY W L S W+D Sbjct: 148 GGLGIKDLTKFNEALLSKWGWELANNQNHLWARTLMSKYGGWNALIHGRNCTGFSNWWKD 207 Query: 285 ISSIGTSSLKIIFKGGFRWFLGSGNFINFWLDLC-IDDXXXXXXXXXXXXXAKNKRSKIC 461 + SI RW +G+G I FW D DD + + S I Sbjct: 208 LKSIFQQQHSNSLTSNLRWKMGNGAKIKFWKDQWREDDLTLQEKYPTLYQVSYQQDSSIS 267 Query: 462 ECGFFNNGRWVWQIKWLRDFRSIEDEYWAQLKRRLERLSIIPKSENQLIW 611 G + RW W++ W R F E + A ++ + I S + LIW Sbjct: 268 LMGLLVDNRWEWKMHWRRSFFDHEIDMVAAFMDEIDAVQIRLSSMDSLIW 317 >emb|CCA66140.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1381 Score = 40.0 bits (92), Expect(2) = 2e-06 Identities = 27/114 (23%), Positives = 46/114 (40%), Gaps = 2/114 (1%) Frame = +3 Query: 276 WRDI--SSIGTSSLKIIFKGGFRWFLGSGNFINFWLDLCIDDXXXXXXXXXXXXXAKNKR 449 WR I S + +K + G R + +G+ FW D+ I + A + Sbjct: 905 WRSICASYLRNQDVKDMAIKGVRKNVKNGHDSLFWHDVWIGEATLKSLFPRLFTIAMSPN 964 Query: 450 SKICECGFFNNGRWVWQIKWLRDFRSIEDEYWAQLKRRLERLSIIPKSENQLIW 611 + GF++ WVW W R R + L L++ + + ++QLIW Sbjct: 965 GSVASYGFWDGLAWVWSFSWRRMLRPQDLIEKTHLDSLLQQAHVAYEKKDQLIW 1018 Score = 37.4 bits (85), Expect(2) = 2e-06 Identities = 17/34 (50%), Positives = 21/34 (61%) Frame = +2 Query: 89 KTLGGVGIPNLKRKNLSLFEKWWWWFRFENGFLW 190 K+LGG+GI N+K KN +L KW W E LW Sbjct: 844 KSLGGMGIGNIKHKNQALLFKWIWRLFDEPSQLW 877 >emb|CCA65974.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1379 Score = 43.5 bits (101), Expect(2) = 2e-06 Identities = 26/107 (24%), Positives = 42/107 (39%) Frame = +3 Query: 294 IGTSSLKIIFKGGFRWFLGSGNFINFWLDLCIDDXXXXXXXXXXXXXAKNKRSKICECGF 473 + + K + K G R +G+G FWLD I A + + + GF Sbjct: 913 LNDQAAKSVMKIGLRKIIGNGGNTLFWLDPWISSHPLKILYPRLFSIAIHPNASVAAHGF 972 Query: 474 FNNGRWVWQIKWLRDFRSIEDEYWAQLKRRLERLSIIPKSENQLIWT 614 + WVW W R+ R + A + L+ + E++L WT Sbjct: 973 WEGYFWVWSFSWRRNLRPRDKIEKANMDALLKSVCPSLLCEDKLAWT 1019 Score = 33.9 bits (76), Expect(2) = 2e-06 Identities = 15/34 (44%), Positives = 20/34 (58%) Frame = +2 Query: 89 KTLGGVGIPNLKRKNLSLFEKWWWWFRFENGFLW 190 K+ GG+ I N+ KNL++ KW W F E LW Sbjct: 844 KSRGGLNIGNVMHKNLAMLFKWIWRFFQEPNNLW 877 >gb|EOY06145.1| Uncharacterized protein TCM_020961 [Theobroma cacao] Length = 172 Score = 57.4 bits (137), Expect = 3e-06 Identities = 29/94 (30%), Positives = 44/94 (46%) Frame = +3 Query: 330 GFRWFLGSGNFINFWLDLCIDDXXXXXXXXXXXXXAKNKRSKICECGFFNNGRWVWQIKW 509 G + +GSG INFW D ID A K K+ E G+++N W WQ+ Sbjct: 6 GIGYLIGSGTGINFWDDQWIDGIILRSTFSRIFSLANKKFGKVYEFGYWDNWGWQWQMYL 65 Query: 510 LRDFRSIEDEYWAQLKRRLERLSIIPKSENQLIW 611 + E +YWA K L + + ++ ++LIW Sbjct: 66 RKRLFDWEKDYWAHFKECLVHIHLDRETNDKLIW 99 >ref|XP_003528143.1| PREDICTED: uncharacterized protein LOC100778359 [Glycine max] Length = 2621 Score = 42.7 bits (99), Expect(2) = 3e-06 Identities = 15/49 (30%), Positives = 26/49 (53%) Frame = +2 Query: 53 LIWVVMSSTCCSKTLGGVGIPNLKRKNLSLFEKWWWWFRFENGFLWKCI 199 + W+ + C K GG+GI ++ N++L KW W ++ G LW + Sbjct: 1454 IAWISWKTVCLPKDRGGLGIKDIHTFNMALLGKWMWNLMYQQGALWVAV 1502 Score = 34.3 bits (77), Expect(2) = 3e-06 Identities = 31/148 (20%), Positives = 55/148 (37%), Gaps = 3/148 (2%) Frame = +3 Query: 180 AFYGNALISKYGNEWTWNSNLGSKYWKKLSLVWRDISSIGTSSL--KIIFKGGFRWFLGS 353 A + L +KYG W +G S+ WRD+ + K +++ +W + + Sbjct: 1497 ALWVAVLEAKYGG---WRGLVGEGNSSCQSIWWRDLIKVMHMPYNGKTLYQQ-IKWKVEA 1552 Query: 354 GNFINFWLDLCID-DXXXXXXXXXXXXXAKNKRSKICECGFFNNGRWVWQIKWLRDFRSI 530 G+ + FW D I D + ++ + G +N W W W Sbjct: 1553 GDKVRFWEDRWISHDQSLAEKYPRLYVNSNHQYQLVGSLGQHSNLGWNWNFSWRCQLFDR 1612 Query: 531 EDEYWAQLKRRLERLSIIPKSENQLIWT 614 E E +E +SI + + +WT Sbjct: 1613 EIESAISFLSEVEGISINSQGSDTWVWT 1640 >emb|CAN71985.1| hypothetical protein VITISV_011667 [Vitis vinifera] Length = 413 Score = 42.0 bits (97), Expect(2) = 5e-06 Identities = 36/151 (23%), Positives = 63/151 (41%), Gaps = 6/151 (3%) Frame = +3 Query: 180 AFYGNALISKYG---NEWTWNSNLGSKY---WKKLSLVWRDISSIGTSSLKIIFKGGFRW 341 A + ++S YG N W N+N+ + WK ++LV+++ S R+ Sbjct: 80 ALWHQVILSIYGSHSNGWDVNNNVRWSHRCPWKAIALVFQEFSKFT------------RF 127 Query: 342 FLGSGNFINFWLDLCIDDXXXXXXXXXXXXXAKNKRSKICECGFFNNGRWVWQIKWLRDF 521 +G+G+ I FW DL D +K + I ++ + W + R+ Sbjct: 128 VVGNGDRIRFWDDLWWGDQPLGTQYPRLLSVVTDKNAPISSILGYSR-PFSWNFNFRRNL 186 Query: 522 RSIEDEYWAQLKRRLERLSIIPKSENQLIWT 614 + E E L R L+RL I P ++ W+ Sbjct: 187 TNSEIEDLESLMRSLDRLHISPSVPDKRSWS 217 Score = 34.3 bits (77), Expect(2) = 5e-06 Identities = 13/37 (35%), Positives = 20/37 (54%) Frame = +2 Query: 80 CCSKTLGGVGIPNLKRKNLSLFEKWWWWFRFENGFLW 190 C K+ GG+G + +N++L KW W + E LW Sbjct: 46 CKPKSRGGLGFGKISMRNVALLGKWLWRYPREGSALW 82 >emb|CAN63585.1| hypothetical protein VITISV_026806 [Vitis vinifera] Length = 684 Score = 41.2 bits (95), Expect(2) = 9e-06 Identities = 36/151 (23%), Positives = 61/151 (40%), Gaps = 6/151 (3%) Frame = +3 Query: 180 AFYGNALISKYG---NEWTWNSNLGSKY---WKKLSLVWRDISSIGTSSLKIIFKGGFRW 341 A + ++S YG N W N+N+ + WK ++LV+++ S R+ Sbjct: 348 ALWHQVILSIYGSHSNGWDVNNNVRWSHRCPWKAIALVFQEFSKFT------------RF 395 Query: 342 FLGSGNFINFWLDLCIDDXXXXXXXXXXXXXAKNKRSKICECGFFNNGRWVWQIKWLRDF 521 +G G+ I FW DL D +K + I ++ + W + R+ Sbjct: 396 VVGDGDRIRFWDDLWWGDQTLGTQYPRLLSVVTDKNAPISSILGYSR-PFSWNFNFRRNL 454 Query: 522 RSIEDEYWAQLKRRLERLSIIPKSENQLIWT 614 E E L R L+RL I P ++ W+ Sbjct: 455 TDSEIEDLESLMRSLDRLHISPSVPDKRSWS 485 Score = 34.3 bits (77), Expect(2) = 9e-06 Identities = 13/37 (35%), Positives = 20/37 (54%) Frame = +2 Query: 80 CCSKTLGGVGIPNLKRKNLSLFEKWWWWFRFENGFLW 190 C K+ GG+G + +N++L KW W + E LW Sbjct: 314 CKPKSRGGLGFGKISMRNVALLGKWLWRYPREGSALW 350