BLASTX nr result
ID: Akebia24_contig00042930
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00042930 (408 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cac... 127 1e-27 ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobrom... 120 2e-25 ref|XP_007048014.1| Gag-pol polyprotein-like protein [Theobroma ... 112 5e-23 ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobrom... 107 2e-21 ref|XP_004292437.1| PREDICTED: uncharacterized protein LOC101306... 103 3e-20 gb|ADP20181.1| mutant gag-pol polyprotein [Pisum sativum] 91 2e-16 ref|XP_007033335.1| Uncharacterized protein TCM_019516, partial ... 88 1e-15 gb|ADP20180.1| mutant gag-pol polyprotein [Pisum sativum] 86 4e-15 ref|XP_006494982.1| PREDICTED: uncharacterized protein LOC102624... 85 9e-15 ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [The... 84 2e-14 emb|CAN81775.1| hypothetical protein VITISV_020071 [Vitis vinifera] 84 2e-14 gb|AAM15062.1| putative retroelement integrase [Arabidopsis thal... 84 2e-14 gb|AAD17351.1| contains similarity to retrovirus-related polypro... 83 4e-14 ref|XP_007028192.1| Uncharacterized protein TCM_023754 [Theobrom... 83 4e-14 gb|AAQ72729.1| putative gag-pol polyprotein [Petunia x hybrida] 83 5e-14 emb|CAN82678.1| hypothetical protein VITISV_009305 [Vitis vinifera] 82 6e-14 ref|XP_006607002.1| PREDICTED: uncharacterized protein LOC100788... 82 8e-14 ref|XP_002322039.2| hypothetical protein POPTR_0015s02634g [Popu... 82 8e-14 ref|XP_007027874.1| DNA/RNA polymerases superfamily protein [The... 82 8e-14 gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group] 82 8e-14 >ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cacao] gi|508704828|gb|EOX96724.1| Gag-pol polyprotein, putative [Theobroma cacao] Length = 794 Score = 127 bits (320), Expect = 1e-27 Identities = 63/112 (56%), Positives = 82/112 (73%), Gaps = 4/112 (3%) Frame = -2 Query: 383 FKCLGFGHIAMDCPNRRFINLTEEASGEE----EFEDKKRGHNEEQEAEDITWSDHGETL 216 FKC GFGHIA DCPNRR I+L EE EE E +D+ N E E E+++ +DHGE L Sbjct: 299 FKCQGFGHIASDCPNRRIISLIEEEVMEEPSLEEVDDELEIFNNE-EIEEVS-ADHGEAL 356 Query: 215 VIRRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMV 60 V+RR++N + + E ++WLR+NIFHTRCT+ GKVCN+IID GSCEN++ MV Sbjct: 357 VVRRNLNTAMLTEDESWLRHNIFHTRCTSQGKVCNVIIDSGSCENVIANYMV 408 >ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobroma cacao] gi|508718388|gb|EOY10285.1| Uncharacterized protein TCM_025656 [Theobroma cacao] Length = 505 Score = 120 bits (302), Expect = 2e-25 Identities = 57/111 (51%), Positives = 83/111 (74%), Gaps = 2/111 (1%) Frame = -2 Query: 383 FKCLGFGHIAMDCPNRRFINLTEEA--SGEEEFEDKKRGHNEEQEAEDITWSDHGETLVI 210 FKC GFGHIA DCPNRR I+L EE + E+ E +++E E E+++ +DHGE L++ Sbjct: 150 FKCQGFGHIAFDCPNRRIISLVEEEDYANWEKLEPVYDEYDDE-EIEEVS-ADHGEALIV 207 Query: 209 RRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMVD 57 RR++N + + + ++WLR+NIF+TRCT+ GKVCN+IID GSCEN++ MV+ Sbjct: 208 RRNLNTAMMTKDESWLRHNIFYTRCTSQGKVCNVIIDSGSCENVIANYMVE 258 >ref|XP_007048014.1| Gag-pol polyprotein-like protein [Theobroma cacao] gi|508700275|gb|EOX92171.1| Gag-pol polyprotein-like protein [Theobroma cacao] Length = 399 Score = 112 bits (280), Expect = 5e-23 Identities = 54/111 (48%), Positives = 80/111 (72%), Gaps = 2/111 (1%) Frame = -2 Query: 383 FKCLGFGHIAMDCPNRRFINLTEEASGEEEFEDKKRGHNE--EQEAEDITWSDHGETLVI 210 FKC GFGHIA DC NRR I+L EE +E K ++E ++E E+++ +DHGE L++ Sbjct: 275 FKCQGFGHIASDCSNRRIISLVEEED-YANWEKLKPVYDEYDDEEIEEVS-ADHGEALIV 332 Query: 209 RRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMVD 57 RR++N + + + ++W R+NIF+TRCT+ GKVCN+IID GS EN++ MV+ Sbjct: 333 RRNLNTAMMTKDESWFRHNIFYTRCTSQGKVCNVIIDSGSYENVIANYMVE 383 >ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobroma cacao] gi|508726763|gb|EOY18660.1| Uncharacterized protein TCM_043155 [Theobroma cacao] Length = 625 Score = 107 bits (266), Expect = 2e-21 Identities = 53/111 (47%), Positives = 78/111 (70%), Gaps = 2/111 (1%) Frame = -2 Query: 383 FKCLGFGHIAMDCPNRRFINLTEEAS--GEEEFEDKKRGHNEEQEAEDITWSDHGETLVI 210 FKC FGHIA DCP+RR I+L EE E+ E +++E E E+++ +DHGE ++ Sbjct: 270 FKCQRFGHIASDCPSRRIISLVEEEDYVNWEKLEPVYDEYDDE-EIEEVS-ADHGEAFIV 327 Query: 209 RRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMVD 57 RR++N + + + ++ LR+NIF+TRCT+ G VCN+IID GSCEN+V MV+ Sbjct: 328 RRNLNTALMTKDESCLRHNIFYTRCTSQGNVCNVIIDSGSCENVVANYMVE 378 >ref|XP_004292437.1| PREDICTED: uncharacterized protein LOC101306407 [Fragaria vesca subsp. vesca] Length = 1300 Score = 103 bits (257), Expect = 3e-20 Identities = 49/99 (49%), Positives = 67/99 (67%), Gaps = 3/99 (3%) Frame = -2 Query: 407 SNSKATQGFKCLGFGHIAMDCPNRRFINLTEEA--SGEEEFEDKKRGHNE-EQEAEDITW 237 + K + FKC G GHIA DCPNRR +NL EE S +D + +Q+ E+ITW Sbjct: 431 NKGKNVKCFKCSGLGHIASDCPNRRVVNLVEELGESSSAGLDDMPTSDDYGDQDEEEITW 490 Query: 236 SDHGETLVIRRSMNLSCIEEKDNWLRNNIFHTRCTAYGK 120 SDHGE+LVIR++M+ S +E+ WL++NIFHT+CT+ GK Sbjct: 491 SDHGESLVIRQTMSASKVEDDSEWLKHNIFHTKCTSNGK 529 >gb|ADP20181.1| mutant gag-pol polyprotein [Pisum sativum] Length = 572 Score = 90.5 bits (223), Expect = 2e-16 Identities = 53/116 (45%), Positives = 71/116 (61%) Frame = -2 Query: 407 SNSKATQGFKCLGFGHIAMDCPNRRFINLTEEASGEEEFEDKKRGHNEEQEAEDITWSDH 228 S +K+ + FKC G GHIA CP +R + L EE G E ED G +E+ E+I Sbjct: 288 STNKSVKCFKCQGQGHIASQCPTKRTM-LMEENEGIVEEED---GDYDEEFEEEIP---S 340 Query: 227 GETLVIRRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMV 60 G+ L++RR + S I+E+D R N+FHTRC GKVC+LIIDGGSC N+ +V Sbjct: 341 GDLLMVRRMLG-SQIKEEDTGQRENLFHTRCFVQGKVCSLIIDGGSCTNVASTRLV 395 >ref|XP_007033335.1| Uncharacterized protein TCM_019516, partial [Theobroma cacao] gi|508712364|gb|EOY04261.1| Uncharacterized protein TCM_019516, partial [Theobroma cacao] Length = 215 Score = 88.2 bits (217), Expect = 1e-15 Identities = 40/81 (49%), Positives = 61/81 (75%) Frame = -2 Query: 299 EEFEDKKRGHNEEQEAEDITWSDHGETLVIRRSMNLSCIEEKDNWLRNNIFHTRCTAYGK 120 +E +D+ N E E E+++ +DHGE LV+RR++N + + + ++WLR+NIFH RCT+ GK Sbjct: 7 KEVDDELEIFNNE-EIEEVS-ADHGEALVVRRNLNTAMMTKDESWLRHNIFHARCTSQGK 64 Query: 119 VCNLIIDGGSCENMVFQEMVD 57 VCN+IID GSCEN++ MV+ Sbjct: 65 VCNVIIDSGSCENVIANYMVE 85 >gb|ADP20180.1| mutant gag-pol polyprotein [Pisum sativum] Length = 1004 Score = 86.3 bits (212), Expect = 4e-15 Identities = 47/116 (40%), Positives = 70/116 (60%) Frame = -2 Query: 407 SNSKATQGFKCLGFGHIAMDCPNRRFINLTEEASGEEEFEDKKRGHNEEQEAEDITWSDH 228 S +K+ + FKC G GHIA CP +R + + E EE +++ G +++ E+I Sbjct: 288 STNKSVKCFKCQGQGHIASQCPTKRTMLMEEN----EEIVEEEDGDYDKEFGEEIP---S 340 Query: 227 GETLVIRRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMV 60 G+ L++RR + S I+E+D R N+FH RC GKVC+LIIDGGSC N+ +V Sbjct: 341 GDLLMVRRMLG-SQIKEEDTSQRENLFHIRCFVQGKVCSLIIDGGSCTNVASTRLV 395 >ref|XP_006494982.1| PREDICTED: uncharacterized protein LOC102624489 [Citrus sinensis] Length = 1083 Score = 85.1 bits (209), Expect = 9e-15 Identities = 46/118 (38%), Positives = 69/118 (58%), Gaps = 1/118 (0%) Frame = -2 Query: 407 SNSKATQGFKCLGFGHIAMDCPNRRFINLTEEASGEEEFEDKKRGHNEEQEAED-ITWSD 231 S ++ + FKCLG HIA CPN+R + L ++ E E E ++A D + +S Sbjct: 147 SRNRDIKCFKCLGTCHIASQCPNKRAMILRDDGDVETESESDDDPMPPLEDANDGVEYSI 206 Query: 230 HGETLVIRRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMVD 57 G+ +V RR++N ++E R+NIFHTRC KVC++IIDGGSC N+ +V+ Sbjct: 207 DGKLMVARRALNTQ-VKEDAEVQRDNIFHTRCHIKDKVCSMIIDGGSCTNVAITSLVE 263 >ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508702148|gb|EOX94044.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 546 Score = 84.0 bits (206), Expect = 2e-14 Identities = 48/109 (44%), Positives = 65/109 (59%) Frame = -2 Query: 383 FKCLGFGHIAMDCPNRRFINLTEEASGEEEFEDKKRGHNEEQEAEDITWSDHGETLVIRR 204 F C GHI+ CP RR +NL E EE E + EE E D+ + GE+LV+RR Sbjct: 262 FTCGEKGHISFACPQRR-VNLAELG---EELEPVYDEYEEEVEEIDV-YPAQGESLVVRR 316 Query: 203 SMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMVD 57 M + EE ++W R +IF TR GKVC+L+IDGGS EN++ +E V+ Sbjct: 317 VMTTTVNEEAEDWKRRSIFRTRVVCEGKVCDLVIDGGSMENIISKEAVN 365 >emb|CAN81775.1| hypothetical protein VITISV_020071 [Vitis vinifera] Length = 1159 Score = 84.0 bits (206), Expect = 2e-14 Identities = 42/105 (40%), Positives = 62/105 (59%), Gaps = 2/105 (1%) Frame = -2 Query: 383 FKCLGFGHIAMDCPNRRFI--NLTEEASGEEEFEDKKRGHNEEQEAEDITWSDHGETLVI 210 F+CLG GHIA CPN+R + + E E E +D + E+ +++ + GE+LV Sbjct: 301 FRCLGVGHIASQCPNKRTMIARVDGEVETESEEDDDQMPSLEDACDDNVEYPVEGESLVA 360 Query: 209 RRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMV 75 RR+++ ++ R NIFHTRC KVC++IIDGGSC N + Sbjct: 361 RRALSAQVKKDDMEQQRENIFHTRCHINNKVCSMIIDGGSCTNFL 405 >gb|AAM15062.1| putative retroelement integrase [Arabidopsis thaliana] Length = 1215 Score = 84.0 bits (206), Expect = 2e-14 Identities = 46/110 (41%), Positives = 63/110 (57%), Gaps = 2/110 (1%) Frame = -2 Query: 383 FKCLGFGHIAMDCPNRRFINLTE--EASGEEEFEDKKRGHNEEQEAEDITWSDHGETLVI 210 +KC G GH A +CPN+R + L + E EEE D E +E GE LV Sbjct: 160 YKCQGKGHYANECPNKRVMILLDNGEIEPEEEIPDSPSSLKENEELPA-----QGELLVA 214 Query: 209 RRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMV 60 RR++++ ++ R N+FHTRC +GKVC+LIIDGGSC N+ + MV Sbjct: 215 RRTLSVQTKTDEQEQ-RKNLFHTRCHVHGKVCSLIIDGGSCTNVASETMV 263 >gb|AAD17351.1| contains similarity to retrovirus-related polyproteins and to CCHC zinc finger protein (Pfam: PF00098, Score=16.3, E=0.051, E= 1) [Arabidopsis thaliana] gi|7267432|emb|CAB77944.1| putative polyprotein [Arabidopsis thaliana] Length = 1138 Score = 83.2 bits (204), Expect = 4e-14 Identities = 49/109 (44%), Positives = 65/109 (59%), Gaps = 1/109 (0%) Frame = -2 Query: 383 FKCLGFGHIAMDCPNRRFINLTEEASGEEEFEDKKRGHNEEQEAEDITWSDHGETLVIRR 204 FKC G GH A +C N+R + + + SGE E ED+K E D+ + GE LV R Sbjct: 281 FKCHGLGHYASECSNKRIMIIRD--SGEVESEDEK------PEESDVEEAPKGELLVTMR 332 Query: 203 SMN-LSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMV 60 ++ L+ EE+ R N+FHTRC GKVC+LIIDGGSC N+ + MV Sbjct: 333 VLSVLNKAEEQAQ--RENLFHTRCLIKGKVCSLIIDGGSCTNVASETMV 379 >ref|XP_007028192.1| Uncharacterized protein TCM_023754 [Theobroma cacao] gi|508716797|gb|EOY08694.1| Uncharacterized protein TCM_023754 [Theobroma cacao] Length = 440 Score = 83.2 bits (204), Expect = 4e-14 Identities = 34/70 (48%), Positives = 57/70 (81%) Frame = -2 Query: 266 EEQEAEDITWSDHGETLVIRRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSC 87 +++E E+++ +DHGE L++RR++N + + + ++WLR+NIF+TR T+ GKVCN+IID GSC Sbjct: 125 DDEEIEEVS-ADHGEALIVRRNLNTAMMTKDESWLRHNIFYTRYTSQGKVCNVIIDSGSC 183 Query: 86 ENMVFQEMVD 57 EN++ MV+ Sbjct: 184 ENVIANYMVE 193 >gb|AAQ72729.1| putative gag-pol polyprotein [Petunia x hybrida] Length = 803 Score = 82.8 bits (203), Expect = 5e-14 Identities = 45/121 (37%), Positives = 65/121 (53%), Gaps = 7/121 (5%) Frame = -2 Query: 401 SKATQGFKCLGFGHIAMDCPNRRFINLT-----EEASGEEEFEDKKRGHNEEQEAEDITW 237 S + Q KC G+GH A +CP +R + + E+ S EE E + E++ ED T Sbjct: 377 SSSIQCHKCKGYGHFAKECPTKRTMVVVVEHAYEQESEPEEDEGVEGDEGVEEDGEDDTV 436 Query: 236 SDHGET--LVIRRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEM 63 D LV+RRS+ E+ + R N+FH RC G VC++IID GSC N+V Q + Sbjct: 437 EDEEPHTFLVVRRSLGAMIAEDGETLQRENLFHARCRVKGVVCSMIIDSGSCTNVVSQSL 496 Query: 62 V 60 + Sbjct: 497 I 497 >emb|CAN82678.1| hypothetical protein VITISV_009305 [Vitis vinifera] Length = 417 Score = 82.4 bits (202), Expect = 6e-14 Identities = 43/111 (38%), Positives = 63/111 (56%), Gaps = 2/111 (1%) Frame = -2 Query: 383 FKCLGFGHIAMDCPNRRFI--NLTEEASGEEEFEDKKRGHNEEQEAEDITWSDHGETLVI 210 F+CLG GHIA CPN+R + + E E E +D E+ +++ + GE+LV Sbjct: 306 FRCLGVGHIASQCPNKRIMIARVDGEVEIESEEDDDLMPSLEDACDDNVEYLVEGESLVA 365 Query: 209 RRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMVD 57 R +++ E+ R NIFHTRC KVC++IIDGGSC N+ +V+ Sbjct: 366 RLALSSQVKEDDMEQQRENIFHTRCHINNKVCSMIIDGGSCTNVASTTLVE 416 >ref|XP_006607002.1| PREDICTED: uncharacterized protein LOC100788838 [Glycine max] Length = 519 Score = 82.0 bits (201), Expect = 8e-14 Identities = 43/108 (39%), Positives = 63/108 (58%) Frame = -2 Query: 383 FKCLGFGHIAMDCPNRRFINLTEEASGEEEFEDKKRGHNEEQEAEDITWSDHGETLVIRR 204 FKCLG GHIA +CP RR + + + E E + EE+ E+ G+ L++RR Sbjct: 314 FKCLGRGHIASECPTRRTMIMKADGEITSESEISEEEVEEEEYGEEAM---QGDMLMVRR 370 Query: 203 SMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMV 60 + + ++ D+ R NIFHTRC GK+C+LI+DGGSC N+ +V Sbjct: 371 LLG-NQMQPLDDNQRENIFHTRCVINGKLCSLIVDGGSCTNVASSTLV 417 >ref|XP_002322039.2| hypothetical protein POPTR_0015s02634g [Populus trichocarpa] gi|550321818|gb|EEF06166.2| hypothetical protein POPTR_0015s02634g [Populus trichocarpa] Length = 283 Score = 82.0 bits (201), Expect = 8e-14 Identities = 45/103 (43%), Positives = 63/103 (61%) Frame = -2 Query: 383 FKCLGFGHIAMDCPNRRFINLTEEASGEEEFEDKKRGHNEEQEAEDITWSDHGETLVIRR 204 F CLG GHIA CPNRR + LT + +GE E ++ ++ E+I + GE LVI R Sbjct: 158 FMCLGKGHIASQCPNRR-VMLTRD-NGEVGSESEEMPPLVDRSDEEIAYPVEGEALVISR 215 Query: 203 SMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMV 75 ++N+ E+ + NIFHTRC KVC++IIDGGS N++ Sbjct: 216 ALNIQIKEDDVDQQWENIFHTRCHIQNKVCSMIIDGGSDANLM 258 >ref|XP_007027874.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508716479|gb|EOY08376.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 558 Score = 82.0 bits (201), Expect = 8e-14 Identities = 47/109 (43%), Positives = 64/109 (58%) Frame = -2 Query: 383 FKCLGFGHIAMDCPNRRFINLTEEASGEEEFEDKKRGHNEEQEAEDITWSDHGETLVIRR 204 F C GH + CP RR +NL E EE E + EE E D+ + GE+LV+RR Sbjct: 258 FTCGEKGHTSFACPQRR-VNLAELG---EELEPVYDEYEEEVEEIDV-YPAQGESLVVRR 312 Query: 203 SMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMVD 57 M + EE ++W R +IF TR GKVC+L+IDGGS EN++ +E V+ Sbjct: 313 VMTTTVNEEAEDWKRRSIFRTRVVCEGKVCDLVIDGGSMENIISKEAVN 361 >gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1713 Score = 82.0 bits (201), Expect = 8e-14 Identities = 43/118 (36%), Positives = 67/118 (56%), Gaps = 1/118 (0%) Frame = -2 Query: 407 SNSKATQGFKCLGFGHIAMDCPNRRFINLTEEASGEEEFEDKKRGHNEEQEAE-DITWSD 231 + S Q FKC G GH+A +CPN R I + ++ E E+++ EE E DI + Sbjct: 381 TKSSGIQCFKCGGRGHVARECPNNRTIVVNDQGEYESTSEEEQEDSEEENNLEKDICEFE 440 Query: 230 HGETLVIRRSMNLSCIEEKDNWLRNNIFHTRCTAYGKVCNLIIDGGSCENMVFQEMVD 57 G LV+ + +++ + + +N R+N+F TR KV +IIDGGSC N+ +EMV+ Sbjct: 441 SGAALVVTQILSVQ-MSDAENGQRHNLFQTRAKVQDKVVKVIIDGGSCHNLASKEMVE 497