BLASTX nr result
ID: Akebia27_contig00030739
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00030739 (325 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom... 77 3e-12 ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom... 75 7e-12 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 72 1e-10 gb|AAB84340.1| putative non-LTR retroelement reverse transcripta... 71 2e-10 pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabi... 70 4e-10 ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom... 69 5e-10 ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom... 69 9e-10 gb|EPS58553.1| hypothetical protein M569_16261, partial [Genlise... 67 3e-09 ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268... 67 3e-09 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 66 4e-09 ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobrom... 66 4e-09 ref|XP_007031316.1| Uncharacterized protein TCM_016767 [Theobrom... 66 4e-09 ref|XP_007052625.1| Uncharacterized protein TCM_005953 [Theobrom... 66 6e-09 ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581... 65 8e-09 ref|XP_004253407.1| PREDICTED: uncharacterized protein LOC101250... 64 2e-08 ref|XP_004243111.1| PREDICTED: putative ribonuclease H protein A... 64 2e-08 ref|XP_004233579.1| PREDICTED: uncharacterized protein LOC101260... 64 2e-08 ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 64 2e-08 ref|XP_007203701.1| hypothetical protein PRUPE_ppa020995mg, part... 64 3e-08 ref|XP_007010390.1| Retrotransposon, unclassified-like protein [... 63 4e-08 >ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao] gi|508710339|gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 77.0 bits (188), Expect = 3e-12 Identities = 44/96 (45%), Positives = 54/96 (56%), Gaps = 1/96 (1%) Frame = -1 Query: 322 EDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTS-DQNSRLCISPSEKEVFQAIK 146 EDP+ I AV FQ+LL + Q C F L R S N LC +PS KE+ + + Sbjct: 969 EDPQYIQNSAVQYFQNLLTAEQ-CDFSRFDPSLIPRTISITDNEFLCAAPSLKEIKEVVF 1027 Query: 145 GMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38 ++ GPDGFS LFY CWDIIK DLL AV +F Sbjct: 1028 NIDKDSVAGPDGFSSLFYQHCWDIIKQDLLEAVLDF 1063 >ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao] gi|508710341|gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 75.5 bits (184), Expect = 7e-12 Identities = 42/99 (42%), Positives = 54/99 (54%), Gaps = 1/99 (1%) Frame = -1 Query: 325 LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAI 149 LE+P I V FQ+LL + Q C + R+ S N LC +PS +EV +A+ Sbjct: 1229 LEEPHLIQNSGVEFFQNLLKAEQ-CDISRFDPSITPRIISTTDNEFLCATPSLQEVKEAV 1287 Query: 148 KGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR 32 +N GPDGFS LFY CWDIIK DL AV +F + Sbjct: 1288 FNINKDSVAGPDGFSSLFYQHCWDIIKQDLFEAVLDFFK 1326 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 71.6 bits (174), Expect = 1e-10 Identities = 39/97 (40%), Positives = 55/97 (56%), Gaps = 1/97 (1%) Frame = -1 Query: 325 LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAI 149 +EDP+ + + A++ F SLL + + C L + SD N LC P+ +EV +A+ Sbjct: 1228 IEDPEQLQQSAIDFFSSLLKA-ESCDDTRFQSSLCPSIISDTDNGFLCAEPTLQEVKEAV 1286 Query: 148 KGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38 G++ A GPDGFS FY +CWDII DL AV F Sbjct: 1287 FGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVKEF 1323 >gb|AAB84340.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1094 Score = 70.9 bits (172), Expect = 2e-10 Identities = 32/91 (35%), Positives = 55/91 (60%) Frame = -1 Query: 304 GREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNVHGA 125 G+ AV F+ L +S+ S ++LEG N RVT D N L +E+E+++A+ +N A Sbjct: 115 GKIAVTFFEDLFSSSYPSSMDSVLEGFNKRVTEDMNQDLTKKVNEQEIYKAVFSINAESA 174 Query: 124 TGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR 32 GPDGF+ LF+ + W ++K +++ + F + Sbjct: 175 PGPDGFTALFFQRQWPLVKNQIISDIELFFQ 205 >pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabidopsis thaliana (fragment) Length = 1365 Score = 69.7 bits (169), Expect = 4e-10 Identities = 34/89 (38%), Positives = 52/89 (58%) Frame = -1 Query: 304 GREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNVHGA 125 G+ A + F++L ST + N LEGL +VTS+ N L +E EV+ A+ +N A Sbjct: 385 GKIASSFFENLFTSTYILTHNNHLEGLQAKVTSEMNHNLIQEVTELEVYNAVFSINKESA 444 Query: 124 TGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38 GPDGF+ LF+ + WD++K +L + F Sbjct: 445 PGPDGFTALFFQQHWDLVKHQILTEIFGF 473 >ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao] gi|508725617|gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 69.3 bits (168), Expect = 5e-10 Identities = 39/97 (40%), Positives = 53/97 (54%), Gaps = 1/97 (1%) Frame = -1 Query: 325 LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAI 149 +ED + + A+ F SLL + C L + S+ +N LC PS +EV A+ Sbjct: 1263 IEDQEQLKHSAIEYFSSLLK-VEPCYDSRFQSSLIPSIISNSENELLCAEPSLQEVKDAV 1321 Query: 148 KGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38 G+N A GPDGFS FY +CW+II DLL AV +F Sbjct: 1322 FGINSESAAGPDGFSSYFYQQCWNIIAQDLLDAVRDF 1358 >ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao] gi|508710342|gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 68.6 bits (166), Expect = 9e-10 Identities = 38/98 (38%), Positives = 54/98 (55%), Gaps = 1/98 (1%) Frame = -1 Query: 322 EDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAIK 146 +D +I + A + F+ L+ + + C L R+ S N LC +P +E+ +A+ Sbjct: 1143 DDIHSIQKSATDFFRDLMQA-ENCDLSRFDPSLIPRIISSADNEFLCAAPPLQEIKEAVF 1201 Query: 145 GMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR 32 +N GPDGFS LFY CWDIIK DLL AV +F R Sbjct: 1202 NINKDSVAGPDGFSSLFYQHCWDIIKNDLLDAVLDFFR 1239 >gb|EPS58553.1| hypothetical protein M569_16261, partial [Genlisea aurea] Length = 398 Score = 67.0 bits (162), Expect = 3e-09 Identities = 37/99 (37%), Positives = 56/99 (56%), Gaps = 2/99 (2%) Frame = -1 Query: 325 LEDPKAIGREAVNEFQSLLNSTQKC--SFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQA 152 +EDP I RE + ++ L S+ C + ++ + RVT++ N +L + +E EV+ A Sbjct: 259 IEDPADIQREFLAFYEQLFTSSAPCREAISEVVRTIPRRVTNEMNDKLIQAFTEDEVWFA 318 Query: 151 IKGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFL 35 +K MN A GPDGF LFY W IIK + +V +FL Sbjct: 319 VKQMNAESAPGPDGFPPLFYQNYWPIIKEETCCSVLDFL 357 >ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268853 [Solanum lycopersicum] Length = 1333 Score = 67.0 bits (162), Expect = 3e-09 Identities = 35/96 (36%), Positives = 53/96 (55%) Frame = -1 Query: 325 LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIK 146 ++ + I + A + ++ + ++L+ +N +T +QN L P E+ + I Sbjct: 331 IKGEEEIAKHACDYYEKIFTGMNGKIKEDILQCINPMITQEQNKDLDRIPDMDELRRTIM 390 Query: 145 GMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38 MN H A GPDGF G FY C+DIIK DLLAAV +F Sbjct: 391 SMNPHSAPGPDGFGGKFYQVCFDIIKEDLLAAVKHF 426 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 66.2 bits (160), Expect = 4e-09 Identities = 37/97 (38%), Positives = 54/97 (55%), Gaps = 1/97 (1%) Frame = -1 Query: 325 LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAI 149 +ED + + + A+ F SLL + C L + S+ +N LC P+ +EV A+ Sbjct: 1435 IEDQEQLKQSAIKYFSSLLKF-EPCDDSRFQRSLIPSIISNSENELLCAEPNLQEVKDAV 1493 Query: 148 KGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38 G++ A GPDGFS FY +CW+II DLL AV +F Sbjct: 1494 FGIDPESAAGPDGFSSYFYQQCWNIIAHDLLDAVRDF 1530 >ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobroma cacao] gi|508715059|gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] Length = 1702 Score = 66.2 bits (160), Expect = 4e-09 Identities = 45/116 (38%), Positives = 57/116 (49%), Gaps = 9/116 (7%) Frame = -1 Query: 322 EDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTS-DQNSRLCISPSEKEVFQAIK 146 EDP I AV FQ LL + Q C L R S N L +PS KE+ + + Sbjct: 625 EDPLYIQNSAVEFFQKLLRAEQ-CDISRFDFSLIPRTISITDNDFLYAAPSLKEIKEVVF 683 Query: 145 GMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR--------KVLSRKLSML 2 + PDGFS LFY CWDIIK DLL AV +F + K+L+ +LS + Sbjct: 684 NNDKDSVASPDGFSSLFYQHCWDIIKQDLLEAVLDFFKGTPMPQVTKLLANRLSKI 739 >ref|XP_007031316.1| Uncharacterized protein TCM_016767 [Theobroma cacao] gi|508710345|gb|EOY02242.1| Uncharacterized protein TCM_016767 [Theobroma cacao] Length = 1707 Score = 66.2 bits (160), Expect = 4e-09 Identities = 38/98 (38%), Positives = 54/98 (55%), Gaps = 1/98 (1%) Frame = -1 Query: 322 EDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAIK 146 +D +I + A + F++L+ + + C L R+ S N LC +PS +EV + + Sbjct: 1100 DDTHSIQKSATDFFRNLMQA-ENCDNSRFDPSLIPRIISSADNEFLCAAPSLQEVKETVF 1158 Query: 145 GMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR 32 +N G DGFS LFY CWDIIK DLL AV +F R Sbjct: 1159 NINKDSVAGSDGFSSLFYQHCWDIIKHDLLDAVLDFFR 1196 >ref|XP_007052625.1| Uncharacterized protein TCM_005953 [Theobroma cacao] gi|508704886|gb|EOX96782.1| Uncharacterized protein TCM_005953 [Theobroma cacao] Length = 1659 Score = 65.9 bits (159), Expect = 6e-09 Identities = 38/99 (38%), Positives = 52/99 (52%), Gaps = 4/99 (4%) Frame = -1 Query: 322 EDPKAIGREAVNEFQSLLNSTQKCS----FGNLLEGLNCRVTSDQNSRLCISPSEKEVFQ 155 +DP +I R +N+ + LN F + L +T N LC +PS KE+ + Sbjct: 864 QDPSSINRNLMNKAYAKLNRQLSIEELFWFDSSLIPRTISITD--NEFLCAAPSLKEINE 921 Query: 154 AIKGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38 + ++ GPDGFS LFY CWDIIK DLL AV +F Sbjct: 922 VVFNIDKDSVVGPDGFSSLFYQHCWDIIKQDLLEAVLDF 960 >ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum] Length = 1946 Score = 65.5 bits (158), Expect = 8e-09 Identities = 34/87 (39%), Positives = 48/87 (55%) Frame = -1 Query: 307 IGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNVHG 128 I + A FQ + + N+L+ + VT +QN L P+++E Q + MN + Sbjct: 164 IAKAACVYFQETFTGHENRNAENILQCITRMVTEEQNQNLKALPTKEESKQVVYSMNPNS 223 Query: 127 ATGPDGFSGLFYLKCWDIIKLDLLAAV 47 A GPDGF G FY CWDII+ +LL AV Sbjct: 224 APGPDGFGGKFYQACWDIIQDELLEAV 250 >ref|XP_004253407.1| PREDICTED: uncharacterized protein LOC101250876, partial [Solanum lycopersicum] Length = 445 Score = 64.3 bits (155), Expect = 2e-08 Identities = 33/90 (36%), Positives = 50/90 (55%) Frame = -1 Query: 307 IGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNVHG 128 I ++A ++ + + ++L+ + +T +QN L P E+ + I MN H Sbjct: 102 IAKKACEYYEEIFTGKNETIKEDILQCITPMITQEQNDGLDRLPDMDELRRIIMSMNPHS 161 Query: 127 ATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38 A GPDGF G FY C+DIIK DLL AV++F Sbjct: 162 APGPDGFGGKFYQVCFDIIKKDLLDAVNHF 191 >ref|XP_004243111.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 927 Score = 64.3 bits (155), Expect = 2e-08 Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 1/101 (0%) Frame = -1 Query: 307 IGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNVHG 128 I +EA + +Q++ +L++ + VT +QN L PS E+ I GMN + Sbjct: 14 IAKEACDYYQNIFTGKSDKINEDLVQCIPELVTEEQNYDLDKMPSVDELKGIIMGMNPNS 73 Query: 127 ATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR-KVLSRKLS 8 A GPDG G FY C+DIIK DLLAAV++F K++ R ++ Sbjct: 74 APGPDGIGGKFYQFCFDIIKEDLLAAVNSFFSGKIMPRYMT 114 >ref|XP_004233579.1| PREDICTED: uncharacterized protein LOC101260201 [Solanum lycopersicum] Length = 1531 Score = 64.3 bits (155), Expect = 2e-08 Identities = 33/90 (36%), Positives = 54/90 (60%) Frame = -1 Query: 307 IGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNVHG 128 + +EA + +Q++ + LL+ + +T +QNS L P+ +E+ I MN + Sbjct: 681 VAKEACDYYQNMFTGKSEKIKEELLQNIPELITLEQNSDLDKLPTVEELKNTIMSMNPNS 740 Query: 127 ATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38 A GPDG G FY +C+DII+ D+LAAV++F Sbjct: 741 APGPDGIGGKFYQECFDIIQEDMLAAVNSF 770 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 63.9 bits (154), Expect = 2e-08 Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 1/97 (1%) Frame = -1 Query: 325 LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAI 149 +ED + + + A+ F SLL + + C L + S+ +N LC P+ +EV A+ Sbjct: 1265 IEDQEQLKQSAIEYFSSLLKA-EPCDISRFQNSLIPSIISNSENELLCAEPNLQEVKDAV 1323 Query: 148 KGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38 ++ A GPDGFS FY +CW+ I DLL AV +F Sbjct: 1324 FDIDPESAAGPDGFSSYFYQQCWNTIAHDLLDAVRDF 1360 >ref|XP_007203701.1| hypothetical protein PRUPE_ppa020995mg, partial [Prunus persica] gi|462399232|gb|EMJ04900.1| hypothetical protein PRUPE_ppa020995mg, partial [Prunus persica] Length = 1367 Score = 63.5 bits (153), Expect = 3e-08 Identities = 33/94 (35%), Positives = 53/94 (56%) Frame = -1 Query: 313 KAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNV 134 + + + VN FQ L +ST + +++G+ RVT + N L + +E+ A+ M+ Sbjct: 657 QGLTQTVVNYFQHLFSSTGSSEYTGVVDGVRGRVTEEMNQTLLAEFTPEEIKIALFQMHP 716 Query: 133 HGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR 32 A GPDGFS FY K W I+ D++AAV +F + Sbjct: 717 SKAPGPDGFSPFFYQKYWQIVGEDVVAAVLHFFK 750 >ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao] gi|508727303|gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 63.2 bits (152), Expect = 4e-08 Identities = 35/98 (35%), Positives = 51/98 (52%) Frame = -1 Query: 325 LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIK 146 +E+P I AV F++LL + E + ++ N+ LC P +EV A+ Sbjct: 349 MEEPGLIESSAVEFFENLLKAENYDLSRFKAEFIPQMLSDADNNLLCAEPQLQEVKDAVF 408 Query: 145 GMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR 32 ++ GPDGFS FY +CW II DLLAAV +F + Sbjct: 409 AIDKDSVVGPDGFSSFFYQQCWPIIAEDLLAAVRDFFK 446