BLASTX nr result

ID: Akebia27_contig00030739 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00030739
         (325 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...    77   3e-12
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...    75   7e-12
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...    72   1e-10
gb|AAB84340.1| putative non-LTR retroelement reverse transcripta...    71   2e-10
pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabi...    70   4e-10
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...    69   5e-10
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...    69   9e-10
gb|EPS58553.1| hypothetical protein M569_16261, partial [Genlise...    67   3e-09
ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268...    67   3e-09
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...    66   4e-09
ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobrom...    66   4e-09
ref|XP_007031316.1| Uncharacterized protein TCM_016767 [Theobrom...    66   4e-09
ref|XP_007052625.1| Uncharacterized protein TCM_005953 [Theobrom...    66   6e-09
ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581...    65   8e-09
ref|XP_004253407.1| PREDICTED: uncharacterized protein LOC101250...    64   2e-08
ref|XP_004243111.1| PREDICTED: putative ribonuclease H protein A...    64   2e-08
ref|XP_004233579.1| PREDICTED: uncharacterized protein LOC101260...    64   2e-08
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...    64   2e-08
ref|XP_007203701.1| hypothetical protein PRUPE_ppa020995mg, part...    64   3e-08
ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...    63   4e-08

>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 77.0 bits (188), Expect = 3e-12
 Identities = 44/96 (45%), Positives = 54/96 (56%), Gaps = 1/96 (1%)
 Frame = -1

Query: 322  EDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTS-DQNSRLCISPSEKEVFQAIK 146
            EDP+ I   AV  FQ+LL + Q C F      L  R  S   N  LC +PS KE+ + + 
Sbjct: 969  EDPQYIQNSAVQYFQNLLTAEQ-CDFSRFDPSLIPRTISITDNEFLCAAPSLKEIKEVVF 1027

Query: 145  GMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38
             ++     GPDGFS LFY  CWDIIK DLL AV +F
Sbjct: 1028 NIDKDSVAGPDGFSSLFYQHCWDIIKQDLLEAVLDF 1063


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score = 75.5 bits (184), Expect = 7e-12
 Identities = 42/99 (42%), Positives = 54/99 (54%), Gaps = 1/99 (1%)
 Frame = -1

Query: 325  LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAI 149
            LE+P  I    V  FQ+LL + Q C        +  R+ S   N  LC +PS +EV +A+
Sbjct: 1229 LEEPHLIQNSGVEFFQNLLKAEQ-CDISRFDPSITPRIISTTDNEFLCATPSLQEVKEAV 1287

Query: 148  KGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR 32
              +N     GPDGFS LFY  CWDIIK DL  AV +F +
Sbjct: 1288 FNINKDSVAGPDGFSSLFYQHCWDIIKQDLFEAVLDFFK 1326


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 71.6 bits (174), Expect = 1e-10
 Identities = 39/97 (40%), Positives = 55/97 (56%), Gaps = 1/97 (1%)
 Frame = -1

Query: 325  LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAI 149
            +EDP+ + + A++ F SLL + + C        L   + SD  N  LC  P+ +EV +A+
Sbjct: 1228 IEDPEQLQQSAIDFFSSLLKA-ESCDDTRFQSSLCPSIISDTDNGFLCAEPTLQEVKEAV 1286

Query: 148  KGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38
             G++   A GPDGFS  FY +CWDII  DL  AV  F
Sbjct: 1287 FGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVKEF 1323


>gb|AAB84340.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
           thaliana]
          Length = 1094

 Score = 70.9 bits (172), Expect = 2e-10
 Identities = 32/91 (35%), Positives = 55/91 (60%)
 Frame = -1

Query: 304 GREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNVHGA 125
           G+ AV  F+ L +S+   S  ++LEG N RVT D N  L    +E+E+++A+  +N   A
Sbjct: 115 GKIAVTFFEDLFSSSYPSSMDSVLEGFNKRVTEDMNQDLTKKVNEQEIYKAVFSINAESA 174

Query: 124 TGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR 32
            GPDGF+ LF+ + W ++K  +++ +  F +
Sbjct: 175 PGPDGFTALFFQRQWPLVKNQIISDIELFFQ 205


>pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabidopsis thaliana
           (fragment)
          Length = 1365

 Score = 69.7 bits (169), Expect = 4e-10
 Identities = 34/89 (38%), Positives = 52/89 (58%)
 Frame = -1

Query: 304 GREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNVHGA 125
           G+ A + F++L  ST   +  N LEGL  +VTS+ N  L    +E EV+ A+  +N   A
Sbjct: 385 GKIASSFFENLFTSTYILTHNNHLEGLQAKVTSEMNHNLIQEVTELEVYNAVFSINKESA 444

Query: 124 TGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38
            GPDGF+ LF+ + WD++K  +L  +  F
Sbjct: 445 PGPDGFTALFFQQHWDLVKHQILTEIFGF 473


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score = 69.3 bits (168), Expect = 5e-10
 Identities = 39/97 (40%), Positives = 53/97 (54%), Gaps = 1/97 (1%)
 Frame = -1

Query: 325  LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAI 149
            +ED + +   A+  F SLL   + C        L   + S+ +N  LC  PS +EV  A+
Sbjct: 1263 IEDQEQLKHSAIEYFSSLLK-VEPCYDSRFQSSLIPSIISNSENELLCAEPSLQEVKDAV 1321

Query: 148  KGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38
             G+N   A GPDGFS  FY +CW+II  DLL AV +F
Sbjct: 1322 FGINSESAAGPDGFSSYFYQQCWNIIAQDLLDAVRDF 1358


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score = 68.6 bits (166), Expect = 9e-10
 Identities = 38/98 (38%), Positives = 54/98 (55%), Gaps = 1/98 (1%)
 Frame = -1

Query: 322  EDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAIK 146
            +D  +I + A + F+ L+ + + C        L  R+ S   N  LC +P  +E+ +A+ 
Sbjct: 1143 DDIHSIQKSATDFFRDLMQA-ENCDLSRFDPSLIPRIISSADNEFLCAAPPLQEIKEAVF 1201

Query: 145  GMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR 32
             +N     GPDGFS LFY  CWDIIK DLL AV +F R
Sbjct: 1202 NINKDSVAGPDGFSSLFYQHCWDIIKNDLLDAVLDFFR 1239


>gb|EPS58553.1| hypothetical protein M569_16261, partial [Genlisea aurea]
          Length = 398

 Score = 67.0 bits (162), Expect = 3e-09
 Identities = 37/99 (37%), Positives = 56/99 (56%), Gaps = 2/99 (2%)
 Frame = -1

Query: 325 LEDPKAIGREAVNEFQSLLNSTQKC--SFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQA 152
           +EDP  I RE +  ++ L  S+  C  +   ++  +  RVT++ N +L  + +E EV+ A
Sbjct: 259 IEDPADIQREFLAFYEQLFTSSAPCREAISEVVRTIPRRVTNEMNDKLIQAFTEDEVWFA 318

Query: 151 IKGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFL 35
           +K MN   A GPDGF  LFY   W IIK +   +V +FL
Sbjct: 319 VKQMNAESAPGPDGFPPLFYQNYWPIIKEETCCSVLDFL 357


>ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268853 [Solanum
           lycopersicum]
          Length = 1333

 Score = 67.0 bits (162), Expect = 3e-09
 Identities = 35/96 (36%), Positives = 53/96 (55%)
 Frame = -1

Query: 325 LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIK 146
           ++  + I + A + ++ +          ++L+ +N  +T +QN  L   P   E+ + I 
Sbjct: 331 IKGEEEIAKHACDYYEKIFTGMNGKIKEDILQCINPMITQEQNKDLDRIPDMDELRRTIM 390

Query: 145 GMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38
            MN H A GPDGF G FY  C+DIIK DLLAAV +F
Sbjct: 391 SMNPHSAPGPDGFGGKFYQVCFDIIKEDLLAAVKHF 426


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score = 66.2 bits (160), Expect = 4e-09
 Identities = 37/97 (38%), Positives = 54/97 (55%), Gaps = 1/97 (1%)
 Frame = -1

Query: 325  LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAI 149
            +ED + + + A+  F SLL   + C        L   + S+ +N  LC  P+ +EV  A+
Sbjct: 1435 IEDQEQLKQSAIKYFSSLLKF-EPCDDSRFQRSLIPSIISNSENELLCAEPNLQEVKDAV 1493

Query: 148  KGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38
             G++   A GPDGFS  FY +CW+II  DLL AV +F
Sbjct: 1494 FGIDPESAAGPDGFSSYFYQQCWNIIAHDLLDAVRDF 1530


>ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
           gi|508715059|gb|EOY06956.1| Uncharacterized protein
           TCM_021518 [Theobroma cacao]
          Length = 1702

 Score = 66.2 bits (160), Expect = 4e-09
 Identities = 45/116 (38%), Positives = 57/116 (49%), Gaps = 9/116 (7%)
 Frame = -1

Query: 322 EDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTS-DQNSRLCISPSEKEVFQAIK 146
           EDP  I   AV  FQ LL + Q C        L  R  S   N  L  +PS KE+ + + 
Sbjct: 625 EDPLYIQNSAVEFFQKLLRAEQ-CDISRFDFSLIPRTISITDNDFLYAAPSLKEIKEVVF 683

Query: 145 GMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR--------KVLSRKLSML 2
             +      PDGFS LFY  CWDIIK DLL AV +F +        K+L+ +LS +
Sbjct: 684 NNDKDSVASPDGFSSLFYQHCWDIIKQDLLEAVLDFFKGTPMPQVTKLLANRLSKI 739


>ref|XP_007031316.1| Uncharacterized protein TCM_016767 [Theobroma cacao]
            gi|508710345|gb|EOY02242.1| Uncharacterized protein
            TCM_016767 [Theobroma cacao]
          Length = 1707

 Score = 66.2 bits (160), Expect = 4e-09
 Identities = 38/98 (38%), Positives = 54/98 (55%), Gaps = 1/98 (1%)
 Frame = -1

Query: 322  EDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAIK 146
            +D  +I + A + F++L+ + + C        L  R+ S   N  LC +PS +EV + + 
Sbjct: 1100 DDTHSIQKSATDFFRNLMQA-ENCDNSRFDPSLIPRIISSADNEFLCAAPSLQEVKETVF 1158

Query: 145  GMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR 32
             +N     G DGFS LFY  CWDIIK DLL AV +F R
Sbjct: 1159 NINKDSVAGSDGFSSLFYQHCWDIIKHDLLDAVLDFFR 1196


>ref|XP_007052625.1| Uncharacterized protein TCM_005953 [Theobroma cacao]
            gi|508704886|gb|EOX96782.1| Uncharacterized protein
            TCM_005953 [Theobroma cacao]
          Length = 1659

 Score = 65.9 bits (159), Expect = 6e-09
 Identities = 38/99 (38%), Positives = 52/99 (52%), Gaps = 4/99 (4%)
 Frame = -1

Query: 322  EDPKAIGREAVNEFQSLLNSTQKCS----FGNLLEGLNCRVTSDQNSRLCISPSEKEVFQ 155
            +DP +I R  +N+  + LN          F + L      +T   N  LC +PS KE+ +
Sbjct: 864  QDPSSINRNLMNKAYAKLNRQLSIEELFWFDSSLIPRTISITD--NEFLCAAPSLKEINE 921

Query: 154  AIKGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38
             +  ++     GPDGFS LFY  CWDIIK DLL AV +F
Sbjct: 922  VVFNIDKDSVVGPDGFSSLFYQHCWDIIKQDLLEAVLDF 960


>ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum]
          Length = 1946

 Score = 65.5 bits (158), Expect = 8e-09
 Identities = 34/87 (39%), Positives = 48/87 (55%)
 Frame = -1

Query: 307 IGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNVHG 128
           I + A   FQ      +  +  N+L+ +   VT +QN  L   P+++E  Q +  MN + 
Sbjct: 164 IAKAACVYFQETFTGHENRNAENILQCITRMVTEEQNQNLKALPTKEESKQVVYSMNPNS 223

Query: 127 ATGPDGFSGLFYLKCWDIIKLDLLAAV 47
           A GPDGF G FY  CWDII+ +LL AV
Sbjct: 224 APGPDGFGGKFYQACWDIIQDELLEAV 250


>ref|XP_004253407.1| PREDICTED: uncharacterized protein LOC101250876, partial [Solanum
           lycopersicum]
          Length = 445

 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 33/90 (36%), Positives = 50/90 (55%)
 Frame = -1

Query: 307 IGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNVHG 128
           I ++A   ++ +     +    ++L+ +   +T +QN  L   P   E+ + I  MN H 
Sbjct: 102 IAKKACEYYEEIFTGKNETIKEDILQCITPMITQEQNDGLDRLPDMDELRRIIMSMNPHS 161

Query: 127 ATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38
           A GPDGF G FY  C+DIIK DLL AV++F
Sbjct: 162 APGPDGFGGKFYQVCFDIIKKDLLDAVNHF 191


>ref|XP_004243111.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
           lycopersicum]
          Length = 927

 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 1/101 (0%)
 Frame = -1

Query: 307 IGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNVHG 128
           I +EA + +Q++          +L++ +   VT +QN  L   PS  E+   I GMN + 
Sbjct: 14  IAKEACDYYQNIFTGKSDKINEDLVQCIPELVTEEQNYDLDKMPSVDELKGIIMGMNPNS 73

Query: 127 ATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR-KVLSRKLS 8
           A GPDG  G FY  C+DIIK DLLAAV++F   K++ R ++
Sbjct: 74  APGPDGIGGKFYQFCFDIIKEDLLAAVNSFFSGKIMPRYMT 114


>ref|XP_004233579.1| PREDICTED: uncharacterized protein LOC101260201 [Solanum
           lycopersicum]
          Length = 1531

 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 33/90 (36%), Positives = 54/90 (60%)
 Frame = -1

Query: 307 IGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNVHG 128
           + +EA + +Q++     +     LL+ +   +T +QNS L   P+ +E+   I  MN + 
Sbjct: 681 VAKEACDYYQNMFTGKSEKIKEELLQNIPELITLEQNSDLDKLPTVEELKNTIMSMNPNS 740

Query: 127 ATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38
           A GPDG  G FY +C+DII+ D+LAAV++F
Sbjct: 741 APGPDGIGGKFYQECFDIIQEDMLAAVNSF 770


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score = 63.9 bits (154), Expect = 2e-08
 Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 1/97 (1%)
 Frame = -1

Query: 325  LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSD-QNSRLCISPSEKEVFQAI 149
            +ED + + + A+  F SLL + + C        L   + S+ +N  LC  P+ +EV  A+
Sbjct: 1265 IEDQEQLKQSAIEYFSSLLKA-EPCDISRFQNSLIPSIISNSENELLCAEPNLQEVKDAV 1323

Query: 148  KGMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNF 38
              ++   A GPDGFS  FY +CW+ I  DLL AV +F
Sbjct: 1324 FDIDPESAAGPDGFSSYFYQQCWNTIAHDLLDAVRDF 1360


>ref|XP_007203701.1| hypothetical protein PRUPE_ppa020995mg, partial [Prunus persica]
           gi|462399232|gb|EMJ04900.1| hypothetical protein
           PRUPE_ppa020995mg, partial [Prunus persica]
          Length = 1367

 Score = 63.5 bits (153), Expect = 3e-08
 Identities = 33/94 (35%), Positives = 53/94 (56%)
 Frame = -1

Query: 313 KAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIKGMNV 134
           + + +  VN FQ L +ST    +  +++G+  RVT + N  L    + +E+  A+  M+ 
Sbjct: 657 QGLTQTVVNYFQHLFSSTGSSEYTGVVDGVRGRVTEEMNQTLLAEFTPEEIKIALFQMHP 716

Query: 133 HGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR 32
             A GPDGFS  FY K W I+  D++AAV +F +
Sbjct: 717 SKAPGPDGFSPFFYQKYWQIVGEDVVAAVLHFFK 750


>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
           gi|508727303|gb|EOY19200.1| Retrotransposon,
           unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 63.2 bits (152), Expect = 4e-08
 Identities = 35/98 (35%), Positives = 51/98 (52%)
 Frame = -1

Query: 325 LEDPKAIGREAVNEFQSLLNSTQKCSFGNLLEGLNCRVTSDQNSRLCISPSEKEVFQAIK 146
           +E+P  I   AV  F++LL +          E +   ++   N+ LC  P  +EV  A+ 
Sbjct: 349 MEEPGLIESSAVEFFENLLKAENYDLSRFKAEFIPQMLSDADNNLLCAEPQLQEVKDAVF 408

Query: 145 GMNVHGATGPDGFSGLFYLKCWDIIKLDLLAAVSNFLR 32
            ++     GPDGFS  FY +CW II  DLLAAV +F +
Sbjct: 409 AIDKDSVVGPDGFSSFFYQQCWPIIAEDLLAAVRDFFK 446


Top