BLASTX nr result

ID: Sinomenium21_contig00017181 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00017181
         (313 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...   113   2e-23
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...   112   5e-23
ref|XP_007028283.1| BZIP-like protein [Theobroma cacao] gi|50871...   111   1e-22
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...   110   2e-22
ref|XP_007052625.1| Uncharacterized protein TCM_005953 [Theobrom...   110   2e-22
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...   110   3e-22
ref|XP_007008705.1| Uncharacterized protein TCM_042331 [Theobrom...   109   3e-22
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...   108   6e-22
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...   108   8e-22
emb|CAD45561.1| reverse transcriptase [Elaeis guineensis]             108   1e-21
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...   107   2e-21
emb|CAD45562.1| reverse transcriptase [Elaeis guineensis]             107   2e-21
ref|XP_007040948.1| Uncharacterized protein TCM_016755 [Theobrom...   106   3e-21
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...   106   3e-21
ref|XP_007011457.1| Uncharacterized protein TCM_045622 [Theobrom...   106   4e-21
emb|CAD45564.1| reverse transcriptase [Elaeis guineensis]             105   8e-21
ref|XP_007031316.1| Uncharacterized protein TCM_016767 [Theobrom...   103   3e-20
emb|CAA12932.1| reverse transcriptase [Pinus elliottii]               102   5e-20
ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein A...   102   7e-20
ref|XP_007032403.1| Uncharacterized protein TCM_018253 [Theobrom...   100   2e-19

>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
           gi|508727303|gb|EOY19200.1| Retrotransposon,
           unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  113 bits (283), Expect = 2e-23
 Identities = 56/99 (56%), Positives = 72/99 (72%)
 Frame = +2

Query: 5   EPASFDKFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEM 184
           + A++  FRPISLC ++ K+ TKL+ NRLSKVL S+IS  Q  FV GR I  NI L QE+
Sbjct: 467 DAATWSDFRPISLCTILNKIVTKLLANRLSKVLPSLISENQSGFVSGRLINDNILLAQEL 526

Query: 185 LTLLHRKSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
           +  +  K+RGGNV+LK+DM KAYDR+NW FL+ VL  FG
Sbjct: 527 IGKIDYKARGGNVVLKLDMMKAYDRLNWDFLILVLERFG 565


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  112 bits (280), Expect = 5e-23
 Identities = 53/97 (54%), Positives = 73/97 (75%)
 Frame = +2

Query: 11   ASFDKFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLT 190
            + + +FRPISLC V+ K+ TKL+ NRLSK+L S+IS  Q  FV GR I  NI L QE++ 
Sbjct: 1349 SQWSEFRPISLCTVLNKIVTKLLANRLSKILPSIISENQSGFVNGRLISDNILLAQELVD 1408

Query: 191  LLHRKSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
             ++ +SRGGNV+LK+DM+KAYDR+NW+FL  ++  FG
Sbjct: 1409 KINARSRGGNVVLKLDMAKAYDRLNWEFLYLMMEQFG 1445


>ref|XP_007028283.1| BZIP-like protein [Theobroma cacao] gi|508716888|gb|EOY08785.1|
           BZIP-like protein [Theobroma cacao]
          Length = 539

 Score =  111 bits (277), Expect = 1e-22
 Identities = 55/96 (57%), Positives = 69/96 (71%)
 Frame = +2

Query: 14  SFDKFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTL 193
           S+  FRPISLC V  K+ TKL+VN+L+KVL S+IS  Q  FV GR I  NI L QE++  
Sbjct: 339 SWSDFRPISLCTVFNKIITKLLVNQLAKVLSSLISDNQSGFVSGRLISDNILLAQELVGK 398

Query: 194 LHRKSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
           +  K+RGGNV+LK+DM KAYDR+NW FL  +L  FG
Sbjct: 399 IDYKARGGNVILKLDMMKAYDRLNWDFLYLILEHFG 434


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  110 bits (275), Expect = 2e-22
 Identities = 53/95 (55%), Positives = 70/95 (73%)
 Frame = +2

Query: 17   FDKFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTLL 196
            + ++RPISLC V+ K+ TKL+ NRLSK+L S+IS  Q  FV GR I  NI L QE++  +
Sbjct: 1264 WSEYRPISLCTVLNKIVTKLLANRLSKILPSIISENQSGFVNGRLISDNILLAQELIGKI 1323

Query: 197  HRKSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
              KSRGGNV+LK+DM+KAYDR+NW FL  ++  FG
Sbjct: 1324 DAKSRGGNVVLKLDMAKAYDRLNWDFLYLMMEHFG 1358


>ref|XP_007052625.1| Uncharacterized protein TCM_005953 [Theobroma cacao]
            gi|508704886|gb|EOX96782.1| Uncharacterized protein
            TCM_005953 [Theobroma cacao]
          Length = 1659

 Score =  110 bits (275), Expect = 2e-22
 Identities = 53/92 (57%), Positives = 69/92 (75%)
 Frame = +2

Query: 26   FRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTLLHRK 205
            FRPISLC V+ K+ TK++ NRLSK+L S+IS  Q  FV GR I  NI L QE++  L  K
Sbjct: 990  FRPISLCTVLNKIVTKMLANRLSKILPSIISENQSGFVNGRLISDNILLAQELIGKLDAK 1049

Query: 206  SRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
            +RGGNV+LK+DM+KAYDR+NW FL  +++ FG
Sbjct: 1050 ARGGNVVLKLDMAKAYDRLNWDFLYLMMKQFG 1081


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  110 bits (274), Expect = 3e-22
 Identities = 54/100 (54%), Positives = 72/100 (72%)
 Frame = +2

Query: 11   ASFDKFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLT 190
            + + +FRPISLC V+ K+ TKL+ NRL+K+L S+I+  Q  FV GR I  NI L QE++ 
Sbjct: 1385 SKWSEFRPISLCTVMNKIITKLLSNRLAKILPSIITENQSGFVGGRLISDNILLAQELIR 1444

Query: 191  LLHRKSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFGLPE 310
             L  KSRGGN+ LK+DM KAYDR++W FL+ VL+ FG  E
Sbjct: 1445 KLDTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQHFGFNE 1484


>ref|XP_007008705.1| Uncharacterized protein TCM_042331 [Theobroma cacao]
           gi|508725618|gb|EOY17515.1| Uncharacterized protein
           TCM_042331 [Theobroma cacao]
          Length = 1176

 Score =  109 bits (273), Expect = 3e-22
 Identities = 52/98 (53%), Positives = 71/98 (72%)
 Frame = +2

Query: 11  ASFDKFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLT 190
           + + +F PISLC V+ K+ TKL+ NRLSK+L S+IS  Q  FV GR I  NI L QE++ 
Sbjct: 594 SQWSEFHPISLCTVLNKIVTKLLANRLSKILSSIISENQSGFVNGRLISDNILLAQELIG 653

Query: 191 LLHRKSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFGL 304
            ++ +SRGGNV+LK+DM+KAYDR+NW FL  ++  F L
Sbjct: 654 KINARSRGGNVVLKLDMAKAYDRLNWDFLYLMMEHFAL 691


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  108 bits (271), Expect = 6e-22
 Identities = 53/92 (57%), Positives = 68/92 (73%)
 Frame = +2

Query: 26   FRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTLLHRK 205
            FRPISLC V+ K+ TK + NRLSK+L S+IS  Q  FV GR I  NI L QE++  L  K
Sbjct: 1093 FRPISLCTVLNKIVTKTLANRLSKILPSIISENQSGFVNGRLISDNILLAQELVGKLDAK 1152

Query: 206  SRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
            +RGGNV+LK+DM+KAYDR+NW FL  +++ FG
Sbjct: 1153 ARGGNVVLKLDMAKAYDRLNWDFLYLMMKQFG 1184


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  108 bits (270), Expect = 8e-22
 Identities = 53/97 (54%), Positives = 71/97 (73%)
 Frame = +2

Query: 11   ASFDKFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLT 190
            + +  FRPISLC V+ K+ TKL+ NRL+K+L S+I+  Q  FV GR I  NI L QE++ 
Sbjct: 1555 SKWSDFRPISLCTVMNKIITKLLSNRLAKILPSIITENQSGFVGGRLISDNILLAQELIG 1614

Query: 191  LLHRKSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
             L+ KSRGGN+ LK+DM KAYDR++W FL+ VL+ FG
Sbjct: 1615 KLNTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQHFG 1651


>emb|CAD45561.1| reverse transcriptase [Elaeis guineensis]
          Length = 137

 Score =  108 bits (269), Expect = 1e-21
 Identities = 52/92 (56%), Positives = 67/92 (72%)
 Frame = +2

Query: 26  FRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTLLHRK 205
           FRPISLCN+IYK+  +L  +RL+ +L  +IS  QGAFVKGR+I +NISL QE+    +RK
Sbjct: 3   FRPISLCNLIYKIIARLFNDRLASILPLIISENQGAFVKGRNILENISLAQELTQEFNRK 62

Query: 206 SRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
             G NV++K+DM KAYDR+ W FL  VL  FG
Sbjct: 63  CYGHNVIIKLDMGKAYDRLEWDFLFQVLLRFG 94


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  107 bits (267), Expect = 2e-21
 Identities = 53/97 (54%), Positives = 70/97 (72%)
 Frame = +2

Query: 11   ASFDKFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLT 190
            + +  FRPISLC V+ K+ TKL+ NRL+KVL S+I+  Q  FV GR I  NI L QE++ 
Sbjct: 1383 SKWSDFRPISLCTVMNKIITKLLSNRLAKVLPSIITENQSGFVGGRLISDNILLAQELIG 1442

Query: 191  LLHRKSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
             L+ KSRGGN+ LK+DM KAYD+++W FL  VL+ FG
Sbjct: 1443 KLNTKSRGGNLALKLDMMKAYDKLDWSFLFKVLQHFG 1479


>emb|CAD45562.1| reverse transcriptase [Elaeis guineensis]
          Length = 137

 Score =  107 bits (266), Expect = 2e-21
 Identities = 52/93 (55%), Positives = 67/93 (72%)
 Frame = +2

Query: 23  KFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTLLHR 202
           KFRPISLCN+IYK+  +L  +RL+ +L  +IS  QGAFVKGR+I +NISL QE+    +R
Sbjct: 2   KFRPISLCNLIYKIIARLFNDRLASILPLIISENQGAFVKGRNILENISLAQELTQEFNR 61

Query: 203 KSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
           K  G NV++K+DM KAYD + W FL  VL  FG
Sbjct: 62  KCYGHNVIIKLDMGKAYDWLEWDFLFQVLLRFG 94


>ref|XP_007040948.1| Uncharacterized protein TCM_016755 [Theobroma cacao]
            gi|508778193|gb|EOY25449.1| Uncharacterized protein
            TCM_016755 [Theobroma cacao]
          Length = 1245

 Score =  106 bits (265), Expect = 3e-21
 Identities = 53/92 (57%), Positives = 67/92 (72%)
 Frame = +2

Query: 26   FRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTLLHRK 205
            FRPISLC V+ K+ TKL+ NRLSK L S+IS  Q  FV GR I  NI L QE++  L  K
Sbjct: 842  FRPISLCTVLNKIVTKLLANRLSKFLPSIISENQSGFVNGRLISDNILLAQELVGKLDAK 901

Query: 206  SRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
            +RGGNV+LK+DM+KAYDR++W FL  ++  FG
Sbjct: 902  ARGGNVVLKLDMAKAYDRLSWDFLYLMMEQFG 933


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  106 bits (265), Expect = 3e-21
 Identities = 51/97 (52%), Positives = 70/97 (72%)
 Frame = +2

Query: 11   ASFDKFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLT 190
            + + +FRPISLC V+ K+ TK++ NRL+K+L S+I+  Q  FV GR I  NI L QE++ 
Sbjct: 1348 SKWSEFRPISLCTVMNKIITKILANRLAKILPSIITENQSGFVGGRLISDNILLAQELIG 1407

Query: 191  LLHRKSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
             L +K+RGGNV LK+DM KAYDR++W FL  VL+  G
Sbjct: 1408 KLDQKNRGGNVALKLDMMKAYDRLDWSFLFKVLQHLG 1444


>ref|XP_007011457.1| Uncharacterized protein TCM_045622 [Theobroma cacao]
           gi|508728370|gb|EOY20267.1| Uncharacterized protein
           TCM_045622 [Theobroma cacao]
          Length = 1232

 Score =  106 bits (264), Expect = 4e-21
 Identities = 53/92 (57%), Positives = 68/92 (73%)
 Frame = +2

Query: 26  FRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTLLHRK 205
           FRPISLC V+ K+ TKL+ NRL+K+L S+IS  Q AFV  R I  NI L QE++  +  K
Sbjct: 706 FRPISLCTVLNKIVTKLLANRLAKLLPSIISENQSAFVNDRLISDNILLAQELIGKIDGK 765

Query: 206 SRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
           SRGGNV+LK+DM+KAYDR++W FL  +L  FG
Sbjct: 766 SRGGNVVLKLDMAKAYDRLHWDFLYLMLEHFG 797


>emb|CAD45564.1| reverse transcriptase [Elaeis guineensis]
          Length = 137

 Score =  105 bits (261), Expect = 8e-21
 Identities = 51/94 (54%), Positives = 67/94 (71%)
 Frame = +2

Query: 20  DKFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTLLH 199
           + FRPISLCN+IYK+  +L  +RL+ +L  +IS  QGAFVKGR+I +NISL QE+    +
Sbjct: 1   NNFRPISLCNLIYKIIARLFNDRLASILPLIISENQGAFVKGRNILENISLAQELTQEFN 60

Query: 200 RKSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
           RK  G NV++K DM +AYDR+ W FL  VL  FG
Sbjct: 61  RKCYGHNVIIKPDMGEAYDRLEWDFLFQVLLRFG 94


>ref|XP_007031316.1| Uncharacterized protein TCM_016767 [Theobroma cacao]
            gi|508710345|gb|EOY02242.1| Uncharacterized protein
            TCM_016767 [Theobroma cacao]
          Length = 1707

 Score =  103 bits (256), Expect = 3e-20
 Identities = 50/92 (54%), Positives = 65/92 (70%)
 Frame = +2

Query: 26   FRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTLLHRK 205
            + PISLC V+ K+ TKL+ NRLSK+L  +IS  Q  FV GR I  NI L  E++  +  K
Sbjct: 1224 YSPISLCTVLNKIVTKLLANRLSKILPLIISENQSGFVNGRLISDNILLAHELIGKIDAK 1283

Query: 206  SRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
            SRGGNV+LK+DM+KAYDR+NW FL  ++  FG
Sbjct: 1284 SRGGNVVLKLDMAKAYDRLNWDFLYLMMEHFG 1315


>emb|CAA12932.1| reverse transcriptase [Pinus elliottii]
          Length = 194

 Score =  102 bits (254), Expect = 5e-20
 Identities = 48/94 (51%), Positives = 69/94 (73%)
 Frame = +2

Query: 20  DKFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTLLH 199
           +KFRPI+LCNVIYK+ +K+I NRL  +L  +IS EQ  +V+GR I  NI L QEM+  LH
Sbjct: 58  EKFRPIALCNVIYKIISKVIANRLKIILPGIISQEQSGYVEGRQILDNILLAQEMIHSLH 117

Query: 200 RKSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
            +   G +L+++D+SKAYD+V+W +L  +L+ FG
Sbjct: 118 SRKVAG-MLIQLDLSKAYDKVSWTYLEAILKAFG 150


>ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
           tuberosum]
          Length = 885

 Score =  102 bits (253), Expect = 7e-20
 Identities = 51/99 (51%), Positives = 72/99 (72%)
 Frame = +2

Query: 14  SFDKFRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTL 193
           SF   RPISL   I K+ ++L+ +RL KVL ++IS  Q AFVKGRSI +N+ L QE++  
Sbjct: 31  SFGDLRPISLSTFINKIISRLLHDRLVKVLPTIISQNQAAFVKGRSITENVLLAQEIIRD 90

Query: 194 LHRKSRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFGLPE 310
           ++R+++  NV++K+DM+KAYDRV+W FL  VLR FG  E
Sbjct: 91  INRRNKNHNVVVKLDMAKAYDRVSWIFLTKVLRSFGCSE 129


>ref|XP_007032403.1| Uncharacterized protein TCM_018253 [Theobroma cacao]
           gi|508711432|gb|EOY03329.1| Uncharacterized protein
           TCM_018253 [Theobroma cacao]
          Length = 540

 Score =  100 bits (249), Expect = 2e-19
 Identities = 49/92 (53%), Positives = 65/92 (70%)
 Frame = +2

Query: 26  FRPISLCNVIYKVFTKLIVNRLSKVLGSMISLEQGAFVKGRSIFQNISLTQEMLTLLHRK 205
           F PISLC ++ K+ TKL+ NRL+K+L S+I   Q  FV GR I  NI L QE++  +  K
Sbjct: 141 FHPISLCTILNKIVTKLLGNRLAKILPSIILENQSGFVNGRFISDNILLVQELIGRIDAK 200

Query: 206 SRGGNVLLKIDMSKAYDRVNWKFLMHVLRGFG 301
           S GGNV+LK+DM+KAYDR+NW FL  ++  FG
Sbjct: 201 SWGGNVVLKLDMAKAYDRLNWDFLYLMMEYFG 232


Top