BLASTX nr result

ID: Glycyrrhiza32_contig00023618 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza32_contig00023618
         (398 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU20014.1 hypothetical protein TSUD_273490 [Trifolium subterran...   100   2e-24
XP_018510856.1 PREDICTED: uncharacterized protein LOC103844431 [...    93   2e-19
BAB09815.1 non-LTR retroelement reverse transcriptase-like [Arab...    92   4e-19
AAD21515.1 putative reverse transcriptase [Arabidopsis thaliana]...    90   7e-19
ABK28199.1 unknown, partial [Arabidopsis thaliana]                     90   7e-19
XP_016164673.1 PREDICTED: uncharacterized protein LOC107607211 [...    89   5e-18
OMO72395.1 hypothetical protein COLO4_27635 [Corchorus olitorius]      85   5e-18
ABE65462.1 hypothetical protein At2g27870 [Arabidopsis thaliana]       87   7e-18
AAD22368.1 putative non-LTR retroelement reverse transcriptase [...    87   1e-17
XP_015385965.1 PREDICTED: uncharacterized protein LOC107177137 [...    88   1e-17
JAU15471.1 Putative ribonuclease H protein, partial [Noccaea cae...    83   2e-17
XP_013668797.1 PREDICTED: uncharacterized protein LOC106373127 [...    87   3e-17
AAF79490.1 F1L3.4 [Arabidopsis thaliana]                               83   8e-17
AAF97302.1 Hypothetical protein [Arabidopsis thaliana] AAV68820....    83   1e-16
XP_013679846.1 PREDICTED: uncharacterized protein LOC106384431 [...    85   1e-16
XP_013710279.1 PREDICTED: uncharacterized protein LOC106414114 [...    85   2e-16
EOY16798.1 Uncharacterized protein TCM_035679 [Theobroma cacao]        81   2e-16
XP_013658112.1 PREDICTED: uncharacterized protein LOC106362816 [...    84   4e-16
KYP33915.1 Putative ribonuclease H protein At1g65750 family, par...    79   6e-16
XP_015390029.1 PREDICTED: uncharacterized protein LOC107178892 [...    82   1e-15

>GAU20014.1 hypothetical protein TSUD_273490 [Trifolium subterraneum]
          Length = 159

 Score =  100 bits (250), Expect = 2e-24
 Identities = 49/105 (46%), Positives = 69/105 (65%)
 Frame = +3

Query: 60  VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
           V RDDTGAW    A N+GSC++LM +LWGI +T+ M  + G R++ ++SDSA  +  +N+
Sbjct: 30  VIRDDTGAWIGEVARNLGSCTVLMAELWGILTTLQMVWDKGYRYVSLESDSAIVVSLINK 89

Query: 240 GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHA 374
           G  PSH   S+V  I  L      VQI+H+YR+AN +ADW A +A
Sbjct: 90  GCPPSHPYVSIVSLINRLKMKDWQVQISHIYRQANQVADWIANYA 134


>XP_018510856.1 PREDICTED: uncharacterized protein LOC103844431 [Brassica rapa]
          Length = 1833

 Score = 92.8 bits (229), Expect = 2e-19
 Identities = 48/113 (42%), Positives = 66/113 (58%)
 Frame = +3

Query: 60   VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
            V RD+ G W+ GFA NIG CS  + +LWG++   V+A E G R L V+ DSAT +  +  
Sbjct: 1693 VMRDENGEWQGGFAVNIGICSATLAELWGVYYGFVIAWENGIRRLEVEVDSATVVGFLKT 1752

Query: 240  GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
            G+  +H  + LV            V+I+HVYREAN +AD  A +AFS   G+H
Sbjct: 1753 GIHDAHPLSFLVRLCYGFISRDWLVKISHVYREANCLADGLANYAFSLSFGVH 1805


>BAB09815.1 non-LTR retroelement reverse transcriptase-like [Arabidopsis
           thaliana]
          Length = 676

 Score = 92.0 bits (227), Expect = 4e-19
 Identities = 47/111 (42%), Positives = 66/111 (59%)
 Frame = +3

Query: 60  VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
           V RD+ G+W +GFA NIG CS  + +LWG++  +V+A E G R + ++ DSA  +  +  
Sbjct: 536 VIRDEHGSWLVGFALNIGVCSAPLAELWGVYYGLVVAWERGWRRVRLEVDSALVVGFLQS 595

Query: 240 GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIG 392
           G+  SH  A LV            V+ITHVYREAN +AD  A +AF+ P G
Sbjct: 596 GIGDSHPLAFLVRLCHGFISKDWIVRITHVYREANRLADGLANYAFTLPFG 646


>AAD21515.1 putative reverse transcriptase [Arabidopsis thaliana] AAM15081.1
           putative reverse transcriptase [Arabidopsis thaliana]
          Length = 314

 Score = 89.7 bits (221), Expect = 7e-19
 Identities = 46/113 (40%), Positives = 65/113 (57%)
 Frame = +3

Query: 60  VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
           V RD+ GAWR GFA NIG CS  + +LWG++  + +A E     L ++ DS   +  +  
Sbjct: 174 VLRDEEGAWRGGFALNIGVCSAPLAELWGVYYGLYIAWERRVTRLEIEVDSEIVVGFLKI 233

Query: 240 GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
           G++  H  + LV    D       V+I+HVYREAN +AD  A +AFS P+G H
Sbjct: 234 GINEVHPLSFLVRLCHDFISRDWRVRISHVYREANRLADGLANYAFSLPLGFH 286


>ABK28199.1 unknown, partial [Arabidopsis thaliana]
          Length = 315

 Score = 89.7 bits (221), Expect = 7e-19
 Identities = 46/113 (40%), Positives = 65/113 (57%)
 Frame = +3

Query: 60  VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
           V RD+ GAWR GFA NIG CS  + +LWG++  + +A E     L ++ DS   +  +  
Sbjct: 174 VLRDEEGAWRGGFALNIGVCSAPLAELWGVYYGLYIAWERRVTRLEIEVDSEIVVGFLKI 233

Query: 240 GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
           G++  H  + LV    D       V+I+HVYREAN +AD  A +AFS P+G H
Sbjct: 234 GINEVHPLSFLVRLCHDFISRDWRVRISHVYREANRLADGLANYAFSLPLGFH 286


>XP_016164673.1 PREDICTED: uncharacterized protein LOC107607211 [Arachis ipaensis]
          Length = 1901

 Score = 89.0 bits (219), Expect = 5e-18
 Identities = 45/113 (39%), Positives = 68/113 (60%)
 Frame = +3

Query: 60   VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
            VFR+  G +  GF+ N+G+CSI+  +LW +   + +A   G + L+V+SDSA A++ +N+
Sbjct: 1763 VFRNSDGRFLQGFSCNLGNCSIMHAELWAVIHGLSIATTKGYQCLFVESDSAEAINFINR 1822

Query: 240  GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
            G SP+H CA LV  I+ L      +   H  REAN++AD  A      PIG+H
Sbjct: 1823 GCSPTHPCAPLVQDIRGLAARIQKITWLHSLREANSVADLLAKKGQELPIGLH 1875


>OMO72395.1 hypothetical protein COLO4_27635 [Corchorus olitorius]
          Length = 197

 Score = 85.1 bits (209), Expect = 5e-18
 Identities = 43/102 (42%), Positives = 65/102 (63%)
 Frame = +3

Query: 60  VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
           V R   G W +GF+  IG CSI M +LWGIH  + +A   G R L ++SDSAT++D + +
Sbjct: 59  VIRGQCGEWLLGFSQAIGKCSIDMAELWGIHQGISLAWSRGFRALEIESDSATSVDMIRK 118

Query: 240 GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFA 365
           GV+ +H  A +V AI++L     + +I++V R+ N +ADW A
Sbjct: 119 GVNTNHPLACVVEAIRELLAKDWNWRISYVPRQKNFVADWLA 160


>ABE65462.1 hypothetical protein At2g27870 [Arabidopsis thaliana]
          Length = 314

 Score = 87.0 bits (214), Expect = 7e-18
 Identities = 45/113 (39%), Positives = 64/113 (56%)
 Frame = +3

Query: 60  VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
           V RD+ GAWR GFA NIG CS  + +LWG++  + +A E     L ++ DS   +  +  
Sbjct: 174 VLRDEEGAWRGGFALNIGVCSAPLAELWGVYYGLYIAWERRVTRLEIEVDSEIVVGFLKI 233

Query: 240 GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
            ++  H  + LV    D       V+I+HVYREAN +AD  A +AFS P+G H
Sbjct: 234 XINEVHPLSFLVRLCHDFISRDWRVRISHVYREANRLADGLANYAFSLPLGFH 286


>AAD22368.1 putative non-LTR retroelement reverse transcriptase [Arabidopsis
           thaliana]
          Length = 321

 Score = 86.7 bits (213), Expect = 1e-17
 Identities = 44/113 (38%), Positives = 65/113 (57%)
 Frame = +3

Query: 60  VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
           V RD  GAW  GFA NIG CS  + +LWG++  + +A   G R + ++ DS   +  +  
Sbjct: 181 VLRDHNGAWIGGFAVNIGVCSAPLAELWGVYYGLFIAWGRGARRVELEVDSKMVVGFLTT 240

Query: 240 GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
           G++ SH  + L+    D       V+I+HVYREAN +AD  A +AFS  +G+H
Sbjct: 241 GIADSHPLSFLLRLCYDFLSKGWIVRISHVYREANRLADGLANYAFSLSLGLH 293


>XP_015385965.1 PREDICTED: uncharacterized protein LOC107177137 [Citrus sinensis]
          Length = 1277

 Score = 87.8 bits (216), Expect = 1e-17
 Identities = 44/113 (38%), Positives = 66/113 (58%)
 Frame = +3

Query: 60   VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
            + RD  G W  GF  NIG  S+LM +LWG++  + +  E G + L V+ D+      +++
Sbjct: 1137 IIRDSVGHWITGFCMNIGESSVLMAELWGLYQGLRLTWEAGIKRLLVEVDNLCVTQLVSK 1196

Query: 240  GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
             V   +   +LV+AI++L      V ITH+YREAN+ AD+ A  A SYP G+H
Sbjct: 1197 QVVVPNEFYALVVAIRELISRNWQVSITHIYREANSAADFMANMAHSYPHGLH 1249


>JAU15471.1 Putative ribonuclease H protein, partial [Noccaea caerulescens]
          Length = 174

 Score = 83.2 bits (204), Expect = 2e-17
 Identities = 42/107 (39%), Positives = 62/107 (57%)
 Frame = +3

Query: 60  VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
           V RD  G W  GF+ NIG C+  M +LWG++  + +A E G   + V+ DS   +  + +
Sbjct: 68  VIRDGDGKWCGGFSLNIGRCTAPMAELWGVYYGLCIAWEKGITRVEVEVDSVMVVGFLKE 127

Query: 240 GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFS 380
           G+S +H  +SLV     L     +V++ HVYREAN +AD  A +AFS
Sbjct: 128 GISDTHPLSSLVHMCHGLLSKDWEVRVVHVYREANYLADGLANYAFS 174


>XP_013668797.1 PREDICTED: uncharacterized protein LOC106373127 [Brassica napus]
          Length = 1818

 Score = 86.7 bits (213), Expect = 3e-17
 Identities = 45/113 (39%), Positives = 64/113 (56%)
 Frame = +3

Query: 60   VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
            V R+  G W  GFA NIG CS  + +LWG++  +V+A E G   L ++ DSA  +  +  
Sbjct: 1677 VLRNKFGDWCGGFAMNIGRCSAPLAELWGVYYGLVLAWERGITRLELEVDSAVVVGFLKT 1736

Query: 240  GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
            G+  +H  + LV            V+I HVYREAN +AD  A +AF+ P+GIH
Sbjct: 1737 GIEETHPLSFLVRLCHGYLSKDWIVRIDHVYREANRLADGLANYAFTLPLGIH 1789


>AAF79490.1 F1L3.4 [Arabidopsis thaliana]
          Length = 253

 Score = 83.2 bits (204), Expect = 8e-17
 Identities = 44/113 (38%), Positives = 61/113 (53%)
 Frame = +3

Query: 60  VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
           V RD  G W  GF+ NIG CS  + +LWG +  + +A E G   L ++ DS   +  +  
Sbjct: 113 VVRDGDGNWCYGFSLNIGICSAPLAELWGAYYGLNIAWERGVTQLEMEIDSEMVVGFLRT 172

Query: 240 GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
           G+  SH  + LV     L      V+I+HVYREAN +AD  A +AF  P+G H
Sbjct: 173 GIDDSHPLSFLVRLCHGLLSKDWSVRISHVYREANRLADGLANYAFFLPLGFH 225


>AAF97302.1 Hypothetical protein [Arabidopsis thaliana] AAV68820.1 hypothetical
           protein AT1G17390 [Arabidopsis thaliana]
          Length = 272

 Score = 83.2 bits (204), Expect = 1e-16
 Identities = 44/113 (38%), Positives = 61/113 (53%)
 Frame = +3

Query: 60  VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
           V RD  G W  GF+ NIG CS  + +LWG +  + +A E G   L ++ DS   +  +  
Sbjct: 132 VVRDGDGNWCYGFSLNIGICSAPLAELWGAYYGLNIAWERGVTQLEMEIDSEMVVGFLRT 191

Query: 240 GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
           G+  SH  + LV     L      V+I+HVYREAN +AD  A +AF  P+G H
Sbjct: 192 GIDDSHPLSFLVRLCHGLLSKDWSVRISHVYREANRLADGLANYAFFLPLGFH 244


>XP_013679846.1 PREDICTED: uncharacterized protein LOC106384431 [Brassica napus]
          Length = 1854

 Score = 85.1 bits (209), Expect = 1e-16
 Identities = 45/113 (39%), Positives = 63/113 (55%)
 Frame = +3

Query: 60   VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
            V R+  G W  GFA NIG CS  + +LWG++  +V+A E G   L +  DSA  +  +  
Sbjct: 1713 VLRNKFGDWCGGFALNIGRCSAPLAELWGVYYGLVLAWERGIARLELDVDSAVVVGFLKI 1772

Query: 240  GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
            G+  +H  + LV            V+I HVYREAN +AD  A +AF+ P+GIH
Sbjct: 1773 GIEQTHPLSFLVRLCHGYLSKDWIVRIDHVYREANRLADGLANYAFTLPLGIH 1825


>XP_013710279.1 PREDICTED: uncharacterized protein LOC106414114 [Brassica napus]
          Length = 1895

 Score = 84.7 bits (208), Expect = 2e-16
 Identities = 44/113 (38%), Positives = 63/113 (55%)
 Frame = +3

Query: 60   VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
            V R + G W  GFA NIG CS  + +LWG++  M +A + G R L V+ DS + +  +  
Sbjct: 1755 VVRGEYGTWEGGFAVNIGICSAPLAELWGVYYGMCIAWDRGIRQLEVEVDSESVVGFLQT 1814

Query: 240  GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
            G+  +H  + LV            V+ +HVYREAN +AD  A +AFS P G+H
Sbjct: 1815 GIHDAHPLSFLVRLCYGFVSRDWLVKFSHVYREANRLADELANYAFSLPFGLH 1867


>EOY16798.1 Uncharacterized protein TCM_035679 [Theobroma cacao]
          Length = 203

 Score = 81.3 bits (199), Expect = 2e-16
 Identities = 41/114 (35%), Positives = 67/114 (58%), Gaps = 1/114 (0%)
 Frame = +3

Query: 60  VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSM-N 236
           V  D+ G W +GF Y IG    L  +LW ++  + +  + G R + V+SDS  A+  + N
Sbjct: 74  VITDEVGNWLLGFNYKIGISCSLQVELWALYWGLTLCWDKGFRKVQVESDSLLAVQKISN 133

Query: 237 QGVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
           Q + P  + A L+  I++L++   D  +TH++REAN  A+W ATH  + P+G+H
Sbjct: 134 QSLQPKQN-AGLLKCIRELFQRSWDCTLTHIHREANQCANWMATHHENLPLGLH 186


>XP_013658112.1 PREDICTED: uncharacterized protein LOC106362816 [Brassica napus]
          Length = 1707

 Score = 83.6 bits (205), Expect = 4e-16
 Identities = 44/113 (38%), Positives = 63/113 (55%)
 Frame = +3

Query: 60   VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
            V R+  G W  GFA NIG CS  + +LWG++  +V+A E G   L ++ DSA  +  +  
Sbjct: 1566 VLRNKFGDWCGGFAMNIGRCSAPLAELWGVYYGLVLAWERGITRLELEVDSAVVVGFLKT 1625

Query: 240  GVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGIH 398
             +  +H  + LV            V+I HVYREAN +AD  A +AF+ P+GIH
Sbjct: 1626 RIEETHPLSFLVRLCHGYLSKDWIVRIDHVYREANRLADGLANYAFTLPLGIH 1678


>KYP33915.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 148

 Score = 78.6 bits (192), Expect = 6e-16
 Identities = 40/113 (35%), Positives = 64/113 (56%), Gaps = 1/113 (0%)
 Frame = +3

Query: 60  VFRDDTGAWRMGFAYNIGSCSILMGKLWGI-HSTMVMAKELGCRHLWVKSDSATAMDSMN 236
           V RD  G + + FA  +G+C+++  +LW I H   ++  +    H+ ++SDSA A+  +N
Sbjct: 28  VLRDSDGTFIVAFAGLLGTCTVVQAELWAIYHGLRLIKDKFISSHIIIESDSAIAVKFLN 87

Query: 237 QGVSPSHHCASLVMAIQDLWKDFSDVQITHVYREANTIADWFATHAFSYPIGI 395
           +G   +H C +LV  I  +  DF  ++  H  REAN +AD FA   FS P G+
Sbjct: 88  EGCPQAHPCYALVNHIVRMLGDFYKIECIHTLREANQVADGFAKIGFSIPEGV 140


>XP_015390029.1 PREDICTED: uncharacterized protein LOC107178892 [Citrus sinensis]
          Length = 1186

 Score = 82.4 bits (202), Expect = 1e-15
 Identities = 45/113 (39%), Positives = 67/113 (59%), Gaps = 1/113 (0%)
 Frame = +3

Query: 60   VFRDDTGAWRMGFAYNIGSCSILMGKLWGIHSTMVMAKELGCRHLWVKSDSATAMDSMNQ 239
            + RD +G W  GF   +GSCS+ M +L GI+  +++A   G R L +++DS  A   +  
Sbjct: 1045 LIRDYSGRWLTGFGLMLGSCSVTMAELRGIYQGLILAWNFGIRWLHMETDSLCATQMLAN 1104

Query: 240  GVSPSHHCASLVMAIQD-LWKDFSDVQITHVYREANTIADWFATHAFSYPIGI 395
             V  ++  ASL+ AI++ L K    V I+HVYREAN  AD+ A  A S P+G+
Sbjct: 1105 QVETTNEFASLIFAIKEYLQKKDWQVSISHVYREANLAADFMANLACSLPLGL 1157


Top