BLASTX nr result

ID: Glycyrrhiza30_contig00039868 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza30_contig00039868
         (378 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU35861.1 hypothetical protein TSUD_63510 [Trifolium subterraneum]   135   2e-35
KYP65965.1 Putative ribonuclease H protein At1g65750 family [Caj...    88   7e-18
KYP61064.1 Putative ribonuclease H protein At1g65750 family, par...    85   1e-16
CAB10337.1 reverse transcriptase like protein [Arabidopsis thali...    83   4e-16
ABW81176.1 non-LTR reverse transcriptase [Arabidopsis cebennensis]     82   1e-15
JAU13983.1 hypothetical protein GA_TR2951_c0_g1_i1_g.9318 [Nocca...    79   3e-15
GAU43826.1 hypothetical protein TSUD_399190 [Trifolium subterran...    79   9e-15
KYP71396.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus ca...    79   1e-14
XP_019091097.1 PREDICTED: uncharacterized protein LOC109128707 [...    79   1e-14
KHN27243.1 hypothetical protein glysoja_046611 [Glycine soja]          74   4e-14
GAU32702.1 hypothetical protein TSUD_145760 [Trifolium subterran...    77   8e-14
XP_013617633.1 PREDICTED: uncharacterized protein LOC106324165 [...    76   1e-13
KYP46870.1 Transposon TX1 uncharacterized [Cajanus cajan]              76   1e-13
KYP57513.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus ca...    75   2e-13
KYP33748.1 Putative ribonuclease H protein At1g65750 family [Caj...    75   2e-13
OMO60252.1 reverse transcriptase [Corchorus capsularis]                75   2e-13
AAD37019.2 putative non-LTR retrolelement reverse transcriptase ...    75   3e-13
XP_012435352.1 PREDICTED: uncharacterized protein LOC105761970 [...    74   4e-13
XP_018512865.1 PREDICTED: uncharacterized protein LOC103860211 [...    75   4e-13
GAU42562.1 hypothetical protein TSUD_240380 [Trifolium subterran...    75   4e-13

>GAU35861.1 hypothetical protein TSUD_63510 [Trifolium subterraneum]
          Length = 483

 Score =  135 bits (341), Expect = 2e-35
 Identities = 72/165 (43%), Positives = 88/165 (53%), Gaps = 40/165 (24%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE YWF QA+SKW+ LGD NTRYFHQSTL RRC N I AL+D  + W+ D++ +K+ VLD
Sbjct: 209 EENYWFQQARSKWITLGDNNTRYFHQSTLVRRCHNKIMALQDANDQWIYDEDDLKQHVLD 268

Query: 181 FYQDLFSLGDQ--------------------------------------LEW*SYSTFSF 246
           FY  L+S   Q                                      + W    T  F
Sbjct: 269 FYHQLYSTSGQVYPNFISITTFPNISDVDMNYLGSTVTSHECISSASMSINWNGDPTSKF 328

Query: 247 --APRARQGDPIFPYLFVIAMEWLGHKIIESVNEGSWSPLRFGRG 375
             +   RQGDP+ PYLFV+A+E LGH I + VN GSW PL FGRG
Sbjct: 329 YSSRGLRQGDPLSPYLFVLALERLGHYIQDRVNNGSWKPLSFGRG 373


>KYP65965.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1043

 Score = 88.2 bits (217), Expect = 7e-18
 Identities = 55/148 (37%), Positives = 75/148 (50%), Gaps = 23/148 (15%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISA-------------LKDGYE-- 135
           EE  WF +++ KWL +GDRNTR+FH ST+ RR +N I               L+  Y+  
Sbjct: 198 EELIWFQKSRCKWLLMGDRNTRFFHGSTIIRRRKNRIEKLLNENGELAIKIDLEKAYDRL 257

Query: 136 NWVCDKE-----GVKKVVLDFYQDLFSLGD-QLEW*SYSTFSFAPR--ARQGDPIFPYLF 291
           NW+  K+     G+    +D      S    Q+ W      +F+P    RQGDPI PYLF
Sbjct: 258 NWLFIKDTLEDIGLPSKFIDLVWSCISTASLQVLWNGEVLEAFSPSRGIRQGDPISPYLF 317

Query: 292 VIAMEWLGHKIIESVNEGSWSPLRFGRG 375
           V+ ME L H I  +V +  W P+R  RG
Sbjct: 318 VLCMERLFHLIDITVTQQLWKPIRLSRG 345


>KYP61064.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 977

 Score = 84.7 bits (208), Expect = 1e-16
 Identities = 50/134 (37%), Positives = 73/134 (54%), Gaps = 11/134 (8%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDK--EGVKKVV 174
           EE  W+ +++ KW+ LG++NT +FH  T+ RR RN I          +C+   + V KV+
Sbjct: 184 EETLWYQKSREKWIKLGNKNTAFFHTQTVVRRKRNRILGATTLPYISLCNVVYKLVSKVL 243

Query: 175 LDFYQDLF-------SLGDQLEW*SYSTFSFAPRA--RQGDPIFPYLFVIAMEWLGHKII 327
           ++  +          S    L W      SF+P+   RQGDP+ PYLFV+ ME LGH I 
Sbjct: 244 VNRLRPFLMRIVRVTSAKLSLLWNGECLESFSPKRGLRQGDPLSPYLFVLCMERLGHIIQ 303

Query: 328 ESVNEGSWSPLRFG 369
           E V+ G W+PL+ G
Sbjct: 304 EQVSAGVWAPLKLG 317


>CAB10337.1 reverse transcriptase like protein [Arabidopsis thaliana]
           CAB78601.1 reverse transcriptase like protein
           [Arabidopsis thaliana]
          Length = 929

 Score = 83.2 bits (204), Expect = 4e-16
 Identities = 36/70 (51%), Positives = 50/70 (71%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE  WF +++ K LALGDRNT +FH ST+ RR RN I  LKD  + WV +KE ++K+ +D
Sbjct: 147 EETLWFQKSREKLLALGDRNTTFFHTSTVIRRRRNRIEMLKDSEDRWVTEKEALEKLAMD 206

Query: 181 FYQDLFSLGD 210
           +Y+ L+SL D
Sbjct: 207 YYRKLYSLED 216


>ABW81176.1 non-LTR reverse transcriptase [Arabidopsis cebennensis]
          Length = 464

 Score = 81.6 bits (200), Expect = 1e-15
 Identities = 35/70 (50%), Positives = 49/70 (70%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE  WF +++ KW+ALGDRNT+YFH ST+ RR RN I  LKD    WV + E + K+ ++
Sbjct: 305 EEIVWFQKSREKWIALGDRNTKYFHTSTIIRRRRNQIEMLKDDEGKWVEEPEELAKMAVE 364

Query: 181 FYQDLFSLGD 210
           +YQ L+S+ D
Sbjct: 365 YYQKLYSVED 374


>JAU13983.1 hypothetical protein GA_TR2951_c0_g1_i1_g.9318 [Noccaea
           caerulescens]
          Length = 290

 Score = 79.3 bits (194), Expect = 3e-15
 Identities = 31/70 (44%), Positives = 49/70 (70%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE  WF +++ KW+ALGDRNT+Y+H +T+ RR RN I  LKD    W+   E ++K+ ++
Sbjct: 172 EEILWFEKSREKWIALGDRNTKYYHTTTVVRRRRNRIETLKDDDGRWITKPEELEKIAIE 231

Query: 181 FYQDLFSLGD 210
           +Y+ L+S+ D
Sbjct: 232 YYRRLYSMED 241


>GAU43826.1 hypothetical protein TSUD_399190 [Trifolium subterraneum]
          Length = 1071

 Score = 79.3 bits (194), Expect = 9e-15
 Identities = 34/74 (45%), Positives = 51/74 (68%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE  WF ++++KWL  GDR+TRY+H  T++RR +NNI  LKD    WV D + ++ +V D
Sbjct: 245 EELMWFQRSRAKWLTDGDRDTRYYHIKTISRRRKNNIVMLKDEQGQWVEDIQQLQSLVND 304

Query: 181 FYQDLFSLGDQLEW 222
           FY+ LF+L ++  W
Sbjct: 305 FYKQLFALNNRCNW 318


>KYP71396.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan]
          Length = 733

 Score = 79.0 bits (193), Expect = 1e-14
 Identities = 43/110 (39%), Positives = 61/110 (55%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE  WF +++ KWL LGDRNT+YFH +T+ RR RN +  L+D    W+ DK  ++ +V +
Sbjct: 400 EELLWFQKSRCKWLCLGDRNTKYFHGTTVIRRQRNKVKMLQDESGRWISDKAELEGLVTN 459

Query: 181 FYQDLFSLGDQLEW*SYSTFSFAPRARQGDPIFPYLFVIAMEWLGHKIIE 330
           FY+DLF   D      Y+ F+           FP +    M+ LG  IIE
Sbjct: 460 FYKDLFQNID-----PYTPFNLT-------GYFPEVHDSLMQNLGRDIIE 497


>XP_019091097.1 PREDICTED: uncharacterized protein LOC109128707 [Camelina sativa]
          Length = 850

 Score = 79.0 bits (193), Expect = 1e-14
 Identities = 30/70 (42%), Positives = 51/70 (72%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE +WF +++ KW+ +GDRN+R+FH ST+ RR RN+I +LKD    W+ D + ++K+ ++
Sbjct: 59  EELFWFQKSREKWINMGDRNSRFFHTSTIIRRRRNHIDSLKDDEGKWISDPQVLEKLAIE 118

Query: 181 FYQDLFSLGD 210
           F+  L+S+ D
Sbjct: 119 FFTRLYSVND 128


>KHN27243.1 hypothetical protein glysoja_046611 [Glycine soja]
          Length = 181

 Score = 74.3 bits (181), Expect = 4e-14
 Identities = 31/67 (46%), Positives = 46/67 (68%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE +W+ ++K+KWL LGDRN + FH  T+ RR RN    +KDG  NWV D E ++++   
Sbjct: 80  EEVHWYTKSKAKWLHLGDRNPKSFHGVTIIRRRRNRYDMIKDGDGNWVVDSEKLEEMATK 139

Query: 181 FYQDLFS 201
           FY+DL++
Sbjct: 140 FYKDLYT 146


>GAU32702.1 hypothetical protein TSUD_145760 [Trifolium subterraneum]
          Length = 739

 Score = 76.6 bits (187), Expect = 8e-14
 Identities = 34/71 (47%), Positives = 51/71 (71%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE  WF ++++KWL  GDRNTRY+H  T++RR RNNI  LKD   NW+ D   ++++V +
Sbjct: 38  EELMWFQRSRAKWLIDGDRNTRYYHVKTISRRRRNNIVMLKDCDGNWIDDSLKLQELVNN 97

Query: 181 FYQDLFSLGDQ 213
           FY+ LF++ +Q
Sbjct: 98  FYKALFTIKNQ 108


>XP_013617633.1 PREDICTED: uncharacterized protein LOC106324165 [Brassica oleracea
           var. oleracea]
          Length = 636

 Score = 76.3 bits (186), Expect = 1e-13
 Identities = 30/67 (44%), Positives = 46/67 (68%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE YWF ++++ W + GDRNT ++H  T+ RR RN I  L D   NW+ D EGV+KV ++
Sbjct: 312 EEDYWFQKSRNMWYSSGDRNTEFYHALTMQRRVRNRIVGLHDADGNWITDDEGVEKVAVN 371

Query: 181 FYQDLFS 201
           ++ +LF+
Sbjct: 372 YFDELFT 378


>KYP46870.1 Transposon TX1 uncharacterized [Cajanus cajan]
          Length = 544

 Score = 75.9 bits (185), Expect = 1e-13
 Identities = 31/67 (46%), Positives = 49/67 (73%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE  W+ +++SKWL  GDRNT +FH ST+ RR +N+I +++D   +WV  K+ ++ +V +
Sbjct: 373 EELLWYQKSRSKWLTFGDRNTHFFHCSTIIRRRKNHILSIQDTNGSWVYTKDNLESMVTN 432

Query: 181 FYQDLFS 201
           FY+DLFS
Sbjct: 433 FYRDLFS 439


>KYP57513.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan]
          Length = 520

 Score = 75.5 bits (184), Expect = 2e-13
 Identities = 33/73 (45%), Positives = 49/73 (67%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE  W+ +++ KWL LGD+NTRYFH ST+ RR ++ I  L +  + WV  KE ++ +V +
Sbjct: 28  EEILWYQKSRCKWLLLGDKNTRYFHGSTIVRRRKSKIEKLMNDNDEWVTQKEELEGMVTN 87

Query: 181 FYQDLFSLGDQLE 219
           FY+ LFS  D+ E
Sbjct: 88  FYRTLFSDTDEAE 100


>KYP33748.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1133

 Score = 75.5 bits (184), Expect = 2e-13
 Identities = 34/73 (46%), Positives = 49/73 (67%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE  W+ +++ KWL LGDRNTRYFH ST+ RR ++ I  L +  E WV  KE ++ +V +
Sbjct: 297 EEILWYQKSRCKWLHLGDRNTRYFHGSTIVRRRKSRIEKLMNDNEEWVSRKEDLEGMVTN 356

Query: 181 FYQDLFSLGDQLE 219
           FY+ LFS  ++ E
Sbjct: 357 FYKSLFSDTNEAE 369


>OMO60252.1 reverse transcriptase [Corchorus capsularis]
          Length = 2099

 Score = 75.5 bits (184), Expect = 2e-13
 Identities = 29/74 (39%), Positives = 50/74 (67%)
 Frame = +1

Query: 1    EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
            EE YWF +A+ KWL  GD+NT +FHQ+TL RR +N +  +K+   +WV +++ ++   LD
Sbjct: 1241 EEQYWFQRARVKWLCYGDQNTSFFHQTTLQRRQKNKVVKIKNRQGDWVDEEKDIRSCFLD 1300

Query: 181  FYQDLFSLGDQLEW 222
            F+++L++     +W
Sbjct: 1301 FFKNLYTSSGPRDW 1314


>AAD37019.2 putative non-LTR retrolelement reverse transcriptase [Arabidopsis
           thaliana]
          Length = 855

 Score = 75.1 bits (183), Expect = 3e-13
 Identities = 31/70 (44%), Positives = 47/70 (67%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE  WF +++ KW+ LGDRNT+YFH  T+ RR RN I  LK    +WV  ++ ++K+ +D
Sbjct: 749 EEVLWFQKSREKWVELGDRNTKYFHTMTVVRRRRNRIEMLKADDGSWVSQQQELEKMAVD 808

Query: 181 FYQDLFSLGD 210
           +Y  L+S+ D
Sbjct: 809 YYSRLYSMED 818


>XP_012435352.1 PREDICTED: uncharacterized protein LOC105761970 [Gossypium
           raimondii]
          Length = 277

 Score = 73.6 bits (179), Expect = 4e-13
 Identities = 32/71 (45%), Positives = 43/71 (60%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           +E+YW  +++S+WL  GDRN RYFH     R  +NNI  LKD   NWV + +G+ KV  D
Sbjct: 173 KESYWAQRSRSRWLREGDRNIRYFHAKATGRLKKNNIEKLKDAEGNWVTNNKGISKVAKD 232

Query: 181 FYQDLFSLGDQ 213
           F+  LF    Q
Sbjct: 233 FFVRLFQSNGQ 243


>XP_018512865.1 PREDICTED: uncharacterized protein LOC103860211 [Brassica rapa]
          Length = 809

 Score = 74.7 bits (182), Expect = 4e-13
 Identities = 30/67 (44%), Positives = 45/67 (67%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE YWF ++++ W + GDRNT ++H  T  RR RN I  L D   NW+ D EGV+KV ++
Sbjct: 416 EENYWFQKSRNMWYSSGDRNTEFYHALTRQRRVRNRIVGLHDADGNWITDDEGVEKVAVN 475

Query: 181 FYQDLFS 201
           ++ +LF+
Sbjct: 476 YFDELFT 482


>GAU42562.1 hypothetical protein TSUD_240380 [Trifolium subterraneum]
          Length = 973

 Score = 74.7 bits (182), Expect = 4e-13
 Identities = 30/67 (44%), Positives = 46/67 (68%)
 Frame = +1

Query: 1   EEAYWFHQAKSKWLALGDRNTRYFHQSTLARRCRNNISALKDGYENWVCDKEGVKKVVLD 180
           EE  WF +++S+W+  GDRNT+Y+H  T+ RR +N I +L+D   NW+ + E +K +V  
Sbjct: 310 EECLWFQKSRSQWITDGDRNTKYYHSKTIIRRRKNKILSLRDDAGNWIDEPENLKDLVRQ 369

Query: 181 FYQDLFS 201
           FY DLF+
Sbjct: 370 FYVDLFT 376


Top