BLASTX nr result

ID: Cheilocostus21_contig00033387 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00033387
         (951 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PRQ37611.1| putative reverse transcriptase zinc-binding domai...    79   5e-13
ref|XP_014622286.1| PREDICTED: uncharacterized protein LOC106795...    71   8e-11
gb|OAY74722.1| putative ribonuclease H protein [Ananas comosus]        74   1e-10
ref|XP_020096969.1| uncharacterized protein LOC109716078, partia...    74   1e-10
pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabi...    72   3e-10
ref|XP_006603210.1| PREDICTED: uncharacterized protein LOC102670...    70   3e-10
ref|XP_019447270.1| PREDICTED: uncharacterized protein LOC109350...    72   3e-10
ref|XP_009799064.1| PREDICTED: uncharacterized protein LOC104245...    71   5e-10
ref|XP_006605024.1| PREDICTED: uncharacterized protein LOC102662...    69   1e-09
gb|KHN23636.1| hypothetical protein glysoja_044901, partial [Gly...    66   1e-09
ref|NP_189164.1| Ribonuclease H-like superfamily protein [Arabid...    69   2e-09
gb|ACF78995.1| unknown [Zea mays] >gi|645168529|gb|AIB05810.1| o...    69   2e-09
ref|XP_014626117.1| PREDICTED: uncharacterized protein LOC106796...    67   3e-09
ref|XP_006577422.1| PREDICTED: uncharacterized protein LOC102666...    67   4e-09
ref|XP_022041224.1| uncharacterized protein LOC110943800 [Helian...    66   5e-09
ref|XP_021844830.1| uncharacterized protein LOC110784683 [Spinac...    69   5e-09
ref|XP_006603334.1| PREDICTED: uncharacterized protein LOC102660...    67   5e-09
ref|XP_014631438.1| PREDICTED: uncharacterized protein LOC106798...    66   5e-09
ref|XP_013738859.1| uncharacterized protein LOC106441605 [Brassi...    67   5e-09
gb|ONM61175.1| hypothetical protein ZEAMMB73_Zm00001d022590 [Zea...    69   5e-09

>gb|PRQ37611.1| putative reverse transcriptase zinc-binding domain-containing
           protein [Rosa chinensis]
          Length = 308

 Score = 79.0 bits (193), Expect = 5e-13
 Identities = 42/103 (40%), Positives = 61/103 (59%), Gaps = 1/103 (0%)
 Frame = +3

Query: 33  DLWIWRRGMNG-LSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGR 209
           D +IW    NG  S+  A +  L  + + +    +    L +L+ +PKIK+FGW L  GR
Sbjct: 199 DEYIWGPSSNGKFSIKSATW--LQYDHLRKHSQSKLINKLWKLNVQPKIKIFGWLLLRGR 256

Query: 210 LPTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVWR 338
           L T DRL + G+ N  +C LCN++ ETADHLF  C +++ VWR
Sbjct: 257 LKTRDRLSRFGIINDNSCLLCNRDNETADHLFGYCEFTKEVWR 299


>ref|XP_014622286.1| PREDICTED: uncharacterized protein LOC106795879 [Glycine max]
          Length = 221

 Score = 71.2 bits (173), Expect = 8e-11
 Identities = 37/102 (36%), Positives = 54/102 (52%), Gaps = 1/102 (0%)
 Frame = +3

Query: 33  DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212
           D+W+W+   +G    K+ Y  LLM    +   DR +  L  L    K  +F W+L   RL
Sbjct: 20  DVWVWKADSSGNYSTKSAYR-LLMEATGEFPVDRTYVDLWNLKIPSKAIVFAWRLIKDRL 78

Query: 213 PTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335
           PT+  L+   +  N + C LCN  EE A HLF++CT +E +W
Sbjct: 79  PTWTNLRVRQVELNDSRCPLCNSSEEDAAHLFFHCTKTEPLW 120


>gb|OAY74722.1| putative ribonuclease H protein [Ananas comosus]
          Length = 851

 Score = 73.6 bits (179), Expect = 1e-10
 Identities = 37/101 (36%), Positives = 52/101 (51%)
 Frame = +3

Query: 33  DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212
           D W+W     G +   ++Y+ L  +    D+    WK L  L   P++K F WK  + RL
Sbjct: 491 DKWVWSLHPQGKARAGSVYSFLNGHT---DNCWDGWKQLWGLAVAPRVKTFLWKYFWKRL 547

Query: 213 PTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVW 335
           PT D L++ GL  S  C LC +  E   HLF+ C YS+ VW
Sbjct: 548 PTKDFLQQRGLTQSNLCALCGEAAENIQHLFFQCRYSKEVW 588


>ref|XP_020096969.1| uncharacterized protein LOC109716078, partial [Ananas comosus]
          Length = 1220

 Score = 73.6 bits (179), Expect = 1e-10
 Identities = 37/101 (36%), Positives = 52/101 (51%)
 Frame = +3

Query: 33   DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212
            D W+W     G +   ++Y+ L  +    D+    WK L  L   P++K F WK  + RL
Sbjct: 1018 DKWVWSLHPQGKARAGSVYSFLNGHT---DNCWDGWKQLWGLAVAPRVKTFLWKYFWKRL 1074

Query: 213  PTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVW 335
            PT D L++ GL  S  C LC +  E   HLF+ C YS+ VW
Sbjct: 1075 PTKDFLQQRGLTQSNLCALCGEAAENIQHLFFQCRYSKEVW 1115


>pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabidopsis thaliana
            (fragment)
          Length = 1365

 Score = 72.4 bits (176), Expect = 3e-10
 Identities = 34/109 (31%), Positives = 56/109 (51%), Gaps = 8/109 (7%)
 Frame = +3

Query: 33   DLWIWRRGMNGLS--------LNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFG 188
            D + W    NGL         L+K +++ L     V+   +  +  +  LHT PKI++F 
Sbjct: 999  DSFCWLHSHNGLYSVKTGYEFLSKQVHHRLYQEAKVKPSVNSLFDKIWNLHTAPKIRIFL 1058

Query: 189  WKLHYGRLPTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVW 335
            WK  +G +P  DRL+  G+R+   C +C+ E ET +H+ + C  +  VW
Sbjct: 1059 WKALHGAIPVEDRLRTRGIRSDDGCLMCDTENETINHILFECPLARQVW 1107


>ref|XP_006603210.1| PREDICTED: uncharacterized protein LOC102670384 [Glycine max]
          Length = 224

 Score = 69.7 bits (169), Expect = 3e-10
 Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 1/102 (0%)
 Frame = +3

Query: 33  DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212
           D W+W    NG+   K+ YN +    +++D  D  +  L  L   PK+  F W+L + RL
Sbjct: 22  DTWLWGAEPNGIFSTKSAYNLIKAEQILEDQ-DSGFHQLWDLKVPPKVLSFAWRLLWDRL 80

Query: 213 PTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335
           PT D L +  ++ ++  C LC  + ETA HLF+ C     +W
Sbjct: 81  PTKDNLSRRQIQLDNDLCPLCQTQPETASHLFFTCDKVLPLW 122


>ref|XP_019447270.1| PREDICTED: uncharacterized protein LOC109350495 [Lupinus
            angustifolius]
          Length = 1596

 Score = 72.4 bits (176), Expect = 3e-10
 Identities = 41/102 (40%), Positives = 52/102 (50%), Gaps = 1/102 (0%)
 Frame = +3

Query: 33   DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMF-GWKLHYGR 209
            D   W+   +GL   K  Y  L    V       NW A+    T P  K F  W+L   +
Sbjct: 1354 DKLAWKSSTHGLLSAKDAYLHLNPGSV-----QHNWGAILWFDTIPPSKSFTAWRLLNNK 1408

Query: 210  LPTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVW 335
            +PT D LK+ G   ++ C LCNKEEETA HLF+NC +S  VW
Sbjct: 1409 MPTDDNLKRRGCVMASICNLCNKEEETATHLFFNCPFSVYVW 1450


>ref|XP_009799064.1| PREDICTED: uncharacterized protein LOC104245194 [Nicotiana
           sylvestris]
 ref|XP_016448778.1| PREDICTED: uncharacterized protein LOC107773866 [Nicotiana tabacum]
          Length = 383

 Score = 70.9 bits (172), Expect = 5e-10
 Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 1/98 (1%)
 Frame = +3

Query: 48  RRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*-QLHTKPKIKMFGWKLHYGRLPTFD 224
           +R  +G SL K IY  LL      D    NWK L  Q +T+PK +   W L +GRL   D
Sbjct: 238 QRNNSGTSLTKHIYLQLL-----GDRPHVNWKCLMFQNNTRPKAQFTMWMLLHGRLLITD 292

Query: 225 RLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVWR 338
           RL++ G+     C LC+  +E+ +HLF NC +++++WR
Sbjct: 293 RLRQWGMAVETQCALCHDRDESREHLFVNCVFTKTLWR 330


>ref|XP_006605024.1| PREDICTED: uncharacterized protein LOC102662505 [Glycine max]
          Length = 299

 Score = 68.9 bits (167), Expect = 1e-09
 Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 1/102 (0%)
 Frame = +3

Query: 33  DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212
           D W+W    NG+   K+ YN +    +++D  D  +  L  L   PK+  F W+L + RL
Sbjct: 97  DTWLWGAEPNGIFSTKSAYNLIKAEQLLEDQ-DSGFHQLWDLKVPPKVLSFAWRLLWDRL 155

Query: 213 PTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335
           PT D L +  ++ ++  C LC  + ETA HLF+ C     +W
Sbjct: 156 PTKDNLSRRQIQLDNDLCPLCQTQPETASHLFFTCDKVLPLW 197


>gb|KHN23636.1| hypothetical protein glysoja_044901, partial [Glycine soja]
          Length = 139

 Score = 65.9 bits (159), Expect = 1e-09
 Identities = 39/105 (37%), Positives = 54/105 (51%), Gaps = 3/105 (2%)
 Frame = +3

Query: 30  ADLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGR 209
           AD WIW++  +G+      Y   LM  +  D  D ++  L +L   PK K+F W+L   R
Sbjct: 36  ADSWIWKQHSSGIYSTNTAYK-FLMEEIRGDPVDGSFVFLWKLKIPPKAKIFTWRLIKDR 94

Query: 210 LPTFDRLKKMGLRNSAT---CTLCNKEEETADHLFYNCTYSESVW 335
           LPT  +L   G +   T   C LCN  EE A HLF+NC+    +W
Sbjct: 95  LPT--KLNLRGRQVEITDPMCPLCNNSEEDAAHLFFNCSKVLPLW 137


>ref|NP_189164.1| Ribonuclease H-like superfamily protein [Arabidopsis thaliana]
 dbj|BAB02086.1| reverse transcriptase-like protein [Arabidopsis thaliana]
 gb|AEE77003.1| Ribonuclease H-like superfamily protein [Arabidopsis thaliana]
          Length = 343

 Score = 68.9 bits (167), Expect = 2e-09
 Identities = 31/62 (50%), Positives = 41/62 (66%)
 Frame = +3

Query: 153 QLHTKPKIKMFGWKLHYGRLPTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESV 332
           +L T PKIK F WKL  G L T D LK+  +RN   C  C +E+ET+ HLF++C Y++ V
Sbjct: 20  KLKTAPKIKHFLWKLLSGALATGDNLKRRHIRNHPQCHRCCQEDETSQHLFFDCFYAQQV 79

Query: 333 WR 338
           WR
Sbjct: 80  WR 81


>gb|ACF78995.1| unknown [Zea mays]
 gb|AIB05810.1| orphans transcription factor, partial [Zea mays]
          Length = 572

 Score = 69.3 bits (168), Expect = 2e-09
 Identities = 37/101 (36%), Positives = 50/101 (49%)
 Frame = +3

Query: 33  DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212
           D  I+R   NG    KA Y  L +  V  +  +R WK        PK + F W + + + 
Sbjct: 376 DKHIFRLAANGKYSTKAAYEGLFIGSVEFEPFERIWKTW----APPKCRFFLWLVAHKKC 431

Query: 213 PTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVW 335
            T DRL+K GL ++  C LC +E ET DHL  NC +S   W
Sbjct: 432 WTADRLEKRGLDHTEKCPLCEQERETIDHLLVNCVFSRECW 472


>ref|XP_014626117.1| PREDICTED: uncharacterized protein LOC106796982 [Glycine max]
          Length = 247

 Score = 67.4 bits (163), Expect = 3e-09
 Identities = 35/102 (34%), Positives = 53/102 (51%), Gaps = 1/102 (0%)
 Frame = +3

Query: 33  DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212
           D   W+   NG+   K+ Y  +L      D  D   K++ +L   PK+  F W+    RL
Sbjct: 45  DFLCWKPDTNGIFSTKSAYK-VLQESHHSDSEDNVLKSMWKLKIPPKVSAFSWRFFKNRL 103

Query: 213 PTFDRLKKMGLRNSA-TCTLCNKEEETADHLFYNCTYSESVW 335
           PT D L+K  +  S+ +C LC+ EEE+  HL +NC  + S+W
Sbjct: 104 PTRDNLRKRQVTMSSYSCPLCDHEEESIYHLMFNCEKTRSLW 145


>ref|XP_006577422.1| PREDICTED: uncharacterized protein LOC102666164 [Glycine max]
          Length = 224

 Score = 66.6 bits (161), Expect = 4e-09
 Identities = 33/102 (32%), Positives = 53/102 (51%), Gaps = 1/102 (0%)
 Frame = +3

Query: 33  DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212
           D W+W    NG+   K+ YN +    +++D  D  +  L  L   PK+  F W+L + RL
Sbjct: 22  DTWLWGAEPNGIFSTKSAYNLIKAEQLLEDQ-DSGFHQLWDLKVPPKVLSFAWRLLWDRL 80

Query: 213 PTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335
           PT D L +  ++ ++  C LC  + ETA +LF+ C     +W
Sbjct: 81  PTKDNLSRRQIQLDNDLCPLCQTQPETASYLFFTCDKVLPLW 122


>ref|XP_022041224.1| uncharacterized protein LOC110943800 [Helianthus annuus]
          Length = 217

 Score = 66.2 bits (160), Expect = 5e-09
 Identities = 34/107 (31%), Positives = 55/107 (51%), Gaps = 5/107 (4%)
 Frame = +3

Query: 30  ADLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDAD----RNWKAL*QLHTKPKIKMFGWKL 197
           +D W+W+         K++  TL     + +DAD     NW       T  K  MF W+ 
Sbjct: 17  SDKWVWKNDDKHEFTVKSVRCTLASQLNLNEDADSFMWNNW-------TTNKCSMFVWRA 69

Query: 198 HYGRLPTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335
             GR+PT  +L++ G++ +S  C +C +E+ET DH+   C Y++ VW
Sbjct: 70  VQGRIPTTTQLRQRGMQISSIICKVCGREDETPDHVLVKCDYAKEVW 116


>ref|XP_021844830.1| uncharacterized protein LOC110784683 [Spinacia oleracea]
          Length = 737

 Score = 68.6 bits (166), Expect = 5e-09
 Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 3/101 (2%)
 Frame = +3

Query: 45  WRRGMNGLSLNKAIYNTLLMNCVVQ---DDADRNWKAL*QLHTKPKIKMFGWKLHYGRLP 215
           W     G    K+IYN L+M   +Q   D+ D+ WK L +    PK ++F W+L    + 
Sbjct: 374 WMANRTGQPTVKSIYNQLIMEKNLQTPLDNKDKFWKRLWKSELIPKWRIFTWRLLNEAIA 433

Query: 216 TFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVWR 338
           T  RL+K G+     C LC K +E   HLF +C  S  VW+
Sbjct: 434 TRSRLRKRGMMVEDCCVLCKKSQENDKHLFRDCPISSHVWK 474


>ref|XP_006603334.1| PREDICTED: uncharacterized protein LOC102660367 [Glycine max]
          Length = 247

 Score = 66.6 bits (161), Expect = 5e-09
 Identities = 33/102 (32%), Positives = 53/102 (51%), Gaps = 1/102 (0%)
 Frame = +3

Query: 33  DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212
           D W+W    NG+   K+ YN +    +++D  D  +  L  L   PK+  F W+L + RL
Sbjct: 45  DTWLWGAEPNGIFSTKSAYNLIKAEQLLEDQ-DSGFHQLWDLKVPPKVLSFAWRLLWDRL 103

Query: 213 PTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335
           PT D L +  ++ ++  C LC  + ETA +LF+ C     +W
Sbjct: 104 PTKDNLSRRQIQLDNDLCPLCQTQPETASYLFFTCDKVLPLW 145


>ref|XP_014631438.1| PREDICTED: uncharacterized protein LOC106798777 [Glycine max]
          Length = 224

 Score = 66.2 bits (160), Expect = 5e-09
 Identities = 33/102 (32%), Positives = 53/102 (51%), Gaps = 1/102 (0%)
 Frame = +3

Query: 33  DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212
           D W+W    NG+   K+ YN +    +++D  D  +  L  L   PK+  F W+L + RL
Sbjct: 22  DTWLWGAKPNGIFSTKSAYNLIKAEQLLEDQ-DSGFHQLWDLKVPPKVLSFAWRLLWDRL 80

Query: 213 PTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335
           PT D L +  ++ ++  C LC  + ETA +LF+ C     +W
Sbjct: 81  PTKDNLARRQIQLDNDLCPLCQTQPETASYLFFTCDKVLPLW 122


>ref|XP_013738859.1| uncharacterized protein LOC106441605 [Brassica napus]
          Length = 279

 Score = 67.0 bits (162), Expect = 5e-09
 Identities = 28/69 (40%), Positives = 43/69 (62%), Gaps = 1/69 (1%)
 Frame = +3

Query: 135 NW-KAL*QLHTKPKIKMFGWKLHYGRLPTFDRLKKMGLRNSATCTLCNKEEETADHLFYN 311
           NW K +  +H+ PK   F W +   RL T D+++   +   ATC LCN++EET DHLF+ 
Sbjct: 111 NWSKGIWFVHSTPKYSFFVWIVMRNRLQTGDKMRLWNVGIDATCILCNEDEETCDHLFFG 170

Query: 312 CTYSESVWR 338
           C Y++ +W+
Sbjct: 171 CRYTKQIWK 179


>gb|ONM61175.1| hypothetical protein ZEAMMB73_Zm00001d022590 [Zea mays]
 gb|ONM61177.1| hypothetical protein ZEAMMB73_Zm00001d022590 [Zea mays]
          Length = 1188

 Score = 68.6 bits (166), Expect = 5e-09
 Identities = 35/101 (34%), Positives = 53/101 (52%)
 Frame = +3

Query: 33   DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212
            D  I+R   +G+   KA YN L +   +     + W+ + +  + PK K F W    GR 
Sbjct: 992  DKHIFRFANDGIYSAKAAYNGLFIGSTMA----KYWELIWKTWSPPKCKFFLWLADLGRC 1047

Query: 213  PTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVW 335
             T DRL+K GL +   C LC++E+ET DH+   C ++ S W
Sbjct: 1048 WTADRLQKRGLSHPDKCVLCDQEQETIDHILVGCVFARSFW 1088


Top