BLASTX nr result
ID: Cheilocostus21_contig00033387
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cheilocostus21_contig00033387 (951 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PRQ37611.1| putative reverse transcriptase zinc-binding domai... 79 5e-13 ref|XP_014622286.1| PREDICTED: uncharacterized protein LOC106795... 71 8e-11 gb|OAY74722.1| putative ribonuclease H protein [Ananas comosus] 74 1e-10 ref|XP_020096969.1| uncharacterized protein LOC109716078, partia... 74 1e-10 pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabi... 72 3e-10 ref|XP_006603210.1| PREDICTED: uncharacterized protein LOC102670... 70 3e-10 ref|XP_019447270.1| PREDICTED: uncharacterized protein LOC109350... 72 3e-10 ref|XP_009799064.1| PREDICTED: uncharacterized protein LOC104245... 71 5e-10 ref|XP_006605024.1| PREDICTED: uncharacterized protein LOC102662... 69 1e-09 gb|KHN23636.1| hypothetical protein glysoja_044901, partial [Gly... 66 1e-09 ref|NP_189164.1| Ribonuclease H-like superfamily protein [Arabid... 69 2e-09 gb|ACF78995.1| unknown [Zea mays] >gi|645168529|gb|AIB05810.1| o... 69 2e-09 ref|XP_014626117.1| PREDICTED: uncharacterized protein LOC106796... 67 3e-09 ref|XP_006577422.1| PREDICTED: uncharacterized protein LOC102666... 67 4e-09 ref|XP_022041224.1| uncharacterized protein LOC110943800 [Helian... 66 5e-09 ref|XP_021844830.1| uncharacterized protein LOC110784683 [Spinac... 69 5e-09 ref|XP_006603334.1| PREDICTED: uncharacterized protein LOC102660... 67 5e-09 ref|XP_014631438.1| PREDICTED: uncharacterized protein LOC106798... 66 5e-09 ref|XP_013738859.1| uncharacterized protein LOC106441605 [Brassi... 67 5e-09 gb|ONM61175.1| hypothetical protein ZEAMMB73_Zm00001d022590 [Zea... 69 5e-09 >gb|PRQ37611.1| putative reverse transcriptase zinc-binding domain-containing protein [Rosa chinensis] Length = 308 Score = 79.0 bits (193), Expect = 5e-13 Identities = 42/103 (40%), Positives = 61/103 (59%), Gaps = 1/103 (0%) Frame = +3 Query: 33 DLWIWRRGMNG-LSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGR 209 D +IW NG S+ A + L + + + + L +L+ +PKIK+FGW L GR Sbjct: 199 DEYIWGPSSNGKFSIKSATW--LQYDHLRKHSQSKLINKLWKLNVQPKIKIFGWLLLRGR 256 Query: 210 LPTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVWR 338 L T DRL + G+ N +C LCN++ ETADHLF C +++ VWR Sbjct: 257 LKTRDRLSRFGIINDNSCLLCNRDNETADHLFGYCEFTKEVWR 299 >ref|XP_014622286.1| PREDICTED: uncharacterized protein LOC106795879 [Glycine max] Length = 221 Score = 71.2 bits (173), Expect = 8e-11 Identities = 37/102 (36%), Positives = 54/102 (52%), Gaps = 1/102 (0%) Frame = +3 Query: 33 DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212 D+W+W+ +G K+ Y LLM + DR + L L K +F W+L RL Sbjct: 20 DVWVWKADSSGNYSTKSAYR-LLMEATGEFPVDRTYVDLWNLKIPSKAIVFAWRLIKDRL 78 Query: 213 PTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335 PT+ L+ + N + C LCN EE A HLF++CT +E +W Sbjct: 79 PTWTNLRVRQVELNDSRCPLCNSSEEDAAHLFFHCTKTEPLW 120 >gb|OAY74722.1| putative ribonuclease H protein [Ananas comosus] Length = 851 Score = 73.6 bits (179), Expect = 1e-10 Identities = 37/101 (36%), Positives = 52/101 (51%) Frame = +3 Query: 33 DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212 D W+W G + ++Y+ L + D+ WK L L P++K F WK + RL Sbjct: 491 DKWVWSLHPQGKARAGSVYSFLNGHT---DNCWDGWKQLWGLAVAPRVKTFLWKYFWKRL 547 Query: 213 PTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVW 335 PT D L++ GL S C LC + E HLF+ C YS+ VW Sbjct: 548 PTKDFLQQRGLTQSNLCALCGEAAENIQHLFFQCRYSKEVW 588 >ref|XP_020096969.1| uncharacterized protein LOC109716078, partial [Ananas comosus] Length = 1220 Score = 73.6 bits (179), Expect = 1e-10 Identities = 37/101 (36%), Positives = 52/101 (51%) Frame = +3 Query: 33 DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212 D W+W G + ++Y+ L + D+ WK L L P++K F WK + RL Sbjct: 1018 DKWVWSLHPQGKARAGSVYSFLNGHT---DNCWDGWKQLWGLAVAPRVKTFLWKYFWKRL 1074 Query: 213 PTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVW 335 PT D L++ GL S C LC + E HLF+ C YS+ VW Sbjct: 1075 PTKDFLQQRGLTQSNLCALCGEAAENIQHLFFQCRYSKEVW 1115 >pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabidopsis thaliana (fragment) Length = 1365 Score = 72.4 bits (176), Expect = 3e-10 Identities = 34/109 (31%), Positives = 56/109 (51%), Gaps = 8/109 (7%) Frame = +3 Query: 33 DLWIWRRGMNGLS--------LNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFG 188 D + W NGL L+K +++ L V+ + + + LHT PKI++F Sbjct: 999 DSFCWLHSHNGLYSVKTGYEFLSKQVHHRLYQEAKVKPSVNSLFDKIWNLHTAPKIRIFL 1058 Query: 189 WKLHYGRLPTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVW 335 WK +G +P DRL+ G+R+ C +C+ E ET +H+ + C + VW Sbjct: 1059 WKALHGAIPVEDRLRTRGIRSDDGCLMCDTENETINHILFECPLARQVW 1107 >ref|XP_006603210.1| PREDICTED: uncharacterized protein LOC102670384 [Glycine max] Length = 224 Score = 69.7 bits (169), Expect = 3e-10 Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 1/102 (0%) Frame = +3 Query: 33 DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212 D W+W NG+ K+ YN + +++D D + L L PK+ F W+L + RL Sbjct: 22 DTWLWGAEPNGIFSTKSAYNLIKAEQILEDQ-DSGFHQLWDLKVPPKVLSFAWRLLWDRL 80 Query: 213 PTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335 PT D L + ++ ++ C LC + ETA HLF+ C +W Sbjct: 81 PTKDNLSRRQIQLDNDLCPLCQTQPETASHLFFTCDKVLPLW 122 >ref|XP_019447270.1| PREDICTED: uncharacterized protein LOC109350495 [Lupinus angustifolius] Length = 1596 Score = 72.4 bits (176), Expect = 3e-10 Identities = 41/102 (40%), Positives = 52/102 (50%), Gaps = 1/102 (0%) Frame = +3 Query: 33 DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMF-GWKLHYGR 209 D W+ +GL K Y L V NW A+ T P K F W+L + Sbjct: 1354 DKLAWKSSTHGLLSAKDAYLHLNPGSV-----QHNWGAILWFDTIPPSKSFTAWRLLNNK 1408 Query: 210 LPTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVW 335 +PT D LK+ G ++ C LCNKEEETA HLF+NC +S VW Sbjct: 1409 MPTDDNLKRRGCVMASICNLCNKEEETATHLFFNCPFSVYVW 1450 >ref|XP_009799064.1| PREDICTED: uncharacterized protein LOC104245194 [Nicotiana sylvestris] ref|XP_016448778.1| PREDICTED: uncharacterized protein LOC107773866 [Nicotiana tabacum] Length = 383 Score = 70.9 bits (172), Expect = 5e-10 Identities = 38/98 (38%), Positives = 56/98 (57%), Gaps = 1/98 (1%) Frame = +3 Query: 48 RRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*-QLHTKPKIKMFGWKLHYGRLPTFD 224 +R +G SL K IY LL D NWK L Q +T+PK + W L +GRL D Sbjct: 238 QRNNSGTSLTKHIYLQLL-----GDRPHVNWKCLMFQNNTRPKAQFTMWMLLHGRLLITD 292 Query: 225 RLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVWR 338 RL++ G+ C LC+ +E+ +HLF NC +++++WR Sbjct: 293 RLRQWGMAVETQCALCHDRDESREHLFVNCVFTKTLWR 330 >ref|XP_006605024.1| PREDICTED: uncharacterized protein LOC102662505 [Glycine max] Length = 299 Score = 68.9 bits (167), Expect = 1e-09 Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 1/102 (0%) Frame = +3 Query: 33 DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212 D W+W NG+ K+ YN + +++D D + L L PK+ F W+L + RL Sbjct: 97 DTWLWGAEPNGIFSTKSAYNLIKAEQLLEDQ-DSGFHQLWDLKVPPKVLSFAWRLLWDRL 155 Query: 213 PTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335 PT D L + ++ ++ C LC + ETA HLF+ C +W Sbjct: 156 PTKDNLSRRQIQLDNDLCPLCQTQPETASHLFFTCDKVLPLW 197 >gb|KHN23636.1| hypothetical protein glysoja_044901, partial [Glycine soja] Length = 139 Score = 65.9 bits (159), Expect = 1e-09 Identities = 39/105 (37%), Positives = 54/105 (51%), Gaps = 3/105 (2%) Frame = +3 Query: 30 ADLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGR 209 AD WIW++ +G+ Y LM + D D ++ L +L PK K+F W+L R Sbjct: 36 ADSWIWKQHSSGIYSTNTAYK-FLMEEIRGDPVDGSFVFLWKLKIPPKAKIFTWRLIKDR 94 Query: 210 LPTFDRLKKMGLRNSAT---CTLCNKEEETADHLFYNCTYSESVW 335 LPT +L G + T C LCN EE A HLF+NC+ +W Sbjct: 95 LPT--KLNLRGRQVEITDPMCPLCNNSEEDAAHLFFNCSKVLPLW 137 >ref|NP_189164.1| Ribonuclease H-like superfamily protein [Arabidopsis thaliana] dbj|BAB02086.1| reverse transcriptase-like protein [Arabidopsis thaliana] gb|AEE77003.1| Ribonuclease H-like superfamily protein [Arabidopsis thaliana] Length = 343 Score = 68.9 bits (167), Expect = 2e-09 Identities = 31/62 (50%), Positives = 41/62 (66%) Frame = +3 Query: 153 QLHTKPKIKMFGWKLHYGRLPTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESV 332 +L T PKIK F WKL G L T D LK+ +RN C C +E+ET+ HLF++C Y++ V Sbjct: 20 KLKTAPKIKHFLWKLLSGALATGDNLKRRHIRNHPQCHRCCQEDETSQHLFFDCFYAQQV 79 Query: 333 WR 338 WR Sbjct: 80 WR 81 >gb|ACF78995.1| unknown [Zea mays] gb|AIB05810.1| orphans transcription factor, partial [Zea mays] Length = 572 Score = 69.3 bits (168), Expect = 2e-09 Identities = 37/101 (36%), Positives = 50/101 (49%) Frame = +3 Query: 33 DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212 D I+R NG KA Y L + V + +R WK PK + F W + + + Sbjct: 376 DKHIFRLAANGKYSTKAAYEGLFIGSVEFEPFERIWKTW----APPKCRFFLWLVAHKKC 431 Query: 213 PTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVW 335 T DRL+K GL ++ C LC +E ET DHL NC +S W Sbjct: 432 WTADRLEKRGLDHTEKCPLCEQERETIDHLLVNCVFSRECW 472 >ref|XP_014626117.1| PREDICTED: uncharacterized protein LOC106796982 [Glycine max] Length = 247 Score = 67.4 bits (163), Expect = 3e-09 Identities = 35/102 (34%), Positives = 53/102 (51%), Gaps = 1/102 (0%) Frame = +3 Query: 33 DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212 D W+ NG+ K+ Y +L D D K++ +L PK+ F W+ RL Sbjct: 45 DFLCWKPDTNGIFSTKSAYK-VLQESHHSDSEDNVLKSMWKLKIPPKVSAFSWRFFKNRL 103 Query: 213 PTFDRLKKMGLRNSA-TCTLCNKEEETADHLFYNCTYSESVW 335 PT D L+K + S+ +C LC+ EEE+ HL +NC + S+W Sbjct: 104 PTRDNLRKRQVTMSSYSCPLCDHEEESIYHLMFNCEKTRSLW 145 >ref|XP_006577422.1| PREDICTED: uncharacterized protein LOC102666164 [Glycine max] Length = 224 Score = 66.6 bits (161), Expect = 4e-09 Identities = 33/102 (32%), Positives = 53/102 (51%), Gaps = 1/102 (0%) Frame = +3 Query: 33 DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212 D W+W NG+ K+ YN + +++D D + L L PK+ F W+L + RL Sbjct: 22 DTWLWGAEPNGIFSTKSAYNLIKAEQLLEDQ-DSGFHQLWDLKVPPKVLSFAWRLLWDRL 80 Query: 213 PTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335 PT D L + ++ ++ C LC + ETA +LF+ C +W Sbjct: 81 PTKDNLSRRQIQLDNDLCPLCQTQPETASYLFFTCDKVLPLW 122 >ref|XP_022041224.1| uncharacterized protein LOC110943800 [Helianthus annuus] Length = 217 Score = 66.2 bits (160), Expect = 5e-09 Identities = 34/107 (31%), Positives = 55/107 (51%), Gaps = 5/107 (4%) Frame = +3 Query: 30 ADLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDAD----RNWKAL*QLHTKPKIKMFGWKL 197 +D W+W+ K++ TL + +DAD NW T K MF W+ Sbjct: 17 SDKWVWKNDDKHEFTVKSVRCTLASQLNLNEDADSFMWNNW-------TTNKCSMFVWRA 69 Query: 198 HYGRLPTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335 GR+PT +L++ G++ +S C +C +E+ET DH+ C Y++ VW Sbjct: 70 VQGRIPTTTQLRQRGMQISSIICKVCGREDETPDHVLVKCDYAKEVW 116 >ref|XP_021844830.1| uncharacterized protein LOC110784683 [Spinacia oleracea] Length = 737 Score = 68.6 bits (166), Expect = 5e-09 Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 3/101 (2%) Frame = +3 Query: 45 WRRGMNGLSLNKAIYNTLLMNCVVQ---DDADRNWKAL*QLHTKPKIKMFGWKLHYGRLP 215 W G K+IYN L+M +Q D+ D+ WK L + PK ++F W+L + Sbjct: 374 WMANRTGQPTVKSIYNQLIMEKNLQTPLDNKDKFWKRLWKSELIPKWRIFTWRLLNEAIA 433 Query: 216 TFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVWR 338 T RL+K G+ C LC K +E HLF +C S VW+ Sbjct: 434 TRSRLRKRGMMVEDCCVLCKKSQENDKHLFRDCPISSHVWK 474 >ref|XP_006603334.1| PREDICTED: uncharacterized protein LOC102660367 [Glycine max] Length = 247 Score = 66.6 bits (161), Expect = 5e-09 Identities = 33/102 (32%), Positives = 53/102 (51%), Gaps = 1/102 (0%) Frame = +3 Query: 33 DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212 D W+W NG+ K+ YN + +++D D + L L PK+ F W+L + RL Sbjct: 45 DTWLWGAEPNGIFSTKSAYNLIKAEQLLEDQ-DSGFHQLWDLKVPPKVLSFAWRLLWDRL 103 Query: 213 PTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335 PT D L + ++ ++ C LC + ETA +LF+ C +W Sbjct: 104 PTKDNLSRRQIQLDNDLCPLCQTQPETASYLFFTCDKVLPLW 145 >ref|XP_014631438.1| PREDICTED: uncharacterized protein LOC106798777 [Glycine max] Length = 224 Score = 66.2 bits (160), Expect = 5e-09 Identities = 33/102 (32%), Positives = 53/102 (51%), Gaps = 1/102 (0%) Frame = +3 Query: 33 DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212 D W+W NG+ K+ YN + +++D D + L L PK+ F W+L + RL Sbjct: 22 DTWLWGAKPNGIFSTKSAYNLIKAEQLLEDQ-DSGFHQLWDLKVPPKVLSFAWRLLWDRL 80 Query: 213 PTFDRLKKMGLR-NSATCTLCNKEEETADHLFYNCTYSESVW 335 PT D L + ++ ++ C LC + ETA +LF+ C +W Sbjct: 81 PTKDNLARRQIQLDNDLCPLCQTQPETASYLFFTCDKVLPLW 122 >ref|XP_013738859.1| uncharacterized protein LOC106441605 [Brassica napus] Length = 279 Score = 67.0 bits (162), Expect = 5e-09 Identities = 28/69 (40%), Positives = 43/69 (62%), Gaps = 1/69 (1%) Frame = +3 Query: 135 NW-KAL*QLHTKPKIKMFGWKLHYGRLPTFDRLKKMGLRNSATCTLCNKEEETADHLFYN 311 NW K + +H+ PK F W + RL T D+++ + ATC LCN++EET DHLF+ Sbjct: 111 NWSKGIWFVHSTPKYSFFVWIVMRNRLQTGDKMRLWNVGIDATCILCNEDEETCDHLFFG 170 Query: 312 CTYSESVWR 338 C Y++ +W+ Sbjct: 171 CRYTKQIWK 179 >gb|ONM61175.1| hypothetical protein ZEAMMB73_Zm00001d022590 [Zea mays] gb|ONM61177.1| hypothetical protein ZEAMMB73_Zm00001d022590 [Zea mays] Length = 1188 Score = 68.6 bits (166), Expect = 5e-09 Identities = 35/101 (34%), Positives = 53/101 (52%) Frame = +3 Query: 33 DLWIWRRGMNGLSLNKAIYNTLLMNCVVQDDADRNWKAL*QLHTKPKIKMFGWKLHYGRL 212 D I+R +G+ KA YN L + + + W+ + + + PK K F W GR Sbjct: 992 DKHIFRFANDGIYSAKAAYNGLFIGSTMA----KYWELIWKTWSPPKCKFFLWLADLGRC 1047 Query: 213 PTFDRLKKMGLRNSATCTLCNKEEETADHLFYNCTYSESVW 335 T DRL+K GL + C LC++E+ET DH+ C ++ S W Sbjct: 1048 WTADRLQKRGLSHPDKCVLCDQEQETIDHILVGCVFARSFW 1088