BLASTX nr result
ID: Cheilocostus21_contig00041831
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cheilocostus21_contig00041831 (1237 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_020096969.1| uncharacterized protein LOC109716078, partia... 88 3e-15 gb|OAY74722.1| putative ribonuclease H protein [Ananas comosus] 88 3e-15 ref|XP_010667870.1| PREDICTED: uncharacterized protein LOC104884... 79 9e-13 ref|XP_006606655.1| PREDICTED: uncharacterized protein LOC102668... 78 1e-12 ref|XP_006593201.1| PREDICTED: uncharacterized protein LOC102665... 76 9e-12 gb|PRQ37611.1| putative reverse transcriptase zinc-binding domai... 76 1e-11 ref|XP_010525577.1| PREDICTED: uncharacterized protein LOC104803... 74 3e-11 ref|XP_019197208.1| PREDICTED: uncharacterized protein LOC109191... 75 9e-11 ref|XP_023878301.1| uncharacterized protein LOC111990748 [Quercu... 75 1e-10 gb|OMO85278.1| hypothetical protein CCACVL1_10296 [Corchorus cap... 75 2e-10 ref|XP_006574189.1| PREDICTED: uncharacterized protein LOC102665... 72 2e-10 ref|XP_019166531.1| PREDICTED: uncharacterized protein LOC109162... 74 2e-10 ref|XP_020088996.1| uncharacterized protein LOC109710675 [Ananas... 64 2e-10 ref|XP_014630554.1| PREDICTED: uncharacterized protein LOC106798... 70 4e-10 ref|XP_004240331.1| PREDICTED: uncharacterized protein LOC101255... 68 5e-10 gb|KHN14498.1| Putative ribonuclease H protein, partial [Glycine... 69 6e-10 ref|XP_017251469.1| PREDICTED: uncharacterized protein LOC108222... 72 6e-10 ref|XP_021852226.1| uncharacterized protein LOC110791770, partia... 68 7e-10 gb|PNX58727.1| ribonuclease H [Trifolium pratense] 71 8e-10 ref|XP_019175849.1| PREDICTED: uncharacterized protein LOC109171... 72 8e-10 >ref|XP_020096969.1| uncharacterized protein LOC109716078, partial [Ananas comosus] Length = 1220 Score = 88.2 bits (217), Expect(2) = 3e-15 Identities = 43/117 (36%), Positives = 63/117 (53%) Frame = +1 Query: 880 WRGWKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKTCPLCGIVDESIKHLFF 1059 W GWK LW L VAP++K F WK +LPT D L + G+ C LCG E+I+HLFF Sbjct: 1047 WDGWKQLWGLAVAPRVKTFLWKYFWKRLPTKDFLQQRGLTQSNLCALCGEAAENIQHLFF 1106 Query: 1060 QCEYILNVWLHIECNYGFKLHISFAWGSGLWILDSTDPHLHFRSLILILF*QVWKTR 1230 QC Y VW + ++G +++ G W+ L +++I + +WK+R Sbjct: 1107 QCRYSKEVWHIFQLDWGKVINVQ-QLHDGCWLTSKVPNDL--KAMIASILWCIWKSR 1160 Score = 23.5 bits (49), Expect(2) = 3e-15 Identities = 14/39 (35%), Positives = 18/39 (46%) Frame = +3 Query: 744 DLHYKITSINLISPPSVDNWVWKERTNGLPLNKSIYNSL 860 DL IT++ L P D WVW G S+Y+ L Sbjct: 1003 DLVDAITNLQLGEGP--DKWVWSLHPQGKARAGSVYSFL 1039 >gb|OAY74722.1| putative ribonuclease H protein [Ananas comosus] Length = 851 Score = 88.2 bits (217), Expect(2) = 3e-15 Identities = 43/117 (36%), Positives = 63/117 (53%) Frame = +1 Query: 880 WRGWKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKTCPLCGIVDESIKHLFF 1059 W GWK LW L VAP++K F WK +LPT D L + G+ C LCG E+I+HLFF Sbjct: 520 WDGWKQLWGLAVAPRVKTFLWKYFWKRLPTKDFLQQRGLTQSNLCALCGEAAENIQHLFF 579 Query: 1060 QCEYILNVWLHIECNYGFKLHISFAWGSGLWILDSTDPHLHFRSLILILF*QVWKTR 1230 QC Y VW + ++G +++ G W+ L +++I + +WK+R Sbjct: 580 QCRYSKEVWHIFQLDWGKVINVQ-QLHDGCWLTSKVPNDL--KAMIASILWCIWKSR 633 Score = 23.5 bits (49), Expect(2) = 3e-15 Identities = 14/39 (35%), Positives = 18/39 (46%) Frame = +3 Query: 744 DLHYKITSINLISPPSVDNWVWKERTNGLPLNKSIYNSL 860 DL IT++ L P D WVW G S+Y+ L Sbjct: 476 DLVDAITNLQLGEGP--DKWVWSLHPQGKARAGSVYSFL 512 >ref|XP_010667870.1| PREDICTED: uncharacterized protein LOC104884866 [Beta vulgaris subsp. vulgaris] Length = 259 Score = 78.6 bits (192), Expect = 9e-13 Identities = 33/66 (50%), Positives = 42/66 (63%) Frame = +1 Query: 889 WKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKTCPLCGIVDESIKHLFFQCE 1068 WK +W VAPK+K+F WKLLH LP D L+R G+ K CP CG +ES++HL QC+ Sbjct: 150 WKKVWRAKVAPKVKLFGWKLLHNGLPVNDNLARRGVIIDKQCPRCGDGEESVEHLLMQCD 209 Query: 1069 YILNVW 1086 VW Sbjct: 210 VSKQVW 215 >ref|XP_006606655.1| PREDICTED: uncharacterized protein LOC102668453 [Glycine max] Length = 247 Score = 77.8 bits (190), Expect = 1e-12 Identities = 40/83 (48%), Positives = 50/83 (60%), Gaps = 1/83 (1%) Frame = +1 Query: 841 NPFTTLFSPIKDRWRGWKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGI-CNPKTCP 1017 N T SP DR R ++LWN+ + PK VF+WKLL +LPT LSR G+ CP Sbjct: 64 NLLITPSSPTLDR-RTSQLLWNMKIPPKHAVFTWKLLSCRLPTRANLSRRGVNIQDTACP 122 Query: 1018 LCGIVDESIKHLFFQCEYILNVW 1086 LCG V E + HLFF C+ IL +W Sbjct: 123 LCGYVQEEVGHLFFNCKKILGLW 145 >ref|XP_006593201.1| PREDICTED: uncharacterized protein LOC102665828 [Glycine max] Length = 292 Score = 76.3 bits (186), Expect = 9e-12 Identities = 41/83 (49%), Positives = 51/83 (61%), Gaps = 1/83 (1%) Frame = +1 Query: 841 NPFTTLFSPIKDRWRGWKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKT-CP 1017 N TL SP DR R ++LWN+ + PK VF+WKLL +LPT LSR + T CP Sbjct: 109 NLLITLSSPALDR-RTSQLLWNMKIPPKHAVFTWKLLSGRLPTRANLSRRRVNIQDTACP 167 Query: 1018 LCGIVDESIKHLFFQCEYILNVW 1086 LCG V E + HLFF C+ IL +W Sbjct: 168 LCGDVQEEVGHLFFNCKKILGLW 190 >gb|PRQ37611.1| putative reverse transcriptase zinc-binding domain-containing protein [Rosa chinensis] Length = 308 Score = 75.9 bits (185), Expect = 1e-11 Identities = 35/63 (55%), Positives = 41/63 (65%) Frame = +1 Query: 898 LWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKTCPLCGIVDESIKHLFFQCEYIL 1077 LW LNV PKIK+F W LL +L T DRLSRFGI N +C LC +E+ HLF CE+ Sbjct: 236 LWKLNVQPKIKIFGWLLLRGRLKTRDRLSRFGIINDNSCLLCNRDNETADHLFGYCEFTK 295 Query: 1078 NVW 1086 VW Sbjct: 296 EVW 298 >ref|XP_010525577.1| PREDICTED: uncharacterized protein LOC104803346 [Tarenaya hassleriana] Length = 269 Score = 74.3 bits (181), Expect = 3e-11 Identities = 31/74 (41%), Positives = 44/74 (59%) Frame = +1 Query: 880 WRGWKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKTCPLCGIVDESIKHLFF 1059 WR +W P++ +F W+++H +LPT DRL R+G+ + TC LC DES HLFF Sbjct: 106 WRS--TVWFKQSTPRMAIFMWQMMHVRLPTKDRLMRWGVASVMTCGLCDAEDESHNHLFF 163 Query: 1060 QCEYILNVWLHIEC 1101 +C Y VW + C Sbjct: 164 ECRYSAAVWSYYAC 177 >ref|XP_019197208.1| PREDICTED: uncharacterized protein LOC109191092 [Ipomoea nil] Length = 1295 Score = 75.5 bits (184), Expect = 9e-11 Identities = 40/117 (34%), Positives = 61/117 (52%), Gaps = 2/117 (1%) Frame = +1 Query: 889 WKILWNLNVAPKIKVFSWKLLHYKLPTFDR-LSRFGICNPKTCPLCGIVDESIKHLFFQC 1065 W+ +WNL V PK+K F W L +LPT D L + C+P C +CG +ES+ HLF C Sbjct: 974 WRGMWNLRVPPKVKCFFWNLCTKRLPTKDALLIKHVPCDP-VCVMCGKANESVVHLFINC 1032 Query: 1066 EYILNVWLHIECNYGFKLHISFAWGSGLWILDS-TDPHLHFRSLILILF*QVWKTRN 1233 EY WL + N+ +S+ +W+ + T P R ++++ +W RN Sbjct: 1033 EYAHKCWLMLNANW----ILSYVDSINVWLEEMWTIPSAEMREKLMMVAWAIWGARN 1085 >ref|XP_023878301.1| uncharacterized protein LOC111990748 [Quercus suber] Length = 1325 Score = 75.1 bits (183), Expect = 1e-10 Identities = 31/66 (46%), Positives = 41/66 (62%) Frame = +1 Query: 889 WKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKTCPLCGIVDESIKHLFFQCE 1068 WK LW LN+ KIK+F+W+ LPT+D +S+ GIC TCP+CG+V E + H CE Sbjct: 1008 WKKLWLLNLPGKIKIFAWRACVDGLPTYDNISKRGICCSSTCPICGLVTEDVNHALLYCE 1067 Query: 1069 YILNVW 1086 VW Sbjct: 1068 AASLVW 1073 >gb|OMO85278.1| hypothetical protein CCACVL1_10296 [Corchorus capsularis] Length = 2943 Score = 74.7 bits (182), Expect = 2e-10 Identities = 40/123 (32%), Positives = 63/123 (51%), Gaps = 1/123 (0%) Frame = +1 Query: 868 IKDRWRGWKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKTCPLCGIVDESIK 1047 I+DR W+ +W V PK+K F W++ H LP+ L R G+ + CP+CG +++S+ Sbjct: 67 IEDRTDYWRTIWGAPVQPKVKFFLWRVRHNILPSKINLQRRGVPVDELCPVCGSIEDSLV 126 Query: 1048 HLFFQCEYILNVWLHIECNYGFKLHISFAWGSGLW-ILDSTDPHLHFRSLILILF*QVWK 1224 H FF C + VW + C + ++ + G LW L + L +L+ L VW Sbjct: 127 HTFFTCSFSAKVWEN-SCPWVMEILQNMRDGDDLWNCLMAKAAQLGSLALMANLLWLVWH 185 Query: 1225 TRN 1233 RN Sbjct: 186 NRN 188 >ref|XP_006574189.1| PREDICTED: uncharacterized protein LOC102665138 [Glycine max] Length = 247 Score = 71.6 bits (174), Expect = 2e-10 Identities = 38/83 (45%), Positives = 49/83 (59%), Gaps = 1/83 (1%) Frame = +1 Query: 841 NPFTTLFSPIKDRWRGWKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGI-CNPKTCP 1017 N T SP D+ R ++LWN+ + PK VF+WKLL +LPT LSR G+ CP Sbjct: 64 NLLITPSSPALDK-RTSQLLWNMKIPPKHAVFTWKLLSGRLPTRANLSRRGVNIQDTACP 122 Query: 1018 LCGIVDESIKHLFFQCEYILNVW 1086 LCG V E + HLFF + IL +W Sbjct: 123 LCGDVQEEVGHLFFNSKKILGLW 145 >ref|XP_019166531.1| PREDICTED: uncharacterized protein LOC109162266 [Ipomoea nil] Length = 1590 Score = 74.3 bits (181), Expect = 2e-10 Identities = 41/117 (35%), Positives = 62/117 (52%), Gaps = 2/117 (1%) Frame = +1 Query: 889 WKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGI-CNPKTCPLCGIVDESIKHLFFQC 1065 W+ LW L + PK+K F W+L +LPT D L I C+P C LCG +E + HLF C Sbjct: 1036 WRGLWYLRIPPKVKCFFWRLCTLRLPTKDVLMTKRIQCDP-VCVLCGKANECVAHLFANC 1094 Query: 1066 EYILNVWLHIECNYGFKLHISF-AWGSGLWILDSTDPHLHFRSLILILF*QVWKTRN 1233 EY N WLH+ ++ S AW +W + P+ ++++ + +W+ RN Sbjct: 1095 EYAHNCWLHLNADWEMGYVDSLNAWLYEMW---AVLPNKILEQVVMVAW-AIWEARN 1147 >ref|XP_020088996.1| uncharacterized protein LOC109710675 [Ananas comosus] Length = 1113 Score = 63.5 bits (153), Expect(2) = 2e-10 Identities = 37/119 (31%), Positives = 53/119 (44%), Gaps = 4/119 (3%) Frame = +1 Query: 889 WKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKTCPLCGIVDESIKHLFFQCE 1068 W +LW+L VAP++K F WKLL +LPT +R C C E H+F C Sbjct: 790 WIVLWSLPVAPRVKNFLWKLLWNRLPTNERCYSLNSAPSPFCIYCS-TPEDQNHIFLDCI 848 Query: 1069 YILNVWLHIECNYGFKLHISFAWGSGLWILDSTD----PHLHFRSLILILF*QVWKTRN 1233 +W + + G + W + WI + + R+LI F Q+WK RN Sbjct: 849 NARRIWDAVMSSTGILFSFNGDWITEEWIDEGKNLAVVQQQFIRALIANTFWQIWKERN 907 Score = 31.6 bits (70), Expect(2) = 2e-10 Identities = 15/47 (31%), Positives = 22/47 (46%) Frame = +3 Query: 720 KLTNWCCRDLHYKITSINLISPPSVDNWVWKERTNGLPLNKSIYNSL 860 +L W L + I I L D W+W + +G P KSIY+ + Sbjct: 732 RLVEWFGPILAHNICKIILSPDNGSDEWIWAPKKDGKPSVKSIYHHI 778 >ref|XP_014630554.1| PREDICTED: uncharacterized protein LOC106798474 [Glycine max] Length = 246 Score = 70.5 bits (171), Expect = 4e-10 Identities = 32/66 (48%), Positives = 41/66 (62%), Gaps = 1/66 (1%) Frame = +1 Query: 892 KILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKT-CPLCGIVDESIKHLFFQCE 1068 KI+WNLNV P+ +FSW+L+ +LPT L R + T CPLCG E + HLFF CE Sbjct: 79 KIVWNLNVPPRAVIFSWRLILDRLPTRRNLLRRNVQKQDTSCPLCGNAQEEVDHLFFNCE 138 Query: 1069 YILNVW 1086 L +W Sbjct: 139 MTLGLW 144 >ref|XP_004240331.1| PREDICTED: uncharacterized protein LOC101255200 [Solanum lycopersicum] Length = 138 Score = 67.8 bits (164), Expect = 5e-10 Identities = 33/67 (49%), Positives = 41/67 (61%), Gaps = 1/67 (1%) Frame = +1 Query: 889 WKILWNLNVA-PKIKVFSWKLLHYKLPTFDRLSRFGICNPKTCPLCGIVDESIKHLFFQC 1065 W+ L N N A PK W LL+ KL T DRL+++G+ KTC LC DESI H+F QC Sbjct: 23 WRCLMNKNAARPKATFTLWILLNRKLATVDRLAKWGMALDKTCVLCKSADESIDHMFIQC 82 Query: 1066 EYILNVW 1086 +Y VW Sbjct: 83 QYAGEVW 89 >gb|KHN14498.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 171 Score = 68.6 bits (166), Expect = 6e-10 Identities = 30/67 (44%), Positives = 41/67 (61%), Gaps = 1/67 (1%) Frame = +1 Query: 889 WKILWNLNVAPKIKVFSWKLLHYKLPTFDRL-SRFGICNPKTCPLCGIVDESIKHLFFQC 1065 +KI+W L + P+ VFSW+L+ +LPT D L SR + CPLCG V E HLFF C Sbjct: 100 FKIIWKLKIPPRAAVFSWRLIKDRLPTRDNLLSRNVVIQEAVCPLCGFVQEEAGHLFFNC 159 Query: 1066 EYILNVW 1086 + + +W Sbjct: 160 KMKIGLW 166 >ref|XP_017251469.1| PREDICTED: uncharacterized protein LOC108222085 [Daucus carota subsp. sativus] Length = 378 Score = 71.6 bits (174), Expect = 6e-10 Identities = 40/119 (33%), Positives = 59/119 (49%), Gaps = 2/119 (1%) Frame = +1 Query: 883 RGWKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKTCPLCGIVDESIKHLFFQ 1062 +GW +W LN+ K+++F+W++ +P RLS GI P CP+C E + HLFF Sbjct: 57 QGWSKIWKLNLHHKVRIFTWRICRNSIPVRTRLSSRGITLPLECPMCDKAPEDMLHLFFN 116 Query: 1063 CEYILNVWLHIECNYGFKLHISFAWGSGLWILDSTD--PHLHFRSLILILF*QVWKTRN 1233 C++ L ++ +Y I A W+LD P F LI +L VW RN Sbjct: 117 CDFALECRNNVGLSYDMTGVIDVAG----WLLDKIVELPSDEFGKLITVLL-GVWFWRN 170 >ref|XP_021852226.1| uncharacterized protein LOC110791770, partial [Spinacia oleracea] Length = 166 Score = 68.2 bits (165), Expect = 7e-10 Identities = 29/77 (37%), Positives = 46/77 (59%) Frame = +1 Query: 892 KILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKTCPLCGIVDESIKHLFFQCEY 1071 K++WN PK + W +H KL T D+L++ GI +C +CG+ DE+ +HLFFQC+Y Sbjct: 24 KMVWNRLNIPKHRFICWLAVHSKLQTTDKLAKIGISQSASCLICGLDDETHQHLFFQCQY 83 Query: 1072 ILNVWLHIECNYGFKLH 1122 + + + GF +H Sbjct: 84 SKQIIIAVHQWIGFSIH 100 >gb|PNX58727.1| ribonuclease H [Trifolium pratense] Length = 330 Score = 70.9 bits (172), Expect = 8e-10 Identities = 48/122 (39%), Positives = 57/122 (46%), Gaps = 6/122 (4%) Frame = +1 Query: 889 WKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGI-CNPKTCPLCGIVDESIKHLFFQC 1065 WK LW + PK + W+LLH LP D L + GI CNP CP C E I H+F C Sbjct: 7 WKTLWQQKIPPKYQHLIWRLLHNALPVTDNLQKKGIMCNP-LCPRCNAKIEDINHVFKDC 65 Query: 1066 EYILNVWLHIECNYGF-KLHISFAWGSGLWILDSTDPHLHFRS----LILILF*QVWKTR 1230 + VW N F KL SF WI DS H +S LI + +WK R Sbjct: 66 IWAQQVWFASPLNINFEKLRTSFI----DWIHDSFS---HNQSDTVELISSICYHIWKAR 118 Query: 1231 NL 1236 NL Sbjct: 119 NL 120 >ref|XP_019175849.1| PREDICTED: uncharacterized protein LOC109171175 [Ipomoea nil] Length = 1236 Score = 72.4 bits (176), Expect = 8e-10 Identities = 39/115 (33%), Positives = 54/115 (46%) Frame = +1 Query: 889 WKILWNLNVAPKIKVFSWKLLHYKLPTFDRLSRFGICNPKTCPLCGIVDESIKHLFFQCE 1068 W +W L V PK KVF W+ L LPT D+L + N TCP CG+ +E + HLF C Sbjct: 957 WNNVWKLQVPPKWKVFLWRALLNILPTLDKLLIKRVININTCPSCGLHEECVMHLFCTCP 1016 Query: 1069 YILNVWLHIECNYGFKLHISFAWGSGLWILDSTDPHLHFRSLILILF*QVWKTRN 1233 Y L +W + + +F + W+ S+ R I L +W RN Sbjct: 1017 YALKLWQLSQLQIPPIANRNFVQWAEEWLGGSSGYSSEVRGQICGLLHTIWAARN 1071