BLASTX nr result
ID: Rehmannia32_contig00024155
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia32_contig00024155 (449 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNY13385.1| ribonuclease H [Trifolium pratense] 92 2e-19 gb|KYP63926.1| Putative ribonuclease H protein At1g65750 family ... 91 1e-18 ref|XP_020219657.1| uncharacterized protein LOC109802651 [Cajanu... 91 2e-18 gb|PNX56550.1| ribonuclease H, partial [Trifolium pratense] 87 3e-18 dbj|GAU30587.1| hypothetical protein TSUD_392780 [Trifolium subt... 91 3e-18 gb|PNX67203.1| ribonuclease H, partial [Trifolium pratense] 83 5e-18 dbj|GAU27143.1| hypothetical protein TSUD_104520 [Trifolium subt... 87 2e-17 gb|KYP60015.1| Putative ribonuclease H protein At1g65750 family ... 85 7e-17 ref|XP_019197208.1| PREDICTED: uncharacterized protein LOC109191... 87 8e-17 dbj|GAU50545.1| hypothetical protein TSUD_409890 [Trifolium subt... 82 1e-16 gb|KHN04733.1| hypothetical protein glysoja_045595, partial [Gly... 79 1e-16 dbj|GAU10749.1| hypothetical protein TSUD_425820, partial [Trifo... 82 2e-16 gb|PNX78963.1| ribonuclease H, partial [Trifolium pratense] 81 2e-16 dbj|GAU39484.1| hypothetical protein TSUD_279050 [Trifolium subt... 84 2e-16 dbj|GAU37566.1| hypothetical protein TSUD_153990 [Trifolium subt... 79 3e-16 dbj|GAU47981.1| hypothetical protein TSUD_87850 [Trifolium subte... 80 4e-16 gb|PNX89307.1| ribonuclease H, partial [Trifolium pratense] 80 5e-16 dbj|GAU26279.1| hypothetical protein TSUD_224710 [Trifolium subt... 82 5e-16 gb|PNX79373.1| pentatricopeptide repeat-containing protein [Trif... 80 6e-16 dbj|GAU46416.1| hypothetical protein TSUD_402050 [Trifolium subt... 82 7e-16 >gb|PNY13385.1| ribonuclease H [Trifolium pratense] Length = 333 Score = 92.4 bits (228), Expect = 2e-19 Identities = 43/120 (35%), Positives = 64/120 (53%), Gaps = 3/120 (2%) Frame = +3 Query: 3 GYRLAVDCSINGSDGN---SWVRRNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPA 173 GYRL + N + SW LWN++ PPK +H +WR C DCLPT L +RH+ Sbjct: 131 GYRLIMHDKWNNMERRGDLSW--SVLWNIKAPPKTRHTLWRICRDCLPTHVRLLQRHVDC 188 Query: 174 ESCCVLCNEELEMGWHLFISCKYCIMCWNVAGLKTLVDRWVSNVESVKEFIANFMKHYDK 353 C +CN+ E +H+F +C + CW AGL ++ +S+ +V E + N H D+ Sbjct: 189 VPNCPMCNDGAEDDFHVFFTCPQVVDCWTAAGLYDVIYNRLSSFNNVAELLLNICNHEDE 248 >gb|KYP63926.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 404 Score = 91.3 bits (225), Expect = 1e-18 Identities = 37/95 (38%), Positives = 57/95 (60%) Frame = +3 Query: 63 RNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEELEMGWHLFISCKY 242 +N+WNL IP K+KHFIWR D LPTR NL ++ I S C CN ++E WH F SC Sbjct: 237 KNIWNLRIPHKIKHFIWRLMRDILPTRPNLQKKGIRCPSTCFRCNTDIENTWHTFFSCPS 296 Query: 243 CIMCWNVAGLKTLVDRWVSNVESVKEFIANFMKHY 347 +CW + L + + +++ + V I++ ++H+ Sbjct: 297 AKLCWQGSNLHSCILNLINSSDGVFNLISHILQHW 331 >ref|XP_020219657.1| uncharacterized protein LOC109802651 [Cajanus cajan] Length = 729 Score = 91.3 bits (225), Expect = 2e-18 Identities = 37/95 (38%), Positives = 57/95 (60%) Frame = +3 Query: 63 RNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEELEMGWHLFISCKY 242 +N+WNL IP K+KHFIWR D LPTR NL ++ I S C CN ++E WH F SC Sbjct: 562 KNIWNLRIPHKIKHFIWRLMRDILPTRPNLQKKGIRCPSTCFRCNTDIENTWHTFFSCPS 621 Query: 243 CIMCWNVAGLKTLVDRWVSNVESVKEFIANFMKHY 347 +CW + L + + +++ + V I++ ++H+ Sbjct: 622 AKLCWQGSNLHSCILNLINSSDGVFNLISHILQHW 656 >gb|PNX56550.1| ribonuclease H, partial [Trifolium pratense] Length = 371 Score = 86.7 bits (213), Expect(2) = 3e-18 Identities = 36/83 (43%), Positives = 45/83 (54%) Frame = +3 Query: 69 LWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEELEMGWHLFISCKYCI 248 LW PPK KH +WR C DCLPTR L R++P S C LC E WH+F C Sbjct: 72 LWRSRAPPKTKHLLWRVCRDCLPTRLRLRERYVPCPSECALCVNNTEDDWHVFFGCSESR 131 Query: 249 MCWNVAGLKTLVDRWVSNVESVK 317 WN AGL ++D + ++VK Sbjct: 132 QVWNEAGLGGVIDPRIRQYDNVK 154 Score = 32.7 bits (73), Expect(2) = 3e-18 Identities = 12/23 (52%), Positives = 14/23 (60%) Frame = +1 Query: 379 VIWSLWMERNNEVWNRTHATANG 447 + WS+W RNN VWN TA G Sbjct: 175 IAWSIWHNRNNWVWNGVKDTAKG 197 >dbj|GAU30587.1| hypothetical protein TSUD_392780 [Trifolium subterraneum] Length = 529 Score = 90.5 bits (223), Expect = 3e-18 Identities = 41/101 (40%), Positives = 53/101 (52%) Frame = +3 Query: 51 SWVRRNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEELEMGWHLFI 230 SW +NLW + PPK KH +WR C +CLPTRS L + H+ C LCN E WH Sbjct: 272 SW--KNLWQIRAPPKAKHLLWRICRECLPTRSQLWQHHVQCPINCELCNNADEDAWHALF 329 Query: 231 SCKYCIMCWNVAGLKTLVDRWVSNVESVKEFIANFMKHYDK 353 C W VAGL T++ + + SVKE I + + K Sbjct: 330 DCAEVKSSWAVAGLNTVITTRLQSCASVKEVIMDICSNESK 370 >gb|PNX67203.1| ribonuclease H, partial [Trifolium pratense] Length = 229 Score = 82.8 bits (203), Expect(2) = 5e-18 Identities = 36/107 (33%), Positives = 54/107 (50%) Frame = +3 Query: 33 NGSDGNSWVRRNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEELEM 212 N + +W N+W + PPK KH +WR C +CLPTR L R +P C LC+ E Sbjct: 97 NTTQQENW--NNIWKIHAPPKAKHLLWRICKECLPTRVRLHERRVPCTLLCPLCDHCNED 154 Query: 213 GWHLFISCKYCIMCWNVAGLKTLVDRWVSNVESVKEFIANFMKHYDK 353 WH+ +C + AGL+ + V +++S KE I + D+ Sbjct: 155 DWHMLFTCNVSVQARQAAGLEITLSHRVQHMQSAKEVIFSICAGEDR 201 Score = 35.8 bits (81), Expect(2) = 5e-18 Identities = 13/20 (65%), Positives = 15/20 (75%) Frame = +1 Query: 379 VIWSLWMERNNEVWNRTHAT 438 +IW LW RNN+VWN TH T Sbjct: 210 LIWVLWNNRNNKVWNDTHET 229 >dbj|GAU27143.1| hypothetical protein TSUD_104520 [Trifolium subterraneum] Length = 699 Score = 87.0 bits (214), Expect(2) = 2e-17 Identities = 40/120 (33%), Positives = 60/120 (50%), Gaps = 3/120 (2%) Frame = +3 Query: 3 GYRLAVDCSINGS---DGNSWVRRNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPA 173 GY L + ++ S + W + LW ++ PPK KH +WR C CLPTRS L R++ Sbjct: 331 GYNLLLQSTLEASAIQEEEDW--KWLWKIQAPPKTKHLLWRICRGCLPTRSRLKERNVQC 388 Query: 174 ESCCVLCNEELEMGWHLFISCKYCIMCWNVAGLKTLVDRWVSNVESVKEFIANFMKHYDK 353 + C +C EE E WHL C+ W AG+ ++ V + KE I ++ D+ Sbjct: 389 TTSCPICEEEEENDWHLLYDCETSKRAWRSAGIVHIIQPHVQQAITTKECIMKLCRNSDR 448 Score = 29.3 bits (64), Expect(2) = 2e-17 Identities = 11/18 (61%), Positives = 13/18 (72%) Frame = +1 Query: 370 AS*VIWSLWMERNNEVWN 423 A+ +IW LW RNN VWN Sbjct: 454 AAMLIWMLWKNRNNCVWN 471 >gb|KYP60015.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 282 Score = 84.7 bits (208), Expect = 7e-17 Identities = 36/95 (37%), Positives = 56/95 (58%) Frame = +3 Query: 63 RNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEELEMGWHLFISCKY 242 +N+WNL IP K+KHFIWR D LPTR NL ++ I S C CN ++E WH F SC Sbjct: 176 KNIWNLCIPHKIKHFIWRLMQDILPTRPNLQKKGICYPSTCFRCNIDIENTWHTFFSCPS 235 Query: 243 CIMCWNVAGLKTLVDRWVSNVESVKEFIANFMKHY 347 +CW + L + + +++ V I++ ++++ Sbjct: 236 AKLCWQGSNLHSCILNLINSSNGVFNLISHILQYW 270 >ref|XP_019197208.1| PREDICTED: uncharacterized protein LOC109191092 [Ipomoea nil] Length = 1295 Score = 86.7 bits (213), Expect = 8e-17 Identities = 40/85 (47%), Positives = 52/85 (61%) Frame = +3 Query: 3 GYRLAVDCSINGSDGNSWVRRNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESC 182 GYRLAV +NG +G W R +WNL +PPKVK F W C LPT+ L +H+P + Sbjct: 958 GYRLAVGGEVNG-EGVVW--RGMWNLRVPPKVKCFFWNLCTKRLPTKDALLIKHVPCDPV 1014 Query: 183 CVLCNEELEMGWHLFISCKYCIMCW 257 CV+C + E HLFI+C+Y CW Sbjct: 1015 CVMCGKANESVVHLFINCEYAHKCW 1039 >dbj|GAU50545.1| hypothetical protein TSUD_409890 [Trifolium subterraneum] Length = 607 Score = 81.6 bits (200), Expect(2) = 1e-16 Identities = 35/100 (35%), Positives = 48/100 (48%) Frame = +3 Query: 69 LWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEELEMGWHLFISCKYCI 248 LW ++ PPK KH +WR C +CLPTR+ L R + C +C E E WH C Sbjct: 287 LWKIQAPPKAKHLLWRICKECLPTRTRLRERFVNCPLECPMCGNETESDWHFLFDCMDSK 346 Query: 249 MCWNVAGLKTLVDRWVSNVESVKEFIANFMKHYDKWNVVK 368 W AGL+ ++ V +VKE + + D N K Sbjct: 347 RAWQTAGLEAIISSQVQQHMTVKEALLKLCRGTDVQNAGK 386 Score = 32.3 bits (72), Expect(2) = 1e-16 Identities = 11/17 (64%), Positives = 13/17 (76%) Frame = +1 Query: 379 VIWSLWMERNNEVWNRT 429 V+W+LW RNN VWN T Sbjct: 390 VVWALWNNRNNWVWNHT 406 >gb|KHN04733.1| hypothetical protein glysoja_045595, partial [Glycine soja] Length = 74 Score = 79.0 bits (193), Expect = 1e-16 Identities = 31/69 (44%), Positives = 43/69 (62%) Frame = +3 Query: 84 IPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEELEMGWHLFISCKYCIMCWNV 263 +PPK+KHF+WR CLPTR+NL RR I + CV C E E WH+F++C W Sbjct: 1 VPPKLKHFLWRVTRGCLPTRTNLRRRGIDCTTGCVFCQEHFESEWHVFVACSKAREMWTA 60 Query: 264 AGLKTLVDR 290 AG+ L+++ Sbjct: 61 AGIHYLLEQ 69 >dbj|GAU10749.1| hypothetical protein TSUD_425820, partial [Trifolium subterraneum] Length = 179 Score = 81.6 bits (200), Expect = 2e-16 Identities = 34/84 (40%), Positives = 46/84 (54%) Frame = +3 Query: 63 RNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEELEMGWHLFISCKY 242 ++LW + PPK KH +WR CLPTR L RH+P S C LCN + E WH+F C Sbjct: 96 QSLWTILAPPKAKHLLWRISRGCLPTRMRLQTRHVPCPSSCPLCNHDSEDEWHVFFDCDV 155 Query: 243 CIMCWNVAGLKTLVDRWVSNVESV 314 I AGL+ L+ + ++V Sbjct: 156 SIQARQTAGLEQLLQNQIQQHQNV 179 >gb|PNX78963.1| ribonuclease H, partial [Trifolium pratense] Length = 757 Score = 81.3 bits (199), Expect(2) = 2e-16 Identities = 37/109 (33%), Positives = 56/109 (51%) Frame = +3 Query: 27 SINGSDGNSWVRRNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEEL 206 S+N + W + LW + PPK KH IWR C CLPTR L R +P + C LCN + Sbjct: 587 SVNDLHQDGW--KCLWKIHAPPKAKHLIWRICKKCLPTRIRLQERCVPCPTECPLCNNNV 644 Query: 207 EMGWHLFISCKYCIMCWNVAGLKTLVDRWVSNVESVKEFIANFMKHYDK 353 E WH +C + AGL ++ V ++ + +E + + K+ D+ Sbjct: 645 EDDWHFLFNCSDSVEARIAAGLDCIIADRVRHMSTAEEVLLDICKNEDR 693 Score = 32.0 bits (71), Expect(2) = 2e-16 Identities = 10/18 (55%), Positives = 14/18 (77%) Frame = +1 Query: 379 VIWSLWMERNNEVWNRTH 432 ++WSLW RN++VWN H Sbjct: 702 LVWSLWNNRNSKVWNGEH 719 >dbj|GAU39484.1| hypothetical protein TSUD_279050 [Trifolium subterraneum] Length = 310 Score = 84.0 bits (206), Expect = 2e-16 Identities = 39/120 (32%), Positives = 62/120 (51%), Gaps = 3/120 (2%) Frame = +3 Query: 3 GYRLAVDCSINGSDG---NSWVRRNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPA 173 GYRL + +G +W R +W + PPK +H +WR C +C+PTR L +R++ Sbjct: 83 GYRLLMKERWKHKEGAVTKNW--RGVWAVNAPPKARHAMWRVCSECIPTRVRLVQRNVEC 140 Query: 174 ESCCVLCNEELEMGWHLFISCKYCIMCWNVAGLKTLVDRWVSNVESVKEFIANFMKHYDK 353 +CC LCNE LE +H F C + W AGL ++ R + +V + + + D+ Sbjct: 141 NTCCPLCNEGLEDDYHAFALCPDVVASWEAAGLSNILIRRLPKFNNVSDLLFDICSGEDQ 200 >dbj|GAU37566.1| hypothetical protein TSUD_153990 [Trifolium subterraneum] Length = 343 Score = 79.3 bits (194), Expect(2) = 3e-16 Identities = 40/121 (33%), Positives = 63/121 (52%), Gaps = 5/121 (4%) Frame = +3 Query: 6 YRLAVDCSINGSDGNSWVRRN-----LWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIP 170 YR+ V I+ S++R N +WN+++PPKVK+ IWR C CLPTR L + + Sbjct: 120 YRICVQELID----TSYLRVNGNWNLVWNIKVPPKVKNLIWRICRRCLPTRVRLRDKGVE 175 Query: 171 AESCCVLCNEELEMGWHLFISCKYCIMCWNVAGLKTLVDRWVSNVESVKEFIANFMKHYD 350 C LCNEE E H+F C W++ G +V ++N + ++ I + ++ Sbjct: 176 CTQTCALCNEENEDSEHIFFKCPSSRNVWSMTGFFHVVSNAINNNNNAQDIIFHILQQLS 235 Query: 351 K 353 K Sbjct: 236 K 236 Score = 33.1 bits (74), Expect(2) = 3e-16 Identities = 9/22 (40%), Positives = 15/22 (68%) Frame = +1 Query: 379 VIWSLWMERNNEVWNRTHATAN 444 ++WS+W +RNN++WN N Sbjct: 245 ILWSIWKQRNNQIWNNVTDAQN 266 >dbj|GAU47981.1| hypothetical protein TSUD_87850 [Trifolium subterraneum] Length = 200 Score = 80.5 bits (197), Expect(2) = 4e-16 Identities = 31/97 (31%), Positives = 50/97 (51%) Frame = +3 Query: 63 RNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEELEMGWHLFISCKY 242 +N+W + + PK KH +WR C DCLPTR L +++ C +C E+E+ WH C Sbjct: 5 KNIWKVHVLPKAKHLLWRICKDCLPTRVQLQEKNVQCPLDCPICEHEVEIEWHSLFDCND 64 Query: 243 CIMCWNVAGLKTLVDRWVSNVESVKEFIANFMKHYDK 353 W AGL ++ + + + + +E I + DK Sbjct: 65 SRNAWQAAGLTLIISQHMQSFTTTRECILHICSETDK 101 Score = 31.6 bits (70), Expect(2) = 4e-16 Identities = 11/17 (64%), Positives = 13/17 (76%) Frame = +1 Query: 379 VIWSLWMERNNEVWNRT 429 +IW LW RNN VWN+T Sbjct: 110 LIWVLWNNRNNAVWNQT 126 >gb|PNX89307.1| ribonuclease H, partial [Trifolium pratense] Length = 426 Score = 80.1 bits (196), Expect(2) = 5e-16 Identities = 38/113 (33%), Positives = 56/113 (49%), Gaps = 5/113 (4%) Frame = +3 Query: 3 GYRLAVDCSINGSDG-----NSWVRRNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHI 167 GY+ +D + G G W +LW + PPK KH +WR CLPTR L +H+ Sbjct: 280 GYKCMLD--VTGKVGVQPQHEDW--HSLWKIHAPPKAKHLLWRISKGCLPTRIRLQEKHV 335 Query: 168 PAESCCVLCNEELEMGWHLFISCKYCIMCWNVAGLKTLVDRWVSNVESVKEFI 326 P C LC+ ++E WH +C+ I AGL+T++ + + KE I Sbjct: 336 PCPLICPLCSHDMEDDWHFLFACETSIQAHYAAGLETIIQPRIQLAGNAKEII 388 Score = 31.6 bits (70), Expect(2) = 5e-16 Identities = 10/18 (55%), Positives = 13/18 (72%) Frame = +1 Query: 379 VIWSLWMERNNEVWNRTH 432 ++W LW RNN+VWN H Sbjct: 406 LVWVLWNNRNNKVWNDEH 423 >dbj|GAU26279.1| hypothetical protein TSUD_224710 [Trifolium subterraneum] Length = 398 Score = 81.6 bits (200), Expect(2) = 5e-16 Identities = 32/70 (45%), Positives = 41/70 (58%) Frame = +3 Query: 69 LWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEELEMGWHLFISCKYCI 248 LW PPK KH +WR C +CLPTR+ L R++P S C LC ++E WH+F C Sbjct: 86 LWRSSAPPKTKHLLWRVCRECLPTRTRLRERYVPCPSECPLCLNDMEDDWHVFFGCSDSR 145 Query: 249 MCWNVAGLKT 278 WN AG +T Sbjct: 146 QVWNEAGKET 155 Score = 30.0 bits (66), Expect(2) = 5e-16 Identities = 10/23 (43%), Positives = 14/23 (60%) Frame = +1 Query: 379 VIWSLWMERNNEVWNRTHATANG 447 + W +W RNN +WN T +A G Sbjct: 165 IAWCIWHNRNNWLWNGTKDSAKG 187 >gb|PNX79373.1| pentatricopeptide repeat-containing protein [Trifolium pratense] Length = 188 Score = 80.5 bits (197), Expect = 6e-16 Identities = 35/91 (38%), Positives = 48/91 (52%), Gaps = 3/91 (3%) Frame = +3 Query: 3 GYRLAVDCSINGSDGN---SWVRRNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPA 173 GYR+ ++ G SW + LW + PPK KH +WR C DCLPTR+ L + ++ Sbjct: 21 GYRILMEAKEAGRRRGIEGSW--KCLWQIRAPPKAKHILWRICRDCLPTRAQLRQHYVQC 78 Query: 174 ESCCVLCNEELEMGWHLFISCKYCIMCWNVA 266 + C LCN E E WH+ C+ CW A Sbjct: 79 PAACELCNGENEDTWHVLFDCEMISNCWTAA 109 >dbj|GAU46416.1| hypothetical protein TSUD_402050 [Trifolium subterraneum] Length = 1421 Score = 82.4 bits (202), Expect(2) = 7e-16 Identities = 35/95 (36%), Positives = 54/95 (56%) Frame = +3 Query: 60 RRNLWNLEIPPKVKHFIWRACHDCLPTRSNLGRRHIPAESCCVLCNEELEMGWHLFISCK 239 R ++++ PPK KHFIWR C +CLPTR L RH+ C LC E E WH+F +C Sbjct: 1088 RDGVYSIHAPPKAKHFIWRICKECLPTRMRLRSRHVQCPLECPLCLVEPEDDWHIFFNCD 1147 Query: 240 YCIMCWNVAGLKTLVDRWVSNVESVKEFIANFMKH 344 WN G+ ++ +++ +++++FI N H Sbjct: 1148 NNNEAWNAMGIDHVLHPRLNSSQNIRDFIFNVCCH 1182 Score = 28.9 bits (63), Expect(2) = 7e-16 Identities = 8/15 (53%), Positives = 11/15 (73%) Frame = +1 Query: 379 VIWSLWMERNNEVWN 423 ++W +W RNN VWN Sbjct: 1194 LLWMIWQNRNNAVWN 1208