BLASTX nr result
ID: Rehmannia32_contig00005712
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia32_contig00005712 (418 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AJE29370.1| putative gag protein [Coffea canephora] 114 4e-27 gb|KYP36396.1| Retrovirus-related Pol polyprotein from transposo... 88 1e-17 gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposo... 88 1e-17 dbj|GAU46320.1| hypothetical protein TSUD_401910 [Trifolium subt... 88 2e-17 gb|EOY18934.1| Uncharacterized protein TCM_043452 [Theobroma cacao] 82 7e-17 gb|EOY01950.1| Uncharacterized protein TCM_011728 [Theobroma cacao] 80 3e-16 gb|KYP74267.1| Retrovirus-related Pol polyprotein from transposo... 84 4e-16 gb|EOY05822.1| Uncharacterized protein TCM_020722 [Theobroma cacao] 81 5e-16 dbj|GAU51472.1| hypothetical protein TSUD_95870 [Trifolium subte... 84 6e-16 gb|PON41343.1| Zinc finger, CCHC-type [Parasponia andersonii] 82 7e-16 gb|PRQ56251.1| putative RNA-directed DNA polymerase [Rosa chinen... 82 1e-15 gb|KYP64657.1| Retrovirus-related Pol polyprotein from transposo... 81 1e-15 gb|PRQ51350.1| putative RNA-directed DNA polymerase [Rosa chinen... 79 2e-14 gb|PRQ41601.1| putative tripeptidyl-peptidase II [Rosa chinensis] 79 2e-14 gb|KYP74254.1| Retrovirus-related Pol polyprotein from transposo... 76 5e-14 gb|EOY12702.1| Uncharacterized protein TCM_031224 [Theobroma cacao] 78 7e-14 gb|PNX57240.1| copia LTR rider, partial [Trifolium pratense] 76 9e-14 gb|KYP36635.1| Retrovirus-related Pol polyprotein from transposo... 76 1e-13 ref|XP_020582332.1| uncharacterized protein LOC110025962 [Phalae... 77 2e-13 emb|CAN80490.1| hypothetical protein VITISV_004703 [Vitis vinifera] 76 3e-13 >gb|AJE29370.1| putative gag protein [Coffea canephora] Length = 433 Score = 114 bits (284), Expect = 4e-27 Identities = 65/126 (51%), Positives = 77/126 (61%), Gaps = 3/126 (2%) Frame = +1 Query: 49 IGCHRCHDEGHIKRNCPXXXXXXXXXXXXXXXXXXXX-LHSLRKYGYFFSR--WVDLDSG 219 I C CH+ GHIKR CP + ++ K SR W+ LDSG Sbjct: 230 IQCFGCHEFGHIKRYCPHQKKNDENDYDGVAGYASGGDILTISKGNNTSSRDGWI-LDSG 288 Query: 220 CVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAYVPKM 399 CV HVCSR DYFD+LQ K+AG + LGDGS C+V G GVV+IKM +G LGGVAYVPK+ Sbjct: 289 CVSHVCSRLDYFDTLQRKKAGFMCLGDGSTCQVKGVGVVKIKMLNGEIRSLGGVAYVPKL 348 Query: 400 RRNLIS 417 RRNLIS Sbjct: 349 RRNLIS 354 >gb|KYP36396.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 485 Score = 88.2 bits (217), Expect = 1e-17 Identities = 47/126 (37%), Positives = 64/126 (50%), Gaps = 5/126 (3%) Frame = +1 Query: 55 CHRCHDEGHIKRNCPXXXXXXXXXXXXXXXXXXXXLHSLRKYGYFFSR-----WVDLDSG 219 C CH GH K+NCP + + WV +DSG Sbjct: 179 CFYCHKVGHFKKNCPERNRDQKSSADSADIAAISDGYESADVLVVTTSQTQKDWV-MDSG 237 Query: 220 CVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAYVPKM 399 C H+C ++DYF++L+LK GT+ LGD C+V G G V +KM D ++IL V YVP + Sbjct: 238 CSYHMCPKKDYFETLKLKEGGTVLLGDDHPCQVQGIGTVRLKMFDNREYILKDVRYVPDL 297 Query: 400 RRNLIS 417 +RNLIS Sbjct: 298 KRNLIS 303 >gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 780 Score = 88.2 bits (217), Expect = 1e-17 Identities = 47/126 (37%), Positives = 64/126 (50%), Gaps = 5/126 (3%) Frame = +1 Query: 55 CHRCHDEGHIKRNCPXXXXXXXXXXXXXXXXXXXXLHSLRKYGYFFSR-----WVDLDSG 219 C CH GH K+NCP + + WV +DSG Sbjct: 233 CFYCHKVGHFKKNCPERNRDQKSSADSADIAAISDGYESADVLVVTTSQTQKDWV-MDSG 291 Query: 220 CVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAYVPKM 399 C H+C ++DYF++L+LK GT+ LGD C+V G G V +KM D ++IL V YVP + Sbjct: 292 CSYHMCPKKDYFETLKLKEGGTVLLGDDHPCQVQGIGTVRLKMFDNREYILKDVRYVPDL 351 Query: 400 RRNLIS 417 +RNLIS Sbjct: 352 KRNLIS 357 >dbj|GAU46320.1| hypothetical protein TSUD_401910 [Trifolium subterraneum] Length = 1006 Score = 88.2 bits (217), Expect = 2e-17 Identities = 42/123 (34%), Positives = 66/123 (53%), Gaps = 2/123 (1%) Frame = +1 Query: 55 CHRCHDEGHIKRNCPXXXXXXXXXXXXXXXXXXXX--LHSLRKYGYFFSRWVDLDSGCVM 228 C+ CH+ GH K++CP +L + + +DSGC Sbjct: 62 CYHCHEPGHFKKDCPQRRGGDSSSAQIAVSEEEGYESAGALTVTSWEPEKSWVMDSGCSC 121 Query: 229 HVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAYVPKMRRN 408 H+C R++YF++L+LK G + LG+ AC+V G G + +KM D D +L V Y+P+++RN Sbjct: 122 HICPRKEYFETLELKEGGVVRLGNNKACKVQGTGSIRLKMYDDRDFLLKNVRYIPELKRN 181 Query: 409 LIS 417 LIS Sbjct: 182 LIS 184 >gb|EOY18934.1| Uncharacterized protein TCM_043452 [Theobroma cacao] Length = 166 Score = 82.0 bits (201), Expect = 7e-17 Identities = 42/70 (60%), Positives = 50/70 (71%) Frame = +1 Query: 208 LDSGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAY 387 LDS H+C ++D FD LQ G L+LG+ S +VMG GVV+IKM DGV LGGVAY Sbjct: 9 LDSASATHICYQKDCFDLLQEVVVGNLTLGNKSIVKVMGIGVVKIKMFDGVVRSLGGVAY 68 Query: 388 VPKMRRNLIS 417 VPKMR+NLIS Sbjct: 69 VPKMRKNLIS 78 >gb|EOY01950.1| Uncharacterized protein TCM_011728 [Theobroma cacao] Length = 176 Score = 80.5 bits (197), Expect = 3e-16 Identities = 42/70 (60%), Positives = 50/70 (71%) Frame = +1 Query: 208 LDSGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAY 387 LDS H+C ++D FD LQ AG L+LG+ S +VMG VV+IKM DGV LGGVAY Sbjct: 9 LDSASATHICYQKDCFDLLQEGVAGNLTLGNKSIVKVMGIRVVKIKMFDGVVRSLGGVAY 68 Query: 388 VPKMRRNLIS 417 VPKMR+NLIS Sbjct: 69 VPKMRKNLIS 78 >gb|KYP74267.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 411 Score = 83.6 bits (205), Expect = 4e-16 Identities = 45/124 (36%), Positives = 64/124 (51%), Gaps = 1/124 (0%) Frame = +1 Query: 49 IGCHRCHDEGHIKRNCPXXXXXXXXXXXXXXXXXXXXLHSLRKYGYFFSRWVD-LDSGCV 225 I C++C GHI ++CP L G + D +DSGC Sbjct: 115 IQCYKCQKVGHIMKHCPEKGAKESRIQETTYVVEA--LEEYESAGVLVASSDDVMDSGCT 172 Query: 226 MHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAYVPKMRR 405 H+ +DYF++L+LK GT+ LG+ ACRV G G V +KM D + +L V YVPK++R Sbjct: 173 YHMFPVKDYFETLELKEYGTVLLGNNKACRVQGIGAVRLKMFDNQEMLLQNVRYVPKLKR 232 Query: 406 NLIS 417 L+S Sbjct: 233 KLMS 236 >gb|EOY05822.1| Uncharacterized protein TCM_020722 [Theobroma cacao] Length = 218 Score = 80.9 bits (198), Expect = 5e-16 Identities = 42/70 (60%), Positives = 50/70 (71%) Frame = +1 Query: 208 LDSGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAY 387 LDS H+C ++D FD LQ AG L+LG+ S +VMG VV+IKM DGV LGGVAY Sbjct: 51 LDSASATHICYQKDCFDLLQEGMAGNLTLGNKSIVKVMGLAVVKIKMFDGVVLSLGGVAY 110 Query: 388 VPKMRRNLIS 417 VPKMR+NLIS Sbjct: 111 VPKMRKNLIS 120 >dbj|GAU51472.1| hypothetical protein TSUD_95870 [Trifolium subterraneum] Length = 1682 Score = 83.6 bits (205), Expect = 6e-16 Identities = 43/128 (33%), Positives = 64/128 (50%), Gaps = 2/128 (1%) Frame = +1 Query: 40 GNSIGCHRCHDEGHIKRNCPXXXXXXXXXXXXXXXXXXXXLHSLRKYGYFFSR--WVDLD 213 G C+ CH+ GH K++CP + WV +D Sbjct: 198 GGKFKCYHCHEPGHFKKDCPQRKGGGSSSAQIATSDEGYESAGALTVTSWEPEKIWV-MD 256 Query: 214 SGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAYVP 393 SGC H+C R++YF +L+LK G + LG+ A +V G G + +KM D D +L V Y+P Sbjct: 257 SGCSDHMCLRKEYFKTLELKEGGVVRLGNNKAGKVQGTGTIRLKMYDDRDFLLKNVRYIP 316 Query: 394 KMRRNLIS 417 +++RNLIS Sbjct: 317 ELKRNLIS 324 >gb|PON41343.1| Zinc finger, CCHC-type [Parasponia andersonii] Length = 297 Score = 82.0 bits (201), Expect = 7e-16 Identities = 45/133 (33%), Positives = 66/133 (49%), Gaps = 12/133 (9%) Frame = +1 Query: 55 CHRCHDEGHIKRNCPXXXXXXXXXXXXXXXXXXXXLHSLRKYGYFFSR------------ 198 C CH EGH KR+CP S+ GY + Sbjct: 136 CFHCHKEGHFKRDCPDRKKKVHEKPKDPGEA------SVASDGYDSAEVLVVTDEDSSKE 189 Query: 199 WVDLDSGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGG 378 W+ +DSGC H+C + +F++L+ G++ LG+ C+V G G V I+M DG++ IL Sbjct: 190 WI-MDSGCSFHMCPTKSWFENLEKTDGGSVLLGNNKPCKVAGIGSVRIRMFDGMERILQQ 248 Query: 379 VAYVPKMRRNLIS 417 V YVP+++RNLIS Sbjct: 249 VRYVPELKRNLIS 261 >gb|PRQ56251.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 392 Score = 82.4 bits (202), Expect = 1e-15 Identities = 44/130 (33%), Positives = 65/130 (50%), Gaps = 8/130 (6%) Frame = +1 Query: 52 GCHRCHDEGHIKRNC--------PXXXXXXXXXXXXXXXXXXXXLHSLRKYGYFFSRWVD 207 GC +C H+KRNC L + F W+ Sbjct: 257 GCFKCGATDHLKRNCREGKMRAEAMAGSSNTANVVIKLDKDDGELLVVAASSNAFRNWI- 315 Query: 208 LDSGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAY 387 LD+GC H+C+ R++FD+ + +G + GD S+CR++G G V+I+M DGV L V Y Sbjct: 316 LDTGCTFHMCAIREWFDTFEDSSSGEVFRGDDSSCRILGIGSVKIRMHDGVVRTLENVRY 375 Query: 388 VPKMRRNLIS 417 +PK+R+NLIS Sbjct: 376 IPKLRKNLIS 385 >gb|KYP64657.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 259 Score = 80.9 bits (198), Expect = 1e-15 Identities = 35/70 (50%), Positives = 49/70 (70%) Frame = +1 Query: 208 LDSGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAY 387 +DSGC H+C ++DYF++L+ K GT+ LGD C+V G G V +KM D ++IL V Y Sbjct: 1 MDSGCSYHMCPKKDYFETLKFKEGGTVLLGDDHPCQVQGIGTVRLKMFDNREYILKDVRY 60 Query: 388 VPKMRRNLIS 417 VP ++RNLIS Sbjct: 61 VPDLKRNLIS 70 >gb|PRQ51350.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 460 Score = 79.3 bits (194), Expect = 2e-14 Identities = 49/134 (36%), Positives = 70/134 (52%), Gaps = 8/134 (5%) Frame = +1 Query: 40 GNSIGCHRCHDEGHIKRNCPXXXXXXXXXXXXXXXXXXXXLH------SLRKYGYFFSR- 198 G C++C + GHI+ +CP + ++ K Y S+ Sbjct: 227 GKGKQCYKCKEWGHIRPDCPLWKEKDDKGSDCSMTGIAQASNDFGEFLTVSKGNYTCSQR 286 Query: 199 -WVDLDSGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILG 375 W+ LD+G H+CSRR+YFD+ Q + G ++ GDG+ VMG G V+IKM DG LG Sbjct: 287 DWI-LDTGSSHHLCSRREYFDTFQEVK-GFVTWGDGTRRCVMGVGTVKIKMFDGAVRTLG 344 Query: 376 GVAYVPKMRRNLIS 417 V YVP+ RRNL+S Sbjct: 345 DVVYVPRFRRNLVS 358 >gb|PRQ41601.1| putative tripeptidyl-peptidase II [Rosa chinensis] Length = 1371 Score = 79.3 bits (194), Expect = 2e-14 Identities = 49/134 (36%), Positives = 70/134 (52%), Gaps = 8/134 (5%) Frame = +1 Query: 40 GNSIGCHRCHDEGHIKRNCPXXXXXXXXXXXXXXXXXXXXLH------SLRKYGYFFSR- 198 G C++C + GHI+ +CP + ++ K Y S+ Sbjct: 417 GKGKQCYKCKEWGHIRPDCPLWKEKDDKGSDCSMTGIAQASNDFGEFLTVSKGNYTCSQR 476 Query: 199 -WVDLDSGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILG 375 W+ LD+G H+CSRR+YFD+ Q + G ++ GDG+ VMG G V+IKM DG LG Sbjct: 477 DWI-LDTGSSHHLCSRREYFDTFQEVK-GFVTWGDGTRRCVMGVGTVKIKMFDGAVRTLG 534 Query: 376 GVAYVPKMRRNLIS 417 V YVP+ RRNL+S Sbjct: 535 DVVYVPRFRRNLVS 548 >gb|KYP74254.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 245 Score = 76.3 bits (186), Expect = 5e-14 Identities = 34/70 (48%), Positives = 48/70 (68%) Frame = +1 Query: 208 LDSGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAY 387 +DSGC H+ +DYF++L+LK GT+ LG+ ACRV G G V +KM D + +L V Y Sbjct: 1 MDSGCTYHMFPVKDYFETLELKEYGTVLLGNNKACRVQGIGAVRLKMFDNQEMLLQNVRY 60 Query: 388 VPKMRRNLIS 417 VPK++R L+S Sbjct: 61 VPKLKRKLMS 70 >gb|EOY12702.1| Uncharacterized protein TCM_031224 [Theobroma cacao] Length = 3109 Score = 77.8 bits (190), Expect = 7e-14 Identities = 41/63 (65%), Positives = 47/63 (74%) Frame = +1 Query: 229 HVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAYVPKMRRN 408 HVC ++DYFD LQ A L+LG+ S +VM GVVEIKM DGV H LGGVAYV KMR+N Sbjct: 2132 HVCYQKDYFDLLQEGVARNLTLGNKSIMKVMVLGVVEIKMFDGVMHSLGGVAYVSKMRKN 2191 Query: 409 LIS 417 LIS Sbjct: 2192 LIS 2194 >gb|PNX57240.1| copia LTR rider, partial [Trifolium pratense] Length = 291 Score = 76.3 bits (186), Expect = 9e-14 Identities = 31/70 (44%), Positives = 47/70 (67%) Frame = +1 Query: 208 LDSGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAY 387 +DSGC H+C R++YF+ L LK G + L + AC++ G G + +KM D D +L V Y Sbjct: 1 MDSGCSYHMCPRKEYFEILDLKEGGVVRLSNNKACKIQGTGTIRLKMFDDRDFLLKNVXY 60 Query: 388 VPKMRRNLIS 417 +P+++RNLIS Sbjct: 61 IPELKRNLIS 70 >gb|KYP36635.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 364 Score = 76.3 bits (186), Expect = 1e-13 Identities = 38/123 (30%), Positives = 64/123 (52%), Gaps = 2/123 (1%) Frame = +1 Query: 55 CHRCHDEGHIKRNCPXXXXXXXXXXXXXXXXXXXXL--HSLRKYGYFFSRWVDLDSGCVM 228 C+ C + GH K++CP L + + +W+ LDSGC Sbjct: 239 CNYCKEPGHWKKDCPKKKGKPSAAVAKEESTSENELVLSIADQPQHSEDQWI-LDSGCSF 297 Query: 229 HVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAYVPKMRRN 408 H+C R +FD+ + K G + +G+ + C+ +G G ++IKM DG+ L V +VP++++N Sbjct: 298 HMCPNRTWFDTYEKKSGGNVFMGNDAPCKTIGIGTIKIKMHDGITRTLTEVRHVPELKKN 357 Query: 409 LIS 417 LIS Sbjct: 358 LIS 360 >ref|XP_020582332.1| uncharacterized protein LOC110025962 [Phalaenopsis equestris] Length = 975 Score = 76.6 bits (187), Expect = 2e-13 Identities = 36/72 (50%), Positives = 51/72 (70%) Frame = +1 Query: 199 WVDLDSGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGG 378 W+ LDSGC H+ + +F+SLQL+ G++ LGD ACRV+ G ++IKM DG + IL Sbjct: 219 WI-LDSGCSFHMRPHKYWFESLQLENGGSVLLGDNKACRVVDSGTIKIKMFDGAERILQH 277 Query: 379 VAYVPKMRRNLI 414 V YVP+++RNLI Sbjct: 278 VRYVPELKRNLI 289 >emb|CAN80490.1| hypothetical protein VITISV_004703 [Vitis vinifera] Length = 777 Score = 75.9 bits (185), Expect = 3e-13 Identities = 44/128 (34%), Positives = 59/128 (46%), Gaps = 4/128 (3%) Frame = +1 Query: 46 SIGCHRCHDEGHIKRNCPXXXXXXXXXXXXXXXXXXXXLHSLR----KYGYFFSRWVDLD 213 ++ C+ C+ G I+R CP S + WV LD Sbjct: 123 NVKCYHCNKIGQIRRICPDRQQEEKTQAQGSAAIIDDGYDSTEVLTIRLNPNHEEWV-LD 181 Query: 214 SGCVMHVCSRRDYFDSLQLKRAGTLSLGDGSACRVMGFGVVEIKMGDGVDHILGGVAYVP 393 SGC H+C RRD+F S Q G L LG+ +C V+G G + I M DG L V +VP Sbjct: 182 SGCTYHMCPRRDWFSSYQEVNGGKLLLGNNMSCNVVGIGTMAINMHDGKTRTLKEVRHVP 241 Query: 394 KMRRNLIS 417 ++RNLIS Sbjct: 242 DLKRNLIS 249