BLASTX nr result
ID: Glycyrrhiza35_contig00020505
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza35_contig00020505 (534 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU18772.1 hypothetical protein TSUD_80610 [Trifolium subterraneum] 128 1e-31 GAU36816.1 hypothetical protein TSUD_219190 [Trifolium subterran... 95 2e-19 GAU40905.1 hypothetical protein TSUD_297100 [Trifolium subterran... 91 3e-19 KYP51521.1 Putative ribonuclease H protein At1g65750 family, par... 91 3e-18 KYP44518.1 Putative ribonuclease H protein At1g65750 family, par... 91 3e-18 KYP79070.1 Putative ribonuclease H protein At1g65750 family [Caj... 90 4e-18 KYP72596.1 Putative ribonuclease H protein At1g65750 family [Caj... 90 1e-17 KYP65942.1 Putative ribonuclease H protein At1g65750 family [Caj... 89 2e-17 XP_015935830.1 PREDICTED: uncharacterized protein LOC107461787 [... 88 4e-17 GAU31578.1 hypothetical protein TSUD_54010 [Trifolium subterraneum] 87 8e-17 XP_016164673.1 PREDICTED: uncharacterized protein LOC107607211 [... 86 3e-16 KHN27242.1 Putative ribonuclease H protein [Glycine soja] 79 2e-15 XP_014619745.1 PREDICTED: uncharacterized protein LOC102664765 i... 81 2e-15 XP_014619740.1 PREDICTED: uncharacterized protein LOC102664765 i... 81 3e-15 KHN24231.1 Putative ribonuclease H protein [Glycine soja] 81 5e-15 XP_016172644.1 PREDICTED: uncharacterized protein LOC107615037 [... 82 6e-15 KHN13238.1 Putative ribonuclease H protein, partial [Glycine soja] 76 8e-15 ONI01039.1 hypothetical protein PRUPE_6G118100 [Prunus persica] 81 1e-14 KYP45885.1 Putative ribonuclease H protein At1g65750 family [Caj... 81 1e-14 KYP71397.1 Putative ribonuclease H protein At1g65750 family, par... 81 1e-14 >GAU18772.1 hypothetical protein TSUD_80610 [Trifolium subterraneum] Length = 482 Score = 128 bits (321), Expect = 1e-31 Identities = 56/120 (46%), Positives = 83/120 (69%) Frame = -2 Query: 533 DFPCWDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERL 354 DFPCW LS +G F+LKTAY+ + + + +PIF++VW W+GP R +A LWK Q RL Sbjct: 143 DFPCWKLSIDGYFSLKTAYEFMENQHQEDLYINPIFEKVWHWKGPNRIKAFLWKLSQGRL 202 Query: 353 LTNTERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSLINPDYFIRFFSMDLDS 174 LTN ER R M + DLC RC +++MH + DCE+ +EF +++INP+ + +FFS+ L++ Sbjct: 203 LTNEERRHRNMTNSDLCPRCQDYPESIMHCLRDCEDAREFWTNIINPEVWSKFFSIGLNN 262 >GAU36816.1 hypothetical protein TSUD_219190 [Trifolium subterraneum] Length = 521 Score = 94.7 bits (234), Expect = 2e-19 Identities = 42/80 (52%), Positives = 54/80 (67%) Frame = -2 Query: 533 DFPCWDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERL 354 DFPCW LS +G F+LKTAY+ + + + +PIF++VW W+GP R +A LWK Q RL Sbjct: 312 DFPCWKLSVDGYFSLKTAYEFMENQHQEDLYINPIFEKVWHWKGPNRIKAFLWKLSQGRL 371 Query: 353 LTNTERVRRGMASEDLCSRC 294 LTN ER R M + DLC RC Sbjct: 372 LTNEERRHRNMTNSDLCPRC 391 >GAU40905.1 hypothetical protein TSUD_297100 [Trifolium subterraneum] Length = 264 Score = 91.3 bits (225), Expect = 3e-19 Identities = 44/123 (35%), Positives = 70/123 (56%) Frame = -2 Query: 533 DFPCWDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERL 354 DFP W S +G+F+ +AY ++M + + +F VWKW+GP R + LWK RL Sbjct: 75 DFPNWSASPDGKFSRNSAYSLLMDKKPIEKTDSHLFKLVWKWKGPNRIPSFLWKVAHCRL 134 Query: 353 LTNTERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSLINPDYFIRFFSMDLDS 174 +TN ER + M + + C RC +++MHV+ D + S +I P+ + +FFS+ L S Sbjct: 135 MTNEERRKINMTAHNSCLRCQQGPESIMHVLRD--YAMDIWSPIIKPNNWAKFFSLGLTS 192 Query: 173 VVK 165 +K Sbjct: 193 WLK 195 >KYP51521.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 511 Score = 91.3 bits (225), Expect = 3e-18 Identities = 38/95 (40%), Positives = 61/95 (64%) Frame = -2 Query: 521 WDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERLLTNT 342 W+ + +G FT+K+A+++ + A + P+F ++WKW GP+R +A LW+ E LLTN+ Sbjct: 260 WNPTTDGLFTIKSAHEIAAQQMLPA--KSPLFKQIWKWHGPERVRAFLWRVAHESLLTNS 317 Query: 341 ERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQE 237 RV+RGM S+ C C + +T +HV+ DC QE Sbjct: 318 CRVKRGMTSDSTCGECRQAMETTLHVLRDCPYAQE 352 >KYP44518.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 584 Score = 91.3 bits (225), Expect = 3e-18 Identities = 38/95 (40%), Positives = 61/95 (64%) Frame = -2 Query: 521 WDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERLLTNT 342 W+ + +G FT+K+A+++ + A + P+F ++WKW GP+R +A LW+ E LLTN+ Sbjct: 229 WNPTTDGLFTIKSAHEIAAQQMLPA--KSPLFKQIWKWHGPERVRAFLWRVAHESLLTNS 286 Query: 341 ERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQE 237 RV+RGM S+ C C + +T +HV+ DC QE Sbjct: 287 CRVKRGMTSDSTCGECRQAMETTLHVLRDCPYAQE 321 >KYP79070.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 339 Score = 89.7 bits (221), Expect = 4e-18 Identities = 38/95 (40%), Positives = 59/95 (62%) Frame = -2 Query: 521 WDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERLLTNT 342 W+ + +G FT+K+AY+ + A + P+F ++WKW GP+R + LW+ E LLTN Sbjct: 36 WNPTTDGLFTIKSAYETAAQQMLPA--KSPLFKQIWKWHGPERVRVFLWRVAHESLLTNF 93 Query: 341 ERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQE 237 RV+RGM+S+ C C + +T +HV+ DC QE Sbjct: 94 CRVKRGMSSDSTCGECRQAMETTLHVLRDCPYAQE 128 >KYP72596.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 646 Score = 89.7 bits (221), Expect = 1e-17 Identities = 41/106 (38%), Positives = 62/106 (58%) Frame = -2 Query: 533 DFPCWDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERL 354 D W S++G+F++K+AY V + +F +W+W+GP+R + LWK L Sbjct: 444 DSLAWLGSSDGEFSVKSAY--VCLDHNQNECNQVVFSTIWRWKGPERIKLMLWKTAHNSL 501 Query: 353 LTNTERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSLIN 216 LTNT R RRG+A ++LC RCH +T +H + DC +V+ S L N Sbjct: 502 LTNTARARRGLALDNLCPRCHQEPETGLHALRDCVDVKNVWSHLAN 547 >KYP65942.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 457 Score = 89.0 bits (219), Expect = 2e-17 Identities = 41/106 (38%), Positives = 59/106 (55%) Frame = -2 Query: 533 DFPCWDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERL 354 D W +S NG+F++K+AY V + +F +W+W+GP+R + WK L Sbjct: 93 DSLAWLVSANGEFSVKSAY--VCLDHNQIECNQAVFSTIWRWKGPERIKLRFWKTAHNSL 150 Query: 353 LTNTERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSLIN 216 LTN R RRG+A E+LC RCH +T +H + DC V+ S L N Sbjct: 151 LTNIARERRGLALENLCPRCHQEPETGLHALRDCVVVKNVWSHLAN 196 >XP_015935830.1 PREDICTED: uncharacterized protein LOC107461787 [Arachis duranensis] Length = 1370 Score = 88.2 bits (217), Expect = 4e-17 Identities = 43/119 (36%), Positives = 65/119 (54%) Frame = -2 Query: 533 DFPCWDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERL 354 D W+ S++G F+ KTAYQV+M E + F VW+W+GP+R + LW + Sbjct: 919 DHIAWEPSSDGIFSTKTAYQVIMEEQHTQNQN---FRLVWRWQGPERIRTFLWLATHNVI 975 Query: 353 LTNTERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSLINPDYFIRFFSMDLD 177 LTN+ER RR + ++D C RC ++ +HV+ DC + L P FF+ DL+ Sbjct: 976 LTNSERKRRHLTNDDSCPRCRCHEESTIHVLRDCFYAKSIWRKLFPPIGINSFFNTDLN 1034 >GAU31578.1 hypothetical protein TSUD_54010 [Trifolium subterraneum] Length = 402 Score = 86.7 bits (213), Expect = 8e-17 Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 5/129 (3%) Frame = -2 Query: 521 WDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERLLTNT 342 W+ SN QFT+++AY + + GD + + +W WEGP R Q +W VQER+LTN Sbjct: 166 WEGSNTHQFTVQSAY---LLQFGDILTQGGDWKSLWDWEGPHRIQTFIWMAVQERILTNL 222 Query: 341 ERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSLINPDYFIRFFSMD-----LD 177 R + G+ LC+RC +TM+HV+ DC + L+ ++ FF+ D + Sbjct: 223 RRSKWGVGISPLCTRCGRDDETMIHVLRDCIYSIQVWLHLVPSNFITDFFTFDCRNWNFN 282 Query: 176 SVVKLGCPA 150 ++ KLG A Sbjct: 283 NINKLGIGA 291 >XP_016164673.1 PREDICTED: uncharacterized protein LOC107607211 [Arachis ipaensis] Length = 1901 Score = 85.9 bits (211), Expect = 3e-16 Identities = 41/119 (34%), Positives = 64/119 (53%) Frame = -2 Query: 533 DFPCWDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERL 354 D W LS++G F+ K+AYQ+ M K F VW W+GP+R + LW + Sbjct: 1543 DHLAWGLSSDGSFSTKSAYQLNMENQHAPNKN---FRLVWNWQGPERIRTFLWLVTHNAI 1599 Query: 353 LTNTERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSLINPDYFIRFFSMDLD 177 LTN+E+ RR + ++D C RC ++ +HV+ DC + LI P+ FF+ +L+ Sbjct: 1600 LTNSEKRRRHLTNDDTCPRCRSHEESTIHVLRDCPYAMSIWNRLIPPNGRSSFFNTELN 1658 >KHN27242.1 Putative ribonuclease H protein [Glycine soja] Length = 140 Score = 78.6 bits (192), Expect = 2e-15 Identities = 37/99 (37%), Positives = 57/99 (57%) Frame = -2 Query: 533 DFPCWDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERL 354 D W + +G FTLK+AY VV D T + +F V +W GP+R + LWK E + Sbjct: 11 DTIAWKGTADGSFTLKSAYGVVC--DLQVTSDINLFKLVHRWPGPERIRMFLWKISHESI 68 Query: 353 LTNTERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQE 237 LTN +R+R GM+ D CS C ++ ++H+ DC + ++ Sbjct: 69 LTNAKRMRIGMSVNDACSACQAEAEALLHLFRDCNDCKQ 107 >XP_014619745.1 PREDICTED: uncharacterized protein LOC102664765 isoform X2 [Glycine max] Length = 263 Score = 81.3 bits (199), Expect = 2e-15 Identities = 46/170 (27%), Positives = 84/170 (49%), Gaps = 16/170 (9%) Frame = -2 Query: 506 NGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERLLTNTERVRR 327 +G+FTL++ Y + ++ G A +P+F VW+W GP+R + LWK Q+ L+TN R ++ Sbjct: 57 DGRFTLESTYASISSQSGQA---NPLFKVVWRWSGPERTRILLWKTAQQALVTNDFRAKK 113 Query: 326 GMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSLINPDYFIRFFS--------MDLDSV 171 GM+ + C C ++S+ +H DC + S L+N + ++ S ++ DS+ Sbjct: 114 GMSLSNACHVCDVASENAVHCFRDCPHAIQMSSRLLNVNIGLKLASKKGFSDLIVESDSL 173 Query: 170 VKL-----GCPARPDPLGQIAKLGQVS---PFFIGHIFPGLKNTRSGLAR 45 V + C + +GQ++ H+F G+ LA+ Sbjct: 174 VAIKLLTGSCSTNHSSNQLVCNIGQLADQRSLVWHHVFWGVNQVADSLAK 223 >XP_014619740.1 PREDICTED: uncharacterized protein LOC102664765 isoform X1 [Glycine max] XP_014619741.1 PREDICTED: uncharacterized protein LOC102664765 isoform X1 [Glycine max] XP_014619742.1 PREDICTED: uncharacterized protein LOC102664765 isoform X1 [Glycine max] XP_014619743.1 PREDICTED: uncharacterized protein LOC102664765 isoform X1 [Glycine max] XP_014619744.1 PREDICTED: uncharacterized protein LOC102664765 isoform X1 [Glycine max] Length = 305 Score = 81.3 bits (199), Expect = 3e-15 Identities = 46/170 (27%), Positives = 84/170 (49%), Gaps = 16/170 (9%) Frame = -2 Query: 506 NGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERLLTNTERVRR 327 +G+FTL++ Y + ++ G A +P+F VW+W GP+R + LWK Q+ L+TN R ++ Sbjct: 57 DGRFTLESTYASISSQSGQA---NPLFKVVWRWSGPERTRILLWKTAQQALVTNDFRAKK 113 Query: 326 GMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSLINPDYFIRFFS--------MDLDSV 171 GM+ + C C ++S+ +H DC + S L+N + ++ S ++ DS+ Sbjct: 114 GMSLSNACHVCDVASENAVHCFRDCPHAIQMSSRLLNVNIGLKLASKKGFSDLIVESDSL 173 Query: 170 VKL-----GCPARPDPLGQIAKLGQVS---PFFIGHIFPGLKNTRSGLAR 45 V + C + +GQ++ H+F G+ LA+ Sbjct: 174 VAIKLLTGSCSTNHSSNQLVCNIGQLADQRSLVWHHVFWGVNQVADSLAK 223 >KHN24231.1 Putative ribonuclease H protein [Glycine soja] Length = 317 Score = 80.9 bits (198), Expect = 5e-15 Identities = 36/95 (37%), Positives = 57/95 (60%) Frame = -2 Query: 506 NGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERLLTNTERVRR 327 NG F++K+AY+ V D +D +F+ +W W+GP+R + LWK E LLTN RV R Sbjct: 110 NGLFSIKSAYRKVAGFSND---KDLLFNLIWSWKGPERMRILLWKIANEGLLTNKSRVTR 166 Query: 326 GMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSL 222 MA C RCH+ ++++H + DC ++ ++L Sbjct: 167 AMAESSECPRCHLQPESILHCLRDCFYAKQVWNTL 201 >XP_016172644.1 PREDICTED: uncharacterized protein LOC107615037 [Arachis ipaensis] Length = 914 Score = 82.0 bits (201), Expect = 6e-15 Identities = 37/91 (40%), Positives = 57/91 (62%), Gaps = 1/91 (1%) Frame = -2 Query: 521 WDLSNNGQFTLKTAYQVVMAEDGDATKE-DPIFDEVWKWEGPKRYQAHLWKFVQERLLTN 345 W L+++G F L++AYQ + D T + + IF VWKW+GP+R + LW + +LTN Sbjct: 707 WTLTSDGSFKLQSAYQDIQ----DTTSQPNNIFSLVWKWKGPERIRMFLWLVAHDAILTN 762 Query: 344 TERVRRGMASEDLCSRCHMSSKTMMHVVHDC 252 R RR M S++ C RC + ++ +HV+HDC Sbjct: 763 AARKRRHMTSDNRCPRCSSNEESTLHVLHDC 793 >KHN13238.1 Putative ribonuclease H protein, partial [Glycine soja] Length = 118 Score = 76.3 bits (186), Expect = 8e-15 Identities = 35/114 (30%), Positives = 60/114 (52%) Frame = -2 Query: 521 WDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERLLTNT 342 W L+ NG F+ KT +Q++ D ++ VWKW+GP+R + LW + L TN Sbjct: 1 WTLTANGVFSTKTTFQLISRNKHFIPNFD--WNNVWKWKGPERVRFFLWTVLHHGLKTNF 58 Query: 341 ERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSLINPDYFIRFFSMDL 180 R R G + CS CH+ S+ +H++ DC + +++ + FF+M++ Sbjct: 59 RRRRCGFTKDSSCSFCHLQSEDELHILRDCPFASQVWYAIVGHNLESSFFNMNM 112 >ONI01039.1 hypothetical protein PRUPE_6G118100 [Prunus persica] Length = 408 Score = 80.9 bits (198), Expect = 1e-14 Identities = 39/113 (34%), Positives = 65/113 (57%) Frame = -2 Query: 521 WDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERLLTNT 342 W L++NG+F++KTAY + E+ + T +D +WK + P + + LW +Q +LLTN Sbjct: 49 WQLTSNGEFSVKTAYLSLFTEETNYTWN---WDMIWKLQVPPKIKTFLWLLIQGKLLTNV 105 Query: 341 ERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSLINPDYFIRFFSMD 183 +RVRR +AS C C+ S +++ H+ C + +S+ P+ FSMD Sbjct: 106 QRVRRNLASNSNCPCCNGSMESLDHLFRRCRHATKMWNSIGIPNQVAHSFSMD 158 >KYP45885.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 1192 Score = 81.3 bits (199), Expect = 1e-14 Identities = 35/94 (37%), Positives = 55/94 (58%) Frame = -2 Query: 533 DFPCWDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERL 354 D CW+L+N+G FT+K+AY+++ + A PIF +W W+GP+R + LWK L Sbjct: 1093 DRACWNLTNDGLFTIKSAYEIIGSMQ--APTSHPIFKVIWHWKGPERIRTLLWKIAHNSL 1150 Query: 353 LTNTERVRRGMASEDLCSRCHMSSKTMMHVVHDC 252 LTN R++ G+ + C C + +HV+ DC Sbjct: 1151 LTNEVRMKLGLNNSFSCDICTTGIENTLHVLRDC 1184 >KYP71397.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 510 Score = 80.9 bits (198), Expect = 1e-14 Identities = 36/117 (30%), Positives = 59/117 (50%) Frame = -2 Query: 533 DFPCWDLSNNGQFTLKTAYQVVMAEDGDATKEDPIFDEVWKWEGPKRYQAHLWKFVQERL 354 D W + +G+F+LK Y+ + D + +F +W W GP+R + LW+ L Sbjct: 240 DVLAWKNAADGEFSLKKGYEFLCL--ADVSSRQKVFKLLWNWRGPERIRTFLWRLAHNSL 297 Query: 353 LTNTERVRRGMASEDLCSRCHMSSKTMMHVVHDCEEVQEFLSSLINPDYFIRFFSMD 183 LTN R+ RGM + LC CH +T++H + +C + ++ N FF+MD Sbjct: 298 LTNDLRMHRGMTMDPLCPVCHDELETLIHAMRECNVARSVWINIFNGRLHTIFFTMD 354