BLASTX nr result
ID: Glycyrrhiza35_contig00036402
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza35_contig00036402 (375 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value KHN33900.1 Putative ribonuclease H protein, partial [Glycine soja] 141 4e-38 KHN40328.1 Putative ribonuclease H protein [Glycine soja] 121 2e-32 KYP45737.1 hypothetical protein KK1_032736 [Cajanus cajan] 96 5e-23 GAU25827.1 hypothetical protein TSUD_30910 [Trifolium subterraneum] 94 4e-20 XP_016165290.1 PREDICTED: uncharacterized protein LOC107607909 [... 91 2e-19 XP_016206298.1 PREDICTED: uncharacterized protein LOC107646634 [... 91 2e-19 GAU23316.1 hypothetical protein TSUD_237700 [Trifolium subterran... 92 3e-19 GAU49954.1 hypothetical protein TSUD_180180 [Trifolium subterran... 92 3e-19 XP_015971192.1 PREDICTED: uncharacterized protein LOC107494666 [... 91 1e-18 XP_016172534.1 PREDICTED: uncharacterized protein LOC107614924 [... 88 2e-18 KHN02883.1 Putative ribonuclease H protein [Glycine soja] 88 3e-18 XP_015970506.1 PREDICTED: uncharacterized protein LOC107493982 [... 89 3e-18 XP_016206120.1 PREDICTED: uncharacterized protein LOC107646451 [... 87 3e-18 XP_016165109.1 PREDICTED: uncharacterized protein LOC107607701 [... 88 4e-18 XP_016164363.1 PREDICTED: uncharacterized protein LOC107606867 [... 89 5e-18 XP_016178564.1 PREDICTED: uncharacterized protein LOC107621028 [... 89 5e-18 XP_016168263.1 PREDICTED: uncharacterized protein LOC107610776 [... 89 5e-18 XP_019181390.1 PREDICTED: uncharacterized protein LOC109176414 [... 88 9e-18 XP_016178767.1 PREDICTED: uncharacterized protein LOC107621248 [... 87 1e-17 AID60103.1 hypothetical protein [Brassica napus] 87 2e-17 >KHN33900.1 Putative ribonuclease H protein, partial [Glycine soja] Length = 385 Score = 141 bits (355), Expect = 4e-38 Identities = 60/124 (48%), Positives = 83/124 (66%) Frame = +2 Query: 2 SAHRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSS 181 +A +WL+ L I+Q NWSW+WKL +PENIKHF WL HGSLPTN FR RH+S+ SS Sbjct: 178 TAFKWLIAVNSLPISQFGNWSWIWKLQIPENIKHFFWLTFHGSLPTNEFRVFRHLSIVSS 237 Query: 182 CNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFAVS 361 CNRC QE I H +R+ + +RQ+W+ L + +FY +W+ +N+ + I+FAV+ Sbjct: 238 CNRCMHGQEIILHLLRDCHYSRQVWQFLQLDHDSNFYVANYMDWIVNNINSVRGILFAVT 297 Query: 362 CWII 373 CW I Sbjct: 298 CWTI 301 >KHN40328.1 Putative ribonuclease H protein [Glycine soja] Length = 175 Score = 121 bits (303), Expect = 2e-32 Identities = 54/109 (49%), Positives = 67/109 (61%) Frame = +2 Query: 2 SAHRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSS 181 SA+ W +HEP H N +W+WVWKL PENIKHF WLIL SLPTN FR RHIS D Sbjct: 66 SAYEWWIHEPANHSNFPGDWNWVWKLQTPENIKHFTWLILQNSLPTNFFRTKRHISFDPF 125 Query: 182 CNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNL 328 C RCG+ ESI H +R+ +AR+IW ++M F+ W + L Sbjct: 126 CCRCGLEDESIIHLLRDCSEAREIWSHINMYADTDFFAMNGDTWFVTQL 174 >KYP45737.1 hypothetical protein KK1_032736 [Cajanus cajan] Length = 121 Score = 95.5 bits (236), Expect = 5e-23 Identities = 38/99 (38%), Positives = 59/99 (59%) Frame = +2 Query: 32 LLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSSCNRCGVAQES 211 ++H+ NW+W+WKL L E+IKHFIWL G LP N +H+S D S +CG +E Sbjct: 21 IIHLTLTRNWTWIWKLKLIEHIKHFIWLAFRGKLPINQIHVRQHLSFDLSYCKCGAGEED 80 Query: 212 ISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNL 328 ++H + + ++++W L QQQ F +P W+Y+ L Sbjct: 81 LTHVLCDCPSSKKVWDHLCFTQQQSFNQRPASSWIYTFL 119 >GAU25827.1 hypothetical protein TSUD_30910 [Trifolium subterraneum] Length = 592 Score = 94.4 bits (233), Expect = 4e-20 Identities = 42/124 (33%), Positives = 64/124 (51%), Gaps = 2/124 (1%) Frame = +2 Query: 2 SAHRWLV--HEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLD 175 S + WL+ + +++ N +WSW+WK+ LPE IK F WL H +PT +R +S Sbjct: 277 SGYNWLLSLRDLVINHNPSHSWSWIWKIQLPEKIKFFFWLACHNFVPTLSLLNHRKMSHS 336 Query: 176 SSCNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFA 355 ++C RCG+ ES H IR+ AR +W + F + +WL TG Q+ F+ Sbjct: 337 ATCTRCGLQDESFLHCIRDCEFARSLWNHIGFNNMDFFSNMDVYDWLKLGATGSQTTTFS 396 Query: 356 VSCW 367 W Sbjct: 397 TGVW 400 >XP_016165290.1 PREDICTED: uncharacterized protein LOC107607909 [Arachis ipaensis] Length = 294 Score = 90.5 bits (223), Expect = 2e-19 Identities = 43/124 (34%), Positives = 64/124 (51%) Frame = +2 Query: 2 SAHRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSS 181 S + WL N+ +NW WVW+L +PE K IWL LH ++PT FR R ++L S+ Sbjct: 18 SGYSWLAKRKF-DWNEHDNWLWVWRLHIPEKYKFLIWLSLHNAIPTAEFRLGRDLALSST 76 Query: 182 CNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFAVS 361 C+RC + ESI H +R A+++W +L + + L WLY +F + Sbjct: 77 CHRCQNSSESILHCLRECPSAKEVWNLLGL----YSDNSNLHNWLYRGARSENVFLFFST 132 Query: 362 CWII 373 W I Sbjct: 133 IWWI 136 >XP_016206298.1 PREDICTED: uncharacterized protein LOC107646634 [Arachis ipaensis] Length = 329 Score = 90.9 bits (224), Expect = 2e-19 Identities = 40/123 (32%), Positives = 67/123 (54%) Frame = +2 Query: 2 SAHRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSS 181 S + WL+ E + + E+W W+W + +PE +K IWL LH ++PT FR++R++++ Sbjct: 7 SGYEWLL-EQKVQWDANESWLWLWHMNIPEKVKGLIWLCLHNAIPTASFRSSRNLTITDI 65 Query: 182 CNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFAVS 361 C RC A E+ H +R K + IW+ L +G + + WL+ N ++I A Sbjct: 66 CPRCNEAPETTEHCLRFCIKVQSIWKRLEVGMSRRDAFLEFRAWLWVNFRATEAIFVAGL 125 Query: 362 CWI 370 WI Sbjct: 126 WWI 128 >GAU23316.1 hypothetical protein TSUD_237700 [Trifolium subterraneum] Length = 418 Score = 91.7 bits (226), Expect = 3e-19 Identities = 41/124 (33%), Positives = 65/124 (52%), Gaps = 2/124 (1%) Frame = +2 Query: 2 SAHRWL--VHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLD 175 S WL + P+ N +WSW+WKL LPE IK F WL+ H S+PT +R ++L Sbjct: 106 SGFNWLFSLQNPVTPHNPSFSWSWIWKLQLPEKIKFFFWLVCHNSVPTLSLLDHRKMNLS 165 Query: 176 SSCNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFA 355 ++C RCG+ +E+ H +R+ + IW + F + +WL TG ++ +F+ Sbjct: 166 ATCARCGLREETFLHCVRDCDFSISIWHHIGFDNPDFFSSMDAHDWLKWGSTGSKAFIFS 225 Query: 356 VSCW 367 W Sbjct: 226 AGVW 229 >GAU49954.1 hypothetical protein TSUD_180180 [Trifolium subterraneum] Length = 968 Score = 92.0 bits (227), Expect = 3e-19 Identities = 39/108 (36%), Positives = 60/108 (55%) Frame = +2 Query: 44 NQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSSCNRCGVAQESISHA 223 N ++W+W+WKL LPE IK F+WL H S+PT +R ++ ++C RCG+ ES H Sbjct: 645 NPSQSWTWIWKLHLPEKIKFFLWLACHNSVPTLSLLNHRKMNPSTTCVRCGLQDESFLHC 704 Query: 224 IRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFAVSCW 367 IR+ +R +W + F + +WL TG QS++F+ W Sbjct: 705 IRDCDFSRSLWHHIGFTNPNFFSNMDVYDWLKMGATGTQSLIFSAGVW 752 >XP_015971192.1 PREDICTED: uncharacterized protein LOC107494666 [Arachis duranensis] Length = 901 Score = 90.5 bits (223), Expect = 1e-18 Identities = 43/124 (34%), Positives = 65/124 (52%) Frame = +2 Query: 2 SAHRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSS 181 S + WL N+ +NW WVW+L +PE K IWL LH ++PT FR R ++L S+ Sbjct: 571 SGYSWLAKRKF-DWNEHDNWLWVWRLHIPEKYKFLIWLSLHNAIPTAEFRLGRGLALSST 629 Query: 182 CNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFAVS 361 C+RC ESI H +R A+++W +L + + L +WLY + +F + Sbjct: 630 CHRCQNGSESILHCLRECPSAKEVWTLLGL----YSDNSNLHDWLYRGARSGDAFLFFST 685 Query: 362 CWII 373 W I Sbjct: 686 IWWI 689 >XP_016172534.1 PREDICTED: uncharacterized protein LOC107614924 [Arachis ipaensis] Length = 328 Score = 88.2 bits (217), Expect = 2e-18 Identities = 40/120 (33%), Positives = 63/120 (52%) Frame = +2 Query: 8 HRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSSCN 187 +RWL+ + L + N NW+W+W +PE K +WL LH +LPT FR RH++ C Sbjct: 84 YRWLLKKTL-NWNANSNWNWLWNTNIPEKFKFTMWLGLHDTLPTETFRFKRHLASSDMCK 142 Query: 188 RCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFAVSCW 367 RC AQE++ H +R+ +++ IW +L L+EW L +++ F W Sbjct: 143 RCNKAQETMEHCLRDCERSKAIWHMLDPSILDSTAGTALEEWFQKALANNEA-SFGTGLW 201 >KHN02883.1 Putative ribonuclease H protein [Glycine soja] Length = 313 Score = 87.8 bits (216), Expect = 3e-18 Identities = 43/125 (34%), Positives = 64/125 (51%), Gaps = 1/125 (0%) Frame = +2 Query: 2 SAHRWLVHEPLLHIN-QEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDS 178 +A+ WL + + + Q EN SW+W + +P+NIK F+WL H SLPT F RH+S + Sbjct: 83 TAYWWLQSQANVSLGAQAENNSWMWLMKIPQNIKFFLWLTSHKSLPTKFFLVYRHLSSNP 142 Query: 179 SCNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFAV 358 C RC ES+ H +R+ KA +W + F WL+ + T +F + Sbjct: 143 FCCRCSNQVESVLHLLRDCDKACSVWSMFQPTLVVDFAEHDSSVWLHKHATCATGALFCL 202 Query: 359 SCWII 373 CW I Sbjct: 203 ICWFI 207 >XP_015970506.1 PREDICTED: uncharacterized protein LOC107493982 [Arachis duranensis] Length = 641 Score = 89.4 bits (220), Expect = 3e-18 Identities = 42/106 (39%), Positives = 59/106 (55%) Frame = +2 Query: 2 SAHRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSS 181 S H WL N+ +NW WVW+L +PE K IWL LH ++PT FR R ++L S+ Sbjct: 414 SGHSWLAKRTF-DWNEHDNWLWVWRLHIPEKYKFLIWLSLHNAIPTTEFRLGRGLALSST 472 Query: 182 CNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLY 319 C+RC ESI H +R A++IW +L + + L +WLY Sbjct: 473 CHRCQNGSESILHCLRECPSAKEIWTLLGL----YSDNSNLHDWLY 514 >XP_016206120.1 PREDICTED: uncharacterized protein LOC107646451 [Arachis ipaensis] Length = 300 Score = 87.4 bits (215), Expect = 3e-18 Identities = 40/110 (36%), Positives = 58/110 (52%) Frame = +2 Query: 44 NQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSSCNRCGVAQESISHA 223 N+ +NW WVW+L +PE K IWL LH ++P FR R + L S+C+RC ESI H Sbjct: 59 NEHDNWLWVWRLHIPEKYKFLIWLSLHNAIPATEFRLGRGLVLSSTCHRCQNGYESILHC 118 Query: 224 IRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFAVSCWII 373 +R A+++W +L M + L +WLY +F + W I Sbjct: 119 LRECPSAKEVWNLLGM----YSDNSNLHDWLYRGARSGDIFLFFSTIWWI 164 >XP_016165109.1 PREDICTED: uncharacterized protein LOC107607701 [Arachis ipaensis] Length = 356 Score = 87.8 bits (216), Expect = 4e-18 Identities = 40/124 (32%), Positives = 65/124 (52%) Frame = +2 Query: 2 SAHRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSS 181 S + WL N+ +NW WVW+L +P+ K IWL LH ++PT FR +R ++L S+ Sbjct: 126 SGYSWLTKRKF-DWNERDNWLWVWRLHIPKKYKFLIWLSLHNAIPTAKFRLSRGLTLSST 184 Query: 182 CNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFAVS 361 C+RC ESI H + A+++W +L + + L++WLY +F + Sbjct: 185 CHRCQNGFESILHCLHECPSAKEVWNLLGL----YSDNSDLRDWLYRGTRSGDVFLFFST 240 Query: 362 CWII 373 W + Sbjct: 241 IWFL 244 >XP_016164363.1 PREDICTED: uncharacterized protein LOC107606867 [Arachis ipaensis] Length = 524 Score = 88.6 bits (218), Expect = 5e-18 Identities = 43/124 (34%), Positives = 65/124 (52%) Frame = +2 Query: 2 SAHRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSS 181 S + WL + N+ +NW WVW+L +PE K IWL LH ++PT FR R ++L + Sbjct: 261 SGYSWLAKRKF-NWNEHDNWLWVWRLHIPEKYKFLIWLSLHNAIPTAEFRLGRGLALSRT 319 Query: 182 CNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFAVS 361 C+RC ESI H +R A++IW +L + + L +WLY +F ++ Sbjct: 320 CHRCQNDFESILHCLRECPSAKEIWNLLGL----YSDNSDLCDWLYRGARSEDVFLFFLT 375 Query: 362 CWII 373 W I Sbjct: 376 IWWI 379 >XP_016178564.1 PREDICTED: uncharacterized protein LOC107621028 [Arachis ipaensis] Length = 570 Score = 88.6 bits (218), Expect = 5e-18 Identities = 43/124 (34%), Positives = 63/124 (50%) Frame = +2 Query: 2 SAHRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSS 181 S + WL N+ +NW WVW+L +PE K IWL LH ++P FR +R + L S+ Sbjct: 271 SGYSWLAKRKF-DWNEHDNWLWVWRLHIPEKYKFLIWLSLHNAIPMAEFRLSRGLDLSST 329 Query: 182 CNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFAVS 361 C+RC ESI H +R A+++W +L + + L +WLY +F S Sbjct: 330 CHRCQNGSESILHCLRECPSAKEVWNLLGL----YSDNSNLHDWLYRGARSGDVFLFFSS 385 Query: 362 CWII 373 W I Sbjct: 386 IWWI 389 >XP_016168263.1 PREDICTED: uncharacterized protein LOC107610776 [Arachis ipaensis] Length = 1591 Score = 88.6 bits (218), Expect = 5e-18 Identities = 44/122 (36%), Positives = 62/122 (50%) Frame = +2 Query: 8 HRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSSCN 187 + WL+ + + ENW W+W L +P+ +K IWLILHG++PT R R ++ C Sbjct: 1358 YEWLLERKVAW-DTNENWLWLWHLGIPKKVKGLIWLILHGAIPTASLRYRRRLTATDLCP 1416 Query: 188 RCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQSIMFAVSCW 367 RC A ES+ H +R K IW L +G Q + WL L ++ I FAV W Sbjct: 1417 RCNEAPESVEHCLRLCNKVSPIWESLKVGMQTWDVALDFQSWLRIKLRTNEGI-FAVGLW 1475 Query: 368 II 373 I Sbjct: 1476 WI 1477 >XP_019181390.1 PREDICTED: uncharacterized protein LOC109176414 [Ipomoea nil] Length = 1289 Score = 87.8 bits (216), Expect = 9e-18 Identities = 44/109 (40%), Positives = 60/109 (55%), Gaps = 1/109 (0%) Frame = +2 Query: 2 SAHRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSS 181 SA+ LV P N EENW+W+WKL + E +K F+WL+L L TNL R RH++ DSS Sbjct: 945 SAYNSLVGTP----NDEENWAWIWKLKVVEKVKTFVWLLLKDKLLTNLERMKRHMTTDSS 1000 Query: 182 CNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYT-QPLKEWLYSN 325 C CG +ES SH +R+ A + W + G P+ W+ N Sbjct: 1001 CASCGFGEESTSHLLRDCPLAEECWDLAKDGGGTGLVRYSPISTWIKEN 1049 >XP_016178767.1 PREDICTED: uncharacterized protein LOC107621248 [Arachis ipaensis] Length = 578 Score = 87.4 bits (215), Expect = 1e-17 Identities = 42/124 (33%), Positives = 64/124 (51%), Gaps = 1/124 (0%) Frame = +2 Query: 2 SAHRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSS 181 S + WL N+ +NW W+W+L +PE K IWL LH ++PT FR R ++L S+ Sbjct: 258 SGYSWLAKRKF-DWNEHDNWLWIWRLHIPEKYKFLIWLSLHNAIPTAEFRLGRGLALSST 316 Query: 182 CNRCGVAQESISHAIRNYYKARQIWRVLHMGQQQHFYTQPLKEWLYSNLTGHQ-SIMFAV 358 C RC ESI H +R A+++W +L + + L +WLY ++F+ Sbjct: 317 CQRCQNGSESILHCLRECPSAKEVWNLLGL----YSDNSNLHDWLYRGARSENIFLLFST 372 Query: 359 SCWI 370 WI Sbjct: 373 IWWI 376 >AID60103.1 hypothetical protein [Brassica napus] Length = 620 Score = 87.0 bits (214), Expect = 2e-17 Identities = 44/130 (33%), Positives = 73/130 (56%), Gaps = 8/130 (6%) Frame = +2 Query: 2 SAHRWLVHEPLLHINQEENWSWVWKLPLPENIKHFIWLILHGSLPTNLFRANRHISLDSS 181 +A+ +L + + N E+ +S VW++ PE ++ F+WL+ H + TN+ R RHIS + + Sbjct: 269 AAYAFLTKDAVPRPNMEDLYSRVWRVTAPERVRVFLWLVTHQVIMTNMERKRRHISENGT 328 Query: 182 CNRCGVAQESISHAIRNYYKARQIWRVLHM-GQQQHFYTQPLKEWLYSNLTGHQSI---- 346 C C E+I H +R+ A +WR L + +QQ F+ L EWLY NL +S+ Sbjct: 329 CPLCKSGDETILHVLRDCPAAAGLWRKLVLPTRQQRFFNLTLFEWLYENLANDKSVNGDQ 388 Query: 347 ---MFAVSCW 367 +FA++ W Sbjct: 389 WPSLFALTVW 398