BLASTX nr result
ID: Glycyrrhiza28_contig00029714
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza28_contig00029714 (558 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU29765.1 hypothetical protein TSUD_161700 [Trifolium subterran... 132 1e-32 KYP52222.1 hypothetical protein KK1_025963 [Cajanus cajan] 122 1e-31 GAU45350.1 hypothetical protein TSUD_84730 [Trifolium subterraneum] 122 8e-29 GAU50378.1 hypothetical protein TSUD_368580 [Trifolium subterran... 116 4e-28 GAU44816.1 hypothetical protein TSUD_400350 [Trifolium subterran... 114 5e-26 GAU47548.1 hypothetical protein TSUD_284150 [Trifolium subterran... 110 1e-24 GAU17551.1 hypothetical protein TSUD_340950 [Trifolium subterran... 107 1e-23 XP_014625195.1 PREDICTED: uncharacterized protein LOC102663263 [... 100 2e-23 GAU50686.1 hypothetical protein TSUD_410420 [Trifolium subterran... 101 2e-23 XP_004492078.1 PREDICTED: uncharacterized protein LOC101505740 [... 100 2e-22 XP_006606642.1 PREDICTED: uncharacterized protein LOC102664916 [... 100 3e-22 GAU49951.1 hypothetical protein TSUD_408430 [Trifolium subterran... 87 4e-18 XP_003599950.2 hypothetical protein MTR_3g049490 [Medicago trunc... 85 2e-17 KHN08396.1 hypothetical protein glysoja_035935 [Glycine soja] 84 5e-17 XP_014626943.1 PREDICTED: uncharacterized protein LOC102664952 i... 81 9e-16 XP_014626942.1 PREDICTED: uncharacterized protein LOC102664952 i... 81 9e-16 XP_006603876.1 PREDICTED: uncharacterized protein LOC102664952 i... 81 2e-15 XP_017256079.1 PREDICTED: uncharacterized protein LOC108225664 [... 77 1e-13 GAU21349.1 hypothetical protein TSUD_189360 [Trifolium subterran... 75 2e-13 XP_017217391.1 PREDICTED: uncharacterized protein LOC108194968 [... 77 3e-13 >GAU29765.1 hypothetical protein TSUD_161700 [Trifolium subterraneum] Length = 911 Score = 132 bits (333), Expect = 1e-32 Identities = 69/128 (53%), Positives = 79/128 (61%) Frame = +2 Query: 116 IRSNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGE 295 I SN PK WP IDSEPINPPLM+RAPGRPKK RNKTNDEP+ + PR +T+ CT C + Sbjct: 784 IPSNGPKHWPIIDSEPINPPLMRRAPGRPKKKRNKTNDEPKNRNSLPRYLSTLTCTNCNK 843 Query: 296 LGHNVRTCKRKATADRAIPKGRNKGAKRKKPSESEQAKHAEEASQVTQGSQATFTPTEAT 475 +GHN RTCK K ADR IPKG NK K PS ++ + A TQGS T Sbjct: 844 VGHNRRTCKGKTAADRDIPKGGNKKQKTTTPSHAQTSAQG-VAQATTQGSAQVQTVLTQG 902 Query: 476 QSAPQPFQ 499 AP Q Sbjct: 903 SQAPSTSQ 910 Score = 57.4 bits (137), Expect = 2e-06 Identities = 26/46 (56%), Positives = 35/46 (76%) Frame = +2 Query: 2 WGVELSTHKAYMAKRRALEFIQGAGSDQSNHLKSDAEEIRSNPPKS 139 WGV+LS +AY AKR+A+E IQGA S+Q HL+S A+E++S P S Sbjct: 395 WGVKLSQDQAYRAKRKAIEMIQGASSEQYTHLRSYADELKSTNPNS 440 >KYP52222.1 hypothetical protein KK1_025963 [Cajanus cajan] Length = 194 Score = 122 bits (305), Expect = 1e-31 Identities = 55/88 (62%), Positives = 61/88 (69%) Frame = +2 Query: 122 SNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELG 301 SN P WP + EPI PPLM+RAPGRPKK R ++ DEPQ PHK PRQHTT+KC C G Sbjct: 107 SNGPNLWPIVHCEPIKPPLMRRAPGRPKKQRKRSIDEPQNPHKLPRQHTTIKCKKCENFG 166 Query: 302 HNVRTCKRKATADRAIPKGRNKGAKRKK 385 HN RTCK K ADR IPKG NK + K Sbjct: 167 HNKRTCKGKTVADREIPKGENKVILKSK 194 >GAU45350.1 hypothetical protein TSUD_84730 [Trifolium subterraneum] Length = 912 Score = 122 bits (305), Expect = 8e-29 Identities = 74/137 (54%), Positives = 86/137 (62%), Gaps = 3/137 (2%) Frame = +2 Query: 122 SNPPKSWPKIDSEPI-NPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGEL 298 S PK WP IDSE I NPP M+RAPGRPKK RNK+NDEP+ PR TV+C C +L Sbjct: 785 STGPKLWPIIDSESIINPPRMRRAPGRPKKQRNKSNDEPKNSKILPRYLKTVECKKCKKL 844 Query: 299 GHNVRTCKRKATADRAIPKGRNKGAKRKKPSESEQAKHAEEASQV--TQGSQATFTPTEA 472 GHN+RTCK K ADR IPKG NK K KK + HA E+SQ QGSQA PT Sbjct: 845 GHNMRTCKGKTAADREIPKGGNKKQKSKKKT------HAGESSQAPQPQGSQA---PTVL 895 Query: 473 TQSAPQPFQDSQGASAH 523 +Q + P Q SQ +S + Sbjct: 896 SQGSQAP-QGSQVSSQY 911 >GAU50378.1 hypothetical protein TSUD_368580 [Trifolium subterraneum] Length = 333 Score = 116 bits (291), Expect = 4e-28 Identities = 69/135 (51%), Positives = 83/135 (61%), Gaps = 1/135 (0%) Frame = +2 Query: 122 SNPPKSWPKIDSEPI-NPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGEL 298 S PK WP IDSE I NPP M+RAPGRPKK RNK+NDEP+ PR TV+C C +L Sbjct: 206 STGPKLWPIIDSESIINPPRMRRAPGRPKKQRNKSNDEPKNSKILPRYLKTVECKKCKKL 265 Query: 299 GHNVRTCKRKATADRAIPKGRNKGAKRKKPSESEQAKHAEEASQVTQGSQATFTPTEATQ 478 GHN+RTCK K A R IPKG NK K KK + + ++ A + QGSQA PT +Q Sbjct: 266 GHNMRTCKGKTAAYREIPKGGNKKQKSKKKTRAGESSQAPQ----PQGSQA---PTVLSQ 318 Query: 479 SAPQPFQDSQGASAH 523 + P Q SQ S + Sbjct: 319 GSQAP-QGSQVPSQY 332 >GAU44816.1 hypothetical protein TSUD_400350 [Trifolium subterraneum] Length = 729 Score = 114 bits (284), Expect = 5e-26 Identities = 61/117 (52%), Positives = 77/117 (65%), Gaps = 7/117 (5%) Frame = +2 Query: 122 SNPPKSWPKIDSEPI-NPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGEL 298 S PK WP IDSE I NPP M+RA GRPKK RN++NDEP+ PR TV+C C +L Sbjct: 602 STGPKLWPIIDSELIINPPRMRRASGRPKKQRNRSNDEPKNSKILPRYLKTVECKKCKKL 661 Query: 299 GHNVRTCKRKATADRAIPKGRNKGAKRKK------PSESEQAKHAEEASQVTQGSQA 451 GHN+RTCK K ADR IPKG NK K KK S++ Q + ++ ++ ++QGSQA Sbjct: 662 GHNMRTCKGKTAADREIPKGGNKKQKSKKKTHAGESSQAPQPQRSQASTVLSQGSQA 718 >GAU47548.1 hypothetical protein TSUD_284150 [Trifolium subterraneum] Length = 747 Score = 110 bits (274), Expect = 1e-24 Identities = 62/155 (40%), Positives = 85/155 (54%), Gaps = 10/155 (6%) Frame = +2 Query: 122 SNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELG 301 SN P+ WP +++ INPPLM+RAPGRPKK RNK+NDEP+ P PR +TV C C + G Sbjct: 584 SNGPRVWPVTNTDAINPPLMRRAPGRPKKQRNKSNDEPKNPTVLPRHLSTVHCKKCKKPG 643 Query: 302 HNVRTCKRKATADRAIPKGRNK------GAKRKKPSESEQAK-HA---EEASQVTQGSQA 451 HN+ +CK K ADR IPKG NK G PS +Q K HA + A + + Sbjct: 644 HNISSCKGKTAADREIPKGGNKQKKTNVGPSDATPSAKKQKKSHAGPFDAAPTSAKKQKK 703 Query: 452 TFTPTEATQSAPQPFQDSQGASAHPQSMEEGSEAS 556 T P + +P ++ + + + S+AS Sbjct: 704 TTKPKKTNTYETRPSMQTESSVQVAVVVTQSSQAS 738 >GAU17551.1 hypothetical protein TSUD_340950 [Trifolium subterraneum] Length = 716 Score = 107 bits (266), Expect = 1e-23 Identities = 49/82 (59%), Positives = 60/82 (73%) Frame = +2 Query: 122 SNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELG 301 +N P+ WP ++ EPI PP M+R+ GRPKK RNK NDEP+ P+ PR TV+CT C ELG Sbjct: 633 TNGPQLWPLLEPEPIKPPYMRRSIGRPKKNRNKRNDEPRNPNIVPRTLPTVQCTQCTELG 692 Query: 302 HNVRTCKRKATADRAIPKGRNK 367 HN R+CK K A+RAIPKG NK Sbjct: 693 HNKRSCKGKRAAERAIPKGGNK 714 >XP_014625195.1 PREDICTED: uncharacterized protein LOC102663263 [Glycine max] Length = 154 Score = 99.8 bits (247), Expect = 2e-23 Identities = 46/88 (52%), Positives = 55/88 (62%) Frame = +2 Query: 125 NPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGH 304 N P WP + + + PP+M+RAPGRPKK RNK NDE PRQ +V C C +GH Sbjct: 33 NGPNLWPPLQTPVMLPPIMRRAPGRPKKARNKKNDESTKRPYLPRQSRSVVCKNCKAIGH 92 Query: 305 NVRTCKRKATADRAIPKGRNKGAKRKKP 388 N RTCK K + DR IPKG NK KR+ P Sbjct: 93 NRRTCKGKTSTDRTIPKGGNKNLKRQSP 120 >GAU50686.1 hypothetical protein TSUD_410420 [Trifolium subterraneum] Length = 230 Score = 101 bits (252), Expect = 2e-23 Identities = 46/82 (56%), Positives = 56/82 (68%) Frame = +2 Query: 122 SNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELG 301 S+ PK WP +D+ PI PP ++RAPGRPKKLR K NDE + ++ R T+ CT C LG Sbjct: 56 SDGPKGWPIVDATPIAPPYVRRAPGRPKKLRRKANDESRGSARRKRNQHTITCTRCKTLG 115 Query: 302 HNVRTCKRKATADRAIPKGRNK 367 HN R+CK K ADR IPKG NK Sbjct: 116 HNRRSCKGKTVADRMIPKGGNK 137 >XP_004492078.1 PREDICTED: uncharacterized protein LOC101505740 [Cicer arietinum] Length = 305 Score = 100 bits (250), Expect = 2e-22 Identities = 49/88 (55%), Positives = 57/88 (64%) Frame = +2 Query: 122 SNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELG 301 SN PK W + E IN P+M+R+PGRPKK RNK+ND+P+ PRQ VKC C +L Sbjct: 212 SNGPKLWQASNVEHINSPIMRRSPGRPKKKRNKSNDKPKGSKILPRQFVAVKCKNCRKLD 271 Query: 302 HNVRTCKRKATADRAIPKGRNKGAKRKK 385 HN RTCK K ADR IPKG N K KK Sbjct: 272 HNTRTCKGKNAADREIPKGGNNVNKTKK 299 >XP_006606642.1 PREDICTED: uncharacterized protein LOC102664916 [Glycine max] Length = 333 Score = 100 bits (250), Expect = 3e-22 Identities = 53/124 (42%), Positives = 70/124 (56%), Gaps = 4/124 (3%) Frame = +2 Query: 125 NPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGH 304 N P WP + + + PP+M+RAPGRPKK RNK NDE RQ +V C C ++GH Sbjct: 207 NEPNLWPPLQTPVMLPPMMRRAPGRPKKARNKKNDEFTKRSNLARQAKSVVCKKCRKIGH 266 Query: 305 NVRTCKRKATADRAIPKGRNKGAKRKKPSE----SEQAKHAEEASQVTQGSQATFTPTEA 472 N RTCK K + DR+IPKG NK KR+ P +++ K+A SQ T + T Sbjct: 267 NKRTCKGKTSTDRSIPKGGNKNLKRQAPCPTPVVTKKQKNAFTTSQTTSTAHQGGDITNE 326 Query: 473 TQSA 484 TQ + Sbjct: 327 TQQS 330 >GAU49951.1 hypothetical protein TSUD_408430 [Trifolium subterraneum] Length = 174 Score = 86.7 bits (213), Expect = 4e-18 Identities = 45/97 (46%), Positives = 55/97 (56%) Frame = +2 Query: 161 PINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGHNVRTCKRKATAD 340 P+ PP ++RAPGRPKK R K NDEP P + + TV C C E GHN R+CK K AD Sbjct: 45 PLQPPYVRRAPGRPKKARRKANDEPN-PKRMKKAPGTVTCNRCLEAGHNQRSCKGKTAAD 103 Query: 341 RAIPKGRNKGAKRKKPSESEQAKHAEEASQVTQGSQA 451 R IPKG NK K E++ + QGSQ+ Sbjct: 104 RLIPKGGNKSNTTMKHPEAQPGASVSK----IQGSQS 136 >XP_003599950.2 hypothetical protein MTR_3g049490 [Medicago truncatula] AES70201.2 hypothetical protein MTR_3g049490 [Medicago truncatula] Length = 186 Score = 85.1 bits (209), Expect = 2e-17 Identities = 41/75 (54%), Positives = 50/75 (66%), Gaps = 1/75 (1%) Frame = +2 Query: 146 KIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQ-PRQHTTVKCTTCGELGHNVRTCK 322 ++++EPI PP +RAPGRPKK R K NDEP+ K+ R T++C C ELGHN RTC Sbjct: 11 EVNTEPILPPGARRAPGRPKKARRKENDEPKTASKKGKRNQVTLRCRRCKELGHNTRTCG 70 Query: 323 RKATADRAIPKGRNK 367 K ADR IP G NK Sbjct: 71 GKTGADRRIPVGGNK 85 >KHN08396.1 hypothetical protein glysoja_035935 [Glycine soja] Length = 164 Score = 83.6 bits (205), Expect = 5e-17 Identities = 42/90 (46%), Positives = 51/90 (56%) Frame = +2 Query: 176 LMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGHNVRTCKRKATADRAIPK 355 +M+RA GRP K RNK NDE K PRQ V C CG++ HN RTCK K T DR IPK Sbjct: 58 MMRRALGRPNKARNKRNDESTNRFKLPRQSKLVVCKKCGKIDHNKRTCKGKTTTDRNIPK 117 Query: 356 GRNKGAKRKKPSESEQAKHAEEASQVTQGS 445 G NK KR+ S + ++ T G+ Sbjct: 118 GGNKNLKRQTTSPTNVVTKKQKNVSCTSGT 147 >XP_014626943.1 PREDICTED: uncharacterized protein LOC102664952 isoform X3 [Glycine max] Length = 186 Score = 80.9 bits (198), Expect = 9e-16 Identities = 38/81 (46%), Positives = 47/81 (58%) Frame = +2 Query: 125 NPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGH 304 N P WP + + + PP+M++A GRPKK RNK NDE RQ +V C C +GH Sbjct: 22 NGPNLWPPLQTPVMLPPIMRKAHGRPKKARNKKNDESTKRPNLARQSRSVVCKNCRTIGH 81 Query: 305 NVRTCKRKATADRAIPKGRNK 367 N R CK K + DR IPK NK Sbjct: 82 NRRACKGKTSTDRIIPKEGNK 102 >XP_014626942.1 PREDICTED: uncharacterized protein LOC102664952 isoform X2 [Glycine max] Length = 188 Score = 80.9 bits (198), Expect = 9e-16 Identities = 38/81 (46%), Positives = 47/81 (58%) Frame = +2 Query: 125 NPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGH 304 N P WP + + + PP+M++A GRPKK RNK NDE RQ +V C C +GH Sbjct: 24 NGPNLWPPLQTPVMLPPIMRKAHGRPKKARNKKNDESTKRPNLARQSRSVVCKNCRTIGH 83 Query: 305 NVRTCKRKATADRAIPKGRNK 367 N R CK K + DR IPK NK Sbjct: 84 NRRACKGKTSTDRIIPKEGNK 104 >XP_006603876.1 PREDICTED: uncharacterized protein LOC102664952 isoform X1 [Glycine max] Length = 216 Score = 80.9 bits (198), Expect = 2e-15 Identities = 38/81 (46%), Positives = 47/81 (58%) Frame = +2 Query: 125 NPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGH 304 N P WP + + + PP+M++A GRPKK RNK NDE RQ +V C C +GH Sbjct: 52 NGPNLWPPLQTPVMLPPIMRKAHGRPKKARNKKNDESTKRPNLARQSRSVVCKNCRTIGH 111 Query: 305 NVRTCKRKATADRAIPKGRNK 367 N R CK K + DR IPK NK Sbjct: 112 NRRACKGKTSTDRIIPKEGNK 132 >XP_017256079.1 PREDICTED: uncharacterized protein LOC108225664 [Daucus carota subsp. sativus] Length = 264 Score = 76.6 bits (187), Expect = 1e-13 Identities = 44/108 (40%), Positives = 54/108 (50%) Frame = +2 Query: 125 NPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGH 304 N + W K S PPL+K PGRPKK RNK ND P K RQ+T V C+ C E H Sbjct: 89 NSSEYWEKTGSPGPLPPLIKVQPGRPKKKRNKKNDIPLDTTKLRRQNTNVFCSYCKEKSH 148 Query: 305 NVRTCKRKATADRAIPKGRNKGAKRKKPSESEQAKHAEEASQVTQGSQ 448 N RTC K T + K K K KK S+ +K + ++ T Q Sbjct: 149 NARTCPAKKTDQASGVKTNVKARKPKKAVASKVSKKQDTSTSATPSDQ 196 >GAU21349.1 hypothetical protein TSUD_189360 [Trifolium subterraneum] Length = 177 Score = 74.7 bits (182), Expect = 2e-13 Identities = 44/103 (42%), Positives = 51/103 (49%) Frame = +2 Query: 122 SNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELG 301 SN WPK D I PP KR PGRPKK+R + DE P + R +TT +C C E G Sbjct: 5 SNGRNRWPKTDDPDILPPQYKRGPGRPKKMRRRDPDEAD-PLRWTRSNTTHQCQRCLEYG 63 Query: 302 HNVRTCKRKATADRAIPKGRNKGAKRKKPSESEQAKHAEEASQ 430 HN RTCK A PK +N A + S S ASQ Sbjct: 64 HNARTCKLPA------PKKKNDVADKNVASTSNDETPNAHASQ 100 >XP_017217391.1 PREDICTED: uncharacterized protein LOC108194968 [Daucus carota subsp. sativus] Length = 532 Score = 77.4 bits (189), Expect = 3e-13 Identities = 47/138 (34%), Positives = 65/138 (47%), Gaps = 4/138 (2%) Frame = +2 Query: 134 KSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGHNVR 313 + W + + PP+MK GRPKK R+K ND P + RQ+T V+C+ C E HN+R Sbjct: 357 EQWEQTEYPRPLPPVMKVQTGRPKKSRSKKNDTPAGATRLKRQNTKVRCSYCTEYSHNLR 416 Query: 314 TCKR----KATADRAIPKGRNKGAKRKKPSESEQAKHAEEASQVTQGSQATFTPTEATQS 481 TC KA + K RNK K KK E +E ++ G T PTE +Q Sbjct: 417 TCPARAHDKANGCEKVVKRRNK--KTKKDDIEEGTAADDEGNEDAAGDTQTDAPTEGSQG 474 Query: 482 APQPFQDSQGASAHPQSM 535 Q ++A P + Sbjct: 475 GVFNMQQPHDSTARPSPL 492