BLASTX nr result

ID: Glycyrrhiza28_contig00029714 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza28_contig00029714
         (558 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU29765.1 hypothetical protein TSUD_161700 [Trifolium subterran...   132   1e-32
KYP52222.1 hypothetical protein KK1_025963 [Cajanus cajan]            122   1e-31
GAU45350.1 hypothetical protein TSUD_84730 [Trifolium subterraneum]   122   8e-29
GAU50378.1 hypothetical protein TSUD_368580 [Trifolium subterran...   116   4e-28
GAU44816.1 hypothetical protein TSUD_400350 [Trifolium subterran...   114   5e-26
GAU47548.1 hypothetical protein TSUD_284150 [Trifolium subterran...   110   1e-24
GAU17551.1 hypothetical protein TSUD_340950 [Trifolium subterran...   107   1e-23
XP_014625195.1 PREDICTED: uncharacterized protein LOC102663263 [...   100   2e-23
GAU50686.1 hypothetical protein TSUD_410420 [Trifolium subterran...   101   2e-23
XP_004492078.1 PREDICTED: uncharacterized protein LOC101505740 [...   100   2e-22
XP_006606642.1 PREDICTED: uncharacterized protein LOC102664916 [...   100   3e-22
GAU49951.1 hypothetical protein TSUD_408430 [Trifolium subterran...    87   4e-18
XP_003599950.2 hypothetical protein MTR_3g049490 [Medicago trunc...    85   2e-17
KHN08396.1 hypothetical protein glysoja_035935 [Glycine soja]          84   5e-17
XP_014626943.1 PREDICTED: uncharacterized protein LOC102664952 i...    81   9e-16
XP_014626942.1 PREDICTED: uncharacterized protein LOC102664952 i...    81   9e-16
XP_006603876.1 PREDICTED: uncharacterized protein LOC102664952 i...    81   2e-15
XP_017256079.1 PREDICTED: uncharacterized protein LOC108225664 [...    77   1e-13
GAU21349.1 hypothetical protein TSUD_189360 [Trifolium subterran...    75   2e-13
XP_017217391.1 PREDICTED: uncharacterized protein LOC108194968 [...    77   3e-13

>GAU29765.1 hypothetical protein TSUD_161700 [Trifolium subterraneum]
          Length = 911

 Score =  132 bits (333), Expect = 1e-32
 Identities = 69/128 (53%), Positives = 79/128 (61%)
 Frame = +2

Query: 116  IRSNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGE 295
            I SN PK WP IDSEPINPPLM+RAPGRPKK RNKTNDEP+  +  PR  +T+ CT C +
Sbjct: 784  IPSNGPKHWPIIDSEPINPPLMRRAPGRPKKKRNKTNDEPKNRNSLPRYLSTLTCTNCNK 843

Query: 296  LGHNVRTCKRKATADRAIPKGRNKGAKRKKPSESEQAKHAEEASQVTQGSQATFTPTEAT 475
            +GHN RTCK K  ADR IPKG NK  K   PS ++ +     A   TQGS    T     
Sbjct: 844  VGHNRRTCKGKTAADRDIPKGGNKKQKTTTPSHAQTSAQG-VAQATTQGSAQVQTVLTQG 902

Query: 476  QSAPQPFQ 499
              AP   Q
Sbjct: 903  SQAPSTSQ 910



 Score = 57.4 bits (137), Expect = 2e-06
 Identities = 26/46 (56%), Positives = 35/46 (76%)
 Frame = +2

Query: 2   WGVELSTHKAYMAKRRALEFIQGAGSDQSNHLKSDAEEIRSNPPKS 139
           WGV+LS  +AY AKR+A+E IQGA S+Q  HL+S A+E++S  P S
Sbjct: 395 WGVKLSQDQAYRAKRKAIEMIQGASSEQYTHLRSYADELKSTNPNS 440


>KYP52222.1 hypothetical protein KK1_025963 [Cajanus cajan]
          Length = 194

 Score =  122 bits (305), Expect = 1e-31
 Identities = 55/88 (62%), Positives = 61/88 (69%)
 Frame = +2

Query: 122 SNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELG 301
           SN P  WP +  EPI PPLM+RAPGRPKK R ++ DEPQ PHK PRQHTT+KC  C   G
Sbjct: 107 SNGPNLWPIVHCEPIKPPLMRRAPGRPKKQRKRSIDEPQNPHKLPRQHTTIKCKKCENFG 166

Query: 302 HNVRTCKRKATADRAIPKGRNKGAKRKK 385
           HN RTCK K  ADR IPKG NK   + K
Sbjct: 167 HNKRTCKGKTVADREIPKGENKVILKSK 194


>GAU45350.1 hypothetical protein TSUD_84730 [Trifolium subterraneum]
          Length = 912

 Score =  122 bits (305), Expect = 8e-29
 Identities = 74/137 (54%), Positives = 86/137 (62%), Gaps = 3/137 (2%)
 Frame = +2

Query: 122  SNPPKSWPKIDSEPI-NPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGEL 298
            S  PK WP IDSE I NPP M+RAPGRPKK RNK+NDEP+     PR   TV+C  C +L
Sbjct: 785  STGPKLWPIIDSESIINPPRMRRAPGRPKKQRNKSNDEPKNSKILPRYLKTVECKKCKKL 844

Query: 299  GHNVRTCKRKATADRAIPKGRNKGAKRKKPSESEQAKHAEEASQV--TQGSQATFTPTEA 472
            GHN+RTCK K  ADR IPKG NK  K KK +      HA E+SQ    QGSQA   PT  
Sbjct: 845  GHNMRTCKGKTAADREIPKGGNKKQKSKKKT------HAGESSQAPQPQGSQA---PTVL 895

Query: 473  TQSAPQPFQDSQGASAH 523
            +Q +  P Q SQ +S +
Sbjct: 896  SQGSQAP-QGSQVSSQY 911


>GAU50378.1 hypothetical protein TSUD_368580 [Trifolium subterraneum]
          Length = 333

 Score =  116 bits (291), Expect = 4e-28
 Identities = 69/135 (51%), Positives = 83/135 (61%), Gaps = 1/135 (0%)
 Frame = +2

Query: 122 SNPPKSWPKIDSEPI-NPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGEL 298
           S  PK WP IDSE I NPP M+RAPGRPKK RNK+NDEP+     PR   TV+C  C +L
Sbjct: 206 STGPKLWPIIDSESIINPPRMRRAPGRPKKQRNKSNDEPKNSKILPRYLKTVECKKCKKL 265

Query: 299 GHNVRTCKRKATADRAIPKGRNKGAKRKKPSESEQAKHAEEASQVTQGSQATFTPTEATQ 478
           GHN+RTCK K  A R IPKG NK  K KK + + ++  A +     QGSQA   PT  +Q
Sbjct: 266 GHNMRTCKGKTAAYREIPKGGNKKQKSKKKTRAGESSQAPQ----PQGSQA---PTVLSQ 318

Query: 479 SAPQPFQDSQGASAH 523
            +  P Q SQ  S +
Sbjct: 319 GSQAP-QGSQVPSQY 332


>GAU44816.1 hypothetical protein TSUD_400350 [Trifolium subterraneum]
          Length = 729

 Score =  114 bits (284), Expect = 5e-26
 Identities = 61/117 (52%), Positives = 77/117 (65%), Gaps = 7/117 (5%)
 Frame = +2

Query: 122 SNPPKSWPKIDSEPI-NPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGEL 298
           S  PK WP IDSE I NPP M+RA GRPKK RN++NDEP+     PR   TV+C  C +L
Sbjct: 602 STGPKLWPIIDSELIINPPRMRRASGRPKKQRNRSNDEPKNSKILPRYLKTVECKKCKKL 661

Query: 299 GHNVRTCKRKATADRAIPKGRNKGAKRKK------PSESEQAKHAEEASQVTQGSQA 451
           GHN+RTCK K  ADR IPKG NK  K KK       S++ Q + ++ ++ ++QGSQA
Sbjct: 662 GHNMRTCKGKTAADREIPKGGNKKQKSKKKTHAGESSQAPQPQRSQASTVLSQGSQA 718


>GAU47548.1 hypothetical protein TSUD_284150 [Trifolium subterraneum]
          Length = 747

 Score =  110 bits (274), Expect = 1e-24
 Identities = 62/155 (40%), Positives = 85/155 (54%), Gaps = 10/155 (6%)
 Frame = +2

Query: 122  SNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELG 301
            SN P+ WP  +++ INPPLM+RAPGRPKK RNK+NDEP+ P   PR  +TV C  C + G
Sbjct: 584  SNGPRVWPVTNTDAINPPLMRRAPGRPKKQRNKSNDEPKNPTVLPRHLSTVHCKKCKKPG 643

Query: 302  HNVRTCKRKATADRAIPKGRNK------GAKRKKPSESEQAK-HA---EEASQVTQGSQA 451
            HN+ +CK K  ADR IPKG NK      G     PS  +Q K HA   + A    +  + 
Sbjct: 644  HNISSCKGKTAADREIPKGGNKQKKTNVGPSDATPSAKKQKKSHAGPFDAAPTSAKKQKK 703

Query: 452  TFTPTEATQSAPQPFQDSQGASAHPQSMEEGSEAS 556
            T  P +      +P   ++ +      + + S+AS
Sbjct: 704  TTKPKKTNTYETRPSMQTESSVQVAVVVTQSSQAS 738


>GAU17551.1 hypothetical protein TSUD_340950 [Trifolium subterraneum]
          Length = 716

 Score =  107 bits (266), Expect = 1e-23
 Identities = 49/82 (59%), Positives = 60/82 (73%)
 Frame = +2

Query: 122 SNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELG 301
           +N P+ WP ++ EPI PP M+R+ GRPKK RNK NDEP+ P+  PR   TV+CT C ELG
Sbjct: 633 TNGPQLWPLLEPEPIKPPYMRRSIGRPKKNRNKRNDEPRNPNIVPRTLPTVQCTQCTELG 692

Query: 302 HNVRTCKRKATADRAIPKGRNK 367
           HN R+CK K  A+RAIPKG NK
Sbjct: 693 HNKRSCKGKRAAERAIPKGGNK 714


>XP_014625195.1 PREDICTED: uncharacterized protein LOC102663263 [Glycine max]
          Length = 154

 Score = 99.8 bits (247), Expect = 2e-23
 Identities = 46/88 (52%), Positives = 55/88 (62%)
 Frame = +2

Query: 125 NPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGH 304
           N P  WP + +  + PP+M+RAPGRPKK RNK NDE       PRQ  +V C  C  +GH
Sbjct: 33  NGPNLWPPLQTPVMLPPIMRRAPGRPKKARNKKNDESTKRPYLPRQSRSVVCKNCKAIGH 92

Query: 305 NVRTCKRKATADRAIPKGRNKGAKRKKP 388
           N RTCK K + DR IPKG NK  KR+ P
Sbjct: 93  NRRTCKGKTSTDRTIPKGGNKNLKRQSP 120


>GAU50686.1 hypothetical protein TSUD_410420 [Trifolium subterraneum]
          Length = 230

 Score =  101 bits (252), Expect = 2e-23
 Identities = 46/82 (56%), Positives = 56/82 (68%)
 Frame = +2

Query: 122 SNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELG 301
           S+ PK WP +D+ PI PP ++RAPGRPKKLR K NDE +   ++ R   T+ CT C  LG
Sbjct: 56  SDGPKGWPIVDATPIAPPYVRRAPGRPKKLRRKANDESRGSARRKRNQHTITCTRCKTLG 115

Query: 302 HNVRTCKRKATADRAIPKGRNK 367
           HN R+CK K  ADR IPKG NK
Sbjct: 116 HNRRSCKGKTVADRMIPKGGNK 137


>XP_004492078.1 PREDICTED: uncharacterized protein LOC101505740 [Cicer arietinum]
          Length = 305

 Score =  100 bits (250), Expect = 2e-22
 Identities = 49/88 (55%), Positives = 57/88 (64%)
 Frame = +2

Query: 122 SNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELG 301
           SN PK W   + E IN P+M+R+PGRPKK RNK+ND+P+     PRQ   VKC  C +L 
Sbjct: 212 SNGPKLWQASNVEHINSPIMRRSPGRPKKKRNKSNDKPKGSKILPRQFVAVKCKNCRKLD 271

Query: 302 HNVRTCKRKATADRAIPKGRNKGAKRKK 385
           HN RTCK K  ADR IPKG N   K KK
Sbjct: 272 HNTRTCKGKNAADREIPKGGNNVNKTKK 299


>XP_006606642.1 PREDICTED: uncharacterized protein LOC102664916 [Glycine max]
          Length = 333

 Score =  100 bits (250), Expect = 3e-22
 Identities = 53/124 (42%), Positives = 70/124 (56%), Gaps = 4/124 (3%)
 Frame = +2

Query: 125 NPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGH 304
           N P  WP + +  + PP+M+RAPGRPKK RNK NDE        RQ  +V C  C ++GH
Sbjct: 207 NEPNLWPPLQTPVMLPPMMRRAPGRPKKARNKKNDEFTKRSNLARQAKSVVCKKCRKIGH 266

Query: 305 NVRTCKRKATADRAIPKGRNKGAKRKKPSE----SEQAKHAEEASQVTQGSQATFTPTEA 472
           N RTCK K + DR+IPKG NK  KR+ P      +++ K+A   SQ T  +      T  
Sbjct: 267 NKRTCKGKTSTDRSIPKGGNKNLKRQAPCPTPVVTKKQKNAFTTSQTTSTAHQGGDITNE 326

Query: 473 TQSA 484
           TQ +
Sbjct: 327 TQQS 330


>GAU49951.1 hypothetical protein TSUD_408430 [Trifolium subterraneum]
          Length = 174

 Score = 86.7 bits (213), Expect = 4e-18
 Identities = 45/97 (46%), Positives = 55/97 (56%)
 Frame = +2

Query: 161 PINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGHNVRTCKRKATAD 340
           P+ PP ++RAPGRPKK R K NDEP  P +  +   TV C  C E GHN R+CK K  AD
Sbjct: 45  PLQPPYVRRAPGRPKKARRKANDEPN-PKRMKKAPGTVTCNRCLEAGHNQRSCKGKTAAD 103

Query: 341 RAIPKGRNKGAKRKKPSESEQAKHAEEASQVTQGSQA 451
           R IPKG NK     K  E++      +     QGSQ+
Sbjct: 104 RLIPKGGNKSNTTMKHPEAQPGASVSK----IQGSQS 136


>XP_003599950.2 hypothetical protein MTR_3g049490 [Medicago truncatula] AES70201.2
           hypothetical protein MTR_3g049490 [Medicago truncatula]
          Length = 186

 Score = 85.1 bits (209), Expect = 2e-17
 Identities = 41/75 (54%), Positives = 50/75 (66%), Gaps = 1/75 (1%)
 Frame = +2

Query: 146 KIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQ-PRQHTTVKCTTCGELGHNVRTCK 322
           ++++EPI PP  +RAPGRPKK R K NDEP+   K+  R   T++C  C ELGHN RTC 
Sbjct: 11  EVNTEPILPPGARRAPGRPKKARRKENDEPKTASKKGKRNQVTLRCRRCKELGHNTRTCG 70

Query: 323 RKATADRAIPKGRNK 367
            K  ADR IP G NK
Sbjct: 71  GKTGADRRIPVGGNK 85


>KHN08396.1 hypothetical protein glysoja_035935 [Glycine soja]
          Length = 164

 Score = 83.6 bits (205), Expect = 5e-17
 Identities = 42/90 (46%), Positives = 51/90 (56%)
 Frame = +2

Query: 176 LMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGHNVRTCKRKATADRAIPK 355
           +M+RA GRP K RNK NDE     K PRQ   V C  CG++ HN RTCK K T DR IPK
Sbjct: 58  MMRRALGRPNKARNKRNDESTNRFKLPRQSKLVVCKKCGKIDHNKRTCKGKTTTDRNIPK 117

Query: 356 GRNKGAKRKKPSESEQAKHAEEASQVTQGS 445
           G NK  KR+  S +      ++    T G+
Sbjct: 118 GGNKNLKRQTTSPTNVVTKKQKNVSCTSGT 147


>XP_014626943.1 PREDICTED: uncharacterized protein LOC102664952 isoform X3 [Glycine
           max]
          Length = 186

 Score = 80.9 bits (198), Expect = 9e-16
 Identities = 38/81 (46%), Positives = 47/81 (58%)
 Frame = +2

Query: 125 NPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGH 304
           N P  WP + +  + PP+M++A GRPKK RNK NDE        RQ  +V C  C  +GH
Sbjct: 22  NGPNLWPPLQTPVMLPPIMRKAHGRPKKARNKKNDESTKRPNLARQSRSVVCKNCRTIGH 81

Query: 305 NVRTCKRKATADRAIPKGRNK 367
           N R CK K + DR IPK  NK
Sbjct: 82  NRRACKGKTSTDRIIPKEGNK 102


>XP_014626942.1 PREDICTED: uncharacterized protein LOC102664952 isoform X2 [Glycine
           max]
          Length = 188

 Score = 80.9 bits (198), Expect = 9e-16
 Identities = 38/81 (46%), Positives = 47/81 (58%)
 Frame = +2

Query: 125 NPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGH 304
           N P  WP + +  + PP+M++A GRPKK RNK NDE        RQ  +V C  C  +GH
Sbjct: 24  NGPNLWPPLQTPVMLPPIMRKAHGRPKKARNKKNDESTKRPNLARQSRSVVCKNCRTIGH 83

Query: 305 NVRTCKRKATADRAIPKGRNK 367
           N R CK K + DR IPK  NK
Sbjct: 84  NRRACKGKTSTDRIIPKEGNK 104


>XP_006603876.1 PREDICTED: uncharacterized protein LOC102664952 isoform X1 [Glycine
           max]
          Length = 216

 Score = 80.9 bits (198), Expect = 2e-15
 Identities = 38/81 (46%), Positives = 47/81 (58%)
 Frame = +2

Query: 125 NPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGH 304
           N P  WP + +  + PP+M++A GRPKK RNK NDE        RQ  +V C  C  +GH
Sbjct: 52  NGPNLWPPLQTPVMLPPIMRKAHGRPKKARNKKNDESTKRPNLARQSRSVVCKNCRTIGH 111

Query: 305 NVRTCKRKATADRAIPKGRNK 367
           N R CK K + DR IPK  NK
Sbjct: 112 NRRACKGKTSTDRIIPKEGNK 132


>XP_017256079.1 PREDICTED: uncharacterized protein LOC108225664 [Daucus carota
           subsp. sativus]
          Length = 264

 Score = 76.6 bits (187), Expect = 1e-13
 Identities = 44/108 (40%), Positives = 54/108 (50%)
 Frame = +2

Query: 125 NPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGH 304
           N  + W K  S    PPL+K  PGRPKK RNK ND P    K  RQ+T V C+ C E  H
Sbjct: 89  NSSEYWEKTGSPGPLPPLIKVQPGRPKKKRNKKNDIPLDTTKLRRQNTNVFCSYCKEKSH 148

Query: 305 NVRTCKRKATADRAIPKGRNKGAKRKKPSESEQAKHAEEASQVTQGSQ 448
           N RTC  K T   +  K   K  K KK   S+ +K  + ++  T   Q
Sbjct: 149 NARTCPAKKTDQASGVKTNVKARKPKKAVASKVSKKQDTSTSATPSDQ 196


>GAU21349.1 hypothetical protein TSUD_189360 [Trifolium subterraneum]
          Length = 177

 Score = 74.7 bits (182), Expect = 2e-13
 Identities = 44/103 (42%), Positives = 51/103 (49%)
 Frame = +2

Query: 122 SNPPKSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELG 301
           SN    WPK D   I PP  KR PGRPKK+R +  DE   P +  R +TT +C  C E G
Sbjct: 5   SNGRNRWPKTDDPDILPPQYKRGPGRPKKMRRRDPDEAD-PLRWTRSNTTHQCQRCLEYG 63

Query: 302 HNVRTCKRKATADRAIPKGRNKGAKRKKPSESEQAKHAEEASQ 430
           HN RTCK  A      PK +N  A +   S S        ASQ
Sbjct: 64  HNARTCKLPA------PKKKNDVADKNVASTSNDETPNAHASQ 100


>XP_017217391.1 PREDICTED: uncharacterized protein LOC108194968 [Daucus carota
           subsp. sativus]
          Length = 532

 Score = 77.4 bits (189), Expect = 3e-13
 Identities = 47/138 (34%), Positives = 65/138 (47%), Gaps = 4/138 (2%)
 Frame = +2

Query: 134 KSWPKIDSEPINPPLMKRAPGRPKKLRNKTNDEPQIPHKQPRQHTTVKCTTCGELGHNVR 313
           + W + +     PP+MK   GRPKK R+K ND P    +  RQ+T V+C+ C E  HN+R
Sbjct: 357 EQWEQTEYPRPLPPVMKVQTGRPKKSRSKKNDTPAGATRLKRQNTKVRCSYCTEYSHNLR 416

Query: 314 TCKR----KATADRAIPKGRNKGAKRKKPSESEQAKHAEEASQVTQGSQATFTPTEATQS 481
           TC      KA     + K RNK  K KK    E     +E ++   G   T  PTE +Q 
Sbjct: 417 TCPARAHDKANGCEKVVKRRNK--KTKKDDIEEGTAADDEGNEDAAGDTQTDAPTEGSQG 474

Query: 482 APQPFQDSQGASAHPQSM 535
                Q    ++A P  +
Sbjct: 475 GVFNMQQPHDSTARPSPL 492


Top