BLASTX nr result

ID: Glycyrrhiza23_contig00017322 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00017322
         (523 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003610734.1| Pentatricopeptide repeat-containing protein ...   330   6e-89
ref|XP_003538644.1| PREDICTED: pentatricopeptide repeat-containi...   325   2e-87
ref|XP_002278762.1| PREDICTED: pentatricopeptide repeat-containi...   281   3e-74
ref|XP_002315764.1| predicted protein [Populus trichocarpa] gi|2...   273   8e-72
ref|NP_193218.1| pentatricopeptide repeat-containing protein [Ar...   265   2e-69

>ref|XP_003610734.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355512069|gb|AES93692.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 726

 Score =  330 bits (847), Expect = 6e-89
 Identities = 159/173 (91%), Positives = 165/173 (95%)
 Frame = +3

Query: 3   GFGRVLSVNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLF 182
           GFGR LSVNNALIDMYAKCGNLV AREVF+NMPRKNVISWSSMINAFAMHG+ADSAI LF
Sbjct: 384 GFGRALSVNNALIDMYAKCGNLVKAREVFENMPRKNVISWSSMINAFAMHGNADSAIKLF 443

Query: 183 QRMKEENIEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCR 362
           +RMKE NIEPNGVTFIGVLYAC HAGLVEEG+K FSSMINEHGISP+REHYGCMVDLYCR
Sbjct: 444 RRMKEVNIEPNGVTFIGVLYACGHAGLVEEGEKLFSSMINEHGISPTREHYGCMVDLYCR 503

Query: 363 ANLLRKAIEVIETMPFAPNVIIWGSLMSACQVHGEVELGEFAAKRLLELEPDH 521
           AN LRKAIE+IETMPFAPNVIIWGSLMSACQVHGE ELGEFAAKRLLELEPDH
Sbjct: 504 ANFLRKAIELIETMPFAPNVIIWGSLMSACQVHGEAELGEFAAKRLLELEPDH 556



 Score = 95.9 bits (237), Expect = 3e-18
 Identities = 52/169 (30%), Positives = 92/169 (54%), Gaps = 2/169 (1%)
 Frame = +3

Query: 18  LSVNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLFQRMKE 197
           L V+ A++  YAK G + +AR +FD M  ++++ WS+MI+ +A       A+ LF  M +
Sbjct: 288 LIVSTAMLSGYAKLGMVKDARFIFDQMIERDLVCWSAMISGYAESDQPQEALKLFDEMLQ 347

Query: 198 ENIEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCRANLLR 377
           +   P+ +T + V+ ACSH G + +   +  + ++  G   +      ++D+Y +   L 
Sbjct: 348 KRSVPDQITMLSVISACSHVGALAQA-NWIHTYVDRSGFGRALSVNNALIDMYAKCGNLV 406

Query: 378 KAIEVIETMPFAPNVIIWGSLMSACQVHGEVELGEFAAKRLLE--LEPD 518
           KA EV E MP   NVI W S+++A  +HG  +      +R+ E  +EP+
Sbjct: 407 KAREVFENMP-RKNVISWSSMINAFAMHGNADSAIKLFRRMKEVNIEPN 454



 Score = 73.6 bits (179), Expect = 2e-11
 Identities = 38/161 (23%), Positives = 84/161 (52%)
 Frame = +3

Query: 24  VNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLFQRMKEEN 203
           +   LI MYA C  +++AR +FD M   + ++W+ +I+ +  +G  D A+ LF+ M+  +
Sbjct: 158 IQTGLIAMYASCRRIMDARLLFDKMCHPDAVAWNMIIDGYCQNGHYDDALRLFEDMRSSD 217

Query: 204 IEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCRANLLRKA 383
           ++P+ V    VL AC HAG +  G +     + ++G +        ++++Y     +  A
Sbjct: 218 MKPDSVILCTVLSACGHAGNLSYG-RTIHEFVKDNGYAIDSHLQTALINMYANCGAMDLA 276

Query: 384 IEVIETMPFAPNVIIWGSLMSACQVHGEVELGEFAAKRLLE 506
            ++ + +  + ++I+  +++S     G V+   F   +++E
Sbjct: 277 RKIYDGLS-SKHLIVSTAMLSGYAKLGMVKDARFIFDQMIE 316


>ref|XP_003538644.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g14820-like [Glycine max]
          Length = 721

 Score =  325 bits (833), Expect = 2e-87
 Identities = 156/173 (90%), Positives = 163/173 (94%)
 Frame = +3

Query: 3   GFGRVLSVNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLF 182
           GFGR L +NNALIDMYAKCGNLV AREVF+NMPRKNVISWSSMINAFAMHGDADSAI+LF
Sbjct: 379 GFGRTLPINNALIDMYAKCGNLVKAREVFENMPRKNVISWSSMINAFAMHGDADSAIALF 438

Query: 183 QRMKEENIEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCR 362
            RMKE+NIEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEH ISP REHYGCMVDLYCR
Sbjct: 439 HRMKEQNIEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHRISPQREHYGCMVDLYCR 498

Query: 363 ANLLRKAIEVIETMPFAPNVIIWGSLMSACQVHGEVELGEFAAKRLLELEPDH 521
           AN LRKA+E+IETMPF PNVIIWGSLMSACQ HGE+ELGEFAA RLLELEPDH
Sbjct: 499 ANHLRKAMELIETMPFPPNVIIWGSLMSACQNHGEIELGEFAATRLLELEPDH 551



 Score = 93.6 bits (231), Expect = 2e-17
 Identities = 52/167 (31%), Positives = 93/167 (55%), Gaps = 2/167 (1%)
 Frame = +3

Query: 24  VNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLFQRMKEEN 203
           V+ A++  YAK G + +AR +FD M  K+++ WS+MI+ +A       A+ LF  M+   
Sbjct: 285 VSTAMLSGYAKLGMVQDARFIFDRMVEKDLVCWSAMISGYAESYQPLEALQLFNEMQRRR 344

Query: 204 IEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCRANLLRKA 383
           I P+ +T + V+ AC++ G + +  K+  +  +++G   +      ++D+Y +   L KA
Sbjct: 345 IVPDQITMLSVISACANVGALVQA-KWIHTYADKNGFGRTLPINNALIDMYAKCGNLVKA 403

Query: 384 IEVIETMPFAPNVIIWGSLMSACQVHGEVELGEFAAKRLLE--LEPD 518
            EV E MP   NVI W S+++A  +HG+ +       R+ E  +EP+
Sbjct: 404 REVFENMP-RKNVISWSSMINAFAMHGDADSAIALFHRMKEQNIEPN 449



 Score = 89.0 bits (219), Expect = 4e-16
 Identities = 46/161 (28%), Positives = 87/161 (54%)
 Frame = +3

Query: 24  VNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLFQRMKEEN 203
           + +ALI MYA CG +++AR +FD M  ++V++W+ MI+ ++ +   D  + L++ MK   
Sbjct: 153 IQSALIAMYAACGRIMDARFLFDKMSHRDVVTWNIMIDGYSQNAHYDHVLKLYEEMKTSG 212

Query: 204 IEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCRANLLRKA 383
            EP+ +    VL AC+HAG +  G K     I ++G          +V++Y     +  A
Sbjct: 213 TEPDAIILCTVLSACAHAGNLSYG-KAIHQFIKDNGFRVGSHIQTSLVNMYANCGAMHLA 271

Query: 384 IEVIETMPFAPNVIIWGSLMSACQVHGEVELGEFAAKRLLE 506
            EV + +P + ++++  +++S     G V+   F   R++E
Sbjct: 272 REVYDQLP-SKHMVVSTAMLSGYAKLGMVQDARFIFDRMVE 311


>ref|XP_002278762.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14820
           [Vitis vinifera] gi|297737070|emb|CBI26271.3| unnamed
           protein product [Vitis vinifera]
          Length = 727

 Score =  281 bits (720), Expect = 3e-74
 Identities = 129/173 (74%), Positives = 152/173 (87%)
 Frame = +3

Query: 3   GFGRVLSVNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLF 182
           GFG  L +NNALI+MYAKCG+L  AR +FD MPRKNVISW+ MI+AFAMHGDA SA+  F
Sbjct: 385 GFGGALPINNALIEMYAKCGSLERARRIFDKMPRKNVISWTCMISAFAMHGDAGSALRFF 444

Query: 183 QRMKEENIEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCR 362
            +M++ENIEPNG+TF+GVLYACSHAGLVEEG+K F SMINEH I+P   HYGCMVDL+ R
Sbjct: 445 HQMEDENIEPNGITFVGVLYACSHAGLVEEGRKIFYSMINEHNITPKHVHYGCMVDLFGR 504

Query: 363 ANLLRKAIEVIETMPFAPNVIIWGSLMSACQVHGEVELGEFAAKRLLELEPDH 521
           ANLLR+A+E++E MP APNVIIWGSLM+AC+VHGE+ELGEFAAKRLLEL+PDH
Sbjct: 505 ANLLREALELVEAMPLAPNVIIWGSLMAACRVHGEIELGEFAAKRLLELDPDH 557



 Score = 95.1 bits (235), Expect = 5e-18
 Identities = 45/150 (30%), Positives = 87/150 (58%)
 Frame = +3

Query: 18  LSVNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLFQRMKE 197
           L  + A++  Y+K G + NAR VF+ M +K+++ WS+MI+ +A       A++LF  M+ 
Sbjct: 289 LVASTAMVTGYSKLGQIENARSVFNQMVKKDLVCWSAMISGYAESDSPQEALNLFNEMQS 348

Query: 198 ENIEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCRANLLR 377
             I+P+ VT + V+ AC+H G +++  K+    ++++G   +      ++++Y +   L 
Sbjct: 349 LGIKPDQVTMLSVITACAHLGALDQA-KWIHLFVDKNGFGGALPINNALIEMYAKCGSLE 407

Query: 378 KAIEVIETMPFAPNVIIWGSLMSACQVHGE 467
           +A  + + MP   NVI W  ++SA  +HG+
Sbjct: 408 RARRIFDKMP-RKNVISWTCMISAFAMHGD 436



 Score = 79.0 bits (193), Expect = 4e-13
 Identities = 47/157 (29%), Positives = 77/157 (49%)
 Frame = +3

Query: 3   GFGRVLSVNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLF 182
           GF     V   L+ MYA CG +  AR +FD M  ++V++WS MI+ +   G  + A+ LF
Sbjct: 152 GFDSDPFVQTGLVRMYAACGRIAEARLMFDKMFHRDVVTWSIMIDGYCQSGLFNDALLLF 211

Query: 183 QRMKEENIEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCR 362
           + MK  N+EP+ +    VL AC  AG +  G K     I E+ I         +V +Y  
Sbjct: 212 EEMKNYNVEPDEMMLSTVLSACGRAGNLSYG-KMIHDFIMENNIVVDPHLQSALVTMYAS 270

Query: 363 ANLLRKAIEVIETMPFAPNVIIWGSLMSACQVHGEVE 473
              +  A+ + E M    N++   ++++     G++E
Sbjct: 271 CGSMDLALNLFEKMT-PKNLVASTAMVTGYSKLGQIE 306


>ref|XP_002315764.1| predicted protein [Populus trichocarpa] gi|222864804|gb|EEF01935.1|
           predicted protein [Populus trichocarpa]
          Length = 452

 Score =  273 bits (699), Expect = 8e-72
 Identities = 127/173 (73%), Positives = 150/173 (86%)
 Frame = +3

Query: 3   GFGRVLSVNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLF 182
           G G  L VNNALIDMYAKCGNL  AR VF+ M  +NVISW+SMINAFA+HGDA +A+  F
Sbjct: 110 GLGGALPVNNALIDMYAKCGNLGAARGVFEKMQSRNVISWTSMINAFAIHGDASNALKFF 169

Query: 183 QRMKEENIEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCR 362
            +MK+ENI+PNGVTF+GVLYACSHAGLVEEG++ F+SM NEH I+P  EHYGCMVDL+ R
Sbjct: 170 YQMKDENIKPNGVTFVGVLYACSHAGLVEEGRRTFASMTNEHNITPKHEHYGCMVDLFGR 229

Query: 363 ANLLRKAIEVIETMPFAPNVIIWGSLMSACQVHGEVELGEFAAKRLLELEPDH 521
           ANLLR A+E++ETMP APNV+IWGSLM+ACQ+HGE ELGEFAAK++LELEPDH
Sbjct: 230 ANLLRDALELVETMPLAPNVVIWGSLMAACQIHGENELGEFAAKQVLELEPDH 282



 Score = 88.6 bits (218), Expect = 5e-16
 Identities = 47/152 (30%), Positives = 87/152 (57%)
 Frame = +3

Query: 12  RVLSVNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLFQRM 191
           R L V  A+I  Y++ G + +AR +FD M  K+++ WS+MI+ +A       A++LF  M
Sbjct: 12  RNLVVLTAMISGYSRVGRVEDARLIFDQMEEKDLVCWSAMISGYAESDKPQEALNLFSEM 71

Query: 192 KEENIEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCRANL 371
           +   I+P+ VT + V+ AC+  G+++   K+    ++++G+  +      ++D+Y +   
Sbjct: 72  QVFGIKPDQVTILSVISACARLGVLDRA-KWIHMYVDKNGLGGALPVNNALIDMYAKCGN 130

Query: 372 LRKAIEVIETMPFAPNVIIWGSLMSACQVHGE 467
           L  A  V E M  + NVI W S+++A  +HG+
Sbjct: 131 LGAARGVFEKMQ-SRNVISWTSMINAFAIHGD 161


>ref|NP_193218.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75274931|sp|O23337.1|PP311_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g14820 gi|2244839|emb|CAB10261.1| hypothetical
           protein [Arabidopsis thaliana]
           gi|7268228|emb|CAB78524.1| hypothetical protein
           [Arabidopsis thaliana] gi|332658106|gb|AEE83506.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 722

 Score =  265 bits (678), Expect = 2e-69
 Identities = 122/173 (70%), Positives = 150/173 (86%)
 Frame = +3

Query: 3   GFGRVLSVNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLF 182
           G    LS+NNALI+MYAKCG L   R+VF+ MPR+NV+SWSSMINA +MHG+A  A+SLF
Sbjct: 374 GLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDALSLF 433

Query: 183 QRMKEENIEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCR 362
            RMK+EN+EPN VTF+GVLY CSH+GLVEEG+K F+SM +E+ I+P  EHYGCMVDL+ R
Sbjct: 434 ARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDLFGR 493

Query: 363 ANLLRKAIEVIETMPFAPNVIIWGSLMSACQVHGEVELGEFAAKRLLELEPDH 521
           ANLLR+A+EVIE+MP A NV+IWGSLMSAC++HGE+ELG+FAAKR+LELEPDH
Sbjct: 494 ANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDH 546



 Score = 92.4 bits (228), Expect = 3e-17
 Identities = 50/171 (29%), Positives = 96/171 (56%), Gaps = 2/171 (1%)
 Frame = +3

Query: 12  RVLSVNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLFQRM 191
           R L V+ A++  Y+KCG L +A+ +FD   +K+++ W++MI+A+        A+ +F+ M
Sbjct: 276 RNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVCWTTMISAYVESDYPQEALRVFEEM 335

Query: 192 KEENIEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCRANL 371
               I+P+ V+   V+ AC++ G++++  K+  S I+ +G+         ++++Y +   
Sbjct: 336 CCSGIKPDVVSMFSVISACANLGILDKA-KWVHSCIHVNGLESELSINNALINMYAKCGG 394

Query: 372 LRKAIEVIETMPFAPNVIIWGSLMSACQVHGEVE--LGEFAAKRLLELEPD 518
           L    +V E MP   NV+ W S+++A  +HGE    L  FA  +   +EP+
Sbjct: 395 LDATRDVFEKMP-RRNVVSWSSMINALSMHGEASDALSLFARMKQENVEPN 444



 Score = 74.7 bits (182), Expect = 7e-12
 Identities = 37/127 (29%), Positives = 62/127 (48%)
 Frame = +3

Query: 24  VNNALIDMYAKCGNLVNAREVFDNMPRKNVISWSSMINAFAMHGDADSAISLFQRMKEEN 203
           V    +DMYA CG +  AR VFD M  ++V++W++MI  +   G  D A  LF+ MK+ N
Sbjct: 148 VETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFGLVDEAFKLFEEMKDSN 207

Query: 204 IEPNGVTFIGVLYACSHAGLVEEGQKFFSSMINEHGISPSREHYGCMVDLYCRANLLRKA 383
           + P+ +    ++ AC   G +   +  +  +I E+ +         +V +Y  A  +  A
Sbjct: 208 VMPDEMILCNIVSACGRTGNMRYNRAIYEFLI-ENDVRMDTHLLTALVTMYAGAGCMDMA 266

Query: 384 IEVIETM 404
            E    M
Sbjct: 267 REFFRKM 273


Top