BLASTX nr result

ID: Glycyrrhiza23_contig00023590 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00023590
         (1436 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003626687.1| Pentatricopeptide repeat-containing protein ...   516   e-144
ref|XP_003530173.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   511   e-142
ref|XP_002521681.1| pentatricopeptide repeat-containing protein,...   484   e-134
ref|XP_003595941.1| Pentatricopeptide repeat-containing protein ...   483   e-134
emb|CAN69960.1| hypothetical protein VITISV_032887 [Vitis vinifera]   478   e-132

>ref|XP_003626687.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355520709|gb|AET01163.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 501

 Score =  516 bits (1330), Expect = e-144
 Identities = 253/314 (80%), Positives = 280/314 (89%)
 Frame = -1

Query: 1436 LNLMVSDGISPDKVTVDLAVRALCLAGRLDHAVELIRELSSKHCSPDAYTFNFLVKHLCK 1257
            LNLMVSDGISPDK TVDLAVR+LC A R+D AVELI+ELSSKHCSPD Y++NFLVK+LCK
Sbjct: 151  LNLMVSDGISPDKGTVDLAVRSLCTADRVDDAVELIKELSSKHCSPDIYSYNFLVKNLCK 210

Query: 1256 CRALSSVYGFIDEMRSNFGLKPDLVTYTILIDNVCNTKNLREALRLVSVLHEEGFKPDCF 1077
             R LS VY FIDEMR+ F +KP+LVTYTILIDNVCNTKNLREA RLV +L EEGFKPDCF
Sbjct: 211  SRTLSLVYAFIDEMRTKFDVKPNLVTYTILIDNVCNTKNLREATRLVDILEEEGFKPDCF 270

Query: 1076 VYNTIMKGYCMLSRGSEAIEVYNKMKEEGVEPDLVTYNTLIYGLSKSGRVAEAKKLLRVM 897
            +YNTIMKGYCMLSRGSEAIEVYN+MKE+GVEPDL+TYNTLI+GLSKSGRV+EAKKLLRVM
Sbjct: 271  LYNTIMKGYCMLSRGSEAIEVYNRMKEKGVEPDLITYNTLIFGLSKSGRVSEAKKLLRVM 330

Query: 896  AEKGHFPDEVTYTTLMNGLCRKGXXXXXXXXXXXXEVKGCSPNSCTYNTLLHGLCKSRML 717
            AEKGHFPDEVTYT+LMNG+CRKG            E+KGCSPN+CTYNTLLHGLCKSRM 
Sbjct: 331  AEKGHFPDEVTYTSLMNGMCRKGETLAALALLEEMEMKGCSPNTCTYNTLLHGLCKSRMF 390

Query: 716  DKAVELYGVMKSGGLKLETASYATFVRALCRDGKIAEAYEVFDYAVESKSLTDVAAYSAL 537
            DKA+ELYG MKS GLKL+ ASYATFVRALC  G++A+AYEVFDYAVESKSL+DVAAYS L
Sbjct: 391  DKAMELYGAMKSDGLKLDMASYATFVRALCSVGRVADAYEVFDYAVESKSLSDVAAYSTL 450

Query: 536  ESTLKWLKKAKEQG 495
            ESTLKW KKAKE+G
Sbjct: 451  ESTLKWFKKAKEEG 464


>ref|XP_003530173.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At2g17670-like [Glycine max]
          Length = 456

 Score =  511 bits (1317), Expect = e-142
 Identities = 252/319 (78%), Positives = 281/319 (88%), Gaps = 2/319 (0%)
 Frame = -1

Query: 1436 LNLMVSDGISPDKVTVDLAVRALCLAGRLDHAVELIRELSSKHCSPDAYTFNFLVKHLCK 1257
            LNLM++ GI+PD  T D+AVR+LC AGRLDHAVELI+E +SKHC PD YTFNFLVKHLCK
Sbjct: 138  LNLMLAAGITPDTATADVAVRSLCSAGRLDHAVELIKEFASKHCPPDTYTFNFLVKHLCK 197

Query: 1256 CRALSSVYGFIDEMRSNFGLKPDLVTYTILIDNVCNTKNL--REALRLVSVLHEEGFKPD 1083
               +++VY FIDEMR  F +KPDLVTYTILIDNVCN KNL  REA+RLVSVLHEEGFK D
Sbjct: 198  SSTITTVYAFIDEMREKFDVKPDLVTYTILIDNVCNGKNLNLREAMRLVSVLHEEGFKLD 257

Query: 1082 CFVYNTIMKGYCMLSRGSEAIEVYNKMKEEGVEPDLVTYNTLIYGLSKSGRVAEAKKLLR 903
            CFVYNTIMKGYC+LSRGSEAIEVYNKMKEEGVEPDLVTYNTLI+GLSKSGRV EA+KLLR
Sbjct: 258  CFVYNTIMKGYCVLSRGSEAIEVYNKMKEEGVEPDLVTYNTLIFGLSKSGRVTEARKLLR 317

Query: 902  VMAEKGHFPDEVTYTTLMNGLCRKGXXXXXXXXXXXXEVKGCSPNSCTYNTLLHGLCKSR 723
            VMAEKG+FPDEVTYT+LMNGLCRKG            E KGCSPN+CTYNTLLHGLCK+R
Sbjct: 318  VMAEKGYFPDEVTYTSLMNGLCRKGDALGALALLGEMEAKGCSPNACTYNTLLHGLCKAR 377

Query: 722  MLDKAVELYGVMKSGGLKLETASYATFVRALCRDGKIAEAYEVFDYAVESKSLTDVAAYS 543
            +++KAV+ Y V+++GGLKL+TASY TFVRALCRDG+IAEAYEVFDYAVESKSLTDVAAYS
Sbjct: 378  LVEKAVKFYQVIRAGGLKLDTASYGTFVRALCRDGRIAEAYEVFDYAVESKSLTDVAAYS 437

Query: 542  ALESTLKWLKKAKEQGHAI 486
             LESTLKWL+KAKEQG AI
Sbjct: 438  TLESTLKWLRKAKEQGLAI 456



 Score = 87.8 bits (216), Expect = 6e-15
 Identities = 59/228 (25%), Positives = 102/228 (44%), Gaps = 5/228 (2%)
 Frame = -1

Query: 1193 PDLVTYTILIDNVCNTKNLREAL-RLVSVLHEEGFKPDCFVYNTIMKGYCMLSRGSEAIE 1017
            PD  T+ IL+ +  N  NL   + + ++++   G  PD    +  ++  C   R   A+E
Sbjct: 112  PDRSTFHILLSHASNNSNLLSPIHQXLNLMLAAGITPDTATADVAVRSLCSAGRLDHAVE 171

Query: 1016 VYNKMKEEGVEPDLVTYNTLIYGLSKSGRVAEAKKLLRVMAEKGHF-PDEVTYTTLMNGL 840
            +  +   +   PD  T+N L+  L KS  +      +  M EK    PD VTYT L++ +
Sbjct: 172  LIKEFASKHCPPDTYTFNFLVKHLCKSSTITTVYAFIDEMREKFDVKPDLVTYTILIDNV 231

Query: 839  CRKGXXXXXXXXXXXXEV---KGCSPNSCTYNTLLHGLCKSRMLDKAVELYGVMKSGGLK 669
            C  G             V   +G   +   YNT++ G C      +A+E+Y  MK  G++
Sbjct: 232  C-NGKNLNLREAMRLVSVLHEEGFKLDCFVYNTIMKGYCVLSRGSEAIEVYNKMKEEGVE 290

Query: 668  LETASYATFVRALCRDGKIAEAYEVFDYAVESKSLTDVAAYSALESTL 525
             +  +Y T +  L + G++ EA ++     E     D   Y++L + L
Sbjct: 291  PDLVTYNTLIFGLSKSGRVTEARKLLRVMAEKGYFPDEVTYTSLMNGL 338


>ref|XP_002521681.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539072|gb|EEF40668.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 458

 Score =  484 bits (1246), Expect = e-134
 Identities = 230/317 (72%), Positives = 276/317 (87%)
 Frame = -1

Query: 1436 LNLMVSDGISPDKVTVDLAVRALCLAGRLDHAVELIRELSSKHCSPDAYTFNFLVKHLCK 1257
            LNLMV++G  P +VTVD+AVRALC AG+ D AV+L++ELS KH  PD++T+NFLVK LCK
Sbjct: 142  LNLMVNNGFMPTQVTVDIAVRALCSAGKEDDAVKLVKELSLKHSKPDSFTYNFLVKCLCK 201

Query: 1256 CRALSSVYGFIDEMRSNFGLKPDLVTYTILIDNVCNTKNLREALRLVSVLHEEGFKPDCF 1077
            CRALS+VY FIDEMRS+F L+P+LVTYTILIDNVCN+KNLREA+RL+ +L E GFKPDCF
Sbjct: 202  CRALSNVYSFIDEMRSSFDLEPNLVTYTILIDNVCNSKNLREAMRLLGILRECGFKPDCF 261

Query: 1076 VYNTIMKGYCMLSRGSEAIEVYNKMKEEGVEPDLVTYNTLIYGLSKSGRVAEAKKLLRVM 897
            VYNTIMKGYCMLS+GS+AI+V+ KMKEEG+EPDL+TYNTLI+GLSK GRV+EAK+ L++M
Sbjct: 262  VYNTIMKGYCMLSKGSDAIQVFKKMKEEGIEPDLITYNTLIFGLSKGGRVSEAKRYLKIM 321

Query: 896  AEKGHFPDEVTYTTLMNGLCRKGXXXXXXXXXXXXEVKGCSPNSCTYNTLLHGLCKSRML 717
             E GHFPD VTYT+LMNGLCRKG            E+KGCSPNSCTYNTLL+GLCK R+L
Sbjct: 322  VESGHFPDAVTYTSLMNGLCRKGDALGALALLEDMEMKGCSPNSCTYNTLLYGLCKERLL 381

Query: 716  DKAVELYGVMKSGGLKLETASYATFVRALCRDGKIAEAYEVFDYAVESKSLTDVAAYSAL 537
            +K +ELY V+K GG+ L+TASYATFVRALCR+GK+AEAYEVFDYAVESKSLT+ AAY+ L
Sbjct: 382  EKGIELYNVIKEGGMLLDTASYATFVRALCREGKVAEAYEVFDYAVESKSLTNAAAYTTL 441

Query: 536  ESTLKWLKKAKEQGHAI 486
            ESTLKWLKKA+EQG ++
Sbjct: 442  ESTLKWLKKAREQGLSV 458



 Score = 84.3 bits (207), Expect = 7e-14
 Identities = 52/226 (23%), Positives = 100/226 (44%), Gaps = 3/226 (1%)
 Frame = -1

Query: 1193 PDLVTYTILIDNVCNTKN--LREALRLVSVLHEEGFKPDCFVYNTIMKGYCMLSRGSEAI 1020
            P + TY IL+   C   +  L    ++++++   GF P     +  ++  C   +  +A+
Sbjct: 115  PTISTYHILLSQSCKAPDPTLSPVHQILNLMVNNGFMPTQVTVDIAVRALCSAGKEDDAV 174

Query: 1019 EVYNKMKEEGVEPDLVTYNTLIYGLSKSGRVAEAKKLLRVMAEKGHF-PDEVTYTTLMNG 843
            ++  ++  +  +PD  TYN L+  L K   ++     +  M       P+ VTYT L++ 
Sbjct: 175  KLVKELSLKHSKPDSFTYNFLVKCLCKCRALSNVYSFIDEMRSSFDLEPNLVTYTILIDN 234

Query: 842  LCRKGXXXXXXXXXXXXEVKGCSPNSCTYNTLLHGLCKSRMLDKAVELYGVMKSGGLKLE 663
            +C                  G  P+   YNT++ G C       A++++  MK  G++ +
Sbjct: 235  VCNSKNLREAMRLLGILRECGFKPDCFVYNTIMKGYCMLSKGSDAIQVFKKMKEEGIEPD 294

Query: 662  TASYATFVRALCRDGKIAEAYEVFDYAVESKSLTDVAAYSALESTL 525
              +Y T +  L + G+++EA       VES    D   Y++L + L
Sbjct: 295  LITYNTLIFGLSKGGRVSEAKRYLKIMVESGHFPDAVTYTSLMNGL 340


>ref|XP_003595941.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355484989|gb|AES66192.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 472

 Score =  483 bits (1242), Expect = e-134
 Identities = 241/313 (76%), Positives = 268/313 (85%)
 Frame = -1

Query: 1436 LNLMVSDGISPDKVTVDLAVRALCLAGRLDHAVELIRELSSKHCSPDAYTFNFLVKHLCK 1257
            LNLMVSDGISPDK TVDLAVR+LC A R+D AVELI+ELSSKHCSPD Y++NFLVK+LCK
Sbjct: 167  LNLMVSDGISPDKGTVDLAVRSLCTADRVDDAVELIKELSSKHCSPDIYSYNFLVKNLCK 226

Query: 1256 CRALSSVYGFIDEMRSNFGLKPDLVTYTILIDNVCNTKNLREALRLVSVLHEEGFKPDCF 1077
             R LS VY             P+LVTYTILIDNVCNTKNLREA RLV +L EEGFKPDCF
Sbjct: 227  SRTLSLVY-------------PNLVTYTILIDNVCNTKNLREATRLVDILEEEGFKPDCF 273

Query: 1076 VYNTIMKGYCMLSRGSEAIEVYNKMKEEGVEPDLVTYNTLIYGLSKSGRVAEAKKLLRVM 897
            +YNTIMKGYCMLSRGSEAIEVYN+MKE+GVEPDL+TYNTLI+GLSKSGRV+EAKKLLRVM
Sbjct: 274  LYNTIMKGYCMLSRGSEAIEVYNRMKEKGVEPDLITYNTLIFGLSKSGRVSEAKKLLRVM 333

Query: 896  AEKGHFPDEVTYTTLMNGLCRKGXXXXXXXXXXXXEVKGCSPNSCTYNTLLHGLCKSRML 717
            AEKGHFPDEVTYT+LMNG+CRKG            E+KGCSPN+CTYNTLLHGLCKSRM 
Sbjct: 334  AEKGHFPDEVTYTSLMNGMCRKGETLAALALLEEMEMKGCSPNTCTYNTLLHGLCKSRMF 393

Query: 716  DKAVELYGVMKSGGLKLETASYATFVRALCRDGKIAEAYEVFDYAVESKSLTDVAAYSAL 537
            DKA+ELYG MKS GLKL+ ASYATFVRALC  G++A+AYEVFDYAVESKSL+DVAAYS L
Sbjct: 394  DKAMELYGAMKSDGLKLDMASYATFVRALCSVGRVADAYEVFDYAVESKSLSDVAAYSTL 453

Query: 536  ESTLKWLKKAKEQ 498
            ESTLKW +K K++
Sbjct: 454  ESTLKWSRKQKKK 466


>emb|CAN69960.1| hypothetical protein VITISV_032887 [Vitis vinifera]
          Length = 472

 Score =  478 bits (1231), Expect = e-132
 Identities = 228/316 (72%), Positives = 268/316 (84%)
 Frame = -1

Query: 1436 LNLMVSDGISPDKVTVDLAVRALCLAGRLDHAVELIRELSSKHCSPDAYTFNFLVKHLCK 1257
            LNLMV+ G  PD+VT D+AVR+LC AGR +HA+EL++ELS KH  PD++T+NF+++HLCK
Sbjct: 156  LNLMVTHGFPPDRVTTDIAVRSLCSAGREEHAIELVKELSLKHSPPDSFTYNFIIRHLCK 215

Query: 1256 CRALSSVYGFIDEMRSNFGLKPDLVTYTILIDNVCNTKNLREALRLVSVLHEEGFKPDCF 1077
             RALS+VY FIDE++++F LKPDLVTYTILIDNVCN KNLREA RL+ VL E GFKPDC+
Sbjct: 216  TRALSTVYNFIDELQNSFQLKPDLVTYTILIDNVCNGKNLREATRLLEVLGEAGFKPDCY 275

Query: 1076 VYNTIMKGYCMLSRGSEAIEVYNKMKEEGVEPDLVTYNTLIYGLSKSGRVAEAKKLLRVM 897
            VYNTIMKGYC+L +GSEAI VY KMKEEGVEPDLVTYNTLI+GLSKSGRV EA+K L +M
Sbjct: 276  VYNTIMKGYCILDKGSEAIGVYKKMKEEGVEPDLVTYNTLIFGLSKSGRVKEARKFLDIM 335

Query: 896  AEKGHFPDEVTYTTLMNGLCRKGXXXXXXXXXXXXEVKGCSPNSCTYNTLLHGLCKSRML 717
            AE GHFPD VTYT+LMNGLCR+G            E KGCSPNSCTYNTLLHGLCK RML
Sbjct: 336  AEMGHFPDAVTYTSLMNGLCREGNALGALALLEEMEAKGCSPNSCTYNTLLHGLCKLRML 395

Query: 716  DKAVELYGVMKSGGLKLETASYATFVRALCRDGKIAEAYEVFDYAVESKSLTDVAAYSAL 537
            ++ +ELYGVMKSGG+KLE ASYATFVRALC++G++AEAYE FDY VESKS  DV AYS L
Sbjct: 396  ERGIELYGVMKSGGMKLEKASYATFVRALCKEGRVAEAYEAFDYVVESKSFDDVTAYSTL 455

Query: 536  ESTLKWLKKAKEQGHA 489
            E++LKWL+KA+EQG A
Sbjct: 456  ENSLKWLRKAREQGLA 471



 Score = 63.5 bits (153), Expect = 1e-07
 Identities = 52/225 (23%), Positives = 91/225 (40%), Gaps = 6/225 (2%)
 Frame = -1

Query: 1157 VCNTKNLREALRLVSVLHEEGFKP-DCFVYNTIMKGYCMLSRGSEAIEVYNKM--KEEGV 987
            + N+ NL +A +L + +      P D   +N +++ Y  +S  +++I     M   +   
Sbjct: 68   IFNSPNLLDAKKLFASITTTSTTPLDLRFHNALLQSYSSISTVNDSISFLRHMIKSQPSF 127

Query: 986  EPDLVTYNTLIYGLSKSGR--VAEAKKLLRVMAEKGHFPDEVTYTTLMNGLCRKGXXXXX 813
             P+  TY+ L+    KS    ++   + L +M   G  PD VT    +  LC  G     
Sbjct: 128  SPERSTYHILLSQSCKSPNSDLSAVHQTLNLMVTHGFPPDRVTTDIAVRSLCSAGREEHA 187

Query: 812  XXXXXXXEVKGCSPNSCTYNTLLHGLCKSRMLDKAVELYG-VMKSGGLKLETASYATFVR 636
                    +K   P+S TYN ++  LCK+R L         +  S  LK +  +Y   + 
Sbjct: 188  IELVKELSLKHSPPDSFTYNFIIRHLCKTRALSTVYNFIDELQNSFQLKPDLVTYTILID 247

Query: 635  ALCRDGKIAEAYEVFDYAVESKSLTDVAAYSALESTLKWLKKAKE 501
             +C    + EA  + +   E+    D   Y+ +      L K  E
Sbjct: 248  NVCNGKNLREATRLLEVLGEAGFKPDCYVYNTIMKGYCILDKGSE 292


Top