BLASTX nr result

ID: Glycyrrhiza36_contig00034152 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza36_contig00034152
         (353 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_004486343.1 PREDICTED: pentatricopeptide repeat-containing pr...    87   1e-17
XP_013462936.1 pentatricopeptide (PPR) repeat protein [Medicago ...    85   6e-17
XP_019440002.1 PREDICTED: pentatricopeptide repeat-containing pr...    68   5e-11
XP_019434918.1 PREDICTED: pentatricopeptide repeat-containing pr...    67   1e-10
KRH11579.1 hypothetical protein GLYMA_15G118300 [Glycine max]          67   1e-10
XP_014623746.1 PREDICTED: pentatricopeptide repeat-containing pr...    67   1e-10
KHN34221.1 Pentatricopeptide repeat-containing protein, chloropl...    67   1e-10
XP_015940563.1 PREDICTED: pentatricopeptide repeat-containing pr...    66   3e-10
XP_016195774.1 PREDICTED: pentatricopeptide repeat-containing pr...    66   3e-10
KRH36593.1 hypothetical protein GLYMA_09G013500 [Glycine max]          65   5e-10
XP_014617315.1 PREDICTED: pentatricopeptide repeat-containing pr...    65   5e-10
XP_006597613.1 PREDICTED: pentatricopeptide repeat-containing pr...    65   9e-10
KYP62645.1 hypothetical protein KK1_017186, partial [Cajanus cajan]    65   9e-10
XP_015958976.1 PREDICTED: pentatricopeptide repeat-containing pr...    64   1e-09
XP_003534744.1 PREDICTED: pentatricopeptide repeat-containing pr...    64   2e-09
KHN40797.1 Pentatricopeptide repeat-containing protein, chloropl...    64   2e-09
XP_016197567.1 PREDICTED: pentatricopeptide repeat-containing pr...    64   2e-09
KHN40799.1 Pentatricopeptide repeat-containing protein, chloropl...    64   2e-09
XP_006607098.1 PREDICTED: pentatricopeptide repeat-containing pr...    64   2e-09
XP_014617314.1 PREDICTED: pentatricopeptide repeat-containing pr...    63   4e-09

>XP_004486343.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic-like [Cicer arietinum]
          Length = 692

 Score = 87.4 bits (215), Expect = 1e-17
 Identities = 51/87 (58%), Positives = 57/87 (65%), Gaps = 6/87 (6%)
 Frame = +1

Query: 109 NPNFKFKTFSHPNQASLQVADSHDP------VGKNKIWINPNSPRAKLLRKKLSGTSTNS 270
           N NFKFKTFSH NQ S     SH P      + K KIW+NP +P+ KL R K S  S NS
Sbjct: 28  NRNFKFKTFSHLNQES----HSHHPPTNSSSLSKTKIWVNPYTPKPKLHRNKPSN-SNNS 82

Query: 271 FLVKLAESLDSCDPTQPRVSAILNCLG 351
           FL+KLAESLD CDPTQ +V AILN  G
Sbjct: 83  FLLKLAESLDLCDPTQQQVYAILNGFG 109


>XP_013462936.1 pentatricopeptide (PPR) repeat protein [Medicago truncatula]
           KEH36981.1 pentatricopeptide (PPR) repeat protein
           [Medicago truncatula]
          Length = 687

 Score = 85.1 bits (209), Expect = 6e-17
 Identities = 46/87 (52%), Positives = 58/87 (66%), Gaps = 9/87 (10%)
 Frame = +1

Query: 109 NPNFKFKTFSHPNQASLQVADSHDPV------GKNKIWINPNSPRAKLLRKKLSGTSTNS 270
           NPNFKFKTFSH NQ       SH+PV       K KIW+NPN+P++K L+ K +   +NS
Sbjct: 22  NPNFKFKTFSHVNQQH-----SHNPVTNSSSLSKPKIWVNPNNPKSKPLQNKNNNKPSNS 76

Query: 271 ---FLVKLAESLDSCDPTQPRVSAILN 342
              FL+K  +SLDSCDPT  +V+AILN
Sbjct: 77  RNHFLLKFVQSLDSCDPTHQQVNAILN 103


>XP_019440002.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic-like [Lupinus angustifolius] OIW13865.1
           hypothetical protein TanjilG_31754 [Lupinus
           angustifolius]
          Length = 695

 Score = 68.2 bits (165), Expect = 5e-11
 Identities = 51/119 (42%), Positives = 63/119 (52%), Gaps = 8/119 (6%)
 Frame = +1

Query: 19  MASRRCSSSYSLLFSDCQXXXXXXXXXXXRNPNFKFKTFSHPNQASLQVADSHDP----- 183
           MA R  SS  SL F+  +           + PN+   T   P   S+  A+SHDP     
Sbjct: 1   MAYRLFSSHPSLFFNGRKPTSLSSSSRNFKFPNYPSPT---PTSHSITHANSHDPDANSL 57

Query: 184 -VGKNKIWINPNSPRAK-LLRKKLSGTSTNSFLVKLAESLDSCDPTQPRVSAIL-NCLG 351
            + KNKIW+NP SPR K LL+K  S    N+ LVK+ ESLDSC  T+  VS IL N LG
Sbjct: 58  SLSKNKIWVNPKSPRVKQLLKKSSSSRYNNNPLVKVTESLDSC-RTENEVSIILKNALG 115


>XP_019434918.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic-like [Lupinus angustifolius] OIV89284.1
           hypothetical protein TanjilG_23744 [Lupinus
           angustifolius]
          Length = 688

 Score = 67.4 bits (163), Expect = 1e-10
 Identities = 45/88 (51%), Positives = 48/88 (54%), Gaps = 14/88 (15%)
 Frame = +1

Query: 127 KTFSHPNQASLQVADSH-----------DPVGK---NKIWINPNSPRAKLLRKKLSGTST 264
           K   H N  SL     H           DP GK   N IW+NPNSPRAK LRKK      
Sbjct: 38  KIILHSNHVSLHEPIPHITNHDKDSNFDDPDGKSSKNYIWVNPNSPRAKQLRKKSYDARY 97

Query: 265 NSFLVKLAESLDSCDPTQPRVSAILNCL 348
           NS L+KLA SLDSC+PTQ  VS ILN L
Sbjct: 98  NS-LLKLAHSLDSCNPTQNDVSEILNGL 124


>KRH11579.1 hypothetical protein GLYMA_15G118300 [Glycine max]
          Length = 1016

 Score = 67.0 bits (162), Expect = 1e-10
 Identities = 38/80 (47%), Positives = 49/80 (61%), Gaps = 1/80 (1%)
 Frame = +1

Query: 112 PNFKFKTFSHPNQASLQVADSHDP-VGKNKIWINPNSPRAKLLRKKLSGTSTNSFLVKLA 288
           P F  +  +    A L   D+  P + KN IW+NP SPRAK L+K  S  + +S L KLA
Sbjct: 39  PRFSLQPVNTLQDAKLDDPDAKSPSLSKNSIWVNPRSPRAKHLQKN-SPHARSSSLTKLA 97

Query: 289 ESLDSCDPTQPRVSAILNCL 348
           +SLDSC+PT+  VS ILN L
Sbjct: 98  KSLDSCNPTEQHVSEILNVL 117


>XP_014623746.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic-like [Glycine max]
          Length = 1046

 Score = 67.0 bits (162), Expect = 1e-10
 Identities = 38/80 (47%), Positives = 49/80 (61%), Gaps = 1/80 (1%)
 Frame = +1

Query: 112 PNFKFKTFSHPNQASLQVADSHDP-VGKNKIWINPNSPRAKLLRKKLSGTSTNSFLVKLA 288
           P F  +  +    A L   D+  P + KN IW+NP SPRAK L+K  S  + +S L KLA
Sbjct: 69  PRFSLQPVNTLQDAKLDDPDAKSPSLSKNSIWVNPRSPRAKHLQKN-SPHARSSSLTKLA 127

Query: 289 ESLDSCDPTQPRVSAILNCL 348
           +SLDSC+PT+  VS ILN L
Sbjct: 128 KSLDSCNPTEQHVSEILNVL 147


>KHN34221.1 Pentatricopeptide repeat-containing protein, chloroplastic [Glycine
           soja]
          Length = 1875

 Score = 67.0 bits (162), Expect = 1e-10
 Identities = 38/80 (47%), Positives = 49/80 (61%), Gaps = 1/80 (1%)
 Frame = +1

Query: 112 PNFKFKTFSHPNQASLQVADSHDP-VGKNKIWINPNSPRAKLLRKKLSGTSTNSFLVKLA 288
           P F  +  +    A L   D+  P + KN IW+NP SPRAK L+K  S  + +S L KLA
Sbjct: 39  PRFSLQPVNTLQDAKLDDPDAKSPSLSKNSIWVNPRSPRAKHLQKN-SPHARSSSLTKLA 97

Query: 289 ESLDSCDPTQPRVSAILNCL 348
           +SLDSC+PT+  VS ILN L
Sbjct: 98  KSLDSCNPTEQHVSEILNVL 117


>XP_015940563.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic [Arachis duranensis]
          Length = 705

 Score = 66.2 bits (160), Expect = 3e-10
 Identities = 44/95 (46%), Positives = 53/95 (55%), Gaps = 20/95 (21%)
 Frame = +1

Query: 127 KTFSHPNQASLQVA------------DS--HDPV------GKNKIWINPNSPRAKLLRKK 246
           +TF H N+ SLQ +            DS   DPV       KN +W+NP SPRAK LRKK
Sbjct: 51  RTFLHANRVSLQESVPKLAEDADIEKDSKFEDPVEKSSSSSKNYVWVNPKSPRAKQLRKK 110

Query: 247 LSGTSTNSFLVKLAESLDSCDPTQPRVSAILNCLG 351
              T   + LVK+A SLDSC+PT+  VS IL  LG
Sbjct: 111 SYNTRYTN-LVKIANSLDSCNPTEHDVSEILKSLG 144


>XP_016195774.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic [Arachis ipaensis]
          Length = 705

 Score = 65.9 bits (159), Expect = 3e-10
 Identities = 42/95 (44%), Positives = 51/95 (53%), Gaps = 20/95 (21%)
 Frame = +1

Query: 127 KTFSHPNQASLQ--------------VADSHDPV------GKNKIWINPNSPRAKLLRKK 246
           +TF H N  SLQ              V+   DP+       KN +W+NP SPRAK LRKK
Sbjct: 51  RTFLHANCVSLQESVLKLAEDADIEKVSKFEDPIEKSSSSSKNYVWVNPKSPRAKQLRKK 110

Query: 247 LSGTSTNSFLVKLAESLDSCDPTQPRVSAILNCLG 351
              T   + LVK+A SLDSC+PT+  VS IL  LG
Sbjct: 111 SYNTRYTN-LVKIANSLDSCNPTEHDVSEILKSLG 144


>KRH36593.1 hypothetical protein GLYMA_09G013500 [Glycine max]
          Length = 693

 Score = 65.5 bits (158), Expect = 5e-10
 Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
 Frame = +1

Query: 112 PNFKFKTFSH-PNQASLQVADSHDP-VGKNKIWINPNSPRAKLLRKKLSGTSTNSFLVKL 285
           P F F+ F+   + A L   D+  P + K   W+NP SPRAK L++  S  + +S L KL
Sbjct: 39  PRFSFQPFNTLQDAAKLDDPDAKSPSLSKKSFWVNPRSPRAKQLQQN-SPHARSSSLTKL 97

Query: 286 AESLDSCDPTQPRVSAILNCLG 351
           A+SLDSC+PT+  VS ILN LG
Sbjct: 98  AKSLDSCNPTEEHVSEILNVLG 119


>XP_014617315.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic-like [Glycine max] KHN40798.1
           Pentatricopeptide repeat-containing protein,
           chloroplastic [Glycine soja]
          Length = 1017

 Score = 65.5 bits (158), Expect = 5e-10
 Identities = 38/82 (46%), Positives = 51/82 (62%), Gaps = 2/82 (2%)
 Frame = +1

Query: 112 PNFKFKTFSH-PNQASLQVADSHDP-VGKNKIWINPNSPRAKLLRKKLSGTSTNSFLVKL 285
           P F F+ F+   + A L   D+  P + K   W+NP SPRAK L++  S  + +S L KL
Sbjct: 39  PRFSFQPFNTLQDAAKLDDPDAKSPSLSKKSFWVNPRSPRAKQLQQN-SPHARSSSLTKL 97

Query: 286 AESLDSCDPTQPRVSAILNCLG 351
           A+SLDSC+PT+  VS ILN LG
Sbjct: 98  AKSLDSCNPTEEHVSEILNVLG 119


>XP_006597613.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic-like [Glycine max] KRH11577.1 hypothetical
           protein GLYMA_15G118100 [Glycine max]
          Length = 691

 Score = 64.7 bits (156), Expect = 9e-10
 Identities = 39/87 (44%), Positives = 49/87 (56%), Gaps = 8/87 (9%)
 Frame = +1

Query: 115 NFKFKTFSHPNQASLQVADSH--------DPVGKNKIWINPNSPRAKLLRKKLSGTSTNS 270
           +FK   FS  +Q    + D+          P+ K  IW+NP SPRAK L K  S  + +S
Sbjct: 43  HFKLPNFSSSHQPKTLLQDAKLDDPNAKSSPLSKTSIWVNPKSPRAKHLWKN-SYHARSS 101

Query: 271 FLVKLAESLDSCDPTQPRVSAILNCLG 351
            L KLA+SLDSC+PTQ  VS IL  LG
Sbjct: 102 SLTKLAKSLDSCNPTQEHVSEILKVLG 128


>KYP62645.1 hypothetical protein KK1_017186, partial [Cajanus cajan]
          Length = 1470

 Score = 64.7 bits (156), Expect = 9e-10
 Identities = 37/79 (46%), Positives = 48/79 (60%), Gaps = 1/79 (1%)
 Frame = +1

Query: 118 FKFKTFSHPNQASLQVADSHDPVG-KNKIWINPNSPRAKLLRKKLSGTSTNSFLVKLAES 294
           F F+ F+    A L   D++     K  IW+NP SPR K L K +S  + +SFL KL +S
Sbjct: 326 FSFQVFNTIPHAKLDDPDANSSSSTKTSIWVNPRSPRGKRLWK-ISPNARSSFLNKLTKS 384

Query: 295 LDSCDPTQPRVSAILNCLG 351
           LDSC+PT+  VS ILN LG
Sbjct: 385 LDSCNPTEKHVSEILNVLG 403


>XP_015958976.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic-like [Arachis duranensis]
          Length = 706

 Score = 64.3 bits (155), Expect = 1e-09
 Identities = 45/92 (48%), Positives = 56/92 (60%), Gaps = 10/92 (10%)
 Frame = +1

Query: 106 RNPNFKFKTFSHPNQASLQVADSHDP------VGKNKIWINPNSPRAKLLRKK--LSGTS 261
           +NP    +   HPN  S + A+ HDP      + KN IW+NP S RAK LRKK  LS  S
Sbjct: 55  QNPFTFTQHHEHPN--SPEAANPHDPHAKSQPLSKNHIWVNPRSSRAKHLRKKKSLSSIS 112

Query: 262 TNSF--LVKLAESLDSCDPTQPRVSAILNCLG 351
            +SF  L++LA+SLDSC  +Q  VS ILN LG
Sbjct: 113 ASSFSSLLRLAKSLDSC--SQQDVSKILNSLG 142


>XP_003534744.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic-like [Glycine max] KRH36594.1 hypothetical
           protein GLYMA_09G013600 [Glycine max]
          Length = 705

 Score = 63.9 bits (154), Expect = 2e-09
 Identities = 48/124 (38%), Positives = 60/124 (48%), Gaps = 14/124 (11%)
 Frame = +1

Query: 19  MASRRCSSSYSLLFSDCQXXXXXXXXXXXRNPNFKFKTFSHPN--------QASLQVADS 174
           MAS  C+S  SL                 RN  F     SH +        QA  Q A S
Sbjct: 1   MASCICASHSSLFHDRHYVSTSISSSFSLRNFKFPNAASSHKSLSLHSKASQALHQDAAS 60

Query: 175 HDP------VGKNKIWINPNSPRAKLLRKKLSGTSTNSFLVKLAESLDSCDPTQPRVSAI 336
           HDP      + K +IW+NPNSPRAK L+ K S ++  S+L +L ESL+SC P+   VS I
Sbjct: 61  HDPDANSLSLSKTRIWVNPNSPRAKHLQPK-SPSARYSYLARLTESLNSCTPSAQHVSTI 119

Query: 337 LNCL 348
           L  L
Sbjct: 120 LKGL 123


>KHN40797.1 Pentatricopeptide repeat-containing protein, chloroplastic [Glycine
           soja]
          Length = 1127

 Score = 63.9 bits (154), Expect = 2e-09
 Identities = 48/124 (38%), Positives = 60/124 (48%), Gaps = 14/124 (11%)
 Frame = +1

Query: 19  MASRRCSSSYSLLFSDCQXXXXXXXXXXXRNPNFKFKTFSHPN--------QASLQVADS 174
           MAS  C+S  SL                 RN  F     SH +        QA  Q A S
Sbjct: 1   MASCICASHSSLFHDRHYVSTSISSSFSLRNFKFPNAASSHKSLSLHSKASQALHQDAAS 60

Query: 175 HDP------VGKNKIWINPNSPRAKLLRKKLSGTSTNSFLVKLAESLDSCDPTQPRVSAI 336
           HDP      + K +IW+NPNSPRAK L+ K S ++  S+L +L ESL+SC P+   VS I
Sbjct: 61  HDPDANSLSLSKTRIWVNPNSPRAKHLQPK-SPSARYSYLARLTESLNSCTPSAQHVSTI 119

Query: 337 LNCL 348
           L  L
Sbjct: 120 LKGL 123


>XP_016197567.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic-like [Arachis ipaensis]
          Length = 744

 Score = 63.5 bits (153), Expect = 2e-09
 Identities = 48/116 (41%), Positives = 58/116 (50%), Gaps = 5/116 (4%)
 Frame = +1

Query: 19  MASRRCSSSYSLLFSDCQXXXXXXXXXXXRNPNFKFKTFSHPNQASLQV-----ADSHDP 183
           MASR C SS S L    Q            +P  K KT       SLQ       ++   
Sbjct: 48  MASRLCCSSPSPLSH--QHRCSQTHSRISPSPTTKPKTSLQLIHVSLQQHQHPEQEAPSS 105

Query: 184 VGKNKIWINPNSPRAKLLRKKLSGTSTNSFLVKLAESLDSCDPTQPRVSAILNCLG 351
             K +IW+NPN+P AK LR+K S +S    LVK A+SLDSCDPT  +VS IL   G
Sbjct: 106 SSKPRIWVNPNNPLAKRLRRK-SNSSRYVSLVKFAQSLDSCDPTSEQVSTILASFG 160


>KHN40799.1 Pentatricopeptide repeat-containing protein, chloroplastic [Glycine
           soja]
          Length = 995

 Score = 63.5 bits (153), Expect = 2e-09
 Identities = 41/90 (45%), Positives = 51/90 (56%), Gaps = 11/90 (12%)
 Frame = +1

Query: 115 NFKFKTF--SHPNQASLQVADSHDP---------VGKNKIWINPNSPRAKLLRKKLSGTS 261
           +FK  TF  SH  +  LQV  ++ P         + K  IW+NP SPR K L K      
Sbjct: 45  HFKLPTFPLSHNPKTLLQVQATYTPQDPDAKSSSLSKTSIWVNPKSPRVKHLWKNPYHAR 104

Query: 262 TNSFLVKLAESLDSCDPTQPRVSAILNCLG 351
           ++S L KLA+SLDSC+PTQ RVS IL  LG
Sbjct: 105 SSS-LTKLAKSLDSCNPTQQRVSQILQVLG 133


>XP_006607098.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic-like [Glycine max] KRH36592.1 hypothetical
           protein GLYMA_09G013400 [Glycine max]
          Length = 1024

 Score = 63.5 bits (153), Expect = 2e-09
 Identities = 41/90 (45%), Positives = 51/90 (56%), Gaps = 11/90 (12%)
 Frame = +1

Query: 115 NFKFKTF--SHPNQASLQVADSHDP---------VGKNKIWINPNSPRAKLLRKKLSGTS 261
           +FK  TF  SH  +  LQV  ++ P         + K  IW+NP SPR K L K      
Sbjct: 45  HFKLPTFPLSHNPKTLLQVQATYTPQDPDAKSSSLSKTSIWVNPKSPRVKHLWKNPYHAR 104

Query: 262 TNSFLVKLAESLDSCDPTQPRVSAILNCLG 351
           ++S L KLA+SLDSC+PTQ RVS IL  LG
Sbjct: 105 SSS-LTKLAKSLDSCNPTQQRVSQILQVLG 133


>XP_014617314.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390,
           chloroplastic-like [Glycine max] KRH36591.1 hypothetical
           protein GLYMA_09G013300 [Glycine max]
          Length = 692

 Score = 62.8 bits (151), Expect = 4e-09
 Identities = 41/86 (47%), Positives = 48/86 (55%), Gaps = 6/86 (6%)
 Frame = +1

Query: 112 PNFKFKTFSHPNQASLQVADSHDP------VGKNKIWINPNSPRAKLLRKKLSGTSTNSF 273
           PNF     SH ++  LQ     DP      + K  IW+NP SPRAK L K    T ++S 
Sbjct: 48  PNFPS---SHQSKTLLQDTILDDPDAKSSSLSKTSIWVNPKSPRAKHLWKNSYHTRSSS- 103

Query: 274 LVKLAESLDSCDPTQPRVSAILNCLG 351
           L KLA SLDSC+PTQ  VS IL  LG
Sbjct: 104 LTKLARSLDSCNPTQQHVSEILKVLG 129


Top