BLASTX nr result

ID: Glycyrrhiza34_contig00023004 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza34_contig00023004
         (320 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KRH40926.1 hypothetical protein GLYMA_09G286100 [Glycine max]         129   5e-33
XP_003556973.2 PREDICTED: pentatricopeptide repeat-containing pr...   129   7e-33
XP_007153011.1 hypothetical protein PHAVU_003G000200g [Phaseolus...   126   5e-32
XP_004498089.1 PREDICTED: pentatricopeptide repeat-containing pr...   121   3e-30
XP_003589826.1 PPR containing plant-like protein [Medicago trunc...   113   3e-27
XP_019420700.1 PREDICTED: pentatricopeptide repeat-containing pr...   100   2e-22
KHN08008.1 Pentatricopeptide repeat-containing protein [Glycine ...    94   4e-20
KYP45778.1 Pentatricopeptide repeat-containing protein At5g56310...    90   6e-19
XP_002303975.2 pentatricopeptide repeat-containing family protei...    90   9e-19
XP_011025095.1 PREDICTED: pentatricopeptide repeat-containing pr...    90   9e-19
EEF38963.1 pentatricopeptide repeat-containing protein, putative...    88   4e-18
XP_015577407.1 PREDICTED: pentatricopeptide repeat-containing pr...    88   4e-18
KDP30643.1 hypothetical protein JCGZ_16208 [Jatropha curcas]           86   2e-17
GAV81085.1 PPR domain-containing protein/PPR_2 domain-containing...    77   3e-14
OIW17364.1 hypothetical protein TanjilG_22476 [Lupinus angustifo...    76   3e-14
CAN72397.1 hypothetical protein VITISV_041201 [Vitis vinifera]         75   1e-13
XP_019075855.1 PREDICTED: pentatricopeptide repeat-containing pr...    75   1e-13
EOY32273.1 Pentatricopeptide repeat (PPR) superfamily protein, p...    75   1e-13
XP_017983451.1 PREDICTED: pentatricopeptide repeat-containing pr...    72   1e-12
XP_018507338.1 PREDICTED: pentatricopeptide repeat-containing pr...    72   2e-12

>KRH40926.1 hypothetical protein GLYMA_09G286100 [Glycine max]
          Length = 549

 Score =  129 bits (324), Expect = 5e-33
 Identities = 71/106 (66%), Positives = 84/106 (79%)
 Frame = -1

Query: 320 SQQVEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHF 141
           SQ  EQLL HCTNLSHL+QTQ FMLTR LDQ+DILL+RFI+TSA+LG   +AYSVF    
Sbjct: 30  SQHAEQLLCHCTNLSHLQQTQGFMLTRGLDQDDILLARFIYTSASLGLSSYAYSVF---I 86

Query: 140 NNHRPLPICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
           +NHRP  I  +NN I+AL  SSSNP RA+SLFN+IR LG+ PDSY+
Sbjct: 87  SNHRP-SIFFYNNVIWAL--SSSNPTRAISLFNAIRLLGMPPDSYS 129


>XP_003556973.2 PREDICTED: pentatricopeptide repeat-containing protein At5g56310
           [Glycine max]
          Length = 580

 Score =  129 bits (324), Expect = 7e-33
 Identities = 71/106 (66%), Positives = 84/106 (79%)
 Frame = -1

Query: 320 SQQVEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHF 141
           SQ  EQLL HCTNLSHL+QTQ FMLTR LDQ+DILL+RFI+TSA+LG   +AYSVF    
Sbjct: 61  SQHAEQLLCHCTNLSHLQQTQGFMLTRGLDQDDILLARFIYTSASLGLSSYAYSVF---I 117

Query: 140 NNHRPLPICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
           +NHRP  I  +NN I+AL  SSSNP RA+SLFN+IR LG+ PDSY+
Sbjct: 118 SNHRP-SIFFYNNVIWAL--SSSNPTRAISLFNAIRLLGMPPDSYS 160


>XP_007153011.1 hypothetical protein PHAVU_003G000200g [Phaseolus vulgaris]
           ESW25005.1 hypothetical protein PHAVU_003G000200g
           [Phaseolus vulgaris]
          Length = 555

 Score =  126 bits (317), Expect = 5e-32
 Identities = 67/104 (64%), Positives = 84/104 (80%)
 Frame = -1

Query: 314 QVEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFNN 135
           + E+LL  C+NLSHL+QTQCFMLTRALD++DILL+RFI+ SA+LGFP  AYS+F    +N
Sbjct: 38  EAEKLLRRCSNLSHLQQTQCFMLTRALDKDDILLARFIYASASLGFPSFAYSIF---IHN 94

Query: 134 HRPLPICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
           HRP  I ++NN I+AL  SS NP RA+SLFN+IR LGL PDSY+
Sbjct: 95  HRP-SIYLYNNLIWAL--SSHNPTRAISLFNAIRLLGLRPDSYS 135


>XP_004498089.1 PREDICTED: pentatricopeptide repeat-containing protein At5g56310
           [Cicer arietinum]
          Length = 546

 Score =  121 bits (304), Expect = 3e-30
 Identities = 64/105 (60%), Positives = 80/105 (76%)
 Frame = -1

Query: 317 QQVEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFN 138
           Q ++QLL HCTNL+HL+QT  FML  +L QNDI LSRFIH SA+L FP +++S+FT   N
Sbjct: 26  QILQQLLSHCTNLTHLQQTHSFMLKTSLFQNDIHLSRFIHKSASLNFPNYSFSIFT-SLN 84

Query: 137 NHRPLPICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
           ++RP PI V+NN I+A   SSSNP RAVSLFN +R LGL  DSY+
Sbjct: 85  HNRPFPIFVYNNIIYAF--SSSNPTRAVSLFNVVRRLGLSLDSYS 127


>XP_003589826.1 PPR containing plant-like protein [Medicago truncatula] AES60077.1
           PPR containing plant-like protein [Medicago truncatula]
          Length = 526

 Score =  113 bits (283), Expect = 3e-27
 Identities = 60/106 (56%), Positives = 79/106 (74%)
 Frame = -1

Query: 320 SQQVEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHF 141
           +QQ++QLL  CT L+HL+QT  F+L  AL QNDI LSRFIH +A+L +P ++YS+FT  F
Sbjct: 13  TQQLQQLLSQCTTLTHLQQTHTFILKHALFQNDINLSRFIHKTASLNYPSYSYSIFT--F 70

Query: 140 NNHRPLPICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
           N++RP PI V+NN I+AL   SSN   AVS+F S+R LGL  DSY+
Sbjct: 71  NHNRPFPIFVYNNIIYAL--YSSNAKLAVSIFRSVRRLGLSFDSYS 114


>XP_019420700.1 PREDICTED: pentatricopeptide repeat-containing protein At5g56310
           [Lupinus angustifolius]
          Length = 572

 Score =  100 bits (248), Expect = 2e-22
 Identities = 55/105 (52%), Positives = 74/105 (70%)
 Frame = -1

Query: 317 QQVEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFN 138
           +++  LL  CTNL+HL Q Q FMLT ALD+++I +S+ IHTSA+LGF   +YS+FT+   
Sbjct: 49  EKLHNLLNQCTNLNHLHQIQAFMLTTALDKDNIFISQLIHTSASLGFSSFSYSLFTY--- 105

Query: 137 NHRPLPICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
           NHRP  I ++NN I+AL  S  N  RA+ LF  IR LGL PD+Y+
Sbjct: 106 NHRP-NIFLYNNTIWAL--SKFNAPRAILLFKGIRLLGLKPDNYS 147


>KHN08008.1 Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 497

 Score = 93.6 bits (231), Expect = 4e-20
 Identities = 53/83 (63%), Positives = 65/83 (78%)
 Frame = -1

Query: 251 MLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFNNHRPLPICVHNNAIFALSTSSS 72
           MLTR LDQ+DILL+RFI+TSA+LG   +AYSVF    +NHRP  I  +NN I+AL  SSS
Sbjct: 1   MLTRGLDQDDILLARFIYTSASLGLSSYAYSVF---ISNHRP-SIFFYNNVIWAL--SSS 54

Query: 71  NPARAVSLFNSIRHLGLFPDSYT 3
           NP RA+SLFN+IR LG+ PDSY+
Sbjct: 55  NPTRAISLFNAIRLLGMPPDSYS 77


>KYP45778.1 Pentatricopeptide repeat-containing protein At5g56310 family
           [Cajanus cajan]
          Length = 495

 Score = 90.1 bits (222), Expect = 6e-19
 Identities = 53/81 (65%), Positives = 62/81 (76%)
 Frame = -1

Query: 251 MLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFNNHRPLPICVHNNAIFALSTSSS 72
           MLTR LDQ+DILL+RFI TSA+LG   +AYSVF +   NHR   I V+NN I+AL  SSS
Sbjct: 1   MLTRGLDQDDILLARFIQTSASLGLSSYAYSVFMY---NHR-ASIFVYNNVIWAL--SSS 54

Query: 71  NPARAVSLFNSIRHLGLFPDS 9
           NP RA+SLFN+IR LGL PDS
Sbjct: 55  NPTRAISLFNAIRLLGLRPDS 75


>XP_002303975.2 pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] EEE78954.2 pentatricopeptide
           repeat-containing family protein [Populus trichocarpa]
          Length = 543

 Score = 89.7 bits (221), Expect = 9e-19
 Identities = 48/100 (48%), Positives = 70/100 (70%)
 Frame = -1

Query: 302 LLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFNNHRPL 123
           L+ H ++L H+ QT  FML RALD +++LLSRFIH  ++LGF  +AYS+FT     H P 
Sbjct: 19  LIAHSSHLKHIHQTHAFMLLRALDTDNLLLSRFIHACSSLGFYSYAYSLFT--SITHAP- 75

Query: 122 PICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
            I ++NN I ALS+S ++P  ++ L+N+I+  GL PDSY+
Sbjct: 76  DIYLYNNIIKALSSSPTHPKASIFLYNNIQLAGLRPDSYS 115


>XP_011025095.1 PREDICTED: pentatricopeptide repeat-containing protein
           At5g56310-like [Populus euphratica] XP_011025096.1
           PREDICTED: pentatricopeptide repeat-containing protein
           At5g56310-like [Populus euphratica] XP_011025097.1
           PREDICTED: pentatricopeptide repeat-containing protein
           At5g56310-like [Populus euphratica]
          Length = 558

 Score = 89.7 bits (221), Expect = 9e-19
 Identities = 48/100 (48%), Positives = 70/100 (70%)
 Frame = -1

Query: 302 LLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFNNHRPL 123
           L+ H ++L H+ QT  FML RALD +++LLSRFIH  ++LGF  +AYS+FT     H P 
Sbjct: 34  LIAHSSHLKHIHQTHAFMLLRALDTDNLLLSRFIHACSSLGFYSYAYSLFT--SITHAP- 90

Query: 122 PICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
            I ++NN I ALS+S ++P  ++ L+N+I+  GL PDSY+
Sbjct: 91  DIYLYNNIIKALSSSPTHPKASIFLYNNIQLAGLRPDSYS 130


>EEF38963.1 pentatricopeptide repeat-containing protein, putative [Ricinus
           communis]
          Length = 538

 Score = 87.8 bits (216), Expect = 4e-18
 Identities = 51/105 (48%), Positives = 69/105 (65%)
 Frame = -1

Query: 317 QQVEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFN 138
           +QV  LL HC+NL HL QT  FML RALD +++LLS FI +S++LGF  +AYS+FT   +
Sbjct: 12  KQVIWLLNHCSNLKHLHQTHAFMLCRALDHDNLLLSLFIQSSSSLGFSLYAYSLFT---S 68

Query: 137 NHRPLPICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
              P  I ++N  I ALS  S  P+ ++ LFN I+   L PDSY+
Sbjct: 69  LTHPPNIFLYNTIIRALSL-SPQPSLSIFLFNRIQSARLRPDSYS 112


>XP_015577407.1 PREDICTED: pentatricopeptide repeat-containing protein At5g56310
           [Ricinus communis] XP_015577408.1 PREDICTED:
           pentatricopeptide repeat-containing protein At5g56310
           [Ricinus communis] XP_015577409.1 PREDICTED:
           pentatricopeptide repeat-containing protein At5g56310
           [Ricinus communis] XP_002523384.2 PREDICTED:
           pentatricopeptide repeat-containing protein At5g56310
           [Ricinus communis]
          Length = 556

 Score = 87.8 bits (216), Expect = 4e-18
 Identities = 51/105 (48%), Positives = 69/105 (65%)
 Frame = -1

Query: 317 QQVEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFN 138
           +QV  LL HC+NL HL QT  FML RALD +++LLS FI +S++LGF  +AYS+FT   +
Sbjct: 30  KQVIWLLNHCSNLKHLHQTHAFMLCRALDHDNLLLSLFIQSSSSLGFSLYAYSLFT---S 86

Query: 137 NHRPLPICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
              P  I ++N  I ALS  S  P+ ++ LFN I+   L PDSY+
Sbjct: 87  LTHPPNIFLYNTIIRALSL-SPQPSLSIFLFNRIQSARLRPDSYS 130


>KDP30643.1 hypothetical protein JCGZ_16208 [Jatropha curcas]
          Length = 564

 Score = 85.9 bits (211), Expect = 2e-17
 Identities = 45/106 (42%), Positives = 70/106 (66%)
 Frame = -1

Query: 320 SQQVEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHF 141
           ++Q+  LL HC+NL H++QT  FM++  L  +++LLSRFI+  ++LG   ++YSVFT   
Sbjct: 24  AEQLIWLLSHCSNLKHIQQTHGFMVSTGLHHDNLLLSRFINACSSLGLSLYSYSVFT--- 80

Query: 140 NNHRPLPICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
           +  RP  I ++N  I +LS S +    A+ LFN+I+  GL PDSY+
Sbjct: 81  HKTRPPDIYLYNTMIRSLSFSQTQSQAAIFLFNNIQVAGLRPDSYS 126


>GAV81085.1 PPR domain-containing protein/PPR_2 domain-containing protein
           [Cephalotus follicularis]
          Length = 582

 Score = 77.0 bits (188), Expect = 3e-14
 Identities = 44/103 (42%), Positives = 65/103 (63%)
 Frame = -1

Query: 311 VEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFNNH 132
           ++QLL  C NL H++QT  +M+ RALDQ+++LL R I   ++LGF  +A  +FT   +N 
Sbjct: 62  LKQLLKQCFNLRHIQQTHGYMVLRALDQHNLLLGRLIDACSSLGFSSYASLLFT---HNK 118

Query: 131 RPLPICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
               I ++N  + ALS     P +A+ L+NSIR  GL PDSY+
Sbjct: 119 SQTDIYLYNATLKALS-----PVKAIILYNSIRLAGLRPDSYS 156


>OIW17364.1 hypothetical protein TanjilG_22476 [Lupinus angustifolius]
          Length = 328

 Score = 76.3 bits (186), Expect = 3e-14
 Identities = 44/83 (53%), Positives = 59/83 (71%)
 Frame = -1

Query: 251 MLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFNNHRPLPICVHNNAIFALSTSSS 72
           MLT ALD+++I +S+ IHTSA+LGF   +YS+FT+   NHRP  I ++NN I+AL  S  
Sbjct: 1   MLTTALDKDNIFISQLIHTSASLGFSSFSYSLFTY---NHRP-NIFLYNNTIWAL--SKF 54

Query: 71  NPARAVSLFNSIRHLGLFPDSYT 3
           N  RA+ LF  IR LGL PD+Y+
Sbjct: 55  NAPRAILLFKGIRLLGLKPDNYS 77


>CAN72397.1 hypothetical protein VITISV_041201 [Vitis vinifera]
          Length = 576

 Score = 75.5 bits (184), Expect = 1e-13
 Identities = 46/107 (42%), Positives = 65/107 (60%), Gaps = 1/107 (0%)
 Frame = -1

Query: 320 SQQVEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHF 141
           S  +  LL  C+NL HL QT CFML+R LDQ++ILLSRFI   ++LGF  +++S+FT   
Sbjct: 41  SYHLLSLLKQCSNLKHLHQTHCFMLSRGLDQDNILLSRFIEACSSLGFSHYSHSIFT--- 97

Query: 140 NNHRPLP-ICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
             H+  P I ++N  I ALS +      A+ L+N I    L  D+Y+
Sbjct: 98  --HKTRPDIYLYNTIIKALS-NPELATEAILLYNRILASDLRFDTYS 141


>XP_019075855.1 PREDICTED: pentatricopeptide repeat-containing protein At5g56310
           [Vitis vinifera]
          Length = 576

 Score = 75.5 bits (184), Expect = 1e-13
 Identities = 46/107 (42%), Positives = 65/107 (60%), Gaps = 1/107 (0%)
 Frame = -1

Query: 320 SQQVEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHF 141
           S  +  LL  C+NL HL QT CFML+R LDQ++ILLSRFI   ++LGF  +++S+FT   
Sbjct: 41  SYHLLSLLKQCSNLKHLHQTHCFMLSRGLDQDNILLSRFIEACSSLGFSHYSHSIFT--- 97

Query: 140 NNHRPLP-ICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
             H+  P I ++N  I ALS +      A+ L+N I    L  D+Y+
Sbjct: 98  --HKTRPDIYLYNTIIKALS-NPELATEAILLYNRILASDLRFDTYS 141


>EOY32273.1 Pentatricopeptide repeat (PPR) superfamily protein, putative
           isoform 1 [Theobroma cacao] EOY32274.1 Pentatricopeptide
           repeat (PPR) superfamily protein, putative isoform 1
           [Theobroma cacao]
          Length = 540

 Score = 75.1 bits (183), Expect = 1e-13
 Identities = 45/100 (45%), Positives = 60/100 (60%)
 Frame = -1

Query: 302 LLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFNNHRPL 123
           LL  C+NL H+EQ   FM+  ALD ++ILLS+FI   ++LGFP +AYSVF F+       
Sbjct: 9   LLKRCSNLKHVEQAHGFMVRTALDHDEILLSQFIEACSSLGFPGYAYSVFAFNLQPR--- 65

Query: 122 PICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
            I V N  I AL T S +   A+ ++ SI    L PDSY+
Sbjct: 66  -IYVFNTMIKAL-TLSHSAFEALHVYKSIARAKLRPDSYS 103


>XP_017983451.1 PREDICTED: pentatricopeptide repeat-containing protein At5g56310
           [Theobroma cacao] XP_007014654.2 PREDICTED:
           pentatricopeptide repeat-containing protein At5g56310
           [Theobroma cacao]
          Length = 540

 Score = 72.4 bits (176), Expect = 1e-12
 Identities = 46/100 (46%), Positives = 61/100 (61%)
 Frame = -1

Query: 302 LLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFNNHRPL 123
           LL  C+NL H+EQ   FM+  ALD ++ILLS+FI   ++LGF  +AYSVF F   N +P 
Sbjct: 9   LLKRCSNLKHVEQAHGFMVRTALDHDEILLSQFIEACSSLGFSGYAYSVFAF---NSQP- 64

Query: 122 PICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
            I V N  I AL T S +   A+ ++ SI    L PDSY+
Sbjct: 65  RIYVFNTMIKAL-TLSHSAFEALHVYKSIARAKLRPDSYS 103


>XP_018507338.1 PREDICTED: pentatricopeptide repeat-containing protein At5g56310
           [Pyrus x bretschneideri]
          Length = 539

 Score = 72.0 bits (175), Expect = 2e-12
 Identities = 44/104 (42%), Positives = 65/104 (62%)
 Frame = -1

Query: 314 QVEQLLYHCTNLSHLEQTQCFMLTRALDQNDILLSRFIHTSATLGFPFHAYSVFTFHFNN 135
           +V  LL  C+   H++Q   FM+ R+L  ++++L+RFI T ++LG   +A SVF      
Sbjct: 11  RVSWLLRQCSTPKHIQQAHGFMVPRSLVHDNLILARFIDTCSSLGLSDYASSVFAHCPAT 70

Query: 134 HRPLPICVHNNAIFALSTSSSNPARAVSLFNSIRHLGLFPDSYT 3
           H+P  I ++N  I A + S S P+RAVSLFNSI+   L PDSY+
Sbjct: 71  HQP-TIYLYNTMIKAHALSPS-PSRAVSLFNSIQLAALRPDSYS 112


Top