BLASTX nr result

ID: Glycyrrhiza36_contig00023547 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza36_contig00023547
         (406 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_012573667.1 PREDICTED: pentatricopeptide repeat-containing pr...   225   5e-68
KYP42158.1 Pentatricopeptide repeat-containing protein At3g12770...   202   4e-60
XP_007155935.1 hypothetical protein PHAVU_003G244800g [Phaseolus...   196   5e-57
KHN26464.1 Pentatricopeptide repeat-containing protein [Glycine ...   190   6e-56
XP_003551036.1 PREDICTED: pentatricopeptide repeat-containing pr...   190   8e-55
XP_014506479.1 PREDICTED: pentatricopeptide repeat-containing pr...   189   2e-54
XP_017410302.1 PREDICTED: putative pentatricopeptide repeat-cont...   186   2e-53
XP_016206998.1 PREDICTED: pentatricopeptide repeat-containing pr...   181   2e-51
XP_009366958.1 PREDICTED: pentatricopeptide repeat-containing pr...   181   3e-51
XP_015954901.1 PREDICTED: pentatricopeptide repeat-containing pr...   181   3e-51
XP_002277337.2 PREDICTED: pentatricopeptide repeat-containing pr...   180   4e-51
KRH04632.1 hypothetical protein GLYMA_17G175800 [Glycine max]         176   6e-51
GAV80726.1 PPR domain-containing protein/PPR_1 domain-containing...   179   7e-51
ONH96536.1 hypothetical protein PRUPE_7G135300 [Prunus persica]       177   5e-50
XP_008344308.1 PREDICTED: pentatricopeptide repeat-containing pr...   177   8e-50
XP_008340659.1 PREDICTED: pentatricopeptide repeat-containing pr...   179   8e-50
XP_007047218.1 PREDICTED: pentatricopeptide repeat-containing pr...   176   1e-49
XP_015890127.1 PREDICTED: pentatricopeptide repeat-containing pr...   173   2e-48
KDP31910.1 hypothetical protein JCGZ_12371 [Jatropha curcas]          162   4e-46
XP_010263222.1 PREDICTED: pentatricopeptide repeat-containing pr...   167   5e-46

>XP_012573667.1 PREDICTED: pentatricopeptide repeat-containing protein
           At3g16610-like [Cicer arietinum]
          Length = 653

 Score =  225 bits (574), Expect = 5e-68
 Identities = 109/134 (81%), Positives = 117/134 (87%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAGAALLTLYARC R  D E VFR MD SDVVTWNAMILG+ID GLG LA ECFREMQ G
Sbjct: 311 SAGAALLTLYARCNRLHDAEKVFRVMDDSDVVTWNAMILGYIDTGLGRLAFECFREMQ-G 369

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           RGV ID TTISTILPVCDLRCGKQIHAYV+KS+FDC V VYNAL+HMYSICGCI+YA S+
Sbjct: 370 RGVRIDQTTISTILPVCDLRCGKQIHAYVRKSNFDCAVGVYNALIHMYSICGCISYAYSL 429

Query: 363 FSTMAKKDLVTWNT 404
           FSTM KKDL++WNT
Sbjct: 430 FSTMVKKDLISWNT 443



 Score = 55.5 bits (132), Expect = 3e-06
 Identities = 39/119 (32%), Positives = 59/119 (49%), Gaps = 7/119 (5%)
 Frame = +3

Query: 66  VFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNIDHTTISTILPVC---- 233
           VF  +   +V++W  +I G+  +G   +ALE FR+M     +  D   +S IL  C    
Sbjct: 227 VFEQIKDPNVISWTILISGYSGVGKHVVALEIFRDMVNVGMIIPDVDALSGILVSCKFLG 286

Query: 234 DLRCGKQIHAYVKKSDFDCMV---PVYNALVHMYSICGCIAYACSVFSTMAKKDLVTWN 401
           +L  G++IH Y  K+ F   V       AL+ +Y+ C  +  A  VF  M   D+VTWN
Sbjct: 287 NLTSGREIHGYGLKNGFRNDVFYKSAGAALLTLYARCNRLHDAEKVFRVMDDSDVVTWN 345


>KYP42158.1 Pentatricopeptide repeat-containing protein At3g12770 family
           [Cajanus cajan]
          Length = 549

 Score =  202 bits (515), Expect = 4e-60
 Identities = 102/134 (76%), Positives = 109/134 (81%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAGAALLTLYA C R     +VFRGMDK DVVTWNAMI G +D GLGHLALECFREMQ G
Sbjct: 241 SAGAALLTLYAGCGRVDRAYSVFRGMDKGDVVTWNAMIFGLVDAGLGHLALECFREMQ-G 299

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           RGV ID TT+STILPVCDLR GKQ+HAYV K +F C VPV+NALVHMYSI GCI YA SV
Sbjct: 300 RGVEIDGTTVSTILPVCDLRRGKQLHAYVSKWNFSCGVPVFNALVHMYSIRGCIVYAHSV 359

Query: 363 FSTMAKKDLVTWNT 404
           FS M  KDLV+WNT
Sbjct: 360 FSMMENKDLVSWNT 373



 Score = 67.4 bits (163), Expect = 2e-10
 Identities = 48/139 (34%), Positives = 72/139 (51%), Gaps = 10/139 (7%)
 Frame = +3

Query: 15  ALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVN 194
           +LL +YA+C      E VF  M + DV +WN+++ G++  GL   A+E F  M++     
Sbjct: 141 SLLGMYAKCGDMASAERVFGEMPQRDVFSWNSVMSGYVCNGLPDRAVEVFGVMKE--ECQ 198

Query: 195 IDHTTISTILP------VCDLRCGKQIHAYVKKSDFDCMVPVYN----ALVHMYSICGCI 344
            D  T +T++       + DL  G++IH Y  K    C    Y     AL+ +Y+ CG +
Sbjct: 199 PDVVTWNTLMDAYWKWFLGDLASGREIHGYGLK--IMCGDVFYRSAGAALLTLYAGCGRV 256

Query: 345 AYACSVFSTMAKKDLVTWN 401
             A SVF  M K D+VTWN
Sbjct: 257 DRAYSVFRGMDKGDVVTWN 275


>XP_007155935.1 hypothetical protein PHAVU_003G244800g [Phaseolus vulgaris]
           ESW27929.1 hypothetical protein PHAVU_003G244800g
           [Phaseolus vulgaris]
          Length = 619

 Score =  196 bits (497), Expect = 5e-57
 Identities = 96/134 (71%), Positives = 110/134 (82%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAGAALL LYA C R    + VFR MDKSDVVTWNAMI G +D+GLG LALECFREMQ+ 
Sbjct: 311 SAGAALLALYAGCGRLDRADVVFRRMDKSDVVTWNAMIFGLVDVGLGDLALECFREMQE- 369

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           RG+ ID TT++TILPVCDLRCGK++HAYV+K     ++PV NALVHMYSI GCIAYAC+V
Sbjct: 370 RGLRIDGTTVATILPVCDLRCGKEMHAYVRKCCLSSVIPVNNALVHMYSIRGCIAYACAV 429

Query: 363 FSTMAKKDLVTWNT 404
           FSTM  KDLV+WNT
Sbjct: 430 FSTMVAKDLVSWNT 443



 Score = 60.8 bits (146), Expect = 3e-08
 Identities = 42/134 (31%), Positives = 66/134 (49%), Gaps = 6/134 (4%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           L+  Y R  +  +   VF  ++  +V++W  +I G+  +G  H++L  FREM     V+ 
Sbjct: 212 LMDAYCRMGKCCEAWRVFGEIEIPNVISWTILISGYASVGRHHVSLGIFREMVNVGMVSP 271

Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMV--PVYNALVHMYSICGCIAYACS 359
           D   +S +L  C     L  G +IH Y  K  +  +       AL+ +Y+ CG +  A  
Sbjct: 272 DVDALSGVLVSCRALGALASGMEIHGYGLKIMYGDVFYRSAGAALLALYAGCGRLDRADV 331

Query: 360 VFSTMAKKDLVTWN 401
           VF  M K D+VTWN
Sbjct: 332 VFRRMDKSDVVTWN 345


>KHN26464.1 Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 473

 Score =  190 bits (482), Expect = 6e-56
 Identities = 96/134 (71%), Positives = 109/134 (81%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAGAALL LYA   R    +NVF  MDKSDVVTWNAMI G +D+GL  LAL+CFREMQ G
Sbjct: 165 SAGAALLMLYAGWGRLDCADNVFWRMDKSDVVTWNAMIFGLVDVGLVDLALDCFREMQ-G 223

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           RGV ID  TIS+ILPVCDLRCGK+IHAYV+K +F  ++PVYNAL+HMYSI GCIAYA SV
Sbjct: 224 RGVGIDGRTISSILPVCDLRCGKEIHAYVRKCNFSGVIPVYNALIHMYSIRGCIAYAYSV 283

Query: 363 FSTMAKKDLVTWNT 404
           FSTM  +DLV+WNT
Sbjct: 284 FSTMVARDLVSWNT 297



 Score = 55.1 bits (131), Expect = 3e-06
 Identities = 41/136 (30%), Positives = 66/136 (48%), Gaps = 8/136 (5%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           ++  Y R  +  +   VF  ++  +V++W  +I G+  +G   ++L  FR+M     V+ 
Sbjct: 66  VMDAYCRMGQCCEASRVFGEIEDPNVISWTILISGYAGVGRHDVSLGIFRQMVNVGMVSP 125

Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYN----ALVHMYSICGCIAYA 353
           D   +S +L  C     L  GK+IH Y  K    C    Y     AL+ +Y+  G +  A
Sbjct: 126 DVDALSGVLVSCRHLGALASGKEIHGYGLK--IMCGDVFYRSAGAALLMLYAGWGRLDCA 183

Query: 354 CSVFSTMAKKDLVTWN 401
            +VF  M K D+VTWN
Sbjct: 184 DNVFWRMDKSDVVTWN 199


>XP_003551036.1 PREDICTED: pentatricopeptide repeat-containing protein
           At5g39350-like [Glycine max]
          Length = 619

 Score =  190 bits (482), Expect = 8e-55
 Identities = 96/134 (71%), Positives = 109/134 (81%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAGAALL LYA   R    +NVF  MDKSDVVTWNAMI G +D+GL  LAL+CFREMQ G
Sbjct: 311 SAGAALLMLYAGWGRLDCADNVFWRMDKSDVVTWNAMIFGLVDVGLVDLALDCFREMQ-G 369

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           RGV ID  TIS+ILPVCDLRCGK+IHAYV+K +F  ++PVYNAL+HMYSI GCIAYA SV
Sbjct: 370 RGVGIDGRTISSILPVCDLRCGKEIHAYVRKCNFSGVIPVYNALIHMYSIRGCIAYAYSV 429

Query: 363 FSTMAKKDLVTWNT 404
           FSTM  +DLV+WNT
Sbjct: 430 FSTMVARDLVSWNT 443



 Score = 55.1 bits (131), Expect = 3e-06
 Identities = 41/136 (30%), Positives = 66/136 (48%), Gaps = 8/136 (5%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           ++  Y R  +  +   VF  ++  +V++W  +I G+  +G   ++L  FR+M     V+ 
Sbjct: 212 VMDAYCRMGQCCEASRVFGEIEDPNVISWTILISGYAGVGRHDVSLGIFRQMVNVGMVSP 271

Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYN----ALVHMYSICGCIAYA 353
           D   +S +L  C     L  GK+IH Y  K    C    Y     AL+ +Y+  G +  A
Sbjct: 272 DVDALSGVLVSCRHLGALASGKEIHGYGLK--IMCGDVFYRSAGAALLMLYAGWGRLDCA 329

Query: 354 CSVFSTMAKKDLVTWN 401
            +VF  M K D+VTWN
Sbjct: 330 DNVFWRMDKSDVVTWN 345


>XP_014506479.1 PREDICTED: pentatricopeptide repeat-containing protein
           At5g39350-like [Vigna radiata var. radiata]
          Length = 619

 Score =  189 bits (479), Expect = 2e-54
 Identities = 93/134 (69%), Positives = 108/134 (80%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAGAALLTLYA C R    + VFR MDKSDVVTWNAMI G +D+G G LALECFR+MQ+ 
Sbjct: 311 SAGAALLTLYAGCGRLDRADIVFRRMDKSDVVTWNAMIFGLVDVGSGDLALECFRKMQE- 369

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           RGV ID TT+ST+LPVCDLRCGK++HAYV+K     ++PV NALVHMYS+ GCIAYA +V
Sbjct: 370 RGVRIDGTTVSTVLPVCDLRCGKEMHAYVRKCCLSSVIPVNNALVHMYSVRGCIAYAFAV 429

Query: 363 FSTMAKKDLVTWNT 404
           FS M  KDLV+WNT
Sbjct: 430 FSAMLAKDLVSWNT 443



 Score = 54.7 bits (130), Expect = 5e-06
 Identities = 39/134 (29%), Positives = 64/134 (47%), Gaps = 6/134 (4%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           L+  Y R  +  +    F  ++  +V++W  ++ G+   G   ++L  FR+M     V+ 
Sbjct: 212 LMDAYCRMGKCCEAWRAFGEIEVPNVISWTILMSGYASAGRHDVSLGIFRKMMNVGMVSP 271

Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMV--PVYNALVHMYSICGCIAYACS 359
           D  T+S +L  C     L  G +IH Y  K  +  +       AL+ +Y+ CG +  A  
Sbjct: 272 DVDTLSGMLVSCRCLGALASGMEIHGYGLKIMYGDVFYRSAGAALLTLYAGCGRLDRADI 331

Query: 360 VFSTMAKKDLVTWN 401
           VF  M K D+VTWN
Sbjct: 332 VFRRMDKSDVVTWN 345


>XP_017410302.1 PREDICTED: putative pentatricopeptide repeat-containing protein
           At1g17630 [Vigna angularis] KOM29543.1 hypothetical
           protein LR48_Vigan727s000200 [Vigna angularis]
           BAT75575.1 hypothetical protein VIGAN_01345300 [Vigna
           angularis var. angularis]
          Length = 619

 Score =  186 bits (473), Expect = 2e-53
 Identities = 94/134 (70%), Positives = 106/134 (79%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAGAALLTLYA C R    + VFR MDKSDVVTWNAMI   +D+G G LALECFREMQ+ 
Sbjct: 311 SAGAALLTLYAGCGRLDRADIVFRRMDKSDVVTWNAMIFCLVDVGSGDLALECFREMQE- 369

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           RGV ID TT+STILPVCDLRCGK++HAYV+K     ++PV NALVHMYS+ GCIAYA  V
Sbjct: 370 RGVRIDGTTVSTILPVCDLRCGKEMHAYVRKCCLSSVIPVNNALVHMYSVRGCIAYAFVV 429

Query: 363 FSTMAKKDLVTWNT 404
           FS M  KDLV+WNT
Sbjct: 430 FSAMVAKDLVSWNT 443



 Score = 55.5 bits (132), Expect = 3e-06
 Identities = 39/134 (29%), Positives = 63/134 (47%), Gaps = 6/134 (4%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           L+  Y R  +  +    F  ++  +V++W  ++ G+   G   ++L  FREM     V+ 
Sbjct: 212 LMDAYCRMGKCCEAWRAFGEIEVPNVISWTILLSGYASAGRHDVSLGIFREMMNVGMVSP 271

Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMV--PVYNALVHMYSICGCIAYACS 359
           D   +S +L  C     L  G +IH Y  K  +  +       AL+ +Y+ CG +  A  
Sbjct: 272 DVDALSGVLVSCRSLGALASGMEIHGYGLKIMYGDVFYRSAGAALLTLYAGCGRLDRADI 331

Query: 360 VFSTMAKKDLVTWN 401
           VF  M K D+VTWN
Sbjct: 332 VFRRMDKSDVVTWN 345


>XP_016206998.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Arachis ipaensis]
          Length = 609

 Score =  181 bits (458), Expect = 2e-51
 Identities = 89/134 (66%), Positives = 105/134 (78%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAGAALLTLYA C R  D ENVF  MDKSDVVTWNAMI G IDMGL + A++CF+EMQ  
Sbjct: 327 SAGAALLTLYANCGRLNDAENVFDRMDKSDVVTWNAMIYGLIDMGLANEAVQCFKEMQAS 386

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
             V +D TT+ST+L  CDLR GK++HAYV K  ++ ++PV NAL+H YS CGCIAYA SV
Sbjct: 387 N-VKVDQTTVSTLLLACDLRRGKEMHAYVLKHRYNWVIPVCNALIHTYSKCGCIAYAYSV 445

Query: 363 FSTMAKKDLVTWNT 404
           FSTMA +DLV+WNT
Sbjct: 446 FSTMAVRDLVSWNT 459



 Score = 58.5 bits (140), Expect = 2e-07
 Identities = 42/134 (31%), Positives = 64/134 (47%), Gaps = 6/134 (4%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           ++  Y +     +   VF  +   +V++W  +I G+  +G   LAL  FR+M     V  
Sbjct: 228 MMDAYCKMGLCSEALRVFHQIKDPNVISWTTLISGYAGVGRHDLALGTFRDMVNFGMVLP 287

Query: 198 DHTTISTILPVC----DLRCGKQIHAY-VKKSDFDCMVPVYN-ALVHMYSICGCIAYACS 359
           D  ++S IL  C     L  G ++H Y VK    D        AL+ +Y+ CG +  A +
Sbjct: 288 DVDSLSGILVSCRFLGSLTSGNEVHCYGVKVISGDAFYKSAGAALLTLYANCGRLNDAEN 347

Query: 360 VFSTMAKKDLVTWN 401
           VF  M K D+VTWN
Sbjct: 348 VFDRMDKSDVVTWN 361


>XP_009366958.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39350
           isoform X1 [Pyrus x bretschneideri]
          Length = 690

 Score =  181 bits (460), Expect = 3e-51
 Identities = 87/134 (64%), Positives = 108/134 (80%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAG ALL LYA C R +D  NVFR M+ +DVV+WNAMILGFID+GL  LALECFR+MQ+ 
Sbjct: 382 SAGPALLILYANCSRIQDAINVFRLMNPADVVSWNAMILGFIDLGLEDLALECFRKMQRA 441

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           + V +D TTIST+LP C+L+ GKQIHA+V+K+ FD + PV+NAL+HMY+ICGCI  A SV
Sbjct: 442 Q-VKVDQTTISTVLPTCNLKFGKQIHAFVRKNSFDLVAPVWNALIHMYAICGCIESAYSV 500

Query: 363 FSTMAKKDLVTWNT 404
           FS M  +DLVTWN+
Sbjct: 501 FSNMVHRDLVTWNS 514



 Score = 59.3 bits (142), Expect = 1e-07
 Identities = 39/134 (29%), Positives = 64/134 (47%), Gaps = 6/134 (4%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           ++  Y R       + +F  + + ++++W  +I GF  +G    +L+ FR+M  G  V  
Sbjct: 283 VMDAYCRLGHCDKAKRIFEQIKEPNIISWTTLISGFSRIGNHESSLKIFRDMMDGSRVYP 342

Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKK--SDFDCMVPVYNALVHMYSICGCIAYACS 359
           D  ++S +L  C     L  GK+IH Y  K  S          AL+ +Y+ C  I  A +
Sbjct: 343 DLDSLSAVLVSCRHLGSLLNGKEIHGYGIKIGSGIAFYSSAGPALLILYANCSRIQDAIN 402

Query: 360 VFSTMAKKDLVTWN 401
           VF  M   D+V+WN
Sbjct: 403 VFRLMNPADVVSWN 416


>XP_015954901.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Arachis duranensis]
          Length = 641

 Score =  181 bits (458), Expect = 3e-51
 Identities = 89/134 (66%), Positives = 105/134 (78%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAGAALLTLYA C R  D ENVF  MDKSDVVTWNAMI G IDMGL + A++CF+EMQ  
Sbjct: 327 SAGAALLTLYANCGRLNDAENVFDRMDKSDVVTWNAMIYGLIDMGLANEAVQCFKEMQAS 386

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
             V +D TT+ST+L  CDLR GK++HAYV K  ++ ++PV NAL+H YS CGCIAYA SV
Sbjct: 387 N-VKVDQTTVSTLLLACDLRRGKEMHAYVLKHRYNWVIPVCNALIHTYSKCGCIAYAYSV 445

Query: 363 FSTMAKKDLVTWNT 404
           FSTMA +DLV+WNT
Sbjct: 446 FSTMAVRDLVSWNT 459



 Score = 58.2 bits (139), Expect = 3e-07
 Identities = 42/134 (31%), Positives = 64/134 (47%), Gaps = 6/134 (4%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           ++  Y +  R  +   VF  +   +V++W  +I G+  +    LAL  FR+M     V  
Sbjct: 228 MMDAYCKMGRCSEALRVFHQIKDPNVISWTTLISGYAGVRRHDLALGTFRDMVNFGMVLP 287

Query: 198 DHTTISTILPVC----DLRCGKQIHAY-VKKSDFDCMVPVYN-ALVHMYSICGCIAYACS 359
           D  ++S IL  C     L  G ++H Y VK    D        AL+ +Y+ CG +  A +
Sbjct: 288 DVDSLSGILVSCRFLGSLTSGNEVHCYGVKVISGDAFYKSAGAALLTLYANCGRLNDAEN 347

Query: 360 VFSTMAKKDLVTWN 401
           VF  M K D+VTWN
Sbjct: 348 VFDRMDKSDVVTWN 361


>XP_002277337.2 PREDICTED: pentatricopeptide repeat-containing protein
           At5g39350-like [Vitis vinifera]
          Length = 634

 Score =  180 bits (457), Expect = 4e-51
 Identities = 82/134 (61%), Positives = 106/134 (79%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAGAALLT+Y +C+R +D  NVF  MD+ DVVTWNAMILGF+D+ +GHLALECF +MQ+ 
Sbjct: 330 SAGAALLTMYVKCKRIQDALNVFELMDRFDVVTWNAMILGFVDLEMGHLALECFSKMQRS 389

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
            G+  +  TIST+LP CDL+ GKQ+HAY+ K+ F  ++PV+NAL+HMYS CGCI  A S+
Sbjct: 390 -GIMNNQITISTVLPACDLKSGKQVHAYITKNSFSSVIPVWNALIHMYSKCGCIGTAYSI 448

Query: 363 FSTMAKKDLVTWNT 404
           FS M  +DLV+WNT
Sbjct: 449 FSNMISRDLVSWNT 462


>KRH04632.1 hypothetical protein GLYMA_17G175800 [Glycine max]
          Length = 456

 Score =  176 bits (447), Expect = 6e-51
 Identities = 85/115 (73%), Positives = 98/115 (85%)
 Frame = +3

Query: 60  ENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNIDHTTISTILPVCDL 239
           +NVF  MDKSDVVTWNAMI G +D+GL  LAL+CFREMQ GRGV ID  TIS+ILPVCDL
Sbjct: 167 DNVFWRMDKSDVVTWNAMIFGLVDVGLVDLALDCFREMQ-GRGVGIDGRTISSILPVCDL 225

Query: 240 RCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSVFSTMAKKDLVTWNT 404
           RCGK+IHAYV+K +F  ++PVYNAL+HMYSI GCIAYA SVFSTM  +DLV+WNT
Sbjct: 226 RCGKEIHAYVRKCNFSGVIPVYNALIHMYSIRGCIAYAYSVFSTMVARDLVSWNT 280


>GAV80726.1 PPR domain-containing protein/PPR_1 domain-containing protein/PPR_2
           domain-containing protein [Cephalotus follicularis]
          Length = 581

 Score =  179 bits (453), Expect = 7e-51
 Identities = 86/134 (64%), Positives = 105/134 (78%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           S+G ALLT+YA C R  + +NVF  +DKSDVVTWNAMILGF+D+GLGHLALECF +MQ+ 
Sbjct: 270 SSGPALLTMYANCGRIWEAKNVFELLDKSDVVTWNAMILGFVDLGLGHLALECFSDMQR- 328

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           RG   D TTIST+LPVCDL  GKQ+HA + +S  D  V V+NAL+HMYS CGCI  A SV
Sbjct: 329 RGFKNDDTTISTVLPVCDLTSGKQVHALIWRSHLDSAVSVWNALIHMYSKCGCIGSAYSV 388

Query: 363 FSTMAKKDLVTWNT 404
           F++M  +DLV+WNT
Sbjct: 389 FTSMVTRDLVSWNT 402


>ONH96536.1 hypothetical protein PRUPE_7G135300 [Prunus persica]
          Length = 646

 Score =  177 bits (450), Expect = 5e-50
 Identities = 85/134 (63%), Positives = 106/134 (79%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAG ALLT+YA CRR  D  NVF+ M+ + VV+WNAMILGFID+GL  LAL+ FR MQ+ 
Sbjct: 330 SAGPALLTMYANCRRIHDATNVFKLMNPAHVVSWNAMILGFIDLGLEDLALDSFRRMQRA 389

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           R +N+D TTISTILP C+L+ GKQIHA+++K  FD +VPV+NAL+HMYS CGCI  A SV
Sbjct: 390 R-INVDQTTISTILPACNLKFGKQIHAFIRKISFDLVVPVWNALIHMYSKCGCIGSAYSV 448

Query: 363 FSTMAKKDLVTWNT 404
           FS M  +DLV+WN+
Sbjct: 449 FSNMINRDLVSWNS 462



 Score = 57.0 bits (136), Expect = 7e-07
 Identities = 38/136 (27%), Positives = 66/136 (48%), Gaps = 8/136 (5%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           ++  Y R     +   +F  + + ++++W  +I G+  +G    +L  FR+M     V+ 
Sbjct: 231 VMDAYCRMGHCNEATRIFEQIKEPNIISWTTLISGYSRIGSHEASLRIFRDMIGSSMVDP 290

Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYN----ALVHMYSICGCIAYA 353
           D  ++ST+L  C     L  GK+IH Y  K +    +  Y+    AL+ MY+ C  I  A
Sbjct: 291 DLDSLSTVLVSCRHLGSLLNGKEIHGYGIKRESG--IAFYHSAGPALLTMYANCRRIHDA 348

Query: 354 CSVFSTMAKKDLVTWN 401
            +VF  M    +V+WN
Sbjct: 349 TNVFKLMNPAHVVSWN 364


>XP_008344308.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like isoform X1 [Malus domestica]
          Length = 690

 Score =  177 bits (450), Expect = 8e-50
 Identities = 87/134 (64%), Positives = 105/134 (78%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAG ALL LYA C R +D  NVFR M+ +DVV+WNAMILGFID+GL  LALECFR+MQ+ 
Sbjct: 382 SAGPALLILYANCSRIQDAINVFRLMNPADVVSWNAMILGFIDLGLXDLALECFRKMQRA 441

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           + V  D TTIST LP C+L+ GKQIHA+V+KS FD + PV+NAL+HMY+ CGCI  A SV
Sbjct: 442 Q-VKADQTTISTXLPTCNLKFGKQIHAFVRKSSFDLVAPVWNALIHMYAKCGCIESAYSV 500

Query: 363 FSTMAKKDLVTWNT 404
           FS M  +DLVTWN+
Sbjct: 501 FSNMVNRDLVTWNS 514



 Score = 59.7 bits (143), Expect = 9e-08
 Identities = 38/136 (27%), Positives = 66/136 (48%), Gaps = 8/136 (5%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           ++  Y R     + + +F  +   ++++W  +I GF  +G    +L+ FR+M  G  V  
Sbjct: 283 VMDAYCRLGHCDEAKRIFEQIKDPNIISWTTLISGFSRIGNHESSLKIFRDMMDGSRVYP 342

Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYN----ALVHMYSICGCIAYA 353
           D  ++S ++  C     L  GK+IH Y  K      +  Y+    AL+ +Y+ C  I  A
Sbjct: 343 DLDSLSAVJVSCRHLGSLLNGKEIHGYGIK--IGSXIAFYSSAGPALLILYANCSRIQDA 400

Query: 354 CSVFSTMAKKDLVTWN 401
            +VF  M   D+V+WN
Sbjct: 401 INVFRLMNPADVVSWN 416


>XP_008340659.1 PREDICTED: pentatricopeptide repeat-containing protein
           At5g39350-like [Malus domestica]
          Length = 817

 Score =  179 bits (453), Expect = 8e-50
 Identities = 87/134 (64%), Positives = 106/134 (79%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAG ALL LYA C R +D  NVFR M+ +DVV+WNAMILGFID+GL  LALECFR+MQ+ 
Sbjct: 382 SAGPALLILYANCSRIQDAINVFRLMNPADVVSWNAMILGFIDLGLEDLALECFRKMQRA 441

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           + V  D TTIST+LP C+L+ GKQIHA+V+KS FD + PV+NAL+HMY+ CGCI  A SV
Sbjct: 442 Q-VKADQTTISTVLPTCNLKFGKQIHAFVRKSSFDLVAPVWNALIHMYAKCGCIESAYSV 500

Query: 363 FSTMAKKDLVTWNT 404
           FS M  +DLVTWN+
Sbjct: 501 FSNMVNRDLVTWNS 514



 Score = 59.7 bits (143), Expect = 9e-08
 Identities = 38/136 (27%), Positives = 66/136 (48%), Gaps = 8/136 (5%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           ++  Y R     + + +F  +   ++++W  +I GF  +G    +L+ FR+M  G  V  
Sbjct: 283 VMDAYCRLGHCDEAKRIFEQIKDPNIISWTTLISGFSRIGNHESSLKIFRDMMDGSRVYP 342

Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYN----ALVHMYSICGCIAYA 353
           D  ++S ++  C     L  GK+IH Y  K      +  Y+    AL+ +Y+ C  I  A
Sbjct: 343 DLDSLSAVJVSCRHLGSLLNGKEIHGYGIK--IGSXIAFYSSAGPALLILYANCSRIQDA 400

Query: 354 CSVFSTMAKKDLVTWN 401
            +VF  M   D+V+WN
Sbjct: 401 INVFRLMNPADVVSWN 416


>XP_007047218.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic [Theobroma cacao] EOX91375.1
           Pentatricopeptide repeat-containing protein, putative
           [Theobroma cacao]
          Length = 635

 Score =  176 bits (447), Expect = 1e-49
 Identities = 83/134 (61%), Positives = 104/134 (77%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAG ALLTL+++C R RD  N+F  MDKSD VTWNAMILGF+D GLGH+A++CF EMQ+ 
Sbjct: 327 SAGPALLTLHSKCGRSRDAGNIFELMDKSDTVTWNAMILGFVDRGLGHMAVDCFGEMQR- 385

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
            G+  D TTI T+LPVC+LR GKQ+HAY+++   D + P++NALVHMYS CG I  A SV
Sbjct: 386 MGIKNDQTTICTVLPVCELRQGKQLHAYIRRQYSDSICPIWNALVHMYSKCGSIGSAYSV 445

Query: 363 FSTMAKKDLVTWNT 404
           FS M  +DLV+WNT
Sbjct: 446 FSNMVARDLVSWNT 459


>XP_015890127.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Ziziphus jujuba]
          Length = 635

 Score =  173 bits (438), Expect = 2e-48
 Identities = 82/134 (61%), Positives = 106/134 (79%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAGA LLT+YA+  R +D +NVF+ MD++DVVTWNAMILGF D+GL H ALECF +MQ+ 
Sbjct: 327 SAGATLLTMYAKYGRLQDAKNVFKLMDQADVVTWNAMILGFADVGLEHSALECFSKMQRA 386

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
            G+  D TTIST+LPVCDL+ GKQIHA+++K  FD + PV+NAL++MYS CGCI  A  V
Sbjct: 387 -GIKNDRTTISTVLPVCDLKSGKQIHAFIRKGCFDLVTPVWNALIYMYSKCGCIRSASLV 445

Query: 363 FSTMAKKDLVTWNT 404
           FS M  +D+V+WN+
Sbjct: 446 FSNMLTRDVVSWNS 459



 Score = 60.8 bits (146), Expect = 3e-08
 Identities = 40/132 (30%), Positives = 62/132 (46%), Gaps = 4/132 (3%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           L+ +YA C        +F  + + +V  W A+I  +   G+    +  + EM    GV+ 
Sbjct: 61  LVQMYADCDHLLSARILFDQLSQPNVFAWTAIIGFYSRHGMYQKCVRTYAEMSL-MGVSP 119

Query: 198 DHTTISTILPVCD----LRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSVF 365
           D      +L VC     L+ G QIH  V  S F+    V N+L+ MYS C  +  A  VF
Sbjct: 120 DEYVFPKVLKVCAQSSCLKAGMQIHKDVITSGFEFSSEVCNSLIEMYSKCMDVQNAKRVF 179

Query: 366 STMAKKDLVTWN 401
             +  +DL++WN
Sbjct: 180 DVIVGRDLLSWN 191



 Score = 60.1 bits (144), Expect = 6e-08
 Identities = 42/136 (30%), Positives = 65/136 (47%), Gaps = 6/136 (4%)
 Frame = +3

Query: 15  ALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVN 194
           AL+ +Y++C   R    VF  M   DVV+WN+M+ GF   GLG  ALE  +EM+Q   + 
Sbjct: 428 ALIYMYSKCGCIRSASLVFSNMLTRDVVSWNSMMGGFRMHGLGQAALELLKEMRQS-ALE 486

Query: 195 IDHTTISTILPVCDLR--CGKQIHAYVKKSDFDCMVPV---YNALVHMYSICGCIAYACS 359
            D  T +++L  C       + +  + K + + C+ P    Y  +V M +  G +  A S
Sbjct: 487 PDSMTFTSVLSACSHSGLVNEGLEVFHKMTKYYCLTPSMEHYACIVDMLARAGRLQDAVS 546

Query: 360 VFSTM-AKKDLVTWNT 404
               M  + D   W T
Sbjct: 547 FIQNMPLEPDKSIWGT 562



 Score = 57.8 bits (138), Expect = 4e-07
 Identities = 37/136 (27%), Positives = 69/136 (50%), Gaps = 8/136 (5%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           ++  Y + R   +  N+F  + + ++++W  +I G+  +G   ++L  FR+M     ++ 
Sbjct: 228 VMDAYCQMRLCDEAWNIFERIKEPNIISWTTLIKGYSRIGNHEVSLRIFRDMISSGMISP 287

Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYNA----LVHMYSICGCIAYA 353
           D   +S +L  C     L  G++IH+Y  K      +  YN+    L+ MY+  G +  A
Sbjct: 288 DLDCLSGVLVSCRHLGSLSGGREIHSYGIK--MKSCIAFYNSAGATLLTMYAKYGRLQDA 345

Query: 354 CSVFSTMAKKDLVTWN 401
            +VF  M + D+VTWN
Sbjct: 346 KNVFKLMDQADVVTWN 361


>KDP31910.1 hypothetical protein JCGZ_12371 [Jatropha curcas]
          Length = 377

 Score =  162 bits (410), Expect = 4e-46
 Identities = 84/135 (62%), Positives = 101/135 (74%), Gaps = 1/135 (0%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           SAG ALLT+YA+C   +    VF  MDKSDVVTWNAMILGF+++ L  LALECF  MQ+ 
Sbjct: 57  SAGPALLTMYAKCGIIQYARFVFELMDKSDVVTWNAMILGFVELQLVQLALECFSGMQRS 116

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYV-KKSDFDCMVPVYNALVHMYSICGCIAYACS 359
            GV  D TTISTILPVC L+CGKQIHAY+ + S  + +VPV++A++HMY   GCI  A S
Sbjct: 117 -GVKNDQTTISTILPVCGLKCGKQIHAYILRSSSLNSVVPVWSAMIHMYCKSGCIRSAYS 175

Query: 360 VFSTMAKKDLVTWNT 404
           VFS MA KD+VTWNT
Sbjct: 176 VFSNMAVKDIVTWNT 190


>XP_010263222.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Nelumbo nucifera]
          Length = 654

 Score =  167 bits (422), Expect = 5e-46
 Identities = 81/134 (60%), Positives = 103/134 (76%)
 Frame = +3

Query: 3   SAGAALLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQG 182
           S+G ALLT+YA   R RD  NVF+ MDKSDVVTWNAMILG + +GLG LA++  REMQ  
Sbjct: 330 SSGPALLTVYATSGRLRDARNVFQLMDKSDVVTWNAMILGLVHLGLGDLAIKYVREMQS- 388

Query: 183 RGVNIDHTTISTILPVCDLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSV 362
           RG+  D TT+ST+LPVCDLR GKQIHAY++++  D  + V+NAL++MYS CGCI  A +V
Sbjct: 389 RGLQYDETTVSTVLPVCDLRFGKQIHAYIRRNALDSAISVWNALINMYSKCGCIRSAYTV 448

Query: 363 FSTMAKKDLVTWNT 404
           FS M  +D+V+WNT
Sbjct: 449 FSKMDSRDVVSWNT 462



 Score = 58.5 bits (140), Expect = 2e-07
 Identities = 37/133 (27%), Positives = 66/133 (49%), Gaps = 4/133 (3%)
 Frame = +3

Query: 18  LLTLYARCRRPRDVENVFRGMDKSDVVTWNAMILGFIDMGLGHLALECFREMQQGRGVNI 197
           L+ +YA C        +F  + + +V  W ++I  +   G+    +  + EM+  +G+  
Sbjct: 64  LVQMYAACNDLISARILFDELPRPNVFAWTSIISFYSRNGMFKECVRTYNEMKL-QGIGP 122

Query: 198 DHTTISTILPVC----DLRCGKQIHAYVKKSDFDCMVPVYNALVHMYSICGCIAYACSVF 365
           D      +L  C     L  G +IH  + +   +  + V N+L+ MYS CG +  A  +F
Sbjct: 123 DGYVFPKVLRACTQSLSLAEGIRIHKDIIELGAEHNLQVCNSLIDMYSKCGDVQTAQRIF 182

Query: 366 STMAKKDLVTWNT 404
           + MA+KDL+TWN+
Sbjct: 183 NGMAEKDLLTWNS 195


Top