BLASTX nr result

ID: Astragalus22_contig00035420 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00035420
         (444 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_012573667.1| PREDICTED: pentatricopeptide repeat-containi...   224   4e-67
ref|XP_016206998.1| pentatricopeptide repeat-containing protein ...   206   8e-61
gb|KHN26464.1| Pentatricopeptide repeat-containing protein [Glyc...   199   5e-59
ref|XP_015954901.1| pentatricopeptide repeat-containing protein ...   201   1e-58
ref|XP_003551036.1| PREDICTED: pentatricopeptide repeat-containi...   199   8e-58
ref|XP_007155935.1| hypothetical protein PHAVU_003G244800g [Phas...   188   7e-54
ref|XP_014506479.1| pentatricopeptide repeat-containing protein ...   185   1e-52
ref|XP_017410302.1| PREDICTED: putative pentatricopeptide repeat...   184   2e-52
ref|XP_020239719.1| pentatricopeptide repeat-containing protein ...   181   5e-51
ref|XP_015890127.1| PREDICTED: pentatricopeptide repeat-containi...   178   5e-50
gb|ONH96536.1| hypothetical protein PRUPE_7G135300 [Prunus persica]   176   4e-49
ref|XP_021897472.1| pentatricopeptide repeat-containing protein ...   176   4e-49
ref|XP_020424413.1| pentatricopeptide repeat-containing protein ...   176   7e-49
ref|XP_021825144.1| pentatricopeptide repeat-containing protein ...   173   2e-48
ref|XP_024183907.1| pentatricopeptide repeat-containing protein ...   168   2e-47
gb|KZM88651.1| hypothetical protein DCAR_025726 [Daucus carota s...   169   9e-47
ref|XP_017219367.1| PREDICTED: pentatricopeptide repeat-containi...   169   2e-46
ref|XP_008344308.1| PREDICTED: pentatricopeptide repeat-containi...   169   3e-46
ref|XP_007047218.1| PREDICTED: pentatricopeptide repeat-containi...   168   4e-46
ref|XP_024183906.1| putative pentatricopeptide repeat-containing...   168   5e-46

>ref|XP_012573667.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g16610-like [Cicer arietinum]
          Length = 653

 Score =  224 bits (570), Expect = 4e-67
 Identities = 110/147 (74%), Positives = 122/147 (82%)
 Frame = -3

Query: 442 LFEKALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTL 263
           L ++ L +  FMGMDGCE  PDVVTWNTVMDAYCK+GL+ +A RVFE+IKDPNVISWT L
Sbjct: 185 LSDRVLGMLEFMGMDGCE--PDVVTWNTVMDAYCKMGLVDEALRVFEQIKDPNVISWTIL 242

Query: 262 ILGYSGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNL 83
           I GYSG GKH VALE FR MVN+GM+ PDVDALSGILVSC+ LG L  GREIHGYG+KN 
Sbjct: 243 ISGYSGVGKHVVALEIFRDMVNVGMIIPDVDALSGILVSCKFLGNLTSGREIHGYGLKNG 302

Query: 82  SLNNAFYKSAGAALLTLYARCGRLHDA 2
             N+ FYKSAGAALLTLYARC RLHDA
Sbjct: 303 FRNDVFYKSAGAALLTLYARCNRLHDA 329



 Score = 54.7 bits (130), Expect = 9e-06
 Identities = 39/115 (33%), Positives = 57/115 (49%), Gaps = 1/115 (0%)
 Frame = -3

Query: 358 VMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGYSGFGKHDVALEFFRKMVNIGMVFP 179
           ++  Y +   + DA +VF  + D +V++W  +ILGY   G   +A E FR+M   G V  
Sbjct: 316 LLTLYARCNRLHDAEKVFRVMDDSDVVTWNAMILGYIDTGLGRLAFECFREMQGRG-VRI 374

Query: 178 DVDALSGILVSCRSLGALRIGREIHGYGIK-NLSLNNAFYKSAGAALLTLYARCG 17
           D   +S IL  C     LR G++IH Y  K N       Y     AL+ +Y+ CG
Sbjct: 375 DQTTISTILPVC----DLRCGKQIHAYVRKSNFDCAVGVYN----ALIHMYSICG 421


>ref|XP_016206998.1| pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Arachis ipaensis]
 ref|XP_020973997.1| pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Arachis ipaensis]
          Length = 609

 Score =  206 bits (525), Expect = 8e-61
 Identities = 102/147 (69%), Positives = 120/147 (81%)
 Frame = -3

Query: 442 LFEKALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTL 263
           LFE+A+ L G M   G +C+PDVVTWNT+MDAYCK+GL  +A RVF +IKDPNVISWTTL
Sbjct: 201 LFEEAVRLLGLMRASG-DCEPDVVTWNTMMDAYCKMGLCSEALRVFHQIKDPNVISWTTL 259

Query: 262 ILGYSGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNL 83
           I GY+G G+HD+AL  FR MVN GMV PDVD+LSGILVSCR LG+L  G E+H YG+K +
Sbjct: 260 ISGYAGVGRHDLALGTFRDMVNFGMVLPDVDSLSGILVSCRFLGSLTSGNEVHCYGVKVI 319

Query: 82  SLNNAFYKSAGAALLTLYARCGRLHDA 2
           S  +AFYKSAGAALLTLYA CGRL+DA
Sbjct: 320 S-GDAFYKSAGAALLTLYANCGRLNDA 345


>gb|KHN26464.1| Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 473

 Score =  199 bits (505), Expect = 5e-59
 Identities = 96/141 (68%), Positives = 115/141 (81%)
 Frame = -3

Query: 433 KALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILG 254
           KA+E+ G M  DGC C+PDVVTWNTVMDAYC++G   +ASRVF +I+DPNVISWT LI G
Sbjct: 41  KAVEVLGVMKKDGCGCEPDVVTWNTVMDAYCRMGQCCEASRVFGEIEDPNVISWTILISG 100

Query: 253 YSGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLN 74
           Y+G G+HDV+L  FR+MVN+GMV PDVDALSG+LVSCR LGAL  G+EIHGYG+K +   
Sbjct: 101 YAGVGRHDVSLGIFRQMVNVGMVSPDVDALSGVLVSCRHLGALASGKEIHGYGLK-IMCG 159

Query: 73  NAFYKSAGAALLTLYARCGRL 11
           + FY+SAGAALL LYA  GRL
Sbjct: 160 DVFYRSAGAALLMLYAGWGRL 180


>ref|XP_015954901.1| pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Arachis duranensis]
 ref|XP_020993529.1| pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Arachis duranensis]
          Length = 641

 Score =  201 bits (511), Expect = 1e-58
 Identities = 100/147 (68%), Positives = 118/147 (80%)
 Frame = -3

Query: 442 LFEKALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTL 263
           LFE+A+ L G M   G +C+PDVVTWNT+MDAYCK+G   +A RVF +IKDPNVISWTTL
Sbjct: 201 LFEEAVRLLGLMRASG-DCEPDVVTWNTMMDAYCKMGRCSEALRVFHQIKDPNVISWTTL 259

Query: 262 ILGYSGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNL 83
           I GY+G  +HD+AL  FR MVN GMV PDVD+LSGILVSCR LG+L  G E+H YG+K +
Sbjct: 260 ISGYAGVRRHDLALGTFRDMVNFGMVLPDVDSLSGILVSCRFLGSLTSGNEVHCYGVKVI 319

Query: 82  SLNNAFYKSAGAALLTLYARCGRLHDA 2
           S  +AFYKSAGAALLTLYA CGRL+DA
Sbjct: 320 S-GDAFYKSAGAALLTLYANCGRLNDA 345


>ref|XP_003551036.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g39350-like [Glycine max]
          Length = 619

 Score =  199 bits (505), Expect = 8e-58
 Identities = 96/141 (68%), Positives = 115/141 (81%)
 Frame = -3

Query: 433 KALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILG 254
           KA+E+ G M  DGC C+PDVVTWNTVMDAYC++G   +ASRVF +I+DPNVISWT LI G
Sbjct: 187 KAVEVLGVMKKDGCGCEPDVVTWNTVMDAYCRMGQCCEASRVFGEIEDPNVISWTILISG 246

Query: 253 YSGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLN 74
           Y+G G+HDV+L  FR+MVN+GMV PDVDALSG+LVSCR LGAL  G+EIHGYG+K +   
Sbjct: 247 YAGVGRHDVSLGIFRQMVNVGMVSPDVDALSGVLVSCRHLGALASGKEIHGYGLK-IMCG 305

Query: 73  NAFYKSAGAALLTLYARCGRL 11
           + FY+SAGAALL LYA  GRL
Sbjct: 306 DVFYRSAGAALLMLYAGWGRL 326


>ref|XP_007155935.1| hypothetical protein PHAVU_003G244800g [Phaseolus vulgaris]
 gb|ESW27929.1| hypothetical protein PHAVU_003G244800g [Phaseolus vulgaris]
          Length = 619

 Score =  188 bits (478), Expect = 7e-54
 Identities = 92/145 (63%), Positives = 114/145 (78%)
 Frame = -3

Query: 436 EKALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLIL 257
           ++A+E+F  M  +GCEC PDVVTWNT+MDAYC++G   +A RVF +I+ PNVISWT LI 
Sbjct: 186 QRAVEVFRVMKGNGCECAPDVVTWNTLMDAYCRMGKCCEAWRVFGEIEIPNVISWTILIS 245

Query: 256 GYSGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSL 77
           GY+  G+H V+L  FR+MVN+GMV PDVDALSG+LVSCR+LGAL  G EIHGYG+K +  
Sbjct: 246 GYASVGRHHVSLGIFREMVNVGMVSPDVDALSGVLVSCRALGALASGMEIHGYGLK-IMY 304

Query: 76  NNAFYKSAGAALLTLYARCGRLHDA 2
            + FY+SAGAALL LYA CGRL  A
Sbjct: 305 GDVFYRSAGAALLALYAGCGRLDRA 329


>ref|XP_014506479.1| pentatricopeptide repeat-containing protein At5g39350-like [Vigna
           radiata var. radiata]
          Length = 619

 Score =  185 bits (470), Expect = 1e-52
 Identities = 91/145 (62%), Positives = 112/145 (77%)
 Frame = -3

Query: 436 EKALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLIL 257
           ++A+E+F  M  DG EC PDVVTWNT+MDAYC++G   +A R F +I+ PNVISWT L+ 
Sbjct: 186 QRAVEVFRVMKRDGRECAPDVVTWNTLMDAYCRMGKCCEAWRAFGEIEVPNVISWTILMS 245

Query: 256 GYSGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSL 77
           GY+  G+HDV+L  FRKM+N+GMV PDVD LSG+LVSCR LGAL  G EIHGYG+K +  
Sbjct: 246 GYASAGRHDVSLGIFRKMMNVGMVSPDVDTLSGMLVSCRCLGALASGMEIHGYGLK-IMY 304

Query: 76  NNAFYKSAGAALLTLYARCGRLHDA 2
            + FY+SAGAALLTLYA CGRL  A
Sbjct: 305 GDVFYRSAGAALLTLYAGCGRLDRA 329


>ref|XP_017410302.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At1g17630 [Vigna angularis]
 gb|KOM29543.1| hypothetical protein LR48_Vigan727s000200 [Vigna angularis]
 dbj|BAT75575.1| hypothetical protein VIGAN_01345300 [Vigna angularis var.
           angularis]
          Length = 619

 Score =  184 bits (468), Expect = 2e-52
 Identities = 90/145 (62%), Positives = 112/145 (77%)
 Frame = -3

Query: 436 EKALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLIL 257
           ++A+ +F  M  DG EC PD VTWNT+MDAYC++G   +A R F +I+ PNVISWT L+ 
Sbjct: 186 QRAVGVFRVMKRDGFECAPDAVTWNTLMDAYCRMGKCCEAWRAFGEIEVPNVISWTILLS 245

Query: 256 GYSGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSL 77
           GY+  G+HDV+L  FR+M+N+GMV PDVDALSG+LVSCRSLGAL  G EIHGYG+K +  
Sbjct: 246 GYASAGRHDVSLGIFREMMNVGMVSPDVDALSGVLVSCRSLGALASGMEIHGYGLK-IMY 304

Query: 76  NNAFYKSAGAALLTLYARCGRLHDA 2
            + FY+SAGAALLTLYA CGRL  A
Sbjct: 305 GDVFYRSAGAALLTLYAGCGRLDRA 329


>ref|XP_020239719.1| pentatricopeptide repeat-containing protein At5g39350-like [Cajanus
           cajan]
          Length = 614

 Score =  181 bits (458), Expect = 5e-51
 Identities = 91/145 (62%), Positives = 111/145 (76%)
 Frame = -3

Query: 436 EKALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLIL 257
           ++A+E+FG M     EC PDVVTWNT+MDAYCK+GL  +A+RVF +I+ PNVISWT LI 
Sbjct: 184 DRAVEVFGVMKE---ECQPDVVTWNTLMDAYCKMGLCCEAARVFGEIEVPNVISWTILIS 240

Query: 256 GYSGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSL 77
           GY   G+H V LE FR+MV++G V PDVDALS +LVSCR LG L  GREIHGYG+K +  
Sbjct: 241 GYGSVGRHGVCLEIFREMVSVGRVLPDVDALSCVLVSCRFLGDLASGREIHGYGLK-IMC 299

Query: 76  NNAFYKSAGAALLTLYARCGRLHDA 2
            + FY+SAGAALLTLYA CGR+  A
Sbjct: 300 GDVFYRSAGAALLTLYAGCGRVDRA 324


>ref|XP_015890127.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Ziziphus jujuba]
          Length = 635

 Score =  178 bits (452), Expect = 5e-50
 Identities = 90/147 (61%), Positives = 109/147 (74%)
 Frame = -3

Query: 442 LFEKALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTL 263
           L E A++L   M  DGCE  PDVVTWN VMDAYC++ L  +A  +FE+IK+PN+ISWTTL
Sbjct: 202 LLESAVKLLDRMRFDGCE--PDVVTWNIVMDAYCQMRLCDEAWNIFERIKEPNIISWTTL 259

Query: 262 ILGYSGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNL 83
           I GYS  G H+V+L  FR M++ GM+ PD+D LSG+LVSCR LG+L  GREIH YGIK  
Sbjct: 260 IKGYSRIGNHEVSLRIFRDMISSGMISPDLDCLSGVLVSCRHLGSLSGGREIHSYGIKMK 319

Query: 82  SLNNAFYKSAGAALLTLYARCGRLHDA 2
           S   AFY SAGA LLT+YA+ GRL DA
Sbjct: 320 SC-IAFYNSAGATLLTMYAKYGRLQDA 345



 Score = 55.8 bits (133), Expect = 4e-06
 Identities = 36/120 (30%), Positives = 59/120 (49%)
 Frame = -3

Query: 361 TVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGYSGFGKHDVALEFFRKMVNIGMVF 182
           T++  Y K G + DA  VF+ +   +V++W  +ILG++  G    ALE F KM   G + 
Sbjct: 331 TLLTMYAKYGRLQDAKNVFKLMDQADVVTWNAMILGFADVGLEHSALECFSKMQRAG-IK 389

Query: 181 PDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNNAFYKSAGAALLTLYARCGRLHDA 2
            D   +S +L  C     L+ G++IH +  K              AL+ +Y++CG +  A
Sbjct: 390 NDRTTISTVLPVC----DLKSGKQIHAFIRKGCF---DLVTPVWNALIYMYSKCGCIRSA 442



 Score = 55.8 bits (133), Expect = 4e-06
 Identities = 39/128 (30%), Positives = 61/128 (47%), Gaps = 2/128 (1%)
 Frame = -3

Query: 379 DVVT--WNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGYSGFGKHDVALEFFRK 206
           D+VT  WN ++  Y K G I  AS VF  +   +V+SW +++ G+   G    ALE  ++
Sbjct: 420 DLVTPVWNALIYMYSKCGCIRSASLVFSNMLTRDVVSWNSMMGGFRMHGLGQAALELLKE 479

Query: 205 MVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNNAFYKSAGAALLTLYA 26
           M    +  PD    + +L +C   G +  G E+     K   L  +      A ++ + A
Sbjct: 480 MRQSALE-PDSMTFTSVLSACSHSGLVNEGLEVFHKMTKYYCLTPSM--EHYACIVDMLA 536

Query: 25  RCGRLHDA 2
           R GRL DA
Sbjct: 537 RAGRLQDA 544


>gb|ONH96536.1| hypothetical protein PRUPE_7G135300 [Prunus persica]
          Length = 646

 Score =  176 bits (446), Expect = 4e-49
 Identities = 87/143 (60%), Positives = 108/143 (75%)
 Frame = -3

Query: 430 ALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGY 251
           A+ELF  M + GCE  PD+VT NTVMDAYC++G   +A+R+FE+IK+PN+ISWTTLI GY
Sbjct: 209 AVELFDCMNLGGCE--PDIVTLNTVMDAYCRMGHCNEATRIFEQIKEPNIISWTTLISGY 266

Query: 250 SGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNN 71
           S  G H+ +L  FR M+   MV PD+D+LS +LVSCR LG+L  G+EIHGYGIK  S   
Sbjct: 267 SRIGSHEASLRIFRDMIGSSMVDPDLDSLSTVLVSCRHLGSLLNGKEIHGYGIKRES-GI 325

Query: 70  AFYKSAGAALLTLYARCGRLHDA 2
           AFY SAG ALLT+YA C R+HDA
Sbjct: 326 AFYHSAGPALLTMYANCRRIHDA 348



 Score = 57.8 bits (138), Expect = 8e-07
 Identities = 36/119 (30%), Positives = 63/119 (52%)
 Frame = -3

Query: 358 VMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGYSGFGKHDVALEFFRKMVNIGMVFP 179
           ++  Y     I DA+ VF+ +   +V+SW  +ILG+   G  D+AL+ FR+M     +  
Sbjct: 335 LLTMYANCRRIHDATNVFKLMNPAHVVSWNAMILGFIDLGLEDLALDSFRRMQR-ARINV 393

Query: 178 DVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNNAFYKSAGAALLTLYARCGRLHDA 2
           D   +S IL +C     L+ G++IH + I+ +S +         AL+ +Y++CG +  A
Sbjct: 394 DQTTISTILPACN----LKFGKQIHAF-IRKISFD--LVVPVWNALIHMYSKCGCIGSA 445


>ref|XP_021897472.1| pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Carica papaya]
          Length = 627

 Score =  176 bits (445), Expect = 4e-49
 Identities = 86/147 (58%), Positives = 110/147 (74%)
 Frame = -3

Query: 442 LFEKALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTL 263
           L   A++LF  +  +GCE  PD+VT+NTV+DAYC++GL  +A ++F +IKDPN+ISWTTL
Sbjct: 196 LLRLAVQLFSSVRANGCE--PDIVTFNTVLDAYCRMGLCEEAWKIFGQIKDPNIISWTTL 253

Query: 262 ILGYSGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNL 83
           I GYS  GKH++AL  FR MVN+G VFPD+ +LS +LVSCR LGAL  GREIHGYG K +
Sbjct: 254 ISGYSRTGKHEIALRKFRTMVNMGRVFPDLGSLSSVLVSCRHLGALMSGREIHGYGTK-M 312

Query: 82  SLNNAFYKSAGAALLTLYARCGRLHDA 2
                FY SAG ALLT+Y +C R+ DA
Sbjct: 313 ERGTKFYSSAGPALLTMYTKCHRIQDA 339



 Score = 54.7 bits (130), Expect = 9e-06
 Identities = 36/136 (26%), Positives = 62/136 (45%), Gaps = 5/136 (3%)
 Frame = -3

Query: 394 CECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGYSGFGKHDVALEF 215
           C    D  T   ++  Y     +  A  +F+K+  PNV +W++++  YS  G ++  L  
Sbjct: 43  CGSSRDPFTLTKLLQLYVDCDDLDSAQNLFDKLPQPNVFAWSSILAFYSRHGSYEECLHS 102

Query: 214 FRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGI-----KNLSLNNAFYKSAG 50
           +R M  +  V PD      +L +C    +L  G +IH + I      NL + N       
Sbjct: 103 YRDM-KVKGVSPDNYVFPQVLRACAQSSSLEEGIQIHKHVIVYGSELNLQVCN------- 154

Query: 49  AALLTLYARCGRLHDA 2
            +L+ +YA+CG +  A
Sbjct: 155 -SLIDMYAKCGDVESA 169


>ref|XP_020424413.1| pentatricopeptide repeat-containing protein At5g39350 isoform X1
           [Prunus persica]
          Length = 699

 Score =  176 bits (446), Expect = 7e-49
 Identities = 87/143 (60%), Positives = 108/143 (75%)
 Frame = -3

Query: 430 ALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGY 251
           A+ELF  M + GCE  PD+VT NTVMDAYC++G   +A+R+FE+IK+PN+ISWTTLI GY
Sbjct: 262 AVELFDCMNLGGCE--PDIVTLNTVMDAYCRMGHCNEATRIFEQIKEPNIISWTTLISGY 319

Query: 250 SGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNN 71
           S  G H+ +L  FR M+   MV PD+D+LS +LVSCR LG+L  G+EIHGYGIK  S   
Sbjct: 320 SRIGSHEASLRIFRDMIGSSMVDPDLDSLSTVLVSCRHLGSLLNGKEIHGYGIKRES-GI 378

Query: 70  AFYKSAGAALLTLYARCGRLHDA 2
           AFY SAG ALLT+YA C R+HDA
Sbjct: 379 AFYHSAGPALLTMYANCRRIHDA 401



 Score = 57.8 bits (138), Expect = 8e-07
 Identities = 36/119 (30%), Positives = 63/119 (52%)
 Frame = -3

Query: 358 VMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGYSGFGKHDVALEFFRKMVNIGMVFP 179
           ++  Y     I DA+ VF+ +   +V+SW  +ILG+   G  D+AL+ FR+M     +  
Sbjct: 388 LLTMYANCRRIHDATNVFKLMNPAHVVSWNAMILGFIDLGLEDLALDSFRRMQR-ARINV 446

Query: 178 DVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNNAFYKSAGAALLTLYARCGRLHDA 2
           D   +S IL +C     L+ G++IH + I+ +S +         AL+ +Y++CG +  A
Sbjct: 447 DQTTISTILPACN----LKFGKQIHAF-IRKISFD--LVVPVWNALIHMYSKCGCIGSA 498


>ref|XP_021825144.1| pentatricopeptide repeat-containing protein At5g39350-like [Prunus
           avium]
          Length = 580

 Score =  173 bits (438), Expect = 2e-48
 Identities = 86/143 (60%), Positives = 106/143 (74%)
 Frame = -3

Query: 430 ALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGY 251
           A++LF  M + GCE  PD+VT NTVMDAYC++G    A R+FE+IK+PN+ISWTTLI GY
Sbjct: 143 AVKLFDCMNLGGCE--PDIVTLNTVMDAYCRMGHCNKAKRIFEQIKEPNIISWTTLISGY 200

Query: 250 SGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNN 71
           S  G H+ +L  FR M+   MV PD+D+LS +LVSCR LG+L  G+EIHGYGIK  S   
Sbjct: 201 SRIGSHEASLRIFRDMIGSSMVDPDLDSLSTVLVSCRHLGSLLNGKEIHGYGIKRES-GI 259

Query: 70  AFYKSAGAALLTLYARCGRLHDA 2
           AFY SAG ALLT+YA C R+HDA
Sbjct: 260 AFYHSAGPALLTMYANCRRIHDA 282



 Score = 57.8 bits (138), Expect = 7e-07
 Identities = 36/119 (30%), Positives = 63/119 (52%)
 Frame = -3

Query: 358 VMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGYSGFGKHDVALEFFRKMVNIGMVFP 179
           ++  Y     I DA+ VF+ +   +V+SW  +ILG+   G  D+AL+ FR+M     +  
Sbjct: 269 LLTMYANCRRIHDATNVFKLMNPAHVVSWNAMILGFIDLGLEDLALDSFRRMQR-ARINV 327

Query: 178 DVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNNAFYKSAGAALLTLYARCGRLHDA 2
           D   +S IL +C     L+ G++IH + I+ +S +         AL+ +Y++CG +  A
Sbjct: 328 DQTTISTILPACN----LKFGKQIHAF-IRKISFD--LVVPVWNALIHMYSKCGCIGSA 379


>ref|XP_024183907.1| pentatricopeptide repeat-containing protein At2g13600 isoform X2
           [Rosa chinensis]
          Length = 433

 Score =  168 bits (425), Expect = 2e-47
 Identities = 85/143 (59%), Positives = 105/143 (73%)
 Frame = -3

Query: 430 ALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGY 251
           A+ELFG M  DGCE  PDVVT NTVMDAYC++GL  +A  +F+ IK+PN+ISWTTLI GY
Sbjct: 5   AIELFGCMNSDGCE--PDVVTLNTVMDAYCRMGLCDEAKGIFKHIKEPNIISWTTLISGY 62

Query: 250 SGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNN 71
           S  G H+ +L  FR M+N  MV+PD+D+LS  LVS R LG+L  G+EIHGYG+K  S   
Sbjct: 63  SRIGNHEASLGIFRDMMNSSMVYPDLDSLSIALVSSRHLGSLLSGKEIHGYGLKRES-GI 121

Query: 70  AFYKSAGAALLTLYARCGRLHDA 2
            FY SAG ALLT+YA C ++ DA
Sbjct: 122 VFYISAGPALLTMYANCRKIQDA 144



 Score = 56.6 bits (135), Expect = 2e-06
 Identities = 37/114 (32%), Positives = 57/114 (50%)
 Frame = -3

Query: 358 VMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGYSGFGKHDVALEFFRKMVNIGMVFP 179
           ++  Y     I DA  VF  +     +SW  +ILG+   G  D+ALE FRKM  I  +  
Sbjct: 131 LLTMYANCRKIQDAENVFRFMDPAQAVSWNAMILGFIDLGLEDLALECFRKM-QIAEIKL 189

Query: 178 DVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNNAFYKSAGAALLTLYARCG 17
           D   LS +L +C     L+ G++IH + I+  S +         AL+ +Y++CG
Sbjct: 190 DQTTLSTVLPTCN----LKFGKQIHAF-IRKSSFD--LVVPVWNALIHMYSKCG 236


>gb|KZM88651.1| hypothetical protein DCAR_025726 [Daucus carota subsp. sativus]
          Length = 577

 Score =  169 bits (427), Expect = 9e-47
 Identities = 82/140 (58%), Positives = 105/140 (75%)
 Frame = -3

Query: 430 ALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGY 251
           A+ELFGFM M G E  PD VTWNT++DAYC++G   +AS VF+KIK+PN+ISWTTLI GY
Sbjct: 150 AIELFGFMRMGGFE--PDTVTWNTIVDAYCRMGQCDEASNVFKKIKEPNIISWTTLISGY 207

Query: 250 SGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNN 71
           S  G+H+V L  FR+M++IG V PD+D LS +LVSCR +     GREIH +GIK +++  
Sbjct: 208 SRIGEHEVTLSIFREMMSIGKVCPDLDCLSSVLVSCRHVEGFNFGREIHAHGIKTINI-T 266

Query: 70  AFYKSAGAALLTLYARCGRL 11
           AFYKSAG ALL +YA   R+
Sbjct: 267 AFYKSAGPALLVMYATNRRM 286


>ref|XP_017219367.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Daucus carota subsp. sativus]
          Length = 647

 Score =  169 bits (427), Expect = 2e-46
 Identities = 82/140 (58%), Positives = 105/140 (75%)
 Frame = -3

Query: 430 ALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGY 251
           A+ELFGFM M G E  PD VTWNT++DAYC++G   +AS VF+KIK+PN+ISWTTLI GY
Sbjct: 220 AIELFGFMRMGGFE--PDTVTWNTIVDAYCRMGQCDEASNVFKKIKEPNIISWTTLISGY 277

Query: 250 SGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNN 71
           S  G+H+V L  FR+M++IG V PD+D LS +LVSCR +     GREIH +GIK +++  
Sbjct: 278 SRIGEHEVTLSIFREMMSIGKVCPDLDCLSSVLVSCRHVEGFNFGREIHAHGIKTINI-T 336

Query: 70  AFYKSAGAALLTLYARCGRL 11
           AFYKSAG ALL +YA   R+
Sbjct: 337 AFYKSAGPALLVMYATNRRM 356


>ref|XP_008344308.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like isoform X1 [Malus domestica]
          Length = 690

 Score =  169 bits (427), Expect = 3e-46
 Identities = 84/143 (58%), Positives = 107/143 (74%)
 Frame = -3

Query: 430 ALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGY 251
           A+ELF  M +D CE  PDVVT NTVMDAYC++G   +A R+FE+IKDPN+ISWTTLI G+
Sbjct: 261 AVELFDCMHLDVCE--PDVVTLNTVMDAYCRLGHCDEAKRIFEQIKDPNIISWTTLISGF 318

Query: 250 SGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNN 71
           S  G H+ +L+ FR M++   V+PD+D+LS ++VSCR LG+L  G+EIHGYGIK +    
Sbjct: 319 SRIGNHESSLKIFRDMMDGSRVYPDLDSLSAVJVSCRHLGSLLNGKEIHGYGIK-IGSXI 377

Query: 70  AFYKSAGAALLTLYARCGRLHDA 2
           AFY SAG ALL LYA C R+ DA
Sbjct: 378 AFYSSAGPALLILYANCSRIQDA 400



 Score = 58.2 bits (139), Expect = 6e-07
 Identities = 38/115 (33%), Positives = 57/115 (49%)
 Frame = -3

Query: 346 YCKVGLIGDASRVFEKIKDPNVISWTTLILGYSGFGKHDVALEFFRKMVNIGMVFPDVDA 167
           Y     I DA  VF  +   +V+SW  +ILG+   G  D+ALE FRKM     V  D   
Sbjct: 391 YANCSRIQDAINVFRLMNPADVVSWNAMILGFIDLGLXDLALECFRKMQR-AQVKADQTT 449

Query: 166 LSGILVSCRSLGALRIGREIHGYGIKNLSLNNAFYKSAGAALLTLYARCGRLHDA 2
           +S  L +C     L+ G++IH + ++  S +         AL+ +YA+CG +  A
Sbjct: 450 ISTXLPTCN----LKFGKQIHAF-VRKSSFD--LVAPVWNALIHMYAKCGCIESA 497


>ref|XP_007047218.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic [Theobroma cacao]
 gb|EOX91375.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
           cacao]
          Length = 635

 Score =  168 bits (425), Expect = 4e-46
 Identities = 82/147 (55%), Positives = 107/147 (72%)
 Frame = -3

Query: 442 LFEKALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTL 263
           + E  LE+   M +DG E  PDVVTWN VMD YC++G   +A ++FE IK+PN+ISWTTL
Sbjct: 202 MLEFGLEILNCMRLDGFE--PDVVTWNMVMDGYCRMGRCDEALKIFEYIKEPNIISWTTL 259

Query: 262 ILGYSGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNL 83
           I GYS  G+H+ +L  F+ M+N G+V PD+D LS  LVSCR LGAL  G+EIHG+GIK +
Sbjct: 260 ISGYSRIGQHESSLRIFKDMLNKGVVLPDLDCLSSALVSCRHLGALLSGKEIHGFGIK-M 318

Query: 82  SLNNAFYKSAGAALLTLYARCGRLHDA 2
            +  +FY SAG ALLTL+++CGR  DA
Sbjct: 319 MIGRSFYGSAGPALLTLHSKCGRSRDA 345



 Score = 57.4 bits (137), Expect = 1e-06
 Identities = 37/126 (29%), Positives = 62/126 (49%), Gaps = 4/126 (3%)
 Frame = -3

Query: 367 WNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGYSGFGKHDVALEFFRKMVNIGM 188
           WN ++  Y K G IG A  VF  +   +++SW T+I G++  G  + AL+  ++M  +G 
Sbjct: 426 WNALVHMYSKCGSIGSAYSVFSNMVARDLVSWNTMIGGFALHGLGEAALQLLKEMNYLG- 484

Query: 187 VFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNNAFYKSAG----AALLTLYARC 20
           V P    L+  L +C   G +  G ++        S+   F+ S      A ++ + +R 
Sbjct: 485 VCPSPVTLTSALSACNHSGLVDEGLKVFS------SMTRGFHLSPSMEHFACVVDMLSRA 538

Query: 19  GRLHDA 2
           GRL DA
Sbjct: 539 GRLEDA 544


>ref|XP_024183906.1| putative pentatricopeptide repeat-containing protein At3g23330
           isoform X1 [Rosa chinensis]
          Length = 666

 Score =  168 bits (425), Expect = 5e-46
 Identities = 85/143 (59%), Positives = 105/143 (73%)
 Frame = -3

Query: 430 ALELFGFMGMDGCECDPDVVTWNTVMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGY 251
           A+ELFG M  DGCE  PDVVT NTVMDAYC++GL  +A  +F+ IK+PN+ISWTTLI GY
Sbjct: 238 AIELFGCMNSDGCE--PDVVTLNTVMDAYCRMGLCDEAKGIFKHIKEPNIISWTTLISGY 295

Query: 250 SGFGKHDVALEFFRKMVNIGMVFPDVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNN 71
           S  G H+ +L  FR M+N  MV+PD+D+LS  LVS R LG+L  G+EIHGYG+K  S   
Sbjct: 296 SRIGNHEASLGIFRDMMNSSMVYPDLDSLSIALVSSRHLGSLLSGKEIHGYGLKRES-GI 354

Query: 70  AFYKSAGAALLTLYARCGRLHDA 2
            FY SAG ALLT+YA C ++ DA
Sbjct: 355 VFYISAGPALLTMYANCRKIQDA 377



 Score = 56.6 bits (135), Expect = 2e-06
 Identities = 37/114 (32%), Positives = 57/114 (50%)
 Frame = -3

Query: 358 VMDAYCKVGLIGDASRVFEKIKDPNVISWTTLILGYSGFGKHDVALEFFRKMVNIGMVFP 179
           ++  Y     I DA  VF  +     +SW  +ILG+   G  D+ALE FRKM  I  +  
Sbjct: 364 LLTMYANCRKIQDAENVFRFMDPAQAVSWNAMILGFIDLGLEDLALECFRKM-QIAEIKL 422

Query: 178 DVDALSGILVSCRSLGALRIGREIHGYGIKNLSLNNAFYKSAGAALLTLYARCG 17
           D   LS +L +C     L+ G++IH + I+  S +         AL+ +Y++CG
Sbjct: 423 DQTTLSTVLPTCN----LKFGKQIHAF-IRKSSFD--LVVPVWNALIHMYSKCG 469


Top