BLASTX nr result

ID: Rehmannia29_contig00007142 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia29_contig00007142
         (860 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011082312.1| pentatricopeptide repeat-containing protein ...   454   e-155
gb|PIN17871.1| hypothetical protein CDL12_09472 [Handroanthus im...   444   e-151
gb|EYU23303.1| hypothetical protein MIMGU_mgv1a003335mg [Erythra...   444   e-151
ref|XP_012854228.1| PREDICTED: pentatricopeptide repeat-containi...   444   e-150
ref|XP_019252293.1| PREDICTED: pentatricopeptide repeat-containi...   412   e-138
gb|KZV46461.1| pentatricopeptide repeat-containing protein-like ...   409   e-138
ref|XP_009625891.1| PREDICTED: pentatricopeptide repeat-containi...   407   e-136
ref|XP_006352928.1| PREDICTED: pentatricopeptide repeat-containi...   407   e-136
ref|XP_015084956.1| PREDICTED: pentatricopeptide repeat-containi...   405   e-136
ref|XP_009797211.1| PREDICTED: pentatricopeptide repeat-containi...   405   e-136
ref|XP_004245945.1| PREDICTED: pentatricopeptide repeat-containi...   405   e-135
ref|XP_022736385.1| pentatricopeptide repeat-containing protein ...   401   e-135
gb|PHT58485.1| Pentatricopeptide repeat-containing protein [Caps...   402   e-134
ref|XP_022875458.1| pentatricopeptide repeat-containing protein ...   402   e-134
ref|XP_017973405.1| PREDICTED: pentatricopeptide repeat-containi...   397   e-134
gb|OVA02841.1| Pentatricopeptide repeat [Macleaya cordata]            400   e-134
ref|XP_022736384.1| pentatricopeptide repeat-containing protein ...   401   e-134
ref|XP_016571538.1| PREDICTED: pentatricopeptide repeat-containi...   401   e-134
ref|XP_018843924.1| PREDICTED: pentatricopeptide repeat-containi...   400   e-133
ref|XP_017973404.1| PREDICTED: pentatricopeptide repeat-containi...   397   e-132

>ref|XP_011082312.1| pentatricopeptide repeat-containing protein At1g31920 [Sesamum
            indicum]
          Length = 606

 Score =  454 bits (1169), Expect = e-155
 Identities = 222/285 (77%), Positives = 246/285 (86%)
 Frame = -2

Query: 856  LFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMY 677
            LF  M+ EGCWRAEESTLVSV+SAC HLGALDLG+ THGYLLRNLSG NVAV+T L+DMY
Sbjct: 220  LFGEMNFEGCWRAEESTLVSVVSACAHLGALDLGRSTHGYLLRNLSGLNVAVQTSLIDMY 279

Query: 676  IRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIYV 497
            I+CGSLDKGMSLFQ  V KN  SY+V++SGLA HG GE+AL IFEQML EGLKPDDV+YV
Sbjct: 280  IKCGSLDKGMSLFQRTVRKNRNSYTVVVSGLANHGRGEEALNIFEQMLGEGLKPDDVVYV 339

Query: 496  GVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMPM 317
            G+LSAC + GLVEKGM+ FDRMR EH IEPTIQHYGC+VDLMGR G + E FELIK MPM
Sbjct: 340  GLLSACSHAGLVEKGMRCFDRMRFEHGIEPTIQHYGCMVDLMGRAGKISEGFELIKCMPM 399

Query: 316  DPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSLT 137
             PNDVIWRSLLSSCKVHQNVELGEIAA +LF+L TQN GDY+MLSNIYA+A RW DVS+T
Sbjct: 400  APNDVIWRSLLSSCKVHQNVELGEIAAGHLFQLKTQNAGDYLMLSNIYAEAGRWQDVSMT 459

Query: 136  RVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
            RVKMA  GLGQV GSS VE+KRKVHKF+SNDKSH +  EIYEMLH
Sbjct: 460  RVKMACMGLGQVPGSSSVEVKRKVHKFVSNDKSHPQCYEIYEMLH 504



 Score = 73.6 bits (179), Expect = 6e-11
 Identities = 60/226 (26%), Positives = 102/226 (45%), Gaps = 39/226 (17%)
 Frame = -2

Query: 808 TLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMYIRCGSLDKGMSLFQEM 629
           T  S+L AC  L A++ GK  HG +L+     +V V+  L++MY +CG +    ++F++M
Sbjct: 134 TYPSLLKACASLSAVEEGKQIHGQVLKLGFVEDVFVQNSLINMYGKCGQIKHSRAVFEQM 193

Query: 628 VEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGL-KPDDVIYVGVLSACRN------- 473
             K   S+S +I+  A  G  ++ L++F +M  EG  + ++   V V+SAC +       
Sbjct: 194 DRKTVASWSAVIAAYANLGMWDECLSLFGEMNFEGCWRAEESTLVSVVSACAHLGALDLG 253

Query: 472 ----------------------------GGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVD 377
                                        G ++KGM  F R   ++R       Y  +V 
Sbjct: 254 RSTHGYLLRNLSGLNVAVQTSLIDMYIKCGSLDKGMSLFQRTVRKNR-----NSYTVVVS 308

Query: 376 LMGRVGMVYEAFELIKKM---PMDPNDVIWRSLLSSCKVHQNVELG 248
            +   G   EA  + ++M    + P+DV++  LLS+C     VE G
Sbjct: 309 GLANHGRGEEALNIFEQMLGEGLKPDDVVYVGLLSACSHAGLVEKG 354


>gb|PIN17871.1| hypothetical protein CDL12_09472 [Handroanthus impetiginosus]
          Length = 606

 Score =  444 bits (1143), Expect = e-151
 Identities = 215/286 (75%), Positives = 243/286 (84%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            +LFS M+REGCWRAEESTLVSVL ACTHLG LD G+  HGYLLRNL+G NVAVET L+DM
Sbjct: 219  KLFSQMNREGCWRAEESTLVSVLLACTHLGVLDSGRAIHGYLLRNLTGLNVAVETSLIDM 278

Query: 679  YIRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIY 500
            YI CG+LDKG+ LF+ M  +N KSYSV+ISGLATHG+GE+AL IFEQMLEE  +P+DV+Y
Sbjct: 279  YIHCGNLDKGICLFRGMGRRNRKSYSVVISGLATHGHGEEALKIFEQMLEERSEPEDVVY 338

Query: 499  VGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMP 320
            VGVLSAC    LV++G KYFDRMR EH IEPT+QHYGCIVDLMGR GMVYEA E IK MP
Sbjct: 339  VGVLSACSRARLVKEGKKYFDRMRFEHGIEPTMQHYGCIVDLMGRAGMVYEALEFIKAMP 398

Query: 319  MDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSL 140
            ++PNDVIWR  LSSCK+HQNVELGEIAAENLFK  TQN  DY+MLSN+YA AQRWDDVSL
Sbjct: 399  IEPNDVIWRIFLSSCKIHQNVELGEIAAENLFKKTTQNACDYLMLSNMYAHAQRWDDVSL 458

Query: 139  TRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
            TRVKMA KGL  V GSS VE+KRKVH+F+SND+SH +  EIYEMLH
Sbjct: 459  TRVKMAQKGLDHVPGSSFVEVKRKVHRFVSNDRSHPQCHEIYEMLH 504


>gb|EYU23303.1| hypothetical protein MIMGU_mgv1a003335mg [Erythranthe guttata]
          Length = 592

 Score =  444 bits (1141), Expect = e-151
 Identities = 214/287 (74%), Positives = 252/287 (87%), Gaps = 1/287 (0%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            RLFS+M+ EG WRAEESTLVSVLSACT LG LD G+CTHGYL+RNL+GFNVAV+T LMDM
Sbjct: 204  RLFSDMNWEGKWRAEESTLVSVLSACTRLGVLDSGRCTHGYLIRNLTGFNVAVQTSLMDM 263

Query: 679  YIRCGSLDKGMSLFQEMVEK-NHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVI 503
            Y+R GSLDKGMSLF EM EK N KSYSV+ISGLATHG+GE+AL +F++MLE GLKPDDV 
Sbjct: 264  YVRSGSLDKGMSLFLEMGEKKNRKSYSVVISGLATHGHGEEALKVFDEMLERGLKPDDVA 323

Query: 502  YVGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKM 323
            YVGVLSAC + GLVE+G KYFDRMR+EHR+EPTIQH GC+VDLMGR G++ EA E IK M
Sbjct: 324  YVGVLSACSHAGLVEEGKKYFDRMRIEHRVEPTIQHCGCMVDLMGRAGLIREALEFIKNM 383

Query: 322  PMDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVS 143
             ++PN+VIWRSLLSSC+VHQNVELGE+AAENLFK+NT+N GDY+ L NIYAQA+RW+++S
Sbjct: 384  KIEPNEVIWRSLLSSCRVHQNVELGELAAENLFKMNTRNAGDYLNLCNIYAQARRWEEMS 443

Query: 142  LTRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
            +TRVKMA+ GLGQ  GSS VE+KRKVHKF+S+D SH +  EIYEMLH
Sbjct: 444  ITRVKMASNGLGQEPGSSSVEVKRKVHKFVSSDTSHSQCDEIYEMLH 490


>ref|XP_012854228.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Erythranthe guttata]
          Length = 607

 Score =  444 bits (1141), Expect = e-150
 Identities = 214/287 (74%), Positives = 252/287 (87%), Gaps = 1/287 (0%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            RLFS+M+ EG WRAEESTLVSVLSACT LG LD G+CTHGYL+RNL+GFNVAV+T LMDM
Sbjct: 219  RLFSDMNWEGKWRAEESTLVSVLSACTRLGVLDSGRCTHGYLIRNLTGFNVAVQTSLMDM 278

Query: 679  YIRCGSLDKGMSLFQEMVEK-NHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVI 503
            Y+R GSLDKGMSLF EM EK N KSYSV+ISGLATHG+GE+AL +F++MLE GLKPDDV 
Sbjct: 279  YVRSGSLDKGMSLFLEMGEKKNRKSYSVVISGLATHGHGEEALKVFDEMLERGLKPDDVA 338

Query: 502  YVGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKM 323
            YVGVLSAC + GLVE+G KYFDRMR+EHR+EPTIQH GC+VDLMGR G++ EA E IK M
Sbjct: 339  YVGVLSACSHAGLVEEGKKYFDRMRIEHRVEPTIQHCGCMVDLMGRAGLIREALEFIKNM 398

Query: 322  PMDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVS 143
             ++PN+VIWRSLLSSC+VHQNVELGE+AAENLFK+NT+N GDY+ L NIYAQA+RW+++S
Sbjct: 399  KIEPNEVIWRSLLSSCRVHQNVELGELAAENLFKMNTRNAGDYLNLCNIYAQARRWEEMS 458

Query: 142  LTRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
            +TRVKMA+ GLGQ  GSS VE+KRKVHKF+S+D SH +  EIYEMLH
Sbjct: 459  ITRVKMASNGLGQEPGSSSVEVKRKVHKFVSSDTSHSQCDEIYEMLH 505



 Score = 58.2 bits (139), Expect = 8e-06
 Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 1/120 (0%)
 Frame = -2

Query: 808 TLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMYIRCGSLDKGMSLFQEM 629
           T   +L AC+ L A   G   HG + +     +V V+  L+++Y +CG + +  ++F+ M
Sbjct: 134 TYPPLLKACSILSAFAEGAQIHGQIYKMGFVEDVMVQNSLINVYGKCGRVKRSCAVFRRM 193

Query: 628 VEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEG-LKPDDVIYVGVLSACRNGGLVEKG 452
             K   S+S +I+  A  G  ++ L +F  M  EG  + ++   V VLSAC   G+++ G
Sbjct: 194 DHKTIASWSALIAAHANLGMWKECLRLFSDMNWEGKWRAEESTLVSVLSACTRLGVLDSG 253


>ref|XP_019252293.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Nicotiana attenuata]
 gb|OIS99560.1| pentatricopeptide repeat-containing protein [Nicotiana attenuata]
          Length = 605

 Score =  412 bits (1060), Expect = e-138
 Identities = 195/286 (68%), Positives = 237/286 (82%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            +LF  M+ EGCWRAEESTLVSV+SACTHL ALD GK THGYL RN+SG N  VET L+DM
Sbjct: 218  KLFGEMNYEGCWRAEESTLVSVISACTHLDALDFGKATHGYLARNMSGLNNIVETSLIDM 277

Query: 679  YIRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIY 500
            Y++CGSLDKG+ LFQ M  KN  SYS IISGLA HG GE+AL+I+ +ML+E L PDDV+Y
Sbjct: 278  YVKCGSLDKGLFLFQRMANKNQMSYSAIISGLALHGRGEEALSIYHEMLKERLDPDDVVY 337

Query: 499  VGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMP 320
            VGVLSAC + GLVE+G+K FDRMRLEH+IEPTIQHYGC+VDL+GR G + EA ELIK MP
Sbjct: 338  VGVLSACSHAGLVEQGLKCFDRMRLEHQIEPTIQHYGCMVDLLGRAGRLEEALELIKGMP 397

Query: 319  MDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSL 140
            M+PNDV+WRSLLS+C+VH NVELGE AAE+LF++N++N  DY+ML N+YAQAQ W+  ++
Sbjct: 398  MEPNDVLWRSLLSACRVHHNVELGEFAAEHLFQMNSRNASDYVMLCNMYAQAQMWEKKAV 457

Query: 139  TRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
            TR KMAN+G+ QV GS LVEI RK++KF+S D+SH    E+YEMLH
Sbjct: 458  TRTKMANEGISQVPGSCLVEINRKMYKFVSQDRSHPCSEEVYEMLH 503


>gb|KZV46461.1| pentatricopeptide repeat-containing protein-like [Dorcoceras
           hygrometricum]
          Length = 524

 Score =  409 bits (1052), Expect = e-138
 Identities = 197/286 (68%), Positives = 240/286 (83%)
 Frame = -2

Query: 859 RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
           RLF  M+ +  WRAEES LVSVLS+CTHLGALDLG+CTHGYLLRNLSGFNVAVET L+DM
Sbjct: 138 RLFREMNNDNAWRAEESILVSVLSSCTHLGALDLGRCTHGYLLRNLSGFNVAVETSLIDM 197

Query: 679 YIRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIY 500
           YI+CGS+DKG+SLF++M  KN  SYSVIISGLA HG G++AL  FE+MLEEGLKPDD+IY
Sbjct: 198 YIKCGSIDKGLSLFRKMRLKNEMSYSVIISGLAHHGRGKEALETFEKMLEEGLKPDDIIY 257

Query: 499 VGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMP 320
           VGVL+AC + GLVE+G+KYF +MR EH I P IQHYGCIVDLMGR GMV  AFE IK MP
Sbjct: 258 VGVLNACSHDGLVEEGLKYFKKMR-EHGIVPAIQHYGCIVDLMGRAGMVDRAFETIKSMP 316

Query: 319 MDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSL 140
           M PND+IWRSLL+SCK+HQNVE+ E+AA++L   ++QN  D++M+SN+YAQA+RW+DVS 
Sbjct: 317 MKPNDIIWRSLLTSCKIHQNVEIAEVAAKSLLHTHSQNASDFLMMSNVYAQAKRWEDVSY 376

Query: 139 TRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
            R +MA   + QV GSSLVE++RKV+KF+S+D+SH    +IYEMLH
Sbjct: 377 IRTQMARLRVPQVPGSSLVEVRRKVYKFVSSDQSHPNSQDIYEMLH 422



 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 57/222 (25%), Positives = 99/222 (44%), Gaps = 39/222 (17%)
 Frame = -2

Query: 796 VLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMYIRCGSLDKGMSLFQEMVEKN 617
           +L A T L A+  G   HG +L+     +V V+  L++MY RCG +      F++M +K+
Sbjct: 57  LLKAITQLSAIGEGTQIHGQILKKGFVEDVLVQNSLINMYGRCGKIRHSCVAFEQMAKKS 116

Query: 616 HKSYSVIISGLATHGYGEKALTIFEQMLEEGL-KPDDVIYVGVLSACRN----------- 473
             S+S +I   A  G  ++ L +F +M  +   + ++ I V VLS+C +           
Sbjct: 117 IASWSALIGSHANLGMWDECLRLFREMNNDNAWRAEESILVSVLSSCTHLGALDLGRCTH 176

Query: 472 ------------------------GGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGR 365
                                    G ++KG+  F +MRL++ +      Y  I+  +  
Sbjct: 177 GYLLRNLSGFNVAVETSLIDMYIKCGSIDKGLSLFRKMRLKNEMS-----YSVIISGLAH 231

Query: 364 VGMVYEAFELIKKM---PMDPNDVIWRSLLSSCKVHQNVELG 248
            G   EA E  +KM    + P+D+I+  +L++C     VE G
Sbjct: 232 HGRGKEALETFEKMLEEGLKPDDIIYVGVLNACSHDGLVEEG 273


>ref|XP_009625891.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Nicotiana tomentosiformis]
 ref|XP_016508238.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Nicotiana tabacum]
          Length = 605

 Score =  407 bits (1047), Expect = e-136
 Identities = 189/286 (66%), Positives = 239/286 (83%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            +LF  M+ EGCWRAEES LVSV+SACTHL ALD GK THGYL+RN++G N+ VET L+DM
Sbjct: 218  KLFGEMNSEGCWRAEESALVSVISACTHLDALDFGKATHGYLVRNMNGLNIIVETSLIDM 277

Query: 679  YIRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIY 500
            Y++CGSLDKG+ LFQ M +KN  SYS IISGLA HG GE+AL+I+ +ML+E L+PDDV+Y
Sbjct: 278  YVKCGSLDKGLFLFQRMAKKNQMSYSAIISGLALHGRGEEALSIYHEMLKERLEPDDVVY 337

Query: 499  VGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMP 320
            VGVLSAC + GLVE+G+K F+RMRL+H+IEPTIQHYGC+VDL+GR G + EA ELIK MP
Sbjct: 338  VGVLSACSHAGLVEEGLKCFNRMRLKHQIEPTIQHYGCMVDLLGRAGRLDEALELIKGMP 397

Query: 319  MDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSL 140
            M+PNDV+WRSLLS+C+ HQNVELGE+AAENLF++N++N  DY+ML N+YAQAQ W+  ++
Sbjct: 398  MEPNDVLWRSLLSACRAHQNVELGELAAENLFQMNSRNASDYVMLCNMYAQAQMWEKKAV 457

Query: 139  TRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
               KMAN+G+ QV GS LVE+ RK++KF+S D+SH    E+YEMLH
Sbjct: 458  IWTKMANEGISQVPGSCLVEVNRKMYKFVSQDRSHPCSEEVYEMLH 503


>ref|XP_006352928.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Solanum tuberosum]
          Length = 605

 Score =  407 bits (1047), Expect = e-136
 Identities = 191/286 (66%), Positives = 236/286 (82%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            ++F  M+ EGCWRAEESTLVSV+SACTHL ALD GK THGYLLRN++G NV VET L+DM
Sbjct: 218  KVFGEMNSEGCWRAEESTLVSVISACTHLDALDFGKATHGYLLRNMTGLNVIVETSLIDM 277

Query: 679  YIRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIY 500
            Y++CG L+KG+ LFQ M  KN  SYS IISGLA HG GE+AL I+ +ML+E ++PDDV+Y
Sbjct: 278  YVKCGCLEKGLFLFQRMANKNQMSYSAIISGLALHGRGEEALRIYHEMLKERIEPDDVVY 337

Query: 499  VGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMP 320
            VGVLSAC + GLVE+G+K FDRMRLEHRIEPTIQHYGC+VDL+GR G + EA ELIK MP
Sbjct: 338  VGVLSACSHAGLVEEGLKCFDRMRLEHRIEPTIQHYGCMVDLLGRAGRLEEALELIKGMP 397

Query: 319  MDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSL 140
            M+PNDV+WRSLLSSC+VHQNVELGE+AA+NLF L ++N  DY+ML NIYAQA+ W+ +++
Sbjct: 398  MEPNDVLWRSLLSSCRVHQNVELGEVAAKNLFMLKSRNASDYVMLCNIYAQAKMWEKMAV 457

Query: 139  TRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
             R KM N+G+ QV GS LVE  RK++KF+S D+SH    E+YEM+H
Sbjct: 458  IRTKMVNEGIIQVPGSCLVEADRKLYKFVSQDRSHTCSDEVYEMIH 503


>ref|XP_015084956.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Solanum pennellii]
          Length = 605

 Score =  405 bits (1042), Expect = e-136
 Identities = 190/286 (66%), Positives = 235/286 (82%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            R+F+ M+ EGCWRAEESTLVSV+SACTHL ALD GK THGYLLRN++G NV VE+ L+DM
Sbjct: 218  RVFAEMNSEGCWRAEESTLVSVISACTHLNALDFGKATHGYLLRNMTGLNVIVESSLIDM 277

Query: 679  YIRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIY 500
            Y++CG L+KG+ LFQ M  KN  SYS IISGLA HG GE+AL I+ +ML+  ++PDDV+Y
Sbjct: 278  YVKCGCLEKGLFLFQRMANKNQMSYSAIISGLALHGRGEEALRIYHEMLKARIEPDDVVY 337

Query: 499  VGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMP 320
            VGVLSAC + GLVE+G+K FDRMRLEHRIEPTIQHYGC+VDL+GR G + EA ELIK MP
Sbjct: 338  VGVLSACSHAGLVEEGLKCFDRMRLEHRIEPTIQHYGCMVDLLGRTGRLEEALELIKGMP 397

Query: 319  MDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSL 140
            M+PNDV+WRSLLS+C+VHQNVELGE+AA+NLF L ++N  DY+ML NIYAQA+ W+ +S 
Sbjct: 398  MEPNDVLWRSLLSACRVHQNVELGEVAAKNLFMLKSRNASDYVMLCNIYAQAKMWEKMSA 457

Query: 139  TRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
             R KM N+G+ QV GS LVE  RK++KF+S D+SH    E+YEM+H
Sbjct: 458  IRTKMVNEGIIQVTGSCLVEADRKLYKFVSQDRSHTCSDEVYEMIH 503


>ref|XP_009797211.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Nicotiana sylvestris]
 ref|XP_016506179.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like
            [Nicotiana tabacum]
          Length = 605

 Score =  405 bits (1042), Expect = e-136
 Identities = 190/286 (66%), Positives = 238/286 (83%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            +LF +++ E CWRAEESTLVSV+SACTHL ALD GK THGYL+RN+SG NV VET L+DM
Sbjct: 218  KLFGDLNSERCWRAEESTLVSVISACTHLDALDFGKATHGYLVRNMSGLNVIVETSLIDM 277

Query: 679  YIRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIY 500
            Y++CGSLDKG+ LFQ M  KN  SYS IISGLA HG GE+AL I+ +ML+E ++ DDV+Y
Sbjct: 278  YVKCGSLDKGLFLFQIMANKNQMSYSAIISGLALHGRGEEALRIYHEMLKERIEADDVVY 337

Query: 499  VGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMP 320
            VGVLSAC + GLVE+G+K F+RMRLEHRIEPTIQHYGC++DL+GR G + EA ELIK MP
Sbjct: 338  VGVLSACSHAGLVEEGLKCFNRMRLEHRIEPTIQHYGCMIDLLGRAGRLDEALELIKGMP 397

Query: 319  MDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSL 140
            M+PN V+WRSLLS C+VHQNVELGE+AAENLF+LN++N  DY+ML N+YA+A+ W+  ++
Sbjct: 398  MEPNSVLWRSLLSGCRVHQNVELGELAAENLFQLNSRNASDYVMLCNMYAKAKMWEKKAV 457

Query: 139  TRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
            TR KMAN+G+ QV GS LVE+ RK++KF+S D+SH    E+YEMLH
Sbjct: 458  TRTKMANEGISQVPGSCLVEVNRKMYKFVSQDRSHPCSEEVYEMLH 503


>ref|XP_004245945.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Solanum lycopersicum]
          Length = 605

 Score =  405 bits (1041), Expect = e-135
 Identities = 190/286 (66%), Positives = 235/286 (82%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            R+F+ M+ EGCWRAEESTLVSV+SACTHL ALD GK THGYLLRN++G NV VET L+DM
Sbjct: 218  RVFAEMNSEGCWRAEESTLVSVISACTHLNALDFGKATHGYLLRNMTGLNVIVETSLIDM 277

Query: 679  YIRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIY 500
            Y++CG L+KG+ LFQ M  KN  SYS IISGLA HG GE+AL I+ +ML+  ++PDDV+Y
Sbjct: 278  YVKCGCLEKGLFLFQRMANKNQMSYSAIISGLALHGRGEEALRIYHEMLKARIEPDDVVY 337

Query: 499  VGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMP 320
            VGVLSAC + GLVE+G+K FDRMRLEHRIEPTIQHYGC+VDL+GR G + EA ELIK MP
Sbjct: 338  VGVLSACSHAGLVEEGLKCFDRMRLEHRIEPTIQHYGCMVDLLGRTGRLKEALELIKGMP 397

Query: 319  MDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSL 140
            M+PNDV+WRSLLS+C+VHQNVELGE+AA+NLF L ++N  DY+ML NIYAQA+ W+ +S 
Sbjct: 398  MEPNDVLWRSLLSACRVHQNVELGEVAAKNLFMLKSRNASDYVMLCNIYAQAKMWEKMSA 457

Query: 139  TRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
             R KM N+G+ QV GS LVE  RK++KF+S D+SH    E+Y+M+H
Sbjct: 458  IRTKMVNEGIIQVPGSCLVEADRKLYKFVSQDRSHTCSDEVYDMIH 503


>ref|XP_022736385.1| pentatricopeptide repeat-containing protein At1g31920 isoform X2
           [Durio zibethinus]
          Length = 510

 Score =  401 bits (1030), Expect = e-135
 Identities = 183/285 (64%), Positives = 239/285 (83%)
 Frame = -2

Query: 856 LFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMY 677
           +F NMS EGCWR EESTLV+VLSAC +LGALDLGKCT G LLRN+S  NV V+T L+DMY
Sbjct: 124 IFGNMSSEGCWRPEESTLVTVLSACAYLGALDLGKCTQGSLLRNISELNVIVQTSLIDMY 183

Query: 676 IRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIYV 497
           ++CG L+KG+S+F++M ++N  SY+VIISGLATHG+GE+AL IF +MLEEGL PDDV+YV
Sbjct: 184 VKCGCLEKGLSVFRKMAKRNQMSYTVIISGLATHGHGEEALRIFLEMLEEGLDPDDVVYV 243

Query: 496 GVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMPM 317
           GVLSAC + GL+ +G ++FDR++ EH I+PT+QHYGC+VDLMG+ GM+ EA E IK MP+
Sbjct: 244 GVLSACSHAGLIHEGFQFFDRLKSEHGIKPTVQHYGCMVDLMGKAGMIDEALEFIKSMPI 303

Query: 316 DPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSLT 137
            PNDV+WRSLLS+C+VH N+E+ EIAA++LF+LN+QN GDY++LSN+YA+AQRW +V+  
Sbjct: 304 KPNDVVWRSLLSACRVHCNLEIAEIAAKHLFQLNSQNPGDYVILSNMYARAQRWQEVAKI 363

Query: 136 RVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
           RV+MA KGL QV G SLVE+ R++HKF+S D SH +   +YEM+H
Sbjct: 364 RVEMARKGLHQVPGYSLVEVGRRIHKFVSQDTSHHQCESVYEMIH 408



 Score = 67.4 bits (163), Expect = 6e-09
 Identities = 62/240 (25%), Positives = 108/240 (45%), Gaps = 44/240 (18%)
 Frame = -2

Query: 787 ACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMYIRCGSLDKGMSLFQEMVEKNHKS 608
           AC  L A + G   HG+  +     ++ V+  L++MY +CG +    ++F++M EK+  S
Sbjct: 45  ACAWLQAQEEGMQIHGHAFKLGFENDLYVQNSLINMYGKCGEIKHSCAVFEQMDEKSVAS 104

Query: 607 YSVIISGLATHGYGEKALTIFEQMLEEGL-KPDDVIYVGVLSAC---------------- 479
           +S II+  A+ G   + L IF  M  EG  +P++   V VLSAC                
Sbjct: 105 WSAIIAAHASVGMWYECLMIFGNMSSEGCWRPEESTLVTVLSACAYLGALDLGKCTQGSL 164

Query: 478 -RN------------------GGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGM 356
            RN                   G +EKG+  F +M   +++  T+     I+  +   G 
Sbjct: 165 LRNISELNVIVQTSLIDMYVKCGCLEKGLSVFRKMAKRNQMSYTV-----IISGLATHGH 219

Query: 355 VYEAFELIKKM---PMDPNDVIWRSLLSSCK----VHQNVE-LGEIAAENLFKLNTQNGG 200
             EA  +  +M    +DP+DV++  +LS+C     +H+  +    + +E+  K   Q+ G
Sbjct: 220 GEEALRIFLEMLEEGLDPDDVVYVGVLSACSHAGLIHEGFQFFDRLKSEHGIKPTVQHYG 279


>gb|PHT58485.1| Pentatricopeptide repeat-containing protein [Capsicum baccatum]
          Length = 605

 Score =  402 bits (1034), Expect = e-134
 Identities = 188/286 (65%), Positives = 234/286 (81%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            ++F  M+ EGCWRAEESTLVSV+SAC HL ALD GK THGYLLRN++G NV VET L+DM
Sbjct: 218  KVFGEMNTEGCWRAEESTLVSVISACAHLDALDFGKATHGYLLRNMTGLNVIVETSLIDM 277

Query: 679  YIRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIY 500
            Y++CG L+KG+ LFQ M  KN  SYS IISGLA HG GE+AL I+ +ML+E L+PDDV+Y
Sbjct: 278  YVKCGCLEKGLFLFQRMTNKNQMSYSTIISGLAMHGRGEEALRIYHEMLKERLEPDDVVY 337

Query: 499  VGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMP 320
            VGVLSAC + GLVE+G+K FDRMRLEHRIEPTIQHYGC+VDL+GR G + EA ELIK M 
Sbjct: 338  VGVLSACSHAGLVEEGLKCFDRMRLEHRIEPTIQHYGCMVDLLGRAGRLEEALELIKGMS 397

Query: 319  MDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSL 140
            M+PNDV+WRSLLS+C+VH+NVELGE+AA+NLFKL ++N  DY+ML N+YAQA+ W+ ++ 
Sbjct: 398  MEPNDVLWRSLLSACRVHRNVELGEVAAKNLFKLKSRNASDYVMLCNMYAQAKMWEKMAA 457

Query: 139  TRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
             R KM N+G+ QV GS LVE+ RKV+KF+S D+SH     +Y+MLH
Sbjct: 458  IRTKMVNEGISQVPGSCLVEVNRKVYKFVSQDRSHTCSDVVYDMLH 503


>ref|XP_022875458.1| pentatricopeptide repeat-containing protein At1g31920 [Olea europaea
            var. sylvestris]
 ref|XP_022875459.1| pentatricopeptide repeat-containing protein At1g31920 [Olea europaea
            var. sylvestris]
 ref|XP_022875460.1| pentatricopeptide repeat-containing protein At1g31920 [Olea europaea
            var. sylvestris]
          Length = 603

 Score =  402 bits (1033), Expect = e-134
 Identities = 193/287 (67%), Positives = 241/287 (83%), Gaps = 1/287 (0%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            RLF  M+  G  RAEES LV+VL ACTHLGALDL  CTHGYLLRNL+G NV VET L+D+
Sbjct: 219  RLFGEMNEIGL-RAEESILVNVLCACTHLGALDLAMCTHGYLLRNLTGLNVIVETTLIDV 277

Query: 679  YIRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIY 500
            Y++CG  D+G+ LF++M E+N  S+SV+ISGLA HG  ++A  IFEQM+E+GLKPDDVIY
Sbjct: 278  YMKCGFPDRGLLLFEKMSERNQMSHSVVISGLAIHGRAQEAFKIFEQMIEQGLKPDDVIY 337

Query: 499  VGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMP 320
            VGVLSAC + GLV++G KYFD++R EHRI+PTIQH+GC+VDLMGR GM+ EA ELI+ MP
Sbjct: 338  VGVLSACNHAGLVQEGFKYFDKLRFEHRIQPTIQHFGCMVDLMGRAGMIDEAIELIRSMP 397

Query: 319  MDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSL 140
            M+PNDV+WRSLLSSCK+HQNVELGE AA+NLF+LN++N  DYI++SN+YA+AQRW DVS+
Sbjct: 398  MEPNDVLWRSLLSSCKIHQNVELGEFAAKNLFQLNSRNASDYILISNMYAKAQRWHDVSM 457

Query: 139  TRVKMANKGL-GQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
            TR +MANKGL  QV G SLVE+K+K++KF+SND SH    ++YEMLH
Sbjct: 458  TRTEMANKGLINQVPGYSLVEVKKKMYKFVSNDTSH---SQVYEMLH 501



 Score = 77.4 bits (189), Expect = 3e-12
 Identities = 55/215 (25%), Positives = 102/215 (47%), Gaps = 37/215 (17%)
 Frame = -2

Query: 808 TLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMYIRCGSLDKGMSLFQEM 629
           T  ++L ACT L A+D G   HG + +     +V V+  L++MY +CG + +   +F++M
Sbjct: 132 TYPALLKACTRLSAVDQGMQIHGQIFKLGFVDDVFVQNSLINMYGKCGDIKRSCVVFEQM 191

Query: 628 VE--KNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIYVGVLSACRNGGLV-- 461
            +  K   S+S +IS  A+ G   + L +F +M E GL+ ++ I V VL AC + G +  
Sbjct: 192 EDSRKTIASWSSVISAYASTGMWSECLRLFGEMNEIGLRAEESILVNVLCACTHLGALDL 251

Query: 460 ---------------------------------EKGMKYFDRMRLEHRIEPTIQHYGCIV 380
                                            ++G+  F++M   +++  ++   G  +
Sbjct: 252 AMCTHGYLLRNLTGLNVIVETTLIDVYMKCGFPDRGLLLFEKMSERNQMSHSVVISGLAI 311

Query: 379 DLMGRVGMVYEAFELIKKMPMDPNDVIWRSLLSSC 275
              GR    ++ FE + +  + P+DVI+  +LS+C
Sbjct: 312 H--GRAQEAFKIFEQMIEQGLKPDDVIYVGVLSAC 344


>ref|XP_017973405.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
           isoform X2 [Theobroma cacao]
          Length = 484

 Score =  397 bits (1021), Expect = e-134
 Identities = 182/285 (63%), Positives = 237/285 (83%)
 Frame = -2

Query: 856 LFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMY 677
           +F NMS EGCWR EESTLV+VLSACTHLGALDLGKCTHG LLRN+S  NV V+T LMDMY
Sbjct: 98  MFGNMSSEGCWRPEESTLVTVLSACTHLGALDLGKCTHGSLLRNISELNVIVQTSLMDMY 157

Query: 676 IRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIYV 497
           ++CG L+KG+SLF++M  ++  SY+V+ISGLA HG+G +AL I+ +ML++GL PDDV+YV
Sbjct: 158 VKCGCLEKGLSLFRKMGNRSQMSYTVMISGLAMHGHGAEALRIYSEMLKDGLDPDDVVYV 217

Query: 496 GVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMPM 317
           GVLSAC + GLV++G + FDRM+ EH I PT+QHYGC+VDLMG+ GM+ EA E IK MP+
Sbjct: 218 GVLSACSHAGLVDEGFRCFDRMKSEHGITPTVQHYGCMVDLMGKAGMINEALEFIKSMPI 277

Query: 316 DPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSLT 137
            PNDV+WRSLLS+C+VH N+E+GEIAA++LF+  +QN GDY++LSN+YA+AQRW +V+  
Sbjct: 278 KPNDVVWRSLLSACRVHCNLEIGEIAAKHLFQSKSQNPGDYVILSNMYARAQRWQEVAKI 337

Query: 136 RVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
           RV+MA KGL QV G SLVE+ R++HKF+S D SH + + +YEM+H
Sbjct: 338 RVEMARKGLHQVPGFSLVEVGRRIHKFVSQDTSHPQCVSVYEMIH 382



 Score = 68.2 bits (165), Expect = 3e-09
 Identities = 52/207 (25%), Positives = 94/207 (45%), Gaps = 36/207 (17%)
 Frame = -2

Query: 787 ACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMYIRCGSLDKGMSLFQEMVEKNHKS 608
           AC  L A + GK  HG+  +     ++ V+  L++MY +CG ++   ++F++M +K+  S
Sbjct: 19  ACACLQAQEEGKQIHGHAFKLGLESDLYVQNSLINMYGKCGEIEHSCAIFEQMDQKSVAS 78

Query: 607 YSVIISGLATHGYGEKALTIFEQMLEEGL-KPDDVIYVGVLSACRN-------------- 473
           +S II+  A+ G   + L +F  M  EG  +P++   V VLSAC +              
Sbjct: 79  WSAIIAAHASFGKWYECLMMFGNMSSEGCWRPEESTLVTVLSACTHLGALDLGKCTHGSL 138

Query: 472 ---------------------GGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGM 356
                                 G +EKG+  F +M    ++  T+   G  + + G    
Sbjct: 139 LRNISELNVIVQTSLMDMYVKCGCLEKGLSLFRKMGNRSQMSYTVMISG--LAMHGHGAE 196

Query: 355 VYEAFELIKKMPMDPNDVIWRSLLSSC 275
               +  + K  +DP+DV++  +LS+C
Sbjct: 197 ALRIYSEMLKDGLDPDDVVYVGVLSAC 223


>gb|OVA02841.1| Pentatricopeptide repeat [Macleaya cordata]
          Length = 563

 Score =  400 bits (1027), Expect = e-134
 Identities = 188/286 (65%), Positives = 232/286 (81%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            RLF  MS EG WRA+ESTLVSVLS+CTHLGALDLG+  HG+L RN+S  NV V+T L+DM
Sbjct: 176  RLFGEMSSEGFWRADESTLVSVLSSCTHLGALDLGRSIHGFLSRNISDLNVIVQTSLIDM 235

Query: 679  YIRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIY 500
            Y++CGS++KG+ LFQ M +KN  SY+VIISGLA HG  E+AL IF+ ML+EGL PD+++Y
Sbjct: 236  YVKCGSIEKGLILFQNMPKKNQLSYTVIISGLAIHGRAEEALKIFQNMLKEGLDPDEIVY 295

Query: 499  VGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMP 320
            VGVL+AC +GGLV++G K+FD+MR EH+IEPTIQHYGC+VDL+ R G + EAFELIK MP
Sbjct: 296  VGVLTACSHGGLVDEGRKFFDKMRFEHQIEPTIQHYGCMVDLISRAGKIDEAFELIKSMP 355

Query: 319  MDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSL 140
              PNDVIWRSLLSSCKVHQ+ E  EIA+ NLF+L+ QN  DY++LSNIYA+A RW+D + 
Sbjct: 356  KKPNDVIWRSLLSSCKVHQDFEFAEIASRNLFELDPQNTSDYVLLSNIYAKAHRWEDAAK 415

Query: 139  TRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
            TR  M NKGL Q+ G S VE+K+KVHKF+S DKSH E   IYEM+H
Sbjct: 416  TRKDMVNKGLNQIPGCSSVEVKKKVHKFVSQDKSHPESYAIYEMIH 461



 Score = 67.8 bits (164), Expect = 5e-09
 Identities = 71/321 (22%), Positives = 143/321 (44%), Gaps = 41/321 (12%)
 Frame = -2

Query: 856 LFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMY 677
           L+  M + G  + +  T  ++L AC++L A+  G   HG++ +     +V V+  L++MY
Sbjct: 76  LYKEMHQRGI-KPDNFTYPALLKACSYLSAIREGLQIHGHVFKFGFESDVFVQNSLINMY 134

Query: 676 IRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGL-KPDDVIY 500
            +CG +     +F++M + +  S+S +++     G   + L +F +M  EG  + D+   
Sbjct: 135 GKCGEIKLCCRVFEKMDQISVASWSALLASHTKLGLWRECLRLFGEMSSEGFWRADESTL 194

Query: 499 VGVLSACRN-----------------------------------GGLVEKGMKYFDRMRL 425
           V VLS+C +                                    G +EKG+  F  M  
Sbjct: 195 VSVLSSCTHLGALDLGRSIHGFLSRNISDLNVIVQTSLIDMYVKCGSIEKGLILFQNMPK 254

Query: 424 EHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMPMDPNDVIWRSLLSSCKVHQNVELG- 248
           ++++  T+   G  +   GR     + F+ + K  +DP+++++  +L++C     V+ G 
Sbjct: 255 KNQLSYTVIISGLAIH--GRAEEALKIFQNMLKEGLDPDEIVYVGVLTACSHGGLVDEGR 312

Query: 247 ----EIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSLTRVKMANKGLGQVVGSSLVE 80
               ++  E+  +   Q+   Y  + ++ ++A + D+ +   +K   K    V+  SL+ 
Sbjct: 313 KFFDKMRFEHQIEPTIQH---YGCMVDLISRAGKIDE-AFELIKSMPKKPNDVIWRSLLS 368

Query: 79  IKRKVHKFISNDKSHFEFLEI 17
              KVH+        FEF EI
Sbjct: 369 -SCKVHQ-------DFEFAEI 381


>ref|XP_022736384.1| pentatricopeptide repeat-containing protein At1g31920 isoform X1
            [Durio zibethinus]
          Length = 605

 Score =  401 bits (1030), Expect = e-134
 Identities = 183/285 (64%), Positives = 239/285 (83%)
 Frame = -2

Query: 856  LFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMY 677
            +F NMS EGCWR EESTLV+VLSAC +LGALDLGKCT G LLRN+S  NV V+T L+DMY
Sbjct: 219  IFGNMSSEGCWRPEESTLVTVLSACAYLGALDLGKCTQGSLLRNISELNVIVQTSLIDMY 278

Query: 676  IRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIYV 497
            ++CG L+KG+S+F++M ++N  SY+VIISGLATHG+GE+AL IF +MLEEGL PDDV+YV
Sbjct: 279  VKCGCLEKGLSVFRKMAKRNQMSYTVIISGLATHGHGEEALRIFLEMLEEGLDPDDVVYV 338

Query: 496  GVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMPM 317
            GVLSAC + GL+ +G ++FDR++ EH I+PT+QHYGC+VDLMG+ GM+ EA E IK MP+
Sbjct: 339  GVLSACSHAGLIHEGFQFFDRLKSEHGIKPTVQHYGCMVDLMGKAGMIDEALEFIKSMPI 398

Query: 316  DPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSLT 137
             PNDV+WRSLLS+C+VH N+E+ EIAA++LF+LN+QN GDY++LSN+YA+AQRW +V+  
Sbjct: 399  KPNDVVWRSLLSACRVHCNLEIAEIAAKHLFQLNSQNPGDYVILSNMYARAQRWQEVAKI 458

Query: 136  RVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
            RV+MA KGL QV G SLVE+ R++HKF+S D SH +   +YEM+H
Sbjct: 459  RVEMARKGLHQVPGYSLVEVGRRIHKFVSQDTSHHQCESVYEMIH 503



 Score = 70.1 bits (170), Expect = 9e-10
 Identities = 64/247 (25%), Positives = 111/247 (44%), Gaps = 44/247 (17%)
 Frame = -2

Query: 808 TLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMYIRCGSLDKGMSLFQEM 629
           T  S+  AC  L A + G   HG+  +     ++ V+  L++MY +CG +    ++F++M
Sbjct: 133 TYPSLFKACAWLQAQEEGMQIHGHAFKLGFENDLYVQNSLINMYGKCGEIKHSCAVFEQM 192

Query: 628 VEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGL-KPDDVIYVGVLSAC--------- 479
            EK+  S+S II+  A+ G   + L IF  M  EG  +P++   V VLSAC         
Sbjct: 193 DEKSVASWSAIIAAHASVGMWYECLMIFGNMSSEGCWRPEESTLVTVLSACAYLGALDLG 252

Query: 478 --------RN------------------GGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVD 377
                   RN                   G +EKG+  F +M   +++  T+     I+ 
Sbjct: 253 KCTQGSLLRNISELNVIVQTSLIDMYVKCGCLEKGLSVFRKMAKRNQMSYTV-----IIS 307

Query: 376 LMGRVGMVYEAFELIKKM---PMDPNDVIWRSLLSSCK----VHQNVE-LGEIAAENLFK 221
            +   G   EA  +  +M    +DP+DV++  +LS+C     +H+  +    + +E+  K
Sbjct: 308 GLATHGHGEEALRIFLEMLEEGLDPDDVVYVGVLSACSHAGLIHEGFQFFDRLKSEHGIK 367

Query: 220 LNTQNGG 200
              Q+ G
Sbjct: 368 PTVQHYG 374


>ref|XP_016571538.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Capsicum annuum]
 gb|PHT93075.1| Pentatricopeptide repeat-containing protein [Capsicum annuum]
          Length = 606

 Score =  401 bits (1030), Expect = e-134
 Identities = 188/286 (65%), Positives = 233/286 (81%)
 Frame = -2

Query: 859  RLFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDM 680
            ++F  M+ EGCWRAEESTLVSV+SAC HL ALD GK THGYLLRN++G NV VET L+DM
Sbjct: 219  KVFGEMNTEGCWRAEESTLVSVISACAHLDALDFGKATHGYLLRNMTGLNVIVETSLIDM 278

Query: 679  YIRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIY 500
            Y++CG L+KG+ LFQ M  KN  SYS IISGLA HG GE+AL I+ +ML+E L+PDDV+Y
Sbjct: 279  YVKCGCLEKGLFLFQRMTNKNQMSYSTIISGLAMHGRGEEALRIYHEMLKERLEPDDVVY 338

Query: 499  VGVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMP 320
            VGVLSAC + GLVE+G+K FDRMRLEHRIEPTIQHYGC+VDL+GR G + EA ELIK M 
Sbjct: 339  VGVLSACSHAGLVEEGLKCFDRMRLEHRIEPTIQHYGCMVDLLGRAGRLEEALELIKGMS 398

Query: 319  MDPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSL 140
            M+PNDV+WRSLLS+C+VH+NVELGE+AA+NLFKL ++N  DY+ML N+YAQA+ W+ ++ 
Sbjct: 399  MEPNDVLWRSLLSACRVHRNVELGEVAAKNLFKLKSRNASDYVMLCNMYAQAKMWEKMAA 458

Query: 139  TRVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
             R KM N+G+ QV GS LVE  RKV+KF+S D+SH     +Y+MLH
Sbjct: 459  IRTKMVNEGISQVPGSCLVEANRKVYKFVSQDRSHTCSDVVYDMLH 504


>ref|XP_018843924.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Juglans regia]
          Length = 605

 Score =  400 bits (1027), Expect = e-133
 Identities = 187/285 (65%), Positives = 236/285 (82%)
 Frame = -2

Query: 856  LFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMY 677
            LF  M+REGCWRAEESTLVSVLSACTHLG+ DLG+CTHG LLRN+SG NV VET L+DMY
Sbjct: 219  LFGEMNREGCWRAEESTLVSVLSACTHLGSFDLGRCTHGSLLRNISGINVIVETSLIDMY 278

Query: 676  IRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIYV 497
            ++ G L+KG+S+FQ M +KN  SY+V+ISGLA HG G +AL +F +MLEEG+ PDDV+YV
Sbjct: 279  VKSGCLEKGLSIFQNMSKKNQFSYTVMISGLAMHGRGREALRVFSEMLEEGMVPDDVVYV 338

Query: 496  GVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMPM 317
            GVLSAC + GLV++G++ FDRM+ EHRI+PTIQHYGC+VDLMG+ GM+ EA ELI  MP+
Sbjct: 339  GVLSACSHAGLVKEGLQCFDRMKYEHRIDPTIQHYGCLVDLMGQAGMLNEASELINSMPI 398

Query: 316  DPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSLT 137
            +PNDV+WRSLLS+CK H N+ELGEIAA+NL +L +QN  DY++LSN+YA+AQRW+DV+  
Sbjct: 399  EPNDVVWRSLLSACKAHHNLELGEIAAKNLGQLKSQNPSDYVILSNMYARAQRWEDVARI 458

Query: 136  RVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
            R +M +KG+ Q  G SLVE+KRKVHKF+S D SH +   I EM+H
Sbjct: 459  RTEMVDKGIEQKPGCSLVEVKRKVHKFVSQDMSHPQCHGIQEMIH 503



 Score = 69.3 bits (168), Expect = 2e-09
 Identities = 54/214 (25%), Positives = 94/214 (43%), Gaps = 36/214 (16%)
 Frame = -2

Query: 808 TLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMYIRCGSLDKGMSLFQEM 629
           T  ++L AC    AL+ G   HG++ +     +V V+  L+ MY +CG ++   ++F+ M
Sbjct: 133 TYPALLKACARRWALEKGMQIHGHIFKLGLEVDVFVQNSLISMYGKCGKIELACAVFELM 192

Query: 628 VEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGL-KPDDVIYVGVLSACRN------- 473
            +K   S+S II   A+ G   + L +F +M  EG  + ++   V VLSAC +       
Sbjct: 193 NQKTVASWSAIIGAHASLGLWCECLMLFGEMNREGCWRAEESTLVSVLSACTHLGSFDLG 252

Query: 472 ----------------------------GGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVD 377
                                        G +EKG+  F  M  +++   T+   G  + 
Sbjct: 253 RCTHGSLLRNISGINVIVETSLIDMYVKSGCLEKGLSIFQNMSKKNQFSYTVMISGLAMH 312

Query: 376 LMGRVGMVYEAFELIKKMPMDPNDVIWRSLLSSC 275
             GR  +    F  + +  M P+DV++  +LS+C
Sbjct: 313 GRGREAL--RVFSEMLEEGMVPDDVVYVGVLSAC 344


>ref|XP_017973404.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            isoform X1 [Theobroma cacao]
          Length = 605

 Score =  397 bits (1021), Expect = e-132
 Identities = 182/285 (63%), Positives = 237/285 (83%)
 Frame = -2

Query: 856  LFSNMSREGCWRAEESTLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMY 677
            +F NMS EGCWR EESTLV+VLSACTHLGALDLGKCTHG LLRN+S  NV V+T LMDMY
Sbjct: 219  MFGNMSSEGCWRPEESTLVTVLSACTHLGALDLGKCTHGSLLRNISELNVIVQTSLMDMY 278

Query: 676  IRCGSLDKGMSLFQEMVEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGLKPDDVIYV 497
            ++CG L+KG+SLF++M  ++  SY+V+ISGLA HG+G +AL I+ +ML++GL PDDV+YV
Sbjct: 279  VKCGCLEKGLSLFRKMGNRSQMSYTVMISGLAMHGHGAEALRIYSEMLKDGLDPDDVVYV 338

Query: 496  GVLSACRNGGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVDLMGRVGMVYEAFELIKKMPM 317
            GVLSAC + GLV++G + FDRM+ EH I PT+QHYGC+VDLMG+ GM+ EA E IK MP+
Sbjct: 339  GVLSACSHAGLVDEGFRCFDRMKSEHGITPTVQHYGCMVDLMGKAGMINEALEFIKSMPI 398

Query: 316  DPNDVIWRSLLSSCKVHQNVELGEIAAENLFKLNTQNGGDYIMLSNIYAQAQRWDDVSLT 137
             PNDV+WRSLLS+C+VH N+E+GEIAA++LF+  +QN GDY++LSN+YA+AQRW +V+  
Sbjct: 399  KPNDVVWRSLLSACRVHCNLEIGEIAAKHLFQSKSQNPGDYVILSNMYARAQRWQEVAKI 458

Query: 136  RVKMANKGLGQVVGSSLVEIKRKVHKFISNDKSHFEFLEIYEMLH 2
            RV+MA KGL QV G SLVE+ R++HKF+S D SH + + +YEM+H
Sbjct: 459  RVEMARKGLHQVPGFSLVEVGRRIHKFVSQDTSHPQCVSVYEMIH 503



 Score = 69.7 bits (169), Expect = 1e-09
 Identities = 53/214 (24%), Positives = 97/214 (45%), Gaps = 36/214 (16%)
 Frame = -2

Query: 808 TLVSVLSACTHLGALDLGKCTHGYLLRNLSGFNVAVETCLMDMYIRCGSLDKGMSLFQEM 629
           T  ++  AC  L A + GK  HG+  +     ++ V+  L++MY +CG ++   ++F++M
Sbjct: 133 TYPALFKACACLQAQEEGKQIHGHAFKLGLESDLYVQNSLINMYGKCGEIEHSCAIFEQM 192

Query: 628 VEKNHKSYSVIISGLATHGYGEKALTIFEQMLEEGL-KPDDVIYVGVLSACRN------- 473
            +K+  S+S II+  A+ G   + L +F  M  EG  +P++   V VLSAC +       
Sbjct: 193 DQKSVASWSAIIAAHASFGKWYECLMMFGNMSSEGCWRPEESTLVTVLSACTHLGALDLG 252

Query: 472 ----------------------------GGLVEKGMKYFDRMRLEHRIEPTIQHYGCIVD 377
                                        G +EKG+  F +M    ++  T+   G  + 
Sbjct: 253 KCTHGSLLRNISELNVIVQTSLMDMYVKCGCLEKGLSLFRKMGNRSQMSYTVMISG--LA 310

Query: 376 LMGRVGMVYEAFELIKKMPMDPNDVIWRSLLSSC 275
           + G        +  + K  +DP+DV++  +LS+C
Sbjct: 311 MHGHGAEALRIYSEMLKDGLDPDDVVYVGVLSAC 344


Top