BLASTX nr result

ID: Chrysanthemum21_contig00015158 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00015158
         (1649 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KVI08246.1| hypothetical protein Ccrd_013385 [Cynara carduncu...   350   e-114
ref|XP_022035185.1| pentatricopeptide repeat-containing protein ...   324   e-104
ref|XP_023734219.1| pentatricopeptide repeat-containing protein ...   308   1e-98
ref|XP_023913802.1| pentatricopeptide repeat-containing protein ...   249   3e-75
dbj|GAV62919.1| hypothetical protein CFOL_v3_06441 [Cephalotus f...   249   3e-75
emb|CBI39461.3| unnamed protein product, partial [Vitis vinifera]     246   9e-74
ref|XP_002269673.1| PREDICTED: pentatricopeptide repeat-containi...   246   1e-73
ref|XP_018842584.1| PREDICTED: pentatricopeptide repeat-containi...   243   3e-73
ref|XP_011091155.1| pentatricopeptide repeat-containing protein ...   242   2e-72
ref|XP_022967610.1| pentatricopeptide repeat-containing protein ...   242   2e-72
gb|EYU32987.1| hypothetical protein MIMGU_mgv1a019936mg, partial...   241   3e-72
ref|XP_012842714.1| PREDICTED: pentatricopeptide repeat-containi...   241   4e-72
ref|XP_023511579.1| pentatricopeptide repeat-containing protein ...   241   4e-72
ref|XP_022967611.1| pentatricopeptide repeat-containing protein ...   240   8e-72
ref|XP_017235677.1| PREDICTED: pentatricopeptide repeat-containi...   239   1e-71
ref|XP_021810292.1| pentatricopeptide repeat-containing protein ...   240   1e-71
ref|XP_023511580.1| pentatricopeptide repeat-containing protein ...   239   2e-71
ref|XP_021683105.1| pentatricopeptide repeat-containing protein ...   239   2e-71
ref|XP_021683104.1| pentatricopeptide repeat-containing protein ...   239   2e-71
ref|XP_021683102.1| pentatricopeptide repeat-containing protein ...   239   2e-71

>gb|KVI08246.1| hypothetical protein Ccrd_013385 [Cynara cardunculus var. scolymus]
          Length = 280

 Score =  350 bits (897), Expect = e-114
 Identities = 181/277 (65%), Positives = 214/277 (77%)
 Frame = -3

Query: 1452 MLRSSKSIIGLIRPMSQFGMLKGPISSYSRTTIIQDQIPTLVSEDELQKHFKEQAMIPYI 1273
            MLRS+ +I GLIRPM Q G+ + PIS+ S+ T+IQD IPT V E+ LQ HF        I
Sbjct: 1    MLRSN-AIAGLIRPMRQLGIFRVPISNCSQLTVIQDHIPTPVDENYLQDHFN-------I 52

Query: 1272 DKEGERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAWVAE 1093
             K           DAE   KL K  +VSKKDKI+V +  LL+L+D+KEAVYGALDAWV  
Sbjct: 53   GK-----------DAEGIGKLQKTYNVSKKDKISVLVRSLLDLEDSKEAVYGALDAWVVG 101

Query: 1092 EKEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRVEEA 913
            E+EFPIGRLKTALI LE+ ++WHKVVQVIKWMLSKGQG+TVGTYGQLIRALDMD RVEEA
Sbjct: 102  EREFPIGRLKTALIALEKMQEWHKVVQVIKWMLSKGQGVTVGTYGQLIRALDMDLRVEEA 161

Query: 912  KKLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQKVA 733
             KLW +KL RDV +VPWKVC+I++SVY+RNEMWEEL  L+K LE HGRK PD+ I +KVA
Sbjct: 162  NKLWAKKLGRDVQSVPWKVCDIMISVYYRNEMWEELVKLFKGLEAHGRKPPDQSIVKKVA 221

Query: 732  DSYEKLGLVEEKERVMEKYKSLFIKTRGRYGRRPSSK 622
            +SYEKLGLVEEKER +EKYKS F KTRG+YGR+ S +
Sbjct: 222  ESYEKLGLVEEKERFVEKYKSSFTKTRGKYGRKASKE 258


>ref|XP_022035185.1| pentatricopeptide repeat-containing protein At4g21190-like
            [Helianthus annuus]
 gb|OTG28771.1| hypothetical protein HannXRQ_Chr04g0114961 [Helianthus annuus]
          Length = 267

 Score =  324 bits (830), Expect = e-104
 Identities = 165/273 (60%), Positives = 208/273 (76%)
 Frame = -3

Query: 1440 SKSIIGLIRPMSQFGMLKGPISSYSRTTIIQDQIPTLVSEDELQKHFKEQAMIPYIDKEG 1261
            SK+IIGLIRPM Q  + +  I+SY + T+ +  + T  + + L   F +Q     ++  G
Sbjct: 4    SKAIIGLIRPMRQLIISRVSITSYEQNTMTRYHLHTPANVNHLHDRFNKQTTSHDVETMG 63

Query: 1260 ERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAWVAEEKEF 1081
            +                +K+  VSKK KI+ F+S L++LDD+KE+VYGALDAWVAEE+EF
Sbjct: 64   K----------------VKSAHVSKKVKISDFVSTLVDLDDSKESVYGALDAWVAEEQEF 107

Query: 1080 PIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRVEEAKKLW 901
            PIGR+KTALITLERK++WHKVVQVIKW LSKGQG+TVGTYGQLIRALDMD+RVEEA +LW
Sbjct: 108  PIGRVKTALITLERKQQWHKVVQVIKWTLSKGQGVTVGTYGQLIRALDMDHRVEEANRLW 167

Query: 900  VRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQKVADSYE 721
            V+KLSRDV ++PWKVC+II+ VY+RN MW+EL  L+K+LE HGRK PD+ I +KVA+SYE
Sbjct: 168  VKKLSRDVQSIPWKVCDIIILVYYRNGMWKELVKLFKELESHGRKPPDRAIVRKVAESYE 227

Query: 720  KLGLVEEKERVMEKYKSLFIKTRGRYGRRPSSK 622
             LGLVEEK+RVMEKY SLF KTRGRYGR P  +
Sbjct: 228  ALGLVEEKDRVMEKYASLF-KTRGRYGRNPGKE 259


>ref|XP_023734219.1| pentatricopeptide repeat-containing protein At4g21190-like [Lactuca
            sativa]
 gb|PLY73481.1| hypothetical protein LSAT_2X43240 [Lactuca sativa]
          Length = 244

 Score =  308 bits (789), Expect = 1e-98
 Identities = 165/278 (59%), Positives = 201/278 (72%)
 Frame = -3

Query: 1452 MLRSSKSIIGLIRPMSQFGMLKGPISSYSRTTIIQDQIPTLVSEDELQKHFKEQAMIPYI 1273
            MLRS K+I GLI+P S+  + K PIS +S+TT  QDQ P L       + F+        
Sbjct: 1    MLRS-KAITGLIKPTSRLSIFKIPISPFSQTTNTQDQTPIL-------QRFQ-------- 44

Query: 1272 DKEGERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAWVAE 1093
                                     +VSKKDKI V +SKL+ +++ KEAVYG LDAWVAE
Sbjct: 45   -------------------------NVSKKDKITVLVSKLVAVENTKEAVYGTLDAWVAE 79

Query: 1092 EKEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRVEEA 913
            E+EFPIGR+KTAL+TLE+KEKWHKVVQVIKWMLSKG G T+GTYGQLIRALDMDNRVEEA
Sbjct: 80   EREFPIGRVKTALLTLEKKEKWHKVVQVIKWMLSKGHGTTIGTYGQLIRALDMDNRVEEA 139

Query: 912  KKLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQKVA 733
            K+LW RKL R+V+ VPWKVCEI+VSVY+RNEMW+E+  L+++LE   RK  DK I ++VA
Sbjct: 140  KRLWGRKLGRNVELVPWKVCEIMVSVYYRNEMWKEIVKLFEELEGRNRKCTDKVIVERVA 199

Query: 732  DSYEKLGLVEEKERVMEKYKSLFIKTRGRYGRRPSSKE 619
            +SY KLGLVEEKE V+ KYKSLF K+RG+YGR+ SSKE
Sbjct: 200  ESYGKLGLVEEKEGVLVKYKSLFSKSRGKYGRK-SSKE 236


>ref|XP_023913802.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X1 [Quercus suber]
 ref|XP_023913803.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X2 [Quercus suber]
 ref|XP_023913805.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X3 [Quercus suber]
 ref|XP_023913806.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X2 [Quercus suber]
          Length = 275

 Score =  249 bits (635), Expect = 3e-75
 Identities = 127/265 (47%), Positives = 181/265 (68%), Gaps = 2/265 (0%)
 Frame = -3

Query: 1440 SKSIIGLIRPMSQFGMLKGPI--SSYSRTTIIQDQIPTLVSEDELQKHFKEQAMIPYIDK 1267
            S ++  L+  ++Q G ++     SSYS   + Q QI    S  +    F+++      D 
Sbjct: 4    SSTMSSLLHRLAQRGAVRAQFLNSSYSTMLLHQSQISNR-STTKAMASFQDEC-----DS 57

Query: 1266 EGERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAWVAEEK 1087
              +           + R++ +  +VS+KDKIN  ++ LL++ D+KEAVYGALDAWVA E+
Sbjct: 58   PAKSQFPDQNAGGVQGRQIGE--NVSRKDKINFLVNTLLDIKDSKEAVYGALDAWVAWEQ 115

Query: 1086 EFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRVEEAKK 907
             FPI  LK AL+ LE++++WH+++QVIKWMLSKGQG T+GTYGQLIRALDMD+R EEA K
Sbjct: 116  NFPIASLKRALLVLEKEQQWHRIIQVIKWMLSKGQGTTMGTYGQLIRALDMDHRAEEAHK 175

Query: 906  LWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQKVADS 727
            +WV+KL  D+ +VPW++C +++SVYHRN M E+L  L+K LE   RK P+K I Q+VAD+
Sbjct: 176  VWVQKLGMDLHSVPWQLCRLMISVYHRNNMLEDLVKLFKNLEAFDRKPPEKSIVQRVADA 235

Query: 726  YEKLGLVEEKERVMEKYKSLFIKTR 652
            YE LGL+EEK+RV+EKY  LF + +
Sbjct: 236  YEMLGLLEEKQRVLEKYNDLFTENK 260


>dbj|GAV62919.1| hypothetical protein CFOL_v3_06441 [Cephalotus follicularis]
          Length = 276

 Score =  249 bits (635), Expect = 3e-75
 Identities = 134/283 (47%), Positives = 189/283 (66%), Gaps = 5/283 (1%)
 Frame = -3

Query: 1452 MLRSSKSIIGLIRPMSQFGMLKGPISSYSRTTIIQDQI-PTLVSEDELQKHFKEQAMIPY 1276
            M RSS  +  L+R  +Q G+ +  I + S  T++Q QI     ++   +  +   A   +
Sbjct: 1    MWRSSSVMSYLVRRSTQLGVFRVKILNASYGTMVQAQIYKQSTTKTTPEDRYNSPATCQH 60

Query: 1275 IDKEGERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAWVA 1096
             +K           +   T+K     +VS KDKI    + LLEL+D+KEAVYGALDAWVA
Sbjct: 61   EEK-----------NVGGTQKNHTGANVSGKDKITFLTNTLLELNDSKEAVYGALDAWVA 109

Query: 1095 EEKEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRVEE 916
             E+ FPI RLK  L+ LE++++WH+V+QVIKWMLSKGQG T+GTYGQLI+ALDMD+R EE
Sbjct: 110  WEQNFPIARLKNVLLALEKEQQWHRVIQVIKWMLSKGQGTTMGTYGQLIKALDMDHRTEE 169

Query: 915  AKKLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQKV 736
            A KLW +K+  D+ +VPW++C  ++S+Y+RN M E+L  L+K LE   RK P+K I QKV
Sbjct: 170  AHKLWEKKIGSDLHSVPWQLCNRMISIYYRNNMLEKLVKLFKGLEAFDRKPPEKSIVQKV 229

Query: 735  ADSYEKLGLVEEKERVMEKYKSLFIKT-RG---RYGRRPSSKE 619
            A++YE LGL+EEK+RV+EKYK LF +T +G   ++G+  S K+
Sbjct: 230  ANAYEMLGLLEEKDRVLEKYKDLFTQTGKGNLKKFGKSSSKKK 272


>emb|CBI39461.3| unnamed protein product, partial [Vitis vinifera]
          Length = 296

 Score =  246 bits (627), Expect = 9e-74
 Identities = 127/265 (47%), Positives = 173/265 (65%)
 Frame = -3

Query: 1449 LRSSKSIIGLIRPMSQFGMLKGPISSYSRTTIIQDQIPTLVSEDELQKHFKEQAMIPYID 1270
            +  SK+++ L+R  +Q G  +    + S +T  Q Q+    +  E+     +    P   
Sbjct: 1    MSKSKAMVNLVRQFTQLGATRVQTLASSYSTFTQTQMSDTSNVGEVAFLGGQCNNQPMYH 60

Query: 1269 KEGERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAWVAEE 1090
              G+ A    K    E        +VS+KDKIN  ++ LL+L D+KEAVYGALDAWVA E
Sbjct: 61   DSGKDAASVHKHQIGE--------NVSRKDKINFLVTTLLDLKDSKEAVYGALDAWVAWE 112

Query: 1089 KEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRVEEAK 910
            + FPI  LK  LITLE++++WH+V+QV+KWMLSKGQG T+GTYGQLIRALDMD+R EEA 
Sbjct: 113  QNFPIASLKRVLITLEKEQQWHRVIQVVKWMLSKGQGTTMGTYGQLIRALDMDHRAEEAH 172

Query: 909  KLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQKVAD 730
            + WV+K+  D+ +VPW +C  ++SVY+RN M E L  L+K LE   RK  DK + +KVAD
Sbjct: 173  EFWVKKIGTDLHSVPWHLCHRMISVYYRNNMLENLVKLFKGLEAFDRKPQDKLVVKKVAD 232

Query: 729  SYEKLGLVEEKERVMEKYKSLFIKT 655
            +YE LGL+EEKER+ EKY  LF +T
Sbjct: 233  AYEMLGLLEEKERIFEKYDYLFTET 257


>ref|XP_002269673.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic isoform X1 [Vitis vinifera]
          Length = 300

 Score =  246 bits (627), Expect = 1e-73
 Identities = 127/265 (47%), Positives = 173/265 (65%)
 Frame = -3

Query: 1449 LRSSKSIIGLIRPMSQFGMLKGPISSYSRTTIIQDQIPTLVSEDELQKHFKEQAMIPYID 1270
            +  SK+++ L+R  +Q G  +    + S +T  Q Q+    +  E+     +    P   
Sbjct: 5    MSKSKAMVNLVRQFTQLGATRVQTLASSYSTFTQTQMSDTSNVGEVAFLGGQCNNQPMYH 64

Query: 1269 KEGERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAWVAEE 1090
              G+ A    K    E        +VS+KDKIN  ++ LL+L D+KEAVYGALDAWVA E
Sbjct: 65   DSGKDAASVHKHQIGE--------NVSRKDKINFLVTTLLDLKDSKEAVYGALDAWVAWE 116

Query: 1089 KEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRVEEAK 910
            + FPI  LK  LITLE++++WH+V+QV+KWMLSKGQG T+GTYGQLIRALDMD+R EEA 
Sbjct: 117  QNFPIASLKRVLITLEKEQQWHRVIQVVKWMLSKGQGTTMGTYGQLIRALDMDHRAEEAH 176

Query: 909  KLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQKVAD 730
            + WV+K+  D+ +VPW +C  ++SVY+RN M E L  L+K LE   RK  DK + +KVAD
Sbjct: 177  EFWVKKIGTDLHSVPWHLCHRMISVYYRNNMLENLVKLFKGLEAFDRKPQDKLVVKKVAD 236

Query: 729  SYEKLGLVEEKERVMEKYKSLFIKT 655
            +YE LGL+EEKER+ EKY  LF +T
Sbjct: 237  AYEMLGLLEEKERIFEKYDYLFTET 261


>ref|XP_018842584.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Juglans regia]
          Length = 273

 Score =  243 bits (621), Expect = 3e-73
 Identities = 129/273 (47%), Positives = 178/273 (65%), Gaps = 6/273 (2%)
 Frame = -3

Query: 1422 LIRPMSQFGMLKGPI---SSYSRTTIIQDQI---PTLVSEDELQKHFKEQAMIPYIDKEG 1261
            L+R ++Q G ++      S+YS  +    +    PT  +E   Q     Q M P      
Sbjct: 10   LVRRLTQLGEVRVRYFNSSNYSNMSFPSQKSHPGPTATTETSFQDECNGQPMRP------ 63

Query: 1260 ERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAWVAEEKEF 1081
                + +  D +E+R      +VS+KDK+N  ++ LL++ D+KEAVYGALDAWVA E+ F
Sbjct: 64   ----EKNAGDVQESRI---GENVSRKDKVNFLVNTLLDIKDSKEAVYGALDAWVAWEQNF 116

Query: 1080 PIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRVEEAKKLW 901
            PI  +K AL+ LE++++WHKVVQVIKWMLSKGQG T+GTYGQLIRALDMD+R EEA K+W
Sbjct: 117  PIVSIKRALLALEKEQQWHKVVQVIKWMLSKGQGTTMGTYGQLIRALDMDHRAEEAHKIW 176

Query: 900  VRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQKVADSYE 721
             RK+  D+ +VPW++C  ++S+Y+RN M + L  L+K LE   RK P+K I Q+VAD+YE
Sbjct: 177  ERKIGMDLHSVPWQLCRQMISIYYRNNMLKSLVKLFKDLEAFDRKPPEKSIVQRVADAYE 236

Query: 720  KLGLVEEKERVMEKYKSLFIKTRGRYGRRPSSK 622
             LGL+EEKERV+EKY  LF        ++  SK
Sbjct: 237  MLGLLEEKERVLEKYNDLFTGNESEKHKKAPSK 269


>ref|XP_011091155.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            [Sesamum indicum]
          Length = 277

 Score =  242 bits (617), Expect = 2e-72
 Identities = 110/190 (57%), Positives = 153/190 (80%)
 Frame = -3

Query: 1197 DVSKKDKINVFISKLLELDDNKEAVYGALDAWVAEEKEFPIGRLKTALITLERKEKWHKV 1018
            +VS+KDKI+  +S L++L D+KEAVY  LDAWVA E+ FPIG LK  L+ LE++++WH++
Sbjct: 77   NVSRKDKISFLVSTLMDLQDSKEAVYSTLDAWVAWERNFPIGALKQVLVALEKEQQWHRI 136

Query: 1017 VQVIKWMLSKGQGITVGTYGQLIRALDMDNRVEEAKKLWVRKLSRDVDTVPWKVCEIIVS 838
            +QVIKWMLSKGQG T GTYGQLI+ALDMD+RVEEA+++W +KL+ D+ +VPWK+C++++S
Sbjct: 137  IQVIKWMLSKGQGTTRGTYGQLIQALDMDHRVEEAQEIWKKKLAFDLHSVPWKLCKLMIS 196

Query: 837  VYHRNEMWEELATLYKKLEVHGRKVPDKYIAQKVADSYEKLGLVEEKERVMEKYKSLFIK 658
            VY+RN M ++L  L+K LE   RK P+K I QKVAD+YE LGL EEKER++EKYK LF++
Sbjct: 197  VYYRNNMLDDLVKLFKGLEAFDRKPPEKSIVQKVADAYELLGLPEEKERILEKYKDLFVE 256

Query: 657  TRGRYGRRPS 628
            +     ++ S
Sbjct: 257  SSNEKAKKIS 266


>ref|XP_022967610.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X1 [Cucurbita maxima]
          Length = 296

 Score =  242 bits (618), Expect = 2e-72
 Identities = 133/294 (45%), Positives = 189/294 (64%), Gaps = 5/294 (1%)
 Frame = -3

Query: 1488 RYYFRETTLSIAMLRSSKSIIGLIRPMSQFGMLKGPISSYSRTTIIQDQIPTLVSEDELQ 1309
            R + R  T +  +LR +      +  + + G+ K  I +    T++Q+Q+P         
Sbjct: 4    RRFHRAATWATPLLRDTT-----VGQVMELGVNKLQIGNSCYCTMLQNQMP--------- 49

Query: 1308 KHFKEQAMIPYIDKEGERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKE 1129
            K F ++ M        +   ++S+ +  + RK     +VS+KDKIN  ++ L++L D+KE
Sbjct: 50   KRFADKDMTDKDVNNSKPLYQTSERNIGDIRKHQIGENVSRKDKINFLVNTLMDLRDSKE 109

Query: 1128 AVYGALDAWVAEEKEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLI 949
            AVYGALDAWVA E++FPI  LK AL  LE++ +WH+VVQVIKWMLSKGQG T+  YGQLI
Sbjct: 110  AVYGALDAWVAWEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLI 169

Query: 948  RALDMDNRVEEAKKLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGR 769
            RALDMD+R EEA K WV K+  D+ +VPW++C  ++S+Y+RN+M E+L  L+K LE  GR
Sbjct: 170  RALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGR 229

Query: 768  KVPDKYIAQKVADSYEKLGLVEEKERVMEKYKSLFIKTR----GRY-GRRPSSK 622
            K P+K I Q+VAD+ E LGLVEEKERV+ KY  LF   +     +Y G+R S+K
Sbjct: 230  KPPEKSIVQRVADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKYKGKRKSTK 283


>gb|EYU32987.1| hypothetical protein MIMGU_mgv1a019936mg, partial [Erythranthe
            guttata]
          Length = 266

 Score =  241 bits (614), Expect = 3e-72
 Identities = 112/193 (58%), Positives = 153/193 (79%)
 Frame = -3

Query: 1233 DAEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAWVAEEKEFPIGRLKTAL 1054
            D +   KL    ++ ++DKI+  ++ L++L DNKE++Y  LDAWVA E+EFPIG LK  L
Sbjct: 65   DIKSLPKLEIGENIPRRDKISFLVTTLIDLQDNKESIYNTLDAWVAWEREFPIGALKNVL 124

Query: 1053 ITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRVEEAKKLWVRKLSRDVD 874
            + LE++++WHKV+QVIKWMLSKGQG T GTYGQLIRALDMD+RVEEA ++W +KL  D+ 
Sbjct: 125  LALEKQQQWHKVIQVIKWMLSKGQGNTRGTYGQLIRALDMDHRVEEAHEIWKKKLGFDLH 184

Query: 873  TVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQKVADSYEKLGLVEEKE 694
            +VPWK+C++++SVY+RN M E+L  L+K LE   RK P+K I Q+VAD+YE LGL EEKE
Sbjct: 185  SVPWKLCKLMISVYYRNNMLEDLVKLFKGLEGFDRKPPEKSIVQRVADAYEVLGLSEEKE 244

Query: 693  RVMEKYKSLFIKT 655
            RV+EKYK+LF+++
Sbjct: 245  RVLEKYKTLFVES 257


>ref|XP_012842714.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Erythranthe guttata]
          Length = 273

 Score =  241 bits (614), Expect = 4e-72
 Identities = 112/193 (58%), Positives = 153/193 (79%)
 Frame = -3

Query: 1233 DAEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAWVAEEKEFPIGRLKTAL 1054
            D +   KL    ++ ++DKI+  ++ L++L DNKE++Y  LDAWVA E+EFPIG LK  L
Sbjct: 65   DIKSLPKLEIGENIPRRDKISFLVTTLIDLQDNKESIYNTLDAWVAWEREFPIGALKNVL 124

Query: 1053 ITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRVEEAKKLWVRKLSRDVD 874
            + LE++++WHKV+QVIKWMLSKGQG T GTYGQLIRALDMD+RVEEA ++W +KL  D+ 
Sbjct: 125  LALEKQQQWHKVIQVIKWMLSKGQGNTRGTYGQLIRALDMDHRVEEAHEIWKKKLGFDLH 184

Query: 873  TVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQKVADSYEKLGLVEEKE 694
            +VPWK+C++++SVY+RN M E+L  L+K LE   RK P+K I Q+VAD+YE LGL EEKE
Sbjct: 185  SVPWKLCKLMISVYYRNNMLEDLVKLFKGLEGFDRKPPEKSIVQRVADAYEVLGLSEEKE 244

Query: 693  RVMEKYKSLFIKT 655
            RV+EKYK+LF+++
Sbjct: 245  RVLEKYKTLFVES 257


>ref|XP_023511579.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X1 [Cucurbita pepo subsp. pepo]
          Length = 296

 Score =  241 bits (616), Expect = 4e-72
 Identities = 133/294 (45%), Positives = 188/294 (63%), Gaps = 5/294 (1%)
 Frame = -3

Query: 1488 RYYFRETTLSIAMLRSSKSIIGLIRPMSQFGMLKGPISSYSRTTIIQDQIPTLVSEDELQ 1309
            R + R  T +  +LR        +  + + G+ K  I + S  T++Q+Q         + 
Sbjct: 4    RRFHRAATWATPLLRDKT-----VGQIMELGVNKLQIGNSSYCTMLQNQ---------MS 49

Query: 1308 KHFKEQAMIPYIDKEGERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKE 1129
            K F ++ M        +   ++S+ +  + RK     +VS+KDKIN  ++ L++L D+KE
Sbjct: 50   KRFADKDMTDKDVNNSKPLYQTSERNIGDIRKHQIGENVSRKDKINFLVNTLMDLRDSKE 109

Query: 1128 AVYGALDAWVAEEKEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLI 949
            AVYGALDAWVA E++FPI  LK AL  LE++ +WH+VVQVIKWMLSKGQG T+  YGQLI
Sbjct: 110  AVYGALDAWVAWEQDFPIASLKHALTVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLI 169

Query: 948  RALDMDNRVEEAKKLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGR 769
            RALDMD+R EEA K WV K+  D+ +VPW++C  ++S+Y+RN+M E+L  L+K LE  GR
Sbjct: 170  RALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGR 229

Query: 768  KVPDKYIAQKVADSYEKLGLVEEKERVMEKYKSLFIKTR----GRY-GRRPSSK 622
            K P+K I Q+VAD+ E LGLVEEKERV+ KY  LF   +     +Y G+R S+K
Sbjct: 230  KPPEKSIVQRVADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKYKGKRKSTK 283


>ref|XP_022967611.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X2 [Cucurbita maxima]
          Length = 286

 Score =  240 bits (613), Expect = 8e-72
 Identities = 134/294 (45%), Positives = 188/294 (63%), Gaps = 5/294 (1%)
 Frame = -3

Query: 1488 RYYFRETTLSIAMLRSSKSIIGLIRPMSQFGMLKGPISSYSRTTIIQDQIPTLVSEDELQ 1309
            R + R  T +  +LR +      +  + + G+ K  I +    T++Q+Q+P         
Sbjct: 4    RRFHRAATWATPLLRDTT-----VGQVMELGVNKLQIGNSCYCTMLQNQMP--------- 49

Query: 1308 KHFKEQAMIPYIDKEGERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKE 1129
            K F ++ M            K+S+ +  + RK     +VS+KDKIN  ++ L++L D+KE
Sbjct: 50   KRFADKDMTD----------KTSERNIGDIRKHQIGENVSRKDKINFLVNTLMDLRDSKE 99

Query: 1128 AVYGALDAWVAEEKEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLI 949
            AVYGALDAWVA E++FPI  LK AL  LE++ +WH+VVQVIKWMLSKGQG T+  YGQLI
Sbjct: 100  AVYGALDAWVAWEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLI 159

Query: 948  RALDMDNRVEEAKKLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGR 769
            RALDMD+R EEA K WV K+  D+ +VPW++C  ++S+Y+RN+M E+L  L+K LE  GR
Sbjct: 160  RALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGR 219

Query: 768  KVPDKYIAQKVADSYEKLGLVEEKERVMEKYKSLFIKTR----GRY-GRRPSSK 622
            K P+K I Q+VAD+ E LGLVEEKERV+ KY  LF   +     +Y G+R S+K
Sbjct: 220  KPPEKSIVQRVADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKYKGKRKSTK 273


>ref|XP_017235677.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Daucus carota subsp. sativus]
 gb|KZN10647.1| hypothetical protein DCAR_003303 [Daucus carota subsp. sativus]
          Length = 267

 Score =  239 bits (610), Expect = 1e-71
 Identities = 118/273 (43%), Positives = 184/273 (67%)
 Frame = -3

Query: 1440 SKSIIGLIRPMSQFGMLKGPISSYSRTTIIQDQIPTLVSEDELQKHFKEQAMIPYIDKEG 1261
            S +++ L+R +SQF +++    SY   T+ Q + P+L +   +             D +G
Sbjct: 4    SVAMVKLVRRVSQFDLVRAQTLSYC--TMAQGRNPSLNNGSRMPS-----------DDDG 50

Query: 1260 ERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAWVAEEKEF 1081
                  SK   ++ +  +   +VSK+D+++  +S LL+L D+KEAVYG LD+W A E+EF
Sbjct: 51   SNGVSMSKNLGKDIKHRI-GKNVSKRDRVSFLVSTLLDLQDSKEAVYGTLDSWAAWEREF 109

Query: 1080 PIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRVEEAKKLW 901
            PIG LK ALI LE++++WH+VVQVIKWMLSKGQG T+ TYGQLIRALDMD+RV+EA ++W
Sbjct: 110  PIGHLKQALIALEKEQQWHRVVQVIKWMLSKGQGTTMNTYGQLIRALDMDHRVKEAHEIW 169

Query: 900  VRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQKVADSYE 721
            V+K+  D+ +VPW++C++++ VY+RN M +++  L K +E + RK+ +K +  K+AD+YE
Sbjct: 170  VKKVGDDLHSVPWQLCKLMIGVYYRNNMLDKVVKLSKSMEAYDRKIYEKSVLMKIADAYE 229

Query: 720  KLGLVEEKERVMEKYKSLFIKTRGRYGRRPSSK 622
             LGL EEK R++EK+  L  +T  R+ ++   K
Sbjct: 230  MLGLAEEKNRILEKHSDLLDETSKRHTKQSRGK 262


>ref|XP_021810292.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X1 [Prunus avium]
 ref|XP_021810293.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X2 [Prunus avium]
          Length = 304

 Score =  240 bits (613), Expect = 1e-71
 Identities = 121/282 (42%), Positives = 186/282 (65%), Gaps = 1/282 (0%)
 Frame = -3

Query: 1461 SIAMLRSSKSIIGLIRPMSQFGMLKGPISSYSRTTIIQDQIPTLVSEDELQKHFKEQAMI 1282
            +I  L  S ++ G +  ++Q G+++  I + + +T++Q QI   ++           AM+
Sbjct: 27   AILSLNCSSAVGGQVGRLTQLGVIRAQILNSTYSTVVQAQISNQITAT---------AMV 77

Query: 1281 PYIDKEGERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAW 1102
               D+   +     + +A   ++     +VS+KDK+N  +  LL+L+D+KEAVYG LD W
Sbjct: 78   SLEDQHNHQNSYYPEKNAGGEKRDQIGWNVSRKDKVNFLVRTLLDLNDSKEAVYGTLDGW 137

Query: 1101 VAEEKEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRV 922
            VA E++FPIG+++ AL +LE++++WH++VQVIKWMLSKGQG T+GTYGQLIRALDMD R 
Sbjct: 138  VAWEQDFPIGKIRMALTSLEKEQQWHRIVQVIKWMLSKGQGNTMGTYGQLIRALDMDQRP 197

Query: 921  EEAKKLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQ 742
            EEA K W +K+  D+ +VPW++C+ ++ +Y+RN M E L  L++ LE   RK P K I Q
Sbjct: 198  EEAHKFWDKKIGIDLHSVPWQLCKSMIGIYYRNNMLESLVKLFEGLEAFDRKPPVKSIVQ 257

Query: 741  KVADSYEKLGLVEEKERVMEKYKSLFIKTRG-RYGRRPSSKE 619
            +VAD+YE LG +EEKERV++KY  LF +    +  R  S+K+
Sbjct: 258  RVADAYEMLGRIEEKERVLQKYNYLFTENASLKKSREASAKK 299


>ref|XP_023511580.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X2 [Cucurbita pepo subsp. pepo]
          Length = 286

 Score =  239 bits (611), Expect = 2e-71
 Identities = 134/294 (45%), Positives = 187/294 (63%), Gaps = 5/294 (1%)
 Frame = -3

Query: 1488 RYYFRETTLSIAMLRSSKSIIGLIRPMSQFGMLKGPISSYSRTTIIQDQIPTLVSEDELQ 1309
            R + R  T +  +LR        +  + + G+ K  I + S  T++Q+Q         + 
Sbjct: 4    RRFHRAATWATPLLRDKT-----VGQIMELGVNKLQIGNSSYCTMLQNQ---------MS 49

Query: 1308 KHFKEQAMIPYIDKEGERARKSSKTDAEETRKLLKALDVSKKDKINVFISKLLELDDNKE 1129
            K F ++ M            K+S+ +  + RK     +VS+KDKIN  ++ L++L D+KE
Sbjct: 50   KRFADKDMTD----------KTSERNIGDIRKHQIGENVSRKDKINFLVNTLMDLRDSKE 99

Query: 1128 AVYGALDAWVAEEKEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLI 949
            AVYGALDAWVA E++FPI  LK AL  LE++ +WH+VVQVIKWMLSKGQG T+  YGQLI
Sbjct: 100  AVYGALDAWVAWEQDFPIASLKHALTVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLI 159

Query: 948  RALDMDNRVEEAKKLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGR 769
            RALDMD+R EEA K WV K+  D+ +VPW++C  ++S+Y+RN+M E+L  L+K LE  GR
Sbjct: 160  RALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGR 219

Query: 768  KVPDKYIAQKVADSYEKLGLVEEKERVMEKYKSLFIKTR----GRY-GRRPSSK 622
            K P+K I Q+VAD+ E LGLVEEKERV+ KY  LF   +     +Y G+R S+K
Sbjct: 220  KPPEKSIVQRVADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKYKGKRKSTK 273


>ref|XP_021683105.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X4 [Hevea brasiliensis]
          Length = 266

 Score =  239 bits (609), Expect = 2e-71
 Identities = 117/220 (53%), Positives = 162/220 (73%), Gaps = 2/220 (0%)
 Frame = -3

Query: 1275 IDKEGERARKSSKTD--AEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAW 1102
            ++ + +R  K   +D  A   +K     +VS+KDKIN  ++ LL+L D+KEAVYGALDAW
Sbjct: 18   LENQYDRKAKCENSDQNAGGVQKFRIGENVSRKDKINFLVNTLLDLKDSKEAVYGALDAW 77

Query: 1101 VAEEKEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRV 922
            VA E+ FPIG LK  L+TLE++++WH+VVQVIKWMLSKGQG T+GTYGQLI+ALD D+R 
Sbjct: 78   VAWERNFPIGSLKMVLLTLEKEQQWHRVVQVIKWMLSKGQGNTMGTYGQLIQALDKDHRA 137

Query: 921  EEAKKLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQ 742
            EEA   W++K+  D+ +VPW++C+ ++S+Y+RN M E L  L+K LE   RK P+K I Q
Sbjct: 138  EEAHVFWLKKIGTDLHSVPWQLCKCMISIYYRNNMLENLIKLFKGLEAFDRKPPEKSIVQ 197

Query: 741  KVADSYEKLGLVEEKERVMEKYKSLFIKTRGRYGRRPSSK 622
            KVAD+YE LG++EEKER+++KY  LF K + + G + S K
Sbjct: 198  KVADAYEMLGMLEEKERLLQKYNDLF-KEKEKEGLKKSRK 236


>ref|XP_021683104.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X3 [Hevea brasiliensis]
          Length = 274

 Score =  239 bits (609), Expect = 2e-71
 Identities = 117/220 (53%), Positives = 162/220 (73%), Gaps = 2/220 (0%)
 Frame = -3

Query: 1275 IDKEGERARKSSKTD--AEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAW 1102
            ++ + +R  K   +D  A   +K     +VS+KDKIN  ++ LL+L D+KEAVYGALDAW
Sbjct: 49   LENQYDRKAKCENSDQNAGGVQKFRIGENVSRKDKINFLVNTLLDLKDSKEAVYGALDAW 108

Query: 1101 VAEEKEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRV 922
            VA E+ FPIG LK  L+TLE++++WH+VVQVIKWMLSKGQG T+GTYGQLI+ALD D+R 
Sbjct: 109  VAWERNFPIGSLKMVLLTLEKEQQWHRVVQVIKWMLSKGQGNTMGTYGQLIQALDKDHRA 168

Query: 921  EEAKKLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQ 742
            EEA   W++K+  D+ +VPW++C+ ++S+Y+RN M E L  L+K LE   RK P+K I Q
Sbjct: 169  EEAHVFWLKKIGTDLHSVPWQLCKCMISIYYRNNMLENLIKLFKGLEAFDRKPPEKSIVQ 228

Query: 741  KVADSYEKLGLVEEKERVMEKYKSLFIKTRGRYGRRPSSK 622
            KVAD+YE LG++EEKER+++KY  LF K + + G + S K
Sbjct: 229  KVADAYEMLGMLEEKERLLQKYNDLF-KEKEKEGLKKSRK 267


>ref|XP_021683102.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X2 [Hevea brasiliensis]
 ref|XP_021683103.1| pentatricopeptide repeat-containing protein At4g18975, chloroplastic
            isoform X2 [Hevea brasiliensis]
          Length = 278

 Score =  239 bits (609), Expect = 2e-71
 Identities = 117/220 (53%), Positives = 162/220 (73%), Gaps = 2/220 (0%)
 Frame = -3

Query: 1275 IDKEGERARKSSKTD--AEETRKLLKALDVSKKDKINVFISKLLELDDNKEAVYGALDAW 1102
            ++ + +R  K   +D  A   +K     +VS+KDKIN  ++ LL+L D+KEAVYGALDAW
Sbjct: 49   LENQYDRKAKCENSDQNAGGVQKFRIGENVSRKDKINFLVNTLLDLKDSKEAVYGALDAW 108

Query: 1101 VAEEKEFPIGRLKTALITLERKEKWHKVVQVIKWMLSKGQGITVGTYGQLIRALDMDNRV 922
            VA E+ FPIG LK  L+TLE++++WH+VVQVIKWMLSKGQG T+GTYGQLI+ALD D+R 
Sbjct: 109  VAWERNFPIGSLKMVLLTLEKEQQWHRVVQVIKWMLSKGQGNTMGTYGQLIQALDKDHRA 168

Query: 921  EEAKKLWVRKLSRDVDTVPWKVCEIIVSVYHRNEMWEELATLYKKLEVHGRKVPDKYIAQ 742
            EEA   W++K+  D+ +VPW++C+ ++S+Y+RN M E L  L+K LE   RK P+K I Q
Sbjct: 169  EEAHVFWLKKIGTDLHSVPWQLCKCMISIYYRNNMLENLIKLFKGLEAFDRKPPEKSIVQ 228

Query: 741  KVADSYEKLGLVEEKERVMEKYKSLFIKTRGRYGRRPSSK 622
            KVAD+YE LG++EEKER+++KY  LF K + + G + S K
Sbjct: 229  KVADAYEMLGMLEEKERLLQKYNDLF-KEKEKEGLKKSRK 267