BLASTX nr result

ID: Cheilocostus21_contig00003980 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00003980
         (1978 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009416281.1| PREDICTED: pentatricopeptide repeat-containi...   290   3e-90
ref|XP_010937828.1| PREDICTED: pentatricopeptide repeat-containi...   244   9e-73
ref|XP_008780605.2| PREDICTED: pentatricopeptide repeat-containi...   235   5e-69
ref|XP_020574427.1| pentatricopeptide repeat-containing protein ...   203   5e-57
ref|XP_020696423.1| pentatricopeptide repeat-containing protein ...   201   3e-56
gb|PKA56205.1| Pentatricopeptide repeat-containing protein [Apos...   194   1e-53
gb|OAY82212.1| Pentatricopeptide repeat-containing protein [Anan...   189   7e-52
ref|XP_020087607.1| pentatricopeptide repeat-containing protein ...   186   1e-50
ref|XP_008783728.1| PREDICTED: uncharacterized protein LOC103702...   175   9e-47
gb|KMZ75150.1| Pentatricopeptide repeat-containing protein [Zost...   170   4e-45
ref|XP_010922535.1| PREDICTED: uncharacterized protein LOC105045...   169   2e-44
gb|PIA64861.1| hypothetical protein AQUCO_00100375v1 [Aquilegia ...   154   2e-39
ref|XP_010249644.1| PREDICTED: uncharacterized protein LOC104592...   154   4e-39
ref|XP_022033658.1| pentatricopeptide repeat-containing protein ...   152   2e-38
ref|XP_003632859.1| PREDICTED: uncharacterized protein LOC100251...   150   5e-38
ref|XP_024178616.1| protein THYLAKOID ASSEMBLY 8, chloroplastic ...   149   2e-37
gb|KVH97988.1| hypothetical protein Ccrd_023796 [Cynara carduncu...   148   4e-37
ref|XP_006850605.1| pentatricopeptide repeat-containing protein ...   148   9e-37
gb|PAN32358.1| hypothetical protein PAHAL_E04053 [Panicum hallii]     147   2e-36
gb|OVA20311.1| hypothetical protein BVC80_157g118 [Macleaya cord...   146   2e-36

>ref|XP_009416281.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g62350-like [Musa acuminata subsp. malaccensis]
 ref|XP_009416282.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g62350-like [Musa acuminata subsp. malaccensis]
          Length = 242

 Score =  290 bits (741), Expect = 3e-90
 Identities = 146/210 (69%), Positives = 170/210 (80%), Gaps = 1/210 (0%)
 Frame = +2

Query: 143 VSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSD 322
           ++ PV CGPRDNRGKL RGRTLSTE              G+EARV+RI+S D  RLIK+D
Sbjct: 33  LAAPVTCGPRDNRGKLLRGRTLSTEAILAVQALKRAAAAGDEARVHRIISVDLGRLIKAD 92

Query: 323 LLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADL 502
           LLAA+AELQRQ++W L+ KAFAAAR EPWYRTDLALYAEMVS+LARCG   E+DALVA L
Sbjct: 93  LLAALAELQRQNEWGLSSKAFAAARREPWYRTDLALYAEMVSSLARCGASDEIDALVACL 152

Query: 503 LK-EEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMIRGLRR 679
           L+ EEGW+SS N K I R V+ALMAAE+ KL++DVY  LK GGFEPDEFLFK++IRGLRR
Sbjct: 153 LEDEEGWISSENTKEISRFVRALMAAEKAKLVRDVYGNLKSGGFEPDEFLFKFLIRGLRR 212

Query: 680 LGEDSSADEVERDFEAWDDCGSLPGEPLPV 769
           LGED++A+EVERDFE W +CGSLP EPLPV
Sbjct: 213 LGEDAAAEEVERDFEVWYECGSLPLEPLPV 242


>ref|XP_010937828.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870
           [Elaeis guineensis]
          Length = 234

 Score =  244 bits (622), Expect = 9e-73
 Identities = 126/215 (58%), Positives = 162/215 (75%), Gaps = 3/215 (1%)
 Frame = +2

Query: 134 PTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLI 313
           P   + P+ CGPRDNRG L+RGRTLSTE              G+E +V  IVS+  SRLI
Sbjct: 23  PKPHATPISCGPRDNRGPLRRGRTLSTEAILAIQALKRTR--GDEPKVEHIVSTTLSRLI 80

Query: 314 KSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALV 493
           K+DLLAA+AELQRQDQWRLA+  FAAAR EPWY+ D +LYA MVSTLARCG   E+D LV
Sbjct: 81  KADLLAALAELQRQDQWRLALTVFAAARREPWYKPDFSLYAAMVSTLARCGVAEEIDILV 140

Query: 494 ADLLKE---EGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMI 664
           ++L KE   EG +SS +++G+ +L KAL+AA RGK+L++VY E+K+GG++PDE+LFK MI
Sbjct: 141 SNLFKEKEMEGGISSEDMRGLVQLSKALVAAGRGKVLREVYREIKRGGWDPDEYLFKLMI 200

Query: 665 RGLRRLGEDSSADEVERDFEAWDDCGSLPGEPLPV 769
           RGLRRLGE  +ADEVE+D++ W + G++  EPLPV
Sbjct: 201 RGLRRLGEGEAADEVEKDYKIWFEGGAI-SEPLPV 234


>ref|XP_008780605.2| PREDICTED: pentatricopeptide repeat-containing protein
           At1g62350-like [Phoenix dactylifera]
          Length = 261

 Score =  235 bits (599), Expect = 5e-69
 Identities = 122/208 (58%), Positives = 156/208 (75%), Gaps = 3/208 (1%)
 Frame = +2

Query: 155 VVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDLLAA 334
           + CGPRDNRG L+RGRTLS+E              G+E +V  IVS+  SRLIK+DLLAA
Sbjct: 57  IFCGPRDNRGPLRRGRTLSSEAILAIQALKRAR--GDEHKVEHIVSTTLSRLIKADLLAA 114

Query: 335 IAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLLKE- 511
           +AELQRQDQWRLA+  FAAA  EPWY+ D +LYA MVSTLARCG   E+D LV++LLKE 
Sbjct: 115 LAELQRQDQWRLALTVFAAAGREPWYKPDFSLYAAMVSTLARCGVAEEIDVLVSNLLKEK 174

Query: 512 --EGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMIRGLRRLG 685
             EG +S  +++G+ +L KAL+AA RGK+L+D+Y E+K+GG +PDE+LFK MIRGLR LG
Sbjct: 175 EMEGGISLEDIRGLTQLSKALVAAGRGKVLRDIYREIKRGGCDPDEYLFKLMIRGLRSLG 234

Query: 686 EDSSADEVERDFEAWDDCGSLPGEPLPV 769
           E  +ADEVE+D+E W + G+L  +PLPV
Sbjct: 235 EGEAADEVEKDYEVWFEGGALT-QPLPV 261


>ref|XP_020574427.1| pentatricopeptide repeat-containing protein At3g46870-like
           [Phalaenopsis equestris]
          Length = 252

 Score =  203 bits (516), Expect = 5e-57
 Identities = 108/205 (52%), Positives = 145/205 (70%), Gaps = 6/205 (2%)
 Frame = +2

Query: 134 PTAVSIP---VVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFS 304
           P   S P   +V GPRDNR  L+RGRTLS+E               +EA VNRI+S+  +
Sbjct: 38  PNPTSSPASIIVSGPRDNRQPLRRGRTLSSEAILTVQALKRAR--NDEAEVNRIISTSVA 95

Query: 305 RLIKSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVD 484
           RLI++DLLAA+ ELQRQDQW+LA+K F AAR E WYR D ALYA+MV +L R     E+D
Sbjct: 96  RLIRADLLAALTELQRQDQWQLALKVFEAARRENWYRIDCALYADMVGSLTRSKTDSEID 155

Query: 485 ALVADLLKE---EGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFK 655
            L+ +LL+E    G V++ +++G  RLVKAL+AA +GK +KDVY  +K+GG EP+E+LFK
Sbjct: 156 LLMVELLEELQLGGGVAAGDLRGPARLVKALVAAGKGKAVKDVYKMMKRGGCEPNEYLFK 215

Query: 656 YMIRGLRRLGEDSSADEVERDFEAW 730
           +MI+GLRRLGE+ +A E+E+D+E W
Sbjct: 216 FMIKGLRRLGEEDAACEIEKDYELW 240


>ref|XP_020696423.1| pentatricopeptide repeat-containing protein At3g46870-like
           [Dendrobium catenatum]
 ref|XP_020696424.1| pentatricopeptide repeat-containing protein At3g46870-like
           [Dendrobium catenatum]
 ref|XP_020696425.1| pentatricopeptide repeat-containing protein At3g46870-like
           [Dendrobium catenatum]
 gb|PKU76528.1| Pentatricopeptide repeat-containing protein [Dendrobium catenatum]
          Length = 251

 Score =  201 bits (510), Expect = 3e-56
 Identities = 111/216 (51%), Positives = 148/216 (68%), Gaps = 4/216 (1%)
 Frame = +2

Query: 134 PTAVSIPV-VCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRL 310
           PT+ S  + V GPRDNR  ++RGRTLSTE               +EA V RIVS+  +RL
Sbjct: 38  PTSSSTSIIVSGPRDNRQPIRRGRTLSTEAILTVQALKRAR--NDEAAVERIVSTSVARL 95

Query: 311 IKSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDAL 490
           I++DLLAA+ ELQRQDQW+LA+K F  AR E WYRTD ALYA+MVS+L R     E+D L
Sbjct: 96  IRADLLAALTELQRQDQWQLALKLFEVARRENWYRTDCALYADMVSSLTRSKRDSEIDIL 155

Query: 491 VADLLKE---EGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYM 661
           + +L++E    G   S +++G  RLVKAL+AA +GK +KDVY  +K+GG EP+E+LFK+M
Sbjct: 156 MVELMEELQMGGGALSGDLRGPARLVKALVAAGKGKAVKDVYEMMKRGGCEPNEYLFKFM 215

Query: 662 IRGLRRLGEDSSADEVERDFEAWDDCGSLPGEPLPV 769
           I+GLR LGE+ +A E+E+D+E W   G    EPL +
Sbjct: 216 IKGLRGLGEEDAACEIEKDYELW-GFGGFGAEPLAI 250


>gb|PKA56205.1| Pentatricopeptide repeat-containing protein [Apostasia shenzhenica]
          Length = 256

 Score =  194 bits (493), Expect = 1e-53
 Identities = 106/214 (49%), Positives = 145/214 (67%), Gaps = 3/214 (1%)
 Frame = +2

Query: 134 PTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLI 313
           P+ ++   V GPRDNR  ++RGRTLS+E               ++  V+R+VS+  +RLI
Sbjct: 45  PSYLASTTVSGPRDNRQPIRRGRTLSSEAILAVQALKRSRR--DDMAVDRVVSTSVARLI 102

Query: 314 KSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALV 493
           K+DLLAA+ EL RQ+QWRLA+K F AAR E WYR D  LYA+MVS LAR G   E+  L+
Sbjct: 103 KADLLAALGELLRQNQWRLALKVFDAARREEWYRNDCGLYADMVSALARSGIDSEIAPLM 162

Query: 494 ADLLKE---EGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMI 664
           A+L+KE    G +   +++G  RLVKAL+AA +G+ +KDVY  +K+G  +P+EFLFK++I
Sbjct: 163 AELMKELEKIGGIVEGDLRGPARLVKALLAAGKGEAVKDVYMMMKRGSCQPNEFLFKFII 222

Query: 665 RGLRRLGEDSSADEVERDFEAWDDCGSLPGEPLP 766
           +GLR LG++  A EVE+DFE W   GS   EPLP
Sbjct: 223 KGLRGLGKEDMASEVEKDFELWVH-GSFGMEPLP 255


>gb|OAY82212.1| Pentatricopeptide repeat-containing protein [Ananas comosus]
          Length = 241

 Score =  189 bits (479), Expect = 7e-52
 Identities = 110/220 (50%), Positives = 145/220 (65%), Gaps = 7/220 (3%)
 Frame = +2

Query: 131 APTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVN-RIVSSDFSR 307
           +P+  S+ + CGPRDNRG LQRGRTLSTE              G+ A  +    ++   R
Sbjct: 23  SPSTRSLAITCGPRDNRGPLQRGRTLSTEAILAVQALKRAALSGDGAVPSPAAAAAALGR 82

Query: 308 LIKSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRT-DLALYAEMVSTLARCGFPGE-V 481
           L+K DLLAA+AELQRQ +WRLA+  FAAAR E WY   D ALYAEM S +AR G   E +
Sbjct: 83  LLKPDLLAALAELQRQGRWRLALVVFAAARRETWYTNPDFALYAEMASAMARGGAAAEEI 142

Query: 482 DALVADLLKEE----GWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFL 649
           DALVA+LL+E+    G+  S +V  + RLV+ L+AA RG+ ++D+Y  +K+GG   DE+L
Sbjct: 143 DALVAELLEEKEGSGGFSPSDDVWKLTRLVRVLIAAGRGEAVRDLYKRMKRGGCVGDEYL 202

Query: 650 FKYMIRGLRRLGEDSSADEVERDFEAWDDCGSLPGEPLPV 769
           F+ +IRGLRRLGE  +A EVERDF+ W + G +  E LPV
Sbjct: 203 FRVLIRGLRRLGEGEAAGEVERDFDEWYE-GGIITETLPV 241


>ref|XP_020087607.1| pentatricopeptide repeat-containing protein At3g46870-like [Ananas
           comosus]
          Length = 241

 Score =  186 bits (471), Expect = 1e-50
 Identities = 108/220 (49%), Positives = 144/220 (65%), Gaps = 7/220 (3%)
 Frame = +2

Query: 131 APTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVN-RIVSSDFSR 307
           +P+  S+ + CGPRDNRG LQRGRTLSTE              G+ A  +    ++   R
Sbjct: 23  SPSTRSLAITCGPRDNRGPLQRGRTLSTEAILAVQALKRAALSGDGAVPSPAAAAAALGR 82

Query: 308 LIKSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRT-DLALYAEMVSTLAR-CGFPGEV 481
           L+K DLLAA+AELQRQ +WRLA+  FAAAR E WY   D +LYAEM S +AR      E+
Sbjct: 83  LLKPDLLAALAELQRQGRWRLALVVFAAARRETWYTNPDFSLYAEMASAMARGAAAAEEI 142

Query: 482 DALVADLLKEE----GWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFL 649
           DALVA+LL+E+    G+  S +V  + RLV+ L+AA RG+ ++D+Y  +K+GG   DE+L
Sbjct: 143 DALVAELLEEKEGSGGFSPSDDVWKLTRLVRVLIAAGRGEAVRDLYKRMKRGGCVGDEYL 202

Query: 650 FKYMIRGLRRLGEDSSADEVERDFEAWDDCGSLPGEPLPV 769
           F+ +IRGLRRLGE  +A EVERDF+ W + G +  E LPV
Sbjct: 203 FRVLIRGLRRLGEGEAAGEVERDFDEWYE-GGIITETLPV 241


>ref|XP_008783728.1| PREDICTED: uncharacterized protein LOC103702875 [Phoenix
           dactylifera]
          Length = 238

 Score =  175 bits (443), Expect = 9e-47
 Identities = 98/200 (49%), Positives = 136/200 (68%), Gaps = 3/200 (1%)
 Frame = +2

Query: 155 VVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDLLAA 334
           + CGPRD+R  L RGRTLS E              G++++V ++VS+ F RL+K DL+AA
Sbjct: 34  ISCGPRDHRWPLLRGRTLSAEAILAVQALKRAR--GDDSKVEQVVSTAFIRLLKPDLVAA 91

Query: 335 IAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLL--K 508
           +AEL+RQ QWRLA K F AAR +   + D +LYAEMV+T+AR G   E+   V+DLL  +
Sbjct: 92  LAELRRQGQWRLAGKVFVAARKDFSSKPDYSLYAEMVATMARNGMREEIGLSVSDLLAAR 151

Query: 509 EEG-WVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMIRGLRRLG 685
           E+G   S+ +++G+ R+ KAL+ A  GK +K++Y ELK+    PDEFLFK +I+GLR +G
Sbjct: 152 EKGDGFSADDLRGLARVFKALIGAGSGKAVKNMYRELKRRNCVPDEFLFKDLIKGLRGMG 211

Query: 686 EDSSADEVERDFEAWDDCGS 745
           E  +ADEVERDFE W + GS
Sbjct: 212 EGEAADEVERDFEVWSNGGS 231


>gb|KMZ75150.1| Pentatricopeptide repeat-containing protein [Zostera marina]
          Length = 234

 Score =  170 bits (431), Expect = 4e-45
 Identities = 91/197 (46%), Positives = 127/197 (64%), Gaps = 3/197 (1%)
 Frame = +2

Query: 149 IPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDLL 328
           + V CGPR NR +L RGRTLSTE              G++A+V+ IVS+   RLIKSDLL
Sbjct: 33  LSVTCGPRGNRSQLVRGRTLSTEAIHAVQALKRAS--GDDAKVDSIVSTSVVRLIKSDLL 90

Query: 329 AAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLL- 505
           AA+ ELQRQ+QW++A+K F A R E W   D  LYA+M+  L R G   E+D ++ DLL 
Sbjct: 91  AALKELQRQEQWKIALKVFVAMRKEHWRNVDYGLYADMIIALGRNGMTNEIDTMIGDLLE 150

Query: 506 --KEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMIRGLRR 679
             K+EG +S  +++G+ RL+KA++ A +G   K +Y  +K G    DE+LFK + +GL+R
Sbjct: 151 EIKKEG-ISGDDLRGLARLLKAVIGAGKGNAAKMLYKAMKIGDCVGDEYLFKILSKGLKR 209

Query: 680 LGEDSSADEVERDFEAW 730
           LGE  +A EV+RD+  W
Sbjct: 210 LGESEAAVEVDRDYAIW 226


>ref|XP_010922535.1| PREDICTED: uncharacterized protein LOC105045819 [Elaeis guineensis]
          Length = 238

 Score =  169 bits (427), Expect = 2e-44
 Identities = 98/207 (47%), Positives = 133/207 (64%), Gaps = 3/207 (1%)
 Frame = +2

Query: 134 PTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLI 313
           PT  SI   CGPRD+R  L RGRTLSTE               ++++V ++VS+ F RL+
Sbjct: 29  PTLFSIS--CGPRDHRWPLLRGRTLSTEAILAVQALKRALD--DDSKVEQVVSTAFVRLL 84

Query: 314 KSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALV 493
           K DL+AA+AEL+RQ QWRLA K F AAR E     D +LYA MV+ +AR G   E+  LV
Sbjct: 85  KPDLVAALAELRRQGQWRLAGKVFVAARKEFSSNPDYSLYAAMVAAMARNGMREEIGLLV 144

Query: 494 ADLLKEE---GWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMI 664
           +DLL E       S+ +++G+ R+V+AL+    GK +K +Y E+K+    P+EFLFK ++
Sbjct: 145 SDLLAEREKGDGFSADDIRGLARVVQALIGVGSGKAVKYMYREMKRRNCVPNEFLFKDLM 204

Query: 665 RGLRRLGEDSSADEVERDFEAWDDCGS 745
           RGLR LGE  +AD+VERDFE W + GS
Sbjct: 205 RGLRGLGEGEAADDVERDFEVWSNGGS 231


>gb|PIA64861.1| hypothetical protein AQUCO_00100375v1 [Aquilegia coerulea]
          Length = 232

 Score =  154 bits (390), Expect = 2e-39
 Identities = 87/199 (43%), Positives = 130/199 (65%), Gaps = 4/199 (2%)
 Frame = +2

Query: 137 TAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIK 316
           T  ++ + CGPRDNRG +Q+GR LS E               +E ++  ++S   +RL+K
Sbjct: 24  TRRNVTIRCGPRDNRGPIQKGRILSIEAIQAIQALKRAK--SDETKLATLISKTLTRLVK 81

Query: 317 SDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVA 496
           +DL+A++ EL RQ+Q  LA+K F+  RSE WY+TD +LYA MVS LAR G   +++ L+ 
Sbjct: 82  NDLIASLNELLRQNQCDLALKVFSTVRSELWYKTDWSLYANMVSGLARNGMSEDINRLML 141

Query: 497 DLLKEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGF----EPDEFLFKYMI 664
           D L+EEG V + +  GI RL+KAL+AAE  + +  +Y  +KKGG+    E DE++ K + 
Sbjct: 142 D-LEEEGLVKN-DSNGISRLLKALIAAEMNECVVRIYGIMKKGGWECKDEKDEYVVKVLN 199

Query: 665 RGLRRLGEDSSADEVERDF 721
           RGLRRLGE+  ADEV++++
Sbjct: 200 RGLRRLGEEEVADEVQKEY 218


>ref|XP_010249644.1| PREDICTED: uncharacterized protein LOC104592133 [Nelumbo nucifera]
 ref|XP_010249645.1| PREDICTED: uncharacterized protein LOC104592133 [Nelumbo nucifera]
          Length = 237

 Score =  154 bits (389), Expect = 4e-39
 Identities = 85/197 (43%), Positives = 128/197 (64%), Gaps = 4/197 (2%)
 Frame = +2

Query: 146 SIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDL 325
           ++ + CGPRDNRG + +GR LSTE               +EA+V+ +VS   SRLIK+DL
Sbjct: 28  AVVIRCGPRDNRGPIVKGRVLSTEAIHAVQALKRAQR-ADEAKVDELVSRALSRLIKADL 86

Query: 326 LAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLL 505
           LA++ EL RQDQ  LA+K F+A RSE WY+TD +LYA++V+ LAR G   E+D L++++ 
Sbjct: 87  LASLGELLRQDQCHLALKVFSAVRSELWYKTDCSLYADVVAALARNGMSEEIDRLISEM- 145

Query: 506 KEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGF----EPDEFLFKYMIRGL 673
            EE  +   + +G+ RL+KAL+AAER +    +Y  +K+ G+      DE++ K + RG+
Sbjct: 146 -EEEGLGGSDGRGLSRLIKALVAAERVESTVGIYRLMKRAGWGSSPTADEYVVKVLSRGM 204

Query: 674 RRLGEDSSADEVERDFE 724
           RRLG+   ADE++  F+
Sbjct: 205 RRLGKRDVADELDSAFD 221


>ref|XP_022033658.1| pentatricopeptide repeat-containing protein At3g46870 [Helianthus
           annuus]
 gb|OTG27087.1| hypothetical protein HannXRQ_Chr04g0096241 [Helianthus annuus]
          Length = 229

 Score =  152 bits (383), Expect = 2e-38
 Identities = 88/201 (43%), Positives = 121/201 (60%), Gaps = 4/201 (1%)
 Frame = +2

Query: 134 PTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLI 313
           P     P+ CGPRDNRG L +GRTLS E               ++   N  VS   SRL+
Sbjct: 21  PRCHHTPIRCGPRDNRGPLYKGRTLSIEAIQAVQSLKRSHR--SDPTNNDTVSRTLSRLV 78

Query: 314 KSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALV 493
           KSDL+AA  EL RQDQ+ LAVK F+A RSE WY+TDL LYA++VS +A  G   E+D L+
Sbjct: 79  KSDLIAAFNELIRQDQFDLAVKVFSAIRSEDWYKTDLNLYAKLVSAMASKGMTDEIDRLM 138

Query: 494 ADLLKEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGF----EPDEFLFKYM 661
            D+  E   V S   KG+  ++KAL+AA+R +    +Y  +K GG+      D+++ K +
Sbjct: 139 CDV--EPADVVSAEGKGLVTVIKALLAADRAESTVRIYEMMKAGGWRCNSSADDYVGKVL 196

Query: 662 IRGLRRLGEDSSADEVERDFE 724
            RGLRRLG++  ADE++ + E
Sbjct: 197 SRGLRRLGKNKVADEIDLEIE 217


>ref|XP_003632859.1| PREDICTED: uncharacterized protein LOC100251441 [Vitis vinifera]
 emb|CBI36107.3| unnamed protein product, partial [Vitis vinifera]
          Length = 230

 Score =  150 bits (380), Expect = 5e-38
 Identities = 89/191 (46%), Positives = 122/191 (63%), Gaps = 1/191 (0%)
 Frame = +2

Query: 152 PVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDLLA 331
           P+ CGPRDNRG L +GR LS E              G+  +++  +S   SRL+K+DLLA
Sbjct: 30  PIRCGPRDNRGPLMKGRVLSIEAIQAIQSLKRAHR-GDPTKIDDFLSKTLSRLVKADLLA 88

Query: 332 AIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLLKE 511
            + EL RQDQ  LA++ F+A RSE WY+T+L+LYA++VS LAR G   E+D L+ D L+ 
Sbjct: 89  TLNELLRQDQCDLALRVFSAVRSELWYKTELSLYADLVSALARKGMKEEIDRLICD-LEG 147

Query: 512 EGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGF-EPDEFLFKYMIRGLRRLGE 688
           EG V   + KGI RLVKA++AAER      +Y  +K+ G    DE++ + + RGLRRLGE
Sbjct: 148 EGSV-RCDDKGIVRLVKAVIAAERRDSTVRIYGLMKRSGCGGGDEYVGRVLSRGLRRLGE 206

Query: 689 DSSADEVERDF 721
              ADEV+ +F
Sbjct: 207 LGVADEVDLEF 217


>ref|XP_024178616.1| protein THYLAKOID ASSEMBLY 8, chloroplastic [Rosa chinensis]
 gb|PRQ50986.1| hypothetical protein RchiOBHm_Chr2g0139281 [Rosa chinensis]
          Length = 237

 Score =  149 bits (376), Expect = 2e-37
 Identities = 90/204 (44%), Positives = 123/204 (60%), Gaps = 8/204 (3%)
 Frame = +2

Query: 134 PTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNE---ARVNRIVSSDFS 304
           PT V +PV CGPRD RG L +GR LS E                +   + +  +VS   S
Sbjct: 28  PTVVRVPVRCGPRDKRGPLVKGRVLSIEAIQAVQALKRAQRSDPDPDPSHLPALVSKTLS 87

Query: 305 RLIKSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVD 484
           RLIKSDL+AA+ EL RQDQ  LA++AF+A RSE  Y+ DL++YAE+   LAR G   E+D
Sbjct: 88  RLIKSDLVAALKELLRQDQCHLALQAFSAFRSE--YQPDLSVYAEVALALARNGMVEEID 145

Query: 485 ALVADLLKEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGG-----FEPDEFL 649
            LV +L KE G V   + KG+ RL+KA++ A+R +    +Y  LK+ G     F+ DE++
Sbjct: 146 TLVCELEKESGGVQWDSDKGLIRLIKAVIGADRRESTVRIYEVLKRKGWGSSSFKADEYM 205

Query: 650 FKYMIRGLRRLGEDSSADEVERDF 721
            + + +GLRRLGE   ADEV+  F
Sbjct: 206 VRVLSKGLRRLGEAELADEVDVKF 229


>gb|KVH97988.1| hypothetical protein Ccrd_023796 [Cynara cardunculus var. scolymus]
          Length = 235

 Score =  148 bits (374), Expect = 4e-37
 Identities = 84/194 (43%), Positives = 122/194 (62%), Gaps = 4/194 (2%)
 Frame = +2

Query: 149 IPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDLL 328
           + + CGPRDNRG L +GRTLS E               ++   N  VS   SRL+KSD++
Sbjct: 32  LTIRCGPRDNRGPLHKGRTLSIEAIQAVQSLKRSHR--SDPANNDAVSKTLSRLVKSDVV 89

Query: 329 AAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLLK 508
           AA  EL RQDQ+ LA+K F+A RSE WY+T+L+LYA++VS++A  G   ++D L+ D+  
Sbjct: 90  AAFNELIRQDQFDLALKVFSAIRSEDWYKTELSLYAKLVSSMASKGMADDIDRLILDVEP 149

Query: 509 EEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGF----EPDEFLFKYMIRGLR 676
           E   V S + KG+  L+KAL+AA+R +    +Y  +K GG+      D++L K + RGLR
Sbjct: 150 EA--VISADSKGLITLIKALIAADRAESTVVIYEMMKAGGWGCNSVTDDYLGKVLSRGLR 207

Query: 677 RLGEDSSADEVERD 718
           RLG+   ADE++R+
Sbjct: 208 RLGKKKVADEIDRE 221


>ref|XP_006850605.1| pentatricopeptide repeat-containing protein At1g62350 [Amborella
           trichopoda]
 gb|ERN12186.1| hypothetical protein AMTR_s00034p00130400 [Amborella trichopoda]
          Length = 250

 Score =  148 bits (373), Expect = 9e-37
 Identities = 87/204 (42%), Positives = 119/204 (58%), Gaps = 1/204 (0%)
 Frame = +2

Query: 146 SIPVVCGPRD-NRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSD 322
           S  + CGPRD NRG L RGR LSTE                  R   +++   SRL+K+D
Sbjct: 51  STGIWCGPRDQNRGPLARGRLLSTEAMLAIQSLK---------RSPNLLAQTTSRLLKAD 101

Query: 323 LLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADL 502
           LLA + ELQRQDQ  LA++ F   R E WY+TD  LYAEMV+ L+R G   E+D+L+AD 
Sbjct: 102 LLAVLKELQRQDQCHLALQVFGVVRKEVWYKTDFGLYAEMVTALSRNGMTEEIDSLIADA 161

Query: 503 LKEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMIRGLRRL 682
           L+++      + +GI RLV+AL+ A   +    +Y   K  GF PD+FLF+ +IRGL+RL
Sbjct: 162 LQDK---FETDNRGIARLVRALIGAGNAEGAVSIYEMTKGSGFLPDDFLFRVLIRGLKRL 218

Query: 683 GEDSSADEVERDFEAWDDCGSLPG 754
           G+ + A +V  DF  +     L G
Sbjct: 219 GKQAHAAKVMDDFREFSKKSVLVG 242


>gb|PAN32358.1| hypothetical protein PAHAL_E04053 [Panicum hallii]
          Length = 256

 Score =  147 bits (371), Expect = 2e-36
 Identities = 91/206 (44%), Positives = 120/206 (58%), Gaps = 7/206 (3%)
 Frame = +2

Query: 155 VVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNE-ARVNRIVSSDFSRLIKSDLLA 331
           + CGPRDNRG LQRGR+LSTE                  A  +   SS   RL+K+DL+A
Sbjct: 31  ITCGPRDNRGPLQRGRSLSTEAILAIQSLKRLTAADRSPAAASAAASSALGRLLKADLVA 90

Query: 332 AIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLLKE 511
           AIAELQRQ  W LA+ A   AR+EPWYR D ALYA  VS+ A       VDALV   L+E
Sbjct: 91  AIAELQRQGHWSLALAALHVARAEPWYRPDPALYATFVSS-APAASDDAVDALVEAFLEE 149

Query: 512 E----GWV-SSVNVKGIFRLVKALMAAERGKLLKDVY-SELKKGGFEPDEFLFKYMIRGL 673
           +    G+V    +V  + RL++AL+A  RG+    VY + +++GG + DE++++ M RG+
Sbjct: 150 KARGGGFVDGEEDVYKLTRLLRALVAKGRGRAAWKVYEAAVRRGGLDVDEYVYRVMARGM 209

Query: 674 RRLGEDSSADEVERDFEAWDDCGSLP 751
           RRLG D  A E E D   W+   S P
Sbjct: 210 RRLGLDEEASEAEADLAEWEATISPP 235


>gb|OVA20311.1| hypothetical protein BVC80_157g118 [Macleaya cordata]
          Length = 238

 Score =  146 bits (369), Expect = 2e-36
 Identities = 82/202 (40%), Positives = 124/202 (61%), Gaps = 5/202 (2%)
 Frame = +2

Query: 146 SIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDL 325
           ++ + CGPR+NRG L +GR LSTE              G+E ++N+++S   +RLIK+DL
Sbjct: 33  NVTIRCGPRNNRGPLVKGRILSTEAMQAVQALKRAK--GDETKINQLISKTLTRLIKNDL 90

Query: 326 LAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLL 505
           LA++ EL RQ    LA+K F A RS+ W + D +LYA++V  L +CG   ++D L+ DL 
Sbjct: 91  LASLNELLRQGHCELALKVFCAVRSDIWSKIDCSLYADLVLALTKCGMLEDIDRLICDL- 149

Query: 506 KEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGF-----EPDEFLFKYMIRG 670
            E   +     +G+ RL+KAL+AAER + +  VY  ++ GG+       DE++ K + RG
Sbjct: 150 -EGEVLVGGGDRGLSRLIKALIAAERTESVVRVYGLMRDGGWGSVGSHVDEYVVKILSRG 208

Query: 671 LRRLGEDSSADEVERDFEAWDD 736
           LRRLGE   ADEV+R F ++ +
Sbjct: 209 LRRLGESGVADEVDRKFGSFSN 230


Top