BLASTX nr result
ID: Cheilocostus21_contig00003980
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cheilocostus21_contig00003980 (1978 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_009416281.1| PREDICTED: pentatricopeptide repeat-containi... 290 3e-90 ref|XP_010937828.1| PREDICTED: pentatricopeptide repeat-containi... 244 9e-73 ref|XP_008780605.2| PREDICTED: pentatricopeptide repeat-containi... 235 5e-69 ref|XP_020574427.1| pentatricopeptide repeat-containing protein ... 203 5e-57 ref|XP_020696423.1| pentatricopeptide repeat-containing protein ... 201 3e-56 gb|PKA56205.1| Pentatricopeptide repeat-containing protein [Apos... 194 1e-53 gb|OAY82212.1| Pentatricopeptide repeat-containing protein [Anan... 189 7e-52 ref|XP_020087607.1| pentatricopeptide repeat-containing protein ... 186 1e-50 ref|XP_008783728.1| PREDICTED: uncharacterized protein LOC103702... 175 9e-47 gb|KMZ75150.1| Pentatricopeptide repeat-containing protein [Zost... 170 4e-45 ref|XP_010922535.1| PREDICTED: uncharacterized protein LOC105045... 169 2e-44 gb|PIA64861.1| hypothetical protein AQUCO_00100375v1 [Aquilegia ... 154 2e-39 ref|XP_010249644.1| PREDICTED: uncharacterized protein LOC104592... 154 4e-39 ref|XP_022033658.1| pentatricopeptide repeat-containing protein ... 152 2e-38 ref|XP_003632859.1| PREDICTED: uncharacterized protein LOC100251... 150 5e-38 ref|XP_024178616.1| protein THYLAKOID ASSEMBLY 8, chloroplastic ... 149 2e-37 gb|KVH97988.1| hypothetical protein Ccrd_023796 [Cynara carduncu... 148 4e-37 ref|XP_006850605.1| pentatricopeptide repeat-containing protein ... 148 9e-37 gb|PAN32358.1| hypothetical protein PAHAL_E04053 [Panicum hallii] 147 2e-36 gb|OVA20311.1| hypothetical protein BVC80_157g118 [Macleaya cord... 146 2e-36 >ref|XP_009416281.1| PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Musa acuminata subsp. malaccensis] ref|XP_009416282.1| PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Musa acuminata subsp. malaccensis] Length = 242 Score = 290 bits (741), Expect = 3e-90 Identities = 146/210 (69%), Positives = 170/210 (80%), Gaps = 1/210 (0%) Frame = +2 Query: 143 VSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSD 322 ++ PV CGPRDNRGKL RGRTLSTE G+EARV+RI+S D RLIK+D Sbjct: 33 LAAPVTCGPRDNRGKLLRGRTLSTEAILAVQALKRAAAAGDEARVHRIISVDLGRLIKAD 92 Query: 323 LLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADL 502 LLAA+AELQRQ++W L+ KAFAAAR EPWYRTDLALYAEMVS+LARCG E+DALVA L Sbjct: 93 LLAALAELQRQNEWGLSSKAFAAARREPWYRTDLALYAEMVSSLARCGASDEIDALVACL 152 Query: 503 LK-EEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMIRGLRR 679 L+ EEGW+SS N K I R V+ALMAAE+ KL++DVY LK GGFEPDEFLFK++IRGLRR Sbjct: 153 LEDEEGWISSENTKEISRFVRALMAAEKAKLVRDVYGNLKSGGFEPDEFLFKFLIRGLRR 212 Query: 680 LGEDSSADEVERDFEAWDDCGSLPGEPLPV 769 LGED++A+EVERDFE W +CGSLP EPLPV Sbjct: 213 LGEDAAAEEVERDFEVWYECGSLPLEPLPV 242 >ref|XP_010937828.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46870 [Elaeis guineensis] Length = 234 Score = 244 bits (622), Expect = 9e-73 Identities = 126/215 (58%), Positives = 162/215 (75%), Gaps = 3/215 (1%) Frame = +2 Query: 134 PTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLI 313 P + P+ CGPRDNRG L+RGRTLSTE G+E +V IVS+ SRLI Sbjct: 23 PKPHATPISCGPRDNRGPLRRGRTLSTEAILAIQALKRTR--GDEPKVEHIVSTTLSRLI 80 Query: 314 KSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALV 493 K+DLLAA+AELQRQDQWRLA+ FAAAR EPWY+ D +LYA MVSTLARCG E+D LV Sbjct: 81 KADLLAALAELQRQDQWRLALTVFAAARREPWYKPDFSLYAAMVSTLARCGVAEEIDILV 140 Query: 494 ADLLKE---EGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMI 664 ++L KE EG +SS +++G+ +L KAL+AA RGK+L++VY E+K+GG++PDE+LFK MI Sbjct: 141 SNLFKEKEMEGGISSEDMRGLVQLSKALVAAGRGKVLREVYREIKRGGWDPDEYLFKLMI 200 Query: 665 RGLRRLGEDSSADEVERDFEAWDDCGSLPGEPLPV 769 RGLRRLGE +ADEVE+D++ W + G++ EPLPV Sbjct: 201 RGLRRLGEGEAADEVEKDYKIWFEGGAI-SEPLPV 234 >ref|XP_008780605.2| PREDICTED: pentatricopeptide repeat-containing protein At1g62350-like [Phoenix dactylifera] Length = 261 Score = 235 bits (599), Expect = 5e-69 Identities = 122/208 (58%), Positives = 156/208 (75%), Gaps = 3/208 (1%) Frame = +2 Query: 155 VVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDLLAA 334 + CGPRDNRG L+RGRTLS+E G+E +V IVS+ SRLIK+DLLAA Sbjct: 57 IFCGPRDNRGPLRRGRTLSSEAILAIQALKRAR--GDEHKVEHIVSTTLSRLIKADLLAA 114 Query: 335 IAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLLKE- 511 +AELQRQDQWRLA+ FAAA EPWY+ D +LYA MVSTLARCG E+D LV++LLKE Sbjct: 115 LAELQRQDQWRLALTVFAAAGREPWYKPDFSLYAAMVSTLARCGVAEEIDVLVSNLLKEK 174 Query: 512 --EGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMIRGLRRLG 685 EG +S +++G+ +L KAL+AA RGK+L+D+Y E+K+GG +PDE+LFK MIRGLR LG Sbjct: 175 EMEGGISLEDIRGLTQLSKALVAAGRGKVLRDIYREIKRGGCDPDEYLFKLMIRGLRSLG 234 Query: 686 EDSSADEVERDFEAWDDCGSLPGEPLPV 769 E +ADEVE+D+E W + G+L +PLPV Sbjct: 235 EGEAADEVEKDYEVWFEGGALT-QPLPV 261 >ref|XP_020574427.1| pentatricopeptide repeat-containing protein At3g46870-like [Phalaenopsis equestris] Length = 252 Score = 203 bits (516), Expect = 5e-57 Identities = 108/205 (52%), Positives = 145/205 (70%), Gaps = 6/205 (2%) Frame = +2 Query: 134 PTAVSIP---VVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFS 304 P S P +V GPRDNR L+RGRTLS+E +EA VNRI+S+ + Sbjct: 38 PNPTSSPASIIVSGPRDNRQPLRRGRTLSSEAILTVQALKRAR--NDEAEVNRIISTSVA 95 Query: 305 RLIKSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVD 484 RLI++DLLAA+ ELQRQDQW+LA+K F AAR E WYR D ALYA+MV +L R E+D Sbjct: 96 RLIRADLLAALTELQRQDQWQLALKVFEAARRENWYRIDCALYADMVGSLTRSKTDSEID 155 Query: 485 ALVADLLKE---EGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFK 655 L+ +LL+E G V++ +++G RLVKAL+AA +GK +KDVY +K+GG EP+E+LFK Sbjct: 156 LLMVELLEELQLGGGVAAGDLRGPARLVKALVAAGKGKAVKDVYKMMKRGGCEPNEYLFK 215 Query: 656 YMIRGLRRLGEDSSADEVERDFEAW 730 +MI+GLRRLGE+ +A E+E+D+E W Sbjct: 216 FMIKGLRRLGEEDAACEIEKDYELW 240 >ref|XP_020696423.1| pentatricopeptide repeat-containing protein At3g46870-like [Dendrobium catenatum] ref|XP_020696424.1| pentatricopeptide repeat-containing protein At3g46870-like [Dendrobium catenatum] ref|XP_020696425.1| pentatricopeptide repeat-containing protein At3g46870-like [Dendrobium catenatum] gb|PKU76528.1| Pentatricopeptide repeat-containing protein [Dendrobium catenatum] Length = 251 Score = 201 bits (510), Expect = 3e-56 Identities = 111/216 (51%), Positives = 148/216 (68%), Gaps = 4/216 (1%) Frame = +2 Query: 134 PTAVSIPV-VCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRL 310 PT+ S + V GPRDNR ++RGRTLSTE +EA V RIVS+ +RL Sbjct: 38 PTSSSTSIIVSGPRDNRQPIRRGRTLSTEAILTVQALKRAR--NDEAAVERIVSTSVARL 95 Query: 311 IKSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDAL 490 I++DLLAA+ ELQRQDQW+LA+K F AR E WYRTD ALYA+MVS+L R E+D L Sbjct: 96 IRADLLAALTELQRQDQWQLALKLFEVARRENWYRTDCALYADMVSSLTRSKRDSEIDIL 155 Query: 491 VADLLKE---EGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYM 661 + +L++E G S +++G RLVKAL+AA +GK +KDVY +K+GG EP+E+LFK+M Sbjct: 156 MVELMEELQMGGGALSGDLRGPARLVKALVAAGKGKAVKDVYEMMKRGGCEPNEYLFKFM 215 Query: 662 IRGLRRLGEDSSADEVERDFEAWDDCGSLPGEPLPV 769 I+GLR LGE+ +A E+E+D+E W G EPL + Sbjct: 216 IKGLRGLGEEDAACEIEKDYELW-GFGGFGAEPLAI 250 >gb|PKA56205.1| Pentatricopeptide repeat-containing protein [Apostasia shenzhenica] Length = 256 Score = 194 bits (493), Expect = 1e-53 Identities = 106/214 (49%), Positives = 145/214 (67%), Gaps = 3/214 (1%) Frame = +2 Query: 134 PTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLI 313 P+ ++ V GPRDNR ++RGRTLS+E ++ V+R+VS+ +RLI Sbjct: 45 PSYLASTTVSGPRDNRQPIRRGRTLSSEAILAVQALKRSRR--DDMAVDRVVSTSVARLI 102 Query: 314 KSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALV 493 K+DLLAA+ EL RQ+QWRLA+K F AAR E WYR D LYA+MVS LAR G E+ L+ Sbjct: 103 KADLLAALGELLRQNQWRLALKVFDAARREEWYRNDCGLYADMVSALARSGIDSEIAPLM 162 Query: 494 ADLLKE---EGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMI 664 A+L+KE G + +++G RLVKAL+AA +G+ +KDVY +K+G +P+EFLFK++I Sbjct: 163 AELMKELEKIGGIVEGDLRGPARLVKALLAAGKGEAVKDVYMMMKRGSCQPNEFLFKFII 222 Query: 665 RGLRRLGEDSSADEVERDFEAWDDCGSLPGEPLP 766 +GLR LG++ A EVE+DFE W GS EPLP Sbjct: 223 KGLRGLGKEDMASEVEKDFELWVH-GSFGMEPLP 255 >gb|OAY82212.1| Pentatricopeptide repeat-containing protein [Ananas comosus] Length = 241 Score = 189 bits (479), Expect = 7e-52 Identities = 110/220 (50%), Positives = 145/220 (65%), Gaps = 7/220 (3%) Frame = +2 Query: 131 APTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVN-RIVSSDFSR 307 +P+ S+ + CGPRDNRG LQRGRTLSTE G+ A + ++ R Sbjct: 23 SPSTRSLAITCGPRDNRGPLQRGRTLSTEAILAVQALKRAALSGDGAVPSPAAAAAALGR 82 Query: 308 LIKSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRT-DLALYAEMVSTLARCGFPGE-V 481 L+K DLLAA+AELQRQ +WRLA+ FAAAR E WY D ALYAEM S +AR G E + Sbjct: 83 LLKPDLLAALAELQRQGRWRLALVVFAAARRETWYTNPDFALYAEMASAMARGGAAAEEI 142 Query: 482 DALVADLLKEE----GWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFL 649 DALVA+LL+E+ G+ S +V + RLV+ L+AA RG+ ++D+Y +K+GG DE+L Sbjct: 143 DALVAELLEEKEGSGGFSPSDDVWKLTRLVRVLIAAGRGEAVRDLYKRMKRGGCVGDEYL 202 Query: 650 FKYMIRGLRRLGEDSSADEVERDFEAWDDCGSLPGEPLPV 769 F+ +IRGLRRLGE +A EVERDF+ W + G + E LPV Sbjct: 203 FRVLIRGLRRLGEGEAAGEVERDFDEWYE-GGIITETLPV 241 >ref|XP_020087607.1| pentatricopeptide repeat-containing protein At3g46870-like [Ananas comosus] Length = 241 Score = 186 bits (471), Expect = 1e-50 Identities = 108/220 (49%), Positives = 144/220 (65%), Gaps = 7/220 (3%) Frame = +2 Query: 131 APTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVN-RIVSSDFSR 307 +P+ S+ + CGPRDNRG LQRGRTLSTE G+ A + ++ R Sbjct: 23 SPSTRSLAITCGPRDNRGPLQRGRTLSTEAILAVQALKRAALSGDGAVPSPAAAAAALGR 82 Query: 308 LIKSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRT-DLALYAEMVSTLAR-CGFPGEV 481 L+K DLLAA+AELQRQ +WRLA+ FAAAR E WY D +LYAEM S +AR E+ Sbjct: 83 LLKPDLLAALAELQRQGRWRLALVVFAAARRETWYTNPDFSLYAEMASAMARGAAAAEEI 142 Query: 482 DALVADLLKEE----GWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFL 649 DALVA+LL+E+ G+ S +V + RLV+ L+AA RG+ ++D+Y +K+GG DE+L Sbjct: 143 DALVAELLEEKEGSGGFSPSDDVWKLTRLVRVLIAAGRGEAVRDLYKRMKRGGCVGDEYL 202 Query: 650 FKYMIRGLRRLGEDSSADEVERDFEAWDDCGSLPGEPLPV 769 F+ +IRGLRRLGE +A EVERDF+ W + G + E LPV Sbjct: 203 FRVLIRGLRRLGEGEAAGEVERDFDEWYE-GGIITETLPV 241 >ref|XP_008783728.1| PREDICTED: uncharacterized protein LOC103702875 [Phoenix dactylifera] Length = 238 Score = 175 bits (443), Expect = 9e-47 Identities = 98/200 (49%), Positives = 136/200 (68%), Gaps = 3/200 (1%) Frame = +2 Query: 155 VVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDLLAA 334 + CGPRD+R L RGRTLS E G++++V ++VS+ F RL+K DL+AA Sbjct: 34 ISCGPRDHRWPLLRGRTLSAEAILAVQALKRAR--GDDSKVEQVVSTAFIRLLKPDLVAA 91 Query: 335 IAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLL--K 508 +AEL+RQ QWRLA K F AAR + + D +LYAEMV+T+AR G E+ V+DLL + Sbjct: 92 LAELRRQGQWRLAGKVFVAARKDFSSKPDYSLYAEMVATMARNGMREEIGLSVSDLLAAR 151 Query: 509 EEG-WVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMIRGLRRLG 685 E+G S+ +++G+ R+ KAL+ A GK +K++Y ELK+ PDEFLFK +I+GLR +G Sbjct: 152 EKGDGFSADDLRGLARVFKALIGAGSGKAVKNMYRELKRRNCVPDEFLFKDLIKGLRGMG 211 Query: 686 EDSSADEVERDFEAWDDCGS 745 E +ADEVERDFE W + GS Sbjct: 212 EGEAADEVERDFEVWSNGGS 231 >gb|KMZ75150.1| Pentatricopeptide repeat-containing protein [Zostera marina] Length = 234 Score = 170 bits (431), Expect = 4e-45 Identities = 91/197 (46%), Positives = 127/197 (64%), Gaps = 3/197 (1%) Frame = +2 Query: 149 IPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDLL 328 + V CGPR NR +L RGRTLSTE G++A+V+ IVS+ RLIKSDLL Sbjct: 33 LSVTCGPRGNRSQLVRGRTLSTEAIHAVQALKRAS--GDDAKVDSIVSTSVVRLIKSDLL 90 Query: 329 AAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLL- 505 AA+ ELQRQ+QW++A+K F A R E W D LYA+M+ L R G E+D ++ DLL Sbjct: 91 AALKELQRQEQWKIALKVFVAMRKEHWRNVDYGLYADMIIALGRNGMTNEIDTMIGDLLE 150 Query: 506 --KEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMIRGLRR 679 K+EG +S +++G+ RL+KA++ A +G K +Y +K G DE+LFK + +GL+R Sbjct: 151 EIKKEG-ISGDDLRGLARLLKAVIGAGKGNAAKMLYKAMKIGDCVGDEYLFKILSKGLKR 209 Query: 680 LGEDSSADEVERDFEAW 730 LGE +A EV+RD+ W Sbjct: 210 LGESEAAVEVDRDYAIW 226 >ref|XP_010922535.1| PREDICTED: uncharacterized protein LOC105045819 [Elaeis guineensis] Length = 238 Score = 169 bits (427), Expect = 2e-44 Identities = 98/207 (47%), Positives = 133/207 (64%), Gaps = 3/207 (1%) Frame = +2 Query: 134 PTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLI 313 PT SI CGPRD+R L RGRTLSTE ++++V ++VS+ F RL+ Sbjct: 29 PTLFSIS--CGPRDHRWPLLRGRTLSTEAILAVQALKRALD--DDSKVEQVVSTAFVRLL 84 Query: 314 KSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALV 493 K DL+AA+AEL+RQ QWRLA K F AAR E D +LYA MV+ +AR G E+ LV Sbjct: 85 KPDLVAALAELRRQGQWRLAGKVFVAARKEFSSNPDYSLYAAMVAAMARNGMREEIGLLV 144 Query: 494 ADLLKEE---GWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMI 664 +DLL E S+ +++G+ R+V+AL+ GK +K +Y E+K+ P+EFLFK ++ Sbjct: 145 SDLLAEREKGDGFSADDIRGLARVVQALIGVGSGKAVKYMYREMKRRNCVPNEFLFKDLM 204 Query: 665 RGLRRLGEDSSADEVERDFEAWDDCGS 745 RGLR LGE +AD+VERDFE W + GS Sbjct: 205 RGLRGLGEGEAADDVERDFEVWSNGGS 231 >gb|PIA64861.1| hypothetical protein AQUCO_00100375v1 [Aquilegia coerulea] Length = 232 Score = 154 bits (390), Expect = 2e-39 Identities = 87/199 (43%), Positives = 130/199 (65%), Gaps = 4/199 (2%) Frame = +2 Query: 137 TAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIK 316 T ++ + CGPRDNRG +Q+GR LS E +E ++ ++S +RL+K Sbjct: 24 TRRNVTIRCGPRDNRGPIQKGRILSIEAIQAIQALKRAK--SDETKLATLISKTLTRLVK 81 Query: 317 SDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVA 496 +DL+A++ EL RQ+Q LA+K F+ RSE WY+TD +LYA MVS LAR G +++ L+ Sbjct: 82 NDLIASLNELLRQNQCDLALKVFSTVRSELWYKTDWSLYANMVSGLARNGMSEDINRLML 141 Query: 497 DLLKEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGF----EPDEFLFKYMI 664 D L+EEG V + + GI RL+KAL+AAE + + +Y +KKGG+ E DE++ K + Sbjct: 142 D-LEEEGLVKN-DSNGISRLLKALIAAEMNECVVRIYGIMKKGGWECKDEKDEYVVKVLN 199 Query: 665 RGLRRLGEDSSADEVERDF 721 RGLRRLGE+ ADEV++++ Sbjct: 200 RGLRRLGEEEVADEVQKEY 218 >ref|XP_010249644.1| PREDICTED: uncharacterized protein LOC104592133 [Nelumbo nucifera] ref|XP_010249645.1| PREDICTED: uncharacterized protein LOC104592133 [Nelumbo nucifera] Length = 237 Score = 154 bits (389), Expect = 4e-39 Identities = 85/197 (43%), Positives = 128/197 (64%), Gaps = 4/197 (2%) Frame = +2 Query: 146 SIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDL 325 ++ + CGPRDNRG + +GR LSTE +EA+V+ +VS SRLIK+DL Sbjct: 28 AVVIRCGPRDNRGPIVKGRVLSTEAIHAVQALKRAQR-ADEAKVDELVSRALSRLIKADL 86 Query: 326 LAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLL 505 LA++ EL RQDQ LA+K F+A RSE WY+TD +LYA++V+ LAR G E+D L++++ Sbjct: 87 LASLGELLRQDQCHLALKVFSAVRSELWYKTDCSLYADVVAALARNGMSEEIDRLISEM- 145 Query: 506 KEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGF----EPDEFLFKYMIRGL 673 EE + + +G+ RL+KAL+AAER + +Y +K+ G+ DE++ K + RG+ Sbjct: 146 -EEEGLGGSDGRGLSRLIKALVAAERVESTVGIYRLMKRAGWGSSPTADEYVVKVLSRGM 204 Query: 674 RRLGEDSSADEVERDFE 724 RRLG+ ADE++ F+ Sbjct: 205 RRLGKRDVADELDSAFD 221 >ref|XP_022033658.1| pentatricopeptide repeat-containing protein At3g46870 [Helianthus annuus] gb|OTG27087.1| hypothetical protein HannXRQ_Chr04g0096241 [Helianthus annuus] Length = 229 Score = 152 bits (383), Expect = 2e-38 Identities = 88/201 (43%), Positives = 121/201 (60%), Gaps = 4/201 (1%) Frame = +2 Query: 134 PTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLI 313 P P+ CGPRDNRG L +GRTLS E ++ N VS SRL+ Sbjct: 21 PRCHHTPIRCGPRDNRGPLYKGRTLSIEAIQAVQSLKRSHR--SDPTNNDTVSRTLSRLV 78 Query: 314 KSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALV 493 KSDL+AA EL RQDQ+ LAVK F+A RSE WY+TDL LYA++VS +A G E+D L+ Sbjct: 79 KSDLIAAFNELIRQDQFDLAVKVFSAIRSEDWYKTDLNLYAKLVSAMASKGMTDEIDRLM 138 Query: 494 ADLLKEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGF----EPDEFLFKYM 661 D+ E V S KG+ ++KAL+AA+R + +Y +K GG+ D+++ K + Sbjct: 139 CDV--EPADVVSAEGKGLVTVIKALLAADRAESTVRIYEMMKAGGWRCNSSADDYVGKVL 196 Query: 662 IRGLRRLGEDSSADEVERDFE 724 RGLRRLG++ ADE++ + E Sbjct: 197 SRGLRRLGKNKVADEIDLEIE 217 >ref|XP_003632859.1| PREDICTED: uncharacterized protein LOC100251441 [Vitis vinifera] emb|CBI36107.3| unnamed protein product, partial [Vitis vinifera] Length = 230 Score = 150 bits (380), Expect = 5e-38 Identities = 89/191 (46%), Positives = 122/191 (63%), Gaps = 1/191 (0%) Frame = +2 Query: 152 PVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDLLA 331 P+ CGPRDNRG L +GR LS E G+ +++ +S SRL+K+DLLA Sbjct: 30 PIRCGPRDNRGPLMKGRVLSIEAIQAIQSLKRAHR-GDPTKIDDFLSKTLSRLVKADLLA 88 Query: 332 AIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLLKE 511 + EL RQDQ LA++ F+A RSE WY+T+L+LYA++VS LAR G E+D L+ D L+ Sbjct: 89 TLNELLRQDQCDLALRVFSAVRSELWYKTELSLYADLVSALARKGMKEEIDRLICD-LEG 147 Query: 512 EGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGF-EPDEFLFKYMIRGLRRLGE 688 EG V + KGI RLVKA++AAER +Y +K+ G DE++ + + RGLRRLGE Sbjct: 148 EGSV-RCDDKGIVRLVKAVIAAERRDSTVRIYGLMKRSGCGGGDEYVGRVLSRGLRRLGE 206 Query: 689 DSSADEVERDF 721 ADEV+ +F Sbjct: 207 LGVADEVDLEF 217 >ref|XP_024178616.1| protein THYLAKOID ASSEMBLY 8, chloroplastic [Rosa chinensis] gb|PRQ50986.1| hypothetical protein RchiOBHm_Chr2g0139281 [Rosa chinensis] Length = 237 Score = 149 bits (376), Expect = 2e-37 Identities = 90/204 (44%), Positives = 123/204 (60%), Gaps = 8/204 (3%) Frame = +2 Query: 134 PTAVSIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNE---ARVNRIVSSDFS 304 PT V +PV CGPRD RG L +GR LS E + + + +VS S Sbjct: 28 PTVVRVPVRCGPRDKRGPLVKGRVLSIEAIQAVQALKRAQRSDPDPDPSHLPALVSKTLS 87 Query: 305 RLIKSDLLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVD 484 RLIKSDL+AA+ EL RQDQ LA++AF+A RSE Y+ DL++YAE+ LAR G E+D Sbjct: 88 RLIKSDLVAALKELLRQDQCHLALQAFSAFRSE--YQPDLSVYAEVALALARNGMVEEID 145 Query: 485 ALVADLLKEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGG-----FEPDEFL 649 LV +L KE G V + KG+ RL+KA++ A+R + +Y LK+ G F+ DE++ Sbjct: 146 TLVCELEKESGGVQWDSDKGLIRLIKAVIGADRRESTVRIYEVLKRKGWGSSSFKADEYM 205 Query: 650 FKYMIRGLRRLGEDSSADEVERDF 721 + + +GLRRLGE ADEV+ F Sbjct: 206 VRVLSKGLRRLGEAELADEVDVKF 229 >gb|KVH97988.1| hypothetical protein Ccrd_023796 [Cynara cardunculus var. scolymus] Length = 235 Score = 148 bits (374), Expect = 4e-37 Identities = 84/194 (43%), Positives = 122/194 (62%), Gaps = 4/194 (2%) Frame = +2 Query: 149 IPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDLL 328 + + CGPRDNRG L +GRTLS E ++ N VS SRL+KSD++ Sbjct: 32 LTIRCGPRDNRGPLHKGRTLSIEAIQAVQSLKRSHR--SDPANNDAVSKTLSRLVKSDVV 89 Query: 329 AAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLLK 508 AA EL RQDQ+ LA+K F+A RSE WY+T+L+LYA++VS++A G ++D L+ D+ Sbjct: 90 AAFNELIRQDQFDLALKVFSAIRSEDWYKTELSLYAKLVSSMASKGMADDIDRLILDVEP 149 Query: 509 EEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGF----EPDEFLFKYMIRGLR 676 E V S + KG+ L+KAL+AA+R + +Y +K GG+ D++L K + RGLR Sbjct: 150 EA--VISADSKGLITLIKALIAADRAESTVVIYEMMKAGGWGCNSVTDDYLGKVLSRGLR 207 Query: 677 RLGEDSSADEVERD 718 RLG+ ADE++R+ Sbjct: 208 RLGKKKVADEIDRE 221 >ref|XP_006850605.1| pentatricopeptide repeat-containing protein At1g62350 [Amborella trichopoda] gb|ERN12186.1| hypothetical protein AMTR_s00034p00130400 [Amborella trichopoda] Length = 250 Score = 148 bits (373), Expect = 9e-37 Identities = 87/204 (42%), Positives = 119/204 (58%), Gaps = 1/204 (0%) Frame = +2 Query: 146 SIPVVCGPRD-NRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSD 322 S + CGPRD NRG L RGR LSTE R +++ SRL+K+D Sbjct: 51 STGIWCGPRDQNRGPLARGRLLSTEAMLAIQSLK---------RSPNLLAQTTSRLLKAD 101 Query: 323 LLAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADL 502 LLA + ELQRQDQ LA++ F R E WY+TD LYAEMV+ L+R G E+D+L+AD Sbjct: 102 LLAVLKELQRQDQCHLALQVFGVVRKEVWYKTDFGLYAEMVTALSRNGMTEEIDSLIADA 161 Query: 503 LKEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGFEPDEFLFKYMIRGLRRL 682 L+++ + +GI RLV+AL+ A + +Y K GF PD+FLF+ +IRGL+RL Sbjct: 162 LQDK---FETDNRGIARLVRALIGAGNAEGAVSIYEMTKGSGFLPDDFLFRVLIRGLKRL 218 Query: 683 GEDSSADEVERDFEAWDDCGSLPG 754 G+ + A +V DF + L G Sbjct: 219 GKQAHAAKVMDDFREFSKKSVLVG 242 >gb|PAN32358.1| hypothetical protein PAHAL_E04053 [Panicum hallii] Length = 256 Score = 147 bits (371), Expect = 2e-36 Identities = 91/206 (44%), Positives = 120/206 (58%), Gaps = 7/206 (3%) Frame = +2 Query: 155 VVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNE-ARVNRIVSSDFSRLIKSDLLA 331 + CGPRDNRG LQRGR+LSTE A + SS RL+K+DL+A Sbjct: 31 ITCGPRDNRGPLQRGRSLSTEAILAIQSLKRLTAADRSPAAASAAASSALGRLLKADLVA 90 Query: 332 AIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLLKE 511 AIAELQRQ W LA+ A AR+EPWYR D ALYA VS+ A VDALV L+E Sbjct: 91 AIAELQRQGHWSLALAALHVARAEPWYRPDPALYATFVSS-APAASDDAVDALVEAFLEE 149 Query: 512 E----GWV-SSVNVKGIFRLVKALMAAERGKLLKDVY-SELKKGGFEPDEFLFKYMIRGL 673 + G+V +V + RL++AL+A RG+ VY + +++GG + DE++++ M RG+ Sbjct: 150 KARGGGFVDGEEDVYKLTRLLRALVAKGRGRAAWKVYEAAVRRGGLDVDEYVYRVMARGM 209 Query: 674 RRLGEDSSADEVERDFEAWDDCGSLP 751 RRLG D A E E D W+ S P Sbjct: 210 RRLGLDEEASEAEADLAEWEATISPP 235 >gb|OVA20311.1| hypothetical protein BVC80_157g118 [Macleaya cordata] Length = 238 Score = 146 bits (369), Expect = 2e-36 Identities = 82/202 (40%), Positives = 124/202 (61%), Gaps = 5/202 (2%) Frame = +2 Query: 146 SIPVVCGPRDNRGKLQRGRTLSTEXXXXXXXXXXXXXXGNEARVNRIVSSDFSRLIKSDL 325 ++ + CGPR+NRG L +GR LSTE G+E ++N+++S +RLIK+DL Sbjct: 33 NVTIRCGPRNNRGPLVKGRILSTEAMQAVQALKRAK--GDETKINQLISKTLTRLIKNDL 90 Query: 326 LAAIAELQRQDQWRLAVKAFAAARSEPWYRTDLALYAEMVSTLARCGFPGEVDALVADLL 505 LA++ EL RQ LA+K F A RS+ W + D +LYA++V L +CG ++D L+ DL Sbjct: 91 LASLNELLRQGHCELALKVFCAVRSDIWSKIDCSLYADLVLALTKCGMLEDIDRLICDL- 149 Query: 506 KEEGWVSSVNVKGIFRLVKALMAAERGKLLKDVYSELKKGGF-----EPDEFLFKYMIRG 670 E + +G+ RL+KAL+AAER + + VY ++ GG+ DE++ K + RG Sbjct: 150 -EGEVLVGGGDRGLSRLIKALIAAERTESVVRVYGLMRDGGWGSVGSHVDEYVVKILSRG 208 Query: 671 LRRLGEDSSADEVERDFEAWDD 736 LRRLGE ADEV+R F ++ + Sbjct: 209 LRRLGESGVADEVDRKFGSFSN 230