BLASTX nr result
ID: Rehmannia23_contig00020933
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00020933 (1227 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006343601.1| PREDICTED: pentatricopeptide repeat-containi... 334 5e-89 ref|XP_004242995.1| PREDICTED: pentatricopeptide repeat-containi... 332 2e-88 ref|XP_003634022.1| PREDICTED: pentatricopeptide repeat-containi... 277 7e-72 gb|EPS64936.1| hypothetical protein M569_09839, partial [Genlise... 248 5e-63 gb|EOY31499.1| Tetratricopeptide repeat (TPR)-like superfamily p... 246 1e-62 ref|XP_004287149.1| PREDICTED: pentatricopeptide repeat-containi... 233 1e-58 gb|EMJ04907.1| hypothetical protein PRUPE_ppa019391mg, partial [... 215 3e-53 ref|XP_002528404.1| pentatricopeptide repeat-containing protein,... 214 6e-53 gb|EXB42922.1| Pentatricopeptide repeat-containing protein [Moru... 212 3e-52 ref|XP_006474045.1| PREDICTED: pentatricopeptide repeat-containi... 211 5e-52 ref|XP_004146719.1| PREDICTED: pentatricopeptide repeat-containi... 211 6e-52 ref|XP_006453565.1| hypothetical protein CICLE_v10007430mg [Citr... 207 7e-51 ref|XP_006285536.1| hypothetical protein CARUB_v10006977mg [Caps... 206 2e-50 ref|XP_002869359.1| pentatricopeptide repeat-containing protein ... 192 2e-46 gb|ESW24614.1| hypothetical protein PHAVU_004G145400g [Phaseolus... 191 4e-46 ref|NP_567856.1| pentatricopeptide repeat-containing protein [Ar... 188 3e-45 emb|CAA18211.1| puative protein [Arabidopsis thaliana] gi|726998... 188 3e-45 ref|XP_006412665.1| hypothetical protein EUTSA_v10024344mg [Eutr... 186 1e-44 ref|XP_004975413.1| PREDICTED: pentatricopeptide repeat-containi... 185 3e-44 ref|XP_006857035.1| hypothetical protein AMTR_s00065p00020910 [A... 181 4e-43 >ref|XP_006343601.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like isoform X1 [Solanum tuberosum] gi|565353364|ref|XP_006343602.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like isoform X2 [Solanum tuberosum] Length = 937 Score = 334 bits (856), Expect = 5e-89 Identities = 170/326 (52%), Positives = 222/326 (68%) Frame = -3 Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSNTLFSGYVSTHGALIVKPFCKLKHIRVSR 800 MASLKL + D+ S+ES KL +L F + + GA +V PFC LKHIRVSR Sbjct: 1 MASLKLPLYVDS-SWESKKLNCTVKALNFTDSKCWVPSFLGGGAFVVSPFCNLKHIRVSR 59 Query: 799 LDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGAKTVR 620 L+ E L+T E +LD +D E + G+D+ + E + DS K K N+WK+FR K V Sbjct: 60 LETEELETSELSLDNEGVDGFEGEL-GNDSFVTERPNLGRDSQKGKFNVWKRFRRVKKVP 118 Query: 619 KNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSERCNFI 440 +++ F K ENPM+ NS ++D + VD ++G + S ++CN I Sbjct: 119 RDSNHRSSFRLKDRKNGMEENPMIAFDVNSDESVIDSQNGVDFPDENIGSDSSLDQCNAI 178 Query: 439 LEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMVSDSD 260 L++LE+ ND +AL+FF WM+ NGKLK NVTAYN ILRVLGR+GDWDGAE MI EM +S Sbjct: 179 LKELERGNDGKALSFFRWMRKNGKLKQNVTAYNLILRVLGRRGDWDGAEGMIKEMSMESG 238 Query: 259 CELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVVEEAE 80 C+L Y++FNTLIYAC+K GLV+LG++WF MML+ VQPN+ATFGMLM+LYQKG VEEAE Sbjct: 239 CKLTYQVFNTLIYACHKKGLVELGAKWFHMMLENGVQPNIATFGMLMALYQKGWHVEEAE 298 Query: 79 YTFSQMRNLKITCQSAYSALITIYTR 2 + FS MRNLKI CQSAYS+++TIYTR Sbjct: 299 FAFSMMRNLKIMCQSAYSSMLTIYTR 324 >ref|XP_004242995.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like [Solanum lycopersicum] Length = 1201 Score = 332 bits (850), Expect = 2e-88 Identities = 168/330 (50%), Positives = 223/330 (67%) Frame = -3 Query: 991 PVCFMASLKLSVSPDNNSYESNKLISGFNSLKFVSNTLFSGYVSTHGALIVKPFCKLKHI 812 P+C MASLKL + D+ S+ES KL L F + GA +V PFC LKHI Sbjct: 261 PLCLMASLKLPLYVDS-SWESKKLNCTVKPLIFTDSKCCVPSFLGGGAFVVSPFCNLKHI 319 Query: 811 RVSRLDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGA 632 RVSRL+ E L+T E ++D +D E + G+++ + E + DS K K N+W++FR Sbjct: 320 RVSRLETEELETSELSIDNEGVDGFEGEL-GNESFVTERPNLGRDSKKGKFNVWRRFRRV 378 Query: 631 KTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSER 452 K V K++ F KY ENP + NS ++D + VD ++G + S ++ Sbjct: 379 KKVPKDSNYRSSFRLKDRKYGTEENPRIVFDVNSDENVIDSQNGVDFHDENIGSDSSLDQ 438 Query: 451 CNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMV 272 CN IL++LE+ +D +AL+FF WM+ NGKLK NVTAYN ILRVLGR+GDWDGAE MI EM Sbjct: 439 CNAILKELERGDDGKALSFFRWMRKNGKLKQNVTAYNLILRVLGRRGDWDGAEGMIKEMS 498 Query: 271 SDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVV 92 +S C+L Y++FNTLIYAC+K GLV+LG++WF MML+ VQPN+ATFG+LM+LYQKG V Sbjct: 499 MESGCKLTYQVFNTLIYACHKKGLVELGAKWFHMMLENGVQPNIATFGLLMALYQKGWHV 558 Query: 91 EEAEYTFSQMRNLKITCQSAYSALITIYTR 2 EEAE+ FS MRNLKI CQSAYS+++TIYTR Sbjct: 559 EEAEFAFSMMRNLKIMCQSAYSSMLTIYTR 588 >ref|XP_003634022.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like [Vitis vinifera] gi|297745081|emb|CBI38673.3| unnamed protein product [Vitis vinifera] Length = 900 Score = 277 bits (708), Expect = 7e-72 Identities = 155/329 (47%), Positives = 214/329 (65%), Gaps = 3/329 (0%) Frame = -3 Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSNTLFSGYVSTHGAL-IVKPFCKLKHIRVS 803 MASLK SVS D +Y+SNK + S + +L I+ F ++K I +S Sbjct: 1 MASLKFSVSVD--TYDSNKF-----------------HFSVNPSLPIINSFARVKPINIS 41 Query: 802 RLDNESLDTCESNLDGFSIDNLEKYV--AGDDNLIIEGQDFHGDSGKRKVNIWKKFRGAK 629 RL+ ES DT +SN +DN++ + +G +NLI+E +F D IW++ +G K Sbjct: 42 RLEAESWDTSDSNS---VVDNIKTWNKDSGSENLILESSNFRND-------IWRRVQGVK 91 Query: 628 TVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSERC 449 VR+ +K++ N ++ S N D +D++ +GPELS ERC Sbjct: 92 RVRRRD--------PNSKFRSIRNDNGHEEQKSVNHFDDE---IDVNEYGIGPELSVERC 140 Query: 448 NFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMVS 269 N IL+ LE+ +DS+ + FFEWM+ NGKL+ NV+AYN LRVLGR+GDWD AE MI EM Sbjct: 141 NAILKGLERCSDSKTMKFFEWMRENGKLEGNVSAYNLALRVLGRRGDWDAAETMIWEMNG 200 Query: 268 DSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVVE 89 DSDC++N++++NTLIYACYK G V+LG++WF++ML+ V+PNVATFGM+MSLYQKG V Sbjct: 201 DSDCQVNFQVYNTLIYACYKQGHVELGTKWFRLMLENGVRPNVATFGMVMSLYQKGWNVA 260 Query: 88 EAEYTFSQMRNLKITCQSAYSALITIYTR 2 ++EY FSQMR+ ITCQSAYSA+ITIYTR Sbjct: 261 DSEYAFSQMRSFGITCQSAYSAMITIYTR 289 >gb|EPS64936.1| hypothetical protein M569_09839, partial [Genlisea aurea] Length = 865 Score = 248 bits (632), Expect = 5e-63 Identities = 139/271 (51%), Positives = 183/271 (67%) Frame = -3 Query: 814 IRVSRLDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRG 635 I VS L+N+ D+ ES + +D+ +K + + +G+D + K+ R Sbjct: 1 ITVSNLENDVPDSSESKSN---LDSRKK----NRDFTAQGKD-----------VSKQCRI 42 Query: 634 AKTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSE 455 AK R++ ++LD H K +K P Q+ S+ L + + LD DV PE + E Sbjct: 43 AKMWREHKKQSLDPHLQSKKSRK-VRPTSLQQRASSGSALGSETDLCLDSWDVRPEETIE 101 Query: 454 RCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEM 275 RCN ILE+LEKS+DS+A++FF+WM++N KLK NV A+N ILRVL RK DWDGAE ++ EM Sbjct: 102 RCNMILERLEKSDDSKAISFFKWMRLNQKLKKNVIAHNVILRVLTRKDDWDGAEGLVKEM 161 Query: 274 VSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLV 95 VSDS C LNY+IFNT+IYACYK GL D+ +RWFKMML+Y+V PNVAT+GMLMSLYQK Sbjct: 162 VSDSGCLLNYQIFNTVIYACYKKGLSDVATRWFKMMLNYQVDPNVATYGMLMSLYQKNWA 221 Query: 94 VEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 VEEAE T + MR LKITC SAYS++ITIY R Sbjct: 222 VEEAESTLTHMRKLKITCNSAYSSMITIYIR 252 >gb|EOY31499.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao] Length = 916 Score = 246 bits (629), Expect = 1e-62 Identities = 146/331 (44%), Positives = 197/331 (59%), Gaps = 5/331 (1%) Frame = -3 Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSNTLFSGYVS----THGALIVKPFCKLKHI 812 MASLKL +S D + +S KL N + + S T A + +LKH Sbjct: 1 MASLKLPISLD--TVDSKKLNFYVNPSHVPDHCSIFSFTSCIHVTKAASNLTSLTRLKHF 58 Query: 811 RVSRLDNESLDTCESNLDGFSIDNLEKYVAGDDN-LIIEGQDFHGDSGKRKVNIWKKFRG 635 +VSR + E + E + I K ++N +EGQ G + K Sbjct: 59 KVSRFETEFPNIPEPSPVDKDIHFSSKIDLVNENPKFVEGQK--GQNPK----------- 105 Query: 634 AKTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSE 455 K +RKN F RN N+ ++ + + +S +D+D++ + P L+ Sbjct: 106 -KGIRKNVGFKFRFRRNRNEIEREDL------------FVHNNSGLDVDYSAIKPNLNLP 152 Query: 454 RCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEM 275 CNFIL++LE+SNDS AL FFEWM+ NGKLK NVTAY +LRVLGR+ DWD AE+M+ + Sbjct: 153 HCNFILKRLERSNDSNALRFFEWMRSNGKLKGNVTAYRLVLRVLGRREDWDAAEMMLRQA 212 Query: 274 VSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLV 95 DS C+LN+++FNT+IYAC K GLV+LG++WF+MML++ +PNVATFGMLM LYQKG Sbjct: 213 NGDSGCKLNFQVFNTIIYACSKKGLVELGAKWFRMMLEHGFRPNVATFGMLMGLYQKGWN 272 Query: 94 VEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 EAE+TFSQMRN I CQSAYSA+ITIYTR Sbjct: 273 ASEAEFTFSQMRNSGIVCQSAYSAMITIYTR 303 >ref|XP_004287149.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 885 Score = 233 bits (594), Expect = 1e-58 Identities = 137/300 (45%), Positives = 186/300 (62%), Gaps = 4/300 (1%) Frame = -3 Query: 889 SNTLFSGYVSTHGALIVKPFCKLKHIRVSRLDNESLDTCES----NLDGFSIDNLEKYVA 722 S+ F+ + + +L+V ++ I+V+R +E L+ ES N D S + K ++ Sbjct: 15 SSKKFNSFCYSRASLVVNSLNRVNAIKVNRFQSE-LNVAESLNEQNPD-CSRHEIGKGIS 72 Query: 721 GDDNLIIEGQDFHGDSGKRKVNIWKKFRGAKTVRKNTIRNLDFHRNGNKYKKHENPMVPL 542 G L KR+V + R +K VRK EN V Sbjct: 73 GTKRL-----------SKREVGLRSSSRKSKWVRKL-----------------ENVFVN- 103 Query: 541 QKNSANPILDGHSVVDLDFNDVGPELSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLK 362 DG D+D++ + ++S E CN IL++LE+S+D + L FFEWM++NGKLK Sbjct: 104 ---------DGE--FDVDYSVIKSDMSLEHCNDILKRLERSSDFKTLKFFEWMRINGKLK 152 Query: 361 NNVTAYNSILRVLGRKGDWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSR 182 NV+A+NS+ RVLGR+ +WD AE +I EMV++ CELNY++FNTLIYAC K G V+LG++ Sbjct: 153 GNVSAFNSVFRVLGRRENWDAAENLIQEMVTEFGCELNYQVFNTLIYACSKLGRVELGAK 212 Query: 181 WFKMMLDYKVQPNVATFGMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 WF MML+Y VQPNVATFGMLM+LYQKG VEEAE+TFS+MRN I CQSAYSA+ITIYTR Sbjct: 213 WFAMMLEYGVQPNVATFGMLMALYQKGWNVEEAEFTFSRMRNFGIVCQSAYSAMITIYTR 272 >gb|EMJ04907.1| hypothetical protein PRUPE_ppa019391mg, partial [Prunus persica] Length = 766 Score = 215 bits (548), Expect = 3e-53 Identities = 97/152 (63%), Positives = 127/152 (83%) Frame = -3 Query: 457 ERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIME 278 E CN IL++LE+ +D + L FFEWM+ NGKL+ NV+A+N +LRV+GR+ DWDGAE ++ E Sbjct: 2 EHCNDILKRLERCSDVKTLRFFEWMRSNGKLERNVSAFNLVLRVMGRREDWDGAEKLVQE 61 Query: 277 MVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGL 98 +++D CELNY++FNTLIYAC K G ++LG +WF+MML+++VQPN+ATFGMLM LYQKG Sbjct: 62 VIADLGCELNYQVFNTLIYACCKLGRLELGGKWFRMMLEHEVQPNIATFGMLMVLYQKGW 121 Query: 97 VVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 VEEAE+TF QMRN I CQSAYS++ITIYTR Sbjct: 122 NVEEAEFTFFQMRNFGILCQSAYSSMITIYTR 153 >ref|XP_002528404.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223532192|gb|EEF33997.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 955 Score = 214 bits (545), Expect = 6e-53 Identities = 136/332 (40%), Positives = 199/332 (59%), Gaps = 6/332 (1%) Frame = -3 Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSNTL---FSGYVSTHGALIVKPFCKLKHIR 809 MASL+L++S D +++S K N L+ ++T S + GA I+ ++ Sbjct: 37 MASLRLTISLD--TFDSKKPNFSRNPLQLSTHTSPFSISSSTPSPGACIITTLTTFSPVK 94 Query: 808 VSRLDNESLDTCESNLDGFSIDNLEKYVAGD--DNLIIEGQDFHGDSGKRKVNIWKKFRG 635 VSR++ E + D++ + D I EG + KR++ KK+RG Sbjct: 95 VSRIETELFE-----------DDVVLSTSNDLPHECINEGLIDRNPNSKREIR--KKYRG 141 Query: 634 AKTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSE 455 +K R + F N YK++ +++ + ++G + D++++ + LS E Sbjct: 142 G--AKKRGKRKVGFKFN---YKRNG-----IEQEIEDLFVEGGEL-DVNYSVIHCNLSLE 190 Query: 454 RCNFILEQLEK-SNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIME 278 CN IL++LE+ S+D ++L FFEWM+ NGKL+ N+ AYN ILRVLGR+ DW AE MI E Sbjct: 191 HCNLILKRLERCSSDDKSLRFFEWMRNNGKLEKNLNAYNVILRVLGRREDWGTAERMIGE 250 Query: 277 MVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGL 98 + EL++R+FNTLIYAC + G + LG +WF+MML+ VQPN+ATFGMLM LYQKG Sbjct: 251 VSDSFGSELDFRVFNTLIYACSRRGNMLLGGKWFRMMLELGVQPNIATFGMLMGLYQKGW 310 Query: 97 VVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 VEEAE+ FS+MR+ I CQSAYSA+ITIYTR Sbjct: 311 NVEEAEFVFSKMRSFGIICQSAYSAMITIYTR 342 >gb|EXB42922.1| Pentatricopeptide repeat-containing protein [Morus notabilis] Length = 889 Score = 212 bits (539), Expect = 3e-52 Identities = 109/215 (50%), Positives = 151/215 (70%) Frame = -3 Query: 646 KFRGAKTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPE 467 KFRG+K K + + + G K + E + L N DG +D++++ + + Sbjct: 74 KFRGSKKEAKRFLGS----KVGMKKNRWERELENLFVN------DGE--IDVNYSVIRSD 121 Query: 466 LSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVM 287 LS E+CN +L++LE +DS+ L FFEWM+ +GKL+ N++AYN + RVL RK DW AE M Sbjct: 122 LSLEQCNSVLKRLESCSDSKTLRFFEWMRSHGKLEGNISAYNLVFRVLSRKEDWGTAEKM 181 Query: 286 IMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQ 107 I E+ ++ CE+ Y++FNTLIYAC K G V+LG++WF+MML++ V+PNVATFGMLM LYQ Sbjct: 182 IWELKNELGCEMGYQVFNTLIYACSKLGRVELGAKWFRMMLEHGVRPNVATFGMLMGLYQ 241 Query: 106 KGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 K VEEAE+TF++MR+L CQSAYSALITIYTR Sbjct: 242 KSWNVEEAEFTFTRMRDLGTVCQSAYSALITIYTR 276 >ref|XP_006474045.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like [Citrus sinensis] Length = 915 Score = 211 bits (537), Expect = 5e-52 Identities = 138/332 (41%), Positives = 199/332 (59%), Gaps = 6/332 (1%) Frame = -3 Query: 979 MASLKL-SVSPDNNSYESNKLISGFNSLKFVSN-TLFSGYVSTHGALIVKPFCKLKHIRV 806 MASLKL S+S D + +S KL N + + +FS +S IV ++KH++ Sbjct: 1 MASLKLLSISLD--TVDSRKLNFAANPPQLSDHFPIFSFTMSC----IVTASNRVKHVK- 53 Query: 805 SRLDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGAKT 626 + + D C N + +E V + + G+ + RKV +G Sbjct: 54 -NVSSSETDLCSMNESKETDIGIENDVGSE---VFVGEC---SNVSRKVK-----KGRYG 101 Query: 625 VRKNTIRNLD----FHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSS 458 V+K + R++D F R+ + ++ + AN DG +D++++ +G +LS Sbjct: 102 VKKGSKRDVDMSLRFRRSAREQER--------EYFFAN---DGE--LDVNYSVIGADLSL 148 Query: 457 ERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIME 278 + CN IL++LEK +DS++L FFEWM+ NGKL+ NVTAYN +LRV R+ DWD AE MI E Sbjct: 149 DECNAILKRLEKYSDSKSLKFFEWMRTNGKLEKNVTAYNLVLRVFSRREDWDAAEKMIRE 208 Query: 277 MVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGL 98 + +LN+++FNTLIYAC K G V+LG++WF MML+ VQPNVATFGMLM LY+K Sbjct: 209 VRMSLGAKLNFQLFNTLIYACNKRGCVELGAKWFHMMLECDVQPNVATFGMLMGLYKKSW 268 Query: 97 VVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 VEEAE+ F+QMR L + C+SAYSA+ITIYTR Sbjct: 269 NVEEAEFAFNQMRKLGLVCESAYSAMITIYTR 300 >ref|XP_004146719.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like [Cucumis sativus] Length = 894 Score = 211 bits (536), Expect = 6e-52 Identities = 129/330 (39%), Positives = 193/330 (58%), Gaps = 4/330 (1%) Frame = -3 Query: 979 MASLKLSVSPDNNSYESNKLISGFNS---LKFVSNTLFSGYVSTHGALIVKPFCKL-KHI 812 MASLKLS S +S++SNK NS + S + ++ + + I+ ++ K Sbjct: 1 MASLKLSFSL--HSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPS 58 Query: 811 RVSRLDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGA 632 +VS+++ ++ D +S D + VA RK K F Sbjct: 59 KVSQVEQDASDVSQSRFD--------EIVA------------------RK----KYFTSK 88 Query: 631 KTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSER 452 K ++ + F RN N + IL +D++++ + +LS E Sbjct: 89 KPSKRAAGSHFSFSRNCN-----------------DNILFNGGELDVNYSTISSDLSLED 131 Query: 451 CNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMV 272 CN IL++LEK NDS+ L FFEWM+ NGKLK+NV+AYN +LRVLGR+ DWD AE +I E+ Sbjct: 132 CNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGRQEDWDAAEKLIEEVR 191 Query: 271 SDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVV 92 ++ +L++++FNTLIYACYKS V+ G++WF+MML+ +VQPNVATFGMLM LYQK + Sbjct: 192 AELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVATFGMLMGLYQKKCDI 251 Query: 91 EEAEYTFSQMRNLKITCQSAYSALITIYTR 2 +E+E+ F+QMRN I C++AY+++ITIY R Sbjct: 252 KESEFAFNQMRNFGIVCETAYASMITIYIR 281 >ref|XP_006453565.1| hypothetical protein CICLE_v10007430mg [Citrus clementina] gi|557556791|gb|ESR66805.1| hypothetical protein CICLE_v10007430mg [Citrus clementina] Length = 851 Score = 207 bits (527), Expect = 7e-51 Identities = 97/166 (58%), Positives = 129/166 (77%) Frame = -3 Query: 499 VDLDFNDVGPELSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLG 320 +D++++ +G +LS + CN IL++LEK +DS++L FFEWM+ NGKL+ NV AYN +LRV Sbjct: 71 LDVNYSVIGADLSLDECNAILKRLEKYSDSKSLKFFEWMRTNGKLEKNVIAYNLVLRVFS 130 Query: 319 RKGDWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNV 140 R+ DWD AE MI E+ +LN+++FNTLIYAC K G V+LG++WF MML+ VQPNV Sbjct: 131 RREDWDAAEKMIREVRMSLGTKLNFQLFNTLIYACNKRGCVELGAKWFHMMLECDVQPNV 190 Query: 139 ATFGMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 ATFGMLM LY+K VEEAE+ F+QMR L + C+SAYSA+ITIYTR Sbjct: 191 ATFGMLMGLYKKSWSVEEAEFAFNQMRKLGLVCESAYSAMITIYTR 236 >ref|XP_006285536.1| hypothetical protein CARUB_v10006977mg [Capsella rubella] gi|482554241|gb|EOA18434.1| hypothetical protein CARUB_v10006977mg [Capsella rubella] Length = 907 Score = 206 bits (523), Expect = 2e-50 Identities = 125/327 (38%), Positives = 181/327 (55%), Gaps = 1/327 (0%) Frame = -3 Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSN-TLFSGYVSTHGALIVKPFCKLKHIRVS 803 M SL+ S+ D ++S + N +F +FS S A + + + IRVS Sbjct: 1 MGSLRFSIPLD--PFDSKRFHFSANPFQFPDQFPIFSVTSSYVPATRIGSLVRAEKIRVS 58 Query: 802 RLDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGAKTV 623 RLD E+ +T E+ +D S +E+ S K K + Sbjct: 59 RLDVEAEET-ENAIDSASAAKVER----------------SSSSKLKSGKTVSSGNKRGT 101 Query: 622 RKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSERCNF 443 +K+ ++ F R + E +L + +D++++ + P LS E CN Sbjct: 102 KKDVVKKFSFRRESINLELEE-------------LLVNNGEMDVNYSAIKPTLSLEHCNG 148 Query: 442 ILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMVSDS 263 IL++LE +DS A+ FF+WM NGKL+ N +AY+ ILRVLGR+ DWD AE +I E+ Sbjct: 149 ILKRLESCSDSNAVKFFDWMSCNGKLQGNFSAYSLILRVLGRRQDWDRAEDLIKELCGFQ 208 Query: 262 DCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVVEEA 83 + ++++FNT+IYAC K G V LGS+WF++ML+ V+PNVAT GMLM LYQK V+EA Sbjct: 209 GFQQSFQVFNTVIYACAKKGNVKLGSKWFQLMLELGVRPNVATIGMLMGLYQKNWNVDEA 268 Query: 82 EYTFSQMRNLKITCQSAYSALITIYTR 2 E+ FSQMR I C+SAYSA+ITIYTR Sbjct: 269 EFAFSQMRKFGIVCESAYSAMITIYTR 295 >ref|XP_002869359.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297315195|gb|EFH45618.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 906 Score = 192 bits (489), Expect = 2e-46 Identities = 120/326 (36%), Positives = 182/326 (55%) Frame = -3 Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSNTLFSGYVSTHGALIVKPFCKLKHIRVSR 800 M SL+LS+ D ++S + N +F ++ A + ++K IRVSR Sbjct: 1 MGSLRLSIPLD--PFDSKRFHFSANPFQFPDQVPIFSVSTSVPATRIGSLIRVKKIRVSR 58 Query: 799 LDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGAKTVR 620 LD E+ + E+ +D S+ N+E+ N ++G + +R + Sbjct: 59 LDIEAKEA-ENAIDSDSV-NVER----SSNSKLKGSNTVTSGNQRGT------------K 100 Query: 619 KNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSERCNFI 440 K+ R F R N + EN V + +D++++ + P LS E N I Sbjct: 101 KDVARKFSFRRESNDLEL-ENLFV------------NNGEMDVNYSAIKPGLSLEHYNAI 147 Query: 439 LEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMVSDSD 260 L++LE +D+ A+ FF+WM+ GKL+ N AY+ ILRVLGR+ +W+ AE +I E+ Sbjct: 148 LKRLESCSDTNAIKFFDWMRCKGKLEGNFGAYSLILRVLGRREEWNRAEDLIEELCGFQG 207 Query: 259 CELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVVEEAE 80 + ++++FNT+IYAC K G V L S+WF+MML+ V+PNVAT GMLM LYQK V+EAE Sbjct: 208 FQQSFQVFNTVIYACTKKGNVKLASKWFQMMLELGVRPNVATIGMLMGLYQKNWNVDEAE 267 Query: 79 YTFSQMRNLKITCQSAYSALITIYTR 2 + FS MR +I C+SAYS++ITIYTR Sbjct: 268 FAFSHMRKFEIVCESAYSSMITIYTR 293 >gb|ESW24614.1| hypothetical protein PHAVU_004G145400g [Phaseolus vulgaris] Length = 852 Score = 191 bits (486), Expect = 4e-46 Identities = 96/167 (57%), Positives = 126/167 (75%), Gaps = 2/167 (1%) Frame = -3 Query: 496 DLDFNDVGPELSSERCNFILEQLEKS--NDSRALTFFEWMKVNGKLKNNVTAYNSILRVL 323 D++F+ ELS+ +CN IL++LE+S +D+ L+FFE M+ GKL+ N AYN ILRV+ Sbjct: 75 DVEFSS---ELSTAQCNAILKRLEESAEDDAETLSFFEKMREGGKLERNAGAYNVILRVV 131 Query: 322 GRKGDWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPN 143 R+GDW+GAE +I EM + EL++ +FNTLIYAC K LV LG++WF+MMLDY V PN Sbjct: 132 SRRGDWEGAEKLISEMKASFGSELSFNVFNTLIYACCKRNLVKLGTKWFRMMLDYGVAPN 191 Query: 142 VATFGMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 VAT GMLM LY+KG +EEAE+ FSQMR I C+SAYS++ITIYTR Sbjct: 192 VATVGMLMGLYRKGWNLEEAEFAFSQMRGFGIVCESAYSSMITIYTR 238 >ref|NP_567856.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635625|sp|O65567.2|PP342_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g30825, chloroplastic; Flags: Precursor gi|332660415|gb|AEE85815.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 904 Score = 188 bits (478), Expect = 3e-45 Identities = 89/166 (53%), Positives = 121/166 (72%) Frame = -3 Query: 499 VDLDFNDVGPELSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLG 320 +D++++ + P S E CN IL++LE +D+ A+ FF+WM+ NGKL N AY+ ILRVLG Sbjct: 126 IDVNYSAIKPGQSLEHCNGILKRLESCSDTNAIKFFDWMRCNGKLVGNFVAYSLILRVLG 185 Query: 319 RKGDWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNV 140 R+ +WD AE +I E+ + + +Y++FNT+IYAC K G V L S+WF MML++ V+PNV Sbjct: 186 RREEWDRAEDLIKELCGFHEFQKSYQVFNTVIYACTKKGNVKLASKWFHMMLEFGVRPNV 245 Query: 139 ATFGMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 AT GMLM LYQK VEEAE+ FS MR I C+SAYS++ITIYTR Sbjct: 246 ATIGMLMGLYQKNWNVEEAEFAFSHMRKFGIVCESAYSSMITIYTR 291 >emb|CAA18211.1| puative protein [Arabidopsis thaliana] gi|7269983|emb|CAB79800.1| puative protein [Arabidopsis thaliana] Length = 1075 Score = 188 bits (478), Expect = 3e-45 Identities = 89/166 (53%), Positives = 121/166 (72%) Frame = -3 Query: 499 VDLDFNDVGPELSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLG 320 +D++++ + P S E CN IL++LE +D+ A+ FF+WM+ NGKL N AY+ ILRVLG Sbjct: 297 IDVNYSAIKPGQSLEHCNGILKRLESCSDTNAIKFFDWMRCNGKLVGNFVAYSLILRVLG 356 Query: 319 RKGDWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNV 140 R+ +WD AE +I E+ + + +Y++FNT+IYAC K G V L S+WF MML++ V+PNV Sbjct: 357 RREEWDRAEDLIKELCGFHEFQKSYQVFNTVIYACTKKGNVKLASKWFHMMLEFGVRPNV 416 Query: 139 ATFGMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 AT GMLM LYQK VEEAE+ FS MR I C+SAYS++ITIYTR Sbjct: 417 ATIGMLMGLYQKNWNVEEAEFAFSHMRKFGIVCESAYSSMITIYTR 462 >ref|XP_006412665.1| hypothetical protein EUTSA_v10024344mg [Eutrema salsugineum] gi|557113835|gb|ESQ54118.1| hypothetical protein EUTSA_v10024344mg [Eutrema salsugineum] Length = 916 Score = 186 bits (473), Expect = 1e-44 Identities = 116/330 (35%), Positives = 179/330 (54%), Gaps = 4/330 (1%) Frame = -3 Query: 979 MASLKLSVSPDNNSYESNKLISGFNSLKFVSN----TLFSGYVSTHGALIVKPFCKLKHI 812 M SL+LS D ++S + N +F ++ S +T I P + Sbjct: 1 MVSLRLSTPLD--PFDSKRFHFSANPFQFTDQFPIFSVTSSISATRTFTIGSPI-SVNKT 57 Query: 811 RVSRLDNESLDTCESNLDGFSIDNLEKYVAGDDNLIIEGQDFHGDSGKRKVNIWKKFRGA 632 RV+RLD E+ + E+ +D S +D+ + E S K K Sbjct: 58 RVARLDTEA-NEAENAIDRSS----------EDDSVSEASVGRSWSSKLKGGNNVTSSNK 106 Query: 631 KTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDLDFNDVGPELSSER 452 + ++K+ R F R N+ + + + +D++++ + P+LS E Sbjct: 107 RGIKKDVTRKSSFRRESNELE-------------LEGLFVNNGEMDVNYSAMKPDLSLEH 153 Query: 451 CNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKGDWDGAEVMIMEMV 272 N IL++LE +D+ A+ FF+WM+ GKL+ N+ AY+ ILRVL R+ +WD AE +I E+ Sbjct: 154 YNGILKRLECCSDTNAVKFFDWMRCKGKLEGNIVAYSLILRVLARREEWDRAEDLIKELC 213 Query: 271 SDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATFGMLMSLYQKGLVV 92 + ++++FNT+IYAC K G V LGS+WF++ML+ V+PNVAT GMLM LYQK V Sbjct: 214 GFQGFQQSFQVFNTVIYACSKKGNVKLGSKWFQLMLELGVRPNVATIGMLMGLYQKNRNV 273 Query: 91 EEAEYTFSQMRNLKITCQSAYSALITIYTR 2 +EAE+ F+ MR I C+SAYSA+IT+YTR Sbjct: 274 DEAEFAFTHMRRFGIVCESAYSAMITLYTR 303 >ref|XP_004975413.1| PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic-like isoform X1 [Setaria italica] Length = 957 Score = 185 bits (470), Expect = 3e-44 Identities = 100/223 (44%), Positives = 138/223 (61%) Frame = -3 Query: 670 KRKVNIWKKFRGAKTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSVVDL 491 K++ +W++ G K +R++ ++G +H V K+ N +L Sbjct: 132 KKEGKLWRRLGGGKKLRRHRAP-----KHGPGKDRHVRRSVV--KDDVNVVL-------- 176 Query: 490 DFNDVGPELSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLGRKG 311 + + E S E CN L LEK +D +AL FF+WMK NGKLK N AY+ L+ + K Sbjct: 177 --SCISQESSIEECNSALIHLEKHSDEKALNFFDWMKANGKLKGNAYAYHLALQAIAWKE 234 Query: 310 DWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNVATF 131 +W AE+++ EMV+DSDC L+ R FN LIY C K L D G+RWF+MMLD +VQPNV+T Sbjct: 235 NWKMAELLLHEMVADSDCTLDARAFNGLIYVCAKRRLDDWGTRWFRMMLDSEVQPNVSTI 294 Query: 130 GMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 GMLM LYQK + EAE+TF++MRN I C +AYSA+IT+YTR Sbjct: 295 GMLMGLYQKTGNLSEAEFTFAKMRNYNIKCVNAYSAMITLYTR 337 >ref|XP_006857035.1| hypothetical protein AMTR_s00065p00020910 [Amborella trichopoda] gi|548861118|gb|ERN18502.1| hypothetical protein AMTR_s00065p00020910 [Amborella trichopoda] Length = 903 Score = 181 bits (460), Expect = 4e-43 Identities = 102/226 (45%), Positives = 138/226 (61%) Frame = -3 Query: 679 DSGKRKVNIWKKFRGAKTVRKNTIRNLDFHRNGNKYKKHENPMVPLQKNSANPILDGHSV 500 +SG++ +WK+ RG K R ++ + + K E L + + + S Sbjct: 71 NSGRK---LWKRLRGFK-------RPIESEVSARRLAKTEQ-CPSLDRKDGDSLSSTESE 119 Query: 499 VDLDFNDVGPELSSERCNFILEQLEKSNDSRALTFFEWMKVNGKLKNNVTAYNSILRVLG 320 ++ + + P S E CN L+ LEKSND++AL FEWMK NGKL N TAYN LRVL Sbjct: 120 LEAKLSTLEPLSSIENCNNYLKLLEKSNDAKALQLFEWMKSNGKLDRNPTAYNLALRVLS 179 Query: 319 RKGDWDGAEVMIMEMVSDSDCELNYRIFNTLIYACYKSGLVDLGSRWFKMMLDYKVQPNV 140 RK DW +E ++ EM + S+C + ++FNTLIY C K LV G++WF+MML V+PN Sbjct: 180 RKEDWKASEELLREMPTVSNCSPSSQMFNTLIYVCSKRELVGWGTKWFRMMLYCGVKPNQ 239 Query: 139 ATFGMLMSLYQKGLVVEEAEYTFSQMRNLKITCQSAYSALITIYTR 2 AT GMLMSLYQKG +EEAE+T QMR + C AYSA++TIYTR Sbjct: 240 ATIGMLMSLYQKGGNLEEAEFTLGQMRTHGLHCCVAYSAMMTIYTR 285