BLASTX nr result
ID: Rauwolfia21_contig00030622
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00030622 (1055 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006353247.1| PREDICTED: pentatricopeptide repeat-containi... 286 7e-75 ref|XP_004250507.1| PREDICTED: pentatricopeptide repeat-containi... 284 4e-74 gb|EPS65453.1| hypothetical protein M569_09325, partial [Genlise... 254 5e-65 ref|XP_002302689.2| hypothetical protein POPTR_0002s18390g [Popu... 248 4e-63 gb|EOX95524.1| Pentatricopeptide repeat-containing protein, puta... 244 4e-62 ref|XP_002334407.1| predicted protein [Populus trichocarpa] 243 1e-61 ref|XP_006386676.1| pentatricopeptide repeat-containing family p... 239 1e-60 ref|XP_006396354.1| hypothetical protein EUTSA_v10028437mg [Eutr... 239 2e-60 ref|XP_006289934.1| hypothetical protein CARUB_v10003556mg [Caps... 235 2e-59 ref|XP_006444679.1| hypothetical protein CICLE_v10023806mg [Citr... 227 7e-57 gb|AAC62783.1| F11O4.7 [Arabidopsis thaliana] 226 9e-57 ref|NP_192066.2| pentatricopeptide repeat-containing protein [Ar... 226 9e-57 ref|XP_002874971.1| pentatricopeptide repeat-containing protein ... 226 9e-57 ref|XP_006491416.1| PREDICTED: pentatricopeptide repeat-containi... 225 3e-56 ref|XP_002272556.1| PREDICTED: pentatricopeptide repeat-containi... 224 4e-56 ref|XP_002515124.1| pentatricopeptide repeat-containing protein,... 219 1e-54 ref|XP_004140525.1| PREDICTED: pentatricopeptide repeat-containi... 219 1e-54 ref|XP_004308750.1| PREDICTED: pentatricopeptide repeat-containi... 210 9e-52 gb|EXC13626.1| hypothetical protein L484_019583 [Morus notabilis] 204 4e-50 gb|EMJ21345.1| hypothetical protein PRUPE_ppa019625mg [Prunus pe... 197 6e-48 >ref|XP_006353247.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like isoform X1 [Solanum tuberosum] Length = 816 Score = 286 bits (733), Expect = 7e-75 Identities = 145/244 (59%), Positives = 183/244 (75%) Frame = +1 Query: 322 SKVGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC 501 SKVGNL++VASIAKAL +PGGTRNLE+ SI LSE LVLQVL RN+L+A KKLDFF WC Sbjct: 35 SKVGNLLVVASIAKALIKPGGTRNLEQYGDSIPLSESLVLQVLRRNNLDAEKKLDFFKWC 94 Query: 502 SLKPCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSG 681 SL+P + HS TYSQ+F++I + + I LL M D ++L++ATFKL+L++F R+G Sbjct: 95 SLRPSFKHSTETYSQMFKSICYSHNHREAIFVLLNSMKDDKVLLNAATFKLLLDSFTRTG 154 Query: 682 KFDSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDNG 861 FDSALEIL +E DL SC PD+Y++VLIAL++KNQ+ +ALSIFLKLL+ + D N Sbjct: 155 NFDSALEILEFVEGDLDNSSCLSPDVYNSVLIALVQKNQVNLALSIFLKLLETN--DGNS 212 Query: 862 TDTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGDLST 1041 A++CNELLVGL++ NM+ +F VF KLR FP DR GYNICIH FGCWGDLS+ Sbjct: 213 IGVSSAVACNELLVGLKRGNMRAEFKQVFDKLRGGNVFPFDRWGYNICIHTFGCWGDLSS 272 Query: 1042 SLIL 1053 SL L Sbjct: 273 SLSL 276 >ref|XP_004250507.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like [Solanum lycopersicum] Length = 819 Score = 284 bits (727), Expect = 4e-74 Identities = 145/244 (59%), Positives = 180/244 (73%) Frame = +1 Query: 322 SKVGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC 501 SKVGNL++VASIAKAL + GGTRNLEK I LSE LVLQVL RN+L+A KKLDFF WC Sbjct: 38 SKVGNLIVVASIAKALIKRGGTRNLEKYGDLIPLSESLVLQVLRRNNLDAEKKLDFFKWC 97 Query: 502 SLKPCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSG 681 SL+P + HS TYSQ+F+ I + +++ LL M D ++L+SATFKL+L++F R+G Sbjct: 98 SLRPNFKHSTETYSQMFKCICYSRNHREDVFVLLNSMKDDEVLLNSATFKLLLDSFTRTG 157 Query: 682 KFDSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDNG 861 FDSALEIL +E DL SC PD+Y++VLIAL++KNQ+ +ALSIFLKLL+ + D N Sbjct: 158 NFDSALEILEFVEGDLANSSCLSPDVYNSVLIALVQKNQVNLALSIFLKLLETN--DGNS 215 Query: 862 TDTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGDLST 1041 AI+CNELLVGL++ NM+ +F VF KLR FP DR GYNICIH FGCWGDLS Sbjct: 216 IGVSSAIACNELLVGLKRGNMRAEFKQVFDKLRGGNVFPFDRWGYNICIHAFGCWGDLSR 275 Query: 1042 SLIL 1053 SL L Sbjct: 276 SLSL 279 >gb|EPS65453.1| hypothetical protein M569_09325, partial [Genlisea aurea] Length = 770 Score = 254 bits (648), Expect = 5e-65 Identities = 130/241 (53%), Positives = 171/241 (70%), Gaps = 1/241 (0%) Frame = +1 Query: 334 NLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWCSLKP 513 N+++VASI K LS+ G + LEK+A SI LSED+VLQ++H SL SKKL+FF WCS +P Sbjct: 1 NILVVASITKILSKFGALQYLEKNADSIPLSEDVVLQIVHHRSLVISKKLEFFRWCSSRP 60 Query: 514 CYSHSASTYSQIFRTISSCP-QYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSGKFD 690 Y+H+A+ YS++ R I P Q+H+ ++ LL LM DG++LDS T K ILN IR+ KFD Sbjct: 61 DYNHTANAYSEMLRAIFRFPNQHHNNVIELLALMKRDGVILDSDTLKRILNGLIRAQKFD 120 Query: 691 SALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDNGTDT 870 AL++L+++EKD PD+YS VL+AL+RK+Q+ IAL +F KLL + D Sbjct: 121 YALDVLDYIEKDSVIAGNLSPDVYSPVLVALVRKDQISIALPVFFKLLHSQFED----YI 176 Query: 871 PDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGDLSTSLI 1050 PDA +CNELL GL+K MK++F VF KLRE +P DR GYNICIH FGCWGDLST+L Sbjct: 177 PDAFACNELLAGLKKKKMKNEFREVFAKLRETARYPSDRWGYNICIHSFGCWGDLSTALS 236 Query: 1051 L 1053 L Sbjct: 237 L 237 >ref|XP_002302689.2| hypothetical protein POPTR_0002s18390g [Populus trichocarpa] gi|550345304|gb|EEE81962.2| hypothetical protein POPTR_0002s18390g [Populus trichocarpa] Length = 776 Score = 248 bits (632), Expect = 4e-63 Identities = 135/245 (55%), Positives = 178/245 (72%), Gaps = 3/245 (1%) Frame = +1 Query: 328 VGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWCSL 507 +GN++LVA + K LSE GTR+L+ D SI LSE LVLQ+L RNSL++SKK++FF WCS+ Sbjct: 1 MGNILLVAYLTKTLSE-SGTRSLDPD--SIPLSESLVLQILRRNSLDSSKKMEFFKWCSV 57 Query: 508 KPCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSGKF 687 + Y HS STYSQ+F T+ Y DE+ +LL M +DG+V+ S TFKL+L+AFIRSGKF Sbjct: 58 RHIYKHSVSTYSQMFSTLCRSG-YLDEVPDLLNSMKNDGVVVGSETFKLLLDAFIRSGKF 116 Query: 688 DSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDN--- 858 DSAL+IL+HME +LG S +P +Y ++++AL +KNQ+ +ALSI KLL+ S ++ Sbjct: 117 DSALDILDHME-ELG--SNPNPHMYDSIIVALAKKNQVGLALSIMFKLLEASDGNEENAV 173 Query: 859 GTDTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGDLS 1038 G P +++CN LLV LR MK +F VF KLR KG F L+ GYNICIH FGCWGDL+ Sbjct: 174 GVSLPGSVACNALLVALRNGEMKVEFKTVFAKLRGKGGFELNTWGYNICIHAFGCWGDLT 233 Query: 1039 TSLIL 1053 TSL L Sbjct: 234 TSLRL 238 >gb|EOX95524.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] Length = 807 Score = 244 bits (623), Expect = 4e-62 Identities = 138/247 (55%), Positives = 178/247 (72%), Gaps = 5/247 (2%) Frame = +1 Query: 328 VGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC-S 504 +GN++L+AS+ K LSE GTRNL D SI +SE LV+Q+L ++SLE SKKLDFF+WC S Sbjct: 23 LGNILLIASLTKTLSE-SGTRNL--DPNSIPISEPLVIQILRKHSLEPSKKLDFFNWCRS 79 Query: 505 LKPCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSGK 684 +KP + HSA TYS IFRT+ + +E+ NLL M DG+++DS TFK +L+AFIRSGK Sbjct: 80 VKPNFKHSAVTYSHIFRTLCRSG-FVEEVPNLLFAMKEDGVLVDSDTFKFLLDAFIRSGK 138 Query: 685 FDSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDNGT 864 FDSALEIL+ ME +LGA + +Y +VL+ALIRK+Q+ +ALS+F KLL+ +D+G Sbjct: 139 FDSALEILDFME-ELGA--GLNLRVYDSVLVALIRKDQVGLALSLFFKLLEACNGNDDGN 195 Query: 865 DT----PDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGD 1032 P +I+ NELLV LRKA+M+ +F VF LREK F D CGYNICIH FGCWGD Sbjct: 196 SVDSSLPGSIAINELLVALRKAHMRREFKQVFDILREKREFEFDTCGYNICIHSFGCWGD 255 Query: 1033 LSTSLIL 1053 L SL L Sbjct: 256 LGASLKL 262 >ref|XP_002334407.1| predicted protein [Populus trichocarpa] Length = 513 Score = 243 bits (619), Expect = 1e-61 Identities = 133/245 (54%), Positives = 177/245 (72%), Gaps = 3/245 (1%) Frame = +1 Query: 328 VGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWCSL 507 +GN++LVA + K LSE GTR+L+ D SI LSE LVLQ+L RNSL++SKK++FF WCS+ Sbjct: 1 MGNILLVAYLTKTLSE-SGTRSLDPD--SIPLSESLVLQILRRNSLDSSKKMEFFKWCSV 57 Query: 508 KPCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSGKF 687 + Y HS STYSQ+F T+ Y +E+ +LL M +DG+V+ S TFKL+L+AFIRSGKF Sbjct: 58 RHIYKHSVSTYSQMFSTLCRSG-YLEEVPDLLNSMKNDGVVVGSETFKLLLDAFIRSGKF 116 Query: 688 DSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDN--- 858 DSAL+IL+HME +LG S +P +Y ++++AL +KNQ+ +ALSI KLL+ S ++ Sbjct: 117 DSALDILDHME-ELG--SNPNPHMYDSIIVALAKKNQVGLALSIMFKLLEASDGNEENAV 173 Query: 859 GTDTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGDLS 1038 G P +++CN LLV LR MK +F VF KLR K F L+ GYNICIH FGCWGDL+ Sbjct: 174 GVSLPGSVACNALLVALRNGEMKVEFKTVFAKLRGKVGFELNTWGYNICIHAFGCWGDLT 233 Query: 1039 TSLIL 1053 TSL L Sbjct: 234 TSLRL 238 >ref|XP_006386676.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550345301|gb|ERP64473.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 776 Score = 239 bits (610), Expect = 1e-60 Identities = 132/245 (53%), Positives = 176/245 (71%), Gaps = 3/245 (1%) Frame = +1 Query: 328 VGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWCSL 507 +GN++LVA + K LSE GTR+L+ D SI LSE LVLQ+L RNSL++SKK++FF WCS+ Sbjct: 1 MGNILLVAYLTKTLSE-SGTRSLDPD--SIPLSEYLVLQILRRNSLDSSKKMEFFKWCSV 57 Query: 508 KPCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSGKF 687 + Y HS STYSQ+F T+ Y +E+ +LL M +DG+V+ S TFKL+L+AFIRSGKF Sbjct: 58 RHIYKHSVSTYSQMFSTLCRSG-YLEEVPDLLNSMKNDGVVVGSETFKLLLDAFIRSGKF 116 Query: 688 DSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDNGT- 864 DSAL+IL+HME +LG S +P +Y ++++AL +KNQ+ +ALSI KLL+ S ++ Sbjct: 117 DSALDILDHME-ELG--SNPNPHMYDSIIVALAKKNQVGLALSIMFKLLEASDGNEENAV 173 Query: 865 --DTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGDLS 1038 P +++CN LLV LR MK +F VF KLR K F L+ GYNICIH FGCWGDL+ Sbjct: 174 RVSLPGSVACNALLVALRNGEMKVEFKTVFAKLRGKVGFKLNTWGYNICIHAFGCWGDLT 233 Query: 1039 TSLIL 1053 TSL L Sbjct: 234 TSLRL 238 >ref|XP_006396354.1| hypothetical protein EUTSA_v10028437mg [Eutrema salsugineum] gi|557097371|gb|ESQ37807.1| hypothetical protein EUTSA_v10028437mg [Eutrema salsugineum] Length = 801 Score = 239 bits (609), Expect = 2e-60 Identities = 139/249 (55%), Positives = 174/249 (69%), Gaps = 9/249 (3%) Frame = +1 Query: 334 NLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC-SLK 510 N+++VAS++K LS GTRNL DA S +SE +VLQ+L RNSL+ SKKLDFF WC SL+ Sbjct: 29 NVLVVASLSKTLSH-SGTRNL--DANSTPISEPIVLQILRRNSLDPSKKLDFFRWCFSLR 85 Query: 511 PCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSGKFD 690 P Y HSAS YSQIFRT+ EI NLL M DG+ LD T KL+L++ IRSGK+D Sbjct: 86 PGYKHSASAYSQIFRTVCRTGLL-GEIPNLLGSMKEDGVNLDQTTSKLLLDSLIRSGKYD 144 Query: 691 SALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDNGTDT 870 SAL +L++ME +LG C +P +Y +VLIAL++KN+LR+ALSIF KLL+ A DN ++T Sbjct: 145 SALGVLDYME-ELGG--CLNPRLYDSVLIALVKKNELRLALSIFFKLLE---ASDNPSET 198 Query: 871 --------PDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCW 1026 P ++ NELLVGLRKANMK +F VF KL+ F D GYNICIHGFGCW Sbjct: 199 GGVSVSYLPGTVAVNELLVGLRKANMKLEFKGVFDKLKGMERFKFDTWGYNICIHGFGCW 258 Query: 1027 GDLSTSLIL 1053 GDL +L L Sbjct: 259 GDLDAALSL 267 >ref|XP_006289934.1| hypothetical protein CARUB_v10003556mg [Capsella rubella] gi|482558640|gb|EOA22832.1| hypothetical protein CARUB_v10003556mg [Capsella rubella] Length = 802 Score = 235 bits (600), Expect = 2e-59 Identities = 131/247 (53%), Positives = 174/247 (70%), Gaps = 7/247 (2%) Frame = +1 Query: 334 NLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC-SLK 510 N++LVAS++K LS+ GTR+L DA SI +SE +VLQ+L R+S+++SKKLDFF WC SL+ Sbjct: 29 NVLLVASLSKTLSQ-SGTRSL--DANSIPISESVVLQILRRSSIDSSKKLDFFRWCFSLR 85 Query: 511 PCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSGKFD 690 P Y HSAS YSQIFRT+ E+ +LL M DG+ LD K++L++ IRSGKFD Sbjct: 86 PGYKHSASAYSQIFRTVCRTGLI-GEVPDLLGSMKDDGVNLDQTMAKVLLDSLIRSGKFD 144 Query: 691 SALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDNGTD- 867 SAL +L++ME +LG C +P +Y +VL+AL++KN++R+ALSIF KLL+ S +GT Sbjct: 145 SALGVLDYME-ELG--DCLNPGLYDSVLVALVKKNEMRLALSIFFKLLEASDNHSDGTGG 201 Query: 868 -----TPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGD 1032 P ++ NELLVGLR+A M+ +F VF KLRE F D GYNICIHGFGCWGD Sbjct: 202 VIVSYLPGTVAVNELLVGLRRAGMRSEFKRVFEKLREVKRFKFDTWGYNICIHGFGCWGD 261 Query: 1033 LSTSLIL 1053 L +L L Sbjct: 262 LDAALSL 268 >ref|XP_006444679.1| hypothetical protein CICLE_v10023806mg [Citrus clementina] gi|557546941|gb|ESR57919.1| hypothetical protein CICLE_v10023806mg [Citrus clementina] Length = 619 Score = 227 bits (578), Expect = 7e-57 Identities = 127/248 (51%), Positives = 172/248 (69%), Gaps = 5/248 (2%) Frame = +1 Query: 325 KVGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWCS 504 ++G+++L+A + K L E GTRNL D SI +SE LVLQVL +NSL++SKKLDFF WCS Sbjct: 18 QLGSILLLAFVTKTLKE-SGTRNL--DPRSIPISEPLVLQVLGKNSLDSSKKLDFFRWCS 74 Query: 505 -LKPCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSG 681 L+P Y H+A TYS IFRT+ + +E+ +LL M D +V+DS TFKL+L A I+SG Sbjct: 75 SLRPIYKHTACTYSHIFRTVCRAG-FLEEVPSLLNSMQEDDVVVDSETFKLLLEACIKSG 133 Query: 682 KFDSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLD--NSRADD 855 K D A+EIL++ME +LG + P++Y +VL++L+RK QL +A+SI KLL+ N D Sbjct: 134 KIDFAIEILDYME-ELG--TSLSPNVYDSVLVSLVRKKQLGLAMSILFKLLEACNDNTAD 190 Query: 856 NGT--DTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWG 1029 N P ++CNELLV LRK++ + +F VF +L+E+ F D GYNICIH FGCWG Sbjct: 191 NSVVESLPGCVACNELLVALRKSDRRSEFKQVFERLKEQKEFEFDIYGYNICIHAFGCWG 250 Query: 1030 DLSTSLIL 1053 DL TSL L Sbjct: 251 DLHTSLRL 258 >gb|AAC62783.1| F11O4.7 [Arabidopsis thaliana] Length = 508 Score = 226 bits (577), Expect = 9e-57 Identities = 129/249 (51%), Positives = 172/249 (69%), Gaps = 9/249 (3%) Frame = +1 Query: 334 NLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC-SLK 510 N++LVAS++K LS+ GTR+L DA SI +SE +VLQ+L RNS++ SKKLDFF WC SL+ Sbjct: 29 NVLLVASLSKTLSQ-SGTRSL--DANSIPISEPVVLQILRRNSIDPSKKLDFFRWCYSLR 85 Query: 511 PCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSGKFD 690 P Y HSA+ YSQIFRT+ E+ +LL M DG+ LD K++L++ IRSGKF+ Sbjct: 86 PGYKHSATAYSQIFRTVCRTGLL-GEVPDLLGSMKEDGVNLDQTMAKILLDSLIRSGKFE 144 Query: 691 SALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLL---DNSRADDNG 861 SAL +L++ME +LG C +P +Y +VLIAL++K++LR+ALSI KLL DN DD G Sbjct: 145 SALGVLDYME-ELG--DCLNPSVYDSVLIALVKKHELRLALSILFKLLEASDNHSDDDTG 201 Query: 862 -----TDTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCW 1026 + P ++ NELLVGLR+A+M+ +F VF KL+ F D YNICIHGFGCW Sbjct: 202 RVIIVSYLPGTVAVNELLVGLRRADMRSEFKRVFEKLKGMKRFKFDTWSYNICIHGFGCW 261 Query: 1027 GDLSTSLIL 1053 GDL +L L Sbjct: 262 GDLDAALSL 270 >ref|NP_192066.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75161629|sp|Q8VZE4.1|PP299_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g01570 gi|18086402|gb|AAL57659.1| AT4g01570/T15B16_21 [Arabidopsis thaliana] gi|24797024|gb|AAN64524.1| At4g01570/T15B16_21 [Arabidopsis thaliana] gi|332656643|gb|AEE82043.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 805 Score = 226 bits (577), Expect = 9e-57 Identities = 129/249 (51%), Positives = 172/249 (69%), Gaps = 9/249 (3%) Frame = +1 Query: 334 NLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC-SLK 510 N++LVAS++K LS+ GTR+L DA SI +SE +VLQ+L RNS++ SKKLDFF WC SL+ Sbjct: 29 NVLLVASLSKTLSQ-SGTRSL--DANSIPISEPVVLQILRRNSIDPSKKLDFFRWCYSLR 85 Query: 511 PCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSGKFD 690 P Y HSA+ YSQIFRT+ E+ +LL M DG+ LD K++L++ IRSGKF+ Sbjct: 86 PGYKHSATAYSQIFRTVCRTGLL-GEVPDLLGSMKEDGVNLDQTMAKILLDSLIRSGKFE 144 Query: 691 SALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLL---DNSRADDNG 861 SAL +L++ME +LG C +P +Y +VLIAL++K++LR+ALSI KLL DN DD G Sbjct: 145 SALGVLDYME-ELG--DCLNPSVYDSVLIALVKKHELRLALSILFKLLEASDNHSDDDTG 201 Query: 862 -----TDTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCW 1026 + P ++ NELLVGLR+A+M+ +F VF KL+ F D YNICIHGFGCW Sbjct: 202 RVIIVSYLPGTVAVNELLVGLRRADMRSEFKRVFEKLKGMKRFKFDTWSYNICIHGFGCW 261 Query: 1027 GDLSTSLIL 1053 GDL +L L Sbjct: 262 GDLDAALSL 270 >ref|XP_002874971.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297320808|gb|EFH51230.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 802 Score = 226 bits (577), Expect = 9e-57 Identities = 130/249 (52%), Positives = 171/249 (68%), Gaps = 9/249 (3%) Frame = +1 Query: 334 NLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC-SLK 510 N++LVAS++K LS+ GTR L DA SI +SE +VLQ+L RNS++ SKKLDFF WC SL+ Sbjct: 29 NVLLVASLSKTLSQ-SGTRGL--DANSIPISEPVVLQILRRNSIDPSKKLDFFRWCYSLR 85 Query: 511 PCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSGKFD 690 Y HS S YSQIFRT+ E+ +LL M DG+ LD K++L++ IRSGKF+ Sbjct: 86 TGYKHSVSAYSQIFRTVCRTGLL-GEVPDLLCSMKEDGVNLDQTMAKILLDSLIRSGKFE 144 Query: 691 SALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDNGTDT 870 SAL +L++ME +LG C +P +Y +VLIAL +KN+LR+ALSIF KLL+ S D++G DT Sbjct: 145 SALGVLDYME-ELG--DCLNPSLYDSVLIALAKKNELRLALSIFFKLLEAS--DNHGDDT 199 Query: 871 --------PDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCW 1026 P ++ NELLVGLR+A+M+ +F VF KL+ F D YNICIHGFGCW Sbjct: 200 SGVTVSYLPGRVAVNELLVGLRRADMRSEFKTVFEKLKGMNRFKFDTWSYNICIHGFGCW 259 Query: 1027 GDLSTSLIL 1053 GDL +L L Sbjct: 260 GDLDAALSL 268 >ref|XP_006491416.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like [Citrus sinensis] Length = 790 Score = 225 bits (573), Expect = 3e-56 Identities = 126/248 (50%), Positives = 171/248 (68%), Gaps = 5/248 (2%) Frame = +1 Query: 325 KVGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWCS 504 ++G+++L+A + K L E GTRNL D SI +SE LVLQVL +NSL++SKKLDFF WCS Sbjct: 18 QLGSILLLAFVTKTLKE-SGTRNL--DPRSIPISEPLVLQVLGKNSLDSSKKLDFFRWCS 74 Query: 505 -LKPCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSG 681 L+P Y H+A TYS IFRT+ + +E+ +LL M D +V+DS TFKL+L I+SG Sbjct: 75 SLRPIYKHTACTYSHIFRTVCRAG-FLEEVPSLLNSMQEDDVVVDSETFKLLLEPCIKSG 133 Query: 682 KFDSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLD--NSRADD 855 K D A+EIL++ME +LG + P++Y +VL++L+RK QL +A+SI KLL+ N D Sbjct: 134 KIDFAIEILDYME-ELG--TSLSPNVYDSVLVSLVRKKQLGLAMSILFKLLEACNDNTAD 190 Query: 856 NGT--DTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWG 1029 N P ++CNELLV LRK++ + +F VF +L+E+ F D GYNICIH FGCWG Sbjct: 191 NSVVESLPGCVACNELLVALRKSDRRSEFKQVFERLKEQKEFEFDIYGYNICIHAFGCWG 250 Query: 1030 DLSTSLIL 1053 DL TSL L Sbjct: 251 DLHTSLRL 258 >ref|XP_002272556.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Vitis vinifera] Length = 792 Score = 224 bits (571), Expect = 4e-56 Identities = 124/244 (50%), Positives = 168/244 (68%), Gaps = 1/244 (0%) Frame = +1 Query: 325 KVGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWCS 504 K+G+++LVASI+K LSE G TR+ D SI +SE LV+Q+L RNS++ +K++FF WCS Sbjct: 18 KLGDMLLVASISKTLSERG-TRS--PDLESIPISESLVVQILGRNSIDVFRKVEFFRWCS 74 Query: 505 LKPCYSHSASTYSQIFRTISSC-PQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSG 681 + Y HS YS IFR + ++ D++ L+ M DG+V+ TFKL+L++ IR+G Sbjct: 75 FRHNYKHSVGAYSHIFRIVCRAGAEFLDQVPLLMSSMKDDGVVVGQETFKLLLDSLIRAG 134 Query: 682 KFDSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDNG 861 KFDSALEIL+H+E +LG + + +Y +VL+ALIRKNQL +AL +F KLL G Sbjct: 135 KFDSALEILDHIE-ELG--TGLNSYVYDSVLVALIRKNQLGLALPLFFKLLGGDEGQ-GG 190 Query: 862 TDTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGDLST 1041 P++ +CN+LLV LRKA+MK +F VF KLR K F LD GYNICIH FGCWGDL T Sbjct: 191 VPVPESNACNQLLVALRKADMKIEFRNVFEKLRAKKDFDLDTQGYNICIHAFGCWGDLGT 250 Query: 1042 SLIL 1053 +L L Sbjct: 251 ALNL 254 >ref|XP_002515124.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223545604|gb|EEF47108.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 898 Score = 219 bits (559), Expect = 1e-54 Identities = 122/244 (50%), Positives = 166/244 (68%) Frame = +1 Query: 322 SKVGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC 501 +++ +++LVA + KALSE G RNL+ D I LSE L+LQ+L +NSL+ASKK++FF WC Sbjct: 47 NQLESILLVAFLNKALSE-SGVRNLDPDF--IPLSEPLILQILRQNSLDASKKIEFFKWC 103 Query: 502 SLKPCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSG 681 S Y HSA YS +FRT+ + Y +E+ +LL M D ++ + TFK +L+ FI G Sbjct: 104 SFSHNYKHSACVYSHMFRTVCNAG-YFEEVRSLLNSMKDDCAIVGTGTFKFLLDTFINLG 162 Query: 682 KFDSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDNG 861 FD ALE+L+ ME +LG + +P +Y +VL+AL RKNQ+ +ALSIF KLL+ S D G Sbjct: 163 NFDFALELLDVME-ELG--TNLNPHMYDSVLVALTRKNQIGLALSIFFKLLETSNDIDIG 219 Query: 862 TDTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGDLST 1041 P +++CN LLV LRKA+M+ +F VF KL+ G F LD GYNICIH FGCW DL T Sbjct: 220 VSVPGSVACNTLLVALRKADMRVEFKKVFDKLKGMG-FELDTWGYNICIHAFGCWSDLGT 278 Query: 1042 SLIL 1053 +L L Sbjct: 279 ALRL 282 >ref|XP_004140525.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like [Cucumis sativus] gi|449523383|ref|XP_004168703.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like [Cucumis sativus] Length = 803 Score = 219 bits (558), Expect = 1e-54 Identities = 129/251 (51%), Positives = 169/251 (67%), Gaps = 7/251 (2%) Frame = +1 Query: 322 SKVGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC 501 S + +L+L+ASI K LSE GTR L+ S+ +S L+LQ+LH SL S KLDFF WC Sbjct: 24 SHLSHLLLLASITKTLSE-SGTRTLQHH--SLPISHPLLLQILHSRSLNPSHKLDFFKWC 80 Query: 502 SLKPCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSG 681 SL P ++HS STYSQIF + H E+ LL M DG+ +DS TFK++L+AFIRSG Sbjct: 81 SLAPNFNHSPSTYSQIFHILCRSGYLH-EVPPLLDSMKRDGVSVDSHTFKVLLDAFIRSG 139 Query: 682 KFDSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLD---NSRAD 852 K+D+ALEIL+HME DLG + + + Y++VL+AL+RKNQ+ +ALSIF KLLD N Sbjct: 140 KYDAALEILDHME-DLG--TSLELNTYNSVLVALLRKNQVGLALSIFFKLLDGFNNGGQV 196 Query: 853 DNGTDT----PDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFG 1020 D+ T P++++CNELLV LRK +M+ +F VF KLR F GYNICI+ FG Sbjct: 197 DSAATTFHFLPNSLACNELLVALRKLDMRVEFKKVFDKLRAIESFEFSVYGYNICIYAFG 256 Query: 1021 CWGDLSTSLIL 1053 CWG L T+L L Sbjct: 257 CWGYLDTALSL 267 >ref|XP_004308750.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like [Fragaria vesca subsp. vesca] Length = 789 Score = 210 bits (534), Expect = 9e-52 Identities = 117/244 (47%), Positives = 159/244 (65%) Frame = +1 Query: 322 SKVGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC 501 +++G+++LVASI K LS+ GTRNL + + L+E L+LQ+L SL SKKLDFF WC Sbjct: 17 AELGDILLVASITKTLSQ-SGTRNLPQP---LPLTEPLLLQILRTQSLHPSKKLDFFKWC 72 Query: 502 SLKPCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSG 681 SL S +S + T + EI LL +M D L +DS TFK +L+AFIR G Sbjct: 73 SLTHSIPPSPRAFSHVLHTACRAG-FLAEIPELLTIMRRDSLAVDSGTFKSLLDAFIREG 131 Query: 682 KFDSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDNG 861 KFD A+EIL+ M++ ++ D+Y++VL+AL+RK QLR+A+SI ++LL+ D Sbjct: 132 KFDMAIEILDTMQEVNAELNA---DMYNSVLVALVRKGQLRLAMSILVRLLEGGSCDQ-- 186 Query: 862 TDTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGDLST 1041 P I+CNELLVGLRK +M+ +F V+ KLR +F +D GYNICIH FGCWGDL T Sbjct: 187 --VPSCIACNELLVGLRKGDMRVEFKQVYDKLRGNEWFEMDTWGYNICIHAFGCWGDLGT 244 Query: 1042 SLIL 1053 SL L Sbjct: 245 SLSL 248 >gb|EXC13626.1| hypothetical protein L484_019583 [Morus notabilis] Length = 788 Score = 204 bits (520), Expect = 4e-50 Identities = 125/245 (51%), Positives = 163/245 (66%), Gaps = 1/245 (0%) Frame = +1 Query: 322 SKVGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC 501 S++ +++LVAS+ K LSE TR L D SI LSE ++LQ+L NSL SKKLDFF W Sbjct: 18 SQLADVLLVASLTKTLSE-SSTRYLP-DPRSIPLSEPILLQILRNNSLHISKKLDFFTWF 75 Query: 502 SLKPCYSHSASTYSQIFRTISSCPQYH-DEILNLLKLMSSDGLVLDSATFKLILNAFIRS 678 SL SA +YSQ+ R + C + H E NLL M +G+++DS TFK +L+ FIRS Sbjct: 76 SLNSDLKPSAHSYSQVLRAL--CREGHLHEASNLLGSMRQNGVIIDSWTFKTLLDTFIRS 133 Query: 679 GKFDSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDN 858 GKFD ALEIL+ ME +LG S +Y +VLIAL+RK+QL ALSIF K+L++S Sbjct: 134 GKFDFALEILDTME-ELGVTLNSH--MYDSVLIALVRKDQLSFALSIFFKILEDS----- 185 Query: 859 GTDTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGDLS 1038 + P +I CNELLV L+K++M+ +F VF +REK F ++ GYNICIH FG WGDL Sbjct: 186 -SHVPSSIGCNELLVALKKSDMRVEFKQVFDGIREKKGFGMNVWGYNICIHAFGFWGDLG 244 Query: 1039 TSLIL 1053 TSL L Sbjct: 245 TSLSL 249 >gb|EMJ21345.1| hypothetical protein PRUPE_ppa019625mg [Prunus persica] Length = 558 Score = 197 bits (501), Expect = 6e-48 Identities = 120/244 (49%), Positives = 151/244 (61%) Frame = +1 Query: 322 SKVGNLVLVASIAKALSEPGGTRNLEKDAGSIHLSEDLVLQVLHRNSLEASKKLDFFHWC 501 S++G+++LVASI K LS GTRNL D ++ LSE L+LQ+L SL SKK+DFF WC Sbjct: 16 SQLGDILLVASITKTLSS-SGTRNLP-DPHTLSLSEPLLLQILRAQSLHPSKKVDFFKWC 73 Query: 502 SLKPCYSHSASTYSQIFRTISSCPQYHDEILNLLKLMSSDGLVLDSATFKLILNAFIRSG 681 SL HSA TYS I RT S H E+ +LL M DG+V+DS TFK +L+AFIRSG Sbjct: 74 SLTHNIKHSARTYSHILRTASRAGFLH-EVPHLLHSMKEDGVVIDSQTFKALLDAFIRSG 132 Query: 682 KFDSALEILNHMEKDLGAISCSDPDIYSTVLIALIRKNQLRIALSIFLKLLDNSRADDNG 861 KFD ALEIL+ ME ++GA + D+Y++VL+AL+RKNQ+ +A+SI LKLL+ + Sbjct: 133 KFDYALEILDIME-EVGA--SLNTDMYNSVLVALVRKNQVGLAMSILLKLLEGGCSSQQ- 188 Query: 862 TDTPDAISCNELLVGLRKANMKDQFMLVFHKLREKGFFPLDRCGYNICIHGFGCWGDLST 1041 VF KLRE F +D GYNICIH FGCWGDL T Sbjct: 189 ---------------------------VFDKLRENKGFEMDNWGYNICIHAFGCWGDLGT 221 Query: 1042 SLIL 1053 SL L Sbjct: 222 SLSL 225