BLASTX nr result
ID: Catharanthus22_contig00014370
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00014370 (2679 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containi... 466 e-128 ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containi... 461 e-127 ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi... 449 e-123 gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein... 444 e-122 ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr... 431 e-118 gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus pe... 431 e-117 gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis] 418 e-114 ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi... 417 e-113 ref|XP_002521239.1| pentatricopeptide repeat-containing protein,... 409 e-111 ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223... 408 e-111 ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204... 408 e-111 gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucu... 407 e-110 gb|AGH33847.1| PPR [Cucumis melo] 406 e-110 ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Caps... 400 e-108 ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr... 395 e-107 ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar... 393 e-106 dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] 393 e-106 ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar... 393 e-106 ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Popu... 389 e-105 ref|XP_002884032.1| pentatricopeptide repeat-containing protein ... 388 e-105 >ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Solanum tuberosum] Length = 459 Score = 466 bits (1199), Expect = e-128 Identities = 241/437 (55%), Positives = 318/437 (72%), Gaps = 1/437 (0%) Frame = -1 Query: 2466 RRCSPCRRPGLSLRLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVA 2287 RR PC R +LSKQGHRF D SAT R L+RKFV SS KHVA Sbjct: 23 RRPRPCPRC-------SLSKQGHRFLSTLIAADS---EDISAT-RHLLRKFVASSSKHVA 71 Query: 2286 LDXXXXXXXXXXXXXXXS-AVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAET 2110 L ++A+PLYL I+EASWF+WN+KLVAD++A++YK E+FDEAET Sbjct: 72 LSTLSHLVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAET 131 Query: 2109 LILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAY 1930 L+ ET+ K+G +ER++C+FY LI S +KH + V D +K + SSS Y+K+R Y Sbjct: 132 LVTETVSKLGSRERDLCSFYSQLIHSQSKHNSERGVLDFCTKLKLVLLRSSSVYLKQRGY 191 Query: 1929 ESMVRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQN 1750 SMV C IG PR+AE+LMEEM+ELGLK S FE R+LVY+YGK G + DMKR V+E+++ Sbjct: 192 ASMVEGFCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMES 251 Query: 1749 QGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQ 1570 GF+LDTV +NMVL+S G+H ELSE+VS LQ++++ + FS+RTYNSVLNSCPTI L+LQ Sbjct: 252 MGFQLDTVSSNMVLNSFGSHNELSEVVSSLQKIEASGVPFSIRTYNSVLNSCPTISLLLQ 311 Query: 1569 DIKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIF 1390 D+KSVP+S+E L+ NL ++E ++V L+GSSVL+E M+W SE+KLDLHGMHL+ +Y+I Sbjct: 312 DLKSVPLSLEELMGNLDENEAVLVNILVGSSVLEETMQWKPSELKLDLHGMHLTSAYVII 371 Query: 1389 LQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDR 1210 LQW ++ +F + N++L P EI VVCG GKHS VRG+SPVK L+KE++LR+ CPL+IDR Sbjct: 372 LQWFHQLQCKFLAENRVL-PGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDR 430 Query: 1209 KNVGCFIAKGKVFRDWL 1159 KN+GCFIAKGK F +WL Sbjct: 431 KNIGCFIAKGKSFMEWL 447 >ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Solanum lycopersicum] Length = 459 Score = 461 bits (1187), Expect = e-127 Identities = 240/431 (55%), Positives = 316/431 (73%), Gaps = 1/431 (0%) Frame = -1 Query: 2448 RRPGLSLRLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXX 2269 RRP R +LSKQGHRF D SAT R L+RKFV SS KHVAL Sbjct: 23 RRPRPGPRC-SLSKQGHRFLSTLIATDSD---DISAT-RHLLRKFVGSSSKHVALSTLSH 77 Query: 2268 XXXXXXXXXXXS-AVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETM 2092 ++A+PLYL I+EASWF+WN+KLVA+++A++YK E+FDEAETL+ E++ Sbjct: 78 LVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVAELVALLYKLERFDEAETLVTESV 137 Query: 2091 KKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRS 1912 K+G +ER++C+FY LI S +KH + V D +K + SSS Y+K+R Y SMV Sbjct: 138 SKLGSRERDLCSFYSQLIYSQSKHNSERGVLDYCTKLKLVLLHSSSVYLKQRGYASMVEG 197 Query: 1911 LCDIGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELD 1732 C IG PR+AE+LMEEM+ELGLK S FE R+LVY+YGK G + DMKR V+E++ GF+LD Sbjct: 198 FCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMERMGFQLD 257 Query: 1731 TVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVP 1552 TV +NMVL+S G+H ELSE+VS LQ++++ + FS+RTYNSVLNSCPTI L+LQD+KSVP Sbjct: 258 TVGSNMVLNSFGSHNELSELVSSLQKIEASGVLFSIRTYNSVLNSCPTISLLLQDLKSVP 317 Query: 1551 ISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDV 1372 +S+E L+ NL ++E ++V L+GSSVL+E M+W E+KLDLHGMHL+ +YLI LQW Sbjct: 318 LSLEELMGNLDENEAVLVKILVGSSVLEETMQWKPKELKLDLHGMHLTSAYLIILQWFHQ 377 Query: 1371 MRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCF 1192 ++ +F + N++L P EI VVCG GKHS VRG+SPVK L+KE++LR+ CPL+IDRKNVGCF Sbjct: 378 LQCKFLAENRVL-PGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDRKNVGCF 436 Query: 1191 IAKGKVFRDWL 1159 IAKGKVF +WL Sbjct: 437 IAKGKVFMEWL 447 >ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed protein product [Vitis vinifera] Length = 435 Score = 449 bits (1155), Expect = e-123 Identities = 228/421 (54%), Positives = 309/421 (73%) Frame = -1 Query: 2418 ALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXX 2239 ALSKQG F RD SA+ R LI KF+ SS K +AL+ Sbjct: 24 ALSKQGQLFLSSV-------ARDPSASNR-LICKFIASSSKSIALNALSHLLSPTTTHPY 75 Query: 2238 XSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVC 2059 S++A+PLY I+EASWF+WN KL+ADVIA++YK Q EAETL+ ET+ K+G +ER++ Sbjct: 76 LSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDLV 135 Query: 2058 NFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAE 1879 +FYCNLI+S +KH + V D+ + + I + SSS YVK+RAY+SM+ SLC +G P EAE Sbjct: 136 SFYCNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKERAYKSMISSLCAVGLPLEAE 195 Query: 1878 DLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSL 1699 +L+EEMR GLK S FE R++VY YG++GL EDM+R ++++ N+GFELDTV +NMVLSS Sbjct: 196 NLIEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVLSSY 255 Query: 1698 GTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLT 1519 G + + SEMVSWLQRMK+ I FS+RTYNSVLNSCP I+ +LQD+K+ P +++ L++ L Sbjct: 256 GAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSILQDLKTFPPTIDELMETLK 315 Query: 1518 KDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQM 1339 DE L+V EL+GS VL E+MEW+ SE KLDLHGMHL +YLI LQW + +R+R ++ + Sbjct: 316 GDEALLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWREELRYRLNAA-EY 374 Query: 1338 LVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWL 1159 ++P EITVVCG GKHS+VRG+SPVK +++EM+ R + P+KIDRKN+GCF+AK KV ++WL Sbjct: 375 VMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKIDRKNIGCFVAKAKVVKNWL 434 Query: 1158 C 1156 C Sbjct: 435 C 435 >gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein, putative [Theobroma cacao] Length = 456 Score = 444 bits (1143), Expect = e-122 Identities = 221/420 (52%), Positives = 302/420 (71%), Gaps = 1/420 (0%) Frame = -1 Query: 2415 LSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 2236 L+KQGHRF + AT LI+KFV SSPK +AL+ Sbjct: 34 LTKQGHRFFSSLAATADV---NDPATANRLIKKFVASSPKSIALNALSHLLSPRNSHPHL 90 Query: 2235 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 2056 SA+A PLY I+E SW+NWN KLVA++IA++ K ++DE+E LI + + K+ +ER++ Sbjct: 91 SALAFPLYTKISETSWYNWNPKLVAELIALLVKQGRYDESEALISQAVSKLKFRERDLVQ 150 Query: 2055 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAED 1876 FYCN IES +KH KE +D Y Y+ + SSS YVK++ Y+SMV SLC++ +P EAE+ Sbjct: 151 FYCNWIESCSKHNSKEGFNDAYCYLSELICNSSSVYVKRQGYKSMVSSLCEMDRPNEAEN 210 Query: 1875 LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 1696 L+EEMR+ GL + FE R + Y YG++GL EDM+R V E++ +GFE+DT+C+NMVLSS G Sbjct: 211 LVEEMRKNGLTPTLFEFRFISYGYGQLGLFEDMERMVCEMEIEGFEVDTICSNMVLSSYG 270 Query: 1695 THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 1516 + S+MV WLQ+MK+L+I FS+RTYNSVLNSCP I+ ++Q + SVP+S+ L K L + Sbjct: 271 AYNAFSKMVPWLQKMKTLQIPFSIRTYNSVLNSCPEIMSLVQGLDSVPLSLGELAKILNE 330 Query: 1515 DEVLVVGELM-GSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQM 1339 DE L+V EL+ SSVLDE MEWN SE KLDLHGMHL +YLI LQWI+ M+ RF + Sbjct: 331 DEALLVQELVKSSSVLDEAMEWNGSEGKLDLHGMHLGSAYLIMLQWIEEMKCRFKV-EEC 389 Query: 1338 LVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWL 1159 ++P +IT+VCG GKHS+VRG+SPVK+LM++M+++MK P+KIDRKN+GCFIAKG+V ++WL Sbjct: 390 VIPAQITIVCGSGKHSSVRGESPVKTLMRKMMVKMKSPMKIDRKNIGCFIAKGQVVKNWL 449 >ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina] gi|568866680|ref|XP_006486677.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Citrus sinensis] gi|557524456|gb|ESR35762.1| hypothetical protein CICLE_v10028424mg [Citrus clementina] Length = 451 Score = 431 bits (1109), Expect = e-118 Identities = 228/441 (51%), Positives = 299/441 (67%), Gaps = 5/441 (1%) Frame = -1 Query: 2463 RCSPCRRPGLSL---RLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKH 2293 RC R+ L+L L+KQG RF RDS A R LI KFV SSP+ Sbjct: 16 RCCRLRQQRLTLVQCLTARLTKQGQRFLSSLALAV---TRDSKAASR-LISKFVASSPQF 71 Query: 2292 VALDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAE 2113 +AL+ S++A PLY+ ITE SWF WN KLVA++IA + K Q +EAE Sbjct: 72 IALNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEIIAFLDKQGQREEAE 131 Query: 2112 TLILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRA 1933 TLILET+ K+G +ER + FYCNLI+S KH K D Y + + SSS YVK++A Sbjct: 132 TLILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQLVNSSSSVYVKRQA 191 Query: 1932 YESMVRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQ 1753 +SM+ LC++GQP EAE+L+EEMR GL+ S FE + ++Y YG++GL+EDM+R V +++ Sbjct: 192 LKSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYGRLGLLEDMERIVNQME 251 Query: 1752 NQGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLML 1573 + G +DTVC+NMVLSS G H ELS MV WLQ+MK I FSVRTYNSVLNSC TI+ ML Sbjct: 252 SDGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVRTYNSVLNSCSTIMSML 311 Query: 1572 QDIKS--VPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSY 1399 QD+ S P+S+ L + L ++EV VV EL SSVLDE M+W+S E KLDLHGMHL +Y Sbjct: 312 QDLNSNDFPLSILELTEVLNEEEVSVVKELEDSSVLDEAMKWDSGETKLDLHGMHLGSAY 371 Query: 1398 LIFLQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLK 1219 I LQW+D MR RF++ + ++P EITVVCG GKHS VRG+S VK+++K+M++R P++ Sbjct: 372 FIILQWMDEMRNRFNN-EKHVIPAEITVVCGSGKHSTVRGESSVKAMVKKMMVRTSSPMR 430 Query: 1218 IDRKNVGCFIAKGKVFRDWLC 1156 + R N+GCFIAKG V +DWLC Sbjct: 431 VHRNNIGCFIAKGHVVKDWLC 451 >gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica] Length = 447 Score = 431 bits (1107), Expect = e-117 Identities = 218/421 (51%), Positives = 292/421 (69%) Frame = -1 Query: 2418 ALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXX 2239 A++KQG RF RD+ T + LI KF+ SS K +AL+ Sbjct: 33 AVTKQGQRFLTKLAAN----ARDAKVTNK-LIAKFLTSSTKSIALNTLSYLLSPDTTLPH 87 Query: 2238 XSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVC 2059 S++A+P Y ITEASWF WN KLVA ++A++ K Q +EAE LI ET+ K+G +ER + Sbjct: 88 LSSLALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEAEVLISETISKLGSRERELA 147 Query: 2058 NFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAE 1879 F+C L+ES +K K Y+Y+ + SSS YVK RA+ESMV LC++ +PREA+ Sbjct: 148 LFHCQLVESHSKLSSKHGFDSSYSYLYQLLHNSSSVYVKNRAFESMVSGLCEMDRPREAD 207 Query: 1878 DLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSL 1699 +L+EEMR GLK S FE R++VY YG++GL EDM + V +++NQG +DT+C+NMVLSS Sbjct: 208 NLIEEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQMENQGIAIDTICSNMVLSSY 267 Query: 1698 GTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLT 1519 G H EL+ M+ WL++MKSL + FS+RTYNSVLNSC TI+ MLQ+ K P S+E L L Sbjct: 268 GAHSELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIMAMLQEPKDFPCSIEELNGVLN 327 Query: 1518 KDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQM 1339 DE L+V EL+ S+VLDEVM W E KLDLHGMHL +YLI L+W + MR RF+SG Sbjct: 328 GDEALLVKELVESTVLDEVMVWEPLEAKLDLHGMHLGSAYLILLEWFEAMRCRFNSGKD- 386 Query: 1338 LVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWL 1159 ++P E+ V+CG GKHS+VRG+SPVK L+K+M+LRM+ P++IDRKNVGCF+AKG+ +DWL Sbjct: 387 VIPAEVVVICGSGKHSSVRGESPVKGLVKQMMLRMESPMRIDRKNVGCFVAKGRAVKDWL 446 Query: 1158 C 1156 C Sbjct: 447 C 447 >gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis] Length = 517 Score = 418 bits (1075), Expect = e-114 Identities = 219/435 (50%), Positives = 291/435 (66%), Gaps = 1/435 (0%) Frame = -1 Query: 2457 SPCRRPGLSLRLR-ALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALD 2281 SP R S ++ AL+KQGHRF +++ LI KFV SSPK ++L+ Sbjct: 89 SPTRSAAASSSIQCALTKQGHRFLSTLSINAG-----NASAANKLIGKFVASSPKSISLN 143 Query: 2280 XXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLIL 2101 ++ ++ LY I EASWF ++ KLVA + A++ K ++ EAE LI Sbjct: 144 ALSHLLSPDTTHTHLTSHSLHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIA 203 Query: 2100 ETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESM 1921 E + K+G ++R + FYC+L+ES +K K Y Y+ + SSS YVK RA+E+M Sbjct: 204 EAVSKLGHRQRELAVFYCSLVESHSKQSSKHGFDSSYAYLYQLLRDSSSAYVKCRAFETM 263 Query: 1920 VRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGF 1741 V +LC + +P EAE LMEEMR GLK S FE R+LVY YG++GL EDM R V +++ +G Sbjct: 264 VGALCTMDRPCEAESLMEEMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGL 323 Query: 1740 ELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIK 1561 +DT+C+NMVLSS G H EL +MV WLQ+M++ I FS+RTYNSVLN CPTI MLQD+K Sbjct: 324 VIDTICSNMVLSSYGAHNELQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTITAMLQDLK 383 Query: 1560 SVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQW 1381 +P+SM L L DE L+V EL+GSSVL+EV+ W+S E+KLDLHGMHL +YLI L+W Sbjct: 384 DIPLSMYELNATLRGDEGLLVMELVGSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEW 443 Query: 1380 IDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNV 1201 ++ M RF+ GN +P E+ VVCG GKHS VRG SPVK L+KEM+++MK P+KIDRKN Sbjct: 444 MEEMTRRFNDGNHG-IPAEVVVVCGSGKHSNVRGVSPVKILVKEMMVQMKSPMKIDRKNA 502 Query: 1200 GCFIAKGKVFRDWLC 1156 GCF+AKGK RDWLC Sbjct: 503 GCFLAKGKTVRDWLC 517 >ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Fragaria vesca subsp. vesca] Length = 448 Score = 417 bits (1071), Expect = e-113 Identities = 215/437 (49%), Positives = 294/437 (67%), Gaps = 1/437 (0%) Frame = -1 Query: 2463 RCSPCRRPGLSLRLR-ALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVA 2287 R P + LSL+++ AL+KQG RF + + LI KF+++SPK A Sbjct: 18 RHDPPQHSKLSLQIQCALTKQGQRFLTKLAANAG-----NPSVANKLISKFLSTSPKSTA 72 Query: 2286 LDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETL 2107 L S++A+P+Y ITEASWF WN KLVA ++A++ K Q ++E L Sbjct: 73 LTTLSYLLSPHTAHPHLSSLALPMYSKITEASWFEWNPKLVAALVALLAKQGQQSQSEAL 132 Query: 2106 ILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYE 1927 I ET+ K+G +ER + F+C L+ES +K K Y+ + SSS YVK+RA+E Sbjct: 133 ISETISKLGNKERELVQFHCQLVESHSKMSSKCGFDRACTYLHQLLQNSSSVYVKRRAFE 192 Query: 1926 SMVRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQ 1747 SMV LC + +P EA++L+EEMR GLK S FE R++VY YG++G+ E+M + V +++ Q Sbjct: 193 SMVGGLCAMDRPGEADELIEEMRVKGLKASVFEFRSVVYGYGRLGMFEEMLKIVDQMEKQ 252 Query: 1746 GFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQD 1567 GF DT+C NMVLSS G H EL+ M +WL++MK + FSVRTYNSVLNSCPTI+ MLQ+ Sbjct: 253 GFGDDTICCNMVLSSYGAHNELAAMANWLRKMKESSVPFSVRTYNSVLNSCPTIMAMLQE 312 Query: 1566 IKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFL 1387 K+VP S+ L L DE LVV EL+GS+V+DE M W+S+E KLDLHGMHL +YL+ L Sbjct: 313 PKAVPCSVGELSGVLDGDEALVVKELVGSAVVDEAMVWDSAEAKLDLHGMHLGSAYLVML 372 Query: 1386 QWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRK 1207 +W + M RF S + +VP E+ +VCGLGKHS+VRG+SPVK L+KEM+ +M+ P++IDRK Sbjct: 373 EWFEAMGNRFKSA-ECVVPAEVVIVCGLGKHSSVRGESPVKDLVKEMMHQMESPMRIDRK 431 Query: 1206 NVGCFIAKGKVFRDWLC 1156 NVGCFIAKG+ +DWLC Sbjct: 432 NVGCFIAKGRAVKDWLC 448 >ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223539507|gb|EEF41095.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 460 Score = 409 bits (1050), Expect = e-111 Identities = 208/424 (49%), Positives = 287/424 (67%) Frame = -1 Query: 2427 RLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXX 2248 R ALSKQG RF D+ AT R LI+KFV +SPK +ALD Sbjct: 41 RCAALSKQGQRFLSSLAIATTKG--DTVATNR-LIKKFVAASPKSIALDALSHLLNPHSS 97 Query: 2247 XXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQER 2068 S++A LYL I EA WF WN KLVADV+A + K ++DE+ TL+ +++ K+ ++ER Sbjct: 98 HSHLSSLAFTLYLKIAEARWFQWNPKLVADVVAFLDKQGRYDESATLVSDSISKLQVKER 157 Query: 2067 NVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPR 1888 ++ FYCNL+ES +K + + + S+S YVK++ Y+SMV LC++G+PR Sbjct: 158 DLARFYCNLVESQSKQNSIRGFDNSVASLMQLVCNSNSVYVKRQGYKSMVNGLCEMGRPR 217 Query: 1887 EAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVL 1708 EAE L+EEM + G++ S FE + +VYAYG +G E+M + + +++ GF +DTVC+NM+L Sbjct: 218 EAETLIEEMGKEGVRPSMFEFKCVVYAYGSLGSFEEMNKCLHQMERAGFRVDTVCSNMIL 277 Query: 1707 SSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLK 1528 +S G H L EMV WLQ+MK L I FS+RT NS LNSCPTI+ M+Q+ PIS+ L+K Sbjct: 278 ASYGAHNALPEMVLWLQKMKDLGIPFSLRTCNSALNSCPTIMSMMQNSNDFPISIHDLMK 337 Query: 1527 NLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSG 1348 L++DE L+V E++ SSVLDE M+W+ +E KLDLHG HL +YLI L WI+ MR RF S Sbjct: 338 ILSEDEALLVKEIVTSSVLDEAMKWDVAEAKLDLHGTHLCSAYLIILLWIEEMRKRFKSV 397 Query: 1347 NQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFR 1168 N + PTEITVVCG G HS VRG+SPVK ++K+ ++R + P++IDR+N+GCFIAKGKV Sbjct: 398 N-YVNPTEITVVCGSGNHSIVRGESPVKCMVKDFMVRARSPMRIDRRNIGCFIAKGKVVE 456 Query: 1167 DWLC 1156 +WLC Sbjct: 457 EWLC 460 >ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223617 [Cucumis sativus] Length = 1296 Score = 408 bits (1048), Expect = e-111 Identities = 214/436 (49%), Positives = 291/436 (66%), Gaps = 5/436 (1%) Frame = -1 Query: 2442 PGLSLRLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 2263 P L ++ +L+KQ HRF D SAT R LIRKFV SSPK + L Sbjct: 36 PNLQVKCTSLTKQTHRFLSTLSTTAATG--DQSATNR-LIRKFVASSPKSITLSVLSNIV 92 Query: 2262 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 2083 + A+ LY ITEASWF WN+KLVAD++A + ++ + E+E LI E + K+ Sbjct: 93 STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLDQNGLYSESEVLISEAISKL 152 Query: 2082 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 1903 G QER + NFY L+ES +KH + D Y+ + + S S YVK+RAYESMV LC Sbjct: 153 GSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLELLYNSPSVYVKRRAYESMVTGLCS 212 Query: 1902 IGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 1723 + +P EAE+L++EMR G+ + +E R+++YAYG +GL E+MKR++ +++N ELDTVC Sbjct: 213 MKRPHEAENLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272 Query: 1722 ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 1552 +NMVLSS G H +L +MV WLQRMK S SVRTYNSVLNSCP I MLQD KS +P Sbjct: 273 SNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSSVRTYNSVLNSCPKITAMLQDHKSTNLP 332 Query: 1551 ISMEHLLKNLTKD-EVLVVGELM-GSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 1378 + +E L+ L D E L+V EL+ GSSVL+E+M W++ E+KLDLHG H+ +Y+I LQWI Sbjct: 333 VLIEDLIAVLDGDEEALLVEELLAGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392 Query: 1377 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 1198 MR F ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G Sbjct: 393 KEMRLNFED-ESYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451 Query: 1197 CFIAKGKVFRDWLCCL 1150 CFI+KGK ++WLC L Sbjct: 452 CFISKGKAVKNWLCSL 467 >ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204365 [Cucumis sativus] Length = 1913 Score = 408 bits (1048), Expect = e-111 Identities = 214/436 (49%), Positives = 291/436 (66%), Gaps = 5/436 (1%) Frame = -1 Query: 2442 PGLSLRLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 2263 P L ++ +L+KQ HRF D SAT R LIRKFV SSPK + L Sbjct: 36 PNLQVKCTSLTKQTHRFLSTLSTTAATG--DQSATNR-LIRKFVASSPKSITLSVLSNIV 92 Query: 2262 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 2083 + A+ LY ITEASWF WN+KLVAD++A + ++ + E+E LI E + K+ Sbjct: 93 STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLDQNGLYSESEVLISEAISKL 152 Query: 2082 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 1903 G QER + NFY L+ES +KH + D Y+ + + S S YVK+RAYESMV LC Sbjct: 153 GSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLELLYNSPSVYVKRRAYESMVTGLCS 212 Query: 1902 IGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 1723 + +P EAE+L++EMR G+ + +E R+++YAYG +GL E+MKR++ +++N ELDTVC Sbjct: 213 MKRPHEAENLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272 Query: 1722 ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 1552 +NMVLSS G H +L +MV WLQRMK S SVRTYNSVLNSCP I MLQD KS +P Sbjct: 273 SNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSSVRTYNSVLNSCPKITAMLQDHKSTNLP 332 Query: 1551 ISMEHLLKNLTKD-EVLVVGELM-GSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 1378 + +E L+ L D E L+V EL+ GSSVL+E+M W++ E+KLDLHG H+ +Y+I LQWI Sbjct: 333 VLIEDLIAVLDGDEEALLVEELLAGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392 Query: 1377 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 1198 MR F ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G Sbjct: 393 KEMRLNFED-ESYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451 Query: 1197 CFIAKGKVFRDWLCCL 1150 CFI+KGK ++WLC L Sbjct: 452 CFISKGKAVKNWLCSL 467 >gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucumis melo] Length = 488 Score = 407 bits (1045), Expect = e-110 Identities = 212/436 (48%), Positives = 290/436 (66%), Gaps = 5/436 (1%) Frame = -1 Query: 2442 PGLSLRLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 2263 P L ++ L+KQ HRF D SAT R LIRKFV SSPK + L Sbjct: 36 PNLQVKCTTLTKQTHRFLSTLSTTGATG--DQSATNR-LIRKFVASSPKSITLSVLSNIV 92 Query: 2262 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 2083 + A+ LY ITEASWF WN+KLVAD++A + ++ + E+E LI E + K+ Sbjct: 93 STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLGQNGLYSESEALISEAISKL 152 Query: 2082 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 1903 G QER + NFY L+ES +KH + D Y+ + + S S YVK+RAYESMV LC Sbjct: 153 GSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFELLYNSPSVYVKRRAYESMVTGLCS 212 Query: 1902 IGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 1723 + +P EAE L++EMR G+ + +E R+++YAYG +GL E+MKR++ +++N ELDTVC Sbjct: 213 MKRPHEAESLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272 Query: 1722 ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 1552 +NMVLSS G H +L +M+ WLQRMK S + SVRTYNSVLNSCP I MLQD KS +P Sbjct: 273 SNMVLSSYGAHNKLGDMLLWLQRMKTSSHCKSSVRTYNSVLNSCPKITSMLQDHKSGDLP 332 Query: 1551 ISMEHLLKNLTKDE--VLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 1378 + +E L+ L DE +LV L+GSSVL+E+M W++ E+KLDLHG H+ +Y+I LQWI Sbjct: 333 VLIEDLIAILDGDEEALLVKELLVGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392 Query: 1377 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 1198 MR F + ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G Sbjct: 393 KEMRLNFEDESN-VIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451 Query: 1197 CFIAKGKVFRDWLCCL 1150 CFI+KGK ++WLC L Sbjct: 452 CFISKGKAVKNWLCSL 467 >gb|AGH33847.1| PPR [Cucumis melo] Length = 488 Score = 406 bits (1044), Expect = e-110 Identities = 212/436 (48%), Positives = 289/436 (66%), Gaps = 5/436 (1%) Frame = -1 Query: 2442 PGLSLRLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 2263 P L ++ L+KQ HRF D SAT R LIRKFV SSPK + L Sbjct: 36 PNLQVKCTTLTKQTHRFLSTLSTTAATG--DQSATNR-LIRKFVASSPKSITLSVLSNIV 92 Query: 2262 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 2083 + A+ LY ITEASWF WN+KLVAD++A + ++ + E+E LI E + K+ Sbjct: 93 STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLGQNGLYSESEALISEAISKL 152 Query: 2082 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 1903 G QER + NFY L+ES +KH + D Y+ + + S S YVK+RAYESMV LC Sbjct: 153 GSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFELLYNSPSVYVKRRAYESMVTGLCS 212 Query: 1902 IGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 1723 + +P EAE L++EMR G+ + +E R+++YAYG +GL E+MKR++ +++N ELDTVC Sbjct: 213 MKRPHEAESLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272 Query: 1722 ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 1552 +NMVLSS G H +L +M+ WLQRMK S + SVRTYNSVLNSCP I MLQD KS +P Sbjct: 273 SNMVLSSYGAHNKLGDMLLWLQRMKTSPHCKSSVRTYNSVLNSCPKITSMLQDHKSGDLP 332 Query: 1551 ISMEHLLKNLTKDE--VLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 1378 + +E L+ L DE +LV L+GSSVL+E+M W++ E+KLDLHG H+ +Y+I LQWI Sbjct: 333 VLIEDLIAILDGDEEALLVKELLVGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392 Query: 1377 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 1198 MR F ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G Sbjct: 393 KEMRLNFED-ESYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451 Query: 1197 CFIAKGKVFRDWLCCL 1150 CFI+KGK ++WLC L Sbjct: 452 CFISKGKAVKNWLCSL 467 >ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Capsella rubella] gi|482566151|gb|EOA30340.1| hypothetical protein CARUB_v10013465mg [Capsella rubella] Length = 516 Score = 400 bits (1028), Expect = e-108 Identities = 207/420 (49%), Positives = 283/420 (67%) Frame = -1 Query: 2415 LSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 2236 L KQGH+F D AT R LI+KFV +SPK VAL+ Sbjct: 99 LMKQGHQFLSSLSSPALAG--DPPATNR-LIKKFVAASPKSVALNVLSHLLSDNTSHPHL 155 Query: 2235 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 2056 S A LYL ITEASWF+WN KL+ ++++++ K E+F E+ETL+ + ++ ER+ Sbjct: 156 SYFAPQLYLEITEASWFDWNPKLIGELVSLLNKQERFVESETLLSTAVSRLESNERDFAL 215 Query: 2055 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAED 1876 F CNL+ES++K + SD + ++ I SSS YVK +AY+SMV LC++ QP +AE Sbjct: 216 FLCNLVESNSKQGSIQGFSDACSRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPLDAER 275 Query: 1875 LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 1696 ++EEMR +K FE ++++Y YG++GL +DM R V ++ QG ++DTVC+NMVLSS G Sbjct: 276 VIEEMRMETIKPGLFEYKSVLYGYGRLGLFDDMNRIVHRMETQGHKIDTVCSNMVLSSYG 335 Query: 1695 THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 1516 H L +M SWLQ++K + S+RTYNSVLNSCPTI+ +L+D+ S P+S+ LL L + Sbjct: 336 AHDALPQMGSWLQKLKGYNVPLSIRTYNSVLNSCPTIISLLKDLDSCPLSLSELLPILNE 395 Query: 1515 DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 1336 DE L+V EL S VLDE +EWN+ E KLDLHGMHLS SYLI LQW+D R RFS + + Sbjct: 396 DEALLVRELTQSLVLDEAIEWNAVEGKLDLHGMHLSASYLIMLQWMDETRLRFSEDKKCV 455 Query: 1335 VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 1156 VP EI VV G GKHS VRG+SPVK+++K++++R K P++IDRKNVG FIAKGK ++WLC Sbjct: 456 VPAEIVVVSGSGKHSNVRGESPVKAMVKKIMVRTKSPMRIDRKNVGSFIAKGKNVKEWLC 515 >ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum] gi|557110519|gb|ESQ50810.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum] Length = 469 Score = 395 bits (1015), Expect = e-107 Identities = 207/434 (47%), Positives = 290/434 (66%), Gaps = 4/434 (0%) Frame = -1 Query: 2445 RPGLSLRLRA----LSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDX 2278 R + +R +A L KQGHRF D SAT R I+KFV +SPK V+L+ Sbjct: 39 RTSMEVRCKAGTVPLMKQGHRFLSSLSSPALAG--DPSATNRH-IKKFVAASPKSVSLNV 95 Query: 2277 XXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILE 2098 S A+ LY ITEASWF+WN KL+A+++A++ K E+ E+ETL+ Sbjct: 96 LSHLLSAQTSHPHLSFFALSLYSEITEASWFDWNPKLIAELVALLNKQERSHESETLLSN 155 Query: 2097 TMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMV 1918 + ++ ER++ FYCNL+ES++K + ++ ++ I S+S YVK +AY+SMV Sbjct: 156 AVSRLKSNERDIALFYCNLVESNSKQGSIQGFNEACVRLREITRRSTSVYVKTQAYKSMV 215 Query: 1917 RSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFE 1738 LC++ QP +AE ++EEMR +K FE ++++Y YG++GL EDM R V ++ +G + Sbjct: 216 SGLCNMDQPHDAESVIEEMRIAKIKPGLFEYKSVLYGYGRLGLFEDMNRVVHRMETEGHK 275 Query: 1737 LDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKS 1558 +DTVC+NMVLSS G H L +M SWLQ++K + S RTYNSVLNSCPTI+ +L+D+ S Sbjct: 276 IDTVCSNMVLSSYGAHNALPQMGSWLQKLKDSNVPLSERTYNSVLNSCPTILSLLKDLDS 335 Query: 1557 VPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 1378 P+S+ LL L KDE ++V L SSVLDE +EW+S E KLDLHGMHLS SYLI +QW+ Sbjct: 336 CPVSLSELLTFLNKDEEVLVRGLTQSSVLDEAIEWSSLEGKLDLHGMHLSSSYLIMMQWM 395 Query: 1377 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 1198 D MR RFS G + +VP EI +V G GKHS VRG+SPVK+L+K++++R P++IDRKN+G Sbjct: 396 DEMRIRFSEG-KCVVPAEIVLVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNIG 454 Query: 1197 CFIAKGKVFRDWLC 1156 FIAKGK ++WLC Sbjct: 455 SFIAKGKTVKEWLC 468 >ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 505 Score = 393 bits (1010), Expect = e-106 Identities = 202/399 (50%), Positives = 275/399 (68%) Frame = -1 Query: 2352 DSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNA 2173 D SA R I+KFV +SPK VAL+ S A+ LY ITEASWF+WN Sbjct: 108 DPSAINRH-IKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNP 166 Query: 2172 KLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDL 1993 KL+A++IA++ K E+FDE+ETL+ + ++ ER+ F CNL+ES++K + S+ Sbjct: 167 KLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEA 226 Query: 1992 YNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALV 1813 ++ I SSS YVK +AY+SMV LC++ QP +AE ++EEMR +K FE ++++ Sbjct: 227 SFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVL 286 Query: 1812 YAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQ 1633 Y YG++GL +DM R V + +G ++DTVC+NMVLSS G H L +M SWLQ++K + Sbjct: 287 YGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVP 346 Query: 1632 FSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEW 1453 FS+RTYNSVLNSCPTI+ ML+D+ S P+S+ L L +DE L+V EL SSVLDE +EW Sbjct: 347 FSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQSSVLDEAIEW 406 Query: 1452 NSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQS 1273 N+ E KLDLHGMHLS SYLI LQW+D R RFS + ++P EI VV G GKHS VRG+S Sbjct: 407 NAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSE-EKCVIPAEIVVVSGSGKHSNVRGES 465 Query: 1272 PVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 1156 PVK+L+K++++R P++IDRKNVG FIAKGK ++WLC Sbjct: 466 PVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 504 >dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] Length = 501 Score = 393 bits (1010), Expect = e-106 Identities = 202/399 (50%), Positives = 275/399 (68%) Frame = -1 Query: 2352 DSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNA 2173 D SA R I+KFV +SPK VAL+ S A+ LY ITEASWF+WN Sbjct: 104 DPSAINRH-IKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNP 162 Query: 2172 KLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDL 1993 KL+A++IA++ K E+FDE+ETL+ + ++ ER+ F CNL+ES++K + S+ Sbjct: 163 KLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEA 222 Query: 1992 YNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALV 1813 ++ I SSS YVK +AY+SMV LC++ QP +AE ++EEMR +K FE ++++ Sbjct: 223 SFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVL 282 Query: 1812 YAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQ 1633 Y YG++GL +DM R V + +G ++DTVC+NMVLSS G H L +M SWLQ++K + Sbjct: 283 YGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVP 342 Query: 1632 FSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEW 1453 FS+RTYNSVLNSCPTI+ ML+D+ S P+S+ L L +DE L+V EL SSVLDE +EW Sbjct: 343 FSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQSSVLDEAIEW 402 Query: 1452 NSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQS 1273 N+ E KLDLHGMHLS SYLI LQW+D R RFS + ++P EI VV G GKHS VRG+S Sbjct: 403 NAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSE-EKCVIPAEIVVVSGSGKHSNVRGES 461 Query: 1272 PVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 1156 PVK+L+K++++R P++IDRKNVG FIAKGK ++WLC Sbjct: 462 PVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 500 >ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown protein [Arabidopsis thaliana] gi|330251481|gb|AEC06575.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 504 Score = 393 bits (1010), Expect = e-106 Identities = 202/399 (50%), Positives = 275/399 (68%) Frame = -1 Query: 2352 DSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNA 2173 D SA R I+KFV +SPK VAL+ S A+ LY ITEASWF+WN Sbjct: 107 DPSAINRH-IKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNP 165 Query: 2172 KLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDL 1993 KL+A++IA++ K E+FDE+ETL+ + ++ ER+ F CNL+ES++K + S+ Sbjct: 166 KLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEA 225 Query: 1992 YNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALV 1813 ++ I SSS YVK +AY+SMV LC++ QP +AE ++EEMR +K FE ++++ Sbjct: 226 SFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVL 285 Query: 1812 YAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQ 1633 Y YG++GL +DM R V + +G ++DTVC+NMVLSS G H L +M SWLQ++K + Sbjct: 286 YGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVP 345 Query: 1632 FSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEW 1453 FS+RTYNSVLNSCPTI+ ML+D+ S P+S+ L L +DE L+V EL SSVLDE +EW Sbjct: 346 FSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQSSVLDEAIEW 405 Query: 1452 NSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQS 1273 N+ E KLDLHGMHLS SYLI LQW+D R RFS + ++P EI VV G GKHS VRG+S Sbjct: 406 NAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSE-EKCVIPAEIVVVSGSGKHSNVRGES 464 Query: 1272 PVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 1156 PVK+L+K++++R P++IDRKNVG FIAKGK ++WLC Sbjct: 465 PVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 503 >ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa] gi|550331693|gb|EEE86893.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa] Length = 473 Score = 389 bits (1000), Expect = e-105 Identities = 201/425 (47%), Positives = 287/425 (67%), Gaps = 2/425 (0%) Frame = -1 Query: 2424 LRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXX 2245 L A+SKQ RF D+SAT R LI+KFV SSPK +ALD Sbjct: 53 LAAISKQAQRFFSAVLPTVA--TSDTSATNR-LIKKFVASSPKSIALDALSNLLSPDSTH 109 Query: 2244 XXXS-AVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQER 2068 + +PLYL I+EASWF+WN KLVA V+ ++ K E + L+ ET+ ++ +ER Sbjct: 110 HPLLYLLTLPLYLKISEASWFSWNPKLVAQVVVLLDKQGLDKELKALMSETVSRLQFKER 169 Query: 2067 NVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPR 1888 + FYCNLI ++KH D Y+ + + S+S YVKK+ Y++M+ LC++G+ R Sbjct: 170 ELVLFYCNLIGFNSKHNWVRGFDDSYSRLNQFVSDSNSVYVKKQGYKAMISGLCEMGRAR 229 Query: 1887 EAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVL 1708 EAEDL+ EMRE GLK FE R ++Y YG++GL +DM+R + ++++ E+DTVCANMVL Sbjct: 230 EAEDLIGEMRERGLKPKLFEFRCVLYGYGRLGLFKDMERILDKMESGEIEVDTVCANMVL 289 Query: 1707 SSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDI-KSVPISMEHLL 1531 +S G H L EM WL++MK+L I S+RT NSVLNSCPTI+ +++++ S P+S++ LL Sbjct: 290 ASYGAHNALPEMGLWLRKMKTLGIPLSIRTCNSVLNSCPTIMALMRNLDASYPVSIQELL 349 Query: 1530 KNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSS 1351 K L+++E ++V EL+ SSVL E +W++SE KLDLHGMHL +Y+I LQW++ R R S Sbjct: 350 KILSEEEAMLVKELIESSVLKEATKWDTSEGKLDLHGMHLGSAYVIMLQWMEETRNRLSD 409 Query: 1350 GNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVF 1171 G + ++P EITVVCG G HS VRG+SPVKS++ E++ + + P++IDRKN+GCF+AKG V Sbjct: 410 G-EHVIPAEITVVCGSGNHSTVRGESPVKSMITEIMAQTRSPMRIDRKNIGCFVAKGNVV 468 Query: 1170 RDWLC 1156 + WLC Sbjct: 469 KKWLC 473 >ref|XP_002884032.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297329872|gb|EFH60291.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 504 Score = 388 bits (997), Expect = e-105 Identities = 202/420 (48%), Positives = 281/420 (66%) Frame = -1 Query: 2415 LSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 2236 L KQG RF D SAT R I+KFV +SPK V L+ Sbjct: 88 LMKQGDRFLSSLSSPALAG--DPSATHRH-IKKFVAASPKSVTLNVLSHLLSDQTSYPHL 144 Query: 2235 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 2056 S A+ LY ITEASWF+WN KL+A+++AV+ E+FDE+ETL+ + ++ ER+ Sbjct: 145 SFFALSLYSEITEASWFDWNPKLIAELVAVLNNQERFDESETLLSTAVSRLKSNERDFAL 204 Query: 2055 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAED 1876 F CNL+ES++K + ++ ++ SSS YVK +AY+SMV LC++ QP +AE Sbjct: 205 FLCNLVESNSKQGSIQGFNEACFRLRERIQRSSSVYVKTQAYKSMVAGLCNMDQPHDAER 264 Query: 1875 LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 1696 ++EEMR +K FE ++++Y YG++GL +DM R V ++ +G ++DTVC+NMVLSS G Sbjct: 265 VIEEMRVEKIKPGSFEHKSVLYGYGRLGLFDDMNRVVHRMETEGHKIDTVCSNMVLSSYG 324 Query: 1695 THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 1516 H L +M SWLQ++K + FS+RTYNSVLNSCPTI+ +L+D+ S P+S+ L L + Sbjct: 325 AHDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIMSLLKDLNSCPVSLSELRTFLNE 384 Query: 1515 DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 1336 DE L+V EL S+VLDE +EWN+ E KLDLHGMHLS SYLI LQW+D +R RF + + Sbjct: 385 DEALLVLELTQSTVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDEIRLRFRD-QKCV 443 Query: 1335 VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 1156 +P EI VV G GKHS VRG+SPVK+L+K++++R + P++IDRKNVG FIAKGK ++WLC Sbjct: 444 IPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTESPMRIDRKNVGSFIAKGKNVKEWLC 503