BLASTX nr result
ID: Catharanthus23_contig00006008
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00006008 (1615 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containi... 470 e-130 ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containi... 468 e-129 ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi... 456 e-125 gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein... 454 e-125 ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr... 441 e-121 gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus pe... 438 e-120 ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi... 426 e-116 gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis] 426 e-116 ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223... 421 e-115 ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204... 421 e-115 gb|AGH33847.1| PPR [Cucumis melo] 420 e-114 gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucu... 420 e-114 ref|XP_002521239.1| pentatricopeptide repeat-containing protein,... 419 e-114 ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Caps... 410 e-111 ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr... 405 e-110 ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar... 403 e-109 dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] 403 e-109 ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar... 403 e-109 ref|XP_002884032.1| pentatricopeptide repeat-containing protein ... 398 e-108 ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Popu... 395 e-107 >ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Solanum tuberosum] Length = 459 Score = 470 bits (1210), Expect = e-130 Identities = 242/437 (55%), Positives = 322/437 (73%), Gaps = 1/437 (0%) Frame = -3 Query: 1430 RRCSPCRRPGLSLRLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVA 1251 RR PC R +LSKQGHRFL++L D SAT R L+RKFV SS KHVA Sbjct: 23 RRPRPCPRC-------SLSKQGHRFLSTLIAADSE---DISAT-RHLLRKFVASSSKHVA 71 Query: 1250 LDXXXXXXXXXXXXXXXS-AVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAET 1074 L ++A+PLYL I+EASWF+WN+KLVAD++A++YK E+FDEAET Sbjct: 72 LSTLSHLVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAET 131 Query: 1073 LILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAY 894 L+ ET+ K+G +ER++C+FY LI S +KH + V D +K + SSS Y+K+R Y Sbjct: 132 LVTETVSKLGSRERDLCSFYSQLIHSQSKHNSERGVLDFCTKLKLVLLRSSSVYLKQRGY 191 Query: 893 ESMVRSLCDIGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQN 714 SMV C IG P++AE+LMEEM+ELGLK S FE R+LVY+YGK G + DMKR V+E+++ Sbjct: 192 ASMVEGFCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMES 251 Query: 713 QGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQ 534 GF+LDTV +NMVL+S G+H ELSE+VS LQ++++ + FS+RTYNSVLNSCPTI L+LQ Sbjct: 252 MGFQLDTVSSNMVLNSFGSHNELSEVVSSLQKIEASGVPFSIRTYNSVLNSCPTISLLLQ 311 Query: 533 DIKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIF 354 D+KSVP+S+E L+ NL ++E ++V L+GSSVL+E M+W SE+KLDLHGMHL+ +Y+I Sbjct: 312 DLKSVPLSLEELMGNLDENEAVLVNILVGSSVLEETMQWKPSELKLDLHGMHLTSAYVII 371 Query: 353 LQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDR 174 LQW ++ +F + N++L P EI VVCG GKHS VRG+SPVK L+KE++LR+ CPL+IDR Sbjct: 372 LQWFHQLQCKFLAENRVL-PGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDR 430 Query: 173 KNVGCFIAKGKVFRDWL 123 KN+GCFIAKGK F +WL Sbjct: 431 KNIGCFIAKGKSFMEWL 447 >ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Solanum lycopersicum] Length = 459 Score = 468 bits (1204), Expect = e-129 Identities = 242/431 (56%), Positives = 321/431 (74%), Gaps = 1/431 (0%) Frame = -3 Query: 1412 RRPGLSLRLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXX 1233 RRP R +LSKQGHRFL++L T D SAT R L+RKFV SS KHVAL Sbjct: 23 RRPRPGPRC-SLSKQGHRFLSTLIATDSD---DISAT-RHLLRKFVGSSSKHVALSTLSH 77 Query: 1232 XXXXXXXXXXXS-AVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETM 1056 ++A+PLYL I+EASWF+WN+KLVA+++A++YK E+FDEAETL+ E++ Sbjct: 78 LVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVAELVALLYKLERFDEAETLVTESV 137 Query: 1055 KKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRS 876 K+G +ER++C+FY LI S +KH + V D +K + SSS Y+K+R Y SMV Sbjct: 138 SKLGSRERDLCSFYSQLIYSQSKHNSERGVLDYCTKLKLVLLHSSSVYLKQRGYASMVEG 197 Query: 875 LCDIGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELD 696 C IG P++AE+LMEEM+ELGLK S FE R+LVY+YGK G + DMKR V+E++ GF+LD Sbjct: 198 FCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMERMGFQLD 257 Query: 695 TVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVP 516 TV +NMVL+S G+H ELSE+VS LQ++++ + FS+RTYNSVLNSCPTI L+LQD+KSVP Sbjct: 258 TVGSNMVLNSFGSHNELSELVSSLQKIEASGVLFSIRTYNSVLNSCPTISLLLQDLKSVP 317 Query: 515 ISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDV 336 +S+E L+ NL ++E ++V L+GSSVL+E M+W E+KLDLHGMHL+ +YLI LQW Sbjct: 318 LSLEELMGNLDENEAVLVKILVGSSVLEETMQWKPKELKLDLHGMHLTSAYLIILQWFHQ 377 Query: 335 MRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCF 156 ++ +F + N++L P EI VVCG GKHS VRG+SPVK L+KE++LR+ CPL+IDRKNVGCF Sbjct: 378 LQCKFLAENRVL-PGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDRKNVGCF 436 Query: 155 IAKGKVFRDWL 123 IAKGKVF +WL Sbjct: 437 IAKGKVFMEWL 447 >ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed protein product [Vitis vinifera] Length = 435 Score = 456 bits (1173), Expect = e-125 Identities = 231/421 (54%), Positives = 314/421 (74%) Frame = -3 Query: 1382 ALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXX 1203 ALSKQG FL+S+A RD SA+ R LI KF+ SS K +AL+ Sbjct: 24 ALSKQGQLFLSSVA-------RDPSASNR-LICKFIASSSKSIALNALSHLLSPTTTHPY 75 Query: 1202 XSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVC 1023 S++A+PLY I+EASWF+WN KL+ADVIA++YK Q EAETL+ ET+ K+G +ER++ Sbjct: 76 LSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDLV 135 Query: 1022 NFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAE 843 +FYCNLI+S +KH + V D+ + + I + SSS YVK+RAY+SM+ SLC +G P EAE Sbjct: 136 SFYCNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKERAYKSMISSLCAVGLPLEAE 195 Query: 842 DLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSL 663 +L+EEMR GLK S FE R++VY YG++GL EDM+R ++++ N+GFELDTV +NMVLSS Sbjct: 196 NLIEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVLSSY 255 Query: 662 GTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLT 483 G + + SEMVSWLQRMK+ I FS+RTYNSVLNSCP I+ +LQD+K+ P +++ L++ L Sbjct: 256 GAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSILQDLKTFPPTIDELMETLK 315 Query: 482 KDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQM 303 DE L+V EL+GS VL E+MEW+ SE KLDLHGMHL +YLI LQW + +R+R ++ + Sbjct: 316 GDEALLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWREELRYRLNAA-EY 374 Query: 302 LVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWL 123 ++P EITVVCG GKHS+VRG+SPVK +++EM+ R + P+KIDRKN+GCF+AK KV ++WL Sbjct: 375 VMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKIDRKNIGCFVAKAKVVKNWL 434 Query: 122 C 120 C Sbjct: 435 C 435 >gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein, putative [Theobroma cacao] Length = 456 Score = 454 bits (1169), Expect = e-125 Identities = 226/420 (53%), Positives = 308/420 (73%), Gaps = 1/420 (0%) Frame = -3 Query: 1379 LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 1200 L+KQGHRF +SLA T A + AT LI+KFV SSPK +AL+ Sbjct: 34 LTKQGHRFFSSLAAT---ADVNDPATANRLIKKFVASSPKSIALNALSHLLSPRNSHPHL 90 Query: 1199 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 1020 SA+A PLY I+E SW+NWN KLVA++IA++ K ++DE+E LI + + K+ +ER++ Sbjct: 91 SALAFPLYTKISETSWYNWNPKLVAELIALLVKQGRYDESEALISQAVSKLKFRERDLVQ 150 Query: 1019 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAED 840 FYCN IES +KH KE +D Y Y+ + SSS YVK++ Y+SMV SLC++ +P EAE+ Sbjct: 151 FYCNWIESCSKHNSKEGFNDAYCYLSELICNSSSVYVKRQGYKSMVSSLCEMDRPNEAEN 210 Query: 839 LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 660 L+EEMR+ GL + FE R + Y YG++GL EDM+R V E++ +GFE+DT+C+NMVLSS G Sbjct: 211 LVEEMRKNGLTPTLFEFRFISYGYGQLGLFEDMERMVCEMEIEGFEVDTICSNMVLSSYG 270 Query: 659 THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 480 + S+MV WLQ+MK+L+I FS+RTYNSVLNSCP I+ ++Q + SVP+S+ L K L + Sbjct: 271 AYNAFSKMVPWLQKMKTLQIPFSIRTYNSVLNSCPEIMSLVQGLDSVPLSLGELAKILNE 330 Query: 479 DEVLVVGELM-GSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQM 303 DE L+V EL+ SSVLDE MEWN SE KLDLHGMHL +YLI LQWI+ M+ RF + Sbjct: 331 DEALLVQELVKSSSVLDEAMEWNGSEGKLDLHGMHLGSAYLIMLQWIEEMKCRFKV-EEC 389 Query: 302 LVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWL 123 ++P +IT+VCG GKHS+VRG+SPVK+LM++M+++MK P+KIDRKN+GCFIAKG+V ++WL Sbjct: 390 VIPAQITIVCGSGKHSSVRGESPVKTLMRKMMVKMKSPMKIDRKNIGCFIAKGQVVKNWL 449 >ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina] gi|568866680|ref|XP_006486677.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Citrus sinensis] gi|557524456|gb|ESR35762.1| hypothetical protein CICLE_v10028424mg [Citrus clementina] Length = 451 Score = 441 bits (1134), Expect = e-121 Identities = 232/441 (52%), Positives = 305/441 (69%), Gaps = 5/441 (1%) Frame = -3 Query: 1427 RCSPCRRPGLSL---RLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKH 1257 RC R+ L+L L+KQG RFL+SLA + RDS A R LI KFV SSP+ Sbjct: 16 RCCRLRQQRLTLVQCLTARLTKQGQRFLSSLAL---AVTRDSKAASR-LISKFVASSPQF 71 Query: 1256 VALDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAE 1077 +AL+ S++A PLY+ ITE SWF WN KLVA++IA + K Q +EAE Sbjct: 72 IALNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEIIAFLDKQGQREEAE 131 Query: 1076 TLILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRA 897 TLILET+ K+G +ER + FYCNLI+S KH K D Y + + SSS YVK++A Sbjct: 132 TLILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQLVNSSSSVYVKRQA 191 Query: 896 YESMVRSLCDIGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQ 717 +SM+ LC++GQP EAE+L+EEMR GL+ S FE + ++Y YG++GL+EDM+R V +++ Sbjct: 192 LKSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYGRLGLLEDMERIVNQME 251 Query: 716 NQGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLML 537 + G +DTVC+NMVLSS G H ELS MV WLQ+MK I FSVRTYNSVLNSC TI+ ML Sbjct: 252 SDGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVRTYNSVLNSCSTIMSML 311 Query: 536 QDIKS--VPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSY 363 QD+ S P+S+ L + L ++EV VV EL SSVLDE M+W+S E KLDLHGMHL +Y Sbjct: 312 QDLNSNDFPLSILELTEVLNEEEVSVVKELEDSSVLDEAMKWDSGETKLDLHGMHLGSAY 371 Query: 362 LIFLQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLK 183 I LQW+D MR RF++ + ++P EITVVCG GKHS VRG+S VK+++K+M++R P++ Sbjct: 372 FIILQWMDEMRNRFNN-EKHVIPAEITVVCGSGKHSTVRGESSVKAMVKKMMVRTSSPMR 430 Query: 182 IDRKNVGCFIAKGKVFRDWLC 120 + R N+GCFIAKG V +DWLC Sbjct: 431 VHRNNIGCFIAKGHVVKDWLC 451 >gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica] Length = 447 Score = 438 bits (1127), Expect = e-120 Identities = 221/421 (52%), Positives = 296/421 (70%) Frame = -3 Query: 1382 ALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXX 1203 A++KQG RFLT LA RD+ T + LI KF+ SS K +AL+ Sbjct: 33 AVTKQGQRFLTKLAANA----RDAKVTNK-LIAKFLTSSTKSIALNTLSYLLSPDTTLPH 87 Query: 1202 XSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVC 1023 S++A+P Y ITEASWF WN KLVA ++A++ K Q +EAE LI ET+ K+G +ER + Sbjct: 88 LSSLALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEAEVLISETISKLGSRERELA 147 Query: 1022 NFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAE 843 F+C L+ES +K K Y+Y+ + SSS YVK RA+ESMV LC++ +P+EA+ Sbjct: 148 LFHCQLVESHSKLSSKHGFDSSYSYLYQLLHNSSSVYVKNRAFESMVSGLCEMDRPREAD 207 Query: 842 DLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSL 663 +L+EEMR GLK S FE R++VY YG++GL EDM + V +++NQG +DT+C+NMVLSS Sbjct: 208 NLIEEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQMENQGIAIDTICSNMVLSSY 267 Query: 662 GTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLT 483 G H EL+ M+ WL++MKSL + FS+RTYNSVLNSC TI+ MLQ+ K P S+E L L Sbjct: 268 GAHSELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIMAMLQEPKDFPCSIEELNGVLN 327 Query: 482 KDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQM 303 DE L+V EL+ S+VLDEVM W E KLDLHGMHL +YLI L+W + MR RF+SG Sbjct: 328 GDEALLVKELVESTVLDEVMVWEPLEAKLDLHGMHLGSAYLILLEWFEAMRCRFNSGKD- 386 Query: 302 LVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWL 123 ++P E+ V+CG GKHS+VRG+SPVK L+K+M+LRM+ P++IDRKNVGCF+AKG+ +DWL Sbjct: 387 VIPAEVVVICGSGKHSSVRGESPVKGLVKQMMLRMESPMRIDRKNVGCFVAKGRAVKDWL 446 Query: 122 C 120 C Sbjct: 447 C 447 >ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Fragaria vesca subsp. vesca] Length = 448 Score = 426 bits (1096), Expect = e-116 Identities = 219/437 (50%), Positives = 298/437 (68%), Gaps = 1/437 (0%) Frame = -3 Query: 1427 RCSPCRRPGLSLRLR-ALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVA 1251 R P + LSL+++ AL+KQG RFLT LA + + LI KF+++SPK A Sbjct: 18 RHDPPQHSKLSLQIQCALTKQGQRFLTKLAANA-----GNPSVANKLISKFLSTSPKSTA 72 Query: 1250 LDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETL 1071 L S++A+P+Y ITEASWF WN KLVA ++A++ K Q ++E L Sbjct: 73 LTTLSYLLSPHTAHPHLSSLALPMYSKITEASWFEWNPKLVAALVALLAKQGQQSQSEAL 132 Query: 1070 ILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYE 891 I ET+ K+G +ER + F+C L+ES +K K Y+ + SSS YVK+RA+E Sbjct: 133 ISETISKLGNKERELVQFHCQLVESHSKMSSKCGFDRACTYLHQLLQNSSSVYVKRRAFE 192 Query: 890 SMVRSLCDIGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQ 711 SMV LC + +P EA++L+EEMR GLK S FE R++VY YG++G+ E+M + V +++ Q Sbjct: 193 SMVGGLCAMDRPGEADELIEEMRVKGLKASVFEFRSVVYGYGRLGMFEEMLKIVDQMEKQ 252 Query: 710 GFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQD 531 GF DT+C NMVLSS G H EL+ M +WL++MK + FSVRTYNSVLNSCPTI+ MLQ+ Sbjct: 253 GFGDDTICCNMVLSSYGAHNELAAMANWLRKMKESSVPFSVRTYNSVLNSCPTIMAMLQE 312 Query: 530 IKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFL 351 K+VP S+ L L DE LVV EL+GS+V+DE M W+S+E KLDLHGMHL +YL+ L Sbjct: 313 PKAVPCSVGELSGVLDGDEALVVKELVGSAVVDEAMVWDSAEAKLDLHGMHLGSAYLVML 372 Query: 350 QWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRK 171 +W + M RF S + +VP E+ +VCGLGKHS+VRG+SPVK L+KEM+ +M+ P++IDRK Sbjct: 373 EWFEAMGNRFKSA-ECVVPAEVVIVCGLGKHSSVRGESPVKDLVKEMMHQMESPMRIDRK 431 Query: 170 NVGCFIAKGKVFRDWLC 120 NVGCFIAKG+ +DWLC Sbjct: 432 NVGCFIAKGRAVKDWLC 448 >gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis] Length = 517 Score = 426 bits (1094), Expect = e-116 Identities = 222/435 (51%), Positives = 296/435 (68%), Gaps = 1/435 (0%) Frame = -3 Query: 1421 SPCRRPGLSLRLR-ALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALD 1245 SP R S ++ AL+KQGHRFL++L+ +A + LI KFV SSPK ++L+ Sbjct: 89 SPTRSAAASSSIQCALTKQGHRFLSTLSINAGNA-----SAANKLIGKFVASSPKSISLN 143 Query: 1244 XXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLIL 1065 ++ ++ LY I EASWF ++ KLVA + A++ K ++ EAE LI Sbjct: 144 ALSHLLSPDTTHTHLTSHSLHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIA 203 Query: 1064 ETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESM 885 E + K+G ++R + FYC+L+ES +K K Y Y+ + SSS YVK RA+E+M Sbjct: 204 EAVSKLGHRQRELAVFYCSLVESHSKQSSKHGFDSSYAYLYQLLRDSSSAYVKCRAFETM 263 Query: 884 VRSLCDIGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGF 705 V +LC + +P EAE LMEEMR GLK S FE R+LVY YG++GL EDM R V +++ +G Sbjct: 264 VGALCTMDRPCEAESLMEEMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGL 323 Query: 704 ELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIK 525 +DT+C+NMVLSS G H EL +MV WLQ+M++ I FS+RTYNSVLN CPTI MLQD+K Sbjct: 324 VIDTICSNMVLSSYGAHNELQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTITAMLQDLK 383 Query: 524 SVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQW 345 +P+SM L L DE L+V EL+GSSVL+EV+ W+S E+KLDLHGMHL +YLI L+W Sbjct: 384 DIPLSMYELNATLRGDEGLLVMELVGSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEW 443 Query: 344 IDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNV 165 ++ M RF+ GN +P E+ VVCG GKHS VRG SPVK L+KEM+++MK P+KIDRKN Sbjct: 444 MEEMTRRFNDGNHG-IPAEVVVVCGSGKHSNVRGVSPVKILVKEMMVQMKSPMKIDRKNA 502 Query: 164 GCFIAKGKVFRDWLC 120 GCF+AKGK RDWLC Sbjct: 503 GCFLAKGKTVRDWLC 517 >ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223617 [Cucumis sativus] Length = 1296 Score = 421 bits (1083), Expect = e-115 Identities = 219/436 (50%), Positives = 300/436 (68%), Gaps = 5/436 (1%) Frame = -3 Query: 1406 PGLSLRLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 1227 P L ++ +L+KQ HRFL++L+TT +A D SAT R LIRKFV SSPK + L Sbjct: 36 PNLQVKCTSLTKQTHRFLSTLSTT--AATGDQSATNR-LIRKFVASSPKSITLSVLSNIV 92 Query: 1226 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 1047 + A+ LY ITEASWF WN+KLVAD++A + ++ + E+E LI E + K+ Sbjct: 93 STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLDQNGLYSESEVLISEAISKL 152 Query: 1046 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 867 G QER + NFY L+ES +KH + D Y+ + + S S YVK+RAYESMV LC Sbjct: 153 GSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLELLYNSPSVYVKRRAYESMVTGLCS 212 Query: 866 IGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 687 + +P EAE+L++EMR G+ + +E R+++YAYG +GL E+MKR++ +++N ELDTVC Sbjct: 213 MKRPHEAENLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272 Query: 686 ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 516 +NMVLSS G H +L +MV WLQRMK S SVRTYNSVLNSCP I MLQD KS +P Sbjct: 273 SNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSSVRTYNSVLNSCPKITAMLQDHKSTNLP 332 Query: 515 ISMEHLLKNLTKD-EVLVVGELM-GSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 342 + +E L+ L D E L+V EL+ GSSVL+E+M W++ E+KLDLHG H+ +Y+I LQWI Sbjct: 333 VLIEDLIAVLDGDEEALLVEELLAGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392 Query: 341 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 162 MR F ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G Sbjct: 393 KEMRLNFED-ESYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451 Query: 161 CFIAKGKVFRDWLCCL 114 CFI+KGK ++WLC L Sbjct: 452 CFISKGKAVKNWLCSL 467 >ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204365 [Cucumis sativus] Length = 1913 Score = 421 bits (1083), Expect = e-115 Identities = 219/436 (50%), Positives = 300/436 (68%), Gaps = 5/436 (1%) Frame = -3 Query: 1406 PGLSLRLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 1227 P L ++ +L+KQ HRFL++L+TT +A D SAT R LIRKFV SSPK + L Sbjct: 36 PNLQVKCTSLTKQTHRFLSTLSTT--AATGDQSATNR-LIRKFVASSPKSITLSVLSNIV 92 Query: 1226 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 1047 + A+ LY ITEASWF WN+KLVAD++A + ++ + E+E LI E + K+ Sbjct: 93 STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLDQNGLYSESEVLISEAISKL 152 Query: 1046 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 867 G QER + NFY L+ES +KH + D Y+ + + S S YVK+RAYESMV LC Sbjct: 153 GSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLELLYNSPSVYVKRRAYESMVTGLCS 212 Query: 866 IGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 687 + +P EAE+L++EMR G+ + +E R+++YAYG +GL E+MKR++ +++N ELDTVC Sbjct: 213 MKRPHEAENLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272 Query: 686 ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 516 +NMVLSS G H +L +MV WLQRMK S SVRTYNSVLNSCP I MLQD KS +P Sbjct: 273 SNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSSVRTYNSVLNSCPKITAMLQDHKSTNLP 332 Query: 515 ISMEHLLKNLTKD-EVLVVGELM-GSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 342 + +E L+ L D E L+V EL+ GSSVL+E+M W++ E+KLDLHG H+ +Y+I LQWI Sbjct: 333 VLIEDLIAVLDGDEEALLVEELLAGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392 Query: 341 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 162 MR F ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G Sbjct: 393 KEMRLNFED-ESYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451 Query: 161 CFIAKGKVFRDWLCCL 114 CFI+KGK ++WLC L Sbjct: 452 CFISKGKAVKNWLCSL 467 >gb|AGH33847.1| PPR [Cucumis melo] Length = 488 Score = 420 bits (1079), Expect = e-114 Identities = 217/436 (49%), Positives = 298/436 (68%), Gaps = 5/436 (1%) Frame = -3 Query: 1406 PGLSLRLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 1227 P L ++ L+KQ HRFL++L+TT +A D SAT R LIRKFV SSPK + L Sbjct: 36 PNLQVKCTTLTKQTHRFLSTLSTT--AATGDQSATNR-LIRKFVASSPKSITLSVLSNIV 92 Query: 1226 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 1047 + A+ LY ITEASWF WN+KLVAD++A + ++ + E+E LI E + K+ Sbjct: 93 STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLGQNGLYSESEALISEAISKL 152 Query: 1046 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 867 G QER + NFY L+ES +KH + D Y+ + + S S YVK+RAYESMV LC Sbjct: 153 GSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFELLYNSPSVYVKRRAYESMVTGLCS 212 Query: 866 IGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 687 + +P EAE L++EMR G+ + +E R+++YAYG +GL E+MKR++ +++N ELDTVC Sbjct: 213 MKRPHEAESLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272 Query: 686 ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 516 +NMVLSS G H +L +M+ WLQRMK S + SVRTYNSVLNSCP I MLQD KS +P Sbjct: 273 SNMVLSSYGAHNKLGDMLLWLQRMKTSPHCKSSVRTYNSVLNSCPKITSMLQDHKSGDLP 332 Query: 515 ISMEHLLKNLTKDE--VLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 342 + +E L+ L DE +LV L+GSSVL+E+M W++ E+KLDLHG H+ +Y+I LQWI Sbjct: 333 VLIEDLIAILDGDEEALLVKELLVGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392 Query: 341 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 162 MR F ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G Sbjct: 393 KEMRLNFED-ESYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451 Query: 161 CFIAKGKVFRDWLCCL 114 CFI+KGK ++WLC L Sbjct: 452 CFISKGKAVKNWLCSL 467 >gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucumis melo] Length = 488 Score = 420 bits (1079), Expect = e-114 Identities = 217/436 (49%), Positives = 298/436 (68%), Gaps = 5/436 (1%) Frame = -3 Query: 1406 PGLSLRLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 1227 P L ++ L+KQ HRFL++L+TT A D SAT R LIRKFV SSPK + L Sbjct: 36 PNLQVKCTTLTKQTHRFLSTLSTT--GATGDQSATNR-LIRKFVASSPKSITLSVLSNIV 92 Query: 1226 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 1047 + A+ LY ITEASWF WN+KLVAD++A + ++ + E+E LI E + K+ Sbjct: 93 STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLGQNGLYSESEALISEAISKL 152 Query: 1046 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 867 G QER + NFY L+ES +KH + D Y+ + + S S YVK+RAYESMV LC Sbjct: 153 GSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFELLYNSPSVYVKRRAYESMVTGLCS 212 Query: 866 IGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 687 + +P EAE L++EMR G+ + +E R+++YAYG +GL E+MKR++ +++N ELDTVC Sbjct: 213 MKRPHEAESLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272 Query: 686 ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 516 +NMVLSS G H +L +M+ WLQRMK S + SVRTYNSVLNSCP I MLQD KS +P Sbjct: 273 SNMVLSSYGAHNKLGDMLLWLQRMKTSSHCKSSVRTYNSVLNSCPKITSMLQDHKSGDLP 332 Query: 515 ISMEHLLKNLTKDE--VLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 342 + +E L+ L DE +LV L+GSSVL+E+M W++ E+KLDLHG H+ +Y+I LQWI Sbjct: 333 VLIEDLIAILDGDEEALLVKELLVGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392 Query: 341 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 162 MR F + ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G Sbjct: 393 KEMRLNFEDESN-VIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451 Query: 161 CFIAKGKVFRDWLCCL 114 CFI+KGK ++WLC L Sbjct: 452 CFISKGKAVKNWLCSL 467 >ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223539507|gb|EEF41095.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 460 Score = 419 bits (1076), Expect = e-114 Identities = 212/424 (50%), Positives = 293/424 (69%) Frame = -3 Query: 1391 RLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXX 1212 R ALSKQG RFL+SLA T D+ AT R LI+KFV +SPK +ALD Sbjct: 41 RCAALSKQGQRFLSSLAIATTKG--DTVATNR-LIKKFVAASPKSIALDALSHLLNPHSS 97 Query: 1211 XXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQER 1032 S++A LYL I EA WF WN KLVADV+A + K ++DE+ TL+ +++ K+ ++ER Sbjct: 98 HSHLSSLAFTLYLKIAEARWFQWNPKLVADVVAFLDKQGRYDESATLVSDSISKLQVKER 157 Query: 1031 NVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQ 852 ++ FYCNL+ES +K + + + S+S YVK++ Y+SMV LC++G+P+ Sbjct: 158 DLARFYCNLVESQSKQNSIRGFDNSVASLMQLVCNSNSVYVKRQGYKSMVNGLCEMGRPR 217 Query: 851 EAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVL 672 EAE L+EEM + G++ S FE + +VYAYG +G E+M + + +++ GF +DTVC+NM+L Sbjct: 218 EAETLIEEMGKEGVRPSMFEFKCVVYAYGSLGSFEEMNKCLHQMERAGFRVDTVCSNMIL 277 Query: 671 SSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLK 492 +S G H L EMV WLQ+MK L I FS+RT NS LNSCPTI+ M+Q+ PIS+ L+K Sbjct: 278 ASYGAHNALPEMVLWLQKMKDLGIPFSLRTCNSALNSCPTIMSMMQNSNDFPISIHDLMK 337 Query: 491 NLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSG 312 L++DE L+V E++ SSVLDE M+W+ +E KLDLHG HL +YLI L WI+ MR RF S Sbjct: 338 ILSEDEALLVKEIVTSSVLDEAMKWDVAEAKLDLHGTHLCSAYLIILLWIEEMRKRFKSV 397 Query: 311 NQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFR 132 N + PTEITVVCG G HS VRG+SPVK ++K+ ++R + P++IDR+N+GCFIAKGKV Sbjct: 398 N-YVNPTEITVVCGSGNHSIVRGESPVKCMVKDFMVRARSPMRIDRRNIGCFIAKGKVVE 456 Query: 131 DWLC 120 +WLC Sbjct: 457 EWLC 460 >ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Capsella rubella] gi|482566151|gb|EOA30340.1| hypothetical protein CARUB_v10013465mg [Capsella rubella] Length = 516 Score = 410 bits (1053), Expect = e-111 Identities = 210/420 (50%), Positives = 290/420 (69%) Frame = -3 Query: 1379 LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 1200 L KQGH+FL+SL++ + D AT R LI+KFV +SPK VAL+ Sbjct: 99 LMKQGHQFLSSLSSPALAG--DPPATNR-LIKKFVAASPKSVALNVLSHLLSDNTSHPHL 155 Query: 1199 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 1020 S A LYL ITEASWF+WN KL+ ++++++ K E+F E+ETL+ + ++ ER+ Sbjct: 156 SYFAPQLYLEITEASWFDWNPKLIGELVSLLNKQERFVESETLLSTAVSRLESNERDFAL 215 Query: 1019 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAED 840 F CNL+ES++K + SD + ++ I SSS YVK +AY+SMV LC++ QP +AE Sbjct: 216 FLCNLVESNSKQGSIQGFSDACSRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPLDAER 275 Query: 839 LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 660 ++EEMR +K FE ++++Y YG++GL +DM R V ++ QG ++DTVC+NMVLSS G Sbjct: 276 VIEEMRMETIKPGLFEYKSVLYGYGRLGLFDDMNRIVHRMETQGHKIDTVCSNMVLSSYG 335 Query: 659 THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 480 H L +M SWLQ++K + S+RTYNSVLNSCPTI+ +L+D+ S P+S+ LL L + Sbjct: 336 AHDALPQMGSWLQKLKGYNVPLSIRTYNSVLNSCPTIISLLKDLDSCPLSLSELLPILNE 395 Query: 479 DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 300 DE L+V EL S VLDE +EWN+ E KLDLHGMHLS SYLI LQW+D R RFS + + Sbjct: 396 DEALLVRELTQSLVLDEAIEWNAVEGKLDLHGMHLSASYLIMLQWMDETRLRFSEDKKCV 455 Query: 299 VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 120 VP EI VV G GKHS VRG+SPVK+++K++++R K P++IDRKNVG FIAKGK ++WLC Sbjct: 456 VPAEIVVVSGSGKHSNVRGESPVKAMVKKIMVRTKSPMRIDRKNVGSFIAKGKNVKEWLC 515 >ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum] gi|557110519|gb|ESQ50810.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum] Length = 469 Score = 405 bits (1040), Expect = e-110 Identities = 210/434 (48%), Positives = 297/434 (68%), Gaps = 4/434 (0%) Frame = -3 Query: 1409 RPGLSLRLRA----LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDX 1242 R + +R +A L KQGHRFL+SL++ + D SAT R I+KFV +SPK V+L+ Sbjct: 39 RTSMEVRCKAGTVPLMKQGHRFLSSLSSPALAG--DPSATNRH-IKKFVAASPKSVSLNV 95 Query: 1241 XXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILE 1062 S A+ LY ITEASWF+WN KL+A+++A++ K E+ E+ETL+ Sbjct: 96 LSHLLSAQTSHPHLSFFALSLYSEITEASWFDWNPKLIAELVALLNKQERSHESETLLSN 155 Query: 1061 TMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMV 882 + ++ ER++ FYCNL+ES++K + ++ ++ I S+S YVK +AY+SMV Sbjct: 156 AVSRLKSNERDIALFYCNLVESNSKQGSIQGFNEACVRLREITRRSTSVYVKTQAYKSMV 215 Query: 881 RSLCDIGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFE 702 LC++ QP +AE ++EEMR +K FE ++++Y YG++GL EDM R V ++ +G + Sbjct: 216 SGLCNMDQPHDAESVIEEMRIAKIKPGLFEYKSVLYGYGRLGLFEDMNRVVHRMETEGHK 275 Query: 701 LDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKS 522 +DTVC+NMVLSS G H L +M SWLQ++K + S RTYNSVLNSCPTI+ +L+D+ S Sbjct: 276 IDTVCSNMVLSSYGAHNALPQMGSWLQKLKDSNVPLSERTYNSVLNSCPTILSLLKDLDS 335 Query: 521 VPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 342 P+S+ LL L KDE ++V L SSVLDE +EW+S E KLDLHGMHLS SYLI +QW+ Sbjct: 336 CPVSLSELLTFLNKDEEVLVRGLTQSSVLDEAIEWSSLEGKLDLHGMHLSSSYLIMMQWM 395 Query: 341 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 162 D MR RFS G + +VP EI +V G GKHS VRG+SPVK+L+K++++R P++IDRKN+G Sbjct: 396 DEMRIRFSEG-KCVVPAEIVLVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNIG 454 Query: 161 CFIAKGKVFRDWLC 120 FIAKGK ++WLC Sbjct: 455 SFIAKGKTVKEWLC 468 >ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 505 Score = 403 bits (1035), Expect = e-109 Identities = 210/420 (50%), Positives = 287/420 (68%) Frame = -3 Query: 1379 LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 1200 L K G RFL+SL++ + D SA R I+KFV +SPK VAL+ Sbjct: 89 LMKHGDRFLSSLSSPALAG--DPSAINRH-IKKFVAASPKSVALNVLSHLLSDQTSHPHL 145 Query: 1199 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 1020 S A+ LY ITEASWF+WN KL+A++IA++ K E+FDE+ETL+ + ++ ER+ Sbjct: 146 SFFALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTL 205 Query: 1019 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAED 840 F CNL+ES++K + S+ ++ I SSS YVK +AY+SMV LC++ QP +AE Sbjct: 206 FLCNLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAER 265 Query: 839 LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 660 ++EEMR +K FE ++++Y YG++GL +DM R V + +G ++DTVC+NMVLSS G Sbjct: 266 VIEEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYG 325 Query: 659 THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 480 H L +M SWLQ++K + FS+RTYNSVLNSCPTI+ ML+D+ S P+S+ L L + Sbjct: 326 AHDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNE 385 Query: 479 DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 300 DE L+V EL SSVLDE +EWN+ E KLDLHGMHLS SYLI LQW+D R RFS + + Sbjct: 386 DEALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSE-EKCV 444 Query: 299 VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 120 +P EI VV G GKHS VRG+SPVK+L+K++++R P++IDRKNVG FIAKGK ++WLC Sbjct: 445 IPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 504 >dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] Length = 501 Score = 403 bits (1035), Expect = e-109 Identities = 210/420 (50%), Positives = 287/420 (68%) Frame = -3 Query: 1379 LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 1200 L K G RFL+SL++ + D SA R I+KFV +SPK VAL+ Sbjct: 85 LMKHGDRFLSSLSSPALAG--DPSAINRH-IKKFVAASPKSVALNVLSHLLSDQTSHPHL 141 Query: 1199 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 1020 S A+ LY ITEASWF+WN KL+A++IA++ K E+FDE+ETL+ + ++ ER+ Sbjct: 142 SFFALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTL 201 Query: 1019 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAED 840 F CNL+ES++K + S+ ++ I SSS YVK +AY+SMV LC++ QP +AE Sbjct: 202 FLCNLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAER 261 Query: 839 LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 660 ++EEMR +K FE ++++Y YG++GL +DM R V + +G ++DTVC+NMVLSS G Sbjct: 262 VIEEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYG 321 Query: 659 THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 480 H L +M SWLQ++K + FS+RTYNSVLNSCPTI+ ML+D+ S P+S+ L L + Sbjct: 322 AHDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNE 381 Query: 479 DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 300 DE L+V EL SSVLDE +EWN+ E KLDLHGMHLS SYLI LQW+D R RFS + + Sbjct: 382 DEALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSE-EKCV 440 Query: 299 VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 120 +P EI VV G GKHS VRG+SPVK+L+K++++R P++IDRKNVG FIAKGK ++WLC Sbjct: 441 IPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 500 >ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown protein [Arabidopsis thaliana] gi|330251481|gb|AEC06575.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 504 Score = 403 bits (1035), Expect = e-109 Identities = 210/420 (50%), Positives = 287/420 (68%) Frame = -3 Query: 1379 LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 1200 L K G RFL+SL++ + D SA R I+KFV +SPK VAL+ Sbjct: 88 LMKHGDRFLSSLSSPALAG--DPSAINRH-IKKFVAASPKSVALNVLSHLLSDQTSHPHL 144 Query: 1199 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 1020 S A+ LY ITEASWF+WN KL+A++IA++ K E+FDE+ETL+ + ++ ER+ Sbjct: 145 SFFALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTL 204 Query: 1019 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAED 840 F CNL+ES++K + S+ ++ I SSS YVK +AY+SMV LC++ QP +AE Sbjct: 205 FLCNLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAER 264 Query: 839 LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 660 ++EEMR +K FE ++++Y YG++GL +DM R V + +G ++DTVC+NMVLSS G Sbjct: 265 VIEEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYG 324 Query: 659 THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 480 H L +M SWLQ++K + FS+RTYNSVLNSCPTI+ ML+D+ S P+S+ L L + Sbjct: 325 AHDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNE 384 Query: 479 DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 300 DE L+V EL SSVLDE +EWN+ E KLDLHGMHLS SYLI LQW+D R RFS + + Sbjct: 385 DEALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSE-EKCV 443 Query: 299 VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 120 +P EI VV G GKHS VRG+SPVK+L+K++++R P++IDRKNVG FIAKGK ++WLC Sbjct: 444 IPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 503 >ref|XP_002884032.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297329872|gb|EFH60291.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 504 Score = 398 bits (1022), Expect = e-108 Identities = 205/420 (48%), Positives = 288/420 (68%) Frame = -3 Query: 1379 LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 1200 L KQG RFL+SL++ + D SAT R I+KFV +SPK V L+ Sbjct: 88 LMKQGDRFLSSLSSPALAG--DPSATHRH-IKKFVAASPKSVTLNVLSHLLSDQTSYPHL 144 Query: 1199 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 1020 S A+ LY ITEASWF+WN KL+A+++AV+ E+FDE+ETL+ + ++ ER+ Sbjct: 145 SFFALSLYSEITEASWFDWNPKLIAELVAVLNNQERFDESETLLSTAVSRLKSNERDFAL 204 Query: 1019 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAED 840 F CNL+ES++K + ++ ++ SSS YVK +AY+SMV LC++ QP +AE Sbjct: 205 FLCNLVESNSKQGSIQGFNEACFRLRERIQRSSSVYVKTQAYKSMVAGLCNMDQPHDAER 264 Query: 839 LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 660 ++EEMR +K FE ++++Y YG++GL +DM R V ++ +G ++DTVC+NMVLSS G Sbjct: 265 VIEEMRVEKIKPGSFEHKSVLYGYGRLGLFDDMNRVVHRMETEGHKIDTVCSNMVLSSYG 324 Query: 659 THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 480 H L +M SWLQ++K + FS+RTYNSVLNSCPTI+ +L+D+ S P+S+ L L + Sbjct: 325 AHDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIMSLLKDLNSCPVSLSELRTFLNE 384 Query: 479 DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 300 DE L+V EL S+VLDE +EWN+ E KLDLHGMHLS SYLI LQW+D +R RF + + Sbjct: 385 DEALLVLELTQSTVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDEIRLRFRD-QKCV 443 Query: 299 VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 120 +P EI VV G GKHS VRG+SPVK+L+K++++R + P++IDRKNVG FIAKGK ++WLC Sbjct: 444 IPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTESPMRIDRKNVGSFIAKGKNVKEWLC 503 >ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa] gi|550331693|gb|EEE86893.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa] Length = 473 Score = 395 bits (1015), Expect = e-107 Identities = 202/425 (47%), Positives = 292/425 (68%), Gaps = 2/425 (0%) Frame = -3 Query: 1388 LRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXX 1209 L A+SKQ RF +++ T A D+SAT R LI+KFV SSPK +ALD Sbjct: 53 LAAISKQAQRFFSAVLPTV--ATSDTSATNR-LIKKFVASSPKSIALDALSNLLSPDSTH 109 Query: 1208 XXXS-AVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQER 1032 + +PLYL I+EASWF+WN KLVA V+ ++ K E + L+ ET+ ++ +ER Sbjct: 110 HPLLYLLTLPLYLKISEASWFSWNPKLVAQVVVLLDKQGLDKELKALMSETVSRLQFKER 169 Query: 1031 NVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQ 852 + FYCNLI ++KH D Y+ + + S+S YVKK+ Y++M+ LC++G+ + Sbjct: 170 ELVLFYCNLIGFNSKHNWVRGFDDSYSRLNQFVSDSNSVYVKKQGYKAMISGLCEMGRAR 229 Query: 851 EAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVL 672 EAEDL+ EMRE GLK FE R ++Y YG++GL +DM+R + ++++ E+DTVCANMVL Sbjct: 230 EAEDLIGEMRERGLKPKLFEFRCVLYGYGRLGLFKDMERILDKMESGEIEVDTVCANMVL 289 Query: 671 SSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDI-KSVPISMEHLL 495 +S G H L EM WL++MK+L I S+RT NSVLNSCPTI+ +++++ S P+S++ LL Sbjct: 290 ASYGAHNALPEMGLWLRKMKTLGIPLSIRTCNSVLNSCPTIMALMRNLDASYPVSIQELL 349 Query: 494 KNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSS 315 K L+++E ++V EL+ SSVL E +W++SE KLDLHGMHL +Y+I LQW++ R R S Sbjct: 350 KILSEEEAMLVKELIESSVLKEATKWDTSEGKLDLHGMHLGSAYVIMLQWMEETRNRLSD 409 Query: 314 GNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVF 135 G + ++P EITVVCG G HS VRG+SPVKS++ E++ + + P++IDRKN+GCF+AKG V Sbjct: 410 G-EHVIPAEITVVCGSGNHSTVRGESPVKSMITEIMAQTRSPMRIDRKNIGCFVAKGNVV 468 Query: 134 RDWLC 120 + WLC Sbjct: 469 KKWLC 473