BLASTX nr result
ID: Mentha22_contig00025828
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00025828 (1313 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU38120.1| hypothetical protein MIMGU_mgv1a004917mg [Mimulus... 447 e-123 ref|XP_006360543.1| PREDICTED: pentatricopeptide repeat-containi... 382 e-103 ref|XP_004243426.1| PREDICTED: pentatricopeptide repeat-containi... 375 e-101 gb|EYU28635.1| hypothetical protein MIMGU_mgv1a020179mg, partial... 363 9e-98 gb|EPS69217.1| hypothetical protein M569_05548, partial [Genlise... 340 1e-90 ref|XP_002270788.1| PREDICTED: pentatricopeptide repeat-containi... 338 3e-90 ref|XP_002524769.1| pentatricopeptide repeat-containing protein,... 338 4e-90 ref|XP_007201156.1| hypothetical protein PRUPE_ppa004264mg [Prun... 333 1e-88 ref|XP_007050731.1| Pentatricopeptide repeat-containing protein,... 329 1e-87 gb|EXC34654.1| hypothetical protein L484_020422 [Morus notabilis] 324 6e-86 ref|XP_006444185.1| hypothetical protein CICLE_v10019759mg [Citr... 317 1e-83 ref|XP_006577688.1| PREDICTED: pentatricopeptide repeat-containi... 312 2e-82 ref|XP_007162371.1| hypothetical protein PHAVU_001G146400g [Phas... 305 4e-80 ref|XP_004147968.1| PREDICTED: pentatricopeptide repeat-containi... 303 1e-79 ref|XP_003553441.1| PREDICTED: pentatricopeptide repeat-containi... 302 2e-79 ref|XP_004292613.1| PREDICTED: pentatricopeptide repeat-containi... 300 7e-79 ref|XP_006294198.1| hypothetical protein CARUB_v10023194mg [Caps... 299 2e-78 ref|XP_004488316.1| PREDICTED: pentatricopeptide repeat-containi... 295 4e-77 ref|XP_002322993.2| hypothetical protein POPTR_0016s12710g [Popu... 293 1e-76 ref|NP_180636.1| pentatricopeptide repeat-containing protein [Ar... 287 8e-75 >gb|EYU38120.1| hypothetical protein MIMGU_mgv1a004917mg [Mimulus guttatus] Length = 504 Score = 447 bits (1151), Expect = e-123 Identities = 229/392 (58%), Positives = 289/392 (73%) Frame = +1 Query: 136 MKRASKLAEFRKFLVTSRFSHCNNERIAKPSALIRFLCQDTTQTHSGNEKKVTNFGSLAS 315 MKR +++EF+K + T I KPS ++F+ + TQ+ ++ K T + S Sbjct: 2 MKRVWRISEFQKCINTRNI-------ITKPSLPLQFI-RHVTQSPGIDDGKHTTLSDIVS 53 Query: 316 LFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVALFRRYPDGSA 495 LF+DK S N ET K+ REKV+ L++E+L E+ EKIEK+LEENGVALFR Y DGSA Sbjct: 54 LFSDKWSRNPAETILKKNLREKVSKLKDELLTQNEDPEKIEKVLEENGVALFRIYSDGSA 113 Query: 496 IVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVLAVELFKEA 675 +VELL QL++ P AME F WRRKQLDY++PMT+EEY+KGI +AG+L N+ LA+ELFKEA Sbjct: 114 VVELLFQLKSFPYLAMEVFSWRRKQLDYSAPMTVEEYAKGIAMAGRLKNIDLALELFKEA 173 Query: 676 SNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXXXXAGLMRI 855 SNKQL TSLFNALMS Y +GL +CQS+F+DLK + TC+P LM + Sbjct: 174 SNKQLMATSLFNALMSAYTYNGLAMRCQSLFRDLKMESTCSPSIVTYNILISAFGRLMLV 233 Query: 856 DNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKPDLDIHLLM 1035 D+MEAT +E+KDLN+ P ++TYKG+I+GYI+AW WEEMEKTY MK GPIKPD+DIHLLM Sbjct: 234 DHMEATLREVKDLNIPPTVNTYKGLIAGYITAWKWEEMEKTYRLMKAGPIKPDIDIHLLM 293 Query: 1036 LRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVEELLGKIPK 1215 LRGYA + K+E+MEEIY MVRDHVD NE LIR MI AYCR+ RVE+V+ELL IP+ Sbjct: 294 LRGYAHSGKLERMEEIYDMVRDHVDQNEIPLIRTMICAYCRNSRVGRVEKVDELLRLIPE 353 Query: 1216 DDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 D+Y+P LNV+LICLYA E LL+QMENSI EAF Sbjct: 354 DEYRPWLNVILICLYAREDLLDQMENSIKEAF 385 >ref|XP_006360543.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like [Solanum tuberosum] Length = 504 Score = 382 bits (981), Expect = e-103 Identities = 202/393 (51%), Positives = 275/393 (69%), Gaps = 1/393 (0%) Frame = +1 Query: 136 MKRASKLAE-FRKFLVTSRFSHCNNERIAKPSALIRFLCQDTTQTHSGNEKKVTNFGSLA 312 MKR ++ + F+K + +FS C I + SA ++ + S ++ + + Sbjct: 2 MKRVWRIHDAFQKEAILQKFSSCYT--IGRGSANTLWIRGLAGKASSSSQPAA--WPHVF 57 Query: 313 SLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVALFRRYPDGS 492 +LF D+ S + N+ D REKV+ L++E+LA+ +AEK EKIL + G LF RY DGS Sbjct: 58 TLFADRWSPADSLVNQ--DMREKVSHLKDELLAYSGDAEKFEKILADKGDLLFSRYADGS 115 Query: 493 AIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVLAVELFKE 672 A+VELL+QL++SP A++AF WRR+QLDY +PMT+EEYSK I +AG+L N+ LA +LFKE Sbjct: 116 AVVELLQQLKSSPGLALQAFDWRRRQLDYQNPMTVEEYSKAIVVAGRLKNVDLAAKLFKE 175 Query: 673 ASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXXXXAGLMR 852 ASNKQLK TSL+NALM+ YM +GL KCQSVF+DLK++ TC P LM Sbjct: 176 ASNKQLKSTSLYNALMTAYMINGLAVKCQSVFRDLKREATCTPTIVTYNILISVFGRLML 235 Query: 853 IDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKPDLDIHLL 1032 ID+MEAT +E+ DL + PN+ TY +I+GYI+AWMW+++EK Y MK G IKPDL HLL Sbjct: 236 IDHMEATLREINDLGICPNVGTYNYLIAGYITAWMWDDVEKAYRIMKAGSIKPDLTTHLL 295 Query: 1033 MLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVEELLGKIP 1212 MLRGYA + K++ MEEIY++V+ HVD + LIR MI AY +S +V+++EEL+ IP Sbjct: 296 MLRGYAHSGKLDNMEEIYELVKGHVDRHGIPLIRSMICAYSKSSDVNKVQKIEELMRLIP 355 Query: 1213 KDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 KDDY+P LNV+LICLYA+E LL++MENSINEAF Sbjct: 356 KDDYRPWLNVILICLYAKEDLLDEMENSINEAF 388 >ref|XP_004243426.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like [Solanum lycopersicum] Length = 504 Score = 375 bits (963), Expect = e-101 Identities = 190/333 (57%), Positives = 248/333 (74%) Frame = +1 Query: 313 SLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVALFRRYPDGS 492 +LF D+ S + N+ D REKV+ L++E+LA+ +AE EKIL + GV+LF RY DGS Sbjct: 58 TLFADRRSPADSLVNQ--DMREKVSHLKDELLAYSGDAEMFEKILADKGVSLFSRYADGS 115 Query: 493 AIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVLAVELFKE 672 A+VELL+QL++SP A++AF WRR+QLD SPMT+EEYSK I +AG+L N+ LA +LFKE Sbjct: 116 AVVELLQQLKSSPGLAVQAFDWRRRQLDCWSPMTVEEYSKAIVMAGRLKNIDLAAKLFKE 175 Query: 673 ASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXXXXAGLMR 852 ASNK+LK TSL+NALM+ YM +GL KCQSVF+DLK++ TC P LM Sbjct: 176 ASNKRLKSTSLYNALMTAYMINGLAVKCQSVFRDLKREATCTPTIVTYNILISVFGRLML 235 Query: 853 IDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKPDLDIHLL 1032 ID+M AT +E+ DL + PN+ TY +I+GYI+AWMW+++EKTY MK G IKPDL HLL Sbjct: 236 IDHMAATLREINDLGICPNVGTYNYLIAGYITAWMWDDVEKTYRIMKAGSIKPDLTTHLL 295 Query: 1033 MLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVEELLGKIP 1212 MLRGYA + K+E MEE+Y++V+ HVD LIR MI AY +S +V+++EEL+ IP Sbjct: 296 MLRGYAHSGKLENMEEMYELVKGHVDRYGIPLIRSMICAYSKSSDVNKVQKIEELMRLIP 355 Query: 1213 KDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 KDDY+P LNV+LICLYA+E LL+QMENSINEAF Sbjct: 356 KDDYRPWLNVILICLYAKEDLLDQMENSINEAF 388 >gb|EYU28635.1| hypothetical protein MIMGU_mgv1a020179mg, partial [Mimulus guttatus] Length = 397 Score = 363 bits (932), Expect = 9e-98 Identities = 177/275 (64%), Positives = 216/275 (78%) Frame = +1 Query: 487 GSAIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVLAVELF 666 GS +VELL QL++ P AME F WRRKQLDY++PMT+EEY+KGI +AG+L N+ LA+ELF Sbjct: 4 GSPVVELLFQLKSFPYLAMEVFSWRRKQLDYSAPMTVEEYAKGIAMAGRLKNIDLALELF 63 Query: 667 KEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXXXXAGL 846 KEASNKQL TSLFNALMS Y +GL +CQS+F+DLK + TC+P L Sbjct: 64 KEASNKQLMATSLFNALMSAYTYNGLAMRCQSLFRDLKMESTCSPSIVTYNILISAFGRL 123 Query: 847 MRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKPDLDIH 1026 M +D+MEAT +E+KDLN+ P ++TYKG+I+GYI+AW WEEMEKTY MK GPIKPD+DIH Sbjct: 124 MLVDHMEATLREVKDLNIPPTVNTYKGLIAGYITAWKWEEMEKTYRMMKAGPIKPDIDIH 183 Query: 1027 LLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVEELLGK 1206 LLMLRGYA + K+E+MEEIY MVRDHVD NE LIR MI AYCR+ RVE+V+ELL Sbjct: 184 LLMLRGYAHSGKLERMEEIYDMVRDHVDQNEIPLIRTMICAYCRNSRVGRVEKVDELLRL 243 Query: 1207 IPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 IP+D+Y+P LNV+LICLYA E LL+QMENSI EAF Sbjct: 244 IPEDEYRPWLNVILICLYAREDLLDQMENSIKEAF 278 >gb|EPS69217.1| hypothetical protein M569_05548, partial [Genlisea aurea] Length = 496 Score = 340 bits (871), Expect = 1e-90 Identities = 187/392 (47%), Positives = 261/392 (66%), Gaps = 1/392 (0%) Frame = +1 Query: 136 MKRASKLAEFRKFLVTSRFSHCNNERIAKPSALIRFLCQDTTQTHSGNEKKVTNFGSLAS 315 MKRA ++ R+ L S A+PS LI+ + + + ++ + F +AS Sbjct: 1 MKRAWRIGFMRRCLNVSIAER------AQPSNLIKLIRRYAAEIPPKDDSQ-PGFVDIAS 53 Query: 316 LFTD-KLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVALFRRYPDGS 492 LF++ K + RTE FR V+ L++EIL H E+A++I ++LEE GV L RRY +G Sbjct: 54 LFSEYKDAAARTE------FRNSVSELKDEILGHGEDADRIGQVLEEKGVDLLRRYANGC 107 Query: 493 AIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVLAVELFKE 672 A++ELL QL++ P +AME F WRR Q ++PMT +EYSKGI +AGKL + LA ELF+E Sbjct: 108 AVIELLAQLKHFPRSAMEVFSWRRNQ---STPMTADEYSKGISLAGKLRYVDLAFELFEE 164 Query: 673 ASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXXXXAGLMR 852 A NKQL G L+N+LMS YM +GL +KCQ++F+DLK D CAP L+ Sbjct: 165 AKNKQLNGVCLYNSLMSTYMYNGLATKCQALFRDLKMDSACAPSIITYNVLISVFGRLVL 224 Query: 853 IDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKPDLDIHLL 1032 D+MEATF E+ +LNL P + TYK +I+GY++AWMW++MEK Y +K G + PD+ IHLL Sbjct: 225 TDHMEATFSEIANLNLLPTLRTYKVLIAGYVTAWMWDKMEKAYELLKAGDLIPDVSIHLL 284 Query: 1033 MLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVEELLGKIP 1212 MLRGYA++ ++EKME + MVRD + + L+R MI AY RS RVE++ ELL +P Sbjct: 285 MLRGYAISGRLEKMEAFFDMVRDEISYKKFPLMRAMICAYSRSCDRRRVEKISELLRLLP 344 Query: 1213 KDDYQPCLNVLLICLYAEESLLEQMENSINEA 1308 D+Y+P LNVLLI LYA+E+LLE+ME+SI+EA Sbjct: 345 GDEYRPWLNVLLIRLYAQEALLEEMEDSIDEA 376 >ref|XP_002270788.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780 [Vitis vinifera] gi|296086664|emb|CBI32299.3| unnamed protein product [Vitis vinifera] Length = 494 Score = 338 bits (867), Expect = 3e-90 Identities = 163/336 (48%), Positives = 240/336 (71%) Frame = +1 Query: 304 SLASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVALFRRYP 483 S+ LF+DK ++ + ++ R KV+ LR+E++ ++++ + ++LEE G +LFR Y Sbjct: 42 SMIGLFSDK--HSELDLRAREELRGKVSQLRDELVPSGDDSDMVVRVLEEKGESLFRSYS 99 Query: 484 DGSAIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVLAVEL 663 +GSA VELL+QL + P A++ F WRR Q DY+ PMT EEY+KGI +AG+ N+ LAVEL Sbjct: 100 NGSAFVELLKQLSSWPYLALQVFNWRRNQTDYSIPMTSEEYAKGISVAGRTKNVDLAVEL 159 Query: 664 FKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXXXXAG 843 F EA+NKQ+K TS +NALM YM +G KCQ++F+DLK++ +C+P Sbjct: 160 FTEAANKQIKTTSTYNALMGAYMCNGHAEKCQALFRDLKREASCSPTIVTYNILISVFGR 219 Query: 844 LMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKPDLDI 1023 LM +D+MEATF+E+K+L L+PNI TY +I+GY++AWMW ME T+ +MK+ I+PD++ Sbjct: 220 LMLVDHMEATFREIKELELSPNISTYNNIIAGYVTAWMWNRMEDTFRTMKEDNIQPDINT 279 Query: 1024 HLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVEELLG 1203 HLLMLRGYA + ++KMEE Y++++ HV+ E LIR MI AYC+S RVE++ L+ Sbjct: 280 HLLMLRGYAHSGNLQKMEETYELIKGHVNDKEIPLIRAMICAYCKSSITDRVEKIGALMK 339 Query: 1204 KIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 IP+++Y+P LNV+LI +YA+E +E+MENSINEAF Sbjct: 340 LIPENEYRPWLNVMLIRVYAQEDWVEEMENSINEAF 375 >ref|XP_002524769.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223535953|gb|EEF37612.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 509 Score = 338 bits (866), Expect = 4e-90 Identities = 176/395 (44%), Positives = 259/395 (65%), Gaps = 3/395 (0%) Frame = +1 Query: 136 MKRASKLAEFR---KFLVTSRFSHCNNERIAKPSALIRFLCQDTTQTHSGNEKKVTNFGS 306 MKR SK+++ + L ++F L + + H+ + V+ F + Sbjct: 1 MKRVSKISDLAVQAELLSLNKFPSITQTLTPYILTLTKSPIYKLARAHTS--EPVSFFPN 58 Query: 307 LASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVALFRRYPD 486 + SLF+ + + +D + V+ LR+E++ H E+++K ++LEE G +LFR D Sbjct: 59 IISLFSRRFP---VDNKAIEDLSKTVSHLRDELVQHAEDSDKFFRVLEEQGDSLFRMRSD 115 Query: 487 GSAIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVLAVELF 666 SA+VELLRQL + P A+E F WRRKQ ++++PMT EEY+KGI IAG+ N+ LA+E+F Sbjct: 116 RSALVELLRQLVSLPHLAVEVFNWRRKQTEWSTPMTHEEYAKGITIAGRAKNVDLAIEIF 175 Query: 667 KEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXXXXAGL 846 EA +K+ K T ++NALM YM +G KCQS+F D KK+ P Sbjct: 176 AEACSKRRKKTCIYNALMGAYMYNGHYDKCQSLFLDFKKEANIGPSVVTYNILISVFGRS 235 Query: 847 MRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKPDLDIH 1026 M +D+MEATF+EL +LN++PN+ TY +I+GY++AWMW++ME+ + MK+GPI P LD + Sbjct: 236 MLVDHMEATFRELMNLNISPNVSTYNNLIAGYVTAWMWDDMEQVFQLMKEGPIYPHLDTY 295 Query: 1027 LLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVEELLGK 1206 LLMLRGYA + IEKMEE+YK+V+DHV+ NE LIR MI AYC+S R++++EELL Sbjct: 296 LLMLRGYAHSGNIEKMEEMYKLVQDHVNVNEVPLIRTMICAYCKSSITDRIKKIEELLRL 355 Query: 1207 IPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 IP+++Y+P LNVLLI +YA+++LLE MEN I+EAF Sbjct: 356 IPEEEYRPWLNVLLIKVYAQQNLLEAMENKIDEAF 390 >ref|XP_007201156.1| hypothetical protein PRUPE_ppa004264mg [Prunus persica] gi|462396556|gb|EMJ02355.1| hypothetical protein PRUPE_ppa004264mg [Prunus persica] Length = 519 Score = 333 bits (854), Expect = 1e-88 Identities = 180/402 (44%), Positives = 254/402 (63%), Gaps = 10/402 (2%) Frame = +1 Query: 136 MKRASKLAEFRKFLVTSRFSHCNNERIAKPSALI-------RFLCQDTTQTHSGNE---K 285 MKR KL++ + + R + + P L RF Q HS + Sbjct: 1 MKRVWKLSDAAQSELLCRHRCSSKTKTLPPYNLTLTNSPNYRFTRDVNAQNHSSSSPSSS 60 Query: 286 KVTNFGSLASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVA 465 T+F S+ LF DK S + +D KV LR+E++ + ++++ ++LEE G + Sbjct: 61 SPTSFPSIIGLFIDKPSPQ--DLRAREDLVHKVTQLRDELVQNSGDSDEFVRVLEEKGSS 118 Query: 466 LFRRYPDGSAIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNL 645 F Y +G A+VEL+ QL++ P A+E F WRRKQ+ +PMT EEY+K I +AGK N+ Sbjct: 119 FFSSYGNGYAVVELMNQLRSWPHLAVEVFYWRRKQVAGGTPMTPEEYAKAITLAGKTKNV 178 Query: 646 VLAVELFKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXX 825 LAVELF EA NK++K TS++NALMS YM +GL +KCQS+F+DLK++P C+P Sbjct: 179 ELAVELFTEALNKRIKTTSIYNALMSAYMFNGLAAKCQSLFRDLKREPDCSPTIVTYNIL 238 Query: 826 XXXXAGLMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPI 1005 LM +D+MEAT + L DLNL+PN+ TY +I+GYI+AWMW+ ME+T+ +MK GP+ Sbjct: 239 ISVFGRLMLVDHMEATLQGLNDLNLSPNLSTYNNLIAGYITAWMWDSMEETFQNMKVGPV 298 Query: 1006 KPDLDIHLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVER 1185 PD HLLMLRGYA A + KMEE Y++V+ HV+ E LIR MI AYC+S A RV++ Sbjct: 299 SPDTSTHLLMLRGYAHAGNLVKMEETYELVKHHVNDKEIPLIRAMICAYCKSSAADRVKK 358 Query: 1186 VEELLGKIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 + L+ IP+++Y+P LNVLLI +YA+E E ME SI+EAF Sbjct: 359 IHSLMKLIPENEYRPWLNVLLIRVYAQEDWFEAMEKSIDEAF 400 >ref|XP_007050731.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] gi|508702992|gb|EOX94888.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] Length = 529 Score = 329 bits (844), Expect = 1e-87 Identities = 168/346 (48%), Positives = 235/346 (67%), Gaps = 10/346 (2%) Frame = +1 Query: 304 SLASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVALF---R 474 ++ SLF +KLS + +D KV R+E++ + E++ K LEE G L + Sbjct: 65 NIISLFIEKLSQAEPQPKAREDLFRKVILFRDELVKSVDCLEEVNKALEEKGDWLLGSCK 124 Query: 475 RYP-------DGSAIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGK 633 YP SA +ELL++L +S + A+E F WRRK+ + PMT EEY+ GI IAG+ Sbjct: 125 HYPALRTKNFHDSAFLELLKKLYSSGNLALEVFNWRRKKAEQGYPMTEEEYASGIVIAGR 184 Query: 634 LNNLVLAVELFKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXX 813 + N+ LAVELF EA+NKQLK TS +NALMS YM SGL KCQ VF+D K++P C+P Sbjct: 185 IRNVDLAVELFAEAANKQLKSTSTYNALMSAYMYSGLAEKCQLVFRDFKREPDCSPSIVT 244 Query: 814 XXXXXXXXAGLMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMK 993 LM ID+MEATF+E+K+L+L+PN++TY +I+ Y++AWMW+ ME+T+ MK Sbjct: 245 YNILISVFGRLMLIDHMEATFQEIKNLDLSPNLNTYNNLIAAYLTAWMWDSMERTFHMMK 304 Query: 994 DGPIKPDLDIHLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAA 1173 GP+KPD+ H+LMLRGYA + K+E+ME Y+M++ HVD E LIR MI AYC+S Sbjct: 305 AGPVKPDIKTHMLMLRGYAHSGKLEQMERTYQMIKHHVDDKEIPLIRAMICAYCKSSVKG 364 Query: 1174 RVERVEELLGKIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 R +R++ELL IP+++Y+P LN+L+I LYA+E+ LE M+NSINEAF Sbjct: 365 RTKRIKELLRLIPENEYRPWLNLLMIRLYAQENCLESMDNSINEAF 410 >gb|EXC34654.1| hypothetical protein L484_020422 [Morus notabilis] Length = 489 Score = 324 bits (830), Expect = 6e-86 Identities = 165/360 (45%), Positives = 242/360 (67%), Gaps = 7/360 (1%) Frame = +1 Query: 253 DTTQTHSGNEKKVTNFGSLASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEK 432 DT + S + F +++ LFT K +ET ++D ++VA LR+ + + ++++ Sbjct: 50 DTHSSSSSSFDSSALFSNISRLFTAKSPTTLSET-LTEDLAQEVAQLRDALARNLHDSDE 108 Query: 433 IEKILEENGVALFRRYPDGSAIVELLRQLQNSPSAAMEAFG-------WRRKQLDYASPM 591 + ++LEEN L RR+ GSAIV LL+QL + PS A+E F WRRKQ D PM Sbjct: 109 VVRVLEENCGPLLRRFAGGSAIVHLLKQLDSRPSLALEDFNYACKVLNWRRKQEDCEFPM 168 Query: 592 TIEEYSKGICIAGKLNNLVLAVELFKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQ 771 T EEY+KGI +AGK+ + LA+ELF EA+NKQL+ TS +NALM+ YM +G KC+S+F Sbjct: 169 TEEEYAKGIALAGKVEKVGLAIELFSEAANKQLRTTSTYNALMTAYMFNGYAHKCESLFW 228 Query: 772 DLKKDPTCAPXXXXXXXXXXXXAGLMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISA 951 LK+DP C+P L+ +D+M+ TFKE+++LNL+PN+ TY +I+GYI A Sbjct: 229 ALKRDPNCSPTIATYNILILVFGRLLLLDHMKLTFKEIENLNLSPNLSTYNALITGYIRA 288 Query: 952 WMWEEMEKTYLSMKDGPIKPDLDIHLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLI 1131 WMW++ME+T+ MK+GP KP +LLMLRG+A + +EKME++Y++V+D++ LI Sbjct: 289 WMWDDMERTFQLMKEGPAKPSTSTYLLMLRGFAHSGNLEKMEQMYELVKDYMTQKHNPLI 348 Query: 1132 RVMINAYCRSKHAARVERVEELLGKIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 R MI AYC+S R++++EEL+ IP++DY+P LNVLLI +YAEE +E ME SI+EAF Sbjct: 349 RTMICAYCKSSGKERIKKIEELMRHIPEEDYRPWLNVLLIRVYAEEDWIEAMEKSIDEAF 408 >ref|XP_006444185.1| hypothetical protein CICLE_v10019759mg [Citrus clementina] gi|567903396|ref|XP_006444186.1| hypothetical protein CICLE_v10019759mg [Citrus clementina] gi|568852322|ref|XP_006479827.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like isoform X1 [Citrus sinensis] gi|557546447|gb|ESR57425.1| hypothetical protein CICLE_v10019759mg [Citrus clementina] gi|557546448|gb|ESR57426.1| hypothetical protein CICLE_v10019759mg [Citrus clementina] Length = 513 Score = 317 bits (811), Expect = 1e-83 Identities = 155/340 (45%), Positives = 232/340 (68%) Frame = +1 Query: 292 TNFGSLASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVALF 471 T F +L L ++ L+Y + KD + V+ LR+E+LA+ ++ +K+ ++L+E G LF Sbjct: 57 TVFPTLVRLLSETLTY--PDARVRKDLTQTVSALRDELLANVDDLDKVFRVLDEKGSCLF 114 Query: 472 RRYPDGSAIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVL 651 RR+ +G A VEL++QL + P A+E WRR+Q Y +PMT EEY+KGI AG++NN+ L Sbjct: 115 RRHSNGYAFVELMKQLGSRPRLALEVLNWRRRQAGYGTPMTKEEYTKGIKFAGRINNVDL 174 Query: 652 AVELFKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXX 831 A +LF EA+NK LK +NAL+ YM +GL+ KCQS+F+DLKK+ +P Sbjct: 175 AADLFAEAANKHLKTIGTYNALLGAYMYNGLSDKCQSLFRDLKKEANISPSIVTYNTLIS 234 Query: 832 XXAGLMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKP 1011 L+ +D+MEA F+E+KD NL+PN+ TY +I+GY++AWMW ++E+ Y MK GP+ P Sbjct: 235 VFGRLLLVDHMEAAFQEIKDSNLSPNVFTYNYLIAGYMTAWMWGKVEEIYQMMKAGPVMP 294 Query: 1012 DLDIHLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVE 1191 D + +LL+LRGYA + + +ME+IY++V+ HVD E LIR MI AY + R++++E Sbjct: 295 DTNTYLLLLRGYAHSGNLPRMEKIYELVKHHVDGKEFPLIRAMICAYSKCSVTDRIKKIE 354 Query: 1192 ELLGKIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 L+ IP+ +Y+P LNVLLI +YA+E LE+ME SIN+AF Sbjct: 355 ALMRLIPEKEYRPWLNVLLIRVYAKEDCLEEMEKSINDAF 394 >ref|XP_006577688.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like, partial [Glycine max] Length = 482 Score = 312 bits (799), Expect = 2e-82 Identities = 164/353 (46%), Positives = 234/353 (66%), Gaps = 1/353 (0%) Frame = +1 Query: 256 TTQTHSGNEKKVTN-FGSLASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEK 432 T +H N V N + + LFT K ++ +D KV L+ E++ ++ + Sbjct: 17 TALSHHRNVHTVPNIYSDITPLFTAKSC-----SSPKQDLINKVTILKNELIRDSCDSAR 71 Query: 433 IEKILEENGVALFRRYPDGSAIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSK 612 ++ IL+++ LFRR+P+GSA+++L+ QL ++PS A++ F WRRK+ + +PM EYSK Sbjct: 72 VQTILDDSFDTLFRRHPNGSALLKLMNQLNSNPSLALQVFSWRRKRSNAENPMDAYEYSK 131 Query: 613 GICIAGKLNNLVLAVELFKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPT 792 GI AG+ N+ LAV+LFKEA+ K +K T +NALM +M +GL CQS+F DLK+D T Sbjct: 132 GIKAAGRSGNVDLAVKLFKEAAVKGIKTTGTYNALMGAFMFNGLPDNCQSLFCDLKRDLT 191 Query: 793 CAPXXXXXXXXXXXXAGLMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEME 972 C P LM +D+MEATF E++ LNLA NI TY +I+GYI+AWMW++ME Sbjct: 192 CDPSIATYNILLSVYGRLMLVDHMEATFSEIQRLNLAMNICTYNHLIAGYITAWMWDDME 251 Query: 973 KTYLSMKDGPIKPDLDIHLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAY 1152 K + +K ++P++ HLLMLRGYA + +EKMEE+Y +RDHV+ E LIR MI AY Sbjct: 252 KVFQMLKLSSVEPNMKTHLLMLRGYANSGNLEKMEEMYSFIRDHVNIKEISLIRCMICAY 311 Query: 1153 CRSKHAARVERVEELLGKIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 CRS HA R++++E LL IP+ +Y+P LNVLLI LYA+E L +MEN+INEAF Sbjct: 312 CRSSHADRLKKIELLLKFIPQKEYRPWLNVLLIKLYAKEDWLAKMENAINEAF 364 >ref|XP_007162371.1| hypothetical protein PHAVU_001G146400g [Phaseolus vulgaris] gi|593798668|ref|XP_007162372.1| hypothetical protein PHAVU_001G146400g [Phaseolus vulgaris] gi|561035835|gb|ESW34365.1| hypothetical protein PHAVU_001G146400g [Phaseolus vulgaris] gi|561035836|gb|ESW34366.1| hypothetical protein PHAVU_001G146400g [Phaseolus vulgaris] Length = 500 Score = 305 bits (780), Expect = 4e-80 Identities = 156/339 (46%), Positives = 220/339 (64%) Frame = +1 Query: 295 NFGSLASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVALFR 474 N ++++LF+D + ++ S V LR E++ ++ +++ IL++ L R Sbjct: 43 NLNTMSNLFSDLVPLFAAKSYSSAKRDHHVTILRNELVRESSDSVRVQSILDDKYDRLNR 102 Query: 475 RYPDGSAIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVLA 654 RYP+G +L+ QL ++PS A++ F WRRK+ + +PM EYSKGI AG+ N+ LA Sbjct: 103 RYPEGYVFFQLMNQLNSNPSLALQVFNWRRKRCNAENPMDSREYSKGITAAGRAGNINLA 162 Query: 655 VELFKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXXX 834 V LF EA+ K LK TS +NAL+S + +GL C S+F DLK+DPTC P Sbjct: 163 VNLFNEAARKGLKTTSTYNALISAFSLNGLADNCWSLFSDLKRDPTCDPSIATFNILISS 222 Query: 835 XAGLMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKPD 1014 LM ID+MEA F E++ L+LA NI TY +I+GYI+AWMW++MEK + +K P++P Sbjct: 223 CGSLMLIDHMEAIFSEIRRLDLAVNISTYNHLIAGYITAWMWDDMEKVFQMLKSSPVQPK 282 Query: 1015 LDIHLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVEE 1194 +LLMLRGYA + +EKMEE+Y +VRDHV+ +E LIR MI AY RS ++ R+E Sbjct: 283 TKTYLLMLRGYANSGNLEKMEEMYSIVRDHVNEHEIRLIRCMICAYSRSSEPNKLMRIES 342 Query: 1195 LLGKIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 LL IP+ +Y+P LNVLLI LYA+E LE+MENSINEAF Sbjct: 343 LLKFIPEKEYRPWLNVLLIKLYAKEDRLEKMENSINEAF 381 >ref|XP_004147968.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like [Cucumis sativus] gi|449494249|ref|XP_004159492.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like [Cucumis sativus] Length = 514 Score = 303 bits (775), Expect = 1e-79 Identities = 152/311 (48%), Positives = 216/311 (69%) Frame = +1 Query: 379 KVAGLREEILAHRENAEKIEKILEENGVALFRRYPDGSAIVELLRQLQNSPSAAMEAFGW 558 KV + E++ + E++ KI KILE++ L ++ DGSA VELL+QL + P+ A+E F W Sbjct: 84 KVLEVSHELILNSEDSNKIVKILEDSKDLLLWKHTDGSAFVELLKQLGSQPNLALEVFNW 143 Query: 559 RRKQLDYASPMTIEEYSKGICIAGKLNNLVLAVELFKEASNKQLKGTSLFNALMSVYMRS 738 RR+Q + P+T+EEY+KGI +AGK ++ LAV LF EASNK++K TS +NALM V+M + Sbjct: 144 RRRQ-GGSFPLTVEEYAKGIAVAGKSKHIDLAVGLFNEASNKRVKATSTYNALMGVFMFN 202 Query: 739 GLNSKCQSVFQDLKKDPTCAPXXXXXXXXXXXXAGLMRIDNMEATFKELKDLNLAPNIHT 918 GL KC SVF+DLK+D C P LM +D+MEAT +E+ +LNL+PN++T Sbjct: 203 GLADKCNSVFRDLKRDAGCVPNIVTYNILISVFGRLMLVDHMEATMREIHNLNLSPNVNT 262 Query: 919 YKGMISGYISAWMWEEMEKTYLSMKDGPIKPDLDIHLLMLRGYALAHKIEKMEEIYKMVR 1098 Y +I+GYI+AWMW+ ME+ ++ MK I P+ + LLMLRGYA + +EKMEE++ ++ Sbjct: 263 YNSLIAGYITAWMWKRMEQAFMKMKASSITPNTETFLLMLRGYAHSDNLEKMEEMHHFLK 322 Query: 1099 DHVDANETHLIRVMINAYCRSKHAARVERVEELLGKIPKDDYQPCLNVLLICLYAEESLL 1278 DHV+ N LIR MI AY RS +V +++ LL IP+++Y+P LNV LI +YA+ L Sbjct: 323 DHVNKNNFPLIRAMIYAYSRSSITDKVHKIDALLKLIPEEEYRPWLNVKLIRVYAQADCL 382 Query: 1279 EQMENSINEAF 1311 E+MENSINEAF Sbjct: 383 ERMENSINEAF 393 >ref|XP_003553441.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like isoform X1 [Glycine max] gi|571557432|ref|XP_006604408.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like isoform X2 [Glycine max] gi|571557435|ref|XP_006604409.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like isoform X3 [Glycine max] Length = 504 Score = 302 bits (773), Expect = 2e-79 Identities = 162/353 (45%), Positives = 224/353 (63%), Gaps = 1/353 (0%) Frame = +1 Query: 256 TTQTHSGNEKKVTNFGS-LASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEK 432 T +H N V N S L LFT K + E D K + L+ E++ ++ + Sbjct: 40 TALSHHRNLNSVPNACSDLIPLFTAKSCSSAKE-----DLINKASILKNELIRESSDSAR 94 Query: 433 IEKILEENGVALFRRYPDGSAIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSK 612 + IL++N L +R+PDGS ++ L+ QL ++PS A++ F WRRK+ + +PM EYSK Sbjct: 95 VLSILDDNSDTLIQRHPDGSVLLRLMNQLNSNPSLALQVFSWRRKRSNVENPMDAYEYSK 154 Query: 613 GICIAGKLNNLVLAVELFKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPT 792 GI AG+ N+ LAV+LFKEA+ K +K TS +NALM M +GL CQS+F DLK+DPT Sbjct: 155 GIKAAGRSGNVDLAVKLFKEAAVKGIKTTSTYNALMGACMSNGLADNCQSLFCDLKRDPT 214 Query: 793 CAPXXXXXXXXXXXXAGLMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEME 972 C P LM +D+MEATF+E++ L NI TY MI+GYI+AWMW++ME Sbjct: 215 CDPSIATYNILLSVFGRLMLVDHMEATFREIQKLTFTMNICTYNHMIAGYITAWMWDDME 274 Query: 973 KTYLSMKDGPIKPDLDIHLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAY 1152 + +K P++P++ ++LMLRGYA + +EKMEEIY + D VD E LIR MI AY Sbjct: 275 NVFQMLKRSPVEPNMKTYMLMLRGYANSGNLEKMEEIYSFITDRVDIKEISLIRCMICAY 334 Query: 1153 CRSKHAARVERVEELLGKIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 RS A R++++E LL IP +Y+P LNVLLI LYA+E LE+MEN+INEAF Sbjct: 335 SRSSDADRLKKIELLLKFIPGKEYRPWLNVLLIKLYAKEDWLEKMENAINEAF 387 >ref|XP_004292613.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like [Fragaria vesca subsp. vesca] Length = 521 Score = 300 bits (769), Expect = 7e-79 Identities = 155/340 (45%), Positives = 219/340 (64%), Gaps = 2/340 (0%) Frame = +1 Query: 298 FGSLASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENG--VALF 471 F S+ +F D+ + E +D K+ L EE++ +EKI ++LE+ G V Sbjct: 65 FSSVIDIFVDRPTLQ--EIRAREDLGRKLNQLAEELVQTAGRSEKIARVLEDEGSQVLWS 122 Query: 472 RRYPDGSAIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVL 651 + D + IVEL+ QL P A+E F WRR Q D ++PMT EY+K I AGK+ N+ L Sbjct: 123 HGFRDVNVIVELMHQLGRWPHLALEVFNWRRNQADGSNPMTAAEYTKAITAAGKIKNVQL 182 Query: 652 AVELFKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXX 831 A+ELF EA NK+LK T ++NALM Y+ G +KCQ +F+DLK++ C P Sbjct: 183 ALELFDEAINKRLKTTYIYNALMMAYVLKGHTAKCQYLFRDLKRERDCRPTIVTYNILIS 242 Query: 832 XXAGLMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKP 1011 LM +D+MEAT +ELK+L+L+P+++TY +I+GYI+AWMW+ ME+T+ MK GP+ P Sbjct: 243 VFGRLMLVDHMEATVRELKELDLSPSVYTYNNLIAGYITAWMWDRMERTFQKMKAGPVSP 302 Query: 1012 DLDIHLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVE 1191 + +LLMLRGYA + ++KMEE+Y++VR H + E LIR MI AYCRS RV+++ Sbjct: 303 AISTYLLMLRGYAHSGNLKKMEEMYELVRHHANDEEIPLIRAMICAYCRSSVKDRVQKIH 362 Query: 1192 ELLGKIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 L+ IP+D Y+P L+VLLI LYA+E E ME SINEAF Sbjct: 363 TLMNLIPEDQYRPWLHVLLIKLYAQEDCFEAMERSINEAF 402 >ref|XP_006294198.1| hypothetical protein CARUB_v10023194mg [Capsella rubella] gi|482562906|gb|EOA27096.1| hypothetical protein CARUB_v10023194mg [Capsella rubella] Length = 452 Score = 299 bits (766), Expect = 2e-78 Identities = 151/340 (44%), Positives = 218/340 (64%) Frame = +1 Query: 292 TNFGSLASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVALF 471 +NF L++ + K + + D +KV+ L+EE+L + EK +L++NG LF Sbjct: 3 SNFVKLSTQYICKFTRATSLQAHHTDMVQKVSVLKEELLTIGNSKEKFRDVLDQNGQWLF 62 Query: 472 RRYPDGSAIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVL 651 R Y DG+ I+EL+ QL A++ WRR Q DY P+T EEY+KGI IAG+ ++ L Sbjct: 63 RSYRDGAGIIELMDQLFPRHYLALQVLDWRRSQTDYCIPLTSEEYAKGIKIAGRARDVNL 122 Query: 652 AVELFKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXX 831 AV LF EA+ K+++ S++NALMSVYM +G +CQS+F+D + CAP Sbjct: 123 AVYLFDEAAKKRIQTASVYNALMSVYMWNGFADECQSLFRDFTRQTHCAPTIVTYNILIS 182 Query: 832 XXAGLMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKP 1011 L+ + NMEA F+EL+ L L+PN TY +I+GY++AW W++ME T+ MK GP++P Sbjct: 183 VYGRLLMVKNMEAAFEELQKLKLSPNSVTYNFLIAGYVTAWNWDKMEATFREMKSGPVEP 242 Query: 1012 DLDIHLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVE 1191 D D + LMLRGYA + +EKMEE+Y ++D V N L+R MI+AYC+ RV+++E Sbjct: 243 DTDTYQLMLRGYANSGNLEKMEEMYGAIKDQVGLNSGPLVRAMISAYCKKSVEDRVQKIE 302 Query: 1192 ELLGKIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 LL +P ++Y P LNVLLI LYA+E ++E MEN INEAF Sbjct: 303 NLLSLLPGEEYLPWLNVLLIRLYAQEDIVEAMENKINEAF 342 >ref|XP_004488316.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30780-like [Cicer arietinum] Length = 506 Score = 295 bits (754), Expect = 4e-77 Identities = 157/340 (46%), Positives = 216/340 (63%), Gaps = 5/340 (1%) Frame = +1 Query: 307 LASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVALFRRYPD 486 L SLFT + + T ++K+ LR E++ ++ +++ IL+ N L R + Sbjct: 55 LVSLFT----VSSSSTTPDNYLKKKITNLRRELVREASDSVRVKSILDHNSEHLIRSH-- 108 Query: 487 GSAIVELLRQLQNSPSAAMEAFGWRRK----QLDYA-SPMTIEEYSKGICIAGKLNNLVL 651 +ELL QL + PS A+E F WRRK + D + M EYSKGI AG+ N+ + Sbjct: 109 -LIFLELLNQLNSRPSLALEVFNWRRKRNVSEFDACRNSMNAHEYSKGIKAAGRSRNVDV 167 Query: 652 AVELFKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXX 831 AVELFKEA +K TS +NALM +M + L +C S+F D+KKDPTC P Sbjct: 168 AVELFKEAEYNGVKITSTYNALMGAFMFNDLAERCYSLFIDMKKDPTCTPSIATYNIVIS 227 Query: 832 XXAGLMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKP 1011 LM ID+MEATFKE+ +L ++PNI TY +I GYIS WMW++MEK + +K GP++P Sbjct: 228 VFGRLMLIDHMEATFKEMNELQISPNILTYNYLIGGYISTWMWDDMEKVFQVLKSGPVEP 287 Query: 1012 DLDIHLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVE 1191 + +LLM+RGYA + +EKMEE+Y +VRDHV+ NE LIR MI AYC+S R++++E Sbjct: 288 NKKTYLLMIRGYAHSGNLEKMEEVYSLVRDHVNENEMELIRTMICAYCKSSDVDRMKKIE 347 Query: 1192 ELLGKIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 LL IP++DY+P LNVLLI LYA+E LE ME +INEAF Sbjct: 348 ALLKLIPEEDYRPWLNVLLIKLYADEGWLEAMEKAINEAF 387 >ref|XP_002322993.2| hypothetical protein POPTR_0016s12710g [Populus trichocarpa] gi|550321372|gb|EEF04754.2| hypothetical protein POPTR_0016s12710g [Populus trichocarpa] Length = 549 Score = 293 bits (749), Expect = 1e-76 Identities = 155/336 (46%), Positives = 220/336 (65%), Gaps = 6/336 (1%) Frame = +1 Query: 322 TDKLSYNRTETN-----RSKDFREKVAGLREEILAHRENAEKIEKILEENGVALFRRYPD 486 T+K+S R N + +D K++ LR+E+L +N + + +ILEE + Sbjct: 98 TNKISSLRNNNNDLKSKKREDLVNKISSLRDELL---QNVDMVFQILEETKKDDPSLLTN 154 Query: 487 GSAIVELLRQLQ-NSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVLAVEL 663 SA +ELL+ L +SP A++ F W+R Q + +PMT EY+KGI IAG N+ LAVE+ Sbjct: 155 SSAFLELLKLLLLSSPKVALKIFNWKRTQAENDTPMTAAEYAKGIMIAGTDKNVDLAVEI 214 Query: 664 FKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXXXXAG 843 F EA K +K TS++NALM+ M +GL KC+S+F+++K+D C P Sbjct: 215 FDEAIKKCIKTTSMYNALMTACMCNGLAGKCESLFREMKRDVKCRPSVVTYNILVSVFGR 274 Query: 844 LMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKPDLDI 1023 LM +D MEA FKE++D ++PN+ TY +ISGY++ WMW+ MEKT+ M GP+KPDL+ Sbjct: 275 LMLVDKMEAIFKEMEDSRISPNLTTYNNLISGYVTVWMWDSMEKTFQMMIAGPVKPDLNT 334 Query: 1024 HLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVEELLG 1203 HLLMLRGYA + +E+ME +Y++++DHV+A LIR MI AYC+S RV+++E L+ Sbjct: 335 HLLMLRGYAHSGHLEQMELVYELIKDHVNARRLTLIRAMICAYCKSSIPERVQKIEALMR 394 Query: 1204 KIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 IP+ +Y+P LNVLLI LYAEE LE MENSINEAF Sbjct: 395 LIPEKEYRPWLNVLLIKLYAEEDRLEGMENSINEAF 430 >ref|NP_180636.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75219588|sp|O49343.1|PP177_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g30780 gi|2880055|gb|AAC02749.1| hypothetical protein [Arabidopsis thaliana] gi|110736845|dbj|BAF00380.1| hypothetical protein [Arabidopsis thaliana] gi|330253346|gb|AEC08440.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 452 Score = 287 bits (734), Expect = 8e-75 Identities = 146/340 (42%), Positives = 217/340 (63%) Frame = +1 Query: 292 TNFGSLASLFTDKLSYNRTETNRSKDFREKVAGLREEILAHRENAEKIEKILEENGVALF 471 +NF L S F KL+ + D ++V+ L++E+L + EK + +L++ G LF Sbjct: 3 SNFVKLGSQFIGKLTRVTSLPAHHTDLVQRVSILKDELLTIGNSKEKFQNVLDQKGQWLF 62 Query: 472 RRYPDGSAIVELLRQLQNSPSAAMEAFGWRRKQLDYASPMTIEEYSKGICIAGKLNNLVL 651 R Y DG+ I+EL+ QL A++ WRR Q DY P+T EEY+KGI IAG+ ++ L Sbjct: 63 RTYRDGAGILELMDQLFPRHYLALQVLEWRRGQKDYCIPLTSEEYAKGIKIAGRARDINL 122 Query: 652 AVELFKEASNKQLKGTSLFNALMSVYMRSGLNSKCQSVFQDLKKDPTCAPXXXXXXXXXX 831 AV LF EA+ K+++ S++N+LMSVYM +GL +CQS+F+D ++ CAP Sbjct: 123 AVYLFDEAAKKRMQTASVYNSLMSVYMWNGLAEECQSLFKDFRRQTHCAPTVVTYNILVS 182 Query: 832 XXAGLMRIDNMEATFKELKDLNLAPNIHTYKGMISGYISAWMWEEMEKTYLSMKDGPIKP 1011 L+ + NMEA F+EL+ + L PN TY +I+GY++AW W++ME T+ MK GP++P Sbjct: 183 VYGRLLMVKNMEAAFEELQKVKLPPNSVTYNFLIAGYMTAWNWDKMEATFQEMKRGPVEP 242 Query: 1012 DLDIHLLMLRGYALAHKIEKMEEIYKMVRDHVDANETHLIRVMINAYCRSKHAARVERVE 1191 D D + LMLRGYA + + +MEE+Y++++D V N L+R MI AYC+ RV+++E Sbjct: 243 DTDTYQLMLRGYANSGNLNRMEEMYEVIKDQVGVNSGPLVRAMICAYCKKAVEDRVQKIE 302 Query: 1192 ELLGKIPKDDYQPCLNVLLICLYAEESLLEQMENSINEAF 1311 LL + ++Y P LNVLLI LYA+E +E ME+ INEAF Sbjct: 303 NLLSLLSGEEYLPWLNVLLIRLYAQEDFVEAMESKINEAF 342