BLASTX nr result
ID: Mentha25_contig00035281
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00035281 (754 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU31734.1| hypothetical protein MIMGU_mgv1a006094mg [Mimulus... 316 5e-84 gb|EPS63217.1| hypothetical protein M569_11569 [Genlisea aurea] 246 5e-63 ref|XP_002301563.2| hypothetical protein POPTR_0002s22630g, part... 225 1e-56 ref|XP_002512787.1| pentatricopeptide repeat-containing protein,... 223 4e-56 gb|EXB41428.1| hypothetical protein L484_007578 [Morus notabilis] 218 1e-54 ref|XP_006446854.1| hypothetical protein CICLE_v10018065mg [Citr... 216 5e-54 ref|XP_006468953.1| PREDICTED: pentatricopeptide repeat-containi... 214 3e-53 ref|XP_004159154.1| PREDICTED: uncharacterized protein LOC101226... 211 2e-52 ref|XP_004145727.1| PREDICTED: uncharacterized protein LOC101212... 211 2e-52 ref|XP_003548483.1| PREDICTED: pentatricopeptide repeat-containi... 209 1e-51 ref|XP_006844721.1| hypothetical protein AMTR_s00016p00252780 [A... 208 1e-51 ref|XP_006395538.1| hypothetical protein EUTSA_v10004197mg [Eutr... 207 4e-51 ref|NP_189297.1| pentatricopeptide repeat-containing protein [Ar... 204 3e-50 ref|XP_002876985.1| pentatricopeptide repeat-containing protein ... 202 1e-49 ref|XP_003553320.1| PREDICTED: pentatricopeptide repeat-containi... 201 2e-49 ref|XP_003624556.1| Pentatricopeptide repeat-containing protein ... 201 3e-49 ref|XP_006291116.1| hypothetical protein CARUB_v10017228mg [Caps... 197 2e-48 ref|XP_007161786.1| hypothetical protein PHAVU_001G097900g [Phas... 197 4e-48 ref|XP_007217099.1| hypothetical protein PRUPE_ppa022877mg, part... 159 1e-36 ref|XP_002875497.1| pentatricopeptide repeat-containing protein ... 155 1e-35 >gb|EYU31734.1| hypothetical protein MIMGU_mgv1a006094mg [Mimulus guttatus] Length = 458 Score = 316 bits (810), Expect = 5e-84 Identities = 152/217 (70%), Positives = 182/217 (83%) Frame = +3 Query: 102 SLSTLDVLPARNFQPTARSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVAR 281 SL T+D+LPARN ++R SF+ Q+ALIFLQ+CA FKQMKQIHAKIIR SL +QV+VA+ Sbjct: 51 SLGTVDILPARNIPASSRPSFTSQDALIFLQRCATFKQMKQIHAKIIRNSLDQHQVVVAK 110 Query: 282 LIRLCSSYRELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVD 461 LIRLCSSY E DYAA VF+ +DNPSTF WNLLIR YTVN+C RAILL+N MI RGV VD Sbjct: 111 LIRLCSSYGEPDYAASVFEQIDNPSTFAWNLLIRAYTVNNCSNRAILLFNLMICRGVCVD 170 Query: 462 KFTFPFVMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFD 641 KFTFPFVMK+CL V+KA E+YGFAV++GF GDVYLDNVL+D+Y KCG L+D LK+FD Sbjct: 171 KFTFPFVMKSCLDCCSVEKAKEVYGFAVRTGFVGDVYLDNVLLDLYFKCGVLDDGLKLFD 230 Query: 642 KMRHRSIVSWTTLIAGLVLNGRVDTAQKVFDTMPMRN 752 KMR R++ SWTT+I+GLVLNGR+D AQ++FD MP RN Sbjct: 231 KMRFRTVFSWTTVISGLVLNGRIDAAQQLFDEMPDRN 267 Score = 91.3 bits (225), Expect = 3e-16 Identities = 44/147 (29%), Positives = 83/147 (56%) Frame = +3 Query: 312 LDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKA 491 +D A +F + + + W+ +I+ Y ++ P +A L+ M +++T ++KA Sbjct: 253 IDAAQQLFDEMPDRNVVSWSAMIKGYAESETPEKAFELFVEMQHDDAKPNEYTLVGLLKA 312 Query: 492 CLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSW 671 C+ + +K I+ FA+K+GF V+L LID+Y KCG+L AL+VF++M +++ +W Sbjct: 313 CIKLESLKLGCWIHDFAIKNGFEISVFLGTALIDMYSKCGSLEYALRVFEEMEVKNVATW 372 Query: 672 TTLIAGLVLNGRVDTAQKVFDTMPMRN 752 +I+ L ++G A +F+ M N Sbjct: 373 NVMISSLGVHGHSQEALALFEEMEKMN 399 >gb|EPS63217.1| hypothetical protein M569_11569 [Genlisea aurea] Length = 417 Score = 246 bits (629), Expect = 5e-63 Identities = 131/214 (61%), Positives = 161/214 (75%), Gaps = 4/214 (1%) Frame = +3 Query: 123 LPARNFQPTARSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSL--HHNQVIVARLIRLC 296 L +RNF +FS EAL+ L +C +F QMKQIH +IIR+ L H NQ+IVARLIRLC Sbjct: 18 LRSRNFPSP---NFSSSEALLCLHRCTSFNQMKQIHGRIIRSGLQQHQNQLIVARLIRLC 74 Query: 297 SSYRELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFP 476 SS+ +LDYA LV + +++P++F WNLLIR +T N P +A+LLYN MI RGV DKFTFP Sbjct: 75 SSFGKLDYAYLVLERIEDPTSFSWNLLIRAHTENGRPVKAVLLYNLMIRRGVDADKFTFP 134 Query: 477 FVMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKM-RH 653 F MKACLA V+KA E+YGFAVK G DVYL+N+L+ VY K GAL+DALKVFDKM RH Sbjct: 135 FAMKACLASGFVEKARELYGFAVKRGMWRDVYLNNLLMAVYFKYGALDDALKVFDKMPRH 194 Query: 654 -RSIVSWTTLIAGLVLNGRVDTAQKVFDTMPMRN 752 ++ VSWTT IAGL+ GRVD A+K+FD MP RN Sbjct: 195 NKTAVSWTTAIAGLLRRGRVDHARKLFDEMPRRN 228 Score = 68.9 bits (167), Expect = 2e-09 Identities = 42/172 (24%), Positives = 82/172 (47%), Gaps = 3/172 (1%) Frame = +3 Query: 234 KIIRTSLHHNQVIVARLIRLCSSYRE--LDYAALVFQHVDNPSTFVWNLLIRTYTVNDCP 407 K+ HN+ V+ + R +D+A +F + + + +I Y ++ P Sbjct: 186 KVFDKMPRHNKTAVSWTTAIAGLLRRGRVDHARKLFDEMPRRNVVSFTAMINGYARSEKP 245 Query: 408 YRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVL 587 RA L+ M V +++T + +C + ++ + ++ +A +GF +L L Sbjct: 246 ERAFELFARMQREDVTPNEYTLVGLAMSCSRLGILELGHRVHEYATGNGFEIGPFLGTAL 305 Query: 588 IDVYMKCGALNDALKVFDKMRHR-SIVSWTTLIAGLVLNGRVDTAQKVFDTM 740 ID+Y KCG+ + A +VF+ M R S +W +I L ++G + A +F+ M Sbjct: 306 IDMYSKCGSPDHAKRVFEGMEERSSAATWNAMITSLGIHGNGEEALSLFEEM 357 >ref|XP_002301563.2| hypothetical protein POPTR_0002s22630g, partial [Populus trichocarpa] gi|550345613|gb|EEE80836.2| hypothetical protein POPTR_0002s22630g, partial [Populus trichocarpa] Length = 245 Score = 225 bits (573), Expect = 1e-56 Identities = 108/202 (53%), Positives = 139/202 (68%) Frame = +3 Query: 147 TARSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAA 326 T F EAL+ LQ C +F +K +H KIIR +L NQ++V +LI LCSSY LDYAA Sbjct: 33 TCWPKFGSAEALLLLQNCTSFNHLKLVHGKIIRNALSANQLLVRKLIHLCSSYGRLDYAA 92 Query: 327 LVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAIS 506 L+F V P TF WN LIRTYT++ +A+LLYN MI RG DKFTFPFV+KACLA Sbjct: 93 LLFHQVQEPHTFTWNFLIRTYTIHGYSMKALLLYNLMIRRGFPPDKFTFPFVVKACLASG 152 Query: 507 CVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIA 686 ++K E++G A+K+GF D++L N L+D+Y CG KVFDK+R R++VSWTT IA Sbjct: 153 SIRKGKEVHGLAIKTGFSKDMFLYNTLMDLYFSCGDEGYGRKVFDKLRVRNVVSWTTFIA 212 Query: 687 GLVLNGRVDTAQKVFDTMPMRN 752 GLV+ G +D A++ FD MP RN Sbjct: 213 GLVVCGDLDAARRAFDQMPTRN 234 >ref|XP_002512787.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223547798|gb|EEF49290.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 480 Score = 223 bits (569), Expect = 4e-56 Identities = 107/200 (53%), Positives = 140/200 (70%) Frame = +3 Query: 153 RSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALV 332 + F QEAL LQK +NF +K + AKIIR +L +Q++V +L+RLC SY+++DYA L+ Sbjct: 27 KPKFGSQEALNLLQKGSNFTHVKLVQAKIIRNNLSDDQLLVRKLLRLCFSYQKVDYATLI 86 Query: 333 FQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCV 512 F + NP TF WN +IR Y N +A+LLYN MI G DKFTFPFV+KACL S + Sbjct: 87 FDQIQNPHTFTWNFMIRAYNYNGNSQQALLLYNLMICEGFSPDKFTFPFVIKACLDHSAL 146 Query: 513 KKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIAGL 692 K E++GFA+K+GF D +L N L+D+Y KCG L+ A K+FDKM RS+VSWTT +AGL Sbjct: 147 DKGKEVHGFAIKTGFWKDTFLSNTLMDLYFKCGDLDYARKLFDKMAVRSVVSWTTFVAGL 206 Query: 693 VLNGRVDTAQKVFDTMPMRN 752 V G +DTA+ FD MPMRN Sbjct: 207 VACGELDTARAAFDEMPMRN 226 Score = 92.4 bits (228), Expect = 1e-16 Identities = 54/186 (29%), Positives = 93/186 (50%) Frame = +3 Query: 195 KCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALVFQHVDNPSTFVWNL 374 KC + +++ K+ S+ VA L+ C ELD A F + + W Sbjct: 177 KCGDLDYARKLFDKMAVRSVVSWTTFVAGLVA-CG---ELDTARAAFDEMPMRNVVSWTA 232 Query: 375 LIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCVKKANEIYGFAVKSG 554 +I Y N P A L+ M V + FT +++AC + ++ I+ +A+++G Sbjct: 233 MINGYVKNQRPQEAFELFQRMQLANVRPNGFTLVGLLRACTELGSLELGRRIHEYALENG 292 Query: 555 FRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIAGLVLNGRVDTAQKVFD 734 F+ V+L LID+Y KCG++ DA KVF++M+ +S+ +W ++I L ++G A +F Sbjct: 293 FKVGVFLGTALIDMYSKCGSIEDAKKVFEEMQKKSLATWNSMITSLGVHGFGKEALALFA 352 Query: 735 TMPMRN 752 M N Sbjct: 353 QMEEAN 358 >gb|EXB41428.1| hypothetical protein L484_007578 [Morus notabilis] Length = 428 Score = 218 bits (556), Expect = 1e-54 Identities = 99/204 (48%), Positives = 146/204 (71%), Gaps = 1/204 (0%) Frame = +3 Query: 144 PTARSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYA 323 PT+++ F +EA FLQ C +F+Q+KQIHAKIIR+ L H+Q+++ ++++ CS+ +DYA Sbjct: 14 PTSKTKFGSEEAFTFLQNCTSFRQLKQIHAKIIRSGLSHDQLLLRKMLQFCSTSGNMDYA 73 Query: 324 ALVFQH-VDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLA 500 ALVF+H + P TF WNL+IR YT+N P +A+LL+ M RG DKFTFPFV+KAC A Sbjct: 74 ALVFRHQIPYPLTFTWNLMIRAYTLNASPRQALLLFTLMTSRGFPPDKFTFPFVIKACTA 133 Query: 501 ISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTL 680 S + + ++G A+K+ F GD+++ N L+D Y KCG + KVFDKMR R++VSWTT+ Sbjct: 134 SSAFRPGDAVHGLAIKARFSGDIFVQNTLMDFYFKCGDAHSGRKVFDKMRVRNLVSWTTM 193 Query: 681 IAGLVLNGRVDTAQKVFDTMPMRN 752 + GLV +G + A+ +F+ MP +N Sbjct: 194 VTGLVGSGDLRAARAIFEQMPAKN 217 Score = 97.8 bits (242), Expect = 4e-18 Identities = 64/222 (28%), Positives = 108/222 (48%), Gaps = 20/222 (9%) Frame = +3 Query: 147 TARSSFSPQEAL------------IFLQ--------KCANFKQMKQIHAKIIRTSLHHNQ 266 TA S+F P +A+ IF+Q KC + +++ K+ +L Sbjct: 132 TASSAFRPGDAVHGLAIKARFSGDIFVQNTLMDFYFKCGDAHSGRKVFDKMRVRNLVSWT 191 Query: 267 VIVARLIRLCSSYRELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFR 446 +V L+ +L A +F+ + + W ++I Y + P A L+ M Sbjct: 192 TMVTGLV----GSGDLRAARAIFEQMPAKNVVSWTIMIDGYVEDRQPEEAFKLFRRMQLD 247 Query: 447 GVGVDKFTFPFVMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDA 626 V ++FT ++KAC + +K ++ FA+K+GF DV+ LID Y KCG+L DA Sbjct: 248 NVSPNEFTLVSLLKACTELGSLKLGRWVHDFALKNGFELDVFFGTALIDTYSKCGSLEDA 307 Query: 627 LKVFDKMRHRSIVSWTTLIAGLVLNGRVDTAQKVFDTMPMRN 752 +VFDKM+ +SI +W ++I L ++G + A +F M +N Sbjct: 308 RRVFDKMQAKSIATWNSMITSLGVHGFGEEALALFAEMERQN 349 >ref|XP_006446854.1| hypothetical protein CICLE_v10018065mg [Citrus clementina] gi|557549465|gb|ESR60094.1| hypothetical protein CICLE_v10018065mg [Citrus clementina] Length = 438 Score = 216 bits (551), Expect = 5e-54 Identities = 103/200 (51%), Positives = 139/200 (69%) Frame = +3 Query: 153 RSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALV 332 R F QEAL+ L+KC NF Q+K IHAKIIR L ++Q++V +L+ LCS Y + D+A LV Sbjct: 25 RLKFGYQEALVLLRKCRNFGQLKLIHAKIIRHGLSNDQLLVRKLLDLCSFYGKTDHALLV 84 Query: 333 FQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCV 512 F + P F WNL+IR T+N +A+LLYN MI G DKFTFPFV KAC+ + Sbjct: 85 FSQIQCPHVFTWNLMIRALTINGSSRQALLLYNLMICNGFRPDKFTFPFVFKACITSLAI 144 Query: 513 KKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIAGL 692 +K E++G AVK+GF D+++ N L+D+Y KCG +N KVFDKMR RS+VSWTT+I+GL Sbjct: 145 EKGKEVHGLAVKAGFSRDMFVQNTLMDLYFKCGDVNGGRKVFDKMRVRSVVSWTTMISGL 204 Query: 693 VLNGRVDTAQKVFDTMPMRN 752 +G +D A++VF+ M RN Sbjct: 205 AASGDLDAARRVFEQMQTRN 224 Score = 102 bits (255), Expect = 1e-19 Identities = 52/148 (35%), Positives = 83/148 (56%) Frame = +3 Query: 309 ELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMK 488 +LD A VF+ + + W +I Y N+ + A L+ M+ V ++FT +++ Sbjct: 209 DLDAARRVFEQMQTRNVVSWTAMINAYVRNERAHEAFELFQRMLLDNVRPNEFTLVSLLQ 268 Query: 489 ACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVS 668 AC + +K N I+ FA+K+GF VYL LID+Y KCG+L DA KVFDKM +++ + Sbjct: 269 ACTELGSLKLGNWIHDFALKNGFVLGVYLGTALIDMYSKCGSLEDARKVFDKMEIKNLAT 328 Query: 669 WTTLIAGLVLNGRVDTAQKVFDTMPMRN 752 W ++I L ++G + A +F M N Sbjct: 329 WNSMITSLGVHGHGEEALALFAQMENAN 356 >ref|XP_006468953.1| PREDICTED: pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isoform X1 [Citrus sinensis] gi|568829286|ref|XP_006468954.1| PREDICTED: pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isoform X2 [Citrus sinensis] gi|568829288|ref|XP_006468955.1| PREDICTED: pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isoform X3 [Citrus sinensis] Length = 435 Score = 214 bits (545), Expect = 3e-53 Identities = 102/200 (51%), Positives = 140/200 (70%) Frame = +3 Query: 153 RSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALV 332 R F QEAL+ L+KC NF Q+K IHAKIIR L ++Q++V +L+ LCS Y + D+A LV Sbjct: 25 RLKFGYQEALVLLRKCRNFGQLKLIHAKIIRHGLSNDQLLVRKLLDLCSFYGKTDHALLV 84 Query: 333 FQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCV 512 F + P F WNL+IR T+N +A+LLYN MI G DKFTFPFV+KAC+A + Sbjct: 85 FSQIQCPHVFTWNLMIRALTINGSSQQALLLYNLMICNGFRPDKFTFPFVIKACVASLAI 144 Query: 513 KKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIAGL 692 +K E++G AVK+GF D+++ N L+D+Y KCG ++ VFDKMR RS+VSWTT+I+GL Sbjct: 145 EKGKEVHGLAVKAGFSRDMFVQNTLMDLYFKCGDVDGGRMVFDKMRVRSVVSWTTMISGL 204 Query: 693 VLNGRVDTAQKVFDTMPMRN 752 +G +D A++VF+ M RN Sbjct: 205 AASGDLDAARRVFEQMQTRN 224 Score = 101 bits (251), Expect = 3e-19 Identities = 52/148 (35%), Positives = 82/148 (55%) Frame = +3 Query: 309 ELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMK 488 +LD A VF+ + + W +I Y N+ A L+ M+ V ++FT +++ Sbjct: 209 DLDAARRVFEQMQTRNVVSWTAMINAYVRNERAQEAFELFQRMLLDNVRPNEFTLVSLLQ 268 Query: 489 ACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVS 668 AC + +K N I+ FA+K+GF VYL LID+Y KCG+L DA KVFDKM +++ + Sbjct: 269 ACTELRSLKLGNWIHDFALKNGFVLGVYLGTALIDMYSKCGSLEDARKVFDKMEIKNLAT 328 Query: 669 WTTLIAGLVLNGRVDTAQKVFDTMPMRN 752 W ++I L ++G + A +F M N Sbjct: 329 WNSMITSLGVHGHGEEALALFAQMENAN 356 >ref|XP_004159154.1| PREDICTED: uncharacterized protein LOC101226880 [Cucumis sativus] Length = 1725 Score = 211 bits (538), Expect = 2e-52 Identities = 98/214 (45%), Positives = 144/214 (67%) Frame = +3 Query: 111 TLDVLPARNFQPTARSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIR 290 T DV P++N T R + ++AL LQ C NFK ++QIHAKIIR+ L ++Q++ +LI Sbjct: 8 THDVFPSKNIPLTPRGNIRAKKALFLLQNCKNFKHLRQIHAKIIRSGLSNDQLLTRKLIH 67 Query: 291 LCSSYRELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFT 470 L S++ + YA L+F + NP TF WNL+IR T+N +A++LY +M+ +G+ DKFT Sbjct: 68 LYSTHGRIAYAILLFYQIQNPCTFTWNLIIRANTINGLSEQALMLYKNMVCQGIAADKFT 127 Query: 471 FPFVMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMR 650 FPFV+KAC + ++G +K GF GDV++ N LID Y KCG ALKVF+KMR Sbjct: 128 FPFVIKACTNFLSIDLGKVVHGSLIKYGFSGDVFVQNNLIDFYFKCGHTRFALKVFEKMR 187 Query: 651 HRSIVSWTTLIAGLVLNGRVDTAQKVFDTMPMRN 752 R++VSWTT+I+GL+ G + A+++FD +P +N Sbjct: 188 VRNVVSWTTVISGLISCGDLQEARRIFDEIPSKN 221 Score = 85.5 bits (210), Expect = 2e-14 Identities = 50/197 (25%), Positives = 95/197 (48%) Frame = +3 Query: 162 FSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALVFQH 341 F + F KC + + ++ K+ ++ +++ LI S +L A +F Sbjct: 161 FVQNNLIDFYFKCGHTRFALKVFEKMRVRNVVSWTTVISGLI----SCGDLQEARRIFDE 216 Query: 342 VDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCVKKA 521 + + + W +I Y N P A+ L+ M + +++T ++KAC + + Sbjct: 217 IPSKNVVSWTAMINGYIRNQQPEEALELFKRMQAENIFPNEYTMVSLIKACTEMGILTLG 276 Query: 522 NEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIAGLVLN 701 I+ +A+K+ VYL LID+Y KCG++ DA++VF+ M +S+ +W ++I L ++ Sbjct: 277 RGIHDYAIKNCIEIGVYLGTALIDMYSKCGSIKDAIEVFETMPRKSLPTWNSMITSLGVH 336 Query: 702 GRVDTAQKVFDTMPMRN 752 G A +F M N Sbjct: 337 GLGQEALNLFSEMERVN 353 Score = 79.3 bits (194), Expect = 1e-12 Identities = 42/137 (30%), Positives = 73/137 (53%) Frame = +3 Query: 321 AALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLA 500 A +++VD + WN +I A+ ++S+ G+ + +FP +K+C A Sbjct: 1095 ATWFYKYVDKSNVHSWNSVIADLARGGDSVEALRAFSSLRKLGLIPTRSSFPCTIKSCSA 1154 Query: 501 ISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTL 680 + + + A GF D+++ + LID+Y KCG L DA +FD++ R++VSWT++ Sbjct: 1155 LCDLVSGRMSHQQAFVFGFETDLFVSSALIDMYSKCGQLKDARALFDEIPLRNVVSWTSM 1214 Query: 681 IAGLVLNGRVDTAQKVF 731 I G V N + D A +F Sbjct: 1215 ITGYVQNEQADNALLLF 1231 Score = 73.9 bits (180), Expect = 5e-11 Identities = 57/221 (25%), Positives = 91/221 (41%), Gaps = 8/221 (3%) Frame = +3 Query: 102 SLSTLDVLPARNFQPTARSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVAR 281 SL L ++P R+ P S S L+ + H + + + + Sbjct: 1132 SLRKLGLIPTRSSFPCTIKSCSALCDLV---------SGRMSHQQAFVFGFETDLFVSSA 1182 Query: 282 LIRLCSSYRELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFR----- 446 LI + S +L A +F + + W +I Y N+ A+LL+ + Sbjct: 1183 LIDMYSKCGQLKDARALFDEIPLRNVVSWTSMITGYVQNEQADNALLLFKDFLEEETEVE 1242 Query: 447 ---GVGVDKFTFPFVMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGAL 617 V +D V+ AC +S ++GF VK GF G + + N L+D Y KCG Sbjct: 1243 DGNNVPLDSVVMVSVLSACSRVSGKGITEGVHGFVVKKGFDGSIGVGNTLMDAYAKCGQP 1302 Query: 618 NDALKVFDKMRHRSIVSWTTLIAGLVLNGRVDTAQKVFDTM 740 + KVFD M + +SW ++IA +G A +VF M Sbjct: 1303 LVSKKVFDWMEEKDDISWNSMIAVYAQSGLSGEALEVFHGM 1343 Score = 72.0 bits (175), Expect = 2e-10 Identities = 38/138 (27%), Positives = 68/138 (49%), Gaps = 1/138 (0%) Frame = +3 Query: 330 VFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFR-GVGVDKFTFPFVMKACLAIS 506 VF ++ WN +I Y + A+ +++ M+ GV + T V+ AC Sbjct: 1308 VFDWMEEKDDISWNSMIAVYAQSGLSGEALEVFHGMVRHVGVRYNAVTLSAVLLACAHAG 1367 Query: 507 CVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIA 686 ++ I+ +K +V + +ID+Y KCG + A K FD+M+ +++ SWT ++A Sbjct: 1368 ALRAGKCIHDQVIKMDLEYNVCVGTSIIDMYCKCGRVEMAKKTFDRMKEKNVKSWTAMVA 1427 Query: 687 GLVLNGRVDTAQKVFDTM 740 G ++GR A +F M Sbjct: 1428 GYGMHGRAKEALDIFYKM 1445 >ref|XP_004145727.1| PREDICTED: uncharacterized protein LOC101212001 [Cucumis sativus] Length = 2598 Score = 211 bits (538), Expect = 2e-52 Identities = 98/214 (45%), Positives = 144/214 (67%) Frame = +3 Query: 111 TLDVLPARNFQPTARSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIR 290 T DV P++N T R + ++AL LQ C NFK ++QIHAKIIR+ L ++Q++ +LI Sbjct: 8 THDVFPSKNIPLTPRGNIRAKKALFLLQNCKNFKHLRQIHAKIIRSGLSNDQLLTRKLIH 67 Query: 291 LCSSYRELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFT 470 L S++ + YA L+F + NP TF WNL+IR T+N +A++LY +M+ +G+ DKFT Sbjct: 68 LYSTHGRIAYAILLFYQIQNPCTFTWNLIIRANTINGLSEQALMLYKNMVCQGIAADKFT 127 Query: 471 FPFVMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMR 650 FPFV+KAC + ++G +K GF GDV++ N LID Y KCG ALKVF+KMR Sbjct: 128 FPFVIKACTNFLSIDLGKVVHGSLIKYGFSGDVFVQNNLIDFYFKCGHTRFALKVFEKMR 187 Query: 651 HRSIVSWTTLIAGLVLNGRVDTAQKVFDTMPMRN 752 R++VSWTT+I+GL+ G + A+++FD +P +N Sbjct: 188 VRNVVSWTTVISGLISCGDLQEARRIFDEIPSKN 221 Score = 85.5 bits (210), Expect = 2e-14 Identities = 50/197 (25%), Positives = 95/197 (48%) Frame = +3 Query: 162 FSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALVFQH 341 F + F KC + + ++ K+ ++ +++ LI S +L A +F Sbjct: 161 FVQNNLIDFYFKCGHTRFALKVFEKMRVRNVVSWTTVISGLI----SCGDLQEARRIFDE 216 Query: 342 VDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCVKKA 521 + + + W +I Y N P A+ L+ M + +++T ++KAC + + Sbjct: 217 IPSKNVVSWTAMINGYIRNQQPEEALELFKRMQAENIFPNEYTMVSLIKACTEMGILTLG 276 Query: 522 NEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIAGLVLN 701 I+ +A+K+ VYL LID+Y KCG++ DA++VF+ M +S+ +W ++I L ++ Sbjct: 277 RGIHDYAIKNCIEIGVYLGTALIDMYSKCGSIKDAIEVFETMPRKSLPTWNSMITSLGVH 336 Query: 702 GRVDTAQKVFDTMPMRN 752 G A +F M N Sbjct: 337 GLGQEALNLFSEMERVN 353 Score = 79.3 bits (194), Expect = 1e-12 Identities = 42/137 (30%), Positives = 73/137 (53%) Frame = +3 Query: 321 AALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLA 500 A +++VD + WN +I A+ ++S+ G+ + +FP +K+C A Sbjct: 1968 ATWFYKYVDKSNVHSWNSVIADLARGGDSVEALRAFSSLRKLGLIPTRSSFPCTIKSCSA 2027 Query: 501 ISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTL 680 + + + A GF D+++ + LID+Y KCG L DA +FD++ R++VSWT++ Sbjct: 2028 LCDLVSGRMSHQQAFVFGFETDLFVSSALIDMYSKCGQLKDARALFDEIPLRNVVSWTSM 2087 Query: 681 IAGLVLNGRVDTAQKVF 731 I G V N + D A +F Sbjct: 2088 ITGYVQNEQADNALLLF 2104 Score = 73.9 bits (180), Expect = 5e-11 Identities = 57/221 (25%), Positives = 91/221 (41%), Gaps = 8/221 (3%) Frame = +3 Query: 102 SLSTLDVLPARNFQPTARSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVAR 281 SL L ++P R+ P S S L+ + H + + + + Sbjct: 2005 SLRKLGLIPTRSSFPCTIKSCSALCDLV---------SGRMSHQQAFVFGFETDLFVSSA 2055 Query: 282 LIRLCSSYRELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFR----- 446 LI + S +L A +F + + W +I Y N+ A+LL+ + Sbjct: 2056 LIDMYSKCGQLKDARALFDEIPLRNVVSWTSMITGYVQNEQADNALLLFKDFLEEETEVE 2115 Query: 447 ---GVGVDKFTFPFVMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGAL 617 V +D V+ AC +S ++GF VK GF G + + N L+D Y KCG Sbjct: 2116 DGNNVPLDSVVMVSVLSACSRVSGKGITEGVHGFVVKKGFDGSIGVGNTLMDAYAKCGQP 2175 Query: 618 NDALKVFDKMRHRSIVSWTTLIAGLVLNGRVDTAQKVFDTM 740 + KVFD M + +SW ++IA +G A +VF M Sbjct: 2176 LVSKKVFDWMEEKDDISWNSMIAVYAQSGLSGEALEVFHGM 2216 Score = 72.0 bits (175), Expect = 2e-10 Identities = 38/138 (27%), Positives = 68/138 (49%), Gaps = 1/138 (0%) Frame = +3 Query: 330 VFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFR-GVGVDKFTFPFVMKACLAIS 506 VF ++ WN +I Y + A+ +++ M+ GV + T V+ AC Sbjct: 2181 VFDWMEEKDDISWNSMIAVYAQSGLSGEALEVFHGMVRHVGVRYNAVTLSAVLLACAHAG 2240 Query: 507 CVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIA 686 ++ I+ +K +V + +ID+Y KCG + A K FD+M+ +++ SWT ++A Sbjct: 2241 ALRAGKCIHDQVIKMDLEYNVCVGTSIIDMYCKCGRVEMAKKTFDRMKEKNVKSWTAMVA 2300 Query: 687 GLVLNGRVDTAQKVFDTM 740 G ++GR A +F M Sbjct: 2301 GYGMHGRAKEALDIFYKM 2318 >ref|XP_003548483.1| PREDICTED: pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isoform X1 [Glycine max] gi|571525647|ref|XP_006598987.1| PREDICTED: pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like isoform X2 [Glycine max] Length = 483 Score = 209 bits (531), Expect = 1e-51 Identities = 93/202 (46%), Positives = 145/202 (71%) Frame = +3 Query: 147 TARSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAA 326 T R+ F +EAL+ LQKC+NFKQ+KQ+H KIIR L ++Q+++ +LI+L SSY ++ YA Sbjct: 18 TPRTRFGSEEALVLLQKCSNFKQLKQVHGKIIRFGLTYDQLLMRKLIQLSSSYGKMKYAT 77 Query: 327 LVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAIS 506 LVF ++ P F WN++IR +T+ P A+LL+ +M+ +G DKFT+PFV+ AC+A S Sbjct: 78 LVFDQLNAPDVFTWNVMIRAFTIGGSPKMALLLFKAMLCQGFAPDKFTYPFVINACMASS 137 Query: 507 CVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIA 686 + + A+K GF GD+Y+ N ++++Y KC ++D KVFDKMR R++ +WTT+I+ Sbjct: 138 ALDLGIVAHALAIKMGFWGDLYVQNTMMNLYFKCENVDDGRKVFDKMRVRNVFAWTTVIS 197 Query: 687 GLVLNGRVDTAQKVFDTMPMRN 752 GLV G++DTA+++F+ MP +N Sbjct: 198 GLVACGKLDTARELFEQMPSKN 219 Score = 91.3 bits (225), Expect = 3e-16 Identities = 54/187 (28%), Positives = 97/187 (51%), Gaps = 1/187 (0%) Frame = +3 Query: 195 KCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALVFQHVDNPSTFVWNL 374 KC N +++ K+ ++ +++ L+ C +LD A +F+ + + + W Sbjct: 170 KCENVDDGRKVFDKMRVRNVFAWTTVISGLVA-CG---KLDTARELFEQMPSKNVVSWTA 225 Query: 375 LIRTYTVNDCPYRAILLYNSMI-FRGVGVDKFTFPFVMKACLAISCVKKANEIYGFAVKS 551 +I Y + P A L+ M V +++T +++AC + +K ++ FA+K+ Sbjct: 226 MIDGYVKHKQPIEAFNLFERMQQVDNVRPNEYTLVSLVRACTEMGSLKLGRRVHDFALKN 285 Query: 552 GFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIAGLVLNGRVDTAQKVF 731 GF + +L LID+Y KCG L+DA VFD M+ R++ +W T+I L ++G D A +F Sbjct: 286 GFELEPFLGTALIDMYSKCGYLDDARTVFDMMQVRTLATWNTMITSLGVHGYRDEALSLF 345 Query: 732 DTMPMRN 752 D M N Sbjct: 346 DEMEKAN 352 >ref|XP_006844721.1| hypothetical protein AMTR_s00016p00252780 [Amborella trichopoda] gi|548847192|gb|ERN06396.1| hypothetical protein AMTR_s00016p00252780 [Amborella trichopoda] Length = 428 Score = 208 bits (530), Expect = 1e-51 Identities = 96/202 (47%), Positives = 142/202 (70%) Frame = +3 Query: 147 TARSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAA 326 ++ SFS +AL LQKC+ + QIHA + RT LH + +++ +LI LCS ++++D+A Sbjct: 24 SSNPSFSHYQALSLLQKCSTSNHLLQIHAHLFRTGLHRDYILITKLINLCSIHQKIDHAT 83 Query: 327 LVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAIS 506 LVF ++NP TF WN +IR Y ++ P AIL+YN M+ G DKFT+PFV+KAC+A S Sbjct: 84 LVFNQIENPLTFTWNTMIRAYFKSNYPEEAILMYNLMVIHGFLPDKFTYPFVIKACVAFS 143 Query: 507 CVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIA 686 ++K EI+G A+K+G D++L N L+++YMKC A K+FDKM +S+VSWTT++A Sbjct: 144 SLEKGKEIHGRAIKAGMVPDIFLQNTLMELYMKCNEKTLAHKLFDKMSVKSVVSWTTMVA 203 Query: 687 GLVLNGRVDTAQKVFDTMPMRN 752 GLV +G + +A++VFD MP RN Sbjct: 204 GLVSHGDMASARRVFDEMPERN 225 Score = 94.0 bits (232), Expect = 5e-17 Identities = 47/147 (31%), Positives = 79/147 (53%) Frame = +3 Query: 300 SYRELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPF 479 S+ ++ A VF + + W +I Y N+ P+ A+ L+ M+ V ++FT Sbjct: 207 SHGDMASARRVFDEMPERNVVSWTAMIHGYVRNNQPHEALELFILMLRANVRPNEFTIVS 266 Query: 480 VMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRS 659 ++ C +++ ++ ++ F KSGF VYL LID+Y CG++NDA VFD M RS Sbjct: 267 LLLVCTSLNSLRLGRWVHEFMAKSGFELSVYLGTALIDMYSNCGSINDAKNVFDGMSERS 326 Query: 660 IVSWTTLIAGLVLNGRVDTAQKVFDTM 740 + +W ++I L ++G+ A VF M Sbjct: 327 VATWNSMITSLGVHGKGKEALNVFGAM 353 >ref|XP_006395538.1| hypothetical protein EUTSA_v10004197mg [Eutrema salsugineum] gi|557092177|gb|ESQ32824.1| hypothetical protein EUTSA_v10004197mg [Eutrema salsugineum] Length = 453 Score = 207 bits (526), Expect = 4e-51 Identities = 100/197 (50%), Positives = 142/197 (72%) Frame = +3 Query: 162 FSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALVFQH 341 F +A FL+ C+NF Q+KQIHAKIIR +L ++Q++V +LI + SS E YA+LVF Sbjct: 18 FRSDKASFFLRNCSNFSQLKQIHAKIIRYNLTNDQLLVRQLISVSSSLGETRYASLVFSQ 77 Query: 342 VDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCVKKA 521 + +PSTF WNL+IR+ +VND P A+LL+ M+ +DKFTFPFV+KACLA S ++ Sbjct: 78 LQSPSTFTWNLMIRSLSVNDKPREALLLFILMLSHQSQLDKFTFPFVIKACLASSSLRLG 137 Query: 522 NEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIAGLVLN 701 +++G A+KSGF DV+ N L+D+Y+KCG + KVFDKM R+IVSWTT++ GLV N Sbjct: 138 TQVHGLAIKSGFFSDVFFQNTLMDLYLKCGKPDCGRKVFDKMPGRTIVSWTTMLYGLVSN 197 Query: 702 GRVDTAQKVFDTMPMRN 752 ++D+A+ +F+ MP RN Sbjct: 198 SQLDSAEIIFNQMPTRN 214 Score = 88.6 bits (218), Expect = 2e-15 Identities = 46/147 (31%), Positives = 78/147 (53%) Frame = +3 Query: 300 SYRELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPF 479 S +LD A ++F + + W +I Y N P A L+ M V ++FT Sbjct: 196 SNSQLDSAEIIFNQMPTRNVVSWTAMITAYVKNCRPDEAFQLFRRMQVDEVKPNEFTIVS 255 Query: 480 VMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRS 659 +++A + + ++ +A K+GF D +L LID+Y KCG+L DA KVFD M+ +S Sbjct: 256 MLQASTQLGSLSMGRWVHDYAHKNGFPLDCFLGTALIDMYSKCGSLQDAWKVFDAMQSKS 315 Query: 660 IVSWTTLIAGLVLNGRVDTAQKVFDTM 740 + +W ++I L ++G + A ++D M Sbjct: 316 LATWNSMITSLGVHGCGEEALDLYDQM 342 >ref|NP_189297.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75275188|sp|Q38959.1|PP257_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g26630, chloroplastic; Flags: Precursor gi|1402883|emb|CAA66814.1| hypothetical protein [Arabidopsis thaliana] gi|1495263|emb|CAA66119.1| orf09 [Arabidopsis thaliana] gi|11994298|dbj|BAB01728.1| unnamed protein product [Arabidopsis thaliana] gi|20466384|gb|AAM20509.1| unknown protein [Arabidopsis thaliana] gi|23198064|gb|AAN15559.1| unknown protein [Arabidopsis thaliana] gi|332643668|gb|AEE77189.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 455 Score = 204 bits (518), Expect = 3e-50 Identities = 101/198 (51%), Positives = 140/198 (70%), Gaps = 1/198 (0%) Frame = +3 Query: 162 FSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALVFQH 341 F EA FL+ C+NF Q+KQIH KII+ +L ++Q++V +LI + SS+ E YA+LVF Sbjct: 18 FRSPEASYFLRTCSNFSQLKQIHTKIIKHNLTNDQLLVRQLISVSSSFGETQYASLVFNQ 77 Query: 342 VDNPSTFVWNLLIRTYTVNDCPYRAILLYN-SMIFRGVGVDKFTFPFVMKACLAISCVKK 518 + +PSTF WNL+IR+ +VN P A+LL+ MI DKFTFPFV+KACLA S ++ Sbjct: 78 LQSPSTFTWNLMIRSLSVNHKPREALLLFILMMISHQSQFDKFTFPFVIKACLASSSIRL 137 Query: 519 ANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIAGLVL 698 +++G A+K+GF DV+ N L+D+Y KCG + KVFDKM RSIVSWTT++ GLV Sbjct: 138 GTQVHGLAIKAGFFNDVFFQNTLMDLYFKCGKPDSGRKVFDKMPGRSIVSWTTMLYGLVS 197 Query: 699 NGRVDTAQKVFDTMPMRN 752 N ++D+A+ VF+ MPMRN Sbjct: 198 NSQLDSAEIVFNQMPMRN 215 Score = 87.4 bits (215), Expect = 5e-15 Identities = 47/147 (31%), Positives = 78/147 (53%) Frame = +3 Query: 300 SYRELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPF 479 S +LD A +VF + + W +I Y N P A L+ M V ++FT Sbjct: 197 SNSQLDSAEIVFNQMPMRNVVSWTAMITAYVKNRRPDEAFQLFRRMQVDDVKPNEFTIVN 256 Query: 480 VMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRS 659 +++A + + ++ +A K+GF D +L LID+Y KCG+L DA KVFD M+ +S Sbjct: 257 LLQASTQLGSLSMGRWVHDYAHKNGFVLDCFLGTALIDMYSKCGSLQDARKVFDVMQGKS 316 Query: 660 IVSWTTLIAGLVLNGRVDTAQKVFDTM 740 + +W ++I L ++G + A +F+ M Sbjct: 317 LATWNSMITSLGVHGCGEEALSLFEEM 343 >ref|XP_002876985.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297322823|gb|EFH53244.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 451 Score = 202 bits (513), Expect = 1e-49 Identities = 98/197 (49%), Positives = 139/197 (70%) Frame = +3 Query: 162 FSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALVFQH 341 F EA FL+ C+NF Q+KQIH KII+ +L ++Q++V +LI + SS+ E YA+LVF Sbjct: 18 FRSPEASYFLRTCSNFSQLKQIHTKIIKHNLTNDQLLVRQLISVSSSFGETQYASLVFNQ 77 Query: 342 VDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCVKKA 521 + +PSTF WNL+IR+ ++N P A+LL+ M+ DKFTFPFV+KACLA S ++ Sbjct: 78 LQSPSTFTWNLMIRSLSLNHKPREALLLFILMLSHQPQFDKFTFPFVIKACLASSSLRLG 137 Query: 522 NEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIAGLVLN 701 +++G A+K+GF DV+ N L+D+Y KCG + KVFDKM RSIVSWTT++ GLV N Sbjct: 138 TQVHGLAIKAGFFNDVFFQNTLMDLYFKCGKPDCGRKVFDKMPGRSIVSWTTMLYGLVSN 197 Query: 702 GRVDTAQKVFDTMPMRN 752 ++D+A+ VF+ MP RN Sbjct: 198 SQLDSAEIVFNQMPTRN 214 Score = 89.4 bits (220), Expect = 1e-15 Identities = 48/147 (32%), Positives = 78/147 (53%) Frame = +3 Query: 300 SYRELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPF 479 S +LD A +VF + + W +I Y N P A L+ M V ++FT Sbjct: 196 SNSQLDSAEIVFNQMPTRNVVSWTAMITAYVKNRRPDEAFQLFRRMQVDDVKPNEFTIVN 255 Query: 480 VMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRS 659 +++A + + ++ +A K+GF D YL LID+Y KCG+L DA KVFD M+ +S Sbjct: 256 LLQASTQLGSLSMGRWVHDYAHKNGFVLDCYLGTALIDMYSKCGSLQDARKVFDVMQSKS 315 Query: 660 IVSWTTLIAGLVLNGRVDTAQKVFDTM 740 + +W ++I L ++G + A +F+ M Sbjct: 316 LATWNSMITSLGVHGCGEEALYLFEEM 342 >ref|XP_003553320.1| PREDICTED: pentatricopeptide repeat-containing protein At3g26630, chloroplastic-like [Glycine max] Length = 474 Score = 201 bits (511), Expect = 2e-49 Identities = 89/202 (44%), Positives = 140/202 (69%) Frame = +3 Query: 147 TARSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAA 326 T R+ F +EAL+ L+KC+NFKQ+KQ+H KIIR L ++Q++V +LI+L SY ++ YA Sbjct: 17 TPRTRFGSEEALVLLKKCSNFKQLKQVHGKIIRYGLTYDQLLVRKLIQLSPSYGKMKYAT 76 Query: 327 LVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAIS 506 LVF ++ P F WN++IR YT+ P A LL+ +M+++G DKFT+P V+ AC+A + Sbjct: 77 LVFDQLNAPDVFTWNVMIRAYTIGGSPKMAFLLFKAMLYQGFAPDKFTYPCVINACMAYN 136 Query: 507 CVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIA 686 + + A+K GF GD+Y+ N ++++Y KC ++D VFDKM R++ +WTT+IA Sbjct: 137 ALDVGRVAHALAIKMGFWGDLYVQNTMMNLYFKCENVDDGWNVFDKMCVRNVFAWTTVIA 196 Query: 687 GLVLNGRVDTAQKVFDTMPMRN 752 G V G++DTA+++F+ MP +N Sbjct: 197 GFVACGKLDTARELFEQMPSKN 218 Score = 93.2 bits (230), Expect = 9e-17 Identities = 47/148 (31%), Positives = 81/148 (54%) Frame = +3 Query: 309 ELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMK 488 +LD A +F+ + + + W +I Y + P A L+ M V +++T +++ Sbjct: 203 KLDTARELFEQMPSKNVVSWTAIIDGYVKHKQPIEAFDLFERMQADNVRPNEYTLVSLVR 262 Query: 489 ACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVS 668 AC + +K ++ FA+K+GF + +L LID+Y KCG L+DA VFD M+ R++ + Sbjct: 263 ACTEMGSLKLGRRVHDFALKNGFELEPFLGTALIDMYSKCGNLDDARTVFDMMQMRTLAT 322 Query: 669 WTTLIAGLVLNGRVDTAQKVFDTMPMRN 752 W T+I L ++G D A +F+ M N Sbjct: 323 WNTMITSLGVHGYRDEALSIFEEMEKAN 350 >ref|XP_003624556.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355499571|gb|AES80774.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 476 Score = 201 bits (510), Expect = 3e-49 Identities = 94/198 (47%), Positives = 140/198 (70%), Gaps = 1/198 (0%) Frame = +3 Query: 162 FSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALVFQH 341 F +EA +FLQ C NFKQ+KQIHA+IIR L H+Q+++ +L ++ SSY ++DYA+LVF Sbjct: 18 FGSEEARLFLQNCNNFKQLKQIHARIIRFRLTHDQLLIRKLCQISSSYGKIDYASLVFDQ 77 Query: 342 VDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCVKKA 521 +++P F WN++IR Y + P ++I L+ MI G DKFT+PFV+ AC+A + Sbjct: 78 LNDPDIFTWNVMIRAYNTSGLPQKSIFLFKDMICCGFLPDKFTYPFVINACIASGVIDFG 137 Query: 522 NEIYGFAVKSGFRGDVYLDNVLIDVYMKCGA-LNDALKVFDKMRHRSIVSWTTLIAGLVL 698 +G A+K GF DVY+ N ++++Y K G ++D KVFDKMR R++VSWTT+IAGLV Sbjct: 138 RLTHGLAIKMGFWSDVYVQNNMMNLYFKIGGDVDDGWKVFDKMRVRNVVSWTTVIAGLVA 197 Query: 699 NGRVDTAQKVFDTMPMRN 752 G++DTA++VF+ +P +N Sbjct: 198 CGKLDTAREVFERIPSKN 215 Score = 92.0 bits (227), Expect = 2e-16 Identities = 45/144 (31%), Positives = 79/144 (54%) Frame = +3 Query: 309 ELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMK 488 +LD A VF+ + + + W +I Y ND P +A L+ M+ V ++FT ++K Sbjct: 200 KLDTAREVFERIPSKNVVSWTAMINGYVKNDNPIKAFDLFERMLIDNVRPNEFTLVSLIK 259 Query: 489 ACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVS 668 AC + +K ++ FA+K+GF +L L+D+Y KCG+L+ A+KVF M R++ + Sbjct: 260 ACTDLGSLKLGRRMHDFALKNGFELGPFLGTALVDMYSKCGSLDAAVKVFGLMEVRNLAT 319 Query: 669 WTTLIAGLVLNGRVDTAQKVFDTM 740 W T++ ++G + +F M Sbjct: 320 WNTMLTSFGVHGFGNEVLDLFKEM 343 >ref|XP_006291116.1| hypothetical protein CARUB_v10017228mg [Capsella rubella] gi|482559823|gb|EOA24014.1| hypothetical protein CARUB_v10017228mg [Capsella rubella] Length = 454 Score = 197 bits (502), Expect = 2e-48 Identities = 98/197 (49%), Positives = 138/197 (70%) Frame = +3 Query: 162 FSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALVFQH 341 F EA L+ C +F Q+KQIH +II+ +L ++Q++V +LI + SS+ E YA+LVF Sbjct: 18 FRSPEASYLLRTCLSFSQLKQIHTRIIKHNLTNDQLLVRQLISVSSSFGETKYASLVFNQ 77 Query: 342 VDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCVKKA 521 + +PSTF WNL+IR+ +VN P A+LL+ M+ DKFTFPFV+KACLA S ++ Sbjct: 78 LQSPSTFTWNLMIRSLSVNHKPREALLLFILMLNHQSQFDKFTFPFVIKACLASSSLRLG 137 Query: 522 NEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIAGLVLN 701 +++G A+K+GF DV+ N L+D+Y KCG L+ KVFDKM RSIVSWTTL+ GLV N Sbjct: 138 TQVHGLAIKAGFFNDVFFQNTLMDLYFKCGKLDLGRKVFDKMPGRSIVSWTTLLHGLVSN 197 Query: 702 GRVDTAQKVFDTMPMRN 752 ++D+A+ VF+ MP RN Sbjct: 198 CQLDSAKIVFNQMPTRN 214 Score = 90.1 bits (222), Expect = 7e-16 Identities = 53/182 (29%), Positives = 91/182 (50%) Frame = +3 Query: 195 KCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALVFQHVDNPSTFVWNL 374 KC +++ K+ S+ ++ L+ C +LD A +VF + + W Sbjct: 165 KCGKLDLGRKVFDKMPGRSIVSWTTLLHGLVSNC----QLDSAKIVFNQMPTRNVVSWTA 220 Query: 375 LIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCVKKANEIYGFAVKSG 554 +I Y N P A L+ M V ++FT +++A + + ++ +A K+G Sbjct: 221 MITAYVKNRRPDEAFQLFRRMQVDDVKPNEFTIVNLLQASTQLGSLSMGRWVHDYAHKNG 280 Query: 555 FRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIAGLVLNGRVDTAQKVFD 734 F D +L LID+Y KCG+L DA KVFD M+ +S+ +W ++I L ++G + A +FD Sbjct: 281 FVLDCFLGTALIDMYSKCGSLQDARKVFDVMQSKSLATWNSMITSLGVHGCGEEALCLFD 340 Query: 735 TM 740 M Sbjct: 341 DM 342 >ref|XP_007161786.1| hypothetical protein PHAVU_001G097900g [Phaseolus vulgaris] gi|561035250|gb|ESW33780.1| hypothetical protein PHAVU_001G097900g [Phaseolus vulgaris] Length = 482 Score = 197 bits (500), Expect = 4e-48 Identities = 90/202 (44%), Positives = 140/202 (69%) Frame = +3 Query: 147 TARSSFSPQEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAA 326 T R+ F +EA + LQKC+NFKQ+KQ+H +IIR L ++Q+++ LI+L SSY +L+YA Sbjct: 17 TPRTRFGSEEAHLILQKCSNFKQLKQVHGRIIRFGLIYDQLLMRNLIQLSSSYGKLNYAT 76 Query: 327 LVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAIS 506 LVF ++ P TF WN++IR T+ P A+LL+ +M+ +G DKFT+PFV+ AC+A + Sbjct: 77 LVFDQLNAPDTFTWNVMIRANTIGGSPKMALLLFKAMLSQGFAPDKFTYPFVISACVASN 136 Query: 507 CVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSWTTLIA 686 + + A+K GF D+Y+ ++++Y KC ++D KVFDKMR R++ +WT++IA Sbjct: 137 ALDLGRLTHALAIKMGFWSDLYVQTNVMNLYFKCHEVDDGWKVFDKMRVRNVFAWTSVIA 196 Query: 687 GLVLNGRVDTAQKVFDTMPMRN 752 G V G +DTA+K+F+ MP +N Sbjct: 197 GFVACGSLDTARKLFEKMPSKN 218 Score = 89.7 bits (221), Expect = 1e-15 Identities = 46/147 (31%), Positives = 81/147 (55%) Frame = +3 Query: 312 LDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKA 491 LD A +F+ + + + W +I Y ++ P A L+ M+ V +++T +++A Sbjct: 204 LDTARKLFEKMPSKNVVSWTAMIDGYVKHERPIEAFNLFERMLVDDVRPNEYTVVSLVRA 263 Query: 492 CLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSW 671 C + +K I+ F +K+ F +L LID+Y KCG+L++A KVFD M+ R++ +W Sbjct: 264 CTEMGSLKLGRRIHDFVLKNNFDLGPFLGTALIDMYSKCGSLDEARKVFDLMQVRTLATW 323 Query: 672 TTLIAGLVLNGRVDTAQKVFDTMPMRN 752 T+I L ++G D A +F+ M N Sbjct: 324 NTMITSLGVHGFRDEAISLFEEMVKAN 350 >ref|XP_007217099.1| hypothetical protein PRUPE_ppa022877mg, partial [Prunus persica] gi|462413249|gb|EMJ18298.1| hypothetical protein PRUPE_ppa022877mg, partial [Prunus persica] Length = 347 Score = 159 bits (401), Expect = 1e-36 Identities = 69/147 (46%), Positives = 103/147 (70%) Frame = +3 Query: 312 LDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKA 491 +DYA L+F + P TF WNL+I +YT+N C A+LLY+ MI +G DKFTFPFV+KA Sbjct: 1 MDYANLIFHQIQGPLTFTWNLMIMSYTINGCSQEALLLYSLMIHQGFPPDKFTFPFVIKA 60 Query: 492 CLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVSW 671 C+A S ++ ++G ++K+ F D+++ N L+D Y KCG ++ +VF+KMR R++VSW Sbjct: 61 CIASSAFEQGKVVHGLSIKNSFSRDMFVQNTLMDFYFKCGEIDCGCRVFEKMRVRNVVSW 120 Query: 672 TTLIAGLVLNGRVDTAQKVFDTMPMRN 752 TT+I+GLV G + A+ VF+ MP +N Sbjct: 121 TTMISGLVACGELHAARAVFERMPAKN 147 Score = 95.9 bits (237), Expect = 1e-17 Identities = 52/148 (35%), Positives = 80/148 (54%) Frame = +3 Query: 309 ELDYAALVFQHVDNPSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMK 488 EL A VF+ + + W ++ Y N P A L+ M V ++FT ++K Sbjct: 132 ELHAARAVFERMPAKNVVSWTAMMNGYVRNQQPEEAFELFWRMQVGDVRPNEFTLVSLLK 191 Query: 489 ACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGALNDALKVFDKMRHRSIVS 668 AC + +K I+ FA+K+GF+ DV+L LID Y KCG+L DA +VFD+MR +S+ + Sbjct: 192 ACTLLGSLKLGRWIHDFALKNGFKLDVFLGTALIDTYSKCGSLEDARRVFDEMRIKSLAT 251 Query: 669 WTTLIAGLVLNGRVDTAQKVFDTMPMRN 752 W +I L ++G + A +F M N Sbjct: 252 WNAMITSLGVHGFGEEALALFAEMEKVN 279 >ref|XP_002875497.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297321335|gb|EFH51756.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 629 Score = 155 bits (392), Expect = 1e-35 Identities = 79/196 (40%), Positives = 117/196 (59%), Gaps = 2/196 (1%) Frame = +3 Query: 171 QEALIFLQKCANFKQMKQIHAKIIRTSLHHNQVIVARLIRLCSSYRELDYAALVFQHVDN 350 +E L L KCAN Q+KQ+HA+IIR +LH + I +LI S R+ + A VF V Sbjct: 20 EERLQDLPKCANLNQVKQLHAQIIRRNLHQDLHIAPKLISALSLCRQTNLALRVFNQVQE 79 Query: 351 PSTFVWNLLIRTYTVNDCPYRAILLYNSMIFRGVGVDKFTFPFVMKACLAISCVKKANEI 530 P+ + N LIR + +N PY+A +++ M G+ D FT+PF++KAC +S + + Sbjct: 80 PNVHLCNSLIRAHALNSQPYQAFFVFSEMQRFGLFADNFTYPFLLKACSGLSWLPVVKMM 139 Query: 531 YGFAVKSGFRGDVYLDNVLIDVYMKCGAL--NDALKVFDKMRHRSIVSWTTLIAGLVLNG 704 + K G D+Y+ N LID Y +CG L DA+K+F+KM R VSW +++ GLV G Sbjct: 140 HNHIEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERDTVSWNSMLGGLVKAG 199 Query: 705 RVDTAQKVFDTMPMRN 752 + A+K+FD MP R+ Sbjct: 200 ELRDARKLFDEMPQRD 215 Score = 66.2 bits (160), Expect = 1e-08 Identities = 41/162 (25%), Positives = 74/162 (45%), Gaps = 2/162 (1%) Frame = +3 Query: 261 NQVIVARLIRLCSSYRELDYAALVFQHVDNPSTFV--WNLLIRTYTVNDCPYRAILLYNS 434 N V + ++ S +++ A ++F + P+ V W ++I Y A L + Sbjct: 246 NTVSWSTMVMGYSKAGDMEMARVMFDKMPFPAKNVVTWTIIIAGYAEKGLLKEADKLVDQ 305 Query: 435 MIFRGVGVDKFTFPFVMKACLAISCVKKANEIYGFAVKSGFRGDVYLDNVLIDVYMKCGA 614 M+ G+ D ++ AC + + KS + + N L+D+Y KCG+ Sbjct: 306 MVASGLRFDAAAAISILAACAESGLLSLGMRAHSIIKKSNLNSNASVLNALLDMYAKCGS 365 Query: 615 LNDALKVFDKMRHRSIVSWTTLIAGLVLNGRVDTAQKVFDTM 740 L A VF+ M + +VSW T++ GL ++G A ++F M Sbjct: 366 LKKAFDVFNDMPKKDLVSWNTMLHGLGVHGHGKEAIELFSRM 407