BLASTX nr result
ID: Atropa21_contig00002631
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00002631 (798 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containi... 515 e-144 ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containi... 512 e-143 gb|ESW12830.1| hypothetical protein PHAVU_008G145600g [Phaseolus... 459 e-127 gb|EOY02618.1| Pentatricopeptide repeat (PPR-like) superfamily p... 455 e-126 ref|XP_002324000.1| pentatricopeptide repeat-containing family p... 453 e-125 ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citr... 451 e-124 ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containi... 449 e-124 gb|EPS70491.1| hypothetical protein M569_04265, partial [Genlise... 444 e-122 ref|XP_002526948.1| pentatricopeptide repeat-containing protein,... 444 e-122 ref|NP_190245.1| pentatricopeptide repeat-containing protein [Ar... 443 e-122 ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containi... 443 e-122 gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis] 438 e-120 gb|EMJ21432.1| hypothetical protein PRUPE_ppa001979mg [Prunus pe... 437 e-120 ref|XP_002873660.1| pentatricopeptide repeat-containing protein ... 437 e-120 ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Caps... 436 e-120 ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutr... 436 e-120 ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containi... 435 e-120 ref|XP_006852234.1| hypothetical protein AMTR_s00049p00149530 [A... 417 e-114 ref|XP_003576898.1| PREDICTED: pentatricopeptide repeat-containi... 414 e-113 ref|XP_004962591.1| PREDICTED: pentatricopeptide repeat-containi... 411 e-112 >ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Solanum tuberosum] Length = 740 Score = 515 bits (1327), Expect = e-144 Identities = 254/265 (95%), Positives = 261/265 (98%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNHIIWLMG+AKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV Sbjct: 423 CNHIIWLMGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 482 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKMEEKGLKPSSREWN+VLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL Sbjct: 483 RLLNKMEEKGLKPSSREWNAVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 542 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEALQVWKHMIK+GIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMV+TGVEPTV Sbjct: 543 EKGKLYDEALQVWKHMIKVGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVTTGVEPTV 602 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VTFNA+ISGCARNGM SVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELY R Sbjct: 603 VTFNAIISGCARNGMESVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYVR 662 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A+ +GLSLSTKAYDAVISST AYGA Sbjct: 663 ALTEGLSLSTKAYDAVISSTQAYGA 687 >ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Solanum lycopersicum] Length = 742 Score = 512 bits (1319), Expect = e-143 Identities = 252/265 (95%), Positives = 261/265 (98%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNHIIWLMG+AKKWWAALEIYEDLLDKGP+PNNMSYELIVSHFNILLSAARKRGIWRWGV Sbjct: 425 CNHIIWLMGKAKKWWAALEIYEDLLDKGPQPNNMSYELIVSHFNILLSAARKRGIWRWGV 484 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKMEEKGLKPSSREWN+VLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL Sbjct: 485 RLLNKMEEKGLKPSSREWNAVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 544 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEALQVWKHMIK+GIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMV+TGVEPTV Sbjct: 545 EKGKLYDEALQVWKHMIKVGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVTTGVEPTV 604 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VTFNA+ISGCARNGM SVAYEWFQRMKTQNITPNEVSYE+LIEALANDGKPRLAYELY R Sbjct: 605 VTFNAIISGCARNGMESVAYEWFQRMKTQNITPNEVSYEVLIEALANDGKPRLAYELYVR 664 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A+ +GLSLSTKAYDAVISST AYGA Sbjct: 665 ALTEGLSLSTKAYDAVISSTQAYGA 689 >gb|ESW12830.1| hypothetical protein PHAVU_008G145600g [Phaseolus vulgaris] Length = 752 Score = 459 bits (1182), Expect = e-127 Identities = 216/265 (81%), Positives = 245/265 (92%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH IWLMG+AKKWWAALEIYEDLLDKGPKPNN+SYELIVSHFN LL+AA+++GIWRWGV Sbjct: 434 CNHAIWLMGKAKKWWAALEIYEDLLDKGPKPNNLSYELIVSHFNFLLNAAKRKGIWRWGV 493 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKMEEKGLKP SREWN+VLVACSKASET+AAVQIF+RMVE GEKPTVISYGALLSAL Sbjct: 494 RLLNKMEEKGLKPGSREWNAVLVACSKASETTAAVQIFKRMVENGEKPTVISYGALLSAL 553 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYD+AL+VW HM+K+G+EPN YAYTIMASIYTAQG FN VD+I++EMV+ G+E TV Sbjct: 554 EKGKLYDDALRVWNHMVKVGVEPNAYAYTIMASIYTAQGNFNRVDAIVQEMVTIGIEVTV 613 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VT+NA+ISGCARNGM S AYEWF RMK QNITPNE++YEMLIEALANDGKPRLAY+LY R Sbjct: 614 VTYNAIISGCARNGMSSAAYEWFHRMKVQNITPNEITYEMLIEALANDGKPRLAYQLYTR 673 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+GL+LS+KAYD V+ S+ A GA Sbjct: 674 AKNEGLTLSSKAYDVVVHSSQANGA 698 >gb|EOY02618.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative [Theobroma cacao] Length = 741 Score = 455 bits (1170), Expect = e-126 Identities = 210/265 (79%), Positives = 246/265 (92%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IWLMG+AKKWWAALE+YE+LLDKGP PNN+SYEL++SHFNILL+AARKRGIWRWGV Sbjct: 423 CNHLIWLMGKAKKWWAALEVYEELLDKGPSPNNLSYELVMSHFNILLTAARKRGIWRWGV 482 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKME+KGLKP SREWN+VLVACSKASET+AAVQIFRRMVE+GEKPT+ISYGALLSAL Sbjct: 483 RLLNKMEDKGLKPGSREWNAVLVACSKASETTAAVQIFRRMVEQGEKPTIISYGALLSAL 542 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEAL+VW HMIK+G++PNLYAYTIMASI T +G F +V+++ +EM S+G+EPTV Sbjct: 543 EKGKLYDEALRVWDHMIKVGVKPNLYAYTIMASIVTGKGNFRMVNAVFQEMASSGIEPTV 602 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VT+NA+ISGCARNGM S AYEWF RMK QNI+PNE++Y+MLIEALA DGKPRLAYELY R Sbjct: 603 VTYNAIISGCARNGMSSAAYEWFHRMKVQNISPNEITYQMLIEALAKDGKPRLAYELYLR 662 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+GL+LS+KAYDAV+ S+ YGA Sbjct: 663 AHNEGLNLSSKAYDAVVQSSQVYGA 687 Score = 69.3 bits (168), Expect = 1e-09 Identities = 51/206 (24%), Positives = 98/206 (47%), Gaps = 7/206 (3%) Frame = +3 Query: 180 VRLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSA 359 ++LL M+ GLK S ++ ++ AC+ A +++ R+ E+ + ++ L+ Sbjct: 370 LKLLRDMDNAGLKLSKEDYERIIWACTCEEHYVVAKELYSRIRERHSEISLSVCNHLIWL 429 Query: 360 LEKGKLYDEALQVWKHMIKMGIEPNLYAYTIMAS----IYTAQGKFNIVD---SIIKEMV 518 + K K + AL+V++ ++ G PN +Y ++ S + TA K I ++ +M Sbjct: 430 MGKAKKWWAALEVYEELLDKGPSPNNLSYELVMSHFNILLTAARKRGIWRWGVRLLNKME 489 Query: 519 STGVEPTVVTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPR 698 G++P +NAV+ C++ + A + F+RM Q P +SY L+ AL Sbjct: 490 DKGLKPGSREWNAVLVACSKASETTAAVQIFRRMVEQGEKPTIISYGALLSALEKGKLYD 549 Query: 699 LAYELYARAINKGLSLSTKAYDAVIS 776 A ++ I G+ + AY + S Sbjct: 550 EALRVWDHMIKVGVKPNLYAYTIMAS 575 >ref|XP_002324000.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222867002|gb|EEF04133.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 709 Score = 453 bits (1166), Expect = e-125 Identities = 210/265 (79%), Positives = 246/265 (92%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IWLMG+AKKWWAALE+YEDLLDKGPKPNN+SYELIVS+FN+LL+AA+KRGIWRWGV Sbjct: 393 CNHVIWLMGKAKKWWAALEVYEDLLDKGPKPNNLSYELIVSYFNVLLTAAKKRGIWRWGV 452 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKMEEKGLKP S+EWN+VLVACSKASET+AAVQIFRRMVE+GEKPTVISYGALLSAL Sbjct: 453 RLLNKMEEKGLKPGSKEWNAVLVACSKASETAAAVQIFRRMVEQGEKPTVISYGALLSAL 512 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKG+LYDEA++VW+HM+K+G++PN+YAYTIMAS++T QG F +VD+II EMVSTG+EPTV Sbjct: 513 EKGRLYDEAVRVWEHMLKVGVKPNVYAYTIMASVFTRQGNFRLVDAIINEMVSTGIEPTV 572 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VT+NA+ISGCARN + S AYEWF RMK QNI+PNE++Y+MLIEALA GKPRLAYELY R Sbjct: 573 VTYNAIISGCARNNLSSAAYEWFHRMKVQNISPNEITYDMLIEALAKSGKPRLAYELYLR 632 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+ L LS KAYDAV+ S+ AYGA Sbjct: 633 AQNEDLQLSPKAYDAVMHSSEAYGA 657 Score = 65.9 bits (159), Expect = 2e-08 Identities = 44/206 (21%), Positives = 98/206 (47%), Gaps = 7/206 (3%) Frame = +3 Query: 180 VRLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSA 359 ++LL M++ L+P ++ ++ AC++ A +++ R+ E+ ++ ++ Sbjct: 340 LKLLTDMDKAELQPGRSDYERLVWACTREEHYVVAKELYIRIRERCSDISLSVCNHVIWL 399 Query: 360 LEKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTA-------QGKFNIVDSIIKEMV 518 + K K + AL+V++ ++ G +PN +Y ++ S + +G + ++ +M Sbjct: 400 MGKAKKWWAALEVYEDLLDKGPKPNNLSYELIVSYFNVLLTAAKKRGIWRWGVRLLNKME 459 Query: 519 STGVEPTVVTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPR 698 G++P +NAV+ C++ + A + F+RM Q P +SY L+ AL Sbjct: 460 EKGLKPGSKEWNAVLVACSKASETAAAVQIFRRMVEQGEKPTVISYGALLSALEKGRLYD 519 Query: 699 LAYELYARAINKGLSLSTKAYDAVIS 776 A ++ + G+ + AY + S Sbjct: 520 EAVRVWEHMLKVGVKPNVYAYTIMAS 545 >ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citrus clementina] gi|568831365|ref|XP_006469938.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Citrus sinensis] gi|557549828|gb|ESR60457.1| hypothetical protein CICLE_v10014357mg [Citrus clementina] Length = 768 Score = 451 bits (1161), Expect = e-124 Identities = 212/265 (80%), Positives = 242/265 (91%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IWLMG+AKKWWAALE+YEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV Sbjct: 449 CNHLIWLMGKAKKWWAALEVYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 508 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKMEEKGLKP SREWN+VLVACSKASE +AAVQIF+RMVEKGEKPT+ISYGALLSAL Sbjct: 509 RLLNKMEEKGLKPGSREWNAVLVACSKASEYNAAVQIFKRMVEKGEKPTIISYGALLSAL 568 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEA +VW+HM+ +G EPNLYAYTIMASI+TAQGKFN+V+ I +EM S+ +EPTV Sbjct: 569 EKGKLYDEASRVWQHMLNVGAEPNLYAYTIMASIFTAQGKFNLVELIFREMASSRIEPTV 628 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VT+NA+IS C +NGM S AYEWF RMK QNI+PNE++YEMLIEALA DGKPRLAY+LY R Sbjct: 629 VTYNAIISACGQNGMSSAAYEWFHRMKVQNISPNEITYEMLIEALAKDGKPRLAYDLYLR 688 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+ L+LS+KAYDA++ + YGA Sbjct: 689 ARNEELNLSSKAYDAILEFSQVYGA 713 >ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Glycine max] Length = 808 Score = 449 bits (1154), Expect = e-124 Identities = 211/265 (79%), Positives = 243/265 (91%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH IWLMG+AKKWWAALEIYEDLLDKGPKPNN+SYELIVSHFN LLSAA+++GIWRWGV Sbjct: 490 CNHAIWLMGKAKKWWAALEIYEDLLDKGPKPNNLSYELIVSHFNFLLSAAKRKGIWRWGV 549 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 +LLNKME+KGLKP REWN+VLVACSKASET+AAVQIF+RMVE GEKPT+ISYGALLSAL Sbjct: 550 KLLNKMEDKGLKPGCREWNAVLVACSKASETTAAVQIFKRMVENGEKPTIISYGALLSAL 609 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYD+AL+VW HMIK+G+EPN YAYTIMASI+TAQG FN VD+II+EMV+ G+E TV Sbjct: 610 EKGKLYDDALRVWNHMIKVGVEPNAYAYTIMASIHTAQGNFNRVDAIIQEMVTLGIEVTV 669 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VT+NA+I+GCA NGM SVAYEWF RMK QNI+PNE++YEMLI ALANDGKPRLAY+LY R Sbjct: 670 VTYNAIITGCAHNGMSSVAYEWFHRMKVQNISPNEITYEMLIVALANDGKPRLAYQLYTR 729 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+GL+LS+KAYDAV+ S+ A A Sbjct: 730 AKNEGLTLSSKAYDAVVQSSQANNA 754 Score = 56.6 bits (135), Expect = 1e-05 Identities = 35/128 (27%), Positives = 61/128 (47%), Gaps = 6/128 (4%) Frame = +3 Query: 309 EKGEKPTVISYGALLSALEKGKLYDEALQVWKHMIKMGIE------PNLYAYTIMASIYT 470 +KG+ P + + ++S K K D AL ++ M K IE PNL+ Y + + Sbjct: 242 DKGDLPLQV-FSTIISGFGKEKRMDSALILFNWMKKRKIETNGSFGPNLFIYNGLLGVVK 300 Query: 471 AQGKFNIVDSIIKEMVSTGVEPTVVTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEV 650 G+F ++ I+ EM G+ VVT+N +++ G A + ++ +TP+ V Sbjct: 301 QSGQFAEMEVILNEMAEDGIAYNVVTYNTLMAIYIEKGECDKALNMLEEIRRNGLTPSPV 360 Query: 651 SYEMLIEA 674 SY + A Sbjct: 361 SYSQALLA 368 >gb|EPS70491.1| hypothetical protein M569_04265, partial [Genlisea aurea] Length = 557 Score = 444 bits (1143), Expect = e-122 Identities = 210/265 (79%), Positives = 240/265 (90%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNHIIWLMG+AKKWWAALEIYE+LLD GPKPNNMSYELIVSHFNILL+AARK+GIWRWGV Sbjct: 277 CNHIIWLMGKAKKWWAALEIYEELLDTGPKPNNMSYELIVSHFNILLTAARKKGIWRWGV 336 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RL+NKM+EKGLKP SREWNSVLVACSKA ETS A++IF+RMVE G+KPT+ISYGALLSAL Sbjct: 337 RLINKMKEKGLKPGSREWNSVLVACSKAGETSTAIEIFKRMVENGDKPTIISYGALLSAL 396 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEA+QVWKHM+K+G+E NLYAYTIMASI+ +QGK ++VD II+EMV GVEPTV Sbjct: 397 EKGKLYDEAIQVWKHMVKVGVEANLYAYTIMASIHASQGKIDLVDLIIREMVGAGVEPTV 456 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VTFNAVISG +N + S AYEWF+RMK QN+TPNE++YE LIEALA DGKPRLA EL+ R Sbjct: 457 VTFNAVISGFVKNNLSSAAYEWFRRMKLQNVTPNEITYETLIEALAKDGKPRLASELHLR 516 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+GL LSTKAYDA+I S+ AYGA Sbjct: 517 AQNEGLMLSTKAYDAIIQSSDAYGA 541 >ref|XP_002526948.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223533700|gb|EEF35435.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 671 Score = 444 bits (1143), Expect = e-122 Identities = 204/265 (76%), Positives = 240/265 (90%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IWLMG+AKKWWAALEIYEDLLDKGP PNNMSYELIVSHFNILL+AARKRGIWRWGV Sbjct: 353 CNHLIWLMGKAKKWWAALEIYEDLLDKGPNPNNMSYELIVSHFNILLTAARKRGIWRWGV 412 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKME+KGLKP SREWN+VLVACSKASET+AAVQIFRRM+E+GEKPT++SYGALLSAL Sbjct: 413 RLLNKMEDKGLKPGSREWNAVLVACSKASETTAAVQIFRRMIEQGEKPTIVSYGALLSAL 472 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEA++VW+HM+K+ ++PNLYAYTIMAS++ QGKF VD+II++MVS+G+EPT+ Sbjct: 473 EKGKLYDEAVRVWEHMLKVDVKPNLYAYTIMASVFAGQGKFTYVDAIIQKMVSSGIEPTI 532 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 +T+NA+ISGC N + S AYEWF RMK QN+ PN+++YEMLIEALA DGKPRLAYELY R Sbjct: 533 ITYNAIISGCTHNNLSSAAYEWFHRMKVQNMPPNKITYEMLIEALAKDGKPRLAYELYLR 592 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A +GL LS K YDAV+ S+ YGA Sbjct: 593 AKYEGLDLSAKVYDAVLRSSQVYGA 617 Score = 71.2 bits (173), Expect = 4e-10 Identities = 46/173 (26%), Positives = 90/173 (52%), Gaps = 7/173 (4%) Frame = +3 Query: 180 VRLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSA 359 ++LL M++ GL+PS E+ ++ AC++ + +++ R+ E+ K ++ L+ Sbjct: 300 LKLLTDMDKAGLQPSQAEYERLVWACTREDHYAVGKELYIRIRERHSKISLSVCNHLIWL 359 Query: 360 LEKGKLYDEALQVWKHMIKMGIEPNLYAYTIMAS----IYTAQGKFNIVD---SIIKEMV 518 + K K + AL++++ ++ G PN +Y ++ S + TA K I ++ +M Sbjct: 360 MGKAKKWWAALEIYEDLLDKGPNPNNMSYELIVSHFNILLTAARKRGIWRWGVRLLNKME 419 Query: 519 STGVEPTVVTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEAL 677 G++P +NAV+ C++ + A + F+RM Q P VSY L+ AL Sbjct: 420 DKGLKPGSREWNAVLVACSKASETTAAVQIFRRMIEQGEKPTIVSYGALLSAL 472 Score = 64.3 bits (155), Expect = 5e-08 Identities = 59/277 (21%), Positives = 121/277 (43%), Gaps = 22/277 (7%) Frame = +3 Query: 12 IIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGVRLL 191 +I G K +AL + E L + +++ L + +N LLSA +K ++ ++L Sbjct: 119 MIKAFGWDNKMESALALVEWLKRRKEIGSSIGPNLFI--YNSLLSAVKKSKLFEEAEKIL 176 Query: 192 NKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSALE-- 365 N M ++G+ P+ +N+++ + + + A+ I +M EKG PT SY L A Sbjct: 177 NDMTQEGIAPNVVTYNTLMGIYVEKGQATKALNILEQMHEKGFIPTAASYSTALLAYRGM 236 Query: 366 ------------------KGKLYDEALQVWKH-MIKMGIEPNLYAYTIMASIYTAQGKFN 488 KGK+ + + W++ +K+ Y +M F+ Sbjct: 237 EDGHGALAFFVDIKDKYLKGKIGKNSDENWENEFVKLETFIIRICYQVMRRWLVRHDNFS 296 Query: 489 I-VDSIIKEMVSTGVEPTVVTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEML 665 V ++ +M G++P+ + ++ C R +V E + R++ ++ + L Sbjct: 297 TDVLKLLTDMDKAGLQPSQAEYERLVWACTREDHYAVGKELYIRIRERHSKISLSVCNHL 356 Query: 666 IEALANDGKPRLAYELYARAINKGLSLSTKAYDAVIS 776 I + K A E+Y ++KG + + +Y+ ++S Sbjct: 357 IWLMGKAKKWWAALEIYEDLLDKGPNPNNMSYELIVS 393 >ref|NP_190245.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206903|sp|Q9SNB7.1|PP264_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g46610 gi|6523064|emb|CAB62331.1| hypothetical protein [Arabidopsis thaliana] gi|332644660|gb|AEE78181.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 665 Score = 443 bits (1139), Expect = e-122 Identities = 205/265 (77%), Positives = 238/265 (89%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IWLMG+AKKWWAALEIYEDLLD+GP+PNN+SYEL+VSHFNILLSAA KRGIWRWGV Sbjct: 384 CNHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASKRGIWRWGV 443 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKME+KGLKP R WN+VLVACSKASET+AA+QIF+ MV+ GEKPTVISYGALLSAL Sbjct: 444 RLLNKMEDKGLKPQRRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSAL 503 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEA +VW HMIK+GIEPNLYAYT MAS+ T Q KFN++D+++KEM S G+EP+V Sbjct: 504 EKGKLYDEAFRVWNHMIKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASKGIEPSV 563 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VTFNAVISGCARNG+ VAYEWF RMK++N+ PNE++YEMLIEALAND KPRLAYEL+ + Sbjct: 564 VTFNAVISGCARNGLSGVAYEWFHRMKSENVEPNEITYEMLIEALANDAKPRLAYELHVK 623 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+GL LS+K YDAV+ S YGA Sbjct: 624 AQNEGLKLSSKPYDAVVKSAETYGA 648 Score = 71.6 bits (174), Expect = 3e-10 Identities = 47/206 (22%), Positives = 99/206 (48%), Gaps = 7/206 (3%) Frame = +3 Query: 180 VRLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSA 359 ++LLN M+ G++PS E ++ AC++ ++++R+ E+ + ++ L+ Sbjct: 331 LKLLNAMDSAGVRPSREEHERLIWACTREEHYIVGKELYKRIRERFSEISLSVCNHLIWL 390 Query: 360 LEKGKLYDEALQVWKHMIKMGIEPNLYAY-------TIMASIYTAQGKFNIVDSIIKEMV 518 + K K + AL++++ ++ G EPN +Y I+ S + +G + ++ +M Sbjct: 391 MGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASKRGIWRWGVRLLNKME 450 Query: 519 STGVEPTVVTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPR 698 G++P +NAV+ C++ + A + F+ M P +SY L+ AL Sbjct: 451 DKGLKPQRRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGKLYD 510 Query: 699 LAYELYARAINKGLSLSTKAYDAVIS 776 A+ ++ I G+ + AY + S Sbjct: 511 EAFRVWNHMIKVGIEPNLYAYTTMAS 536 >ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610 [Vitis vinifera] Length = 763 Score = 443 bits (1139), Expect = e-122 Identities = 208/265 (78%), Positives = 241/265 (90%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNHIIWLMG+AKKWWAALEIYEDLLDKGPKPNN+SYEL+VSHFNILL+AARK+GIWRWGV Sbjct: 445 CNHIIWLMGKAKKWWAALEIYEDLLDKGPKPNNLSYELVVSHFNILLTAARKKGIWRWGV 504 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKME+KGLKP SREWN+VLVACSKA+ETSAAV+IFRRMVE+GEKPT+ISYGALLSAL Sbjct: 505 RLLNKMEDKGLKPGSREWNAVLVACSKAAETSAAVEIFRRMVEQGEKPTIISYGALLSAL 564 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEA +VW+HM+KMG+EPNLYAYTIMASI QGK VDSI++EM + G++ TV Sbjct: 565 EKGKLYDEASRVWEHMVKMGVEPNLYAYTIMASICVGQGKLQRVDSILREMETLGIDATV 624 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VT+NA+ISGCARNG+ S A+EWF RMK I PNE++YEMLIEALA DGKPRLA+ELY+R Sbjct: 625 VTYNAIISGCARNGLSSAAFEWFHRMKVGKIQPNEITYEMLIEALAKDGKPRLAFELYSR 684 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+GL+LSTKAYDAV+ S+ + A Sbjct: 685 AQNEGLNLSTKAYDAVVLSSQVHSA 709 >gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis] Length = 737 Score = 438 bits (1126), Expect = e-120 Identities = 204/265 (76%), Positives = 240/265 (90%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH IWLMG+AK+WW ALEIYEDLLDKGP+PNNMSYE+IVSHFNILL+AARKRGIW+WGV Sbjct: 452 CNHTIWLMGKAKRWWTALEIYEDLLDKGPQPNNMSYEIIVSHFNILLTAARKRGIWKWGV 511 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKMEEKGLKP S+EWN+VL+ACSKASETSAAV+IF+RMVE+G+KPT +SYGALLSAL Sbjct: 512 RLLNKMEEKGLKPGSKEWNAVLIACSKASETSAAVKIFKRMVEQGQKPTFLSYGALLSAL 571 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEA QVW+HM+K+GI PN+YAYTIMAS++ GKFN+VD++I EMVS+G+EPTV Sbjct: 572 EKGKLYDEARQVWEHMLKVGIRPNVYAYTIMASVFAGHGKFNMVDTVIHEMVSSGIEPTV 631 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VT+NA+ISGCARN M +A+EWF RMK Q+ITPN V+YEMLIEALAND KPRLAYELY R Sbjct: 632 VTYNAIISGCARNDMIDMAFEWFHRMKAQSITPNNVTYEMLIEALANDCKPRLAYELYLR 691 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+GL L+ KAYD V+ S+ +GA Sbjct: 692 AQNEGLRLAPKAYDIVVESSQYHGA 716 Score = 57.4 bits (137), Expect = 6e-06 Identities = 42/164 (25%), Positives = 74/164 (45%), Gaps = 6/164 (3%) Frame = +3 Query: 201 EEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSALEKGKLY 380 EEKG K RE S L A + ++ +KGE P + + ++ L + KL Sbjct: 175 EEKGGKVDVRELASSLRFAKTADDVDEVLK------DKGELPPQV-FSTMIRGLGREKLL 227 Query: 381 DEALQVWKHMIKMG------IEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 D A + + + + I NL+ Y + +F ++ ++ M GV P V Sbjct: 228 DPAFALLEWLKRKKEENNGLISLNLFIYNSLLGAVKQSEQFGEMEKVLNYMAQEGVVPNV 287 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEA 674 VT+N +++ NG G+ A + ++ + +TP+ VSY + A Sbjct: 288 VTYNTMMAIHLENGEGTKALSVLEEIRKKGLTPSPVSYSTALLA 331 >gb|EMJ21432.1| hypothetical protein PRUPE_ppa001979mg [Prunus persica] Length = 734 Score = 437 bits (1124), Expect = e-120 Identities = 203/264 (76%), Positives = 241/264 (91%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IWLMG+AKKWWAALEIYED+LD+GPKPNNMSYELIVSHFN+LL+AARKRGIWRWG+ Sbjct: 443 CNHVIWLMGKAKKWWAALEIYEDMLDRGPKPNNMSYELIVSHFNVLLTAARKRGIWRWGI 502 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKMEEKGLKP S+EWN+VLVACSKA+ETSAAV+IF+RMVE+G+KPTV+SYGALLSAL Sbjct: 503 RLLNKMEEKGLKPRSKEWNAVLVACSKAAETSAAVKIFKRMVEQGQKPTVLSYGALLSAL 562 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEA QVW+HM+K+G++PNLYAYTIMAS+++ GK N+VD+II EMVS+G+EPTV Sbjct: 563 EKGKLYDEARQVWEHMLKVGVKPNLYAYTIMASVFSGHGKLNMVDTIIHEMVSSGIEPTV 622 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VT+NA+ISG ARNG + AYEWFQRMK QNI+PN V+YEM+IE LAN GKPRLAY+LY Sbjct: 623 VTYNAIISGFARNGSTNAAYEWFQRMKDQNISPNNVTYEMMIEGLANGGKPRLAYDLYLT 682 Query: 723 AINKGLSLSTKAYDAVISSTHAYG 794 A N+GL LS K+YD V+ S+ A G Sbjct: 683 AQNQGLDLSPKSYDIVVQSSLASG 706 >ref|XP_002873660.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297319497|gb|EFH49919.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 674 Score = 437 bits (1123), Expect = e-120 Identities = 202/265 (76%), Positives = 236/265 (89%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IWLMG+AKKWWAALEIYEDLLD+GP+PNN+SYEL+VSHFNILLSAA +RGIWRWGV Sbjct: 393 CNHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASRRGIWRWGV 452 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKME+KGLKP SR WN+VLVACSKASET+AA+QIF+ MV+ GEKPTVISYGALLSAL Sbjct: 453 RLLNKMEDKGLKPQSRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSAL 512 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEA +VW HMIK+GIEPNLYAYT MAS+ T Q KFN++D+++KEM S G+EP+V Sbjct: 513 EKGKLYDEAFRVWNHMIKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASKGIEPSV 572 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VT+NAVISGCARNG+ VAYEWF RM+ + + PNE++YEMLIEALAND KPRLAYEL+ + Sbjct: 573 VTYNAVISGCARNGLSGVAYEWFHRMRGEKVEPNEITYEMLIEALANDAKPRLAYELHLK 632 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N GL LS+K YDAV+ S YGA Sbjct: 633 AQNDGLKLSSKPYDAVVKSAETYGA 657 Score = 71.2 bits (173), Expect = 4e-10 Identities = 48/206 (23%), Positives = 98/206 (47%), Gaps = 7/206 (3%) Frame = +3 Query: 180 VRLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSA 359 ++LLN M+ G KPS E ++ AC++ ++++R+ E+ + ++ L+ Sbjct: 340 LKLLNAMDNAGPKPSREEHERLIWACTREEHYIVGKELYKRIRERFPEISLSVCNHLIWL 399 Query: 360 LEKGKLYDEALQVWKHMIKMGIEPNLYAY-------TIMASIYTAQGKFNIVDSIIKEMV 518 + K K + AL++++ ++ G EPN +Y I+ S + +G + ++ +M Sbjct: 400 MGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASRRGIWRWGVRLLNKME 459 Query: 519 STGVEPTVVTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPR 698 G++P +NAV+ C++ + A + F+ M P +SY L+ AL Sbjct: 460 DKGLKPQSRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGKLYD 519 Query: 699 LAYELYARAINKGLSLSTKAYDAVIS 776 A+ ++ I G+ + AY + S Sbjct: 520 EAFRVWNHMIKVGIEPNLYAYTTMAS 545 >ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Capsella rubella] gi|482561642|gb|EOA25833.1| hypothetical protein CARUB_v10019206mg [Capsella rubella] Length = 673 Score = 436 bits (1122), Expect = e-120 Identities = 200/265 (75%), Positives = 238/265 (89%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IWLMG+AKKWWAALEIYEDLLD+GP+PNN+SYEL+VSHF+ILLSAA +RGIWRWGV Sbjct: 392 CNHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFSILLSAASRRGIWRWGV 451 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKME+K LKP SR WN+VLVACSKASET+AA+QIF+ MV+ GEKPTVISYGALLSAL Sbjct: 452 RLLNKMEDKNLKPQSRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSAL 511 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEA +VW HM+K+GIEPNLYAYT MAS+ T Q KFN++D+++KEM S G+EP+V Sbjct: 512 EKGKLYDEAFRVWNHMVKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASKGIEPSV 571 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VT+NAVISGCA+NG+ VAYEWF RMK++N+ PNE++YEMLIEALAND KPRLAYEL+ + Sbjct: 572 VTYNAVISGCAKNGLSGVAYEWFHRMKSENVEPNEITYEMLIEALANDAKPRLAYELHLK 631 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+GL LS+K YDAV+ S YGA Sbjct: 632 AQNEGLKLSSKPYDAVVKSAETYGA 656 Score = 71.6 bits (174), Expect = 3e-10 Identities = 47/206 (22%), Positives = 99/206 (48%), Gaps = 7/206 (3%) Frame = +3 Query: 180 VRLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSA 359 ++LLN M+ GLKPS E ++ AC++ ++++R+ E+ + ++ L+ Sbjct: 339 LKLLNAMDSAGLKPSREEHERLIWACTREEHYIVGKELYKRIRERFPEISLSVCNHLIWL 398 Query: 360 LEKGKLYDEALQVWKHMIKMGIEPNLYAY-------TIMASIYTAQGKFNIVDSIIKEMV 518 + K K + AL++++ ++ G EPN +Y +I+ S + +G + ++ +M Sbjct: 399 MGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFSILLSAASRRGIWRWGVRLLNKME 458 Query: 519 STGVEPTVVTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPR 698 ++P +NAV+ C++ + A + F+ M P +SY L+ AL Sbjct: 459 DKNLKPQSRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGKLYD 518 Query: 699 LAYELYARAINKGLSLSTKAYDAVIS 776 A+ ++ + G+ + AY + S Sbjct: 519 EAFRVWNHMVKVGIEPNLYAYTTMAS 544 >ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutrema salsugineum] gi|557101036|gb|ESQ41399.1| hypothetical protein EUTSA_v10015672mg [Eutrema salsugineum] Length = 688 Score = 436 bits (1121), Expect = e-120 Identities = 201/265 (75%), Positives = 239/265 (90%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IWLMG+AKKWWAALEIYEDLLD+GP+PNN+SYEL+VSHFNILLSAA +RGIWRWGV Sbjct: 404 CNHLIWLMGKAKKWWAALEIYEDLLDQGPEPNNLSYELVVSHFNILLSAASRRGIWRWGV 463 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKME+KGLKP SR WN+VLVACSKASET+AA+QIF+ MVE GEKPTVISYGALLSAL Sbjct: 464 RLLNKMEDKGLKPQSRHWNAVLVACSKASETAAAIQIFKAMVENGEKPTVISYGALLSAL 523 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEA +VW HMIK+GIEPN++AYTIMAS+ T Q KFN++D+++KEM S G+EP+V Sbjct: 524 EKGKLYDEAFRVWNHMIKVGIEPNVHAYTIMASVLTGQQKFNLLDTLLKEMSSKGIEPSV 583 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VT+NA+ISGCARN + VAYEWF RM+ +N+ PNE++YEMLIEALAND KPRLAYEL+ + Sbjct: 584 VTYNAIISGCARNELSGVAYEWFHRMRGENVEPNEITYEMLIEALANDAKPRLAYELHLK 643 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+GL LS+K YDAV+ S +YGA Sbjct: 644 AQNEGLKLSSKPYDAVVKSAESYGA 668 Score = 75.1 bits (183), Expect = 3e-11 Identities = 49/206 (23%), Positives = 99/206 (48%), Gaps = 7/206 (3%) Frame = +3 Query: 180 VRLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSA 359 ++LLN M+ GLKPS E ++ AC++ ++++R+ E+ + ++ L+ Sbjct: 351 LKLLNAMDNAGLKPSREEHERLIWACTREEHYVVGKELYKRIRERFPEISLSVCNHLIWL 410 Query: 360 LEKGKLYDEALQVWKHMIKMGIEPNLYAY-------TIMASIYTAQGKFNIVDSIIKEMV 518 + K K + AL++++ ++ G EPN +Y I+ S + +G + ++ +M Sbjct: 411 MGKAKKWWAALEIYEDLLDQGPEPNNLSYELVVSHFNILLSAASRRGIWRWGVRLLNKME 470 Query: 519 STGVEPTVVTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPR 698 G++P +NAV+ C++ + A + F+ M P +SY L+ AL Sbjct: 471 DKGLKPQSRHWNAVLVACSKASETAAAIQIFKAMVENGEKPTVISYGALLSALEKGKLYD 530 Query: 699 LAYELYARAINKGLSLSTKAYDAVIS 776 A+ ++ I G+ + AY + S Sbjct: 531 EAFRVWNHMIKVGIEPNVHAYTIMAS 556 Score = 56.6 bits (135), Expect = 1e-05 Identities = 40/159 (25%), Positives = 78/159 (49%), Gaps = 6/159 (3%) Frame = +3 Query: 297 RRMVEKGEK--PTVISYGALLSALEKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYT 470 R+ +E G P + Y +LL A+++ + + E ++ M + GI PN+ Y + IY Sbjct: 191 RKKIESGGLIGPNLFIYNSLLGAMKESRGFGETEKILSDMEEEGIVPNIVTYNTLMVIYM 250 Query: 471 AQGKFNIVDSIIKEMVSTGVEPTVVTFNAVISGCARNGMGSVAYEWFQRMK---TQNITP 641 +G+F+ I+ + G EP+ VT++ + R G A E+F ++ ++ Sbjct: 251 EEGEFHKALGILDLVKEKGFEPSPVTYSTALLVYRRLEDGMGALEFFAELREKYSKREIG 310 Query: 642 NEVSYEMLIEALANDG-KPRLAYELYARAINKGLSLSTK 755 N+ Y+ E + + R+ Y++ R + K +L+TK Sbjct: 311 NDADYDWEFEFVKLENFIGRICYQVMRRWLVKDENLTTK 349 >ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Fragaria vesca subsp. vesca] Length = 657 Score = 435 bits (1119), Expect = e-120 Identities = 200/265 (75%), Positives = 243/265 (91%), Gaps = 1/265 (0%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IW+MG+AKKWWAALEIYED+LDKGPKPNNMSYEL+VSHFN+LL+AARK+GIWRWGV Sbjct: 370 CNHVIWVMGKAKKWWAALEIYEDMLDKGPKPNNMSYELVVSHFNVLLTAARKKGIWRWGV 429 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKMEEKGLKP S+EWN+VLVACSKA+ETSAAV+IFRRMVE+G+KPT++SYGALLSAL Sbjct: 430 RLLNKMEEKGLKPRSKEWNAVLVACSKAAETSAAVKIFRRMVEQGQKPTILSYGALLSAL 489 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEA QVW+HMIK+G++PNLYAYTIMAS+++ GKFN+V++I++EMVS+G+EPTV Sbjct: 490 EKGKLYDEARQVWEHMIKVGVKPNLYAYTIMASVFSGHGKFNLVETILQEMVSSGIEPTV 549 Query: 543 VTFNAVISGCARNGMGSV-AYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYA 719 VT+NA+ISGCARN S AY+WF RMK NI PN V+YEM+IEALA +GKPRLAYELY Sbjct: 550 VTYNAIISGCARNDSSSADAYDWFDRMKANNIPPNNVTYEMMIEALAKEGKPRLAYELYL 609 Query: 720 RAINKGLSLSTKAYDAVISSTHAYG 794 RA N+G+ LS+KAYD ++ S+ +G Sbjct: 610 RAQNQGIHLSSKAYDILVQSSIDFG 634 >ref|XP_006852234.1| hypothetical protein AMTR_s00049p00149530 [Amborella trichopoda] gi|548855838|gb|ERN13701.1| hypothetical protein AMTR_s00049p00149530 [Amborella trichopoda] Length = 754 Score = 417 bits (1071), Expect = e-114 Identities = 192/265 (72%), Positives = 231/265 (87%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IWLMG+AKKWWA+LE+YE++LDKGPKPNN+SYEL+VS FNILLSAA +RGIW W + Sbjct: 431 CNHVIWLMGKAKKWWASLEVYEEMLDKGPKPNNLSYELMVSQFNILLSAASRRGIWNWAI 490 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKM+EKG+KP +REWN+ LVACS+ASE +AAVQIF RMVE+GEKPT++SYGALLSAL Sbjct: 491 RLLNKMQEKGIKPRTREWNAALVACSRASEAAAAVQIFMRMVEQGEKPTILSYGALLSAL 550 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYD+A QVW+HMIK+G++PNLYAYT M SIY QG+ VD +I+EM S G+EPTV Sbjct: 551 EKGKLYDKAHQVWEHMIKVGVQPNLYAYTTMLSIYIKQGRLKAVDIVIREMNSLGIEPTV 610 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VTFNA+ISGCA GMG A+EWF RMK +NI PNE++YEMLIEALANDGKPRLAYE+Y R Sbjct: 611 VTFNAIISGCAYKGMGGAAFEWFHRMKAKNIEPNEITYEMLIEALANDGKPRLAYEVYLR 670 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+ L LS KAYD+V+ S++ Y A Sbjct: 671 ARNEDLLLSPKAYDSVLRSSYQYKA 695 Score = 78.2 bits (191), Expect = 3e-12 Identities = 48/206 (23%), Positives = 102/206 (49%), Gaps = 7/206 (3%) Frame = +3 Query: 180 VRLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSA 359 ++LL ++++ GLKP + ++ AC+ A ++++R+ E + ++ ++ Sbjct: 378 LKLLIELDKAGLKPGRAIYERLIWACTNEGHYIVAKELYQRIRENNTEISLSVCNHVIWL 437 Query: 360 LEKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIY-------TAQGKFNIVDSIIKEMV 518 + K K + +L+V++ M+ G +PN +Y +M S + + +G +N ++ +M Sbjct: 438 MGKAKKWWASLEVYEEMLDKGPKPNNLSYELMVSQFNILLSAASRRGIWNWAIRLLNKMQ 497 Query: 519 STGVEPTVVTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPR 698 G++P +NA + C+R + A + F RM Q P +SY L+ AL Sbjct: 498 EKGIKPRTREWNAALVACSRASEAAAAVQIFMRMVEQGEKPTILSYGALLSALEKGKLYD 557 Query: 699 LAYELYARAINKGLSLSTKAYDAVIS 776 A++++ I G+ + AY ++S Sbjct: 558 KAHQVWEHMIKVGVQPNLYAYTTMLS 583 >ref|XP_003576898.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Brachypodium distachyon] Length = 623 Score = 414 bits (1064), Expect = e-113 Identities = 187/265 (70%), Positives = 235/265 (88%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IWLMG+AKKWWAALEIYEDLL+KGP+PNN+SYELI+SHFNILL+AA++RGIWRWGV Sbjct: 340 CNHLIWLMGKAKKWWAALEIYEDLLEKGPQPNNLSYELIMSHFNILLNAAKRRGIWRWGV 399 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLL+KM+EKGLKP SREWNSVL+ACS+A+ETSAAV IF+RM+++G KP V+SYGALLSAL Sbjct: 400 RLLDKMQEKGLKPGSREWNSVLLACSRAAETSAAVNIFKRMIDEGLKPDVVSYGALLSAL 459 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEAL+VWKHM+K+GI+PNL+AYTI+ SIY +G ++VD+++++M+ +EPTV Sbjct: 460 EKGKLYDEALRVWKHMLKVGIDPNLHAYTILVSIYIGKGNHDMVDTVLRDMLYAKIEPTV 519 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VTFNA+IS CARN G A+EWF RMK QNI P+E++Y++LIEAL DGKP+LAYE+Y R Sbjct: 520 VTFNAIISACARNKKGGAAFEWFHRMKMQNIEPDEITYQVLIEALVQDGKPKLAYEMYIR 579 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A N+GL LS K+YD VI + AYG+ Sbjct: 580 ACNQGLKLSAKSYDTVIDACQAYGS 604 Score = 67.8 bits (164), Expect = 4e-09 Identities = 52/239 (21%), Positives = 111/239 (46%), Gaps = 10/239 (4%) Frame = +3 Query: 90 KPNNMSYELI---VSHFNILLSAARKRGIWRWGVRLLNKMEEKGLKPSSREWNSVLVACS 260 +P + YE + V H + S A +++L M+E G++P + ++ AC Sbjct: 254 EPEFVKYEKLAVRVCHIAMRRSLAGGNNPATAALKVLLAMDEAGVRPDRSYYERLVWACM 313 Query: 261 KASETSAAVQIFRRMVEKGEKPTVISYGALLSALEKGKLYDEALQVWKHMIKMGIEPNLY 440 + A ++++R+ E ++ L+ + K K + AL++++ +++ G +PN Sbjct: 314 GEEHYTIAKELYQRIRECDGGISLSVCNHLIWLMGKAKKWWAALEIYEDLLEKGPQPNNL 373 Query: 441 AYTIMASIYT-------AQGKFNIVDSIIKEMVSTGVEPTVVTFNAVISGCARNGMGSVA 599 +Y ++ S + +G + ++ +M G++P +N+V+ C+R S A Sbjct: 374 SYELIMSHFNILLNAAKRRGIWRWGVRLLDKMQEKGLKPGSREWNSVLLACSRAAETSAA 433 Query: 600 YEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYARAINKGLSLSTKAYDAVIS 776 F+RM + + P+ VSY L+ AL A ++ + G+ + AY ++S Sbjct: 434 VNIFKRMIDEGLKPDVVSYGALLSALEKGKLYDEALRVWKHMLKVGIDPNLHAYTILVS 492 >ref|XP_004962591.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Setaria italica] Length = 671 Score = 411 bits (1056), Expect = e-112 Identities = 185/265 (69%), Positives = 232/265 (87%) Frame = +3 Query: 3 CNHIIWLMGRAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFNILLSAARKRGIWRWGV 182 CNH+IWLMG++KKWWAALEIYEDLLDKGPKPNN+SYELI+SHFNILL+AA++RGIWRWGV Sbjct: 351 CNHLIWLMGKSKKWWAALEIYEDLLDKGPKPNNLSYELIMSHFNILLNAAKRRGIWRWGV 410 Query: 183 RLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSAL 362 RLLNKM+EKGLKP S+EWN+VLVACS+ASETSAAV +F++M+E+G KP V+SYGALLSAL Sbjct: 411 RLLNKMQEKGLKPGSKEWNAVLVACSRASETSAAVDVFKKMIEEGLKPDVVSYGALLSAL 470 Query: 363 EKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYTAQGKFNIVDSIIKEMVSTGVEPTV 542 EKGKLYDEAL+VW+HM K+G++PNLYAYTI+ SIY +G +VD+++ +M+S +EPTV Sbjct: 471 EKGKLYDEALRVWEHMCKVGVKPNLYAYTILVSIYIGKGNHAMVDAVLHDMLSKQIEPTV 530 Query: 543 VTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPRLAYELYAR 722 VTFNA+IS C +N MG A+EWF RMK ++I PNE++Y+MLIEAL DGKPRLAYE+Y R Sbjct: 531 VTFNAIISACVKNKMGGTAFEWFHRMKMRSIEPNEITYQMLIEALVQDGKPRLAYEMYMR 590 Query: 723 AINKGLSLSTKAYDAVISSTHAYGA 797 A ++GL L K+YD V+ + AYG+ Sbjct: 591 ACSQGLELPAKSYDTVMEACKAYGS 615 Score = 68.9 bits (167), Expect = 2e-09 Identities = 45/206 (21%), Positives = 101/206 (49%), Gaps = 7/206 (3%) Frame = +3 Query: 180 VRLLNKMEEKGLKPSSREWNSVLVACSKASETSAAVQIFRRMVEKGEKPTVISYGALLSA 359 +++L M+E G+KP ++ ++ AC+ + ++++R+ E + ++ L+ Sbjct: 298 LKVLLAMDEAGVKPERSDYERLVWACTGEEHYTIGKELYQRIRELNGEISLSVCNHLIWL 357 Query: 360 LEKGKLYDEALQVWKHMIKMGIEPNLYAYTIMASIYT-------AQGKFNIVDSIIKEMV 518 + K K + AL++++ ++ G +PN +Y ++ S + +G + ++ +M Sbjct: 358 MGKSKKWWAALEIYEDLLDKGPKPNNLSYELIMSHFNILLNAAKRRGIWRWGVRLLNKMQ 417 Query: 519 STGVEPTVVTFNAVISGCARNGMGSVAYEWFQRMKTQNITPNEVSYEMLIEALANDGKPR 698 G++P +NAV+ C+R S A + F++M + + P+ VSY L+ AL Sbjct: 418 EKGLKPGSKEWNAVLVACSRASETSAAVDVFKKMIEEGLKPDVVSYGALLSALEKGKLYD 477 Query: 699 LAYELYARAINKGLSLSTKAYDAVIS 776 A ++ G+ + AY ++S Sbjct: 478 EALRVWEHMCKVGVKPNLYAYTILVS 503