BLASTX nr result
ID: Sinomenium21_contig00028275
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00028275 (989 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003631274.1| PREDICTED: pentatricopeptide repeat-containi... 273 8e-71 emb|CBI28554.3| unnamed protein product [Vitis vinifera] 270 5e-70 emb|CAN65318.1| hypothetical protein VITISV_006411 [Vitis vinifera] 266 1e-68 ref|XP_006483887.1| PREDICTED: pentatricopeptide repeat-containi... 260 7e-67 ref|XP_006438343.1| hypothetical protein CICLE_v100305772mg, par... 259 9e-67 ref|XP_004298478.1| PREDICTED: putative pentatricopeptide repeat... 259 9e-67 ref|XP_006380228.1| pentatricopeptide repeat-containing family p... 253 6e-65 ref|XP_007224509.1| hypothetical protein PRUPE_ppa022727mg, part... 253 8e-65 gb|EXB82575.1| hypothetical protein L484_027752 [Morus notabilis] 249 1e-63 ref|XP_006350274.1| PREDICTED: pentatricopeptide repeat-containi... 241 3e-61 ref|XP_007044687.1| F28C11.9, putative [Theobroma cacao] gi|5087... 240 6e-61 gb|EYU22504.1| hypothetical protein MIMGU_mgv1a018311mg, partial... 240 7e-61 gb|AAC98003.1| This gene may be continued from BAC T23E23 [Arabi... 238 2e-60 ref|NP_173759.2| pentatricopeptide repeat-containing protein [Ar... 238 2e-60 ref|XP_004237297.1| PREDICTED: pentatricopeptide repeat-containi... 238 3e-60 ref|XP_006585945.1| PREDICTED: pentatricopeptide repeat-containi... 236 8e-60 ref|XP_006416082.1| hypothetical protein EUTSA_v10006630mg [Eutr... 231 4e-58 gb|AAF79584.1|AC007945_4 F28C11.9 [Arabidopsis thaliana] 231 4e-58 ref|XP_006305992.1| hypothetical protein CARUB_v10011268mg [Caps... 230 8e-58 ref|XP_004161088.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 230 8e-58 >ref|XP_003631274.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic-like [Vitis vinifera] Length = 667 Score = 273 bits (698), Expect = 8e-71 Identities = 135/259 (52%), Positives = 172/259 (66%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G+G+D+V C L+HCC IKSGFE D VSCSL+DAYS+ GH Sbjct: 409 DEGIGVDEVTLSTSLKASLVSAFGSLACCRLLHCCAIKSGFEFDTAVSCSLIDAYSKYGH 468 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 ++LS ++F+Q+ PN ICFTSII+GYARN + ++G+ D +TFL LTG Sbjct: 469 VELSHQIFEQLHSPNAICFTSIINGYARNGMGREGLEMLEIMAKQGLKPDRVTFLCALTG 528 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 CSHSGLV+EG VF SM++ HGI PD++HYSCMVD ++K AP K DSV Sbjct: 529 CSHSGLVEEGRSVFNSMKSIHGIHPDQQHYSCMVDLLGRAGLLDEAEGLLKHAPAKSDSV 588 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 MWSSLL SCR+H + VGRRAA+ LMEL+P+DPA YLQAS FYSEIG+ S +IREI Sbjct: 589 MWSSLLQSCRIHGDTIVGRRAAKVLMELEPEDPATYLQASSFYSEIGELEISMQIREIAI 648 Query: 269 IMDIKRDFGYSSIEINSQS 213 +KR+ G+S IE+NS S Sbjct: 649 ARRMKREMGHSLIEVNSHS 667 >emb|CBI28554.3| unnamed protein product [Vitis vinifera] Length = 1125 Score = 270 bits (691), Expect = 5e-70 Identities = 133/257 (51%), Positives = 171/257 (66%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G+G+D+V C L+HCC IKSGFE D VSCSL+DAYS+ GH Sbjct: 409 DEGIGVDEVTLSTSLKASLVSAFGSLACCRLLHCCAIKSGFEFDTAVSCSLIDAYSKYGH 468 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 ++LS ++F+Q+ PN ICFTSII+GYARN + ++G+ D +TFL LTG Sbjct: 469 VELSHQIFEQLHSPNAICFTSIINGYARNGMGREGLEMLEIMAKQGLKPDRVTFLCALTG 528 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 CSHSGLV+EG VF SM++ HGI PD++HYSCMVD ++K AP K DSV Sbjct: 529 CSHSGLVEEGRSVFNSMKSIHGIHPDQQHYSCMVDLLGRAGLLDEAEGLLKHAPAKSDSV 588 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 MWSSLL SCR+H + VGRRAA+ LMEL+P+DPA YLQAS FYSEIG+ S +IREI Sbjct: 589 MWSSLLQSCRIHGDTIVGRRAAKVLMELEPEDPATYLQASSFYSEIGELEISMQIREIAI 648 Query: 269 IMDIKRDFGYSSIEINS 219 +KR+ G+S IE+N+ Sbjct: 649 ARRMKREMGHSLIEVNT 665 >emb|CAN65318.1| hypothetical protein VITISV_006411 [Vitis vinifera] Length = 1052 Score = 266 bits (679), Expect = 1e-68 Identities = 132/260 (50%), Positives = 171/260 (65%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G+G+D+V C L+HC IKSGFE D VSCSL+DAYS+ GH Sbjct: 398 DEGIGVDEVTLSTSLKASLVSAFGSLAXCRLLHCXAIKSGFEFDTAVSCSLIDAYSKYGH 457 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 ++LS ++F+Q+ PN ICFTSII+GYARN + ++G+ D +TFL LTG Sbjct: 458 VELSHQIFEQLHSPNAICFTSIINGYARNGMGREGLEMLEIMAKQGLKPDRVTFLCALTG 517 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 CSHSGLV+EG VF SM++ HGI PD++HYSCMVD ++K AP K DSV Sbjct: 518 CSHSGLVEEGRSVFNSMKSIHGIHPDQQHYSCMVDLLGRAGLLDEAEGLLKHAPAKSDSV 577 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 MWSSLL SCR+H + VGRRAA+ LMEL+ +DPA YLQAS FYSEIG+ S +IREI Sbjct: 578 MWSSLLQSCRIHGDTIVGRRAAKVLMELEXEDPATYLQASSFYSEIGELEISMQIREIAI 637 Query: 269 IMDIKRDFGYSSIEINSQSY 210 +KR+ G+S IE+NS ++ Sbjct: 638 ARRMKREMGHSLIEVNSHTH 657 >ref|XP_006483887.1| PREDICTED: pentatricopeptide repeat-containing protein At3g09040, mitochondrial-like [Citrus sinensis] Length = 666 Score = 260 bits (664), Expect = 7e-67 Identities = 132/257 (51%), Positives = 168/257 (65%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G+GLD+V SC L+HCC IKSGFES+I VSCSL+DAYSR GH Sbjct: 408 DEGIGLDEVTLSTTLKALSVSASANVGSCRLLHCCAIKSGFESNIAVSCSLMDAYSRCGH 467 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 I+LS +VF +I PN++CFTSI++GY+RN + QRG++ D +TFL VL G Sbjct: 468 IELSHQVFKKIPSPNVVCFTSIMNGYSRNGMGREALDMLEVMIQRGLIPDKVTFLCVLAG 527 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 C+HSG+VKEG LVF SM++ +GI DR+HYSCM+D E+++Q P GD + Sbjct: 528 CNHSGMVKEGQLVFNSMKSVYGIDADRQHYSCMIDMLGRAGILDKAEELLQQTPGGGDCM 587 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 MWSSLL SCRVH N +GRR A LMEL+P D A Y Q S FYSEIG+F S +IRE Sbjct: 588 MWSSLLRSCRVHGNEIIGRRVANILMELEPVDFAVYSQVSNFYSEIGEFEVSMQIRETAL 647 Query: 269 IMDIKRDFGYSSIEINS 219 + RD G+S IE+NS Sbjct: 648 ARKLTRDIGHSLIEVNS 664 >ref|XP_006438343.1| hypothetical protein CICLE_v100305772mg, partial [Citrus clementina] gi|557540539|gb|ESR51583.1| hypothetical protein CICLE_v100305772mg, partial [Citrus clementina] Length = 928 Score = 259 bits (663), Expect = 9e-67 Identities = 132/256 (51%), Positives = 167/256 (65%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G+GLD+V SC L+HCC IKSGFES+I VSCSL+DAYSR GH Sbjct: 408 DEGIGLDEVTLSTTLKALSVSASANVGSCRLLHCCAIKSGFESNIAVSCSLMDAYSRCGH 467 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 I+LS +VF +I PN++CFTSI++GY+RN + QRG++ D +TFL VL G Sbjct: 468 IELSHQVFKKIPSPNVVCFTSIMNGYSRNGMGREALDMLEVMIQRGLIPDKVTFLCVLAG 527 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 C+HSG+VKEG LVF SM++ +GI DR+HYSCM+D E+++Q P GD V Sbjct: 528 CNHSGMVKEGQLVFNSMKSVYGIDADRQHYSCMIDMLGRAGILDKAEELLQQTPGGGDCV 587 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 MWSSLL SCRVH N +GRR A LMEL+P D A Y Q S FYSEIG+F S +IRE Sbjct: 588 MWSSLLRSCRVHGNEIIGRRVANILMELEPVDFAVYSQVSNFYSEIGEFEVSMQIRETAL 647 Query: 269 IMDIKRDFGYSSIEIN 222 + RD G+S IE+N Sbjct: 648 ARKLTRDIGHSLIEVN 663 >ref|XP_004298478.1| PREDICTED: putative pentatricopeptide repeat-containing protein At2g01510-like [Fragaria vesca subsp. vesca] Length = 658 Score = 259 bits (663), Expect = 9e-67 Identities = 128/260 (49%), Positives = 169/260 (65%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G+G D+V SC L+HCC +KSGFESDI VSCSL++AY R GH Sbjct: 398 DEGIGFDEVTLSTTLKALSLSVLASLGSCRLLHCCAVKSGFESDIAVSCSLIEAYGRCGH 457 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 +KLS +VFD++ PN+ICFTSII GYARN + ++G+ D +T L VL+G Sbjct: 458 VKLSRQVFDELPSPNVICFTSIIHGYARNGMGSECLQLVQDMVEKGLKPDKVTVLCVLSG 517 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 CSHSGLV+E L+F SME +G+ PDR+HYSCMVD E+++QAP +GD V Sbjct: 518 CSHSGLVEEAKLLFNSMEILYGVSPDRKHYSCMVDLLGRAGLLEEAEELLQQAPGEGDCV 577 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 MWSSLL SC VH+N VGRR + L++L+ +DPA LQAS FYSEI +F T+ +IRE Sbjct: 578 MWSSLLRSCMVHKNETVGRRTVKTLLKLEGEDPAILLQASNFYSEIREFDTAMQIREFAI 637 Query: 269 IMDIKRDFGYSSIEINSQSY 210 + RD G+S +E+ S S+ Sbjct: 638 AQQVTRDIGHSLVEVYSHSH 657 >ref|XP_006380228.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550333749|gb|ERP58025.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 702 Score = 253 bits (647), Expect = 6e-65 Identities = 125/257 (48%), Positives = 170/257 (66%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G+GLD+V SC LVHCC +K GF SDI VSCSL+DAYSR GH Sbjct: 426 DEGIGLDEVTFSTTLKALSVSEFASMDSCRLVHCCAMKLGFGSDIAVSCSLIDAYSRCGH 485 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 ++LS +VF+Q+ PN+ICFTSII+G A+N + ++G+ D +TFL VLTG Sbjct: 486 VQLSKKVFEQLPSPNVICFTSIINGLAQNGLGRECLQTFEAMIRKGLEPDKVTFLCVLTG 545 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 CSHSGL +EG L+F SM+ +GICP + H+SCMVD E+ ++AP +GD V Sbjct: 546 CSHSGLFEEGRLIFYSMKAQYGICPAKEHFSCMVDILGRAGLLDEAEELTQKAPGRGDCV 605 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 MW+SLL SCR+++N VGRRAA+AL+EL P+D + YLQ S FYS+IG++ +S IRE+ Sbjct: 606 MWTSLLRSCRIYRNEIVGRRAAKALLELDPEDFSVYLQVSNFYSDIGEYESSMHIRELAI 665 Query: 269 IMDIKRDFGYSSIEINS 219 + R+ G S IE+N+ Sbjct: 666 ARKLTREIGRSFIEVNN 682 >ref|XP_007224509.1| hypothetical protein PRUPE_ppa022727mg, partial [Prunus persica] gi|462421445|gb|EMJ25708.1| hypothetical protein PRUPE_ppa022727mg, partial [Prunus persica] Length = 682 Score = 253 bits (646), Expect = 8e-65 Identities = 125/257 (48%), Positives = 171/257 (66%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G+GLD+V SC+LVHC IKSGFESDIVVSCSL+DAY+R GH Sbjct: 425 DEGIGLDEVTLSTTLKALSASAMASLGSCKLVHCSAIKSGFESDIVVSCSLIDAYARCGH 484 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 +KLS +VF+++ PN +CFTSII GYARN + ++G+ D +T L VL+G Sbjct: 485 VKLSRQVFEELPSPNAVCFTSIIHGYARNGMGSEGLHLLQAMIRKGLKPDKVTILGVLSG 544 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 C+HSGLV+E ++F+SM+ +GI PDR+H+SCMVD E+++QAP G V Sbjct: 545 CNHSGLVEEARVLFDSMKNLYGISPDRKHFSCMVDLLGRAGLLDEAEELLQQAPGNGHCV 604 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 MWSSLL SCRVH+N VGRR + L+EL +DP +LQAS FYSEIG+F + ++REI+ Sbjct: 605 MWSSLLRSCRVHKNELVGRRTVKTLLELDVEDPDIWLQASNFYSEIGEFDIAMQMREIET 664 Query: 269 IMDIKRDFGYSSIEINS 219 +K + G+S +E+N+ Sbjct: 665 ARKVKWEMGHSLVELNT 681 >gb|EXB82575.1| hypothetical protein L484_027752 [Morus notabilis] Length = 674 Score = 249 bits (636), Expect = 1e-63 Identities = 121/257 (47%), Positives = 175/257 (68%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G+GLD+V S +L+H +KSG ESDI VSCSL+DAYS+ G+ Sbjct: 416 DEGIGLDEVTLSTTLKALSISSLASSSSFKLLHSRAVKSGLESDIAVSCSLIDAYSKHGY 475 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 IKLS +VF+ + PN+ CFTSII+GYA+N + ++G+ D +TFL VLTG Sbjct: 476 IKLSRQVFENLSSPNVKCFTSIINGYAQNGMGRESLDLFHAMIRKGLKPDKVTFLCVLTG 535 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 C+HSGLV+EG ++F+++++++G+ PDR+HYSCMVD ++++QAP +GD+ Sbjct: 536 CNHSGLVREGKVLFDALKSSYGVSPDRQHYSCMVDLLGRAGLLEEAKKLLEQAPGRGDAT 595 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 MWSSLL SCR+H+N +GRRAAE L+EL+P++PA YLQ S FYSE +F+TS +IRE+ Sbjct: 596 MWSSLLRSCRMHKNETLGRRAAEVLLELEPENPAIYLQVSSFYSEFEEFSTSLQIRELAV 655 Query: 269 IMDIKRDFGYSSIEINS 219 + R+ G+S IE +S Sbjct: 656 ARKVTREIGHSLIETSS 672 >ref|XP_006350274.1| PREDICTED: pentatricopeptide repeat-containing protein At3g03580-like [Solanum tuberosum] Length = 662 Score = 241 bits (616), Expect = 3e-61 Identities = 117/228 (51%), Positives = 156/228 (68%) Frame = -1 Query: 905 CELVHCCVIKSGFESDIVVSCSLVDAYSRAGHIKLSCRVFDQIVEPNLICFTSIISGYAR 726 C L+HCC IKSGF+SD +V CSL+DAYSR+G I+ S ++F+ + PN+ FTSII+ YA+ Sbjct: 432 CCLLHCCAIKSGFDSDSMVLCSLIDAYSRSGQIRFSQQIFEALPSPNIFGFTSIINAYAQ 491 Query: 725 NXXXXXXXXXXXXLYQRGMVADSITFLSVLTGCSHSGLVKEGLLVFESMETNHGICPDRR 546 + QRG+ D +TFL +L GC+HSGLV+EG ++F+SM T H + PDRR Sbjct: 492 KGMGYECLAVFEEMIQRGLKPDKVTFLCILLGCNHSGLVEEGKMIFDSMRTIHDVYPDRR 551 Query: 545 HYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSVMWSSLLWSCRVHQNVEVGRRAAEALMEL 366 H SCMVD E++ A +GDSVMWSSLL SCR+HQN VGRRAA+ LM+L Sbjct: 552 HCSCMVDLLGRAGLVNEAEELLMHASSEGDSVMWSSLLRSCRIHQNEHVGRRAAKRLMDL 611 Query: 365 KPDDPAAYLQASGFYSEIGDFATSAEIREIKAIMDIKRDFGYSSIEIN 222 +P+DP+ +LQAS FYSEIG+F T+ IRE+ + RD GYS I ++ Sbjct: 612 EPEDPSFWLQASNFYSEIGEFETAIYIREVAVARKMSRDIGYSLIRVH 659 >ref|XP_007044687.1| F28C11.9, putative [Theobroma cacao] gi|508708622|gb|EOY00519.1| F28C11.9, putative [Theobroma cacao] Length = 1063 Score = 240 bits (613), Expect = 6e-61 Identities = 114/239 (47%), Positives = 161/239 (67%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G+G+D+V SC+L+HCC IKSG+ESD+ VSCSL++ YSR GH Sbjct: 400 DEGIGIDEVTLSTTLKALSVTTYASLGSCKLLHCCAIKSGYESDMAVSCSLINGYSRCGH 459 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 +LSC+VF + PN+ CFTSII+GYARN + Q+G++ D +TFL VL+G Sbjct: 460 FELSCQVFKTLPSPNVFCFTSIINGYARNGMGKEGVSLLEAMIQKGLIPDKVTFLCVLSG 519 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 C H+GLV+EG LVF M++ +GICP+R+H+SCM+D ++++QAP GD V Sbjct: 520 CDHAGLVEEGKLVFNLMKSFYGICPERQHFSCMIDLLGRAGLLSEAEKLLQQAPGGGDPV 579 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIK 273 MWSSLL SC +++N VG+RAA+ LM+L +D A+ LQ S FYSE+G+F + +IREI+ Sbjct: 580 MWSSLLRSCSIYKNEIVGKRAAKVLMDLGQEDFASCLQVSNFYSEVGEFEAALQIREIE 638 >gb|EYU22504.1| hypothetical protein MIMGU_mgv1a018311mg, partial [Mimulus guttatus] Length = 621 Score = 240 bits (612), Expect = 7e-61 Identities = 119/228 (52%), Positives = 156/228 (68%) Frame = -1 Query: 905 CELVHCCVIKSGFESDIVVSCSLVDAYSRAGHIKLSCRVFDQIVEPNLICFTSIISGYAR 726 C L+H C IKSGFESD V+CSL+DAYS++G I+ S ++F+++ PN+ CFTSIIS AR Sbjct: 391 CPLLHSCTIKSGFESDNAVACSLIDAYSKSGEIEFSRQIFNELPSPNVFCFTSIISALAR 450 Query: 725 NXXXXXXXXXXXXLYQRGMVADSITFLSVLTGCSHSGLVKEGLLVFESMETNHGICPDRR 546 N +++ G D +TFLSVL GC+HSGLV+E L+F SM+T G+ PDR+ Sbjct: 451 NGMGNECLEMLNGMFRNGREPDRVTFLSVLMGCNHSGLVEEARLLFHSMQTRFGVYPDRQ 510 Query: 545 HYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSVMWSSLLWSCRVHQNVEVGRRAAEALMEL 366 HYSCMVD E++K P +GDSV+WSS+L SCRVHQN +VG+RAAE LM L Sbjct: 511 HYSCMVDLLGRAGLLEEAEELVKDTPWEGDSVIWSSVLRSCRVHQNEQVGKRAAEILMGL 570 Query: 365 KPDDPAAYLQASGFYSEIGDFATSAEIREIKAIMDIKRDFGYSSIEIN 222 + D+P+ LQAS FYS+IGDF REI A I+R+ G SSIE++ Sbjct: 571 ETDNPSGLLQASNFYSDIGDFERMKYCREIGAARRIRREIGQSSIEVH 618 >gb|AAC98003.1| This gene may be continued from BAC T23E23 [Arabidopsis thaliana] Length = 278 Score = 238 bits (608), Expect = 2e-60 Identities = 117/257 (45%), Positives = 164/257 (63%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G G+D+V SC LVHCC IKSG+ +D+ VSCSL+DAY+++G Sbjct: 22 DEGTGIDEVTLSTVLKALSLSLPESLHSCTLVHCCAIKSGYAADVAVSCSLIDAYTKSGQ 81 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 ++S +VFD++ PN+ C TSII+GYARN + + ++ D +T LSVL+G Sbjct: 82 NEVSRKVFDELDTPNIFCLTSIINGYARNGMGTDCVKMLREMDRMNLIPDEVTILSVLSG 141 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 CSHSGLV+EG L+F+S+E+ +GI P R+ Y+CMVD ++ QA D V Sbjct: 142 CSHSGLVEEGELIFDSLESKYGISPGRKLYACMVDLLGRAGLVEKAERLLLQARGDADCV 201 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 WSSLL SCR+H+N +GRRAAE LM L+P++ A Y+Q S FY EIGDF S +IREI A Sbjct: 202 AWSSLLQSCRIHRNETIGRRAAEVLMNLEPENFAVYIQVSKFYFEIGDFEISRQIREIAA 261 Query: 269 IMDIKRDFGYSSIEINS 219 ++ R+ GYSS+ + + Sbjct: 262 SRELMREIGYSSVVVKN 278 >ref|NP_173759.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332192267|gb|AEE30388.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 666 Score = 238 bits (608), Expect = 2e-60 Identities = 117/257 (45%), Positives = 164/257 (63%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G G+D+V SC LVHCC IKSG+ +D+ VSCSL+DAY+++G Sbjct: 410 DEGTGIDEVTLSTVLKALSLSLPESLHSCTLVHCCAIKSGYAADVAVSCSLIDAYTKSGQ 469 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 ++S +VFD++ PN+ C TSII+GYARN + + ++ D +T LSVL+G Sbjct: 470 NEVSRKVFDELDTPNIFCLTSIINGYARNGMGTDCVKMLREMDRMNLIPDEVTILSVLSG 529 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 CSHSGLV+EG L+F+S+E+ +GI P R+ Y+CMVD ++ QA D V Sbjct: 530 CSHSGLVEEGELIFDSLESKYGISPGRKLYACMVDLLGRAGLVEKAERLLLQARGDADCV 589 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 WSSLL SCR+H+N +GRRAAE LM L+P++ A Y+Q S FY EIGDF S +IREI A Sbjct: 590 AWSSLLQSCRIHRNETIGRRAAEVLMNLEPENFAVYIQVSKFYFEIGDFEISRQIREIAA 649 Query: 269 IMDIKRDFGYSSIEINS 219 ++ R+ GYSS+ + + Sbjct: 650 SRELMREIGYSSVVVKN 666 >ref|XP_004237297.1| PREDICTED: pentatricopeptide repeat-containing protein At4g32430, mitochondrial-like [Solanum lycopersicum] Length = 509 Score = 238 bits (607), Expect = 3e-60 Identities = 115/228 (50%), Positives = 153/228 (67%) Frame = -1 Query: 905 CELVHCCVIKSGFESDIVVSCSLVDAYSRAGHIKLSCRVFDQIVEPNLICFTSIISGYAR 726 C L HCC IKSGF+SD +V CSL+DAYSR+G I+ S ++F+ + PN+ FTSII+ YA+ Sbjct: 279 CCLFHCCAIKSGFDSDSMVLCSLIDAYSRSGQIRFSQQIFEALRSPNIFGFTSIINAYAQ 338 Query: 725 NXXXXXXXXXXXXLYQRGMVADSITFLSVLTGCSHSGLVKEGLLVFESMETNHGICPDRR 546 + Q+G+ D +TFL +L GC+HSGLV+EG +F+SM T H + PDRR Sbjct: 339 KGMGYECLAVFEEMIQKGLKPDKVTFLCILLGCNHSGLVEEGKRIFDSMRTLHDVYPDRR 398 Query: 545 HYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSVMWSSLLWSCRVHQNVEVGRRAAEALMEL 366 HYSCMVD E++ +GDSVMWSSLL SCR+HQN VGRRAA+ LM+L Sbjct: 399 HYSCMVDLLGRAGLVNEAEELLMHGSSEGDSVMWSSLLRSCRIHQNEHVGRRAAKRLMDL 458 Query: 365 KPDDPAAYLQASGFYSEIGDFATSAEIREIKAIMDIKRDFGYSSIEIN 222 +P+DP+ +LQAS FYSEIG+F + IRE+ + RD GYS I ++ Sbjct: 459 EPEDPSFWLQASNFYSEIGEFEAAVYIREVAVARKMSRDIGYSLIRVH 506 >ref|XP_006585945.1| PREDICTED: pentatricopeptide repeat-containing protein At3g03580-like [Glycine max] Length = 668 Score = 236 bits (603), Expect = 8e-60 Identities = 117/226 (51%), Positives = 153/226 (67%) Frame = -1 Query: 902 ELVHCCVIKSGFESDIVVSCSLVDAYSRAGHIKLSCRVFDQIVEPNLICFTSIISGYARN 723 +L+HC +KSG D V+CSLVD+YSR GH++LS R+F+ + PN ICFTS+I+ YARN Sbjct: 441 QLLHCYALKSGLGGDAAVACSLVDSYSRWGHVELSRRIFESLPSPNAICFTSMINAYARN 500 Query: 722 XXXXXXXXXXXXLYQRGMVADSITFLSVLTGCSHSGLVKEGLLVFESMETNHGICPDRRH 543 + +RG+ D +T L L GC+H+GLV+EG LVFESM++ HG+ PD RH Sbjct: 501 GAGKEGIAVLQAMIERGLKPDDVTLLCALNGCNHTGLVEEGRLVFESMKSLHGVDPDHRH 560 Query: 542 YSCMVDXXXXXXXXXXXXEVIKQAPVKGDSVMWSSLLWSCRVHQNVEVGRRAAEALMELK 363 +SCMVD E++ QAP KGD MWSSLL SCRVH+N EVG RAA+ L+EL Sbjct: 561 FSCMVDLFCRAGLLHEAEELLLQAPGKGDCFMWSSLLRSCRVHKNEEVGTRAAQVLVELD 620 Query: 362 PDDPAAYLQASGFYSEIGDFATSAEIREIKAIMDIKRDFGYSSIEI 225 PDDPA +LQAS FY+EIG+F S +IRE+ + R+ G+S EI Sbjct: 621 PDDPAVWLQASIFYAEIGNFDASRQIREVALSRKMTREIGHSLTEI 666 >ref|XP_006416082.1| hypothetical protein EUTSA_v10006630mg [Eutrema salsugineum] gi|557093853|gb|ESQ34435.1| hypothetical protein EUTSA_v10006630mg [Eutrema salsugineum] Length = 1105 Score = 231 bits (588), Expect = 4e-58 Identities = 114/252 (45%), Positives = 161/252 (63%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G G+D+V S LVHCC IKSG+ SD+ VSCSL+DAYS++G Sbjct: 410 DEGTGIDEVTLSTVLKALSLSVPPSLHSFALVHCCAIKSGYASDVAVSCSLIDAYSKSGQ 469 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 ++S +VFD++ PN+ C+TSII+GYARN + Q+ +V D +T LSVL+G Sbjct: 470 NEVSRKVFDELDSPNIFCWTSIINGYARNGMGRDCVEMLREMDQKNLVPDEVTILSVLSG 529 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 CSHSGLV+EG L+F+S+E+ +GI P R+ Y+CMVD ++ +A D + Sbjct: 530 CSHSGLVEEGELIFDSLESKYGISPGRKLYACMVDLLGRAGLVEKAERLLLRARGDADCI 589 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 WSSLL SCR+H N +GRRAAE LM+L+P++ + Y+Q S FY EIGDF S +IRE A Sbjct: 590 AWSSLLQSCRIHGNESIGRRAAEVLMDLEPENFSVYIQISKFYFEIGDFEISRQIRETAA 649 Query: 269 IMDIKRDFGYSS 234 ++ R+ GY++ Sbjct: 650 SRELMREIGYTA 661 >gb|AAF79584.1|AC007945_4 F28C11.9 [Arabidopsis thaliana] Length = 1161 Score = 231 bits (588), Expect = 4e-58 Identities = 114/249 (45%), Positives = 158/249 (63%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+G G+D+V SC LVHCC IKSG+ +D+ VSCSL+DAY+++G Sbjct: 410 DEGTGIDEVTLSTVLKALSLSLPESLHSCTLVHCCAIKSGYAADVAVSCSLIDAYTKSGQ 469 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 ++S +VFD++ PN+ C TSII+GYARN + + ++ D +T LSVL+G Sbjct: 470 NEVSRKVFDELDTPNIFCLTSIINGYARNGMGTDCVKMLREMDRMNLIPDEVTILSVLSG 529 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 CSHSGLV+EG L+F+S+E+ +GI P R+ Y+CMVD ++ QA D V Sbjct: 530 CSHSGLVEEGELIFDSLESKYGISPGRKLYACMVDLLGRAGLVEKAERLLLQARGDADCV 589 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKA 270 WSSLL SCR+H+N +GRRAAE LM L+P++ A Y+Q S FY EIGDF S +IREI A Sbjct: 590 AWSSLLQSCRIHRNETIGRRAAEVLMNLEPENFAVYIQVSKFYFEIGDFEISRQIREIAA 649 Query: 269 IMDIKRDFG 243 ++ R+ G Sbjct: 650 SRELMREIG 658 >ref|XP_006305992.1| hypothetical protein CARUB_v10011268mg [Capsella rubella] gi|482574703|gb|EOA38890.1| hypothetical protein CARUB_v10011268mg [Capsella rubella] Length = 1077 Score = 230 bits (586), Expect = 8e-58 Identities = 114/250 (45%), Positives = 159/250 (63%) Frame = -1 Query: 986 DGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGHI 807 +G G+D+V SC L+HC +KSG+ SD+ VSCSL+DAYS++G Sbjct: 383 EGTGIDEVTLSTVLKAFSLSVPANLHSCALIHCWSMKSGYASDVAVSCSLIDAYSKSGQN 442 Query: 806 KLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTGC 627 ++S +VF ++ PN+ C+TSII+GYARN + Q+ ++ D +T LSVL+GC Sbjct: 443 EVSRKVFIELESPNIFCWTSIINGYARNGMGRECLEMLREMDQKNLMPDEVTILSVLSGC 502 Query: 626 SHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSVM 447 SHSGLV+EG L+F+SME+ +GI P R+ Y+CMVD ++ QA D + Sbjct: 503 SHSGLVEEGELIFDSMESRYGISPGRKLYACMVDLLGRAGLVEKAERLLLQAHGDADCIA 562 Query: 446 WSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREIKAI 267 WSSLL SCR+H+N +GRRAAE LM+L+P+D Y+Q S FY EIGDF S +IREI A Sbjct: 563 WSSLLQSCRIHKNKSIGRRAAEVLMDLEPEDFTVYIQVSKFYFEIGDFEFSRQIREIAAS 622 Query: 266 MDIKRDFGYS 237 ++ R+ GYS Sbjct: 623 RELMREIGYS 632 >ref|XP_004161088.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis sativus] Length = 618 Score = 230 bits (586), Expect = 8e-58 Identities = 116/238 (48%), Positives = 155/238 (65%) Frame = -1 Query: 989 DDGVGLDDVXXXXXXXXXXXXXXXXXXSCELVHCCVIKSGFESDIVVSCSLVDAYSRAGH 810 D+ +GLD+V C L+HCC IKSGFE VVSCSL+D+YSR GH Sbjct: 251 DERIGLDEVTLSTTLKAISICSPDLSR-CRLLHCCAIKSGFEFSSVVSCSLMDSYSRCGH 309 Query: 809 IKLSCRVFDQIVEPNLICFTSIISGYARNXXXXXXXXXXXXLYQRGMVADSITFLSVLTG 630 ++LS +VF++I P +ICFTSII+GYARN + QRG+ D +TFL L G Sbjct: 310 VELSWKVFEEIYSPGVICFTSIINGYARNGLGKEGVEILKMMIQRGLKPDKVTFLCALIG 369 Query: 629 CSHSGLVKEGLLVFESMETNHGICPDRRHYSCMVDXXXXXXXXXXXXEVIKQAPVKGDSV 450 C+HSGLV+EG VFE M+T +GI PD +HYSCMVD ++I++ P K + V Sbjct: 370 CNHSGLVEEGRFVFELMKTLYGIEPDWKHYSCMVDLLGRAGLLDEAEKLIQKVPEKVNGV 429 Query: 449 MWSSLLWSCRVHQNVEVGRRAAEALMELKPDDPAAYLQASGFYSEIGDFATSAEIREI 276 +WSS+L SCRVH+N +GR+ + L++L+ +DPA LQAS FYS+IGDF TS ++REI Sbjct: 430 IWSSMLRSCRVHRNETIGRKIVKWLVDLEAEDPAILLQASNFYSDIGDFETSKQLREI 487