BLASTX nr result
ID: Mentha29_contig00041209
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00041209 (309 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU45383.1| hypothetical protein MIMGU_mgv1a023657mg [Mimulus... 163 3e-38 ref|XP_007206426.1| hypothetical protein PRUPE_ppa001946mg [Prun... 130 2e-28 ref|XP_002525630.1| pentatricopeptide repeat-containing protein,... 129 4e-28 ref|XP_007016122.1| Tetratricopeptide repeat (TPR)-like superfam... 128 7e-28 ref|XP_004241167.1| PREDICTED: pentatricopeptide repeat-containi... 120 1e-25 ref|XP_004145320.1| PREDICTED: pentatricopeptide repeat-containi... 118 7e-25 ref|XP_002279360.1| PREDICTED: pentatricopeptide repeat-containi... 118 7e-25 gb|EXC01449.1| hypothetical protein L484_022020 [Morus notabilis] 117 1e-24 emb|CAN72716.1| hypothetical protein VITISV_032470 [Vitis vinifera] 117 1e-24 ref|XP_006350917.1| PREDICTED: pentatricopeptide repeat-containi... 117 2e-24 ref|XP_002314110.1| pentatricopeptide repeat-containing family p... 117 2e-24 ref|XP_006480615.1| PREDICTED: pentatricopeptide repeat-containi... 116 3e-24 ref|XP_006428806.1| hypothetical protein CICLE_v10011151mg [Citr... 116 3e-24 ref|XP_002879234.1| predicted protein [Arabidopsis lyrata subsp.... 116 3e-24 ref|XP_006293749.1| hypothetical protein CARUB_v10022711mg [Caps... 114 1e-23 ref|XP_002314675.1| pentatricopeptide repeat-containing family p... 114 1e-23 ref|NP_180537.1| RNA editing factor OTP81 [Arabidopsis thaliana]... 114 1e-23 ref|XP_004295750.1| PREDICTED: pentatricopeptide repeat-containi... 113 3e-23 ref|XP_006410036.1| hypothetical protein EUTSA_v10016305mg [Eutr... 111 9e-23 gb|EPS69071.1| hypothetical protein M569_05691, partial [Genlise... 111 9e-23 >gb|EYU45383.1| hypothetical protein MIMGU_mgv1a023657mg [Mimulus guttatus] Length = 701 Score = 163 bits (412), Expect = 3e-38 Identities = 74/103 (71%), Positives = 89/103 (86%) Frame = -1 Query: 309 GIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNG 130 GIHGM IKEGFSSDL+V NCLI+FY+ECGCLDMA R FS+M ERDV+SWN+M++GLA NG Sbjct: 144 GIHGMVIKEGFSSDLFVSNCLIYFYSECGCLDMARRVFSSMSERDVVSWNTMVNGLAQNG 203 Query: 129 CLDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 +DEA+E F ME EG++PNDVTMVGVL+AC KK D+KFG+ V Sbjct: 204 YVDEAVECFHRMEEEGLKPNDVTMVGVLSACGKKSDVKFGRWV 246 Score = 64.3 bits (155), Expect = 2e-08 Identities = 33/90 (36%), Positives = 52/90 (57%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH KEG + ++ L+ Y++CG L A F+++ +RDV W++MI GL +G Sbjct: 350 IHVYMKKEGMRLNRHLVTALVDMYSKCGDLHKALEIFNSVDKRDVFVWSAMIAGLGMHGR 409 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 +AI++F M+ VRP+ VT +L AC Sbjct: 410 GGDAIKLFLKMQEAKVRPSSVTFTNLLAAC 439 >ref|XP_007206426.1| hypothetical protein PRUPE_ppa001946mg [Prunus persica] gi|462402068|gb|EMJ07625.1| hypothetical protein PRUPE_ppa001946mg [Prunus persica] Length = 738 Score = 130 bits (327), Expect = 2e-28 Identities = 60/103 (58%), Positives = 74/103 (71%) Frame = -1 Query: 309 GIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNG 130 G HGMAIK SD+Y+ N L+HFY CG LD+A R F P++DV+SWNSMI A Sbjct: 152 GFHGMAIKASLGSDIYILNSLVHFYGSCGDLDLARRVFMKTPKKDVVSWNSMITVFAQGN 211 Query: 129 CLDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 C EA+E+F MEAE V+PNDVTMV VL+AC KK+DL+FG+ V Sbjct: 212 CPQEALELFKEMEAENVKPNDVTMVSVLSACAKKVDLEFGRWV 254 Score = 66.2 bits (160), Expect = 4e-09 Identities = 36/90 (40%), Positives = 50/90 (55%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K+ + ++ LI YA+CG LD A F+++ RDV W++MI GLA +G Sbjct: 387 IHVYIKKQVMKLNCHLTTSLIDMYAKCGDLDKALEVFNSVERRDVFVWSAMIAGLAMHGQ 446 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 +A+E F M V+PN VT VL AC Sbjct: 447 GRDALEFFSKMLEAKVKPNAVTFTNVLCAC 476 Score = 61.2 bits (147), Expect = 1e-07 Identities = 31/110 (28%), Positives = 51/110 (46%), Gaps = 32/110 (29%) Frame = -1 Query: 270 DLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGCLDEAIEIFGGME 91 +L + N ++ Y +CG +D A R F MPE+D++SW +M+DG A G +EA +F M Sbjct: 266 NLTLNNAMLDMYVKCGSVDDAKRLFDRMPEKDIVSWTTMLDGYAQLGNYEEAWRVFAAMP 325 Query: 90 AEGV--------------------------------RPNDVTMVGVLTAC 37 ++ + +P++VT+V L AC Sbjct: 326 SQDIAAWNVLISSYEQSGKPKEALAVFNELQKSKSPKPDEVTLVSTLAAC 375 >ref|XP_002525630.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223535066|gb|EEF36748.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 765 Score = 129 bits (324), Expect = 4e-28 Identities = 63/102 (61%), Positives = 75/102 (73%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IHGMAIK SDL++ N LIH YA CG LD A F + E+DV+SWNSMI G GC Sbjct: 154 IHGMAIKASLGSDLFILNSLIHCYASCGDLDSAYSVFVKIEEKDVVSWNSMIKGFVLGGC 213 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 D+A+E+F M+AE VRPNDVTMVGVL+AC KKMDL+FG+ V Sbjct: 214 PDKALELFQLMKAENVRPNDVTMVGVLSACAKKMDLEFGRRV 255 Score = 60.5 bits (145), Expect = 2e-07 Identities = 33/90 (36%), Positives = 50/90 (55%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K+ + ++ LI Y++CG ++ A F ++ RDV W++MI GLA +G Sbjct: 388 IHVYIKKQDIKLNCHLTTSLIDMYSKCGEVEKALDIFYSVDRRDVFVWSAMIAGLAMHGR 447 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 AI++F M+ VRPN VT +L AC Sbjct: 448 GRAAIDLFFEMQETKVRPNAVTFTNLLCAC 477 >ref|XP_007016122.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao] gi|508786485|gb|EOY33741.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao] Length = 733 Score = 128 bits (322), Expect = 7e-28 Identities = 57/102 (55%), Positives = 76/102 (74%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 +HGM IK +D+++ N LIH Y CG LD A R F + E+DV+SWNS+I GLA GC Sbjct: 148 LHGMVIKASLGADVFISNSLIHLYLSCGDLDSAYRVFMMIGEKDVVSWNSLITGLAQKGC 207 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 ++A+E+F M+AE V+PNDVTMVGVL+AC KK+DL+FG+ V Sbjct: 208 AEKALELFRRMDAESVKPNDVTMVGVLSACTKKLDLEFGRWV 249 Score = 68.9 bits (167), Expect = 7e-10 Identities = 36/101 (35%), Positives = 56/101 (55%) Frame = -1 Query: 309 GIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNG 130 GIH ++G + ++ LI Y++CG ++ A F ++ RDV W++MI GLA +G Sbjct: 381 GIHAYVKEQGIQLNCHLTTSLIDMYSKCGDVNKALEVFYSVERRDVFVWSAMIAGLAMHG 440 Query: 129 CLDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGK 7 AI++F M+ ++PN VT VL AC +K GK Sbjct: 441 HGRAAIDLFSRMQEATMKPNSVTFTNVLCACSHAGLVKEGK 481 >ref|XP_004241167.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Solanum lycopersicum] Length = 744 Score = 120 bits (302), Expect = 1e-25 Identities = 59/104 (56%), Positives = 73/104 (70%), Gaps = 1/104 (0%) Frame = -1 Query: 309 GIHGMAIK-EGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHN 133 G+HGM +K D++V N LIHFYA+CGCLD A F M RDV+SWN+MI G A Sbjct: 157 GLHGMVVKGRDVGLDIFVLNSLIHFYADCGCLDEAYLIFENMQTRDVVSWNTMILGFAEG 216 Query: 132 GCLDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 G DEA++IF M E VRPNDVTM+ VL+AC KK+DL+FG+ V Sbjct: 217 GYADEALKIFHRMGEENVRPNDVTMMAVLSACAKKLDLEFGRWV 260 Score = 68.6 bits (166), Expect = 9e-10 Identities = 35/90 (38%), Positives = 53/90 (58%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K+G + ++ LI Y++CG ++ A F ++ RDV W++MI GLA +G Sbjct: 393 IHVYIKKQGIKFNCHLTTALIDMYSKCGDVEKALEMFDSVNIRDVFVWSAMIAGLAMHGR 452 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 EAI +F M+ V+PN VT++ VL AC Sbjct: 453 GKEAISLFLKMQEHKVKPNSVTLINVLCAC 482 >ref|XP_004145320.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Cucumis sativus] gi|449470513|ref|XP_004152961.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Cucumis sativus] gi|449523079|ref|XP_004168552.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Cucumis sativus] Length = 733 Score = 118 bits (296), Expect = 7e-25 Identities = 56/102 (54%), Positives = 70/102 (68%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 +HGMAIK F DLY+ N L+ FY CG L MA R F + +DV+SWNSMI A C Sbjct: 148 VHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKGISCKDVVSWNSMISAFAQGNC 207 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 ++A+E+F ME E V PN VTMVGVL+AC KK+DL+FG+ V Sbjct: 208 PEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWV 249 Score = 67.4 bits (163), Expect = 2e-09 Identities = 35/90 (38%), Positives = 52/90 (57%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH +EG + ++ + L+ YA+CG L+ A F ++ ERDV W++MI GL +G Sbjct: 382 IHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDVYVWSAMIAGLGMHGR 441 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 AI++F M+ V+PN VT VL AC Sbjct: 442 GKAAIDLFFEMQEAKVKPNSVTFTNVLCAC 471 Score = 60.1 bits (144), Expect = 3e-07 Identities = 34/116 (29%), Positives = 51/116 (43%), Gaps = 32/116 (27%) Frame = -1 Query: 288 KEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGCLDEAIE 109 ++G DL + N ++ Y +CG +D A + F MPERDV SW M+DG A G D A Sbjct: 255 RKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKMGDYDAARL 314 Query: 108 IFGGMEAEGV--------------------------------RPNDVTMVGVLTAC 37 +F M + + +P++VT+V L+AC Sbjct: 315 VFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVSTLSAC 370 >ref|XP_002279360.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Vitis vinifera] Length = 743 Score = 118 bits (296), Expect = 7e-25 Identities = 54/101 (53%), Positives = 70/101 (69%) Frame = -1 Query: 303 HGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGCL 124 HGM IK SD+++ N LIHFYA+CG L + R F +P RDV+SWNSMI GC Sbjct: 159 HGMVIKVLLGSDVFILNSLIHFYAKCGELGLGYRVFVNIPRRDVVSWNSMITAFVQGGCP 218 Query: 123 DEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 +EA+E+F ME + V+PN +TMVGVL+AC KK D +FG+ V Sbjct: 219 EEALELFQEMETQNVKPNGITMVGVLSACAKKSDFEFGRWV 259 Score = 65.9 bits (159), Expect = 6e-09 Identities = 33/90 (36%), Positives = 50/90 (55%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K+G + ++ LI Y +CG L A F ++ +DV W++MI GLA +G Sbjct: 392 IHVYIKKQGMKLNCHLTTSLIDMYCKCGDLQKALMVFHSVERKDVFVWSAMIAGLAMHGH 451 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 +AI +F M+ + V+PN VT +L AC Sbjct: 452 GKDAIALFSKMQEDKVKPNAVTFTNILCAC 481 >gb|EXC01449.1| hypothetical protein L484_022020 [Morus notabilis] Length = 739 Score = 117 bits (294), Expect = 1e-24 Identities = 51/101 (50%), Positives = 68/101 (67%) Frame = -1 Query: 309 GIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNG 130 G HGM +K +SD+++ N L+HFY C LD A R F +P +DV+SWNSMI Sbjct: 153 GFHGMVMKSSLASDVFILNSLVHFYGSCDDLDSAYRVFLNIPSKDVVSWNSMIKAFVEGD 212 Query: 129 CLDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGK 7 C DEA ++F ME E ++PND+TMVGVL AC KK D++FG+ Sbjct: 213 CPDEAFQLFREMEMENLKPNDITMVGVLCACGKKADIEFGR 253 Score = 67.0 bits (162), Expect = 3e-09 Identities = 34/90 (37%), Positives = 51/90 (56%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH ++G + ++ LI YA+CG L+ A F ++ +DV W++MI GLA +GC Sbjct: 388 IHIYIKRQGIKLNCHLTTSLIDMYAKCGDLEKALEVFDSVERKDVYVWSAMIAGLAMHGC 447 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 AI++F M V+PN VT +L AC Sbjct: 448 GRAAIDLFYEMLKAKVKPNAVTFTNILCAC 477 Score = 62.4 bits (150), Expect = 6e-08 Identities = 33/128 (25%), Positives = 58/128 (45%), Gaps = 32/128 (25%) Frame = -1 Query: 288 KEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGCLDEAIE 109 + G + +L + N ++ Y +CG ++ A F MPERDV+SW +M+DG G DEA+ Sbjct: 261 RNGIAVNLTLNNAMLDMYVKCGSVEDAKELFDKMPERDVVSWTTMLDGYTRMGKYDEALR 320 Query: 108 IFGGME--------------------------------AEGVRPNDVTMVGVLTACRKKM 25 +F M ++ +P++VT+V L+AC + Sbjct: 321 VFEAMPNQDIAAWNVLISSYEQNGMPKEALSVFHKLQVSKSAKPDEVTLVSSLSACSQLG 380 Query: 24 DLKFGKLV 1 + G+ + Sbjct: 381 SIDPGRWI 388 >emb|CAN72716.1| hypothetical protein VITISV_032470 [Vitis vinifera] Length = 694 Score = 117 bits (294), Expect = 1e-24 Identities = 54/101 (53%), Positives = 69/101 (68%) Frame = -1 Query: 303 HGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGCL 124 HGM IK SD+++ N LIHFYA+CG L + R F P RDV+SWNSMI GC Sbjct: 159 HGMVIKVLLGSDVFILNSLIHFYAKCGELGLGYRVFVNXPRRDVVSWNSMITAFVQGGCP 218 Query: 123 DEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 +EA+E+F ME + V+PN +TMVGVL+AC KK D +FG+ V Sbjct: 219 EEALELFQEMETQNVKPNGITMVGVLSACAKKSDFEFGRWV 259 Score = 66.2 bits (160), Expect = 4e-09 Identities = 32/102 (31%), Positives = 52/102 (50%), Gaps = 12/102 (11%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMI-------- 151 +H + L + N ++ Y +CG ++ A R F MPE+D++SW +M+ Sbjct: 259 VHSYIERNRIXESLTLSNAMLDMYTKCGSVEDAKRLFDKMPEKDIVSWTTMLVGYAKIGE 318 Query: 150 ----DGLAHNGCLDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 GLA +G +AI +F M+ + V+PN VT +L AC Sbjct: 319 YDAAQGLAMHGHGKDAIALFSKMQEDKVKPNAVTFTNILCAC 360 >ref|XP_006350917.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Solanum tuberosum] Length = 744 Score = 117 bits (292), Expect = 2e-24 Identities = 57/104 (54%), Positives = 72/104 (69%), Gaps = 1/104 (0%) Frame = -1 Query: 309 GIHGMAIK-EGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHN 133 G+HGM +K D++V N LIHFYA+CGCLD A F M RDV+SWN+MI G A Sbjct: 157 GLHGMVVKGRDVGLDIFVLNSLIHFYADCGCLDEAYLVFENMQTRDVVSWNTMILGFAEG 216 Query: 132 GCLDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 G DEA+++F M E VRPN VTM+ VL+AC KK+DL+FG+ V Sbjct: 217 GYADEALKMFHRMGEENVRPNGVTMMAVLSACGKKLDLEFGRWV 260 Score = 68.2 bits (165), Expect = 1e-09 Identities = 34/90 (37%), Positives = 53/90 (58%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K+G + ++ LI Y++CG ++ A F ++ RDV W++M+ GLA +G Sbjct: 393 IHVYIKKQGIKLNCHLTTALIDMYSKCGDVEKALEMFDSVNIRDVFVWSAMVAGLAMHGR 452 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 EAI +F M+ V+PN VT++ VL AC Sbjct: 453 GKEAISLFLKMQEHKVKPNSVTLINVLCAC 482 >ref|XP_002314110.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222850518|gb|EEE88065.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 738 Score = 117 bits (292), Expect = 2e-24 Identities = 53/100 (53%), Positives = 71/100 (71%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IHGM +K F SDL++ N LIHFY+ G LD A FS + E+D++SWNSMI G G Sbjct: 153 IHGMVMKASFGSDLFISNSLIHFYSSLGDLDSAYLVFSKIVEKDIVSWNSMISGFVQGGS 212 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGK 7 +EA+++F M+ E RPN VTMVGVL+AC K++DL+FG+ Sbjct: 213 PEEALQLFKRMKMENARPNRVTMVGVLSACAKRIDLEFGR 252 Score = 67.0 bits (162), Expect = 3e-09 Identities = 34/90 (37%), Positives = 51/90 (56%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K+G + ++ LI Y++CG L+ A F ++ RDV W++MI GLA +G Sbjct: 387 IHVYIKKQGIKLNFHITTSLIDMYSKCGHLEKALEVFYSVERRDVFVWSAMIAGLAMHGH 446 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 AI++F M+ V+PN VT +L AC Sbjct: 447 GRAAIDLFSKMQETKVKPNAVTFTNLLCAC 476 Score = 59.3 bits (142), Expect = 5e-07 Identities = 36/124 (29%), Positives = 53/124 (42%), Gaps = 34/124 (27%) Frame = -1 Query: 288 KEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGCLDEAIE 109 + G +L + N ++ Y +CG L+ A R F M E+D++SW +MIDG A G D A Sbjct: 260 RNGIDINLILSNAMLDMYVKCGSLEDARRLFDKMEEKDIVSWTTMIDGYAKVGDYDAARR 319 Query: 108 IFGGMEAEGV--------------------------------RPNDVTMVGVLTACRK-- 31 +F M E + +PN+VT+ L AC + Sbjct: 320 VFDVMPREDITAWNALISSYQQNGKPKEALAIFRELQLNKNTKPNEVTLASTLAACAQLG 379 Query: 30 KMDL 19 MDL Sbjct: 380 AMDL 383 >ref|XP_006480615.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Citrus sinensis] Length = 746 Score = 116 bits (291), Expect = 3e-24 Identities = 55/102 (53%), Positives = 71/102 (69%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IHGM IK F DL++ N LIHFYA CG L MA F + ++DV+SWNSMI G G Sbjct: 162 IHGMVIKSSFEDDLFISNSLIHFYAICGDLAMAYCVFVMIGKKDVVSWNSMISGFVQGGF 221 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 ++AIE++ ME E V+P++VTMV VL+AC KK DL+FG+ V Sbjct: 222 FEKAIELYREMEMENVKPDEVTMVAVLSACAKKRDLEFGRWV 263 Score = 70.9 bits (172), Expect = 2e-10 Identities = 36/90 (40%), Positives = 49/90 (54%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K+G + Y+ LI Y +CG LD A F T+ RDV W++MI G A G Sbjct: 395 IHAKMKKQGIKLNCYLTTSLIDMYTKCGNLDKALEVFHTVKSRDVFVWSTMIAGFAMYGR 454 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 +A+++F M+ V+PN VT VL AC Sbjct: 455 GRDALDLFSRMQEAKVKPNAVTFTNVLCAC 484 Score = 61.6 bits (148), Expect = 1e-07 Identities = 35/115 (30%), Positives = 51/115 (44%), Gaps = 31/115 (26%) Frame = -1 Query: 288 KEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLA---------- 139 K G DL + N ++ Y +CG L+ A F M E+D++SW +MIDG A Sbjct: 269 KNGIKMDLTLSNAMLDMYVKCGSLEDAKSLFDKMEEKDIVSWTTMIDGYAKLGEFDAAMS 328 Query: 138 ---------------------HNGCLDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 NG +EA+ IF ++ V P++ T V VL+AC Sbjct: 329 VLAAVPIQQIATWNALISAYEQNGKPNEALSIFHEQLSKNVNPDEFTFVSVLSAC 383 >ref|XP_006428806.1| hypothetical protein CICLE_v10011151mg [Citrus clementina] gi|557530863|gb|ESR42046.1| hypothetical protein CICLE_v10011151mg [Citrus clementina] Length = 737 Score = 116 bits (291), Expect = 3e-24 Identities = 55/102 (53%), Positives = 71/102 (69%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IHGM IK F DL++ N LIHFYA CG L MA F + ++DV+SWNSMI G G Sbjct: 153 IHGMVIKSSFEDDLFISNSLIHFYAICGDLAMAYCVFVMIGKKDVVSWNSMISGFVQGGF 212 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 ++AIE++ ME E V+P++VTMV VL+AC KK DL+FG+ V Sbjct: 213 FEKAIELYREMEMENVKPDEVTMVAVLSACAKKRDLEFGRWV 254 Score = 70.9 bits (172), Expect = 2e-10 Identities = 36/90 (40%), Positives = 49/90 (54%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K+G + Y+ LI Y +CG LD A F T+ RDV W++MI G A G Sbjct: 386 IHAKMKKQGIKLNCYLTTSLIDMYTKCGNLDKALEVFHTVKSRDVFVWSTMIAGFAMYGR 445 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 +A+++F M+ V+PN VT VL AC Sbjct: 446 GRDALDLFSRMQEAKVKPNAVTFTNVLCAC 475 Score = 61.6 bits (148), Expect = 1e-07 Identities = 35/115 (30%), Positives = 51/115 (44%), Gaps = 31/115 (26%) Frame = -1 Query: 288 KEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLA---------- 139 K G DL + N ++ Y +CG L+ A F M E+D++SW +MIDG A Sbjct: 260 KNGIKMDLTLSNAMLDMYVKCGSLEDAKSLFDKMEEKDIVSWTTMIDGYAKLGEFDAAMS 319 Query: 138 ---------------------HNGCLDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 NG +EA+ IF ++ V P++ T V VL+AC Sbjct: 320 VLAAVPIQQIATWNALISAYEQNGKPNEALSIFHEQLSKNVNPDEFTFVSVLSAC 374 >ref|XP_002879234.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297325073|gb|EFH55493.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 740 Score = 116 bits (291), Expect = 3e-24 Identities = 55/102 (53%), Positives = 73/102 (71%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 +HGMAIK SD++V N LIH Y CG LD A + F+T+ E+DV+SWNSMI+G G Sbjct: 155 LHGMAIKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGS 214 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 D+A+E+F ME+E V+ + VTMVGVL+AC K DL+FG+ V Sbjct: 215 PDKALELFKKMESEDVKASHVTMVGVLSACAKIRDLEFGRRV 256 Score = 83.2 bits (204), Expect = 3e-14 Identities = 38/90 (42%), Positives = 55/90 (61%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K G + YV + LIH Y++CG L+ A F+++ +RDV W++MI GLA +GC Sbjct: 389 IHSYIKKNGIKMNFYVTSALIHMYSKCGDLEKAREVFNSVEKRDVFVWSAMIGGLAMHGC 448 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 EA+++F M+ V+PN VT V AC Sbjct: 449 GSEAVDMFYKMQEANVKPNGVTFTNVFCAC 478 >ref|XP_006293749.1| hypothetical protein CARUB_v10022711mg [Capsella rubella] gi|482562457|gb|EOA26647.1| hypothetical protein CARUB_v10022711mg [Capsella rubella] Length = 739 Score = 114 bits (285), Expect = 1e-23 Identities = 54/102 (52%), Positives = 72/102 (70%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 +HGMAIK DL+V N LIH Y CG LD A + F+T+ E+DV+SWNSMI+G G Sbjct: 154 LHGMAIKSAVGCDLFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGS 213 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 D+A+E+F ME+E V+ + VTMVGVL+AC K +L+FG+ V Sbjct: 214 PDKALELFKKMESEDVKASHVTMVGVLSACTKLRNLEFGRQV 255 Score = 83.2 bits (204), Expect = 3e-14 Identities = 37/90 (41%), Positives = 56/90 (62%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K G + Y+ + LIH Y++CG L+ A F+ + +RDV W++MI GLA +GC Sbjct: 388 IHSYIKKHGIRMNFYITSALIHMYSKCGDLEKAREVFNCVEKRDVFVWSAMIGGLAMHGC 447 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 +EA+++F M+ E V+PN VT + AC Sbjct: 448 GNEAVDMFYKMQEENVKPNGVTFTNLFCAC 477 >ref|XP_002314675.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222863715|gb|EEF00846.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 845 Score = 114 bits (285), Expect = 1e-23 Identities = 52/102 (50%), Positives = 67/102 (65%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 +HG +K GF D++V N LIHFY ECG +D R F M ER+V+SW S+I G A GC Sbjct: 161 VHGAIVKMGFERDMFVENSLIHFYGECGEIDCMRRVFDKMSERNVVSWTSLIGGYAKRGC 220 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 EA+ +F M G+RPN VTMVGV++AC K DL+ G+ V Sbjct: 221 YKEAVSLFFEMVEVGIRPNSVTMVGVISACAKLQDLQLGEQV 262 Score = 67.8 bits (164), Expect = 2e-09 Identities = 39/120 (32%), Positives = 55/120 (45%), Gaps = 31/120 (25%) Frame = -1 Query: 303 HGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF------------------------ 196 HG ++ G V N +I+ Y +CG +MA R F Sbjct: 364 HGYVLRNGLEGWDNVCNAIINMYMKCGKQEMACRVFDRMLNKTRVSWNSLIAGFVRNGDM 423 Query: 195 -------STMPERDVISWNSMIDGLAHNGCLDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 S MP+ D++SWN+MI L EAIE+F M++EG+ + VTMVGV +AC Sbjct: 424 ESAWKIFSAMPDSDLVSWNTMIGALVQESMFKEAIELFRVMQSEGITADKVTMVGVASAC 483 Score = 64.7 bits (156), Expect = 1e-08 Identities = 32/89 (35%), Positives = 50/89 (56%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IHG K+ D+++G L+ +A CG A + F+ M +RDV +W + I +A G Sbjct: 495 IHGYIKKKDIHFDMHLGTALVDMFARCGDPQSAMQVFNKMVKRDVSAWTAAIGAMAMEGN 554 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTA 40 AIE+F M +G++P+ V V +LTA Sbjct: 555 GTGAIELFDEMLQQGIKPDGVVFVALLTA 583 Score = 56.6 bits (135), Expect = 3e-06 Identities = 23/83 (27%), Positives = 45/83 (54%) Frame = -1 Query: 255 NCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGCLDEAIEIFGGMEAEGVR 76 N L+ Y +CG +D A + F +++++ +N+++ G E + + G M G R Sbjct: 279 NALVDMYMKCGAIDKARKIFDECVDKNLVLYNTIMSNYVRQGLAREVLAVLGEMLKHGPR 338 Query: 75 PNDVTMVGVLTACRKKMDLKFGK 7 P+ +TM+ ++AC + D+ GK Sbjct: 339 PDRITMLSAVSACSELDDVSCGK 361 >ref|NP_180537.1| RNA editing factor OTP81 [Arabidopsis thaliana] gi|75100656|sp|O82380.1|PP175_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g29760, chloroplastic; Flags: Precursor gi|3582328|gb|AAC35225.1| hypothetical protein [Arabidopsis thaliana] gi|330253207|gb|AEC08301.1| RNA editing factor OTP81 [Arabidopsis thaliana] Length = 738 Score = 114 bits (285), Expect = 1e-23 Identities = 53/102 (51%), Positives = 73/102 (71%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 +HGMA+K SD++V N LIH Y CG LD A + F+T+ E+DV+SWNSMI+G G Sbjct: 153 LHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGS 212 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 D+A+E+F ME+E V+ + VTMVGVL+AC K +L+FG+ V Sbjct: 213 PDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQV 254 Score = 80.1 bits (196), Expect = 3e-13 Identities = 36/90 (40%), Positives = 56/90 (62%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K G + +V + LIH Y++CG L+ + F+++ +RDV W++MI GLA +GC Sbjct: 387 IHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGC 446 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 +EA+++F M+ V+PN VT V AC Sbjct: 447 GNEAVDMFYKMQEANVKPNGVTFTNVFCAC 476 >ref|XP_004295750.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 1049 Score = 113 bits (282), Expect = 3e-23 Identities = 52/103 (50%), Positives = 69/103 (66%) Frame = -1 Query: 309 GIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNG 130 G HGM +K SD+Y+ N LIHFY CG LD+A F ++DV+SWNS+I A Sbjct: 208 GFHGMVVKAELGSDVYIVNSLIHFYGSCGELDLARLVFLKSYKKDVVSWNSVITAFAQGN 267 Query: 129 CLDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 C + A+E+F MEAE ++PNDVT+V VL+AC K DL+FG+ V Sbjct: 268 CPEVALELFKEMEAENMKPNDVTLVSVLSACAKMADLEFGRWV 310 Score = 68.6 bits (166), Expect = 9e-10 Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 32/116 (27%) Frame = -1 Query: 288 KEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGCLDEAIE 109 + G +L + N ++ YA+CG ++ A R F MPE+DV+SW +M+DG A G DEA Sbjct: 316 RHGVEENLTLNNAMLDMYAKCGSVEDAERLFGRMPEKDVVSWTTMLDGYARMGNYDEARR 375 Query: 108 IFGGMEAE--------------------------------GVRPNDVTMVGVLTAC 37 +FG M ++ G +P++VT+V L AC Sbjct: 376 VFGTMPSQDIATWNVLISSYEQNGKPKEALAVFHELQKNKGPKPDEVTLVSTLAAC 431 Score = 62.8 bits (151), Expect = 5e-08 Identities = 33/90 (36%), Positives = 48/90 (53%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K+G + ++ LI YA+CG L+ A F++ RDV W++MI LA +G Sbjct: 443 IHVYVKKQGMKLNCHLTTSLIDMYAKCGNLEKALEVFNSAETRDVFVWSAMIAALAMHGQ 502 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 +A+ F M V+PN VT +L AC Sbjct: 503 GRDALHFFSKMLEAKVKPNAVTFTNILCAC 532 >ref|XP_006410036.1| hypothetical protein EUTSA_v10016305mg [Eutrema salsugineum] gi|557111205|gb|ESQ51489.1| hypothetical protein EUTSA_v10016305mg [Eutrema salsugineum] Length = 739 Score = 111 bits (278), Expect = 9e-23 Identities = 52/102 (50%), Positives = 71/102 (69%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 +HGMA+K D++V N LIH Y CG LD A + F+T+ E+DV+SWNSMI G G Sbjct: 154 LHGMAVKSSVGCDVFVANSLIHCYFSCGDLDSACKVFTTIQEKDVVSWNSMITGFVQKGS 213 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTACRKKMDLKFGKLV 1 D+A+E+F ME+E V+ + VTMVGVL+AC K +L+FG+ V Sbjct: 214 PDKALELFKKMESEEVKASHVTMVGVLSACAKLRNLEFGRQV 255 Score = 83.2 bits (204), Expect = 3e-14 Identities = 39/90 (43%), Positives = 56/90 (62%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMPERDVISWNSMIDGLAHNGC 127 IH K G S+ YV + LIH Y++CG L A F+T+ +RDV W++MI GLA +GC Sbjct: 388 IHSYIKKHGIRSNFYVTSALIHMYSKCGDLVKAREVFNTVEKRDVFVWSAMIGGLAMHGC 447 Query: 126 LDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 ++A+++F M+ V+PN VT V AC Sbjct: 448 GNDALDMFYKMQEANVKPNGVTFTNVFCAC 477 >gb|EPS69071.1| hypothetical protein M569_05691, partial [Genlisea aurea] Length = 726 Score = 111 bits (278), Expect = 9e-23 Identities = 59/107 (55%), Positives = 71/107 (66%), Gaps = 6/107 (5%) Frame = -1 Query: 309 GIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAFSTMP--ERDVISWNSMIDGLAH 136 GIHGMA+K +SD++V N LI FY+EC CL A R F TMP RDV+SWNSMI+GL Sbjct: 132 GIHGMAVKGNHASDVFVSNSLIRFYSECRCLVAAYRIFETMPRTRRDVVSWNSMINGLVQ 191 Query: 135 NGCLDEAIEIFGGM----EAEGVRPNDVTMVGVLTACRKKMDLKFGK 7 N D+A+E+F M E EGV PN VTM+ VL C K DL+ GK Sbjct: 192 NKWHDDAMELFHRMVAEEEEEGVEPNGVTMLSVLGICGTKSDLELGK 238 Score = 66.6 bits (161), Expect = 3e-09 Identities = 34/91 (37%), Positives = 54/91 (59%), Gaps = 1/91 (1%) Frame = -1 Query: 306 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF-STMPERDVISWNSMIDGLAHNG 130 IH K G S + ++ LI Y++CG L+ A++ F S+ ERDV W++MI +G Sbjct: 374 IHNYVKKRGMSLNCHLVTSLIDMYSKCGDLEKAAQVFRSSSHERDVFVWSAMIAAYGMHG 433 Query: 129 CLDEAIEIFGGMEAEGVRPNDVTMVGVLTAC 37 C +A+E+F M+ V+P+ VT +L+AC Sbjct: 434 CGHDAVELFKKMQEAKVKPSFVTFTNLLSAC 464