BLASTX nr result
ID: Forsythia23_contig00033635
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00033635 (810 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011084698.1| PREDICTED: pentatricopeptide repeat-containi... 244 5e-62 ref|XP_012834852.1| PREDICTED: pentatricopeptide repeat-containi... 239 2e-60 emb|CDP04141.1| unnamed protein product [Coffea canephora] 224 5e-56 ref|XP_006353247.1| PREDICTED: pentatricopeptide repeat-containi... 205 3e-50 ref|XP_004250507.1| PREDICTED: pentatricopeptide repeat-containi... 202 2e-49 ref|XP_009604735.1| PREDICTED: pentatricopeptide repeat-containi... 202 3e-49 ref|XP_009762033.1| PREDICTED: pentatricopeptide repeat-containi... 196 1e-47 gb|EPS65453.1| hypothetical protein M569_09325, partial [Genlise... 166 2e-38 ref|XP_012480399.1| PREDICTED: pentatricopeptide repeat-containi... 157 8e-36 gb|EYU39754.1| hypothetical protein MIMGU_mgv1a001778mg [Erythra... 154 8e-35 ref|XP_002272556.1| PREDICTED: pentatricopeptide repeat-containi... 153 1e-34 ref|XP_010456145.1| PREDICTED: pentatricopeptide repeat-containi... 152 2e-34 ref|XP_010456144.1| PREDICTED: pentatricopeptide repeat-containi... 152 2e-34 ref|XP_002302689.2| hypothetical protein POPTR_0002s18390g [Popu... 152 2e-34 ref|XP_006289934.1| hypothetical protein CARUB_v10003556mg [Caps... 152 3e-34 gb|AAC62783.1| F11O4.7 [Arabidopsis thaliana] 151 4e-34 ref|NP_192066.2| pentatricopeptide repeat-containing protein [Ar... 151 4e-34 ref|XP_006396354.1| hypothetical protein EUTSA_v10028437mg [Eutr... 151 4e-34 ref|XP_010422707.1| PREDICTED: pentatricopeptide repeat-containi... 150 1e-33 ref|XP_006386676.1| pentatricopeptide repeat-containing family p... 150 1e-33 >ref|XP_011084698.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Sesamum indicum] Length = 831 Score = 244 bits (623), Expect = 5e-62 Identities = 129/196 (65%), Positives = 148/196 (75%), Gaps = 7/196 (3%) Frame = -1 Query: 567 MRHGRGGKTTALFHSSASFLMPIHTRHRST----ASAGT---AASKLGNLLLVASIAKAL 409 MRHG+ G+ AL H S +FL P +R + AS G ASKLGNLL+VASIAKAL Sbjct: 1 MRHGQSGRRIALLHHSVAFLAPFCSRISFSTAVKASGGAEVGTASKLGNLLVVASIAKAL 60 Query: 408 SKPGGIKSLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWCSLRPNYKHSADTYSQI 229 S+PGGI SLEK SIPLSE+LVLQ+LRR SLDA KKLDFF WCS+RPNYKH+A TYSQ+ Sbjct: 61 SRPGGIHSLEKYGDSIPLSEDLVLQVLRRGSLDASKKLDFFRWCSVRPNYKHTAGTYSQM 120 Query: 228 FRTICNYPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKSGKFDSALEIFDYLEKD 49 F+TIC PH H DDI L+AS + DGVVLDSST KLILD I+SGKFDSALE+ Y+E+D Sbjct: 121 FKTICFLPHQHQDDILELVASTRRDGVVLDSSTLKLILDGLIRSGKFDSALEVLGYIERD 180 Query: 48 LTRTGCLSPDIYSSVL 1 L CLSPDIYS VL Sbjct: 181 LISISCLSPDIYSPVL 196 >ref|XP_012834852.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Erythranthe guttatus] Length = 833 Score = 239 bits (609), Expect = 2e-60 Identities = 127/200 (63%), Positives = 150/200 (75%), Gaps = 10/200 (5%) Frame = -1 Query: 570 IMRHGRGGKTTALFHSSASFLM-PIHTRHRSTASAGTA-------ASKLGNLLLVASIAK 415 IMRHGR GKT ALFH SASFL P+ + R T +A + AS+LGNLL+VA+IAK Sbjct: 12 IMRHGRRGKTMALFHHSASFLRRPLSPKSRFTTAAKSTNGAVSGTASELGNLLIVAAIAK 71 Query: 414 ALSKPGGIKSLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWCSLRPNYKHSADTYS 235 LS PGGI SLEKD SIPLSE LVLQ+LRR SLDA +KLDFF WCSLRPN+KHSA TY Sbjct: 72 TLSNPGGIHSLEKDADSIPLSENLVLQVLRRGSLDAARKLDFFRWCSLRPNFKHSAGTYH 131 Query: 234 QIFRTICNYPHHHHDDIFHLLASMK--GDGVVLDSSTFKLILDAFIKSGKFDSALEIFDY 61 Q+F++IC P HHH DI L+ASM GD LDS T KLIL++FI+SGK+DSALE+ D Sbjct: 132 QMFKSICISPRHHHSDILELVASMASGGDAAALDSPTLKLILNSFIRSGKYDSALEVLDC 191 Query: 60 LEKDLTRTGCLSPDIYSSVL 1 +E+DL +T LSPDIYS V+ Sbjct: 192 VERDLIQTTSLSPDIYSPVI 211 >emb|CDP04141.1| unnamed protein product [Coffea canephora] Length = 820 Score = 224 bits (571), Expect = 5e-56 Identities = 116/195 (59%), Positives = 153/195 (78%), Gaps = 6/195 (3%) Frame = -1 Query: 567 MRHGRGGKTTALFHSSASFLMPIHTRHRSTASAGTA------ASKLGNLLLVASIAKALS 406 M HG+ G +FHSS +F++ +++R + T ASK+G+L++VASIAKALS Sbjct: 1 MHHGQSG-AGVMFHSSLAFVVIFGSKNRFITTLTTGLRAKPLASKVGSLIVVASIAKALS 59 Query: 405 KPGGIKSLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWCSLRPNYKHSADTYSQIF 226 +PGG ++L+K++ S+ LSE+LVLQ+LRRNSL A KKLDFF WCSLRPNYKHS TYSQ+F Sbjct: 60 EPGGTRNLDKNMASVNLSEDLVLQVLRRNSLAASKKLDFFHWCSLRPNYKHSVGTYSQMF 119 Query: 225 RTICNYPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKSGKFDSALEIFDYLEKDL 46 TIC+ P +HD+IF+LL S+K DG+VLDS+TFKLILDAFI+SG+FDSALEI D++EKDL Sbjct: 120 HTICHCP-QYHDEIFNLLTSLKRDGLVLDSTTFKLILDAFIRSGRFDSALEILDHVEKDL 178 Query: 45 TRTGCLSPDIYSSVL 1 T L+ D+YSS+L Sbjct: 179 CMTVSLNADLYSSIL 193 >ref|XP_006353247.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like isoform X1 [Solanum tuberosum] Length = 816 Score = 205 bits (521), Expect = 3e-50 Identities = 102/158 (64%), Positives = 132/158 (83%) Frame = -1 Query: 474 SAGTAASKLGNLLLVASIAKALSKPGGIKSLEKDLHSIPLSEELVLQLLRRNSLDAGKKL 295 S+ AASK+GNLL+VASIAKAL KPGG ++LE+ SIPLSE LVLQ+LRRN+LDA KKL Sbjct: 29 SSTAAASKVGNLLVVASIAKALIKPGGTRNLEQYGDSIPLSESLVLQVLRRNNLDAEKKL 88 Query: 294 DFFCWCSLRPNYKHSADTYSQIFRTICNYPHHHHDDIFHLLASMKGDGVVLDSSTFKLIL 115 DFF WCSLRP++KHS +TYSQ+F++IC Y H+H + IF LL SMK D V+L+++TFKL+L Sbjct: 89 DFFKWCSLRPSFKHSTETYSQMFKSIC-YSHNHREAIFVLLNSMKDDKVLLNAATFKLLL 147 Query: 114 DAFIKSGKFDSALEIFDYLEKDLTRTGCLSPDIYSSVL 1 D+F ++G FDSALEI +++E DL + CLSPD+Y+SVL Sbjct: 148 DSFTRTGNFDSALEILEFVEGDLDNSSCLSPDVYNSVL 185 >ref|XP_004250507.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Solanum lycopersicum] Length = 819 Score = 202 bits (515), Expect = 2e-49 Identities = 101/161 (62%), Positives = 131/161 (81%) Frame = -1 Query: 483 STASAGTAASKLGNLLLVASIAKALSKPGGIKSLEKDLHSIPLSEELVLQLLRRNSLDAG 304 ++A+ AASK+GNL++VASIAKAL K GG ++LEK IPLSE LVLQ+LRRN+LDA Sbjct: 29 TSAAKTAAASKVGNLIVVASIAKALIKRGGTRNLEKYGDLIPLSESLVLQVLRRNNLDAE 88 Query: 303 KKLDFFCWCSLRPNYKHSADTYSQIFRTICNYPHHHHDDIFHLLASMKGDGVVLDSSTFK 124 KKLDFF WCSLRPN+KHS +TYSQ+F+ IC Y +H +D+F LL SMK D V+L+S+TFK Sbjct: 89 KKLDFFKWCSLRPNFKHSTETYSQMFKCIC-YSRNHREDVFVLLNSMKDDEVLLNSATFK 147 Query: 123 LILDAFIKSGKFDSALEIFDYLEKDLTRTGCLSPDIYSSVL 1 L+LD+F ++G FDSALEI +++E DL + CLSPD+Y+SVL Sbjct: 148 LLLDSFTRTGNFDSALEILEFVEGDLANSSCLSPDVYNSVL 188 >ref|XP_009604735.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Nicotiana tomentosiformis] gi|697191337|ref|XP_009604736.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Nicotiana tomentosiformis] Length = 816 Score = 202 bits (513), Expect = 3e-49 Identities = 105/163 (64%), Positives = 132/163 (80%), Gaps = 6/163 (3%) Frame = -1 Query: 471 AGT---AASKLGNLLLVASIAKALSKPGGIKSLEKDLHSIPLSEELVLQLLRRNSLDAGK 301 AGT AASK+GNLL+VASI KAL+ PGG ++LEK SIPLSE LVLQ+LRRN+LDA Sbjct: 26 AGTRSPAASKIGNLLVVASITKALTAPGGTRNLEKYNDSIPLSETLVLQILRRNNLDAAT 85 Query: 300 KLDFFCWCSLRPNYKHSADTYSQIFRTICNYP-HHHHDDIFHLLASMKGDGVVLDSSTFK 124 KLDFF WCSLRPN+KHS +TYSQ+FR+IC Y H+H +DIF LL SMK DGV L+S+TFK Sbjct: 86 KLDFFKWCSLRPNFKHSTETYSQMFRSICYYYFHNHREDIFVLLNSMKHDGVSLNSATFK 145 Query: 123 LILDAFIKSGKFDSALEIFDYLEKDL--TRTGCLSPDIYSSVL 1 L+LD+F ++G F+SALEI +++E +L + CLSPD+Y+SVL Sbjct: 146 LLLDSFTRAGNFNSALEILEFMESNLKNSNINCLSPDVYNSVL 188 >ref|XP_009762033.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Nicotiana sylvestris] gi|698441696|ref|XP_009762044.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Nicotiana sylvestris] gi|698441708|ref|XP_009762056.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Nicotiana sylvestris] gi|698441714|ref|XP_009762062.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Nicotiana sylvestris] gi|698441723|ref|XP_009762073.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Nicotiana sylvestris] Length = 816 Score = 196 bits (498), Expect = 1e-47 Identities = 104/173 (60%), Positives = 134/173 (77%), Gaps = 9/173 (5%) Frame = -1 Query: 492 RHRSTASA---GT---AASKLGNLLLVASIAKALSKPGGIKSLEKDLHSIPLSEELVLQL 331 RH + A A GT A+SK+ NLL+VASI KAL+ PGG ++LEK SI +SE LVLQ+ Sbjct: 16 RHFTVAGAKVAGTRSPASSKIENLLVVASITKALTAPGGTRNLEKYNDSIAVSENLVLQI 75 Query: 330 LRRNSLDAGKKLDFFCWCSLRPNYKHSADTYSQIFRTICNY-PHHHHDDIFHLLASMKGD 154 LRRN+LDA KLDFF WCSLRPN+KHS +TYSQ+FR+IC Y H+H +DIF LL SMK D Sbjct: 76 LRRNNLDAATKLDFFKWCSLRPNFKHSIETYSQMFRSICYYNSHNHREDIFVLLNSMKHD 135 Query: 153 GVVLDSSTFKLILDAFIKSGKFDSALEIFDYLEKDL--TRTGCLSPDIYSSVL 1 GV L+S+TFKL+LD+F ++G F+SALE+ +++E DL + CLSPD+Y+SVL Sbjct: 136 GVSLNSATFKLLLDSFTRAGNFNSALELLEFMESDLENSNNNCLSPDVYNSVL 188 >gb|EPS65453.1| hypothetical protein M569_09325, partial [Genlisea aurea] Length = 770 Score = 166 bits (419), Expect = 2e-38 Identities = 80/148 (54%), Positives = 108/148 (72%) Frame = -1 Query: 444 NLLLVASIAKALSKPGGIKSLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWCSLRP 265 N+L+VASI K LSK G ++ LEK+ SIPLSE++VLQ++ SL KKL+FF WCS RP Sbjct: 1 NILVVASITKILSKFGALQYLEKNADSIPLSEDVVLQIVHHRSLVISKKLEFFRWCSSRP 60 Query: 264 NYKHSADTYSQIFRTICNYPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKSGKFD 85 +Y H+A+ YS++ R I +P+ HH+++ LLA MK DGV+LDS T K IL+ I++ KFD Sbjct: 61 DYNHTANAYSEMLRAIFRFPNQHHNNVIELLALMKRDGVILDSDTLKRILNGLIRAQKFD 120 Query: 84 SALEIFDYLEKDLTRTGCLSPDIYSSVL 1 AL++ DY+EKD G LSPD+YS VL Sbjct: 121 YALDVLDYIEKDSVIAGNLSPDVYSPVL 148 >ref|XP_012480399.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Gossypium raimondii] gi|763742056|gb|KJB09555.1| hypothetical protein B456_001G149600 [Gossypium raimondii] Length = 808 Score = 157 bits (397), Expect = 8e-36 Identities = 86/152 (56%), Positives = 112/152 (73%), Gaps = 1/152 (0%) Frame = -1 Query: 453 KLGNLLLVASIAKALSKPGGIKSLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWC- 277 +LGN+LL+AS+ K L + G + D +SIPLSE LVLQ+LR+NSL+ KKLDFF WC Sbjct: 22 QLGNILLIASLTKTLLESG---TRNLDPNSIPLSEPLVLQILRKNSLEPSKKLDFFNWCR 78 Query: 276 SLRPNYKHSADTYSQIFRTICNYPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKS 97 S +PN+KHSA TYS IFRT+C +++ +LL MK DGV++DSSTFKL+LDAFI+S Sbjct: 79 SFKPNFKHSAVTYSHIFRTLCR--SGFVEEVPNLLFLMKEDGVLVDSSTFKLLLDAFIRS 136 Query: 96 GKFDSALEIFDYLEKDLTRTGCLSPDIYSSVL 1 GKFD+ALEI DY+E+ CL+ +Y SVL Sbjct: 137 GKFDTALEILDYMEES---GACLNASVYDSVL 165 >gb|EYU39754.1| hypothetical protein MIMGU_mgv1a001778mg [Erythranthe guttata] Length = 760 Score = 154 bits (388), Expect = 8e-35 Identities = 97/189 (51%), Positives = 116/189 (61%), Gaps = 10/189 (5%) Frame = -1 Query: 537 ALFHSSASFLM-PIHTRHRSTASAGTA-------ASKLGNLLLVASIAKALSKPGGIKSL 382 ALFH SASFL P+ + R T +A + AS+LGNLL+VA+IAK LS PGGI SL Sbjct: 2 ALFHHSASFLRRPLSPKSRFTTAAKSTNGAVSGTASELGNLLIVAAIAKTLSNPGGIHSL 61 Query: 381 EKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWCSLRPNYKHSADTYSQIFRTICNYPH 202 EKD SIPLSE LVLQ+LRR SLDA +KLDFF Sbjct: 62 EKDADSIPLSENLVLQVLRRGSLDAARKLDFF---------------------------- 93 Query: 201 HHHDDIFHLLASMK--GDGVVLDSSTFKLILDAFIKSGKFDSALEIFDYLEKDLTRTGCL 28 DI L+ASM GD LDS T KLIL++FI+SGK+DSALE+ D +E+DL +T L Sbjct: 94 --RCDILELVASMASGGDAAALDSPTLKLILNSFIRSGKYDSALEVLDCVERDLIQTTSL 151 Query: 27 SPDIYSSVL 1 SPDIYS V+ Sbjct: 152 SPDIYSPVI 160 >ref|XP_002272556.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Vitis vinifera] Length = 792 Score = 153 bits (387), Expect = 1e-34 Identities = 85/166 (51%), Positives = 114/166 (68%) Frame = -1 Query: 498 HTRHRSTASAGTAASKLGNLLLVASIAKALSKPGGIKSLEKDLHSIPLSEELVLQLLRRN 319 H R S+++A A KLG++LLVASI+K LS+ G + DL SIP+SE LV+Q+L RN Sbjct: 3 HGRTLSSSAAAGAGVKLGDMLLVASISKTLSERG---TRSPDLESIPISESLVVQILGRN 59 Query: 318 SLDAGKKLDFFCWCSLRPNYKHSADTYSQIFRTICNYPHHHHDDIFHLLASMKGDGVVLD 139 S+D +K++FF WCS R NYKHS YS IFR +C D + L++SMK DGVV+ Sbjct: 60 SIDVFRKVEFFRWCSFRHNYKHSVGAYSHIFRIVCRAGAEFLDQVPLLMSSMKDDGVVVG 119 Query: 138 SSTFKLILDAFIKSGKFDSALEIFDYLEKDLTRTGCLSPDIYSSVL 1 TFKL+LD+ I++GKFDSALEI D++E+ TG L+ +Y SVL Sbjct: 120 QETFKLLLDSLIRAGKFDSALEILDHIEE--LGTG-LNSYVYDSVL 162 >ref|XP_010456145.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like isoform X2 [Camelina sativa] Length = 797 Score = 152 bits (384), Expect = 2e-34 Identities = 93/190 (48%), Positives = 123/190 (64%), Gaps = 1/190 (0%) Frame = -1 Query: 567 MRHGRGGKTTALFHSSASFLMPIHTRHRSTASAGTAASKLGNLLLVASIAKALSKPGGIK 388 MRHGRG +A S L P + ++ +L N+LLVAS++K LS+ G + Sbjct: 1 MRHGRGSAVSA----GISGLSP---------AISSSLPQLCNVLLVASLSKTLSQ-SGTR 46 Query: 387 SLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWC-SLRPNYKHSADTYSQIFRTICN 211 SL D +SIP+SE +VLQ+LRR+S+D KKLDFF WC +LRP YKHSA YSQIFRT+C Sbjct: 47 SL--DANSIPISEPVVLQILRRSSIDPSKKLDFFRWCFTLRPGYKHSASAYSQIFRTVCR 104 Query: 210 YPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKSGKFDSALEIFDYLEKDLTRTGC 31 +++ LL SMK DGV LD K++LD+ I+SGKFDSAL + DY+E+ C Sbjct: 105 --RGLLEEVPDLLGSMKDDGVNLDQKMAKVLLDSLIRSGKFDSALGVLDYMEE---LGDC 159 Query: 30 LSPDIYSSVL 1 L+P +Y SVL Sbjct: 160 LNPSLYDSVL 169 >ref|XP_010456144.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like isoform X1 [Camelina sativa] Length = 797 Score = 152 bits (384), Expect = 2e-34 Identities = 93/190 (48%), Positives = 123/190 (64%), Gaps = 1/190 (0%) Frame = -1 Query: 567 MRHGRGGKTTALFHSSASFLMPIHTRHRSTASAGTAASKLGNLLLVASIAKALSKPGGIK 388 MRHGRG +A S L P + ++ +L N+LLVAS++K LS+ G + Sbjct: 1 MRHGRGSAVSA----GISGLSP---------AISSSLPQLCNVLLVASLSKTLSQ-SGTR 46 Query: 387 SLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWC-SLRPNYKHSADTYSQIFRTICN 211 SL D +SIP+SE +VLQ+LRR+S+D KKLDFF WC +LRP YKHSA YSQIFRT+C Sbjct: 47 SL--DANSIPISEPVVLQILRRSSIDPSKKLDFFRWCFTLRPGYKHSASAYSQIFRTVCR 104 Query: 210 YPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKSGKFDSALEIFDYLEKDLTRTGC 31 +++ LL SMK DGV LD K++LD+ I+SGKFDSAL + DY+E+ C Sbjct: 105 --RGLLEEVPDLLGSMKDDGVNLDQKMAKVLLDSLIRSGKFDSALGVLDYMEE---LGDC 159 Query: 30 LSPDIYSSVL 1 L+P +Y SVL Sbjct: 160 LNPSLYDSVL 169 >ref|XP_002302689.2| hypothetical protein POPTR_0002s18390g [Populus trichocarpa] gi|550345304|gb|EEE81962.2| hypothetical protein POPTR_0002s18390g [Populus trichocarpa] Length = 776 Score = 152 bits (384), Expect = 2e-34 Identities = 83/150 (55%), Positives = 111/150 (74%) Frame = -1 Query: 450 LGNLLLVASIAKALSKPGGIKSLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWCSL 271 +GN+LLVA + K LS+ G +SL+ D SIPLSE LVLQ+LRRNSLD+ KK++FF WCS+ Sbjct: 1 MGNILLVAYLTKTLSE-SGTRSLDPD--SIPLSESLVLQILRRNSLDSSKKMEFFKWCSV 57 Query: 270 RPNYKHSADTYSQIFRTICNYPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKSGK 91 R YKHS TYSQ+F T+C + D++ LL SMK DGVV+ S TFKL+LDAFI+SGK Sbjct: 58 RHIYKHSVSTYSQMFSTLCR--SGYLDEVPDLLNSMKNDGVVVGSETFKLLLDAFIRSGK 115 Query: 90 FDSALEIFDYLEKDLTRTGCLSPDIYSSVL 1 FDSAL+I D++E+ + +P +Y S++ Sbjct: 116 FDSALDILDHMEELGSNP---NPHMYDSII 142 >ref|XP_006289934.1| hypothetical protein CARUB_v10003556mg [Capsella rubella] gi|482558640|gb|EOA22832.1| hypothetical protein CARUB_v10003556mg [Capsella rubella] Length = 802 Score = 152 bits (383), Expect = 3e-34 Identities = 94/190 (49%), Positives = 124/190 (65%), Gaps = 1/190 (0%) Frame = -1 Query: 567 MRHGRGGKTTALFHSSASFLMPIHTRHRSTASAGTAASKLGNLLLVASIAKALSKPGGIK 388 MRHGRG +A + S L P + + +L N+LLVAS++K LS+ G + Sbjct: 1 MRHGRGSAVSA----AISGLSP---------AKNSPFPQLCNVLLVASLSKTLSQ-SGTR 46 Query: 387 SLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWC-SLRPNYKHSADTYSQIFRTICN 211 SL D +SIP+SE +VLQ+LRR+S+D+ KKLDFF WC SLRP YKHSA YSQIFRT+C Sbjct: 47 SL--DANSIPISESVVLQILRRSSIDSSKKLDFFRWCFSLRPGYKHSASAYSQIFRTVCR 104 Query: 210 YPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKSGKFDSALEIFDYLEKDLTRTGC 31 ++ LL SMK DGV LD + K++LD+ I+SGKFDSAL + DY+E+ C Sbjct: 105 TGLI--GEVPDLLGSMKDDGVNLDQTMAKVLLDSLIRSGKFDSALGVLDYMEE---LGDC 159 Query: 30 LSPDIYSSVL 1 L+P +Y SVL Sbjct: 160 LNPGLYDSVL 169 >gb|AAC62783.1| F11O4.7 [Arabidopsis thaliana] Length = 508 Score = 151 bits (382), Expect = 4e-34 Identities = 94/190 (49%), Positives = 123/190 (64%), Gaps = 1/190 (0%) Frame = -1 Query: 567 MRHGRGGKTTALFHSSASFLMPIHTRHRSTASAGTAASKLGNLLLVASIAKALSKPGGIK 388 MRHGRG +A + S L P + + +L N+LLVAS++K LS+ G + Sbjct: 1 MRHGRGSAVSA----AISGLSP---------AKNSPFPQLCNVLLVASLSKTLSQ-SGTR 46 Query: 387 SLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWC-SLRPNYKHSADTYSQIFRTICN 211 SL D +SIP+SE +VLQ+LRRNS+D KKLDFF WC SLRP YKHSA YSQIFRT+C Sbjct: 47 SL--DANSIPISEPVVLQILRRNSIDPSKKLDFFRWCYSLRPGYKHSATAYSQIFRTVCR 104 Query: 210 YPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKSGKFDSALEIFDYLEKDLTRTGC 31 ++ LL SMK DGV LD + K++LD+ I+SGKF+SAL + DY+E+ C Sbjct: 105 --TGLLGEVPDLLGSMKEDGVNLDQTMAKILLDSLIRSGKFESALGVLDYMEE---LGDC 159 Query: 30 LSPDIYSSVL 1 L+P +Y SVL Sbjct: 160 LNPSVYDSVL 169 >ref|NP_192066.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75161629|sp|Q8VZE4.1|PP299_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g01570 gi|18086402|gb|AAL57659.1| AT4g01570/T15B16_21 [Arabidopsis thaliana] gi|24797024|gb|AAN64524.1| At4g01570/T15B16_21 [Arabidopsis thaliana] gi|332656643|gb|AEE82043.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 805 Score = 151 bits (382), Expect = 4e-34 Identities = 94/190 (49%), Positives = 123/190 (64%), Gaps = 1/190 (0%) Frame = -1 Query: 567 MRHGRGGKTTALFHSSASFLMPIHTRHRSTASAGTAASKLGNLLLVASIAKALSKPGGIK 388 MRHGRG +A + S L P + + +L N+LLVAS++K LS+ G + Sbjct: 1 MRHGRGSAVSA----AISGLSP---------AKNSPFPQLCNVLLVASLSKTLSQ-SGTR 46 Query: 387 SLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWC-SLRPNYKHSADTYSQIFRTICN 211 SL D +SIP+SE +VLQ+LRRNS+D KKLDFF WC SLRP YKHSA YSQIFRT+C Sbjct: 47 SL--DANSIPISEPVVLQILRRNSIDPSKKLDFFRWCYSLRPGYKHSATAYSQIFRTVCR 104 Query: 210 YPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKSGKFDSALEIFDYLEKDLTRTGC 31 ++ LL SMK DGV LD + K++LD+ I+SGKF+SAL + DY+E+ C Sbjct: 105 --TGLLGEVPDLLGSMKEDGVNLDQTMAKILLDSLIRSGKFESALGVLDYMEE---LGDC 159 Query: 30 LSPDIYSSVL 1 L+P +Y SVL Sbjct: 160 LNPSVYDSVL 169 >ref|XP_006396354.1| hypothetical protein EUTSA_v10028437mg [Eutrema salsugineum] gi|557097371|gb|ESQ37807.1| hypothetical protein EUTSA_v10028437mg [Eutrema salsugineum] Length = 801 Score = 151 bits (382), Expect = 4e-34 Identities = 92/190 (48%), Positives = 118/190 (62%), Gaps = 1/190 (0%) Frame = -1 Query: 567 MRHGRGGKTTALFHSSASFLMPIHTRHRSTASAGTAASKLGNLLLVASIAKALSKPGGIK 388 MRHGR +A + +P +L N+L+VAS++K LS G Sbjct: 1 MRHGRASAVSAAIAGLSPAKIP-------------PFPQLCNVLVVASLSKTLSHSG--- 44 Query: 387 SLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWC-SLRPNYKHSADTYSQIFRTICN 211 + D +S P+SE +VLQ+LRRNSLD KKLDFF WC SLRP YKHSA YSQIFRT+C Sbjct: 45 TRNLDANSTPISEPIVLQILRRNSLDPSKKLDFFRWCFSLRPGYKHSASAYSQIFRTVCR 104 Query: 210 YPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKSGKFDSALEIFDYLEKDLTRTGC 31 +I +LL SMK DGV LD +T KL+LD+ I+SGK+DSAL + DY+E+ GC Sbjct: 105 TGLL--GEIPNLLGSMKEDGVNLDQTTSKLLLDSLIRSGKYDSALGVLDYMEE---LGGC 159 Query: 30 LSPDIYSSVL 1 L+P +Y SVL Sbjct: 160 LNPRLYDSVL 169 >ref|XP_010422707.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570 [Camelina sativa] Length = 797 Score = 150 bits (378), Expect = 1e-33 Identities = 90/190 (47%), Positives = 122/190 (64%), Gaps = 1/190 (0%) Frame = -1 Query: 567 MRHGRGGKTTALFHSSASFLMPIHTRHRSTASAGTAASKLGNLLLVASIAKALSKPGGIK 388 MRHGRG +A + S L P + ++ +L N+L+VAS++K LS+ G Sbjct: 1 MRHGRGTAVSA----AVSGLSP---------AINSSLPQLCNVLIVASLSKTLSQSG--- 44 Query: 387 SLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWC-SLRPNYKHSADTYSQIFRTICN 211 + D +SIP+SE +VLQ+LRR+S+D KKLDFF WC +LRP YKHSA YSQIFRT+C Sbjct: 45 TRRLDANSIPISEPVVLQILRRSSIDPSKKLDFFRWCFTLRPGYKHSASAYSQIFRTVCR 104 Query: 210 YPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKSGKFDSALEIFDYLEKDLTRTGC 31 ++ LL SMK DGV LD + K++LD+ I+SGKFDSAL + DY+E+ C Sbjct: 105 --RGLLGEVPDLLGSMKDDGVNLDQTMAKVLLDSLIRSGKFDSALGVLDYMEE---LGDC 159 Query: 30 LSPDIYSSVL 1 L+P +Y SVL Sbjct: 160 LNPSLYDSVL 169 >ref|XP_006386676.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550345301|gb|ERP64473.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 776 Score = 150 bits (378), Expect = 1e-33 Identities = 82/150 (54%), Positives = 111/150 (74%) Frame = -1 Query: 450 LGNLLLVASIAKALSKPGGIKSLEKDLHSIPLSEELVLQLLRRNSLDAGKKLDFFCWCSL 271 +GN+LLVA + K LS+ G +SL+ D SIPLSE LVLQ+LRRNSLD+ KK++FF WCS+ Sbjct: 1 MGNILLVAYLTKTLSE-SGTRSLDPD--SIPLSEYLVLQILRRNSLDSSKKMEFFKWCSV 57 Query: 270 RPNYKHSADTYSQIFRTICNYPHHHHDDIFHLLASMKGDGVVLDSSTFKLILDAFIKSGK 91 R YKHS TYSQ+F T+C + +++ LL SMK DGVV+ S TFKL+LDAFI+SGK Sbjct: 58 RHIYKHSVSTYSQMFSTLCR--SGYLEEVPDLLNSMKNDGVVVGSETFKLLLDAFIRSGK 115 Query: 90 FDSALEIFDYLEKDLTRTGCLSPDIYSSVL 1 FDSAL+I D++E+ + +P +Y S++ Sbjct: 116 FDSALDILDHMEELGSNP---NPHMYDSII 142