BLASTX nr result
ID: Rehmannia24_contig00022078
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia24_contig00022078 (754 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006435073.1| hypothetical protein CICLE_v10000229mg [Citr... 272 8e-71 ref|XP_006341986.1| PREDICTED: pentatricopeptide repeat-containi... 260 4e-67 ref|XP_006386200.1| pentatricopeptide repeat-containing family p... 259 5e-67 ref|XP_004238610.1| PREDICTED: pentatricopeptide repeat-containi... 256 6e-66 gb|EOY14874.1| Pentatricopeptide repeat (PPR-like) superfamily p... 252 8e-65 ref|XP_002280968.2| PREDICTED: pentatricopeptide repeat-containi... 248 2e-63 gb|ESW14194.1| hypothetical protein PHAVU_008G260600g [Phaseolus... 241 2e-61 gb|EMJ28416.1| hypothetical protein PRUPE_ppa019183mg [Prunus pe... 236 8e-60 ref|XP_006575412.1| PREDICTED: pentatricopeptide repeat-containi... 235 1e-59 ref|XP_006596427.1| PREDICTED: pentatricopeptide repeat-containi... 234 2e-59 ref|XP_006416469.1| hypothetical protein EUTSA_v10006756mg [Eutr... 233 7e-59 ref|XP_003615696.1| Pentatricopeptide repeat-containing protein ... 232 9e-59 ref|XP_004490605.1| PREDICTED: pentatricopeptide repeat-containi... 231 3e-58 gb|EXB97347.1| hypothetical protein L484_024210 [Morus notabilis] 220 3e-55 emb|CAA06829.1| DYW7 protein [Arabidopsis thaliana] 219 8e-55 ref|NP_173402.2| pentatricopeptide repeat-containing protein [Ar... 219 8e-55 ref|XP_004152769.1| PREDICTED: pentatricopeptide repeat-containi... 212 1e-52 gb|ABK26521.1| unknown [Picea sitchensis] 194 2e-47 ref|XP_003546945.2| PREDICTED: putative pentatricopeptide repeat... 191 2e-46 gb|ESW22630.1| hypothetical protein PHAVU_005G169000g [Phaseolus... 189 1e-45 >ref|XP_006435073.1| hypothetical protein CICLE_v10000229mg [Citrus clementina] gi|557537195|gb|ESR48313.1| hypothetical protein CICLE_v10000229mg [Citrus clementina] Length = 889 Score = 272 bits (696), Expect = 8e-71 Identities = 135/249 (54%), Positives = 177/249 (71%), Gaps = 2/249 (0%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+ VF ++TE +QI+P ++HY AM++LYGRSGK++EA +FI M I+PD SIW ALLTAC Sbjct: 641 GKQVFCSITECYQIIPMIEHYSAMIDLYGRSGKLEEAMEFIEDMPIEPDSSIWEALLTAC 700 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLVL--YDLRGISKDSLKIKRAGIRKDSSESLG 401 H + LAV A E+L +LEP + I++L+L Y + G +D+LK+++ S G Sbjct: 701 RIHGNIDLAVLAIERLFDLEPGDVLIQRLILQIYAICGKPEDALKVRKLEKENTRRNSFG 760 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKYHDILSFQEEEKEETAGIH 221 SWIE KN+V+TFV+G + + L+SW++ + N H L +EEEKEE +GIH Sbjct: 761 QSWIEVKNLVYTFVTGGWSESYSDLLYSWLQNVPENVTARSCHSGLCIEEEEKEEISGIH 820 Query: 220 SEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLHHFKH 41 SEKLALAFALI S A TIRIVKN+RMC HCHK AK VSK H CEI+++DSKCLHHFK+ Sbjct: 821 SEKLALAFALIGSSQAPHTIRIVKNIRMCVHCHKTAKYVSKMHHCEIFLADSKCLHHFKN 880 Query: 40 GICSCGDYW 14 G CSCGDYW Sbjct: 881 GQCSCGDYW 889 >ref|XP_006341986.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Solanum tuberosum] Length = 884 Score = 260 bits (664), Expect = 4e-67 Identities = 132/249 (53%), Positives = 173/249 (69%), Gaps = 2/249 (0%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+ +FS+M EE++I+P L+HYVAMV LYGRSGK++EA DFI M ++ D+SIW ALLTA Sbjct: 638 GKRMFSSMYEEYRIVPGLEHYVAMVTLYGRSGKLEEAIDFIDNMTMEHDISIWGALLTAS 697 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLVLY--DLRGISKDSLKIKRAGIRKDSSESLG 401 H + LA+HAGE+LL+L+P N I +L+L LRGIS++S+ + R R E L Sbjct: 698 RVHGNLNLAIHAGEQLLKLDPGNVVIHQLLLQLNVLRGISEESVTVMRPRKRNHHEEPLS 757 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKYHDILSFQEEEKEETAGIH 221 SW E NVVH F SG +V SWI+R E+ + S + L +EEE E+ +H Sbjct: 758 WSWTEINNVVHAFASGQQSNSEVPD--SWIKRKEVKMEGSSSCNRLCIKEEENEDITRVH 815 Query: 220 SEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLHHFKH 41 SEKLAL+FALI S ++R IRIVKN+RMC+ CH+ AK VS+K+ EIY+ DSKCLHHFK Sbjct: 816 SEKLALSFALINSPQSSRVIRIVKNLRMCEDCHRIAKLVSQKYEREIYIHDSKCLHHFKD 875 Query: 40 GICSCGDYW 14 G CSCG+YW Sbjct: 876 GYCSCGNYW 884 >ref|XP_006386200.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550344175|gb|ERP63997.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 810 Score = 259 bits (663), Expect = 5e-67 Identities = 134/255 (52%), Positives = 171/255 (67%), Gaps = 8/255 (3%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 GR VFS+MTE+FQI+P +HY AMV+LYGRSG++ EA + I M I P S+W ALLTAC Sbjct: 560 GRQVFSSMTEDFQIIPASEHYAAMVDLYGRSGRLKEAIELIDNMPIKPQSSVWYALLTAC 619 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLVL--YDLRGISKDSLKIKRAGIRKDSSESLG 401 H LA+ A E LL+LEP N I + +L Y + G +D+ K+K+ R + + G Sbjct: 620 RNHGNSDLAIRARENLLDLEPWNSSIHQSILQSYAMHGKYEDAPKVKKLEKRNEVQKPKG 679 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKYHD------ILSFQEEEKE 239 SWIE N VH+FV+GD + + L SW+ER+ + E+K HD I +EEEKE Sbjct: 680 QSWIEVNNTVHSFVAGD-QSTSYSDLFSWVERISM---EAKVHDLHCGCCIEEEEEEEKE 735 Query: 238 ETAGIHSEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKC 59 E GIHSEKLALAFA+I+S A ++IRIVKN+R C CH+ AK +S KHGCEIY+SDS Sbjct: 736 EIVGIHSEKLALAFAIIRSPSAPQSIRIVKNLRTCADCHRMAKYISAKHGCEIYLSDSNF 795 Query: 58 LHHFKHGICSCGDYW 14 HHFK G CSCGDYW Sbjct: 796 FHHFKSGCCSCGDYW 810 >ref|XP_004238610.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Solanum lycopersicum] Length = 884 Score = 256 bits (654), Expect = 6e-66 Identities = 130/249 (52%), Positives = 174/249 (69%), Gaps = 2/249 (0%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+ +FS+M+E+++I+P L+H VAMVNLYGRSGK++EA +FI M ++ D+SIW ALLTA Sbjct: 638 GKRMFSSMSEKYRIVPGLEHCVAMVNLYGRSGKLEEAINFIDNMTMEHDISIWGALLTAS 697 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLVL--YDLRGISKDSLKIKRAGIRKDSSESLG 401 H + LA+HAGE+L +L+P N I +L+L Y LRGIS++S + R R E L Sbjct: 698 RVHGNLNLAIHAGEQLFKLDPGNVVIHQLLLQLYVLRGISEESETVMRPRKRNHHEEPLS 757 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKYHDILSFQEEEKEETAGIH 221 SW E NVVH F SG +Q + SWI+R E+ + S + L +EEE E+ +H Sbjct: 758 WSWTEINNVVHAFASG--QQCNSEVPDSWIKRKEVKMEGSSSCNRLCIKEEENEDITRVH 815 Query: 220 SEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLHHFKH 41 SEKLAL+FALI S ++R IRIVKN+RMC+ CH+ AK VS+K+ EIY+ DSKCLHHFK Sbjct: 816 SEKLALSFALINSPQSSRVIRIVKNLRMCEDCHRIAKLVSQKYEREIYIHDSKCLHHFKD 875 Query: 40 GICSCGDYW 14 G CSCG+YW Sbjct: 876 GYCSCGNYW 884 >gb|EOY14874.1| Pentatricopeptide repeat (PPR-like) superfamily protein isoform 1 [Theobroma cacao] gi|508722978|gb|EOY14875.1| Pentatricopeptide repeat (PPR-like) superfamily protein isoform 1 [Theobroma cacao] Length = 890 Score = 252 bits (644), Expect = 8e-65 Identities = 122/249 (48%), Positives = 170/249 (68%), Gaps = 2/249 (0%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+ +FS++++ ++I+P ++HY AM+++YGRSG++ EA +FI M I+PD S+W++LLTA Sbjct: 642 GKQIFSSISDNYEIIPAVEHYAAMIDVYGRSGRLGEAVEFIEDMPIEPDSSVWTSLLTAS 701 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLV--LYDLRGISKDSLKIKRAGIRKDSSESLG 401 H + LAV AGE+LL+LEP N I +++ +Y L G D LK+++ SLG Sbjct: 702 RIHRDIALAVLAGERLLDLEPANILINRVMFQIYVLSGKLDDPLKVRKLEKENILRRSLG 761 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKYHDILSFQEEEKEETAGIH 221 SWIE +N VH FV+GD + + L+SW++ + +H +EEEKEET G+H Sbjct: 762 HSWIEVRNTVHKFVTGDQSKPCADLLYSWVKSIAREVNIHDHHGRFFLEEEEKEETGGVH 821 Query: 220 SEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLHHFKH 41 SEKL LAFALI + R+IRIVKN RMC +CH AK +S K GCEIY+SD KC HHFK+ Sbjct: 822 SEKLTLAFALIGLPYSPRSIRIVKNTRMCSNCHLTAKYISLKFGCEIYLSDRKCFHHFKN 881 Query: 40 GICSCGDYW 14 G CSCGDYW Sbjct: 882 GQCSCGDYW 890 >ref|XP_002280968.2| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Vitis vinifera] Length = 1545 Score = 248 bits (633), Expect = 2e-63 Identities = 127/236 (53%), Positives = 164/236 (69%), Gaps = 2/236 (0%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+ VFS+M E++QILP L+H+ AM++L GRSGK+ EA +FI MAI+PD IW+ALLTA Sbjct: 648 GKQVFSSMMEDYQILPGLEHHSAMIDLLGRSGKLGEAIEFIEDMAIEPDSCIWAALLTAS 707 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLVL--YDLRGISKDSLKIKRAGIRKDSSESLG 401 H + LA+ AGE LLELEP N I + +L Y L G +D K++++ R ++ + LG Sbjct: 708 KIHGNIGLAIRAGECLLELEPSNFSIHQQILQMYALSGKFEDVSKLRKSEKRSETKQPLG 767 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKYHDILSFQEEEKEETAGIH 221 CSWIE KN+VHTFV+ D + + LHSWIE + K HD L +EEEKEE G+H Sbjct: 768 CSWIEAKNIVHTFVADDRSRPYFDFLHSWIENVARKVKAPDQHDRLFIEEEEKEEIGGVH 827 Query: 220 SEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLH 53 SEKLALAFALI A R++RIVKN+RMC CH AK +S + CEIY+SDSKCLH Sbjct: 828 SEKLALAFALIDPSCAPRSVRIVKNLRMCGDCHGTAKFLSMLYSCEIYLSDSKCLH 883 >gb|ESW14194.1| hypothetical protein PHAVU_008G260600g [Phaseolus vulgaris] Length = 893 Score = 241 bits (614), Expect = 2e-61 Identities = 123/249 (49%), Positives = 168/249 (67%), Gaps = 2/249 (0%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+H FSNM+E+F+I+ L+HY AMV L GRSGK+ EA +FI M I+P++S+W+A LTAC Sbjct: 648 GKHAFSNMSEDFKIILDLEHYSAMVYLLGRSGKLAEAQEFILNMPIEPNISVWTAFLTAC 707 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLV--LYDLRGISKDSLKIKRAGIRKDSSESLG 401 H +A+ AGE+LLEL+P+N + L+ Y L G ++ K+ + K +G Sbjct: 708 RIHRNFGMAIFAGERLLELDPENIITQHLLSQAYSLCGKYWEAPKMTKLEKEKIP---VG 764 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKYHDILSFQEEEKEETAGIH 221 SWIE N+VHTFV GD + ++ LHSW++R+ +N K + L +EEEKE+ +H Sbjct: 765 QSWIEMNNMVHTFVVGDQSKPYLDKLHSWLKRVHVNVKAHISDNGLCIEEEEKEDINSVH 824 Query: 220 SEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLHHFKH 41 SEKLA+AFALI S + +RIVKN+R+C CH AK +S +GCEIY+SDS CLHHFK Sbjct: 825 SEKLAIAFALIDSHHRPQILRIVKNLRVCKDCHDTAKYISLAYGCEIYLSDSNCLHHFKD 884 Query: 40 GICSCGDYW 14 G CSC DYW Sbjct: 885 GHCSCRDYW 893 >gb|EMJ28416.1| hypothetical protein PRUPE_ppa019183mg [Prunus persica] Length = 882 Score = 236 bits (601), Expect = 8e-60 Identities = 123/249 (49%), Positives = 163/249 (65%), Gaps = 2/249 (0%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G F ++TE++QI+P L+HY AMV+LYGRSG++ EA +FI GM I+PD S+W AL TAC Sbjct: 635 GTQAFHSITEDYQIIPGLEHYSAMVDLYGRSGRLQEAMEFIEGMPIEPDSSVWGALFTAC 694 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLVL--YDLRGISKDSLKIKRAGIRKDSSESLG 401 + + LAV AGE LL EP N I++L+L Y L G S+D K+++ G + LG Sbjct: 695 RIYGNLALAVRAGEHLLVSEPGNVLIQQLMLQAYALCGKSEDISKLRKFGKDYPKKKFLG 754 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKYHDILSFQEEEKEETAGIH 221 WIE KN +HTF+SGD +L L+ W++ +E K + L +EEE EE IH Sbjct: 755 QCWIEVKNSLHTFISGDRLKLCSIFLNLWLQNIEEKAKTPDLCNELCVEEEE-EEIGWIH 813 Query: 220 SEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLHHFKH 41 SEKLA AFAL S ++IRI+KN+RMC CH+ AK +S GC+IY+SD K HHF + Sbjct: 814 SEKLAFAFALSGSPSVPQSIRIMKNLRMCGDCHRIAKYISVAFGCDIYLSDVKSFHHFSN 873 Query: 40 GICSCGDYW 14 G CSCGDYW Sbjct: 874 GRCSCGDYW 882 >ref|XP_006575412.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like isoform X1 [Glycine max] gi|571441335|ref|XP_006575413.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like isoform X2 [Glycine max] Length = 896 Score = 235 bits (600), Expect = 1e-59 Identities = 119/250 (47%), Positives = 166/250 (66%), Gaps = 3/250 (1%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+H FSN++EE+QI L+HY AMV L GRSGK+ +A +FI+ M ++P+ S+W+AL+TAC Sbjct: 647 GKHAFSNISEEYQIRLDLEHYSAMVYLLGRSGKLAKALEFIQNMPVEPNSSVWAALMTAC 706 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLV--LYDLRGISKDSLKIKRAGIRKDSSESLG 401 H +A+ AGE++ EL+P+N + L+ Y + G S ++ K+ + K + +G Sbjct: 707 RIHKNFGMAIFAGERMHELDPENIITQHLLSQAYSVCGKSLEAPKMTKLEKEKFVNIPVG 766 Query: 400 CSWIEDKNVVHTFVSGDLRQLD-VNSLHSWIERMELNTKESKYHDILSFQEEEKEETAGI 224 SWIE N+VHTFV GD + ++ LHSW++R+ N K + L +EEEKE + + Sbjct: 767 QSWIEMNNMVHTFVVGDDQSTPYLDKLHSWLKRVGANVKAHISDNGLCIEEEEKENISSV 826 Query: 223 HSEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLHHFK 44 HSEKLA AF LI S + +RIVKN+RMC CH AK +S +GCEIY+SDS CLHHFK Sbjct: 827 HSEKLAFAFGLIDSHHTPQILRIVKNLRMCRDCHDSAKYISLAYGCEIYLSDSNCLHHFK 886 Query: 43 HGICSCGDYW 14 G CSC DYW Sbjct: 887 DGHCSCRDYW 896 >ref|XP_006596427.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Glycine max] Length = 896 Score = 234 bits (598), Expect = 2e-59 Identities = 119/250 (47%), Positives = 164/250 (65%), Gaps = 3/250 (1%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+H FSN++EE+QI L+HY AMV L GRSGK+ +A +FI+ M ++P+ S+W+ALLTAC Sbjct: 647 GKHAFSNISEEYQIRLDLEHYSAMVYLLGRSGKLAKALEFIQNMPVEPNSSVWAALLTAC 706 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLV--LYDLRGISKDSLKIKRAGIRKDSSESLG 401 H +A+ AGE +LEL+P+N + L+ Y + G S ++ K+ + K +G Sbjct: 707 RIHKNFGMAIFAGEHMLELDPENIITQHLLSQAYSVCGKSWEAQKMTKLEKEKFVKMPVG 766 Query: 400 CSWIEDKNVVHTFVSGDLRQLD-VNSLHSWIERMELNTKESKYHDILSFQEEEKEETAGI 224 SWIE N+VHTFV GD + + ++ +HSW++R+ N K + L +EEEKE + Sbjct: 767 QSWIEMNNMVHTFVVGDDQSIPYLDKIHSWLKRVGENVKAHISDNGLRIEEEEKENIGSV 826 Query: 223 HSEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLHHFK 44 HSEKLA AF LI + +RIVKN+RMC CH AK +S +GCEIY+SDS CLHHFK Sbjct: 827 HSEKLAFAFGLIDFHHTPQILRIVKNLRMCRDCHDTAKYISLAYGCEIYLSDSNCLHHFK 886 Query: 43 HGICSCGDYW 14 G CSC DYW Sbjct: 887 DGHCSCRDYW 896 >ref|XP_006416469.1| hypothetical protein EUTSA_v10006756mg [Eutrema salsugineum] gi|557094240|gb|ESQ34822.1| hypothetical protein EUTSA_v10006756mg [Eutrema salsugineum] Length = 893 Score = 233 bits (593), Expect = 7e-59 Identities = 113/252 (44%), Positives = 168/252 (66%), Gaps = 5/252 (1%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+ VFS++ +++ I+P L+H AM++LYGRS +++EA FI+ M + + IW + LT C Sbjct: 644 GKKVFSSIADDYNIIPALEHCSAMISLYGRSNRLEEAVQFIQEMNVQSETPIWESFLTGC 703 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLV--LYDLRGISKDSLKIKRAGIRKDS--SES 407 H + LA+HA E L LEP+N +V +Y L SL+ K+ R+D+ + Sbjct: 704 RIHGDIDLAIHAAEHLFSLEPENPITENVVSQIYALGAKLGRSLEGKKP--RRDNLLKKP 761 Query: 406 LGCSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERM-ELNTKESKYHDILSFQEEEKEETA 230 LG SWIE +N +HTF +GD QL + L+ W+E++ L+ + +Y+ L +EE +EET Sbjct: 762 LGHSWIEVRNSIHTFTTGDKSQLCTDVLYPWVEKLCRLDDRNDQYNGELLIEEEGREETC 821 Query: 229 GIHSEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLHH 50 GIHSEK A+AF LI S A++TIRI+KN+RMC CH AK +S+++GC+I + D++CLHH Sbjct: 822 GIHSEKFAMAFGLISSSRAHKTIRILKNLRMCRDCHNTAKYISRRYGCDILLEDTRCLHH 881 Query: 49 FKHGICSCGDYW 14 FK+G CSC DYW Sbjct: 882 FKNGDCSCKDYW 893 >ref|XP_003615696.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355517031|gb|AES98654.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 887 Score = 232 bits (592), Expect = 9e-59 Identities = 122/249 (48%), Positives = 160/249 (64%), Gaps = 2/249 (0%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+ VFS +T+++ + ++HY AMV L GRSGK+ EA DFI+ M I+P+ S+W ALLTAC Sbjct: 646 GKSVFSCITKDYLVRQGMEHYSAMVYLLGRSGKLAEALDFIQSMPIEPNSSVWGALLTAC 705 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLV--LYDLRGISKDSLKIKRAGIRKDSSESLG 401 H +AV AG+++LE EP N R L+ Y L G K + G K ++ +G Sbjct: 706 RIHRNFGVAVLAGKRMLEFEPGNNITRHLLSQAYSLCG------KFEPEG-EKAVNKPIG 758 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKYHDILSFQEEEKEETAGIH 221 SWIE NVVHTFV GD ++ LHSW++R+ +N K + L +EEEKE T+ +H Sbjct: 759 QSWIERNNVVHTFVVGDQSNPYLDKLHSWLKRVAVNVKTHVSDNELYIEEEEKENTSSVH 818 Query: 220 SEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLHHFKH 41 SEKLA AFALI + +RIVK +RMC CH AK +S +GCEIY+SDS CLHHFK Sbjct: 819 SEKLAFAFALIDPHNKPQILRIVKKLRMCRDCHDTAKYISMAYGCEIYLSDSNCLHHFKG 878 Query: 40 GICSCGDYW 14 G CSC DYW Sbjct: 879 GHCSCRDYW 887 >ref|XP_004490605.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Cicer arietinum] Length = 888 Score = 231 bits (588), Expect = 3e-58 Identities = 118/247 (47%), Positives = 157/247 (63%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+HVFS MT E+ I P ++HY AMV + GRSGK+ EA +FI+ M I+P+ +W ALLTAC Sbjct: 647 GKHVFSCMTNEYLIRPGMEHYSAMVYMLGRSGKLAEALEFIQNMPIEPNSLVWDALLTAC 706 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLVLYDLRGISKDSLKIKRAGIRKDSSESLGCS 395 H +AV AG++LLELEP N R L+ K +L+ ++A ++ +G Sbjct: 707 KIHRNFGMAVLAGKRLLELEPGNNITRYLLSQAYSLCGKFTLEEEKA-----VNKPVGQC 761 Query: 394 WIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKYHDILSFQEEEKEETAGIHSE 215 WIE N VHTFV GD ++ L SW++R+ +N K + + L +EEE+E + +HSE Sbjct: 762 WIERNNTVHTFVVGDQSYTYLDKLRSWLKRVAVNVKTHVFDNGLCIEEEERENNSIVHSE 821 Query: 214 KLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLHHFKHGI 35 KLA AFA I R + IVKN+RMC CH AK +S +GCEIY+SDS CLHHFK G Sbjct: 822 KLAFAFAFIDPHNTPRILHIVKNLRMCRDCHDTAKYISLAYGCEIYLSDSNCLHHFKGGH 881 Query: 34 CSCGDYW 14 CSC DYW Sbjct: 882 CSCRDYW 888 >gb|EXB97347.1| hypothetical protein L484_024210 [Morus notabilis] Length = 880 Score = 220 bits (561), Expect = 3e-55 Identities = 112/249 (44%), Positives = 161/249 (64%), Gaps = 2/249 (0%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 GR FS++TE++ I+P L+HY A+V+LYGR G++ EA +FI M ++PD S+W+ALLTA Sbjct: 635 GRLAFSSITEDYNIVPGLEHYAAVVDLYGRPGRLGEAMEFIENMPVEPDSSVWAALLTAS 694 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLVLYDLRGISKDSLKIKRAGIRKDSSES--LG 401 H + V A +K+L+LEP N I++L ++K K + K+++ LG Sbjct: 695 RNHRNIGFTVRALDKILDLEPGNYLIQRLRAQADALVAKSENDPKMRKLEKENATKRHLG 754 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKYHDILSFQEEEKEETAGIH 221 WIE +N V+TFV+GD + L+ WI + + +H+ L +EEEKEE +H Sbjct: 755 RCWIELQNRVYTFVNGDQSE---PYLYPWIHDIAGKASKYGFHEGLCIEEEEKEEVGRVH 811 Query: 220 SEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCLHHFKH 41 EK+A+AFALI + IRIVK++RMC +CH+ AK +SK +GCEIYV+DSKCLH F + Sbjct: 812 CEKIAIAFALIGFPRKAQCIRIVKSLRMCGNCHETAKYISKTYGCEIYVTDSKCLHRFSN 871 Query: 40 GICSCGDYW 14 G CSC DYW Sbjct: 872 GHCSCKDYW 880 >emb|CAA06829.1| DYW7 protein [Arabidopsis thaliana] Length = 406 Score = 219 bits (558), Expect = 8e-55 Identities = 113/254 (44%), Positives = 164/254 (64%), Gaps = 7/254 (2%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+ VF ++ ++ I+P L+H AMV LYGR+ +++EA FI+ M I + IW + LT C Sbjct: 156 GKKVFYSIANDYHIIPALEHCSAMVYLYGRANRLEEALQFIQEMNIQSETPIWESFLTGC 215 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLV--LYDLR---GISKDSLKIKRAGIRKDSSE 410 H + +A+HA E L LEP+N +V +Y L G S + K +R + K + Sbjct: 216 RIHGDIDMAIHAAENLFSLEPENTATESIVSQIYALGAKLGRSLEGNKPRRDNLLK---K 272 Query: 409 SLGCSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERME-LNTKESKYHDILSFQEEEKEET 233 LG SWIE +N++HTF +GD +L + L+ +E+M L+ + +Y+ L +EE +EET Sbjct: 273 PLGQSWIEVRNLIHTFTTGDQSKLCTDVLYPLVEKMSRLDNRSDQYNGELWIEEEGREET 332 Query: 232 AGIHSEKLALAFALIKSRPANRT-IRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCL 56 GIHSEK A+AF LI S A++T IRI+KN+RMC CH AK VSK++GC+I + D++CL Sbjct: 333 CGIHSEKFAMAFGLISSSGASKTTIRILKNLRMCRDCHDTAKYVSKRYGCDILLEDTRCL 392 Query: 55 HHFKHGICSCGDYW 14 HHFK+G CSC DYW Sbjct: 393 HHFKNGDCSCKDYW 406 >ref|NP_173402.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75263158|sp|Q9FXH1.1|PPR52_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g19720; AltName: Full=Protein DYW7 gi|10086495|gb|AAG12555.1|AC007797_15 Unknown Protein [Arabidopsis thaliana] gi|332191770|gb|AEE29891.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 894 Score = 219 bits (558), Expect = 8e-55 Identities = 113/254 (44%), Positives = 164/254 (64%), Gaps = 7/254 (2%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 G+ VF ++ ++ I+P L+H AMV LYGR+ +++EA FI+ M I + IW + LT C Sbjct: 644 GKKVFYSIANDYHIIPALEHCSAMVYLYGRANRLEEALQFIQEMNIQSETPIWESFLTGC 703 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLV--LYDLR---GISKDSLKIKRAGIRKDSSE 410 H + +A+HA E L LEP+N +V +Y L G S + K +R + K + Sbjct: 704 RIHGDIDMAIHAAENLFSLEPENTATESIVSQIYALGAKLGRSLEGNKPRRDNLLK---K 760 Query: 409 SLGCSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERME-LNTKESKYHDILSFQEEEKEET 233 LG SWIE +N++HTF +GD +L + L+ +E+M L+ + +Y+ L +EE +EET Sbjct: 761 PLGQSWIEVRNLIHTFTTGDQSKLCTDVLYPLVEKMSRLDNRSDQYNGELWIEEEGREET 820 Query: 232 AGIHSEKLALAFALIKSRPANRT-IRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDSKCL 56 GIHSEK A+AF LI S A++T IRI+KN+RMC CH AK VSK++GC+I + D++CL Sbjct: 821 CGIHSEKFAMAFGLISSSGASKTTIRILKNLRMCRDCHDTAKYVSKRYGCDILLEDTRCL 880 Query: 55 HHFKHGICSCGDYW 14 HHFK+G CSC DYW Sbjct: 881 HHFKNGDCSCKDYW 894 >ref|XP_004152769.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Cucumis sativus] Length = 1463 Score = 212 bits (539), Expect = 1e-52 Identities = 112/226 (49%), Positives = 152/226 (67%), Gaps = 3/226 (1%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 GRHVFS++TEE QILP LDHY+AMV+LYGRSG++ +A +FI M I+PDVSIW++LLTAC Sbjct: 643 GRHVFSSITEEHQILPTLDHYLAMVDLYGRSGRLADAIEFIEDMPIEPDVSIWTSLLTAC 702 Query: 574 LRHDKVKLAVHAGEKLLELEPDNGFIRKLVL--YDLRGISKDSLKIKRAGIRKDSSESLG 401 H + LAV A ++L ELEPDN I +L++ Y L G + +LK+++ G + Sbjct: 703 RFHGNLNLAVLAAKRLHELEPDNHVIYRLLVQAYALYGKFEQTLKVRKLGKESAMKKCTA 762 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKYHDILSFQEEEKEE-TAGI 224 W+E +N VH FV+GD +LDV L++WI+ +E K+ H LS +EEEKEE G Sbjct: 763 QCWVEVRNKVHLFVTGDQSKLDV--LNTWIKSIEGKVKKFNNHHQLSIEEEEKEEKIGGF 820 Query: 223 HSEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGC 86 H EK A AF LI S ++I+IVKN+RMC CH+ AK +S + C Sbjct: 821 HCEKFAFAFGLIGSSHTRKSIKIVKNLRMCVDCHQMAKYISAAYEC 866 >gb|ABK26521.1| unknown [Picea sitchensis] Length = 370 Score = 194 bits (494), Expect = 2e-47 Identities = 103/257 (40%), Positives = 151/257 (58%), Gaps = 10/257 (3%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 GR+ F +MT + I P +HY MV+L+GR+G +DEA +FI M ++P+ S+W +LL AC Sbjct: 118 GRNYFDSMTRDHGISPKAEHYSCMVDLFGRAGCLDEALNFINQMPVEPNASVWGSLLGAC 177 Query: 574 LRHDKVKLAVHAGEKLLELEPDN--GFIRKLVLYDLRGISKDSLKIKRAGIRKDSSESLG 401 H ++LA A E+L+EL P+N ++ +Y G D+ K+++ + + G Sbjct: 178 RVHGNIELAERAVEQLIELTPENPGTYVLLSNIYAAAGRWDDAGKVRKMMKDRSVKKEPG 237 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKY--------HDILSFQEEE 245 CSWIE +N VH F+ GD + ++ +E + L K + Y HD+ +EE+ Sbjct: 238 CSWIEVQNKVHPFIVGDSSHPQIEEIYETLETLTLQMKAAGYIPNTNFVLHDV---EEEQ 294 Query: 244 KEETAGIHSEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDS 65 KE G HSEKLA+AF +I S P TIR+VKN+R+C CH K +S+ EI + D+ Sbjct: 295 KEWILGHHSEKLAIAFGII-STPPGTTIRVVKNLRVCGDCHTATKFISRIVSREIVLRDT 353 Query: 64 KCLHHFKHGICSCGDYW 14 HHFK G CSCGDYW Sbjct: 354 HRFHHFKDGQCSCGDYW 370 >ref|XP_003546945.2| PREDICTED: putative pentatricopeptide repeat-containing protein At3g23330-like [Glycine max] Length = 640 Score = 191 bits (486), Expect = 2e-46 Identities = 101/257 (39%), Positives = 156/257 (60%), Gaps = 10/257 (3%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 GR +++ M + +I P ++HY MV+L G G++DEA+D IR M + PD +W ALL +C Sbjct: 388 GRALYNLMVRDCRINPTVEHYTCMVDLLGHCGQLDEAYDLIRQMDVMPDSGVWGALLNSC 447 Query: 574 LRHDKVKLAVHAGEKLLELEPDN--GFIRKLVLYDLRGISKDSLKIKRAGIRKDSSESLG 401 H V+LA A EKL+ELEPD+ ++ +Y G + ++++ I K +++ Sbjct: 448 KTHGNVELAEVALEKLIELEPDDSGNYVILANMYAQSGKWEGVARLRQLMIDKGIKKNIA 507 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKY--------HDILSFQEEE 245 CSWIE KN V+ F+SGD+ + ++++ ++R+E +E+ Y HD+ +E+E Sbjct: 508 CSWIEVKNKVYAFLSGDVSHPNSGAIYAELKRLEGLMREAGYVPDTGSVFHDV---EEDE 564 Query: 244 KEETAGIHSEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDS 65 K + HSE+LA+AF LI + P R + I KN+R+C+ CH K +SK EI V D Sbjct: 565 KTDMVCSHSERLAIAFGLISTLPGTRLL-ITKNLRICEDCHVAIKFISKITEREITVRDV 623 Query: 64 KCLHHFKHGICSCGDYW 14 HHF+HG+CSCGDYW Sbjct: 624 NRYHHFRHGLCSCGDYW 640 >gb|ESW22630.1| hypothetical protein PHAVU_005G169000g [Phaseolus vulgaris] Length = 631 Score = 189 bits (479), Expect = 1e-45 Identities = 101/257 (39%), Positives = 152/257 (59%), Gaps = 10/257 (3%) Frame = -1 Query: 754 GRHVFSNMTEEFQILPCLDHYVAMVNLYGRSGKIDEAFDFIRGMAIDPDVSIWSALLTAC 575 GR +++ M ++ I P + HY MV+L G GK+DEA+D IR M + D +W ALL +C Sbjct: 379 GRALYNLMVRDYCINPTVQHYTCMVDLLGHCGKLDEAYDLIRQMDVTADSGVWGALLNSC 438 Query: 574 LRHDKVKLAVHAGEKLLELEPDN--GFIRKLVLYDLRGISKDSLKIKRAGIRKDSSESLG 401 H V+L A EKL+ELEPD+ ++ +Y G + ++++ I K +++ Sbjct: 439 KTHGNVELGEVALEKLIELEPDDSGNYVILANMYAQSGKWEGVARLRQLMIDKGIKKNIA 498 Query: 400 CSWIEDKNVVHTFVSGDLRQLDVNSLHSWIERMELNTKESKY--------HDILSFQEEE 245 CSWIE KN V+ F+SGD+ + +++S ++R+E +E+ Y HD+ +E+E Sbjct: 499 CSWIEVKNKVYAFLSGDVSHPNSGAIYSELKRLEGLMREAGYVPHTGSVFHDV---EEDE 555 Query: 244 KEETAGIHSEKLALAFALIKSRPANRTIRIVKNMRMCDHCHKFAKTVSKKHGCEIYVSDS 65 K HSE+LA+AF LI + P R + I KN+R+C+ CH K +S+ EI V D Sbjct: 556 KANMVCSHSERLAIAFGLISTLPGTRLL-ITKNLRICEDCHVAIKFISEITEREITVRDV 614 Query: 64 KCLHHFKHGICSCGDYW 14 HHFKHG+CSCGDYW Sbjct: 615 NRYHHFKHGLCSCGDYW 631