BLASTX nr result
ID: Atropa21_contig00010746
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00010746 (645 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006341986.1| PREDICTED: pentatricopeptide repeat-containi... 403 e-110 ref|XP_004238610.1| PREDICTED: pentatricopeptide repeat-containi... 399 e-109 ref|XP_006435073.1| hypothetical protein CICLE_v10000229mg [Citr... 216 3e-54 gb|ESW14194.1| hypothetical protein PHAVU_008G260600g [Phaseolus... 209 7e-52 ref|XP_006575412.1| PREDICTED: pentatricopeptide repeat-containi... 204 2e-50 gb|EOY14874.1| Pentatricopeptide repeat (PPR-like) superfamily p... 204 2e-50 ref|XP_006596427.1| PREDICTED: pentatricopeptide repeat-containi... 198 1e-48 ref|XP_003615696.1| Pentatricopeptide repeat-containing protein ... 198 1e-48 ref|XP_006386200.1| pentatricopeptide repeat-containing family p... 195 8e-48 ref|XP_004490605.1| PREDICTED: pentatricopeptide repeat-containi... 194 1e-47 ref|XP_002280968.2| PREDICTED: pentatricopeptide repeat-containi... 194 2e-47 gb|EMJ28416.1| hypothetical protein PRUPE_ppa019183mg [Prunus pe... 189 5e-46 gb|EXB97347.1| hypothetical protein L484_024210 [Morus notabilis] 183 4e-44 ref|XP_006416469.1| hypothetical protein EUTSA_v10006756mg [Eutr... 181 1e-43 emb|CAA06829.1| DYW7 protein [Arabidopsis thaliana] 171 1e-40 ref|NP_173402.2| pentatricopeptide repeat-containing protein [Ar... 171 1e-40 ref|XP_004152769.1| PREDICTED: pentatricopeptide repeat-containi... 170 4e-40 ref|XP_002443755.1| hypothetical protein SORBIDRAFT_07g001380 [S... 159 5e-37 gb|AFW74323.1| hypothetical protein ZEAMMB73_642674 [Zea mays] 155 9e-36 gb|AFW74322.1| hypothetical protein ZEAMMB73_642674 [Zea mays] 155 9e-36 >ref|XP_006341986.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Solanum tuberosum] Length = 884 Score = 403 bits (1036), Expect = e-110 Identities = 196/214 (91%), Positives = 204/214 (95%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSGK+EEAIDFIDNMTMEHDIS+W ALLTASRVHGNLN+AIHAG+QLLKLDPGNVVI+QL Sbjct: 667 RSGKLEEAIDFIDNMTMEHDISIWGALLTASRVHGNLNLAIHAGEQLLKLDPGNVVIHQL 726 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSNSEVPESWIK 285 L QL VLRG SEESVTVMRPRKRNHHEE LSWSWTEINNVVHAFASGQQSNSEVP+SWIK Sbjct: 727 LLQLNVLRGISEESVTVMRPRKRNHHEEPLSWSWTEINNVVHAFASGQQSNSEVPDSWIK 786 Query: 284 RKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRMCED 105 RK+ K EGSSSCNRLCI EEE+EDITRVHSEKLALSFALINSPQS RVIRIVKNLRMCED Sbjct: 787 RKEVKMEGSSSCNRLCIKEEENEDITRVHSEKLALSFALINSPQSSRVIRIVKNLRMCED 846 Query: 104 CHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 CHR AKLVSQKYEREIYIHDSKCLHHFKDGYCSC Sbjct: 847 CHRIAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 880 >ref|XP_004238610.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Solanum lycopersicum] Length = 884 Score = 399 bits (1025), Expect = e-109 Identities = 193/214 (90%), Positives = 202/214 (94%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSGK+EEAI+FIDNMTMEHDIS+W ALLTASRVHGNLN+AIHAG+QL KLDPGNVVI+QL Sbjct: 667 RSGKLEEAINFIDNMTMEHDISIWGALLTASRVHGNLNLAIHAGEQLFKLDPGNVVIHQL 726 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSNSEVPESWIK 285 L QLYVLRG SEES TVMRPRKRNHHEE LSWSWTEINNVVHAFASGQQ NSEVP+SWIK Sbjct: 727 LLQLYVLRGISEESETVMRPRKRNHHEEPLSWSWTEINNVVHAFASGQQCNSEVPDSWIK 786 Query: 284 RKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRMCED 105 RK+ K EGSSSCNRLCI EEE+EDITRVHSEKLALSFALINSPQS RVIRIVKNLRMCED Sbjct: 787 RKEVKMEGSSSCNRLCIKEEENEDITRVHSEKLALSFALINSPQSSRVIRIVKNLRMCED 846 Query: 104 CHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 CHR AKLVSQKYEREIYIHDSKCLHHFKDGYCSC Sbjct: 847 CHRIAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 880 >ref|XP_006435073.1| hypothetical protein CICLE_v10000229mg [Citrus clementina] gi|557537195|gb|ESR48313.1| hypothetical protein CICLE_v10000229mg [Citrus clementina] Length = 889 Score = 216 bits (551), Expect = 3e-54 Identities = 104/216 (48%), Positives = 152/216 (70%), Gaps = 2/216 (0%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSGK+EEA++FI++M +E D S+W+ALLTA R+HGN+++A+ A ++L L+PG+V+I +L Sbjct: 670 RSGKLEEAMEFIEDMPIEPDSSIWEALLTACRIHGNIDLAVLAIERLFDLEPGDVLIQRL 729 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASG--QQSNSEVPESW 291 + Q+Y + G E+++ V + K N S SW E+ N+V+ F +G +S S++ SW Sbjct: 730 ILQIYAICGKPEDALKVRKLEKENTRRNSFGQSWIEVKNLVYTFVTGGWSESYSDLLYSW 789 Query: 290 IKRKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRMC 111 ++ S + LCI EEE E+I+ +HSEKLAL+FALI S Q+ IRIVKN+RMC Sbjct: 790 LQNVPENVTARSCHSGLCIEEEEKEEISGIHSEKLALAFALIGSSQAPHTIRIVKNIRMC 849 Query: 110 EDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 CH+TAK VS+ + EI++ DSKCLHHFK+G CSC Sbjct: 850 VHCHKTAKYVSKMHHCEIFLADSKCLHHFKNGQCSC 885 >gb|ESW14194.1| hypothetical protein PHAVU_008G260600g [Phaseolus vulgaris] Length = 893 Score = 209 bits (531), Expect = 7e-52 Identities = 109/216 (50%), Positives = 143/216 (66%), Gaps = 2/216 (0%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSGK+ EA +FI NM +E +IS+W A LTA R+H N +AI AG++LL+LDP N++ L Sbjct: 677 RSGKLAEAQEFILNMPIEPNISVWTAFLTACRIHRNFGMAIFAGERLLELDPENIITQHL 736 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSNSEVPE--SW 291 L Q Y L G E+ + + K + + SW E+NN+VH F G QS + + SW Sbjct: 737 LSQAYSLCGKYWEAPKMTKLEKE---KIPVGQSWIEMNNMVHTFVVGDQSKPYLDKLHSW 793 Query: 290 IKRKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRMC 111 +KR + S N LCI EEE EDI VHSEKLA++FALI+S +++RIVKNLR+C Sbjct: 794 LKRVHVNVKAHISDNGLCIEEEEKEDINSVHSEKLAIAFALIDSHHRPQILRIVKNLRVC 853 Query: 110 EDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 +DCH TAK +S Y EIY+ DS CLHHFKDG+CSC Sbjct: 854 KDCHDTAKYISLAYGCEIYLSDSNCLHHFKDGHCSC 889 >ref|XP_006575412.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like isoform X1 [Glycine max] gi|571441335|ref|XP_006575413.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like isoform X2 [Glycine max] Length = 896 Score = 204 bits (519), Expect = 2e-50 Identities = 102/217 (47%), Positives = 142/217 (65%), Gaps = 3/217 (1%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSGK+ +A++FI NM +E + S+W AL+TA R+H N +AI AG+++ +LDP N++ L Sbjct: 676 RSGKLAKALEFIQNMPVEPNSSVWAALMTACRIHKNFGMAIFAGERMHELDPENIITQHL 735 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSNSEVPE---S 294 L Q Y + G S E+ + + K + SW E+NN+VH F G ++ + S Sbjct: 736 LSQAYSVCGKSLEAPKMTKLEKEKFVNIPVGQSWIEMNNMVHTFVVGDDQSTPYLDKLHS 795 Query: 293 WIKRKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRM 114 W+KR A + S N LCI EEE E+I+ VHSEKLA +F LI+S + +++RIVKNLRM Sbjct: 796 WLKRVGANVKAHISDNGLCIEEEEKENISSVHSEKLAFAFGLIDSHHTPQILRIVKNLRM 855 Query: 113 CEDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 C DCH +AK +S Y EIY+ DS CLHHFKDG+CSC Sbjct: 856 CRDCHDSAKYISLAYGCEIYLSDSNCLHHFKDGHCSC 892 >gb|EOY14874.1| Pentatricopeptide repeat (PPR-like) superfamily protein isoform 1 [Theobroma cacao] gi|508722978|gb|EOY14875.1| Pentatricopeptide repeat (PPR-like) superfamily protein isoform 1 [Theobroma cacao] Length = 890 Score = 204 bits (518), Expect = 2e-50 Identities = 101/216 (46%), Positives = 143/216 (66%), Gaps = 2/216 (0%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSG++ EA++FI++M +E D S+W +LLTASR+H ++ +A+ AG++LL L+P N++I ++ Sbjct: 671 RSGRLGEAVEFIEDMPIEPDSSVWTSLLTASRIHRDIALAVLAGERLLDLEPANILINRV 730 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSN--SEVPESW 291 +FQ+YVL G ++ + V + K N SL SW E+ N VH F +G QS +++ SW Sbjct: 731 MFQIYVLSGKLDDPLKVRKLEKENILRRSLGHSWIEVRNTVHKFVTGDQSKPCADLLYSW 790 Query: 290 IKRKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRMC 111 +K + R + EEE E+ VHSEKL L+FALI P S R IRIVKN RMC Sbjct: 791 VKSIAREVNIHDHHGRFFLEEEEKEETGGVHSEKLTLAFALIGLPYSPRSIRIVKNTRMC 850 Query: 110 EDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 +CH TAK +S K+ EIY+ D KC HHFK+G CSC Sbjct: 851 SNCHLTAKYISLKFGCEIYLSDRKCFHHFKNGQCSC 886 >ref|XP_006596427.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Glycine max] Length = 896 Score = 198 bits (504), Expect = 1e-48 Identities = 102/217 (47%), Positives = 138/217 (63%), Gaps = 3/217 (1%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSGK+ +A++FI NM +E + S+W ALLTA R+H N +AI AG+ +L+LDP N++ L Sbjct: 676 RSGKLAKALEFIQNMPVEPNSSVWAALLTACRIHKNFGMAIFAGEHMLELDPENIITQHL 735 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSNSEVPE---S 294 L Q Y + G S E+ + + K + + SW E+NN+VH F G + + S Sbjct: 736 LSQAYSVCGKSWEAQKMTKLEKEKFVKMPVGQSWIEMNNMVHTFVVGDDQSIPYLDKIHS 795 Query: 293 WIKRKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRM 114 W+KR + S N L I EEE E+I VHSEKLA +F LI+ + +++RIVKNLRM Sbjct: 796 WLKRVGENVKAHISDNGLRIEEEEKENIGSVHSEKLAFAFGLIDFHHTPQILRIVKNLRM 855 Query: 113 CEDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 C DCH TAK +S Y EIY+ DS CLHHFKDG+CSC Sbjct: 856 CRDCHDTAKYISLAYGCEIYLSDSNCLHHFKDGHCSC 892 >ref|XP_003615696.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355517031|gb|AES98654.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 887 Score = 198 bits (504), Expect = 1e-48 Identities = 105/216 (48%), Positives = 134/216 (62%), Gaps = 2/216 (0%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSGK+ EA+DFI +M +E + S+W ALLTA R+H N VA+ AGK++L+ +PGN + L Sbjct: 675 RSGKLAEALDFIQSMPIEPNSSVWGALLTACRIHRNFGVAVLAGKRMLEFEPGNNITRHL 734 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSNSEVPE--SW 291 L Q Y L G E P + + SW E NNVVH F G QSN + + SW Sbjct: 735 LSQAYSLCGKFE-------PEGEKAVNKPIGQSWIERNNVVHTFVVGDQSNPYLDKLHSW 787 Query: 290 IKRKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRMC 111 +KR + S N L I EEE E+ + VHSEKLA +FALI+ +++RIVK LRMC Sbjct: 788 LKRVAVNVKTHVSDNELYIEEEEKENTSSVHSEKLAFAFALIDPHNKPQILRIVKKLRMC 847 Query: 110 EDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 DCH TAK +S Y EIY+ DS CLHHFK G+CSC Sbjct: 848 RDCHDTAKYISMAYGCEIYLSDSNCLHHFKGGHCSC 883 >ref|XP_006386200.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550344175|gb|ERP63997.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 810 Score = 195 bits (496), Expect = 8e-48 Identities = 104/218 (47%), Positives = 139/218 (63%), Gaps = 4/218 (1%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSG+++EAI+ IDNM ++ S+W ALLTA R HGN ++AI A + LL L+P N I+Q Sbjct: 589 RSGRLKEAIELIDNMPIKPQSSVWYALLTACRNHGNSDLAIRARENLLDLEPWNSSIHQS 648 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSNSEVP-ESWI 288 + Q Y + G E++ V + KRN ++ SW E+NN VH+F +G QS S SW+ Sbjct: 649 ILQSYAMHGKYEDAPKVKKLEKRNEVQKPKGQSWIEVNNTVHSFVAGDQSTSYSDLFSWV 708 Query: 287 KRKKAKAEGSSSCNRLCITEEEH---EDITRVHSEKLALSFALINSPQSYRVIRIVKNLR 117 +R +A+ CI EEE E+I +HSEKLAL+FA+I SP + + IRIVKNLR Sbjct: 709 ERISMEAKVHDLHCGCCIEEEEEEEKEEIVGIHSEKLALAFAIIRSPSAPQSIRIVKNLR 768 Query: 116 MCEDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 C DCHR AK +S K+ EIY+ DS HHFK G CSC Sbjct: 769 TCADCHRMAKYISAKHGCEIYLSDSNFFHHFKSGCCSC 806 >ref|XP_004490605.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Cicer arietinum] Length = 888 Score = 194 bits (494), Expect = 1e-47 Identities = 105/218 (48%), Positives = 136/218 (62%), Gaps = 4/218 (1%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSGK+ EA++FI NM +E + +WDALLTA ++H N +A+ AGK+LL+L+PGN + L Sbjct: 676 RSGKLAEALEFIQNMPIEPNSLVWDALLTACKIHRNFGMAVLAGKRLLELEPGNNITRYL 735 Query: 464 LFQLYVLRG--TSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSNSEVPE-- 297 L Q Y L G T EE V +P + W E NN VH F G QS + + + Sbjct: 736 LSQAYSLCGKFTLEEEKAVNKP---------VGQCWIERNNTVHTFVVGDQSYTYLDKLR 786 Query: 296 SWIKRKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLR 117 SW+KR + N LCI EEE E+ + VHSEKLA +FA I+ + R++ IVKNLR Sbjct: 787 SWLKRVAVNVKTHVFDNGLCIEEEERENNSIVHSEKLAFAFAFIDPHNTPRILHIVKNLR 846 Query: 116 MCEDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 MC DCH TAK +S Y EIY+ DS CLHHFK G+CSC Sbjct: 847 MCRDCHDTAKYISLAYGCEIYLSDSNCLHHFKGGHCSC 884 >ref|XP_002280968.2| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Vitis vinifera] Length = 1545 Score = 194 bits (493), Expect = 2e-47 Identities = 100/207 (48%), Positives = 136/207 (65%), Gaps = 2/207 (0%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSGK+ EAI+FI++M +E D +W ALLTAS++HGN+ +AI AG+ LL+L+P N I+Q Sbjct: 677 RSGKLGEAIEFIEDMAIEPDSCIWAALLTASKIHGNIGLAIRAGECLLELEPSNFSIHQQ 736 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSNS--EVPESW 291 + Q+Y L G E+ + + KR+ ++ L SW E N+VH F + +S + SW Sbjct: 737 ILQMYALSGKFEDVSKLRKSEKRSETKQPLGCSWIEAKNIVHTFVADDRSRPYFDFLHSW 796 Query: 290 IKRKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRMC 111 I+ K + +RL I EEE E+I VHSEKLAL+FALI+ + R +RIVKNLRMC Sbjct: 797 IENVARKVKAPDQHDRLFIEEEEKEEIGGVHSEKLALAFALIDPSCAPRSVRIVKNLRMC 856 Query: 110 EDCHRTAKLVSQKYEREIYIHDSKCLH 30 DCH TAK +S Y EIY+ DSKCLH Sbjct: 857 GDCHGTAKFLSMLYSCEIYLSDSKCLH 883 >gb|EMJ28416.1| hypothetical protein PRUPE_ppa019183mg [Prunus persica] Length = 882 Score = 189 bits (481), Expect = 5e-46 Identities = 98/216 (45%), Positives = 137/216 (63%), Gaps = 2/216 (0%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSG+++EA++FI+ M +E D S+W AL TA R++GNL +A+ AG+ LL +PGNV+I QL Sbjct: 664 RSGRLQEAMEFIEGMPIEPDSSVWGALFTACRIYGNLALAVRAGEHLLVSEPGNVLIQQL 723 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSN--SEVPESW 291 + Q Y L G SE+ + + K ++ L W E+ N +H F SG + S W Sbjct: 724 MLQAYALCGKSEDISKLRKFGKDYPKKKFLGQCWIEVKNSLHTFISGDRLKLCSIFLNLW 783 Query: 290 IKRKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRMC 111 ++ + KA+ CN LC+ EEE E+I +HSEKLA +FAL SP + IRI+KNLRMC Sbjct: 784 LQNIEEKAKTPDLCNELCV-EEEEEEIGWIHSEKLAFAFALSGSPSVPQSIRIMKNLRMC 842 Query: 110 EDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 DCHR AK +S + +IY+ D K HHF +G CSC Sbjct: 843 GDCHRIAKYISVAFGCDIYLSDVKSFHHFSNGRCSC 878 >gb|EXB97347.1| hypothetical protein L484_024210 [Morus notabilis] Length = 880 Score = 183 bits (464), Expect = 4e-44 Identities = 96/214 (44%), Positives = 131/214 (61%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 R G++ EA++FI+NM +E D S+W ALLTASR H N+ + A ++L L+PGN +I +L Sbjct: 664 RPGRLGEAMEFIENMPVEPDSSVWAALLTASRNHRNIGFTVRALDKILDLEPGNYLIQRL 723 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSNSEVPESWIK 285 Q L SE + + K N + L W E+ N V+ F +G QS + WI Sbjct: 724 RAQADALVAKSENDPKMRKLEKENATKRHLGRCWIELQNRVYTFVNGDQSEPYL-YPWIH 782 Query: 284 RKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRMCED 105 KA LCI EEE E++ RVH EK+A++FALI P+ + IRIVK+LRMC + Sbjct: 783 DIAGKASKYGFHEGLCIEEEEKEEVGRVHCEKIAIAFALIGFPRKAQCIRIVKSLRMCGN 842 Query: 104 CHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 CH TAK +S+ Y EIY+ DSKCLH F +G+CSC Sbjct: 843 CHETAKYISKTYGCEIYVTDSKCLHRFSNGHCSC 876 >ref|XP_006416469.1| hypothetical protein EUTSA_v10006756mg [Eutrema salsugineum] gi|557094240|gb|ESQ34822.1| hypothetical protein EUTSA_v10006756mg [Eutrema salsugineum] Length = 893 Score = 181 bits (460), Expect = 1e-43 Identities = 84/217 (38%), Positives = 138/217 (63%), Gaps = 3/217 (1%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RS ++EEA+ FI M ++ + +W++ LT R+HG++++AIHA + L L+P N + + Sbjct: 673 RSNRLEEAVQFIQEMNVQSETPIWESFLTGCRIHGDIDLAIHAAEHLFSLEPENPITENV 732 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSN--SEVPESW 291 + Q+Y L S+ +PR+ N ++ L SW E+ N +H F +G +S ++V W Sbjct: 733 VSQIYALGAKLGRSLEGKKPRRDNLLKKPLGHSWIEVRNSIHTFTTGDKSQLCTDVLYPW 792 Query: 290 IKRKKAKAEGSSSCN-RLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRM 114 +++ + + N L I EE E+ +HSEK A++F LI+S ++++ IRI+KNLRM Sbjct: 793 VEKLCRLDDRNDQYNGELLIEEEGREETCGIHSEKFAMAFGLISSSRAHKTIRILKNLRM 852 Query: 113 CEDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 C DCH TAK +S++Y +I + D++CLHHFK+G CSC Sbjct: 853 CRDCHNTAKYISRRYGCDILLEDTRCLHHFKNGDCSC 889 >emb|CAA06829.1| DYW7 protein [Arabidopsis thaliana] Length = 406 Score = 171 bits (434), Expect = 1e-40 Identities = 86/218 (39%), Positives = 133/218 (61%), Gaps = 4/218 (1%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 R+ ++EEA+ FI M ++ + +W++ LT R+HG++++AIHA + L L+P N + Sbjct: 185 RANRLEEALQFIQEMNIQSETPIWESFLTGCRIHGDIDMAIHAAENLFSLEPENTATESI 244 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSN--SEVPESW 291 + Q+Y L S+ +PR+ N ++ L SW E+ N++H F +G QS ++V Sbjct: 245 VSQIYALGAKLGRSLEGNKPRRDNLLKKPLGQSWIEVRNLIHTFTTGDQSKLCTDVLYPL 304 Query: 290 IKRKKAKAEGSSSCN-RLCITEEEHEDITRVHSEKLALSFALINSP-QSYRVIRIVKNLR 117 +++ S N L I EE E+ +HSEK A++F LI+S S IRI+KNLR Sbjct: 305 VEKMSRLDNRSDQYNGELWIEEEGREETCGIHSEKFAMAFGLISSSGASKTTIRILKNLR 364 Query: 116 MCEDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 MC DCH TAK VS++Y +I + D++CLHHFK+G CSC Sbjct: 365 MCRDCHDTAKYVSKRYGCDILLEDTRCLHHFKNGDCSC 402 >ref|NP_173402.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75263158|sp|Q9FXH1.1|PPR52_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g19720; AltName: Full=Protein DYW7 gi|10086495|gb|AAG12555.1|AC007797_15 Unknown Protein [Arabidopsis thaliana] gi|332191770|gb|AEE29891.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 894 Score = 171 bits (434), Expect = 1e-40 Identities = 86/218 (39%), Positives = 133/218 (61%), Gaps = 4/218 (1%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 R+ ++EEA+ FI M ++ + +W++ LT R+HG++++AIHA + L L+P N + Sbjct: 673 RANRLEEALQFIQEMNIQSETPIWESFLTGCRIHGDIDMAIHAAENLFSLEPENTATESI 732 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSN--SEVPESW 291 + Q+Y L S+ +PR+ N ++ L SW E+ N++H F +G QS ++V Sbjct: 733 VSQIYALGAKLGRSLEGNKPRRDNLLKKPLGQSWIEVRNLIHTFTTGDQSKLCTDVLYPL 792 Query: 290 IKRKKAKAEGSSSCN-RLCITEEEHEDITRVHSEKLALSFALINSP-QSYRVIRIVKNLR 117 +++ S N L I EE E+ +HSEK A++F LI+S S IRI+KNLR Sbjct: 793 VEKMSRLDNRSDQYNGELWIEEEGREETCGIHSEKFAMAFGLISSSGASKTTIRILKNLR 852 Query: 116 MCEDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 MC DCH TAK VS++Y +I + D++CLHHFK+G CSC Sbjct: 853 MCRDCHDTAKYVSKRYGCDILLEDTRCLHHFKNGDCSC 890 >ref|XP_004152769.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Cucumis sativus] Length = 1463 Score = 170 bits (430), Expect = 4e-40 Identities = 87/194 (44%), Positives = 128/194 (65%), Gaps = 1/194 (0%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSG++ +AI+FI++M +E D+S+W +LLTA R HGNLN+A+ A K+L +L+P N VIY+L Sbjct: 672 RSGRLADAIEFIEDMPIEPDVSIWTSLLTACRFHGNLNLAVLAAKRLHELEPDNHVIYRL 731 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSNSEVPESWIK 285 L Q Y L G E+++ V + K + ++ + W E+ N VH F +G QS +V +WIK Sbjct: 732 LVQAYALYGKFEQTLKVRKLGKESAMKKCTAQCWVEVRNKVHLFVTGDQSKLDVLNTWIK 791 Query: 284 RKKAKAEGSSSCNRLCITEEEHED-ITRVHSEKLALSFALINSPQSYRVIRIVKNLRMCE 108 + K + ++ ++L I EEE E+ I H EK A +F LI S + + I+IVKNLRMC Sbjct: 792 SIEGKVKKFNNHHQLSIEEEEKEEKIGGFHCEKFAFAFGLIGSSHTRKSIKIVKNLRMCV 851 Query: 107 DCHRTAKLVSQKYE 66 DCH+ AK +S YE Sbjct: 852 DCHQMAKYISAAYE 865 >ref|XP_002443755.1| hypothetical protein SORBIDRAFT_07g001380 [Sorghum bicolor] gi|241940105|gb|EES13250.1| hypothetical protein SORBIDRAFT_07g001380 [Sorghum bicolor] Length = 871 Score = 159 bits (403), Expect = 5e-37 Identities = 91/214 (42%), Positives = 131/214 (61%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSG ++EA +FIDNM + ++++W+ALLTA+ +HGN +A A ++L LDP + I +L Sbjct: 657 RSGSLQEAYEFIDNMPLIPNLAVWEALLTAASIHGNARLANLAARELSLLDPSDPRIQRL 716 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASGQQSNSEVPESWIK 285 +F + L G S + V +M + E + EI N V+ F++ E + +K Sbjct: 717 VFNYWDLTGKSAD-VPLMTVYNKGRELEDVDSCSVEIKNKVYLFSTSDNLALENTIAELK 775 Query: 284 RKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNLRMCED 105 + S CN EEE E+++ +H EKLA++FA+ NSP +R IRI+K LRMC Sbjct: 776 LIMIQIRMSLLCNGTD-AEEEKEELSGIHCEKLAIAFAVSNSPP-FRNIRIIKTLRMCSL 833 Query: 104 CHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 CH AKLVS+KYER+I I DS CLH FK+G CSC Sbjct: 834 CHVFAKLVSEKYERQILIKDSNCLHKFKNGKCSC 867 >gb|AFW74323.1| hypothetical protein ZEAMMB73_642674 [Zea mays] Length = 876 Score = 155 bits (392), Expect = 9e-36 Identities = 89/219 (40%), Positives = 133/219 (60%), Gaps = 5/219 (2%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSG ++EA +FI NM + ++++W+ALLTA+ +HGN +A ++L LDP + I +L Sbjct: 660 RSGSLQEAYEFIGNMPLIPNLAVWEALLTAATIHGNARLANLTARELSSLDPSDPRIQRL 719 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASG-----QQSNSEVP 300 +F + L G S + V +M E + EI N V+ F++G + + +E+ Sbjct: 720 VFNYWGLTGKSVD-VPLMTVYNGGRELEDVDSCSVEIKNNVYLFSTGDNLALESTVAELK 778 Query: 299 ESWIKRKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNL 120 I+ + + S+ N EEE E+++ +H EKLA++FA+ NSP +R IRI+K L Sbjct: 779 LIMIQIRMSLLNISNETN----AEEEKEELSGIHCEKLAIAFAISNSPP-FRSIRIIKTL 833 Query: 119 RMCEDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 RMC CH AKLVS+KYER+I I DS CLH F+DG CSC Sbjct: 834 RMCSHCHIFAKLVSEKYERQILIKDSNCLHKFEDGKCSC 872 >gb|AFW74322.1| hypothetical protein ZEAMMB73_642674 [Zea mays] Length = 1028 Score = 155 bits (392), Expect = 9e-36 Identities = 89/219 (40%), Positives = 133/219 (60%), Gaps = 5/219 (2%) Frame = -2 Query: 644 RSGKIEEAIDFIDNMTMEHDISLWDALLTASRVHGNLNVAIHAGKQLLKLDPGNVVIYQL 465 RSG ++EA +FI NM + ++++W+ALLTA+ +HGN +A ++L LDP + I +L Sbjct: 660 RSGSLQEAYEFIGNMPLIPNLAVWEALLTAATIHGNARLANLTARELSSLDPSDPRIQRL 719 Query: 464 LFQLYVLRGTSEESVTVMRPRKRNHHEESLSWSWTEINNVVHAFASG-----QQSNSEVP 300 +F + L G S + V +M E + EI N V+ F++G + + +E+ Sbjct: 720 VFNYWGLTGKSVD-VPLMTVYNGGRELEDVDSCSVEIKNNVYLFSTGDNLALESTVAELK 778 Query: 299 ESWIKRKKAKAEGSSSCNRLCITEEEHEDITRVHSEKLALSFALINSPQSYRVIRIVKNL 120 I+ + + S+ N EEE E+++ +H EKLA++FA+ NSP +R IRI+K L Sbjct: 779 LIMIQIRMSLLNISNETN----AEEEKEELSGIHCEKLAIAFAISNSPP-FRSIRIIKTL 833 Query: 119 RMCEDCHRTAKLVSQKYEREIYIHDSKCLHHFKDGYCSC 3 RMC CH AKLVS+KYER+I I DS CLH F+DG CSC Sbjct: 834 RMCSHCHIFAKLVSEKYERQILIKDSNCLHKFEDGKCSC 872