BLASTX nr result
ID: Mentha26_contig00041167
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00041167 (587 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU41456.1| hypothetical protein MIMGU_mgv1a001700mg [Mimulus... 168 1e-39 ref|XP_004247673.1| PREDICTED: pentatricopeptide repeat-containi... 156 3e-36 gb|EXB51133.1| hypothetical protein L484_009097 [Morus notabilis] 151 1e-34 ref|XP_002314675.1| pentatricopeptide repeat-containing family p... 151 1e-34 ref|XP_007225289.1| hypothetical protein PRUPE_ppa001360mg [Prun... 150 2e-34 ref|XP_004296686.1| PREDICTED: pentatricopeptide repeat-containi... 147 2e-33 ref|XP_006489434.1| PREDICTED: pentatricopeptide repeat-containi... 145 8e-33 emb|CBI26289.3| unnamed protein product [Vitis vinifera] 144 1e-32 ref|XP_002279134.1| PREDICTED: pentatricopeptide repeat-containi... 144 1e-32 ref|NP_001189950.1| uncharacterized protein [Arabidopsis thalian... 144 2e-32 ref|NP_188908.2| uncharacterized protein [Arabidopsis thaliana] ... 144 2e-32 ref|XP_006419998.1| hypothetical protein CICLE_v10004307mg [Citr... 140 2e-31 ref|XP_002885518.1| pentatricopeptide repeat-containing protein ... 138 1e-30 ref|XP_007034824.1| Regulation of chlorophyll biosynthetic proce... 138 1e-30 ref|XP_006406136.1| hypothetical protein EUTSA_v10020066mg [Eutr... 135 6e-30 ref|XP_003538647.2| PREDICTED: pentatricopeptide repeat-containi... 134 2e-29 ref|XP_007157109.1| hypothetical protein PHAVU_002G043500g [Phas... 127 2e-27 ref|XP_006299530.1| hypothetical protein CARUB_v10015702mg [Caps... 124 2e-26 ref|XP_006856643.1| hypothetical protein AMTR_s01859p00006880, p... 119 8e-25 gb|EXC26223.1| hypothetical protein L484_022794 [Morus notabilis] 114 2e-23 >gb|EYU41456.1| hypothetical protein MIMGU_mgv1a001700mg [Mimulus guttatus] Length = 770 Score = 168 bits (425), Expect = 1e-39 Identities = 79/118 (66%), Positives = 94/118 (79%) Frame = -2 Query: 355 MGSSESLEFAEKAFKLFKNRRKDRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNF 176 MGS+ESLE+A KAF +F+N R+D + + +YLYNSLIRGNS G +AI +YV+ML D Sbjct: 1 MGSAESLEYALKAFTIFRNSREDCSGSKTYLYNSLIRGNSIAGDSREAISLYVNMLIDGV 60 Query: 175 KPDNYTFPFVLTACAKNSRLFEGSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTDSA 2 +PDNYTFPFVL+AC K LFEG Q+H AVK GYH DVFVSNSLVYCYGECG+TDSA Sbjct: 61 EPDNYTFPFVLSACTKRLSLFEGLQVHASAVKMGYHEDVFVSNSLVYCYGECGETDSA 118 Score = 58.9 bits (141), Expect = 1e-06 Identities = 39/145 (26%), Positives = 67/145 (46%) Frame = -2 Query: 436 QIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKNRRKDRTSNISYLYN 257 Q+HA K G D + L+ Y E G ++S A K F R + Sbjct: 85 QVHASAVKMGYHEDVFVSNSLVYCYGECGETDS---ARKVFDGMSERNV-------VSWT 134 Query: 256 SLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRLFEGSQIHGCAVKT 77 SLI G + H +A+ ++ +M+ + +P+ T V+++CAK+ + G ++ + Sbjct: 135 SLICGYATKDWHQEAVSLFFEMVAEGIEPNEVTMTSVISSCAKSGDVDLGERVLDYLTGS 194 Query: 76 GYHSDVFVSNSLVYCYGECGDTDSA 2 G S+ + N+LV Y +CG D A Sbjct: 195 GLTSNAVMVNALVDMYMKCGAADKA 219 >ref|XP_004247673.1| PREDICTED: pentatricopeptide repeat-containing protein At3g22690-like [Solanum lycopersicum] Length = 837 Score = 156 bits (395), Expect = 3e-36 Identities = 77/156 (49%), Positives = 106/156 (67%) Frame = -2 Query: 469 LKSCKNVEEIKQIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKNRRK 290 +KS KN+ EIKQ+HA +TK G DP L KLIAK SE+GS S+E+A+ AF F + + Sbjct: 30 IKSSKNLNEIKQLHAHFTKQGFNQDPGFLGKLIAKCSELGSYNSMEYAQIAFDSFCSGNE 89 Query: 289 DRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRLFE 110 + N +Y +NSLI+G S G A+L+YV M+ + +PD YTFP +L+ACAK+ R F Sbjct: 90 EGYDN-TYKFNSLIKGYSLAGLFHDAVLIYVRMVVECVEPDGYTFPLILSACAKDGRFFT 148 Query: 109 GSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTDSA 2 G Q+ G A+K G+ DVFV NS+++ YGECG+ D A Sbjct: 149 GIQVMGLALKWGFGDDVFVLNSVIHLYGECGEVDKA 184 >gb|EXB51133.1| hypothetical protein L484_009097 [Morus notabilis] Length = 845 Score = 151 bits (381), Expect = 1e-34 Identities = 76/159 (47%), Positives = 104/159 (65%) Frame = -2 Query: 478 NGHLKSCKNVEEIKQIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKN 299 NG +CK ++E+KQ+H TK GL S +T+LIAK +EMG+SESL++A +AF+LFK Sbjct: 37 NGSFGNCKTMDELKQLHCDITKKGLNHRISSMTELIAKGAEMGTSESLDYARRAFELFKE 96 Query: 298 RRKDRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSR 119 + + ++YNSL+RG S G +AI +YV ML PD YTFPFVL+ CAK Sbjct: 97 --DEASIGTLFMYNSLMRGYSSAGLGFEAISVYVQMLVLGITPDKYTFPFVLSGCAKAEA 154 Query: 118 LFEGSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTDSA 2 EG Q+HG V+ G D+F+ NSL++ Y ECG+ DSA Sbjct: 155 FREGIQLHGAVVRMGLERDLFIGNSLIHFYAECGELDSA 193 >ref|XP_002314675.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222863715|gb|EEF00846.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 845 Score = 151 bits (381), Expect = 1e-34 Identities = 84/194 (43%), Positives = 120/194 (61%), Gaps = 1/194 (0%) Frame = -2 Query: 586 AAVIVSLLPTAVPSAVKTPIPN-LKFQLQSEPYPLKQNGHLKSCKNVEEIKQIHAQYTKH 410 A + +S L A P++V P N LK + P G K CK + E+KQ+H+Q TK+ Sbjct: 3 ATLHLSTLIPATPTSVALPNQNELKILTKHRSSP---TGSFKKCKTMTELKQLHSQITKN 59 Query: 409 GLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKNRRKDRTSNISYLYNSLIRGNSFV 230 GL P LT LI+ +EMG+ ESLE+A+KA +LF + Y+++SLIRG S Sbjct: 60 GLNHHPLSLTNLISSCTEMGTFESLEYAQKALELFIE--DNGIMGTHYMFSSLIRGFSAC 117 Query: 229 GSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRLFEGSQIHGCAVKTGYHSDVFVS 50 G +AI+++ ++ PDN+TFPFVL+AC K++ L EG Q+HG VK G+ D+FV Sbjct: 118 GLGYKAIVVFRQLMCMGAVPDNFTFPFVLSACTKSAALTEGFQVHGAIVKMGFERDMFVE 177 Query: 49 NSLVYCYGECGDTD 8 NSL++ YGECG+ D Sbjct: 178 NSLIHFYGECGEID 191 >ref|XP_007225289.1| hypothetical protein PRUPE_ppa001360mg [Prunus persica] gi|462422225|gb|EMJ26488.1| hypothetical protein PRUPE_ppa001360mg [Prunus persica] Length = 845 Score = 150 bits (380), Expect = 2e-34 Identities = 81/193 (41%), Positives = 116/193 (60%) Frame = -2 Query: 586 AAVIVSLLPTAVPSAVKTPIPNLKFQLQSEPYPLKQNGHLKSCKNVEEIKQIHAQYTKHG 407 A + +S L +A PS V P + + ++ G L++CK + E+KQ+H Q +K G Sbjct: 3 ATLQLSPLVSATPSFVA---PTNQRESKAMAKDTSPTGLLRNCKTMNEVKQLHCQISKKG 59 Query: 406 LIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKNRRKDRTSNISYLYNSLIRGNSFVG 227 L PS +T LI +EMG+ ESL++A KAF LF + + +I ++YNSLIRG S G Sbjct: 60 LRNRPSTVTNLITTCAEMGTFESLDYARKAFNLFLEDEETK-GHILFMYNSLIRGYSSAG 118 Query: 226 SHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRLFEGSQIHGCAVKTGYHSDVFVSN 47 D+A+L+YV M+ PD +TFPFVL+AC+K EG Q+HG VK G D F+ N Sbjct: 119 LSDEAVLLYVQMVVKGILPDKFTFPFVLSACSKVVAFSEGVQLHGALVKMGLEEDAFIEN 178 Query: 46 SLVYCYGECGDTD 8 SL++ Y E G+ D Sbjct: 179 SLIHFYAESGELD 191 Score = 56.6 bits (135), Expect = 5e-06 Identities = 47/159 (29%), Positives = 74/159 (46%), Gaps = 3/159 (1%) Frame = -2 Query: 469 LKSCKNV---EEIKQIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKN 299 L +C V E Q+H K GL D + LI Y+E G L+++ K F Sbjct: 146 LSACSKVVAFSEGVQLHGALVKMGLEEDAFIENSLIHFYAESGE---LDYSRKVFDGMAE 202 Query: 298 RRKDRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSR 119 R NI + SLI G + +A+ ++ +M+ KP++ T V++ACAK Sbjct: 203 R------NI-VSWTSLICGYARRQFPKEAVSLFFEMVAAGIKPNSVTMVCVISACAKLKD 255 Query: 118 LFEGSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTDSA 2 L ++ ++G + V N+LV Y +CG TD+A Sbjct: 256 LELSERVCAYIGESGVKVNTLVVNALVDMYMKCGATDAA 294 >ref|XP_004296686.1| PREDICTED: pentatricopeptide repeat-containing protein At3g22690-like [Fragaria vesca subsp. vesca] Length = 843 Score = 147 bits (371), Expect = 2e-33 Identities = 74/166 (44%), Positives = 104/166 (62%) Frame = -2 Query: 511 QLQSEPYPLKQNGHLKSCKNVEEIKQIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLE 332 Q +S+P LK+CK + ++KQ+H Q TK G PS +TKLI +E+G+ +SL+ Sbjct: 21 QNESKPINPSPTESLKNCKTINQVKQLHCQITKKGHSHRPSTVTKLIITCAEIGTLQSLD 80 Query: 331 FAEKAFKLFKNRRKDRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFP 152 +A KA LF +++ R + ++YNSLIRG S G D+AI +YV M+ PD +TFP Sbjct: 81 YARKALDLFLEQQETR--GVLFMYNSLIRGYSSAGLGDEAIGLYVQMVVQGVSPDKFTFP 138 Query: 151 FVLTACAKNSRLFEGSQIHGCAVKTGYHSDVFVSNSLVYCYGECGD 14 F L+AC+K EG Q+HG VK G DVFV NSL++ Y ECG+ Sbjct: 139 FALSACSKVVAFCEGVQLHGSIVKMGLEGDVFVGNSLIHFYAECGE 184 Score = 57.4 bits (137), Expect = 3e-06 Identities = 44/159 (27%), Positives = 75/159 (47%), Gaps = 3/159 (1%) Frame = -2 Query: 469 LKSCKNVE---EIKQIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKN 299 L +C V E Q+H K GL D + LI Y+E G + +A K F ++ Sbjct: 141 LSACSKVVAFCEGVQLHGSIVKMGLEGDVFVGNSLIHFYAECGE---MGYARKVFDEMRD 197 Query: 298 RRKDRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSR 119 R + + SLI G +A+ ++ M+ + +P++ T V++ACAK Sbjct: 198 RN-------TVSWTSLICGYGRRSMPKEAVSLFFQMVGNGIEPNSVTMVCVISACAKLKD 250 Query: 118 LFEGSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTDSA 2 + ++ ++G S++ + NSLV Y +CGDT +A Sbjct: 251 VGLSERVCDYIGESGMKSNMLMVNSLVDMYMKCGDTGTA 289 >ref|XP_006489434.1| PREDICTED: pentatricopeptide repeat-containing protein At3g22690-like [Citrus sinensis] Length = 844 Score = 145 bits (366), Expect = 8e-33 Identities = 76/190 (40%), Positives = 115/190 (60%) Frame = -2 Query: 583 AVIVSLLPTAVPSAVKTPIPNLKFQLQSEPYPLKQNGHLKSCKNVEEIKQIHAQYTKHGL 404 A+ ++ P + + T + N + + ++ P G LK+CK + E+KQ+H K GL Sbjct: 2 ALTLNPSPLVLATPTVTTLTN-QHEAKTTPKDSPSIGSLKNCKTLNELKQLHCHILKQGL 60 Query: 403 IADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKNRRKDRTSNISYLYNSLIRGNSFVGS 224 PS ++K+++ ++MG+ ESL +A+KAF + + + TS ++YNSLIRG S +G Sbjct: 61 GHKPSYISKVVSTCAQMGTFESLTYAQKAFDYYI--KDNETSATLFMYNSLIRGYSCIGL 118 Query: 223 HDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRLFEGSQIHGCAVKTGYHSDVFVSNS 44 +AI +YV++ PD +TFPFVL AC K+S EG Q+HG VK G+ DVFV N Sbjct: 119 GVEAISLYVELAGFGILPDKFTFPFVLNACTKSSAFGEGVQVHGAIVKMGFDRDVFVENC 178 Query: 43 LVYCYGECGD 14 L+ YGECGD Sbjct: 179 LINFYGECGD 188 >emb|CBI26289.3| unnamed protein product [Vitis vinifera] Length = 668 Score = 144 bits (364), Expect = 1e-32 Identities = 74/157 (47%), Positives = 100/157 (63%) Frame = -2 Query: 478 NGHLKSCKNVEEIKQIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKN 299 N L+ CK + ++KQ+H Q TK+GL PS LTKL+ +E+ S ESL++A KAF+LFK Sbjct: 29 NESLRCCKTLNQLKQLHCQITKNGLDQIPSTLTKLVNAGAEIASPESLDYARKAFELFKE 88 Query: 298 RRKDRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSR 119 R+ + ++ NSLIRG S G +AIL+YV ML P++YTFPFVL+ C K + Sbjct: 89 --DVRSDDALFMLNSLIRGYSSAGLGREAILLYVRMLVLGVTPNHYTFPFVLSGCTKIAA 146 Query: 118 LFEGSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTD 8 EG Q+HG VK G DVF+ N L++ Y ECG D Sbjct: 147 FCEGIQVHGSVVKMGLEEDVFIQNCLIHFYAECGHMD 183 >ref|XP_002279134.1| PREDICTED: pentatricopeptide repeat-containing protein At3g22690 [Vitis vinifera] Length = 836 Score = 144 bits (364), Expect = 1e-32 Identities = 74/157 (47%), Positives = 100/157 (63%) Frame = -2 Query: 478 NGHLKSCKNVEEIKQIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKN 299 N L+ CK + ++KQ+H Q TK+GL PS LTKL+ +E+ S ESL++A KAF+LFK Sbjct: 29 NESLRCCKTLNQLKQLHCQITKNGLDQIPSTLTKLVNAGAEIASPESLDYARKAFELFKE 88 Query: 298 RRKDRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSR 119 R+ + ++ NSLIRG S G +AIL+YV ML P++YTFPFVL+ C K + Sbjct: 89 --DVRSDDALFMLNSLIRGYSSAGLGREAILLYVRMLVLGVTPNHYTFPFVLSGCTKIAA 146 Query: 118 LFEGSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTD 8 EG Q+HG VK G DVF+ N L++ Y ECG D Sbjct: 147 FCEGIQVHGSVVKMGLEEDVFIQNCLIHFYAECGHMD 183 >ref|NP_001189950.1| uncharacterized protein [Arabidopsis thaliana] gi|75274240|sp|Q9LUJ2.1|PP249_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g22690 gi|9279687|dbj|BAB01244.1| unnamed protein product [Arabidopsis thaliana] gi|332643145|gb|AEE76666.1| uncharacterized protein AT3G22690 [Arabidopsis thaliana] Length = 842 Score = 144 bits (362), Expect = 2e-32 Identities = 72/156 (46%), Positives = 99/156 (63%) Frame = -2 Query: 469 LKSCKNVEEIKQIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKNRRK 290 LK+CK ++E+K H TK GL D S +TKL+A+ E+G+ ESL FA++ F+ Sbjct: 39 LKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAKEVFE------N 92 Query: 289 DRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRLFE 110 + ++YNSLIRG + G ++AIL+++ M+ PD YTFPF L+ACAK+ Sbjct: 93 SESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGN 152 Query: 109 GSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTDSA 2 G QIHG VK GY D+FV NSLV+ Y ECG+ DSA Sbjct: 153 GIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSA 188 >ref|NP_188908.2| uncharacterized protein [Arabidopsis thaliana] gi|332643144|gb|AEE76665.1| uncharacterized protein AT3G22690 [Arabidopsis thaliana] Length = 938 Score = 144 bits (362), Expect = 2e-32 Identities = 72/156 (46%), Positives = 99/156 (63%) Frame = -2 Query: 469 LKSCKNVEEIKQIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKNRRK 290 LK+CK ++E+K H TK GL D S +TKL+A+ E+G+ ESL FA++ F+ Sbjct: 39 LKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAKEVFE------N 92 Query: 289 DRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRLFE 110 + ++YNSLIRG + G ++AIL+++ M+ PD YTFPF L+ACAK+ Sbjct: 93 SESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGN 152 Query: 109 GSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTDSA 2 G QIHG VK GY D+FV NSLV+ Y ECG+ DSA Sbjct: 153 GIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSA 188 >ref|XP_006419998.1| hypothetical protein CICLE_v10004307mg [Citrus clementina] gi|557521871|gb|ESR33238.1| hypothetical protein CICLE_v10004307mg [Citrus clementina] Length = 844 Score = 140 bits (354), Expect = 2e-31 Identities = 74/190 (38%), Positives = 115/190 (60%) Frame = -2 Query: 583 AVIVSLLPTAVPSAVKTPIPNLKFQLQSEPYPLKQNGHLKSCKNVEEIKQIHAQYTKHGL 404 A+ ++ P + + T + N + + ++ P G LK+ K + E+KQ+H K GL Sbjct: 2 ALTLNPSPLVLATPTVTTLTN-QHKAKTTPKDSPSIGSLKNYKTLNELKQLHCHILKQGL 60 Query: 403 IADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKNRRKDRTSNISYLYNSLIRGNSFVGS 224 PS ++K+++ ++MG+ ESL +A+KAF + + + TS ++YNSLIRG S +G Sbjct: 61 GHKPSYISKVVSTCAQMGTFESLTYAQKAFDYYI--KDNETSATLFMYNSLIRGYSCIGL 118 Query: 223 HDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRLFEGSQIHGCAVKTGYHSDVFVSNS 44 +AI +YV+++ PD +TFPFVL AC K+S E Q+HG VK G+ DVFV N Sbjct: 119 GVEAISLYVELVGFGILPDKFTFPFVLNACTKSSAFGEAVQVHGAIVKMGFDRDVFVENC 178 Query: 43 LVYCYGECGD 14 L++ YGECGD Sbjct: 179 LIHFYGECGD 188 >ref|XP_002885518.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297331358|gb|EFH61777.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 904 Score = 138 bits (348), Expect = 1e-30 Identities = 73/182 (40%), Positives = 104/182 (57%) Frame = -2 Query: 547 SAVKTPIPNLKFQLQSEPYPLKQNGHLKSCKNVEEIKQIHAQYTKHGLIADPSLLTKLIA 368 S K +PN + ++ P LK+CK ++E+K H TK GL D S +TKL+A Sbjct: 18 STSKPSLPNQSKRTKATP------SSLKNCKTIDELKMFHLSLTKQGLDDDVSAITKLVA 71 Query: 367 KYSEMGSSESLEFAEKAFKLFKNRRKDRTSNISYLYNSLIRGNSFVGSHDQAILMYVDML 188 + E+G+ ESL FA++ F+ + ++YNSLIRG + G +AIL+++ M+ Sbjct: 72 RSCELGTRESLSFAKEVFE------NGESYGTCFMYNSLIRGYASSGLCKEAILLFIRMM 125 Query: 187 TDNFKPDNYTFPFVLTACAKNSRLFEGSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTD 8 PD YTFPF L+ CAK+ G QIHG +K Y D+FV NSLV+ Y ECG+ D Sbjct: 126 NSGISPDKYTFPFGLSVCAKSRDKGNGIQIHGLIIKMDYAKDLFVQNSLVHFYAECGELD 185 Query: 7 SA 2 A Sbjct: 186 CA 187 >ref|XP_007034824.1| Regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage [Theobroma cacao] gi|508713853|gb|EOY05750.1| Regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage [Theobroma cacao] Length = 841 Score = 138 bits (347), Expect = 1e-30 Identities = 68/154 (44%), Positives = 100/154 (64%) Frame = -2 Query: 475 GHLKSCKNVEEIKQIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKNR 296 G L SC ++ E+K++H Q TK GLI PS +TKLI+ ++MG+ +S+ +A K F R Sbjct: 35 GSLYSCNHLTELKKLHCQITKQGLIHHPSSITKLISTCTQMGTFDSVIYARKILNQF--R 92 Query: 295 RKDRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRL 116 + ++ ++YNSLIRG S + ++AI +Y++ML PD YTFPF+L+AC K S Sbjct: 93 QDNQNDGTLFMYNSLIRGYSSIDLGNEAIWVYLEMLELGISPDKYTFPFLLSACTKISAR 152 Query: 115 FEGSQIHGCAVKTGYHSDVFVSNSLVYCYGECGD 14 EG Q+HG VK G+ D+FV NSL++ ECG+ Sbjct: 153 AEGLQVHGSVVKMGFQGDIFVLNSLIHFSSECGE 186 >ref|XP_006406136.1| hypothetical protein EUTSA_v10020066mg [Eutrema salsugineum] gi|557107282|gb|ESQ47589.1| hypothetical protein EUTSA_v10020066mg [Eutrema salsugineum] Length = 836 Score = 135 bits (341), Expect = 6e-30 Identities = 68/156 (43%), Positives = 98/156 (62%) Frame = -2 Query: 469 LKSCKNVEEIKQIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKNRRK 290 LK+CK V+++K H K GL D S +TKL+A+ E+G+ ESL FA + LF ++ Sbjct: 34 LKNCKTVDQLKMFHRSLAKQGLENDVSSITKLVARSCELGTRESLSFARE---LFDSKGN 90 Query: 289 DRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRLFE 110 + ++YNSLIRG + G ++A+ +++ M+ D PD YTFPF L+ACAK+ + Sbjct: 91 GESYGSRFMYNSLIRGYASSGLCEEALSLFLRMMVDGISPDKYTFPFGLSACAKSRANRD 150 Query: 109 GSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTDSA 2 G QIHG VK Y D+FV NSL++ Y ECG+ D A Sbjct: 151 GIQIHGLIVKMDYAKDMFVQNSLLHFYAECGELDLA 186 >ref|XP_003538647.2| PREDICTED: pentatricopeptide repeat-containing protein At3g22690-like [Glycine max] Length = 854 Score = 134 bits (337), Expect = 2e-29 Identities = 75/171 (43%), Positives = 102/171 (59%), Gaps = 7/171 (4%) Frame = -2 Query: 499 EPYPLKQNGHLK---SCKNVEEIKQIHAQYTKHGLIADP--SLLTKLIAKYSEMGSSESL 335 E P+ +N K +CK ++E+KQ+H K GL+ S L KLIA ++G+ ESL Sbjct: 37 EANPITRNSSSKLLVNCKTLKELKQLHCDMMKKGLLCHKPASNLNKLIASSVQIGTLESL 96 Query: 334 EFAEKAFKLFKNRRKDRTSNIS--YLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNY 161 ++A AF D N++ ++YN LIRG + G DQAIL+YV ML PD Y Sbjct: 97 DYARNAFG-------DDDGNMASLFMYNCLIRGYASAGLGDQAILLYVQMLVMGIVPDKY 149 Query: 160 TFPFVLTACAKNSRLFEGSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTD 8 TFPF+L+AC+K L EG Q+HG +K G D+FVSNSL++ Y ECG D Sbjct: 150 TFPFLLSACSKILALSEGVQVHGAVLKMGLEGDIFVSNSLIHFYAECGKVD 200 >ref|XP_007157109.1| hypothetical protein PHAVU_002G043500g [Phaseolus vulgaris] gi|561030524|gb|ESW29103.1| hypothetical protein PHAVU_002G043500g [Phaseolus vulgaris] Length = 838 Score = 127 bits (319), Expect = 2e-27 Identities = 77/196 (39%), Positives = 109/196 (55%), Gaps = 7/196 (3%) Frame = -2 Query: 574 VSLLPTAVPSAVKTPIPNLKFQLQSEPYPLKQNGHLK---SCKNVEEIKQIHAQYTKHGL 404 +++ T PS++ +LK E PL N K +CK + E+KQ+H K GL Sbjct: 1 MAMATTLHPSSIVLVPTSLK-----EAKPLTTNSSQKLLANCKTLNELKQLHCDMMKKGL 55 Query: 403 IADPS--LLTKLIAKYSEMGSSESLEFAEKAFKLFKNRRKDRTSNISYLY--NSLIRGNS 236 P + KLIA ++G+ ESL++A AF+ D I +Y N LIRG + Sbjct: 56 CHKPGGDHINKLIAACVQIGTLESLDYAGNAFQ-------DDDDGIPSVYVCNCLIRGYA 108 Query: 235 FVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRLFEGSQIHGCAVKTGYHSDVF 56 G ++AIL+Y+ M+ PDNYTFPF+L+AC+K + L EG Q+HG VK G D+F Sbjct: 109 SAGLCEKAILLYIQMVGMGIVPDNYTFPFLLSACSKTTALSEGVQVHGVVVKMGLDGDIF 168 Query: 55 VSNSLVYCYGECGDTD 8 VSNS ++ Y ECG D Sbjct: 169 VSNSFIHFYAECGKVD 184 >ref|XP_006299530.1| hypothetical protein CARUB_v10015702mg [Capsella rubella] gi|482568239|gb|EOA32428.1| hypothetical protein CARUB_v10015702mg [Capsella rubella] Length = 844 Score = 124 bits (311), Expect = 2e-26 Identities = 65/156 (41%), Positives = 92/156 (58%) Frame = -2 Query: 469 LKSCKNVEEIKQIHAQYTKHGLIADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKNRRK 290 LK+CK ++E++ H LTKL+A+ ++G+ ESL FA++ F + + Sbjct: 49 LKNCKTIDELRMFHR------------CLTKLVARSCDLGTRESLSFAKEVFDYSEGNGE 96 Query: 289 DRTSNISYLYNSLIRGNSFVGSHDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRLFE 110 S ++YNS+IRG + G D+AIL+++ M+ PD YTFPF L+ACAK Sbjct: 97 SYGS--CFMYNSMIRGYASAGLCDEAILLFLRMMNSGISPDKYTFPFGLSACAKRRAKGN 154 Query: 109 GSQIHGCAVKTGYHSDVFVSNSLVYCYGECGDTDSA 2 G QIHG VK Y D+FV NSLV+ Y ECG+ DSA Sbjct: 155 GIQIHGLIVKMDYAKDLFVQNSLVHFYAECGELDSA 190 >ref|XP_006856643.1| hypothetical protein AMTR_s01859p00006880, partial [Amborella trichopoda] gi|548860532|gb|ERN18110.1| hypothetical protein AMTR_s01859p00006880, partial [Amborella trichopoda] Length = 190 Score = 119 bits (297), Expect = 8e-25 Identities = 66/190 (34%), Positives = 98/190 (51%), Gaps = 12/190 (6%) Frame = -2 Query: 547 SAVKTPIPNLKFQLQSEPYPLKQNGH------------LKSCKNVEEIKQIHAQYTKHGL 404 +A+ TP P L Q+ P P N L+ CKN +++ QIHA + GL Sbjct: 2 AAMATPQPKLSLSTQTNPKPNNSNSSSKQFSDHPSLILLERCKNTKQLPQIHAHLIRLGL 61 Query: 403 IADPSLLTKLIAKYSEMGSSESLEFAEKAFKLFKNRRKDRTSNISYLYNSLIRGNSFVGS 224 I P L++L+ + S +L +A K F+ Y+YN++IR ++ S Sbjct: 62 IFHPYPLSRLLTISALSNSENALSYALKIFEQIPQPNL-------YMYNTIIRAHASSRS 114 Query: 223 HDQAILMYVDMLTDNFKPDNYTFPFVLTACAKNSRLFEGSQIHGCAVKTGYHSDVFVSNS 44 + A+L+Y +ML N P+ +TFPF+L A AK L EG +HG +K G SD FV NS Sbjct: 115 PENALLLYTEMLHQNIDPNKFTFPFLLKAIAKIPALLEGKTVHGMVLKAGLSSDAFVQNS 174 Query: 43 LVYCYGECGD 14 L++ Y CG+ Sbjct: 175 LIHFYANCGN 184 >gb|EXC26223.1| hypothetical protein L484_022794 [Morus notabilis] Length = 605 Score = 114 bits (286), Expect = 2e-23 Identities = 70/184 (38%), Positives = 104/184 (56%), Gaps = 5/184 (2%) Frame = -2 Query: 538 KTPIPNLKFQLQSEPYPLKQN---GHLKSCKNVEEIKQIHAQYTKHGLIADPSLLTKLIA 368 K PI + +F L LK+ LK CK+V E+KQIH Q K GL+ D L+A Sbjct: 17 KEPIQSPEFHLS-----LKEQECLSLLKRCKSVRELKQIHVQILKIGLLGDSFCAGNLVA 71 Query: 367 K--YSEMGSSESLEFAEKAFKLFKNRRKDRTSNISYLYNSLIRGNSFVGSHDQAILMYVD 194 S+ GS + A +F++ ++ +T +L+N+++RG+ G+ QA+++Y D Sbjct: 72 TCALSDWGSMDY------ACSIFRHVKEPQT----FLFNTMMRGHVKDGNWGQALILYFD 121 Query: 193 MLTDNFKPDNYTFPFVLTACAKNSRLFEGSQIHGCAVKTGYHSDVFVSNSLVYCYGECGD 14 ML +PDN+T+P +L ACA+ S EG QIHG K G D+FV NSL+ YG+CG Sbjct: 122 MLKSGVEPDNFTYPVLLKACARLSATEEGMQIHGHTSKLGLQGDLFVQNSLINMYGKCGK 181 Query: 13 TDSA 2 + A Sbjct: 182 IELA 185