BLASTX nr result
ID: Sinomenium22_contig00011122
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00011122 (1340 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EXC20898.1| Clp protease-related protein [Morus notabilis] 310 1e-81 ref|XP_007038471.1| Double Clp-N motif protein [Theobroma cacao]... 304 5e-80 ref|XP_002268037.2| PREDICTED: clp protease-related protein At4g... 303 9e-80 emb|CAN60931.1| hypothetical protein VITISV_006813 [Vitis vinifera] 303 9e-80 gb|EYU25008.1| hypothetical protein MIMGU_mgv1a012964mg [Mimulus... 300 7e-79 ref|XP_007218298.1| hypothetical protein PRUPE_ppa010759mg [Prun... 299 2e-78 ref|XP_006421758.1| hypothetical protein CICLE_v10005783mg [Citr... 293 1e-76 ref|XP_006490254.1| PREDICTED: clp protease-related protein At4g... 293 2e-76 ref|XP_004306484.1| PREDICTED: clp protease-related protein At4g... 289 2e-75 ref|XP_006856238.1| hypothetical protein AMTR_s00059p00213590 [A... 288 4e-75 ref|XP_004148125.1| PREDICTED: clp protease-related protein At4g... 283 1e-73 ref|XP_007040016.1| Double Clp-N motif protein [Theobroma cacao]... 272 2e-70 ref|XP_006348058.1| PREDICTED: clp protease-related protein At4g... 271 4e-70 ref|NP_001242487.1| uncharacterized protein LOC100786582 [Glycin... 271 4e-70 ref|XP_006413313.1| hypothetical protein EUTSA_v10026111mg [Eutr... 271 5e-70 ref|XP_003534040.1| PREDICTED: clp protease-related protein At4g... 271 6e-70 ref|XP_002510906.1| ATP-dependent clp protease, putative [Ricinu... 268 4e-69 ref|XP_006285475.1| hypothetical protein CARUB_v10006894mg [Caps... 267 9e-69 ref|XP_006587382.1| PREDICTED: clp protease-related protein At4g... 266 2e-68 gb|EPS73881.1| hypothetical protein M569_00876 [Genlisea aurea] 265 3e-68 >gb|EXC20898.1| Clp protease-related protein [Morus notabilis] Length = 257 Score = 310 bits (793), Expect = 1e-81 Identities = 163/240 (67%), Positives = 190/240 (79%), Gaps = 1/240 (0%) Frame = +3 Query: 222 MAARNLSVFTI-LASSSHSGGSTRAGKSPSLILPWNVKTEGLVNSLPGNRLALQFTNSVP 398 MA+ LS I L++ S G+ + SP +LPW + L S G +L ++ TN Sbjct: 1 MASHTLSAIVIPLSALQPSQGNYSSASSP--LLPWRLHPTNLSTSCVGRQLLIRPTNFKN 58 Query: 399 VIRKQRCSTVMTVSFSLPTAKPERASTDKFPKWSARAIKSFAMAELEARKLKYPNTGTEA 578 + K+R + TV FSLPTAK +RAS +K P WSARAIKSFAMAELEARKLKYPNTGTEA Sbjct: 59 LASKRR-PAIATVLFSLPTAKTDRASNEKIPNWSARAIKSFAMAELEARKLKYPNTGTEA 117 Query: 579 FLMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDMYFFSPEHPPLTEPAQRALDW 758 LMGILVEGTS AAKFLRANGITLFKVREET+NLLGKSD+YFFSPEHPPLTEPAQRALDW Sbjct: 118 LLMGILVEGTSVAAKFLRANGITLFKVREETVNLLGKSDLYFFSPEHPPLTEPAQRALDW 177 Query: 759 AVDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGFDDTKAAELAKSADEDAVLSY 938 AVD+KLKSGESGEITT+HLLLGIWSEKE AGHKI+A+LGF+D KA ELAKS ++ ++S+ Sbjct: 178 AVDQKLKSGESGEITTAHLLLGIWSEKESAGHKILASLGFNDEKAKELAKSETKERIISF 237 >ref|XP_007038471.1| Double Clp-N motif protein [Theobroma cacao] gi|508775716|gb|EOY22972.1| Double Clp-N motif protein [Theobroma cacao] Length = 230 Score = 304 bits (779), Expect = 5e-80 Identities = 166/240 (69%), Positives = 185/240 (77%) Frame = +3 Query: 222 MAARNLSVFTILASSSHSGGSTRAGKSPSLILPWNVKTEGLVNSLPGNRLALQFTNSVPV 401 MA LS I S+S S S R SL L +SL GN+L L+ ++ Sbjct: 1 MATYRLSFLPISISASQSLPSKRHDFRLSLPL----------SSLYGNKLLLKTSDLSLF 50 Query: 402 IRKQRCSTVMTVSFSLPTAKPERASTDKFPKWSARAIKSFAMAELEARKLKYPNTGTEAF 581 + K ST TVSFSLPTAKPERA ++K PKWSAR+IKSFAMAELEARKLKYPNTGTEA Sbjct: 51 VTKHHSSTTATVSFSLPTAKPERAPSEKSPKWSARSIKSFAMAELEARKLKYPNTGTEAL 110 Query: 582 LMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDMYFFSPEHPPLTEPAQRALDWA 761 LMGILVEGTS AAKFLR NGITLFKVREET+NLLGKSDMYFFSPEHPPLTE AQRALDWA Sbjct: 111 LMGILVEGTSQAAKFLRDNGITLFKVREETVNLLGKSDMYFFSPEHPPLTEQAQRALDWA 170 Query: 762 VDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGFDDTKAAELAKSADEDAVLSYR 941 VDEKLKSGESGEITT++LLLGIWSEKE AG+KI+A LGF+D KA EL K +ED VL+Y+ Sbjct: 171 VDEKLKSGESGEITTTYLLLGIWSEKESAGYKILATLGFNDEKAKELTKYINEDIVLNYK 230 >ref|XP_002268037.2| PREDICTED: clp protease-related protein At4g12060, chloroplastic-like [Vitis vinifera] gi|296084123|emb|CBI24511.3| unnamed protein product [Vitis vinifera] Length = 231 Score = 303 bits (777), Expect = 9e-80 Identities = 155/217 (71%), Positives = 181/217 (83%) Frame = +3 Query: 288 RAGKSPSLILPWNVKTEGLVNSLPGNRLALQFTNSVPVIRKQRCSTVMTVSFSLPTAKPE 467 R P+ L ++ GL +SL G +L++Q ++S V+ + STV TV SLPTAKPE Sbjct: 15 RNPNDPASPLLLSLHKNGLFSSLIGQKLSIQSSHSRLVVSTRYRSTVATVLLSLPTAKPE 74 Query: 468 RASTDKFPKWSARAIKSFAMAELEARKLKYPNTGTEAFLMGILVEGTSSAAKFLRANGIT 647 R S++K PKWSARAIKSF+MAELEARKLKYPNTGTEA LMGILVEGTS AAKFLRANGIT Sbjct: 75 RTSSEKVPKWSARAIKSFSMAELEARKLKYPNTGTEALLMGILVEGTSLAAKFLRANGIT 134 Query: 648 LFKVREETINLLGKSDMYFFSPEHPPLTEPAQRALDWAVDEKLKSGESGEITTSHLLLGI 827 LFKVREET+NLLGKSD+YFFSPEHPPLTEPAQRALDWAVDEK+KSGE GEITTSHLLLGI Sbjct: 135 LFKVREETVNLLGKSDLYFFSPEHPPLTEPAQRALDWAVDEKIKSGEEGEITTSHLLLGI 194 Query: 828 WSEKEPAGHKIMAALGFDDTKAAELAKSADEDAVLSY 938 W+E+E AGHKI+A LGF+D +A ELAKS +++ LS+ Sbjct: 195 WAEEESAGHKILATLGFNDDQAKELAKSINKETDLSF 231 >emb|CAN60931.1| hypothetical protein VITISV_006813 [Vitis vinifera] Length = 231 Score = 303 bits (777), Expect = 9e-80 Identities = 155/217 (71%), Positives = 181/217 (83%) Frame = +3 Query: 288 RAGKSPSLILPWNVKTEGLVNSLPGNRLALQFTNSVPVIRKQRCSTVMTVSFSLPTAKPE 467 R P+ L ++ GL +SL G +L++Q ++S V+ + STV TV SLPTAKPE Sbjct: 15 RNPNDPASPLLLSLHKNGLXSSLIGQKLSIQSSHSRLVVSTRYRSTVATVLLSLPTAKPE 74 Query: 468 RASTDKFPKWSARAIKSFAMAELEARKLKYPNTGTEAFLMGILVEGTSSAAKFLRANGIT 647 R S++K PKWSARAIKSF+MAELEARKLKYPNTGTEA LMGILVEGTS AAKFLRANGIT Sbjct: 75 RTSSEKVPKWSARAIKSFSMAELEARKLKYPNTGTEALLMGILVEGTSLAAKFLRANGIT 134 Query: 648 LFKVREETINLLGKSDMYFFSPEHPPLTEPAQRALDWAVDEKLKSGESGEITTSHLLLGI 827 LFKVREET+NLLGKSD+YFFSPEHPPLTEPAQRALDWAVDEK+KSGE GEITTSHLLLGI Sbjct: 135 LFKVREETVNLLGKSDLYFFSPEHPPLTEPAQRALDWAVDEKIKSGEEGEITTSHLLLGI 194 Query: 828 WSEKEPAGHKIMAALGFDDTKAAELAKSADEDAVLSY 938 W+E+E AGHKI+A LGF+D +A ELAKS +++ LS+ Sbjct: 195 WAEEESAGHKILATLGFNDDQAKELAKSINKETDLSF 231 >gb|EYU25008.1| hypothetical protein MIMGU_mgv1a012964mg [Mimulus guttatus] Length = 234 Score = 300 bits (769), Expect = 7e-79 Identities = 157/239 (65%), Positives = 187/239 (78%) Frame = +3 Query: 225 AARNLSVFTILASSSHSGGSTRAGKSPSLILPWNVKTEGLVNSLPGNRLALQFTNSVPVI 404 AA+ LSVF I SSS + S R K L+L +N T S G +L++Q + + Sbjct: 3 AAQGLSVFPITPSSSATNQSCR--KPFELVLSFNPST-----SFTGTKLSVQPLSFNQMA 55 Query: 405 RKQRCSTVMTVSFSLPTAKPERASTDKFPKWSARAIKSFAMAELEARKLKYPNTGTEAFL 584 K+R S V T+SFSLPT E ++DK PKWSAR+IKSFAM ELEARKLKYP+TGTEA L Sbjct: 56 SKRRSSAVATISFSLPTTNKEGIASDKMPKWSARSIKSFAMGELEARKLKYPSTGTEALL 115 Query: 585 MGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDMYFFSPEHPPLTEPAQRALDWAV 764 MG+LVEGTS AAKFLR NGITLFKVR+E ++LLGKSDMYFFSPEHPPLTEPAQRALDWAV Sbjct: 116 MGVLVEGTSFAAKFLRENGITLFKVRDEIVSLLGKSDMYFFSPEHPPLTEPAQRALDWAV 175 Query: 765 DEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGFDDTKAAELAKSADEDAVLSYR 941 +EKLKSG+SGEIT++HL+LGIWS+KE AGHKIMA+ GFDD KA ELAK+ D+D + S+R Sbjct: 176 EEKLKSGDSGEITSAHLVLGIWSQKESAGHKIMASFGFDDEKAEELAKNMDKDVIFSFR 234 >ref|XP_007218298.1| hypothetical protein PRUPE_ppa010759mg [Prunus persica] gi|462414760|gb|EMJ19497.1| hypothetical protein PRUPE_ppa010759mg [Prunus persica] Length = 237 Score = 299 bits (766), Expect = 2e-78 Identities = 160/243 (65%), Positives = 187/243 (76%), Gaps = 2/243 (0%) Frame = +3 Query: 216 VAMAARNLSVFTILASSS--HSGGSTRAGKSPSLILPWNVKTEGLVNSLPGNRLALQFTN 389 +A + LS +I S+S H S + P + P+N+ T + G +L+++ N Sbjct: 1 MASTSITLSALSISPSTSQLHRNPSASSPSLPCHLPPYNLST-----TFMGKKLSIRVPN 55 Query: 390 SVPVIRKQRCSTVMTVSFSLPTAKPERASTDKFPKWSARAIKSFAMAELEARKLKYPNTG 569 + K R + V TV FSLPTAKP+R ST K PKWSARAIKSFAM ELEARKLKYPNTG Sbjct: 56 LNHLASKHR-TAVATVLFSLPTAKPDRNSTGKSPKWSARAIKSFAMGELEARKLKYPNTG 114 Query: 570 TEAFLMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDMYFFSPEHPPLTEPAQRA 749 TEA LMGILVEGTS AAKFLRANGITLFKVR+ET+NLLGKSD+YFFSPEHPPLTEPAQRA Sbjct: 115 TEALLMGILVEGTSLAAKFLRANGITLFKVRDETVNLLGKSDLYFFSPEHPPLTEPAQRA 174 Query: 750 LDWAVDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGFDDTKAAELAKSADEDAV 929 LDWAVD+KLKSGE+GEIT +HLLLGIWSEKE AGHKI+A+LGFD+ KA EL+KS D D V Sbjct: 175 LDWAVDQKLKSGENGEITVTHLLLGIWSEKESAGHKILASLGFDEEKAKELSKSMDSDYV 234 Query: 930 LSY 938 S+ Sbjct: 235 PSF 237 >ref|XP_006421758.1| hypothetical protein CICLE_v10005783mg [Citrus clementina] gi|557523631|gb|ESR34998.1| hypothetical protein CICLE_v10005783mg [Citrus clementina] Length = 233 Score = 293 bits (750), Expect = 1e-76 Identities = 151/201 (75%), Positives = 172/201 (85%), Gaps = 3/201 (1%) Frame = +3 Query: 348 NSLPGNRLAL--QFTNSVPVIRKQRCSTVMTVSFSLPTA-KPERASTDKFPKWSARAIKS 518 +S+ GN+L + Q ++S V + R S TVSFSLPT KPE AS DK PKWSARAI+S Sbjct: 33 SSMFGNKLLIRPQLSSSRFVTKYHRSSATATVSFSLPTTVKPETASPDKIPKWSARAIRS 92 Query: 519 FAMAELEARKLKYPNTGTEAFLMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDM 698 FAMAELEARKLKYPNTGTEAFLMGILVEGTS+ AKFLRANGITLFKVREET+NLLGKSD+ Sbjct: 93 FAMAELEARKLKYPNTGTEAFLMGILVEGTSTTAKFLRANGITLFKVREETLNLLGKSDL 152 Query: 699 YFFSPEHPPLTEPAQRALDWAVDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGF 878 +FFSPE PPLTE AQRALDWA +EKLKSGESGEITT+HLLLGIWSEKE AGHKI+A LGF Sbjct: 153 FFFSPERPPLTEQAQRALDWAFNEKLKSGESGEITTNHLLLGIWSEKESAGHKILATLGF 212 Query: 879 DDTKAAELAKSADEDAVLSYR 941 +D KA E+AKS +ED +LS++ Sbjct: 213 NDEKAKEIAKSINEDTILSFK 233 >ref|XP_006490254.1| PREDICTED: clp protease-related protein At4g12060, chloroplastic-like [Citrus sinensis] Length = 233 Score = 293 bits (749), Expect = 2e-76 Identities = 151/201 (75%), Positives = 171/201 (85%), Gaps = 3/201 (1%) Frame = +3 Query: 348 NSLPGNRLAL--QFTNSVPVIRKQRCSTVMTVSFSLPTA-KPERASTDKFPKWSARAIKS 518 +S+ GN+L + Q +S V + R S TVSFSLPT KPE AS DK PKWSARAI+S Sbjct: 33 SSMFGNKLLIRPQLNSSRFVTKYHRSSATATVSFSLPTTVKPETASPDKIPKWSARAIRS 92 Query: 519 FAMAELEARKLKYPNTGTEAFLMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDM 698 FAMAELEARKLKYPNTGTEAFLMGILVEGTS+ AKFLRANGITLFKVREET+NLLGKSD+ Sbjct: 93 FAMAELEARKLKYPNTGTEAFLMGILVEGTSTTAKFLRANGITLFKVREETLNLLGKSDL 152 Query: 699 YFFSPEHPPLTEPAQRALDWAVDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGF 878 +FFSPE PPLTE AQRALDWA +EKLKSGESGEITT+HLLLGIWSEKE AGHKI+A LGF Sbjct: 153 FFFSPERPPLTEQAQRALDWAFNEKLKSGESGEITTNHLLLGIWSEKESAGHKILATLGF 212 Query: 879 DDTKAAELAKSADEDAVLSYR 941 +D KA E+AKS +ED +LS++ Sbjct: 213 NDEKAKEIAKSINEDTILSFK 233 >ref|XP_004306484.1| PREDICTED: clp protease-related protein At4g12060, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 225 Score = 289 bits (740), Expect = 2e-75 Identities = 152/215 (70%), Positives = 171/215 (79%) Frame = +3 Query: 297 KSPSLILPWNVKTEGLVNSLPGNRLALQFTNSVPVIRKQRCSTVMTVSFSLPTAKPERAS 476 + PS P N+ T L G +L+++ +S K + V TV FSLPT KPER S Sbjct: 17 RKPSNSTPCNLSTSFL-----GRKLSIEIPHSNKFASKHP-TPVATVLFSLPTGKPERIS 70 Query: 477 TDKFPKWSARAIKSFAMAELEARKLKYPNTGTEAFLMGILVEGTSSAAKFLRANGITLFK 656 + K +WSARAIKSFAM ELEARKLKYPNTGTEA LMGILVEGTS AAKFLRANGITLFK Sbjct: 71 SGKTSQWSARAIKSFAMGELEARKLKYPNTGTEALLMGILVEGTSIAAKFLRANGITLFK 130 Query: 657 VREETINLLGKSDMYFFSPEHPPLTEPAQRALDWAVDEKLKSGESGEITTSHLLLGIWSE 836 VREET+ LLGKSDMYFFSPEHPPLTEPAQRALDWAVD+KLKSG+SGEIT SHLLLGIWSE Sbjct: 131 VREETVKLLGKSDMYFFSPEHPPLTEPAQRALDWAVDQKLKSGDSGEITVSHLLLGIWSE 190 Query: 837 KEPAGHKIMAALGFDDTKAAELAKSADEDAVLSYR 941 KE AGHKI+ +LGFDD KA EL+ S D+D VLS++ Sbjct: 191 KESAGHKILVSLGFDDEKAKELSVSMDKDYVLSFK 225 >ref|XP_006856238.1| hypothetical protein AMTR_s00059p00213590 [Amborella trichopoda] gi|548860097|gb|ERN17705.1| hypothetical protein AMTR_s00059p00213590 [Amborella trichopoda] Length = 241 Score = 288 bits (737), Expect = 4e-75 Identities = 157/242 (64%), Positives = 183/242 (75%), Gaps = 4/242 (1%) Frame = +3 Query: 222 MAARNLSVFTILASSSHSGGSTRAGKSPSLILPWNVKTE----GLVNSLPGNRLALQFTN 389 MAA+ L+ ++L GS + + P L +K+E GL SL L + Sbjct: 1 MAAQALTSSSLLTHCWLISGSNKT-RIPFLSRHGELKSEALGLGLGLSLSTQSLVYRSRV 59 Query: 390 SVPVIRKQRCSTVMTVSFSLPTAKPERASTDKFPKWSARAIKSFAMAELEARKLKYPNTG 569 SVP + K+RCS + TV LPTAKPERAS+ K P+WSARAIKSF MAELEARKLKYP TG Sbjct: 60 SVPYV-KRRCS-ITTVFMMLPTAKPERASSGKVPRWSARAIKSFGMAELEARKLKYPKTG 117 Query: 570 TEAFLMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDMYFFSPEHPPLTEPAQRA 749 TE LMGILVEGTS AAKFLR+NGITLFK+R+ET+ LLGKS+MYFFSPEHPPLTEPAQRA Sbjct: 118 TETLLMGILVEGTSLAAKFLRSNGITLFKMRDETVKLLGKSEMYFFSPEHPPLTEPAQRA 177 Query: 750 LDWAVDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGFDDTKAAELAKSADEDAV 929 LDWAVDEK+KSGE GE+T +HLLLGIWS+KE AGHKIMA L FDD KA ELAKS D+D + Sbjct: 178 LDWAVDEKMKSGEDGEVTNTHLLLGIWSQKESAGHKIMATLAFDDKKAEELAKSMDKDVI 237 Query: 930 LS 935 L+ Sbjct: 238 LT 239 >ref|XP_004148125.1| PREDICTED: clp protease-related protein At4g12060, chloroplastic-like [Cucumis sativus] gi|449499662|ref|XP_004160878.1| PREDICTED: clp protease-related protein At4g12060, chloroplastic-like [Cucumis sativus] Length = 234 Score = 283 bits (724), Expect = 1e-73 Identities = 143/193 (74%), Positives = 164/193 (84%), Gaps = 1/193 (0%) Frame = +3 Query: 366 RLALQFTNSV-PVIRKQRCSTVMTVSFSLPTAKPERASTDKFPKWSARAIKSFAMAELEA 542 +LA++ +N+ PV++ R +T TVSFSLP +KPE +K PKWSARAIKSFAM ELEA Sbjct: 43 KLAIKRSNATHPVLKFSRRATTATVSFSLPASKPEGVPPEKLPKWSARAIKSFAMGELEA 102 Query: 543 RKLKYPNTGTEAFLMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDMYFFSPEHP 722 RKLKYPNTGTEA LMGIL+EGTS+AAKFLRANGITLFKVREET+ LLGK+DMYF SPEHP Sbjct: 103 RKLKYPNTGTEALLMGILIEGTSTAAKFLRANGITLFKVREETVKLLGKADMYFCSPEHP 162 Query: 723 PLTEPAQRALDWAVDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGFDDTKAAEL 902 PLTEPAQ+ALDWAV EKLKSG+SGEITT HLLLGIWSE E AG KI+A LGFDD KA E+ Sbjct: 163 PLTEPAQKALDWAVAEKLKSGQSGEITTGHLLLGIWSE-ESAGRKILATLGFDDEKAKEI 221 Query: 903 AKSADEDAVLSYR 941 AK+ D+DA SY+ Sbjct: 222 AKTVDKDATFSYK 234 >ref|XP_007040016.1| Double Clp-N motif protein [Theobroma cacao] gi|508777261|gb|EOY24517.1| Double Clp-N motif protein [Theobroma cacao] Length = 233 Score = 272 bits (696), Expect = 2e-70 Identities = 142/199 (71%), Positives = 161/199 (80%), Gaps = 1/199 (0%) Frame = +3 Query: 327 VKTEGLVNSLPGNRLALQFTNSVPVIRKQRCSTVMTVSFSLPTAKPER-ASTDKFPKWSA 503 +K GL + G +L+L+ + P + R T TVSFSLPTAKP+R AST+K PKWS Sbjct: 33 LKPHGLQSPWLGIKLSLRSSKPRPHLPNHRPITA-TVSFSLPTAKPDRVASTEKVPKWSR 91 Query: 504 RAIKSFAMAELEARKLKYPNTGTEAFLMGILVEGTSSAAKFLRANGITLFKVREETINLL 683 RAIKSF MAELEARKLKYP TGTEAFLMGIL+EGTS AAKFLRANGITLFKVREET+ +L Sbjct: 92 RAIKSFVMAELEARKLKYPTTGTEAFLMGILIEGTSLAAKFLRANGITLFKVREETVKVL 151 Query: 684 GKSDMYFFSPEHPPLTEPAQRALDWAVDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIM 863 GK+DMY+FSPEHPPLTE AQRALDWAVD+KLKSG+ GE+TT+HLLLGIWSE E GHKIM Sbjct: 152 GKADMYYFSPEHPPLTEAAQRALDWAVDQKLKSGDDGEVTTTHLLLGIWSEVESPGHKIM 211 Query: 864 AALGFDDTKAAELAKSADE 920 ALGF D KA ELA + E Sbjct: 212 TALGFIDVKAKELASLSSE 230 >ref|XP_006348058.1| PREDICTED: clp protease-related protein At4g12060, chloroplastic-like [Solanum tuberosum] Length = 235 Score = 271 bits (694), Expect = 4e-70 Identities = 142/240 (59%), Positives = 176/240 (73%) Frame = +3 Query: 222 MAARNLSVFTILASSSHSGGSTRAGKSPSLILPWNVKTEGLVNSLPGNRLALQFTNSVPV 401 MA + S+ +I + +S+S + L + + L + G +L ++ N Sbjct: 1 MATHSFSLLSIQSLTSNSSNKQSENTNTFLTHKY---CKALATTFTGGKLLIRPQNLNNF 57 Query: 402 IRKQRCSTVMTVSFSLPTAKPERASTDKFPKWSARAIKSFAMAELEARKLKYPNTGTEAF 581 K+R STV TV+FSLP +PE S++K PKWS+RAI++F MAELEARKLKYPNTGTEA Sbjct: 58 TLKRRRSTVATVAFSLPITRPE--SSEKQPKWSSRAIQAFVMAELEARKLKYPNTGTEAL 115 Query: 582 LMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDMYFFSPEHPPLTEPAQRALDWA 761 LMGILVEGTS AAKFLRANG+T FKV EET+ LLG+SDMY+FSPEHPPLT+PAQ+ALDWA Sbjct: 116 LMGILVEGTSLAAKFLRANGVTFFKVSEETLKLLGRSDMYYFSPEHPPLTKPAQKALDWA 175 Query: 762 VDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGFDDTKAAELAKSADEDAVLSYR 941 V+EKLKSGE GEIT +H+ LGIWS KE AGHKIM+ GFDD KA ELAK D+D L+Y+ Sbjct: 176 VNEKLKSGEDGEITVTHIALGIWSVKESAGHKIMSTFGFDDEKAKELAKFMDKDIELTYK 235 >ref|NP_001242487.1| uncharacterized protein LOC100786582 [Glycine max] gi|255639105|gb|ACU19852.1| unknown [Glycine max] Length = 260 Score = 271 bits (694), Expect = 4e-70 Identities = 146/229 (63%), Positives = 170/229 (74%), Gaps = 2/229 (0%) Frame = +3 Query: 261 SSSHSGGSTRAGKSPSLILPWNVKTEGLVNSLPGNRLAL-QFTNSVPVIRKQRC-STVMT 434 SS HS + SP+ SL G R+ L + T+S + C +T T Sbjct: 43 SSPHSNPNNHCTLSPT--------------SLFGTRITLLRATSSSRSLPNTNCRATSAT 88 Query: 435 VSFSLPTAKPERASTDKFPKWSARAIKSFAMAELEARKLKYPNTGTEAFLMGILVEGTSS 614 VSFSLPT KP + +K PKWSARAIKS+AM ELEARKLKYPNTGTEA LMGILVEGTS Sbjct: 89 VSFSLPTPKPLSDTPEKTPKWSARAIKSYAMGELEARKLKYPNTGTEALLMGILVEGTSK 148 Query: 615 AAKFLRANGITLFKVREETINLLGKSDMYFFSPEHPPLTEPAQRALDWAVDEKLKSGESG 794 AAKFLRANGITLFKVREET+ LLGKSD+YFFSPEHPPLTEPAQ+ALDWA++EKLKSGE G Sbjct: 149 AAKFLRANGITLFKVREETVELLGKSDLYFFSPEHPPLTEPAQKALDWAIEEKLKSGEGG 208 Query: 795 EITTSHLLLGIWSEKEPAGHKIMAALGFDDTKAAELAKSADEDAVLSYR 941 EI+ +HLLLGIWS+KE AG +I+ LGF+D KA ELAK+ D D LS++ Sbjct: 209 EISVTHLLLGIWSQKESAGQQILDTLGFNDEKAKELAKTIDGDVDLSFK 257 >ref|XP_006413313.1| hypothetical protein EUTSA_v10026111mg [Eutrema salsugineum] gi|557114483|gb|ESQ54766.1| hypothetical protein EUTSA_v10026111mg [Eutrema salsugineum] Length = 241 Score = 271 bits (693), Expect = 5e-70 Identities = 141/246 (57%), Positives = 178/246 (72%), Gaps = 6/246 (2%) Frame = +3 Query: 222 MAARNLSVFTILAS------SSHSGGSTRAGKSPSLILPWNVKTEGLVNSLPGNRLALQF 383 MA+ LS + S SS G + + S S I P ++ + L+ + P R Sbjct: 1 MASYTLSFIPLTVSNRRIFVSSQKGSPSSSSSSSSPIPPTSLLGKKLLVTKPSRRC---- 56 Query: 384 TNSVPVIRKQRCSTVMTVSFSLPTAKPERASTDKFPKWSARAIKSFAMAELEARKLKYPN 563 + K RC T + FS+PTA+PE S+DK PKWSAR+IKS AM ELEARKLKYP+ Sbjct: 57 -----FVSKHRCLTSASTVFSVPTAQPENGSSDKLPKWSARSIKSLAMGELEARKLKYPS 111 Query: 564 TGTEAFLMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDMYFFSPEHPPLTEPAQ 743 TGTEA LMGILVEGTS+AAKFLR NG+TLFKVR+ETINLLGKSDMYFFSPEHPPLTEPA+ Sbjct: 112 TGTEAILMGILVEGTSTAAKFLRGNGVTLFKVRDETINLLGKSDMYFFSPEHPPLTEPAR 171 Query: 744 RALDWAVDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGFDDTKAAELAKSADED 923 +A++WA+DEK KSG GE+TT++LLLGIWS+K+ AG +I+ LGF++ KA E+AKS +ED Sbjct: 172 KAIEWAIDEKKKSGVDGELTTAYLLLGIWSQKDSAGRQILETLGFNEDKAKEVAKSMNED 231 Query: 924 AVLSYR 941 LS++ Sbjct: 232 VDLSFK 237 >ref|XP_003534040.1| PREDICTED: clp protease-related protein At4g12060, chloroplastic-like isoform X1 [Glycine max] Length = 252 Score = 271 bits (692), Expect = 6e-70 Identities = 140/199 (70%), Positives = 161/199 (80%), Gaps = 2/199 (1%) Frame = +3 Query: 351 SLPGNRLAL-QFTNSVPVIRKQRC-STVMTVSFSLPTAKPERASTDKFPKWSARAIKSFA 524 SL G R+ L + T+S + C +T TVSFSLPT KP + +K PKWSARAIKS+A Sbjct: 51 SLFGTRITLLRATSSSRSLPNTNCRATSATVSFSLPTPKPLSDTPEKTPKWSARAIKSYA 110 Query: 525 MAELEARKLKYPNTGTEAFLMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDMYF 704 M ELEARKLKYPNTGTEA LMGILVEGTS AAKF RANGITLFKVREET+ LLGKSD+YF Sbjct: 111 MGELEARKLKYPNTGTEALLMGILVEGTSKAAKFSRANGITLFKVREETVELLGKSDLYF 170 Query: 705 FSPEHPPLTEPAQRALDWAVDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGFDD 884 FSPEHPPLTEPAQ+ALDWA++EKLKSGE GEI +HLLLGIWS+KE AG +I+A LGF+D Sbjct: 171 FSPEHPPLTEPAQKALDWAIEEKLKSGEGGEINVTHLLLGIWSQKESAGQQILATLGFND 230 Query: 885 TKAAELAKSADEDAVLSYR 941 KA EL+KS D D LS++ Sbjct: 231 EKAKELSKSIDGDVDLSFK 249 >ref|XP_002510906.1| ATP-dependent clp protease, putative [Ricinus communis] gi|223550021|gb|EEF51508.1| ATP-dependent clp protease, putative [Ricinus communis] Length = 227 Score = 268 bits (685), Expect = 4e-69 Identities = 139/210 (66%), Positives = 163/210 (77%) Frame = +3 Query: 306 SLILPWNVKTEGLVNSLPGNRLALQFTNSVPVIRKQRCSTVMTVSFSLPTAKPERASTDK 485 SL+LP ++S GN+L ++ +N + K ST TV SLPT +R + K Sbjct: 27 SLLLP--------LSSFHGNKLLIKQSNFSNFVLKSHGSTAATVLSSLPT---KRHPSGK 75 Query: 486 FPKWSARAIKSFAMAELEARKLKYPNTGTEAFLMGILVEGTSSAAKFLRANGITLFKVRE 665 PKWSARAI+SF + ELEARKLKYPNTGTEA LMGIL+EGTS AAKFLRANGIT F+VRE Sbjct: 76 IPKWSARAIRSFGLGELEARKLKYPNTGTEALLMGILIEGTSPAAKFLRANGITFFEVRE 135 Query: 666 ETINLLGKSDMYFFSPEHPPLTEPAQRALDWAVDEKLKSGESGEITTSHLLLGIWSEKEP 845 ET+NLLGKSD+Y+FSPEHPPLTE AQRALDWA+DEKLKSG+ GEITT+H+LLGIWSE E Sbjct: 136 ETVNLLGKSDLYYFSPEHPPLTEQAQRALDWAIDEKLKSGDDGEITTTHILLGIWSEIES 195 Query: 846 AGHKIMAALGFDDTKAAELAKSADEDAVLS 935 AGHK+M LGF+D KA ELAKS + D VLS Sbjct: 196 AGHKVMETLGFNDEKAKELAKSMNGDVVLS 225 >ref|XP_006285475.1| hypothetical protein CARUB_v10006894mg [Capsella rubella] gi|482554180|gb|EOA18373.1| hypothetical protein CARUB_v10006894mg [Capsella rubella] Length = 239 Score = 267 bits (682), Expect = 9e-69 Identities = 140/245 (57%), Positives = 175/245 (71%), Gaps = 5/245 (2%) Frame = +3 Query: 222 MAARNLSVFTILASS-----SHSGGSTRAGKSPSLILPWNVKTEGLVNSLPGNRLALQFT 386 MA+ LS + S+ S GS+ + SP L L +SL G +L + Sbjct: 1 MASYTLSYIPLTLSNPRILVSRQNGSSLSSSSPLL----------LTSSLLGKKLLVTPP 50 Query: 387 NSVPVIRKQRCSTVMTVSFSLPTAKPERASTDKFPKWSARAIKSFAMAELEARKLKYPNT 566 + + K RC T + ++PTA+PE S+DK PKWSARAIKS AM ELEARKLKYP+T Sbjct: 51 SRRCFVSKNRCLTSASTVLNVPTAQPENGSSDKIPKWSARAIKSLAMGELEARKLKYPST 110 Query: 567 GTEAFLMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDMYFFSPEHPPLTEPAQR 746 GTEA LMGILVEGTS+ AKFLR NG+TLFKVR+ETI+LLGKSDMYFFSPEHPPLTEPAQ+ Sbjct: 111 GTEAILMGILVEGTSTVAKFLRGNGVTLFKVRDETISLLGKSDMYFFSPEHPPLTEPAQK 170 Query: 747 ALDWAVDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGFDDTKAAELAKSADEDA 926 A+ WA+DEK KS GE+TT++LLLGIWS+K+ AGH+I+ LGFD+ KA E+ KS +ED Sbjct: 171 AIAWAIDEKNKSAVDGELTTAYLLLGIWSQKDSAGHQILEKLGFDEDKAKEVEKSMNEDV 230 Query: 927 VLSYR 941 LS++ Sbjct: 231 DLSFK 235 >ref|XP_006587382.1| PREDICTED: clp protease-related protein At4g12060, chloroplastic-like isoform X2 [Glycine max] Length = 253 Score = 266 bits (680), Expect = 2e-68 Identities = 140/200 (70%), Positives = 161/200 (80%), Gaps = 3/200 (1%) Frame = +3 Query: 351 SLPGNRLAL-QFTNSVPVIRKQRC-STVMTVSFSLPTAKPERASTDKFPKWSARAIKSFA 524 SL G R+ L + T+S + C +T TVSFSLPT KP + +K PKWSARAIKS+A Sbjct: 51 SLFGTRITLLRATSSSRSLPNTNCRATSATVSFSLPTPKPLSDTPEKTPKWSARAIKSYA 110 Query: 525 MAELEARKLKYPNTGTEAFLMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDMYF 704 M ELEARKLKYPNTGTEA LMGILVEGTS AAKF RANGITLFKVREET+ LLGKSD+YF Sbjct: 111 MGELEARKLKYPNTGTEALLMGILVEGTSKAAKFSRANGITLFKVREETVELLGKSDLYF 170 Query: 705 FSPEHPPLTEPAQRALDWAVDEKLKS-GESGEITTSHLLLGIWSEKEPAGHKIMAALGFD 881 FSPEHPPLTEPAQ+ALDWA++EKLKS GE GEI +HLLLGIWS+KE AG +I+A LGF+ Sbjct: 171 FSPEHPPLTEPAQKALDWAIEEKLKSAGEGGEINVTHLLLGIWSQKESAGQQILATLGFN 230 Query: 882 DTKAAELAKSADEDAVLSYR 941 D KA EL+KS D D LS++ Sbjct: 231 DEKAKELSKSIDGDVDLSFK 250 >gb|EPS73881.1| hypothetical protein M569_00876 [Genlisea aurea] Length = 233 Score = 265 bits (678), Expect = 3e-68 Identities = 142/240 (59%), Positives = 177/240 (73%) Frame = +3 Query: 222 MAARNLSVFTILASSSHSGGSTRAGKSPSLILPWNVKTEGLVNSLPGNRLALQFTNSVPV 401 MAA+ LSV + S + + R LI ++ L NS G ++++ + Sbjct: 1 MAAQGLSVISKTPYFSTNREAFRKPVKQQLI------SQNLSNSFFGTKVSIPPVGFSVI 54 Query: 402 IRKQRCSTVMTVSFSLPTAKPERASTDKFPKWSARAIKSFAMAELEARKLKYPNTGTEAF 581 + CSTV ++ SLPT K E S DK KWS+R+IKSFAMAELEARKLK+PNTGTEA Sbjct: 55 GSIRSCSTVAAITLSLPTTKTEIVS-DKNLKWSSRSIKSFAMAELEARKLKFPNTGTEAL 113 Query: 582 LMGILVEGTSSAAKFLRANGITLFKVREETINLLGKSDMYFFSPEHPPLTEPAQRALDWA 761 LMGIL+EGTS AA+FLR NG+TLFKVREET+NLLGKSD++FFSPEHPPLTEPAQ ALD+A Sbjct: 114 LMGILIEGTSLAARFLRENGVTLFKVREETVNLLGKSDLFFFSPEHPPLTEPAQNALDYA 173 Query: 762 VDEKLKSGESGEITTSHLLLGIWSEKEPAGHKIMAALGFDDTKAAELAKSADEDAVLSYR 941 V+EKLKSGE GEITT+HLLLGIWS+ E AG+KIM LG +D K +ELAK+ D+D +LS++ Sbjct: 174 VEEKLKSGEDGEITTAHLLLGIWSQNESAGYKIMVTLGINDDKLSELAKNKDKDIILSFK 233