BLASTX nr result
ID: Catharanthus23_contig00009481
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00009481 (1871 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004238590.1| PREDICTED: uncharacterized protein LOC101263... 431 e-118 ref|XP_006354937.1| PREDICTED: thylakoidal processing peptidase ... 431 e-118 ref|XP_002284120.1| PREDICTED: probable thylakoidal processing p... 421 e-115 ref|XP_006434872.1| hypothetical protein CICLE_v10001591mg [Citr... 396 e-107 ref|XP_006473394.1| PREDICTED: probable thylakoidal processing p... 395 e-107 gb|EXB38625.1| putative thylakoidal processing peptidase 2 [Moru... 390 e-105 gb|EOY14603.1| Peptidase S24/S26A/S26B/S26C family protein, puta... 389 e-105 gb|EMJ24518.1| hypothetical protein PRUPE_ppa007329mg [Prunus pe... 386 e-104 gb|EOY14609.1| Peptidase S24/S26A/S26B/S26C family protein, puta... 385 e-104 ref|XP_002326914.1| predicted protein [Populus trichocarpa] gi|5... 382 e-103 ref|XP_003602967.1| Thylakoidal processing peptidase [Medicago t... 371 e-100 ref|XP_004160620.1| PREDICTED: uncharacterized protein LOC101229... 370 e-100 ref|XP_004141368.1| PREDICTED: uncharacterized protein LOC101221... 363 2e-97 ref|XP_002510285.1| signal peptidase I, putative [Ricinus commun... 357 8e-96 ref|XP_004501604.1| PREDICTED: probable thylakoidal processing p... 356 2e-95 ref|XP_006581229.1| PREDICTED: probable thylakoidal processing p... 353 1e-94 gb|ESW08643.1| hypothetical protein PHAVU_009G062100g [Phaseolus... 349 2e-93 ref|XP_003523894.1| PREDICTED: probable thylakoidal processing p... 349 2e-93 ref|XP_006417872.1| hypothetical protein EUTSA_v10008025mg [Eutr... 349 3e-93 gb|EPS69411.1| hypothetical protein M569_05352, partial [Genlise... 348 5e-93 >ref|XP_004238590.1| PREDICTED: uncharacterized protein LOC101263904 [Solanum lycopersicum] Length = 853 Score = 431 bits (1108), Expect = e-118 Identities = 240/399 (60%), Positives = 279/399 (69%), Gaps = 8/399 (2%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIFQT---KPDHNYSSDFR--- 466 MAIRFTVTYSGY+AQNLASSA+SK CRFFHE + RSRIF KP+ N SDFR Sbjct: 1 MAIRFTVTYSGYLAQNLASSASSKVVGCRFFHECTVRSRIFHPPAQKPESN-CSDFRRTK 59 Query: 467 PRQHPXXXXXXXXXXXXXXXXXXTLAGEVFGESKNPXXXXXXXXXXXXXXXXXXXXXXXX 646 P+ P + A E+FG S N Sbjct: 60 PKPRPVSNTYSSRSFSSSSACS-SFASELFGGSSNSPLVVGLISLMRSSSGSCT------ 112 Query: 647 MMGVFGVSPLKPSSIIPFLLQGSKWLPCSQPSIGSA-SSLLDKGGTAVTADQPSVTSRAK 823 M G+SPLK SS +PFL QGSKWLPC++PSIGS+ SS +DKGGT Sbjct: 113 -MNALGISPLKASSFLPFL-QGSKWLPCNEPSIGSSGSSEVDKGGT-------------- 156 Query: 824 SVVQERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAE 1003 E R + S+ SE +++ +K+S++ W+SKLLN+CSDDAKAAFTALSVSI+F+SSLAE Sbjct: 157 ----ETRCSESSVRSEPLSNEMKVSKSRWVSKLLNICSDDAKAAFTALSVSIMFKSSLAE 212 Query: 1004 PRSIPSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEI-GYGSSDVFIKR 1180 PRSIPSASMSPTLD GDRI+AEKVSY FR+P++SDIVIFKAPPILQ I G + DVFIKR Sbjct: 213 PRSIPSASMSPTLDKGDRIMAEKVSYFFRQPDISDIVIFKAPPILQHIFGCSAGDVFIKR 272 Query: 1181 VVAKGGDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSH 1360 VVA GDY+EVR+GKL +NG+AQDEDFILEP+AYEMEPVLVPEG VFVMGDNRNNS+DSH Sbjct: 273 VVALAGDYIEVREGKLFLNGVAQDEDFILEPIAYEMEPVLVPEGCVFVMGDNRNNSYDSH 332 Query: 1361 NWGPLPIENIVGRSVFRYWPPSRVSDTLPSPTATQNTVA 1477 NWGPLP+ NIVGRSVFRYWPPSRVSDTL + VA Sbjct: 333 NWGPLPVANIVGRSVFRYWPPSRVSDTLHGSVMEKRVVA 371 >ref|XP_006354937.1| PREDICTED: thylakoidal processing peptidase 1, chloroplastic-like [Solanum tuberosum] Length = 373 Score = 431 bits (1107), Expect = e-118 Identities = 243/401 (60%), Positives = 278/401 (69%), Gaps = 8/401 (1%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIFQT---KPDHNYSSDFR--- 466 MAIRFTVTYSGY+AQNLASSA+SK CRFFHE + RSRIF KP+ N SDFR Sbjct: 1 MAIRFTVTYSGYLAQNLASSASSKVVGCRFFHECTVRSRIFHPPAQKPESN-CSDFRRTK 59 Query: 467 PRQHPXXXXXXXXXXXXXXXXXXTLAGEVFGESKNPXXXXXXXXXXXXXXXXXXXXXXXX 646 P+ P + A E+ G S N Sbjct: 60 PKPRPVSNTYSSRSFSSSSVCS-SFASELLGGSSNSPLVVGLISLMRSSSGSCT------ 112 Query: 647 MMGVFGVSPLKPSSIIPFLLQGSKWLPCSQPSIGS-ASSLLDKGGTAVTADQPSVTSRAK 823 M G+SPLK SS +PF QGSKWLPC++PSIGS ASS +DKGGT Sbjct: 113 -MNTLGISPLKASSFLPFF-QGSKWLPCNEPSIGSSASSEVDKGGT-------------- 156 Query: 824 SVVQERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAE 1003 E R + S SE +++ +K+S++ W+SKLLN+CSDDAKAAFTALSVSI+F+SSLAE Sbjct: 157 ----ETRCSESFVRSEPLSNEMKVSKSRWVSKLLNICSDDAKAAFTALSVSIMFKSSLAE 212 Query: 1004 PRSIPSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEI-GYGSSDVFIKR 1180 PRSIPSASMSPTLD GDRI+AEKVSY FR+P++SDIVIFKAPPILQ I G + DVFIKR Sbjct: 213 PRSIPSASMSPTLDKGDRIMAEKVSYFFRQPDISDIVIFKAPPILQHIFGCSAGDVFIKR 272 Query: 1181 VVAKGGDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSH 1360 VVA GDYVEVR+GKL +NG+AQDEDFILEPLAYEMEPVLVPEG VFVMGDNRNNSFDSH Sbjct: 273 VVALAGDYVEVREGKLFLNGVAQDEDFILEPLAYEMEPVLVPEGYVFVMGDNRNNSFDSH 332 Query: 1361 NWGPLPIENIVGRSVFRYWPPSRVSDTLPSPTATQNTVALS 1483 NWGPLP+ NIVGRSVFRYWPPSRVSDTL + VA+S Sbjct: 333 NWGPLPVANIVGRSVFRYWPPSRVSDTLHGSVMEKRVVAVS 373 >ref|XP_002284120.1| PREDICTED: probable thylakoidal processing peptidase 2, chloroplastic [Vitis vinifera] gi|147810057|emb|CAN78280.1| hypothetical protein VITISV_021649 [Vitis vinifera] Length = 368 Score = 421 bits (1081), Expect = e-115 Identities = 231/396 (58%), Positives = 263/396 (66%), Gaps = 3/396 (0%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIF--QTKPDHNYSSDFRPRQH 478 MAI+ TVTYSGYVAQNLASSA + G CR HE RSR F KP+ + R Q Sbjct: 1 MAIKLTVTYSGYVAQNLASSAGIRVGNCRSIHECWVRSRFFCPSQKPEVDSPVPSRAYQA 60 Query: 479 PXXXXXXXXXXXXXXXXXXTLAGEVFGES-KNPXXXXXXXXXXXXXXXXXXXXXXXXMMG 655 TLAGEVFG+S +NP +G Sbjct: 61 DYRRPKANCWAKVSTSAYSTLAGEVFGDSCRNPLIVGLISLMKSSTGVSESS------VG 114 Query: 656 VFGVSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGTAVTADQPSVTSRAKSVVQ 835 VFGVSPLK +SI+PFL GSKWLPC++P GS +DKGGT Sbjct: 115 VFGVSPLKATSILPFL-PGSKWLPCNEPIQGSVGDEVDKGGT------------------ 155 Query: 836 ERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEPRSI 1015 C D L R+NWLSKLLN CS+DA+A FTA++VS+LFRS LAEPRSI Sbjct: 156 ---QCCDVEVISKPLDRKVLERSNWLSKLLNCCSEDARAVFTAVTVSLLFRSPLAEPRSI 212 Query: 1016 PSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVVAKG 1195 PSASM PTLDVGDRILAEKVSYVFR PEVSDIVIFK PPILQEIGY + DVFIKR+VAK Sbjct: 213 PSASMYPTLDVGDRILAEKVSYVFRNPEVSDIVIFKVPPILQEIGYSAGDVFIKRIVAKA 272 Query: 1196 GDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHNWGPL 1375 GDYVEV +GKL+VNG+AQ+EDFILEPLAY M+PVLVPEG VFV+GDNRNNSFDSHNWGPL Sbjct: 273 GDYVEVSEGKLMVNGVAQEEDFILEPLAYNMDPVLVPEGYVFVLGDNRNNSFDSHNWGPL 332 Query: 1376 PIENIVGRSVFRYWPPSRVSDTLPSPTATQNTVALS 1483 PI+NIVGRSV RYWPPS+VSDT+ P A + +A+S Sbjct: 333 PIKNIVGRSVLRYWPPSKVSDTIYEPEARKTAMAIS 368 >ref|XP_006434872.1| hypothetical protein CICLE_v10001591mg [Citrus clementina] gi|557536994|gb|ESR48112.1| hypothetical protein CICLE_v10001591mg [Citrus clementina] Length = 365 Score = 396 bits (1017), Expect = e-107 Identities = 227/394 (57%), Positives = 261/394 (66%), Gaps = 6/394 (1%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAG----TCRFFHEFSARSRIF--QTKPDHNYSSDFR 466 MA+R TV +SGYVAQNLA SA + G + R FHE R R+F K D + +++ Sbjct: 1 MALRVTVNFSGYVAQNLAHSAGIRFGFSTTSTRSFHECLFRPRVFCHSKKTDLDPPPNYQ 60 Query: 467 PRQHPXXXXXXXXXXXXXXXXXXTLAGEVFGESKNPXXXXXXXXXXXXXXXXXXXXXXXX 646 P+ + TLA E+FG+ Sbjct: 61 PKAN---------------YRCNTLAAEIFGDGA-CNSPILMGLVSLMKSTAGMPGPSAT 104 Query: 647 MMGVFGVSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGTAVTADQPSVTSRAKS 826 MGVFG+SP K +SIIPF LQGSKWLPC++P S +DKGGT T + + Sbjct: 105 SMGVFGISPFKAASIIPF-LQGSKWLPCNEPGTVPESDYVDKGGT---------TDKIQF 154 Query: 827 VVQERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEP 1006 E N S LK S +WLSKLLNVCSDDAKAAFTAL+VS LF+S LAEP Sbjct: 155 SGSENLNGVSL--------QLKTS-GSWLSKLLNVCSDDAKAAFTALTVSFLFKSFLAEP 205 Query: 1007 RSIPSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVV 1186 RSIPSASM+PTLDVGDRILAEKVSY F++PEVSDIVIF+APPILQEIG+ S DVFIKR+V Sbjct: 206 RSIPSASMNPTLDVGDRILAEKVSYFFKRPEVSDIVIFRAPPILQEIGFSSGDVFIKRIV 265 Query: 1187 AKGGDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHNW 1366 A GD VEV GKLLVNG+AQDEDFILEPLAYEM+PV+VPEG VFV+GDNRNNSFDSHNW Sbjct: 266 ATAGDCVEVHGGKLLVNGVAQDEDFILEPLAYEMDPVVVPEGYVFVLGDNRNNSFDSHNW 325 Query: 1367 GPLPIENIVGRSVFRYWPPSRVSDTLPSPTATQN 1468 GPLPIENIVGRSVFRYWPPSRVSD L P A +N Sbjct: 326 GPLPIENIVGRSVFRYWPPSRVSDMLDDPYAMKN 359 >ref|XP_006473394.1| PREDICTED: probable thylakoidal processing peptidase 2, chloroplastic-like [Citrus sinensis] Length = 365 Score = 395 bits (1016), Expect = e-107 Identities = 227/392 (57%), Positives = 259/392 (66%), Gaps = 4/392 (1%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAG----TCRFFHEFSARSRIFQTKPDHNYSSDFRPR 472 MA+R TV +SGYVAQNLA SA + G + R FHE R R+F HN +D P Sbjct: 1 MALRVTVNFSGYVAQNLAHSAGIRFGFSTTSTRSFHECLFRPRVF----CHNKKTDLDPA 56 Query: 473 QHPXXXXXXXXXXXXXXXXXXTLAGEVFGESKNPXXXXXXXXXXXXXXXXXXXXXXXXMM 652 + TLA E+FG+ M Sbjct: 57 PN---------YQPKANYRCNTLAAEIFGDGA-CNSPILMGLVSLMKSTAGMPGSSATSM 106 Query: 653 GVFGVSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGTAVTADQPSVTSRAKSVV 832 GVFG+SP K +SIIPF LQGSKWLPC++P S +DKGGT T + + Sbjct: 107 GVFGISPFKAASIIPF-LQGSKWLPCNEPGTVPESDYVDKGGT---------TDKIQFSG 156 Query: 833 QERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEPRS 1012 E N S LK S +WLSKLLNVCSDDAKAAFTAL+VS+LF+S LAEPRS Sbjct: 157 SENLNGVSL--------QLKTS-GSWLSKLLNVCSDDAKAAFTALTVSLLFKSFLAEPRS 207 Query: 1013 IPSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVVAK 1192 IPSASM+PTLDVGDRILAEKVSY F++PEVSDIVIF+APPILQEIG+ S DVFIKR+VA Sbjct: 208 IPSASMNPTLDVGDRILAEKVSYFFKRPEVSDIVIFRAPPILQEIGFSSGDVFIKRIVAT 267 Query: 1193 GGDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHNWGP 1372 GD VEV GKLLVNG+AQDEDFILEPLAYEM+PV+VPEG VFV+GDNRNNSFDSHNWGP Sbjct: 268 AGDCVEVHGGKLLVNGVAQDEDFILEPLAYEMDPVVVPEGYVFVLGDNRNNSFDSHNWGP 327 Query: 1373 LPIENIVGRSVFRYWPPSRVSDTLPSPTATQN 1468 LPIENIVGRSVFRYWPPSRVS+ L P A +N Sbjct: 328 LPIENIVGRSVFRYWPPSRVSNMLDDPYAMKN 359 >gb|EXB38625.1| putative thylakoidal processing peptidase 2 [Morus notabilis] Length = 787 Score = 390 bits (1002), Expect = e-105 Identities = 229/406 (56%), Positives = 267/406 (65%), Gaps = 13/406 (3%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIFQT--KPDH--------NYS 454 MAIR T ++SGYVAQNLASSA + G CR FHE R+R+F T KP NY Sbjct: 1 MAIRVTFSFSGYVAQNLASSAGLRVGNCRAFHECWVRNRVFGTSQKPAELDPALSARNYR 60 Query: 455 SDF-RPRQHPXXXXXXXXXXXXXXXXXXTLAGEVFGES-KNPXXXXXXXXXXXXXXXXXX 628 SDF RP+ + TLAGEV GE+ K+P Sbjct: 61 SDFDRPKPN---------CWAKNSSSYSTLAGEVLGENCKSPILLTLISIMKSTAGVSAS 111 Query: 629 XXXXXXMMGVFGVSPLKPSSIIPFLLQGSKWLPCSQP-SIGSASSLLDKGGTAVTADQPS 805 G FG+SP+K +SIIPFL QGSKWLPC++ I S + +DKGGT Sbjct: 112 SATST---GTFGISPIKATSIIPFL-QGSKWLPCNESVQISSVNHEVDKGGTL------- 160 Query: 806 VTSRAKSVVQERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILF 985 CS EA +D+ + WL++LLN CS+DAKA FTA++VS+LF Sbjct: 161 ---------------CSVG--EATSDDHLQKGSGWLTRLLNSCSEDAKAVFTAVTVSLLF 203 Query: 986 RSSLAEPRSIPSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSD 1165 RSSLAEPRSIPS+SM PTLDVGDRILAEKVSYVFRKPEVSDIVIFKAP ILQEIGY SSD Sbjct: 204 RSSLAEPRSIPSSSMYPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPKILQEIGYSSSD 263 Query: 1166 VFIKRVVAKGGDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNN 1345 VFIKR+VAK G+ V+VRDGKLLVNG+AQDE+F+LE L YEM+PVLVPEG VFVMGDNRNN Sbjct: 264 VFIKRIVAKAGECVQVRDGKLLVNGVAQDEEFVLESLDYEMDPVLVPEGYVFVMGDNRNN 323 Query: 1346 SFDSHNWGPLPIENIVGRSVFRYWPPSRVSDTLPSPTATQNTVALS 1483 SFDSHNWGPLP++NIVGRSV+RYWPPS+ A +N V LS Sbjct: 324 SFDSHNWGPLPVKNIVGRSVYRYWPPSK---------AGKNAVTLS 360 >gb|EOY14603.1| Peptidase S24/S26A/S26B/S26C family protein, putative isoform 1 [Theobroma cacao] Length = 365 Score = 389 bits (1000), Expect = e-105 Identities = 221/396 (55%), Positives = 260/396 (65%), Gaps = 3/396 (0%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTC--RFFHEFSARSRIFQTKPDHNYSSDFRPRQH 478 MAIR TVTYSGYVAQNLAS+A + G+C R HE RSR N SD P Sbjct: 1 MAIRVTVTYSGYVAQNLASNAGFRLGSCSSRSVHECWLRSRFLSP----NKKSDIDPS-- 54 Query: 479 PXXXXXXXXXXXXXXXXXXTLAGEVFGESKNPXXXXXXXXXXXXXXXXXXXXXXXXMMGV 658 P TLA E+ + N +G+ Sbjct: 55 PARNYHAADLRHPRSSMSSTLAAEILKDGCN--NPIIVGLISLMKSTAYGSCSSSTTVGL 112 Query: 659 FGVSPLKPSSIIPFLLQGSKWLPCSQP-SIGSASSLLDKGGTAVTADQPSVTSRAKSVVQ 835 G+SP K +SII FL Q SKWLPC++P S+G SS +D+GGT+ S+ K V Sbjct: 113 CGISPFKATSIISFL-QASKWLPCNEPASVGPESSEVDRGGTSNEDRSLSLELDPKGFV- 170 Query: 836 ERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEPRSI 1015 +++W+S+LLNVCS+DAKAA TA++VSILFRS +AEPRSI Sbjct: 171 ---------------------KSSWISRLLNVCSEDAKAALTAVTVSILFRSFMAEPRSI 209 Query: 1016 PSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVVAKG 1195 PS SM PTLDVGDR+LAEKVSY FRKPEVSDIVIF+APPILQEIG+ S DVFIKR+VAK Sbjct: 210 PSTSMYPTLDVGDRVLAEKVSYFFRKPEVSDIVIFRAPPILQEIGFSSGDVFIKRIVAKA 269 Query: 1196 GDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHNWGPL 1375 GD VEVRDGKLL+NG+AQDEDF+LEPLAYEM+PV+VPEG VFV+GDNRNNSFDSHNWGPL Sbjct: 270 GDCVEVRDGKLLINGVAQDEDFVLEPLAYEMDPVVVPEGYVFVLGDNRNNSFDSHNWGPL 329 Query: 1376 PIENIVGRSVFRYWPPSRVSDTLPSPTATQNTVALS 1483 PIENIVGRSVFRYWPPS+VSDT+ P + VA+S Sbjct: 330 PIENIVGRSVFRYWPPSKVSDTIHDPHVGKIAVAVS 365 >gb|EMJ24518.1| hypothetical protein PRUPE_ppa007329mg [Prunus persica] Length = 372 Score = 386 bits (992), Expect = e-104 Identities = 218/398 (54%), Positives = 255/398 (64%), Gaps = 5/398 (1%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIF--QTKPDHNYSSDFRPRQH 478 MAIR T+++SGYVAQNLASSA + G CR FHE RSR+F KP+ + S R Sbjct: 1 MAIRVTLSFSGYVAQNLASSANLRVGNCRGFHECWVRSRVFGSNQKPEFDPSVPVRKYHQ 60 Query: 479 P--XXXXXXXXXXXXXXXXXXTLAGEVFGE-SKNPXXXXXXXXXXXXXXXXXXXXXXXXM 649 LA E+ GE SK+P M Sbjct: 61 TQFSRSKPSSLAAKTLPSLYTALAEEIVGESSKSPIVLGLISLLKSTAFVAGVSSAPSAM 120 Query: 650 MGVFGVSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGTAVTADQPSVTSRAKSV 829 G+SP KP SI+PF LQ SKWLPC++ S +DKGGT + V K Sbjct: 121 ----GISPFKPGSIMPF-LQVSKWLPCNETVPVSILKEVDKGGTLCVDEVAEVPRLTKK- 174 Query: 830 VQERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEPR 1009 +L R+ +LS+LLN CS+DAKA FTA++VS+LF+S LAEPR Sbjct: 175 --------------------ELGRSGFLSRLLNSCSEDAKAVFTAVTVSVLFKSFLAEPR 214 Query: 1010 SIPSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVVA 1189 SIPS SM PTLDVGDR+LAEKVSY F+KPEVSDIVIFKAPPILQEIGY S DVFIKR+VA Sbjct: 215 SIPSTSMYPTLDVGDRVLAEKVSYFFKKPEVSDIVIFKAPPILQEIGYSSGDVFIKRIVA 274 Query: 1190 KGGDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHNWG 1369 K GD VEVR+GKLLVNG+ QDE +ILEPLAYEM+PVL+PEG VFVMGDNRNNSFDSHNWG Sbjct: 275 KAGDCVEVRNGKLLVNGLVQDEHYILEPLAYEMDPVLIPEGYVFVMGDNRNNSFDSHNWG 334 Query: 1370 PLPIENIVGRSVFRYWPPSRVSDTLPSPTATQNTVALS 1483 PLP++NI+GRSVFRYWPPS+VSDT P N VA+S Sbjct: 335 PLPVKNILGRSVFRYWPPSKVSDTTYEPQVADNAVAIS 372 >gb|EOY14609.1| Peptidase S24/S26A/S26B/S26C family protein, putative isoform 7 [Theobroma cacao] Length = 366 Score = 385 bits (988), Expect = e-104 Identities = 221/397 (55%), Positives = 260/397 (65%), Gaps = 4/397 (1%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTC--RFFHEFSARSRIFQTKPDHNYSSDFRPRQH 478 MAIR TVTYSGYVAQNLAS+A + G+C R HE RSR N SD P Sbjct: 1 MAIRVTVTYSGYVAQNLASNAGFRLGSCSSRSVHECWLRSRFLSP----NKKSDIDPS-- 54 Query: 479 PXXXXXXXXXXXXXXXXXXTLAGEVFGESKNPXXXXXXXXXXXXXXXXXXXXXXXXMMGV 658 P TLA E+ + N +G+ Sbjct: 55 PARNYHAADLRHPRSSMSSTLAAEILKDGCN--NPIIVGLISLMKSTAYGSCSSSTTVGL 112 Query: 659 FGVSPLKPSSIIPFLLQGSKWLPCSQP-SIGSASSLLDKGGTAVTADQPSVTSRAKSVVQ 835 G+SP K +SII FL Q SKWLPC++P S+G SS +D+GGT+ S+ K V Sbjct: 113 CGISPFKATSIISFL-QASKWLPCNEPASVGPESSEVDRGGTSNEDRSLSLELDPKGFV- 170 Query: 836 ERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEPRSI 1015 +++W+S+LLNVCS+DAKAA TA++VSILFRS +AEPRSI Sbjct: 171 ---------------------KSSWISRLLNVCSEDAKAALTAVTVSILFRSFMAEPRSI 209 Query: 1016 PSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVVAKG 1195 PS SM PTLDVGDR+LAEKVSY FRKPEVSDIVIF+APPILQEIG+ S DVFIKR+VAK Sbjct: 210 PSTSMYPTLDVGDRVLAEKVSYFFRKPEVSDIVIFRAPPILQEIGFSSGDVFIKRIVAKA 269 Query: 1196 GDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEP-VLVPEGAVFVMGDNRNNSFDSHNWGP 1372 GD VEVRDGKLL+NG+AQDEDF+LEPLAYEM+P V+VPEG VFV+GDNRNNSFDSHNWGP Sbjct: 270 GDCVEVRDGKLLINGVAQDEDFVLEPLAYEMDPVVVVPEGYVFVLGDNRNNSFDSHNWGP 329 Query: 1373 LPIENIVGRSVFRYWPPSRVSDTLPSPTATQNTVALS 1483 LPIENIVGRSVFRYWPPS+VSDT+ P + VA+S Sbjct: 330 LPIENIVGRSVFRYWPPSKVSDTIHDPHVGKIAVAVS 366 >ref|XP_002326914.1| predicted protein [Populus trichocarpa] gi|566202277|ref|XP_006375012.1| hypothetical protein POPTR_0014s03570g [Populus trichocarpa] gi|550323326|gb|ERP52809.1| hypothetical protein POPTR_0014s03570g [Populus trichocarpa] Length = 362 Score = 382 bits (981), Expect = e-103 Identities = 217/404 (53%), Positives = 255/404 (63%), Gaps = 13/404 (3%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIFQ----TKPDHNYS------ 454 MAIR T ++SGYVAQNL + G CR+ +E RSRIF T HN Sbjct: 1 MAIRVTFSFSGYVAQNLGV----RVGNCRYLNECFIRSRIFASPATTTTTHNSDIEPPGP 56 Query: 455 ---SDFRPRQHPXXXXXXXXXXXXXXXXXXTLAGEVFGESKNPXXXXXXXXXXXXXXXXX 625 +DFR R T+AGE+FG++ Sbjct: 57 RTGTDFRRRN-------LKRNYSNSAAMYSTMAGEIFGDN----CKGSAIAVGLVSLMKS 105 Query: 626 XXXXXXXMMGVFGVSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGTAVTADQPS 805 MG G+SP K SI+PFL QGS+WLPC++ +GS S +D+GGT Sbjct: 106 TAGVSCSNMGACGISPFKAVSILPFL-QGSRWLPCNEAVLGSRSPEVDRGGTGTVKSVEK 164 Query: 806 VTSRAKSVVQERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILF 985 V S +KS +W S++ NVCS+DAKA FTA +VS+LF Sbjct: 165 V-SESKS-------------------------RSWFSRVFNVCSEDAKAMFTAATVSLLF 198 Query: 986 RSSLAEPRSIPSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSD 1165 RS+LAEPRSIPS+SMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQE G+ S D Sbjct: 199 RSTLAEPRSIPSSSMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEFGFSSGD 258 Query: 1166 VFIKRVVAKGGDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNN 1345 VFIKR+VAK GDYVEVR+GKL VNG+ QDE+FI EPLAYEME VLVPEG VFVMGDNRNN Sbjct: 259 VFIKRIVAKAGDYVEVREGKLYVNGVVQDEEFIKEPLAYEMELVLVPEGYVFVMGDNRNN 318 Query: 1346 SFDSHNWGPLPIENIVGRSVFRYWPPSRVSDTLPSPTATQNTVA 1477 SFDSHNWGPLPI+NIVGRSVFRYWPPS+VSDT+ P +N ++ Sbjct: 319 SFDSHNWGPLPIKNIVGRSVFRYWPPSKVSDTIYDPHVAKNAIS 362 >ref|XP_003602967.1| Thylakoidal processing peptidase [Medicago truncatula] gi|355492015|gb|AES73218.1| Thylakoidal processing peptidase [Medicago truncatula] Length = 375 Score = 371 bits (953), Expect = e-100 Identities = 208/382 (54%), Positives = 250/382 (65%), Gaps = 2/382 (0%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIFQT--KPDHNYSSDFRPRQH 478 MAIR T ++SGYVAQNL SSA + R E SR+F + KPD S FR R Sbjct: 1 MAIRVTFSFSGYVAQNLVSSAGVRVANSRCVQECCILSRLFGSNPKPDLERSGGFRNRN- 59 Query: 479 PXXXXXXXXXXXXXXXXXXTLAGEVFGESKNPXXXXXXXXXXXXXXXXXXXXXXXXMMGV 658 TLAGE+ ES N MG Sbjct: 60 --LYSDFTKPRNSPVSVYSTLAGEILSESCN---NPIILGLISMMKSTAISGSTSAAMGA 114 Query: 659 FGVSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGTAVTADQPSVTSRAKSVVQE 838 G+SP K SSIIPFL QGSKWLPC++ + + +DKGGT + + SV+S +S + Sbjct: 115 MGISPFKTSSIIPFL-QGSKWLPCNESVPTATTWEVDKGGTRIQSQPVSVSSDKESRLDL 173 Query: 839 RRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEPRSIP 1018 N K + N W+SKLLNVCS+DAKA FTA++VS+LF+S LAEP+SIP Sbjct: 174 ---------------NQKENTNGWISKLLNVCSEDAKAVFTAVTVSLLFKSFLAEPKSIP 218 Query: 1019 SASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVVAKGG 1198 SASM PTL+VGDR+L EK S+ FRKP+VSDIVIFKAP L+ G+ SSDVFIKRVVAK G Sbjct: 219 SASMYPTLEVGDRVLTEKFSFFFRKPDVSDIVIFKAPSWLKAYGFSSSDVFIKRVVAKAG 278 Query: 1199 DYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHNWGPLP 1378 D VEVRDGKLLVNG+A+DE+F+LEPLAYE+ P++VP+G VFVMGDNRN SFDSHNWGPLP Sbjct: 279 DVVEVRDGKLLVNGVAEDEEFVLEPLAYELAPMVVPKGHVFVMGDNRNKSFDSHNWGPLP 338 Query: 1379 IENIVGRSVFRYWPPSRVSDTL 1444 IENIVGRS+FRYWPPS+VSDT+ Sbjct: 339 IENIVGRSMFRYWPPSKVSDTV 360 >ref|XP_004160620.1| PREDICTED: uncharacterized protein LOC101229456 [Cucumis sativus] Length = 763 Score = 370 bits (951), Expect = e-100 Identities = 215/398 (54%), Positives = 254/398 (63%), Gaps = 14/398 (3%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIFQT--KPDHNYSSDFRPRQH 478 MAIR TV++SGYVAQNLASSA + G CR HE RSR+F + KP+ + S R Sbjct: 1 MAIRVTVSFSGYVAQNLASSAGIRVGNCRAVHECWIRSRLFGSNQKPEFDPSGSVRNYHS 60 Query: 479 PXXXXXXXXXXXXXXXXXXTLAGEVFGES-KNPXXXXXXXXXXXXXXXXXXXXXXXXMMG 655 T+AGE+ ES +NP MG Sbjct: 61 AVLPSNSRCWVKNSASALGTIAGEIVDESCRNPIVLGLISLMKSAVGTSVSSPMA---MG 117 Query: 656 VFGVSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGTAVTADQPSVTSRAKSVVQ 835 VFGVS + SSIIPFL QGSK + ++ GS ++ G Sbjct: 118 VFGVSSFEASSIIPFL-QGSKTVTGNESVSGSTGDEIESYGVF----------------- 159 Query: 836 ERRNACSAASSEAMA---DNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEP 1006 E M+ D KL +++W+S+ LN CS+DAKA TAL+VS+LFRSSLAEP Sbjct: 160 ------DCVMDEGMSQPPDPSKLEKSSWISRFLNNCSEDAKAIATALTVSVLFRSSLAEP 213 Query: 1007 RSIPSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVV 1186 RSIPS+SM PTLDVGDRILAEKVSY FR+P VSDIVIFKAPPILQ+IGY S+DVFIKR+V Sbjct: 214 RSIPSSSMYPTLDVGDRILAEKVSYFFRRPSVSDIVIFKAPPILQKIGYKSNDVFIKRIV 273 Query: 1187 AKGGDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHNW 1366 AK GD VEVRDGKLLVNG+AQ+E FILEPL+Y M+PVLVPEG VFV+GDNRNNSFDSHNW Sbjct: 274 AKAGDCVEVRDGKLLVNGVAQNEKFILEPLSYNMDPVLVPEGYVFVLGDNRNNSFDSHNW 333 Query: 1367 GPLPIENIVGRSVFRYWPPSRVSD--------TLPSPT 1456 GPLP+ENIVGRSVFRYWPPS+VSD +P+PT Sbjct: 334 GPLPVENIVGRSVFRYWPPSKVSDKDQNAEKEVIPNPT 371 >ref|XP_004141368.1| PREDICTED: uncharacterized protein LOC101221060, partial [Cucumis sativus] Length = 761 Score = 363 bits (931), Expect = 2e-97 Identities = 208/379 (54%), Positives = 245/379 (64%), Gaps = 6/379 (1%) Frame = +2 Query: 320 TVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIFQT--KPDHNYSSDFRPRQHPXXXX 493 TV++SGYVAQNLASSA + G CR HE RSR+F + KP+ + S R Sbjct: 1 TVSFSGYVAQNLASSAGIRVGNCRAVHECWIRSRLFGSNQKPEFDPSGSVRNYHSAVLPS 60 Query: 494 XXXXXXXXXXXXXXTLAGEVFGES-KNPXXXXXXXXXXXXXXXXXXXXXXXXMMGVFGVS 670 T+AGE+ ES +NP MGVFGVS Sbjct: 61 NSRCWVKNSASALGTIAGEIVDESCRNPIVLGLISLMKSAVGTSVSSPMA---MGVFGVS 117 Query: 671 PLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGTAVTADQPSVTSRAKSVVQERRNA 850 + SSIIPFL QGSK + ++ GS ++ G Sbjct: 118 SFEASSIIPFL-QGSKTVTGNESVSGSTGDEIESYGVF---------------------- 154 Query: 851 CSAASSEAMA---DNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEPRSIPS 1021 E M+ D KL +++W+S+ LN CS+DAKA TAL+VS+LFRSSLAEPRSIPS Sbjct: 155 -DCVMDEGMSQPPDPSKLEKSSWISRFLNNCSEDAKAIATALTVSVLFRSSLAEPRSIPS 213 Query: 1022 ASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVVAKGGD 1201 +SM PTLDVGDRILAEKVSY FR+P VSDIVIFKAPPILQ+IGY S+DVFIKR+VAK GD Sbjct: 214 SSMYPTLDVGDRILAEKVSYFFRRPSVSDIVIFKAPPILQKIGYKSNDVFIKRIVAKAGD 273 Query: 1202 YVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHNWGPLPI 1381 VEVRDGKLLVNG+AQ+E FILEPL+Y M+PVLVPEG VFV+GDNRNNSFDSHNWGPLP+ Sbjct: 274 CVEVRDGKLLVNGVAQNEKFILEPLSYNMDPVLVPEGYVFVLGDNRNNSFDSHNWGPLPV 333 Query: 1382 ENIVGRSVFRYWPPSRVSD 1438 ENIVGRSVFRYWPPS+VSD Sbjct: 334 ENIVGRSVFRYWPPSKVSD 352 >ref|XP_002510285.1| signal peptidase I, putative [Ricinus communis] gi|223550986|gb|EEF52472.1| signal peptidase I, putative [Ricinus communis] Length = 831 Score = 357 bits (917), Expect = 8e-96 Identities = 204/375 (54%), Positives = 243/375 (64%), Gaps = 10/375 (2%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIFQTKPDHNYSSDFRPRQHPX 484 MAIR T TYSGYVAQ++AS A + G CR HE RSRIF + + N + P P Sbjct: 1 MAIRVTFTYSGYVAQSIASCAGIRVGNCRSLHECFVRSRIFASPTNQNVDLE-PPAPRPS 59 Query: 485 XXXXXXXXXXXXXXXXXTLAGEVFGES-KNPXXXXXXXXXXXXXXXXXXXXXXXXMMGVF 661 T+AGE+FG + K+P GVF Sbjct: 60 RVFQSGGYRKSSTSLYSTIAGEIFGNNCKSPIAVGLIELMKSTAGVGVSGST-----GVF 114 Query: 662 GVSPLKPSSIIPFLLQGSKWLPCSQPSIGSA---------SSLLDKGGTAVTADQPSVTS 814 G+SPLK SSI+P +LQGS+WLPC++PS G SS +D+GGT S +S Sbjct: 115 GISPLKASSILP-VLQGSRWLPCNEPSPGQKNNEPSTRQNSSDVDRGGTVKCVKNGSSSS 173 Query: 815 RAKSVVQERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSS 994 + A + + E + L +WLS++L+ S+DAKA FTA +V+ LFRS+ Sbjct: 174 CCTT-------ATTTVTLEINGNELDKG-GSWLSRVLSSFSEDAKAIFTAATVNFLFRSA 225 Query: 995 LAEPRSIPSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFI 1174 LAEPRSIPS SM PTLDVGDR+LAEKVS++FR+PEVSDIVIFKAPPILQEIGY S DVFI Sbjct: 226 LAEPRSIPSTSMCPTLDVGDRVLAEKVSFIFRQPEVSDIVIFKAPPILQEIGYSSGDVFI 285 Query: 1175 KRVVAKGGDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFD 1354 KR+VA GD VEVR+GKL VNG+ Q EDFILEPLAYEMEPVLVPEG VFVMGDNRNNSFD Sbjct: 286 KRIVATAGDIVEVREGKLYVNGVIQHEDFILEPLAYEMEPVLVPEGYVFVMGDNRNNSFD 345 Query: 1355 SHNWGPLPIENIVGR 1399 SHNWGPLPI+NIVGR Sbjct: 346 SHNWGPLPIKNIVGR 360 >ref|XP_004501604.1| PREDICTED: probable thylakoidal processing peptidase 2, chloroplastic-like isoform X2 [Cicer arietinum] Length = 367 Score = 356 bits (914), Expect = 2e-95 Identities = 205/398 (51%), Positives = 251/398 (63%), Gaps = 5/398 (1%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIF--QTKPDHNYSSDFRPRQH 478 MAIR T ++SGYVAQNL SSA + R E SR F K D + S R Sbjct: 1 MAIRVTFSFSGYVAQNLVSSAGVRVANSRCVQECCILSRFFGHNQKRDRDRSGGGGVRNF 60 Query: 479 PXXXXXXXXXXXXXXXXXXTLAGEVFGES-KNPXXXXXXXXXXXXXXXXXXXXXXXXMMG 655 TLAGE+ E KNP MG Sbjct: 61 ----YPGRPKNSTSISAYSTLAGEILNEGCKNPIILGLISVMKSTACVSGSSTAA---MG 113 Query: 656 VFGVSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGTAVTADQPSVTSRAKSVVQ 835 + G+SP K SSIIPFL QGSKWLPC++ + +DKGGT Q S+ +S + Sbjct: 114 IMGISPFKTSSIIPFL-QGSKWLPCNESVPDPTTWEVDKGGT-----QCVQISKKESSLN 167 Query: 836 ERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEPRSI 1015 +R + W+S+LLNVC++DAKA FTA++VS+LF+S LAEP+SI Sbjct: 168 QRETS------------------GWISRLLNVCTEDAKAVFTAVTVSLLFKSFLAEPKSI 209 Query: 1016 PSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVVAKG 1195 PS+SM PTL+VGDR+L EK S+ FRKP+VSDIVIFKAPP LQE G+ +SDVFIKRVVAK Sbjct: 210 PSSSMYPTLEVGDRVLTEKFSFFFRKPDVSDIVIFKAPPWLQEFGFSASDVFIKRVVAKA 269 Query: 1196 GDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHNWGPL 1375 GD VEVRDGKLLVN +A++E+F+LEPLAYEM P++VPEG VFVMGDNRN SFDSHNWGPL Sbjct: 270 GDVVEVRDGKLLVNAVAEEEEFVLEPLAYEMAPMVVPEGHVFVMGDNRNKSFDSHNWGPL 329 Query: 1376 PIENIVGRSVFRYWPPSRVSDTLP--SPTATQNTVALS 1483 PIENIVGRS+FRYWPPS+ +DT+ +P N+VA+S Sbjct: 330 PIENIVGRSMFRYWPPSKAADTVTVHNPPPRNNSVAVS 367 >ref|XP_006581229.1| PREDICTED: probable thylakoidal processing peptidase 2, chloroplastic-like [Glycine max] Length = 362 Score = 353 bits (907), Expect = 1e-94 Identities = 200/386 (51%), Positives = 248/386 (64%), Gaps = 7/386 (1%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIF-----QTKPDHNYSSDFRP 469 MAIR T ++SGYVAQ+LASSA + R E R+R+F +T D + R Sbjct: 1 MAIRVTFSFSGYVAQSLASSAGVRVANSRCVQECWIRTRLFGGATQKTDLDSSAGGGVRN 60 Query: 470 RQHPXXXXXXXXXXXXXXXXXXTLAGEVFGE--SKNPXXXXXXXXXXXXXXXXXXXXXXX 643 P +LAGE G+ SK+P Sbjct: 61 FARPNCWAQSTYS---------SLAGEFLGDGCSKSPIILGLISIMKSTVGVSGSSAAAA 111 Query: 644 XMMGVFGVSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGTAVTADQPSVTSRAK 823 G+FG+SP K +SIIPFL GSKWLPC++ S +DKGGT + Sbjct: 112 ---GIFGISPFKTTSIIPFL-PGSKWLPCNESVPDPTSWEVDKGGT-------------R 154 Query: 824 SVVQERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAE 1003 VV E + ++ +WLS+L+NVCS+DAKAAFTAL+VS+LF+SSLAE Sbjct: 155 RVVSETES--------------NFAKISWLSRLMNVCSEDAKAAFTALTVSLLFKSSLAE 200 Query: 1004 PRSIPSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRV 1183 PRSIPS+SM PTL+VGDR+L EKVS+ FRKP+VSDIVIFKAPP L+E G+ SSDVFIKR+ Sbjct: 201 PRSIPSSSMYPTLEVGDRVLTEKVSFFFRKPDVSDIVIFKAPPWLEEFGFSSSDVFIKRI 260 Query: 1184 VAKGGDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHN 1363 VAK GD VEVRDGKLL+NG A++++F+LE LAYEM+P++VPEG VFVMGDNRN SFDSHN Sbjct: 261 VAKAGDTVEVRDGKLLINGAAEEQEFVLEALAYEMDPMVVPEGYVFVMGDNRNKSFDSHN 320 Query: 1364 WGPLPIENIVGRSVFRYWPPSRVSDT 1441 WGPLP+ENIVGRS+FRYWPPS+ SDT Sbjct: 321 WGPLPVENIVGRSMFRYWPPSKASDT 346 >gb|ESW08643.1| hypothetical protein PHAVU_009G062100g [Phaseolus vulgaris] Length = 359 Score = 349 bits (896), Expect = 2e-93 Identities = 199/382 (52%), Positives = 240/382 (62%), Gaps = 3/382 (0%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIFQTKPDHNYSSDFRPRQHPX 484 MAIR T ++SGYVAQNL SSA ++ R E R+R+F S Sbjct: 1 MAIRVTFSFSGYVAQNLVSSAGARVANSRCVQECWIRTRLFGATQKTELDSS------AG 54 Query: 485 XXXXXXXXXXXXXXXXXTLAGEVFGES-KNPXXXXXXXXXXXXXXXXXXXXXXXXMMGVF 661 TLA E G+ K+P G+F Sbjct: 55 GVRNFARPNCWAQSTYSTLAEEFIGDGCKSPIILGLISIMKSTAGVSGSSAAAA---GIF 111 Query: 662 GVSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGT--AVTADQPSVTSRAKSVVQ 835 G+SP K SSIIPFL GSKWLPC++ S +DKGGT AV D PS Sbjct: 112 GISPFKTSSIIPFL-PGSKWLPCNESVPNPTSWEVDKGGTKRAVENDVPS---------- 160 Query: 836 ERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEPRSI 1015 ++ +WLS+LLNV SDDA+AAFTA++VS+LF+SSLAEPRSI Sbjct: 161 -------------------FAKTSWLSRLLNVSSDDARAAFTAITVSLLFKSSLAEPRSI 201 Query: 1016 PSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVVAKG 1195 PS SM PTL+VGDR+L EKVS+ FRKP+VSDIVIF AP L++ G+ SSDVFIKR+VAK Sbjct: 202 PSLSMYPTLEVGDRVLTEKVSFFFRKPDVSDIVIFTAPRCLEKFGFTSSDVFIKRIVAKA 261 Query: 1196 GDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHNWGPL 1375 GD VEVRDGKLLVNG+A++++F+LEPLAYEM+P++VPEG VFVMGDNRNNS DSHNWGPL Sbjct: 262 GDCVEVRDGKLLVNGVAEEQEFVLEPLAYEMDPMVVPEGYVFVMGDNRNNSLDSHNWGPL 321 Query: 1376 PIENIVGRSVFRYWPPSRVSDT 1441 PIENIVGRS+FRYWPPS+VSDT Sbjct: 322 PIENIVGRSMFRYWPPSKVSDT 343 >ref|XP_003523894.1| PREDICTED: probable thylakoidal processing peptidase 2, chloroplastic-like isoform X1 [Glycine max] Length = 362 Score = 349 bits (896), Expect = 2e-93 Identities = 195/381 (51%), Positives = 243/381 (63%), Gaps = 2/381 (0%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIFQTKPDHNYSSDFRPRQHPX 484 MAIR T ++SGYVAQ+LASSA + R E R+R+ +D Sbjct: 1 MAIRVTFSFSGYVAQSLASSAGVRVANSRCVQECWIRTRLSGA----TQKTDLDSSAGGV 56 Query: 485 XXXXXXXXXXXXXXXXXTLAGEVFGES-KNPXXXXXXXXXXXXXXXXXXXXXXXXMMGVF 661 TL GE G+ K+P G+F Sbjct: 57 RNFAGPKPNCWAQSTYSTLTGEFLGDGCKSPIILGLISIMKSTAGVSGSSAAAA---GIF 113 Query: 662 GVSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSL-LDKGGTAVTADQPSVTSRAKSVVQE 838 G+SP K +SI+PFL GSKWLPC++ +S +DKGGT + VV + Sbjct: 114 GISPFKTTSIVPFL-PGSKWLPCNESVPDPTTSWEVDKGGT-------------RRVVSD 159 Query: 839 RRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEPRSIP 1018 + ++ +WLS+L+NVCS+DAKAAFTA++VS+LF+SSLAEPRSIP Sbjct: 160 TES--------------NFAKTSWLSRLMNVCSEDAKAAFTAVTVSLLFKSSLAEPRSIP 205 Query: 1019 SASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVVAKGG 1198 S+SM PTL+VGDR+L EKVS+ FRKP+VSDIVIFKAPP L+E G+ SSDVFIKR+VAK G Sbjct: 206 SSSMYPTLEVGDRVLTEKVSFFFRKPDVSDIVIFKAPPCLEEFGFSSSDVFIKRIVAKAG 265 Query: 1199 DYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHNWGPLP 1378 D VEVRDGKLLVNG A++ F++EPLAYEM+P++VPEG VFVMGDNRNNSFDSHNWGPLP Sbjct: 266 DTVEVRDGKLLVNGAAEERQFVVEPLAYEMDPMVVPEGYVFVMGDNRNNSFDSHNWGPLP 325 Query: 1379 IENIVGRSVFRYWPPSRVSDT 1441 +ENIVGRS+FRYWPPS+VSDT Sbjct: 326 VENIVGRSMFRYWPPSKVSDT 346 >ref|XP_006417872.1| hypothetical protein EUTSA_v10008025mg [Eutrema salsugineum] gi|557095643|gb|ESQ36225.1| hypothetical protein EUTSA_v10008025mg [Eutrema salsugineum] Length = 358 Score = 349 bits (895), Expect = 3e-93 Identities = 199/385 (51%), Positives = 243/385 (63%), Gaps = 5/385 (1%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGT--CRFFHEFSARSRIF--QTKPDHNYSSDFRPR 472 MA+R T TYS YVA+N+ASSA ++ GT R E R R F KPD + SS Sbjct: 1 MALRVTFTYSSYVARNIASSAATRVGTGDVRSCFECWVRPRFFGHNQKPDMDKSSGSNTL 60 Query: 473 QHPXXXXXXXXXXXXXXXXXXTLAGEVFGE-SKNPXXXXXXXXXXXXXXXXXXXXXXXXM 649 P T+A E+ E S++P Sbjct: 61 ARPASMYS-------------TIAREILEEGSQSPLVLGMISIIKLTAPPELLG------ 101 Query: 650 MGVFGVSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGTAVTADQPSVTSRAKSV 829 M V G+SP K SS+IPFL +GSKW+PCS P+ ++ D V K Sbjct: 102 MNVLGISPFKTSSVIPFL-RGSKWMPCSIPA-------------TLSTDFADVDRGVK-- 145 Query: 830 VQERRNACSAASSEAMADNLKLSRNNWLSKLLNVCSDDAKAAFTALSVSILFRSSLAEPR 1009 AC A +++ N W++KLLN+CS+DAKAAFTA++VS+LFRS+LAEP+ Sbjct: 146 ------ACDAKVKLGLSNKGSNVGNGWVNKLLNICSEDAKAAFTAVTVSLLFRSALAEPK 199 Query: 1010 SIPSASMSPTLDVGDRILAEKVSYVFRKPEVSDIVIFKAPPILQEIGYGSSDVFIKRVVA 1189 SIPS SM PTLDVGDR++AEKVSY+FRKPEVSDIVIFKAPP+L E GY +DVFIKR+VA Sbjct: 200 SIPSTSMYPTLDVGDRVIAEKVSYIFRKPEVSDIVIFKAPPVLVEHGYNCTDVFIKRIVA 259 Query: 1190 KGGDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAVFVMGDNRNNSFDSHNWG 1369 GD+VEV DGKLLVN Q EDF+LEP+ YEMEP+ VPEG VFV+GDNRN SFDSHNWG Sbjct: 260 SEGDWVEVCDGKLLVNDTVQAEDFVLEPIDYEMEPMFVPEGYVFVLGDNRNKSFDSHNWG 319 Query: 1370 PLPIENIVGRSVFRYWPPSRVSDTL 1444 PLPI+NI+GRSVFRYWPPS+VSDT+ Sbjct: 320 PLPIKNIIGRSVFRYWPPSKVSDTI 344 >gb|EPS69411.1| hypothetical protein M569_05352, partial [Genlisea aurea] Length = 391 Score = 348 bits (893), Expect = 5e-93 Identities = 204/402 (50%), Positives = 248/402 (61%), Gaps = 22/402 (5%) Frame = +2 Query: 305 MAIRFTVTYSGYVAQNLASSATSKAGTCRFFHEFSARSRIFQTKPDHNYSSDFRPRQHPX 484 MA+RF V +S VA NLA+ + K R+ +E R R+FQ P DFR HP Sbjct: 1 MAVRFAVGFSASVASNLAN-CSGKCAASRYLNECLTRPRVFQHTPSRK-RDDFR---HPV 55 Query: 485 XXXXXXXXXXXXXXXXXTLAGEVFGESKNPXXXXXXXXXXXXXXXXXXXXXXXXMMGVFG 664 LA + GE + V G Sbjct: 56 SSPDSFLPDSSFAS---VLARGILGEGDQSSVITGLMSLVKHSN-----------ISVLG 101 Query: 665 VSPLKPSSIIPFLLQGSKWLPCSQPSIGSASSLLDKGGTAVTADQPSVTSRAKSVVQERR 844 VSP+K SSI+PF GSKWLPC+QP+ ++ +D+GGT+ + S + V Sbjct: 102 VSPVKVSSILPFF-PGSKWLPCNQPT----ATEVDRGGTSSQSKGDSTGEQTTETVSVGV 156 Query: 845 NACSAASSEAMADNLKL-------------------SRNNWLSKLLNVC--SDDAKAAFT 961 N + + AM N + S ++W+ KL+N+C S+DAKA FT Sbjct: 157 NESKCSEAFAMLKNAQAGSFEVLPQSMKEEDSPRSSSGSSWMLKLMNLCFSSEDAKAIFT 216 Query: 962 ALSVSILFRSSLAEPRSIPSASMSPTLDVGDRILAEKV-SYVFRKPEVSDIVIFKAPPIL 1138 A SVSIL++S+LAEPRSIPS SM PTLDVGDRILAEKV SY+FR PEVSDIVIFKAP L Sbjct: 217 AFSVSILYKSTLAEPRSIPSRSMYPTLDVGDRILAEKVISYIFRSPEVSDIVIFKAPSFL 276 Query: 1139 QEIGYGSSDVFIKRVVAKGGDYVEVRDGKLLVNGIAQDEDFILEPLAYEMEPVLVPEGAV 1318 QE G+ SDVF+KRVVAK GDYVEV DGKL+VNGIAQDEDF+LEP+ YEM+PVLVPEG V Sbjct: 277 QEFGFSPSDVFVKRVVAKAGDYVEVCDGKLMVNGIAQDEDFVLEPVEYEMDPVLVPEGYV 336 Query: 1319 FVMGDNRNNSFDSHNWGPLPIENIVGRSVFRYWPPSRVSDTL 1444 FV+GDNRNNSFDSHNWGPLPI++IVGRSVFRYWPP++VSDTL Sbjct: 337 FVLGDNRNNSFDSHNWGPLPIDDIVGRSVFRYWPPTKVSDTL 378