BLASTX nr result
ID: Mentha28_contig00010878
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00010878 (911 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU32504.1| hypothetical protein MIMGU_mgv1a001590mg [Mimulus... 290 4e-76 gb|EPS67387.1| hypothetical protein M569_07378 [Genlisea aurea] 191 3e-46 ref|XP_004249845.1| PREDICTED: LOW QUALITY PROTEIN: protein tran... 171 5e-40 ref|XP_006351128.1| PREDICTED: protein translocase subunit SECA2... 167 7e-39 ref|XP_002520315.1| F-box and wd40 domain protein, putative [Ric... 157 7e-36 ref|XP_006371362.1| hypothetical protein POPTR_0019s09450g [Popu... 151 3e-34 ref|XP_006416281.1| hypothetical protein EUTSA_v10006535mg [Eutr... 150 9e-34 ref|XP_007225286.1| hypothetical protein PRUPE_ppa001361mg [Prun... 149 1e-33 gb|AAD41418.1|AC007727_7 Contains PF|00097 Zinc finger C3HC4 typ... 147 7e-33 ref|NP_001185058.1| protein translocase subunit SECA2 [Arabidops... 147 7e-33 ref|NP_001117325.1| zinc ion binding protein [Arabidopsis thalia... 147 7e-33 ref|XP_006306193.1| hypothetical protein CARUB_v10011823mg [Caps... 145 2e-32 ref|XP_002284600.2| PREDICTED: uncharacterized protein LOC100248... 143 8e-32 ref|XP_002893179.1| preprotein translocase secA family protein [... 143 1e-31 ref|XP_006472845.1| PREDICTED: protein translocase subunit SECA2... 138 3e-30 ref|XP_006434275.1| hypothetical protein CICLE_v10000294mg [Citr... 136 1e-29 gb|EXB28435.1| Myosin heavy chain kinase B [Morus notabilis] 135 3e-29 ref|XP_007019190.1| Preprotein translocase SecA family protein, ... 133 1e-28 ref|XP_007019188.1| Preprotein translocase SecA family protein, ... 133 1e-28 ref|XP_007019187.1| Preprotein translocase SecA family protein, ... 133 1e-28 >gb|EYU32504.1| hypothetical protein MIMGU_mgv1a001590mg [Mimulus guttatus] Length = 789 Score = 290 bits (743), Expect = 4e-76 Identities = 151/289 (52%), Positives = 192/289 (66%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQPYDAV+AIPRVLTCGHTTCEACLK LP PF NTIRCTVCT Sbjct: 9 CPVCLQPYDAVSAIPRVLTCGHTTCEACLKQLPNPFPNTIRCTVCTLLVKFLNCPSSLPK 68 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPSVLKPWTYEFYCEWRRWILP 508 L+FSS LQ +EK V PS P G + P + W+YE Y +W++WILP Sbjct: 69 NLDLLHFSSALQNRHRTKEKIVNSPS-PHPPGTKHF---PPTVNSWSYEVYRKWKKWILP 124 Query: 507 EDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEADSKLI 328 EDC+ I E ++D G + G VL+ FESD V+G VL+E E VGL +G+FVE +A+SK Sbjct: 125 EDCISIVEFGSESDGGGVCGTVLKYFESDHVIGSVLKEGETVGLFVIGVFVEDQANSKYF 184 Query: 327 KPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCEKLAS 148 SYESRI VL MKE+++ +L V + A+ RV+NVGKAYGFW + +D CVYIV EK S Sbjct: 185 NSSYESRIAAVLCRMKEEDKTQLEVILCASLRVNNVGKAYGFWYNEDDKCVYIVFEKFKS 244 Query: 147 SNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVGY 1 N ++CV K ++ E+ LS DE+ + M+G+E CEIL RL+ EGL +G+ Sbjct: 245 PN-LNCVLKQKESEEGDLSTDEIRGMAMLGLEACEILSRLNSEGLIIGF 292 >gb|EPS67387.1| hypothetical protein M569_07378 [Genlisea aurea] Length = 1757 Score = 191 bits (485), Expect = 3e-46 Identities = 106/290 (36%), Positives = 168/290 (57%), Gaps = 2/290 (0%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCL+PYDAV+ +PRV+ CGHT C+ CL +P PF +TIRC +CT Sbjct: 9 CPVCLEPYDAVSIVPRVIACGHTVCQVCLGKIPNPFPDTIRCPICTALVRCPSPPTSLPK 68 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPSVLKPWTYEFYCEWRRWILP 508 L+FS L+ E+++V + + P LK W+ + Y +W++WI+ Sbjct: 69 NLDLLHFSIGLRNRRSVEDEKVASTRALRV----NEVSFPFALKSWSDDLYRKWKKWIIS 124 Query: 507 EDCVLIGETDPDNDNGV-LGGKVLRSFESD-RVMGCVLREKEDVGLIKVGIFVESEADSK 334 D V + + D + + GK L S + D + CVLR+++++ L+++G+ + +S Sbjct: 125 RDFVSVEKASDRCDYEIAVSGKFLGSCDGDYGPIFCVLRDEQELSLVRIGVLSQGGLNS- 183 Query: 333 LIKPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCEKL 154 + SYESRIL L M+E+ER+KL ++AT +V+N+ KA G W + + VY+V KL Sbjct: 184 -FRRSYESRILMFLSSMEEEERNKLVKLLNATLKVNNIVKACGLWYNEDGNGVYVVFPKL 242 Query: 153 ASSNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 S+ I+ V + KE L A+++++L ++GME+CEIL LH EGL +G Sbjct: 243 DSAKLIEYVCR----HKEKLKAEDVTWLALLGMEMCEILCSLHSEGLILG 288 >ref|XP_004249845.1| PREDICTED: LOW QUALITY PROTEIN: protein translocase subunit SECA2, chloroplastic-like [Solanum lycopersicum] Length = 1855 Score = 171 bits (432), Expect = 5e-40 Identities = 101/297 (34%), Positives = 156/297 (52%), Gaps = 8/297 (2%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ Y V+ IPRVL CGH+ CE CL + PF TIRC CTQ Sbjct: 14 CPVCLQQYGDVSTIPRVLPCGHSACEDCLSQIQNPFPGTIRCPACTQLVKLPNPISSLPK 73 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPSVLKP--WTYEFYCEWRRWI 514 L F +L + + K + ++ P +KP W++EFY W+ W+ Sbjct: 74 NIDLLRFFTLTHHNSNDNSK-------GSHVSTQKYDKDPIFIKPPLWSHEFYSNWKTWV 126 Query: 513 LPEDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEADSK 334 LPED ++I +++ V GKVL+ S MGCVL+E E V L+++G F + K Sbjct: 127 LPEDTIII-----ESNASVSYGKVLKVSTSVSSMGCVLKEGEKVSLLEIGYFAKGSCSCK 181 Query: 333 LIKPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCEKL 154 + SYE ++++VL+G+ E ER +L I A+ + + K YGFW + ++ VY+V E Sbjct: 182 -FEYSYEVKLMSVLYGLSEGERTELESIIKASLALHVMCKVYGFWYNTDNHYVYMVSEAF 240 Query: 153 ASS------NFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVGY 1 + S + V + +EK +A+ + +VG++IC+++ LHL GL +G+ Sbjct: 241 SGSLLGKMGVLRNAVVEKNAEEKICNAAEFV----IVGLDICQMVSDLHLRGLVLGF 293 >ref|XP_006351128.1| PREDICTED: protein translocase subunit SECA2, chloroplastic-like isoform X3 [Solanum tuberosum] Length = 1844 Score = 167 bits (422), Expect = 7e-39 Identities = 101/296 (34%), Positives = 154/296 (52%), Gaps = 8/296 (2%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ Y V+ IPRVL CGH+ CE CL L PF TIRC CTQ Sbjct: 14 CPVCLQQYGDVSTIPRVLPCGHSACEDCLAQLQNPFPGTIRCPACTQLVKLPNPISSLPK 73 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPSVLKP--WTYEFYCEWRRWI 514 L FS+L + N + S + ++ P +KP W++EFY W+ W+ Sbjct: 74 NIDLLRFSTLPHHN--NNDN-----SKGSHVSTQKYDKDPIFIKPPLWSHEFYSNWKTWV 126 Query: 513 LPEDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEADSK 334 LPED ++I +++ V GKVL+ S MGC L+E E V L+++G F + K Sbjct: 127 LPEDTIII-----ESNGSVCYGKVLKVSTSVSSMGCALKEGEKVSLLEIGYFAKGSCSYK 181 Query: 333 LIKPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCEKL 154 + SYE ++++VL+G+ E R +L I A+ + + K YGFW + ++ CVY+V E Sbjct: 182 -FEYSYEVKLMSVLYGLSEGGRTELESIIKASLALHVMCKVYGFWYNMDNHCVYMVSEAF 240 Query: 153 ASS------NFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 + S + V + +EK +A+ + +V ++IC+++ L L GL +G Sbjct: 241 SGSLLGKMGVLRNAVLEKNAEEKISNAAEFV----IVSLDICQMVSDLQLRGLVLG 292 >ref|XP_002520315.1| F-box and wd40 domain protein, putative [Ricinus communis] gi|223540534|gb|EEF42101.1| F-box and wd40 domain protein, putative [Ricinus communis] Length = 1794 Score = 157 bits (396), Expect = 7e-36 Identities = 105/296 (35%), Positives = 141/296 (47%), Gaps = 8/296 (2%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ YD AIPRVLTCGHTTCE+CLK LP+ + TIRC C Q Sbjct: 6 CPVCLQNYDGEYAIPRVLTCGHTTCESCLKSLPQKYPQTIRCPACVQLVK---------- 55 Query: 687 XXXXLYFSSLLQRSCPN--EEKRVIPPS----SPAEFGGRESAASPSVLKPWTYEFYCEW 526 F SL S P + R+IP + P S W+ +F+ W Sbjct: 56 ------FPSLGPSSLPKNIDLLRLIPTNHKKKQPINHSRSSDHQVDSASFLWSDDFFVTW 109 Query: 525 RRWILPEDCVLIGETDPDNDNGVL--GGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVE 352 + W+L +D VL+ E+ + D GVL G K LR F K GL+ V Sbjct: 110 KNWVLEKDAVLVDES--EKDCGVLKDGNKKLRLF------------KVADGLLDV----- 150 Query: 351 SEADSKLIKPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVY 172 + K SY SRI+ L+G+ R++L + + +GK YGFWC +++ +Y Sbjct: 151 -NGSGFIFKLSYASRIMNCLYGLGNVVREELSLILGICLEHYRIGKFYGFWCDSQNGFLY 209 Query: 171 IVCEKLASSNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 +VCE+ F V K G S D ++ + GMEICE + LHLEGL +G Sbjct: 210 LVCER-----FNVGVMDHSGCSKNGSSKDGLASFAVTGMEICEAIIGLHLEGLFMG 260 >ref|XP_006371362.1| hypothetical protein POPTR_0019s09450g [Populus trichocarpa] gi|550317115|gb|ERP49159.1| hypothetical protein POPTR_0019s09450g [Populus trichocarpa] Length = 833 Score = 151 bits (382), Expect = 3e-34 Identities = 100/295 (33%), Positives = 140/295 (47%), Gaps = 7/295 (2%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCL YD IPRVL CGHTTCE+CLK +P+ + TIRC CTQ Sbjct: 9 CPVCLSTYDGEYTIPRVLACGHTTCESCLKNIPQKYPLTIRCPACTQLVKYPSQQGPSSL 68 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPSVLKP--WTYEFYCEWRRWI 514 + Q N +K P++ ++ A P W+ EFY W+ W+ Sbjct: 69 PKNIDLLRLVQQLQDHNPQK----PNNKSQIDKPVLAQDFDFFVPPSWSDEFYTSWKNWV 124 Query: 513 LPEDCVLIGETDPDNDNGVL--GGKVLRSFESDRVMGCVLREKEDVGLIKVGI---FVES 349 L D V + D + G+L G K K V L KVG + Sbjct: 125 LDRDDVFV--EDKERGYGLLKEGNK-----------------KVKVRLFKVGNDGGLLSG 165 Query: 348 EADSKLIKPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYI 169 + + K SY ++++ +L GMKE++RD+LG + + + K G WC ED +Y Sbjct: 166 KVKGCVFKLSYVAKVMNLLNGMKEEKRDELGFILRICAKQGRICKGCGLWCDLEDGVLYF 225 Query: 168 VCEKLASSNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 VCE+L + N +D + D + GLS D +S M+GME+ E + LHLEGL VG Sbjct: 226 VCERL-NGNVLDML----GDFENGLSKDGLSSFAMIGMEMYEAVIGLHLEGLIVG 275 >ref|XP_006416281.1| hypothetical protein EUTSA_v10006535mg [Eutrema salsugineum] gi|557094052|gb|ESQ34634.1| hypothetical protein EUTSA_v10006535mg [Eutrema salsugineum] Length = 1804 Score = 150 bits (378), Expect = 9e-34 Identities = 97/292 (33%), Positives = 138/292 (47%), Gaps = 4/292 (1%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ YD +PRVL+CGHT CE CLK LP+ F NTIRC CT Sbjct: 6 CPVCLQSYDGECTVPRVLSCGHTACEECLKNLPKKFPNTIRCPACTVLVKFPPQGPSALP 65 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSS----PAEFGGRESAASPSVLKPWTYEFYCEWRR 520 L R P+ + + P P EF V + W+ +FY W+ Sbjct: 66 KNID------LLRLFPSVSRITLEPGKNLKKPIEF----------VTRSWSDDFYTTWKD 109 Query: 519 WILPEDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEAD 340 IL D V + + + + F S R + L++ V L++V F+ + D Sbjct: 110 RILLHDAVSVENVESEGSD----------FGSSRRLCGWLKDDSRVSLLRVASFLNDDCD 159 Query: 339 SKLIKPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCE 160 S L+K SY R+++ LW M+E+ERD+L IS R + K +G W ++ +Y+V E Sbjct: 160 S-LLKYSYVQRMMSCLWEMREEERDELDTIISVKQR--GISKVFGLWGDLKNGVLYLVGE 216 Query: 159 KLASSNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 KL + +E + L DE S ++GM+ICE L LH EG+ G Sbjct: 217 KLTGYSC---------EEFDYLDEDETSCFAVIGMQICEALLNLHKEGVITG 259 >ref|XP_007225286.1| hypothetical protein PRUPE_ppa001361mg [Prunus persica] gi|462422222|gb|EMJ26485.1| hypothetical protein PRUPE_ppa001361mg [Prunus persica] Length = 845 Score = 149 bits (377), Expect = 1e-33 Identities = 98/294 (33%), Positives = 139/294 (47%), Gaps = 6/294 (2%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ YD IPRVL CGH+ CEACL LP + TIRC CTQ Sbjct: 9 CPVCLQNYDGEYTIPRVLACGHSACEACLVRLPERYPETIRCPACTQLVKYPPLGPTALP 68 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPSVLKPWTYEFYCEWRRWILP 508 L SL PN P + + + W+ EFY W+ W+LP Sbjct: 69 KNIDLLSFSLSLNPNPNSRSSQNPQKQSTD------GVCKFLPRIWSDEFYDTWKEWVLP 122 Query: 507 EDCVL----IGETDPDNDNGVLGGKVLRSFESDRVMGCV-LREKEDVGLIKVGIFVESEA 343 D + +G+ D VL G+ G V RE + V ++VG Sbjct: 123 SDALSVETEVGDVTRDGLCTVLKGRTGSGSGFGLGSGRVWFREDQSVSFVQVGSL--PNL 180 Query: 342 DSKLIKPSYESRILTVLWGMKEDERDKLGVFISATFR-VSNVGKAYGFWCHNEDVCVYIV 166 S + SY +R++ L GM+E ER++LG+ + A+ R VGK YG W ++ED +Y+V Sbjct: 181 GSSGFEFSYIARVMKCLSGMREGERNELGLLLRASVRQCRKVGKVYGLWGNSEDGFLYVV 240 Query: 165 CEKLASSNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 CE+ + +F + + +E + +G D +S M+ ME+CE + LH EG G Sbjct: 241 CER-RNGSFSEKL--NELRDGDGFGKDGLSAFAMIAMEVCEAVTGLHSEGFASG 291 >gb|AAD41418.1|AC007727_7 Contains PF|00097 Zinc finger C3HC4 type and 4 WD40 PF|00400 (G beta) domains [Arabidopsis thaliana] Length = 860 Score = 147 bits (370), Expect = 7e-33 Identities = 98/288 (34%), Positives = 136/288 (47%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ YD + +PRVL CGHT CE CL LP+ F +TIRC CT Sbjct: 6 CPVCLQSYDGESTVPRVLACGHTACEECLTNLPKKFPDTIRCPACTVLVKFPPQGPSALP 65 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPSVLKPWTYEFYCEWRRWILP 508 L R P+ K + P G V + W+ +FY W+ IL Sbjct: 66 KNID------LLRLFPSISKLKLEP------GRNFEKVVEFVTRSWSDDFYATWKDRILV 113 Query: 507 EDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEADSKLI 328 D V + + ++ + F+S + LR+ V L++V F + DS ++ Sbjct: 114 HDAVSVEIRESESSD----------FDSSSRLCGSLRDDSKVSLLRVASFEHGDCDS-VL 162 Query: 327 KPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCEKLAS 148 K SY R+++ LWGM+E+ERD+L IS R V K +G W ++ +Y+V EKL Sbjct: 163 KYSYVQRMMSCLWGMREEERDELDAIISVKQR--GVSKVFGLWGDLKNGVLYLVGEKLIG 220 Query: 147 SNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 + E D E DE LG++GM+ICE L LH EGL G Sbjct: 221 FSL------EEFDSLE----DETLRLGIIGMQICEALLNLHKEGLITG 258 >ref|NP_001185058.1| protein translocase subunit SECA2 [Arabidopsis thaliana] gi|332192012|gb|AEE30133.1| protein translocase subunit SECA2 [Arabidopsis thaliana] Length = 1805 Score = 147 bits (370), Expect = 7e-33 Identities = 98/288 (34%), Positives = 136/288 (47%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ YD + +PRVL CGHT CE CL LP+ F +TIRC CT Sbjct: 6 CPVCLQSYDGESTVPRVLACGHTACEECLTNLPKKFPDTIRCPACTVLVKFPPQGPSALP 65 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPSVLKPWTYEFYCEWRRWILP 508 L R P+ K + P G V + W+ +FY W+ IL Sbjct: 66 KNID------LLRLFPSISKLKLEP------GRNFEKVVEFVTRSWSDDFYATWKDRILV 113 Query: 507 EDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEADSKLI 328 D V + + ++ + F+S + LR+ V L++V F + DS ++ Sbjct: 114 HDAVSVEIRESESSD----------FDSSSRLCGSLRDDSKVSLLRVASFEHGDCDS-VL 162 Query: 327 KPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCEKLAS 148 K SY R+++ LWGM+E+ERD+L IS R V K +G W ++ +Y+V EKL Sbjct: 163 KYSYVQRMMSCLWGMREEERDELDAIISVKQR--GVSKVFGLWGDLKNGVLYLVGEKLIG 220 Query: 147 SNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 + E D E DE LG++GM+ICE L LH EGL G Sbjct: 221 FSL------EEFDSLE----DETLRLGIIGMQICEALLNLHKEGLITG 258 >ref|NP_001117325.1| zinc ion binding protein [Arabidopsis thaliana] gi|17529236|gb|AAL38845.1| putative SecA-type chloroplast protein transport factor [Arabidopsis thaliana] gi|20465933|gb|AAM20152.1| putative SecA-type chloroplast transport factor protein [Arabidopsis thaliana] gi|110739333|dbj|BAF01579.1| hypothetical protein [Arabidopsis thaliana] gi|332192014|gb|AEE30135.1| zinc ion binding protein [Arabidopsis thaliana] Length = 811 Score = 147 bits (370), Expect = 7e-33 Identities = 98/288 (34%), Positives = 136/288 (47%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ YD + +PRVL CGHT CE CL LP+ F +TIRC CT Sbjct: 6 CPVCLQSYDGESTVPRVLACGHTACEECLTNLPKKFPDTIRCPACTVLVKFPPQGPSALP 65 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPSVLKPWTYEFYCEWRRWILP 508 L R P+ K + P G V + W+ +FY W+ IL Sbjct: 66 KNID------LLRLFPSISKLKLEP------GRNFEKVVEFVTRSWSDDFYATWKDRILV 113 Query: 507 EDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEADSKLI 328 D V + + ++ + F+S + LR+ V L++V F + DS ++ Sbjct: 114 HDAVSVEIRESESSD----------FDSSSRLCGSLRDDSKVSLLRVASFEHGDCDS-VL 162 Query: 327 KPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCEKLAS 148 K SY R+++ LWGM+E+ERD+L IS R V K +G W ++ +Y+V EKL Sbjct: 163 KYSYVQRMMSCLWGMREEERDELDAIISVKQR--GVSKVFGLWGDLKNGVLYLVGEKLIG 220 Query: 147 SNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 + E D E DE LG++GM+ICE L LH EGL G Sbjct: 221 FSL------EEFDSLE----DETLRLGIIGMQICEALLNLHKEGLITG 258 >ref|XP_006306193.1| hypothetical protein CARUB_v10011823mg [Capsella rubella] gi|482574904|gb|EOA39091.1| hypothetical protein CARUB_v10011823mg [Capsella rubella] Length = 1799 Score = 145 bits (366), Expect = 2e-32 Identities = 99/290 (34%), Positives = 144/290 (49%), Gaps = 2/290 (0%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ YD ++PRVL CGHT CE CL LP+ F +TIRC CT Sbjct: 6 CPVCLQSYDGECSVPRVLACGHTACEECLTNLPKKFPDTIRCPACTVLVKFPLQGP---- 61 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPS--VLKPWTYEFYCEWRRWI 514 S L ++ + R+ P S + + P V + W+ +FY W+ I Sbjct: 62 --------SALPKNI--DLLRLFPSVSQIKLESGRNFKKPVEFVTRSWSDDFYATWKDRI 111 Query: 513 LPEDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEADSK 334 L D V + +NG G++ S R+ G L+ V L++V F + DS Sbjct: 112 LVHDAVSV-------ENGE--GEISDLASSSRLFGS-LKNDSKVSLLRVASFELDDCDS- 160 Query: 333 LIKPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCEKL 154 ++K SY R+++ LWG+K++ERD+L IS R V K +G W ++ +Y+V EKL Sbjct: 161 VLKYSYVQRMMSCLWGLKDEERDELEKIISIMQR--GVSKVFGLWGDLKNGVLYLVGEKL 218 Query: 153 ASSNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 + +E + L+ D+ S LG+VGM+ICE L LH EG+ G Sbjct: 219 IEFPW---------EEFDSLTDDDASRLGIVGMQICEALLNLHKEGVISG 259 >ref|XP_002284600.2| PREDICTED: uncharacterized protein LOC100248990 [Vitis vinifera] Length = 1817 Score = 143 bits (361), Expect = 8e-32 Identities = 94/289 (32%), Positives = 140/289 (48%), Gaps = 1/289 (0%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ YD AIPRVL CGHT CEAC+ LP+ F +TIRC CTQ Sbjct: 8 CPVCLQTYDTDQAIPRVLACGHTACEACITHLPQRFLDTIRCPACTQLVKFSHLQGPSAL 67 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPSVLKPWTYEFYCEWRRWILP 508 L ++ + P +S EF + + W+ +FY W+ W+LP Sbjct: 68 PKNIDLLRLCLSEDSDYQKPQKRPITSHYEF----------LPRLWSDQFYSVWKDWVLP 117 Query: 507 EDCVLIGETDPDNDNGVLGGKVLRSFESD-RVMGCVLREKEDVGLIKVGIFVESEADSKL 331 D V + + V+ G++ S S V+ ++E ++V L+++ S + + Sbjct: 118 NDAVSVEPRGGKDFCDVIHGRIASSSSSSPSVIRWWMKENQNVSLVRIASL--SFVNDSV 175 Query: 330 IKPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCEKLA 151 I SY +RI+ L GMKE++R +LG+ + R YG W +D +Y+VCE+ Sbjct: 176 ISFSYMARIMNCLNGMKEEKRYELGLIL----RQRKTCGVYGLWYDLDDQWMYLVCERWE 231 Query: 150 SSNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 D V K + + E + D + M+GMEIC+ + LH EGL G Sbjct: 232 G----DLVEKISELKNE-VVEDGIFCFAMMGMEICKAIIGLHSEGLVSG 275 >ref|XP_002893179.1| preprotein translocase secA family protein [Arabidopsis lyrata subsp. lyrata] gi|297339021|gb|EFH69438.1| preprotein translocase secA family protein [Arabidopsis lyrata subsp. lyrata] Length = 1579 Score = 143 bits (360), Expect = 1e-31 Identities = 94/288 (32%), Positives = 137/288 (47%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ +D + +PRVL CGHT CE CL LP+ F +TIRC CT Sbjct: 6 CPVCLQSFDGESTVPRVLACGHTACEECLTNLPKKFPDTIRCPACTVLVKFPPQGPSALP 65 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPSVLKPWTYEFYCEWRRWILP 508 L R P+ K + P G A V++ W+ +FY W+ IL Sbjct: 66 KNID------LLRLFPSISKIKLEP------GRNFKKAVEFVIRSWSDDFYATWKDRILV 113 Query: 507 EDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEADSKLI 328 D V + + ++ + F S ++ LR+ V L++V F + DS ++ Sbjct: 114 HDAVSVEIRESESSD----------FASASLLCGSLRDDLKVSLLRVASFEHDDCDS-VL 162 Query: 327 KPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCEKLAS 148 K SY R+++ LWGM+E+E D+L IS R V K +G W ++ +Y+V EKL Sbjct: 163 KYSYVLRMMSCLWGMREEEIDELDAIISVKLR--GVSKVFGLWGDLKNGVLYLVGEKLTG 220 Query: 147 SNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 E + L+ D+ S + ++GM+ICE L LH EGL G Sbjct: 221 FLL----------EFDSLTEDDTSRVAIIGMQICEALLNLHKEGLITG 258 >ref|XP_006472845.1| PREDICTED: protein translocase subunit SECA2, chloroplastic-like isoform X4 [Citrus sinensis] Length = 1812 Score = 138 bits (348), Expect = 3e-30 Identities = 97/292 (33%), Positives = 140/292 (47%), Gaps = 4/292 (1%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ YD IPRVLTCGHT CE+CL LP+ F TIRC CT Sbjct: 6 CPVCLQSYDGECTIPRVLTCGHTACESCLSNLPQKFPLTIRCPACTVLVKYPPQGP---- 61 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAAS----PSVLKPWTYEFYCEWRR 520 + L ++ + R+I P+SP ++ + + + W+ EFY W++ Sbjct: 62 --------TFLPKNI--DLLRLIDPASPKPLKNPKNFENVLEFDFIPRTWSNEFYTFWKQ 111 Query: 519 WILPEDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEAD 340 ++LP+D VL ET + D G G LR S RV ++K+G + + D Sbjct: 112 YVLPKDSVLF-ETKAEEDCGFRFG-CLRENLSQRV-----------SVVKLGSLCDDDDD 158 Query: 339 SKLIKPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCE 160 S + K SY R++ L GM + RD+L + + R + G W ED + +VCE Sbjct: 159 S-VFKYSYLMRVMNCLSGMIVEVRDQLDLILRTASRQIKCCRVLGLWGDMEDGFLCLVCE 217 Query: 159 KLASSNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 +L +D + +GL D +S M+GMEICE L L+ +G T G Sbjct: 218 RLNEIERLDFLRNG-----DGLCNDGLSSFAMMGMEICEALIGLNKQGFTAG 264 >ref|XP_006434275.1| hypothetical protein CICLE_v10000294mg [Citrus clementina] gi|557536397|gb|ESR47515.1| hypothetical protein CICLE_v10000294mg [Citrus clementina] Length = 821 Score = 136 bits (342), Expect = 1e-29 Identities = 95/292 (32%), Positives = 138/292 (47%), Gaps = 4/292 (1%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ YD IPRVLTCGHT CE+CL LP+ F TIRC CT Sbjct: 6 CPVCLQSYDGECTIPRVLTCGHTACESCLLNLPQKFPLTIRCPACTVLVKYPPQGP---- 61 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAAS----PSVLKPWTYEFYCEWRR 520 + L ++ + R+I P+SP ++ + + + W+ EFY W++ Sbjct: 62 --------TFLPKNI--DLLRLIDPASPKPLKNPKNFENVLEFDFIPRTWSNEFYTFWKQ 111 Query: 519 WILPEDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEAD 340 ++LP+D VL E + D G G LR +S RV ++K+G + D Sbjct: 112 YVLPKDSVLF-EAKAEEDCGFRFG-CLRENQSQRV-----------SVVKLGSLCDD--D 156 Query: 339 SKLIKPSYESRILTVLWGMKEDERDKLGVFISATFRVSNVGKAYGFWCHNEDVCVYIVCE 160 + K SY R++ L GM + RD+L + + R + G W ED + +VCE Sbjct: 157 DSVFKYSYLMRVMNCLSGMIVEVRDQLDLILRTASRQIKCCRVLGLWGDMEDGFLCLVCE 216 Query: 159 KLASSNFIDCVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 +L +D + +GL D +S M+GMEICE L L+ +G T G Sbjct: 217 RLNEIERLDFLRNG-----DGLCNDGLSSFAMMGMEICEALISLNKQGFTAG 263 >gb|EXB28435.1| Myosin heavy chain kinase B [Morus notabilis] Length = 838 Score = 135 bits (339), Expect = 3e-29 Identities = 96/296 (32%), Positives = 143/296 (48%), Gaps = 7/296 (2%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQ YD + +PRVL+CGH+ CE+CL LP F TIRC CTQ Sbjct: 8 CPVCLQNYDGDSTVPRVLSCGHSACESCLSKLPERFPLTIRCPACTQLVKFPPQGPSVLP 67 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRESAASPSVL-KPWTYEFYCEWRRWIL 511 L SL PN S+ + R+ L + W+ EFY W+ W+L Sbjct: 68 KNIDLLSFSLPPNPNPNS-------STSEDKRSRKLGRFYDFLPRFWSDEFYAAWKDWVL 120 Query: 510 PEDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEADSKL 331 P D V + E G K F D+ V L +V E + S Sbjct: 121 PNDAVWVEER---------GAKARVWFGEDK----------KVSLGRVVSLPELKDSS-- 159 Query: 330 IKPSYESRILTVLWGMKEDERDKLGVFI-SATFRVS-NVGKAYGFWCHNEDVCVYIVCEK 157 + SY R++ L GMKE+ER++LG+ + S + R S +G+ YG W + +D +Y+VCE+ Sbjct: 160 FEFSYVVRVMKCLSGMKEEERNELGLILRSGSMRNSRKIGRVYGLWGNLDDGFLYMVCER 219 Query: 156 LASSNFIDCV--FKSE--KDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVGY 1 + + ++ + K+E +E+EGLS + ++G+E+ E + LH EG G+ Sbjct: 220 MDGGSLLEKISDLKNEFCGEEEEGLSKIGVFSFALIGLEMIEAVMGLHSEGFISGF 275 >ref|XP_007019190.1| Preprotein translocase SecA family protein, putative isoform 10 [Theobroma cacao] gi|508724518|gb|EOY16415.1| Preprotein translocase SecA family protein, putative isoform 10 [Theobroma cacao] Length = 645 Score = 133 bits (334), Expect = 1e-28 Identities = 99/295 (33%), Positives = 127/295 (43%), Gaps = 7/295 (2%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQPYD V AIPRVL CGHT CE CL LP+ IRC CT Sbjct: 9 CPVCLQPYDGVCAIPRVLACGHTVCETCLVNLPQKLPGAIRCPACTVLVKYPPEGP---- 64 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRES---AASPSVLKPWTYEFYCEWRRW 517 S L ++ E R+IP S +S + P + + W+ EFY W+ + Sbjct: 65 --------STLPKNI--ELLRLIPGSGSTRKHVNKSPHDSRVPFLPRSWSDEFYSNWKIY 114 Query: 516 ILPEDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEADS 337 ILP D V E++ V L+ VG F Sbjct: 115 ILPSDAV---------------------------------ERQKVSLLAVGSFSTGGEGG 141 Query: 336 KLIKPSYESRILTVLWGMKEDERDKLGVFISA-TFRVSNVGKAYGFWCHNEDVCVYIVCE 160 Y R++ L GMKE ER++LG+ +SA + S + + G W D +YIV E Sbjct: 142 SGFTAGYFVRVMDCLSGMKEGEREELGLVLSAFNKQSSRICRVLGLWGDPGDGILYIVSE 201 Query: 159 KLASSNFID---CVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 K NF+D C F EK+G M+GMEICE + LH EGL G Sbjct: 202 KQEYGNFLDKNLCGF-----EKDGFFN-----FAMIGMEICEAVIALHKEGLIAG 246 >ref|XP_007019188.1| Preprotein translocase SecA family protein, putative isoform 8 [Theobroma cacao] gi|508724516|gb|EOY16413.1| Preprotein translocase SecA family protein, putative isoform 8 [Theobroma cacao] Length = 746 Score = 133 bits (334), Expect = 1e-28 Identities = 99/295 (33%), Positives = 127/295 (43%), Gaps = 7/295 (2%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQPYD V AIPRVL CGHT CE CL LP+ IRC CT Sbjct: 9 CPVCLQPYDGVCAIPRVLACGHTVCETCLVNLPQKLPGAIRCPACTVLVKYPPEGP---- 64 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRES---AASPSVLKPWTYEFYCEWRRW 517 S L ++ E R+IP S +S + P + + W+ EFY W+ + Sbjct: 65 --------STLPKNI--ELLRLIPGSGSTRKHVNKSPHDSRVPFLPRSWSDEFYSNWKIY 114 Query: 516 ILPEDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEADS 337 ILP D V E++ V L+ VG F Sbjct: 115 ILPSDAV---------------------------------ERQKVSLLAVGSFSTGGEGG 141 Query: 336 KLIKPSYESRILTVLWGMKEDERDKLGVFISA-TFRVSNVGKAYGFWCHNEDVCVYIVCE 160 Y R++ L GMKE ER++LG+ +SA + S + + G W D +YIV E Sbjct: 142 SGFTAGYFVRVMDCLSGMKEGEREELGLVLSAFNKQSSRICRVLGLWGDPGDGILYIVSE 201 Query: 159 KLASSNFID---CVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 K NF+D C F EK+G M+GMEICE + LH EGL G Sbjct: 202 KQEYGNFLDKNLCGF-----EKDGFFN-----FAMIGMEICEAVIALHKEGLIAG 246 >ref|XP_007019187.1| Preprotein translocase SecA family protein, putative isoform 7, partial [Theobroma cacao] gi|590599466|ref|XP_007019189.1| Preprotein translocase SecA family protein, putative isoform 7, partial [Theobroma cacao] gi|508724515|gb|EOY16412.1| Preprotein translocase SecA family protein, putative isoform 7, partial [Theobroma cacao] gi|508724517|gb|EOY16414.1| Preprotein translocase SecA family protein, putative isoform 7, partial [Theobroma cacao] Length = 682 Score = 133 bits (334), Expect = 1e-28 Identities = 99/295 (33%), Positives = 127/295 (43%), Gaps = 7/295 (2%) Frame = -3 Query: 867 CPVCLQPYDAVAAIPRVLTCGHTTCEACLKLLPRPFANTIRCTVCTQXXXXXXXXXXXXX 688 CPVCLQPYD V AIPRVL CGHT CE CL LP+ IRC CT Sbjct: 9 CPVCLQPYDGVCAIPRVLACGHTVCETCLVNLPQKLPGAIRCPACTVLVKYPPEGP---- 64 Query: 687 XXXXLYFSSLLQRSCPNEEKRVIPPSSPAEFGGRES---AASPSVLKPWTYEFYCEWRRW 517 S L ++ E R+IP S +S + P + + W+ EFY W+ + Sbjct: 65 --------STLPKNI--ELLRLIPGSGSTRKHVNKSPHDSRVPFLPRSWSDEFYSNWKIY 114 Query: 516 ILPEDCVLIGETDPDNDNGVLGGKVLRSFESDRVMGCVLREKEDVGLIKVGIFVESEADS 337 ILP D V E++ V L+ VG F Sbjct: 115 ILPSDAV---------------------------------ERQKVSLLAVGSFSTGGEGG 141 Query: 336 KLIKPSYESRILTVLWGMKEDERDKLGVFISA-TFRVSNVGKAYGFWCHNEDVCVYIVCE 160 Y R++ L GMKE ER++LG+ +SA + S + + G W D +YIV E Sbjct: 142 SGFTAGYFVRVMDCLSGMKEGEREELGLVLSAFNKQSSRICRVLGLWGDPGDGILYIVSE 201 Query: 159 KLASSNFID---CVFKSEKDEKEGLSADEMSFLGMVGMEICEILRRLHLEGLTVG 4 K NF+D C F EK+G M+GMEICE + LH EGL G Sbjct: 202 KQEYGNFLDKNLCGF-----EKDGFFN-----FAMIGMEICEAVIALHKEGLIAG 246