BLASTX nr result
ID: Mentha28_contig00000813
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00000813 (2564 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU28232.1| hypothetical protein MIMGU_mgv1a006810mg [Mimulus... 540 e-150 ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-lik... 460 e-126 ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-lik... 460 e-126 ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vi... 441 e-120 ref|XP_007032983.1| Hydroxyproline-rich glycoprotein family prot... 402 e-109 ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-lik... 394 e-106 ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-lik... 392 e-106 ref|XP_002531917.1| conserved hypothetical protein [Ricinus comm... 391 e-106 gb|ABF19057.1| plastid Tic40 [Ricinus communis] 389 e-105 ref|XP_007032985.1| Hydroxyproline-rich glycoprotein family prot... 387 e-104 ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-lik... 387 e-104 emb|CAB50925.1| translocon Tic40 [Pisum sativum] 385 e-104 sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplasti... 385 e-104 ref|XP_007146937.1| hypothetical protein PHAVU_006G083300g [Phas... 380 e-102 ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-lik... 365 4e-98 ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana] gi|753092... 362 6e-97 ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Popu... 360 1e-96 ref|XP_006400200.1| hypothetical protein EUTSA_v10013528mg [Eutr... 357 2e-95 ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arab... 355 6e-95 ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Popu... 355 8e-95 >gb|EYU28232.1| hypothetical protein MIMGU_mgv1a006810mg [Mimulus guttatus] Length = 430 Score = 540 bits (1392), Expect = e-150 Identities = 304/454 (66%), Positives = 337/454 (74%), Gaps = 8/454 (1%) Frame = -2 Query: 1624 MENLGLVSSPKIVLGVSPNPRNSPISVSSKQFVGLPNLLRRTGKSGRRSN-PTPSLSVVS 1448 MENLGLVSSPKIVLGVSPNPRNS IS SK VGLPNLL++TG GR + T SL V+S Sbjct: 1 MENLGLVSSPKIVLGVSPNPRNSVIS--SKPLVGLPNLLKKTGNYGRHTTIHTSSLQVLS 58 Query: 1447 LFGARKATKTIVEEKVAEDGFASIASSGQQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGV 1268 LF + K TKTIV EK A+D FA+I+SSGQ+TSSVG LFWIGVGV Sbjct: 59 LFRSPKPTKTIVSEKAAKDRFATISSSGQETSSVGVNPQLSVPPSSQVGSP-LFWIGVGV 117 Query: 1267 GLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXX 1088 GLSALFS+VAGR+KKYAMEQAFKTFTQQMNTQN+PFGN AF+ Sbjct: 118 GLSALFSFVAGRLKKYAMEQAFKTFTQQMNTQNSPFGNAAFSP----------------- 160 Query: 1087 XXFKTGAP-SFQTTTS---SPFKSGA--ASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAP 929 G+P F TS PF++ ASQ +T DVP +KVEDPPS SVKD+VE E P Sbjct: 161 -----GSPFPFPPATSPALDPFRTSTPLASQPITVDVPASKVEDPPSISVKDEVEQETGP 215 Query: 928 KKYAFKDVSPEETVQKNAFEEDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXX 749 KKYAF DVSPEET+QKNAFE +YKES +QTDSP + QN Sbjct: 216 KKYAFVDVSPEETLQKNAFE-NYKES-IQTDSPKD-PQSSQSVSQNGTAWNQGAGGSEGP 272 Query: 748 XXXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNN 569 PLMSV+ALEKMMEDPTVQQMV+PYLPEEMRNPTTFKWMLQNP YRQQLQDMLNN Sbjct: 273 TTSKTAPLMSVEALEKMMEDPTVQQMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNN 332 Query: 568 MGGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQA 389 MGG PEWDNRMMD+LKNFD+SSPE+KQQFDQIGLTP+EV+ KIM NPDVAMAFQNPRVQA Sbjct: 333 MGGTPEWDNRMMDSLKNFDISSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQA 392 Query: 388 AILDCSQNPLSIAKYQNDKEVMDVFNKITELFPG 287 AI+DCSQNP+SIAKYQNDKEVMDVFNKI+ELFPG Sbjct: 393 AIMDCSQNPMSIAKYQNDKEVMDVFNKISELFPG 426 >ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum tuberosum] Length = 443 Score = 460 bits (1183), Expect = e-126 Identities = 256/449 (57%), Positives = 298/449 (66%), Gaps = 1/449 (0%) Frame = -2 Query: 1624 MENLGLVSSPKIVLGVSPNPRNSPISVSSKQFVGLPNLLRRTGKSGRRSNPTPSLSVVSL 1445 MEN+ +VSSPK+VLG+S NP +S+K GLP+L +R K+GR PT VVS Sbjct: 1 MENICIVSSPKMVLGLSSNP-----VISNKPLFGLPHLPKRPFKNGRIVRPTTCFEVVSC 55 Query: 1444 FGARKATKTIVEEKVAEDGFASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGV 1268 F + + TK IV K FAS +SG QQTSSVG PLFWIGVGV Sbjct: 56 FQSPRLTKKIVLGKSGRGSFASTTTSGGQQTSSVGVNPQFSAPSQPSQVGSPLFWIGVGV 115 Query: 1267 GLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXX 1088 GLSALF+WVA +KKYAM+QA KT QMN QN+ F N AF+ Sbjct: 116 GLSALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNNAFSPGPGSPFPFPFPPPPVSG 175 Query: 1087 XXFKTGAPSFQTTTSSPFKSGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKD 908 + P +T+S+P S A+ DV TKVE+PP+ +VK+ E PKK AF D Sbjct: 176 PASSSPPPPTASTSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDTEAGKEPKKNAFVD 235 Query: 907 VSPEETVQKNAFEEDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXP 728 +SP+ET QK AFE ++K+S T++ + P Sbjct: 236 ISPDETFQKGAFE-NFKDS---TETASVTVDQVTQNGAASQLGFGPNTSDSTSSTGKSNP 291 Query: 727 LMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEW 548 LMSVDALEKMMEDPTVQ+MV+PYLPEEMRNPTTFKWMLQNP YRQQLQDM+NNMGGNPEW Sbjct: 292 LMSVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEW 351 Query: 547 DNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQ 368 DNRMMD+LKNFDLSSPEIKQQFDQIGLTP+EV+ KIM NPDVAMAFQNPRVQAAI+DCSQ Sbjct: 352 DNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQ 411 Query: 367 NPLSIAKYQNDKEVMDVFNKITELFPGPS 281 NPLSIAKYQNDKEVMDVFNKI+ELFPG S Sbjct: 412 NPLSIAKYQNDKEVMDVFNKISELFPGVS 440 >ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum lycopersicum] Length = 443 Score = 460 bits (1183), Expect = e-126 Identities = 256/453 (56%), Positives = 298/453 (65%), Gaps = 5/453 (1%) Frame = -2 Query: 1624 MENLGLVSSPKIVLGVSPNPRNSPISVSSKQFVGLPNLLRRTGKSGRRSNPTPSLSVVSL 1445 MEN+G+VSSPK+VLG+S N +SSK F GLP+L +R K+GR PT VVS Sbjct: 1 MENIGIVSSPKMVLGLSSNS-----VISSKPFFGLPHLPKRPFKNGRTVRPTTCFEVVSC 55 Query: 1444 FGARKATKTIVEEKVAEDGFASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGV 1268 F + TK IV K FAS +SG +QTSSVG PLFWIGVGV Sbjct: 56 FQGPRLTKKIVLGKSGRGSFASTTTSGGKQTSSVGVNPQFSAPSPPSQMGSPLFWIGVGV 115 Query: 1267 GLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXX 1088 G SALF+WVA +KKYAM+QA KT QMN QN+ F NTAF+ Sbjct: 116 GFSALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNTAFSPGPGSPFPFPFPPPPVSG 175 Query: 1087 XXFKTGAPSFQTTTSSPFKSGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKD 908 + P +++S+P S A+ DV TKVE+PP+ +VK+ E E PKK AF D Sbjct: 176 PASSSPPPPTASSSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDKEAEKEPKKNAFVD 235 Query: 907 VSPEETVQKNAFEEDYKESS----VQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXX 740 +SP+ET QK AFE ++K+S+ V D + Sbjct: 236 ISPDETFQKGAFE-NFKDSAETAAVTVDQVTQNGAASQSGFGSNTSDSTSSTGKSNP--- 291 Query: 739 XXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGG 560 L+SVDALEKMMEDPTVQ+MV+PYLPEEMRNPTTFKWMLQNP YRQQLQDM+NNMGG Sbjct: 292 ----LLSVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGG 347 Query: 559 NPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAIL 380 NPEWDNRMMD+LKNFDLSSPEIKQQFDQIGLTP+EV+ KIM NPDVAMAFQNPRVQAAI+ Sbjct: 348 NPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIM 407 Query: 379 DCSQNPLSIAKYQNDKEVMDVFNKITELFPGPS 281 DCSQNPLSIAKYQNDKEVMDVFNKI+ELFPG S Sbjct: 408 DCSQNPLSIAKYQNDKEVMDVFNKISELFPGVS 440 >ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vitis vinifera] gi|296089465|emb|CBI39284.3| unnamed protein product [Vitis vinifera] Length = 436 Score = 441 bits (1133), Expect = e-120 Identities = 251/451 (55%), Positives = 291/451 (64%), Gaps = 3/451 (0%) Frame = -2 Query: 1624 MENLGLVSSPKIVLGVSP-NPRNSPISVSSKQFVGLPNLLRRTGK--SGRRSNPTPSLSV 1454 M++L LVSSPK+VLG SP NPR+ IS + F LP L R+ K + +S +P Sbjct: 1 MDSLTLVSSPKLVLGHSPSNPRH--ISCAHSSF-SLPLLFRKPRKFIAASQSGASP---- 53 Query: 1453 VSLFGARKATKTIVEEKVAEDGFASIASSGQQTSSVGAXXXXXXXXXXXXXXXPLFWIGV 1274 + + +VE K+ + FASI+SS Q TSSVG PLFWIGV Sbjct: 54 -------RTPRHVVETKLGTECFASISSSSQGTSSVGVNPQFSPPPPSSNIGSPLFWIGV 106 Query: 1273 GVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXX 1094 GVGLSALFSWVA +KKYAM+QAFKT QM++QNN F T F+ Sbjct: 107 GVGLSALFSWVASNLKKYAMQQAFKTLMGQMDSQNNQFNTTTFSPGSPFPFPMPPPSGPS 166 Query: 1093 XXXXFKTGAPSFQTTTSSPFKSGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAF 914 T +PS TT SP A S DVP TKVE PP+T VKD +E ++ KYAF Sbjct: 167 TSHSGPTTSPSGPTT--SPSTVAAQSMVTVDVPATKVETPPATDVKDDIEKKNEQNKYAF 224 Query: 913 KDVSPEETVQKNAFEEDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXX 734 DVSPEET+Q++ FE E S +T S + Sbjct: 225 VDVSPEETLQESPFENF--EESTETSSSKDAQFSAGVSQNGTPPRPGMGVSEDSQSTRNA 282 Query: 733 XPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNP 554 P +SVDALEKMMEDPTVQ+MV+PYLPEEMRNPTTFKWMLQNP YRQQLQDMLNNMGG Sbjct: 283 NPFLSVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGGA 342 Query: 553 EWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDC 374 EWDNRMMD LKNFDLSSPE+KQQFDQIGLTP+EV+ KIM NPDVA+AFQNPR+QAAI+DC Sbjct: 343 EWDNRMMDNLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPDVALAFQNPRIQAAIMDC 402 Query: 373 SQNPLSIAKYQNDKEVMDVFNKITELFPGPS 281 SQNPLSIAKYQNDKEVMDVFNKI+ELFPG S Sbjct: 403 SQNPLSIAKYQNDKEVMDVFNKISELFPGVS 433 >ref|XP_007032983.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] gi|508712012|gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 433 Score = 402 bits (1032), Expect = e-109 Identities = 242/455 (53%), Positives = 276/455 (60%), Gaps = 11/455 (2%) Frame = -2 Query: 1618 NLGLVSSPKIVLGVSPNPRNSPISVSSKQFVGLPNLLRRTGKSGRRSNPTPSLSVVSLFG 1439 NL LVSS L + N P F LP SN P S +S+F Sbjct: 5 NLALVSSSSPPLKLYLLGCNHPNYTPKNPFKTLPF---------PSSNLAPRRSRISIFA 55 Query: 1438 ARKATKT-------IVEEKVAEDGFASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFW 1283 + T IV K+ ++ FASI+SS QQTSSVG PLFW Sbjct: 56 HSHSQPTPPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFW 115 Query: 1282 IGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXX 1103 IGVGVGLSALF+WVA +KKYAM+QAFKT QMNTQNN F N AF Sbjct: 116 IGVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFP----- 170 Query: 1102 XXXXXXXFKTGAPSFQTTTSSPFKSGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDA--- 932 AP +SP S + +V DVP TKVE P+T+ +V+ E Sbjct: 171 -----------APPSPGPVTSPSPSSQTAVTV-DVPATKVEAAPATAPATEVKSETETAE 218 Query: 931 PKKYAFKDVSPEETVQKNAFEEDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXX 752 PKKYAF DVSPEETVQK+AFE+ + S NN Sbjct: 219 PKKYAFVDVSPEETVQKSAFED-----AAGISSSNNTQFPKDVSDNGAASKQDAGAFGGS 273 Query: 751 XXXXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLN 572 P +SVDALEKMMEDPTVQ+MV+PYLPEEMRNP TFKWMLQNP YRQQLQDMLN Sbjct: 274 QSTGSADPALSVDALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLN 333 Query: 571 NMGGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQ 392 NMGG+ EWDNRMMD+LKNFDL+SP++KQQFDQIGLTP+EV+ KIM NP+VAMAFQNPRVQ Sbjct: 334 NMGGSTEWDNRMMDSLKNFDLNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQ 393 Query: 391 AAILDCSQNPLSIAKYQNDKEVMDVFNKITELFPG 287 AAI+DCSQNPLSIAKYQNDKEVMDVFNKI+ELFPG Sbjct: 394 AAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 428 >ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max] Length = 432 Score = 394 bits (1011), Expect = e-106 Identities = 242/467 (51%), Positives = 296/467 (63%), Gaps = 20/467 (4%) Frame = -2 Query: 1618 NLGLVSSPK-IVLGVSPNPRNSPISVSSKQFVGLPNLLRRTGKSGRRSNPTPS---LSVV 1451 NL LVSSPK ++LG P I +S+ ++ RR S R P V Sbjct: 5 NLALVSSPKPLMLGHVP-----AIDATSR------DVFRRKHFSFGRVLIAPHRCRFRVS 53 Query: 1450 SLFGARKATKTIVEEKVAEDGFASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGV 1274 +L + + K+ V+EK+ FASI+SS Q+ +S G LFWIGV Sbjct: 54 ALSSSHRNPKS-VQEKLIVKHFASISSSNTQEATSTGVNPQLSPSSTIGSP---LFWIGV 109 Query: 1273 GVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXX 1094 GVGLSALFS VA R+KKYAM+QAFKT QMN+QNN FGN AF+ Sbjct: 110 GVGLSALFSVVASRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPT 169 Query: 1093 XXXXFKTGAPSFQTTTSS--PFKSGAASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKK 923 AP+ TT S P S A+ ++T D+P KVE P+T+VKD+VE ++ PKK Sbjct: 170 --------APASSATTQSRAPSASSASQSTITVDIPAAKVEVAPTTNVKDEVEVKNEPKK 221 Query: 922 YAFKDVSPEETVQKNAFE--EDYKESSV----------QTDSPNNYXXXXXXXXQNXXXX 779 AF DVSPEETVQ++ FE +D + SSV Q +P+N Q+ Sbjct: 222 IAFVDVSPEETVQESPFESFKDDESSSVKEARVPDEVSQNGAPSNQGFGDFPGSQSTKKS 281 Query: 778 XXXXXXXXXXXXXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVY 599 ++SVDALEKMMEDPTVQ+MV+PYLPEEMRNPTTFKWMLQNP Y Sbjct: 282 -----------------VLSVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQY 324 Query: 598 RQQLQDMLNNMGGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVA 419 RQQL++MLNNMGG+ EWD+RMMDTLKNFDL+SPE+KQQFDQIGL+P+EV+ KIM NP+VA Sbjct: 325 RQQLEEMLNNMGGSTEWDSRMMDTLKNFDLNSPEVKQQFDQIGLSPEEVISKIMANPEVA 384 Query: 418 MAFQNPRVQAAILDCSQNPLSIAKYQNDKEVMDVFNKITELFPGPSS 278 MAFQNPRVQAAI+DCSQNP++I KYQNDKEVMDVFNKI+ELFPG S Sbjct: 385 MAFQNPRVQAAIMDCSQNPMNITKYQNDKEVMDVFNKISELFPGVGS 431 >ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max] Length = 429 Score = 392 bits (1006), Expect = e-106 Identities = 222/395 (56%), Positives = 266/395 (67%), Gaps = 16/395 (4%) Frame = -2 Query: 1414 VEEKVAEDGFASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVA 1238 V+EK+ FASI+SS Q+T+S+G LFWIGVGVGLSALFS VA Sbjct: 60 VQEKLIVKHFASISSSNTQETTSIGVKPQLSPSPSSTIGSP-LFWIGVGVGLSALFSVVA 118 Query: 1237 GRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSF 1058 R+KKYAM+QAFKT QMN+QNN FGN AF+ AP+ Sbjct: 119 SRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPT--------APAS 170 Query: 1057 QTTTSS--PFKSGAASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETV 887 TT S P S A+ ++T D+P KVE P+T+VKD+VE ++ PKK AF DVSPEETV Sbjct: 171 SATTQSRAPSASSASQSTITVDLPAAKVEAAPTTNVKDEVELKNEPKKIAFVDVSPEETV 230 Query: 886 QKNAFE--EDYKESSV----------QTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXX 743 +++ FE +D + SSV Q +P+N Q+ Sbjct: 231 RESPFESFKDDESSSVKEAWVPDEVSQNGAPSNLGFGDFPGSQSTKKSA----------- 279 Query: 742 XXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMG 563 +SVDALEKMMEDPTVQ+MV+PYLPEEMRNPTTFKWMLQNP YRQQL++MLNNMG Sbjct: 280 ------LSVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMG 333 Query: 562 GNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAI 383 G+ EWDNRMMDTLKNFDL+SPE+KQQFDQIGL+P+EV+ KIM NP+VAMAFQNPRVQAAI Sbjct: 334 GSTEWDNRMMDTLKNFDLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAI 393 Query: 382 LDCSQNPLSIAKYQNDKEVMDVFNKITELFPGPSS 278 +DCSQNP++I KYQNDKEVMDVFNKI+ELFPG S Sbjct: 394 MDCSQNPMNITKYQNDKEVMDVFNKISELFPGVGS 428 >ref|XP_002531917.1| conserved hypothetical protein [Ricinus communis] gi|223528427|gb|EEF30461.1| conserved hypothetical protein [Ricinus communis] Length = 465 Score = 391 bits (1005), Expect = e-106 Identities = 238/467 (50%), Positives = 285/467 (61%), Gaps = 23/467 (4%) Frame = -2 Query: 1618 NLGLVSS----PKIVLGVS-PNPRNSPISVSSKQFV-------GLPNLLRRTGKSGRRSN 1475 N+GL+SS PK+V+G PN +P ++KQF LP LR R S Sbjct: 5 NMGLLSSFYASPKLVMGCCYPNSLKNPTVTTNKQFSRTSTSTRALPFSLRNYKIVTRSSR 64 Query: 1474 PTPSLSVVSLFGARKATKTIVEEKVAEDGFASIASSGQQTSSVGAXXXXXXXXXXXXXXX 1295 S+ +L + + + ++ + FASI SS QQTSSVG Sbjct: 65 ----FSISALAHSHSSPRISGSSRLGAEHFASI-SSRQQTSSVGVNPQPLPPPSSSSQFG 119 Query: 1294 P-LFWIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXX 1118 LFWIGVGVGLSA+FS VA RVK YAM+QAFK+ QMNTQN+ F N AF+ Sbjct: 120 SPLFWIGVGVGLSAIFSLVATRVKNYAMQQAFKSMMNQMNTQNDQFNNPAFSPGSAFPFP 179 Query: 1117 XXXXXXXXXXXXFKTGA-------PSFQTTTSSPFKSGAASQSVT-DVPVTKVEDPPSTS 962 F T + PS+ T+++S S A+ +VT DV TKVE T Sbjct: 180 TPPASVPASSPPFPTSSTSRPATSPSYPTSSASTSPSVASQPAVTVDVSATKVEAASVTD 239 Query: 961 VKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--EDYKESSVQTDSPNNYXXXXXXXXQNX 788 KD+ E PKKYAF DVSPEET K+ F+ ED E+S D+ N N Sbjct: 240 AKDEAEITKEPKKYAFVDVSPEETFPKSPFKSNEDILETSTSKDTQFNPEVLQNGAASNQ 299 Query: 787 XXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQN 608 +SV+ALEKMMEDPTVQ+MV+PYLPEEMRNP+TFKWMLQN Sbjct: 300 GAADFTGSQSTRKAGSG----LSVEALEKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQN 355 Query: 607 PVYRQQLQDMLNNMGGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENP 428 P YRQQL++MLNNM G EWDNRMMD+LKNFDLSSPE+KQQFDQIGLTP+EV+ KIM NP Sbjct: 356 PQYRQQLEEMLNNMSGTGEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANP 415 Query: 427 DVAMAFQNPRVQAAILDCSQNPLSIAKYQNDKEVMDVFNKITELFPG 287 ++AMAFQNPRVQ AI+DCSQNPLSIAKYQNDKEVMDVFNKI+ELFPG Sbjct: 416 EIAMAFQNPRVQQAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 462 >gb|ABF19057.1| plastid Tic40 [Ricinus communis] Length = 460 Score = 389 bits (999), Expect = e-105 Identities = 237/466 (50%), Positives = 284/466 (60%), Gaps = 23/466 (4%) Frame = -2 Query: 1615 LGLVSS----PKIVLGVS-PNPRNSPISVSSKQFV-------GLPNLLRRTGKSGRRSNP 1472 +GL+SS PK+V+G PN +P ++KQF LP LR R S Sbjct: 1 MGLLSSFYASPKLVMGCCYPNSLKNPTVTTNKQFSRTSTSTRALPFSLRNYKIVTRSSR- 59 Query: 1471 TPSLSVVSLFGARKATKTIVEEKVAEDGFASIASSGQQTSSVGAXXXXXXXXXXXXXXXP 1292 S+ +L + + + ++ + FASI SS QQTSSVG Sbjct: 60 ---FSISALAHSHSSPRISGSSRLGAEHFASI-SSRQQTSSVGVNPQPLPPPSSSSQFGS 115 Query: 1291 -LFWIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXX 1115 LFWIGVGVGLSA+FS VA RVK YAM+QAFK+ QMNTQN+ F N AF+ Sbjct: 116 PLFWIGVGVGLSAIFSLVATRVKNYAMQQAFKSMMNQMNTQNDQFNNPAFSPGSAFPFPT 175 Query: 1114 XXXXXXXXXXXFKTGA-------PSFQTTTSSPFKSGAASQSVT-DVPVTKVEDPPSTSV 959 F T + PS+ T+++S S A+ +VT DV TKVE T Sbjct: 176 PPASVPASSPPFPTSSTSRPATSPSYPTSSASTSPSVASQPAVTVDVSATKVEAASVTDA 235 Query: 958 KDKVEPEDAPKKYAFKDVSPEETVQKNAFE--EDYKESSVQTDSPNNYXXXXXXXXQNXX 785 KD+ E PKKYAF DVSPEET K+ F+ ED E+S D+ N N Sbjct: 236 KDEAEITKEPKKYAFVDVSPEETFPKSPFKSNEDILETSTSKDTQFNPEVLQNGAASNQG 295 Query: 784 XXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNP 605 +SV+ALEKMMEDPTVQ+MV+PYLPEEMRNP+TFKWMLQNP Sbjct: 296 AADFTGSQSTRKAGSG----LSVEALEKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNP 351 Query: 604 VYRQQLQDMLNNMGGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPD 425 YRQQL++MLNNM G EWDNRMMD+LKNFDLSSPE+KQQFDQIGLTP+EV+ KIM NP+ Sbjct: 352 QYRQQLEEMLNNMSGTGEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPE 411 Query: 424 VAMAFQNPRVQAAILDCSQNPLSIAKYQNDKEVMDVFNKITELFPG 287 +AMAFQNPRVQ AI+DCSQNPLSIAKYQNDKEVMDVFNKI+ELFPG Sbjct: 412 IAMAFQNPRVQQAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 457 >ref|XP_007032985.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial [Theobroma cacao] gi|508712014|gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial [Theobroma cacao] Length = 412 Score = 387 bits (993), Expect = e-104 Identities = 238/450 (52%), Positives = 271/450 (60%), Gaps = 12/450 (2%) Frame = -2 Query: 1618 NLGLVSSPKIVLGVSPNPRNSPISVSSKQFVGLPNLLRRTGKSGRRSNPTPSLSVVSLFG 1439 NL LVSS L + N P F LP SN P S +S+F Sbjct: 5 NLALVSSSSPPLKLYLLGCNHPNYTPKNPFKTLPF---------PSSNLAPRRSRISIFA 55 Query: 1438 ARKATKT-------IVEEKVAEDGFASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFW 1283 + T IV K+ ++ FASI+SS QQTSSVG PLFW Sbjct: 56 HSHSQPTPPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFW 115 Query: 1282 IGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXX 1103 IGVGVGLSALF+WVA +KKYAM+QAFKT QMNTQNN F N AF Sbjct: 116 IGVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFP----- 170 Query: 1102 XXXXXXXFKTGAPSFQTTTSSPFKSGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDA--- 932 AP +SP S + +V DVP TKVE P+T+ +V+ E Sbjct: 171 -----------APPSPGPVTSPSPSSQTAVTV-DVPATKVEAAPATAPATEVKSETETAE 218 Query: 931 PKKYAFKDVSPEETVQKNAFEEDYK-ESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXX 755 PKKYAF DVSPEETVQK+AFE+ SS T P + Sbjct: 219 PKKYAFVDVSPEETVQKSAFEDAAGISSSNNTQFPKD----------------DAGAFGG 262 Query: 754 XXXXXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDML 575 P +SVDALEKMMEDPTVQ+MV+PYLPEEMRNP TFKWMLQNP YRQQLQDML Sbjct: 263 SQSTGSADPALSVDALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDML 322 Query: 574 NNMGGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRV 395 NNMGG+ EWDNRMMD+LKNFDL+SP++KQQFDQIGLTP+EV+ KIM NP+VAMAFQNPRV Sbjct: 323 NNMGGSTEWDNRMMDSLKNFDLNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRV 382 Query: 394 QAAILDCSQNPLSIAKYQNDKEVMDVFNKI 305 QAAI+DCSQNPLSIAKYQNDKEVMDVFNKI Sbjct: 383 QAAIMDCSQNPLSIAKYQNDKEVMDVFNKI 412 >ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-like [Cicer arietinum] Length = 433 Score = 387 bits (993), Expect = e-104 Identities = 237/468 (50%), Positives = 287/468 (61%), Gaps = 22/468 (4%) Frame = -2 Query: 1618 NLGLVSSPK-IVLGVSPN----PRNSPISVSSKQFVGLPNLLRRTGKSGRRSNPTPSLSV 1454 NL LVSSPK ++LG S + R P + K FV + ++ +S+ P Sbjct: 5 NLALVSSPKPLLLGHSSSRNVFTRRKPFTFG-KFFVSANSSSSHVTRAAPKSHQNPKS-- 61 Query: 1453 VSLFGARKATKTIVEEKVAEDGFASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIG 1277 V+ K+ FASI+SS Q+T+SVG LFWIG Sbjct: 62 -------------VQGKLIVHNFASISSSNSQETTSVGVSPQLSPPPSSTVGSP-LFWIG 107 Query: 1276 VGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXX 1097 VGVG SALFS VA R+KKYAM+QAFKT QMNTQNNPF + AF+ Sbjct: 108 VGVGFSALFSIVASRLKKYAMQQAFKTMMGQMNTQNNPFDSAAFSPGSPFPFPMPSSSGP 167 Query: 1096 XXXXXFKTGAPSFQTTTSSPFKSG-AASQSVT--DVPVTKVEDPPSTSVKDKVEPEDAPK 926 AP+ T S S ASQS D+P TKVE PST+ KD+VE ++ PK Sbjct: 168 --------AAPASSAGTQSQSTSARTASQSTVTVDIPATKVEAAPSTNAKDEVEVKNEPK 219 Query: 925 KYAFKDVSPEETVQKNAFE--EDYKESS-----------VQTDSPNNYXXXXXXXXQNXX 785 K F DVSPEE+VQK+ FE +D ESS Q +P+N Q+ Sbjct: 220 KIGFVDVSPEESVQKSPFESFKDVDESSSFKEARAPAEAFQNGAPSNQGFGNSPGSQSGG 279 Query: 784 XXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNP 605 ++SV+ALEKMMEDPTVQ+MV+PYLPEEMRNP+TFKWMLQNP Sbjct: 280 KS-----------------VLSVEALEKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNP 322 Query: 604 VYRQQLQDMLNNMGGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPD 425 YRQQL++MLNNMGG+ EWD+RMMDTLKNFDL+SP++KQQFDQIGL+P+EV+ KIM NP+ Sbjct: 323 QYRQQLEEMLNNMGGSTEWDSRMMDTLKNFDLNSPDVKQQFDQIGLSPEEVISKIMANPE 382 Query: 424 VAMAFQNPRVQAAILDCSQNPLSIAKYQNDKEVMDVFNKITELFPGPS 281 VAMAFQNPRVQAAI+DCS NPL+IAKYQNDKEVMDVFNKI+ELFPG S Sbjct: 383 VAMAFQNPRVQAAIMDCSSNPLNIAKYQNDKEVMDVFNKISELFPGVS 430 >emb|CAB50925.1| translocon Tic40 [Pisum sativum] Length = 436 Score = 385 bits (989), Expect = e-104 Identities = 235/458 (51%), Positives = 278/458 (60%), Gaps = 12/458 (2%) Frame = -2 Query: 1618 NLGLVSSPKIVLGVSPNPRNSPISVSSKQFVGLPNLLRRTGKSGRRSNPTPSLSVVSLFG 1439 NL LVSSPK +L + +N + K F T + R S + S V Sbjct: 5 NLALVSSPKPLLLGHSSSKN--VFSRRKSF---------TFGTFRVSANSSSSHVTRAAS 53 Query: 1438 ARKATKTIVEEKVAEDGFASIASS-GQQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGL 1262 V+ KV FASI+SS GQ+T+SVG LFWIG+GVG Sbjct: 54 KSHQNLKSVQGKVNAHSFASISSSNGQETTSVGVSPQLSPPPPSTVGSP-LFWIGIGVGF 112 Query: 1261 SALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXX 1082 SALFS VA RVKKYAM+QAFK+ QMNTQNNPF + AF+ Sbjct: 113 SALFSVVASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAG 172 Query: 1081 FKTGAPSFQTTTSSPFKSGAASQSVTDVPVTKVE---DPPSTSVKDKVEPEDAPKKYAFK 911 F G S T+T +S + S D+P TKVE P +VK++VE ++ PKK AF Sbjct: 173 F-AGNQSQATST----RSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFV 227 Query: 910 DVSPEETVQKNAFEE--------DYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXX 755 DVSPEETVQKNAFE +KE+ ++ N + Sbjct: 228 DVSPEETVQKNAFERFKDVDESSSFKEARAPAEASQNGTPFKQGFGDSPGSPSERKSA-- 285 Query: 754 XXXXXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDML 575 +SVDALEKMMEDPTVQQMV+PYLPEEMRNP+TFKWM+QNP YRQQL+ ML Sbjct: 286 ----------LSVDALEKMMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAML 335 Query: 574 NNMGGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRV 395 NNMGG EWD+RMMDTLKNFDL+SP++KQQFDQIGL+PQEV+ KIM NPDVAMAFQNPRV Sbjct: 336 NNMGGGTEWDSRMMDTLKNFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRV 395 Query: 394 QAAILDCSQNPLSIAKYQNDKEVMDVFNKITELFPGPS 281 QAAI+DCSQNP+SI KYQNDKEVMDVFNKI+ELFPG S Sbjct: 396 QAAIMDCSQNPMSIVKYQNDKEVMDVFNKISELFPGVS 433 >sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplastic; AltName: Full=Translocon at the inner envelope membrane of chloroplasts 40; Short=PsTIC40; Flags: Precursor gi|26000725|gb|AAN75219.1| chloroplast protein translocon component Tic40 precursor [Pisum sativum] Length = 436 Score = 385 bits (989), Expect = e-104 Identities = 235/458 (51%), Positives = 278/458 (60%), Gaps = 12/458 (2%) Frame = -2 Query: 1618 NLGLVSSPKIVLGVSPNPRNSPISVSSKQFVGLPNLLRRTGKSGRRSNPTPSLSVVSLFG 1439 NL LVSSPK +L + +N + K F T + R S + S V Sbjct: 5 NLALVSSPKPLLLGHSSSKN--VFSGRKSF---------TFGTFRVSANSSSSHVTRAAS 53 Query: 1438 ARKATKTIVEEKVAEDGFASIASS-GQQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGL 1262 V+ KV FASI+SS GQ+T+SVG LFWIG+GVG Sbjct: 54 KSHQNLKSVQGKVNAHDFASISSSNGQETTSVGVSPQLSPPPPSTVGSP-LFWIGIGVGF 112 Query: 1261 SALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXX 1082 SALFS VA RVKKYAM+QAFK+ QMNTQNNPF + AF+ Sbjct: 113 SALFSVVASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAG 172 Query: 1081 FKTGAPSFQTTTSSPFKSGAASQSVTDVPVTKVE---DPPSTSVKDKVEPEDAPKKYAFK 911 F G S T+T +S + S D+P TKVE P +VK++VE ++ PKK AF Sbjct: 173 F-AGNQSQATST----RSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFV 227 Query: 910 DVSPEETVQKNAFEE--------DYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXX 755 DVSPEETVQKNAFE +KE+ ++ N + Sbjct: 228 DVSPEETVQKNAFERFKDVDESSSFKEARAPAEASQNGTPFKQGFGDSPSSPSERKSA-- 285 Query: 754 XXXXXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDML 575 +SVDALEKMMEDPTVQQMV+PYLPEEMRNP+TFKWM+QNP YRQQL+ ML Sbjct: 286 ----------LSVDALEKMMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAML 335 Query: 574 NNMGGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRV 395 NNMGG EWD+RMMDTLKNFDL+SP++KQQFDQIGL+PQEV+ KIM NPDVAMAFQNPRV Sbjct: 336 NNMGGGTEWDSRMMDTLKNFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRV 395 Query: 394 QAAILDCSQNPLSIAKYQNDKEVMDVFNKITELFPGPS 281 QAAI+DCSQNP+SI KYQNDKEVMDVFNKI+ELFPG S Sbjct: 396 QAAIMDCSQNPMSIVKYQNDKEVMDVFNKISELFPGVS 433 >ref|XP_007146937.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris] gi|561020160|gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris] Length = 430 Score = 380 bits (977), Expect = e-102 Identities = 228/451 (50%), Positives = 280/451 (62%), Gaps = 4/451 (0%) Frame = -2 Query: 1618 NLGLVSSPK-IVLGVSPNPRNSPISVSSKQFVGLPNLLRRTGKSGRRSNPTPSLSVVSLF 1442 NL LVSS K ++LG P + V ++ L +L + R VS Sbjct: 5 NLALVSSSKPLMLGHVPARDATDRDVLRRKPFSLGRVLIAPHRFRYR---------VSAL 55 Query: 1441 GARKATKTIVEEKVAEDGFASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVG 1265 + + V++K+ FASI+SS Q+T+S+G LFWIGVGVG Sbjct: 56 SSSHHSPKSVQDKLIVKHFASISSSNTQETTSIGVNPQLSPPPSSTIGSP-LFWIGVGVG 114 Query: 1264 LSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXX 1085 LSALFS VA R+KKYAM+QAFKT QMN+ NN FGN AF+ Sbjct: 115 LSALFSMVASRLKKYAMQQAFKTMMGQMNSPNNDFGNAAFSPGSPFPFSMPSAAGPTATA 174 Query: 1084 XFKTGAPSFQTTTSSPFKSGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDV 905 + GAPS SG+ S D+P TKVE +T +KD+VE ++ PKK AF DV Sbjct: 175 --QYGAPSTS--------SGSQSTVTVDIPATKVEATRTTDIKDEVEVQNKPKKIAFVDV 224 Query: 904 SPEETVQKNAFE--EDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXX 731 SPEETVQK+ FE +D + SSV+ ++ N Sbjct: 225 SPEETVQKSPFESVKDNESSSVKEEARVPDEVSQNGAPFNQGFGGFPGSQSTKKSA---- 280 Query: 730 PLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPE 551 +SVDALEKMMEDPTVQ+MV+P+LPEEMRNP TFKWMLQNP YRQQL+ ML+NMGG+ E Sbjct: 281 --LSVDALEKMMEDPTVQKMVYPHLPEEMRNPDTFKWMLQNPQYRQQLEAMLSNMGGSTE 338 Query: 550 WDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCS 371 WDNRMMDTLKNFDL+SPE+KQQFDQIGL+P+EV+ KIM NP+VAMAFQNPRVQAAI+DCS Sbjct: 339 WDNRMMDTLKNFDLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCS 398 Query: 370 QNPLSIAKYQNDKEVMDVFNKITELFPGPSS 278 QNP++I KYQNDKEVM+VFNKI+ELFPG S Sbjct: 399 QNPMNITKYQNDKEVMNVFNKISELFPGMGS 429 >ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-like [Cucumis sativus] Length = 419 Score = 365 bits (938), Expect = 4e-98 Identities = 214/452 (47%), Positives = 272/452 (60%), Gaps = 6/452 (1%) Frame = -2 Query: 1618 NLGLVSSPKIVLGVSPNPRNSPISVSSKQFVGLPNLLRRTGKSGRRSNP---TPSLSVVS 1448 NL L S PK + ++ SS + P+ + + R S+ +P + S Sbjct: 5 NLALTSPPKFLF----------LTYSSTSSISTPSRTNQLCATRRLSSSRIKSPPRILAS 54 Query: 1447 LFGARKATKTIVEEKVAEDGFASIASS--GQQTSSVGAXXXXXXXXXXXXXXXPLFWIGV 1274 R + +V E+ FA+++SS +SSVG LFW+GV Sbjct: 55 ALNRRPNHRILVAER-----FATVSSSTTSNDSSSVGVPSVSIPPPSSYVGSP-LFWVGV 108 Query: 1273 GVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXX 1094 GVGLSALF+WVA +KKYAM+QAFKT QMN+QN+P N + Sbjct: 109 GVGLSALFTWVASYLKKYAMQQAFKTMMSQMNSQNSPMSNPTLSSGSPFPIP-------- 160 Query: 1093 XXXXFKTGAPSFQT-TTSSPFKSGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYA 917 P+F T TT SP S A DV TKVE+ P T+VK + E +A KK+A Sbjct: 161 ---------PTFATGTTISPSVSEPAVS--IDVTATKVEEEPVTNVKSRTENMEA-KKFA 208 Query: 916 FKDVSPEETVQKNAFEEDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXX 737 F DVSPEET QK+ F+ED ++ V + Sbjct: 209 FVDVSPEETDQKSPFKEDATDADVSKSAQPTQELPQNGAASKQAYNGSDGSQFSRKPGS- 267 Query: 736 XXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGN 557 ++SV+A+EKMMEDPTVQ+M++P+LPEEMRNP TFKWM+QNP+YRQQL++MLNNM G+ Sbjct: 268 ---VLSVEAVEKMMEDPTVQKMIYPHLPEEMRNPETFKWMMQNPLYRQQLEEMLNNMSGS 324 Query: 556 PEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILD 377 P+WD R+MD+LKNFDLSSPE+KQQFDQIGLTP+EV+ KIM NP++AMAFQNPRVQAAI+D Sbjct: 325 PQWDGRLMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEIAMAFQNPRVQAAIMD 384 Query: 376 CSQNPLSIAKYQNDKEVMDVFNKITELFPGPS 281 CSQNPLSI KYQNDKEVMDVFNKI+ELFPG S Sbjct: 385 CSQNPLSITKYQNDKEVMDVFNKISELFPGVS 416 >ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana] gi|75309208|sp|Q9FMD5.1|TIC40_ARATH RecName: Full=Protein TIC 40, chloroplastic; AltName: Full=Protein PIGMENT DEFECTIVE EMBRYO 120; AltName: Full=Translocon at the inner envelope membrane of chloroplasts 40; Short=AtTIC40; Flags: Precursor gi|16226313|gb|AAL16131.1|AF428299_1 AT5g16620/MTG13_6 [Arabidopsis thaliana] gi|10176971|dbj|BAB10189.1| translocon Tic40-like protein [Arabidopsis thaliana] gi|20260222|gb|AAM13009.1| translocon Tic40-like protein [Arabidopsis thaliana] gi|30387547|gb|AAP31939.1| At5g16620 [Arabidopsis thaliana] gi|332004935|gb|AED92318.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 447 Score = 362 bits (928), Expect = 6e-97 Identities = 222/471 (47%), Positives = 271/471 (57%), Gaps = 25/471 (5%) Frame = -2 Query: 1624 MENLGLVS----SPKIVLGVS-PNPRNSPISVSSKQFVGLPNLLRRTGK-SGRRSNPTPS 1463 MENL LVS SPK+++G + + +P S + PN++ R K S + +PS Sbjct: 1 MENLTLVSCSASSPKLLIGCNFTSSLKNPTGFSRRT----PNIVLRCSKISASAQSQSPS 56 Query: 1462 LSVVSLFGARKATKTIVEEKVAEDGFASIASSG--QQTSSVGAXXXXXXXXXXXXXXXPL 1289 + T IV K FASI SS QQT+SV + PL Sbjct: 57 -------SRPENTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPL 109 Query: 1288 FWIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXX 1109 FWIGVGVGLSALFS+V +KKYAM+ A KT QMNTQN+ F N+ F Sbjct: 110 FWIGVGVGLSALFSYVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNSGFPSGSPFPFPF-- 167 Query: 1108 XXXXXXXXXFKTGAPSFQTTTSSPFKSGAASQSVT-DVPVTKVEDPPSTSVKDKVEPE-- 938 P + SSPF+S + S T DV TKVE PPST K + Sbjct: 168 --------------PPQTSPASSPFQSQSQSSGATVDVTATKVETPPSTKPKPTPAKDIE 213 Query: 937 -DAP-------------KKYAFKDVSPEETVQKNAFEEDYKESSVQTDSPNNYXXXXXXX 800 D P K YAF+D+SPEET +++ F + S + Sbjct: 214 VDKPSVVLEASKEKKEEKNYAFEDISPEETTKESPFSNYAEVSETNSPKETRLFEDVLQN 273 Query: 799 XQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKW 620 P +SV+ALEKMMEDPTVQ+MV+PYLPEEMRNP TFKW Sbjct: 274 GAGPANGATASEVFQSLGGGKGGPGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKW 333 Query: 619 MLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKI 440 ML+NP YRQQLQDMLNNM G+ EWD RM DTLKNFDL+SPE+KQQF+QIGLTP+EV+ KI Sbjct: 334 MLKNPQYRQQLQDMLNNMSGSGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKI 393 Query: 439 MENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQNDKEVMDVFNKITELFPG 287 MENPDVAMAFQNPRVQAA+++CS+NP++I KYQNDKEVMDVFNKI++LFPG Sbjct: 394 MENPDVAMAFQNPRVQAALMECSENPMNIMKYQNDKEVMDVFNKISQLFPG 444 >ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Populus trichocarpa] gi|222848840|gb|EEE86387.1| hypothetical protein POPTR_0004s08560g [Populus trichocarpa] Length = 429 Score = 360 bits (925), Expect = 1e-96 Identities = 218/453 (48%), Positives = 269/453 (59%), Gaps = 14/453 (3%) Frame = -2 Query: 1603 SSPKIVLGVSP---NPRNSPISVSSKQFVGLPNLLRRTGKSGRRSNPTPSLSVVSLFGAR 1433 SSPK+V+G NP S+S+ + LP LR S P S+ S+ Sbjct: 12 SSPKLVMGYPTSLKNPTTPKFSISTTR-PSLPFSLRI-------SKTAPHASIFSISALA 63 Query: 1432 KATKTIVEEKVAEDGFASIASS-GQQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSA 1256 + K+ + FASI+SS G+QT+SVG LFW+GVGVGLSA Sbjct: 64 NS-----HGKLGSEYFASISSSSGKQTASVGVNPQPVSPPPSQIGSP-LFWVGVGVGLSA 117 Query: 1255 LFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFK 1076 +FSWVA RVK YAM+QAFK+ T+QMNTQNN F N AF+ Sbjct: 118 IFSWVATRVKNYAMQQAFKSLTEQMNTQNNQF-NPAFSARPPFPF--------------- 161 Query: 1075 TGAPSFQTTTSSPFKSGAASQSVTDVPVTKVEDPPSTSVKDKVEPE--------DAPKKY 920 P ++SP + + D+P TKVE P+T V + E + + KKY Sbjct: 162 -SPPPASHPSTSPSPAASQPAITVDIPATKVEAAPTTDVGKEKETDFLEERKIKEETKKY 220 Query: 919 AFKDVSPEETVQKNAFE--EDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXX 746 AF D+SPEET F ED E+S D Sbjct: 221 AFVDISPEETSLNTPFSSVEDDNETSSSKD-------VEFAKKVFQNGAAFKQGPGAAEG 273 Query: 745 XXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNM 566 P +SV+ALEKMMEDPT+Q+MV+PYLPEEMRNPTTFKWMLQNP YRQQL+DMLNNM Sbjct: 274 SQSTRPFLSVEALEKMMEDPTMQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNM 333 Query: 565 GGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAA 386 GG+ +WD++MMD+LK+FDL+S E+KQQFDQIGLTP+EV+ KIM NPDVAMAFQNPRVQ A Sbjct: 334 GGSGKWDSQMMDSLKDFDLNSAEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQQA 393 Query: 385 ILDCSQNPLSIAKYQNDKEVMDVFNKITELFPG 287 I++CSQNP++I KYQNDKEVMDVFNKI+ELFPG Sbjct: 394 IMECSQNPINITKYQNDKEVMDVFNKISELFPG 426 >ref|XP_006400200.1| hypothetical protein EUTSA_v10013528mg [Eutrema salsugineum] gi|557101290|gb|ESQ41653.1| hypothetical protein EUTSA_v10013528mg [Eutrema salsugineum] Length = 449 Score = 357 bits (915), Expect = 2e-95 Identities = 222/470 (47%), Positives = 267/470 (56%), Gaps = 24/470 (5%) Frame = -2 Query: 1624 MENLGLVS----SPKIVLGVSPNPRNSPISVSSKQFVGLPNLLRRTGKSGRRSNPTPSLS 1457 MENL LVS SPK+++G + + S K VG RRT K R + + + Sbjct: 1 MENLTLVSCSASSPKLLIGCN-------FTSSLKNPVGFS---RRTPKVVFRCSKISASA 50 Query: 1456 VVSLFGARKATK-TIVEEKVAEDGFASIASSG--QQTSSVGAXXXXXXXXXXXXXXXPLF 1286 +R IV K FASI SS QQT+SV PLF Sbjct: 51 KSQSHSSRPENAGEIVVVKHRSRDFASIFSSNRDQQTTSVAYPNAAVPPPSSSTIGSPLF 110 Query: 1285 WIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXX 1106 WIGVGVGLSALFSWV +KKYAM+ A KT QMNTQN+ F N F Sbjct: 111 WIGVGVGLSALFSWVTSSLKKYAMQTAMKTMMNQMNTQNSQFNNPGFPAGSASPFPFPF- 169 Query: 1105 XXXXXXXXFKTGAPSFQTTTSSPFKSGAASQSVT-DVPVTKVEDPPSTS----------- 962 P + TSSPF+S + S T DV TKV+ PPS Sbjct: 170 -------------PPQTSPTSSPFQSQSQSSGATVDVTATKVDTPPSAKPQPTPAKKTEV 216 Query: 961 -----VKDKVEPEDAPKKYAFKDVSPEETVQKNAFEEDYKESSVQTDSPNNYXXXXXXXX 797 V ++ + + K YAF+DVSPEET +++ F + S Sbjct: 217 DKPSVVLEENKAKKEEKNYAFEDVSPEETTKESPFSNYAEVSETSAPKEARLFEDVMQNG 276 Query: 796 QNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWM 617 P +SV+ALEKMMEDPTVQ+MV+P+LPEEMRNP TFKWM Sbjct: 277 AAPANGATASEVFQSLGAGKGGPGLSVEALEKMMEDPTVQKMVYPHLPEEMRNPETFKWM 336 Query: 616 LQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIM 437 L+NP YRQQLQDMLNNM G+ EWD RMMDTLKNFDL+SPE+KQQFDQIGLTP+EV+ KIM Sbjct: 337 LKNPHYRQQLQDMLNNMSGSGEWDKRMMDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIM 396 Query: 436 ENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQNDKEVMDVFNKITELFPG 287 ENPDVAMAFQNPRVQAA+++CS+NP++I KYQNDKEVMDVFNKI++LFPG Sbjct: 397 ENPDVAMAFQNPRVQAALMECSENPMNIMKYQNDKEVMDVFNKISQLFPG 446 >ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arabidopsis lyrata subsp. lyrata] gi|297317564|gb|EFH47986.1| hypothetical protein ARALYDRAFT_488521 [Arabidopsis lyrata subsp. lyrata] Length = 447 Score = 355 bits (911), Expect = 6e-95 Identities = 219/471 (46%), Positives = 266/471 (56%), Gaps = 25/471 (5%) Frame = -2 Query: 1624 MENLGLVS----SPKIVLGVS-PNPRNSPISVSSKQFVGLPNLLRRTGK-SGRRSNPTPS 1463 MENL LVS SPK+++G + + +P S + P ++ R K S + +PS Sbjct: 1 MENLTLVSCSASSPKLLIGCNFTSSLKNPTGFSRRT----PRIVLRCSKISASAQSQSPS 56 Query: 1462 LSVVSLFGARKATKTIVEEKVAEDGFASIASSG--QQTSSVGAXXXXXXXXXXXXXXXPL 1289 T IV K FASI SS QQT+SV + PL Sbjct: 57 -------SRPDNTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPL 109 Query: 1288 FWIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXX 1109 FWIGVGVGLSALFS V +KKYAM+ A KT QMNTQN+ F N F Sbjct: 110 FWIGVGVGLSALFSLVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNPGFPSGSPFPFPF-- 167 Query: 1108 XXXXXXXXXFKTGAPSFQTTTSSPFKSGAASQSVT-DVPVTKVEDPPSTSVKDKVEPE-- 938 P + SSPF+S + S T DV TKV+ PPST K + Sbjct: 168 --------------PPQTSPASSPFQSQSQSSGATVDVTATKVDTPPSTKPKPTPAKDIE 213 Query: 937 -DAP-------------KKYAFKDVSPEETVQKNAFEEDYKESSVQTDSPNNYXXXXXXX 800 D P K YAF+D+SPEET +++ F + S + Sbjct: 214 VDKPSVVLEASKEKKEEKNYAFEDISPEETTKESPFSNYAEVSETSSPKETRLFEDVLQN 273 Query: 799 XQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKW 620 +SV+ALEKMMEDPTVQ+MV+PYLPEEMRNP TFKW Sbjct: 274 GAGPANGATASEVFQSLGGGKGGAGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKW 333 Query: 619 MLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKI 440 ML+NP YRQQLQDMLNNM G+ EWD RM DTLKNFDL+SPE+KQQF+QIGLTP+EV+ KI Sbjct: 334 MLKNPQYRQQLQDMLNNMSGSGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKI 393 Query: 439 MENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQNDKEVMDVFNKITELFPG 287 MENPDVAMAFQNPRVQAA+++CS+NP++I KYQNDKEVMDVFNKI++LFPG Sbjct: 394 MENPDVAMAFQNPRVQAALMECSENPMNIMKYQNDKEVMDVFNKISQLFPG 444 >ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa] gi|550319201|gb|ERP50369.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa] Length = 435 Score = 355 bits (910), Expect = 8e-95 Identities = 212/450 (47%), Positives = 266/450 (59%), Gaps = 14/450 (3%) Frame = -2 Query: 1603 SSPKIVLGVSP---NPRNSPISVSSKQFVGLPNLLRRTGKSGRRSNPTPSLSVVSLFGAR 1433 SS K+V G NP S+S+ + LP RT K+ ++ ++ G R Sbjct: 12 SSLKLVSGYPTSLKNPTTPKFSISTTR-PSLP-FPHRTSKTVTHTSRISISALSQSHGPR 69 Query: 1432 KATKTIVEEKVAEDGFASIAS-SGQQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSA 1256 + +K + FASI+S SGQQT+SVG LFW+GVGV LSA Sbjct: 70 RTSKN------GSEYFASISSLSGQQTASVGVNPQSVSPPPSQIGSP-LFWVGVGVALSA 122 Query: 1255 LFSWVAGRVKKYAMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFK 1076 +FSWVA R+K YAM+QAFK+ T+QMN QNN F N AF+ Sbjct: 123 IFSWVATRLKNYAMQQAFKSLTEQMNAQNNQF-NPAFSARSPFPF--------------- 166 Query: 1075 TGAPSFQTTTSSPFKSGAASQSVTDVPVTKVEDPPSTSVKDKVEPE--------DAPKKY 920 P +SPF++ + D+P TKVE P T + + E + + P+K+ Sbjct: 167 -SPPPASQPATSPFQTASQPAVTVDIPATKVEAAPETDARKEKETDTLEEREIKEEPRKF 225 Query: 919 AFKDVSPEETVQKNAFE--EDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXX 746 AF DVSPEET F ED ++S D + Sbjct: 226 AFVDVSPEETSLNTPFSSVEDVIDTSSSKDV--QFAKEASQNGATFKQGPSASEPSEGSQ 283 Query: 745 XXXXXPLMSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNM 566 +SV+ALEKMM+DPTVQ+MV+PYLPEEMRNPTTFKWMLQNP YRQQL++MLNNM Sbjct: 284 SSQKAGSLSVEALEKMMDDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNM 343 Query: 565 GGNPEWDNRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAA 386 G+ EWD+RM+D+LKNFDLSSPE+KQQFDQIGLTP+EV+ KIM NPDVA+AFQNPRVQ A Sbjct: 344 SGSSEWDSRMVDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPDVALAFQNPRVQQA 403 Query: 385 ILDCSQNPLSIAKYQNDKEVMDVFNKITEL 296 I++CSQNPLSIAKYQNDKEVMDVFNKI+E+ Sbjct: 404 IMECSQNPLSIAKYQNDKEVMDVFNKISEI 433