BLASTX nr result
ID: Catharanthus23_contig00002341
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00002341 (1949 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-lik... 426 e-116 ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-lik... 426 e-116 ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vi... 408 e-111 gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein i... 401 e-109 ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-lik... 401 e-109 ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-lik... 400 e-108 ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-lik... 390 e-105 gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein i... 384 e-104 gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus... 383 e-103 ref|XP_002531917.1| conserved hypothetical protein [Ricinus comm... 379 e-102 gb|ABF19057.1| plastid Tic40 [Ricinus communis] 379 e-102 ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Popu... 366 2e-98 ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-lik... 365 5e-98 ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana] gi|753092... 361 7e-97 emb|CAB50925.1| translocon Tic40 [Pisum sativum] 360 1e-96 sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplasti... 359 2e-96 ref|XP_004307173.1| PREDICTED: protein TIC 40, chloroplastic-lik... 359 3e-96 ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arab... 358 4e-96 ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Popu... 348 6e-93 ref|XP_006400200.1| hypothetical protein EUTSA_v10013528mg [Eutr... 347 1e-92 >ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum lycopersicum] Length = 443 Score = 426 bits (1095), Expect = e-116 Identities = 243/444 (54%), Positives = 274/444 (61%), Gaps = 29/444 (6%) Frame = -2 Query: 1846 MENLSLVSSPKIVLGLSPNPRYSIFNKPFLGFXXXXXXXXXXXXXXXSLLVF---SSLQG 1676 MEN+ +VSSPK+VLGLS N S +KPF G F S QG Sbjct: 1 MENIGIVSSPKMVLGLSSNSVIS--SKPFFGLPHLPKRPFKNGRTVRPTTCFEVVSCFQG 58 Query: 1675 PKSNK--ILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLS 1502 P+ K +L K R FAS VNPQ SP S +GSPLFWIGVGVG S Sbjct: 59 PRLTKKIVLGKSGRGSFASTTTSGGKQTSSVGVNPQFSAPSPPSQMGSPLFWIGVGVGFS 118 Query: 1501 AIFSLVAAKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXXX 1322 A+F+ VA+ LK YAMQQA+KT+MGQ+ QN+QFSN FSP Sbjct: 119 ALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNTAFSPGPGSPFPFPFPPPPVSGPAS 178 Query: 1321 XXXXXXXV-----------------DVSASKVEESTATETKDSSEQNQQPKKYAFVDVTP 1193 DVSA+KVEE K+ E ++PKK AFVD++P Sbjct: 179 SSPPPPTASSSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDKEAEKEPKKNAFVDISP 238 Query: 1192 EETFQXXXXXXXXXXXXXXXXKVFQ-------NXXXXXXXXXXXXXXXXXTNPQLSVEAL 1034 +ETFQ V Q + +NP LSV+AL Sbjct: 239 DETFQKGAFENFKDSAETAAVTVDQVTQNGAASQSGFGSNTSDSTSSTGKSNPLLSVDAL 298 Query: 1033 EKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDT 854 EKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNP YRQQLQDM+NN+GG PEWDNRMMD+ Sbjct: 299 EKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDS 358 Query: 853 LKNFDLNSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAK 674 LKNFDL+SPE+KQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAK Sbjct: 359 LKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAK 418 Query: 673 YQNDKEVMDVFNKISELFPGVTGS 602 YQNDKEVMDVFNKISELFPGV+G+ Sbjct: 419 YQNDKEVMDVFNKISELFPGVSGA 442 >ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum tuberosum] Length = 443 Score = 426 bits (1094), Expect = e-116 Identities = 244/444 (54%), Positives = 274/444 (61%), Gaps = 29/444 (6%) Frame = -2 Query: 1846 MENLSLVSSPKIVLGLSPNPRYSIFNKPFLGFXXXXXXXXXXXXXXXSLLVF---SSLQG 1676 MEN+ +VSSPK+VLGLS NP S NKP G F S Q Sbjct: 1 MENICIVSSPKMVLGLSSNPVIS--NKPLFGLPHLPKRPFKNGRIVRPTTCFEVVSCFQS 58 Query: 1675 PKSNK--ILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLS 1502 P+ K +L K R FAS VNPQ S S VGSPLFWIGVGVGLS Sbjct: 59 PRLTKKIVLGKSGRGSFASTTTSGGQQTSSVGVNPQFSAPSQPSQVGSPLFWIGVGVGLS 118 Query: 1501 AIFSLVAAKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXXX 1322 A+F+ VA+ LK YAMQQA+KT+MGQ+ QN+QFSN FSP Sbjct: 119 ALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNNAFSPGPGSPFPFPFPPPPVSGPAS 178 Query: 1321 XXXXXXXV-----------------DVSASKVEESTATETKDSSEQNQQPKKYAFVDVTP 1193 DVSA+KVEE K+ +E ++PKK AFVD++P Sbjct: 179 SSPPPPTASTSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDTEAGKEPKKNAFVDISP 238 Query: 1192 EETFQXXXXXXXXXXXXXXXXKVFQ-------NXXXXXXXXXXXXXXXXXTNPQLSVEAL 1034 +ETFQ V Q + +NP +SV+AL Sbjct: 239 DETFQKGAFENFKDSTETASVTVDQVTQNGAASQLGFGPNTSDSTSSTGKSNPLMSVDAL 298 Query: 1033 EKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDT 854 EKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNP YRQQLQDM+NN+GG PEWDNRMMD+ Sbjct: 299 EKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDS 358 Query: 853 LKNFDLNSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAK 674 LKNFDL+SPE+KQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAK Sbjct: 359 LKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAK 418 Query: 673 YQNDKEVMDVFNKISELFPGVTGS 602 YQNDKEVMDVFNKISELFPGV+GS Sbjct: 419 YQNDKEVMDVFNKISELFPGVSGS 442 >ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vitis vinifera] gi|296089465|emb|CBI39284.3| unnamed protein product [Vitis vinifera] Length = 436 Score = 408 bits (1048), Expect = e-111 Identities = 243/446 (54%), Positives = 273/446 (61%), Gaps = 32/446 (7%) Frame = -2 Query: 1846 MENLSLVSSPKIVLGLSP-NPRY-----SIFNKPFLGFXXXXXXXXXXXXXXXSLLVFSS 1685 M++L+LVSSPK+VLG SP NPR+ S F+ P L + S Sbjct: 1 MDSLTLVSSPKLVLGHSPSNPRHISCAHSSFSLPLL-----------FRKPRKFIAASQS 49 Query: 1684 LQGPKSNK--ILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGV 1511 P++ + + KL + FASI NPQ PSSN+GSPLFWIGVGV Sbjct: 50 GASPRTPRHVVETKLGTECFASISSSSQGTSSVGV-NPQFSPPPPSSNIGSPLFWIGVGV 108 Query: 1510 GLSAIFSLVAAKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXX 1331 GLSA+FS VA+ LK YAMQQA KT+MGQ+ +QNNQF+ FSP Sbjct: 109 GLSALFSWVASNLKKYAMQQAFKTLMGQMDSQNNQFNTTTFSPGSPFPFPMPPPSGPSTS 168 Query: 1330 XXXXXXXXXXV---------------DVSASKVEESTATETKDSSEQNQQPKKYAFVDVT 1196 DV A+KVE AT+ KD E+ + KYAFVDV+ Sbjct: 169 HSGPTTSPSGPTTSPSTVAAQSMVTVDVPATKVETPPATDVKDDIEKKNEQNKYAFVDVS 228 Query: 1195 PEETFQXXXXXXXXXXXXXXXXK-------VFQNXXXXXXXXXXXXXXXXXTN--PQLSV 1043 PEET Q K V QN N P LSV Sbjct: 229 PEETLQESPFENFEESTETSSSKDAQFSAGVSQNGTPPRPGMGVSEDSQSTRNANPFLSV 288 Query: 1042 EALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRM 863 +ALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNP YRQQLQDMLNN+GG EWDNRM Sbjct: 289 DALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGGAEWDNRM 348 Query: 862 MDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLS 683 MD LKNFDL+SPEVKQQFDQIGLTPEEVISKIMANPDVA+AFQNPR+QAAIMDCSQNPLS Sbjct: 349 MDNLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPDVALAFQNPRIQAAIMDCSQNPLS 408 Query: 682 IAKYQNDKEVMDVFNKISELFPGVTG 605 IAKYQNDKEVMDVFNKISELFPGV+G Sbjct: 409 IAKYQNDKEVMDVFNKISELFPGVSG 434 >gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 433 Score = 401 bits (1031), Expect = e-109 Identities = 223/363 (61%), Positives = 247/363 (68%), Gaps = 10/363 (2%) Frame = -2 Query: 1660 ILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLSAIFSLVA 1481 +L KL + FASI VNP V PSS +GSPLFWIGVGVGLSA+F+ VA Sbjct: 71 VLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWIGVGVGLSALFTWVA 130 Query: 1480 AKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXXXXXXXXXX 1301 + LK YAMQQA KT+MGQ+ TQNNQFSNA F Sbjct: 131 SSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPGPVTSPSPSSQTAVT 190 Query: 1300 VDVSASKVEESTAT----ETKDSSEQNQQPKKYAFVDVTPEETFQXXXXXXXXXXXXXXX 1133 VDV A+KVE + AT E K +E +PKKYAFVDV+PEET Q Sbjct: 191 VDVPATKVEAAPATAPATEVKSETE-TAEPKKYAFVDVSPEETVQKSAFEDAAGISSSNN 249 Query: 1132 XKVFQNXXXXXXXXXXXXXXXXXT------NPQLSVEALEKMMEDPTVQKMVYPYLPEEM 971 + ++ + +P LSV+ALEKMMEDPTVQKMVYPYLPEEM Sbjct: 250 TQFPKDVSDNGAASKQDAGAFGGSQSTGSADPALSVDALEKMMEDPTVQKMVYPYLPEEM 309 Query: 970 RNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDTLKNFDLNSPEVKQQFDQIGLT 791 RNP TFKWMLQNP YRQQLQDMLNN+GG+ EWDNRMMD+LKNFDLNSP+VKQQFDQIGLT Sbjct: 310 RNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKNFDLNSPDVKQQFDQIGLT 369 Query: 790 PEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGV 611 PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGV Sbjct: 370 PEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGV 429 Query: 610 TGS 602 TGS Sbjct: 430 TGS 432 >ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max] Length = 429 Score = 401 bits (1030), Expect = e-109 Identities = 233/428 (54%), Positives = 266/428 (62%), Gaps = 18/428 (4%) Frame = -2 Query: 1840 NLSLVSSPKIVLGLSPNPRYSIFNKPFLGFXXXXXXXXXXXXXXXSLLVFSSLQGPKSNK 1661 NL+LVSSPK ++ L P +F + F +L SS PKS Sbjct: 5 NLALVSSPKPLM-LGHVPARDVFRRKHFSFGRVLIAPHRCRFRVSALS--SSHHNPKS-- 59 Query: 1660 ILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLSAIFSLVA 1481 + EKL +FASI V PQ+ SPSS +GSPLFWIGVGVGLSA+FS+VA Sbjct: 60 VQEKLIVKHFASISSSNTQETTSIGVKPQLS-PSPSSTIGSPLFWIGVGVGLSALFSVVA 118 Query: 1480 AKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXXXXXXXXXX 1301 ++LK YAMQQA KT+MGQ+ +QNNQF NA FSP Sbjct: 119 SRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPTAPASSATTQSRA 178 Query: 1300 V------------DVSASKVEESTATETKDSSEQNQQPKKYAFVDVTPEETFQXXXXXXX 1157 D+ A+KVE + T KD E +PKK AFVDV+PEET + Sbjct: 179 PSASSASQSTITVDLPAAKVEAAPTTNVKDEVELKNEPKKIAFVDVSPEETVRESPFESF 238 Query: 1156 XXXXXXXXXK------VFQNXXXXXXXXXXXXXXXXXTNPQLSVEALEKMMEDPTVQKMV 995 + V QN LSV+ALEKMMEDPTVQKMV Sbjct: 239 KDDESSSVKEAWVPDEVSQNGAPSNLGFGDFPGSQSTKKSALSVDALEKMMEDPTVQKMV 298 Query: 994 YPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDTLKNFDLNSPEVKQ 815 YPYLPEEMRNPTTFKWMLQNP YRQQL++MLNN+GG+ EWDNRMMDTLKNFDLNSPEVKQ Sbjct: 299 YPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDNRMMDTLKNFDLNSPEVKQ 358 Query: 814 QFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNK 635 QFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNP++I KYQNDKEVMDVFNK Sbjct: 359 QFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQNDKEVMDVFNK 418 Query: 634 ISELFPGV 611 ISELFPGV Sbjct: 419 ISELFPGV 426 >ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max] Length = 432 Score = 400 bits (1028), Expect = e-108 Identities = 235/432 (54%), Positives = 269/432 (62%), Gaps = 22/432 (5%) Frame = -2 Query: 1840 NLSLVSSPK-IVLGLSPN---PRYSIFNKPFLGFXXXXXXXXXXXXXXXSLLVFSSLQGP 1673 NL+LVSSPK ++LG P +F + F +L SS + P Sbjct: 5 NLALVSSPKPLMLGHVPAIDATSRDVFRRKHFSFGRVLIAPHRCRFRVSALS--SSHRNP 62 Query: 1672 KSNKILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLSAIF 1493 KS + EKL +FASI VNPQ+ SPSS +GSPLFWIGVGVGLSA+F Sbjct: 63 KS--VQEKLIVKHFASISSSNTQEATSTGVNPQL---SPSSTIGSPLFWIGVGVGLSALF 117 Query: 1492 SLVAAKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXXXXXX 1313 S+VA++LK YAMQQA KT+MGQ+ +QNNQF NA FSP Sbjct: 118 SVVASRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPTAPASSATT 177 Query: 1312 XXXXV------------DVSASKVEESTATETKDSSEQNQQPKKYAFVDVTPEETFQXXX 1169 D+ A+KVE + T KD E +PKK AFVDV+PEET Q Sbjct: 178 QSRAPSASSASQSTITVDIPAAKVEVAPTTNVKDEVEVKNEPKKIAFVDVSPEETVQESP 237 Query: 1168 XXXXXXXXXXXXXK------VFQNXXXXXXXXXXXXXXXXXTNPQLSVEALEKMMEDPTV 1007 + V QN LSV+ALEKMMEDPTV Sbjct: 238 FESFKDDESSSVKEARVPDEVSQNGAPSNQGFGDFPGSQSTKKSVLSVDALEKMMEDPTV 297 Query: 1006 QKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDTLKNFDLNSP 827 QKMVYPYLPEEMRNPTTFKWMLQNP YRQQL++MLNN+GG+ EWD+RMMDTLKNFDLNSP Sbjct: 298 QKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDTLKNFDLNSP 357 Query: 826 EVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMD 647 EVKQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNP++I KYQNDKEVMD Sbjct: 358 EVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQNDKEVMD 417 Query: 646 VFNKISELFPGV 611 VFNKISELFPGV Sbjct: 418 VFNKISELFPGV 429 >ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-like [Cicer arietinum] Length = 433 Score = 390 bits (1002), Expect = e-105 Identities = 231/432 (53%), Positives = 264/432 (61%), Gaps = 20/432 (4%) Frame = -2 Query: 1840 NLSLVSSPK-IVLGLSPNPRYSIFNKPFLGFXXXXXXXXXXXXXXXSLLVFSSLQGPKSN 1664 NL+LVSSPK ++LG S + KPF + S Q PKS Sbjct: 5 NLALVSSPKPLLLGHSSSRNVFTRRKPFT--FGKFFVSANSSSSHVTRAAPKSHQNPKS- 61 Query: 1663 KILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLSAIFSLV 1484 + KL FASI V+PQ+ PSS VGSPLFWIGVGVG SA+FS+V Sbjct: 62 -VQGKLIVHNFASISSSNSQETTSVGVSPQLS-PPPSSTVGSPLFWIGVGVGFSALFSIV 119 Query: 1483 AAKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXXXXXXXXX 1304 A++LK YAMQQA KT+MGQ+ TQNN F +A FSP Sbjct: 120 ASRLKKYAMQQAFKTMMGQMNTQNNPFDSAAFSPGSPFPFPMPSSSGPAAPASSAGTQSQ 179 Query: 1303 XV------------DVSASKVEESTATETKDSSEQNQQPKKYAFVDVTPEETFQXXXXXX 1160 D+ A+KVE + +T KD E +PKK FVDV+PEE+ Q Sbjct: 180 STSARTASQSTVTVDIPATKVEAAPSTNAKDEVEVKNEPKKIGFVDVSPEESVQKSPFES 239 Query: 1159 XXXXXXXXXXK-------VFQNXXXXXXXXXXXXXXXXXTNPQLSVEALEKMMEDPTVQK 1001 K FQN LSVEALEKMMEDPTVQK Sbjct: 240 FKDVDESSSFKEARAPAEAFQNGAPSNQGFGNSPGSQSGGKSVLSVEALEKMMEDPTVQK 299 Query: 1000 MVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDTLKNFDLNSPEV 821 MVYPYLPEEMRNP+TFKWMLQNP YRQQL++MLNN+GG+ EWD+RMMDTLKNFDLNSP+V Sbjct: 300 MVYPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDTLKNFDLNSPDV 359 Query: 820 KQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVF 641 KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCS NPL+IAKYQNDKEVMDVF Sbjct: 360 KQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSSNPLNIAKYQNDKEVMDVF 419 Query: 640 NKISELFPGVTG 605 NKISELFPGV+G Sbjct: 420 NKISELFPGVSG 431 >gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial [Theobroma cacao] Length = 412 Score = 384 bits (986), Expect = e-104 Identities = 213/347 (61%), Positives = 236/347 (68%), Gaps = 4/347 (1%) Frame = -2 Query: 1660 ILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLSAIFSLVA 1481 +L KL + FASI VNP V PSS +GSPLFWIGVGVGLSA+F+ VA Sbjct: 71 VLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWIGVGVGLSALFTWVA 130 Query: 1480 AKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXXXXXXXXXX 1301 + LK YAMQQA KT+MGQ+ TQNNQFSNA F Sbjct: 131 SSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPGPVTSPSPSSQTAVT 190 Query: 1300 VDVSASKVEESTAT----ETKDSSEQNQQPKKYAFVDVTPEETFQXXXXXXXXXXXXXXX 1133 VDV A+KVE + AT E K +E +PKKYAFVDV+PEET Q Sbjct: 191 VDVPATKVEAAPATAPATEVKSETE-TAEPKKYAFVDVSPEETVQKSAFEDAAGISSSNN 249 Query: 1132 XKVFQNXXXXXXXXXXXXXXXXXTNPQLSVEALEKMMEDPTVQKMVYPYLPEEMRNPTTF 953 + ++ +P LSV+ALEKMMEDPTVQKMVYPYLPEEMRNP TF Sbjct: 250 TQFPKDDAGAFGGSQSTGSA----DPALSVDALEKMMEDPTVQKMVYPYLPEEMRNPETF 305 Query: 952 KWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDTLKNFDLNSPEVKQQFDQIGLTPEEVIS 773 KWMLQNP YRQQLQDMLNN+GG+ EWDNRMMD+LKNFDLNSP+VKQQFDQIGLTPEEVIS Sbjct: 306 KWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKNFDLNSPDVKQQFDQIGLTPEEVIS 365 Query: 772 KIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKI 632 KIMANP+VAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKI Sbjct: 366 KIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKI 412 >gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris] Length = 430 Score = 383 bits (983), Expect = e-103 Identities = 227/429 (52%), Positives = 263/429 (61%), Gaps = 19/429 (4%) Frame = -2 Query: 1840 NLSLVSSPK-IVLGLSP----NPRYSIFNKPFLGFXXXXXXXXXXXXXXXSLLVFSSLQG 1676 NL+LVSS K ++LG P R + KPF + SS Sbjct: 5 NLALVSSSKPLMLGHVPARDATDRDVLRRKPF---SLGRVLIAPHRFRYRVSALSSSHHS 61 Query: 1675 PKSNKILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLSAI 1496 PKS + +KL +FASI VNPQ+ PSS +GSPLFWIGVGVGLSA+ Sbjct: 62 PKS--VQDKLIVKHFASISSSNTQETTSIGVNPQLS-PPPSSTIGSPLFWIGVGVGLSAL 118 Query: 1495 FSLVAAKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXXXXX 1316 FS+VA++LK YAMQQA KT+MGQ+ + NN F NA FSP Sbjct: 119 FSMVASRLKKYAMQQAFKTMMGQMNSPNNDFGNAAFSPGSPFPFSMPSAAGPTATAQYGA 178 Query: 1315 XXXXXV-------DVSASKVEESTATETKDSSEQNQQPKKYAFVDVTPEETFQXXXXXXX 1157 D+ A+KVE + T+ KD E +PKK AFVDV+PEET Q Sbjct: 179 PSTSSGSQSTVTVDIPATKVEATRTTDIKDEVEVQNKPKKIAFVDVSPEETVQKSPFESV 238 Query: 1156 XXXXXXXXXK-------VFQNXXXXXXXXXXXXXXXXXTNPQLSVEALEKMMEDPTVQKM 998 + V QN LSV+ALEKMMEDPTVQKM Sbjct: 239 KDNESSSVKEEARVPDEVSQNGAPFNQGFGGFPGSQSTKKSALSVDALEKMMEDPTVQKM 298 Query: 997 VYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDTLKNFDLNSPEVK 818 VYP+LPEEMRNP TFKWMLQNP YRQQL+ ML+N+GG+ EWDNRMMDTLKNFDLNSPEVK Sbjct: 299 VYPHLPEEMRNPDTFKWMLQNPQYRQQLEAMLSNMGGSTEWDNRMMDTLKNFDLNSPEVK 358 Query: 817 QQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFN 638 QQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNP++I KYQNDKEVM+VFN Sbjct: 359 QQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQNDKEVMNVFN 418 Query: 637 KISELFPGV 611 KISELFPG+ Sbjct: 419 KISELFPGM 427 >ref|XP_002531917.1| conserved hypothetical protein [Ricinus communis] gi|223528427|gb|EEF30461.1| conserved hypothetical protein [Ricinus communis] Length = 465 Score = 379 bits (973), Expect = e-102 Identities = 214/381 (56%), Positives = 243/381 (63%), Gaps = 34/381 (8%) Frame = -2 Query: 1651 KLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLSAIFSLVAAKL 1472 +L ++FASI P +P S SS GSPLFWIGVGVGLSAIFSLVA ++ Sbjct: 84 RLGAEHFASISSRQQTSSVGVNPQP-LPPPSSSSQFGSPLFWIGVGVGLSAIFSLVATRV 142 Query: 1471 KAYAMQQAIKTVMGQIPTQNNQFSNAGFSP-------------------------NXXXX 1367 K YAMQQA K++M Q+ TQN+QF+N FSP + Sbjct: 143 KNYAMQQAFKSMMNQMNTQNDQFNNPAFSPGSAFPFPTPPASVPASSPPFPTSSTSRPAT 202 Query: 1366 XXXXXXXXXXXXXXXXXXXXXXVDVSASKVEESTATETKDSSEQNQQPKKYAFVDVTPEE 1187 VDVSA+KVE ++ T+ KD +E ++PKKYAFVDV+PEE Sbjct: 203 SPSYPTSSASTSPSVASQPAVTVDVSATKVEAASVTDAKDEAEITKEPKKYAFVDVSPEE 262 Query: 1186 TF-------QXXXXXXXXXXXXXXXXKVFQNXXXXXXXXXXXXXXXXXTNPQ--LSVEAL 1034 TF +V QN LSVEAL Sbjct: 263 TFPKSPFKSNEDILETSTSKDTQFNPEVLQNGAASNQGAADFTGSQSTRKAGSGLSVEAL 322 Query: 1033 EKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDT 854 EKMMEDPTVQKMVYPYLPEEMRNP+TFKWMLQNP YRQQL++MLNN+ GT EWDNRMMD+ Sbjct: 323 EKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMSGTGEWDNRMMDS 382 Query: 853 LKNFDLNSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAK 674 LKNFDL+SPEVKQQFDQIGLTPEEVISKIMANP++AMAFQNPRVQ AIMDCSQNPLSIAK Sbjct: 383 LKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEIAMAFQNPRVQQAIMDCSQNPLSIAK 442 Query: 673 YQNDKEVMDVFNKISELFPGV 611 YQNDKEVMDVFNKISELFPGV Sbjct: 443 YQNDKEVMDVFNKISELFPGV 463 >gb|ABF19057.1| plastid Tic40 [Ricinus communis] Length = 460 Score = 379 bits (973), Expect = e-102 Identities = 214/381 (56%), Positives = 243/381 (63%), Gaps = 34/381 (8%) Frame = -2 Query: 1651 KLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLSAIFSLVAAKL 1472 +L ++FASI P +P S SS GSPLFWIGVGVGLSAIFSLVA ++ Sbjct: 79 RLGAEHFASISSRQQTSSVGVNPQP-LPPPSSSSQFGSPLFWIGVGVGLSAIFSLVATRV 137 Query: 1471 KAYAMQQAIKTVMGQIPTQNNQFSNAGFSP-------------------------NXXXX 1367 K YAMQQA K++M Q+ TQN+QF+N FSP + Sbjct: 138 KNYAMQQAFKSMMNQMNTQNDQFNNPAFSPGSAFPFPTPPASVPASSPPFPTSSTSRPAT 197 Query: 1366 XXXXXXXXXXXXXXXXXXXXXXVDVSASKVEESTATETKDSSEQNQQPKKYAFVDVTPEE 1187 VDVSA+KVE ++ T+ KD +E ++PKKYAFVDV+PEE Sbjct: 198 SPSYPTSSASTSPSVASQPAVTVDVSATKVEAASVTDAKDEAEITKEPKKYAFVDVSPEE 257 Query: 1186 TF-------QXXXXXXXXXXXXXXXXKVFQNXXXXXXXXXXXXXXXXXTNPQ--LSVEAL 1034 TF +V QN LSVEAL Sbjct: 258 TFPKSPFKSNEDILETSTSKDTQFNPEVLQNGAASNQGAADFTGSQSTRKAGSGLSVEAL 317 Query: 1033 EKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDT 854 EKMMEDPTVQKMVYPYLPEEMRNP+TFKWMLQNP YRQQL++MLNN+ GT EWDNRMMD+ Sbjct: 318 EKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMSGTGEWDNRMMDS 377 Query: 853 LKNFDLNSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAK 674 LKNFDL+SPEVKQQFDQIGLTPEEVISKIMANP++AMAFQNPRVQ AIMDCSQNPLSIAK Sbjct: 378 LKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEIAMAFQNPRVQQAIMDCSQNPLSIAK 437 Query: 673 YQNDKEVMDVFNKISELFPGV 611 YQNDKEVMDVFNKISELFPGV Sbjct: 438 YQNDKEVMDVFNKISELFPGV 458 >ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Populus trichocarpa] gi|222848840|gb|EEE86387.1| hypothetical protein POPTR_0004s08560g [Populus trichocarpa] Length = 429 Score = 366 bits (940), Expect = 2e-98 Identities = 206/364 (56%), Positives = 238/364 (65%), Gaps = 15/364 (4%) Frame = -2 Query: 1651 KLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLSAIFSLVAAKL 1472 KL +YFASI VNPQ PV P S +GSPLFW+GVGVGLSAIFS VA ++ Sbjct: 68 KLGSEYFASISSSSGKQTASVGVNPQ-PVSPPPSQIGSPLFWVGVGVGLSAIFSWVATRV 126 Query: 1471 KAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXXXXXXXXXXVDV 1292 K YAMQQA K++ Q+ TQNNQF+ A + VD+ Sbjct: 127 KNYAMQQAFKSLTEQMNTQNNQFNPAFSARPPFPFSPPPASHPSTSPSPAASQPAITVDI 186 Query: 1291 SASKVEESTATETKDSSEQN--------QQPKKYAFVDVTPEETFQXXXXXXXXXXXXXX 1136 A+KVE + T+ E + ++ KKYAFVD++PEET Sbjct: 187 PATKVEAAPTTDVGKEKETDFLEERKIKEETKKYAFVDISPEETSLNTPFSSVEDDNETS 246 Query: 1135 XXK-------VFQNXXXXXXXXXXXXXXXXXTNPQLSVEALEKMMEDPTVQKMVYPYLPE 977 K VFQN P LSVEALEKMMEDPT+QKMVYPYLPE Sbjct: 247 SSKDVEFAKKVFQNGAAFKQGPGAAEGSQST-RPFLSVEALEKMMEDPTMQKMVYPYLPE 305 Query: 976 EMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDTLKNFDLNSPEVKQQFDQIG 797 EMRNPTTFKWMLQNP YRQQL+DMLNN+GG+ +WD++MMD+LK+FDLNS EVKQQFDQIG Sbjct: 306 EMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSGKWDSQMMDSLKDFDLNSAEVKQQFDQIG 365 Query: 796 LTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFP 617 LTPEEVISKIMANPDVAMAFQNPRVQ AIM+CSQNP++I KYQNDKEVMDVFNKISELFP Sbjct: 366 LTPEEVISKIMANPDVAMAFQNPRVQQAIMECSQNPINITKYQNDKEVMDVFNKISELFP 425 Query: 616 GVTG 605 G+TG Sbjct: 426 GMTG 429 >ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-like [Cucumis sativus] Length = 419 Score = 365 bits (936), Expect = 5e-98 Identities = 194/333 (58%), Positives = 232/333 (69%), Gaps = 7/333 (2%) Frame = -2 Query: 1579 PQIPVQSPSSNVGSPLFWIGVGVGLSAIFSLVAAKLKAYAMQQAIKTVMGQIPTQNNQFS 1400 P + + PSS VGSPLFW+GVGVGLSA+F+ VA+ LK YAMQQA KT+M Q+ +QN+ S Sbjct: 88 PSVSIPPPSSYVGSPLFWVGVGVGLSALFTWVASYLKKYAMQQAFKTMMSQMNSQNSPMS 147 Query: 1399 NAGFSPNXXXXXXXXXXXXXXXXXXXXXXXXXXVDVSASKVEESTATETKDSSEQNQQPK 1220 N S + +DV+A+KVEE T K +E N + K Sbjct: 148 NPTLS-SGSPFPIPPTFATGTTISPSVSEPAVSIDVTATKVEEEPVTNVKSRTE-NMEAK 205 Query: 1219 KYAFVDVTPEET-----FQXXXXXXXXXXXXXXXXKVFQNXXXXXXXXXXXXXXXXXTNP 1055 K+AFVDV+PEET F+ ++ QN P Sbjct: 206 KFAFVDVSPEETDQKSPFKEDATDADVSKSAQPTQELPQNGAASKQAYNGSDGSQFSRKP 265 Query: 1054 Q--LSVEALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTP 881 LSVEA+EKMMEDPTVQKM+YP+LPEEMRNP TFKWM+QNP+YRQQL++MLNN+ G+P Sbjct: 266 GSVLSVEAVEKMMEDPTVQKMIYPHLPEEMRNPETFKWMMQNPLYRQQLEEMLNNMSGSP 325 Query: 880 EWDNRMMDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDC 701 +WD R+MD+LKNFDL+SPEVKQQFDQIGLTPEEVISKIMANP++AMAFQNPRVQAAIMDC Sbjct: 326 QWDGRLMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEIAMAFQNPRVQAAIMDC 385 Query: 700 SQNPLSIAKYQNDKEVMDVFNKISELFPGVTGS 602 SQNPLSI KYQNDKEVMDVFNKISELFPGV+G+ Sbjct: 386 SQNPLSITKYQNDKEVMDVFNKISELFPGVSGA 418 >ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana] gi|75309208|sp|Q9FMD5.1|TIC40_ARATH RecName: Full=Protein TIC 40, chloroplastic; AltName: Full=Protein PIGMENT DEFECTIVE EMBRYO 120; AltName: Full=Translocon at the inner envelope membrane of chloroplasts 40; Short=AtTIC40; Flags: Precursor gi|16226313|gb|AAL16131.1|AF428299_1 AT5g16620/MTG13_6 [Arabidopsis thaliana] gi|10176971|dbj|BAB10189.1| translocon Tic40-like protein [Arabidopsis thaliana] gi|20260222|gb|AAM13009.1| translocon Tic40-like protein [Arabidopsis thaliana] gi|30387547|gb|AAP31939.1| At5g16620 [Arabidopsis thaliana] gi|332004935|gb|AED92318.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 447 Score = 361 bits (926), Expect = 7e-97 Identities = 217/454 (47%), Positives = 262/454 (57%), Gaps = 40/454 (8%) Frame = -2 Query: 1846 MENLSLVS----SPKIVLGLSPNPRYSIFNKPFLGFXXXXXXXXXXXXXXXSLLVFSSLQ 1679 MENL+LVS SPK+++G + F + + Q Sbjct: 1 MENLTLVSCSASSPKLLIGCN-------FTSSLKNPTGFSRRTPNIVLRCSKISASAQSQ 53 Query: 1678 GPKSNK------ILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSN-VGSPLFWIG 1520 P S ++ K R FASI +P +PV PSS+ +GSPLFWIG Sbjct: 54 SPSSRPENTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFWIG 113 Query: 1519 VGVGLSAIFSLVAAKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXX 1340 VGVGLSA+FS V + LK YAMQ A+KT+M Q+ TQN+QF+N+GF Sbjct: 114 VGVGLSALFSYVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNSGFPSGSPFPFPFPPQTSP 173 Query: 1339 XXXXXXXXXXXXXV--DVSASKVEESTATETK----------------DSSEQNQQPKKY 1214 DV+A+KVE +T+ K ++S++ ++ K Y Sbjct: 174 ASSPFQSQSQSSGATVDVTATKVETPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEKNY 233 Query: 1213 AFVDVTPEETFQXXXXXXXXXXXXXXXXK-------VFQNXXXXXXXXXXXXXXXXXTN- 1058 AF D++PEET + K V QN Sbjct: 234 AFEDISPEETTKESPFSNYAEVSETNSPKETRLFEDVLQNGAGPANGATASEVFQSLGGG 293 Query: 1057 ---PQLSVEALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGG 887 P LSVEALEKMMEDPTVQKMVYPYLPEEMRNP TFKWML+NP YRQQLQDMLNN+ G Sbjct: 294 KGGPGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNNMSG 353 Query: 886 TPEWDNRMMDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIM 707 + EWD RM DTLKNFDLNSPEVKQQF+QIGLTPEEVISKIM NPDVAMAFQNPRVQAA+M Sbjct: 354 SGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQAALM 413 Query: 706 DCSQNPLSIAKYQNDKEVMDVFNKISELFPGVTG 605 +CS+NP++I KYQNDKEVMDVFNKIS+LFPG+TG Sbjct: 414 ECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 447 >emb|CAB50925.1| translocon Tic40 [Pisum sativum] Length = 436 Score = 360 bits (924), Expect = 1e-96 Identities = 216/434 (49%), Positives = 257/434 (59%), Gaps = 22/434 (5%) Frame = -2 Query: 1840 NLSLVSSPKIVLGLSPNPRYSIFNKPFLGFXXXXXXXXXXXXXXXSLLVFSSLQGPKSNK 1661 NL+LVSSPK +L L + ++F++ + S Q KS Sbjct: 5 NLALVSSPKPLL-LGHSSSKNVFSRRKSFTFGTFRVSANSSSSHVTRAASKSHQNLKS-- 61 Query: 1660 ILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLSAIFSLVA 1481 + K+ FASI V+PQ+ PS+ VGSPLFWIG+GVG SA+FS+VA Sbjct: 62 VQGKVNAHSFASISSSNGQETTSVGVSPQLSPPPPST-VGSPLFWIGIGVGFSALFSVVA 120 Query: 1480 AKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXXXXXXXXXX 1301 +++K YAMQQA K++MGQ+ TQNN F + FS Sbjct: 121 SRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGFAGNQSQA 180 Query: 1300 V------------DVSASKVEESTAT---ETKDSSEQNQQPKKYAFVDVTPEETFQXXXX 1166 D+ A+KVE + K+ E +PKK AFVDV+PEET Q Sbjct: 181 TSTRSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEETVQKNAF 240 Query: 1165 XXXXXXXXXXXXK-------VFQNXXXXXXXXXXXXXXXXXTNPQLSVEALEKMMEDPTV 1007 K QN LSV+ALEKMMEDPTV Sbjct: 241 ERFKDVDESSSFKEARAPAEASQNGTPFKQGFGDSPGSPSERKSALSVDALEKMMEDPTV 300 Query: 1006 QKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDTLKNFDLNSP 827 Q+MVYPYLPEEMRNP+TFKWM+QNP YRQQL+ MLNN+GG EWD+RMMDTLKNFDLNSP Sbjct: 301 QQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRMMDTLKNFDLNSP 360 Query: 826 EVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMD 647 +VKQQFDQIGL+P+EVISKIMANPDVAMAFQNPRVQAAIMDCSQNP+SI KYQNDKEVMD Sbjct: 361 DVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMSIVKYQNDKEVMD 420 Query: 646 VFNKISELFPGVTG 605 VFNKISELFPGV+G Sbjct: 421 VFNKISELFPGVSG 434 >sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplastic; AltName: Full=Translocon at the inner envelope membrane of chloroplasts 40; Short=PsTIC40; Flags: Precursor gi|26000725|gb|AAN75219.1| chloroplast protein translocon component Tic40 precursor [Pisum sativum] Length = 436 Score = 359 bits (922), Expect = 2e-96 Identities = 218/435 (50%), Positives = 256/435 (58%), Gaps = 23/435 (5%) Frame = -2 Query: 1840 NLSLVSSPK-IVLGLSPNPRYSIFNKPFLGFXXXXXXXXXXXXXXXSLLVFSSLQGPKSN 1664 NL+LVSSPK ++LG S + K F + S Q KS Sbjct: 5 NLALVSSPKPLLLGHSSSKNVFSGRKSFT--FGTFRVSANSSSSHVTRAASKSHQNLKS- 61 Query: 1663 KILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLSAIFSLV 1484 + K+ FASI V+PQ+ PS+ VGSPLFWIG+GVG SA+FS+V Sbjct: 62 -VQGKVNAHDFASISSSNGQETTSVGVSPQLSPPPPST-VGSPLFWIGIGVGFSALFSVV 119 Query: 1483 AAKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXXXXXXXXX 1304 A+++K YAMQQA K++MGQ+ TQNN F + FS Sbjct: 120 ASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGFAGNQSQ 179 Query: 1303 XV------------DVSASKVEESTAT---ETKDSSEQNQQPKKYAFVDVTPEETFQXXX 1169 D+ A+KVE + K+ E +PKK AFVDV+PEET Q Sbjct: 180 ATSTRSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEETVQKNA 239 Query: 1168 XXXXXXXXXXXXXK-------VFQNXXXXXXXXXXXXXXXXXTNPQLSVEALEKMMEDPT 1010 K QN LSV+ALEKMMEDPT Sbjct: 240 FERFKDVDESSSFKEARAPAEASQNGTPFKQGFGDSPSSPSERKSALSVDALEKMMEDPT 299 Query: 1009 VQKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDTLKNFDLNS 830 VQ+MVYPYLPEEMRNP+TFKWM+QNP YRQQL+ MLNN+GG EWD+RMMDTLKNFDLNS Sbjct: 300 VQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRMMDTLKNFDLNS 359 Query: 829 PEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVM 650 P+VKQQFDQIGL+P+EVISKIMANPDVAMAFQNPRVQAAIMDCSQNP+SI KYQNDKEVM Sbjct: 360 PDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMSIVKYQNDKEVM 419 Query: 649 DVFNKISELFPGVTG 605 DVFNKISELFPGV+G Sbjct: 420 DVFNKISELFPGVSG 434 >ref|XP_004307173.1| PREDICTED: protein TIC 40, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 448 Score = 359 bits (921), Expect = 3e-96 Identities = 205/398 (51%), Positives = 237/398 (59%), Gaps = 37/398 (9%) Frame = -2 Query: 1684 LQGPKSNKILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGL 1505 L + + KL+ + FASI +NPQ P S +GSPLFWIGVGV Sbjct: 51 LSAAANQPVTSKLQTERFASISSTNSQETSSVGINPQFSAPPPPSTIGSPLFWIGVGVAF 110 Query: 1504 SAIFSLVAAKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXX 1325 SA+FS A KL+ Y +QQA K VMGQ+ TQN+QFSNA FSP Sbjct: 111 SAVFSWAAGKLQKYVVQQAFKNVMGQMNTQNDQFSNAAFSPGSPFPFPSAPASPSASPFS 170 Query: 1324 XXXXXXXXVDVSASKVEESTATETKD-------SSEQNQQPKKY---------------- 1214 DVSA++V+ ++ T S EQ + ++ Sbjct: 171 APSQPSFT-DVSATEVDSPASSATPSTPAADVKSEEQQMKENRFGNSFEIERNNVIQFSR 229 Query: 1213 -----AFVDVTPEET-FQXXXXXXXXXXXXXXXXKVFQNXXXXXXXXXXXXXXXXXTNPQ 1052 AFVDV PEET + ++ N Q Sbjct: 230 QLSDRAFVDVNPEETELKSPFASSLNDTEPGSSKEINSNVEGSQNGAAFKQAKDASMGSQ 289 Query: 1051 --------LSVEALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNN 896 LSVEALEKM+EDPTVQKMVYPYLPEEMRNPTTFKWMLQNP YRQQL+DML N Sbjct: 290 TTGKENSVLSVEALEKMLEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLRN 349 Query: 895 LGGTPEWDNRMMDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQA 716 + G+ EWDNRMMD+LKNFDL+SPEVK+QFDQIGLTPE+VISKIMANPDVAMAFQNPRVQA Sbjct: 350 MTGSNEWDNRMMDSLKNFDLSSPEVKEQFDQIGLTPEQVISKIMANPDVAMAFQNPRVQA 409 Query: 715 AIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGVTGS 602 AIMDCSQNP+SI KYQNDKEVMDVFNKISELFPGV+GS Sbjct: 410 AIMDCSQNPMSITKYQNDKEVMDVFNKISELFPGVSGS 447 >ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arabidopsis lyrata subsp. lyrata] gi|297317564|gb|EFH47986.1| hypothetical protein ARALYDRAFT_488521 [Arabidopsis lyrata subsp. lyrata] Length = 447 Score = 358 bits (920), Expect = 4e-96 Identities = 216/454 (47%), Positives = 262/454 (57%), Gaps = 40/454 (8%) Frame = -2 Query: 1846 MENLSLVS----SPKIVLGLSPNPRYSIFNKPFLGFXXXXXXXXXXXXXXXSLLVFSSLQ 1679 MENL+LVS SPK+++G + F + + Q Sbjct: 1 MENLTLVSCSASSPKLLIGCN-------FTSSLKNPTGFSRRTPRIVLRCSKISASAQSQ 53 Query: 1678 GPKSNK------ILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSN-VGSPLFWIG 1520 P S ++ K R FASI +P +PV PSS+ +GSPLFWIG Sbjct: 54 SPSSRPDNTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFWIG 113 Query: 1519 VGVGLSAIFSLVAAKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXX 1340 VGVGLSA+FSLV + LK YAMQ A+KT+M Q+ TQN+QF+N GF Sbjct: 114 VGVGLSALFSLVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNPGFPSGSPFPFPFPPQTSP 173 Query: 1339 XXXXXXXXXXXXXV--DVSASKVEESTATETK----------------DSSEQNQQPKKY 1214 DV+A+KV+ +T+ K ++S++ ++ K Y Sbjct: 174 ASSPFQSQSQSSGATVDVTATKVDTPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEKNY 233 Query: 1213 AFVDVTPEETFQXXXXXXXXXXXXXXXXK-------VFQNXXXXXXXXXXXXXXXXXTNP 1055 AF D++PEET + K V QN Sbjct: 234 AFEDISPEETTKESPFSNYAEVSETSSPKETRLFEDVLQNGAGPANGATASEVFQSLGGG 293 Query: 1054 Q----LSVEALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGG 887 + LSVEALEKMMEDPTVQKMVYPYLPEEMRNP TFKWML+NP YRQQLQDMLNN+ G Sbjct: 294 KGGAGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNNMSG 353 Query: 886 TPEWDNRMMDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIM 707 + EWD RM DTLKNFDLNSPEVKQQF+QIGLTPEEVISKIM NPDVAMAFQNPRVQAA+M Sbjct: 354 SGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQAALM 413 Query: 706 DCSQNPLSIAKYQNDKEVMDVFNKISELFPGVTG 605 +CS+NP++I KYQNDKEVMDVFNKIS+LFPG+TG Sbjct: 414 ECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 447 >ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa] gi|550319201|gb|ERP50369.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa] Length = 435 Score = 348 bits (892), Expect = 6e-93 Identities = 211/426 (49%), Positives = 250/426 (58%), Gaps = 20/426 (4%) Frame = -2 Query: 1840 NLSLVSSPKIVLGLSPNPRYSIFN-KPFLGFXXXXXXXXXXXXXXXSLLVFSSLQGPKSN 1664 +L LVS L P++SI +P L F + S GP+ Sbjct: 13 SLKLVSGYPTSLKNPTTPKFSISTTRPSLPFPHRTSKTVTHTSRIS-ISALSQSHGPRRT 71 Query: 1663 KILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSNVGSPLFWIGVGVGLSAIFSLV 1484 K +YFASI VNPQ V P S +GSPLFW+GVGV LSAIFS V Sbjct: 72 S---KNGSEYFASISSLSGQQTASVGVNPQ-SVSPPPSQIGSPLFWVGVGVALSAIFSWV 127 Query: 1483 AAKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXXXXXXXXXXX 1304 A +LK YAMQQA K++ Q+ QNNQF+ A + + Sbjct: 128 ATRLKNYAMQQAFKSLTEQMNAQNNQFNPAFSARSPFPFSPPPASQPATSPFQTASQPAV 187 Query: 1303 XVDVSASKVE--------ESTATETKDSSEQNQQPKKYAFVDVTPEETFQXXXXXXXXXX 1148 VD+ A+KVE + T+T + E ++P+K+AFVDV+PEET Sbjct: 188 TVDIPATKVEAAPETDARKEKETDTLEEREIKEEPRKFAFVDVSPEETSLNTPFSSVEDV 247 Query: 1147 XXXXXXKVFQNXXXXXXXXXXXXXXXXXTNPQ-----------LSVEALEKMMEDPTVQK 1001 K Q + P LSVEALEKMM+DPTVQK Sbjct: 248 IDTSSSKDVQFAKEASQNGATFKQGPSASEPSEGSQSSQKAGSLSVEALEKMMDDPTVQK 307 Query: 1000 MVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGTPEWDNRMMDTLKNFDLNSPEV 821 MVYPYLPEEMRNPTTFKWMLQNP YRQQL++MLNN+ G+ EWD+RM+D+LKNFDL+SPEV Sbjct: 308 MVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMSGSSEWDSRMVDSLKNFDLSSPEV 367 Query: 820 KQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVF 641 KQQFDQIGLTPEEVISKIMANPDVA+AFQNPRVQ AIM+CSQNPLSIAKYQNDKEVMDVF Sbjct: 368 KQQFDQIGLTPEEVISKIMANPDVALAFQNPRVQQAIMECSQNPLSIAKYQNDKEVMDVF 427 Query: 640 NKISEL 623 NKISE+ Sbjct: 428 NKISEI 433 >ref|XP_006400200.1| hypothetical protein EUTSA_v10013528mg [Eutrema salsugineum] gi|557101290|gb|ESQ41653.1| hypothetical protein EUTSA_v10013528mg [Eutrema salsugineum] Length = 449 Score = 347 bits (889), Expect = 1e-92 Identities = 215/453 (47%), Positives = 259/453 (57%), Gaps = 39/453 (8%) Frame = -2 Query: 1846 MENLSLVS----SPKIVLGLSPNPRYSIFNKPFLGFXXXXXXXXXXXXXXXSLLVFSSLQ 1679 MENL+LVS SPK+++G + ++ K +GF + S Sbjct: 1 MENLTLVSCSASSPKLLIGCN----FTSSLKNPVGFSRRTPKVVFRCSKISASAKSQSHS 56 Query: 1678 GPKSNK---ILEKLRRDYFASIXXXXXXXXXXXXVNPQIPVQSPSSN-VGSPLFWIGVGV 1511 N ++ K R FASI P V PSS+ +GSPLFWIGVGV Sbjct: 57 SRPENAGEIVVVKHRSRDFASIFSSNRDQQTTSVAYPNAAVPPPSSSTIGSPLFWIGVGV 116 Query: 1510 GLSAIFSLVAAKLKAYAMQQAIKTVMGQIPTQNNQFSNAGFSPNXXXXXXXXXXXXXXXX 1331 GLSA+FS V + LK YAMQ A+KT+M Q+ TQN+QF+N GF Sbjct: 117 GLSALFSWVTSSLKKYAMQTAMKTMMNQMNTQNSQFNNPGFPAGSASPFPFPFPPQTSPT 176 Query: 1330 XXXXXXXXXXV----DVSASKVEESTATETK----------------DSSEQNQQPKKYA 1211 DV+A+KV+ + + + + ++ ++ K YA Sbjct: 177 SSPFQSQSQSSGATVDVTATKVDTPPSAKPQPTPAKKTEVDKPSVVLEENKAKKEEKNYA 236 Query: 1210 FVDVTPEETFQXXXXXXXXXXXXXXXXK-------VFQNXXXXXXXXXXXXXXXXXT--- 1061 F DV+PEET + K V QN Sbjct: 237 FEDVSPEETTKESPFSNYAEVSETSAPKEARLFEDVMQNGAAPANGATASEVFQSLGAGK 296 Query: 1060 -NPQLSVEALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPMYRQQLQDMLNNLGGT 884 P LSVEALEKMMEDPTVQKMVYP+LPEEMRNP TFKWML+NP YRQQLQDMLNN+ G+ Sbjct: 297 GGPGLSVEALEKMMEDPTVQKMVYPHLPEEMRNPETFKWMLKNPHYRQQLQDMLNNMSGS 356 Query: 883 PEWDNRMMDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMD 704 EWD RMMDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIM NPDVAMAFQNPRVQAA+M+ Sbjct: 357 GEWDKRMMDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIMENPDVAMAFQNPRVQAALME 416 Query: 703 CSQNPLSIAKYQNDKEVMDVFNKISELFPGVTG 605 CS+NP++I KYQNDKEVMDVFNKIS+LFPG+TG Sbjct: 417 CSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 449