BLASTX nr result
ID: Akebia24_contig00005441
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00005441 (1514 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vi... 501 e-139 ref|XP_007032983.1| Hydroxyproline-rich glycoprotein family prot... 477 e-132 gb|EYU28232.1| hypothetical protein MIMGU_mgv1a006810mg [Mimulus... 477 e-132 ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-lik... 475 e-131 ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-lik... 468 e-129 ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-lik... 457 e-126 ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-lik... 456 e-126 ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-lik... 451 e-124 ref|XP_007146937.1| hypothetical protein PHAVU_006G083300g [Phas... 444 e-122 ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Popu... 443 e-121 ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Popu... 440 e-121 ref|XP_006858564.1| hypothetical protein AMTR_s00071p00175860 [A... 438 e-120 ref|XP_007032985.1| Hydroxyproline-rich glycoprotein family prot... 435 e-119 ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-lik... 434 e-119 emb|CAB50925.1| translocon Tic40 [Pisum sativum] 429 e-117 ref|XP_004307173.1| PREDICTED: protein TIC 40, chloroplastic-lik... 426 e-116 sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplasti... 426 e-116 ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana] gi|753092... 411 e-112 ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arab... 405 e-110 ref|XP_007032982.1| Hydroxyproline-rich glycoprotein family prot... 400 e-109 >ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vitis vinifera] gi|296089465|emb|CBI39284.3| unnamed protein product [Vitis vinifera] Length = 436 Score = 501 bits (1290), Expect = e-139 Identities = 275/447 (61%), Positives = 307/447 (68%), Gaps = 14/447 (3%) Frame = -1 Query: 1379 MENFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQGTSD 1200 M++ +L+SSPK +LG SP+ I S +LP L RK R +++ G S Sbjct: 1 MDSLTLVSSPKLVLGHSPSNPRHISCAHSSFSLPLLFRK-------PRKFIAASQSGASP 53 Query: 1199 KAHPSRRLSINGVEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVGVGFSA 1020 + + G E FA VGVNPQ S PPPSS+IGSPLFWIGVGVG SA Sbjct: 54 RTPRHVVETKLGTECFASISSSSQGTSS-VGVNPQFSPPPPSSNIGSPLFWIGVGVGLSA 112 Query: 1019 LFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPG--------------XXXXXX 882 LFSWVA+NLKKYAMQQAFKTLMGQM +N+QFN FSPG Sbjct: 113 LFSWVASNLKKYAMQQAFKTLMGQMDSQNNQFNTTTFSPGSPFPFPMPPPSGPSTSHSGP 172 Query: 881 XXXXXXXXXXXXXXXXXXXXXXXXPATKVESAPTTEVKDNTETKNEPKRYAFVDVSPEEM 702 PATKVE+ P T+VKD+ E KNE +YAFVDVSPEE Sbjct: 173 TTSPSGPTTSPSTVAAQSMVTVDVPATKVETPPATDVKDDIEKKNEQNKYAFVDVSPEET 232 Query: 701 SQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVE 522 Q+ PFEN ES ETS SKD QF VSQNG + G +E SQSTR P +SV+ Sbjct: 233 LQESPFENFEESTETSSSKDAQFSAGVSQNGTPPRPGMGV-SEDSQSTRNA--NPFLSVD 289 Query: 521 ALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMM 342 ALEKMMEDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQL+DMLNNMGG EWDNRMM Sbjct: 290 ALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGGAEWDNRMM 349 Query: 341 DSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSI 162 D+LKNFDLSSPEVKQQFDQIGLTPEEVISKIMANP+VA+AFQNPR+QAAIMDCSQNP+SI Sbjct: 350 DNLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPDVALAFQNPRIQAAIMDCSQNPLSI 409 Query: 161 AKYQNDKEVMDVFNKISELFPGVTGPP 81 AKYQNDKEVMDVFNKISELFPGV+GPP Sbjct: 410 AKYQNDKEVMDVFNKISELFPGVSGPP 436 >ref|XP_007032983.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] gi|508712012|gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 433 Score = 477 bits (1228), Expect = e-132 Identities = 275/441 (62%), Positives = 305/441 (69%), Gaps = 10/441 (2%) Frame = -1 Query: 1373 NFSLISSPK-----FLLGSS-PNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQ 1212 N +L+SS +LLG + PN P F LP S +R S SQ Sbjct: 5 NLALVSSSSPPLKLYLLGCNHPNYTPKNP----FKTLPFPSSNLAPRRSRISIFAHSHSQ 60 Query: 1211 GTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVGV 1032 T + P L G E FA SVGVNP +VPPPSS IGSPLFWIGVGV Sbjct: 61 PTPPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWIGVGV 120 Query: 1031 GFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXX 852 G SALF+WVA++LKKYAMQQAFKT+MGQM+ +N+QF+NAAF G Sbjct: 121 GLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPGPVTS 180 Query: 851 XXXXXXXXXXXXXXPATKVESAPTT----EVKDNTETKNEPKRYAFVDVSPEEMSQKDPF 684 ATKVE+AP T EVK TET EPK+YAFVDVSPEE QK F Sbjct: 181 PSPSSQTAVTVDVP-ATKVEAAPATAPATEVKSETETA-EPKKYAFVDVSPEETVQKSAF 238 Query: 683 ENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALEKMM 504 E++ S S + QF ++VS NGAA+KQD GA+ GSQST G P +SV+ALEKMM Sbjct: 239 EDAAG---ISSSNNTQFPKDVSDNGAASKQDAGAFG-GSQST--GSADPALSVDALEKMM 292 Query: 503 EDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSLKNF 324 EDPTVQ MVYPYLPEEMRNP TFKWMLQNPQYRQQL+DMLNNMGGS EWDNRMMDSLKNF Sbjct: 293 EDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKNF 352 Query: 323 DLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQND 144 DL+SP+VKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNP+SIAKYQND Sbjct: 353 DLNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQND 412 Query: 143 KEVMDVFNKISELFPGVTGPP 81 KEVMDVFNKISELFPGVTG P Sbjct: 413 KEVMDVFNKISELFPGVTGSP 433 >gb|EYU28232.1| hypothetical protein MIMGU_mgv1a006810mg [Mimulus guttatus] Length = 430 Score = 477 bits (1227), Expect = e-132 Identities = 270/446 (60%), Positives = 303/446 (67%), Gaps = 15/446 (3%) Frame = -1 Query: 1379 MENFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKR----RYDPTRTRNL-VSSLS 1215 MEN L+SSPK +LG SPN + + LP+L +K R+ T +L V SL Sbjct: 1 MENLGLVSSPKIVLGVSPNPRNSVISSKPLVGLPNLLKKTGNYGRHTTIHTSSLQVLSLF 60 Query: 1214 QG-------TSDKAHPSR--RLSINGVEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIG 1062 + S+KA R +S +G E + VGVNPQ+SVPP SS +G Sbjct: 61 RSPKPTKTIVSEKAAKDRFATISSSGQETSS------------VGVNPQLSVPP-SSQVG 107 Query: 1061 SPLFWIGVGVGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXX 882 SPLFWIGVGVG SALFS+VA LKKYAM+QAFKT QM+ +NS F NAAFSPG Sbjct: 108 SPLFWIGVGVGLSALFSFVAGRLKKYAMEQAFKTFTQQMNTQNSPFGNAAFSPGSPFPFP 167 Query: 881 XXXXXXXXXXXXXXXXXXXXXXXXP-ATKVESAPTTEVKDNTETKNEPKRYAFVDVSPEE 705 A+KVE P+ VKD E + PK+YAFVDVSPEE Sbjct: 168 PATSPALDPFRTSTPLASQPITVDVPASKVEDPPSISVKDEVEQETGPKKYAFVDVSPEE 227 Query: 704 MSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSV 525 QK+ FEN ES +T KD Q + VSQNG A Q G GS+ T + P+MSV Sbjct: 228 TLQKNAFENYKESIQTDSPKDPQSSQSVSQNGTAWNQGAG----GSEGPTTSKTAPLMSV 283 Query: 524 EALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRM 345 EALEKMMEDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQL+DMLNNMGG+ EWDNRM Sbjct: 284 EALEKMMEDPTVQQMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGTPEWDNRM 343 Query: 344 MDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMS 165 MDSLKNFD+SSPEVKQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNPMS Sbjct: 344 MDSLKNFDISSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMS 403 Query: 164 IAKYQNDKEVMDVFNKISELFPGVTG 87 IAKYQNDKEVMDVFNKISELFPGV G Sbjct: 404 IAKYQNDKEVMDVFNKISELFPGVAG 429 >ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum lycopersicum] Length = 443 Score = 475 bits (1222), Expect = e-131 Identities = 263/454 (57%), Positives = 306/454 (67%), Gaps = 21/454 (4%) Frame = -1 Query: 1379 MENFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRK-----RRYDPTRTRNLVSSLS 1215 MEN ++SSPK +LG S N + + F LPHL ++ R PT +VS Sbjct: 1 MENIGIVSSPKMVLGLSSNS---VISSKPFFGLPHLPKRPFKNGRTVRPTTCFEVVSCFQ 57 Query: 1214 QGTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVG 1035 K L +G +FA SVGVNPQ S P P S +GSPLFWIGVG Sbjct: 58 GPRLTKKIV---LGKSGRGSFASTTTSGGKQTSSVGVNPQFSAPSPPSQMGSPLFWIGVG 114 Query: 1034 VGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXX 855 VGFSALF+WVA+ LKKYAMQQA KT+MGQM+ +NSQF+N AFSPG Sbjct: 115 VGFSALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNTAFSPGPGSPFPFPFPPPPVS 174 Query: 854 XXXXXXXXXXXXXXXP----------------ATKVESAPTTEVKDNTETKNEPKRYAFV 723 ATKVE PT VK++ E + EPK+ AFV Sbjct: 175 GPASSSPPPPTASSSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDKEAEKEPKKNAFV 234 Query: 722 DVSPEEMSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQP 543 D+SP+E QK FEN +S ET+ V++V+QNGAA++ G+ T S S+ TG+ Sbjct: 235 DISPDETFQKGAFENFKDSAETAAVT----VDQVTQNGAASQSGFGSNTSDSTSS-TGKS 289 Query: 542 GPVMSVEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSN 363 P++SV+ALEKMMEDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQL+DM+NNMGG+ Sbjct: 290 NPLLSVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNP 349 Query: 362 EWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDC 183 EWDNRMMDSLKNFDLSSPE+KQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQAAIMDC Sbjct: 350 EWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDC 409 Query: 182 SQNPMSIAKYQNDKEVMDVFNKISELFPGVTGPP 81 SQNP+SIAKYQNDKEVMDVFNKISELFPGV+G P Sbjct: 410 SQNPLSIAKYQNDKEVMDVFNKISELFPGVSGAP 443 >ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum tuberosum] Length = 443 Score = 468 bits (1205), Expect = e-129 Identities = 262/454 (57%), Positives = 303/454 (66%), Gaps = 21/454 (4%) Frame = -1 Query: 1379 MENFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRK-----RRYDPTRTRNLVSSLS 1215 MEN ++SSPK +LG S N + + LPHL ++ R PT +VS Sbjct: 1 MENICIVSSPKMVLGLSSNP---VISNKPLFGLPHLPKRPFKNGRIVRPTTCFEVVSCFQ 57 Query: 1214 QGTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVG 1035 S + L +G +FA SVGVNPQ S P S +GSPLFWIGVG Sbjct: 58 ---SPRLTKKIVLGKSGRGSFASTTTSGGQQTSSVGVNPQFSAPSQPSQVGSPLFWIGVG 114 Query: 1034 VGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXX 855 VG SALF+WVA+ LKKYAMQQA KT+MGQM+ +NSQF+N AFSPG Sbjct: 115 VGLSALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNNAFSPGPGSPFPFPFPPPPVS 174 Query: 854 XXXXXXXXXXXXXXXP----------------ATKVESAPTTEVKDNTETKNEPKRYAFV 723 ATKVE PT VK++TE EPK+ AFV Sbjct: 175 GPASSSPPPPTASTSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDTEAGKEPKKNAFV 234 Query: 722 DVSPEEMSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQP 543 D+SP+E QK FEN +S ET+ V++V+QNGAA++ G T S S+ TG+ Sbjct: 235 DISPDETFQKGAFENFKDSTETASVT----VDQVTQNGAASQLGFGPNTSDSTSS-TGKS 289 Query: 542 GPVMSVEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSN 363 P+MSV+ALEKMMEDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQL+DM+NNMGG+ Sbjct: 290 NPLMSVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNP 349 Query: 362 EWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDC 183 EWDNRMMDSLKNFDLSSPE+KQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQAAIMDC Sbjct: 350 EWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDC 409 Query: 182 SQNPMSIAKYQNDKEVMDVFNKISELFPGVTGPP 81 SQNP+SIAKYQNDKEVMDVFNKISELFPGV+G P Sbjct: 410 SQNPLSIAKYQNDKEVMDVFNKISELFPGVSGSP 443 >ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max] Length = 432 Score = 457 bits (1175), Expect = e-126 Identities = 265/443 (59%), Positives = 299/443 (67%), Gaps = 13/443 (2%) Frame = -1 Query: 1373 NFSLISSPK-FLLGSSPNLGGLIPIRRSFSALPHLSRKR-RYDPTRTRNLVSSLSQGTSD 1200 N +L+SSPK +LG P + R H S R P R R VS+LS + Sbjct: 5 NLALVSSPKPLMLGHVPAIDAT---SRDVFRRKHFSFGRVLIAPHRCRFRVSALSSSHRN 61 Query: 1199 KAHPSRRLSINGVEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVGVGFSA 1020 +L V+ FA S GVNPQ+S PSS IGSPLFWIGVGVG SA Sbjct: 62 PKSVQEKLI---VKHFASISSSNTQEATSTGVNPQLS---PSSTIGSPLFWIGVGVGLSA 115 Query: 1019 LFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXXX 840 LFS VA+ LKKYAMQQAFKT+MGQM+ +N+QF NAAFSPG Sbjct: 116 LFSVVASRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPTAPASSA 175 Query: 839 XXXXXXXXXXPAT-----------KVESAPTTEVKDNTETKNEPKRYAFVDVSPEEMSQK 693 A+ KVE APTT VKD E KNEPK+ AFVDVSPEE Q+ Sbjct: 176 TTQSRAPSASSASQSTITVDIPAAKVEVAPTTNVKDEVEVKNEPKKIAFVDVSPEETVQE 235 Query: 692 DPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALE 513 PFE+ + E+S K+ + +EVSQNGA + Q G + GSQST+ V+SV+ALE Sbjct: 236 SPFESFKDD-ESSSVKEARVPDEVSQNGAPSNQGFGDFP-GSQSTKKS----VLSVDALE 289 Query: 512 KMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSL 333 KMMEDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQLE+MLNNMGGS EWD+RMMD+L Sbjct: 290 KMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDTL 349 Query: 332 KNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKY 153 KNFDL+SPEVKQQFDQIGL+PEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPM+I KY Sbjct: 350 KNFDLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKY 409 Query: 152 QNDKEVMDVFNKISELFPGVTGP 84 QNDKEVMDVFNKISELFPGV P Sbjct: 410 QNDKEVMDVFNKISELFPGVGSP 432 >ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max] Length = 429 Score = 456 bits (1174), Expect = e-126 Identities = 261/441 (59%), Positives = 297/441 (67%), Gaps = 11/441 (2%) Frame = -1 Query: 1373 NFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQGTSDKA 1194 N +L+SSPK P + G +P R F + P R R VS+LS + Sbjct: 5 NLALVSSPK------PLMLGHVPARDVFRRKHFSFGRVLIAPHRCRFRVSALSSSHHNPK 58 Query: 1193 HPSRRLSINGVEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVGVGFSALF 1014 +L V+ FA S+GV PQ+S P PSS IGSPLFWIGVGVG SALF Sbjct: 59 SVQEKLI---VKHFASISSSNTQETTSIGVKPQLS-PSPSSTIGSPLFWIGVGVGLSALF 114 Query: 1013 SWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXXXXX 834 S VA+ LKKYAMQQAFKT+MGQM+ +N+QF NAAFSPG Sbjct: 115 SVVASRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPTAPASSATT 174 Query: 833 XXXXXXXXPAT-----------KVESAPTTEVKDNTETKNEPKRYAFVDVSPEEMSQKDP 687 A+ KVE+APTT VKD E KNEPK+ AFVDVSPEE ++ P Sbjct: 175 QSRAPSASSASQSTITVDLPAAKVEAAPTTNVKDEVELKNEPKKIAFVDVSPEETVRESP 234 Query: 686 FENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALEKM 507 FE+ + E+S K+ +EVSQNGA + G + GSQST+ +SV+ALEKM Sbjct: 235 FESFKDD-ESSSVKEAWVPDEVSQNGAPSNLGFGDFP-GSQSTKKS----ALSVDALEKM 288 Query: 506 MEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSLKN 327 MEDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQLE+MLNNMGGS EWDNRMMD+LKN Sbjct: 289 MEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDNRMMDTLKN 348 Query: 326 FDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQN 147 FDL+SPEVKQQFDQIGL+PEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPM+I KYQN Sbjct: 349 FDLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQN 408 Query: 146 DKEVMDVFNKISELFPGVTGP 84 DKEVMDVFNKISELFPGV P Sbjct: 409 DKEVMDVFNKISELFPGVGSP 429 >ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-like [Cicer arietinum] Length = 433 Score = 451 bits (1160), Expect = e-124 Identities = 257/445 (57%), Positives = 300/445 (67%), Gaps = 14/445 (3%) Frame = -1 Query: 1373 NFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQGTSDKA 1194 N +L+SSPK LL + + R+ F+ + N SS + K+ Sbjct: 5 NLALVSSPKPLLLGHSSSRNVFTRRKPFTFGKFFV---------SANSSSSHVTRAAPKS 55 Query: 1193 HPSRRLSING---VEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVGVGFS 1023 H + + S+ G V FA SVGV+PQ+S PPPSS +GSPLFWIGVGVGFS Sbjct: 56 HQNPK-SVQGKLIVHNFASISSSNSQETTSVGVSPQLS-PPPSSTVGSPLFWIGVGVGFS 113 Query: 1022 ALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXX 843 ALFS VA+ LKKYAMQQAFKT+MGQM+ +N+ F++AAFSPG Sbjct: 114 ALFSIVASRLKKYAMQQAFKTMMGQMNTQNNPFDSAAFSPGSPFPFPMPSSSGPAAPASS 173 Query: 842 XXXXXXXXXXXPA-----------TKVESAPTTEVKDNTETKNEPKRYAFVDVSPEEMSQ 696 A TKVE+AP+T KD E KNEPK+ FVDVSPEE Q Sbjct: 174 AGTQSQSTSARTASQSTVTVDIPATKVEAAPSTNAKDEVEVKNEPKKIGFVDVSPEESVQ 233 Query: 695 KDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEAL 516 K PFE+ + E+S K+ + E QNGA + Q G + GSQS V+SVEAL Sbjct: 234 KSPFESFKDVDESSSFKEARAPAEAFQNGAPSNQGFGN-SPGSQSGGKS----VLSVEAL 288 Query: 515 EKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDS 336 EKMMEDPTVQ MVYPYLPEEMRNP+TFKWMLQNPQYRQQLE+MLNNMGGS EWD+RMMD+ Sbjct: 289 EKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDT 348 Query: 335 LKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAK 156 LKNFDL+SP+VKQQFDQIGL+PEEVISKIMANPEVAMAFQNPRVQAAIMDCS NP++IAK Sbjct: 349 LKNFDLNSPDVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSSNPLNIAK 408 Query: 155 YQNDKEVMDVFNKISELFPGVTGPP 81 YQNDKEVMDVFNKISELFPGV+GPP Sbjct: 409 YQNDKEVMDVFNKISELFPGVSGPP 433 >ref|XP_007146937.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris] gi|561020160|gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris] Length = 430 Score = 444 bits (1142), Expect = e-122 Identities = 254/443 (57%), Positives = 298/443 (67%), Gaps = 13/443 (2%) Frame = -1 Query: 1373 NFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYD-------PTRTRNLVSSLS 1215 N +L+SS K P + G +P R + + R++ + P R R VS+LS Sbjct: 5 NLALVSSSK------PLMLGHVPARDATDR--DVLRRKPFSLGRVLIAPHRFRYRVSALS 56 Query: 1214 QGTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVG 1035 +L V+ FA S+GVNPQ+S PPPSS IGSPLFWIGVG Sbjct: 57 SSHHSPKSVQDKLI---VKHFASISSSNTQETTSIGVNPQLS-PPPSSTIGSPLFWIGVG 112 Query: 1034 VGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPG------XXXXXXXXX 873 VG SALFS VA+ LKKYAMQQAFKT+MGQM+ N+ F NAAFSPG Sbjct: 113 VGLSALFSMVASRLKKYAMQQAFKTMMGQMNSPNNDFGNAAFSPGSPFPFSMPSAAGPTA 172 Query: 872 XXXXXXXXXXXXXXXXXXXXXPATKVESAPTTEVKDNTETKNEPKRYAFVDVSPEEMSQK 693 PATKVE+ TT++KD E +N+PK+ AFVDVSPEE QK Sbjct: 173 TAQYGAPSTSSGSQSTVTVDIPATKVEATRTTDIKDEVEVQNKPKKIAFVDVSPEETVQK 232 Query: 692 DPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALE 513 PFE+ ++ +S ++ + +EVSQNGA Q G + GSQST+ +SV+ALE Sbjct: 233 SPFESVKDNESSSVKEEARVPDEVSQNGAPFNQGFGGF-PGSQSTKKS----ALSVDALE 287 Query: 512 KMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSL 333 KMMEDPTVQ MVYP+LPEEMRNP TFKWMLQNPQYRQQLE ML+NMGGS EWDNRMMD+L Sbjct: 288 KMMEDPTVQKMVYPHLPEEMRNPDTFKWMLQNPQYRQQLEAMLSNMGGSTEWDNRMMDTL 347 Query: 332 KNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKY 153 KNFDL+SPEVKQQFDQIGL+PEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPM+I KY Sbjct: 348 KNFDLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKY 407 Query: 152 QNDKEVMDVFNKISELFPGVTGP 84 QNDKEVM+VFNKISELFPG+ P Sbjct: 408 QNDKEVMNVFNKISELFPGMGSP 430 >ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Populus trichocarpa] gi|222848840|gb|EEE86387.1| hypothetical protein POPTR_0004s08560g [Populus trichocarpa] Length = 429 Score = 443 bits (1139), Expect = e-121 Identities = 256/433 (59%), Positives = 292/433 (67%), Gaps = 9/433 (2%) Frame = -1 Query: 1358 SSPKFLLGSSPNLGGLIPIRRSFSAL-PHLSRKRRYDPTRTRNLVSSLSQGTSDKAHPSR 1182 SSPK ++G +L + S S P L R T + S+S A+ Sbjct: 12 SSPKLVMGYPTSLKNPTTPKFSISTTRPSLPFSLRISKTAPHASIFSISA----LANSHG 67 Query: 1181 RLSINGVEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVGVGFSALFSWVA 1002 +L G E FA SVGVNPQ V PP S IGSPLFW+GVGVG SA+FSWVA Sbjct: 68 KL---GSEYFASISSSSGKQTASVGVNPQ-PVSPPPSQIGSPLFWVGVGVGLSAIFSWVA 123 Query: 1001 TNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXXXXXXXXX 822 T +K YAMQQAFK+L QM+ +N+QFN AFS Sbjct: 124 TRVKNYAMQQAFKSLTEQMNTQNNQFN-PAFSARPPFPFSPPPASHPSTSPSPAASQPAI 182 Query: 821 XXXXPATKVESAPTTEVKDNTET--------KNEPKRYAFVDVSPEEMSQKDPFENSTES 666 PATKVE+APTT+V ET K E K+YAFVD+SPEE S PF + + Sbjct: 183 TVDIPATKVEAAPTTDVGKEKETDFLEERKIKEETKKYAFVDISPEETSLNTPFSSVEDD 242 Query: 665 FETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALEKMMEDPTVQ 486 ETS SKDV+F ++V QNGAA KQ GA EGSQSTR P +SVEALEKMMEDPT+Q Sbjct: 243 NETSSSKDVEFAKKVFQNGAAFKQGPGA-AEGSQSTR-----PFLSVEALEKMMEDPTMQ 296 Query: 485 NMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSLKNFDLSSPE 306 MVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGS +WD++MMDSLK+FDL+S E Sbjct: 297 KMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSGKWDSQMMDSLKDFDLNSAE 356 Query: 305 VKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQNDKEVMDV 126 VKQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQ AIM+CSQNP++I KYQNDKEVMDV Sbjct: 357 VKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQQAIMECSQNPINITKYQNDKEVMDV 416 Query: 125 FNKISELFPGVTG 87 FNKISELFPG+TG Sbjct: 417 FNKISELFPGMTG 429 >ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa] gi|550319201|gb|ERP50369.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa] Length = 435 Score = 440 bits (1131), Expect = e-121 Identities = 243/402 (60%), Positives = 281/402 (69%), Gaps = 8/402 (1%) Frame = -1 Query: 1286 ALPHLSRKRRYDPTRTRNLVSSLSQGTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXSVG 1107 +LP R + +R +S+LSQ +H RR S NG E FA SVG Sbjct: 40 SLPFPHRTSKTVTHTSRISISALSQ-----SHGPRRTSKNGSEYFASISSLSGQQTASVG 94 Query: 1106 VNPQISVPPPSSHIGSPLFWIGVGVGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQ 927 VNPQ SV PP S IGSPLFW+GVGV SA+FSWVAT LK YAMQQAFK+L QM+ +N+Q Sbjct: 95 VNPQ-SVSPPPSQIGSPLFWVGVGVALSAIFSWVATRLKNYAMQQAFKSLTEQMNAQNNQ 153 Query: 926 FNNAAFSPGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPATKVESAPTTEVKDNTET-- 753 FN AFS PATKVE+AP T+ + ET Sbjct: 154 FN-PAFSARSPFPFSPPPASQPATSPFQTASQPAVTVDIPATKVEAAPETDARKEKETDT 212 Query: 752 ------KNEPKRYAFVDVSPEEMSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQD 591 K EP+++AFVDVSPEE S PF + + +TS SKDVQF +E SQNGA KQ Sbjct: 213 LEEREIKEEPRKFAFVDVSPEETSLNTPFSSVEDVIDTSSSKDVQFAKEASQNGATFKQG 272 Query: 590 KGAYTEGSQSTRTGQPGPVMSVEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQ 411 A +E S+ +++ Q +SVEALEKMM+DPTVQ MVYPYLPEEMRNPTTFKWMLQNPQ Sbjct: 273 PSA-SEPSEGSQSSQKAGSLSVEALEKMMDDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQ 331 Query: 410 YRQQLEDMLNNMGGSNEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEV 231 YRQQLE+MLNNM GS+EWD+RM+DSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANP+V Sbjct: 332 YRQQLEEMLNNMSGSSEWDSRMVDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPDV 391 Query: 230 AMAFQNPRVQAAIMDCSQNPMSIAKYQNDKEVMDVFNKISEL 105 A+AFQNPRVQ AIM+CSQNP+SIAKYQNDKEVMDVFNKISE+ Sbjct: 392 ALAFQNPRVQQAIMECSQNPLSIAKYQNDKEVMDVFNKISEI 433 >ref|XP_006858564.1| hypothetical protein AMTR_s00071p00175860 [Amborella trichopoda] gi|548862673|gb|ERN20031.1| hypothetical protein AMTR_s00071p00175860 [Amborella trichopoda] Length = 416 Score = 438 bits (1127), Expect = e-120 Identities = 245/419 (58%), Positives = 280/419 (66%), Gaps = 6/419 (1%) Frame = -1 Query: 1355 SPKFLLGSSPNLGGLI--PIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQGTSDKAHPSR 1182 SPKF LG S + P S+L L KRR RTR +V +L G P + Sbjct: 6 SPKFFLGFSSTSRRVSDNPFLIQRSSLLALCGKRRVTGCRTRVIVGALGHGNGGSRKPYK 65 Query: 1181 RLSINGVEAFAXXXXXXXXXXXS-VGVNPQISVPPPSSHIGSPLFWIGVGVGFSALFSWV 1005 +++FA + +GVNP + PPP S++GSPLFWIGVGVG SALFSWV Sbjct: 66 FK----MDSFASISSSSTREEATSIGVNPPFTAPPPPSYVGSPLFWIGVGVGISALFSWV 121 Query: 1004 ATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXXXXXXXX 825 ATNLKKYAMQQAFKT+MGQMS NSQF+ A F PG Sbjct: 122 ATNLKKYAMQQAFKTMMGQMSSNNSQFSGAGFPPG-PPFPFPPTSPSGTPAAPPTPFASK 180 Query: 824 XXXXXPATKVESAP---TTEVKDNTETKNEPKRYAFVDVSPEEMSQKDPFENSTESFETS 654 T + AP T EVK++T+TK + K + FVD+SPEE+ Q P E ES + S Sbjct: 181 SAVTVDVTASDVAPASSTVEVKEDTKTKKQTKTFEFVDISPEEVMQNRPSEQPKESTDGS 240 Query: 653 PSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALEKMMEDPTVQNMVY 474 P+KDV F EVSQNGA + +K TE QS+R V+SVEALEKMMEDPTVQ MVY Sbjct: 241 PAKDVHFA-EVSQNGALPQTEKSVSTENVQSSRPAD--SVLSVEALEKMMEDPTVQKMVY 297 Query: 473 PYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSLKNFDLSSPEVKQQ 294 PYLPEEMRNP TFKWMLQNPQYRQQLEDMLNNMGGS++WDNRMMDSLKNFDLS PEVKQQ Sbjct: 298 PYLPEEMRNPATFKWMLQNPQYRQQLEDMLNNMGGSSDWDNRMMDSLKNFDLSKPEVKQQ 357 Query: 293 FDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQNDKEVMDVFNK 117 FDQIGLTPEEVISKIMANP+VAMAFQNP+VQAAIMDCSQNP+SI KYQNDKEV + K Sbjct: 358 FDQIGLTPEEVISKIMANPDVAMAFQNPKVQAAIMDCSQNPLSITKYQNDKEVRLLLRK 416 >ref|XP_007032985.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial [Theobroma cacao] gi|508712014|gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial [Theobroma cacao] Length = 412 Score = 435 bits (1118), Expect = e-119 Identities = 258/430 (60%), Positives = 285/430 (66%), Gaps = 10/430 (2%) Frame = -1 Query: 1373 NFSLISSPK-----FLLGSS-PNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQ 1212 N +L+SS +LLG + PN P F LP S +R S SQ Sbjct: 5 NLALVSSSSPPLKLYLLGCNHPNYTPKNP----FKTLPFPSSNLAPRRSRISIFAHSHSQ 60 Query: 1211 GTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVGV 1032 T + P L G E FA SVGVNP +VPPPSS IGSPLFWIGVGV Sbjct: 61 PTPPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWIGVGV 120 Query: 1031 GFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXX 852 G SALF+WVA++LKKYAMQQAFKT+MGQM+ +N+QF+NAAF G Sbjct: 121 GLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPGPVTS 180 Query: 851 XXXXXXXXXXXXXXPATKVESAPTT----EVKDNTETKNEPKRYAFVDVSPEEMSQKDPF 684 ATKVE+AP T EVK TET EPK+YAFVDVSPEE QK F Sbjct: 181 PSPSSQTAVTVDVP-ATKVEAAPATAPATEVKSETETA-EPKKYAFVDVSPEETVQKSAF 238 Query: 683 ENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALEKMM 504 E++ S S + QF K D GA+ GSQST G P +SV+ALEKMM Sbjct: 239 EDAAG---ISSSNNTQF----------PKDDAGAFG-GSQST--GSADPALSVDALEKMM 282 Query: 503 EDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSLKNF 324 EDPTVQ MVYPYLPEEMRNP TFKWMLQNPQYRQQL+DMLNNMGGS EWDNRMMDSLKNF Sbjct: 283 EDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKNF 342 Query: 323 DLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQND 144 DL+SP+VKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNP+SIAKYQND Sbjct: 343 DLNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQND 402 Query: 143 KEVMDVFNKI 114 KEVMDVFNKI Sbjct: 403 KEVMDVFNKI 412 >ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-like [Cucumis sativus] Length = 419 Score = 434 bits (1117), Expect = e-119 Identities = 247/434 (56%), Positives = 293/434 (67%), Gaps = 3/434 (0%) Frame = -1 Query: 1373 NFSLISSPKFLLGSSPNLGGLIPIRRS--FSALPHLSRKRRYDPTRTRNLVSSLSQGTSD 1200 N +L S PKFL + + + R+ A LS R P R L S+L++ Sbjct: 5 NLALTSPPKFLFLTYSSTSSISTPSRTNQLCATRRLSSSRIKSPPRI--LASALNR---- 58 Query: 1199 KAHPSRRLSINGVEAFAXXXXXXXXXXXS-VGVNPQISVPPPSSHIGSPLFWIGVGVGFS 1023 P+ R+ + E FA S VGV P +S+PPPSS++GSPLFW+GVGVG S Sbjct: 59 --RPNHRILV--AERFATVSSSTTSNDSSSVGV-PSVSIPPPSSYVGSPLFWVGVGVGLS 113 Query: 1022 ALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXX 843 ALF+WVA+ LKKYAMQQAFKT+M QM+ +NS +N S G Sbjct: 114 ALFTWVASYLKKYAMQQAFKTMMSQMNSQNSPMSNPTLSSGSPFPIPPTFATGTTISPSV 173 Query: 842 XXXXXXXXXXXPATKVESAPTTEVKDNTETKNEPKRYAFVDVSPEEMSQKDPFENSTESF 663 ATKVE P T VK TE E K++AFVDVSPEE QK PF+ ++ Sbjct: 174 SEPAVSIDVT--ATKVEEEPVTNVKSRTENM-EAKKFAFVDVSPEETDQKSPFKE--DAT 228 Query: 662 ETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALEKMMEDPTVQN 483 + SK Q +E+ QNGAA+KQ ++GSQ +R +PG V+SVEA+EKMMEDPTVQ Sbjct: 229 DADVSKSAQPTQELPQNGAASKQAYNG-SDGSQFSR--KPGSVLSVEAVEKMMEDPTVQK 285 Query: 482 MVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSLKNFDLSSPEV 303 M+YP+LPEEMRNP TFKWM+QNP YRQQLE+MLNNM GS +WD R+MDSLKNFDLSSPEV Sbjct: 286 MIYPHLPEEMRNPETFKWMMQNPLYRQQLEEMLNNMSGSPQWDGRLMDSLKNFDLSSPEV 345 Query: 302 KQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQNDKEVMDVF 123 KQQFDQIGLTPEEVISKIMANPE+AMAFQNPRVQAAIMDCSQNP+SI KYQNDKEVMDVF Sbjct: 346 KQQFDQIGLTPEEVISKIMANPEIAMAFQNPRVQAAIMDCSQNPLSITKYQNDKEVMDVF 405 Query: 122 NKISELFPGVTGPP 81 NKISELFPGV+G P Sbjct: 406 NKISELFPGVSGAP 419 >emb|CAB50925.1| translocon Tic40 [Pisum sativum] Length = 436 Score = 429 bits (1103), Expect = e-117 Identities = 247/449 (55%), Positives = 297/449 (66%), Gaps = 18/449 (4%) Frame = -1 Query: 1373 NFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTR-TRNLVSSLSQGTSDK 1197 N +L+SSPK LL + + R+SF+ + R + N SS + K Sbjct: 5 NLALVSSPKPLLLGHSSSKNVFSRRKSFT----------FGTFRVSANSSSSHVTRAASK 54 Query: 1196 AHPSRRLSING---VEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVGVGF 1026 +H + + S+ G +FA SVGV+PQ+S PPPS+ +GSPLFWIG+GVGF Sbjct: 55 SHQNLK-SVQGKVNAHSFASISSSNGQETTSVGVSPQLSPPPPST-VGSPLFWIGIGVGF 112 Query: 1025 SALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXX 846 SALFS VA+ +KKYAMQQAFK++MGQM+ +N+ F++ AFS G Sbjct: 113 SALFSVVASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAG 172 Query: 845 XXXXXXXXXXXXPA-----------TKVESA---PTTEVKDNTETKNEPKRYAFVDVSPE 708 A TKVE+A P VK+ E KNEPK+ AFVDVSPE Sbjct: 173 FAGNQSQATSTRSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPE 232 Query: 707 EMSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMS 528 E QK+ FE + E+S K+ + E SQNG KQ G + GS S R +S Sbjct: 233 ETVQKNAFERFKDVDESSSFKEARAPAEASQNGTPFKQGFGD-SPGSPSERKS----ALS 287 Query: 527 VEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNR 348 V+ALEKMMEDPTVQ MVYPYLPEEMRNP+TFKWM+QNP+YRQQLE MLNNMGG EWD+R Sbjct: 288 VDALEKMMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSR 347 Query: 347 MMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPM 168 MMD+LKNFDL+SP+VKQQFDQIGL+P+EVISKIMANP+VAMAFQNPRVQAAIMDCSQNPM Sbjct: 348 MMDTLKNFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPM 407 Query: 167 SIAKYQNDKEVMDVFNKISELFPGVTGPP 81 SI KYQNDKEVMDVFNKISELFPGV+GPP Sbjct: 408 SIVKYQNDKEVMDVFNKISELFPGVSGPP 436 >ref|XP_004307173.1| PREDICTED: protein TIC 40, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 448 Score = 426 bits (1094), Expect = e-116 Identities = 235/388 (60%), Positives = 261/388 (67%), Gaps = 28/388 (7%) Frame = -1 Query: 1160 EAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVGVGFSALFSWVATNLKKYA 981 E FA SVG+NPQ S PPP S IGSPLFWIGVGV FSA+FSW A L+KY Sbjct: 66 ERFASISSTNSQETSSVGINPQFSAPPPPSTIGSPLFWIGVGVAFSAVFSWAAGKLQKYV 125 Query: 980 MQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPAT 801 +QQAFK +MGQM+ +N QF+NAAFSPG AT Sbjct: 126 VQQAFKNVMGQMNTQNDQFSNAAFSPGSPFPFPSAPASPSASPFSAPSQPSFTDVS--AT 183 Query: 800 KVES--------APTTEVKDNTETKNEPK--------------------RYAFVDVSPEE 705 +V+S P +VK + E + AFVDV+PEE Sbjct: 184 EVDSPASSATPSTPAADVKSEEQQMKENRFGNSFEIERNNVIQFSRQLSDRAFVDVNPEE 243 Query: 704 MSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSV 525 K PF +S E SK++ E SQNGAA KQ K A + GSQ+T G+ V+SV Sbjct: 244 TELKSPFASSLNDTEPGSSKEINSNVEGSQNGAAFKQAKDA-SMGSQTT--GKENSVLSV 300 Query: 524 EALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRM 345 EALEKM+EDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDML NM GSNEWDNRM Sbjct: 301 EALEKMLEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLRNMTGSNEWDNRM 360 Query: 344 MDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMS 165 MDSLKNFDLSSPEVK+QFDQIGLTPE+VISKIMANP+VAMAFQNPRVQAAIMDCSQNPMS Sbjct: 361 MDSLKNFDLSSPEVKEQFDQIGLTPEQVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMS 420 Query: 164 IAKYQNDKEVMDVFNKISELFPGVTGPP 81 I KYQNDKEVMDVFNKISELFPGV+G P Sbjct: 421 ITKYQNDKEVMDVFNKISELFPGVSGSP 448 >sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplastic; AltName: Full=Translocon at the inner envelope membrane of chloroplasts 40; Short=PsTIC40; Flags: Precursor gi|26000725|gb|AAN75219.1| chloroplast protein translocon component Tic40 precursor [Pisum sativum] Length = 436 Score = 426 bits (1094), Expect = e-116 Identities = 246/448 (54%), Positives = 294/448 (65%), Gaps = 17/448 (3%) Frame = -1 Query: 1373 NFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTR-TRNLVSSLSQGTSDK 1197 N +L+SSPK LL + + R+SF+ + R + N SS + K Sbjct: 5 NLALVSSPKPLLLGHSSSKNVFSGRKSFT----------FGTFRVSANSSSSHVTRAASK 54 Query: 1196 AHPSRRLSINGVEA--FAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLFWIGVGVGFS 1023 +H + + V A FA SVGV+PQ+S PPPS+ +GSPLFWIG+GVGFS Sbjct: 55 SHQNLKSVQGKVNAHDFASISSSNGQETTSVGVSPQLSPPPPST-VGSPLFWIGIGVGFS 113 Query: 1022 ALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXX 843 ALFS VA+ +KKYAMQQAFK++MGQM+ +N+ F++ AFS G Sbjct: 114 ALFSVVASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGF 173 Query: 842 XXXXXXXXXXXPA-----------TKVESA---PTTEVKDNTETKNEPKRYAFVDVSPEE 705 A TKVE+A P VK+ E KNEPK+ AFVDVSPEE Sbjct: 174 AGNQSQATSTRSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEE 233 Query: 704 MSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSV 525 QK+ FE + E+S K+ + E SQNG KQ G + S S R +SV Sbjct: 234 TVQKNAFERFKDVDESSSFKEARAPAEASQNGTPFKQGFGD-SPSSPSERKS----ALSV 288 Query: 524 EALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRM 345 +ALEKMMEDPTVQ MVYPYLPEEMRNP+TFKWM+QNP+YRQQLE MLNNMGG EWD+RM Sbjct: 289 DALEKMMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRM 348 Query: 344 MDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMS 165 MD+LKNFDL+SP+VKQQFDQIGL+P+EVISKIMANP+VAMAFQNPRVQAAIMDCSQNPMS Sbjct: 349 MDTLKNFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMS 408 Query: 164 IAKYQNDKEVMDVFNKISELFPGVTGPP 81 I KYQNDKEVMDVFNKISELFPGV+GPP Sbjct: 409 IVKYQNDKEVMDVFNKISELFPGVSGPP 436 >ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana] gi|75309208|sp|Q9FMD5.1|TIC40_ARATH RecName: Full=Protein TIC 40, chloroplastic; AltName: Full=Protein PIGMENT DEFECTIVE EMBRYO 120; AltName: Full=Translocon at the inner envelope membrane of chloroplasts 40; Short=AtTIC40; Flags: Precursor gi|16226313|gb|AAL16131.1|AF428299_1 AT5g16620/MTG13_6 [Arabidopsis thaliana] gi|10176971|dbj|BAB10189.1| translocon Tic40-like protein [Arabidopsis thaliana] gi|20260222|gb|AAM13009.1| translocon Tic40-like protein [Arabidopsis thaliana] gi|30387547|gb|AAP31939.1| At5g16620 [Arabidopsis thaliana] gi|332004935|gb|AED92318.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 447 Score = 411 bits (1057), Expect = e-112 Identities = 238/457 (52%), Positives = 288/457 (63%), Gaps = 26/457 (5%) Frame = -1 Query: 1379 MENFSLIS----SPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQ 1212 MEN +L+S SPK L+G N + FS +R + + +S+ +Q Sbjct: 1 MENLTLVSCSASSPKLLIGC--NFTSSLKNPTGFS-------RRTPNIVLRCSKISASAQ 51 Query: 1211 GTSDKAHPSRRLSINGVE----AFAXXXXXXXXXXXSVGVNPQISVPPPSSH-IGSPLFW 1047 S + P I V+ AFA + +P + VPPPSS IGSPLFW Sbjct: 52 SQSPSSRPENTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFW 111 Query: 1046 IGVGVGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXX 867 IGVGVG SALFS+V +NLKKYAMQ A KT+M QM+ +NSQFNN+ F G Sbjct: 112 IGVGVGLSALFSYVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNSGFPSGSPFPFPFPPQT 171 Query: 866 XXXXXXXXXXXXXXXXXXXP-ATKVESAPTTEVK----------------DNTETKNEPK 738 ATKVE+ P+T+ K + ++ K E K Sbjct: 172 SPASSPFQSQSQSSGATVDVTATKVETPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEK 231 Query: 737 RYAFVDVSPEEMSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQST 558 YAF D+SPEE +++ PF N E ET+ K+ + E+V QNGA A +E QS Sbjct: 232 NYAFEDISPEETTKESPFSNYAEVSETNSPKETRLFEDVLQNGAGPANGATA-SEVFQSL 290 Query: 557 RTGQPGPVMSVEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNN 378 G+ GP +SVEALEKMMEDPTVQ MVYPYLPEEMRNP TFKWML+NPQYRQQL+DMLNN Sbjct: 291 GGGKGGPGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNN 350 Query: 377 MGGSNEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQA 198 M GS EWD RM D+LKNFDL+SPEVKQQF+QIGLTPEEVISKIM NP+VAMAFQNPRVQA Sbjct: 351 MSGSGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQA 410 Query: 197 AIMDCSQNPMSIAKYQNDKEVMDVFNKISELFPGVTG 87 A+M+CS+NPM+I KYQNDKEVMDVFNKIS+LFPG+TG Sbjct: 411 ALMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 447 >ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arabidopsis lyrata subsp. lyrata] gi|297317564|gb|EFH47986.1| hypothetical protein ARALYDRAFT_488521 [Arabidopsis lyrata subsp. lyrata] Length = 447 Score = 405 bits (1040), Expect = e-110 Identities = 237/457 (51%), Positives = 283/457 (61%), Gaps = 26/457 (5%) Frame = -1 Query: 1379 MENFSLIS----SPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQ 1212 MEN +L+S SPK L+G + S SR+ R +S+ +Q Sbjct: 1 MENLTLVSCSASSPKLLIGCN--------FTSSLKNPTGFSRRTPRIVLRCSK-ISASAQ 51 Query: 1211 GTSDKAHPSRRLSINGVE----AFAXXXXXXXXXXXSVGVNPQISVPPPSSH-IGSPLFW 1047 S + P I V+ AFA + +P + VPPPSS IGSPLFW Sbjct: 52 SQSPSSRPDNTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFW 111 Query: 1046 IGVGVGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXX 867 IGVGVG SALFS V +NLKKYAMQ A KT+M QM+ +NSQFNN F G Sbjct: 112 IGVGVGLSALFSLVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNPGFPSGSPFPFPFPPQT 171 Query: 866 XXXXXXXXXXXXXXXXXXXP-ATKVESAPTTEVK----------------DNTETKNEPK 738 ATKV++ P+T+ K + ++ K E K Sbjct: 172 SPASSPFQSQSQSSGATVDVTATKVDTPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEK 231 Query: 737 RYAFVDVSPEEMSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQST 558 YAF D+SPEE +++ PF N E ETS K+ + E+V QNGA A +E QS Sbjct: 232 NYAFEDISPEETTKESPFSNYAEVSETSSPKETRLFEDVLQNGAGPANGATA-SEVFQSL 290 Query: 557 RTGQPGPVMSVEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNN 378 G+ G +SVEALEKMMEDPTVQ MVYPYLPEEMRNP TFKWML+NPQYRQQL+DMLNN Sbjct: 291 GGGKGGAGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNN 350 Query: 377 MGGSNEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQA 198 M GS EWD RM D+LKNFDL+SPEVKQQF+QIGLTPEEVISKIM NP+VAMAFQNPRVQA Sbjct: 351 MSGSGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQA 410 Query: 197 AIMDCSQNPMSIAKYQNDKEVMDVFNKISELFPGVTG 87 A+M+CS+NPM+I KYQNDKEVMDVFNKIS+LFPG+TG Sbjct: 411 ALMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 447 >ref|XP_007032982.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508712011|gb|EOY03908.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 531 Score = 400 bits (1029), Expect = e-109 Identities = 263/537 (48%), Positives = 294/537 (54%), Gaps = 106/537 (19%) Frame = -1 Query: 1373 NFSLISSPK-----FLLGSS-PNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQ 1212 N +L+SS +LLG + PN P F LP S +R S SQ Sbjct: 5 NLALVSSSSPPLKLYLLGCNHPNYTPKNP----FKTLPFPSSNLAPRRSRISIFAHSHSQ 60 Query: 1211 GTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXSVGVNPQISVPPPSSHIGSPLF------ 1050 T + P L G E FA SVGVNP +VPPPSS IGSPLF Sbjct: 61 PTPPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWIGVGV 120 Query: 1049 -------WIGVGVGFSALFS---------------------------------------- 1011 W+ V S F Sbjct: 121 GLSALFTWVSTKVASSLKFGSSIGYCNEAMKTDYGKMNSCMGPHRVGGLDFGVDDKVVVN 180 Query: 1010 ------WVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXX 849 + + +KYAMQQAFKT+MGQM+ +N+QF+NAAF G Sbjct: 181 GIMDPGYKIASQQKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPGPVTSP 240 Query: 848 XXXXXXXXXXXXXPATKVESAPTT----EVKDNTETKNEPKRYAFVDVSPEEMSQKDPFE 681 ATKVE+AP T EVK TET EPK+YAFVDVSPEE QK FE Sbjct: 241 SPSSQTAVTVDVP-ATKVEAAPATAPATEVKSETETA-EPKKYAFVDVSPEETVQKSAFE 298 Query: 680 NSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQST---------RTGQPGPVMS 528 ++ S S + QF ++VS NGAA+KQD GA+ GSQST G P +S Sbjct: 299 DAAG---ISSSNNTQFPKDVSDNGAASKQDAGAFG-GSQSTVKLNKHPIALAGSADPALS 354 Query: 527 VEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNR 348 V+ALEKMMEDPTVQ MVYPYLPEEMRNP TFKWMLQNPQYRQQL+DMLNNMGGS EWDNR Sbjct: 355 VDALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNR 414 Query: 347 MMDSLKNFDLSSPEVKQQF----------------------------DQIGLTPEEVISK 252 MMDSLKNFDL+SP+VKQQF DQIGLTPEEVISK Sbjct: 415 MMDSLKNFDLNSPDVKQQFVSRWSVSVVLECSLVPEEGSYISLSPFADQIGLTPEEVISK 474 Query: 251 IMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQNDKEVMDVFNKISELFPGVTGPP 81 IMANPEVAMAFQNPRVQAAIMDCSQNP+SIAKYQNDKEVMDVFNKISELFPGVTG P Sbjct: 475 IMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGVTGSP 531