BLASTX nr result
ID: Akebia25_contig00006000
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00006000 (1479 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vi... 501 e-139 ref|XP_007032983.1| Hydroxyproline-rich glycoprotein family prot... 477 e-132 gb|EYU28232.1| hypothetical protein MIMGU_mgv1a006810mg [Mimulus... 477 e-132 ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-lik... 475 e-131 ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-lik... 468 e-129 ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-lik... 457 e-126 ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-lik... 456 e-126 ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-lik... 451 e-124 ref|XP_007146937.1| hypothetical protein PHAVU_006G083300g [Phas... 444 e-122 ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Popu... 443 e-121 ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Popu... 440 e-121 ref|XP_006858564.1| hypothetical protein AMTR_s00071p00175860 [A... 438 e-120 ref|XP_007032985.1| Hydroxyproline-rich glycoprotein family prot... 435 e-119 ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-lik... 434 e-119 emb|CAB50925.1| translocon Tic40 [Pisum sativum] 429 e-117 ref|XP_004307173.1| PREDICTED: protein TIC 40, chloroplastic-lik... 426 e-116 sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplasti... 426 e-116 ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana] gi|753092... 411 e-112 ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arab... 405 e-110 ref|XP_007032982.1| Hydroxyproline-rich glycoprotein family prot... 400 e-109 >ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vitis vinifera] gi|296089465|emb|CBI39284.3| unnamed protein product [Vitis vinifera] Length = 436 Score = 501 bits (1290), Expect = e-139 Identities = 274/447 (61%), Positives = 306/447 (68%), Gaps = 14/447 (3%) Frame = +2 Query: 101 MENFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQGTSD 280 M++ +L+SSPK +LG SP+ I S +LP L RK R +++ G S Sbjct: 1 MDSLTLVSSPKLVLGHSPSNPRHISCAHSSFSLPLLFRK-------PRKFIAASQSGASP 53 Query: 281 KAHPSRRLSINGVEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVGVGFSA 460 + + G E FA VGVNPQ S PPPSS+IGSPLFWIGVGVG SA Sbjct: 54 RTPRHVVETKLGTECFASISSSSQGTSS-VGVNPQFSPPPPSSNIGSPLFWIGVGVGLSA 112 Query: 461 LFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPG--------------XXXXXX 598 LFSWVA+NLKKYAMQQAFKTLMGQM +N+QFN FSPG Sbjct: 113 LFSWVASNLKKYAMQQAFKTLMGQMDSQNNQFNTTTFSPGSPFPFPMPPPSGPSTSHSGP 172 Query: 599 XXXXXXXXXXXXXXXXXXXXXXXXXATKVESAPTTEVKDNTETKNEPKRYAFVDVSPEEM 778 ATKVE+ P T+VKD+ E KNE +YAFVDVSPEE Sbjct: 173 TTSPSGPTTSPSTVAAQSMVTVDVPATKVETPPATDVKDDIEKKNEQNKYAFVDVSPEET 232 Query: 779 SQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVE 958 Q+ PFEN ES ETS SKD QF VSQNG + G +E SQSTR P +SV+ Sbjct: 233 LQESPFENFEESTETSSSKDAQFSAGVSQNGTPPRPGMGV-SEDSQSTRNA--NPFLSVD 289 Query: 959 ALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMM 1138 ALEKMMEDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQL+DMLNNMGG EWDNRMM Sbjct: 290 ALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGGAEWDNRMM 349 Query: 1139 DSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSI 1318 D+LKNFDLSSPEVKQQFDQIGLTPEEVISKIMANP+VA+AFQNPR+QAAIMDCSQNP+SI Sbjct: 350 DNLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPDVALAFQNPRIQAAIMDCSQNPLSI 409 Query: 1319 AKYQNDKEVMDVFNKISELFPGVTGPP 1399 AKYQNDKEVMDVFNKISELFPGV+GPP Sbjct: 410 AKYQNDKEVMDVFNKISELFPGVSGPP 436 >ref|XP_007032983.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] gi|508712012|gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 433 Score = 477 bits (1228), Expect = e-132 Identities = 274/441 (62%), Positives = 304/441 (68%), Gaps = 10/441 (2%) Frame = +2 Query: 107 NFSLISSPK-----FLLGSS-PNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQ 268 N +L+SS +LLG + PN P F LP S +R S SQ Sbjct: 5 NLALVSSSSPPLKLYLLGCNHPNYTPKNP----FKTLPFPSSNLAPRRSRISIFAHSHSQ 60 Query: 269 GTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVGV 448 T + P L G E FA VGVNP +VPPPSS IGSPLFWIGVGV Sbjct: 61 PTPPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWIGVGV 120 Query: 449 GFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXX 628 G SALF+WVA++LKKYAMQQAFKT+MGQM+ +N+QF+NAAF G Sbjct: 121 GLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPGPVTS 180 Query: 629 XXXXXXXXXXXXXXXATKVESAPTT----EVKDNTETKNEPKRYAFVDVSPEEMSQKDPF 796 ATKVE+AP T EVK TET EPK+YAFVDVSPEE QK F Sbjct: 181 PSPSSQTAVTVDVP-ATKVEAAPATAPATEVKSETETA-EPKKYAFVDVSPEETVQKSAF 238 Query: 797 ENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALEKMM 976 E++ S S + QF ++VS NGAA+KQD GA+ GSQST G P +SV+ALEKMM Sbjct: 239 EDAAG---ISSSNNTQFPKDVSDNGAASKQDAGAFG-GSQST--GSADPALSVDALEKMM 292 Query: 977 EDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSLKNF 1156 EDPTVQ MVYPYLPEEMRNP TFKWMLQNPQYRQQL+DMLNNMGGS EWDNRMMDSLKNF Sbjct: 293 EDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKNF 352 Query: 1157 DLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQND 1336 DL+SP+VKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNP+SIAKYQND Sbjct: 353 DLNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQND 412 Query: 1337 KEVMDVFNKISELFPGVTGPP 1399 KEVMDVFNKISELFPGVTG P Sbjct: 413 KEVMDVFNKISELFPGVTGSP 433 >gb|EYU28232.1| hypothetical protein MIMGU_mgv1a006810mg [Mimulus guttatus] Length = 430 Score = 477 bits (1227), Expect = e-132 Identities = 270/446 (60%), Positives = 303/446 (67%), Gaps = 15/446 (3%) Frame = +2 Query: 101 MENFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKR----RYDPTRTRNL-VSSLS 265 MEN L+SSPK +LG SPN + + LP+L +K R+ T +L V SL Sbjct: 1 MENLGLVSSPKIVLGVSPNPRNSVISSKPLVGLPNLLKKTGNYGRHTTIHTSSLQVLSLF 60 Query: 266 QG-------TSDKAHPSR--RLSINGVEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIG 418 + S+KA R +S +G E + VGVNPQ+SVPP SS +G Sbjct: 61 RSPKPTKTIVSEKAAKDRFATISSSGQETSS------------VGVNPQLSVPP-SSQVG 107 Query: 419 SPLFWIGVGVGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXX 598 SPLFWIGVGVG SALFS+VA LKKYAM+QAFKT QM+ +NS F NAAFSPG Sbjct: 108 SPLFWIGVGVGLSALFSFVAGRLKKYAMEQAFKTFTQQMNTQNSPFGNAAFSPGSPFPFP 167 Query: 599 XXXXXXXXXXXXXXXXXXXXXXXXX-ATKVESAPTTEVKDNTETKNEPKRYAFVDVSPEE 775 A+KVE P+ VKD E + PK+YAFVDVSPEE Sbjct: 168 PATSPALDPFRTSTPLASQPITVDVPASKVEDPPSISVKDEVEQETGPKKYAFVDVSPEE 227 Query: 776 MSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSV 955 QK+ FEN ES +T KD Q + VSQNG A Q G GS+ T + P+MSV Sbjct: 228 TLQKNAFENYKESIQTDSPKDPQSSQSVSQNGTAWNQGAG----GSEGPTTSKTAPLMSV 283 Query: 956 EALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRM 1135 EALEKMMEDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQL+DMLNNMGG+ EWDNRM Sbjct: 284 EALEKMMEDPTVQQMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGTPEWDNRM 343 Query: 1136 MDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMS 1315 MDSLKNFD+SSPEVKQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNPMS Sbjct: 344 MDSLKNFDISSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMS 403 Query: 1316 IAKYQNDKEVMDVFNKISELFPGVTG 1393 IAKYQNDKEVMDVFNKISELFPGV G Sbjct: 404 IAKYQNDKEVMDVFNKISELFPGVAG 429 >ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum lycopersicum] Length = 443 Score = 475 bits (1222), Expect = e-131 Identities = 262/454 (57%), Positives = 305/454 (67%), Gaps = 21/454 (4%) Frame = +2 Query: 101 MENFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRK-----RRYDPTRTRNLVSSLS 265 MEN ++SSPK +LG S N + + F LPHL ++ R PT +VS Sbjct: 1 MENIGIVSSPKMVLGLSSNS---VISSKPFFGLPHLPKRPFKNGRTVRPTTCFEVVSCFQ 57 Query: 266 QGTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVG 445 K L +G +FA VGVNPQ S P P S +GSPLFWIGVG Sbjct: 58 GPRLTKKIV---LGKSGRGSFASTTTSGGKQTSSVGVNPQFSAPSPPSQMGSPLFWIGVG 114 Query: 446 VGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXX 625 VGFSALF+WVA+ LKKYAMQQA KT+MGQM+ +NSQF+N AFSPG Sbjct: 115 VGFSALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNTAFSPGPGSPFPFPFPPPPVS 174 Query: 626 XXXXXXXXXXXXXXXX----------------ATKVESAPTTEVKDNTETKNEPKRYAFV 757 ATKVE PT VK++ E + EPK+ AFV Sbjct: 175 GPASSSPPPPTASSSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDKEAEKEPKKNAFV 234 Query: 758 DVSPEEMSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQP 937 D+SP+E QK FEN +S ET+ V++V+QNGAA++ G+ T S S+ TG+ Sbjct: 235 DISPDETFQKGAFENFKDSAETAAVT----VDQVTQNGAASQSGFGSNTSDSTSS-TGKS 289 Query: 938 GPVMSVEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSN 1117 P++SV+ALEKMMEDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQL+DM+NNMGG+ Sbjct: 290 NPLLSVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNP 349 Query: 1118 EWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDC 1297 EWDNRMMDSLKNFDLSSPE+KQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQAAIMDC Sbjct: 350 EWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDC 409 Query: 1298 SQNPMSIAKYQNDKEVMDVFNKISELFPGVTGPP 1399 SQNP+SIAKYQNDKEVMDVFNKISELFPGV+G P Sbjct: 410 SQNPLSIAKYQNDKEVMDVFNKISELFPGVSGAP 443 >ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum tuberosum] Length = 443 Score = 468 bits (1205), Expect = e-129 Identities = 261/454 (57%), Positives = 302/454 (66%), Gaps = 21/454 (4%) Frame = +2 Query: 101 MENFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRK-----RRYDPTRTRNLVSSLS 265 MEN ++SSPK +LG S N + + LPHL ++ R PT +VS Sbjct: 1 MENICIVSSPKMVLGLSSNP---VISNKPLFGLPHLPKRPFKNGRIVRPTTCFEVVSCFQ 57 Query: 266 QGTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVG 445 S + L +G +FA VGVNPQ S P S +GSPLFWIGVG Sbjct: 58 ---SPRLTKKIVLGKSGRGSFASTTTSGGQQTSSVGVNPQFSAPSQPSQVGSPLFWIGVG 114 Query: 446 VGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXX 625 VG SALF+WVA+ LKKYAMQQA KT+MGQM+ +NSQF+N AFSPG Sbjct: 115 VGLSALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNNAFSPGPGSPFPFPFPPPPVS 174 Query: 626 XXXXXXXXXXXXXXXX----------------ATKVESAPTTEVKDNTETKNEPKRYAFV 757 ATKVE PT VK++TE EPK+ AFV Sbjct: 175 GPASSSPPPPTASTSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDTEAGKEPKKNAFV 234 Query: 758 DVSPEEMSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQP 937 D+SP+E QK FEN +S ET+ V++V+QNGAA++ G T S S+ TG+ Sbjct: 235 DISPDETFQKGAFENFKDSTETASVT----VDQVTQNGAASQLGFGPNTSDSTSS-TGKS 289 Query: 938 GPVMSVEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSN 1117 P+MSV+ALEKMMEDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQL+DM+NNMGG+ Sbjct: 290 NPLMSVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNP 349 Query: 1118 EWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDC 1297 EWDNRMMDSLKNFDLSSPE+KQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQAAIMDC Sbjct: 350 EWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDC 409 Query: 1298 SQNPMSIAKYQNDKEVMDVFNKISELFPGVTGPP 1399 SQNP+SIAKYQNDKEVMDVFNKISELFPGV+G P Sbjct: 410 SQNPLSIAKYQNDKEVMDVFNKISELFPGVSGSP 443 >ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max] Length = 432 Score = 457 bits (1175), Expect = e-126 Identities = 264/443 (59%), Positives = 298/443 (67%), Gaps = 13/443 (2%) Frame = +2 Query: 107 NFSLISSPK-FLLGSSPNLGGLIPIRRSFSALPHLSRKR-RYDPTRTRNLVSSLSQGTSD 280 N +L+SSPK +LG P + R H S R P R R VS+LS + Sbjct: 5 NLALVSSPKPLMLGHVPAIDAT---SRDVFRRKHFSFGRVLIAPHRCRFRVSALSSSHRN 61 Query: 281 KAHPSRRLSINGVEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVGVGFSA 460 +L V+ FA GVNPQ+S PSS IGSPLFWIGVGVG SA Sbjct: 62 PKSVQEKLI---VKHFASISSSNTQEATSTGVNPQLS---PSSTIGSPLFWIGVGVGLSA 115 Query: 461 LFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXXX 640 LFS VA+ LKKYAMQQAFKT+MGQM+ +N+QF NAAFSPG Sbjct: 116 LFSVVASRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPTAPASSA 175 Query: 641 XXXXXXXXXXXAT-----------KVESAPTTEVKDNTETKNEPKRYAFVDVSPEEMSQK 787 A+ KVE APTT VKD E KNEPK+ AFVDVSPEE Q+ Sbjct: 176 TTQSRAPSASSASQSTITVDIPAAKVEVAPTTNVKDEVEVKNEPKKIAFVDVSPEETVQE 235 Query: 788 DPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALE 967 PFE+ + E+S K+ + +EVSQNGA + Q G + GSQST+ V+SV+ALE Sbjct: 236 SPFESFKDD-ESSSVKEARVPDEVSQNGAPSNQGFGDFP-GSQSTKKS----VLSVDALE 289 Query: 968 KMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSL 1147 KMMEDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQLE+MLNNMGGS EWD+RMMD+L Sbjct: 290 KMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDTL 349 Query: 1148 KNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKY 1327 KNFDL+SPEVKQQFDQIGL+PEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPM+I KY Sbjct: 350 KNFDLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKY 409 Query: 1328 QNDKEVMDVFNKISELFPGVTGP 1396 QNDKEVMDVFNKISELFPGV P Sbjct: 410 QNDKEVMDVFNKISELFPGVGSP 432 >ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max] Length = 429 Score = 456 bits (1174), Expect = e-126 Identities = 260/441 (58%), Positives = 296/441 (67%), Gaps = 11/441 (2%) Frame = +2 Query: 107 NFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQGTSDKA 286 N +L+SSPK P + G +P R F + P R R VS+LS + Sbjct: 5 NLALVSSPK------PLMLGHVPARDVFRRKHFSFGRVLIAPHRCRFRVSALSSSHHNPK 58 Query: 287 HPSRRLSINGVEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVGVGFSALF 466 +L V+ FA +GV PQ+S P PSS IGSPLFWIGVGVG SALF Sbjct: 59 SVQEKLI---VKHFASISSSNTQETTSIGVKPQLS-PSPSSTIGSPLFWIGVGVGLSALF 114 Query: 467 SWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXXXXX 646 S VA+ LKKYAMQQAFKT+MGQM+ +N+QF NAAFSPG Sbjct: 115 SVVASRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPTAPASSATT 174 Query: 647 XXXXXXXXXAT-----------KVESAPTTEVKDNTETKNEPKRYAFVDVSPEEMSQKDP 793 A+ KVE+APTT VKD E KNEPK+ AFVDVSPEE ++ P Sbjct: 175 QSRAPSASSASQSTITVDLPAAKVEAAPTTNVKDEVELKNEPKKIAFVDVSPEETVRESP 234 Query: 794 FENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALEKM 973 FE+ + E+S K+ +EVSQNGA + G + GSQST+ +SV+ALEKM Sbjct: 235 FESFKDD-ESSSVKEAWVPDEVSQNGAPSNLGFGDFP-GSQSTKKS----ALSVDALEKM 288 Query: 974 MEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSLKN 1153 MEDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQLE+MLNNMGGS EWDNRMMD+LKN Sbjct: 289 MEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDNRMMDTLKN 348 Query: 1154 FDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQN 1333 FDL+SPEVKQQFDQIGL+PEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPM+I KYQN Sbjct: 349 FDLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQN 408 Query: 1334 DKEVMDVFNKISELFPGVTGP 1396 DKEVMDVFNKISELFPGV P Sbjct: 409 DKEVMDVFNKISELFPGVGSP 429 >ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-like [Cicer arietinum] Length = 433 Score = 451 bits (1160), Expect = e-124 Identities = 256/445 (57%), Positives = 299/445 (67%), Gaps = 14/445 (3%) Frame = +2 Query: 107 NFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQGTSDKA 286 N +L+SSPK LL + + R+ F+ + N SS + K+ Sbjct: 5 NLALVSSPKPLLLGHSSSRNVFTRRKPFTFGKFFV---------SANSSSSHVTRAAPKS 55 Query: 287 HPSRRLSING---VEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVGVGFS 457 H + + S+ G V FA VGV+PQ+S PPPSS +GSPLFWIGVGVGFS Sbjct: 56 HQNPK-SVQGKLIVHNFASISSSNSQETTSVGVSPQLS-PPPSSTVGSPLFWIGVGVGFS 113 Query: 458 ALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXX 637 ALFS VA+ LKKYAMQQAFKT+MGQM+ +N+ F++AAFSPG Sbjct: 114 ALFSIVASRLKKYAMQQAFKTMMGQMNTQNNPFDSAAFSPGSPFPFPMPSSSGPAAPASS 173 Query: 638 XXXXXXXXXXXXA-----------TKVESAPTTEVKDNTETKNEPKRYAFVDVSPEEMSQ 784 A TKVE+AP+T KD E KNEPK+ FVDVSPEE Q Sbjct: 174 AGTQSQSTSARTASQSTVTVDIPATKVEAAPSTNAKDEVEVKNEPKKIGFVDVSPEESVQ 233 Query: 785 KDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEAL 964 K PFE+ + E+S K+ + E QNGA + Q G + GSQS V+SVEAL Sbjct: 234 KSPFESFKDVDESSSFKEARAPAEAFQNGAPSNQGFGN-SPGSQSGGKS----VLSVEAL 288 Query: 965 EKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDS 1144 EKMMEDPTVQ MVYPYLPEEMRNP+TFKWMLQNPQYRQQLE+MLNNMGGS EWD+RMMD+ Sbjct: 289 EKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDT 348 Query: 1145 LKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAK 1324 LKNFDL+SP+VKQQFDQIGL+PEEVISKIMANPEVAMAFQNPRVQAAIMDCS NP++IAK Sbjct: 349 LKNFDLNSPDVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSSNPLNIAK 408 Query: 1325 YQNDKEVMDVFNKISELFPGVTGPP 1399 YQNDKEVMDVFNKISELFPGV+GPP Sbjct: 409 YQNDKEVMDVFNKISELFPGVSGPP 433 >ref|XP_007146937.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris] gi|561020160|gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris] Length = 430 Score = 444 bits (1142), Expect = e-122 Identities = 252/443 (56%), Positives = 296/443 (66%), Gaps = 13/443 (2%) Frame = +2 Query: 107 NFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYD-------PTRTRNLVSSLS 265 N +L+SS K P + G +P R + + R++ + P R R VS+LS Sbjct: 5 NLALVSSSK------PLMLGHVPARDATDR--DVLRRKPFSLGRVLIAPHRFRYRVSALS 56 Query: 266 QGTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVG 445 +L V+ FA +GVNPQ+S PPPSS IGSPLFWIGVG Sbjct: 57 SSHHSPKSVQDKLI---VKHFASISSSNTQETTSIGVNPQLS-PPPSSTIGSPLFWIGVG 112 Query: 446 VGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPG------XXXXXXXXX 607 VG SALFS VA+ LKKYAMQQAFKT+MGQM+ N+ F NAAFSPG Sbjct: 113 VGLSALFSMVASRLKKYAMQQAFKTMMGQMNSPNNDFGNAAFSPGSPFPFSMPSAAGPTA 172 Query: 608 XXXXXXXXXXXXXXXXXXXXXXATKVESAPTTEVKDNTETKNEPKRYAFVDVSPEEMSQK 787 ATKVE+ TT++KD E +N+PK+ AFVDVSPEE QK Sbjct: 173 TAQYGAPSTSSGSQSTVTVDIPATKVEATRTTDIKDEVEVQNKPKKIAFVDVSPEETVQK 232 Query: 788 DPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALE 967 PFE+ ++ +S ++ + +EVSQNGA Q G + GSQST+ +SV+ALE Sbjct: 233 SPFESVKDNESSSVKEEARVPDEVSQNGAPFNQGFGGF-PGSQSTKKS----ALSVDALE 287 Query: 968 KMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSL 1147 KMMEDPTVQ MVYP+LPEEMRNP TFKWMLQNPQYRQQLE ML+NMGGS EWDNRMMD+L Sbjct: 288 KMMEDPTVQKMVYPHLPEEMRNPDTFKWMLQNPQYRQQLEAMLSNMGGSTEWDNRMMDTL 347 Query: 1148 KNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKY 1327 KNFDL+SPEVKQQFDQIGL+PEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPM+I KY Sbjct: 348 KNFDLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKY 407 Query: 1328 QNDKEVMDVFNKISELFPGVTGP 1396 QNDKEVM+VFNKISELFPG+ P Sbjct: 408 QNDKEVMNVFNKISELFPGMGSP 430 >ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Populus trichocarpa] gi|222848840|gb|EEE86387.1| hypothetical protein POPTR_0004s08560g [Populus trichocarpa] Length = 429 Score = 443 bits (1139), Expect = e-121 Identities = 254/433 (58%), Positives = 290/433 (66%), Gaps = 9/433 (2%) Frame = +2 Query: 122 SSPKFLLGSSPNLGGLIPIRRSFSAL-PHLSRKRRYDPTRTRNLVSSLSQGTSDKAHPSR 298 SSPK ++G +L + S S P L R T + S+S A+ Sbjct: 12 SSPKLVMGYPTSLKNPTTPKFSISTTRPSLPFSLRISKTAPHASIFSISA----LANSHG 67 Query: 299 RLSINGVEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVGVGFSALFSWVA 478 +L G E FA VGVNPQ V PP S IGSPLFW+GVGVG SA+FSWVA Sbjct: 68 KL---GSEYFASISSSSGKQTASVGVNPQ-PVSPPPSQIGSPLFWVGVGVGLSAIFSWVA 123 Query: 479 TNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXXXXXXXXX 658 T +K YAMQQAFK+L QM+ +N+QFN AFS Sbjct: 124 TRVKNYAMQQAFKSLTEQMNTQNNQFN-PAFSARPPFPFSPPPASHPSTSPSPAASQPAI 182 Query: 659 XXXXXATKVESAPTTEVKDNTET--------KNEPKRYAFVDVSPEEMSQKDPFENSTES 814 ATKVE+APTT+V ET K E K+YAFVD+SPEE S PF + + Sbjct: 183 TVDIPATKVEAAPTTDVGKEKETDFLEERKIKEETKKYAFVDISPEETSLNTPFSSVEDD 242 Query: 815 FETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALEKMMEDPTVQ 994 ETS SKDV+F ++V QNGAA KQ GA EGSQSTR P +SVEALEKMMEDPT+Q Sbjct: 243 NETSSSKDVEFAKKVFQNGAAFKQGPGA-AEGSQSTR-----PFLSVEALEKMMEDPTMQ 296 Query: 995 NMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSLKNFDLSSPE 1174 MVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGS +WD++MMDSLK+FDL+S E Sbjct: 297 KMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSGKWDSQMMDSLKDFDLNSAE 356 Query: 1175 VKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQNDKEVMDV 1354 VKQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQ AIM+CSQNP++I KYQNDKEVMDV Sbjct: 357 VKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQQAIMECSQNPINITKYQNDKEVMDV 416 Query: 1355 FNKISELFPGVTG 1393 FNKISELFPG+TG Sbjct: 417 FNKISELFPGMTG 429 >ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa] gi|550319201|gb|ERP50369.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa] Length = 435 Score = 440 bits (1131), Expect = e-121 Identities = 241/402 (59%), Positives = 279/402 (69%), Gaps = 8/402 (1%) Frame = +2 Query: 194 ALPHLSRKRRYDPTRTRNLVSSLSQGTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXXVG 373 +LP R + +R +S+LSQ +H RR S NG E FA VG Sbjct: 40 SLPFPHRTSKTVTHTSRISISALSQ-----SHGPRRTSKNGSEYFASISSLSGQQTASVG 94 Query: 374 VNPQISVPPPSSHIGSPLFWIGVGVGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQ 553 VNPQ SV PP S IGSPLFW+GVGV SA+FSWVAT LK YAMQQAFK+L QM+ +N+Q Sbjct: 95 VNPQ-SVSPPPSQIGSPLFWVGVGVALSAIFSWVATRLKNYAMQQAFKSLTEQMNAQNNQ 153 Query: 554 FNNAAFSPGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXATKVESAPTTEVKDNTET-- 727 FN AFS ATKVE+AP T+ + ET Sbjct: 154 FN-PAFSARSPFPFSPPPASQPATSPFQTASQPAVTVDIPATKVEAAPETDARKEKETDT 212 Query: 728 ------KNEPKRYAFVDVSPEEMSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQD 889 K EP+++AFVDVSPEE S PF + + +TS SKDVQF +E SQNGA KQ Sbjct: 213 LEEREIKEEPRKFAFVDVSPEETSLNTPFSSVEDVIDTSSSKDVQFAKEASQNGATFKQG 272 Query: 890 KGAYTEGSQSTRTGQPGPVMSVEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQ 1069 A +E S+ +++ Q +SVEALEKMM+DPTVQ MVYPYLPEEMRNPTTFKWMLQNPQ Sbjct: 273 PSA-SEPSEGSQSSQKAGSLSVEALEKMMDDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQ 331 Query: 1070 YRQQLEDMLNNMGGSNEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEV 1249 YRQQLE+MLNNM GS+EWD+RM+DSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANP+V Sbjct: 332 YRQQLEEMLNNMSGSSEWDSRMVDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPDV 391 Query: 1250 AMAFQNPRVQAAIMDCSQNPMSIAKYQNDKEVMDVFNKISEL 1375 A+AFQNPRVQ AIM+CSQNP+SIAKYQNDKEVMDVFNKISE+ Sbjct: 392 ALAFQNPRVQQAIMECSQNPLSIAKYQNDKEVMDVFNKISEI 433 >ref|XP_006858564.1| hypothetical protein AMTR_s00071p00175860 [Amborella trichopoda] gi|548862673|gb|ERN20031.1| hypothetical protein AMTR_s00071p00175860 [Amborella trichopoda] Length = 416 Score = 438 bits (1127), Expect = e-120 Identities = 245/419 (58%), Positives = 279/419 (66%), Gaps = 6/419 (1%) Frame = +2 Query: 125 SPKFLLGSSPNLGGLI--PIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQGTSDKAHPSR 298 SPKF LG S + P S+L L KRR RTR +V +L G P + Sbjct: 6 SPKFFLGFSSTSRRVSDNPFLIQRSSLLALCGKRRVTGCRTRVIVGALGHGNGGSRKPYK 65 Query: 299 RLSINGVEAFAXXXXXXXXXXXX-VGVNPQISVPPPSSHIGSPLFWIGVGVGFSALFSWV 475 +++FA +GVNP + PPP S++GSPLFWIGVGVG SALFSWV Sbjct: 66 FK----MDSFASISSSSTREEATSIGVNPPFTAPPPPSYVGSPLFWIGVGVGISALFSWV 121 Query: 476 ATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXXXXXXXX 655 ATNLKKYAMQQAFKT+MGQMS NSQF+ A F PG Sbjct: 122 ATNLKKYAMQQAFKTMMGQMSSNNSQFSGAGFPPG-PPFPFPPTSPSGTPAAPPTPFASK 180 Query: 656 XXXXXXATKVESAP---TTEVKDNTETKNEPKRYAFVDVSPEEMSQKDPFENSTESFETS 826 T + AP T EVK++T+TK + K + FVD+SPEE+ Q P E ES + S Sbjct: 181 SAVTVDVTASDVAPASSTVEVKEDTKTKKQTKTFEFVDISPEEVMQNRPSEQPKESTDGS 240 Query: 827 PSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALEKMMEDPTVQNMVY 1006 P+KDV F EVSQNGA + +K TE QS+R V+SVEALEKMMEDPTVQ MVY Sbjct: 241 PAKDVHFA-EVSQNGALPQTEKSVSTENVQSSRPAD--SVLSVEALEKMMEDPTVQKMVY 297 Query: 1007 PYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSLKNFDLSSPEVKQQ 1186 PYLPEEMRNP TFKWMLQNPQYRQQLEDMLNNMGGS++WDNRMMDSLKNFDLS PEVKQQ Sbjct: 298 PYLPEEMRNPATFKWMLQNPQYRQQLEDMLNNMGGSSDWDNRMMDSLKNFDLSKPEVKQQ 357 Query: 1187 FDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQNDKEVMDVFNK 1363 FDQIGLTPEEVISKIMANP+VAMAFQNP+VQAAIMDCSQNP+SI KYQNDKEV + K Sbjct: 358 FDQIGLTPEEVISKIMANPDVAMAFQNPKVQAAIMDCSQNPLSITKYQNDKEVRLLLRK 416 >ref|XP_007032985.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial [Theobroma cacao] gi|508712014|gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial [Theobroma cacao] Length = 412 Score = 435 bits (1118), Expect = e-119 Identities = 257/430 (59%), Positives = 284/430 (66%), Gaps = 10/430 (2%) Frame = +2 Query: 107 NFSLISSPK-----FLLGSS-PNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQ 268 N +L+SS +LLG + PN P F LP S +R S SQ Sbjct: 5 NLALVSSSSPPLKLYLLGCNHPNYTPKNP----FKTLPFPSSNLAPRRSRISIFAHSHSQ 60 Query: 269 GTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVGV 448 T + P L G E FA VGVNP +VPPPSS IGSPLFWIGVGV Sbjct: 61 PTPPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWIGVGV 120 Query: 449 GFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXX 628 G SALF+WVA++LKKYAMQQAFKT+MGQM+ +N+QF+NAAF G Sbjct: 121 GLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPGPVTS 180 Query: 629 XXXXXXXXXXXXXXXATKVESAPTT----EVKDNTETKNEPKRYAFVDVSPEEMSQKDPF 796 ATKVE+AP T EVK TET EPK+YAFVDVSPEE QK F Sbjct: 181 PSPSSQTAVTVDVP-ATKVEAAPATAPATEVKSETETA-EPKKYAFVDVSPEETVQKSAF 238 Query: 797 ENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALEKMM 976 E++ S S + QF K D GA+ GSQST G P +SV+ALEKMM Sbjct: 239 EDAAG---ISSSNNTQF----------PKDDAGAFG-GSQST--GSADPALSVDALEKMM 282 Query: 977 EDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSLKNF 1156 EDPTVQ MVYPYLPEEMRNP TFKWMLQNPQYRQQL+DMLNNMGGS EWDNRMMDSLKNF Sbjct: 283 EDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKNF 342 Query: 1157 DLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQND 1336 DL+SP+VKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNP+SIAKYQND Sbjct: 343 DLNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQND 402 Query: 1337 KEVMDVFNKI 1366 KEVMDVFNKI Sbjct: 403 KEVMDVFNKI 412 >ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-like [Cucumis sativus] Length = 419 Score = 434 bits (1117), Expect = e-119 Identities = 246/434 (56%), Positives = 292/434 (67%), Gaps = 3/434 (0%) Frame = +2 Query: 107 NFSLISSPKFLLGSSPNLGGLIPIRRS--FSALPHLSRKRRYDPTRTRNLVSSLSQGTSD 280 N +L S PKFL + + + R+ A LS R P R L S+L++ Sbjct: 5 NLALTSPPKFLFLTYSSTSSISTPSRTNQLCATRRLSSSRIKSPPRI--LASALNR---- 58 Query: 281 KAHPSRRLSINGVEAFAXXXXXXXXXXXX-VGVNPQISVPPPSSHIGSPLFWIGVGVGFS 457 P+ R+ + E FA VGV P +S+PPPSS++GSPLFW+GVGVG S Sbjct: 59 --RPNHRILV--AERFATVSSSTTSNDSSSVGV-PSVSIPPPSSYVGSPLFWVGVGVGLS 113 Query: 458 ALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXX 637 ALF+WVA+ LKKYAMQQAFKT+M QM+ +NS +N S G Sbjct: 114 ALFTWVASYLKKYAMQQAFKTMMSQMNSQNSPMSNPTLSSGSPFPIPPTFATGTTISPSV 173 Query: 638 XXXXXXXXXXXXATKVESAPTTEVKDNTETKNEPKRYAFVDVSPEEMSQKDPFENSTESF 817 ATKVE P T VK TE E K++AFVDVSPEE QK PF+ ++ Sbjct: 174 SEPAVSIDVT--ATKVEEEPVTNVKSRTENM-EAKKFAFVDVSPEETDQKSPFKE--DAT 228 Query: 818 ETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSVEALEKMMEDPTVQN 997 + SK Q +E+ QNGAA+KQ ++GSQ +R +PG V+SVEA+EKMMEDPTVQ Sbjct: 229 DADVSKSAQPTQELPQNGAASKQAYNG-SDGSQFSR--KPGSVLSVEAVEKMMEDPTVQK 285 Query: 998 MVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRMMDSLKNFDLSSPEV 1177 M+YP+LPEEMRNP TFKWM+QNP YRQQLE+MLNNM GS +WD R+MDSLKNFDLSSPEV Sbjct: 286 MIYPHLPEEMRNPETFKWMMQNPLYRQQLEEMLNNMSGSPQWDGRLMDSLKNFDLSSPEV 345 Query: 1178 KQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQNDKEVMDVF 1357 KQQFDQIGLTPEEVISKIMANPE+AMAFQNPRVQAAIMDCSQNP+SI KYQNDKEVMDVF Sbjct: 346 KQQFDQIGLTPEEVISKIMANPEIAMAFQNPRVQAAIMDCSQNPLSITKYQNDKEVMDVF 405 Query: 1358 NKISELFPGVTGPP 1399 NKISELFPGV+G P Sbjct: 406 NKISELFPGVSGAP 419 >emb|CAB50925.1| translocon Tic40 [Pisum sativum] Length = 436 Score = 429 bits (1103), Expect = e-117 Identities = 246/449 (54%), Positives = 296/449 (65%), Gaps = 18/449 (4%) Frame = +2 Query: 107 NFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTR-TRNLVSSLSQGTSDK 283 N +L+SSPK LL + + R+SF+ + R + N SS + K Sbjct: 5 NLALVSSPKPLLLGHSSSKNVFSRRKSFT----------FGTFRVSANSSSSHVTRAASK 54 Query: 284 AHPSRRLSING---VEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVGVGF 454 +H + + S+ G +FA VGV+PQ+S PPPS+ +GSPLFWIG+GVGF Sbjct: 55 SHQNLK-SVQGKVNAHSFASISSSNGQETTSVGVSPQLSPPPPST-VGSPLFWIGIGVGF 112 Query: 455 SALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXX 634 SALFS VA+ +KKYAMQQAFK++MGQM+ +N+ F++ AFS G Sbjct: 113 SALFSVVASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAG 172 Query: 635 XXXXXXXXXXXXXA-----------TKVESA---PTTEVKDNTETKNEPKRYAFVDVSPE 772 A TKVE+A P VK+ E KNEPK+ AFVDVSPE Sbjct: 173 FAGNQSQATSTRSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPE 232 Query: 773 EMSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMS 952 E QK+ FE + E+S K+ + E SQNG KQ G + GS S R +S Sbjct: 233 ETVQKNAFERFKDVDESSSFKEARAPAEASQNGTPFKQGFGD-SPGSPSERKS----ALS 287 Query: 953 VEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNR 1132 V+ALEKMMEDPTVQ MVYPYLPEEMRNP+TFKWM+QNP+YRQQLE MLNNMGG EWD+R Sbjct: 288 VDALEKMMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSR 347 Query: 1133 MMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPM 1312 MMD+LKNFDL+SP+VKQQFDQIGL+P+EVISKIMANP+VAMAFQNPRVQAAIMDCSQNPM Sbjct: 348 MMDTLKNFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPM 407 Query: 1313 SIAKYQNDKEVMDVFNKISELFPGVTGPP 1399 SI KYQNDKEVMDVFNKISELFPGV+GPP Sbjct: 408 SIVKYQNDKEVMDVFNKISELFPGVSGPP 436 >ref|XP_004307173.1| PREDICTED: protein TIC 40, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 448 Score = 426 bits (1094), Expect = e-116 Identities = 234/388 (60%), Positives = 260/388 (67%), Gaps = 28/388 (7%) Frame = +2 Query: 320 EAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVGVGFSALFSWVATNLKKYA 499 E FA VG+NPQ S PPP S IGSPLFWIGVGV FSA+FSW A L+KY Sbjct: 66 ERFASISSTNSQETSSVGINPQFSAPPPPSTIGSPLFWIGVGVAFSAVFSWAAGKLQKYV 125 Query: 500 MQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAT 679 +QQAFK +MGQM+ +N QF+NAAFSPG AT Sbjct: 126 VQQAFKNVMGQMNTQNDQFSNAAFSPGSPFPFPSAPASPSASPFSAPSQPSFTDVS--AT 183 Query: 680 KVES--------APTTEVKDNTETKNEPK--------------------RYAFVDVSPEE 775 +V+S P +VK + E + AFVDV+PEE Sbjct: 184 EVDSPASSATPSTPAADVKSEEQQMKENRFGNSFEIERNNVIQFSRQLSDRAFVDVNPEE 243 Query: 776 MSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSV 955 K PF +S E SK++ E SQNGAA KQ K A + GSQ+T G+ V+SV Sbjct: 244 TELKSPFASSLNDTEPGSSKEINSNVEGSQNGAAFKQAKDA-SMGSQTT--GKENSVLSV 300 Query: 956 EALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRM 1135 EALEKM+EDPTVQ MVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDML NM GSNEWDNRM Sbjct: 301 EALEKMLEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLRNMTGSNEWDNRM 360 Query: 1136 MDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMS 1315 MDSLKNFDLSSPEVK+QFDQIGLTPE+VISKIMANP+VAMAFQNPRVQAAIMDCSQNPMS Sbjct: 361 MDSLKNFDLSSPEVKEQFDQIGLTPEQVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMS 420 Query: 1316 IAKYQNDKEVMDVFNKISELFPGVTGPP 1399 I KYQNDKEVMDVFNKISELFPGV+G P Sbjct: 421 ITKYQNDKEVMDVFNKISELFPGVSGSP 448 >sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplastic; AltName: Full=Translocon at the inner envelope membrane of chloroplasts 40; Short=PsTIC40; Flags: Precursor gi|26000725|gb|AAN75219.1| chloroplast protein translocon component Tic40 precursor [Pisum sativum] Length = 436 Score = 426 bits (1094), Expect = e-116 Identities = 245/448 (54%), Positives = 293/448 (65%), Gaps = 17/448 (3%) Frame = +2 Query: 107 NFSLISSPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTR-TRNLVSSLSQGTSDK 283 N +L+SSPK LL + + R+SF+ + R + N SS + K Sbjct: 5 NLALVSSPKPLLLGHSSSKNVFSGRKSFT----------FGTFRVSANSSSSHVTRAASK 54 Query: 284 AHPSRRLSINGVEA--FAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLFWIGVGVGFS 457 +H + + V A FA VGV+PQ+S PPPS+ +GSPLFWIG+GVGFS Sbjct: 55 SHQNLKSVQGKVNAHDFASISSSNGQETTSVGVSPQLSPPPPST-VGSPLFWIGIGVGFS 113 Query: 458 ALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXXXX 637 ALFS VA+ +KKYAMQQAFK++MGQM+ +N+ F++ AFS G Sbjct: 114 ALFSVVASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGF 173 Query: 638 XXXXXXXXXXXXA-----------TKVESA---PTTEVKDNTETKNEPKRYAFVDVSPEE 775 A TKVE+A P VK+ E KNEPK+ AFVDVSPEE Sbjct: 174 AGNQSQATSTRSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEE 233 Query: 776 MSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQSTRTGQPGPVMSV 955 QK+ FE + E+S K+ + E SQNG KQ G + S S R +SV Sbjct: 234 TVQKNAFERFKDVDESSSFKEARAPAEASQNGTPFKQGFGD-SPSSPSERKS----ALSV 288 Query: 956 EALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNRM 1135 +ALEKMMEDPTVQ MVYPYLPEEMRNP+TFKWM+QNP+YRQQLE MLNNMGG EWD+RM Sbjct: 289 DALEKMMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRM 348 Query: 1136 MDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMS 1315 MD+LKNFDL+SP+VKQQFDQIGL+P+EVISKIMANP+VAMAFQNPRVQAAIMDCSQNPMS Sbjct: 349 MDTLKNFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMS 408 Query: 1316 IAKYQNDKEVMDVFNKISELFPGVTGPP 1399 I KYQNDKEVMDVFNKISELFPGV+GPP Sbjct: 409 IVKYQNDKEVMDVFNKISELFPGVSGPP 436 >ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana] gi|75309208|sp|Q9FMD5.1|TIC40_ARATH RecName: Full=Protein TIC 40, chloroplastic; AltName: Full=Protein PIGMENT DEFECTIVE EMBRYO 120; AltName: Full=Translocon at the inner envelope membrane of chloroplasts 40; Short=AtTIC40; Flags: Precursor gi|16226313|gb|AAL16131.1|AF428299_1 AT5g16620/MTG13_6 [Arabidopsis thaliana] gi|10176971|dbj|BAB10189.1| translocon Tic40-like protein [Arabidopsis thaliana] gi|20260222|gb|AAM13009.1| translocon Tic40-like protein [Arabidopsis thaliana] gi|30387547|gb|AAP31939.1| At5g16620 [Arabidopsis thaliana] gi|332004935|gb|AED92318.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 447 Score = 411 bits (1057), Expect = e-112 Identities = 238/457 (52%), Positives = 287/457 (62%), Gaps = 26/457 (5%) Frame = +2 Query: 101 MENFSLIS----SPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQ 268 MEN +L+S SPK L+G N + FS +R + + +S+ +Q Sbjct: 1 MENLTLVSCSASSPKLLIGC--NFTSSLKNPTGFS-------RRTPNIVLRCSKISASAQ 51 Query: 269 GTSDKAHPSRRLSINGVE----AFAXXXXXXXXXXXXVGVNPQISVPPPSSH-IGSPLFW 433 S + P I V+ AFA +P + VPPPSS IGSPLFW Sbjct: 52 SQSPSSRPENTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFW 111 Query: 434 IGVGVGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXX 613 IGVGVG SALFS+V +NLKKYAMQ A KT+M QM+ +NSQFNN+ F G Sbjct: 112 IGVGVGLSALFSYVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNSGFPSGSPFPFPFPPQT 171 Query: 614 XXXXXXXXXXXXXXXXXXXX-ATKVESAPTTEVK----------------DNTETKNEPK 742 ATKVE+ P+T+ K + ++ K E K Sbjct: 172 SPASSPFQSQSQSSGATVDVTATKVETPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEK 231 Query: 743 RYAFVDVSPEEMSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQST 922 YAF D+SPEE +++ PF N E ET+ K+ + E+V QNGA A +E QS Sbjct: 232 NYAFEDISPEETTKESPFSNYAEVSETNSPKETRLFEDVLQNGAGPANGATA-SEVFQSL 290 Query: 923 RTGQPGPVMSVEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNN 1102 G+ GP +SVEALEKMMEDPTVQ MVYPYLPEEMRNP TFKWML+NPQYRQQL+DMLNN Sbjct: 291 GGGKGGPGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNN 350 Query: 1103 MGGSNEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQA 1282 M GS EWD RM D+LKNFDL+SPEVKQQF+QIGLTPEEVISKIM NP+VAMAFQNPRVQA Sbjct: 351 MSGSGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQA 410 Query: 1283 AIMDCSQNPMSIAKYQNDKEVMDVFNKISELFPGVTG 1393 A+M+CS+NPM+I KYQNDKEVMDVFNKIS+LFPG+TG Sbjct: 411 ALMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 447 >ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arabidopsis lyrata subsp. lyrata] gi|297317564|gb|EFH47986.1| hypothetical protein ARALYDRAFT_488521 [Arabidopsis lyrata subsp. lyrata] Length = 447 Score = 405 bits (1040), Expect = e-110 Identities = 237/457 (51%), Positives = 282/457 (61%), Gaps = 26/457 (5%) Frame = +2 Query: 101 MENFSLIS----SPKFLLGSSPNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQ 268 MEN +L+S SPK L+G + S SR+ R +S+ +Q Sbjct: 1 MENLTLVSCSASSPKLLIGCN--------FTSSLKNPTGFSRRTPRIVLRCSK-ISASAQ 51 Query: 269 GTSDKAHPSRRLSINGVE----AFAXXXXXXXXXXXXVGVNPQISVPPPSSH-IGSPLFW 433 S + P I V+ AFA +P + VPPPSS IGSPLFW Sbjct: 52 SQSPSSRPDNTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFW 111 Query: 434 IGVGVGFSALFSWVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXX 613 IGVGVG SALFS V +NLKKYAMQ A KT+M QM+ +NSQFNN F G Sbjct: 112 IGVGVGLSALFSLVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNPGFPSGSPFPFPFPPQT 171 Query: 614 XXXXXXXXXXXXXXXXXXXX-ATKVESAPTTEVK----------------DNTETKNEPK 742 ATKV++ P+T+ K + ++ K E K Sbjct: 172 SPASSPFQSQSQSSGATVDVTATKVDTPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEK 231 Query: 743 RYAFVDVSPEEMSQKDPFENSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQST 922 YAF D+SPEE +++ PF N E ETS K+ + E+V QNGA A +E QS Sbjct: 232 NYAFEDISPEETTKESPFSNYAEVSETSSPKETRLFEDVLQNGAGPANGATA-SEVFQSL 290 Query: 923 RTGQPGPVMSVEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNN 1102 G+ G +SVEALEKMMEDPTVQ MVYPYLPEEMRNP TFKWML+NPQYRQQL+DMLNN Sbjct: 291 GGGKGGAGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNN 350 Query: 1103 MGGSNEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQA 1282 M GS EWD RM D+LKNFDL+SPEVKQQF+QIGLTPEEVISKIM NP+VAMAFQNPRVQA Sbjct: 351 MSGSGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQA 410 Query: 1283 AIMDCSQNPMSIAKYQNDKEVMDVFNKISELFPGVTG 1393 A+M+CS+NPM+I KYQNDKEVMDVFNKIS+LFPG+TG Sbjct: 411 ALMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 447 >ref|XP_007032982.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508712011|gb|EOY03908.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 531 Score = 400 bits (1029), Expect = e-109 Identities = 262/537 (48%), Positives = 293/537 (54%), Gaps = 106/537 (19%) Frame = +2 Query: 107 NFSLISSPK-----FLLGSS-PNLGGLIPIRRSFSALPHLSRKRRYDPTRTRNLVSSLSQ 268 N +L+SS +LLG + PN P F LP S +R S SQ Sbjct: 5 NLALVSSSSPPLKLYLLGCNHPNYTPKNP----FKTLPFPSSNLAPRRSRISIFAHSHSQ 60 Query: 269 GTSDKAHPSRRLSINGVEAFAXXXXXXXXXXXXVGVNPQISVPPPSSHIGSPLF------ 430 T + P L G E FA VGVNP +VPPPSS IGSPLF Sbjct: 61 PTPPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWIGVGV 120 Query: 431 -------WIGVGVGFSALFS---------------------------------------- 469 W+ V S F Sbjct: 121 GLSALFTWVSTKVASSLKFGSSIGYCNEAMKTDYGKMNSCMGPHRVGGLDFGVDDKVVVN 180 Query: 470 ------WVATNLKKYAMQQAFKTLMGQMSPENSQFNNAAFSPGXXXXXXXXXXXXXXXXX 631 + + +KYAMQQAFKT+MGQM+ +N+QF+NAAF G Sbjct: 181 GIMDPGYKIASQQKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPGPVTSP 240 Query: 632 XXXXXXXXXXXXXXATKVESAPTT----EVKDNTETKNEPKRYAFVDVSPEEMSQKDPFE 799 ATKVE+AP T EVK TET EPK+YAFVDVSPEE QK FE Sbjct: 241 SPSSQTAVTVDVP-ATKVEAAPATAPATEVKSETETA-EPKKYAFVDVSPEETVQKSAFE 298 Query: 800 NSTESFETSPSKDVQFVEEVSQNGAAAKQDKGAYTEGSQST---------RTGQPGPVMS 952 ++ S S + QF ++VS NGAA+KQD GA+ GSQST G P +S Sbjct: 299 DAAG---ISSSNNTQFPKDVSDNGAASKQDAGAFG-GSQSTVKLNKHPIALAGSADPALS 354 Query: 953 VEALEKMMEDPTVQNMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSNEWDNR 1132 V+ALEKMMEDPTVQ MVYPYLPEEMRNP TFKWMLQNPQYRQQL+DMLNNMGGS EWDNR Sbjct: 355 VDALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNR 414 Query: 1133 MMDSLKNFDLSSPEVKQQF----------------------------DQIGLTPEEVISK 1228 MMDSLKNFDL+SP+VKQQF DQIGLTPEEVISK Sbjct: 415 MMDSLKNFDLNSPDVKQQFVSRWSVSVVLECSLVPEEGSYISLSPFADQIGLTPEEVISK 474 Query: 1229 IMANPEVAMAFQNPRVQAAIMDCSQNPMSIAKYQNDKEVMDVFNKISELFPGVTGPP 1399 IMANPEVAMAFQNPRVQAAIMDCSQNP+SIAKYQNDKEVMDVFNKISELFPGVTG P Sbjct: 475 IMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGVTGSP 531