BLASTX nr result

ID: Mentha27_contig00006742 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00006742
         (1410 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU28232.1| hypothetical protein MIMGU_mgv1a006810mg [Mimulus...   451   e-124
ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vi...   413   e-113
ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-lik...   406   e-110
ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-lik...   402   e-109
ref|XP_007032983.1| Hydroxyproline-rich glycoprotein family prot...   393   e-107
ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-lik...   387   e-105
ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-lik...   385   e-104
ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-lik...   379   e-102
emb|CAB50925.1| translocon Tic40 [Pisum sativum]                      379   e-102
sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplasti...   379   e-102
ref|XP_007032985.1| Hydroxyproline-rich glycoprotein family prot...   378   e-102
ref|XP_007146937.1| hypothetical protein PHAVU_006G083300g [Phas...   376   e-101
ref|XP_002531917.1| conserved hypothetical protein [Ricinus comm...   376   e-101
gb|ABF19057.1| plastid Tic40 [Ricinus communis]                       376   e-101
ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-lik...   360   8e-97
ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Popu...   352   3e-94
ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Popu...   351   5e-94
ref|XP_006400200.1| hypothetical protein EUTSA_v10013528mg [Eutr...   350   1e-93
ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana] gi|753092...   349   1e-93
ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arab...   346   2e-92

>gb|EYU28232.1| hypothetical protein MIMGU_mgv1a006810mg [Mimulus guttatus]
          Length = 430

 Score =  451 bits (1161), Expect = e-124
 Identities = 250/377 (66%), Positives = 278/377 (73%), Gaps = 7/377 (1%)
 Frame = -3

Query: 1408 EDGFASIASSGQQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYA 1229
            +D FA+I+SSGQ+TSSVG                 LFWIGVGVGLSALFS+VAGR+KKYA
Sbjct: 76   KDRFATISSSGQETSSVGVNPQLSVPPSSQVGSP-LFWIGVGVGLSALFSFVAGRLKKYA 134

Query: 1228 MEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAP-SFQTTTS- 1055
            MEQAFKTFTQQMNTQN+PFGN AF+                       G+P  F   TS 
Sbjct: 135  MEQAFKTFTQQMNTQNSPFGNAAFSP----------------------GSPFPFPPATSP 172

Query: 1054 --SPFKSGA--ASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKN 890
               PF++    ASQ +T DVP +KVEDPPS SVKD+VE E  PKKYAF DVSPEET+QKN
Sbjct: 173  ALDPFRTSTPLASQPITVDVPASKVEDPPSISVKDEVEQETGPKKYAFVDVSPEETLQKN 232

Query: 889  AFEEDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKM 710
            AFE +YKES +QTDSP +         QN                    PLMSV+ALEKM
Sbjct: 233  AFE-NYKES-IQTDSPKD-PQSSQSVSQNGTAWNQGAGGSEGPTTSKTAPLMSVEALEKM 289

Query: 709  MEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLKN 530
            MEDPTVQQMV+PYLPEEMRNPTTFKWMLQNP YRQQLQDMLNNMGG PEWDNRMMD+LKN
Sbjct: 290  MEDPTVQQMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGTPEWDNRMMDSLKN 349

Query: 529  FDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQN 350
            FD+SSPE+KQQFDQIGLTP+EV+ KIM NPDVAMAFQNPRVQAAI+DCSQNP+SIAKYQN
Sbjct: 350  FDISSPEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMSIAKYQN 409

Query: 349  DKEVMDVFNKITELFPG 299
            DKEVMDVFNKI+ELFPG
Sbjct: 410  DKEVMDVFNKISELFPG 426


>ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vitis vinifera]
            gi|296089465|emb|CBI39284.3| unnamed protein product
            [Vitis vinifera]
          Length = 436

 Score =  413 bits (1062), Expect = e-113
 Identities = 222/369 (60%), Positives = 248/369 (67%)
 Frame = -3

Query: 1399 FASIASSGQQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAMEQ 1220
            FASI+SS Q TSSVG                PLFWIGVGVGLSALFSWVA  +KKYAM+Q
Sbjct: 69   FASISSSSQGTSSVGVNPQFSPPPPSSNIGSPLFWIGVGVGLSALFSWVASNLKKYAMQQ 128

Query: 1219 AFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSSPFKS 1040
            AFKT   QM++QNN F  T F+                      T +PS  TT  SP   
Sbjct: 129  AFKTLMGQMDSQNNQFNTTTFSPGSPFPFPMPPPSGPSTSHSGPTTSPSGPTT--SPSTV 186

Query: 1039 GAASQSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFEEDYKESS 860
             A S    DVP TKVE PP+T VKD +E ++   KYAF DVSPEET+Q++ FE    E S
Sbjct: 187  AAQSMVTVDVPATKVETPPATDVKDDIEKKNEQNKYAFVDVSPEETLQESPFENF--EES 244

Query: 859  VQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPTVQQMV 680
             +T S  +                               P +SVDALEKMMEDPTVQ+MV
Sbjct: 245  TETSSSKDAQFSAGVSQNGTPPRPGMGVSEDSQSTRNANPFLSVDALEKMMEDPTVQKMV 304

Query: 679  FPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLKNFDLSSPEIKQ 500
            +PYLPEEMRNPTTFKWMLQNP YRQQLQDMLNNMGG  EWDNRMMD LKNFDLSSPE+KQ
Sbjct: 305  YPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGGAEWDNRMMDNLKNFDLSSPEVKQ 364

Query: 499  QFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQNDKEVMDVFNK 320
            QFDQIGLTP+EV+ KIM NPDVA+AFQNPR+QAAI+DCSQNPLSIAKYQNDKEVMDVFNK
Sbjct: 365  QFDQIGLTPEEVISKIMANPDVALAFQNPRIQAAIMDCSQNPLSIAKYQNDKEVMDVFNK 424

Query: 319  ITELFPGPS 293
            I+ELFPG S
Sbjct: 425  ISELFPGVS 433


>ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum tuberosum]
          Length = 443

 Score =  406 bits (1044), Expect = e-110
 Identities = 221/370 (59%), Positives = 252/370 (68%), Gaps = 1/370 (0%)
 Frame = -3

Query: 1399 FASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAME 1223
            FAS  +SG QQTSSVG                PLFWIGVGVGLSALF+WVA  +KKYAM+
Sbjct: 75   FASTTTSGGQQTSSVGVNPQFSAPSQPSQVGSPLFWIGVGVGLSALFAWVASYLKKYAMQ 134

Query: 1222 QAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSSPFK 1043
            QA KT   QMN QN+ F N AF+                      +  P   +T+S+P  
Sbjct: 135  QALKTMMGQMNGQNSQFSNNAFSPGPGSPFPFPFPPPPVSGPASSSPPPPTASTSSTPSA 194

Query: 1042 SGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFEEDYKES 863
            S A+     DV  TKVE+PP+ +VK+  E    PKK AF D+SP+ET QK AFE ++K+S
Sbjct: 195  SFASQPVTVDVSATKVEEPPTVNVKNDTEAGKEPKKNAFVDISPDETFQKGAFE-NFKDS 253

Query: 862  SVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPTVQQM 683
               T++ +                                PLMSVDALEKMMEDPTVQ+M
Sbjct: 254  ---TETASVTVDQVTQNGAASQLGFGPNTSDSTSSTGKSNPLMSVDALEKMMEDPTVQKM 310

Query: 682  VFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLKNFDLSSPEIK 503
            V+PYLPEEMRNPTTFKWMLQNP YRQQLQDM+NNMGGNPEWDNRMMD+LKNFDLSSPEIK
Sbjct: 311  VYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIK 370

Query: 502  QQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQNDKEVMDVFN 323
            QQFDQIGLTP+EV+ KIM NPDVAMAFQNPRVQAAI+DCSQNPLSIAKYQNDKEVMDVFN
Sbjct: 371  QQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFN 430

Query: 322  KITELFPGPS 293
            KI+ELFPG S
Sbjct: 431  KISELFPGVS 440


>ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum lycopersicum]
          Length = 443

 Score =  402 bits (1033), Expect = e-109
 Identities = 219/374 (58%), Positives = 252/374 (67%), Gaps = 5/374 (1%)
 Frame = -3

Query: 1399 FASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAME 1223
            FAS  +SG +QTSSVG                PLFWIGVGVG SALF+WVA  +KKYAM+
Sbjct: 75   FASTTTSGGKQTSSVGVNPQFSAPSPPSQMGSPLFWIGVGVGFSALFAWVASYLKKYAMQ 134

Query: 1222 QAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSSPFK 1043
            QA KT   QMN QN+ F NTAF+                      +  P   +++S+P  
Sbjct: 135  QALKTMMGQMNGQNSQFSNTAFSPGPGSPFPFPFPPPPVSGPASSSPPPPTASSSSTPSA 194

Query: 1042 SGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFEEDYKES 863
            S A+     DV  TKVE+PP+ +VK+  E E  PKK AF D+SP+ET QK AFE ++K+S
Sbjct: 195  SFASQPVTVDVSATKVEEPPTVNVKNDKEAEKEPKKNAFVDISPDETFQKGAFE-NFKDS 253

Query: 862  S----VQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 695
            +    V  D              +                     L+SVDALEKMMEDPT
Sbjct: 254  AETAAVTVDQVTQNGAASQSGFGSNTSDSTSSTGKSNP-------LLSVDALEKMMEDPT 306

Query: 694  VQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLKNFDLSS 515
            VQ+MV+PYLPEEMRNPTTFKWMLQNP YRQQLQDM+NNMGGNPEWDNRMMD+LKNFDLSS
Sbjct: 307  VQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSS 366

Query: 514  PEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQNDKEVM 335
            PEIKQQFDQIGLTP+EV+ KIM NPDVAMAFQNPRVQAAI+DCSQNPLSIAKYQNDKEVM
Sbjct: 367  PEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVM 426

Query: 334  DVFNKITELFPGPS 293
            DVFNKI+ELFPG S
Sbjct: 427  DVFNKISELFPGVS 440


>ref|XP_007032983.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508712012|gb|EOY03909.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 433

 Score =  393 bits (1010), Expect = e-107
 Identities = 220/374 (58%), Positives = 249/374 (66%), Gaps = 4/374 (1%)
 Frame = -3

Query: 1408 EDGFASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKY 1232
            ++ FASI+SS  QQTSSVG                PLFWIGVGVGLSALF+WVA  +KKY
Sbjct: 77   DERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWIGVGVGLSALFTWVASSLKKY 136

Query: 1231 AMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSS 1052
            AM+QAFKT   QMNTQNN F N AF                         AP      +S
Sbjct: 137  AMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFP----------------APPSPGPVTS 180

Query: 1051 PFKSGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDA---PKKYAFKDVSPEETVQKNAFE 881
            P  S   + +V DVP TKVE  P+T+   +V+ E     PKKYAF DVSPEETVQK+AFE
Sbjct: 181  PSPSSQTAVTV-DVPATKVEAAPATAPATEVKSETETAEPKKYAFVDVSPEETVQKSAFE 239

Query: 880  EDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMED 701
            +     +    S NN                               P +SVDALEKMMED
Sbjct: 240  D-----AAGISSSNNTQFPKDVSDNGAASKQDAGAFGGSQSTGSADPALSVDALEKMMED 294

Query: 700  PTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLKNFDL 521
            PTVQ+MV+PYLPEEMRNP TFKWMLQNP YRQQLQDMLNNMGG+ EWDNRMMD+LKNFDL
Sbjct: 295  PTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKNFDL 354

Query: 520  SSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQNDKE 341
            +SP++KQQFDQIGLTP+EV+ KIM NP+VAMAFQNPRVQAAI+DCSQNPLSIAKYQNDKE
Sbjct: 355  NSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKE 414

Query: 340  VMDVFNKITELFPG 299
            VMDVFNKI+ELFPG
Sbjct: 415  VMDVFNKISELFPG 428


>ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max]
          Length = 429

 Score =  387 bits (995), Expect = e-105
 Identities = 219/386 (56%), Positives = 261/386 (67%), Gaps = 16/386 (4%)
 Frame = -3

Query: 1399 FASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAME 1223
            FASI+SS  Q+T+S+G                 LFWIGVGVGLSALFS VA R+KKYAM+
Sbjct: 69   FASISSSNTQETTSIGVKPQLSPSPSSTIGSP-LFWIGVGVGLSALFSVVASRLKKYAMQ 127

Query: 1222 QAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSS--P 1049
            QAFKT   QMN+QNN FGN AF+                        AP+   TT S  P
Sbjct: 128  QAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPT--------APASSATTQSRAP 179

Query: 1048 FKSGAASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--E 878
              S A+  ++T D+P  KVE  P+T+VKD+VE ++ PKK AF DVSPEETV+++ FE  +
Sbjct: 180  SASSASQSTITVDLPAAKVEAAPTTNVKDEVELKNEPKKIAFVDVSPEETVRESPFESFK 239

Query: 877  DYKESSV----------QTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSV 728
            D + SSV          Q  +P+N         Q+                      +SV
Sbjct: 240  DDESSSVKEAWVPDEVSQNGAPSNLGFGDFPGSQSTKKSA-----------------LSV 282

Query: 727  DALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRM 548
            DALEKMMEDPTVQ+MV+PYLPEEMRNPTTFKWMLQNP YRQQL++MLNNMGG+ EWDNRM
Sbjct: 283  DALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDNRM 342

Query: 547  MDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLS 368
            MDTLKNFDL+SPE+KQQFDQIGL+P+EV+ KIM NP+VAMAFQNPRVQAAI+DCSQNP++
Sbjct: 343  MDTLKNFDLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMN 402

Query: 367  IAKYQNDKEVMDVFNKITELFPGPSS 290
            I KYQNDKEVMDVFNKI+ELFPG  S
Sbjct: 403  ITKYQNDKEVMDVFNKISELFPGVGS 428


>ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max]
          Length = 432

 Score =  385 bits (990), Expect = e-104
 Identities = 218/386 (56%), Positives = 260/386 (67%), Gaps = 16/386 (4%)
 Frame = -3

Query: 1399 FASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAME 1223
            FASI+SS  Q+ +S G                 LFWIGVGVGLSALFS VA R+KKYAM+
Sbjct: 74   FASISSSNTQEATSTGVNPQLSPSSTIGSP---LFWIGVGVGLSALFSVVASRLKKYAMQ 130

Query: 1222 QAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSS--P 1049
            QAFKT   QMN+QNN FGN AF+                        AP+   TT S  P
Sbjct: 131  QAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPT--------APASSATTQSRAP 182

Query: 1048 FKSGAASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--E 878
              S A+  ++T D+P  KVE  P+T+VKD+VE ++ PKK AF DVSPEETVQ++ FE  +
Sbjct: 183  SASSASQSTITVDIPAAKVEVAPTTNVKDEVEVKNEPKKIAFVDVSPEETVQESPFESFK 242

Query: 877  DYKESSV----------QTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSV 728
            D + SSV          Q  +P+N         Q+                     ++SV
Sbjct: 243  DDESSSVKEARVPDEVSQNGAPSNQGFGDFPGSQSTKKS-----------------VLSV 285

Query: 727  DALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRM 548
            DALEKMMEDPTVQ+MV+PYLPEEMRNPTTFKWMLQNP YRQQL++MLNNMGG+ EWD+RM
Sbjct: 286  DALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRM 345

Query: 547  MDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLS 368
            MDTLKNFDL+SPE+KQQFDQIGL+P+EV+ KIM NP+VAMAFQNPRVQAAI+DCSQNP++
Sbjct: 346  MDTLKNFDLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMN 405

Query: 367  IAKYQNDKEVMDVFNKITELFPGPSS 290
            I KYQNDKEVMDVFNKI+ELFPG  S
Sbjct: 406  ITKYQNDKEVMDVFNKISELFPGVGS 431


>ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-like [Cicer arietinum]
          Length = 433

 Score =  379 bits (974), Expect = e-102
 Identities = 217/386 (56%), Positives = 256/386 (66%), Gaps = 17/386 (4%)
 Frame = -3

Query: 1399 FASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAME 1223
            FASI+SS  Q+T+SVG                 LFWIGVGVG SALFS VA R+KKYAM+
Sbjct: 71   FASISSSNSQETTSVGVSPQLSPPPSSTVGSP-LFWIGVGVGFSALFSIVASRLKKYAMQ 129

Query: 1222 QAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSSPFK 1043
            QAFKT   QMNTQNNPF + AF+                        AP+    T S   
Sbjct: 130  QAFKTMMGQMNTQNNPFDSAAFSPGSPFPFPMPSSSGP--------AAPASSAGTQSQST 181

Query: 1042 SG-AASQSVT--DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--E 878
            S   ASQS    D+P TKVE  PST+ KD+VE ++ PKK  F DVSPEE+VQK+ FE  +
Sbjct: 182  SARTASQSTVTVDIPATKVEAAPSTNAKDEVEVKNEPKKIGFVDVSPEESVQKSPFESFK 241

Query: 877  DYKESS-----------VQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMS 731
            D  ESS            Q  +P+N         Q+                     ++S
Sbjct: 242  DVDESSSFKEARAPAEAFQNGAPSNQGFGNSPGSQSGGKS-----------------VLS 284

Query: 730  VDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNR 551
            V+ALEKMMEDPTVQ+MV+PYLPEEMRNP+TFKWMLQNP YRQQL++MLNNMGG+ EWD+R
Sbjct: 285  VEALEKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSR 344

Query: 550  MMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPL 371
            MMDTLKNFDL+SP++KQQFDQIGL+P+EV+ KIM NP+VAMAFQNPRVQAAI+DCS NPL
Sbjct: 345  MMDTLKNFDLNSPDVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSSNPL 404

Query: 370  SIAKYQNDKEVMDVFNKITELFPGPS 293
            +IAKYQNDKEVMDVFNKI+ELFPG S
Sbjct: 405  NIAKYQNDKEVMDVFNKISELFPGVS 430


>emb|CAB50925.1| translocon Tic40 [Pisum sativum]
          Length = 436

 Score =  379 bits (974), Expect = e-102
 Identities = 215/381 (56%), Positives = 251/381 (65%), Gaps = 12/381 (3%)
 Frame = -3

Query: 1399 FASIASS-GQQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAME 1223
            FASI+SS GQ+T+SVG                 LFWIG+GVG SALFS VA RVKKYAM+
Sbjct: 71   FASISSSNGQETTSVGVSPQLSPPPPSTVGSP-LFWIGIGVGFSALFSVVASRVKKYAMQ 129

Query: 1222 QAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSSPFK 1043
            QAFK+   QMNTQNNPF + AF+                    F  G  S  T+T    +
Sbjct: 130  QAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGF-AGNQSQATST----R 184

Query: 1042 SGAASQSVTDVPVTKVE---DPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFEE-- 878
            S + S    D+P TKVE     P  +VK++VE ++ PKK AF DVSPEETVQKNAFE   
Sbjct: 185  SASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEETVQKNAFERFK 244

Query: 877  ------DYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALE 716
                   +KE+    ++  N          +                      +SVDALE
Sbjct: 245  DVDESSSFKEARAPAEASQNGTPFKQGFGDSPGSPSERKSA------------LSVDALE 292

Query: 715  KMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTL 536
            KMMEDPTVQQMV+PYLPEEMRNP+TFKWM+QNP YRQQL+ MLNNMGG  EWD+RMMDTL
Sbjct: 293  KMMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRMMDTL 352

Query: 535  KNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKY 356
            KNFDL+SP++KQQFDQIGL+PQEV+ KIM NPDVAMAFQNPRVQAAI+DCSQNP+SI KY
Sbjct: 353  KNFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMSIVKY 412

Query: 355  QNDKEVMDVFNKITELFPGPS 293
            QNDKEVMDVFNKI+ELFPG S
Sbjct: 413  QNDKEVMDVFNKISELFPGVS 433


>sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplastic; AltName: Full=Translocon
            at the inner envelope membrane of chloroplasts 40;
            Short=PsTIC40; Flags: Precursor
            gi|26000725|gb|AAN75219.1| chloroplast protein translocon
            component Tic40 precursor [Pisum sativum]
          Length = 436

 Score =  379 bits (974), Expect = e-102
 Identities = 215/381 (56%), Positives = 251/381 (65%), Gaps = 12/381 (3%)
 Frame = -3

Query: 1399 FASIASS-GQQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAME 1223
            FASI+SS GQ+T+SVG                 LFWIG+GVG SALFS VA RVKKYAM+
Sbjct: 71   FASISSSNGQETTSVGVSPQLSPPPPSTVGSP-LFWIGIGVGFSALFSVVASRVKKYAMQ 129

Query: 1222 QAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSSPFK 1043
            QAFK+   QMNTQNNPF + AF+                    F  G  S  T+T    +
Sbjct: 130  QAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGF-AGNQSQATST----R 184

Query: 1042 SGAASQSVTDVPVTKVE---DPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFEE-- 878
            S + S    D+P TKVE     P  +VK++VE ++ PKK AF DVSPEETVQKNAFE   
Sbjct: 185  SASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEETVQKNAFERFK 244

Query: 877  ------DYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALE 716
                   +KE+    ++  N          +                      +SVDALE
Sbjct: 245  DVDESSSFKEARAPAEASQNGTPFKQGFGDSPSSPSERKSA------------LSVDALE 292

Query: 715  KMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTL 536
            KMMEDPTVQQMV+PYLPEEMRNP+TFKWM+QNP YRQQL+ MLNNMGG  EWD+RMMDTL
Sbjct: 293  KMMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRMMDTL 352

Query: 535  KNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKY 356
            KNFDL+SP++KQQFDQIGL+PQEV+ KIM NPDVAMAFQNPRVQAAI+DCSQNP+SI KY
Sbjct: 353  KNFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMSIVKY 412

Query: 355  QNDKEVMDVFNKITELFPGPS 293
            QNDKEVMDVFNKI+ELFPG S
Sbjct: 413  QNDKEVMDVFNKISELFPGVS 433


>ref|XP_007032985.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial
            [Theobroma cacao] gi|508712014|gb|EOY03911.1|
            Hydroxyproline-rich glycoprotein family protein isoform
            4, partial [Theobroma cacao]
          Length = 412

 Score =  378 bits (971), Expect = e-102
 Identities = 216/369 (58%), Positives = 244/369 (66%), Gaps = 5/369 (1%)
 Frame = -3

Query: 1408 EDGFASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKY 1232
            ++ FASI+SS  QQTSSVG                PLFWIGVGVGLSALF+WVA  +KKY
Sbjct: 77   DERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWIGVGVGLSALFTWVASSLKKY 136

Query: 1231 AMEQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSS 1052
            AM+QAFKT   QMNTQNN F N AF                         AP      +S
Sbjct: 137  AMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFP----------------APPSPGPVTS 180

Query: 1051 PFKSGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDA---PKKYAFKDVSPEETVQKNAFE 881
            P  S   + +V DVP TKVE  P+T+   +V+ E     PKKYAF DVSPEETVQK+AFE
Sbjct: 181  PSPSSQTAVTV-DVPATKVEAAPATAPATEVKSETETAEPKKYAFVDVSPEETVQKSAFE 239

Query: 880  EDYK-ESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMME 704
            +     SS  T  P +                               P +SVDALEKMME
Sbjct: 240  DAAGISSSNNTQFPKD----------------DAGAFGGSQSTGSADPALSVDALEKMME 283

Query: 703  DPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLKNFD 524
            DPTVQ+MV+PYLPEEMRNP TFKWMLQNP YRQQLQDMLNNMGG+ EWDNRMMD+LKNFD
Sbjct: 284  DPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKNFD 343

Query: 523  LSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQNDK 344
            L+SP++KQQFDQIGLTP+EV+ KIM NP+VAMAFQNPRVQAAI+DCSQNPLSIAKYQNDK
Sbjct: 344  LNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDK 403

Query: 343  EVMDVFNKI 317
            EVMDVFNKI
Sbjct: 404  EVMDVFNKI 412


>ref|XP_007146937.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris]
            gi|561020160|gb|ESW18931.1| hypothetical protein
            PHAVU_006G083300g [Phaseolus vulgaris]
          Length = 430

 Score =  376 bits (966), Expect = e-101
 Identities = 210/373 (56%), Positives = 250/373 (67%), Gaps = 3/373 (0%)
 Frame = -3

Query: 1399 FASIASSG-QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAME 1223
            FASI+SS  Q+T+S+G                 LFWIGVGVGLSALFS VA R+KKYAM+
Sbjct: 74   FASISSSNTQETTSIGVNPQLSPPPSSTIGSP-LFWIGVGVGLSALFSMVASRLKKYAMQ 132

Query: 1222 QAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSSPFK 1043
            QAFKT   QMN+ NN FGN AF+                     + GAPS          
Sbjct: 133  QAFKTMMGQMNSPNNDFGNAAFSPGSPFPFSMPSAAGPTATA--QYGAPSTS-------- 182

Query: 1042 SGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--EDYK 869
            SG+ S    D+P TKVE   +T +KD+VE ++ PKK AF DVSPEETVQK+ FE  +D +
Sbjct: 183  SGSQSTVTVDIPATKVEATRTTDIKDEVEVQNKPKKIAFVDVSPEETVQKSPFESVKDNE 242

Query: 868  ESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPTVQ 689
             SSV+ ++             N                      +SVDALEKMMEDPTVQ
Sbjct: 243  SSSVKEEARVPDEVSQNGAPFNQGFGGFPGSQSTKKSA------LSVDALEKMMEDPTVQ 296

Query: 688  QMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLKNFDLSSPE 509
            +MV+P+LPEEMRNP TFKWMLQNP YRQQL+ ML+NMGG+ EWDNRMMDTLKNFDL+SPE
Sbjct: 297  KMVYPHLPEEMRNPDTFKWMLQNPQYRQQLEAMLSNMGGSTEWDNRMMDTLKNFDLNSPE 356

Query: 508  IKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQNDKEVMDV 329
            +KQQFDQIGL+P+EV+ KIM NP+VAMAFQNPRVQAAI+DCSQNP++I KYQNDKEVM+V
Sbjct: 357  VKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQNDKEVMNV 416

Query: 328  FNKITELFPGPSS 290
            FNKI+ELFPG  S
Sbjct: 417  FNKISELFPGMGS 429


>ref|XP_002531917.1| conserved hypothetical protein [Ricinus communis]
            gi|223528427|gb|EEF30461.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 465

 Score =  376 bits (966), Expect = e-101
 Identities = 215/378 (56%), Positives = 247/378 (65%), Gaps = 11/378 (2%)
 Frame = -3

Query: 1399 FASIASSGQQTSSVGAXXXXXXXXXXXXXXXP-LFWIGVGVGLSALFSWVAGRVKKYAME 1223
            FASI SS QQTSSVG                  LFWIGVGVGLSA+FS VA RVK YAM+
Sbjct: 90   FASI-SSRQQTSSVGVNPQPLPPPSSSSQFGSPLFWIGVGVGLSAIFSLVATRVKNYAMQ 148

Query: 1222 QAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGA-------PSFQT 1064
            QAFK+   QMNTQN+ F N AF+                    F T +       PS+ T
Sbjct: 149  QAFKSMMNQMNTQNDQFNNPAFSPGSAFPFPTPPASVPASSPPFPTSSTSRPATSPSYPT 208

Query: 1063 TTSSPFKSGAASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNA 887
            +++S   S A+  +VT DV  TKVE    T  KD+ E    PKKYAF DVSPEET  K+ 
Sbjct: 209  SSASTSPSVASQPAVTVDVSATKVEAASVTDAKDEAEITKEPKKYAFVDVSPEETFPKSP 268

Query: 886  FE--EDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEK 713
            F+  ED  E+S   D+  N          N                      +SV+ALEK
Sbjct: 269  FKSNEDILETSTSKDTQFNPEVLQNGAASNQGAADFTGSQSTRKAGSG----LSVEALEK 324

Query: 712  MMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLK 533
            MMEDPTVQ+MV+PYLPEEMRNP+TFKWMLQNP YRQQL++MLNNM G  EWDNRMMD+LK
Sbjct: 325  MMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMSGTGEWDNRMMDSLK 384

Query: 532  NFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQ 353
            NFDLSSPE+KQQFDQIGLTP+EV+ KIM NP++AMAFQNPRVQ AI+DCSQNPLSIAKYQ
Sbjct: 385  NFDLSSPEVKQQFDQIGLTPEEVISKIMANPEIAMAFQNPRVQQAIMDCSQNPLSIAKYQ 444

Query: 352  NDKEVMDVFNKITELFPG 299
            NDKEVMDVFNKI+ELFPG
Sbjct: 445  NDKEVMDVFNKISELFPG 462


>gb|ABF19057.1| plastid Tic40 [Ricinus communis]
          Length = 460

 Score =  376 bits (966), Expect = e-101
 Identities = 215/378 (56%), Positives = 247/378 (65%), Gaps = 11/378 (2%)
 Frame = -3

Query: 1399 FASIASSGQQTSSVGAXXXXXXXXXXXXXXXP-LFWIGVGVGLSALFSWVAGRVKKYAME 1223
            FASI SS QQTSSVG                  LFWIGVGVGLSA+FS VA RVK YAM+
Sbjct: 85   FASI-SSRQQTSSVGVNPQPLPPPSSSSQFGSPLFWIGVGVGLSAIFSLVATRVKNYAMQ 143

Query: 1222 QAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGA-------PSFQT 1064
            QAFK+   QMNTQN+ F N AF+                    F T +       PS+ T
Sbjct: 144  QAFKSMMNQMNTQNDQFNNPAFSPGSAFPFPTPPASVPASSPPFPTSSTSRPATSPSYPT 203

Query: 1063 TTSSPFKSGAASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNA 887
            +++S   S A+  +VT DV  TKVE    T  KD+ E    PKKYAF DVSPEET  K+ 
Sbjct: 204  SSASTSPSVASQPAVTVDVSATKVEAASVTDAKDEAEITKEPKKYAFVDVSPEETFPKSP 263

Query: 886  FE--EDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEK 713
            F+  ED  E+S   D+  N          N                      +SV+ALEK
Sbjct: 264  FKSNEDILETSTSKDTQFNPEVLQNGAASNQGAADFTGSQSTRKAGSG----LSVEALEK 319

Query: 712  MMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLK 533
            MMEDPTVQ+MV+PYLPEEMRNP+TFKWMLQNP YRQQL++MLNNM G  EWDNRMMD+LK
Sbjct: 320  MMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMSGTGEWDNRMMDSLK 379

Query: 532  NFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQ 353
            NFDLSSPE+KQQFDQIGLTP+EV+ KIM NP++AMAFQNPRVQ AI+DCSQNPLSIAKYQ
Sbjct: 380  NFDLSSPEVKQQFDQIGLTPEEVISKIMANPEIAMAFQNPRVQQAIMDCSQNPLSIAKYQ 439

Query: 352  NDKEVMDVFNKITELFPG 299
            NDKEVMDVFNKI+ELFPG
Sbjct: 440  NDKEVMDVFNKISELFPG 457


>ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-like [Cucumis sativus]
          Length = 419

 Score =  360 bits (924), Expect = 8e-97
 Identities = 198/372 (53%), Positives = 243/372 (65%), Gaps = 3/372 (0%)
 Frame = -3

Query: 1399 FASIASS--GQQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAM 1226
            FA+++SS     +SSVG                 LFW+GVGVGLSALF+WVA  +KKYAM
Sbjct: 70   FATVSSSTTSNDSSSVGVPSVSIPPPSSYVGSP-LFWVGVGVGLSALFTWVASYLKKYAM 128

Query: 1225 EQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQT-TTSSP 1049
            +QAFKT   QMN+QN+P  N   +                         P+F T TT SP
Sbjct: 129  QQAFKTMMSQMNSQNSPMSNPTLSSGSPFPIP-----------------PTFATGTTISP 171

Query: 1048 FKSGAASQSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFEEDYK 869
              S  A     DV  TKVE+ P T+VK + E  +A KK+AF DVSPEET QK+ F+ED  
Sbjct: 172  SVSEPAVS--IDVTATKVEEEPVTNVKSRTENMEA-KKFAFVDVSPEETDQKSPFKEDAT 228

Query: 868  ESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPTVQ 689
            ++ V   +                                   ++SV+A+EKMMEDPTVQ
Sbjct: 229  DADVSKSAQPTQELPQNGAASKQAYNGSDGSQFSRKPGS----VLSVEAVEKMMEDPTVQ 284

Query: 688  QMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLKNFDLSSPE 509
            +M++P+LPEEMRNP TFKWM+QNP+YRQQL++MLNNM G+P+WD R+MD+LKNFDLSSPE
Sbjct: 285  KMIYPHLPEEMRNPETFKWMMQNPLYRQQLEEMLNNMSGSPQWDGRLMDSLKNFDLSSPE 344

Query: 508  IKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQNDKEVMDV 329
            +KQQFDQIGLTP+EV+ KIM NP++AMAFQNPRVQAAI+DCSQNPLSI KYQNDKEVMDV
Sbjct: 345  VKQQFDQIGLTPEEVISKIMANPEIAMAFQNPRVQAAIMDCSQNPLSITKYQNDKEVMDV 404

Query: 328  FNKITELFPGPS 293
            FNKI+ELFPG S
Sbjct: 405  FNKISELFPGVS 416


>ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Populus trichocarpa]
            gi|222848840|gb|EEE86387.1| hypothetical protein
            POPTR_0004s08560g [Populus trichocarpa]
          Length = 429

 Score =  352 bits (902), Expect = 3e-94
 Identities = 199/378 (52%), Positives = 240/378 (63%), Gaps = 11/378 (2%)
 Frame = -3

Query: 1399 FASIASS-GQQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAME 1223
            FASI+SS G+QT+SVG                 LFW+GVGVGLSA+FSWVA RVK YAM+
Sbjct: 74   FASISSSSGKQTASVGVNPQPVSPPPSQIGSP-LFWVGVGVGLSAIFSWVATRVKNYAMQ 132

Query: 1222 QAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSSPFK 1043
            QAFK+ T+QMNTQNN F N AF+                         P     ++SP  
Sbjct: 133  QAFKSLTEQMNTQNNQF-NPAFSARPPFPF----------------SPPPASHPSTSPSP 175

Query: 1042 SGAASQSVTDVPVTKVEDPPSTSVKDKVEPE--------DAPKKYAFKDVSPEETVQKNA 887
            + +      D+P TKVE  P+T V  + E +        +  KKYAF D+SPEET     
Sbjct: 176  AASQPAITVDIPATKVEAAPTTDVGKEKETDFLEERKIKEETKKYAFVDISPEETSLNTP 235

Query: 886  FE--EDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEK 713
            F   ED  E+S   D                                   P +SV+ALEK
Sbjct: 236  FSSVEDDNETSSSKD-------VEFAKKVFQNGAAFKQGPGAAEGSQSTRPFLSVEALEK 288

Query: 712  MMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLK 533
            MMEDPT+Q+MV+PYLPEEMRNPTTFKWMLQNP YRQQL+DMLNNMGG+ +WD++MMD+LK
Sbjct: 289  MMEDPTMQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSGKWDSQMMDSLK 348

Query: 532  NFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQ 353
            +FDL+S E+KQQFDQIGLTP+EV+ KIM NPDVAMAFQNPRVQ AI++CSQNP++I KYQ
Sbjct: 349  DFDLNSAEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQQAIMECSQNPINITKYQ 408

Query: 352  NDKEVMDVFNKITELFPG 299
            NDKEVMDVFNKI+ELFPG
Sbjct: 409  NDKEVMDVFNKISELFPG 426


>ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa]
            gi|550319201|gb|ERP50369.1| hypothetical protein
            POPTR_0017s02900g [Populus trichocarpa]
          Length = 435

 Score =  351 bits (900), Expect = 5e-94
 Identities = 195/375 (52%), Positives = 237/375 (63%), Gaps = 11/375 (2%)
 Frame = -3

Query: 1399 FASIAS-SGQQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAME 1223
            FASI+S SGQQT+SVG                 LFW+GVGV LSA+FSWVA R+K YAM+
Sbjct: 79   FASISSLSGQQTASVGVNPQSVSPPPSQIGSP-LFWVGVGVALSAIFSWVATRLKNYAMQ 137

Query: 1222 QAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSSPFK 1043
            QAFK+ T+QMN QNN F N AF+                         P      +SPF+
Sbjct: 138  QAFKSLTEQMNAQNNQF-NPAFSARSPFPF----------------SPPPASQPATSPFQ 180

Query: 1042 SGAASQSVTDVPVTKVEDPPSTSVKDKVEPE--------DAPKKYAFKDVSPEETVQKNA 887
            + +      D+P TKVE  P T  + + E +        + P+K+AF DVSPEET     
Sbjct: 181  TASQPAVTVDIPATKVEAAPETDARKEKETDTLEEREIKEEPRKFAFVDVSPEETSLNTP 240

Query: 886  FE--EDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEK 713
            F   ED  ++S   D    +                                +SV+ALEK
Sbjct: 241  FSSVEDVIDTSSSKDV--QFAKEASQNGATFKQGPSASEPSEGSQSSQKAGSLSVEALEK 298

Query: 712  MMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWDNRMMDTLK 533
            MM+DPTVQ+MV+PYLPEEMRNPTTFKWMLQNP YRQQL++MLNNM G+ EWD+RM+D+LK
Sbjct: 299  MMDDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMSGSSEWDSRMVDSLK 358

Query: 532  NFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQNPLSIAKYQ 353
            NFDLSSPE+KQQFDQIGLTP+EV+ KIM NPDVA+AFQNPRVQ AI++CSQNPLSIAKYQ
Sbjct: 359  NFDLSSPEVKQQFDQIGLTPEEVISKIMANPDVALAFQNPRVQQAIMECSQNPLSIAKYQ 418

Query: 352  NDKEVMDVFNKITEL 308
            NDKEVMDVFNKI+E+
Sbjct: 419  NDKEVMDVFNKISEI 433


>ref|XP_006400200.1| hypothetical protein EUTSA_v10013528mg [Eutrema salsugineum]
            gi|557101290|gb|ESQ41653.1| hypothetical protein
            EUTSA_v10013528mg [Eutrema salsugineum]
          Length = 449

 Score =  350 bits (897), Expect = 1e-93
 Identities = 198/386 (51%), Positives = 234/386 (60%), Gaps = 19/386 (4%)
 Frame = -3

Query: 1399 FASIASSG--QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAM 1226
            FASI SS   QQT+SV                 PLFWIGVGVGLSALFSWV   +KKYAM
Sbjct: 75   FASIFSSNRDQQTTSVAYPNAAVPPPSSSTIGSPLFWIGVGVGLSALFSWVTSSLKKYAM 134

Query: 1225 EQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSSPF 1046
            + A KT   QMNTQN+ F N  F                          P   + TSSPF
Sbjct: 135  QTAMKTMMNQMNTQNSQFNNPGFPAGSASPFPFPF--------------PPQTSPTSSPF 180

Query: 1045 KSGAASQSVT-DVPVTKVEDPPSTS----------------VKDKVEPEDAPKKYAFKDV 917
            +S + S   T DV  TKV+ PPS                  V ++ + +   K YAF+DV
Sbjct: 181  QSQSQSSGATVDVTATKVDTPPSAKPQPTPAKKTEVDKPSVVLEENKAKKEEKNYAFEDV 240

Query: 916  SPEETVQKNAFEEDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPL 737
            SPEET +++ F    + S                                        P 
Sbjct: 241  SPEETTKESPFSNYAEVSETSAPKEARLFEDVMQNGAAPANGATASEVFQSLGAGKGGPG 300

Query: 736  MSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWD 557
            +SV+ALEKMMEDPTVQ+MV+P+LPEEMRNP TFKWML+NP YRQQLQDMLNNM G+ EWD
Sbjct: 301  LSVEALEKMMEDPTVQKMVYPHLPEEMRNPETFKWMLKNPHYRQQLQDMLNNMSGSGEWD 360

Query: 556  NRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQN 377
             RMMDTLKNFDL+SPE+KQQFDQIGLTP+EV+ KIMENPDVAMAFQNPRVQAA+++CS+N
Sbjct: 361  KRMMDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIMENPDVAMAFQNPRVQAALMECSEN 420

Query: 376  PLSIAKYQNDKEVMDVFNKITELFPG 299
            P++I KYQNDKEVMDVFNKI++LFPG
Sbjct: 421  PMNIMKYQNDKEVMDVFNKISQLFPG 446


>ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana]
            gi|75309208|sp|Q9FMD5.1|TIC40_ARATH RecName: Full=Protein
            TIC 40, chloroplastic; AltName: Full=Protein PIGMENT
            DEFECTIVE EMBRYO 120; AltName: Full=Translocon at the
            inner envelope membrane of chloroplasts 40;
            Short=AtTIC40; Flags: Precursor
            gi|16226313|gb|AAL16131.1|AF428299_1 AT5g16620/MTG13_6
            [Arabidopsis thaliana] gi|10176971|dbj|BAB10189.1|
            translocon Tic40-like protein [Arabidopsis thaliana]
            gi|20260222|gb|AAM13009.1| translocon Tic40-like protein
            [Arabidopsis thaliana] gi|30387547|gb|AAP31939.1|
            At5g16620 [Arabidopsis thaliana]
            gi|332004935|gb|AED92318.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 447

 Score =  349 bits (896), Expect = 1e-93
 Identities = 198/386 (51%), Positives = 235/386 (60%), Gaps = 19/386 (4%)
 Frame = -3

Query: 1399 FASIASSG--QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAM 1226
            FASI SS   QQT+SV +               PLFWIGVGVGLSALFS+V   +KKYAM
Sbjct: 75   FASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFWIGVGVGLSALFSYVTSNLKKYAM 134

Query: 1225 EQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSSPF 1046
            + A KT   QMNTQN+ F N+ F                          P   +  SSPF
Sbjct: 135  QTAMKTMMNQMNTQNSQFNNSGFPSGSPFPFPF----------------PPQTSPASSPF 178

Query: 1045 KSGAASQSVT-DVPVTKVEDPPSTSVKDKVEPE---DAP-------------KKYAFKDV 917
            +S + S   T DV  TKVE PPST  K     +   D P             K YAF+D+
Sbjct: 179  QSQSQSSGATVDVTATKVETPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEKNYAFEDI 238

Query: 916  SPEETVQKNAFEEDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPL 737
            SPEET +++ F    + S   +                                    P 
Sbjct: 239  SPEETTKESPFSNYAEVSETNSPKETRLFEDVLQNGAGPANGATASEVFQSLGGGKGGPG 298

Query: 736  MSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWD 557
            +SV+ALEKMMEDPTVQ+MV+PYLPEEMRNP TFKWML+NP YRQQLQDMLNNM G+ EWD
Sbjct: 299  LSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNNMSGSGEWD 358

Query: 556  NRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQN 377
             RM DTLKNFDL+SPE+KQQF+QIGLTP+EV+ KIMENPDVAMAFQNPRVQAA+++CS+N
Sbjct: 359  KRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQAALMECSEN 418

Query: 376  PLSIAKYQNDKEVMDVFNKITELFPG 299
            P++I KYQNDKEVMDVFNKI++LFPG
Sbjct: 419  PMNIMKYQNDKEVMDVFNKISQLFPG 444


>ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arabidopsis lyrata subsp.
            lyrata] gi|297317564|gb|EFH47986.1| hypothetical protein
            ARALYDRAFT_488521 [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  346 bits (887), Expect = 2e-92
 Identities = 196/386 (50%), Positives = 232/386 (60%), Gaps = 19/386 (4%)
 Frame = -3

Query: 1399 FASIASSG--QQTSSVGAXXXXXXXXXXXXXXXPLFWIGVGVGLSALFSWVAGRVKKYAM 1226
            FASI SS   QQT+SV +               PLFWIGVGVGLSALFS V   +KKYAM
Sbjct: 75   FASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFWIGVGVGLSALFSLVTSNLKKYAM 134

Query: 1225 EQAFKTFTQQMNTQNNPFGNTAFAXXXXXXXXXXXXXXXXXXXXFKTGAPSFQTTTSSPF 1046
            + A KT   QMNTQN+ F N  F                          P   +  SSPF
Sbjct: 135  QTAMKTMMNQMNTQNSQFNNPGFPSGSPFPFPF----------------PPQTSPASSPF 178

Query: 1045 KSGAASQSVT-DVPVTKVEDPPSTSVKDKVEPE---DAP-------------KKYAFKDV 917
            +S + S   T DV  TKV+ PPST  K     +   D P             K YAF+D+
Sbjct: 179  QSQSQSSGATVDVTATKVDTPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEKNYAFEDI 238

Query: 916  SPEETVQKNAFEEDYKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPL 737
            SPEET +++ F    + S   +                                      
Sbjct: 239  SPEETTKESPFSNYAEVSETSSPKETRLFEDVLQNGAGPANGATASEVFQSLGGGKGGAG 298

Query: 736  MSVDALEKMMEDPTVQQMVFPYLPEEMRNPTTFKWMLQNPVYRQQLQDMLNNMGGNPEWD 557
            +SV+ALEKMMEDPTVQ+MV+PYLPEEMRNP TFKWML+NP YRQQLQDMLNNM G+ EWD
Sbjct: 299  LSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNNMSGSGEWD 358

Query: 556  NRMMDTLKNFDLSSPEIKQQFDQIGLTPQEVVGKIMENPDVAMAFQNPRVQAAILDCSQN 377
             RM DTLKNFDL+SPE+KQQF+QIGLTP+EV+ KIMENPDVAMAFQNPRVQAA+++CS+N
Sbjct: 359  KRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQAALMECSEN 418

Query: 376  PLSIAKYQNDKEVMDVFNKITELFPG 299
            P++I KYQNDKEVMDVFNKI++LFPG
Sbjct: 419  PMNIMKYQNDKEVMDVFNKISQLFPG 444


Top