BLASTX nr result

ID: Rehmannia24_contig00000293 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia24_contig00000293
         (2509 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-lik...   525   e-146
ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-lik...   520   e-144
ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vi...   484   e-133
gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein i...   444   e-122
ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-lik...   429   e-117
sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplasti...   427   e-116
emb|CAB50925.1| translocon Tic40 [Pisum sativum]                      426   e-116
ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-lik...   424   e-116
ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Popu...   423   e-115
ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-lik...   421   e-115
ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Popu...   421   e-115
ref|XP_002531917.1| conserved hypothetical protein [Ricinus comm...   417   e-114
gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus...   416   e-113
gb|ABF19057.1| plastid Tic40 [Ricinus communis]                       415   e-113
gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein i...   407   e-110
ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-lik...   400   e-108
ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana] gi|753092...   395   e-107
ref|XP_006400200.1| hypothetical protein EUTSA_v10013528mg [Eutr...   390   e-105
ref|XP_004307173.1| PREDICTED: protein TIC 40, chloroplastic-lik...   386   e-104
ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arab...   385   e-104

>ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum lycopersicum]
          Length = 443

 Score =  525 bits (1352), Expect = e-146
 Identities = 281/448 (62%), Positives = 324/448 (72%), Gaps = 19/448 (4%)
 Frame = +1

Query: 124  MENLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFE 303
            MEN+G++SS K+VLG+S    NS+ SSKPF GLP+L ++  KNGR + P++ F V+S F+
Sbjct: 1    MENIGIVSSPKMVLGLS---SNSVISSKPFFGLPHLPKRPFKNGRTVRPTTCFEVVSCFQ 57

Query: 304  APKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGL 480
             P+ TK IV  K  R  FAS T+SG ++TSSVGVN                FWIGVGVG 
Sbjct: 58   GPRLTKKIVLGKSGRGSFASTTTSGGKQTSSVGVNPQFSAPSPPSQMGSPLFWIGVGVGF 117

Query: 481  SALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXX 660
            SALF+WVA  +KKYAM+QA KT   QMN QN+ F N A                      
Sbjct: 118  SALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNTAFSPGPGSPFPFPFPPPPVSGPA 177

Query: 661  XXH---------------VASQPVTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVS 795
                               ASQPVTVDV ATKVE+PP+++VK   E E  PKK AFVD+S
Sbjct: 178  SSSPPPPTASSSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDKEAEKEPKKNAFVDIS 237

Query: 796  PEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTS---KTSPLL 966
            P+ET QK AFEN+K+S +T +    Q    V+QNG AS+ G G++   STS   K++PLL
Sbjct: 238  PDETFQKGAFENFKDSAETAAVTVDQ----VTQNGAASQSGFGSNTSDSTSSTGKSNPLL 293

Query: 967  SVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDN 1146
            SV+ALEKMMEDPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQLQDM+NNMGG+PEWDN
Sbjct: 294  SVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDN 353

Query: 1147 RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP 1326
            RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP
Sbjct: 354  RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP 413

Query: 1327 LSIAKYQNDKEVMDVFNKISELFPGATG 1410
            LSIAKYQNDKEVMDVFNKISELFPG +G
Sbjct: 414  LSIAKYQNDKEVMDVFNKISELFPGVSG 441


>ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum tuberosum]
          Length = 443

 Score =  520 bits (1338), Expect = e-144
 Identities = 279/448 (62%), Positives = 323/448 (72%), Gaps = 19/448 (4%)
 Frame = +1

Query: 124  MENLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFE 303
            MEN+ ++SS K+VLG+S NP   + S+KP  GLP+L ++  KNGR++ P++ F V+S F+
Sbjct: 1    MENICIVSSPKMVLGLSSNP---VISNKPLFGLPHLPKRPFKNGRIVRPTTCFEVVSCFQ 57

Query: 304  APKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGL 480
            +P+ TK IV  K  R  FAS T+SG Q+TSSVGVN                FWIGVGVGL
Sbjct: 58   SPRLTKKIVLGKSGRGSFASTTTSGGQQTSSVGVNPQFSAPSQPSQVGSPLFWIGVGVGL 117

Query: 481  SALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXX 660
            SALF+WVA  +KKYAM+QA KT   QMN QN+ F N A                      
Sbjct: 118  SALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNNAFSPGPGSPFPFPFPPPPVSGPA 177

Query: 661  XXH---------------VASQPVTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVS 795
                               ASQPVTVDV ATKVE+PP+++VK   E    PKK AFVD+S
Sbjct: 178  SSSPPPPTASTSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDTEAGKEPKKNAFVDIS 237

Query: 796  PEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTS---KTSPLL 966
            P+ET QK AFEN+K+S +T S    Q    V+QNG AS+ G G +   STS   K++PL+
Sbjct: 238  PDETFQKGAFENFKDSTETASVTVDQ----VTQNGAASQLGFGPNTSDSTSSTGKSNPLM 293

Query: 967  SVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDN 1146
            SV+ALEKMMEDPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQLQDM+NNMGG+PEWDN
Sbjct: 294  SVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDN 353

Query: 1147 RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP 1326
            RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP
Sbjct: 354  RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP 413

Query: 1327 LSIAKYQNDKEVMDVFNKISELFPGATG 1410
            LSIAKYQNDKEVMDVFNKISELFPG +G
Sbjct: 414  LSIAKYQNDKEVMDVFNKISELFPGVSG 441


>ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vitis vinifera]
            gi|296089465|emb|CBI39284.3| unnamed protein product
            [Vitis vinifera]
          Length = 436

 Score =  484 bits (1245), Expect = e-133
 Identities = 266/443 (60%), Positives = 301/443 (67%), Gaps = 14/443 (3%)
 Frame = +1

Query: 124  MENLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFE 303
            M++L L+SS K+VLG SP+    I  +     LP L RK  K    I  S S        
Sbjct: 1    MDSLTLVSSPKLVLGHSPSNPRHISCAHSSFSLPLLFRKPRK---FIAASQSGA------ 51

Query: 304  APKATKTIVPEKDVRDCFASITSSGQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLS 483
            +P+  + +V  K   +CFASI+SS Q TSSVGVN                FWIGVGVGLS
Sbjct: 52   SPRTPRHVVETKLGTECFASISSSSQGTSSVGVNPQFSPPPPSSNIGSPLFWIGVGVGLS 111

Query: 484  ALFSWVAGRVKKYAMEQAFKTFTQQMNAQNN-------------PFGNAAXXXXXXXXXX 624
            ALFSWVA  +KKYAM+QAFKT   QM++QNN             PF              
Sbjct: 112  ALFSWVASNLKKYAMQQAFKTLMGQMDSQNNQFNTTTFSPGSPFPFPMPPPSGPSTSHSG 171

Query: 625  XXXXXXXXXXXXXXHVASQPVTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEE 804
                            A   VTVDVPATKVE PP+  VK+ +E ++   KYAFVDVSPEE
Sbjct: 172  PTTSPSGPTTSPSTVAAQSMVTVDVPATKVETPPATDVKDDIEKKNEQNKYAFVDVSPEE 231

Query: 805  TLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE-GPSTSKTSPLLSVEAL 981
            TLQ++ FEN++ES +T S  D Q S  VSQNGT  + G G SE   ST   +P LSV+AL
Sbjct: 232  TLQESPFENFEESTETSSSKDAQFSAGVSQNGTPPRPGMGVSEDSQSTRNANPFLSVDAL 291

Query: 982  EKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDS 1161
            EKMMEDPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGG  EWDNRMMD+
Sbjct: 292  EKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGGAEWDNRMMDN 351

Query: 1162 LKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAK 1341
            LKNFDLSSPE+KQQFDQIGLTPEEVISKIMANPDVA+AFQNPR+QAAIMDCSQNPLSIAK
Sbjct: 352  LKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPDVALAFQNPRIQAAIMDCSQNPLSIAK 411

Query: 1342 YQNDKEVMDVFNKISELFPGATG 1410
            YQNDKEVMDVFNKISELFPG +G
Sbjct: 412  YQNDKEVMDVFNKISELFPGVSG 434


>gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 433

 Score =  444 bits (1142), Expect = e-122
 Identities = 263/440 (59%), Positives = 295/440 (67%), Gaps = 13/440 (2%)
 Frame = +1

Query: 130  NLGLISSHK-----IVLGIS-PN--PKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFT 285
            NL L+SS        +LG + PN  PKN  F + PF    NL  +  +     +  S  T
Sbjct: 5    NLALVSSSSPPLKLYLLGCNHPNYTPKNP-FKTLPFPS-SNLAPRRSRISIFAHSHSQPT 62

Query: 286  VLSLFEAPKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWI 462
                   P+    IV  K   + FASI+SS  Q+TSSVGVN                FWI
Sbjct: 63   ------PPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWI 116

Query: 463  GVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXX 642
            GVGVGLSALF+WVA  +KKYAM+QAFKT   QMN QNN F NAA                
Sbjct: 117  GVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPG 176

Query: 643  XXXXXXXXHVASQPVTVDVPATKVEDPPSISVKEKVEPESG---PKKYAFVDVSPEETLQ 813
                      +   VTVDVPATKVE  P+ +   +V+ E+    PKKYAFVDVSPEET+Q
Sbjct: 177  PVTSPSPS--SQTAVTVDVPATKVEAAPATAPATEVKSETETAEPKKYAFVDVSPEETVQ 234

Query: 814  KNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGP-STSKTSPLLSVEALEKM 990
            K+AFE   ++    S N+ Q  + VS NG ASKQ  GA  G  ST    P LSV+ALEKM
Sbjct: 235  KSAFE---DAAGISSSNNTQFPKDVSDNGAASKQDAGAFGGSQSTGSADPALSVDALEKM 291

Query: 991  MEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKN 1170
            MEDPTVQKMV+PYLPEEMRNP TFKWMLQNPQYRQQLQDMLNNMGGS EWDNRMMDSLKN
Sbjct: 292  MEDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKN 351

Query: 1171 FDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 1350
            FDL+SP++KQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNPLSIAKYQN
Sbjct: 352  FDLNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 411

Query: 1351 DKEVMDVFNKISELFPGATG 1410
            DKEVMDVFNKISELFPG TG
Sbjct: 412  DKEVMDVFNKISELFPGVTG 431


>ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-like [Cicer arietinum]
          Length = 433

 Score =  429 bits (1102), Expect = e-117
 Identities = 246/438 (56%), Positives = 295/438 (67%), Gaps = 11/438 (2%)
 Frame = +1

Query: 130  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFEAP 309
            NL L+SS K +L    + +N     KPF          GK     N SSS    +  ++ 
Sbjct: 5    NLALVSSPKPLLLGHSSSRNVFTRRKPFT--------FGKFFVSANSSSSHVTRAAPKSH 56

Query: 310  KATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSA 486
            +  K++  +  V + FASI+SS  QET+SVGV+                FWIGVGVG SA
Sbjct: 57   QNPKSVQGKLIVHN-FASISSSNSQETTSVGVSPQLSPPPSSTVGSPL-FWIGVGVGFSA 114

Query: 487  LFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXX 666
            LFS VA R+KKYAM+QAFKT   QMN QNNPF +AA                        
Sbjct: 115  LFSIVASRLKKYAMQQAFKTMMGQMNTQNNPFDSAAFSPGSPFPFPMPSSSGPAAPASSA 174

Query: 667  HVASQ----------PVTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQK 816
               SQ           VTVD+PATKVE  PS + K++VE ++ PKK  FVDVSPEE++QK
Sbjct: 175  GTQSQSTSARTASQSTVTVDIPATKVEAAPSTNAKDEVEVKNEPKKIGFVDVSPEESVQK 234

Query: 817  NAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEKMME 996
            + FE++K+  ++ S  + ++     QNG  S QG G S G S S    +LSVEALEKMME
Sbjct: 235  SPFESFKDVDESSSFKEARAPAEAFQNGAPSNQGFGNSPG-SQSGGKSVLSVEALEKMME 293

Query: 997  DPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFD 1176
            DPTVQKMV+PYLPEEMRNP+TFKWMLQNPQYRQQL++MLNNMGGS EWD+RMMD+LKNFD
Sbjct: 294  DPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDTLKNFD 353

Query: 1177 LSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDK 1356
            L+SP++KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCS NPL+IAKYQNDK
Sbjct: 354  LNSPDVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSSNPLNIAKYQNDK 413

Query: 1357 EVMDVFNKISELFPGATG 1410
            EVMDVFNKISELFPG +G
Sbjct: 414  EVMDVFNKISELFPGVSG 431


>sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplastic; AltName: Full=Translocon
            at the inner envelope membrane of chloroplasts 40;
            Short=PsTIC40; Flags: Precursor
            gi|26000725|gb|AAN75219.1| chloroplast protein translocon
            component Tic40 precursor [Pisum sativum]
          Length = 436

 Score =  427 bits (1097), Expect = e-116
 Identities = 246/441 (55%), Positives = 294/441 (66%), Gaps = 14/441 (3%)
 Frame = +1

Query: 130  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFEAP 309
            NL L+SS K +L    + KN     K F          G      N SSS    +  ++ 
Sbjct: 5    NLALVSSPKPLLLGHSSSKNVFSGRKSFT--------FGTFRVSANSSSSHVTRAASKSH 56

Query: 310  KATKTIVPEKDVRDCFASITSS-GQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSA 486
            +  K++  + +  D FASI+SS GQET+SVGV+                FWIG+GVG SA
Sbjct: 57   QNLKSVQGKVNAHD-FASISSSNGQETTSVGVSPQLSPPPPSTVGSPL-FWIGIGVGFSA 114

Query: 487  LFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXX 666
            LFS VA RVKKYAM+QAFK+   QMN QNNPF + A                        
Sbjct: 115  LFSVVASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGFA 174

Query: 667  HVASQP----------VTVDVPATKVE---DPPSISVKEKVEPESGPKKYAFVDVSPEET 807
               SQ           VTVD+PATKVE     P I+VKE+VE ++ PKK AFVDVSPEET
Sbjct: 175  GNQSQATSTRSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEET 234

Query: 808  LQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEK 987
            +QKNAFE +K+  ++ S  + ++    SQNGT  KQG G S   S S+    LSV+ALEK
Sbjct: 235  VQKNAFERFKDVDESSSFKEARAPAEASQNGTPFKQGFGDSPS-SPSERKSALSVDALEK 293

Query: 988  MMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLK 1167
            MMEDPTVQ+MV+PYLPEEMRNP+TFKWM+QNP+YRQQL+ MLNNMGG  EWD+RMMD+LK
Sbjct: 294  MMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRMMDTLK 353

Query: 1168 NFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQ 1347
            NFDL+SP++KQQFDQIGL+P+EVISKIMANPDVAMAFQNPRVQAAIMDCSQNP+SI KYQ
Sbjct: 354  NFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMSIVKYQ 413

Query: 1348 NDKEVMDVFNKISELFPGATG 1410
            NDKEVMDVFNKISELFPG +G
Sbjct: 414  NDKEVMDVFNKISELFPGVSG 434


>emb|CAB50925.1| translocon Tic40 [Pisum sativum]
          Length = 436

 Score =  426 bits (1096), Expect = e-116
 Identities = 246/441 (55%), Positives = 294/441 (66%), Gaps = 14/441 (3%)
 Frame = +1

Query: 130  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFEAP 309
            NL L+SS K +L    + KN     K F          G      N SSS    +  ++ 
Sbjct: 5    NLALVSSPKPLLLGHSSSKNVFSRRKSFT--------FGTFRVSANSSSSHVTRAASKSH 56

Query: 310  KATKTIVPEKDVRDCFASITSS-GQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSA 486
            +  K++  + +    FASI+SS GQET+SVGV+                FWIG+GVG SA
Sbjct: 57   QNLKSVQGKVNAHS-FASISSSNGQETTSVGVSPQLSPPPPSTVGSPL-FWIGIGVGFSA 114

Query: 487  LFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXX 666
            LFS VA RVKKYAM+QAFK+   QMN QNNPF + A                        
Sbjct: 115  LFSVVASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGFA 174

Query: 667  HVASQP----------VTVDVPATKVE---DPPSISVKEKVEPESGPKKYAFVDVSPEET 807
               SQ           VTVD+PATKVE     P I+VKE+VE ++ PKK AFVDVSPEET
Sbjct: 175  GNQSQATSTRSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEET 234

Query: 808  LQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEK 987
            +QKNAFE +K+  ++ S  + ++    SQNGT  KQG G S G S S+    LSV+ALEK
Sbjct: 235  VQKNAFERFKDVDESSSFKEARAPAEASQNGTPFKQGFGDSPG-SPSERKSALSVDALEK 293

Query: 988  MMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLK 1167
            MMEDPTVQ+MV+PYLPEEMRNP+TFKWM+QNP+YRQQL+ MLNNMGG  EWD+RMMD+LK
Sbjct: 294  MMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRMMDTLK 353

Query: 1168 NFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQ 1347
            NFDL+SP++KQQFDQIGL+P+EVISKIMANPDVAMAFQNPRVQAAIMDCSQNP+SI KYQ
Sbjct: 354  NFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMSIVKYQ 413

Query: 1348 NDKEVMDVFNKISELFPGATG 1410
            NDKEVMDVFNKISELFPG +G
Sbjct: 414  NDKEVMDVFNKISELFPGVSG 434


>ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max]
          Length = 432

 Score =  424 bits (1090), Expect = e-116
 Identities = 242/431 (56%), Positives = 290/431 (67%), Gaps = 15/431 (3%)
 Frame = +1

Query: 154  KIVLGISPNPKNSIFSSKPFVGLPN---LIRKTGKNGR-LINPSSSFTVLSLFEAPKATK 321
            K+ L +  +PK  +    P +   +     RK    GR LI P      +S   +     
Sbjct: 3    KLNLALVSSPKPLMLGHVPAIDATSRDVFRRKHFSFGRVLIAPHRCRFRVSALSSSHRNP 62

Query: 322  TIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSALFSW 498
              V EK +   FASI+SS  QE +S GVN                FWIGVGVGLSALFS 
Sbjct: 63   KSVQEKLIVKHFASISSSNTQEATSTGVNPQLSPSSTIGSPL---FWIGVGVGLSALFSV 119

Query: 499  VAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXXHVAS 678
            VA R+KKYAM+QAFKT   QMN+QNN FGNAA                           S
Sbjct: 120  VASRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPTAPASSATTQS 179

Query: 679  QP----------VTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQKNAFE 828
            +           +TVD+PA KVE  P+ +VK++VE ++ PKK AFVDVSPEET+Q++ FE
Sbjct: 180  RAPSASSASQSTITVDIPAAKVEVAPTTNVKDEVEVKNEPKKIAFVDVSPEETVQESPFE 239

Query: 829  NYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEKMMEDPTV 1008
            ++K+  ++ S  + +    VSQNG  S QG G   G  ++K S +LSV+ALEKMMEDPTV
Sbjct: 240  SFKDD-ESSSVKEARVPDEVSQNGAPSNQGFGDFPGSQSTKKS-VLSVDALEKMMEDPTV 297

Query: 1009 QKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFDLSSP 1188
            QKMV+PYLPEEMRNPTTFKWMLQNPQYRQQL++MLNNMGGS EWD+RMMD+LKNFDL+SP
Sbjct: 298  QKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDTLKNFDLNSP 357

Query: 1189 EIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMD 1368
            E+KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNP++I KYQNDKEVMD
Sbjct: 358  EVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQNDKEVMD 417

Query: 1369 VFNKISELFPG 1401
            VFNKISELFPG
Sbjct: 418  VFNKISELFPG 428


>ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa]
            gi|550319201|gb|ERP50369.1| hypothetical protein
            POPTR_0017s02900g [Populus trichocarpa]
          Length = 435

 Score =  423 bits (1088), Expect = e-115
 Identities = 242/434 (55%), Positives = 290/434 (66%), Gaps = 13/434 (2%)
 Frame = +1

Query: 130  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFEAP 309
            +L L+S +   L     PK SI +++P +  P+   KT  +   I    S + LS    P
Sbjct: 13   SLKLVSGYPTSLKNPTTPKFSISTTRPSLPFPHRTSKTVTHTSRI----SISALSQSHGP 68

Query: 310  KATKTIVPEKDVRDCFASITS-SGQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSA 486
            + T      K+  + FASI+S SGQ+T+SVGVN                FW+GVGV LSA
Sbjct: 69   RRTS-----KNGSEYFASISSLSGQQTASVGVNPQSVSPPPSQIGSPL-FWVGVGVALSA 122

Query: 487  LFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXX 666
            +FSWVA R+K YAM+QAFK+ T+QMNAQNN F  A                         
Sbjct: 123  IFSWVATRLKNYAMQQAFKSLTEQMNAQNNQFNPA---FSARSPFPFSPPPASQPATSPF 179

Query: 667  HVASQP-VTVDVPATKVEDPPSISVKEKVEPES--------GPKKYAFVDVSPEETLQKN 819
              ASQP VTVD+PATKVE  P    +++ E ++         P+K+AFVDVSPEET    
Sbjct: 180  QTASQPAVTVDIPATKVEAAPETDARKEKETDTLEEREIKEEPRKFAFVDVSPEETSLNT 239

Query: 820  AFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE---GPSTSKTSPLLSVEALEKM 990
             F + ++ I T S  D Q ++  SQNG   KQG  ASE   G  +S+ +  LSVEALEKM
Sbjct: 240  PFSSVEDVIDTSSSKDVQFAKEASQNGATFKQGPSASEPSEGSQSSQKAGSLSVEALEKM 299

Query: 991  MEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKN 1170
            M+DPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQL++MLNNM GS EWD+RM+DSLKN
Sbjct: 300  MDDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMSGSSEWDSRMVDSLKN 359

Query: 1171 FDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 1350
            FDLSSPE+KQQFDQIGLTPEEVISKIMANPDVA+AFQNPRVQ AIM+CSQNPLSIAKYQN
Sbjct: 360  FDLSSPEVKQQFDQIGLTPEEVISKIMANPDVALAFQNPRVQQAIMECSQNPLSIAKYQN 419

Query: 1351 DKEVMDVFNKISEL 1392
            DKEVMDVFNKISE+
Sbjct: 420  DKEVMDVFNKISEI 433


>ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max]
          Length = 429

 Score =  421 bits (1083), Expect = e-115
 Identities = 244/436 (55%), Positives = 290/436 (66%), Gaps = 12/436 (2%)
 Frame = +1

Query: 130  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGR-LINPSSSFTVLSLFEA 306
            NL L+SS K ++ +   P   +F  K F             GR LI P      +S   +
Sbjct: 5    NLALVSSPKPLM-LGHVPARDVFRRKHF-----------SFGRVLIAPHRCRFRVSALSS 52

Query: 307  PKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLS 483
                   V EK +   FASI+SS  QET+S+GV                 FWIGVGVGLS
Sbjct: 53   SHHNPKSVQEKLIVKHFASISSSNTQETTSIGVKPQLSPSPSSTIGSPL-FWIGVGVGLS 111

Query: 484  ALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXX 663
            ALFS VA R+KKYAM+QAFKT   QMN+QNN FGNAA                       
Sbjct: 112  ALFSVVASRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPTAPASS 171

Query: 664  XHVASQP----------VTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQ 813
                S+           +TVD+PA KVE  P+ +VK++VE ++ PKK AFVDVSPEET++
Sbjct: 172  ATTQSRAPSASSASQSTITVDLPAAKVEAAPTTNVKDEVELKNEPKKIAFVDVSPEETVR 231

Query: 814  KNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEKMM 993
            ++ FE++K+  ++ S  +      VSQNG  S  G G   G  ++K S L SV+ALEKMM
Sbjct: 232  ESPFESFKDD-ESSSVKEAWVPDEVSQNGAPSNLGFGDFPGSQSTKKSAL-SVDALEKMM 289

Query: 994  EDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNF 1173
            EDPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQL++MLNNMGGS EWDNRMMD+LKNF
Sbjct: 290  EDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDNRMMDTLKNF 349

Query: 1174 DLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQND 1353
            DL+SPE+KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNP++I KYQND
Sbjct: 350  DLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQND 409

Query: 1354 KEVMDVFNKISELFPG 1401
            KEVMDVFNKISELFPG
Sbjct: 410  KEVMDVFNKISELFPG 425


>ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Populus trichocarpa]
            gi|222848840|gb|EEE86387.1| hypothetical protein
            POPTR_0004s08560g [Populus trichocarpa]
          Length = 429

 Score =  421 bits (1083), Expect = e-115
 Identities = 246/449 (54%), Positives = 299/449 (66%), Gaps = 20/449 (4%)
 Frame = +1

Query: 124  MEN--LGLISSH--KIVLGISPN------PKNSIFSSKPFVGLPNLIRKTGKNGRLINPS 273
            MEN  L L+SS   K+V+G   +      PK SI +++P +     I KT  +      +
Sbjct: 1    MENPRLALLSSSSPKLVMGYPTSLKNPTTPKFSISTTRPSLPFSLRISKTAPH------A 54

Query: 274  SSFTVLSLFEAPKATKTIVPEKDVRDCFASITSS-GQETSSVGVNXXXXXXXXXXXXXXX 450
            S F++ +L  +     +        + FASI+SS G++T+SVGVN               
Sbjct: 55   SIFSISALANSHGKLGS--------EYFASISSSSGKQTASVGVNPQPVSPPPSQIGSPL 106

Query: 451  XFWIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXX 630
             FW+GVGVGLSA+FSWVA RVK YAM+QAFK+ T+QMN QNN F  A             
Sbjct: 107  -FWVGVGVGLSAIFSWVATRVKNYAMQQAFKSLTEQMNTQNNQFNPAFSARPPFPFSPPP 165

Query: 631  XXXXXXXXXXXXHVASQP-VTVDVPATKVEDPPSISVKEKVEPE--------SGPKKYAF 783
                          ASQP +TVD+PATKVE  P+  V ++ E +           KKYAF
Sbjct: 166  ASHPSTSPSP---AASQPAITVDIPATKVEAAPTTDVGKEKETDFLEERKIKEETKKYAF 222

Query: 784  VDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPL 963
            VD+SPEET     F + ++  +T S  D + ++ V QNG A KQG GA+EG  +  T P 
Sbjct: 223  VDISPEETSLNTPFSSVEDDNETSSSKDVEFAKKVFQNGAAFKQGPGAAEG--SQSTRPF 280

Query: 964  LSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWD 1143
            LSVEALEKMMEDPT+QKMV+PYLPEEMRNPTTFKWMLQNPQYRQQL+DMLNNMGGS +WD
Sbjct: 281  LSVEALEKMMEDPTMQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSGKWD 340

Query: 1144 NRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQN 1323
            ++MMDSLK+FDL+S E+KQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQ AIM+CSQN
Sbjct: 341  SQMMDSLKDFDLNSAEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQQAIMECSQN 400

Query: 1324 PLSIAKYQNDKEVMDVFNKISELFPGATG 1410
            P++I KYQNDKEVMDVFNKISELFPG TG
Sbjct: 401  PINITKYQNDKEVMDVFNKISELFPGMTG 429


>ref|XP_002531917.1| conserved hypothetical protein [Ricinus communis]
            gi|223528427|gb|EEF30461.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 465

 Score =  417 bits (1073), Expect = e-114
 Identities = 250/459 (54%), Positives = 294/459 (64%), Gaps = 35/459 (7%)
 Frame = +1

Query: 130  NLGLISSH----KIVLGIS-PNP-KN-SIFSSKPFVGLPNLIRKTG---KNGRLINPSSS 279
            N+GL+SS     K+V+G   PN  KN ++ ++K F       R      +N +++  SS 
Sbjct: 5    NMGLLSSFYASPKLVMGCCYPNSLKNPTVTTNKQFSRTSTSTRALPFSLRNYKIVTRSSR 64

Query: 280  FTVLSLFEAPKATKTIVPEKDVRDCFASITSSGQETSSVGVNXXXXXXXXXXXXXXXX-F 456
            F++ +L  +  + +     +   + FASI SS Q+TSSVGVN                 F
Sbjct: 65   FSISALAHSHSSPRISGSSRLGAEHFASI-SSRQQTSSVGVNPQPLPPPSSSSQFGSPLF 123

Query: 457  WIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXX 636
            WIGVGVGLSA+FS VA RVK YAM+QAFK+   QMN QN+ F N A              
Sbjct: 124  WIGVGVGLSAIFSLVATRVKNYAMQQAFKSMMNQMNTQNDQFNNPAFSPGSAFPFPTPPA 183

Query: 637  XXXXXXXXXX----------------------HVASQP-VTVDVPATKVEDPPSISVKEK 747
                                             VASQP VTVDV ATKVE       K++
Sbjct: 184  SVPASSPPFPTSSTSRPATSPSYPTSSASTSPSVASQPAVTVDVSATKVEAASVTDAKDE 243

Query: 748  VEPESGPKKYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGA 927
             E    PKKYAFVDVSPEET  K+ F++ ++ ++T +  D Q +  V QNG AS QG   
Sbjct: 244  AEITKEPKKYAFVDVSPEETFPKSPFKSNEDILETSTSKDTQFNPEVLQNGAASNQGAAD 303

Query: 928  SEGP-STSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQ 1104
              G  ST K    LSVEALEKMMEDPTVQKMV+PYLPEEMRNP+TFKWMLQNPQYRQQL+
Sbjct: 304  FTGSQSTRKAGSGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLE 363

Query: 1105 DMLNNMGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQN 1284
            +MLNNM G+ EWDNRMMDSLKNFDLSSPE+KQQFDQIGLTPEEVISKIMANP++AMAFQN
Sbjct: 364  EMLNNMSGTGEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEIAMAFQN 423

Query: 1285 PRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 1401
            PRVQ AIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG
Sbjct: 424  PRVQQAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 462


>gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris]
          Length = 430

 Score =  416 bits (1070), Expect = e-113
 Identities = 241/432 (55%), Positives = 289/432 (66%), Gaps = 8/432 (1%)
 Frame = +1

Query: 130  NLGLISSHK-IVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGR-LINPSSSFTVLSLFE 303
            NL L+SS K ++LG  P        ++       L RK    GR LI P      +S   
Sbjct: 5    NLALVSSSKPLMLGHVP--------ARDATDRDVLRRKPFSLGRVLIAPHRFRYRVSALS 56

Query: 304  APKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGL 480
            +   +   V +K +   FASI+SS  QET+S+GVN                FWIGVGVGL
Sbjct: 57   SSHHSPKSVQDKLIVKHFASISSSNTQETTSIGVNPQLSPPPSSTIGSPL-FWIGVGVGL 115

Query: 481  SALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXX 660
            SALFS VA R+KKYAM+QAFKT   QMN+ NN FGNAA                      
Sbjct: 116  SALFSMVASRLKKYAMQQAFKTMMGQMNSPNNDFGNAAFSPGSPFPFSMPSAAGPTATAQ 175

Query: 661  XXHVASQP-----VTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQKNAF 825
                ++       VTVD+PATKVE   +  +K++VE ++ PKK AFVDVSPEET+QK+ F
Sbjct: 176  YGAPSTSSGSQSTVTVDIPATKVEATRTTDIKDEVEVQNKPKKIAFVDVSPEETVQKSPF 235

Query: 826  ENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEKMMEDPT 1005
            E+ K++  +    + +    VSQNG    QG G   G  ++K S L SV+ALEKMMEDPT
Sbjct: 236  ESVKDNESSSVKEEARVPDEVSQNGAPFNQGFGGFPGSQSTKKSAL-SVDALEKMMEDPT 294

Query: 1006 VQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFDLSS 1185
            VQKMV+P+LPEEMRNP TFKWMLQNPQYRQQL+ ML+NMGGS EWDNRMMD+LKNFDL+S
Sbjct: 295  VQKMVYPHLPEEMRNPDTFKWMLQNPQYRQQLEAMLSNMGGSTEWDNRMMDTLKNFDLNS 354

Query: 1186 PEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVM 1365
            PE+KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNP++I KYQNDKEVM
Sbjct: 355  PEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQNDKEVM 414

Query: 1366 DVFNKISELFPG 1401
            +VFNKISELFPG
Sbjct: 415  NVFNKISELFPG 426


>gb|ABF19057.1| plastid Tic40 [Ricinus communis]
          Length = 460

 Score =  415 bits (1067), Expect = e-113
 Identities = 249/458 (54%), Positives = 293/458 (63%), Gaps = 35/458 (7%)
 Frame = +1

Query: 133  LGLISSH----KIVLGIS-PNP-KN-SIFSSKPFVGLPNLIRKTG---KNGRLINPSSSF 282
            +GL+SS     K+V+G   PN  KN ++ ++K F       R      +N +++  SS F
Sbjct: 1    MGLLSSFYASPKLVMGCCYPNSLKNPTVTTNKQFSRTSTSTRALPFSLRNYKIVTRSSRF 60

Query: 283  TVLSLFEAPKATKTIVPEKDVRDCFASITSSGQETSSVGVNXXXXXXXXXXXXXXXX-FW 459
            ++ +L  +  + +     +   + FASI SS Q+TSSVGVN                 FW
Sbjct: 61   SISALAHSHSSPRISGSSRLGAEHFASI-SSRQQTSSVGVNPQPLPPPSSSSQFGSPLFW 119

Query: 460  IGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXX 639
            IGVGVGLSA+FS VA RVK YAM+QAFK+   QMN QN+ F N A               
Sbjct: 120  IGVGVGLSAIFSLVATRVKNYAMQQAFKSMMNQMNTQNDQFNNPAFSPGSAFPFPTPPAS 179

Query: 640  XXXXXXXXX----------------------HVASQP-VTVDVPATKVEDPPSISVKEKV 750
                                            VASQP VTVDV ATKVE       K++ 
Sbjct: 180  VPASSPPFPTSSTSRPATSPSYPTSSASTSPSVASQPAVTVDVSATKVEAASVTDAKDEA 239

Query: 751  EPESGPKKYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGAS 930
            E    PKKYAFVDVSPEET  K+ F++ ++ ++T +  D Q +  V QNG AS QG    
Sbjct: 240  EITKEPKKYAFVDVSPEETFPKSPFKSNEDILETSTSKDTQFNPEVLQNGAASNQGAADF 299

Query: 931  EGP-STSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQD 1107
             G  ST K    LSVEALEKMMEDPTVQKMV+PYLPEEMRNP+TFKWMLQNPQYRQQL++
Sbjct: 300  TGSQSTRKAGSGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEE 359

Query: 1108 MLNNMGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNP 1287
            MLNNM G+ EWDNRMMDSLKNFDLSSPE+KQQFDQIGLTPEEVISKIMANP++AMAFQNP
Sbjct: 360  MLNNMSGTGEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEIAMAFQNP 419

Query: 1288 RVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 1401
            RVQ AIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG
Sbjct: 420  RVQQAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 457


>gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial
            [Theobroma cacao]
          Length = 412

 Score =  407 bits (1045), Expect = e-110
 Identities = 247/431 (57%), Positives = 277/431 (64%), Gaps = 13/431 (3%)
 Frame = +1

Query: 130  NLGLISSHK-----IVLGIS-PN--PKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFT 285
            NL L+SS        +LG + PN  PKN  F + PF    NL  +  +     +  S  T
Sbjct: 5    NLALVSSSSPPLKLYLLGCNHPNYTPKNP-FKTLPFPS-SNLAPRRSRISIFAHSHSQPT 62

Query: 286  VLSLFEAPKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWI 462
                   P+    IV  K   + FASI+SS  Q+TSSVGVN                FWI
Sbjct: 63   ------PPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWI 116

Query: 463  GVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXX 642
            GVGVGLSALF+WVA  +KKYAM+QAFKT   QMN QNN F NAA                
Sbjct: 117  GVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPG 176

Query: 643  XXXXXXXXHVASQPVTVDVPATKVEDPPSISVKEKVEPESG---PKKYAFVDVSPEETLQ 813
                      +   VTVDVPATKVE  P+ +   +V+ E+    PKKYAFVDVSPEET+Q
Sbjct: 177  PVTSPSPS--SQTAVTVDVPATKVEAAPATAPATEVKSETETAEPKKYAFVDVSPEETVQ 234

Query: 814  KNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGP-STSKTSPLLSVEALEKM 990
            K+AFE+              +    S N    K   GA  G  ST    P LSV+ALEKM
Sbjct: 235  KSAFED-------------AAGISSSNNTQFPKDDAGAFGGSQSTGSADPALSVDALEKM 281

Query: 991  MEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKN 1170
            MEDPTVQKMV+PYLPEEMRNP TFKWMLQNPQYRQQLQDMLNNMGGS EWDNRMMDSLKN
Sbjct: 282  MEDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKN 341

Query: 1171 FDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 1350
            FDL+SP++KQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNPLSIAKYQN
Sbjct: 342  FDLNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 401

Query: 1351 DKEVMDVFNKI 1383
            DKEVMDVFNKI
Sbjct: 402  DKEVMDVFNKI 412


>ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-like [Cucumis sativus]
          Length = 419

 Score =  400 bits (1028), Expect = e-108
 Identities = 214/359 (59%), Positives = 255/359 (71%), Gaps = 3/359 (0%)
 Frame = +1

Query: 343  VRDCFASITSS--GQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSALFSWVAGRVK 516
            V + FA+++SS    ++SSVGV                 FW+GVGVGLSALF+WVA  +K
Sbjct: 66   VAERFATVSSSTTSNDSSSVGV-PSVSIPPPSSYVGSPLFWVGVGVGLSALFTWVASYLK 124

Query: 517  KYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXXHVASQPVTVD 696
            KYAM+QAFKT   QMN+QN+P  N                           V+   V++D
Sbjct: 125  KYAMQQAFKTMMSQMNSQNSPMSNPTLSSGSPFPIPPTFATGTTISPS---VSEPAVSID 181

Query: 697  VPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQS 876
            V ATKVE+ P  +VK + E     KK+AFVDVSPEET QK+ F+  +++   D     Q 
Sbjct: 182  VTATKVEEEPVTNVKSRTENMEA-KKFAFVDVSPEETDQKSPFK--EDATDADVSKSAQP 238

Query: 877  SQPVSQNGTASKQGTGASEGPSTS-KTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNP 1053
            +Q + QNG ASKQ    S+G   S K   +LSVEA+EKMMEDPTVQKM++P+LPEEMRNP
Sbjct: 239  TQELPQNGAASKQAYNGSDGSQFSRKPGSVLSVEAVEKMMEDPTVQKMIYPHLPEEMRNP 298

Query: 1054 TTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEE 1233
             TFKWM+QNP YRQQL++MLNNM GSP+WD R+MDSLKNFDLSSPE+KQQFDQIGLTPEE
Sbjct: 299  ETFKWMMQNPLYRQQLEEMLNNMSGSPQWDGRLMDSLKNFDLSSPEVKQQFDQIGLTPEE 358

Query: 1234 VISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1410
            VISKIMANP++AMAFQNPRVQAAIMDCSQNPLSI KYQNDKEVMDVFNKISELFPG +G
Sbjct: 359  VISKIMANPEIAMAFQNPRVQAAIMDCSQNPLSITKYQNDKEVMDVFNKISELFPGVSG 417


>ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana]
            gi|75309208|sp|Q9FMD5.1|TIC40_ARATH RecName: Full=Protein
            TIC 40, chloroplastic; AltName: Full=Protein PIGMENT
            DEFECTIVE EMBRYO 120; AltName: Full=Translocon at the
            inner envelope membrane of chloroplasts 40;
            Short=AtTIC40; Flags: Precursor
            gi|16226313|gb|AAL16131.1|AF428299_1 AT5g16620/MTG13_6
            [Arabidopsis thaliana] gi|10176971|dbj|BAB10189.1|
            translocon Tic40-like protein [Arabidopsis thaliana]
            gi|20260222|gb|AAM13009.1| translocon Tic40-like protein
            [Arabidopsis thaliana] gi|30387547|gb|AAP31939.1|
            At5g16620 [Arabidopsis thaliana]
            gi|332004935|gb|AED92318.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 447

 Score =  395 bits (1015), Expect = e-107
 Identities = 234/456 (51%), Positives = 276/456 (60%), Gaps = 27/456 (5%)
 Frame = +1

Query: 124  MENLGLIS----SHKIVLG--ISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFT 285
            MENL L+S    S K+++G   + + KN    S+     PN++ +  K       S+S  
Sbjct: 1    MENLTLVSCSASSPKLLIGCNFTSSLKNPTGFSRR---TPNIVLRCSKI------SASAQ 51

Query: 286  VLSLFEAPKATKTIVPEKDVRDCFASITSSG--QETSSVGVNXXXXXXXXXXXXXXXXFW 459
              S    P+ T  IV  K     FASI SS   Q+T+SV                   FW
Sbjct: 52   SQSPSSRPENTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFW 111

Query: 460  IGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXX 639
            IGVGVGLSALFS+V   +KKYAM+ A KT   QMN QN+ F N+                
Sbjct: 112  IGVGVGLSALFSYVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNSGFPSGSPFPFPFPPQT 171

Query: 640  XXXXXXXXXHVASQPVTVDVPATKVEDPPSISVK----------------EKVEPESGPK 771
                        S   TVDV ATKVE PPS   K                E  + +   K
Sbjct: 172  SPASSPFQSQSQSSGATVDVTATKVETPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEK 231

Query: 772  KYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE---GPS 942
             YAF D+SPEET +++ F NY E  +T+SP + +  + V QNG     G  ASE      
Sbjct: 232  NYAFEDISPEETTKESPFSNYAEVSETNSPKETRLFEDVLQNGAGPANGATASEVFQSLG 291

Query: 943  TSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNM 1122
              K  P LSVEALEKMMEDPTVQKMV+PYLPEEMRNP TFKWML+NPQYRQQLQDMLNNM
Sbjct: 292  GGKGGPGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNNM 351

Query: 1123 GGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAA 1302
             GS EWD RM D+LKNFDL+SPE+KQQF+QIGLTPEEVISKIM NPDVAMAFQNPRVQAA
Sbjct: 352  SGSGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQAA 411

Query: 1303 IMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1410
            +M+CS+NP++I KYQNDKEVMDVFNKIS+LFPG TG
Sbjct: 412  LMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 447


>ref|XP_006400200.1| hypothetical protein EUTSA_v10013528mg [Eutrema salsugineum]
            gi|557101290|gb|ESQ41653.1| hypothetical protein
            EUTSA_v10013528mg [Eutrema salsugineum]
          Length = 449

 Score =  390 bits (1003), Expect = e-105
 Identities = 238/457 (52%), Positives = 274/457 (59%), Gaps = 28/457 (6%)
 Frame = +1

Query: 124  MENLGLIS----SHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNG-RLINPSSSFTV 288
            MENL L+S    S K+++G      N   S K  VG     R+T K   R    S+S   
Sbjct: 1    MENLTLVSCSASSPKLLIGC-----NFTSSLKNPVGFS---RRTPKVVFRCSKISASAKS 52

Query: 289  LSLFEAPKATKTIVPEKDVRDCFASITSSG--QETSSVGVNXXXXXXXXXXXXXXXXFWI 462
             S    P+    IV  K     FASI SS   Q+T+SV                   FWI
Sbjct: 53   QSHSSRPENAGEIVVVKHRSRDFASIFSSNRDQQTTSVAYPNAAVPPPSSSTIGSPLFWI 112

Query: 463  GVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXX 642
            GVGVGLSALFSWV   +KKYAM+ A KT   QMN QN+ F N                  
Sbjct: 113  GVGVGLSALFSWVTSSLKKYAMQTAMKTMMNQMNTQNSQFNNPGFPAGSASPFPFPFPPQ 172

Query: 643  XXXXXXXXHVASQP--VTVDVPATKVEDPPSIS----------------VKEKVEPESGP 768
                       SQ    TVDV ATKV+ PPS                  V E+ + +   
Sbjct: 173  TSPTSSPFQSQSQSSGATVDVTATKVDTPPSAKPQPTPAKKTEVDKPSVVLEENKAKKEE 232

Query: 769  KKYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE---GP 939
            K YAF DVSPEET +++ F NY E  +T +P + +  + V QNG A   G  ASE     
Sbjct: 233  KNYAFEDVSPEETTKESPFSNYAEVSETSAPKEARLFEDVMQNGAAPANGATASEVFQSL 292

Query: 940  STSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNN 1119
               K  P LSVEALEKMMEDPTVQKMV+P+LPEEMRNP TFKWML+NP YRQQLQDMLNN
Sbjct: 293  GAGKGGPGLSVEALEKMMEDPTVQKMVYPHLPEEMRNPETFKWMLKNPHYRQQLQDMLNN 352

Query: 1120 MGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQA 1299
            M GS EWD RMMD+LKNFDL+SPE+KQQFDQIGLTPEEVISKIM NPDVAMAFQNPRVQA
Sbjct: 353  MSGSGEWDKRMMDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIMENPDVAMAFQNPRVQA 412

Query: 1300 AIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1410
            A+M+CS+NP++I KYQNDKEVMDVFNKIS+LFPG TG
Sbjct: 413  ALMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 449


>ref|XP_004307173.1| PREDICTED: protein TIC 40, chloroplastic-like [Fragaria vesca subsp.
            vesca]
          Length = 448

 Score =  386 bits (992), Expect = e-104
 Identities = 224/415 (53%), Positives = 261/415 (62%), Gaps = 31/415 (7%)
 Frame = +1

Query: 259  LINPSSSFTVLSLFEAPKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXX 435
            L +P+S  TV        A    V  K   + FASI+S+  QETSSVG+N          
Sbjct: 40   LSSPNSRLTV----RLSAAANQPVTSKLQTERFASISSTNSQETSSVGINPQFSAPPPPS 95

Query: 436  XXXXXXFWIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXX 615
                  FWIGVGV  SA+FSW AG+++KY ++QAFK    QMN QN+ F NAA       
Sbjct: 96   TIGSPLFWIGVGVAFSAVFSWAAGKLQKYVVQQAFKNVMGQMNTQNDQFSNAA---FSPG 152

Query: 616  XXXXXXXXXXXXXXXXXHVASQPVTVDVPATKVEDP--------PSISVKEKVEPESGPK 771
                                SQP   DV AT+V+ P        P+  VK + E +    
Sbjct: 153  SPFPFPSAPASPSASPFSAPSQPSFTDVSATEVDSPASSATPSTPAADVKSE-EQQMKEN 211

Query: 772  KY---------------------AFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPV 888
            ++                     AFVDV+PEET  K+ F +     +  S  +  S+   
Sbjct: 212  RFGNSFEIERNNVIQFSRQLSDRAFVDVNPEETELKSPFASSLNDTEPGSSKEINSNVEG 271

Query: 889  SQNGTASKQGTGASEG-PSTSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFK 1065
            SQNG A KQ   AS G  +T K + +LSVEALEKM+EDPTVQKMV+PYLPEEMRNPTTFK
Sbjct: 272  SQNGAAFKQAKDASMGSQTTGKENSVLSVEALEKMLEDPTVQKMVYPYLPEEMRNPTTFK 331

Query: 1066 WMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISK 1245
            WMLQNPQYRQQL+DML NM GS EWDNRMMDSLKNFDLSSPE+K+QFDQIGLTPE+VISK
Sbjct: 332  WMLQNPQYRQQLEDMLRNMTGSNEWDNRMMDSLKNFDLSSPEVKEQFDQIGLTPEQVISK 391

Query: 1246 IMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1410
            IMANPDVAMAFQNPRVQAAIMDCSQNP+SI KYQNDKEVMDVFNKISELFPG +G
Sbjct: 392  IMANPDVAMAFQNPRVQAAIMDCSQNPMSITKYQNDKEVMDVFNKISELFPGVSG 446


>ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arabidopsis lyrata subsp.
            lyrata] gi|297317564|gb|EFH47986.1| hypothetical protein
            ARALYDRAFT_488521 [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  385 bits (990), Expect = e-104
 Identities = 230/456 (50%), Positives = 269/456 (58%), Gaps = 27/456 (5%)
 Frame = +1

Query: 124  MENLGLIS----SHKIVLG--ISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFT 285
            MENL L+S    S K+++G   + + KN    S+    +     K   + +  +PSS   
Sbjct: 1    MENLTLVSCSASSPKLLIGCNFTSSLKNPTGFSRRTPRIVLRCSKISASAQSQSPSSR-- 58

Query: 286  VLSLFEAPKATKTIVPEKDVRDCFASITSSG--QETSSVGVNXXXXXXXXXXXXXXXXFW 459
                   P  T  IV  K     FASI SS   Q+T+SV                   FW
Sbjct: 59   -------PDNTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFW 111

Query: 460  IGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXX 639
            IGVGVGLSALFS V   +KKYAM+ A KT   QMN QN+ F N                 
Sbjct: 112  IGVGVGLSALFSLVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNPGFPSGSPFPFPFPPQT 171

Query: 640  XXXXXXXXXHVASQPVTVDVPATKVEDPPSISVK----------------EKVEPESGPK 771
                        S   TVDV ATKV+ PPS   K                E  + +   K
Sbjct: 172  SPASSPFQSQSQSSGATVDVTATKVDTPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEK 231

Query: 772  KYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE---GPS 942
             YAF D+SPEET +++ F NY E  +T SP + +  + V QNG     G  ASE      
Sbjct: 232  NYAFEDISPEETTKESPFSNYAEVSETSSPKETRLFEDVLQNGAGPANGATASEVFQSLG 291

Query: 943  TSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNM 1122
              K    LSVEALEKMMEDPTVQKMV+PYLPEEMRNP TFKWML+NPQYRQQLQDMLNNM
Sbjct: 292  GGKGGAGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNNM 351

Query: 1123 GGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAA 1302
             GS EWD RM D+LKNFDL+SPE+KQQF+QIGLTPEEVISKIM NPDVAMAFQNPRVQAA
Sbjct: 352  SGSGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQAA 411

Query: 1303 IMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1410
            +M+CS+NP++I KYQNDKEVMDVFNKIS+LFPG TG
Sbjct: 412  LMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 447


Top