BLASTX nr result

ID: Rehmannia22_contig00005515 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00005515
         (1902 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-lik...   525   e-146
ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-lik...   520   e-144
ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vi...   484   e-134
gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein i...   444   e-122
ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-lik...   429   e-117
sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplasti...   427   e-116
emb|CAB50925.1| translocon Tic40 [Pisum sativum]                      426   e-116
ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-lik...   424   e-116
ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Popu...   423   e-115
ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-lik...   421   e-115
ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Popu...   421   e-115
ref|XP_002531917.1| conserved hypothetical protein [Ricinus comm...   417   e-114
gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus...   416   e-113
gb|ABF19057.1| plastid Tic40 [Ricinus communis]                       415   e-113
gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein i...   407   e-110
ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-lik...   400   e-108
ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana] gi|753092...   395   e-107
ref|XP_006400200.1| hypothetical protein EUTSA_v10013528mg [Eutr...   390   e-106
ref|XP_004307173.1| PREDICTED: protein TIC 40, chloroplastic-lik...   386   e-104
ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arab...   385   e-104

>ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum lycopersicum]
          Length = 443

 Score =  525 bits (1352), Expect = e-146
 Identities = 281/448 (62%), Positives = 324/448 (72%), Gaps = 19/448 (4%)
 Frame = +3

Query: 120  MENLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFE 299
            MEN+G++SS K+VLG+S    NS+ SSKPF GLP+L ++  KNGR + P++ F V+S F+
Sbjct: 1    MENIGIVSSPKMVLGLS---SNSVISSKPFFGLPHLPKRPFKNGRTVRPTTCFEVVSCFQ 57

Query: 300  APKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGL 476
             P+ TK IV  K  R  FAS T+SG ++TSSVGVN                FWIGVGVG 
Sbjct: 58   GPRLTKKIVLGKSGRGSFASTTTSGGKQTSSVGVNPQFSAPSPPSQMGSPLFWIGVGVGF 117

Query: 477  SALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXX 656
            SALF+WVA  +KKYAM+QA KT   QMN QN+ F N A                      
Sbjct: 118  SALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNTAFSPGPGSPFPFPFPPPPVSGPA 177

Query: 657  XXH---------------VASQPVTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVS 791
                               ASQPVTVDV ATKVE+PP+++VK   E E  PKK AFVD+S
Sbjct: 178  SSSPPPPTASSSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDKEAEKEPKKNAFVDIS 237

Query: 792  PEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTS---KTSPLL 962
            P+ET QK AFEN+K+S +T +    Q    V+QNG AS+ G G++   STS   K++PLL
Sbjct: 238  PDETFQKGAFENFKDSAETAAVTVDQ----VTQNGAASQSGFGSNTSDSTSSTGKSNPLL 293

Query: 963  SVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDN 1142
            SV+ALEKMMEDPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQLQDM+NNMGG+PEWDN
Sbjct: 294  SVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDN 353

Query: 1143 RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP 1322
            RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP
Sbjct: 354  RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP 413

Query: 1323 LSIAKYQNDKEVMDVFNKISELFPGATG 1406
            LSIAKYQNDKEVMDVFNKISELFPG +G
Sbjct: 414  LSIAKYQNDKEVMDVFNKISELFPGVSG 441


>ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum tuberosum]
          Length = 443

 Score =  520 bits (1338), Expect = e-144
 Identities = 279/448 (62%), Positives = 323/448 (72%), Gaps = 19/448 (4%)
 Frame = +3

Query: 120  MENLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFE 299
            MEN+ ++SS K+VLG+S NP   + S+KP  GLP+L ++  KNGR++ P++ F V+S F+
Sbjct: 1    MENICIVSSPKMVLGLSSNP---VISNKPLFGLPHLPKRPFKNGRIVRPTTCFEVVSCFQ 57

Query: 300  APKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGL 476
            +P+ TK IV  K  R  FAS T+SG Q+TSSVGVN                FWIGVGVGL
Sbjct: 58   SPRLTKKIVLGKSGRGSFASTTTSGGQQTSSVGVNPQFSAPSQPSQVGSPLFWIGVGVGL 117

Query: 477  SALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXX 656
            SALF+WVA  +KKYAM+QA KT   QMN QN+ F N A                      
Sbjct: 118  SALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNNAFSPGPGSPFPFPFPPPPVSGPA 177

Query: 657  XXH---------------VASQPVTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVS 791
                               ASQPVTVDV ATKVE+PP+++VK   E    PKK AFVD+S
Sbjct: 178  SSSPPPPTASTSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDTEAGKEPKKNAFVDIS 237

Query: 792  PEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTS---KTSPLL 962
            P+ET QK AFEN+K+S +T S    Q    V+QNG AS+ G G +   STS   K++PL+
Sbjct: 238  PDETFQKGAFENFKDSTETASVTVDQ----VTQNGAASQLGFGPNTSDSTSSTGKSNPLM 293

Query: 963  SVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDN 1142
            SV+ALEKMMEDPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQLQDM+NNMGG+PEWDN
Sbjct: 294  SVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDN 353

Query: 1143 RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP 1322
            RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP
Sbjct: 354  RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP 413

Query: 1323 LSIAKYQNDKEVMDVFNKISELFPGATG 1406
            LSIAKYQNDKEVMDVFNKISELFPG +G
Sbjct: 414  LSIAKYQNDKEVMDVFNKISELFPGVSG 441


>ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vitis vinifera]
            gi|296089465|emb|CBI39284.3| unnamed protein product
            [Vitis vinifera]
          Length = 436

 Score =  484 bits (1245), Expect = e-134
 Identities = 266/443 (60%), Positives = 301/443 (67%), Gaps = 14/443 (3%)
 Frame = +3

Query: 120  MENLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFE 299
            M++L L+SS K+VLG SP+    I  +     LP L RK  K    I  S S        
Sbjct: 1    MDSLTLVSSPKLVLGHSPSNPRHISCAHSSFSLPLLFRKPRK---FIAASQSGA------ 51

Query: 300  APKATKTIVPEKDVRDCFASITSSGQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLS 479
            +P+  + +V  K   +CFASI+SS Q TSSVGVN                FWIGVGVGLS
Sbjct: 52   SPRTPRHVVETKLGTECFASISSSSQGTSSVGVNPQFSPPPPSSNIGSPLFWIGVGVGLS 111

Query: 480  ALFSWVAGRVKKYAMEQAFKTFTQQMNAQNN-------------PFGNAAXXXXXXXXXX 620
            ALFSWVA  +KKYAM+QAFKT   QM++QNN             PF              
Sbjct: 112  ALFSWVASNLKKYAMQQAFKTLMGQMDSQNNQFNTTTFSPGSPFPFPMPPPSGPSTSHSG 171

Query: 621  XXXXXXXXXXXXXXHVASQPVTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEE 800
                            A   VTVDVPATKVE PP+  VK+ +E ++   KYAFVDVSPEE
Sbjct: 172  PTTSPSGPTTSPSTVAAQSMVTVDVPATKVETPPATDVKDDIEKKNEQNKYAFVDVSPEE 231

Query: 801  TLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE-GPSTSKTSPLLSVEAL 977
            TLQ++ FEN++ES +T S  D Q S  VSQNGT  + G G SE   ST   +P LSV+AL
Sbjct: 232  TLQESPFENFEESTETSSSKDAQFSAGVSQNGTPPRPGMGVSEDSQSTRNANPFLSVDAL 291

Query: 978  EKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDS 1157
            EKMMEDPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGG  EWDNRMMD+
Sbjct: 292  EKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGGAEWDNRMMDN 351

Query: 1158 LKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAK 1337
            LKNFDLSSPE+KQQFDQIGLTPEEVISKIMANPDVA+AFQNPR+QAAIMDCSQNPLSIAK
Sbjct: 352  LKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPDVALAFQNPRIQAAIMDCSQNPLSIAK 411

Query: 1338 YQNDKEVMDVFNKISELFPGATG 1406
            YQNDKEVMDVFNKISELFPG +G
Sbjct: 412  YQNDKEVMDVFNKISELFPGVSG 434


>gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 433

 Score =  444 bits (1142), Expect = e-122
 Identities = 263/440 (59%), Positives = 295/440 (67%), Gaps = 13/440 (2%)
 Frame = +3

Query: 126  NLGLISSHK-----IVLGIS-PN--PKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFT 281
            NL L+SS        +LG + PN  PKN  F + PF    NL  +  +     +  S  T
Sbjct: 5    NLALVSSSSPPLKLYLLGCNHPNYTPKNP-FKTLPFPS-SNLAPRRSRISIFAHSHSQPT 62

Query: 282  VLSLFEAPKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWI 458
                   P+    IV  K   + FASI+SS  Q+TSSVGVN                FWI
Sbjct: 63   ------PPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWI 116

Query: 459  GVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXX 638
            GVGVGLSALF+WVA  +KKYAM+QAFKT   QMN QNN F NAA                
Sbjct: 117  GVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPG 176

Query: 639  XXXXXXXXHVASQPVTVDVPATKVEDPPSISVKEKVEPESG---PKKYAFVDVSPEETLQ 809
                      +   VTVDVPATKVE  P+ +   +V+ E+    PKKYAFVDVSPEET+Q
Sbjct: 177  PVTSPSPS--SQTAVTVDVPATKVEAAPATAPATEVKSETETAEPKKYAFVDVSPEETVQ 234

Query: 810  KNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGP-STSKTSPLLSVEALEKM 986
            K+AFE   ++    S N+ Q  + VS NG ASKQ  GA  G  ST    P LSV+ALEKM
Sbjct: 235  KSAFE---DAAGISSSNNTQFPKDVSDNGAASKQDAGAFGGSQSTGSADPALSVDALEKM 291

Query: 987  MEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKN 1166
            MEDPTVQKMV+PYLPEEMRNP TFKWMLQNPQYRQQLQDMLNNMGGS EWDNRMMDSLKN
Sbjct: 292  MEDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKN 351

Query: 1167 FDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 1346
            FDL+SP++KQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNPLSIAKYQN
Sbjct: 352  FDLNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 411

Query: 1347 DKEVMDVFNKISELFPGATG 1406
            DKEVMDVFNKISELFPG TG
Sbjct: 412  DKEVMDVFNKISELFPGVTG 431


>ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-like [Cicer arietinum]
          Length = 433

 Score =  429 bits (1102), Expect = e-117
 Identities = 246/438 (56%), Positives = 295/438 (67%), Gaps = 11/438 (2%)
 Frame = +3

Query: 126  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFEAP 305
            NL L+SS K +L    + +N     KPF          GK     N SSS    +  ++ 
Sbjct: 5    NLALVSSPKPLLLGHSSSRNVFTRRKPFT--------FGKFFVSANSSSSHVTRAAPKSH 56

Query: 306  KATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSA 482
            +  K++  +  V + FASI+SS  QET+SVGV+                FWIGVGVG SA
Sbjct: 57   QNPKSVQGKLIVHN-FASISSSNSQETTSVGVSPQLSPPPSSTVGSPL-FWIGVGVGFSA 114

Query: 483  LFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXX 662
            LFS VA R+KKYAM+QAFKT   QMN QNNPF +AA                        
Sbjct: 115  LFSIVASRLKKYAMQQAFKTMMGQMNTQNNPFDSAAFSPGSPFPFPMPSSSGPAAPASSA 174

Query: 663  HVASQ----------PVTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQK 812
               SQ           VTVD+PATKVE  PS + K++VE ++ PKK  FVDVSPEE++QK
Sbjct: 175  GTQSQSTSARTASQSTVTVDIPATKVEAAPSTNAKDEVEVKNEPKKIGFVDVSPEESVQK 234

Query: 813  NAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEKMME 992
            + FE++K+  ++ S  + ++     QNG  S QG G S G S S    +LSVEALEKMME
Sbjct: 235  SPFESFKDVDESSSFKEARAPAEAFQNGAPSNQGFGNSPG-SQSGGKSVLSVEALEKMME 293

Query: 993  DPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFD 1172
            DPTVQKMV+PYLPEEMRNP+TFKWMLQNPQYRQQL++MLNNMGGS EWD+RMMD+LKNFD
Sbjct: 294  DPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDTLKNFD 353

Query: 1173 LSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDK 1352
            L+SP++KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCS NPL+IAKYQNDK
Sbjct: 354  LNSPDVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSSNPLNIAKYQNDK 413

Query: 1353 EVMDVFNKISELFPGATG 1406
            EVMDVFNKISELFPG +G
Sbjct: 414  EVMDVFNKISELFPGVSG 431


>sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplastic; AltName: Full=Translocon
            at the inner envelope membrane of chloroplasts 40;
            Short=PsTIC40; Flags: Precursor
            gi|26000725|gb|AAN75219.1| chloroplast protein translocon
            component Tic40 precursor [Pisum sativum]
          Length = 436

 Score =  427 bits (1097), Expect = e-116
 Identities = 246/441 (55%), Positives = 294/441 (66%), Gaps = 14/441 (3%)
 Frame = +3

Query: 126  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFEAP 305
            NL L+SS K +L    + KN     K F          G      N SSS    +  ++ 
Sbjct: 5    NLALVSSPKPLLLGHSSSKNVFSGRKSFT--------FGTFRVSANSSSSHVTRAASKSH 56

Query: 306  KATKTIVPEKDVRDCFASITSS-GQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSA 482
            +  K++  + +  D FASI+SS GQET+SVGV+                FWIG+GVG SA
Sbjct: 57   QNLKSVQGKVNAHD-FASISSSNGQETTSVGVSPQLSPPPPSTVGSPL-FWIGIGVGFSA 114

Query: 483  LFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXX 662
            LFS VA RVKKYAM+QAFK+   QMN QNNPF + A                        
Sbjct: 115  LFSVVASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGFA 174

Query: 663  HVASQP----------VTVDVPATKVE---DPPSISVKEKVEPESGPKKYAFVDVSPEET 803
               SQ           VTVD+PATKVE     P I+VKE+VE ++ PKK AFVDVSPEET
Sbjct: 175  GNQSQATSTRSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEET 234

Query: 804  LQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEK 983
            +QKNAFE +K+  ++ S  + ++    SQNGT  KQG G S   S S+    LSV+ALEK
Sbjct: 235  VQKNAFERFKDVDESSSFKEARAPAEASQNGTPFKQGFGDSPS-SPSERKSALSVDALEK 293

Query: 984  MMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLK 1163
            MMEDPTVQ+MV+PYLPEEMRNP+TFKWM+QNP+YRQQL+ MLNNMGG  EWD+RMMD+LK
Sbjct: 294  MMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRMMDTLK 353

Query: 1164 NFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQ 1343
            NFDL+SP++KQQFDQIGL+P+EVISKIMANPDVAMAFQNPRVQAAIMDCSQNP+SI KYQ
Sbjct: 354  NFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMSIVKYQ 413

Query: 1344 NDKEVMDVFNKISELFPGATG 1406
            NDKEVMDVFNKISELFPG +G
Sbjct: 414  NDKEVMDVFNKISELFPGVSG 434


>emb|CAB50925.1| translocon Tic40 [Pisum sativum]
          Length = 436

 Score =  426 bits (1096), Expect = e-116
 Identities = 246/441 (55%), Positives = 294/441 (66%), Gaps = 14/441 (3%)
 Frame = +3

Query: 126  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFEAP 305
            NL L+SS K +L    + KN     K F          G      N SSS    +  ++ 
Sbjct: 5    NLALVSSPKPLLLGHSSSKNVFSRRKSFT--------FGTFRVSANSSSSHVTRAASKSH 56

Query: 306  KATKTIVPEKDVRDCFASITSS-GQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSA 482
            +  K++  + +    FASI+SS GQET+SVGV+                FWIG+GVG SA
Sbjct: 57   QNLKSVQGKVNAHS-FASISSSNGQETTSVGVSPQLSPPPPSTVGSPL-FWIGIGVGFSA 114

Query: 483  LFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXX 662
            LFS VA RVKKYAM+QAFK+   QMN QNNPF + A                        
Sbjct: 115  LFSVVASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGFA 174

Query: 663  HVASQP----------VTVDVPATKVE---DPPSISVKEKVEPESGPKKYAFVDVSPEET 803
               SQ           VTVD+PATKVE     P I+VKE+VE ++ PKK AFVDVSPEET
Sbjct: 175  GNQSQATSTRSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEET 234

Query: 804  LQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEK 983
            +QKNAFE +K+  ++ S  + ++    SQNGT  KQG G S G S S+    LSV+ALEK
Sbjct: 235  VQKNAFERFKDVDESSSFKEARAPAEASQNGTPFKQGFGDSPG-SPSERKSALSVDALEK 293

Query: 984  MMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLK 1163
            MMEDPTVQ+MV+PYLPEEMRNP+TFKWM+QNP+YRQQL+ MLNNMGG  EWD+RMMD+LK
Sbjct: 294  MMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRMMDTLK 353

Query: 1164 NFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQ 1343
            NFDL+SP++KQQFDQIGL+P+EVISKIMANPDVAMAFQNPRVQAAIMDCSQNP+SI KYQ
Sbjct: 354  NFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMSIVKYQ 413

Query: 1344 NDKEVMDVFNKISELFPGATG 1406
            NDKEVMDVFNKISELFPG +G
Sbjct: 414  NDKEVMDVFNKISELFPGVSG 434


>ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max]
          Length = 432

 Score =  424 bits (1090), Expect = e-116
 Identities = 242/431 (56%), Positives = 290/431 (67%), Gaps = 15/431 (3%)
 Frame = +3

Query: 150  KIVLGISPNPKNSIFSSKPFVGLPN---LIRKTGKNGR-LINPSSSFTVLSLFEAPKATK 317
            K+ L +  +PK  +    P +   +     RK    GR LI P      +S   +     
Sbjct: 3    KLNLALVSSPKPLMLGHVPAIDATSRDVFRRKHFSFGRVLIAPHRCRFRVSALSSSHRNP 62

Query: 318  TIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSALFSW 494
              V EK +   FASI+SS  QE +S GVN                FWIGVGVGLSALFS 
Sbjct: 63   KSVQEKLIVKHFASISSSNTQEATSTGVNPQLSPSSTIGSPL---FWIGVGVGLSALFSV 119

Query: 495  VAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXXHVAS 674
            VA R+KKYAM+QAFKT   QMN+QNN FGNAA                           S
Sbjct: 120  VASRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPTAPASSATTQS 179

Query: 675  QP----------VTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQKNAFE 824
            +           +TVD+PA KVE  P+ +VK++VE ++ PKK AFVDVSPEET+Q++ FE
Sbjct: 180  RAPSASSASQSTITVDIPAAKVEVAPTTNVKDEVEVKNEPKKIAFVDVSPEETVQESPFE 239

Query: 825  NYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEKMMEDPTV 1004
            ++K+  ++ S  + +    VSQNG  S QG G   G  ++K S +LSV+ALEKMMEDPTV
Sbjct: 240  SFKDD-ESSSVKEARVPDEVSQNGAPSNQGFGDFPGSQSTKKS-VLSVDALEKMMEDPTV 297

Query: 1005 QKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFDLSSP 1184
            QKMV+PYLPEEMRNPTTFKWMLQNPQYRQQL++MLNNMGGS EWD+RMMD+LKNFDL+SP
Sbjct: 298  QKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDTLKNFDLNSP 357

Query: 1185 EIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMD 1364
            E+KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNP++I KYQNDKEVMD
Sbjct: 358  EVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQNDKEVMD 417

Query: 1365 VFNKISELFPG 1397
            VFNKISELFPG
Sbjct: 418  VFNKISELFPG 428


>ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa]
            gi|550319201|gb|ERP50369.1| hypothetical protein
            POPTR_0017s02900g [Populus trichocarpa]
          Length = 435

 Score =  423 bits (1088), Expect = e-115
 Identities = 242/434 (55%), Positives = 290/434 (66%), Gaps = 13/434 (2%)
 Frame = +3

Query: 126  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFEAP 305
            +L L+S +   L     PK SI +++P +  P+   KT  +   I    S + LS    P
Sbjct: 13   SLKLVSGYPTSLKNPTTPKFSISTTRPSLPFPHRTSKTVTHTSRI----SISALSQSHGP 68

Query: 306  KATKTIVPEKDVRDCFASITS-SGQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSA 482
            + T      K+  + FASI+S SGQ+T+SVGVN                FW+GVGV LSA
Sbjct: 69   RRTS-----KNGSEYFASISSLSGQQTASVGVNPQSVSPPPSQIGSPL-FWVGVGVALSA 122

Query: 483  LFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXX 662
            +FSWVA R+K YAM+QAFK+ T+QMNAQNN F  A                         
Sbjct: 123  IFSWVATRLKNYAMQQAFKSLTEQMNAQNNQFNPA---FSARSPFPFSPPPASQPATSPF 179

Query: 663  HVASQP-VTVDVPATKVEDPPSISVKEKVEPES--------GPKKYAFVDVSPEETLQKN 815
              ASQP VTVD+PATKVE  P    +++ E ++         P+K+AFVDVSPEET    
Sbjct: 180  QTASQPAVTVDIPATKVEAAPETDARKEKETDTLEEREIKEEPRKFAFVDVSPEETSLNT 239

Query: 816  AFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE---GPSTSKTSPLLSVEALEKM 986
             F + ++ I T S  D Q ++  SQNG   KQG  ASE   G  +S+ +  LSVEALEKM
Sbjct: 240  PFSSVEDVIDTSSSKDVQFAKEASQNGATFKQGPSASEPSEGSQSSQKAGSLSVEALEKM 299

Query: 987  MEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKN 1166
            M+DPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQL++MLNNM GS EWD+RM+DSLKN
Sbjct: 300  MDDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMSGSSEWDSRMVDSLKN 359

Query: 1167 FDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 1346
            FDLSSPE+KQQFDQIGLTPEEVISKIMANPDVA+AFQNPRVQ AIM+CSQNPLSIAKYQN
Sbjct: 360  FDLSSPEVKQQFDQIGLTPEEVISKIMANPDVALAFQNPRVQQAIMECSQNPLSIAKYQN 419

Query: 1347 DKEVMDVFNKISEL 1388
            DKEVMDVFNKISE+
Sbjct: 420  DKEVMDVFNKISEI 433


>ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max]
          Length = 429

 Score =  421 bits (1083), Expect = e-115
 Identities = 244/436 (55%), Positives = 290/436 (66%), Gaps = 12/436 (2%)
 Frame = +3

Query: 126  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGR-LINPSSSFTVLSLFEA 302
            NL L+SS K ++ +   P   +F  K F             GR LI P      +S   +
Sbjct: 5    NLALVSSPKPLM-LGHVPARDVFRRKHF-----------SFGRVLIAPHRCRFRVSALSS 52

Query: 303  PKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLS 479
                   V EK +   FASI+SS  QET+S+GV                 FWIGVGVGLS
Sbjct: 53   SHHNPKSVQEKLIVKHFASISSSNTQETTSIGVKPQLSPSPSSTIGSPL-FWIGVGVGLS 111

Query: 480  ALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXX 659
            ALFS VA R+KKYAM+QAFKT   QMN+QNN FGNAA                       
Sbjct: 112  ALFSVVASRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPTAPASS 171

Query: 660  XHVASQP----------VTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQ 809
                S+           +TVD+PA KVE  P+ +VK++VE ++ PKK AFVDVSPEET++
Sbjct: 172  ATTQSRAPSASSASQSTITVDLPAAKVEAAPTTNVKDEVELKNEPKKIAFVDVSPEETVR 231

Query: 810  KNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEKMM 989
            ++ FE++K+  ++ S  +      VSQNG  S  G G   G  ++K S L SV+ALEKMM
Sbjct: 232  ESPFESFKDD-ESSSVKEAWVPDEVSQNGAPSNLGFGDFPGSQSTKKSAL-SVDALEKMM 289

Query: 990  EDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNF 1169
            EDPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQL++MLNNMGGS EWDNRMMD+LKNF
Sbjct: 290  EDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDNRMMDTLKNF 349

Query: 1170 DLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQND 1349
            DL+SPE+KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNP++I KYQND
Sbjct: 350  DLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQND 409

Query: 1350 KEVMDVFNKISELFPG 1397
            KEVMDVFNKISELFPG
Sbjct: 410  KEVMDVFNKISELFPG 425


>ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Populus trichocarpa]
            gi|222848840|gb|EEE86387.1| hypothetical protein
            POPTR_0004s08560g [Populus trichocarpa]
          Length = 429

 Score =  421 bits (1083), Expect = e-115
 Identities = 246/449 (54%), Positives = 299/449 (66%), Gaps = 20/449 (4%)
 Frame = +3

Query: 120  MEN--LGLISSH--KIVLGISPN------PKNSIFSSKPFVGLPNLIRKTGKNGRLINPS 269
            MEN  L L+SS   K+V+G   +      PK SI +++P +     I KT  +      +
Sbjct: 1    MENPRLALLSSSSPKLVMGYPTSLKNPTTPKFSISTTRPSLPFSLRISKTAPH------A 54

Query: 270  SSFTVLSLFEAPKATKTIVPEKDVRDCFASITSS-GQETSSVGVNXXXXXXXXXXXXXXX 446
            S F++ +L  +     +        + FASI+SS G++T+SVGVN               
Sbjct: 55   SIFSISALANSHGKLGS--------EYFASISSSSGKQTASVGVNPQPVSPPPSQIGSPL 106

Query: 447  XFWIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXX 626
             FW+GVGVGLSA+FSWVA RVK YAM+QAFK+ T+QMN QNN F  A             
Sbjct: 107  -FWVGVGVGLSAIFSWVATRVKNYAMQQAFKSLTEQMNTQNNQFNPAFSARPPFPFSPPP 165

Query: 627  XXXXXXXXXXXXHVASQP-VTVDVPATKVEDPPSISVKEKVEPE--------SGPKKYAF 779
                          ASQP +TVD+PATKVE  P+  V ++ E +           KKYAF
Sbjct: 166  ASHPSTSPSP---AASQPAITVDIPATKVEAAPTTDVGKEKETDFLEERKIKEETKKYAF 222

Query: 780  VDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPL 959
            VD+SPEET     F + ++  +T S  D + ++ V QNG A KQG GA+EG  +  T P 
Sbjct: 223  VDISPEETSLNTPFSSVEDDNETSSSKDVEFAKKVFQNGAAFKQGPGAAEG--SQSTRPF 280

Query: 960  LSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWD 1139
            LSVEALEKMMEDPT+QKMV+PYLPEEMRNPTTFKWMLQNPQYRQQL+DMLNNMGGS +WD
Sbjct: 281  LSVEALEKMMEDPTMQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSGKWD 340

Query: 1140 NRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQN 1319
            ++MMDSLK+FDL+S E+KQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQ AIM+CSQN
Sbjct: 341  SQMMDSLKDFDLNSAEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQQAIMECSQN 400

Query: 1320 PLSIAKYQNDKEVMDVFNKISELFPGATG 1406
            P++I KYQNDKEVMDVFNKISELFPG TG
Sbjct: 401  PINITKYQNDKEVMDVFNKISELFPGMTG 429


>ref|XP_002531917.1| conserved hypothetical protein [Ricinus communis]
            gi|223528427|gb|EEF30461.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 465

 Score =  417 bits (1073), Expect = e-114
 Identities = 250/459 (54%), Positives = 294/459 (64%), Gaps = 35/459 (7%)
 Frame = +3

Query: 126  NLGLISSH----KIVLGIS-PNP-KN-SIFSSKPFVGLPNLIRKTG---KNGRLINPSSS 275
            N+GL+SS     K+V+G   PN  KN ++ ++K F       R      +N +++  SS 
Sbjct: 5    NMGLLSSFYASPKLVMGCCYPNSLKNPTVTTNKQFSRTSTSTRALPFSLRNYKIVTRSSR 64

Query: 276  FTVLSLFEAPKATKTIVPEKDVRDCFASITSSGQETSSVGVNXXXXXXXXXXXXXXXX-F 452
            F++ +L  +  + +     +   + FASI SS Q+TSSVGVN                 F
Sbjct: 65   FSISALAHSHSSPRISGSSRLGAEHFASI-SSRQQTSSVGVNPQPLPPPSSSSQFGSPLF 123

Query: 453  WIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXX 632
            WIGVGVGLSA+FS VA RVK YAM+QAFK+   QMN QN+ F N A              
Sbjct: 124  WIGVGVGLSAIFSLVATRVKNYAMQQAFKSMMNQMNTQNDQFNNPAFSPGSAFPFPTPPA 183

Query: 633  XXXXXXXXXX----------------------HVASQP-VTVDVPATKVEDPPSISVKEK 743
                                             VASQP VTVDV ATKVE       K++
Sbjct: 184  SVPASSPPFPTSSTSRPATSPSYPTSSASTSPSVASQPAVTVDVSATKVEAASVTDAKDE 243

Query: 744  VEPESGPKKYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGA 923
             E    PKKYAFVDVSPEET  K+ F++ ++ ++T +  D Q +  V QNG AS QG   
Sbjct: 244  AEITKEPKKYAFVDVSPEETFPKSPFKSNEDILETSTSKDTQFNPEVLQNGAASNQGAAD 303

Query: 924  SEGP-STSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQ 1100
              G  ST K    LSVEALEKMMEDPTVQKMV+PYLPEEMRNP+TFKWMLQNPQYRQQL+
Sbjct: 304  FTGSQSTRKAGSGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLE 363

Query: 1101 DMLNNMGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQN 1280
            +MLNNM G+ EWDNRMMDSLKNFDLSSPE+KQQFDQIGLTPEEVISKIMANP++AMAFQN
Sbjct: 364  EMLNNMSGTGEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEIAMAFQN 423

Query: 1281 PRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 1397
            PRVQ AIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG
Sbjct: 424  PRVQQAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 462


>gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris]
          Length = 430

 Score =  416 bits (1070), Expect = e-113
 Identities = 241/432 (55%), Positives = 289/432 (66%), Gaps = 8/432 (1%)
 Frame = +3

Query: 126  NLGLISSHK-IVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGR-LINPSSSFTVLSLFE 299
            NL L+SS K ++LG  P        ++       L RK    GR LI P      +S   
Sbjct: 5    NLALVSSSKPLMLGHVP--------ARDATDRDVLRRKPFSLGRVLIAPHRFRYRVSALS 56

Query: 300  APKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGL 476
            +   +   V +K +   FASI+SS  QET+S+GVN                FWIGVGVGL
Sbjct: 57   SSHHSPKSVQDKLIVKHFASISSSNTQETTSIGVNPQLSPPPSSTIGSPL-FWIGVGVGL 115

Query: 477  SALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXX 656
            SALFS VA R+KKYAM+QAFKT   QMN+ NN FGNAA                      
Sbjct: 116  SALFSMVASRLKKYAMQQAFKTMMGQMNSPNNDFGNAAFSPGSPFPFSMPSAAGPTATAQ 175

Query: 657  XXHVASQP-----VTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQKNAF 821
                ++       VTVD+PATKVE   +  +K++VE ++ PKK AFVDVSPEET+QK+ F
Sbjct: 176  YGAPSTSSGSQSTVTVDIPATKVEATRTTDIKDEVEVQNKPKKIAFVDVSPEETVQKSPF 235

Query: 822  ENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEKMMEDPT 1001
            E+ K++  +    + +    VSQNG    QG G   G  ++K S L SV+ALEKMMEDPT
Sbjct: 236  ESVKDNESSSVKEEARVPDEVSQNGAPFNQGFGGFPGSQSTKKSAL-SVDALEKMMEDPT 294

Query: 1002 VQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFDLSS 1181
            VQKMV+P+LPEEMRNP TFKWMLQNPQYRQQL+ ML+NMGGS EWDNRMMD+LKNFDL+S
Sbjct: 295  VQKMVYPHLPEEMRNPDTFKWMLQNPQYRQQLEAMLSNMGGSTEWDNRMMDTLKNFDLNS 354

Query: 1182 PEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVM 1361
            PE+KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNP++I KYQNDKEVM
Sbjct: 355  PEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQNDKEVM 414

Query: 1362 DVFNKISELFPG 1397
            +VFNKISELFPG
Sbjct: 415  NVFNKISELFPG 426


>gb|ABF19057.1| plastid Tic40 [Ricinus communis]
          Length = 460

 Score =  415 bits (1067), Expect = e-113
 Identities = 249/458 (54%), Positives = 293/458 (63%), Gaps = 35/458 (7%)
 Frame = +3

Query: 129  LGLISSH----KIVLGIS-PNP-KN-SIFSSKPFVGLPNLIRKTG---KNGRLINPSSSF 278
            +GL+SS     K+V+G   PN  KN ++ ++K F       R      +N +++  SS F
Sbjct: 1    MGLLSSFYASPKLVMGCCYPNSLKNPTVTTNKQFSRTSTSTRALPFSLRNYKIVTRSSRF 60

Query: 279  TVLSLFEAPKATKTIVPEKDVRDCFASITSSGQETSSVGVNXXXXXXXXXXXXXXXX-FW 455
            ++ +L  +  + +     +   + FASI SS Q+TSSVGVN                 FW
Sbjct: 61   SISALAHSHSSPRISGSSRLGAEHFASI-SSRQQTSSVGVNPQPLPPPSSSSQFGSPLFW 119

Query: 456  IGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXX 635
            IGVGVGLSA+FS VA RVK YAM+QAFK+   QMN QN+ F N A               
Sbjct: 120  IGVGVGLSAIFSLVATRVKNYAMQQAFKSMMNQMNTQNDQFNNPAFSPGSAFPFPTPPAS 179

Query: 636  XXXXXXXXX----------------------HVASQP-VTVDVPATKVEDPPSISVKEKV 746
                                            VASQP VTVDV ATKVE       K++ 
Sbjct: 180  VPASSPPFPTSSTSRPATSPSYPTSSASTSPSVASQPAVTVDVSATKVEAASVTDAKDEA 239

Query: 747  EPESGPKKYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGAS 926
            E    PKKYAFVDVSPEET  K+ F++ ++ ++T +  D Q +  V QNG AS QG    
Sbjct: 240  EITKEPKKYAFVDVSPEETFPKSPFKSNEDILETSTSKDTQFNPEVLQNGAASNQGAADF 299

Query: 927  EGP-STSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQD 1103
             G  ST K    LSVEALEKMMEDPTVQKMV+PYLPEEMRNP+TFKWMLQNPQYRQQL++
Sbjct: 300  TGSQSTRKAGSGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEE 359

Query: 1104 MLNNMGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNP 1283
            MLNNM G+ EWDNRMMDSLKNFDLSSPE+KQQFDQIGLTPEEVISKIMANP++AMAFQNP
Sbjct: 360  MLNNMSGTGEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEIAMAFQNP 419

Query: 1284 RVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 1397
            RVQ AIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG
Sbjct: 420  RVQQAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 457


>gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial
            [Theobroma cacao]
          Length = 412

 Score =  407 bits (1045), Expect = e-110
 Identities = 247/431 (57%), Positives = 277/431 (64%), Gaps = 13/431 (3%)
 Frame = +3

Query: 126  NLGLISSHK-----IVLGIS-PN--PKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFT 281
            NL L+SS        +LG + PN  PKN  F + PF    NL  +  +     +  S  T
Sbjct: 5    NLALVSSSSPPLKLYLLGCNHPNYTPKNP-FKTLPFPS-SNLAPRRSRISIFAHSHSQPT 62

Query: 282  VLSLFEAPKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWI 458
                   P+    IV  K   + FASI+SS  Q+TSSVGVN                FWI
Sbjct: 63   ------PPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWI 116

Query: 459  GVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXX 638
            GVGVGLSALF+WVA  +KKYAM+QAFKT   QMN QNN F NAA                
Sbjct: 117  GVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPG 176

Query: 639  XXXXXXXXHVASQPVTVDVPATKVEDPPSISVKEKVEPESG---PKKYAFVDVSPEETLQ 809
                      +   VTVDVPATKVE  P+ +   +V+ E+    PKKYAFVDVSPEET+Q
Sbjct: 177  PVTSPSPS--SQTAVTVDVPATKVEAAPATAPATEVKSETETAEPKKYAFVDVSPEETVQ 234

Query: 810  KNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGP-STSKTSPLLSVEALEKM 986
            K+AFE+              +    S N    K   GA  G  ST    P LSV+ALEKM
Sbjct: 235  KSAFED-------------AAGISSSNNTQFPKDDAGAFGGSQSTGSADPALSVDALEKM 281

Query: 987  MEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKN 1166
            MEDPTVQKMV+PYLPEEMRNP TFKWMLQNPQYRQQLQDMLNNMGGS EWDNRMMDSLKN
Sbjct: 282  MEDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKN 341

Query: 1167 FDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 1346
            FDL+SP++KQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNPLSIAKYQN
Sbjct: 342  FDLNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 401

Query: 1347 DKEVMDVFNKI 1379
            DKEVMDVFNKI
Sbjct: 402  DKEVMDVFNKI 412


>ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-like [Cucumis sativus]
          Length = 419

 Score =  400 bits (1028), Expect = e-108
 Identities = 214/359 (59%), Positives = 255/359 (71%), Gaps = 3/359 (0%)
 Frame = +3

Query: 339  VRDCFASITSS--GQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSALFSWVAGRVK 512
            V + FA+++SS    ++SSVGV                 FW+GVGVGLSALF+WVA  +K
Sbjct: 66   VAERFATVSSSTTSNDSSSVGV-PSVSIPPPSSYVGSPLFWVGVGVGLSALFTWVASYLK 124

Query: 513  KYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXXHVASQPVTVD 692
            KYAM+QAFKT   QMN+QN+P  N                           V+   V++D
Sbjct: 125  KYAMQQAFKTMMSQMNSQNSPMSNPTLSSGSPFPIPPTFATGTTISPS---VSEPAVSID 181

Query: 693  VPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQS 872
            V ATKVE+ P  +VK + E     KK+AFVDVSPEET QK+ F+  +++   D     Q 
Sbjct: 182  VTATKVEEEPVTNVKSRTENMEA-KKFAFVDVSPEETDQKSPFK--EDATDADVSKSAQP 238

Query: 873  SQPVSQNGTASKQGTGASEGPSTS-KTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNP 1049
            +Q + QNG ASKQ    S+G   S K   +LSVEA+EKMMEDPTVQKM++P+LPEEMRNP
Sbjct: 239  TQELPQNGAASKQAYNGSDGSQFSRKPGSVLSVEAVEKMMEDPTVQKMIYPHLPEEMRNP 298

Query: 1050 TTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEE 1229
             TFKWM+QNP YRQQL++MLNNM GSP+WD R+MDSLKNFDLSSPE+KQQFDQIGLTPEE
Sbjct: 299  ETFKWMMQNPLYRQQLEEMLNNMSGSPQWDGRLMDSLKNFDLSSPEVKQQFDQIGLTPEE 358

Query: 1230 VISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1406
            VISKIMANP++AMAFQNPRVQAAIMDCSQNPLSI KYQNDKEVMDVFNKISELFPG +G
Sbjct: 359  VISKIMANPEIAMAFQNPRVQAAIMDCSQNPLSITKYQNDKEVMDVFNKISELFPGVSG 417


>ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana]
            gi|75309208|sp|Q9FMD5.1|TIC40_ARATH RecName: Full=Protein
            TIC 40, chloroplastic; AltName: Full=Protein PIGMENT
            DEFECTIVE EMBRYO 120; AltName: Full=Translocon at the
            inner envelope membrane of chloroplasts 40;
            Short=AtTIC40; Flags: Precursor
            gi|16226313|gb|AAL16131.1|AF428299_1 AT5g16620/MTG13_6
            [Arabidopsis thaliana] gi|10176971|dbj|BAB10189.1|
            translocon Tic40-like protein [Arabidopsis thaliana]
            gi|20260222|gb|AAM13009.1| translocon Tic40-like protein
            [Arabidopsis thaliana] gi|30387547|gb|AAP31939.1|
            At5g16620 [Arabidopsis thaliana]
            gi|332004935|gb|AED92318.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 447

 Score =  395 bits (1015), Expect = e-107
 Identities = 234/456 (51%), Positives = 276/456 (60%), Gaps = 27/456 (5%)
 Frame = +3

Query: 120  MENLGLIS----SHKIVLG--ISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFT 281
            MENL L+S    S K+++G   + + KN    S+     PN++ +  K       S+S  
Sbjct: 1    MENLTLVSCSASSPKLLIGCNFTSSLKNPTGFSRR---TPNIVLRCSKI------SASAQ 51

Query: 282  VLSLFEAPKATKTIVPEKDVRDCFASITSSG--QETSSVGVNXXXXXXXXXXXXXXXXFW 455
              S    P+ T  IV  K     FASI SS   Q+T+SV                   FW
Sbjct: 52   SQSPSSRPENTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFW 111

Query: 456  IGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXX 635
            IGVGVGLSALFS+V   +KKYAM+ A KT   QMN QN+ F N+                
Sbjct: 112  IGVGVGLSALFSYVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNSGFPSGSPFPFPFPPQT 171

Query: 636  XXXXXXXXXHVASQPVTVDVPATKVEDPPSISVK----------------EKVEPESGPK 767
                        S   TVDV ATKVE PPS   K                E  + +   K
Sbjct: 172  SPASSPFQSQSQSSGATVDVTATKVETPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEK 231

Query: 768  KYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE---GPS 938
             YAF D+SPEET +++ F NY E  +T+SP + +  + V QNG     G  ASE      
Sbjct: 232  NYAFEDISPEETTKESPFSNYAEVSETNSPKETRLFEDVLQNGAGPANGATASEVFQSLG 291

Query: 939  TSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNM 1118
              K  P LSVEALEKMMEDPTVQKMV+PYLPEEMRNP TFKWML+NPQYRQQLQDMLNNM
Sbjct: 292  GGKGGPGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNNM 351

Query: 1119 GGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAA 1298
             GS EWD RM D+LKNFDL+SPE+KQQF+QIGLTPEEVISKIM NPDVAMAFQNPRVQAA
Sbjct: 352  SGSGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQAA 411

Query: 1299 IMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1406
            +M+CS+NP++I KYQNDKEVMDVFNKIS+LFPG TG
Sbjct: 412  LMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 447


>ref|XP_006400200.1| hypothetical protein EUTSA_v10013528mg [Eutrema salsugineum]
            gi|557101290|gb|ESQ41653.1| hypothetical protein
            EUTSA_v10013528mg [Eutrema salsugineum]
          Length = 449

 Score =  390 bits (1003), Expect = e-106
 Identities = 238/457 (52%), Positives = 274/457 (59%), Gaps = 28/457 (6%)
 Frame = +3

Query: 120  MENLGLIS----SHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNG-RLINPSSSFTV 284
            MENL L+S    S K+++G      N   S K  VG     R+T K   R    S+S   
Sbjct: 1    MENLTLVSCSASSPKLLIGC-----NFTSSLKNPVGFS---RRTPKVVFRCSKISASAKS 52

Query: 285  LSLFEAPKATKTIVPEKDVRDCFASITSSG--QETSSVGVNXXXXXXXXXXXXXXXXFWI 458
             S    P+    IV  K     FASI SS   Q+T+SV                   FWI
Sbjct: 53   QSHSSRPENAGEIVVVKHRSRDFASIFSSNRDQQTTSVAYPNAAVPPPSSSTIGSPLFWI 112

Query: 459  GVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXX 638
            GVGVGLSALFSWV   +KKYAM+ A KT   QMN QN+ F N                  
Sbjct: 113  GVGVGLSALFSWVTSSLKKYAMQTAMKTMMNQMNTQNSQFNNPGFPAGSASPFPFPFPPQ 172

Query: 639  XXXXXXXXHVASQP--VTVDVPATKVEDPPSIS----------------VKEKVEPESGP 764
                       SQ    TVDV ATKV+ PPS                  V E+ + +   
Sbjct: 173  TSPTSSPFQSQSQSSGATVDVTATKVDTPPSAKPQPTPAKKTEVDKPSVVLEENKAKKEE 232

Query: 765  KKYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE---GP 935
            K YAF DVSPEET +++ F NY E  +T +P + +  + V QNG A   G  ASE     
Sbjct: 233  KNYAFEDVSPEETTKESPFSNYAEVSETSAPKEARLFEDVMQNGAAPANGATASEVFQSL 292

Query: 936  STSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNN 1115
               K  P LSVEALEKMMEDPTVQKMV+P+LPEEMRNP TFKWML+NP YRQQLQDMLNN
Sbjct: 293  GAGKGGPGLSVEALEKMMEDPTVQKMVYPHLPEEMRNPETFKWMLKNPHYRQQLQDMLNN 352

Query: 1116 MGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQA 1295
            M GS EWD RMMD+LKNFDL+SPE+KQQFDQIGLTPEEVISKIM NPDVAMAFQNPRVQA
Sbjct: 353  MSGSGEWDKRMMDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIMENPDVAMAFQNPRVQA 412

Query: 1296 AIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1406
            A+M+CS+NP++I KYQNDKEVMDVFNKIS+LFPG TG
Sbjct: 413  ALMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 449


>ref|XP_004307173.1| PREDICTED: protein TIC 40, chloroplastic-like [Fragaria vesca subsp.
            vesca]
          Length = 448

 Score =  386 bits (992), Expect = e-104
 Identities = 224/415 (53%), Positives = 261/415 (62%), Gaps = 31/415 (7%)
 Frame = +3

Query: 255  LINPSSSFTVLSLFEAPKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXX 431
            L +P+S  TV        A    V  K   + FASI+S+  QETSSVG+N          
Sbjct: 40   LSSPNSRLTV----RLSAAANQPVTSKLQTERFASISSTNSQETSSVGINPQFSAPPPPS 95

Query: 432  XXXXXXFWIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXX 611
                  FWIGVGV  SA+FSW AG+++KY ++QAFK    QMN QN+ F NAA       
Sbjct: 96   TIGSPLFWIGVGVAFSAVFSWAAGKLQKYVVQQAFKNVMGQMNTQNDQFSNAA---FSPG 152

Query: 612  XXXXXXXXXXXXXXXXXHVASQPVTVDVPATKVEDP--------PSISVKEKVEPESGPK 767
                                SQP   DV AT+V+ P        P+  VK + E +    
Sbjct: 153  SPFPFPSAPASPSASPFSAPSQPSFTDVSATEVDSPASSATPSTPAADVKSE-EQQMKEN 211

Query: 768  KY---------------------AFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPV 884
            ++                     AFVDV+PEET  K+ F +     +  S  +  S+   
Sbjct: 212  RFGNSFEIERNNVIQFSRQLSDRAFVDVNPEETELKSPFASSLNDTEPGSSKEINSNVEG 271

Query: 885  SQNGTASKQGTGASEG-PSTSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFK 1061
            SQNG A KQ   AS G  +T K + +LSVEALEKM+EDPTVQKMV+PYLPEEMRNPTTFK
Sbjct: 272  SQNGAAFKQAKDASMGSQTTGKENSVLSVEALEKMLEDPTVQKMVYPYLPEEMRNPTTFK 331

Query: 1062 WMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISK 1241
            WMLQNPQYRQQL+DML NM GS EWDNRMMDSLKNFDLSSPE+K+QFDQIGLTPE+VISK
Sbjct: 332  WMLQNPQYRQQLEDMLRNMTGSNEWDNRMMDSLKNFDLSSPEVKEQFDQIGLTPEQVISK 391

Query: 1242 IMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1406
            IMANPDVAMAFQNPRVQAAIMDCSQNP+SI KYQNDKEVMDVFNKISELFPG +G
Sbjct: 392  IMANPDVAMAFQNPRVQAAIMDCSQNPMSITKYQNDKEVMDVFNKISELFPGVSG 446


>ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arabidopsis lyrata subsp.
            lyrata] gi|297317564|gb|EFH47986.1| hypothetical protein
            ARALYDRAFT_488521 [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  385 bits (990), Expect = e-104
 Identities = 230/456 (50%), Positives = 269/456 (58%), Gaps = 27/456 (5%)
 Frame = +3

Query: 120  MENLGLIS----SHKIVLG--ISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFT 281
            MENL L+S    S K+++G   + + KN    S+    +     K   + +  +PSS   
Sbjct: 1    MENLTLVSCSASSPKLLIGCNFTSSLKNPTGFSRRTPRIVLRCSKISASAQSQSPSSR-- 58

Query: 282  VLSLFEAPKATKTIVPEKDVRDCFASITSSG--QETSSVGVNXXXXXXXXXXXXXXXXFW 455
                   P  T  IV  K     FASI SS   Q+T+SV                   FW
Sbjct: 59   -------PDNTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFW 111

Query: 456  IGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXX 635
            IGVGVGLSALFS V   +KKYAM+ A KT   QMN QN+ F N                 
Sbjct: 112  IGVGVGLSALFSLVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNPGFPSGSPFPFPFPPQT 171

Query: 636  XXXXXXXXXHVASQPVTVDVPATKVEDPPSISVK----------------EKVEPESGPK 767
                        S   TVDV ATKV+ PPS   K                E  + +   K
Sbjct: 172  SPASSPFQSQSQSSGATVDVTATKVDTPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEK 231

Query: 768  KYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE---GPS 938
             YAF D+SPEET +++ F NY E  +T SP + +  + V QNG     G  ASE      
Sbjct: 232  NYAFEDISPEETTKESPFSNYAEVSETSSPKETRLFEDVLQNGAGPANGATASEVFQSLG 291

Query: 939  TSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNM 1118
              K    LSVEALEKMMEDPTVQKMV+PYLPEEMRNP TFKWML+NPQYRQQLQDMLNNM
Sbjct: 292  GGKGGAGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNNM 351

Query: 1119 GGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAA 1298
             GS EWD RM D+LKNFDL+SPE+KQQF+QIGLTPEEVISKIM NPDVAMAFQNPRVQAA
Sbjct: 352  SGSGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQAA 411

Query: 1299 IMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1406
            +M+CS+NP++I KYQNDKEVMDVFNKIS+LFPG TG
Sbjct: 412  LMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 447


Top