BLASTX nr result

ID: Rehmannia25_contig00001336 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00001336
         (1972 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-lik...   525   e-146
ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-lik...   520   e-144
ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vi...   484   e-134
gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein i...   444   e-122
ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-lik...   429   e-117
sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplasti...   427   e-116
emb|CAB50925.1| translocon Tic40 [Pisum sativum]                      426   e-116
ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-lik...   424   e-116
ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Popu...   423   e-115
ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-lik...   421   e-115
ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Popu...   421   e-115
ref|XP_002531917.1| conserved hypothetical protein [Ricinus comm...   417   e-114
gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus...   416   e-113
gb|ABF19057.1| plastid Tic40 [Ricinus communis]                       415   e-113
gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein i...   407   e-110
ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-lik...   400   e-108
ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana] gi|753092...   395   e-107
ref|XP_006400200.1| hypothetical protein EUTSA_v10013528mg [Eutr...   390   e-106
ref|XP_004307173.1| PREDICTED: protein TIC 40, chloroplastic-lik...   386   e-104
ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arab...   385   e-104

>ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum lycopersicum]
          Length = 443

 Score =  525 bits (1352), Expect = e-146
 Identities = 281/448 (62%), Positives = 324/448 (72%), Gaps = 19/448 (4%)
 Frame = +2

Query: 167  MENLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFE 346
            MEN+G++SS K+VLG+S    NS+ SSKPF GLP+L ++  KNGR + P++ F V+S F+
Sbjct: 1    MENIGIVSSPKMVLGLS---SNSVISSKPFFGLPHLPKRPFKNGRTVRPTTCFEVVSCFQ 57

Query: 347  APKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGL 523
             P+ TK IV  K  R  FAS T+SG ++TSSVGVN                FWIGVGVG 
Sbjct: 58   GPRLTKKIVLGKSGRGSFASTTTSGGKQTSSVGVNPQFSAPSPPSQMGSPLFWIGVGVGF 117

Query: 524  SALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXX 703
            SALF+WVA  +KKYAM+QA KT   QMN QN+ F N A                      
Sbjct: 118  SALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNTAFSPGPGSPFPFPFPPPPVSGPA 177

Query: 704  XXH---------------VASQPVTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVS 838
                               ASQPVTVDV ATKVE+PP+++VK   E E  PKK AFVD+S
Sbjct: 178  SSSPPPPTASSSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDKEAEKEPKKNAFVDIS 237

Query: 839  PEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTS---KTSPLL 1009
            P+ET QK AFEN+K+S +T +    Q    V+QNG AS+ G G++   STS   K++PLL
Sbjct: 238  PDETFQKGAFENFKDSAETAAVTVDQ----VTQNGAASQSGFGSNTSDSTSSTGKSNPLL 293

Query: 1010 SVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDN 1189
            SV+ALEKMMEDPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQLQDM+NNMGG+PEWDN
Sbjct: 294  SVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDN 353

Query: 1190 RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP 1369
            RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP
Sbjct: 354  RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP 413

Query: 1370 LSIAKYQNDKEVMDVFNKISELFPGATG 1453
            LSIAKYQNDKEVMDVFNKISELFPG +G
Sbjct: 414  LSIAKYQNDKEVMDVFNKISELFPGVSG 441


>ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum tuberosum]
          Length = 443

 Score =  520 bits (1338), Expect = e-144
 Identities = 279/448 (62%), Positives = 323/448 (72%), Gaps = 19/448 (4%)
 Frame = +2

Query: 167  MENLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFE 346
            MEN+ ++SS K+VLG+S NP   + S+KP  GLP+L ++  KNGR++ P++ F V+S F+
Sbjct: 1    MENICIVSSPKMVLGLSSNP---VISNKPLFGLPHLPKRPFKNGRIVRPTTCFEVVSCFQ 57

Query: 347  APKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGL 523
            +P+ TK IV  K  R  FAS T+SG Q+TSSVGVN                FWIGVGVGL
Sbjct: 58   SPRLTKKIVLGKSGRGSFASTTTSGGQQTSSVGVNPQFSAPSQPSQVGSPLFWIGVGVGL 117

Query: 524  SALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXX 703
            SALF+WVA  +KKYAM+QA KT   QMN QN+ F N A                      
Sbjct: 118  SALFAWVASYLKKYAMQQALKTMMGQMNGQNSQFSNNAFSPGPGSPFPFPFPPPPVSGPA 177

Query: 704  XXH---------------VASQPVTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVS 838
                               ASQPVTVDV ATKVE+PP+++VK   E    PKK AFVD+S
Sbjct: 178  SSSPPPPTASTSSTPSASFASQPVTVDVSATKVEEPPTVNVKNDTEAGKEPKKNAFVDIS 237

Query: 839  PEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTS---KTSPLL 1009
            P+ET QK AFEN+K+S +T S    Q    V+QNG AS+ G G +   STS   K++PL+
Sbjct: 238  PDETFQKGAFENFKDSTETASVTVDQ----VTQNGAASQLGFGPNTSDSTSSTGKSNPLM 293

Query: 1010 SVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDN 1189
            SV+ALEKMMEDPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQLQDM+NNMGG+PEWDN
Sbjct: 294  SVDALEKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDN 353

Query: 1190 RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP 1369
            RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP
Sbjct: 354  RMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNP 413

Query: 1370 LSIAKYQNDKEVMDVFNKISELFPGATG 1453
            LSIAKYQNDKEVMDVFNKISELFPG +G
Sbjct: 414  LSIAKYQNDKEVMDVFNKISELFPGVSG 441


>ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vitis vinifera]
            gi|296089465|emb|CBI39284.3| unnamed protein product
            [Vitis vinifera]
          Length = 436

 Score =  484 bits (1245), Expect = e-134
 Identities = 266/443 (60%), Positives = 301/443 (67%), Gaps = 14/443 (3%)
 Frame = +2

Query: 167  MENLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFE 346
            M++L L+SS K+VLG SP+    I  +     LP L RK  K    I  S S        
Sbjct: 1    MDSLTLVSSPKLVLGHSPSNPRHISCAHSSFSLPLLFRKPRK---FIAASQSGA------ 51

Query: 347  APKATKTIVPEKDVRDCFASITSSGQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLS 526
            +P+  + +V  K   +CFASI+SS Q TSSVGVN                FWIGVGVGLS
Sbjct: 52   SPRTPRHVVETKLGTECFASISSSSQGTSSVGVNPQFSPPPPSSNIGSPLFWIGVGVGLS 111

Query: 527  ALFSWVAGRVKKYAMEQAFKTFTQQMNAQNN-------------PFGNAAXXXXXXXXXX 667
            ALFSWVA  +KKYAM+QAFKT   QM++QNN             PF              
Sbjct: 112  ALFSWVASNLKKYAMQQAFKTLMGQMDSQNNQFNTTTFSPGSPFPFPMPPPSGPSTSHSG 171

Query: 668  XXXXXXXXXXXXXXHVASQPVTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEE 847
                            A   VTVDVPATKVE PP+  VK+ +E ++   KYAFVDVSPEE
Sbjct: 172  PTTSPSGPTTSPSTVAAQSMVTVDVPATKVETPPATDVKDDIEKKNEQNKYAFVDVSPEE 231

Query: 848  TLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE-GPSTSKTSPLLSVEAL 1024
            TLQ++ FEN++ES +T S  D Q S  VSQNGT  + G G SE   ST   +P LSV+AL
Sbjct: 232  TLQESPFENFEESTETSSSKDAQFSAGVSQNGTPPRPGMGVSEDSQSTRNANPFLSVDAL 291

Query: 1025 EKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDS 1204
            EKMMEDPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGG  EWDNRMMD+
Sbjct: 292  EKMMEDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGGAEWDNRMMDN 351

Query: 1205 LKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAK 1384
            LKNFDLSSPE+KQQFDQIGLTPEEVISKIMANPDVA+AFQNPR+QAAIMDCSQNPLSIAK
Sbjct: 352  LKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPDVALAFQNPRIQAAIMDCSQNPLSIAK 411

Query: 1385 YQNDKEVMDVFNKISELFPGATG 1453
            YQNDKEVMDVFNKISELFPG +G
Sbjct: 412  YQNDKEVMDVFNKISELFPGVSG 434


>gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 433

 Score =  444 bits (1142), Expect = e-122
 Identities = 263/440 (59%), Positives = 295/440 (67%), Gaps = 13/440 (2%)
 Frame = +2

Query: 173  NLGLISSHK-----IVLGIS-PN--PKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFT 328
            NL L+SS        +LG + PN  PKN  F + PF    NL  +  +     +  S  T
Sbjct: 5    NLALVSSSSPPLKLYLLGCNHPNYTPKNP-FKTLPFPS-SNLAPRRSRISIFAHSHSQPT 62

Query: 329  VLSLFEAPKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWI 505
                   P+    IV  K   + FASI+SS  Q+TSSVGVN                FWI
Sbjct: 63   ------PPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWI 116

Query: 506  GVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXX 685
            GVGVGLSALF+WVA  +KKYAM+QAFKT   QMN QNN F NAA                
Sbjct: 117  GVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPG 176

Query: 686  XXXXXXXXHVASQPVTVDVPATKVEDPPSISVKEKVEPESG---PKKYAFVDVSPEETLQ 856
                      +   VTVDVPATKVE  P+ +   +V+ E+    PKKYAFVDVSPEET+Q
Sbjct: 177  PVTSPSPS--SQTAVTVDVPATKVEAAPATAPATEVKSETETAEPKKYAFVDVSPEETVQ 234

Query: 857  KNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGP-STSKTSPLLSVEALEKM 1033
            K+AFE   ++    S N+ Q  + VS NG ASKQ  GA  G  ST    P LSV+ALEKM
Sbjct: 235  KSAFE---DAAGISSSNNTQFPKDVSDNGAASKQDAGAFGGSQSTGSADPALSVDALEKM 291

Query: 1034 MEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKN 1213
            MEDPTVQKMV+PYLPEEMRNP TFKWMLQNPQYRQQLQDMLNNMGGS EWDNRMMDSLKN
Sbjct: 292  MEDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKN 351

Query: 1214 FDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 1393
            FDL+SP++KQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNPLSIAKYQN
Sbjct: 352  FDLNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 411

Query: 1394 DKEVMDVFNKISELFPGATG 1453
            DKEVMDVFNKISELFPG TG
Sbjct: 412  DKEVMDVFNKISELFPGVTG 431


>ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-like [Cicer arietinum]
          Length = 433

 Score =  429 bits (1102), Expect = e-117
 Identities = 246/438 (56%), Positives = 295/438 (67%), Gaps = 11/438 (2%)
 Frame = +2

Query: 173  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFEAP 352
            NL L+SS K +L    + +N     KPF          GK     N SSS    +  ++ 
Sbjct: 5    NLALVSSPKPLLLGHSSSRNVFTRRKPFT--------FGKFFVSANSSSSHVTRAAPKSH 56

Query: 353  KATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSA 529
            +  K++  +  V + FASI+SS  QET+SVGV+                FWIGVGVG SA
Sbjct: 57   QNPKSVQGKLIVHN-FASISSSNSQETTSVGVSPQLSPPPSSTVGSPL-FWIGVGVGFSA 114

Query: 530  LFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXX 709
            LFS VA R+KKYAM+QAFKT   QMN QNNPF +AA                        
Sbjct: 115  LFSIVASRLKKYAMQQAFKTMMGQMNTQNNPFDSAAFSPGSPFPFPMPSSSGPAAPASSA 174

Query: 710  HVASQ----------PVTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQK 859
               SQ           VTVD+PATKVE  PS + K++VE ++ PKK  FVDVSPEE++QK
Sbjct: 175  GTQSQSTSARTASQSTVTVDIPATKVEAAPSTNAKDEVEVKNEPKKIGFVDVSPEESVQK 234

Query: 860  NAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEKMME 1039
            + FE++K+  ++ S  + ++     QNG  S QG G S G S S    +LSVEALEKMME
Sbjct: 235  SPFESFKDVDESSSFKEARAPAEAFQNGAPSNQGFGNSPG-SQSGGKSVLSVEALEKMME 293

Query: 1040 DPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFD 1219
            DPTVQKMV+PYLPEEMRNP+TFKWMLQNPQYRQQL++MLNNMGGS EWD+RMMD+LKNFD
Sbjct: 294  DPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDTLKNFD 353

Query: 1220 LSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDK 1399
            L+SP++KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCS NPL+IAKYQNDK
Sbjct: 354  LNSPDVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSSNPLNIAKYQNDK 413

Query: 1400 EVMDVFNKISELFPGATG 1453
            EVMDVFNKISELFPG +G
Sbjct: 414  EVMDVFNKISELFPGVSG 431


>sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplastic; AltName: Full=Translocon
            at the inner envelope membrane of chloroplasts 40;
            Short=PsTIC40; Flags: Precursor
            gi|26000725|gb|AAN75219.1| chloroplast protein translocon
            component Tic40 precursor [Pisum sativum]
          Length = 436

 Score =  427 bits (1097), Expect = e-116
 Identities = 246/441 (55%), Positives = 294/441 (66%), Gaps = 14/441 (3%)
 Frame = +2

Query: 173  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFEAP 352
            NL L+SS K +L    + KN     K F          G      N SSS    +  ++ 
Sbjct: 5    NLALVSSPKPLLLGHSSSKNVFSGRKSFT--------FGTFRVSANSSSSHVTRAASKSH 56

Query: 353  KATKTIVPEKDVRDCFASITSS-GQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSA 529
            +  K++  + +  D FASI+SS GQET+SVGV+                FWIG+GVG SA
Sbjct: 57   QNLKSVQGKVNAHD-FASISSSNGQETTSVGVSPQLSPPPPSTVGSPL-FWIGIGVGFSA 114

Query: 530  LFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXX 709
            LFS VA RVKKYAM+QAFK+   QMN QNNPF + A                        
Sbjct: 115  LFSVVASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGFA 174

Query: 710  HVASQP----------VTVDVPATKVE---DPPSISVKEKVEPESGPKKYAFVDVSPEET 850
               SQ           VTVD+PATKVE     P I+VKE+VE ++ PKK AFVDVSPEET
Sbjct: 175  GNQSQATSTRSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEET 234

Query: 851  LQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEK 1030
            +QKNAFE +K+  ++ S  + ++    SQNGT  KQG G S   S S+    LSV+ALEK
Sbjct: 235  VQKNAFERFKDVDESSSFKEARAPAEASQNGTPFKQGFGDSPS-SPSERKSALSVDALEK 293

Query: 1031 MMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLK 1210
            MMEDPTVQ+MV+PYLPEEMRNP+TFKWM+QNP+YRQQL+ MLNNMGG  EWD+RMMD+LK
Sbjct: 294  MMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRMMDTLK 353

Query: 1211 NFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQ 1390
            NFDL+SP++KQQFDQIGL+P+EVISKIMANPDVAMAFQNPRVQAAIMDCSQNP+SI KYQ
Sbjct: 354  NFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMSIVKYQ 413

Query: 1391 NDKEVMDVFNKISELFPGATG 1453
            NDKEVMDVFNKISELFPG +G
Sbjct: 414  NDKEVMDVFNKISELFPGVSG 434


>emb|CAB50925.1| translocon Tic40 [Pisum sativum]
          Length = 436

 Score =  426 bits (1096), Expect = e-116
 Identities = 246/441 (55%), Positives = 294/441 (66%), Gaps = 14/441 (3%)
 Frame = +2

Query: 173  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFEAP 352
            NL L+SS K +L    + KN     K F          G      N SSS    +  ++ 
Sbjct: 5    NLALVSSPKPLLLGHSSSKNVFSRRKSFT--------FGTFRVSANSSSSHVTRAASKSH 56

Query: 353  KATKTIVPEKDVRDCFASITSS-GQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSA 529
            +  K++  + +    FASI+SS GQET+SVGV+                FWIG+GVG SA
Sbjct: 57   QNLKSVQGKVNAHS-FASISSSNGQETTSVGVSPQLSPPPPSTVGSPL-FWIGIGVGFSA 114

Query: 530  LFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXX 709
            LFS VA RVKKYAM+QAFK+   QMN QNNPF + A                        
Sbjct: 115  LFSVVASRVKKYAMQQAFKSMMGQMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGFA 174

Query: 710  HVASQP----------VTVDVPATKVE---DPPSISVKEKVEPESGPKKYAFVDVSPEET 850
               SQ           VTVD+PATKVE     P I+VKE+VE ++ PKK AFVDVSPEET
Sbjct: 175  GNQSQATSTRSASQSTVTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEET 234

Query: 851  LQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEK 1030
            +QKNAFE +K+  ++ S  + ++    SQNGT  KQG G S G S S+    LSV+ALEK
Sbjct: 235  VQKNAFERFKDVDESSSFKEARAPAEASQNGTPFKQGFGDSPG-SPSERKSALSVDALEK 293

Query: 1031 MMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLK 1210
            MMEDPTVQ+MV+PYLPEEMRNP+TFKWM+QNP+YRQQL+ MLNNMGG  EWD+RMMD+LK
Sbjct: 294  MMEDPTVQQMVYPYLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRMMDTLK 353

Query: 1211 NFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQ 1390
            NFDL+SP++KQQFDQIGL+P+EVISKIMANPDVAMAFQNPRVQAAIMDCSQNP+SI KYQ
Sbjct: 354  NFDLNSPDVKQQFDQIGLSPQEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPMSIVKYQ 413

Query: 1391 NDKEVMDVFNKISELFPGATG 1453
            NDKEVMDVFNKISELFPG +G
Sbjct: 414  NDKEVMDVFNKISELFPGVSG 434


>ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max]
          Length = 432

 Score =  424 bits (1090), Expect = e-116
 Identities = 242/431 (56%), Positives = 290/431 (67%), Gaps = 15/431 (3%)
 Frame = +2

Query: 197  KIVLGISPNPKNSIFSSKPFVGLPN---LIRKTGKNGR-LINPSSSFTVLSLFEAPKATK 364
            K+ L +  +PK  +    P +   +     RK    GR LI P      +S   +     
Sbjct: 3    KLNLALVSSPKPLMLGHVPAIDATSRDVFRRKHFSFGRVLIAPHRCRFRVSALSSSHRNP 62

Query: 365  TIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSALFSW 541
              V EK +   FASI+SS  QE +S GVN                FWIGVGVGLSALFS 
Sbjct: 63   KSVQEKLIVKHFASISSSNTQEATSTGVNPQLSPSSTIGSPL---FWIGVGVGLSALFSV 119

Query: 542  VAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXXHVAS 721
            VA R+KKYAM+QAFKT   QMN+QNN FGNAA                           S
Sbjct: 120  VASRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPTAPASSATTQS 179

Query: 722  QP----------VTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQKNAFE 871
            +           +TVD+PA KVE  P+ +VK++VE ++ PKK AFVDVSPEET+Q++ FE
Sbjct: 180  RAPSASSASQSTITVDIPAAKVEVAPTTNVKDEVEVKNEPKKIAFVDVSPEETVQESPFE 239

Query: 872  NYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEKMMEDPTV 1051
            ++K+  ++ S  + +    VSQNG  S QG G   G  ++K S +LSV+ALEKMMEDPTV
Sbjct: 240  SFKDD-ESSSVKEARVPDEVSQNGAPSNQGFGDFPGSQSTKKS-VLSVDALEKMMEDPTV 297

Query: 1052 QKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFDLSSP 1231
            QKMV+PYLPEEMRNPTTFKWMLQNPQYRQQL++MLNNMGGS EWD+RMMD+LKNFDL+SP
Sbjct: 298  QKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDTLKNFDLNSP 357

Query: 1232 EIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMD 1411
            E+KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNP++I KYQNDKEVMD
Sbjct: 358  EVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQNDKEVMD 417

Query: 1412 VFNKISELFPG 1444
            VFNKISELFPG
Sbjct: 418  VFNKISELFPG 428


>ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa]
            gi|550319201|gb|ERP50369.1| hypothetical protein
            POPTR_0017s02900g [Populus trichocarpa]
          Length = 435

 Score =  423 bits (1088), Expect = e-115
 Identities = 242/434 (55%), Positives = 290/434 (66%), Gaps = 13/434 (2%)
 Frame = +2

Query: 173  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFTVLSLFEAP 352
            +L L+S +   L     PK SI +++P +  P+   KT  +   I    S + LS    P
Sbjct: 13   SLKLVSGYPTSLKNPTTPKFSISTTRPSLPFPHRTSKTVTHTSRI----SISALSQSHGP 68

Query: 353  KATKTIVPEKDVRDCFASITS-SGQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSA 529
            + T      K+  + FASI+S SGQ+T+SVGVN                FW+GVGV LSA
Sbjct: 69   RRTS-----KNGSEYFASISSLSGQQTASVGVNPQSVSPPPSQIGSPL-FWVGVGVALSA 122

Query: 530  LFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXX 709
            +FSWVA R+K YAM+QAFK+ T+QMNAQNN F  A                         
Sbjct: 123  IFSWVATRLKNYAMQQAFKSLTEQMNAQNNQFNPA---FSARSPFPFSPPPASQPATSPF 179

Query: 710  HVASQP-VTVDVPATKVEDPPSISVKEKVEPES--------GPKKYAFVDVSPEETLQKN 862
              ASQP VTVD+PATKVE  P    +++ E ++         P+K+AFVDVSPEET    
Sbjct: 180  QTASQPAVTVDIPATKVEAAPETDARKEKETDTLEEREIKEEPRKFAFVDVSPEETSLNT 239

Query: 863  AFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE---GPSTSKTSPLLSVEALEKM 1033
             F + ++ I T S  D Q ++  SQNG   KQG  ASE   G  +S+ +  LSVEALEKM
Sbjct: 240  PFSSVEDVIDTSSSKDVQFAKEASQNGATFKQGPSASEPSEGSQSSQKAGSLSVEALEKM 299

Query: 1034 MEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKN 1213
            M+DPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQL++MLNNM GS EWD+RM+DSLKN
Sbjct: 300  MDDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMSGSSEWDSRMVDSLKN 359

Query: 1214 FDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 1393
            FDLSSPE+KQQFDQIGLTPEEVISKIMANPDVA+AFQNPRVQ AIM+CSQNPLSIAKYQN
Sbjct: 360  FDLSSPEVKQQFDQIGLTPEEVISKIMANPDVALAFQNPRVQQAIMECSQNPLSIAKYQN 419

Query: 1394 DKEVMDVFNKISEL 1435
            DKEVMDVFNKISE+
Sbjct: 420  DKEVMDVFNKISEI 433


>ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max]
          Length = 429

 Score =  421 bits (1083), Expect = e-115
 Identities = 244/436 (55%), Positives = 290/436 (66%), Gaps = 12/436 (2%)
 Frame = +2

Query: 173  NLGLISSHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGR-LINPSSSFTVLSLFEA 349
            NL L+SS K ++ +   P   +F  K F             GR LI P      +S   +
Sbjct: 5    NLALVSSPKPLM-LGHVPARDVFRRKHF-----------SFGRVLIAPHRCRFRVSALSS 52

Query: 350  PKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLS 526
                   V EK +   FASI+SS  QET+S+GV                 FWIGVGVGLS
Sbjct: 53   SHHNPKSVQEKLIVKHFASISSSNTQETTSIGVKPQLSPSPSSTIGSPL-FWIGVGVGLS 111

Query: 527  ALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXX 706
            ALFS VA R+KKYAM+QAFKT   QMN+QNN FGNAA                       
Sbjct: 112  ALFSVVASRLKKYAMQQAFKTMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPTAPASS 171

Query: 707  XHVASQP----------VTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQ 856
                S+           +TVD+PA KVE  P+ +VK++VE ++ PKK AFVDVSPEET++
Sbjct: 172  ATTQSRAPSASSASQSTITVDLPAAKVEAAPTTNVKDEVELKNEPKKIAFVDVSPEETVR 231

Query: 857  KNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEKMM 1036
            ++ FE++K+  ++ S  +      VSQNG  S  G G   G  ++K S L SV+ALEKMM
Sbjct: 232  ESPFESFKDD-ESSSVKEAWVPDEVSQNGAPSNLGFGDFPGSQSTKKSAL-SVDALEKMM 289

Query: 1037 EDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNF 1216
            EDPTVQKMV+PYLPEEMRNPTTFKWMLQNPQYRQQL++MLNNMGGS EWDNRMMD+LKNF
Sbjct: 290  EDPTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDNRMMDTLKNF 349

Query: 1217 DLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQND 1396
            DL+SPE+KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNP++I KYQND
Sbjct: 350  DLNSPEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQND 409

Query: 1397 KEVMDVFNKISELFPG 1444
            KEVMDVFNKISELFPG
Sbjct: 410  KEVMDVFNKISELFPG 425


>ref|XP_002305876.1| hypothetical protein POPTR_0004s08560g [Populus trichocarpa]
            gi|222848840|gb|EEE86387.1| hypothetical protein
            POPTR_0004s08560g [Populus trichocarpa]
          Length = 429

 Score =  421 bits (1083), Expect = e-115
 Identities = 246/449 (54%), Positives = 299/449 (66%), Gaps = 20/449 (4%)
 Frame = +2

Query: 167  MEN--LGLISSH--KIVLGISPN------PKNSIFSSKPFVGLPNLIRKTGKNGRLINPS 316
            MEN  L L+SS   K+V+G   +      PK SI +++P +     I KT  +      +
Sbjct: 1    MENPRLALLSSSSPKLVMGYPTSLKNPTTPKFSISTTRPSLPFSLRISKTAPH------A 54

Query: 317  SSFTVLSLFEAPKATKTIVPEKDVRDCFASITSS-GQETSSVGVNXXXXXXXXXXXXXXX 493
            S F++ +L  +     +        + FASI+SS G++T+SVGVN               
Sbjct: 55   SIFSISALANSHGKLGS--------EYFASISSSSGKQTASVGVNPQPVSPPPSQIGSPL 106

Query: 494  XFWIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXX 673
             FW+GVGVGLSA+FSWVA RVK YAM+QAFK+ T+QMN QNN F  A             
Sbjct: 107  -FWVGVGVGLSAIFSWVATRVKNYAMQQAFKSLTEQMNTQNNQFNPAFSARPPFPFSPPP 165

Query: 674  XXXXXXXXXXXXHVASQP-VTVDVPATKVEDPPSISVKEKVEPE--------SGPKKYAF 826
                          ASQP +TVD+PATKVE  P+  V ++ E +           KKYAF
Sbjct: 166  ASHPSTSPSP---AASQPAITVDIPATKVEAAPTTDVGKEKETDFLEERKIKEETKKYAF 222

Query: 827  VDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPL 1006
            VD+SPEET     F + ++  +T S  D + ++ V QNG A KQG GA+EG  +  T P 
Sbjct: 223  VDISPEETSLNTPFSSVEDDNETSSSKDVEFAKKVFQNGAAFKQGPGAAEG--SQSTRPF 280

Query: 1007 LSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWD 1186
            LSVEALEKMMEDPT+QKMV+PYLPEEMRNPTTFKWMLQNPQYRQQL+DMLNNMGGS +WD
Sbjct: 281  LSVEALEKMMEDPTMQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEDMLNNMGGSGKWD 340

Query: 1187 NRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQN 1366
            ++MMDSLK+FDL+S E+KQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQ AIM+CSQN
Sbjct: 341  SQMMDSLKDFDLNSAEVKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQQAIMECSQN 400

Query: 1367 PLSIAKYQNDKEVMDVFNKISELFPGATG 1453
            P++I KYQNDKEVMDVFNKISELFPG TG
Sbjct: 401  PINITKYQNDKEVMDVFNKISELFPGMTG 429


>ref|XP_002531917.1| conserved hypothetical protein [Ricinus communis]
            gi|223528427|gb|EEF30461.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 465

 Score =  417 bits (1073), Expect = e-114
 Identities = 250/459 (54%), Positives = 294/459 (64%), Gaps = 35/459 (7%)
 Frame = +2

Query: 173  NLGLISSH----KIVLGIS-PNP-KN-SIFSSKPFVGLPNLIRKTG---KNGRLINPSSS 322
            N+GL+SS     K+V+G   PN  KN ++ ++K F       R      +N +++  SS 
Sbjct: 5    NMGLLSSFYASPKLVMGCCYPNSLKNPTVTTNKQFSRTSTSTRALPFSLRNYKIVTRSSR 64

Query: 323  FTVLSLFEAPKATKTIVPEKDVRDCFASITSSGQETSSVGVNXXXXXXXXXXXXXXXX-F 499
            F++ +L  +  + +     +   + FASI SS Q+TSSVGVN                 F
Sbjct: 65   FSISALAHSHSSPRISGSSRLGAEHFASI-SSRQQTSSVGVNPQPLPPPSSSSQFGSPLF 123

Query: 500  WIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXX 679
            WIGVGVGLSA+FS VA RVK YAM+QAFK+   QMN QN+ F N A              
Sbjct: 124  WIGVGVGLSAIFSLVATRVKNYAMQQAFKSMMNQMNTQNDQFNNPAFSPGSAFPFPTPPA 183

Query: 680  XXXXXXXXXX----------------------HVASQP-VTVDVPATKVEDPPSISVKEK 790
                                             VASQP VTVDV ATKVE       K++
Sbjct: 184  SVPASSPPFPTSSTSRPATSPSYPTSSASTSPSVASQPAVTVDVSATKVEAASVTDAKDE 243

Query: 791  VEPESGPKKYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGA 970
             E    PKKYAFVDVSPEET  K+ F++ ++ ++T +  D Q +  V QNG AS QG   
Sbjct: 244  AEITKEPKKYAFVDVSPEETFPKSPFKSNEDILETSTSKDTQFNPEVLQNGAASNQGAAD 303

Query: 971  SEGP-STSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQ 1147
              G  ST K    LSVEALEKMMEDPTVQKMV+PYLPEEMRNP+TFKWMLQNPQYRQQL+
Sbjct: 304  FTGSQSTRKAGSGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLE 363

Query: 1148 DMLNNMGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQN 1327
            +MLNNM G+ EWDNRMMDSLKNFDLSSPE+KQQFDQIGLTPEEVISKIMANP++AMAFQN
Sbjct: 364  EMLNNMSGTGEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEIAMAFQN 423

Query: 1328 PRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 1444
            PRVQ AIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG
Sbjct: 424  PRVQQAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 462


>gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris]
          Length = 430

 Score =  416 bits (1070), Expect = e-113
 Identities = 241/432 (55%), Positives = 289/432 (66%), Gaps = 8/432 (1%)
 Frame = +2

Query: 173  NLGLISSHK-IVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNGR-LINPSSSFTVLSLFE 346
            NL L+SS K ++LG  P        ++       L RK    GR LI P      +S   
Sbjct: 5    NLALVSSSKPLMLGHVP--------ARDATDRDVLRRKPFSLGRVLIAPHRFRYRVSALS 56

Query: 347  APKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGL 523
            +   +   V +K +   FASI+SS  QET+S+GVN                FWIGVGVGL
Sbjct: 57   SSHHSPKSVQDKLIVKHFASISSSNTQETTSIGVNPQLSPPPSSTIGSPL-FWIGVGVGL 115

Query: 524  SALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXX 703
            SALFS VA R+KKYAM+QAFKT   QMN+ NN FGNAA                      
Sbjct: 116  SALFSMVASRLKKYAMQQAFKTMMGQMNSPNNDFGNAAFSPGSPFPFSMPSAAGPTATAQ 175

Query: 704  XXHVASQP-----VTVDVPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQKNAF 868
                ++       VTVD+PATKVE   +  +K++VE ++ PKK AFVDVSPEET+QK+ F
Sbjct: 176  YGAPSTSSGSQSTVTVDIPATKVEATRTTDIKDEVEVQNKPKKIAFVDVSPEETVQKSPF 235

Query: 869  ENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGPSTSKTSPLLSVEALEKMMEDPT 1048
            E+ K++  +    + +    VSQNG    QG G   G  ++K S L SV+ALEKMMEDPT
Sbjct: 236  ESVKDNESSSVKEEARVPDEVSQNGAPFNQGFGGFPGSQSTKKSAL-SVDALEKMMEDPT 294

Query: 1049 VQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFDLSS 1228
            VQKMV+P+LPEEMRNP TFKWMLQNPQYRQQL+ ML+NMGGS EWDNRMMD+LKNFDL+S
Sbjct: 295  VQKMVYPHLPEEMRNPDTFKWMLQNPQYRQQLEAMLSNMGGSTEWDNRMMDTLKNFDLNS 354

Query: 1229 PEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVM 1408
            PE+KQQFDQIGL+PEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNP++I KYQNDKEVM
Sbjct: 355  PEVKQQFDQIGLSPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPMNITKYQNDKEVM 414

Query: 1409 DVFNKISELFPG 1444
            +VFNKISELFPG
Sbjct: 415  NVFNKISELFPG 426


>gb|ABF19057.1| plastid Tic40 [Ricinus communis]
          Length = 460

 Score =  415 bits (1067), Expect = e-113
 Identities = 249/458 (54%), Positives = 293/458 (63%), Gaps = 35/458 (7%)
 Frame = +2

Query: 176  LGLISSH----KIVLGIS-PNP-KN-SIFSSKPFVGLPNLIRKTG---KNGRLINPSSSF 325
            +GL+SS     K+V+G   PN  KN ++ ++K F       R      +N +++  SS F
Sbjct: 1    MGLLSSFYASPKLVMGCCYPNSLKNPTVTTNKQFSRTSTSTRALPFSLRNYKIVTRSSRF 60

Query: 326  TVLSLFEAPKATKTIVPEKDVRDCFASITSSGQETSSVGVNXXXXXXXXXXXXXXXX-FW 502
            ++ +L  +  + +     +   + FASI SS Q+TSSVGVN                 FW
Sbjct: 61   SISALAHSHSSPRISGSSRLGAEHFASI-SSRQQTSSVGVNPQPLPPPSSSSQFGSPLFW 119

Query: 503  IGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXX 682
            IGVGVGLSA+FS VA RVK YAM+QAFK+   QMN QN+ F N A               
Sbjct: 120  IGVGVGLSAIFSLVATRVKNYAMQQAFKSMMNQMNTQNDQFNNPAFSPGSAFPFPTPPAS 179

Query: 683  XXXXXXXXX----------------------HVASQP-VTVDVPATKVEDPPSISVKEKV 793
                                            VASQP VTVDV ATKVE       K++ 
Sbjct: 180  VPASSPPFPTSSTSRPATSPSYPTSSASTSPSVASQPAVTVDVSATKVEAASVTDAKDEA 239

Query: 794  EPESGPKKYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGAS 973
            E    PKKYAFVDVSPEET  K+ F++ ++ ++T +  D Q +  V QNG AS QG    
Sbjct: 240  EITKEPKKYAFVDVSPEETFPKSPFKSNEDILETSTSKDTQFNPEVLQNGAASNQGAADF 299

Query: 974  EGP-STSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQD 1150
             G  ST K    LSVEALEKMMEDPTVQKMV+PYLPEEMRNP+TFKWMLQNPQYRQQL++
Sbjct: 300  TGSQSTRKAGSGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPSTFKWMLQNPQYRQQLEE 359

Query: 1151 MLNNMGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNP 1330
            MLNNM G+ EWDNRMMDSLKNFDLSSPE+KQQFDQIGLTPEEVISKIMANP++AMAFQNP
Sbjct: 360  MLNNMSGTGEWDNRMMDSLKNFDLSSPEVKQQFDQIGLTPEEVISKIMANPEIAMAFQNP 419

Query: 1331 RVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 1444
            RVQ AIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG
Sbjct: 420  RVQQAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPG 457


>gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial
            [Theobroma cacao]
          Length = 412

 Score =  407 bits (1045), Expect = e-110
 Identities = 247/431 (57%), Positives = 277/431 (64%), Gaps = 13/431 (3%)
 Frame = +2

Query: 173  NLGLISSHK-----IVLGIS-PN--PKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFT 328
            NL L+SS        +LG + PN  PKN  F + PF    NL  +  +     +  S  T
Sbjct: 5    NLALVSSSSPPLKLYLLGCNHPNYTPKNP-FKTLPFPS-SNLAPRRSRISIFAHSHSQPT 62

Query: 329  VLSLFEAPKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXXXXXXXXFWI 505
                   P+    IV  K   + FASI+SS  Q+TSSVGVN                FWI
Sbjct: 63   ------PPRRLPHIVLRKLGDERFASISSSSSQQTSSVGVNPNPTVPPPSSQIGSPLFWI 116

Query: 506  GVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXX 685
            GVGVGLSALF+WVA  +KKYAM+QAFKT   QMN QNN F NAA                
Sbjct: 117  GVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSNAAFPLGSPFPFPAPPSPG 176

Query: 686  XXXXXXXXHVASQPVTVDVPATKVEDPPSISVKEKVEPESG---PKKYAFVDVSPEETLQ 856
                      +   VTVDVPATKVE  P+ +   +V+ E+    PKKYAFVDVSPEET+Q
Sbjct: 177  PVTSPSPS--SQTAVTVDVPATKVEAAPATAPATEVKSETETAEPKKYAFVDVSPEETVQ 234

Query: 857  KNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASEGP-STSKTSPLLSVEALEKM 1033
            K+AFE+              +    S N    K   GA  G  ST    P LSV+ALEKM
Sbjct: 235  KSAFED-------------AAGISSSNNTQFPKDDAGAFGGSQSTGSADPALSVDALEKM 281

Query: 1034 MEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKN 1213
            MEDPTVQKMV+PYLPEEMRNP TFKWMLQNPQYRQQLQDMLNNMGGS EWDNRMMDSLKN
Sbjct: 282  MEDPTVQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKN 341

Query: 1214 FDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 1393
            FDL+SP++KQQFDQIGLTPEEVISKIMANP+VAMAFQNPRVQAAIMDCSQNPLSIAKYQN
Sbjct: 342  FDLNSPDVKQQFDQIGLTPEEVISKIMANPEVAMAFQNPRVQAAIMDCSQNPLSIAKYQN 401

Query: 1394 DKEVMDVFNKI 1426
            DKEVMDVFNKI
Sbjct: 402  DKEVMDVFNKI 412


>ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-like [Cucumis sativus]
          Length = 419

 Score =  400 bits (1028), Expect = e-108
 Identities = 214/359 (59%), Positives = 255/359 (71%), Gaps = 3/359 (0%)
 Frame = +2

Query: 386  VRDCFASITSS--GQETSSVGVNXXXXXXXXXXXXXXXXFWIGVGVGLSALFSWVAGRVK 559
            V + FA+++SS    ++SSVGV                 FW+GVGVGLSALF+WVA  +K
Sbjct: 66   VAERFATVSSSTTSNDSSSVGV-PSVSIPPPSSYVGSPLFWVGVGVGLSALFTWVASYLK 124

Query: 560  KYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXXXXXXXXXXHVASQPVTVD 739
            KYAM+QAFKT   QMN+QN+P  N                           V+   V++D
Sbjct: 125  KYAMQQAFKTMMSQMNSQNSPMSNPTLSSGSPFPIPPTFATGTTISPS---VSEPAVSID 181

Query: 740  VPATKVEDPPSISVKEKVEPESGPKKYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQS 919
            V ATKVE+ P  +VK + E     KK+AFVDVSPEET QK+ F+  +++   D     Q 
Sbjct: 182  VTATKVEEEPVTNVKSRTENMEA-KKFAFVDVSPEETDQKSPFK--EDATDADVSKSAQP 238

Query: 920  SQPVSQNGTASKQGTGASEGPSTS-KTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNP 1096
            +Q + QNG ASKQ    S+G   S K   +LSVEA+EKMMEDPTVQKM++P+LPEEMRNP
Sbjct: 239  TQELPQNGAASKQAYNGSDGSQFSRKPGSVLSVEAVEKMMEDPTVQKMIYPHLPEEMRNP 298

Query: 1097 TTFKWMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEE 1276
             TFKWM+QNP YRQQL++MLNNM GSP+WD R+MDSLKNFDLSSPE+KQQFDQIGLTPEE
Sbjct: 299  ETFKWMMQNPLYRQQLEEMLNNMSGSPQWDGRLMDSLKNFDLSSPEVKQQFDQIGLTPEE 358

Query: 1277 VISKIMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1453
            VISKIMANP++AMAFQNPRVQAAIMDCSQNPLSI KYQNDKEVMDVFNKISELFPG +G
Sbjct: 359  VISKIMANPEIAMAFQNPRVQAAIMDCSQNPLSITKYQNDKEVMDVFNKISELFPGVSG 417


>ref|NP_197165.1| protein TIC 40 [Arabidopsis thaliana]
            gi|75309208|sp|Q9FMD5.1|TIC40_ARATH RecName: Full=Protein
            TIC 40, chloroplastic; AltName: Full=Protein PIGMENT
            DEFECTIVE EMBRYO 120; AltName: Full=Translocon at the
            inner envelope membrane of chloroplasts 40;
            Short=AtTIC40; Flags: Precursor
            gi|16226313|gb|AAL16131.1|AF428299_1 AT5g16620/MTG13_6
            [Arabidopsis thaliana] gi|10176971|dbj|BAB10189.1|
            translocon Tic40-like protein [Arabidopsis thaliana]
            gi|20260222|gb|AAM13009.1| translocon Tic40-like protein
            [Arabidopsis thaliana] gi|30387547|gb|AAP31939.1|
            At5g16620 [Arabidopsis thaliana]
            gi|332004935|gb|AED92318.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 447

 Score =  395 bits (1015), Expect = e-107
 Identities = 234/456 (51%), Positives = 276/456 (60%), Gaps = 27/456 (5%)
 Frame = +2

Query: 167  MENLGLIS----SHKIVLG--ISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFT 328
            MENL L+S    S K+++G   + + KN    S+     PN++ +  K       S+S  
Sbjct: 1    MENLTLVSCSASSPKLLIGCNFTSSLKNPTGFSRR---TPNIVLRCSKI------SASAQ 51

Query: 329  VLSLFEAPKATKTIVPEKDVRDCFASITSSG--QETSSVGVNXXXXXXXXXXXXXXXXFW 502
              S    P+ T  IV  K     FASI SS   Q+T+SV                   FW
Sbjct: 52   SQSPSSRPENTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFW 111

Query: 503  IGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXX 682
            IGVGVGLSALFS+V   +KKYAM+ A KT   QMN QN+ F N+                
Sbjct: 112  IGVGVGLSALFSYVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNSGFPSGSPFPFPFPPQT 171

Query: 683  XXXXXXXXXHVASQPVTVDVPATKVEDPPSISVK----------------EKVEPESGPK 814
                        S   TVDV ATKVE PPS   K                E  + +   K
Sbjct: 172  SPASSPFQSQSQSSGATVDVTATKVETPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEK 231

Query: 815  KYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE---GPS 985
             YAF D+SPEET +++ F NY E  +T+SP + +  + V QNG     G  ASE      
Sbjct: 232  NYAFEDISPEETTKESPFSNYAEVSETNSPKETRLFEDVLQNGAGPANGATASEVFQSLG 291

Query: 986  TSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNM 1165
              K  P LSVEALEKMMEDPTVQKMV+PYLPEEMRNP TFKWML+NPQYRQQLQDMLNNM
Sbjct: 292  GGKGGPGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNNM 351

Query: 1166 GGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAA 1345
             GS EWD RM D+LKNFDL+SPE+KQQF+QIGLTPEEVISKIM NPDVAMAFQNPRVQAA
Sbjct: 352  SGSGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQAA 411

Query: 1346 IMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1453
            +M+CS+NP++I KYQNDKEVMDVFNKIS+LFPG TG
Sbjct: 412  LMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 447


>ref|XP_006400200.1| hypothetical protein EUTSA_v10013528mg [Eutrema salsugineum]
            gi|557101290|gb|ESQ41653.1| hypothetical protein
            EUTSA_v10013528mg [Eutrema salsugineum]
          Length = 449

 Score =  390 bits (1003), Expect = e-106
 Identities = 238/457 (52%), Positives = 274/457 (59%), Gaps = 28/457 (6%)
 Frame = +2

Query: 167  MENLGLIS----SHKIVLGISPNPKNSIFSSKPFVGLPNLIRKTGKNG-RLINPSSSFTV 331
            MENL L+S    S K+++G      N   S K  VG     R+T K   R    S+S   
Sbjct: 1    MENLTLVSCSASSPKLLIGC-----NFTSSLKNPVGFS---RRTPKVVFRCSKISASAKS 52

Query: 332  LSLFEAPKATKTIVPEKDVRDCFASITSSG--QETSSVGVNXXXXXXXXXXXXXXXXFWI 505
             S    P+    IV  K     FASI SS   Q+T+SV                   FWI
Sbjct: 53   QSHSSRPENAGEIVVVKHRSRDFASIFSSNRDQQTTSVAYPNAAVPPPSSSTIGSPLFWI 112

Query: 506  GVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXXX 685
            GVGVGLSALFSWV   +KKYAM+ A KT   QMN QN+ F N                  
Sbjct: 113  GVGVGLSALFSWVTSSLKKYAMQTAMKTMMNQMNTQNSQFNNPGFPAGSASPFPFPFPPQ 172

Query: 686  XXXXXXXXHVASQP--VTVDVPATKVEDPPSIS----------------VKEKVEPESGP 811
                       SQ    TVDV ATKV+ PPS                  V E+ + +   
Sbjct: 173  TSPTSSPFQSQSQSSGATVDVTATKVDTPPSAKPQPTPAKKTEVDKPSVVLEENKAKKEE 232

Query: 812  KKYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE---GP 982
            K YAF DVSPEET +++ F NY E  +T +P + +  + V QNG A   G  ASE     
Sbjct: 233  KNYAFEDVSPEETTKESPFSNYAEVSETSAPKEARLFEDVMQNGAAPANGATASEVFQSL 292

Query: 983  STSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNN 1162
               K  P LSVEALEKMMEDPTVQKMV+P+LPEEMRNP TFKWML+NP YRQQLQDMLNN
Sbjct: 293  GAGKGGPGLSVEALEKMMEDPTVQKMVYPHLPEEMRNPETFKWMLKNPHYRQQLQDMLNN 352

Query: 1163 MGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQA 1342
            M GS EWD RMMD+LKNFDL+SPE+KQQFDQIGLTPEEVISKIM NPDVAMAFQNPRVQA
Sbjct: 353  MSGSGEWDKRMMDTLKNFDLNSPEVKQQFDQIGLTPEEVISKIMENPDVAMAFQNPRVQA 412

Query: 1343 AIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1453
            A+M+CS+NP++I KYQNDKEVMDVFNKIS+LFPG TG
Sbjct: 413  ALMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 449


>ref|XP_004307173.1| PREDICTED: protein TIC 40, chloroplastic-like [Fragaria vesca subsp.
            vesca]
          Length = 448

 Score =  386 bits (992), Expect = e-104
 Identities = 224/415 (53%), Positives = 261/415 (62%), Gaps = 31/415 (7%)
 Frame = +2

Query: 302  LINPSSSFTVLSLFEAPKATKTIVPEKDVRDCFASITSSG-QETSSVGVNXXXXXXXXXX 478
            L +P+S  TV        A    V  K   + FASI+S+  QETSSVG+N          
Sbjct: 40   LSSPNSRLTV----RLSAAANQPVTSKLQTERFASISSTNSQETSSVGINPQFSAPPPPS 95

Query: 479  XXXXXXFWIGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXX 658
                  FWIGVGV  SA+FSW AG+++KY ++QAFK    QMN QN+ F NAA       
Sbjct: 96   TIGSPLFWIGVGVAFSAVFSWAAGKLQKYVVQQAFKNVMGQMNTQNDQFSNAA---FSPG 152

Query: 659  XXXXXXXXXXXXXXXXXHVASQPVTVDVPATKVEDP--------PSISVKEKVEPESGPK 814
                                SQP   DV AT+V+ P        P+  VK + E +    
Sbjct: 153  SPFPFPSAPASPSASPFSAPSQPSFTDVSATEVDSPASSATPSTPAADVKSE-EQQMKEN 211

Query: 815  KY---------------------AFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPV 931
            ++                     AFVDV+PEET  K+ F +     +  S  +  S+   
Sbjct: 212  RFGNSFEIERNNVIQFSRQLSDRAFVDVNPEETELKSPFASSLNDTEPGSSKEINSNVEG 271

Query: 932  SQNGTASKQGTGASEG-PSTSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFK 1108
            SQNG A KQ   AS G  +T K + +LSVEALEKM+EDPTVQKMV+PYLPEEMRNPTTFK
Sbjct: 272  SQNGAAFKQAKDASMGSQTTGKENSVLSVEALEKMLEDPTVQKMVYPYLPEEMRNPTTFK 331

Query: 1109 WMLQNPQYRQQLQDMLNNMGGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISK 1288
            WMLQNPQYRQQL+DML NM GS EWDNRMMDSLKNFDLSSPE+K+QFDQIGLTPE+VISK
Sbjct: 332  WMLQNPQYRQQLEDMLRNMTGSNEWDNRMMDSLKNFDLSSPEVKEQFDQIGLTPEQVISK 391

Query: 1289 IMANPDVAMAFQNPRVQAAIMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1453
            IMANPDVAMAFQNPRVQAAIMDCSQNP+SI KYQNDKEVMDVFNKISELFPG +G
Sbjct: 392  IMANPDVAMAFQNPRVQAAIMDCSQNPMSITKYQNDKEVMDVFNKISELFPGVSG 446


>ref|XP_002871727.1| hypothetical protein ARALYDRAFT_488521 [Arabidopsis lyrata subsp.
            lyrata] gi|297317564|gb|EFH47986.1| hypothetical protein
            ARALYDRAFT_488521 [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  385 bits (990), Expect = e-104
 Identities = 230/456 (50%), Positives = 269/456 (58%), Gaps = 27/456 (5%)
 Frame = +2

Query: 167  MENLGLIS----SHKIVLG--ISPNPKNSIFSSKPFVGLPNLIRKTGKNGRLINPSSSFT 328
            MENL L+S    S K+++G   + + KN    S+    +     K   + +  +PSS   
Sbjct: 1    MENLTLVSCSASSPKLLIGCNFTSSLKNPTGFSRRTPRIVLRCSKISASAQSQSPSSR-- 58

Query: 329  VLSLFEAPKATKTIVPEKDVRDCFASITSSG--QETSSVGVNXXXXXXXXXXXXXXXXFW 502
                   P  T  IV  K     FASI SS   Q+T+SV                   FW
Sbjct: 59   -------PDNTGEIVVVKQRSKAFASIFSSSRDQQTTSVASPSVPVPPPSSSTIGSPLFW 111

Query: 503  IGVGVGLSALFSWVAGRVKKYAMEQAFKTFTQQMNAQNNPFGNAAXXXXXXXXXXXXXXX 682
            IGVGVGLSALFS V   +KKYAM+ A KT   QMN QN+ F N                 
Sbjct: 112  IGVGVGLSALFSLVTSNLKKYAMQTAMKTMMNQMNTQNSQFNNPGFPSGSPFPFPFPPQT 171

Query: 683  XXXXXXXXXHVASQPVTVDVPATKVEDPPSISVK----------------EKVEPESGPK 814
                        S   TVDV ATKV+ PPS   K                E  + +   K
Sbjct: 172  SPASSPFQSQSQSSGATVDVTATKVDTPPSTKPKPTPAKDIEVDKPSVVLEASKEKKEEK 231

Query: 815  KYAFVDVSPEETLQKNAFENYKESIQTDSPNDPQSSQPVSQNGTASKQGTGASE---GPS 985
             YAF D+SPEET +++ F NY E  +T SP + +  + V QNG     G  ASE      
Sbjct: 232  NYAFEDISPEETTKESPFSNYAEVSETSSPKETRLFEDVLQNGAGPANGATASEVFQSLG 291

Query: 986  TSKTSPLLSVEALEKMMEDPTVQKMVFPYLPEEMRNPTTFKWMLQNPQYRQQLQDMLNNM 1165
              K    LSVEALEKMMEDPTVQKMV+PYLPEEMRNP TFKWML+NPQYRQQLQDMLNNM
Sbjct: 292  GGKGGAGLSVEALEKMMEDPTVQKMVYPYLPEEMRNPETFKWMLKNPQYRQQLQDMLNNM 351

Query: 1166 GGSPEWDNRMMDSLKNFDLSSPEIKQQFDQIGLTPEEVISKIMANPDVAMAFQNPRVQAA 1345
             GS EWD RM D+LKNFDL+SPE+KQQF+QIGLTPEEVISKIM NPDVAMAFQNPRVQAA
Sbjct: 352  SGSGEWDKRMTDTLKNFDLNSPEVKQQFNQIGLTPEEVISKIMENPDVAMAFQNPRVQAA 411

Query: 1346 IMDCSQNPLSIAKYQNDKEVMDVFNKISELFPGATG 1453
            +M+CS+NP++I KYQNDKEVMDVFNKIS+LFPG TG
Sbjct: 412  LMECSENPMNIMKYQNDKEVMDVFNKISQLFPGMTG 447


Top