BLASTX nr result

ID: Atropa21_contig00007933 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00007933
         (702 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-lik...   367   3e-99
ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-lik...   365   6e-99
ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vi...   273   4e-71
ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-lik...   261   2e-67
ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-lik...   259   8e-67
gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein i...   258   1e-66
sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplasti...   257   2e-66
emb|CAB50925.1| translocon Tic40 [Pisum sativum]                      256   6e-66
ref|XP_002531917.1| conserved hypothetical protein [Ricinus comm...   254   2e-65
gb|ABF19057.1| plastid Tic40 [Ricinus communis]                       254   2e-65
gb|EXB68023.1| hypothetical protein L484_009630 [Morus notabilis]     253   4e-65
ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-lik...   253   4e-65
gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein i...   250   3e-64
ref|XP_006858564.1| hypothetical protein AMTR_s00071p00175860 [A...   247   3e-63
gb|ESW18933.1| hypothetical protein PHAVU_006G083300g [Phaseolus...   246   4e-63
gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus...   246   4e-63
ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-lik...   239   5e-61
gb|EOY03908.1| Hydroxyproline-rich glycoprotein family protein i...   238   2e-60
ref|XP_006372573.1| hypothetical protein POPTR_0017s02900g [Popu...   234   2e-59
ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Popu...   234   2e-59

>ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum
           lycopersicum]
          Length = 443

 Score =  367 bits (941), Expect = 3e-99
 Identities = 182/196 (92%), Positives = 187/196 (95%)
 Frame = -3

Query: 589 SQPVTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKESTEA 410
           SQPVTVDVSATKVEE  P VNVKND EAEKEPKKNAFVDISPDETFQKGAFEN+K+S E 
Sbjct: 198 SQPVTVDVSATKVEEP-PTVNVKNDKEAEKEPKKNAFVDISPDETFQKGAFENFKDSAET 256

Query: 409 VSATVDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYLP 230
            + TVD+VTQNGAASQSGFG+NTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYLP
Sbjct: 257 AAVTVDQVTQNGAASQSGFGSNTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYLP 316

Query: 229 EEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQI 50
           EEMRN TTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQI
Sbjct: 317 EEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQI 376

Query: 49  GLTPEEVISKIMANPD 2
           GLTPEEVISKIMANPD
Sbjct: 377 GLTPEEVISKIMANPD 392


>ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum tuberosum]
          Length = 443

 Score =  365 bits (938), Expect = 6e-99
 Identities = 182/196 (92%), Positives = 186/196 (94%)
 Frame = -3

Query: 589 SQPVTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKESTEA 410
           SQPVTVDVSATKVEE  P VNVKNDTEA KEPKKNAFVDISPDETFQKGAFEN+K+STE 
Sbjct: 198 SQPVTVDVSATKVEEP-PTVNVKNDTEAGKEPKKNAFVDISPDETFQKGAFENFKDSTET 256

Query: 409 VSATVDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYLP 230
            S TVD+VTQNGAASQ GFG NTSDSTSSTGKSNPL+SVDALEKMMEDPTVQKMVYPYLP
Sbjct: 257 ASVTVDQVTQNGAASQLGFGPNTSDSTSSTGKSNPLMSVDALEKMMEDPTVQKMVYPYLP 316

Query: 229 EEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQI 50
           EEMRN TTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQI
Sbjct: 317 EEMRNPTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQI 376

Query: 49  GLTPEEVISKIMANPD 2
           GLTPEEVISKIMANPD
Sbjct: 377 GLTPEEVISKIMANPD 392


>ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vitis vinifera]
           gi|296089465|emb|CBI39284.3| unnamed protein product
           [Vitis vinifera]
          Length = 436

 Score =  273 bits (698), Expect = 4e-71
 Identities = 140/197 (71%), Positives = 158/197 (80%), Gaps = 4/197 (2%)
 Frame = -3

Query: 580 VTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKESTEAVSA 401
           VTVDV ATKVE   PA +VK+D E + E  K AFVD+SP+ET Q+  FEN++ESTE  S+
Sbjct: 192 VTVDVPATKVETP-PATDVKDDIEKKNEQNKYAFVDVSPEETLQESPFENFEESTETSSS 250

Query: 400 TVDE----VTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYL 233
              +    V+QNG   + G G   S+ + ST  +NP LSVDALEKMMEDPTVQKMVYPYL
Sbjct: 251 KDAQFSAGVSQNGTPPRPGMGV--SEDSQSTRNANPFLSVDALEKMMEDPTVQKMVYPYL 308

Query: 232 PEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQ 53
           PEEMRN TTFKWMLQNPQYRQQLQDM+NNMGG  EWDNRMMD+LKNFDLSSPE+KQQFDQ
Sbjct: 309 PEEMRNPTTFKWMLQNPQYRQQLQDMLNNMGGGAEWDNRMMDNLKNFDLSSPEVKQQFDQ 368

Query: 52  IGLTPEEVISKIMANPD 2
           IGLTPEEVISKIMANPD
Sbjct: 369 IGLTPEEVISKIMANPD 385


>ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max]
          Length = 432

 Score =  261 bits (667), Expect = 2e-67
 Identities = 133/196 (67%), Positives = 157/196 (80%), Gaps = 3/196 (1%)
 Frame = -3

Query: 580 VTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYK--ESTEAV 407
           +TVD+ A KVE  +P  NVK++ E + EPKK AFVD+SP+ET Q+  FE++K  ES+   
Sbjct: 192 ITVDIPAAKVEV-APTTNVKDEVEVKNEPKKIAFVDVSPEETVQESPFESFKDDESSSVK 250

Query: 406 SATV-DEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYLP 230
            A V DEV+QNGA S  GFG    D   S      +LSVDALEKMMEDPTVQKMVYPYLP
Sbjct: 251 EARVPDEVSQNGAPSNQGFG----DFPGSQSTKKSVLSVDALEKMMEDPTVQKMVYPYLP 306

Query: 229 EEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQI 50
           EEMRN TTFKWMLQNPQYRQQL++M+NNMGG+ EWD+RMMD+LKNFDL+SPE+KQQFDQI
Sbjct: 307 EEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDTLKNFDLNSPEVKQQFDQI 366

Query: 49  GLTPEEVISKIMANPD 2
           GL+PEEVISKIMANP+
Sbjct: 367 GLSPEEVISKIMANPE 382


>ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max]
          Length = 429

 Score =  259 bits (661), Expect = 8e-67
 Identities = 133/196 (67%), Positives = 156/196 (79%), Gaps = 3/196 (1%)
 Frame = -3

Query: 580 VTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYK--ESTEAV 407
           +TVD+ A KVE  +P  NVK++ E + EPKK AFVD+SP+ET ++  FE++K  ES+   
Sbjct: 189 ITVDLPAAKVEA-APTTNVKDEVELKNEPKKIAFVDVSPEETVRESPFESFKDDESSSVK 247

Query: 406 SATV-DEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYLP 230
            A V DEV+QNGA S  GFG    D   S       LSVDALEKMMEDPTVQKMVYPYLP
Sbjct: 248 EAWVPDEVSQNGAPSNLGFG----DFPGSQSTKKSALSVDALEKMMEDPTVQKMVYPYLP 303

Query: 229 EEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQI 50
           EEMRN TTFKWMLQNPQYRQQL++M+NNMGG+ EWDNRMMD+LKNFDL+SPE+KQQFDQI
Sbjct: 304 EEMRNPTTFKWMLQNPQYRQQLEEMLNNMGGSTEWDNRMMDTLKNFDLNSPEVKQQFDQI 363

Query: 49  GLTPEEVISKIMANPD 2
           GL+PEEVISKIMANP+
Sbjct: 364 GLSPEEVISKIMANPE 379


>gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein isoform 2
           [Theobroma cacao]
          Length = 433

 Score =  258 bits (660), Expect = 1e-66
 Identities = 136/197 (69%), Positives = 157/197 (79%), Gaps = 4/197 (2%)
 Frame = -3

Query: 580 VTVDVSATKVEEK---SPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKESTEA 410
           VTVDV ATKVE     +PA  VK++TE   EPKK AFVD+SP+ET QK AFE+    + +
Sbjct: 189 VTVDVPATKVEAAPATAPATEVKSETETA-EPKKYAFVDVSPEETVQKSAFEDAAGISSS 247

Query: 409 VSATVD-EVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYL 233
            +     +V+ NGAAS+   GA     + STG ++P LSVDALEKMMEDPTVQKMVYPYL
Sbjct: 248 NNTQFPKDVSDNGAASKQDAGA--FGGSQSTGSADPALSVDALEKMMEDPTVQKMVYPYL 305

Query: 232 PEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQ 53
           PEEMRN  TFKWMLQNPQYRQQLQDM+NNMGG+ EWDNRMMDSLKNFDL+SP++KQQFDQ
Sbjct: 306 PEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKNFDLNSPDVKQQFDQ 365

Query: 52  IGLTPEEVISKIMANPD 2
           IGLTPEEVISKIMANP+
Sbjct: 366 IGLTPEEVISKIMANPE 382


>sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplastic; AltName:
           Full=Translocon at the inner envelope membrane of
           chloroplasts 40; Short=PsTIC40; Flags: Precursor
           gi|26000725|gb|AAN75219.1| chloroplast protein
           translocon component Tic40 precursor [Pisum sativum]
          Length = 436

 Score =  257 bits (657), Expect = 2e-66
 Identities = 129/199 (64%), Positives = 156/199 (78%), Gaps = 6/199 (3%)
 Frame = -3

Query: 580 VTVDVSATKVEEKSPA--VNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKESTEAV 407
           VTVD+ ATKVE  +PA  +NVK + E + EPKK+AFVD+SP+ET QK AFE +K+  E+ 
Sbjct: 191 VTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEETVQKNAFERFKDVDESS 250

Query: 406 S----ATVDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYP 239
           S        E +QNG   + GFG    DS SS  +    LSVDALEKMMEDPTVQ+MVYP
Sbjct: 251 SFKEARAPAEASQNGTPFKQGFG----DSPSSPSERKSALSVDALEKMMEDPTVQQMVYP 306

Query: 238 YLPEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQF 59
           YLPEEMRN +TFKWM+QNP+YRQQL+ M+NNMGG  EWD+RMMD+LKNFDL+SP++KQQF
Sbjct: 307 YLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRMMDTLKNFDLNSPDVKQQF 366

Query: 58  DQIGLTPEEVISKIMANPD 2
           DQIGL+P+EVISKIMANPD
Sbjct: 367 DQIGLSPQEVISKIMANPD 385


>emb|CAB50925.1| translocon Tic40 [Pisum sativum]
          Length = 436

 Score =  256 bits (653), Expect = 6e-66
 Identities = 128/199 (64%), Positives = 155/199 (77%), Gaps = 6/199 (3%)
 Frame = -3

Query: 580 VTVDVSATKVEEKSPA--VNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKESTEAV 407
           VTVD+ ATKVE  +PA  +NVK + E + EPKK+AFVD+SP+ET QK AFE +K+  E+ 
Sbjct: 191 VTVDIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEETVQKNAFERFKDVDESS 250

Query: 406 S----ATVDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYP 239
           S        E +QNG   + GFG    DS  S  +    LSVDALEKMMEDPTVQ+MVYP
Sbjct: 251 SFKEARAPAEASQNGTPFKQGFG----DSPGSPSERKSALSVDALEKMMEDPTVQQMVYP 306

Query: 238 YLPEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQF 59
           YLPEEMRN +TFKWM+QNP+YRQQL+ M+NNMGG  EWD+RMMD+LKNFDL+SP++KQQF
Sbjct: 307 YLPEEMRNPSTFKWMMQNPEYRQQLEAMLNNMGGGTEWDSRMMDTLKNFDLNSPDVKQQF 366

Query: 58  DQIGLTPEEVISKIMANPD 2
           DQIGL+P+EVISKIMANPD
Sbjct: 367 DQIGLSPQEVISKIMANPD 385


>ref|XP_002531917.1| conserved hypothetical protein [Ricinus communis]
           gi|223528427|gb|EEF30461.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 465

 Score =  254 bits (649), Expect = 2e-65
 Identities = 137/201 (68%), Positives = 156/201 (77%), Gaps = 5/201 (2%)
 Frame = -3

Query: 589 SQP-VTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKE--- 422
           SQP VTVDVSATKVE  S   + K++ E  KEPKK AFVD+SP+ETF K  F++ ++   
Sbjct: 219 SQPAVTVDVSATKVEAAS-VTDAKDEAEITKEPKKYAFVDVSPEETFPKSPFKSNEDILE 277

Query: 421 -STEAVSATVDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMV 245
            ST   +    EV QNGAAS  G    T   + ST K+   LSV+ALEKMMEDPTVQKMV
Sbjct: 278 TSTSKDTQFNPEVLQNGAASNQGAADFTG--SQSTRKAGSGLSVEALEKMMEDPTVQKMV 335

Query: 244 YPYLPEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQ 65
           YPYLPEEMRN +TFKWMLQNPQYRQQL++M+NNM G  EWDNRMMDSLKNFDLSSPE+KQ
Sbjct: 336 YPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMSGTGEWDNRMMDSLKNFDLSSPEVKQ 395

Query: 64  QFDQIGLTPEEVISKIMANPD 2
           QFDQIGLTPEEVISKIMANP+
Sbjct: 396 QFDQIGLTPEEVISKIMANPE 416


>gb|ABF19057.1| plastid Tic40 [Ricinus communis]
          Length = 460

 Score =  254 bits (649), Expect = 2e-65
 Identities = 137/201 (68%), Positives = 156/201 (77%), Gaps = 5/201 (2%)
 Frame = -3

Query: 589 SQP-VTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKE--- 422
           SQP VTVDVSATKVE  S   + K++ E  KEPKK AFVD+SP+ETF K  F++ ++   
Sbjct: 214 SQPAVTVDVSATKVEAAS-VTDAKDEAEITKEPKKYAFVDVSPEETFPKSPFKSNEDILE 272

Query: 421 -STEAVSATVDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMV 245
            ST   +    EV QNGAAS  G    T   + ST K+   LSV+ALEKMMEDPTVQKMV
Sbjct: 273 TSTSKDTQFNPEVLQNGAASNQGAADFTG--SQSTRKAGSGLSVEALEKMMEDPTVQKMV 330

Query: 244 YPYLPEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQ 65
           YPYLPEEMRN +TFKWMLQNPQYRQQL++M+NNM G  EWDNRMMDSLKNFDLSSPE+KQ
Sbjct: 331 YPYLPEEMRNPSTFKWMLQNPQYRQQLEEMLNNMSGTGEWDNRMMDSLKNFDLSSPEVKQ 390

Query: 64  QFDQIGLTPEEVISKIMANPD 2
           QFDQIGLTPEEVISKIMANP+
Sbjct: 391 QFDQIGLTPEEVISKIMANPE 411


>gb|EXB68023.1| hypothetical protein L484_009630 [Morus notabilis]
          Length = 391

 Score =  253 bits (646), Expect = 4e-65
 Identities = 132/197 (67%), Positives = 156/197 (79%), Gaps = 4/197 (2%)
 Frame = -3

Query: 580 VTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFEN-YKESTEAVS 404
           VTVDV+AT VE  +PA +VK++TE + E KK AFVD+SP+ET QK  FE+  K++ E +S
Sbjct: 164 VTVDVAATTVEA-TPAADVKDETEQKTEAKKFAFVDVSPEETKQKSPFESSLKDAEETIS 222

Query: 403 ATVDE---VTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYL 233
           +       V+QNG  S+ G GA    S  S  +    +SV+ALEKMMEDPTVQKMVYPYL
Sbjct: 223 SNEGPTAGVSQNGTTSKHGVGA----SQESPPRQESTISVEALEKMMEDPTVQKMVYPYL 278

Query: 232 PEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQ 53
           PEEMRN TTFKWMLQNPQYRQQL+DM+ NMGGN +WDNR+MDSLKNFDLSSP++KQQFDQ
Sbjct: 279 PEEMRNPTTFKWMLQNPQYRQQLEDMLKNMGGNSQWDNRVMDSLKNFDLSSPDVKQQFDQ 338

Query: 52  IGLTPEEVISKIMANPD 2
           IGLTPEEVISKIMANPD
Sbjct: 339 IGLTPEEVISKIMANPD 355


>ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-like [Cicer arietinum]
          Length = 433

 Score =  253 bits (646), Expect = 4e-65
 Identities = 129/197 (65%), Positives = 157/197 (79%), Gaps = 4/197 (2%)
 Frame = -3

Query: 580 VTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKESTEAVS- 404
           VTVD+ ATKVE  +P+ N K++ E + EPKK  FVD+SP+E+ QK  FE++K+  E+ S 
Sbjct: 191 VTVDIPATKVEA-APSTNAKDEVEVKNEPKKIGFVDVSPEESVQKSPFESFKDVDESSSF 249

Query: 403 ---ATVDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYL 233
                  E  QNGA S  GFG   S  + S GKS  +LSV+ALEKMMEDPTVQKMVYPYL
Sbjct: 250 KEARAPAEAFQNGAPSNQGFG--NSPGSQSGGKS--VLSVEALEKMMEDPTVQKMVYPYL 305

Query: 232 PEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQ 53
           PEEMRN +TFKWMLQNPQYRQQL++M+NNMGG+ EWD+RMMD+LKNFDL+SP++KQQFDQ
Sbjct: 306 PEEMRNPSTFKWMLQNPQYRQQLEEMLNNMGGSTEWDSRMMDTLKNFDLNSPDVKQQFDQ 365

Query: 52  IGLTPEEVISKIMANPD 2
           IGL+PEEVISKIMANP+
Sbjct: 366 IGLSPEEVISKIMANPE 382


>gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial
           [Theobroma cacao]
          Length = 412

 Score =  250 bits (639), Expect = 3e-64
 Identities = 133/196 (67%), Positives = 149/196 (76%), Gaps = 3/196 (1%)
 Frame = -3

Query: 580 VTVDVSATKVEEK---SPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKESTEA 410
           VTVDV ATKVE     +PA  VK++TE   EPKK AFVD+SP+ET QK AFE      +A
Sbjct: 189 VTVDVPATKVEAAPATAPATEVKSETETA-EPKKYAFVDVSPEETVQKSAFE------DA 241

Query: 409 VSATVDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYLP 230
              +    TQ        FG +      STG ++P LSVDALEKMMEDPTVQKMVYPYLP
Sbjct: 242 AGISSSNNTQFPKDDAGAFGGS-----QSTGSADPALSVDALEKMMEDPTVQKMVYPYLP 296

Query: 229 EEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQI 50
           EEMRN  TFKWMLQNPQYRQQLQDM+NNMGG+ EWDNRMMDSLKNFDL+SP++KQQFDQI
Sbjct: 297 EEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKNFDLNSPDVKQQFDQI 356

Query: 49  GLTPEEVISKIMANPD 2
           GLTPEEVISKIMANP+
Sbjct: 357 GLTPEEVISKIMANPE 372


>ref|XP_006858564.1| hypothetical protein AMTR_s00071p00175860 [Amborella trichopoda]
           gi|548862673|gb|ERN20031.1| hypothetical protein
           AMTR_s00071p00175860 [Amborella trichopoda]
          Length = 416

 Score =  247 bits (630), Expect = 3e-63
 Identities = 128/196 (65%), Positives = 150/196 (76%), Gaps = 3/196 (1%)
 Frame = -3

Query: 580 VTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKESTEAVSA 401
           VTVDV+A+ V   S  V VK DT+ +K+ K   FVDISP+E  Q    E  KEST+   A
Sbjct: 183 VTVDVTASDVAPASSTVEVKEDTKTKKQTKTFEFVDISPEEVMQNRPSEQPKESTDGSPA 242

Query: 400 T---VDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYLP 230
                 EV+QNGA  Q+    +T ++  S+  ++ +LSV+ALEKMMEDPTVQKMVYPYLP
Sbjct: 243 KDVHFAEVSQNGALPQTEKSVST-ENVQSSRPADSVLSVEALEKMMEDPTVQKMVYPYLP 301

Query: 229 EEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQI 50
           EEMRN  TFKWMLQNPQYRQQL+DM+NNMGG+ +WDNRMMDSLKNFDLS PE+KQQFDQI
Sbjct: 302 EEMRNPATFKWMLQNPQYRQQLEDMLNNMGGSSDWDNRMMDSLKNFDLSKPEVKQQFDQI 361

Query: 49  GLTPEEVISKIMANPD 2
           GLTPEEVISKIMANPD
Sbjct: 362 GLTPEEVISKIMANPD 377


>gb|ESW18933.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris]
          Length = 300

 Score =  246 bits (629), Expect = 4e-63
 Identities = 128/197 (64%), Positives = 154/197 (78%), Gaps = 4/197 (2%)
 Frame = -3

Query: 580 VTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKE----STE 413
           VTVD+ ATKVE  +   ++K++ E + +PKK AFVD+SP+ET QK  FE+ K+    S +
Sbjct: 59  VTVDIPATKVEA-TRTTDIKDEVEVQNKPKKIAFVDVSPEETVQKSPFESVKDNESSSVK 117

Query: 412 AVSATVDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYL 233
             +   DEV+QNGA    GFG      + ST KS   LSVDALEKMMEDPTVQKMVYP+L
Sbjct: 118 EEARVPDEVSQNGAPFNQGFGG--FPGSQSTKKS--ALSVDALEKMMEDPTVQKMVYPHL 173

Query: 232 PEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQ 53
           PEEMRN  TFKWMLQNPQYRQQL+ M++NMGG+ EWDNRMMD+LKNFDL+SPE+KQQFDQ
Sbjct: 174 PEEMRNPDTFKWMLQNPQYRQQLEAMLSNMGGSTEWDNRMMDTLKNFDLNSPEVKQQFDQ 233

Query: 52  IGLTPEEVISKIMANPD 2
           IGL+PEEVISKIMANP+
Sbjct: 234 IGLSPEEVISKIMANPE 250


>gb|ESW18931.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris]
          Length = 430

 Score =  246 bits (629), Expect = 4e-63
 Identities = 128/197 (64%), Positives = 154/197 (78%), Gaps = 4/197 (2%)
 Frame = -3

Query: 580 VTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKE----STE 413
           VTVD+ ATKVE  +   ++K++ E + +PKK AFVD+SP+ET QK  FE+ K+    S +
Sbjct: 189 VTVDIPATKVEA-TRTTDIKDEVEVQNKPKKIAFVDVSPEETVQKSPFESVKDNESSSVK 247

Query: 412 AVSATVDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMVYPYL 233
             +   DEV+QNGA    GFG      + ST KS   LSVDALEKMMEDPTVQKMVYP+L
Sbjct: 248 EEARVPDEVSQNGAPFNQGFGG--FPGSQSTKKS--ALSVDALEKMMEDPTVQKMVYPHL 303

Query: 232 PEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQQFDQ 53
           PEEMRN  TFKWMLQNPQYRQQL+ M++NMGG+ EWDNRMMD+LKNFDL+SPE+KQQFDQ
Sbjct: 304 PEEMRNPDTFKWMLQNPQYRQQLEAMLSNMGGSTEWDNRMMDTLKNFDLNSPEVKQQFDQ 363

Query: 52  IGLTPEEVISKIMANPD 2
           IGL+PEEVISKIMANP+
Sbjct: 364 IGLSPEEVISKIMANPE 380


>ref|XP_004148914.1| PREDICTED: protein TIC 40, chloroplastic-like [Cucumis sativus]
          Length = 419

 Score =  239 bits (611), Expect = 5e-61
 Identities = 126/201 (62%), Positives = 159/201 (79%), Gaps = 5/201 (2%)
 Frame = -3

Query: 589 SQP-VTVDVSATKVEEKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKESTE 413
           S+P V++DV+ATKVEE+ P  NVK+ TE   E KK AFVD+SP+ET QK  F+  +++T+
Sbjct: 174 SEPAVSIDVTATKVEEE-PVTNVKSRTE-NMEAKKFAFVDVSPEETDQKSPFK--EDATD 229

Query: 412 A----VSATVDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMEDPTVQKMV 245
           A     +    E+ QNGAAS+  +  N SD +  + K   +LSV+A+EKMMEDPTVQKM+
Sbjct: 230 ADVSKSAQPTQELPQNGAASKQAY--NGSDGSQFSRKPGSVLSVEAVEKMMEDPTVQKMI 287

Query: 244 YPYLPEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSSPEIKQ 65
           YP+LPEEMRN  TFKWM+QNP YRQQL++M+NNM G+P+WD R+MDSLKNFDLSSPE+KQ
Sbjct: 288 YPHLPEEMRNPETFKWMMQNPLYRQQLEEMLNNMSGSPQWDGRLMDSLKNFDLSSPEVKQ 347

Query: 64  QFDQIGLTPEEVISKIMANPD 2
           QFDQIGLTPEEVISKIMANP+
Sbjct: 348 QFDQIGLTPEEVISKIMANPE 368


>gb|EOY03908.1| Hydroxyproline-rich glycoprotein family protein isoform 1
           [Theobroma cacao]
          Length = 531

 Score =  238 bits (606), Expect = 2e-60
 Identities = 137/234 (58%), Positives = 158/234 (67%), Gaps = 41/234 (17%)
 Frame = -3

Query: 580 VTVDVSATKVEEK---SPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFENYKESTEA 410
           VTVDV ATKVE     +PA  VK++TE   EPKK AFVD+SP+ET QK AFE+    + +
Sbjct: 248 VTVDVPATKVEAAPATAPATEVKSETETA-EPKKYAFVDVSPEETVQKSAFEDAAGISSS 306

Query: 409 VSATVD-EVTQNGAASQSGFGA-NTSDST--------SSTGKSNPLLSVDALEKMMEDPT 260
            +     +V+ NGAAS+   GA   S ST        +  G ++P LSVDALEKMMEDPT
Sbjct: 307 NNTQFPKDVSDNGAASKQDAGAFGGSQSTVKLNKHPIALAGSADPALSVDALEKMMEDPT 366

Query: 259 VQKMVYPYLPEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDLSS 80
           VQKMVYPYLPEEMRN  TFKWMLQNPQYRQQLQDM+NNMGG+ EWDNRMMDSLKNFDL+S
Sbjct: 367 VQKMVYPYLPEEMRNPETFKWMLQNPQYRQQLQDMLNNMGGSTEWDNRMMDSLKNFDLNS 426

Query: 79  PEIKQQF----------------------------DQIGLTPEEVISKIMANPD 2
           P++KQQF                            DQIGLTPEEVISKIMANP+
Sbjct: 427 PDVKQQFVSRWSVSVVLECSLVPEEGSYISLSPFADQIGLTPEEVISKIMANPE 480


>ref|XP_006372573.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa]
           gi|550319202|gb|ERP50370.1| hypothetical protein
           POPTR_0017s02900g [Populus trichocarpa]
          Length = 298

 Score =  234 bits (598), Expect = 2e-59
 Identities = 123/208 (59%), Positives = 150/208 (72%), Gaps = 12/208 (5%)
 Frame = -3

Query: 589 SQP-VTVDVSATKVE-------EKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFE 434
           SQP VTVD+ ATKVE        K    +   + E ++EP+K AFVD+SP+ET     F 
Sbjct: 46  SQPAVTVDIPATKVEAAPETDARKEKETDTLEEREIKEEPRKFAFVDVSPEETSLNTPFS 105

Query: 433 NYKESTEAVSAT----VDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMED 266
           + ++  +  S+       E +QNGA  + G  A+     S + +    LSV+ALEKMM+D
Sbjct: 106 SVEDVIDTSSSKDVQFAKEASQNGATFKQGPSASEPSEGSQSSQKAGSLSVEALEKMMDD 165

Query: 265 PTVQKMVYPYLPEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDL 86
           PTVQKMVYPYLPEEMRN TTFKWMLQNPQYRQQL++M+NNM G+ EWD+RM+DSLKNFDL
Sbjct: 166 PTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMSGSSEWDSRMVDSLKNFDL 225

Query: 85  SSPEIKQQFDQIGLTPEEVISKIMANPD 2
           SSPE+KQQFDQIGLTPEEVISKIMANPD
Sbjct: 226 SSPEVKQQFDQIGLTPEEVISKIMANPD 253


>ref|XP_006372572.1| hypothetical protein POPTR_0017s02900g [Populus trichocarpa]
           gi|550319201|gb|ERP50369.1| hypothetical protein
           POPTR_0017s02900g [Populus trichocarpa]
          Length = 435

 Score =  234 bits (598), Expect = 2e-59
 Identities = 123/208 (59%), Positives = 150/208 (72%), Gaps = 12/208 (5%)
 Frame = -3

Query: 589 SQP-VTVDVSATKVE-------EKSPAVNVKNDTEAEKEPKKNAFVDISPDETFQKGAFE 434
           SQP VTVD+ ATKVE        K    +   + E ++EP+K AFVD+SP+ET     F 
Sbjct: 183 SQPAVTVDIPATKVEAAPETDARKEKETDTLEEREIKEEPRKFAFVDVSPEETSLNTPFS 242

Query: 433 NYKESTEAVSAT----VDEVTQNGAASQSGFGANTSDSTSSTGKSNPLLSVDALEKMMED 266
           + ++  +  S+       E +QNGA  + G  A+     S + +    LSV+ALEKMM+D
Sbjct: 243 SVEDVIDTSSSKDVQFAKEASQNGATFKQGPSASEPSEGSQSSQKAGSLSVEALEKMMDD 302

Query: 265 PTVQKMVYPYLPEEMRNSTTFKWMLQNPQYRQQLQDMMNNMGGNPEWDNRMMDSLKNFDL 86
           PTVQKMVYPYLPEEMRN TTFKWMLQNPQYRQQL++M+NNM G+ EWD+RM+DSLKNFDL
Sbjct: 303 PTVQKMVYPYLPEEMRNPTTFKWMLQNPQYRQQLEEMLNNMSGSSEWDSRMVDSLKNFDL 362

Query: 85  SSPEIKQQFDQIGLTPEEVISKIMANPD 2
           SSPE+KQQFDQIGLTPEEVISKIMANPD
Sbjct: 363 SSPEVKQQFDQIGLTPEEVISKIMANPD 390


Top