BLASTX nr result

ID: Mentha23_contig00007303 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00007303
         (1423 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU38164.1| hypothetical protein MIMGU_mgv1a002443mg [Mimulus...   816   0.0  
gb|EPS58819.1| hypothetical protein M569_15993, partial [Genlise...   785   0.0  
ref|XP_006345081.1| PREDICTED: serine protease SPPA, chloroplast...   766   0.0  
ref|XP_002268894.1| PREDICTED: protease 4-like [Vitis vinifera]       761   0.0  
ref|XP_004236086.1| PREDICTED: protease 4-like [Solanum lycopers...   761   0.0  
ref|XP_006485727.1| PREDICTED: serine protease SPPA, chloroplast...   753   0.0  
ref|XP_007037705.1| Signal peptide peptidase isoform 1 [Theobrom...   736   0.0  
ref|XP_007037706.1| Signal peptide peptidase isoform 2, partial ...   731   0.0  
ref|XP_004138209.1| PREDICTED: protease 4-like [Cucumis sativus]...   729   0.0  
ref|XP_004299267.1| PREDICTED: protease 4-like [Fragaria vesca s...   728   0.0  
ref|XP_007210885.1| hypothetical protein PRUPE_ppa002273mg [Prun...   724   0.0  
ref|XP_004488395.1| PREDICTED: protease 4-like [Cicer arietinum]      720   0.0  
ref|XP_003522978.1| PREDICTED: serine protease SPPA, chloroplast...   716   0.0  
ref|XP_003595673.1| Protease [Medicago truncatula] gi|355484721|...   714   0.0  
ref|XP_006390483.1| hypothetical protein EUTSA_v10018225mg [Eutr...   714   0.0  
ref|XP_007138385.1| hypothetical protein PHAVU_009G204100g [Phas...   711   0.0  
ref|XP_006301110.1| hypothetical protein CARUB_v10021504mg [Caps...   710   0.0  
ref|XP_002887518.1| hypothetical protein ARALYDRAFT_476539 [Arab...   708   0.0  
ref|NP_565077.2| signal peptide peptidase [Arabidopsis thaliana]...   705   0.0  
ref|XP_002322054.2| hypothetical protein POPTR_0015s03620g [Popu...   704   0.0  

>gb|EYU38164.1| hypothetical protein MIMGU_mgv1a002443mg [Mimulus guttatus]
          Length = 674

 Score =  816 bits (2108), Expect = 0.0
 Identities = 408/473 (86%), Positives = 441/473 (93%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMK+RGEISDQLKSRFSSGLSLPQICEN IKAAYDPRISGIYLQIEPLSCGWGKV
Sbjct: 122  KGSVLTMKIRGEISDQLKSRFSSGLSLPQICENLIKAAYDPRISGIYLQIEPLSCGWGKV 181

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRHV+DFKKSGKFIVGYV  CGEKEYY+GSAC+ELY+PPSAYFQLYGLTV ASFLGG
Sbjct: 182  EEIRRHVLDFKKSGKFIVGYVPACGEKEYYIGSACQELYAPPSAYFQLYGLTVSASFLGG 241

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            VLEKIGIEPQV+RIGKYKSAGDQLTRKSIS+ENRE LTALLDNIYGNWVE ISL KGK K
Sbjct: 242  VLEKIGIEPQVERIGKYKSAGDQLTRKSISNENREMLTALLDNIYGNWVETISLVKGKKK 301

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            EDI +F++EGV+E+ RLKEDGWITDIKY+DEV S+LK+RL IP+ KKLPTVDYRKYCRVK
Sbjct: 302  EDIENFVNEGVYEVERLKEDGWITDIKYEDEVESLLKERLAIPSSKKLPTVDYRKYCRVK 361

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWT+GLAG +N+I +IRASGSISRVRG LS S+SGIV+EQFIEKIRTVR+SK+YKAVV+R
Sbjct: 362  KWTIGLAGSRNRIAIIRASGSISRVRGSLSTSSSGIVSEQFIEKIRTVRESKRYKAVVLR 421

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREIKLL+A KPV+ASMADV            QTIVAE LTITGSI
Sbjct: 422  IDSPGGDALASDLMWREIKLLAAKKPVVASMADVAASGGYYMAMAAQTIVAENLTITGSI 481

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNLE+LYEKIGFNKEIISRGRYAELTAA+QRPFR DEAELFAKSAQ+AYRSFRD
Sbjct: 482  GVVTGKFNLERLYEKIGFNKEIISRGRYAELTAADQRPFRTDEAELFAKSAQSAYRSFRD 541

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAASSRSMTVDKMEEVAQGRVW+GNDAASRGLVDAIGG+SRAVAIAKQKAN+P
Sbjct: 542  KAASSRSMTVDKMEEVAQGRVWSGNDAASRGLVDAIGGISRAVAIAKQKANIP 594


>gb|EPS58819.1| hypothetical protein M569_15993, partial [Genlisea aurea]
          Length = 560

 Score =  785 bits (2028), Expect = 0.0
 Identities = 394/473 (83%), Positives = 433/473 (91%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVL++KLRGEISDQ + RFSSGLSLPQICENFIKAAYDPR+SGIYL IEPLSCGWGKV
Sbjct: 38   KGSVLSLKLRGEISDQFRGRFSSGLSLPQICENFIKAAYDPRVSGIYLHIEPLSCGWGKV 97

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH++DF+KSGKF VGY  VCGEKEYY+GSACEELY+PPSAYFQLYGLTVQASFLGG
Sbjct: 98   EEIRRHLLDFRKSGKFAVGYAPVCGEKEYYIGSACEELYAPPSAYFQLYGLTVQASFLGG 157

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            VLEK+GIEPQVQRIGKYKSAGDQLTRK+ISDENRE LTALL+NI+ NWVEKIS+A GKTK
Sbjct: 158  VLEKVGIEPQVQRIGKYKSAGDQLTRKNISDENREALTALLNNIFENWVEKISVATGKTK 217

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            EDI +FI+EGV+E+ RLKE+GWITDIKYDDEV+++LK+RL IP+ K LPTVDYRKY RVK
Sbjct: 218  EDIEAFINEGVYEVQRLKEEGWITDIKYDDEVLAILKERLAIPSAKNLPTVDYRKYSRVK 277

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWTLGL GYK+QI +IRASGSISR R PLS   SGIVA+Q IEKI   RDSKKYKAVV+R
Sbjct: 278  KWTLGLTGYKDQIAIIRASGSISRGRSPLS---SGIVADQLIEKISKARDSKKYKAVVLR 334

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREIKLL+ASKPV+ASM+DV            QTIVAE LT+TGSI
Sbjct: 335  IDSPGGDALASDLMWREIKLLAASKPVVASMSDVAASGGYYMAMAAQTIVAEYLTLTGSI 394

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNLE+LYE+IGFNKEIISRG+YAELTAAEQRPFR DEAELFAKSA+NAYRSFRD
Sbjct: 395  GVVTGKFNLERLYERIGFNKEIISRGKYAELTAAEQRPFRPDEAELFAKSAENAYRSFRD 454

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA+SRSMTV+KMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAK+KANLP
Sbjct: 455  KAAASRSMTVEKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKKKANLP 507


>ref|XP_006345081.1| PREDICTED: serine protease SPPA, chloroplastic-like isoform X1
            [Solanum tuberosum] gi|565356460|ref|XP_006345082.1|
            PREDICTED: serine protease SPPA, chloroplastic-like
            isoform X2 [Solanum tuberosum]
          Length = 699

 Score =  766 bits (1978), Expect = 0.0
 Identities = 379/473 (80%), Positives = 421/473 (89%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMKLRG+ISDQL+SRFSSGLSLPQICEN +KAAYDPRISG+YL IEPL CGWGKV
Sbjct: 147  KGSVLTMKLRGQISDQLQSRFSSGLSLPQICENLMKAAYDPRISGVYLHIEPLGCGWGKV 206

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH++DFKKSGKFIVGY   CGEKEYY+G AC+ELY+PPSAYF LYGLTVQASFLGG
Sbjct: 207  EEIRRHILDFKKSGKFIVGYAPACGEKEYYIGCACQELYAPPSAYFALYGLTVQASFLGG 266

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            V EK+GIEPQVQRIGKYKSAGDQL RKSISDENRE LTALLDNIYGNW+EK++L KGK K
Sbjct: 267  VFEKVGIEPQVQRIGKYKSAGDQLMRKSISDENREMLTALLDNIYGNWLEKVALTKGKKK 326

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            EDI  F+++GV++I RLKE+ WITDIKYDDEV+SMLK+RL I  DKKLP VDYRKY +V+
Sbjct: 327  EDIEQFVNDGVYQIERLKEESWITDIKYDDEVMSMLKERLGILKDKKLPEVDYRKYSKVR 386

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            +WTLGL GYK+QI +IRASGSISR RGP S+ +SGI+AE+ IEKIR+VR+SK++KAVV+R
Sbjct: 387  RWTLGLTGYKDQIAIIRASGSISRTRGPFSSPSSGIIAEKLIEKIRSVRESKRFKAVVLR 446

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LL+ SKPVIASMADV            Q IVAE LT+TGSI
Sbjct: 447  IDSPGGDALASDLMWREIRLLAESKPVIASMADVAASGGYYMAMAAQAIVAENLTLTGSI 506

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNL  LYEKIGFNKE ISRGRYAELTAAEQRPFR +EAELFAKSAQ+AY  FRD
Sbjct: 507  GVVTGKFNLGNLYEKIGFNKETISRGRYAELTAAEQRPFRPEEAELFAKSAQHAYTQFRD 566

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSMTVDKMEEVAQGRVWTG DA SRGLVDA+GGLSRAVAIAKQKAN+P
Sbjct: 567  KAALSRSMTVDKMEEVAQGRVWTGKDALSRGLVDAVGGLSRAVAIAKQKANIP 619


>ref|XP_002268894.1| PREDICTED: protease 4-like [Vitis vinifera]
          Length = 686

 Score =  761 bits (1966), Expect = 0.0
 Identities = 381/473 (80%), Positives = 421/473 (89%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSV TMKLRG+ISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYL IEPLSCGWGKV
Sbjct: 134  KGSVFTMKLRGQISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLHIEPLSCGWGKV 193

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH++DFKKSGKFIV Y   CGEKEYY+GSAC+ELY+PPSAYF LYGLTVQASFLGG
Sbjct: 194  EEIRRHILDFKKSGKFIVAYAPACGEKEYYLGSACDELYAPPSAYFSLYGLTVQASFLGG 253

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            V EK+GIEPQVQRIGKYKSAGDQLTRK++S+EN E LTALLDNIYGNW++KIS AKGK +
Sbjct: 254  VFEKVGIEPQVQRIGKYKSAGDQLTRKTMSEENCEMLTALLDNIYGNWLDKISSAKGKKR 313

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            ED  +FI+EGV+++ +LKE+GWIT+I YDDEV+S+LK+RL  P DK LP VDYRKY +V+
Sbjct: 314  EDTENFINEGVYQVEKLKEEGWITNINYDDEVISILKERLGQPKDKNLPMVDYRKYSKVR 373

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWTLGL+G K+QI VIRASGSISRVR P S   SGI +EQFIEKIR+VRDSK+YKAV+IR
Sbjct: 374  KWTLGLSGGKDQIAVIRASGSISRVRSPFSIPGSGITSEQFIEKIRSVRDSKRYKAVIIR 433

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LL+ASKPVIASM+DV             TIVAE LT+TGSI
Sbjct: 434  IDSPGGDALASDLMWREIRLLAASKPVIASMSDVAASGGYYMAMGAGTIVAENLTLTGSI 493

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNL  LYEKIGFNKEIISRGR+AELTAAEQRPFR DEAELFAKSAQNAY+ FRD
Sbjct: 494  GVVTGKFNLGTLYEKIGFNKEIISRGRFAELTAAEQRPFRPDEAELFAKSAQNAYKQFRD 553

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSM VDKMEE AQGRVWTG DAASRGLVDAIGGLSRAVAIAKQKA++P
Sbjct: 554  KAAFSRSMAVDKMEENAQGRVWTGKDAASRGLVDAIGGLSRAVAIAKQKADIP 606


>ref|XP_004236086.1| PREDICTED: protease 4-like [Solanum lycopersicum]
          Length = 705

 Score =  761 bits (1964), Expect = 0.0
 Identities = 377/473 (79%), Positives = 419/473 (88%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMKLR EISDQL+SRFSSGLSLPQICEN +KAAYDPRISG+YL IEPL CGWGKV
Sbjct: 153  KGSVLTMKLRNEISDQLQSRFSSGLSLPQICENLMKAAYDPRISGVYLHIEPLGCGWGKV 212

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH++DF+KSGKFIVGY   CGEKEYY+G AC+ELY PPSAYF LYGLTVQASFLGG
Sbjct: 213  EEIRRHILDFRKSGKFIVGYAPACGEKEYYIGCACQELYVPPSAYFALYGLTVQASFLGG 272

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            V EK+GIEPQVQRIGKYKSAGDQL RKSISDENRE LTALLDNIYGNW+EK++L KGK  
Sbjct: 273  VFEKVGIEPQVQRIGKYKSAGDQLMRKSISDENREMLTALLDNIYGNWLEKVALTKGKKI 332

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            EDI  F+++GV+++ RLKE+ WITDIKYDDEV+SMLK+RL I  D+ LP VDYRKY +V+
Sbjct: 333  EDIEQFVNDGVYQVERLKEESWITDIKYDDEVMSMLKERLGISKDENLPEVDYRKYSKVR 392

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            +WTLGL GYK+QI VIRASGSISR RGP S+S+SGI+AE+ IEKIR+VR+SK++KAVV+R
Sbjct: 393  RWTLGLTGYKDQIAVIRASGSISRTRGPFSSSSSGIIAEKLIEKIRSVRESKRFKAVVLR 452

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LL+ SKPVIASMADV            Q IVAE LT+TGSI
Sbjct: 453  IDSPGGDALASDLMWREIRLLAESKPVIASMADVAASGGYYMAMAAQAIVAENLTLTGSI 512

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNL KLYEKIGFNKE ISRGRYAELTAAEQRPFR +EAELFAKSAQ+AY  FRD
Sbjct: 513  GVVTGKFNLGKLYEKIGFNKETISRGRYAELTAAEQRPFRPEEAELFAKSAQHAYTQFRD 572

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSMTVDKMEEVAQGRVWTG DA SRGLVDA+GGLSRAVAIAKQKAN+P
Sbjct: 573  KAALSRSMTVDKMEEVAQGRVWTGKDALSRGLVDAVGGLSRAVAIAKQKANIP 625


>ref|XP_006485727.1| PREDICTED: serine protease SPPA, chloroplastic-like [Citrus sinensis]
          Length = 690

 Score =  753 bits (1944), Expect = 0.0
 Identities = 376/473 (79%), Positives = 419/473 (88%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMKLRG+I+DQLKSRFSSGLSLPQICENF+KAAYDPRI GIYL IEPLSCGWGKV
Sbjct: 138  KGSVLTMKLRGQIADQLKSRFSSGLSLPQICENFVKAAYDPRIVGIYLHIEPLSCGWGKV 197

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRHV+DFKKSGKFI+GYV VCGEKEYY+  ACEELY+PPSAYF LYGLTVQASFLGG
Sbjct: 198  EEIRRHVVDFKKSGKFIIGYVPVCGEKEYYLACACEELYAPPSAYFSLYGLTVQASFLGG 257

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            VLEK+GIEPQVQRIGKYKSAGDQLTRK++S+EN E LTALLDNIYGNW++K+S  KGK K
Sbjct: 258  VLEKVGIEPQVQRIGKYKSAGDQLTRKTMSEENCEMLTALLDNIYGNWLDKVSSTKGKRK 317

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            EDI  FI++GV+++ RLKE+G+IT++ YDDEV+SMLK+RL +  DK LP VDYRKY  V+
Sbjct: 318  EDIERFINDGVYKVERLKEEGFITNVLYDDEVISMLKERLGVQKDKNLPMVDYRKYSGVR 377

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            +WTLGL G  +QI VIRASGSISRVR PLS S+SGI+ EQ IEKIR VR+SK+YKA +IR
Sbjct: 378  RWTLGLTGGGDQIAVIRASGSISRVRSPLSLSSSGIIGEQLIEKIRKVRESKRYKAAIIR 437

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LLS SKPVIASM+DV             TI+AE LT+TGSI
Sbjct: 438  IDSPGGDALASDLMWREIRLLSESKPVIASMSDVAASGGYYMAMAAGTILAENLTLTGSI 497

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNL KLYEKIGFNKEIISRG+YAE+ AAEQRPFR DEAELFAKSAQNAY+ FRD
Sbjct: 498  GVVTGKFNLGKLYEKIGFNKEIISRGKYAEVLAAEQRPFRPDEAELFAKSAQNAYKLFRD 557

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSMTVDKMEE AQGRVWTGNDAASRGLVDA+GG SRAVAIAKQKAN+P
Sbjct: 558  KAAFSRSMTVDKMEEYAQGRVWTGNDAASRGLVDALGGFSRAVAIAKQKANIP 610


>ref|XP_007037705.1| Signal peptide peptidase isoform 1 [Theobroma cacao]
            gi|508774950|gb|EOY22206.1| Signal peptide peptidase
            isoform 1 [Theobroma cacao]
          Length = 689

 Score =  736 bits (1899), Expect = 0.0
 Identities = 362/473 (76%), Positives = 418/473 (88%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMKLRG+ISDQLKSRFSSGLSLPQICENF+KAAYDPRISG+YL +EPL+CGWGKV
Sbjct: 137  KGSVLTMKLRGQISDQLKSRFSSGLSLPQICENFVKAAYDPRISGVYLHMEPLNCGWGKV 196

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH+++FKKSGKFI+ Y+  CGEKEYY+  ACEE+Y+PPSAYF LYGLTVQASFLGG
Sbjct: 197  EEIRRHILNFKKSGKFIIAYIPACGEKEYYLACACEEIYAPPSAYFSLYGLTVQASFLGG 256

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            V EKIGIEPQVQRIGKYKSAGDQLTRK++S+EN E LT+LLDNIYGNW++ +S +KGK +
Sbjct: 257  VFEKIGIEPQVQRIGKYKSAGDQLTRKTMSEENCEMLTSLLDNIYGNWLDVVSSSKGKKR 316

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            ED+ +FI+EG++++ +LKE+G IT+I YDD+V+SMLK+RL +P DK L  VDYRKY +V+
Sbjct: 317  EDVENFINEGIYKVEKLKEEGLITNIHYDDQVISMLKERLGVPKDKNLLMVDYRKYSKVR 376

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWTLGLAG ++QI VIRASGSISRVR PLSA +SGI+AEQ  EKIR+VR+SK+YKA +IR
Sbjct: 377  KWTLGLAGGRDQIAVIRASGSISRVRSPLSAPSSGIIAEQINEKIRSVRESKRYKAAIIR 436

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LL+ SKPVIASM+DV             TIVAE LT+TGSI
Sbjct: 437  IDSPGGDALASDLMWREIRLLAESKPVIASMSDVAASGGYYMAMAAGTIVAENLTLTGSI 496

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNL KLYEKIGFNKE+ISRGRYAEL AAEQRP R DEAELFAKSAQNAY+ FRD
Sbjct: 497  GVVTGKFNLGKLYEKIGFNKEVISRGRYAELFAAEQRPLRLDEAELFAKSAQNAYKQFRD 556

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSM V+KMEEVAQGRVW G DAASRGLVDAIGGLSRA+AIAK +AN+P
Sbjct: 557  KAAFSRSMPVEKMEEVAQGRVWAGRDAASRGLVDAIGGLSRAIAIAKHRANIP 609


>ref|XP_007037706.1| Signal peptide peptidase isoform 2, partial [Theobroma cacao]
            gi|508774951|gb|EOY22207.1| Signal peptide peptidase
            isoform 2, partial [Theobroma cacao]
          Length = 620

 Score =  731 bits (1887), Expect = 0.0
 Identities = 362/474 (76%), Positives = 418/474 (88%), Gaps = 1/474 (0%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMKLRG+ISDQLKSRFSSGLSLPQICENF+KAAYDPRISG+YL +EPL+CGWGKV
Sbjct: 137  KGSVLTMKLRGQISDQLKSRFSSGLSLPQICENFVKAAYDPRISGVYLHMEPLNCGWGKV 196

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH+++FKKSGKFI+ Y+  CGEKEYY+  ACEE+Y+PPSAYF LYGLTVQASFLGG
Sbjct: 197  EEIRRHILNFKKSGKFIIAYIPACGEKEYYLACACEEIYAPPSAYFSLYGLTVQASFLGG 256

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            V EKIGIEPQVQRIGKYKSAGDQLTRK++S+EN E LT+LLDNIYGNW++ +S +KGK +
Sbjct: 257  VFEKIGIEPQVQRIGKYKSAGDQLTRKTMSEENCEMLTSLLDNIYGNWLDVVSSSKGKKR 316

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            ED+ +FI+EG++++ +LKE+G IT+I YDD+V+SMLK+RL +P DK L  VDYRKY +V+
Sbjct: 317  EDVENFINEGIYKVEKLKEEGLITNIHYDDQVISMLKERLGVPKDKNLLMVDYRKYSKVR 376

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWTLGLAG ++QI VIRASGSISRVR PLSA +SGI+AEQ  EKIR+VR+SK+YKA +IR
Sbjct: 377  KWTLGLAGGRDQIAVIRASGSISRVRSPLSAPSSGIIAEQINEKIRSVRESKRYKAAIIR 436

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LL+ SKPVIASM+DV             TIVAE LT+TGSI
Sbjct: 437  IDSPGGDALASDLMWREIRLLAESKPVIASMSDVAASGGYYMAMAAGTIVAENLTLTGSI 496

Query: 1083 GVVT-GKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFR 1259
            GVVT GKFNL KLYEKIGFNKE+ISRGRYAEL AAEQRP R DEAELFAKSAQNAY+ FR
Sbjct: 497  GVVTAGKFNLGKLYEKIGFNKEVISRGRYAELFAAEQRPLRLDEAELFAKSAQNAYKQFR 556

Query: 1260 DKAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            DKAA SRSM V+KMEEVAQGRVW G DAASRGLVDAIGGLSRA+AIAK +AN+P
Sbjct: 557  DKAAFSRSMPVEKMEEVAQGRVWAGRDAASRGLVDAIGGLSRAIAIAKHRANIP 610


>ref|XP_004138209.1| PREDICTED: protease 4-like [Cucumis sativus]
            gi|449477130|ref|XP_004154939.1| PREDICTED: protease
            4-like [Cucumis sativus]
          Length = 684

 Score =  729 bits (1883), Expect = 0.0
 Identities = 358/472 (75%), Positives = 415/472 (87%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMKLRG+ISDQLKSRFSSGLSLPQICENF+KAAYDPRISGIYLQIE L+CGWGKV
Sbjct: 132  KGSVLTMKLRGQISDQLKSRFSSGLSLPQICENFVKAAYDPRISGIYLQIEALNCGWGKV 191

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH++DFKKSGKF+V Y+  C EKEYY+  ACEE+Y+PPSAY  L+GLTVQASFL G
Sbjct: 192  EEIRRHILDFKKSGKFVVAYIPTCQEKEYYLACACEEIYAPPSAYVSLFGLTVQASFLRG 251

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            + +K+GIEPQV+RIGKYKSAGDQL R+++S+EN E LT LLDNIYGNW++K+S   GK K
Sbjct: 252  IFDKVGIEPQVERIGKYKSAGDQLARRNMSEENCEMLTTLLDNIYGNWLDKVSSTNGKKK 311

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            +D+ +FI+EGV++I +LKEDGWIT+I+Y+DEV+SML +RL +P DKK+P VDYRKY RV+
Sbjct: 312  DDVENFINEGVYQIEKLKEDGWITNIQYEDEVLSMLSERLGLPKDKKVPMVDYRKYSRVR 371

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            +WT+GL+G  +QI VIRA GSI+RVR PLS  +SGI+ EQFIEKIRTVR+SK++KA +IR
Sbjct: 372  QWTVGLSGGGDQIAVIRAGGSITRVRSPLSVPSSGIIGEQFIEKIRTVRESKRFKAAIIR 431

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LL+ASKPV+ASMADV             TIVAE LT+TGSI
Sbjct: 432  IDSPGGDALASDLMWREIRLLAASKPVVASMADVAASGGYYMAMAAGTIVAEDLTLTGSI 491

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNL KLYEKIGFNKEIISRGR+AEL AAEQRPFR DEAELFAKSAQNAY+ FRD
Sbjct: 492  GVVTGKFNLGKLYEKIGFNKEIISRGRFAELLAAEQRPFRPDEAELFAKSAQNAYKQFRD 551

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANL 1418
            KAA SRSMTVD+ME+VAQGRVWTG DAASRGLVDAIGG SRAVAIAK KAN+
Sbjct: 552  KAAFSRSMTVDEMEKVAQGRVWTGKDAASRGLVDAIGGFSRAVAIAKLKANI 603


>ref|XP_004299267.1| PREDICTED: protease 4-like [Fragaria vesca subsp. vesca]
          Length = 678

 Score =  728 bits (1878), Expect = 0.0
 Identities = 357/473 (75%), Positives = 415/473 (87%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTM LRG+I+DQLKSRFSSGLSLPQICENF+KAAYDPRI+G+YLQIE L+CGWGKV
Sbjct: 126  KGSVLTMTLRGQITDQLKSRFSSGLSLPQICENFVKAAYDPRIAGVYLQIESLNCGWGKV 185

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH++DF+KSGKF+V Y   C EKEYY+ SAC+E+Y+PPSAYF L+GL+VQASF+ G
Sbjct: 186  EEIRRHILDFQKSGKFVVAYAPACSEKEYYLASACQEIYAPPSAYFSLFGLSVQASFVRG 245

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            VLEKIG+EPQV+RIGKYKSAGDQL R ++S+EN E LTALLDNIYGNW++ IS  +GK +
Sbjct: 246  VLEKIGVEPQVERIGKYKSAGDQLARTTMSEENCEMLTALLDNIYGNWLDIISFTRGKKR 305

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            EDI +FI+EGV+++ +LKE+GWIT+I+YDDEV SMLK+RL +  +KKLP VDYRKY +V+
Sbjct: 306  EDIENFINEGVYQVEKLKEEGWITNIQYDDEVTSMLKERLGVEKEKKLPMVDYRKYSKVR 365

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWTLGL+G K++I +IRASGSISRVRG  S   S IV EQFIEKIRT+R+SK+YKA +IR
Sbjct: 366  KWTLGLSGGKDKIAIIRASGSISRVRGSFSLPGSSIVGEQFIEKIRTIRESKRYKAAIIR 425

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREIKLL+ASKPVIASM+DV              IVAE LT+TGSI
Sbjct: 426  IDSPGGDALASDLMWREIKLLAASKPVIASMSDVAASGGYYMAMAADAIVAENLTLTGSI 485

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNL KLYEKIGFNKEIISRG++AE+ AAEQRPFR +EAELFAKSAQN+Y+ FRD
Sbjct: 486  GVVTGKFNLGKLYEKIGFNKEIISRGKFAEVLAAEQRPFRAEEAELFAKSAQNSYKQFRD 545

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAASSRSMTVDKMEEVAQGRVW G DAASRGLVDAIGGLSRAVAIAK KAN+P
Sbjct: 546  KAASSRSMTVDKMEEVAQGRVWAGKDAASRGLVDAIGGLSRAVAIAKLKANIP 598


>ref|XP_007210885.1| hypothetical protein PRUPE_ppa002273mg [Prunus persica]
            gi|462406620|gb|EMJ12084.1| hypothetical protein
            PRUPE_ppa002273mg [Prunus persica]
          Length = 693

 Score =  724 bits (1870), Expect = 0.0
 Identities = 358/473 (75%), Positives = 415/473 (87%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMKLRG++SDQLKSRFSSGLSLPQICEN +KAAYDPRISG+YLQIE L+CGWGKV
Sbjct: 140  KGSVLTMKLRGQVSDQLKSRFSSGLSLPQICENLVKAAYDPRISGVYLQIESLNCGWGKV 199

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH++DFKKSGKFI+ YV  CGEKEYY+ SAC+E+Y+PPSAYF L+GLTVQASF+ G
Sbjct: 200  EEIRRHILDFKKSGKFILAYVPACGEKEYYLASACQEIYAPPSAYFSLFGLTVQASFVRG 259

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            VLE +GIEPQV+RIGKYKSAGDQL RK++S+EN E LTALLDNIYGNW++ IS  +GK +
Sbjct: 260  VLENVGIEPQVERIGKYKSAGDQLARKTMSEENCEMLTALLDNIYGNWLDVISSTRGKKR 319

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            EDI +FI+EGV+++++ KE+GWIT+I YDDEV+S+LK+RL +  +K LP VDYRKY +V+
Sbjct: 320  EDIENFINEGVYQVDKFKEEGWITNIHYDDEVISLLKERLGVQKEKVLPMVDYRKYSKVR 379

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            + T+GL+G K++I +IRASGSISRVRG  S   SGI+ EQFIEKIR+VR+SKKYKA +IR
Sbjct: 380  QSTVGLSGSKDKIAIIRASGSISRVRGSFSLPGSGIIGEQFIEKIRSVRESKKYKAAIIR 439

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LL+ASKPVIASM+DV             TIVAE LT+TGSI
Sbjct: 440  IDSPGGDALASDLMWREIRLLAASKPVIASMSDVAASGGYYMAMAADTIVAENLTLTGSI 499

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNL KLYEKIGFNKEIISRG+YAEL AAEQR FR +EAELFAKSAQNAY+ FRD
Sbjct: 500  GVVTGKFNLGKLYEKIGFNKEIISRGKYAELLAAEQRSFRPEEAELFAKSAQNAYKQFRD 559

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSMTVDKMEEVAQGRVW G DAASRGLVDAIGGLSRAVAIAK KAN+P
Sbjct: 560  KAAFSRSMTVDKMEEVAQGRVWAGKDAASRGLVDAIGGLSRAVAIAKLKANIP 612


>ref|XP_004488395.1| PREDICTED: protease 4-like [Cicer arietinum]
          Length = 675

 Score =  720 bits (1858), Expect = 0.0
 Identities = 358/473 (75%), Positives = 413/473 (87%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMKLRG+ISDQ KSRFS GLSLPQICENF+KAAYDPRISG+YL I+ L+CGWGKV
Sbjct: 126  KGSVLTMKLRGQISDQAKSRFSPGLSLPQICENFLKAAYDPRISGVYLHIDSLNCGWGKV 185

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH+++FKKSGKF+V YV  C EKEYY+ SACEE+Y+PPSAYF L+GL+VQASFL G
Sbjct: 186  EEIRRHILNFKKSGKFVVAYVPTCQEKEYYLASACEEIYAPPSAYFSLFGLSVQASFLRG 245

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            VLE IGIEPQV+RIGKYKSAGDQL R+++SDEN E LTALLDNIY NW++K+S AKGK +
Sbjct: 246  VLENIGIEPQVERIGKYKSAGDQLARRTMSDENCEMLTALLDNIYTNWLDKVSSAKGKGR 305

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            EDI  FI+EGV+++++LKE+G I++I YDDEV +MLK+RL +  DK LP VDYRKY RV+
Sbjct: 306  EDIEKFINEGVYQVDKLKEEGLISNIIYDDEVTAMLKERLGVKTDKNLPMVDYRKYSRVR 365

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWT+G++G K  I +IRASGSISRV+  LS S+SGI+AE+FIEKIRTVR+SK++KA +IR
Sbjct: 366  KWTVGISGGKELIAIIRASGSISRVKSQLSISSSGIIAEEFIEKIRTVRESKRFKAAIIR 425

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LL+ASKPVIASM+DV            Q IVAE LT+TGSI
Sbjct: 426  IDSPGGDALASDLMWREIRLLAASKPVIASMSDVAASGGYYMAMAAQAIVAESLTLTGSI 485

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNL KLYEKIGFNKEIISRGRYAE+ AAEQR FR DEAELFAKSAQNAY+ FRD
Sbjct: 486  GVVTGKFNLGKLYEKIGFNKEIISRGRYAEVLAAEQRSFRPDEAELFAKSAQNAYKQFRD 545

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSMTVDKMEEVAQGRVWTG DAAS GLVDAIGGLSRA+AIAK KAN+P
Sbjct: 546  KAALSRSMTVDKMEEVAQGRVWTGKDAASHGLVDAIGGLSRAIAIAKLKANIP 598


>ref|XP_003522978.1| PREDICTED: serine protease SPPA, chloroplastic-like [Glycine max]
          Length = 683

 Score =  716 bits (1847), Expect = 0.0
 Identities = 354/473 (74%), Positives = 413/473 (87%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMKLRG+ISDQ+KSRFS GLSLPQICENF+KAAYDPRISGIYL I+ L+CGWGKV
Sbjct: 131  KGSVLTMKLRGQISDQVKSRFSPGLSLPQICENFLKAAYDPRISGIYLHIDSLNCGWGKV 190

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH++DFKKSGKF++ YV +C EKEYY+ SACEE+Y+PPSAYF L+GLTVQASFL G
Sbjct: 191  EEIRRHILDFKKSGKFVLAYVPLCQEKEYYLASACEEIYAPPSAYFSLFGLTVQASFLKG 250

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            VL+ IGIEPQV+RIGKYKSAGDQL R+++S+EN E LT LLDNIY NW++K+S AKGKT+
Sbjct: 251  VLDNIGIEPQVERIGKYKSAGDQLARRTMSEENCEMLTTLLDNIYTNWLDKVSSAKGKTR 310

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            EDI +FI+EGV+++++LKE+G I++I YDDE+ +MLK+RL + +DK L  VDYRKY RV+
Sbjct: 311  EDIENFINEGVYQVDKLKEEGLISNINYDDEITAMLKERLGVKSDKDLRMVDYRKYSRVR 370

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWT+G+ G K  I +IRASGSISRV    S S+SGI+AE+FIEKIRTVR+SKK+KA +IR
Sbjct: 371  KWTVGIPGGKELIAIIRASGSISRVESQFSVSSSGIIAEKFIEKIRTVRESKKFKAAIIR 430

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LL+ASKPVIASM+DV              IVAE LT+TGSI
Sbjct: 431  IDSPGGDALASDLMWREIRLLAASKPVIASMSDVAASGGYYMAMGADVIVAESLTLTGSI 490

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNL KLYEKIGFNKEIISRGRYAEL AAEQRPFR DEAELFAKSAQ+AY+ FRD
Sbjct: 491  GVVTGKFNLGKLYEKIGFNKEIISRGRYAELLAAEQRPFRPDEAELFAKSAQHAYKQFRD 550

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAASSRSMTV+KMEE AQGRVWTG DAA RGLVDAIGGLSRA+AIAK KA++P
Sbjct: 551  KAASSRSMTVEKMEEFAQGRVWTGKDAALRGLVDAIGGLSRAIAIAKMKADIP 603


>ref|XP_003595673.1| Protease [Medicago truncatula] gi|355484721|gb|AES65924.1| Protease
            [Medicago truncatula]
          Length = 670

 Score =  714 bits (1844), Expect = 0.0
 Identities = 354/473 (74%), Positives = 410/473 (86%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMKLRGEISDQ+KS+FS GLSLPQICENF+KAAYDPRISG+YL I+ L CGWGKV
Sbjct: 118  KGSVLTMKLRGEISDQVKSKFSPGLSLPQICENFLKAAYDPRISGVYLHIDSLDCGWGKV 177

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH+++FKKSGKF+V Y+  C EKEYY+  ACEE+Y+PPSAYF L+GL+VQASF+ G
Sbjct: 178  EEIRRHILNFKKSGKFVVAYLPTCQEKEYYLACACEEIYAPPSAYFSLFGLSVQASFIRG 237

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            VL+KIG+EPQV+RIGKYKSAGDQL R S+SDEN E LTALLDNIY NW++K+S AKGK +
Sbjct: 238  VLDKIGVEPQVERIGKYKSAGDQLARTSMSDENCEMLTALLDNIYTNWLDKVSSAKGKGR 297

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            EDI +FI+EGV+++++LKE+G I+++ YDDEV  MLK+RL +   KKLPTVDYRKY RV 
Sbjct: 298  EDIENFINEGVYQVDKLKEEGLISNLMYDDEVTDMLKKRLGVKKKKKLPTVDYRKYSRVS 357

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWT+G++G K  I +IRASGSISRV+G LS  +SGI AE+FIEKIRTVR+SKK+KA +IR
Sbjct: 358  KWTVGISGGKKLIAIIRASGSISRVKGQLSLFSSGITAEEFIEKIRTVRESKKFKAAIIR 417

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LL+ASKPVIASMADV              IVAE LT+TGSI
Sbjct: 418  IDSPGGDALASDLMWREIRLLAASKPVIASMADVAASGGYYMAMGTDAIVAESLTLTGSI 477

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNL KLYEKIGFNKEIISRGRYAEL +A+QR FR DEAELFAKSAQNAY+ FRD
Sbjct: 478  GVVTGKFNLAKLYEKIGFNKEIISRGRYAELVSADQRSFRPDEAELFAKSAQNAYKQFRD 537

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSMTVDKME+VAQGRVWTG DAAS GLVDAIGGLSRA+AIAK KAN+P
Sbjct: 538  KAALSRSMTVDKMEKVAQGRVWTGKDAASHGLVDAIGGLSRAIAIAKLKANIP 590


>ref|XP_006390483.1| hypothetical protein EUTSA_v10018225mg [Eutrema salsugineum]
            gi|312283239|dbj|BAJ34485.1| unnamed protein product
            [Thellungiella halophila] gi|557086917|gb|ESQ27769.1|
            hypothetical protein EUTSA_v10018225mg [Eutrema
            salsugineum]
          Length = 682

 Score =  714 bits (1843), Expect = 0.0
 Identities = 354/473 (74%), Positives = 408/473 (86%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTM LRG+ISDQLKSRFSSGLSLPQI EN +KAAYDPRI+G+YL IEPLSCGWGKV
Sbjct: 129  KGSVLTMTLRGQISDQLKSRFSSGLSLPQISENLVKAAYDPRIAGVYLHIEPLSCGWGKV 188

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH++DFKKSGKFIVGY+ +CG KEYY+G AC ELY+PPSAY  LYGLTVQASFLGG
Sbjct: 189  EEIRRHILDFKKSGKFIVGYINICGLKEYYLGCACNELYAPPSAYSFLYGLTVQASFLGG 248

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            V EK+GIEPQVQRIGKYKSAGDQL+RK+IS+EN E L+ LLDNIY NW++ +S + GK +
Sbjct: 249  VFEKVGIEPQVQRIGKYKSAGDQLSRKNISEENYEMLSVLLDNIYANWLDGVSDSTGKQR 308

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            ED+ SFI++GV+EI +LKE+G I DI+YDDEV+SMLK+RL +  DKKLPTVDY+KY  VK
Sbjct: 309  EDVESFINQGVYEIEKLKEEGLIKDIRYDDEVISMLKERLGVEKDKKLPTVDYKKYSGVK 368

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWTLGL+G ++QI +IRA GSISRV+GPLS   S I+AEQ IEKIR+VR+SKKYKA +IR
Sbjct: 369  KWTLGLSGGRDQIAIIRAGGSISRVKGPLSTPGSAIIAEQLIEKIRSVRESKKYKAAIIR 428

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREIKLL+ SKPVIASM+DV             TIVAE LT+TGSI
Sbjct: 429  IDSPGGDALASDLMWREIKLLAESKPVIASMSDVAASGGYYMAMAANTIVAENLTLTGSI 488

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVT +F L KLYEKIGFNKE ISRG+YAEL  AE+RPF+ +EAELF KSAQ+AY+ FRD
Sbjct: 489  GVVTARFTLAKLYEKIGFNKETISRGKYAELLGAEERPFKPEEAELFGKSAQHAYQLFRD 548

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSM VDKMEEVAQGRVWTG DA SRGLVDA+GGLSRA+AIAK+KAN+P
Sbjct: 549  KAALSRSMPVDKMEEVAQGRVWTGKDAHSRGLVDALGGLSRAIAIAKKKANIP 601


>ref|XP_007138385.1| hypothetical protein PHAVU_009G204100g [Phaseolus vulgaris]
            gi|561011472|gb|ESW10379.1| hypothetical protein
            PHAVU_009G204100g [Phaseolus vulgaris]
          Length = 668

 Score =  711 bits (1835), Expect = 0.0
 Identities = 349/473 (73%), Positives = 413/473 (87%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMKLRG+ISDQ+KSRFS GLSLPQICENF+KAAYDPR+SGIYL I+ L+CGWGKV
Sbjct: 116  KGSVLTMKLRGQISDQVKSRFSPGLSLPQICENFLKAAYDPRVSGIYLHIDSLNCGWGKV 175

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH++DFKKSGKFI+ YV +C EKEYY+  AC+E+YSPPSAYF L+GLTVQASFL G
Sbjct: 176  EEIRRHILDFKKSGKFILAYVPLCQEKEYYLACACDEIYSPPSAYFSLFGLTVQASFLRG 235

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            +L+ IGIEPQV+RIGKYKSAGDQL R+++S+EN E LTALLDNIY NW++K+S +KGK++
Sbjct: 236  ILDNIGIEPQVERIGKYKSAGDQLARRTMSEENCEMLTALLDNIYTNWLDKVSSSKGKSR 295

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            EDI   I+EGV+++++LKE+G I+++ YDDE+++MLK+RL +  DK LP VDYRKY RV+
Sbjct: 296  EDIEKLINEGVYQVDKLKEEGLISNVIYDDEIITMLKERLGVKLDKDLPMVDYRKYSRVR 355

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWT+G++G +  I +IRASGSISRV   LS S+SGI AE+FIEKIRTVR+SKK+KA +IR
Sbjct: 356  KWTVGISGGRELIAIIRASGSISRVESQLSVSSSGITAEKFIEKIRTVRESKKFKAAIIR 415

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LL+A KPVIASM+DV              IVAE LT+TGSI
Sbjct: 416  IDSPGGDALASDLMWREIRLLAAKKPVIASMSDVAASGGYYMAMGADAIVAESLTLTGSI 475

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKFNL KLYEKIGFNKE+ISRGRYAEL AAEQRPFR DEAELFAKSA++AY+ FRD
Sbjct: 476  GVVTGKFNLGKLYEKIGFNKEVISRGRYAELLAAEQRPFRPDEAELFAKSARHAYKQFRD 535

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSMTV+KMEEVAQGRVWTGNDAAS GLVDAIGGLSRA+AIAK KAN+P
Sbjct: 536  KAALSRSMTVEKMEEVAQGRVWTGNDAASHGLVDAIGGLSRAIAIAKVKANIP 588


>ref|XP_006301110.1| hypothetical protein CARUB_v10021504mg [Capsella rubella]
            gi|482569820|gb|EOA34008.1| hypothetical protein
            CARUB_v10021504mg [Capsella rubella]
          Length = 677

 Score =  710 bits (1832), Expect = 0.0
 Identities = 352/473 (74%), Positives = 407/473 (86%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVL M LRG+ISDQLKSRF+SGLSLPQ+ ENF+KAAYDPRI+GIYL IEPLSCGWGKV
Sbjct: 124  KGSVLNMTLRGQISDQLKSRFNSGLSLPQLSENFVKAAYDPRIAGIYLHIEPLSCGWGKV 183

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH++DFKKSGKFIVGY+ +CG KEYY+G AC ELY+PPSAY  LYGLTVQASFLGG
Sbjct: 184  EEIRRHILDFKKSGKFIVGYINICGLKEYYLGCACNELYAPPSAYSFLYGLTVQASFLGG 243

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            V EK+GIEPQVQRIGKYKSAGDQL+RK+IS+EN E L+ LLDNIY NW++ +S + GK +
Sbjct: 244  VFEKVGIEPQVQRIGKYKSAGDQLSRKNISEENYEMLSVLLDNIYANWLDGVSDSIGKKR 303

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            ED+ SFI++GV+EI +LKE+G I DI YDDEV+SMLK+RL +  DKKLPTVDY+KY  VK
Sbjct: 304  EDVESFINQGVYEIEKLKEEGLIKDIMYDDEVISMLKERLGVEKDKKLPTVDYKKYSGVK 363

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWTLGL+G ++QI +IRA GSISRV+GPLS   S I+AEQ IEKIR+VR+SKKYKA +IR
Sbjct: 364  KWTLGLSGGRDQIAIIRAGGSISRVKGPLSTPGSAIIAEQLIEKIRSVRESKKYKAAIIR 423

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREIKLL+ +KPVIASM+DV             TIVAE LT+TGSI
Sbjct: 424  IDSPGGDALASDLMWREIKLLAETKPVIASMSDVAASGGYYMAMAANTIVAENLTLTGSI 483

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVT +F L KLYEKIGFNKE ISRG+YAEL  AE+RPF+ +EAELF KSAQ+AY+ FRD
Sbjct: 484  GVVTARFTLAKLYEKIGFNKETISRGKYAELLGAEERPFKPEEAELFGKSAQHAYQLFRD 543

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSM V+KMEEVAQGRVWTG DA SRGLVDA+GGLSRA+AIAKQKAN+P
Sbjct: 544  KAAISRSMPVEKMEEVAQGRVWTGKDAHSRGLVDALGGLSRAIAIAKQKANIP 596


>ref|XP_002887518.1| hypothetical protein ARALYDRAFT_476539 [Arabidopsis lyrata subsp.
            lyrata] gi|297333359|gb|EFH63777.1| hypothetical protein
            ARALYDRAFT_476539 [Arabidopsis lyrata subsp. lyrata]
          Length = 676

 Score =  708 bits (1828), Expect = 0.0
 Identities = 347/473 (73%), Positives = 410/473 (86%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTM LRG+ISDQLKSRF+SGLSLPQ+ ENF+KAAYDPRI+G+YL I+PLSCGWGKV
Sbjct: 123  KGSVLTMTLRGQISDQLKSRFNSGLSLPQLSENFVKAAYDPRIAGVYLHIDPLSCGWGKV 182

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EE+RRH++DFKKSGKFIVGY+++CG KE+Y+G AC ELY+PPSAY  LYGLTVQASFLGG
Sbjct: 183  EELRRHILDFKKSGKFIVGYISICGLKEFYLGCACNELYAPPSAYSFLYGLTVQASFLGG 242

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            V EK+GIEPQVQRIGKYKSAGDQL+RK+IS+EN E L+ LLDNIY NW++ +S + GK +
Sbjct: 243  VFEKVGIEPQVQRIGKYKSAGDQLSRKNISEENYEMLSVLLDNIYANWLDGVSDSTGKKR 302

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            ED+ +FI++GV+EI +LKE+G I DI+YDDEV++MLK+RL +  DKKLPTVDY+KY  VK
Sbjct: 303  EDVENFINQGVYEIEKLKEEGLIKDIRYDDEVIAMLKERLGVEKDKKLPTVDYKKYSGVK 362

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWTLGL+G ++QI +IRA GSISRV+GPLS   S I+AEQ IEKIR+VR+SKK+KA +IR
Sbjct: 363  KWTLGLSGGRDQIAIIRAGGSISRVKGPLSTPGSAIIAEQLIEKIRSVRESKKFKAAIIR 422

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREIKLL+ +KPVIASM+DV             TIVAE LT+TGSI
Sbjct: 423  IDSPGGDALASDLMWREIKLLAETKPVIASMSDVAASGGYYMAMAANTIVAENLTLTGSI 482

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVT +F L KLYEKIGFNKE ISRG+YAEL  AE+RPF+ +EAELF KSAQ+AY+ FRD
Sbjct: 483  GVVTARFTLAKLYEKIGFNKETISRGKYAELLGAEERPFKPEEAELFEKSAQHAYQLFRD 542

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSM VDKMEEVAQGRVWTG DA SRGLVDA+GGLSRA+AIAKQKAN+P
Sbjct: 543  KAALSRSMPVDKMEEVAQGRVWTGRDAHSRGLVDALGGLSRAIAIAKQKANIP 595


>ref|NP_565077.2| signal peptide peptidase [Arabidopsis thaliana]
            gi|75169679|sp|Q9C9C0.1|SPPA1_ARATH RecName: Full=Serine
            protease SPPA, chloroplastic; AltName: Full=Signal
            peptide peptidase SPPA; Flags: Precursor
            gi|12325146|gb|AAG52522.1|AC016662_16 putative protease
            IV; 48713-44371 [Arabidopsis thaliana]
            gi|332197414|gb|AEE35535.1| signal peptide peptidase
            [Arabidopsis thaliana]
          Length = 677

 Score =  705 bits (1819), Expect = 0.0
 Identities = 347/473 (73%), Positives = 406/473 (85%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTM LRG+ISDQLKSRF+SGLSLPQ+ ENF+KAAYDPRI+G+YL I+PLSCGWGKV
Sbjct: 124  KGSVLTMTLRGQISDQLKSRFNSGLSLPQLSENFVKAAYDPRIAGVYLHIDPLSCGWGKV 183

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH+++FKKSGKFIVGY+++CG KEYY+G AC EL++PPSAY  LYGLTVQASFLGG
Sbjct: 184  EEIRRHILNFKKSGKFIVGYISICGLKEYYLGCACNELFAPPSAYSFLYGLTVQASFLGG 243

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            V EK+GIEPQVQRIGKYKSAGDQL+RKSIS+EN E L+ LLDNIY NW++ +S A GK +
Sbjct: 244  VFEKVGIEPQVQRIGKYKSAGDQLSRKSISEENYEMLSVLLDNIYSNWLDGVSDATGKKR 303

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            ED+ +FI++GV+EI +LKE G I DI+YDDEV++MLK+RL +  DKKLPTVDY+KY  VK
Sbjct: 304  EDVENFINQGVYEIEKLKEAGLIKDIRYDDEVITMLKERLGVEKDKKLPTVDYKKYSGVK 363

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
            KWTLGL G ++QI +IRA GSISRV+GPLS   S I+AEQ IEKIR+VR+SKKYKA +IR
Sbjct: 364  KWTLGLTGGRDQIAIIRAGGSISRVKGPLSTPGSAIIAEQLIEKIRSVRESKKYKAAIIR 423

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREIKLL+ +KPVIASM+DV              IVAE LT+TGSI
Sbjct: 424  IDSPGGDALASDLMWREIKLLAETKPVIASMSDVAASGGYYMAMAANAIVAENLTLTGSI 483

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVT +F L KLYEKIGFNKE ISRG+YAEL  AE+RP + +EAELF KSAQ+AY+ FRD
Sbjct: 484  GVVTARFTLAKLYEKIGFNKETISRGKYAELLGAEERPLKPEEAELFEKSAQHAYQLFRD 543

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSM VDKMEEVAQGRVWTG DA SRGL+DA+GGLSRA+AIAKQKAN+P
Sbjct: 544  KAALSRSMPVDKMEEVAQGRVWTGKDAHSRGLIDAVGGLSRAIAIAKQKANIP 596


>ref|XP_002322054.2| hypothetical protein POPTR_0015s03620g [Populus trichocarpa]
            gi|550321867|gb|EEF06181.2| hypothetical protein
            POPTR_0015s03620g [Populus trichocarpa]
          Length = 691

 Score =  704 bits (1818), Expect = 0.0
 Identities = 351/473 (74%), Positives = 400/473 (84%)
 Frame = +3

Query: 3    KGSVLTMKLRGEISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLQIEPLSCGWGKV 182
            KGSVLTMKLRG+ISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYL I+ L+CGW KV
Sbjct: 139  KGSVLTMKLRGQISDQLKSRFSSGLSLPQICENFIKAAYDPRISGIYLHIDGLNCGWAKV 198

Query: 183  EEIRRHVIDFKKSGKFIVGYVAVCGEKEYYVGSACEELYSPPSAYFQLYGLTVQASFLGG 362
            EEIRRH+ +FKKSGKF+V Y+  C EKEYY+ SAC++LY PP+AYF  YG TVQA+FL G
Sbjct: 199  EEIRRHIFNFKKSGKFVVAYLPACREKEYYLASACDDLYLPPTAYFSFYGFTVQAAFLAG 258

Query: 363  VLEKIGIEPQVQRIGKYKSAGDQLTRKSISDENRETLTALLDNIYGNWVEKISLAKGKTK 542
            V E +GI+P VQRIGKYKSAGDQLTRKS+S EN E LTA+LDNIYGNW++K+S  KGK  
Sbjct: 259  VFENVGIQPDVQRIGKYKSAGDQLTRKSMSKENCEMLTAILDNIYGNWLDKVSSTKGKKI 318

Query: 543  EDIVSFIDEGVFEINRLKEDGWITDIKYDDEVVSMLKQRLEIPNDKKLPTVDYRKYCRVK 722
            ED+ +FI+EGV+++ RLKE+G IT++ YDDEV+SMLK+++ +  DK LP VDY KY RV+
Sbjct: 319  EDMKNFINEGVYKVERLKEEGLITNMHYDDEVISMLKEKVGVQKDKVLPMVDYSKYSRVR 378

Query: 723  KWTLGLAGYKNQIVVIRASGSISRVRGPLSASNSGIVAEQFIEKIRTVRDSKKYKAVVIR 902
             WTLGL G ++ I +IRASGSISRV+ PLS S SGI+ EQ IEKIR  R+SKKYKA +IR
Sbjct: 379  NWTLGLTGGRDLIAIIRASGSISRVKSPLSLSGSGIIGEQLIEKIRQARESKKYKAAIIR 438

Query: 903  IDSPGGDALASDLMWREIKLLSASKPVIASMADVXXXXXXXXXXXXQTIVAEKLTITGSI 1082
            IDSPGGDALASDLMWREI+LL+ SKPVIASM+DV             TIVAE LT+TGSI
Sbjct: 439  IDSPGGDALASDLMWREIRLLAESKPVIASMSDVAASGGYYMAMAADTIVAENLTLTGSI 498

Query: 1083 GVVTGKFNLEKLYEKIGFNKEIISRGRYAELTAAEQRPFRNDEAELFAKSAQNAYRSFRD 1262
            GVVTGKF+L KLYEKIGFNKEIISRG+YAEL AA+QRP R DEAELFAKSAQNAY  FRD
Sbjct: 499  GVVTGKFSLGKLYEKIGFNKEIISRGKYAELLAADQRPLRPDEAELFAKSAQNAYEQFRD 558

Query: 1263 KAASSRSMTVDKMEEVAQGRVWTGNDAASRGLVDAIGGLSRAVAIAKQKANLP 1421
            KAA SRSM VDKMEEVAQGRVWTG DAASRGLVDAIGG SRAVAIAKQKAN+P
Sbjct: 559  KAAFSRSMPVDKMEEVAQGRVWTGQDAASRGLVDAIGGFSRAVAIAKQKANIP 611


Top