BLASTX nr result

ID: Mentha26_contig00031488 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00031488
         (1364 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoide...    79   6e-12
emb|CDJ83395.1| Peptidase C1A domain containing protein [Haemonc...    69   4e-09
gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]              69   4e-09
emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]             69   4e-09
ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula] gi...    69   6e-09
ref|XP_007148042.1| hypothetical protein PHAVU_006G175500g [Phas...    68   8e-09
gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]              68   8e-09
gb|EYC39737.1| hypothetical protein Y032_0643g1066 [Ancylostoma ...    68   1e-08
gb|EYC39735.1| hypothetical protein Y032_0643g1066 [Ancylostoma ...    68   1e-08
gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]              68   1e-08
ref|XP_002317417.2| hypothetical protein POPTR_0011s07300g [Popu...    67   1e-08
ref|XP_004293953.1| PREDICTED: cysteine proteinase RD21a-like [F...    67   1e-08
gb|ABK95906.1| unknown [Populus trichocarpa]                           67   1e-08
ref|NP_509408.1| Protein R09F10.1 [Caenorhabditis elegans] gi|35...    67   2e-08
gb|AAP41846.1| cysteine protease [Anthurium andraeanum]                67   2e-08
ref|XP_007025363.1| Xylem bark cysteine peptidase 3 isoform 1 [T...    66   3e-08
ref|XP_006362397.1| PREDICTED: oryzain alpha chain-like [Solanum...    66   4e-08
ref|XP_006396923.1| hypothetical protein EUTSA_v10028733mg [Eutr...    66   4e-08
gb|EYB81604.1| hypothetical protein Y032_0378g287 [Ancylostoma c...    65   5e-08
gb|EYB81603.1| hypothetical protein Y032_0378g287 [Ancylostoma c...    65   5e-08

>gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
          Length = 283

 Score = 78.6 bits (192), Expect = 6e-12
 Identities = 63/222 (28%), Positives = 103/222 (46%), Gaps = 5/222 (2%)
 Frame = -1

Query: 1112 PDRLLKTRDQGNTNACTAMVVRACMYGEEVG--FSFIDVQKLMGVPEEGSFNVPIKGFTC 939
            P+ +L  RDQ    +C A  +      E +G  F  +   K    P++   +       C
Sbjct: 75   PNAILPVRDQEKCGSCWAFSI-----AESLGDRFGILGCGKGHLSPQD-LISCDSNDLGC 128

Query: 938  SYGLV---FSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFI 768
            + G     +++V   G+++ES + + +G  R  SC     +G +  R            I
Sbjct: 129  NGGYQENSWTWVLTTGITTESCWPYRSGSGRIPSCPHRCVNGSVLQRNT----------I 178

Query: 767  DSYRRIVSKETFDNMAVCGTEAYDTLNAALDNQPLTGTFGITLRFRYWEGDKIYKPIPGQ 588
            ++YRR+ S E  D +               +N P+  T+ +   F Y+    IYK + G 
Sbjct: 179  NNYRRLDSSELQDEL--------------YNNGPIQVTYVVYEDFFYYSKG-IYKHLSGN 223

Query: 587  KVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 462
            KV G HA+ L+G+G E G +Y+++QNSWG  WG +GY R+LR
Sbjct: 224  KVGG-HAVVLMGWGIEDGVKYWLVQNSWGYEWGEQGYFRILR 264


>emb|CDJ83395.1| Peptidase C1A domain containing protein [Haemonchus contortus]
          Length = 341

 Score = 69.3 bits (168), Expect = 4e-09
 Identities = 69/237 (29%), Positives = 103/237 (43%), Gaps = 27/237 (11%)
 Frame = -1

Query: 1091 RDQGNTNAC------TAMVVRACMYG---EEVGFSFIDVQKLMGVPEEGSFNVPIKGFTC 939
            RDQ N  +C      +A+  R C+     ++V  S  D+    G            G+ C
Sbjct: 110  RDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQ---------CGYGC 160

Query: 938  SYGL---VFSYVRVRGVSSESFYKWVNG--PSRFQSCSGEPWD---GQITNREGFT---I 792
            + G     F+Y   +G  +   YK  +G  P  F  C     D   G+  N E  T   +
Sbjct: 161  NGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPN-EATTPKCV 219

Query: 791  EKDRKIFIDSYRRIVSKETFDNMAVCGTEAYDTLNAA-------LDNQPLTGTFGITLRF 633
             K +K +  SY++  S          G +AY+  N+        + N P+ G F +   F
Sbjct: 220  RKCQKSYKKSYKKDRS---------IGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDF 270

Query: 632  RYWEGDKIYKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 462
             Y++   IYK   G K  G HAI ++G+G+E G  Y++I NSW   WG  GY R+LR
Sbjct: 271  SYYKKG-IYKHTAG-KARGGHAIKIIGWGKEGGVPYWLIANSWHNDWGENGYFRILR 325


>gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score = 69.3 bits (168), Expect = 4e-09
 Identities = 69/237 (29%), Positives = 103/237 (43%), Gaps = 27/237 (11%)
 Frame = -1

Query: 1091 RDQGNTNAC------TAMVVRACMYG---EEVGFSFIDVQKLMGVPEEGSFNVPIKGFTC 939
            RDQ N  +C      +A+  R C+     ++V  S  D+    G            G+ C
Sbjct: 22   RDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQ---------CGYGC 72

Query: 938  SYGL---VFSYVRVRGVSSESFYKWVNG--PSRFQSCSGEPWD---GQITNREGFT---I 792
            + G     F+Y   +G  +   YK  +G  P  F  C     D   G+  N E  T   +
Sbjct: 73   NGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPN-EATTPKCV 131

Query: 791  EKDRKIFIDSYRRIVSKETFDNMAVCGTEAYDTLNAA-------LDNQPLTGTFGITLRF 633
             K +K +  SY++  S          G +AY+  N+        + N P+ G F +   F
Sbjct: 132  RKCQKSYKKSYKKDRS---------IGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDF 182

Query: 632  RYWEGDKIYKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 462
             Y++   IYK   G K  G HAI ++G+G+E G  Y++I NSW   WG  GY R+LR
Sbjct: 183  SYYKKG-IYKHTAG-KARGGHAIKIIGWGKEGGVPYWLIANSWHNDWGENGYFRILR 237


>emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
          Length = 341

 Score = 69.3 bits (168), Expect = 4e-09
 Identities = 69/237 (29%), Positives = 103/237 (43%), Gaps = 27/237 (11%)
 Frame = -1

Query: 1091 RDQGNTNAC------TAMVVRACMYG---EEVGFSFIDVQKLMGVPEEGSFNVPIKGFTC 939
            RDQ N  +C      +A+  R C+     ++V  S  D+    G            G+ C
Sbjct: 110  RDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQ---------CGYGC 160

Query: 938  SYGL---VFSYVRVRGVSSESFYKWVNG--PSRFQSCSGEPWD---GQITNREGFT---I 792
            + G     F+Y   +G  +   YK  +G  P  F  C     D   G+  N E  T   +
Sbjct: 161  NGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPN-EATTPKCV 219

Query: 791  EKDRKIFIDSYRRIVSKETFDNMAVCGTEAYDTLNAA-------LDNQPLTGTFGITLRF 633
             K +K +  SY++  S          G +AY+  N+        + N P+ G F +   F
Sbjct: 220  RKCQKSYKKSYKKDRS---------IGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDF 270

Query: 632  RYWEGDKIYKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 462
             Y++   IYK   G K  G HAI ++G+G+E G  Y++I NSW   WG  GY R+LR
Sbjct: 271  SYYKKG-IYKHTAG-KARGGHAIKIIGWGKEGGVPYWLIANSWHNDWGENGYFRILR 325


>ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula] gi|355482813|gb|AES64016.1|
            Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score = 68.6 bits (166), Expect = 6e-09
 Identities = 60/215 (27%), Positives = 98/215 (45%), Gaps = 4/215 (1%)
 Frame = -1

Query: 1091 RDQGNTNACTAMVVRACMYGEEVGFSFIDVQKLMGVPEEGSFNVPIKGFTCSYGLV---F 921
            +DQG+   C A  V A + G       I+  +L+ + E+   +   +   C  G +   F
Sbjct: 166  KDQGSCGCCWAFSVVAAVEGAVK----INTGELISLSEQQLVDCDERNSGCHGGNMDSAF 221

Query: 920  SYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFIDSYRRIVSK 741
             Y+  +G+ SE+ Y +  G    Q      ++ QITN            FID        
Sbjct: 222  KYIIQKGIVSEADYPYQEGSQTCQLNDQMKFEAQITN------------FID-------- 261

Query: 740  ETFDNMAVCGTEAYDTLNAALDNQPLTGTFGITLRFRYWEGDKIYKPIPGQKVHGKHAIT 561
                   V   +    L  A+  QP++    +   F+++ GD +Y    GQ ++  HA+T
Sbjct: 262  -------VPANDEQQLLQ-AVAQQPVSVGIEVGDEFQHYMGD-VYSGTCGQSMN--HAVT 310

Query: 560  LVGYG-QEAGDEYFVIQNSWGRTWGCRGYGRVLRK 459
             VGYG  E G +Y++I+NSWG+ WG  GY ++LR+
Sbjct: 311  AVGYGVSEDGTKYWLIKNSWGKGWGEEGYMKLLRE 345


>ref|XP_007148042.1| hypothetical protein PHAVU_006G175500g [Phaseolus vulgaris]
            gi|561021265|gb|ESW20036.1| hypothetical protein
            PHAVU_006G175500g [Phaseolus vulgaris]
          Length = 507

 Score = 68.2 bits (165), Expect = 8e-09
 Identities = 66/269 (24%), Positives = 103/269 (38%)
 Frame = -1

Query: 1091 RDQGNTNACTAMVVRACMYGEEVGFSFIDVQKLMGVPEEGSFNVPIKGFTCSYGLVFSYV 912
            +DQGN  +C A        G   G + +    L+ + E+   +       C YG +  Y 
Sbjct: 166  KDQGNCGSCWAF----SSTGAIEGINALVTGDLVSLSEQELVDCDSTNEGC-YGGLMDYA 220

Query: 911  RVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFIDSYRRIVSKETF 732
                      ++WV       S +  P+ G +  R   T EK + + ID Y  +      
Sbjct: 221  ----------FEWVMHNGGIDSETEYPYTG-VDARCNVTKEKTKVVSIDGYSDVGQS--- 266

Query: 731  DNMAVCGTEAYDTLNAALDNQPLTGTFGITLRFRYWEGDKIYKPIPGQKVHGKHAITLVG 552
            DN  +C T A   ++ A+D        G +L F+ + G               HA+ +VG
Sbjct: 267  DNSLLCAT-AKQPISVAID--------GSSLDFQLYAGGIYDGDCSSDPDDIDHAVLIVG 317

Query: 551  YGQEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVDRIYAPVNTHIPADSPTYIPNQGDKRD 372
            YG E  ++Y++++NSWG +WG  GY   +R++ D  Y     +  A  PT          
Sbjct: 318  YGSEDDEDYWIVKNSWGTSWGMEGY-IYIRRNTDLKYGVCAINYMASYPTKEITAPSPSS 376

Query: 371  RDSPXXXXXXXXXXXEAQPFKKQHIDCGD 285
              SP              P     I CGD
Sbjct: 377  SPSPPSPSPPQPLPPPPPPPPPPPIRCGD 405


>gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score = 68.2 bits (165), Expect = 8e-09
 Identities = 69/236 (29%), Positives = 102/236 (43%), Gaps = 27/236 (11%)
 Frame = -1

Query: 1091 RDQGNTNAC------TAMVVRACMYG---EEVGFSFIDVQKLMGVPEEGSFNVPIKGFTC 939
            RDQ N  +C      +A+  R C+     ++V  S  D+    G            G+ C
Sbjct: 22   RDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQ---------CGYGC 72

Query: 938  SYGL---VFSYVRVRGVSSESFYKWVNG--PSRFQSCSGEPWD---GQITNREGFT---I 792
            + G     F+Y   +G  +   YK  +G  P  F  C     D   G+  N E  T   +
Sbjct: 73   NGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPN-EATTPKCV 131

Query: 791  EKDRKIFIDSYRRIVSKETFDNMAVCGTEAYDTLNAA-------LDNQPLTGTFGITLRF 633
             K +K +  SY++  S          G +AY+  NA        + N P+ G F +   F
Sbjct: 132  RKCQKSYKKSYKKDRS---------IGKDAYEEPNAEKATQREIMKNGPVVGAFTVYEDF 182

Query: 632  RYWEGDKIYKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVL 465
             Y++   IYK   G K  G HAI ++G+G+E G  Y++I NSW   WG  GY R+L
Sbjct: 183  SYYKKG-IYKHTAG-KARGGHAIKIIGWGKEGGVPYWLIANSWHNDWGENGYFRIL 236


>gb|EYC39737.1| hypothetical protein Y032_0643g1066 [Ancylostoma ceylanicum]
          Length = 360

 Score = 67.8 bits (164), Expect = 1e-08
 Identities = 63/233 (27%), Positives = 99/233 (42%), Gaps = 22/233 (9%)
 Frame = -1

Query: 1094 TRDQGNTNACTAMVVRACMYGEEVGFSFIDVQKLMGVPEEGSFNVPIKGFTCSYGL---V 924
            TRDQ N  +C A+   + M       S   + +++   E  S  +P  G  C+ G    V
Sbjct: 124  TRDQSNCGSCYAVSAASVMSDRACILSNGRINRILSDTEVMSCCIPNCGSGCNGGQPSRV 183

Query: 923  FSYVRVRGVSSESFYKWVNG--PSRFQSCS---GEPWDGQITNRE------------GFT 795
            F Y    G+ +   Y+  +   P  F  C     +P+ G  +NR             G+ 
Sbjct: 184  FGYAWRHGICTGGRYREKDACQPYAFYPCGQHKNQPYYGPCSNRLWPTPTCRKTCQLGYP 243

Query: 794  I--EKDRKIFIDSYRRIVSKETFDNMAVCGTEAYDTLNAALDNQPLTGTFGITLRFRYWE 621
            I  EKD+         I +KETF    +   E Y  ++      P+  T+ +   F Y+ 
Sbjct: 244  IPFEKDK---------IFNKETF---YIRANETY-IMHEIFTRGPVVATYDVYKDFNYYR 290

Query: 620  GDKIYKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 462
               IY    G +  G HA+ ++G+G+E    +++I NSW   WG  GY R+LR
Sbjct: 291  KG-IYIHKDGGRPTGSHAVKIIGWGKENDVPFWLIANSWNSDWGEDGYFRILR 342


>gb|EYC39735.1| hypothetical protein Y032_0643g1066 [Ancylostoma ceylanicum]
          Length = 349

 Score = 67.8 bits (164), Expect = 1e-08
 Identities = 63/233 (27%), Positives = 99/233 (42%), Gaps = 22/233 (9%)
 Frame = -1

Query: 1094 TRDQGNTNACTAMVVRACMYGEEVGFSFIDVQKLMGVPEEGSFNVPIKGFTCSYGL---V 924
            TRDQ N  +C A+   + M       S   + +++   E  S  +P  G  C+ G    V
Sbjct: 113  TRDQSNCGSCYAVSAASVMSDRACILSNGRINRILSDTEVMSCCIPNCGSGCNGGQPSRV 172

Query: 923  FSYVRVRGVSSESFYKWVNG--PSRFQSCS---GEPWDGQITNRE------------GFT 795
            F Y    G+ +   Y+  +   P  F  C     +P+ G  +NR             G+ 
Sbjct: 173  FGYAWRHGICTGGRYREKDACQPYAFYPCGQHKNQPYYGPCSNRLWPTPTCRKTCQLGYP 232

Query: 794  I--EKDRKIFIDSYRRIVSKETFDNMAVCGTEAYDTLNAALDNQPLTGTFGITLRFRYWE 621
            I  EKD+         I +KETF    +   E Y  ++      P+  T+ +   F Y+ 
Sbjct: 233  IPFEKDK---------IFNKETF---YIRANETY-IMHEIFTRGPVVATYDVYKDFNYYR 279

Query: 620  GDKIYKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 462
               IY    G +  G HA+ ++G+G+E    +++I NSW   WG  GY R+LR
Sbjct: 280  KG-IYIHKDGGRPTGSHAVKIIGWGKENDVPFWLIANSWNSDWGEDGYFRILR 331


>gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score = 67.8 bits (164), Expect = 1e-08
 Identities = 58/185 (31%), Positives = 83/185 (44%), Gaps = 22/185 (11%)
 Frame = -1

Query: 950 GFTCSYGL-------VFSYVRVRGVSSESFYKWVNG--PSRFQSCSGEPWD---GQITNR 807
           G  C YG         F+Y   +G  +   YK  +G  P  F  C     D   G+  N 
Sbjct: 65  GNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPN- 123

Query: 806 EGFT---IEKDRKIFIDSYRRIVSKETFDNMAVCGTEAYDTLNAA-------LDNQPLTG 657
           E  T   + K +K +  SY++  S          G +AY+  N+        + N P+ G
Sbjct: 124 EATTPKCVRKCQKSYKKSYKKDRS---------IGKDAYEVPNSEKAIQREIMKNGPVVG 174

Query: 656 TFGITLRFRYWEGDKIYKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGY 477
            F +   F Y++   IYK   G K  G HAI ++G+G+E G  Y++I NSW   WG  GY
Sbjct: 175 AFTVYEDFSYYKKG-IYKHTAG-KARGGHAIKIIGWGKENGVPYWLIANSWHNDWGENGY 232

Query: 476 GRVLR 462
            R+LR
Sbjct: 233 FRILR 237


>ref|XP_002317417.2| hypothetical protein POPTR_0011s07300g [Populus trichocarpa]
           gi|550327861|gb|EEE98029.2| hypothetical protein
           POPTR_0011s07300g [Populus trichocarpa]
          Length = 498

 Score = 67.4 bits (163), Expect = 1e-08
 Identities = 54/208 (25%), Positives = 86/208 (41%), Gaps = 1/208 (0%)
 Frame = -1

Query: 902 GVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFIDSYRRIVSKETFDNM 723
           G   +S ++WV G     + +  P+ G +        E+ + + I+ Y   V  +  D+ 
Sbjct: 202 GGDMDSAFQWVIGNGGIDTEADYPYTG-VDGTCNTAKEEKKVVSIEGY---VDVDPSDSA 257

Query: 722 AVCGTEAYDTLNAALDNQPLT-GTFGITLRFRYWEGDKIYKPIPGQKVHGKHAITLVGYG 546
            +C T            QP++ G  G  L F+ + G        G      HAI +VGYG
Sbjct: 258 LLCATV----------QQPISVGMDGSALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYG 307

Query: 545 QEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVDRIYAPVNTHIPADSPTYIPNQGDKRDRD 366
            E  ++Y++++NSWG  WG  GY   +R++  + Y     +  A  PT +P+        
Sbjct: 308 SENDEDYWIVKNSWGTEWGMEGY-FYIRRNTSKPYGVCAINADASYPTKVPSPPSP---P 363

Query: 365 SPXXXXXXXXXXXEAQPFKKQHIDCGDS 282
           SP              P   Q  DCGDS
Sbjct: 364 SPPPPPSPPPPPPSPPPPCPQPSDCGDS 391


>ref|XP_004293953.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp.
            vesca]
          Length = 519

 Score = 67.4 bits (163), Expect = 1e-08
 Identities = 61/264 (23%), Positives = 108/264 (40%), Gaps = 4/264 (1%)
 Frame = -1

Query: 1181 LEDEGRGDIPIFKNATSCLVKSHPDRLLKTRDQGNTNACTAMVVRACMYGEEVGFSFIDV 1002
            ++ + + ++P  K+A S L       +   +DQG+  +C A        G   G + +  
Sbjct: 135  MQQQAKAELP--KDAPSSLDWRKKGIVTPIKDQGSCGSCWAF----SSTGGIEGINALVT 188

Query: 1001 QKLMGVPEEGSFNVPIKGFTCSYGLV---FSYVRVRG-VSSESFYKWVNGPSRFQSCSGE 834
              L+ + E+   +     + CS G +   F +V   G + +E+ Y + +      +C+  
Sbjct: 189  GDLISLSEQELVDCDTTNYGCSGGYMDYAFEWVISNGGIDTEADYPYTSTTGFGGTCN-- 246

Query: 833  PWDGQITNREGFTIEKDRKIFIDSYRRIVSKETFDNMAVCGTEAYDTLNAALDNQPLTGT 654
                        T E+ + + ID Y  +   ET               NA L      G 
Sbjct: 247  -----------VTKEETKVVTIDGYTDVEETET------------GLFNAVLQQPISVGI 283

Query: 653  FGITLRFRYWEGDKIYKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYG 474
             G T  F+ +              +  HA+ +VGYG E+G++Y++++NSWG +WG  GY 
Sbjct: 284  DGSTWDFQLYSSGIYDGDCSDDPNNIDHAVLIVGYGSESGEDYWIVKNSWGTSWGMEGY- 342

Query: 473  RVLRKDVDRIYAPVNTHIPADSPT 402
              LR++ D  Y     +  A  PT
Sbjct: 343  FYLRRNTDLPYGVCAVNAMASYPT 366


>gb|ABK95906.1| unknown [Populus trichocarpa]
          Length = 498

 Score = 67.4 bits (163), Expect = 1e-08
 Identities = 54/208 (25%), Positives = 86/208 (41%), Gaps = 1/208 (0%)
 Frame = -1

Query: 902 GVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFIDSYRRIVSKETFDNM 723
           G   +S ++WV G     + +  P+ G +        E+ + + I+ Y   V  +  D+ 
Sbjct: 202 GGDMDSAFQWVIGNGGIDTEADYPYTG-VDGTCNTAKEEKKVVSIEGY---VDVDPSDSA 257

Query: 722 AVCGTEAYDTLNAALDNQPLT-GTFGITLRFRYWEGDKIYKPIPGQKVHGKHAITLVGYG 546
            +C T            QP++ G  G  L F+ + G        G      HAI +VGYG
Sbjct: 258 LLCATV----------QQPISVGMDGSALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYG 307

Query: 545 QEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVDRIYAPVNTHIPADSPTYIPNQGDKRDRD 366
            E  ++Y++++NSWG  WG  GY   +R++  + Y     +  A  PT +P+        
Sbjct: 308 SENDEDYWIVKNSWGTEWGMEGY-FYIRRNTSKPYGVCAINADASYPTKVPSPPSP---P 363

Query: 365 SPXXXXXXXXXXXEAQPFKKQHIDCGDS 282
           SP              P   Q  DCGDS
Sbjct: 364 SPPPPPSPPPPPPSPPPPCPQPSDCGDS 391


>ref|NP_509408.1| Protein R09F10.1 [Caenorhabditis elegans]
            gi|351061560|emb|CCD69414.1| Protein R09F10.1
            [Caenorhabditis elegans]
          Length = 383

 Score = 67.0 bits (162), Expect = 2e-08
 Identities = 55/222 (24%), Positives = 93/222 (41%), Gaps = 7/222 (3%)
 Frame = -1

Query: 1106 RLLKTRDQGNTNACTAMVVRACMYGEEVGFSFIDVQKLMGVPEEGSFNVPIKGFTCSYGL 927
            +L   ++QG   +C A    A +  +      I   KL+ + E+   +   +   CS G 
Sbjct: 179  KLTPIKNQGQCGSCWAFATVASVEAQNA----IKKGKLVSLSEQEMVDCDGRNNGCSGGY 234

Query: 926  ---VFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFIDSYR 756
                  +V+  G+ SE  Y +                  + + + F  E D ++FID +R
Sbjct: 235  RPYAMKFVKENGLESEKEYPY----------------SALKHDQCFLKENDTRVFIDDFR 278

Query: 755  RIVSKETFDNMAVCGTEAYDTLNAALDNQPLTGTFGITL-RFRYWEGDKIYKPIPG---Q 588
             + + E             D  N      P+T  FG+ + +  Y     I+ P      +
Sbjct: 279  MLSNNEE------------DIANWVGTKGPVT--FGMNVVKAMYSYRSGIFNPSVEDCTE 324

Query: 587  KVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 462
            K  G HA+T++GYG E    Y++++NSWG +WG  GY R+ R
Sbjct: 325  KSMGAHALTIIGYGGEGESAYWIVKNSWGTSWGASGYFRLAR 366


>gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
          Length = 502

 Score = 67.0 bits (162), Expect = 2e-08
 Identities = 54/225 (24%), Positives = 91/225 (40%)
 Frame = -1

Query: 1091 RDQGNTNACTAMVVRACMYGEEVGFSFIDVQKLMGVPEEGSFNVPIKGFTCSYGLVFSYV 912
            ++QG+  +C A      M G     + I   +L+ + E+   +       C  G +    
Sbjct: 162  KNQGDCGSCWAFSSTGAMEG----INAITTGELISLSEQELVDCDTTNEGCDGGYM---- 213

Query: 911  RVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFIDSYRRIVSKETF 732
                   +  ++WV       S +  P+ GQ  +    T E+ + + ID Y  + + E+ 
Sbjct: 214  -------DYAFEWVINNGGIDSEANYPYTGQADSVCNTTKEEIKVVSIDGYEDVATSESA 266

Query: 731  DNMAVCGTEAYDTLNAALDNQPLTGTFGITLRFRYWEGDKIYKPIPGQKVHGKHAITLVG 552
                         L AA+      G  G +L F+ + G        G      HA+ +VG
Sbjct: 267  ------------LLCAAVQQPVSVGIDGSSLDFQLYAGGIYDGDCSGNPDDIDHAVLVVG 314

Query: 551  YGQEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVDRIYAPVNTHIP 417
            YGQ+ G +Y++++NSWG  WG +GY          IY   NT +P
Sbjct: 315  YGQQGGTDYWIVKNSWGTDWGMQGY----------IYIRRNTGLP 349


>ref|XP_007025363.1| Xylem bark cysteine peptidase 3 isoform 1 [Theobroma cacao]
            gi|508780729|gb|EOY27985.1| Xylem bark cysteine peptidase
            3 isoform 1 [Theobroma cacao]
          Length = 501

 Score = 66.2 bits (160), Expect = 3e-08
 Identities = 54/231 (23%), Positives = 98/231 (42%), Gaps = 1/231 (0%)
 Frame = -1

Query: 1091 RDQGNTNACTAMVVRACMYGEEVGFSFIDVQKLMGVPEEGSFNVPIKGFTCSYGLVFSYV 912
            +DQG+  +C A      M G     + +    L+ + E+   +     + C  G +    
Sbjct: 156  KDQGSCGSCWAFSSTGAMEG----INALVTGNLISLSEQELMDCDSTNYGCDGGYM---- 207

Query: 911  RVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFIDSYRRIVSKETF 732
                   +  ++WV       S +  P++G +      T E+ + + ID Y+ +   E  
Sbjct: 208  -------DYAFEWVINNGGIDSEADYPYEG-VDGTCNITKEETKVVSIDGYKDV---EES 256

Query: 731  DNMAVCGTEAYDTLNAALDNQPLT-GTFGITLRFRYWEGDKIYKPIPGQKVHGKHAITLV 555
            D+  +C          A+  QP++ G    ++ F+ + G               HA+ +V
Sbjct: 257  DSALLC----------AVVQQPVSVGIDASSIDFQLYTGGIFDGSCSDNPDDIDHAVLIV 306

Query: 554  GYGQEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVDRIYAPVNTHIPADSPT 402
            GYG E G++Y++++NSWG +WG  GY   L++D D  Y     +  A  PT
Sbjct: 307  GYGSEDGEDYWIVKNSWGTSWGMDGY-FYLKRDTDLPYGVCAVNAMASYPT 356


>ref|XP_006362397.1| PREDICTED: oryzain alpha chain-like [Solanum tuberosum]
          Length = 496

 Score = 65.9 bits (159), Expect = 4e-08
 Identities = 52/214 (24%), Positives = 87/214 (40%), Gaps = 2/214 (0%)
 Frame = -1

Query: 1097 KTRDQGNTNACTAMVVRACMYGEE--VGFSFIDVQKLMGVPEEGSFNVPIKGFTCSYGLV 924
            K +DQG   AC A      M G    V    I + +   +  + S N   KG     GL+
Sbjct: 160  KVKDQGECGACWAFSASGAMEGINAIVAGELISLSEQELIDCDTSHNSGCKG-----GLM 214

Query: 923  FSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFIDSYRRIVS 744
                       +  ++WV   S   S +  P+         ++    + + ID YR +  
Sbjct: 215  -----------DPAFEWVINNSGIDSAADYPYTAHSQGHCNYSKVNHKVVTIDGYRDVPK 263

Query: 743  KETFDNMAVCGTEAYDTLNAALDNQPLTGTFGITLRFRYWEGDKIYKPIPGQKVHGKHAI 564
            +E+    A+    A   ++ A+D        G +  F+ ++G            +  H +
Sbjct: 264  EES----ALLCAAAQQPVSVAID--------GSSPDFQLYQGGIYDGECSDDPNNVSHGV 311

Query: 563  TLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 462
             +VGYG +  D+Y++I+NSWG  WG  GYG + R
Sbjct: 312  LIVGYGSDGHDDYWIIKNSWGTEWGMEGYGYIRR 345


>ref|XP_006396923.1| hypothetical protein EUTSA_v10028733mg [Eutrema salsugineum]
            gi|557097940|gb|ESQ38376.1| hypothetical protein
            EUTSA_v10028733mg [Eutrema salsugineum]
          Length = 379

 Score = 65.9 bits (159), Expect = 4e-08
 Identities = 52/217 (23%), Positives = 96/217 (44%), Gaps = 1/217 (0%)
 Frame = -1

Query: 1097 KTRDQGNTNACTAMVVRACMYGEEVGFSFIDVQKLMGVPEEGSFNVPIKGFTCSYGLVFS 918
            + +DQG+  +C A        G   G + I   +L+ + E+   N   +   C  G V  
Sbjct: 166  EVKDQGHCRSCWAFST----VGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKV-- 219

Query: 917  YVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFIDSYRRIVSKE 738
                     E+ Y+++       + +  P+       +G   E ++K+ ID Y  + + +
Sbjct: 220  ---------ETAYEFIVNNGGLGTDNDYPYKAVNGACDGRLKENNKKVMIDGYENLPAND 270

Query: 737  TFDNMAVCGTEAYDTLNAALDNQPLTGTFGITLR-FRYWEGDKIYKPIPGQKVHGKHAIT 561
             F             L  A+ +QP+T     + R F+ +E   ++    G  ++  H + 
Sbjct: 271  EF------------ALMKAVAHQPVTAVIDSSSRDFQLYESG-VFDGTCGTNLN--HGVV 315

Query: 560  LVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVD 450
            +VGYG E G +Y++++NSWG TWG  GY ++ R  V+
Sbjct: 316  VVGYGTENGHDYWIVRNSWGNTWGEAGYMKMARNIVN 352


>gb|EYB81604.1| hypothetical protein Y032_0378g287 [Ancylostoma ceylanicum]
          Length = 342

 Score = 65.5 bits (158), Expect = 5e-08
 Identities = 63/226 (27%), Positives = 103/226 (45%), Gaps = 16/226 (7%)
 Frame = -1

Query: 1091 RDQGNTNACTAMVVRACMYGEEVGFSFIDVQKLMGVPEEGSFNVPIKGFTCSYGLV---F 921
            RDQ    +C A+ V A + G+ V       +K+     +        GF C  G     +
Sbjct: 108  RDQSACGSCWAVSVGAAI-GDRVCTQSNSTKKMDASDTDLLACCKNCGFGCQGGYTIRAW 166

Query: 920  SYVRVRGVSSESFYKW--VNGPSRFQSC---SGEPWDGQITNREGFTIEKDRKIFIDSYR 756
             Y+   GV +   YK   V  P  F  C   + + + G+  + EG+   K RKI    YR
Sbjct: 167  EYLMNEGVCTGGRYKQKGVCKPYAFHPCGRHANQKYYGECPD-EGWKTPKCRKICQLRYR 225

Query: 755  RIVSKETFDNMAVCGTEAYDTLN-------AALDNQPLTGTFGITLRFRYWEGDKIYKPI 597
            +     ++++  + G  AY   N         ++N P+  +F +   F+++ G  IY   
Sbjct: 226  K-----SYEDDKIYGIRAYQLPNNERSIRKEIMNNGPVVASFRVYSDFKFYTGG-IYIHT 279

Query: 596  PGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCR-GYGRVLR 462
             G +  G HA+ ++G+G+E G +Y++I NSW   WG   GY R+LR
Sbjct: 280  GGYE-RGGHAVKVIGWGRENGTDYWLIANSWNTDWGENGGYFRILR 324


>gb|EYB81603.1| hypothetical protein Y032_0378g287 [Ancylostoma ceylanicum]
          Length = 2733

 Score = 65.5 bits (158), Expect = 5e-08
 Identities = 63/226 (27%), Positives = 103/226 (45%), Gaps = 16/226 (7%)
 Frame = -1

Query: 1091 RDQGNTNACTAMVVRACMYGEEVGFSFIDVQKLMGVPEEGSFNVPIKGFTCSYGLV---F 921
            RDQ    +C A+ V A + G+ V       +K+     +        GF C  G     +
Sbjct: 2499 RDQSACGSCWAVSVGAAI-GDRVCTQSNSTKKMDASDTDLLACCKNCGFGCQGGYTIRAW 2557

Query: 920  SYVRVRGVSSESFYKW--VNGPSRFQSC---SGEPWDGQITNREGFTIEKDRKIFIDSYR 756
             Y+   GV +   YK   V  P  F  C   + + + G+  + EG+   K RKI    YR
Sbjct: 2558 EYLMNEGVCTGGRYKQKGVCKPYAFHPCGRHANQKYYGECPD-EGWKTPKCRKICQLRYR 2616

Query: 755  RIVSKETFDNMAVCGTEAYDTLN-------AALDNQPLTGTFGITLRFRYWEGDKIYKPI 597
            +     ++++  + G  AY   N         ++N P+  +F +   F+++ G  IY   
Sbjct: 2617 K-----SYEDDKIYGIRAYQLPNNERSIRKEIMNNGPVVASFRVYSDFKFYTGG-IYIHT 2670

Query: 596  PGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCR-GYGRVLR 462
             G +  G HA+ ++G+G+E G +Y++I NSW   WG   GY R+LR
Sbjct: 2671 GGYE-RGGHAVKVIGWGRENGTDYWLIANSWNTDWGENGGYFRILR 2715


Top