BLASTX nr result

ID: Mentha23_contig00017446 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00017446
         (1708 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoide...    75   1e-10
ref|XP_006362397.1| PREDICTED: oryzain alpha chain-like [Solanum...    69   7e-09
ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula] gi...    69   8e-09
gb|AAP41846.1| cysteine protease [Anthurium andraeanum]                68   1e-08
ref|XP_006655467.1| PREDICTED: cysteine proteinase RD21a-like [O...    67   3e-08
ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [S...    67   3e-08
gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]                    67   3e-08
gb|EEE64302.1| hypothetical protein OsJ_19139 [Oryza sativa Japo...    67   3e-08
gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indi...    67   3e-08
ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group] g...    67   3e-08
gb|ETN70308.1| papain family cysteine protease [Necator americanus]    66   4e-08
ref|XP_004232961.1| PREDICTED: thiol protease SEN102-like [Solan...    66   4e-08
ref|XP_006360817.1| PREDICTED: pro-cathepsin H-like [Solanum tub...    66   6e-08
ref|XP_006347646.1| PREDICTED: uncharacterized protein LOC102578...    66   6e-08
ref|XP_002317417.2| hypothetical protein POPTR_0011s07300g [Popu...    65   7e-08
ref|XP_002139627.1| papain family cysteine protease [Cryptospori...    65   7e-08
ref|NP_001141813.1| uncharacterized protein LOC100273952 precurs...    65   7e-08
gb|ABK95906.1| unknown [Populus trichocarpa]                           65   7e-08
emb|CDJ83395.1| Peptidase C1A domain containing protein [Haemonc...    65   9e-08
gb|EGT32164.1| hypothetical protein CAEBREN_11106 [Caenorhabditi...    65   9e-08

>gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
          Length = 283

 Score = 74.7 bits (182), Expect = 1e-10
 Identities = 48/154 (31%), Positives = 78/154 (50%)
 Frame = +3

Query: 765  FSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFIDSYRRIVP 944
            +++V   G+++ES + + +G  R  SC     +G +  R            I++YRR+  
Sbjct: 137  WTWVLTTGITTESCWPYRSGSGRIPSCPHRCVNGSVLQRNT----------INNYRRLDS 186

Query: 945  KETFDNMAVRGTEAYDTLNAALDNQPLTGTFGITLRFRYWEGDKIYKPIPGQKVHGKHAI 1124
             E  D +               +N P+  T+ +   F Y+    IYK + G KV G HA+
Sbjct: 187  SELQDEL--------------YNNGPIQVTYVVYEDFFYYSKG-IYKHLSGNKVGG-HAV 230

Query: 1125 TLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 1226
             L+G+G E G +Y+++QNSWG  WG +GY R+LR
Sbjct: 231  VLMGWGIEDGVKYWLVQNSWGYEWGEQGYFRILR 264


>ref|XP_006362397.1| PREDICTED: oryzain alpha chain-like [Solanum tuberosum]
          Length = 496

 Score = 68.9 bits (167), Expect = 7e-09
 Identities = 61/243 (25%), Positives = 93/243 (38%)
 Frame = +3

Query: 543  KTRDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEEGSF 722
            K +DQG   AC A      M G   +  G     S QE+ D               + S 
Sbjct: 160  KVKDQGECGACWAFSASGAMEGINAIVAGELISLSEQELIDC--------------DTSH 205

Query: 723  NVPIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEK 902
            N   KG               G+   +F +WV   S   S +  P+         ++   
Sbjct: 206  NSGCKG---------------GLMDPAF-EWVINNSGIDSAADYPYTAHSQGHCNYSKVN 249

Query: 903  DRKIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLTGTFGITLRFRYWEGDKIY 1082
             + + ID YR  VPKE     A+    A   ++ A+D        G +  F+ ++G    
Sbjct: 250  HKVVTIDGYRD-VPKE---ESALLCAAAQQPVSVAID--------GSSPDFQLYQGGIYD 297

Query: 1083 KPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVDRIYVPVNT 1262
                    +  H + +VGYG +  D+Y++I+NSWG  WG  GYG          Y+  NT
Sbjct: 298  GECSDDPNNVSHGVLIVGYGSDGHDDYWIIKNSWGTEWGMEGYG----------YIRRNT 347

Query: 1263 HIP 1271
            ++P
Sbjct: 348  NLP 350


>ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula] gi|355482813|gb|AES64016.1|
            Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score = 68.6 bits (166), Expect = 8e-09
 Identities = 63/228 (27%), Positives = 97/228 (42%), Gaps = 1/228 (0%)
 Frame = +3

Query: 549  RDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEEGSFNV 728
            +DQG+   C A  V A + G   + TG     S Q++ D     R     G         
Sbjct: 166  KDQGSCGCCWAFSVVAAVEGAVKINTGELISLSEQQLVDCD--ERNSGCHGG-------- 215

Query: 729  PIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDR 908
                   + +  F Y+  +G+ SE+ Y +  G    Q      ++ QITN          
Sbjct: 216  -------NMDSAFKYIIQKGIVSEADYPYQEGSQTCQLNDQMKFEAQITN---------- 258

Query: 909  KIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLTGTFGITLRFRYWEGDKIYKP 1088
              FID     VP                 L  A+  QP++    +   F+++ GD +Y  
Sbjct: 259  --FID-----VP-----------ANDEQQLLQAVAQQPVSVGIEVGDEFQHYMGD-VYSG 299

Query: 1089 IPGQKVHGKHAITLVGYG-QEAGDEYFVIQNSWGRTWGCRGYGRVLRK 1229
              GQ ++  HA+T VGYG  E G +Y++I+NSWG+ WG  GY ++LR+
Sbjct: 300  TCGQSMN--HAVTAVGYGVSEDGTKYWLIKNSWGKGWGEEGYMKLLRE 345


>gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
          Length = 502

 Score = 67.8 bits (164), Expect = 1e-08
 Identities = 59/241 (24%), Positives = 91/241 (37%)
 Frame = +3

Query: 549  RDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEEGSFNV 728
            ++QG+  +C A      M G   + TG     S QE+ D               EG    
Sbjct: 162  KNQGDCGSCWAFSSTGAMEGINAITTGELISLSEQELVDC----------DTTNEG---- 207

Query: 729  PIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDR 908
                  C            G   +  ++WV       S +  P+ GQ  +    T E+ +
Sbjct: 208  ------CD-----------GGYMDYAFEWVINNGGIDSEANYPYTGQADSVCNTTKEEIK 250

Query: 909  KIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLTGTFGITLRFRYWEGDKIYKP 1088
             + ID Y  +   E+              L AA+      G  G +L F+ + G      
Sbjct: 251  VVSIDGYEDVATSESA------------LLCAAVQQPVSVGIDGSSLDFQLYAGGIYDGD 298

Query: 1089 IPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVDRIYVPVNTHI 1268
              G      HA+ +VGYGQ+ G +Y++++NSWG  WG +GY          IY+  NT +
Sbjct: 299  CSGNPDDIDHAVLVVGYGQQGGTDYWIVKNSWGTDWGMQGY----------IYIRRNTGL 348

Query: 1269 P 1271
            P
Sbjct: 349  P 349


>ref|XP_006655467.1| PREDICTED: cysteine proteinase RD21a-like [Oryza brachyantha]
          Length = 377

 Score = 66.6 bits (161), Expect = 3e-08
 Identities = 57/224 (25%), Positives = 87/224 (38%), Gaps = 1/224 (0%)
 Frame = +3

Query: 543  KTRDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEEGSF 722
            K +DQG+  AC +      M G   ++TG     S QE+ D               + S+
Sbjct: 67   KVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELIDC--------------DRSY 112

Query: 723  NVPIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEK 902
            N    G    Y   F  V+  G+ +E  Y +               DG     +     K
Sbjct: 113  NTGCGGGLMDYAYKF-VVKNGGIDTEEDYPY------------RETDGTCNKNK----LK 155

Query: 903  DRKIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLT-GTFGITLRFRYWEGDKI 1079
             R + ID Y+ +                 D L  A+  QP++ G  G    F+ +     
Sbjct: 156  RRVVTIDGYKDVPANNE------------DLLLQAVAQQPVSVGICGSARAFQLYSKGIF 203

Query: 1080 YKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGY 1211
              P P       HA+ +VGYG E G +Y++++NSWG +WG +GY
Sbjct: 204  DGPCPTSL---DHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGY 244


>ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
            gi|241945324|gb|EES18469.1| hypothetical protein
            SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score = 66.6 bits (161), Expect = 3e-08
 Identities = 60/234 (25%), Positives = 92/234 (39%), Gaps = 2/234 (0%)
 Frame = +3

Query: 543  KTRDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEEGSF 722
            K +DQG+  AC +      M G   ++TG     S QE+ D               + S+
Sbjct: 153  KVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDC--------------DRSY 198

Query: 723  NVPIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEK 902
            N    G    Y   F  V+  G+ +E  Y +               DG     +     K
Sbjct: 199  NSGCGGGLMDYAYKF-VVKNGGIDTEEDYPYREA------------DGTCNKNK----LK 241

Query: 903  DRKIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLT-GTFGITLRFRYWEGDKI 1079
             R + ID Y  +   +             D L  A+  QP++ G  G    F+ +    I
Sbjct: 242  KRIVTIDGYSDVPSNKE------------DLLLQAVAQQPVSVGICGSARAFQLYSQQGI 289

Query: 1080 YK-PIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVD 1238
            +  P P       HA+ +VGYG E G +Y++++NSWG +WG +GY  + R   D
Sbjct: 290  FDGPCPTSL---DHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNTGD 340


>gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
          Length = 380

 Score = 66.6 bits (161), Expect = 3e-08
 Identities = 58/230 (25%), Positives = 103/230 (44%)
 Frame = +3

Query: 537  LLKTRDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEEG 716
            ++  ++QG  ++C A    A +     + TG     S QE+ D    +R P  +G   +G
Sbjct: 138  VVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDC---NRTPINEGC--KG 192

Query: 717  SFNVPIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTI 896
             F         +YE +   +   G+++E  Y ++    +      EP   Q         
Sbjct: 193  GF------MDDAYEFI---INNGGINTEENYPYIGQDDQCD----EPKKNQ--------- 230

Query: 897  EKDRKIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLTGTFGITLRFRYWEGDK 1076
                 + IDSY ++ P    D +A++   AY  ++ A+D           L FR+++   
Sbjct: 231  ---NYVTIDSYEQVPPN---DELAMKRAVAYQPVSVAID--------AYCLGFRFYQSGI 276

Query: 1077 IYKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 1226
                  G  ++  HA+T++GYG E G +Y++++NS+G  WG  GYG+V R
Sbjct: 277  FTGGSCGTTLN--HAVTIIGYGTENGIDYWIVKNSYGTQWGESGYGKVQR 324


>gb|EEE64302.1| hypothetical protein OsJ_19139 [Oryza sativa Japonica Group]
          Length = 1105

 Score = 66.6 bits (161), Expect = 3e-08
 Identities = 58/224 (25%), Positives = 88/224 (39%), Gaps = 1/224 (0%)
 Frame = +3

Query: 543  KTRDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEEGSF 722
            K +DQG+  AC +      M G   ++TG     S QE+ D               + S+
Sbjct: 143  KVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELIDC--------------DRSY 188

Query: 723  NVPIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEK 902
            N    G    Y   F  V+  G+ +E+ Y +               DG     +     K
Sbjct: 189  NSGCGGGLMDYAYKF-VVKNGGIDTEADYPY------------RETDGTCNKNK----LK 231

Query: 903  DRKIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLT-GTFGITLRFRYWEGDKI 1079
             R + ID Y+ +                 D L  A+  QP++ G  G    F+ +     
Sbjct: 232  RRVVTIDGYKDVPANNE------------DMLLQAVAQQPVSVGICGSARAFQLYSKGIF 279

Query: 1080 YKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGY 1211
              P P       HAI +VGYG E G +Y++++NSWG +WG +GY
Sbjct: 280  DGPCPTSL---DHAILIVGYGSEGGKDYWIVKNSWGESWGMKGY 320


>gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
          Length = 449

 Score = 66.6 bits (161), Expect = 3e-08
 Identities = 58/224 (25%), Positives = 88/224 (39%), Gaps = 1/224 (0%)
 Frame = +3

Query: 543  KTRDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEEGSF 722
            K +DQG+  AC +      M G   ++TG     S QE+ D               + S+
Sbjct: 139  KVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELIDC--------------DRSY 184

Query: 723  NVPIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEK 902
            N    G    Y   F  V+  G+ +E+ Y +               DG     +     K
Sbjct: 185  NSGCGGGLMDYAYKF-VVKNGGIDTEADYPY------------RETDGTCNKNK----LK 227

Query: 903  DRKIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLT-GTFGITLRFRYWEGDKI 1079
             R + ID Y+ +                 D L  A+  QP++ G  G    F+ +     
Sbjct: 228  RRVVTIDGYKDVPANNE------------DMLLQAVAQQPVSVGICGSARAFQLYSKGIF 275

Query: 1080 YKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGY 1211
              P P       HAI +VGYG E G +Y++++NSWG +WG +GY
Sbjct: 276  DGPCPTSL---DHAILIVGYGSEGGKDYWIVKNSWGESWGMKGY 316


>ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group] gi|48475189|gb|AAT44258.1|
            hypothetical protein [Oryza sativa Japonica Group]
            gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa
            Japonica Group]
          Length = 450

 Score = 66.6 bits (161), Expect = 3e-08
 Identities = 58/224 (25%), Positives = 88/224 (39%), Gaps = 1/224 (0%)
 Frame = +3

Query: 543  KTRDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEEGSF 722
            K +DQG+  AC +      M G   ++TG     S QE+ D               + S+
Sbjct: 140  KVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELIDC--------------DRSY 185

Query: 723  NVPIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEK 902
            N    G    Y   F  V+  G+ +E+ Y +               DG     +     K
Sbjct: 186  NSGCGGGLMDYAYKF-VVKNGGIDTEADYPY------------RETDGTCNKNK----LK 228

Query: 903  DRKIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLT-GTFGITLRFRYWEGDKI 1079
             R + ID Y+ +                 D L  A+  QP++ G  G    F+ +     
Sbjct: 229  RRVVTIDGYKDVPANNE------------DMLLQAVAQQPVSVGICGSARAFQLYSKGIF 276

Query: 1080 YKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGY 1211
              P P       HAI +VGYG E G +Y++++NSWG +WG +GY
Sbjct: 277  DGPCPTSL---DHAILIVGYGSEGGKDYWIVKNSWGESWGMKGY 317


>gb|ETN70308.1| papain family cysteine protease [Necator americanus]
          Length = 414

 Score = 66.2 bits (160), Expect = 4e-08
 Identities = 66/237 (27%), Positives = 101/237 (42%), Gaps = 6/237 (2%)
 Frame = +3

Query: 534  RLLKTRDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEE 713
            +++  +DQG   +C A    A +     ++TG  T  S QE+           VD     
Sbjct: 188  KVIDVKDQGQCGSCWAFATVASVEAAYAIKTGQLTRLSEQEL-----------VDCDSRN 236

Query: 714  GSFNVPIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFT 893
               N   + +  SY  V++    R      F K V      +  +    D  +T    +T
Sbjct: 237  NGCNGGYRPYAMSYNKVYADENERNFRFTVFVKNVVYFEE-EERNHPGLDLDVTRFADWT 295

Query: 894  IEKDRK--IFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLTGTFGITLR-FRYW 1064
             E+ RK  ++I SYR +   E  D +A       D + A   N P+T    +T   + Y 
Sbjct: 296  EEEMRKERVYIRSYRTLSSNE--DAVA-------DWIAA---NGPVTFGMNVTKSLYSYR 343

Query: 1065 EGDKIYKPIPGQ-KVH--GKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 1226
             G  I+ P     + H  G HA+T VGYG E G  Y++++NSWG  WG  GY ++ R
Sbjct: 344  SG--IFSPSKEDCEEHSLGSHALTFVGYGTEGGQPYWLVKNSWGSRWGQNGYFKMAR 398


>ref|XP_004232961.1| PREDICTED: thiol protease SEN102-like [Solanum lycopersicum]
          Length = 341

 Score = 66.2 bits (160), Expect = 4e-08
 Identities = 60/233 (25%), Positives = 100/233 (42%), Gaps = 3/233 (1%)
 Frame = +3

Query: 549  RDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEEGSFNV 728
            +DQG   +C A    A + G   L+TG+    S Q++ D    +      GV  E    +
Sbjct: 141  KDQGKCGSCWAFSAVAAVEGLNQLKTGNLISLSEQQLLDCESRNNNGCGGGVRNEAFLYI 200

Query: 729  PIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDR 908
               G               G+++ES Y +   P    SC+ +             + +  
Sbjct: 201  AENG---------------GLTTESNYPYTGIPG---SCNSK-------------MAEST 229

Query: 909  KIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLTGTFGI-TLRFRYWEGDKIYK 1085
             + I SY+ + P E+              L  A+  QP++    I + +FR+++G  I+ 
Sbjct: 230  AVTISSYKTVDPSES-------------ALLQAVLIQPVSAGVNIGSDKFRFYKGG-IFS 275

Query: 1086 PIPGQKVHGKHAITLVGYG--QEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVD 1238
               G+  H  HA+T+VGYG  ++    Y++++NSWG  WG  GY R+ R  VD
Sbjct: 276  GECGESSH--HAVTVVGYGTSEDGSSNYWLVKNSWGENWGESGYMRMARDVVD 326


>ref|XP_006360817.1| PREDICTED: pro-cathepsin H-like [Solanum tuberosum]
          Length = 346

 Score = 65.9 bits (159), Expect = 6e-08
 Identities = 56/254 (22%), Positives = 101/254 (39%), Gaps = 14/254 (5%)
 Frame = +3

Query: 537  LLKTRDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEEG 716
            L    DQG+   C A V    +     ++       S + + D +F      V   P+E 
Sbjct: 126  LSSVEDQGDCPTCWAFVAAEAISALYHIKFNKEKFLSKKHVIDDLF-----TVIKEPKEI 180

Query: 717  SFNVPIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTI 896
             ++     F   Y   F Y   +GV  +  Y ++                     E   +
Sbjct: 181  QYHKATGCFPSHYNNYFQYAIEKGVYPDKPYPYL-----------------AERGECLEL 223

Query: 897  EKDRKIFIDSYRRI----VPKETFDNMAVRGTEAYDTLNAALDNQPLTGTFGITLRFRYW 1064
              + K  I +Y+++    + K++ + +              +  QP+ G+  +   F+  
Sbjct: 224  PNEEKTKIKAYKKVNDLGLDKKSIEEL--------------IQKQPICGSVKLAKNFQKH 269

Query: 1065 EGDKIYKPIPGQKVH----------GKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYG 1214
            +G  IY     ++++          G+HA+ ++G+G E G EY++I+NSWG  WG  GY 
Sbjct: 270  KGKDIYMGQTKEEIYSEASKNNQSRGRHAVLIIGFGIENGIEYYLIKNSWGVNWGYLGYA 329

Query: 1215 RVLRKDVDRIYVPV 1256
            RV R+ V  +  PV
Sbjct: 330  RVERRLVTSLSFPV 343


>ref|XP_006347646.1| PREDICTED: uncharacterized protein LOC102578529 [Solanum tuberosum]
          Length = 893

 Score = 65.9 bits (159), Expect = 6e-08
 Identities = 44/186 (23%), Positives = 82/186 (44%), Gaps = 14/186 (7%)
 Frame = +3

Query: 741  FTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFI 920
            F CSY   + +    G++ E+ Y ++    + + C  E                 R I I
Sbjct: 724  FPCSYNKAYKFAMDYGITVETKYPFMEERGKCE-CQSEM----------------RIIKI 766

Query: 921  DSYRRIVPK-ETFDNMAVRGTEAYDTLNAALDNQPLTGTFGITLRFRYWEGDKIY----- 1082
            + ++R+    +  +  A+   +  + +   +  QP+T         +   G  +Y     
Sbjct: 767  NGFQRVSELIKELEEKAIEKLDEKEIIEKLIRQQPITCAALHVPSLQLHRGKGVYMGPTE 826

Query: 1083 --------KPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVD 1238
                    K   GQ V GKHA+ +VGYG+E G E+++++NSWG  WG +GY ++ R  + 
Sbjct: 827  NEIAQVRQKETEGQVV-GKHAMLIVGYGEEEGVEFYLVKNSWGTEWGYQGYAKIKRSALS 885

Query: 1239 RIYVPV 1256
            ++  P+
Sbjct: 886  KLSYPI 891


>ref|XP_002317417.2| hypothetical protein POPTR_0011s07300g [Populus trichocarpa]
            gi|550327861|gb|EEE98029.2| hypothetical protein
            POPTR_0011s07300g [Populus trichocarpa]
          Length = 498

 Score = 65.5 bits (158), Expect = 7e-08
 Identities = 53/208 (25%), Positives = 85/208 (40%), Gaps = 1/208 (0%)
 Frame = +3

Query: 786  GVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFIDSYRRIVPKETFDNM 965
            G   +S ++WV G     + +  P+ G +        E+ + + I+ Y  + P ++    
Sbjct: 202  GGDMDSAFQWVIGNGGIDTEADYPYTG-VDGTCNTAKEEKKVVSIEGYVDVDPSDS---- 256

Query: 966  AVRGTEAYDTLNAALDNQPLT-GTFGITLRFRYWEGDKIYKPIPGQKVHGKHAITLVGYG 1142
                      L  A   QP++ G  G  L F+ + G        G      HAI +VGYG
Sbjct: 257  ---------ALLCATVQQPISVGMDGSALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYG 307

Query: 1143 QEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVDRIYVPVNTHIPADSPTYIPNQGDKRDRD 1322
             E  ++Y++++NSWG  WG  GY   +R++  + Y     +  A  PT +P+        
Sbjct: 308  SENDEDYWIVKNSWGTEWGMEGY-FYIRRNTSKPYGVCAINADASYPTKVPSPPSP---P 363

Query: 1323 SPXXXXXXXXXXXXAQPFKKQHIDCGDS 1406
            SP              P   Q  DCGDS
Sbjct: 364  SPPPPPSPPPPPPSPPPPCPQPSDCGDS 391


>ref|XP_002139627.1| papain family cysteine protease [Cryptosporidium muris RN66]
            gi|209555233|gb|EEA05278.1| papain family cysteine
            protease, putative [Cryptosporidium muris RN66]
          Length = 661

 Score = 65.5 bits (158), Expect = 7e-08
 Identities = 66/256 (25%), Positives = 101/256 (39%), Gaps = 8/256 (3%)
 Frame = +3

Query: 552  DQGNTNACTAMVVRACMYGEEVLRTGHSTEA-SPQEIHDWVFFHRRPKVDGVPEEGSFNV 728
            +QG    C A V  + +     ++TG    + SPQ+I D    +     DG         
Sbjct: 397  NQGECGGCYAFVAVSTINTATCVQTGLLIASLSPQQIIDCSGQYGNEGCDG--------- 447

Query: 729  PIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDR 908
               GF   Y   +SY      S  S   W N P  +    G     Q T  E        
Sbjct: 448  ---GF---YANSWSYALSSNSSPNSICSWDNYP--YDDNVGTCTASQCTGCET------- 492

Query: 909  KIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLTGTFGITLR-----FRYWEGD 1073
               + SY      E F + ++ GT+ +D + + L   P  G   +++      F  + G 
Sbjct: 493  ---VGSY------EIFSSYSINGTDGWDFVTSML---PRYGVIAVSVNSQLPGFHEYSGG 540

Query: 1074 KIYKPIPGQKVHGKHAITLVGYG-QEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVDRIYV 1250
                P    +    HA+ L+GYG  + G++Y+V+QNSWG TWG  G+  V     D  ++
Sbjct: 541  IYKAPKCSSREELDHAVLLIGYGISDQGEKYYVMQNSWGITWGIEGFMNVSADSCDMFWL 600

Query: 1251 P-VNTHIPADSPTYIP 1295
            P +    P+DS    P
Sbjct: 601  PGIINQFPSDSINTCP 616


>ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
            gi|194706024|gb|ACF87096.1| unknown [Zea mays]
            gi|413945958|gb|AFW78607.1| hypothetical protein
            ZEAMMB73_489507 [Zea mays]
          Length = 460

 Score = 65.5 bits (158), Expect = 7e-08
 Identities = 58/233 (24%), Positives = 90/233 (38%), Gaps = 1/233 (0%)
 Frame = +3

Query: 543  KTRDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEEGSF 722
            K +DQG+  AC +      M G   ++TG     S QE+ D               + S+
Sbjct: 151  KVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDC--------------DRSY 196

Query: 723  NVPIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEK 902
            N    G    Y   F  ++  G+ +E  Y +               DG     +     K
Sbjct: 197  NSGCGGGLMDYAYKF-VIKNGGIDTEEDYPYREA------------DGTCNKNK----LK 239

Query: 903  DRKIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLT-GTFGITLRFRYWEGDKI 1079
             R + ID Y  +   +             D L  A+  QP++ G  G    F+ +     
Sbjct: 240  KRVVTIDGYTDVPSNKE------------DLLLQAVAQQPVSVGICGSARAFQLYYQGIF 287

Query: 1080 YKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVD 1238
              P P       HA+ +VGYG E G +Y++++NSWG +WG +GY  + R   D
Sbjct: 288  DGPCPTSL---DHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNTGD 337


>gb|ABK95906.1| unknown [Populus trichocarpa]
          Length = 498

 Score = 65.5 bits (158), Expect = 7e-08
 Identities = 53/208 (25%), Positives = 85/208 (40%), Gaps = 1/208 (0%)
 Frame = +3

Query: 786  GVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFTIEKDRKIFIDSYRRIVPKETFDNM 965
            G   +S ++WV G     + +  P+ G +        E+ + + I+ Y  + P ++    
Sbjct: 202  GGDMDSAFQWVIGNGGIDTEADYPYTG-VDGTCNTAKEEKKVVSIEGYVDVDPSDS---- 256

Query: 966  AVRGTEAYDTLNAALDNQPLT-GTFGITLRFRYWEGDKIYKPIPGQKVHGKHAITLVGYG 1142
                      L  A   QP++ G  G  L F+ + G        G      HAI +VGYG
Sbjct: 257  ---------ALLCATVQQPISVGMDGSALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYG 307

Query: 1143 QEAGDEYFVIQNSWGRTWGCRGYGRVLRKDVDRIYVPVNTHIPADSPTYIPNQGDKRDRD 1322
             E  ++Y++++NSWG  WG  GY   +R++  + Y     +  A  PT +P+        
Sbjct: 308  SENDEDYWIVKNSWGTEWGMEGY-FYIRRNTSKPYGVCAINADASYPTKVPSPPSP---P 363

Query: 1323 SPXXXXXXXXXXXXAQPFKKQHIDCGDS 1406
            SP              P   Q  DCGDS
Sbjct: 364  SPPPPPSPPPPPPSPPPPCPQPSDCGDS 391


>emb|CDJ83395.1| Peptidase C1A domain containing protein [Haemonchus contortus]
          Length = 341

 Score = 65.1 bits (157), Expect = 9e-08
 Identities = 53/169 (31%), Positives = 81/169 (47%), Gaps = 15/169 (8%)
 Frame = +3

Query: 765  FSYVRVRGVSSESFYKWVNG--PSRFQSCSGEPWD---GQITNREGFT---IEKDRKIFI 920
            F+Y   +G  +   YK  +G  P  F  C     D   G+  N E  T   + K +K + 
Sbjct: 169  FNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPN-EATTPKCVRKCQKSYK 227

Query: 921  DSYRRIVPKETFDNMAVRGTEAYDTLNAA-------LDNQPLTGTFGITLRFRYWEGDKI 1079
             SY++        + ++ G +AY+  N+        + N P+ G F +   F Y++   I
Sbjct: 228  KSYKK--------DRSI-GKDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKG-I 277

Query: 1080 YKPIPGQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 1226
            YK   G K  G HAI ++G+G+E G  Y++I NSW   WG  GY R+LR
Sbjct: 278  YKHTAG-KARGGHAIKIIGWGKEGGVPYWLIANSWHNDWGENGYFRILR 325


>gb|EGT32164.1| hypothetical protein CAEBREN_11106 [Caenorhabditis brenneri]
          Length = 389

 Score = 65.1 bits (157), Expect = 9e-08
 Identities = 55/235 (23%), Positives = 93/235 (39%), Gaps = 4/235 (1%)
 Frame = +3

Query: 534  RLLKTRDQGNTNACTAMVVRACMYGEEVLRTGHSTEASPQEIHDWVFFHRRPKVDGVPEE 713
            +L   ++QG   +C A    A +  +  ++ G     S QE+ D               +
Sbjct: 185  KLTPIKNQGQCGSCWAFATVAAVEAQHAIKKGQLVSLSEQEMVDC--------------D 230

Query: 714  GSFNVPIKGFTCSYELVFSYVRVRGVSSESFYKWVNGPSRFQSCSGEPWDGQITNREGFT 893
            G  N    G+         +V+  G+ SE  Y +                  + + + F 
Sbjct: 231  GRNNGCSGGYR---PYAMRFVKENGLESEKEYPY----------------SALKHDQCFL 271

Query: 894  IEKDRKIFIDSYRRIVPKETFDNMAVRGTEAYDTLNAALDNQPLTGTFGITL-RFRYWEG 1070
             + D ++FID +R +             T   D  N      P+T  FG+ + +  Y   
Sbjct: 272  KQNDTRVFIDDFRML------------STNEEDIANWVGTKGPVT--FGMNVVKAMYSYR 317

Query: 1071 DKIYKPIP---GQKVHGKHAITLVGYGQEAGDEYFVIQNSWGRTWGCRGYGRVLR 1226
              I+ P      +K  G HA+T+VGYG E    +++++NSWG +WG  GY R+ R
Sbjct: 318  SGIFNPSSEDCAEKSMGAHALTIVGYGGEGSSAFWIVKNSWGTSWGSSGYFRLAR 372


Top