BLASTX nr result

ID: Cocculus23_contig00022731 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00022731
         (1472 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007037884.1| Cysteine proteinases superfamily protein, pu...   316   1e-83
ref|XP_007037882.1| Cysteine proteinases superfamily protein, pu...   316   1e-83
ref|XP_002262951.2| PREDICTED: uncharacterized protein LOC100251...   315   4e-83
ref|XP_007037885.1| Cysteine proteinases superfamily protein, pu...   305   4e-80
ref|XP_002511251.1| sentrin/sumo-specific protease, putative [Ri...   303   1e-79
ref|XP_002315114.1| hypothetical protein POPTR_0010s18760g [Popu...   301   4e-79
ref|XP_004309800.1| PREDICTED: uncharacterized protein LOC101305...   294   8e-77
ref|XP_002322226.2| hypothetical protein POPTR_0015s10250g [Popu...   286   2e-74
ref|XP_006440039.1| hypothetical protein CICLE_v10021330mg [Citr...   284   6e-74
ref|XP_006476972.1| PREDICTED: sentrin-specific protease 1-like ...   284   8e-74
ref|XP_004293126.1| PREDICTED: uncharacterized protein LOC101303...   280   9e-73
ref|XP_007037886.1| Cysteine proteinases superfamily protein, pu...   277   8e-72
ref|XP_007037883.1| Cysteine proteinases superfamily protein, pu...   277   8e-72
ref|XP_007210601.1| hypothetical protein PRUPE_ppa017098mg [Prun...   273   2e-70
gb|AFK37750.1| unknown [Lotus japonicus]                              266   1e-68
ref|XP_006366658.1| PREDICTED: probable ubiquitin-like-specific ...   265   5e-68
ref|XP_007137950.1| hypothetical protein PHAVU_009G168700g [Phas...   263   1e-67
ref|XP_006852167.1| hypothetical protein AMTR_s00049p00094540 [A...   261   4e-67
ref|XP_004501822.1| PREDICTED: probable ubiquitin-like-specific ...   259   2e-66
ref|XP_004253246.1| PREDICTED: uncharacterized protein LOC101254...   259   2e-66

>ref|XP_007037884.1| Cysteine proteinases superfamily protein, putative isoform 3
           [Theobroma cacao] gi|508775129|gb|EOY22385.1| Cysteine
           proteinases superfamily protein, putative isoform 3
           [Theobroma cacao]
          Length = 273

 Score =  316 bits (810), Expect = 1e-83
 Identities = 145/243 (59%), Positives = 189/243 (77%), Gaps = 1/243 (0%)
 Frame = +2

Query: 167 RLARKKSQEMQNMGLTLTCFLEKFPRRERSKMRTNNRKPIPKLPQKLDSNMFQCYLENLW 346
           +++++++Q++++  LT  CFL   P R+RSK R  ++  I K   +LDS  F+CY+E LW
Sbjct: 30  KISKQEAQKLRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLDSGAFECYMEKLW 89

Query: 347 KNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWSLLI 526
            +F EEKRTSFAY DC WFA YRK + + KVL WIK + IFS+KYV VP+VCW HWSLLI
Sbjct: 90  SSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLVPVVCWSHWSLLI 149

Query: 527 LCHLGESSKSKTR-PCMLLLDSLEMANPKRIEPDIRKFVLDIYREEARPEENKLIESIPL 703
            CH GES +S+T+ PCMLLLDSLE+ANP+R+EPDIRKFVLDIYR E RPE+ ++I  IPL
Sbjct: 150 FCHFGESLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGRPEKKEMIYRIPL 209

Query: 704 LVPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFMNENWFNSESLERFFEKV 883
           LVPKVPQQ++  +CG FVLYFINLF+E APENF + EG P+FM ++WFN+E +E F EK+
Sbjct: 210 LVPKVPQQRDGEECGKFVLYFINLFVEGAPENFSI-EGYPYFMRKDWFNAEGVECFCEKL 268

Query: 884 HTF 892
            +F
Sbjct: 269 DSF 271


>ref|XP_007037882.1| Cysteine proteinases superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508775127|gb|EOY22383.1| Cysteine
           proteinases superfamily protein, putative isoform 1
           [Theobroma cacao]
          Length = 291

 Score =  316 bits (810), Expect = 1e-83
 Identities = 145/243 (59%), Positives = 189/243 (77%), Gaps = 1/243 (0%)
 Frame = +2

Query: 167 RLARKKSQEMQNMGLTLTCFLEKFPRRERSKMRTNNRKPIPKLPQKLDSNMFQCYLENLW 346
           +++++++Q++++  LT  CFL   P R+RSK R  ++  I K   +LDS  F+CY+E LW
Sbjct: 48  KISKQEAQKLRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLDSGAFECYMEKLW 107

Query: 347 KNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWSLLI 526
            +F EEKRTSFAY DC WFA YRK + + KVL WIK + IFS+KYV VP+VCW HWSLLI
Sbjct: 108 SSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLVPVVCWSHWSLLI 167

Query: 527 LCHLGESSKSKTR-PCMLLLDSLEMANPKRIEPDIRKFVLDIYREEARPEENKLIESIPL 703
            CH GES +S+T+ PCMLLLDSLE+ANP+R+EPDIRKFVLDIYR E RPE+ ++I  IPL
Sbjct: 168 FCHFGESLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGRPEKKEMIYRIPL 227

Query: 704 LVPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFMNENWFNSESLERFFEKV 883
           LVPKVPQQ++  +CG FVLYFINLF+E APENF + EG P+FM ++WFN+E +E F EK+
Sbjct: 228 LVPKVPQQRDGEECGKFVLYFINLFVEGAPENFSI-EGYPYFMRKDWFNAEGVECFCEKL 286

Query: 884 HTF 892
            +F
Sbjct: 287 DSF 289


>ref|XP_002262951.2| PREDICTED: uncharacterized protein LOC100251251 [Vitis vinifera]
           gi|297733618|emb|CBI14865.3| unnamed protein product
           [Vitis vinifera]
          Length = 295

 Score =  315 bits (806), Expect = 4e-83
 Identities = 150/242 (61%), Positives = 186/242 (76%), Gaps = 3/242 (1%)
 Frame = +2

Query: 167 RLARKKSQEMQNM-GLTLTCFLEKFPRRERSKMRTNNRKPI-PKLPQKLDSNMFQCYLEN 340
           R+ + + +E++ +   T  CF   FPR ERSK R N +  I  K  +KLD+  F+ Y  N
Sbjct: 46  RMTKHEIEEIKEIFEFTTPCFSNTFPRHERSKRRINCKNIIIRKEKKKLDTAAFEWYFRN 105

Query: 341 LWKNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWSL 520
           LWK+FS++K++SF Y+DCLWF+ Y K +++ KVL WIK K IFSRKYVFVPIVCW+HWSL
Sbjct: 106 LWKSFSDDKKSSFGYLDCLWFSFYLKTSSREKVLNWIKKKRIFSRKYVFVPIVCWNHWSL 165

Query: 521 LILCHLGESSKSKTR-PCMLLLDSLEMANPKRIEPDIRKFVLDIYREEARPEENKLIESI 697
           LILCH GES +SK R PCMLLLDSL+MANPKR+EP+IRKFV DIY+EE RPE  +LI  I
Sbjct: 166 LILCHFGESLESKIRAPCMLLLDSLQMANPKRLEPNIRKFVFDIYKEEGRPESKQLISKI 225

Query: 698 PLLVPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFMNENWFNSESLERFFE 877
           PLLVPKVPQQ+N  +CG FVLYFINLF++ APENF VSEG P+FM +NWF  E+LE FF 
Sbjct: 226 PLLVPKVPQQRNGEECGNFVLYFINLFMDGAPENFSVSEGYPYFMKKNWFGPEALEHFFR 285

Query: 878 KV 883
           K+
Sbjct: 286 KL 287


>ref|XP_007037885.1| Cysteine proteinases superfamily protein, putative isoform 4
           [Theobroma cacao] gi|508775130|gb|EOY22386.1| Cysteine
           proteinases superfamily protein, putative isoform 4
           [Theobroma cacao]
          Length = 270

 Score =  305 bits (780), Expect = 4e-80
 Identities = 142/243 (58%), Positives = 186/243 (76%), Gaps = 1/243 (0%)
 Frame = +2

Query: 167 RLARKKSQEMQNMGLTLTCFLEKFPRRERSKMRTNNRKPIPKLPQKLDSNMFQCYLENLW 346
           +++++++Q++++  LT  CFL   P R+RSK R  ++  I K   +LDS  F+CY+E LW
Sbjct: 30  KISKQEAQKLRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLDSGAFECYMEKLW 89

Query: 347 KNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWSLLI 526
            +F EEKRTSFAY DC WFA YRK + + KVL WIK + IFS+KYV VP+VCW HWSLLI
Sbjct: 90  SSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLVPVVCWSHWSLLI 149

Query: 527 LCHLGESSKSKTR-PCMLLLDSLEMANPKRIEPDIRKFVLDIYREEARPEENKLIESIPL 703
            CH GES +S+T+ PCMLLLDSLE+ANP+R+EPDIRKFVLDIYR E RPE+ ++I  IPL
Sbjct: 150 FCHFGESLQSETKTPCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGRPEKKEMIYRIPL 209

Query: 704 LVPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFMNENWFNSESLERFFEKV 883
           LVPK   Q++  +CG FVLYFINLF+E APENF + EG P+FM ++WFN+E +E F EK+
Sbjct: 210 LVPK---QRDGEECGKFVLYFINLFVEGAPENFSI-EGYPYFMRKDWFNAEGVECFCEKL 265

Query: 884 HTF 892
            +F
Sbjct: 266 DSF 268


>ref|XP_002511251.1| sentrin/sumo-specific protease, putative [Ricinus communis]
           gi|223550366|gb|EEF51853.1| sentrin/sumo-specific
           protease, putative [Ricinus communis]
          Length = 294

 Score =  303 bits (776), Expect = 1e-79
 Identities = 144/248 (58%), Positives = 185/248 (74%), Gaps = 6/248 (2%)
 Frame = +2

Query: 167 RLARKKSQEMQNMGLTLTCFLEKFPRRERSKMRTNNRKPIPKL-----PQKLDSNMFQCY 331
           ++ +K++++++   L   CFL  FP R+RS+ R  ++  I ++      ++LDS  F CY
Sbjct: 46  KIKKKEAEKLRRFDLISQCFLGTFPTRQRSRRRIKHKFAITRVIKEKEKKRLDSGEFDCY 105

Query: 332 LENLWKNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDH 511
            +NLWK+FS+EKRTSF Y+D LWF  Y K + K KVL WIK K IFS+KYV VPIVCW H
Sbjct: 106 FQNLWKSFSKEKRTSFVYLDSLWFYWYLKASWKGKVLTWIKRKQIFSKKYVLVPIVCWGH 165

Query: 512 WSLLILCHLGESSKSKTR-PCMLLLDSLEMANPKRIEPDIRKFVLDIYREEARPEENKLI 688
           WSLLI CHLGE S+S  R PCMLLLDSLEMANP+R+EPDIRKFVLDIY  E RPE+ KLI
Sbjct: 166 WSLLIFCHLGEVSESNDRTPCMLLLDSLEMANPRRLEPDIRKFVLDIYTSEGRPEDKKLI 225

Query: 689 ESIPLLVPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFMNENWFNSESLER 868
             IPLLVPKVPQQ+N  +CG +VLYFINLF+  AP++F + +  P+FMN+NWF+ E LER
Sbjct: 226 SQIPLLVPKVPQQRNGEECGNYVLYFINLFMLGAPDDFSIKD-YPYFMNKNWFSPECLER 284

Query: 869 FFEKVHTF 892
           F E++ +F
Sbjct: 285 FSEELESF 292


>ref|XP_002315114.1| hypothetical protein POPTR_0010s18760g [Populus trichocarpa]
           gi|222864154|gb|EEF01285.1| hypothetical protein
           POPTR_0010s18760g [Populus trichocarpa]
          Length = 298

 Score =  301 bits (772), Expect = 4e-79
 Identities = 143/242 (59%), Positives = 181/242 (74%), Gaps = 1/242 (0%)
 Frame = +2

Query: 167 RLARKKSQEMQNMGLTLTCFLEKFPRRERSKMRTNNRKPIPKLPQKLDSNMFQCYLENLW 346
           R+ +K+++E+++  LT  CFL+  P RERSK R      + KL ++LDS  F CY+ENLW
Sbjct: 52  RMTKKQAEEIESFKLTSPCFLQTIPCRERSKKRFKRNNAVSKLKKELDSVSFNCYMENLW 111

Query: 347 KNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWSLLI 526
           K+FSE+K+ SFAY+D LWF +Y + ++  KVL+WIK KHIFS+KYV VPIV W HWSLLI
Sbjct: 112 KSFSEDKKMSFAYLDSLWFTMYTEASSGVKVLEWIKRKHIFSKKYVLVPIVRWCHWSLLI 171

Query: 527 LCHLGESSKSKT-RPCMLLLDSLEMANPKRIEPDIRKFVLDIYREEARPEENKLIESIPL 703
            CH GES  S+   PCMLLLDSLEMA+PKR+EPDIRKFV DIY  E RPE   +I  IPL
Sbjct: 172 FCHFGESLLSENITPCMLLLDSLEMASPKRLEPDIRKFVWDIYESEGRPENKHMISQIPL 231

Query: 704 LVPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFMNENWFNSESLERFFEKV 883
           LVPKVPQQ+N  +CG +VL FINLF++ APENF + EG P+FM +NWF+ E LE F EK+
Sbjct: 232 LVPKVPQQRNGVECGNYVLNFINLFVQDAPENFHM-EGYPYFMKDNWFSPEGLEHFCEKL 290

Query: 884 HT 889
            +
Sbjct: 291 ES 292


>ref|XP_004309800.1| PREDICTED: uncharacterized protein LOC101305332 [Fragaria vesca
           subsp. vesca]
          Length = 330

 Score =  294 bits (752), Expect = 8e-77
 Identities = 154/280 (55%), Positives = 191/280 (68%), Gaps = 40/280 (14%)
 Frame = +2

Query: 170 LARKKSQEMQNMGLTLT--------CFLEKFPRRERS-------------KMRTNN---- 274
           LA  K+Q  Q++GL  T        CFL   P R RS               +T N    
Sbjct: 50  LAFSKAQR-QSLGLRETEEIKKISPCFLTSCPHRRRSVRSFKTKYVNLEVSRKTQNQESK 108

Query: 275 ------RKPI--------PKLPQKLDSNMFQCYLENLWKNFSEEKRTSFAYVDCLWFALY 412
                 RKP+         +  Q+LDS  FQCY E+LWK+FSE+K+TSF Y+DC+WF+LY
Sbjct: 109 ACAVSRRKPVLVSRGCRVSRRKQELDSGTFQCYFESLWKSFSEDKKTSFTYLDCIWFSLY 168

Query: 413 RKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWSLLILCHLGESSKSKT-RPCMLLLDS 589
            K  TK KVL WIK KHIFS+KYVFVPIVCW HW+LLILCH GE+ +SKT RPCMLLLDS
Sbjct: 169 IKPTTKDKVLTWIKKKHIFSKKYVFVPIVCWSHWNLLILCHFGENLESKTQRPCMLLLDS 228

Query: 590 LEMANPKRIEPDIRKFVLDIYREEARPEENKLIESIPLLVPKVPQQKNNNDCGVFVLYFI 769
           LEMA+P+R+EPDIRKFV+DI+REE RPE   L+  IPLLVPKVPQQ+N+ +CG FVLYFI
Sbjct: 229 LEMADPRRLEPDIRKFVVDIFREEGRPENMDLLRKIPLLVPKVPQQRNDQECGNFVLYFI 288

Query: 770 NLFLESAPENFKVSEGCPHFMNENWFNSESLERFFEKVHT 889
           NLF+ESAP+ F + E  P+FM +NWF  ESL+ F + +++
Sbjct: 289 NLFMESAPQTFSMEE-YPYFMKKNWFAYESLDCFCQDIYS 327


>ref|XP_002322226.2| hypothetical protein POPTR_0015s10250g [Populus trichocarpa]
           gi|550322421|gb|EEF06353.2| hypothetical protein
           POPTR_0015s10250g [Populus trichocarpa]
          Length = 292

 Score =  286 bits (731), Expect = 2e-74
 Identities = 139/243 (57%), Positives = 175/243 (72%), Gaps = 4/243 (1%)
 Frame = +2

Query: 167 RLARKKSQEMQNMGLTLTCFLEKFPRRERSKMRTNNRKPIPKLPQ---KLDSNMFQCYLE 337
           ++ +++++E+++  LT  C+L  FP   RSK R   +K I K  +   KLDS  F CY E
Sbjct: 46  KITKQEAEELRSFKLTSQCYLGTFPCSARSKRRIKRKKAIVKEIREKIKLDSGAFDCYFE 105

Query: 338 NLWKNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWS 517
           ++W+NFSE+KRT   Y DCLWF LY K + K KVL WIK K IFS+KYV VPIV W HWS
Sbjct: 106 HMWRNFSEDKRTFITYFDCLWFNLYTKASFKGKVLTWIKKKQIFSKKYVLVPIVHWSHWS 165

Query: 518 LLILCHLGESSKSKTR-PCMLLLDSLEMANPKRIEPDIRKFVLDIYREEARPEENKLIES 694
           LLI CHLGES +SK R PCMLLLDSLE A P+ +EPDIRKFVLDIY+ E R E  +LI  
Sbjct: 166 LLIFCHLGESLQSKLRTPCMLLLDSLEKAGPRCLEPDIRKFVLDIYKSEGRAENKELISK 225

Query: 695 IPLLVPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFMNENWFNSESLERFF 874
           IPLLVPKVPQQ+   +CG +VLY+INLF++ APENF + +  P+FM +NWF+   LE FF
Sbjct: 226 IPLLVPKVPQQRGGEECGNYVLYYINLFVQGAPENFCMDD-YPYFMKQNWFSPGCLEAFF 284

Query: 875 EKV 883
           EK+
Sbjct: 285 EKL 287


>ref|XP_006440039.1| hypothetical protein CICLE_v10021330mg [Citrus clementina]
           gi|557542301|gb|ESR53279.1| hypothetical protein
           CICLE_v10021330mg [Citrus clementina]
          Length = 303

 Score =  284 bits (727), Expect = 6e-74
 Identities = 146/267 (54%), Positives = 179/267 (67%), Gaps = 18/267 (6%)
 Frame = +2

Query: 167 RLARKKSQEMQNMGLTLTCFLEKFPRRERSKMRTNNRKP-----------------IPKL 295
           R  +   Q+++N  LT  CFL  F  R RSK R   +                     + 
Sbjct: 43  RKMKISKQKIRNFELTAPCFLGTFSCRRRSKRRVKCKNTSLIKGKNSSSVKCKDMITKRR 102

Query: 296 PQKLDSNMFQCYLENLWKNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSR 475
             KLDS  F+  L+NLW++FSE+K+  F Y+D LWF LYRK ++KAKVL WIK KHIFS+
Sbjct: 103 KNKLDSGKFEHLLDNLWRSFSEDKKAGFTYLDSLWFDLYRKPSSKAKVLTWIKRKHIFSK 162

Query: 476 KYVFVPIVCWDHWSLLILCHLGESSKSKTR-PCMLLLDSLEMANPKRIEPDIRKFVLDIY 652
           KYV VPIVCW HW+LLILC+ G S +SKTR PCMLLLDSLEM+NP R EPDIRKFV+DIY
Sbjct: 163 KYVLVPIVCWRHWNLLILCNFGGSFESKTRTPCMLLLDSLEMSNPWRFEPDIRKFVMDIY 222

Query: 653 REEARPEENKLIESIPLLVPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFM 832
           + E RPE  +LI  IPLLVPKVPQQ+N  +CG FVLYFINLF+E APENF + E  P+FM
Sbjct: 223 KAEERPETKELISRIPLLVPKVPQQRNGEECGNFVLYFINLFVEGAPENFNL-EDYPYFM 281

Query: 833 NENWFNSESLERFFEKVHTFCRRMNNS 913
            +NWF +E L+        FC R+N+S
Sbjct: 282 EKNWFTAEDLD-------CFCERLNSS 301


>ref|XP_006476972.1| PREDICTED: sentrin-specific protease 1-like isoform X1 [Citrus
           sinensis] gi|568883543|ref|XP_006494525.1| PREDICTED:
           sentrin-specific protease 1-like isoform X1 [Citrus
           sinensis]
          Length = 303

 Score =  284 bits (726), Expect = 8e-74
 Identities = 146/267 (54%), Positives = 179/267 (67%), Gaps = 18/267 (6%)
 Frame = +2

Query: 167 RLARKKSQEMQNMGLTLTCFLEKFPRRERSKMRTNNRKP-----------------IPKL 295
           R  +   Q+++N  LT  CFL  F  R RSK R   +                     + 
Sbjct: 43  RKMKISKQKIRNFELTAPCFLGTFSCRRRSKRRVKCKNTSLIKGKNSSSVKCKDMITKRK 102

Query: 296 PQKLDSNMFQCYLENLWKNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSR 475
             KLDS  F+  L+NLW++FSE+K+  F Y+D LWF LYRK ++KAKVL WIK KHIFS+
Sbjct: 103 KNKLDSGKFEHLLDNLWRSFSEDKKAGFTYLDSLWFDLYRKPSSKAKVLTWIKRKHIFSK 162

Query: 476 KYVFVPIVCWDHWSLLILCHLGESSKSKTR-PCMLLLDSLEMANPKRIEPDIRKFVLDIY 652
           KYV VPIVCW HW+LLILC+ G S +SKTR PCMLLLDSLEM+NP R EPDIRKFV+DIY
Sbjct: 163 KYVLVPIVCWRHWNLLILCNFGGSFESKTRTPCMLLLDSLEMSNPWRFEPDIRKFVMDIY 222

Query: 653 REEARPEENKLIESIPLLVPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFM 832
           + E RPE  +LI  IPLLVPKVPQQ+N  +CG FVLYFINLF+E APENF + E  P+FM
Sbjct: 223 KAEDRPETKELISRIPLLVPKVPQQRNGEECGNFVLYFINLFVEGAPENFNL-EDYPYFM 281

Query: 833 NENWFNSESLERFFEKVHTFCRRMNNS 913
            +NWF +E L+        FC R+N+S
Sbjct: 282 EKNWFTAEDLD-------CFCERLNSS 301


>ref|XP_004293126.1| PREDICTED: uncharacterized protein LOC101303677 [Fragaria vesca
           subsp. vesca]
          Length = 360

 Score =  280 bits (717), Expect = 9e-73
 Identities = 143/279 (51%), Positives = 191/279 (68%), Gaps = 30/279 (10%)
 Frame = +2

Query: 170 LARKKSQEMQNMGLTLTCFLEKFPRRERS-----------------------KMRTNNRK 280
           L+++++QE++ +     CFL   P R+RS                           + RK
Sbjct: 57  LSQRETQEIKKIS---PCFLTFHPHRQRSVRSFKTKYVKRQVLRKKQNQESKACAVSRRK 113

Query: 281 PIPK------LPQKLDSNMFQCYLENLWKNFSEEKRTSFAYVDCLWFALYRKGATKAKVL 442
           P+ +        Q+LDS  FQ   E+LWK+FSE+K+T F Y+DCLWF+LY +  TK KVL
Sbjct: 114 PVSRGCRVSRKKQELDSGSFQSCFESLWKSFSEDKKTYFTYLDCLWFSLYIEPTTKDKVL 173

Query: 443 KWIKGKHIFSRKYVFVPIVCWDHWSLLILCHLGESSKSKT-RPCMLLLDSLEMANPKRIE 619
            WIK KHIFS+KYVFVPIVCW HWSLLILCH GE+ +SKT RPCMLLLDSLEM +PKR+E
Sbjct: 174 TWIKKKHIFSKKYVFVPIVCWCHWSLLILCHFGENLESKTQRPCMLLLDSLEMTDPKRLE 233

Query: 620 PDIRKFVLDIYREEARPEENKLIESIPLLVPKVPQQKNNNDCGVFVLYFINLFLESAPEN 799
           P+IR+FV+DI+REE R E   L+  IPLLVPKVP+Q+N+ +CG FVLYFINLF+ESAP+ 
Sbjct: 234 PNIRRFVVDIFREEGRRENMDLLRKIPLLVPKVPKQRNDQECGNFVLYFINLFMESAPQT 293

Query: 800 FKVSEGCPHFMNENWFNSESLERFFEKVHTFCRRMNNSS 916
           F + EG P+FM +NWF  ESL+ F +++++  +  + +S
Sbjct: 294 FSM-EGYPYFMKKNWFAYESLDCFCQEIYSSAKGCSQNS 331


>ref|XP_007037886.1| Cysteine proteinases superfamily protein, putative isoform 5
           [Theobroma cacao] gi|508775131|gb|EOY22387.1| Cysteine
           proteinases superfamily protein, putative isoform 5
           [Theobroma cacao]
          Length = 259

 Score =  277 bits (709), Expect = 8e-72
 Identities = 134/242 (55%), Positives = 177/242 (73%)
 Frame = +2

Query: 167 RLARKKSQEMQNMGLTLTCFLEKFPRRERSKMRTNNRKPIPKLPQKLDSNMFQCYLENLW 346
           +++++++Q++++  LT  CFL   P R+RSK R  ++  I K   +LDS  F+CY+E LW
Sbjct: 30  KISKQEAQKLRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLDSGAFECYMEKLW 89

Query: 347 KNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWSLLI 526
            +F EEKRTSFAY DC WFA YRK + + KVL WIK + IFS+KYV VP+VC        
Sbjct: 90  SSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLVPVVCC------- 142

Query: 527 LCHLGESSKSKTRPCMLLLDSLEMANPKRIEPDIRKFVLDIYREEARPEENKLIESIPLL 706
                  S++KT PCMLLLDSLE+ANP+R+EPDIRKFVLDIYR E RPE+ ++I  IPLL
Sbjct: 143 -----LQSETKT-PCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGRPEKKEMIYRIPLL 196

Query: 707 VPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFMNENWFNSESLERFFEKVH 886
           VPKVPQQ++  +CG FVLYFINLF+E APENF + EG P+FM ++WFN+E +E F EK+ 
Sbjct: 197 VPKVPQQRDGEECGKFVLYFINLFVEGAPENFSI-EGYPYFMRKDWFNAEGVECFCEKLD 255

Query: 887 TF 892
           +F
Sbjct: 256 SF 257


>ref|XP_007037883.1| Cysteine proteinases superfamily protein, putative isoform 2
           [Theobroma cacao] gi|508775128|gb|EOY22384.1| Cysteine
           proteinases superfamily protein, putative isoform 2
           [Theobroma cacao]
          Length = 277

 Score =  277 bits (709), Expect = 8e-72
 Identities = 134/242 (55%), Positives = 177/242 (73%)
 Frame = +2

Query: 167 RLARKKSQEMQNMGLTLTCFLEKFPRRERSKMRTNNRKPIPKLPQKLDSNMFQCYLENLW 346
           +++++++Q++++  LT  CFL   P R+RSK R  ++  I K   +LDS  F+CY+E LW
Sbjct: 48  KISKQEAQKLRDFRLTAPCFLGNIPCRQRSKRRVKSKNSISKQTNRLDSGAFECYMEKLW 107

Query: 347 KNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWSLLI 526
            +F EEKRTSFAY DC WFA YRK + + KVL WIK + IFS+KYV VP+VC        
Sbjct: 108 SSFPEEKRTSFAYFDCQWFAWYRKASFREKVLSWIKREQIFSKKYVLVPVVCC------- 160

Query: 527 LCHLGESSKSKTRPCMLLLDSLEMANPKRIEPDIRKFVLDIYREEARPEENKLIESIPLL 706
                  S++KT PCMLLLDSLE+ANP+R+EPDIRKFVLDIYR E RPE+ ++I  IPLL
Sbjct: 161 -----LQSETKT-PCMLLLDSLEIANPRRLEPDIRKFVLDIYRAEGRPEKKEMIYRIPLL 214

Query: 707 VPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFMNENWFNSESLERFFEKVH 886
           VPKVPQQ++  +CG FVLYFINLF+E APENF + EG P+FM ++WFN+E +E F EK+ 
Sbjct: 215 VPKVPQQRDGEECGKFVLYFINLFVEGAPENFSI-EGYPYFMRKDWFNAEGVECFCEKLD 273

Query: 887 TF 892
           +F
Sbjct: 274 SF 275


>ref|XP_007210601.1| hypothetical protein PRUPE_ppa017098mg [Prunus persica]
           gi|462406336|gb|EMJ11800.1| hypothetical protein
           PRUPE_ppa017098mg [Prunus persica]
          Length = 303

 Score =  273 bits (697), Expect = 2e-70
 Identities = 133/243 (54%), Positives = 170/243 (69%), Gaps = 14/243 (5%)
 Frame = +2

Query: 233 KFPRRERSKMRT--NNRKPIPKLPQKLDSNMFQC-----------YLENLWKNFSEEKRT 373
           K  R E  ++R   + +  + +  +KLDS  F+C           Y +NLWKN SE+KRT
Sbjct: 68  KGKREEMKELRPPKDAKNAVSRKKEKLDSQAFECKEKLGSEAFDRYFQNLWKNLSEDKRT 127

Query: 374 SFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWSLLILCHLGESSK 553
           SFAY+DC+WF+LY + +++ KVL WIK KHIFS+KYV VPIVCW HW+LLI CH GES +
Sbjct: 128 SFAYLDCMWFSLYLQPSSRDKVLTWIKKKHIFSKKYVIVPIVCWGHWNLLIFCHFGESEQ 187

Query: 554 SKT-RPCMLLLDSLEMANPKRIEPDIRKFVLDIYREEARPEENKLIESIPLLVPKVPQQK 730
           S+T +PCMLLLDSLE A+P+R EPDIRKFVLDIY  E R E    I  IP LVPKVPQQ+
Sbjct: 188 SETHKPCMLLLDSLENADPRRYEPDIRKFVLDIYEAEGRSETKDFIYRIPFLVPKVPQQR 247

Query: 731 NNNDCGVFVLYFINLFLESAPENFKVSEGCPHFMNENWFNSESLERFFEKVHTFCRRMNN 910
           N+ +CG FVLY+INLF+E APENF +  G P+FM +NWF  E LE        FC+++ +
Sbjct: 248 NDVECGNFVLYYINLFIEGAPENFSIEGGYPYFMKKNWFTPEGLE-------CFCQQLYS 300

Query: 911 SSQ 919
           SS+
Sbjct: 301 SSE 303


>gb|AFK37750.1| unknown [Lotus japonicus]
          Length = 284

 Score =  266 bits (681), Expect = 1e-68
 Identities = 131/227 (57%), Positives = 163/227 (71%), Gaps = 1/227 (0%)
 Frame = +2

Query: 212 TLTCFLEKFPRRERSKMRTNNRKPIPKLPQKLDSNMFQCYLENLWKNFSEEKRTSFAYVD 391
           +L  +L   PRR R+K +    +     P KLDS +F   L  +W +FSE+KR  FAY D
Sbjct: 58  SLPSYLSDIPRRPRTKKKKFKAEEALPRP-KLDSGVFDNNLVKIWNSFSEDKRKPFAYFD 116

Query: 392 CLWFALYRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWSLLILCHLGESSKSKTRP- 568
            LWF+LYR  ++K KVL WIK +HIFS+ YVFVPIVCW HWSLLI CH GES +S TR  
Sbjct: 117 SLWFSLYRAASSKDKVLTWIKKEHIFSKAYVFVPIVCWGHWSLLIFCHFGESLQSTTRSR 176

Query: 569 CMLLLDSLEMANPKRIEPDIRKFVLDIYREEARPEENKLIESIPLLVPKVPQQKNNNDCG 748
           CMLLLDSLEM NP+R+EPDIR+FV+DIY+   RPE   LI  IPLLVPKVPQQ++ N+CG
Sbjct: 177 CMLLLDSLEMVNPRRLEPDIRRFVVDIYKAWDRPETKNLIYQIPLLVPKVPQQRDGNECG 236

Query: 749 VFVLYFINLFLESAPENFKVSEGCPHFMNENWFNSESLERFFEKVHT 889
            FVLYFINLFL  APENF +  G P+FM ++WF  E  +RF E++++
Sbjct: 237 NFVLYFINLFLRCAPENFSMG-GYPYFMKKDWFTFEDFDRFCERLYS 282


>ref|XP_006366658.1| PREDICTED: probable ubiquitin-like-specific protease 2B-like
           [Solanum tuberosum]
          Length = 427

 Score =  265 bits (676), Expect = 5e-68
 Identities = 130/242 (53%), Positives = 171/242 (70%), Gaps = 1/242 (0%)
 Frame = +2

Query: 173 ARKKSQEMQNMGLTLTCFLEKFPRRERSKMRTNNRKPIPKLPQKLDSNMFQCYLENLWKN 352
           +RK+S+       T +  + +   R   + R NN +      + L S+ F+ YLE++WK 
Sbjct: 171 SRKRSKSKITADSTDSEVIPQRASRCHGQSRRNNSQ------KGLGSSKFELYLESIWKL 224

Query: 353 FSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWSLLILC 532
             E++R +F+Y+D LWF+LY + + KAKVL WI  K IFS++YVFVPIV W HWSLLI C
Sbjct: 225 HPEDRRNTFSYLDSLWFSLYSERSHKAKVLNWIAKKKIFSKEYVFVPIVLWGHWSLLIFC 284

Query: 533 HLGESSKSKTR-PCMLLLDSLEMANPKRIEPDIRKFVLDIYREEARPEENKLIESIPLLV 709
           HLGES +SK R PCMLLLDSL MANP+R +P IRKFV+D+++ E RPE    I  IPL++
Sbjct: 285 HLGESLQSKERSPCMLLLDSLHMANPERFDPGIRKFVVDLFKAEQRPETKDQIMKIPLMI 344

Query: 710 PKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFMNENWFNSESLERFFEKVHT 889
           PKVPQQ+N+ DCG FVLY+INLFLESAPENF +S+G P+FM E+WF  E LE F +KV +
Sbjct: 345 PKVPQQRNDEDCGNFVLYYINLFLESAPENFSISKGYPYFMTEDWFTPERLECFLQKVQS 404

Query: 890 FC 895
            C
Sbjct: 405 TC 406


>ref|XP_007137950.1| hypothetical protein PHAVU_009G168700g [Phaseolus vulgaris]
           gi|561011037|gb|ESW09944.1| hypothetical protein
           PHAVU_009G168700g [Phaseolus vulgaris]
          Length = 268

 Score =  263 bits (672), Expect = 1e-67
 Identities = 123/221 (55%), Positives = 165/221 (74%), Gaps = 4/221 (1%)
 Frame = +2

Query: 239 PRRERSKMRTNNRKP---IPKLPQKLDSNMFQCYLENLWKNFSEEKRTSFAYVDCLWFAL 409
           P +E S+ +    +P   + ++ +KLDS +F  +L+ +WK F E+++  F Y D LWF+L
Sbjct: 47  PNKEHSRPKEAPCRPKETLSRIKEKLDSGIFDTFLKKIWKIFPEDRKGQFTYFDSLWFSL 106

Query: 410 YRKGATKAKVLKWIKGKHIFSRKYVFVPIVCWDHWSLLILCHLGESSKSKTRP-CMLLLD 586
           YR  ++K KVL WIK + IFS+ YVFVPIVCW HWSLLILCH GES +S TR  CMLLLD
Sbjct: 107 YRSASSKDKVLAWIKREPIFSKAYVFVPIVCWGHWSLLILCHFGESLQSSTRSRCMLLLD 166

Query: 587 SLEMANPKRIEPDIRKFVLDIYREEARPEENKLIESIPLLVPKVPQQKNNNDCGVFVLYF 766
           SLEMANP+R+EP+IR+FVLDIY+   RPE   ++  IP LVPKVPQQ++ N+CG FVLYF
Sbjct: 167 SLEMANPRRLEPEIRRFVLDIYKSGDRPETKNILSQIPFLVPKVPQQRDGNECGFFVLYF 226

Query: 767 INLFLESAPENFKVSEGCPHFMNENWFNSESLERFFEKVHT 889
           INLFLE AP+NF + EG P+FM ++WF+ + L+RF E +++
Sbjct: 227 INLFLEHAPDNFSM-EGYPYFMTKDWFSFDGLDRFHEGLNS 266


>ref|XP_006852167.1| hypothetical protein AMTR_s00049p00094540 [Amborella trichopoda]
           gi|548855771|gb|ERN13634.1| hypothetical protein
           AMTR_s00049p00094540 [Amborella trichopoda]
          Length = 319

 Score =  261 bits (668), Expect = 4e-67
 Identities = 119/207 (57%), Positives = 155/207 (74%), Gaps = 1/207 (0%)
 Frame = +2

Query: 284 IPKLPQKLDSNMFQCYLENLWKNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKH 463
           + KL  K+D+N+F+ YLE LWK   E+K+ S  Y+DCLWF LY  G++  KVL W++ KH
Sbjct: 91  LSKLQHKIDTNIFEFYLETLWKKLPEDKQRSCTYLDCLWFHLYGVGSSSTKVLDWVRRKH 150

Query: 464 IFSRKYVFVPIVCWDHWSLLILCHLGESSKSKTR-PCMLLLDSLEMANPKRIEPDIRKFV 640
           IFSRKYVFVPI+ W HWSLLILCHLGE   SK R PC+LLLDSL MA P+R+EPDIRKFV
Sbjct: 151 IFSRKYVFVPIIRWRHWSLLILCHLGEDLDSKERTPCLLLLDSLRMAEPRRLEPDIRKFV 210

Query: 641 LDIYREEARPEENKLIESIPLLVPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGC 820
            DIY+ E   E  +++  IPLLVPKVPQQ++   CG+FVL FI+LFL++APENF   +G 
Sbjct: 211 WDIYKSEGGKESKEIVSRIPLLVPKVPQQRDEKQCGMFVLQFIDLFLQNAPENFCPFKGY 270

Query: 821 PHFMNENWFNSESLERFFEKVHTFCRR 901
           P+F+ E+WF+ + +E F + +H+F  R
Sbjct: 271 PYFLKEDWFDPKDIESFCKDIHSFSLR 297


>ref|XP_004501822.1| PREDICTED: probable ubiquitin-like-specific protease 2A-like [Cicer
           arietinum]
          Length = 385

 Score =  259 bits (663), Expect = 2e-66
 Identities = 130/219 (59%), Positives = 161/219 (73%), Gaps = 4/219 (1%)
 Frame = +2

Query: 239 PRRERSKMRTN-NRKPIPKLP-QKLDSNMFQCYLENLWKNFSEEKRTSFAYVDCLWFALY 412
           PRR R+K +   N    P  P +KL+S +F  YL  +WK+FSE+++ SFAY+D LWF+LY
Sbjct: 166 PRRPRTKSKRKFNGNEAPSRPKEKLNSEVFDNYLAKIWKSFSEDRKRSFAYLDSLWFSLY 225

Query: 413 RKGATKAKVLKWIKGK-HIFSRKYVFVPIVCWDHWSLLILCHLGESSKSKTRP-CMLLLD 586
           R  ++K KVL WIK K HIF++ YVFVPIVCW HWSLLILCH GE  +  T   CMLLLD
Sbjct: 226 RNASSKDKVLNWIKKKEHIFTKAYVFVPIVCWGHWSLLILCHFGEDLQLVTGSRCMLLLD 285

Query: 587 SLEMANPKRIEPDIRKFVLDIYREEARPEENKLIESIPLLVPKVPQQKNNNDCGVFVLYF 766
           SLEMA+P+R+EP+IR+FV DIY+   RPE   LI  IPLLVPKVPQQK+  DCG FVLYF
Sbjct: 286 SLEMADPRRLEPEIRRFVQDIYKAGDRPETKHLISKIPLLVPKVPQQKDGTDCGNFVLYF 345

Query: 767 INLFLESAPENFKVSEGCPHFMNENWFNSESLERFFEKV 883
           I LFLE AP+NF + EG P+FM ++WF  E L+RF E +
Sbjct: 346 IKLFLELAPKNFSI-EGYPYFMKKDWFTFEDLDRFCENL 383


>ref|XP_004253246.1| PREDICTED: uncharacterized protein LOC101254774 [Solanum
           lycopersicum]
          Length = 460

 Score =  259 bits (662), Expect = 2e-66
 Identities = 121/206 (58%), Positives = 156/206 (75%), Gaps = 1/206 (0%)
 Frame = +2

Query: 305 LDSNMFQCYLENLWKNFSEEKRTSFAYVDCLWFALYRKGATKAKVLKWIKGKHIFSRKYV 484
           L S+ F+ YLE++WK   E++R +F Y+D LWF+LY + + KAKVL WI  K IFS++YV
Sbjct: 242 LGSSKFELYLESIWKLHPEDRRNTFTYLDSLWFSLYSERSHKAKVLNWIAKKKIFSKEYV 301

Query: 485 FVPIVCWDHWSLLILCHLGESSKSKTR-PCMLLLDSLEMANPKRIEPDIRKFVLDIYREE 661
           FVPIV W HWSLLI CHLGES +SK R PCMLLLDSL MANP+R +P IRKFV+D+++ E
Sbjct: 302 FVPIVLWGHWSLLIFCHLGESLQSKERSPCMLLLDSLHMANPERFDPGIRKFVIDLFKAE 361

Query: 662 ARPEENKLIESIPLLVPKVPQQKNNNDCGVFVLYFINLFLESAPENFKVSEGCPHFMNEN 841
            RPE    I  IPL++PKVPQQ+N+ DCG FVLY+INLFLESAPENF +S+G P+FM E+
Sbjct: 362 QRPETKDQIMKIPLMIPKVPQQQNDEDCGNFVLYYINLFLESAPENFSISKGYPYFMTED 421

Query: 842 WFNSESLERFFEKVHTFCRRMNNSSQ 919
           WF  E LE F ++V +     ++S +
Sbjct: 422 WFTPERLECFLQEVQSASGSTSDSDE 447


Top