BLASTX nr result

ID: Mentha25_contig00026193 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00026193
         (703 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004250001.1| PREDICTED: uncharacterized protein LOC101253...   278   1e-72
ref|XP_006360486.1| PREDICTED: uncharacterized protein LOC102606...   277   3e-72
gb|EYU38064.1| hypothetical protein MIMGU_mgv1a011222mg [Mimulus...   274   2e-71
ref|XP_002316423.1| hypothetical protein POPTR_0010s24050g [Popu...   267   2e-69
ref|XP_002311041.1| hypothetical protein POPTR_0008s02620g [Popu...   260   3e-67
ref|XP_007220473.1| hypothetical protein PRUPE_ppa008484mg [Prun...   257   2e-66
ref|XP_003556279.1| PREDICTED: OTU domain-containing protein At3...   256   5e-66
ref|XP_002267087.2| PREDICTED: uncharacterized protein LOC100245...   251   2e-64
ref|XP_003536306.1| PREDICTED: uncharacterized protein LOC100793...   251   2e-64
emb|CBI40221.3| unnamed protein product [Vitis vinifera]              251   2e-64
ref|XP_004142455.1| PREDICTED: OTU domain-containing protein At3...   251   2e-64
ref|XP_007010219.1| Cysteine proteinases superfamily protein iso...   249   8e-64
ref|XP_004307032.1| PREDICTED: OTU domain-containing protein At3...   246   4e-63
ref|XP_007143828.1| hypothetical protein PHAVU_007G105100g [Phas...   246   7e-63
ref|XP_007010220.1| Cysteine proteinases superfamily protein iso...   243   3e-62
dbj|BAE71258.1| hypothetical protein [Trifolium pratense]             243   4e-62
ref|XP_006436685.1| hypothetical protein CICLE_v10032126mg [Citr...   240   3e-61
ref|XP_004496177.1| PREDICTED: OTU domain-containing protein At3...   237   3e-60
gb|EXC25419.1| hypothetical protein L484_016802 [Morus notabilis]     236   7e-60
ref|XP_002315401.2| hypothetical protein POPTR_0010s24050g [Popu...   231   1e-58

>ref|XP_004250001.1| PREDICTED: uncharacterized protein LOC101253339 [Solanum
           lycopersicum]
          Length = 338

 Score =  278 bits (712), Expect = 1e-72
 Identities = 146/247 (59%), Positives = 170/247 (68%), Gaps = 34/247 (13%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAA----AVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDS 525
           GAAS+WH ILP+  R + +       VF +H  E  K  EGSWNV WD+RPARWLHN DS
Sbjct: 81  GAASIWHAILPAGRRNKKDINRRNNTVF-KHHYELAKKGEGSWNVNWDSRPARWLHNPDS 139

Query: 524 AWLLFGVSA--GAPPVDLDPDSNSEVLVATD-----------DSKINNYRVIGVTADGRC 384
           AWLLFGV +   AP +DL PD+NS+V V  D           D    NYRV GV ADGRC
Sbjct: 140 AWLLFGVCSCLAAPSLDLLPDANSDVAVPIDKQSAVNSSDEDDQNSANYRVTGVPADGRC 199

Query: 383 LFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEFDMYVKRI 204
           LFRAIAHMACLR GE APDE RQ+ELADELRAQVV+ELLKRRKE EWFI+ +FD YV+RI
Sbjct: 200 LFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRRKEAEWFIEGDFDAYVERI 259

Query: 203 EQPYIWGGEPELLMCSHVLRA-----------------XXYGDEYKNEEVNSINVLFYGY 75
           E+PY+WGGEPELLM SHVL++                   YG+EY+ E  + INVLF+GY
Sbjct: 260 EKPYVWGGEPELLMASHVLKSAISVYMVDRSSGSLINISNYGEEYRKEGESPINVLFHGY 319

Query: 74  GHYDVLE 54
           GHYD+LE
Sbjct: 320 GHYDILE 326


>ref|XP_006360486.1| PREDICTED: uncharacterized protein LOC102606023 isoform X1 [Solanum
           tuberosum]
          Length = 338

 Score =  277 bits (708), Expect = 3e-72
 Identities = 145/250 (58%), Positives = 170/250 (68%), Gaps = 34/250 (13%)
 Frame = -3

Query: 701 KSSGAASVWHTILPSYWRQRPEAA----AVFGRHESEPVKHAEGSWNVAWDARPARWLHN 534
           +  GAAS+WH ILP+  R + +       VF +H  E  K  EGSWNV WD+RPARWLHN
Sbjct: 78  RGGGAASIWHAILPAGRRNKKDINRRNNTVF-KHHYELAKKGEGSWNVNWDSRPARWLHN 136

Query: 533 SDSAWLLFGVSA--GAPPVDLDPDSNSEVLVATD-----------DSKINNYRVIGVTAD 393
            DSAWLLFGV +   AP +DL PD+N +V V  D           D    NYRV GV AD
Sbjct: 137 PDSAWLLFGVCSCLAAPSLDLLPDANFDVAVPIDKQSVVNSSDEDDQNSANYRVTGVPAD 196

Query: 392 GRCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEFDMYV 213
           GRCLFRAIAHMACLR GE APDE RQ+ELADELRAQVV+ELLKRRKE EWFI+ +FD YV
Sbjct: 197 GRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRRKEAEWFIEGDFDAYV 256

Query: 212 KRIEQPYIWGGEPELLMCSHVLRA-----------------XXYGDEYKNEEVNSINVLF 84
           +RIE+PY+WGGEPELLM SHVL++                   YG+EY+ E  + INVLF
Sbjct: 257 ERIEKPYVWGGEPELLMASHVLKSSISVYMVDRSSGSLINISNYGEEYRKEGESPINVLF 316

Query: 83  YGYGHYDVLE 54
           +GYGHYD+LE
Sbjct: 317 HGYGHYDILE 326


>gb|EYU38064.1| hypothetical protein MIMGU_mgv1a011222mg [Mimulus guttatus]
          Length = 288

 Score =  274 bits (700), Expect = 2e-71
 Identities = 148/243 (60%), Positives = 171/243 (70%), Gaps = 26/243 (10%)
 Frame = -3

Query: 701 KSSGAASVWHTILPSYWRQRPEAAAVFGRHESEPV-KHAEGSWNVAWDARPARWLHNSDS 525
           K   AASVWHTILP   R+R   AAV GRHE+E V K  EGSWN AWD+RPARWLH++DS
Sbjct: 47  KCRAAASVWHTILPCR-RRRRRNAAVLGRHENEAVVKRGEGSWNAAWDSRPARWLHHTDS 105

Query: 524 AWLLFGV-------SAGAPPVDLDPDSNSEVLVATDDSKINNYRVIGVTADGRCLFRAIA 366
           AW LFGV       +A AP +D   DSN EVL    DS  +NYRV GVTADGRCLFRAIA
Sbjct: 106 AWFLFGVCATLASAAAAAPAIDSPCDSNPEVLSLKTDSS-SNYRVRGVTADGRCLFRAIA 164

Query: 365 HMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVE-WFIDEEFDMYVKRIEQPYI 189
           HM CLR GE APDE  Q+ELADELRAQVVEE+LKRRKE+  +F++EEFD YV+ I QPY+
Sbjct: 165 HMVCLRNGENAPDENHQRELADELRAQVVEEMLKRRKELAGFFLEEEFDGYVENIRQPYV 224

Query: 188 WGGEPELLMCSHVLRA-----------------XXYGDEYKNEEVNSINVLFYGYGHYDV 60
           WGGE ELLM SHVLR                    YG+EYK +  N+I+VLF+ YGHY++
Sbjct: 225 WGGEHELLMASHVLRTPISVFEEKRGSNSLINKANYGEEYKRDGENAISVLFHDYGHYEI 284

Query: 59  LEA 51
           LEA
Sbjct: 285 LEA 287


>ref|XP_002316423.1| hypothetical protein POPTR_0010s24050g [Populus trichocarpa]
           gi|222865463|gb|EEF02594.1| hypothetical protein
           POPTR_0010s24050g [Populus trichocarpa]
          Length = 318

 Score =  267 bits (683), Expect = 2e-69
 Identities = 140/256 (54%), Positives = 169/256 (66%), Gaps = 41/256 (16%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAAAVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDSAWLL 513
           GAA++WH I P+ WR+R E  +V G          EGSWN AWD RPARWLH  DSAWLL
Sbjct: 63  GAAAIWHVIQPADWRRRTERRSVRG----------EGSWNAAWDGRPARWLHRPDSAWLL 112

Query: 512 FGVSAG-APPVDLDPDSNS---------------EVLVATDDSKINN--------YRVIG 405
           FGV A  AP ++   D N+               ++  ++DD+K +N        Y+V G
Sbjct: 113 FGVCACLAPAIEFLSDVNNIDDVDHQEKERIDGGDLNASSDDAKQDNSDATVGSDYKVTG 172

Query: 404 VTADGRCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEF 225
           V ADGRCLFRAIAHMACLR GE APDE RQ+ELADELRAQVV+ELLKRR+E EWFI+ +F
Sbjct: 173 VLADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREETEWFIEGDF 232

Query: 224 DMYVKRIEQPYIWGGEPELLMCSHVLRA-----------------XXYGDEYKNEEVNSI 96
           D YVKRI+QPY+WGGEPELLM SHVL+                    YG+EY+ +EVN I
Sbjct: 233 DAYVKRIQQPYVWGGEPELLMASHVLKTMISVFMRDRTTGNLVNIVNYGEEYQKDEVNPI 292

Query: 95  NVLFYGYGHYDVLEAS 48
           NVLF+GYGHYD+LE +
Sbjct: 293 NVLFHGYGHYDILETT 308


>ref|XP_002311041.1| hypothetical protein POPTR_0008s02620g [Populus trichocarpa]
           gi|222850861|gb|EEE88408.1| hypothetical protein
           POPTR_0008s02620g [Populus trichocarpa]
          Length = 326

 Score =  260 bits (665), Expect = 3e-67
 Identities = 141/265 (53%), Positives = 167/265 (63%), Gaps = 50/265 (18%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAAAVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDSAWLL 513
           GAA++WH + P+ WR+R    +V G          EGSWNVAWD RPARWLH  DSAWLL
Sbjct: 62  GAAAIWHVVQPADWRRRRGRRSVRG----------EGSWNVAWDGRPARWLHRPDSAWLL 111

Query: 512 FGVSAG-APPVDLDPDSNSE------------------------VLVATDDSKINN---- 420
           FGV A  AP ++L  D N E                          V +DD K ++    
Sbjct: 112 FGVCACLAPAIELFCDVNIEGGENVVVDVDHQEKERIDGGDLNASAVNSDDVKQDSSSST 171

Query: 419 ----YRVIGVTADGRCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKE 252
               Y+V GV ADGRCLFRAIAHMACLR GE APDE RQ+ELADELRAQVV+ELLKRR+E
Sbjct: 172 AGSDYKVTGVLADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREE 231

Query: 251 VEWFIDEEFDMYVKRIEQPYIWGGEPELLMCSHVLRA-----------------XXYGDE 123
            EWFI+ +FD YVKRI+QPY+WGGEPELLM SHVL+                    YG+E
Sbjct: 232 TEWFIEGDFDAYVKRIQQPYVWGGEPELLMASHVLKTMISVFMRDRTTGNLVNIANYGEE 291

Query: 122 YKNEEVNSINVLFYGYGHYDVLEAS 48
           Y+ +EVN INVLF+GYGHYD+LE +
Sbjct: 292 YRKDEVNPINVLFHGYGHYDILETT 316


>ref|XP_007220473.1| hypothetical protein PRUPE_ppa008484mg [Prunus persica]
           gi|462416935|gb|EMJ21672.1| hypothetical protein
           PRUPE_ppa008484mg [Prunus persica]
          Length = 329

 Score =  257 bits (657), Expect = 2e-66
 Identities = 136/255 (53%), Positives = 163/255 (63%), Gaps = 40/255 (15%)
 Frame = -3

Query: 695 SGAASVWHTILPSYWRQRPEAAAVFGRHESEPVKH----AEGSWNVAWDARPARWLHNSD 528
           +GAAS+WH +LPS   +R        R    P  H     EGSWN AWDARPARWLH  D
Sbjct: 71  TGAASIWHALLPSSCNRR-------SRDLRRPAIHYELKGEGSWNAAWDARPARWLHRPD 123

Query: 527 SAWLLFGVSAGAPPVDLDPDS----------------NSEVLVATDDSKINN---YRVIG 405
           SAWLLFGV     P+D   DS                +S+   A D + I++   YRV G
Sbjct: 124 SAWLLFGVCNCLAPIDWADDSTPDGNDGVSNENAESFDSKCSAAPDQNNIDSSADYRVTG 183

Query: 404 VTADGRCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEF 225
           V ADGRCLFRAIAH+ACLR GE APDE RQ++LADELRAQVV+ELLKRR+E EWFI+ +F
Sbjct: 184 VPADGRCLFRAIAHVACLRNGEEAPDENRQRDLADELRAQVVDELLKRREETEWFIEGDF 243

Query: 224 DMYVKRIEQPYIWGGEPELLMCSHVLRA-----------------XXYGDEYKNEEVNSI 96
           D YVKR++QPY+WGGEPELLM SHVL+                    YG+EY+ EE   I
Sbjct: 244 DAYVKRLQQPYVWGGEPELLMASHVLKTPISVFMIDRSSAGLVNIANYGEEYRKEEEKPI 303

Query: 95  NVLFYGYGHYDVLEA 51
           NVLF+GYGHYD+L++
Sbjct: 304 NVLFHGYGHYDILDS 318


>ref|XP_003556279.1| PREDICTED: OTU domain-containing protein At3g57810-like [Glycine
           max]
          Length = 294

 Score =  256 bits (654), Expect = 5e-66
 Identities = 133/245 (54%), Positives = 164/245 (66%), Gaps = 30/245 (12%)
 Frame = -3

Query: 698 SSGAASVWHTILPSYWRQRPEAAAVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDSAW 519
           + GAAS+WH I+P           V   H+ +     EGSWNVAWDARPARWLH  DSAW
Sbjct: 52  AGGAASIWHAIMPRVNDDDGFRRGVVAFHDMK----GEGSWNVAWDARPARWLHRPDSAW 107

Query: 518 LLFGVSAG-APPVD-LDPDSNSEVLVATDDSKI-----------NNYRVIGVTADGRCLF 378
           LLFGV A  APP   +D D+N++ +   +  ++            +YRV GV ADGRCLF
Sbjct: 108 LLFGVCACLAPPSSCVDADTNTDAIAVDESCRLLDKEREEYEVSADYRVTGVPADGRCLF 167

Query: 377 RAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEFDMYVKRIEQ 198
           RAIAH ACLR GE APDE RQ+ELADELRA+VV+EL+KRR+E EWFI+ +FD YV+RI+Q
Sbjct: 168 RAIAHGACLRNGEKAPDENRQRELADELRAKVVDELMKRREETEWFIEGDFDTYVQRIQQ 227

Query: 197 PYIWGGEPELLMCSHVLRA-----------------XXYGDEYKNEEVNSINVLFYGYGH 69
           PY+WGGEPELLM SHVL+                    YG+EY+N++  SINVLF+GYGH
Sbjct: 228 PYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEEYRNDKEISINVLFHGYGH 287

Query: 68  YDVLE 54
           YD+LE
Sbjct: 288 YDILE 292


>ref|XP_002267087.2| PREDICTED: uncharacterized protein LOC100245448 [Vitis vinifera]
          Length = 380

 Score =  251 bits (641), Expect = 2e-64
 Identities = 140/246 (56%), Positives = 160/246 (65%), Gaps = 33/246 (13%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAAAVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDSAWLL 513
           GAAS+WH ILPS   +R         H+ +     EGSWNVAWDARPARWLH  DSAWLL
Sbjct: 130 GAASIWHAILPSGGDRRSSLRPAL-LHDQK----GEGSWNVAWDARPARWLHRPDSAWLL 184

Query: 512 FGVSAGAPPVDLDPDSNSEVLVATDD-----------SKINN-----YRVIGVTADGRCL 381
           FGV A   P+D   D ++EV VA DD           S  NN     YRV GV ADGRCL
Sbjct: 185 FGVCACLAPLD-SFDVDNEV-VAVDDKIEGCNQVNEISDENNNSSADYRVTGVPADGRCL 242

Query: 380 FRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEFDMYVKRIE 201
           FRAIAH ACLR GE APDE RQ ELAD+LRAQVV+ELLKRR+E EWFI+  FD YVKRI+
Sbjct: 243 FRAIAHSACLRSGEEAPDENRQTELADDLRAQVVDELLKRREETEWFIEGNFDAYVKRIQ 302

Query: 200 QPYIWGGEPELLMCSHVLR-----------------AXXYGDEYKNEEVNSINVLFYGYG 72
           QPY+WGGEPEL+M SHVL+                    YG EY+ +  + INVLF+GYG
Sbjct: 303 QPYVWGGEPELIMASHVLKMPISVFMIGRSSGDLKNIANYGKEYRIDNESPINVLFHGYG 362

Query: 71  HYDVLE 54
           HYD+LE
Sbjct: 363 HYDILE 368


>ref|XP_003536306.1| PREDICTED: uncharacterized protein LOC100793001 [Glycine max]
          Length = 296

 Score =  251 bits (641), Expect = 2e-64
 Identities = 135/245 (55%), Positives = 161/245 (65%), Gaps = 30/245 (12%)
 Frame = -3

Query: 689 AASVWHTILPSYWRQRPEAAAVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDSAWLLF 510
           AAS+WH I+P     R +     G      +K  EGSWNVAWDARPARWLH  DSAWLLF
Sbjct: 57  AASIWHAIMP-----RGDDGLRRGVVAVHDLK-GEGSWNVAWDARPARWLHRPDSAWLLF 110

Query: 509 GVSA--GAPPVDLDPDSNSEVLVATD-----------DSKINNYRVIGVTADGRCLFRAI 369
           GV A    PP  +D D+NS  +   +           D    +YRV GV ADGRCLFRAI
Sbjct: 111 GVCACLAPPPGCVDADTNSAGIAVDESCGLLDKEREEDEVSADYRVTGVPADGRCLFRAI 170

Query: 368 AHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEFDMYVKRIEQPYI 189
           AH ACLR GE APDE RQ+ELADELRA+VV+ELLKRR+E EWFI+ +FD Y++RI+QPY+
Sbjct: 171 AHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQRIQQPYV 230

Query: 188 WGGEPELLMCSHVLRA-----------------XXYGDEYKNEEVNSINVLFYGYGHYDV 60
           WGGEPELLM SHVL+                    YG+EY+N++  SINVLF+GYGHYD+
Sbjct: 231 WGGEPELLMASHVLKTPISVFMRDTGSVELVNIAKYGEEYRNDKDISINVLFHGYGHYDI 290

Query: 59  LEASR 45
           LE  R
Sbjct: 291 LETLR 295


>emb|CBI40221.3| unnamed protein product [Vitis vinifera]
          Length = 317

 Score =  251 bits (641), Expect = 2e-64
 Identities = 140/246 (56%), Positives = 160/246 (65%), Gaps = 33/246 (13%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAAAVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDSAWLL 513
           GAAS+WH ILPS   +R         H+ +     EGSWNVAWDARPARWLH  DSAWLL
Sbjct: 67  GAASIWHAILPSGGDRRSSLRPAL-LHDQK----GEGSWNVAWDARPARWLHRPDSAWLL 121

Query: 512 FGVSAGAPPVDLDPDSNSEVLVATDD-----------SKINN-----YRVIGVTADGRCL 381
           FGV A   P+D   D ++EV VA DD           S  NN     YRV GV ADGRCL
Sbjct: 122 FGVCACLAPLD-SFDVDNEV-VAVDDKIEGCNQVNEISDENNNSSADYRVTGVPADGRCL 179

Query: 380 FRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEFDMYVKRIE 201
           FRAIAH ACLR GE APDE RQ ELAD+LRAQVV+ELLKRR+E EWFI+  FD YVKRI+
Sbjct: 180 FRAIAHSACLRSGEEAPDENRQTELADDLRAQVVDELLKRREETEWFIEGNFDAYVKRIQ 239

Query: 200 QPYIWGGEPELLMCSHVLR-----------------AXXYGDEYKNEEVNSINVLFYGYG 72
           QPY+WGGEPEL+M SHVL+                    YG EY+ +  + INVLF+GYG
Sbjct: 240 QPYVWGGEPELIMASHVLKMPISVFMIGRSSGDLKNIANYGKEYRIDNESPINVLFHGYG 299

Query: 71  HYDVLE 54
           HYD+LE
Sbjct: 300 HYDILE 305


>ref|XP_004142455.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis
           sativus] gi|449520841|ref|XP_004167441.1| PREDICTED: OTU
           domain-containing protein At3g57810-like [Cucumis
           sativus]
          Length = 313

 Score =  251 bits (640), Expect = 2e-64
 Identities = 135/251 (53%), Positives = 159/251 (63%), Gaps = 36/251 (14%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAAAVFGRHESEPVKHA-----EGSWNVAWDARPARWLHNSD 528
           GAAS+WH I+PS         A    +   P  H      EGSWNVAWDARPARWLH  D
Sbjct: 61  GAASIWHAIMPS--------GAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPD 112

Query: 527 SAWLLFGVSAGAPPVD--------LDPDSNSEVLVAT------DDSKINNYRVIGVTADG 390
           SAWLLFGV A   P+D        +  D   EV  ++      +D    +YRV GV ADG
Sbjct: 113 SAWLLFGVCACIAPLDWVDASHEAVSLDQKKEVCESSGPEFNQNDESSADYRVTGVLADG 172

Query: 389 RCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEFDMYVK 210
           RCLFRAIAH ACLR GE APD+ RQ+ELADELRA+VV+ELLKRRKE EW+I+ +FD YVK
Sbjct: 173 RCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDAYVK 232

Query: 209 RIEQPYIWGGEPELLMCSHVLRA-----------------XXYGDEYKNEEVNSINVLFY 81
           RI+QP++WGGEPELLM SHVL+                    YG EY+  E + INVLF+
Sbjct: 233 RIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQKGEESPINVLFH 292

Query: 80  GYGHYDVLEAS 48
           GYGHYD+LE S
Sbjct: 293 GYGHYDILETS 303


>ref|XP_007010219.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma
           cacao] gi|508727132|gb|EOY19029.1| Cysteine proteinases
           superfamily protein isoform 1 [Theobroma cacao]
          Length = 327

 Score =  249 bits (635), Expect = 8e-64
 Identities = 136/257 (52%), Positives = 162/257 (63%), Gaps = 43/257 (16%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAAAVFGRHESEPVKHAE----GSWNVAWDARPARWLHNSDS 525
           GAAS+WH ILP             GR   E  K+ E    GSWNVAWDARPARWLH  DS
Sbjct: 67  GAASIWHAILPC-------GGGGGGRRRGEVWKNVERKGEGSWNVAWDARPARWLHRPDS 119

Query: 524 AWLLFGVSAGAPP----VDLDPDSNSEV----------LVATDDSK--------INNYRV 411
           AWLLFGV A   P    VD++PD++ ++          L A + S          +N +V
Sbjct: 120 AWLLFGVCACLAPMIEFVDVNPDADDKIEGAELNLVSRLSADEKSSSSSSSVAAADNCKV 179

Query: 410 IGVTADGRCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDE 231
            GV ADGRCLFRAIAH ACLR GE APDE  Q+ELADELRAQVV ELLKRR+E EWFI+ 
Sbjct: 180 TGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVVNELLKRREETEWFIEG 239

Query: 230 EFDMYVKRIEQPYIWGGEPELLMCSHVLRA-----------------XXYGDEYKNEEVN 102
           +FD YVK I+QPY+WGGEPE+LM SHVL+                    YG+EY+ ++ N
Sbjct: 240 DFDAYVKEIQQPYVWGGEPEILMASHVLKTPISVYMIPRSSSNLTKIAKYGEEYQKDKEN 299

Query: 101 SINVLFYGYGHYDVLEA 51
            INVLF+GYGHYD+LE+
Sbjct: 300 PINVLFHGYGHYDILES 316


>ref|XP_004307032.1| PREDICTED: OTU domain-containing protein At3g57810-like [Fragaria
           vesca subsp. vesca]
          Length = 324

 Score =  246 bits (629), Expect = 4e-63
 Identities = 136/253 (53%), Positives = 155/253 (61%), Gaps = 40/253 (15%)
 Frame = -3

Query: 689 AASVWHTILPS--YWRQRPEAAAVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDSAWL 516
           AAS+WH ILPS   WR+R        R         EGSWN A DARPARWLH  DSAWL
Sbjct: 70  AASIWHAILPSSGLWRRRD-----LRRPAIHYELKGEGSWNAALDARPARWLHRPDSAWL 124

Query: 515 LFGVSAGAPPVDLDPDSNSEVLVATDDSKINN---------------------YRVIGVT 399
           LFGV     P+D    +NS     T+D   NN                     YRV GV 
Sbjct: 125 LFGVCNCLAPIDWGSTTNS----TTNDEVSNNKTEACDSKSSITSDVQLETPDYRVTGVL 180

Query: 398 ADGRCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEFDM 219
           ADGRCLFRAIAH+ACLR GE  PDE RQ+ELADELRAQVV+ELLKRR+E EWFI+ +FD 
Sbjct: 181 ADGRCLFRAIAHVACLRNGEEPPDENRQRELADELRAQVVDELLKRREETEWFIEGDFDA 240

Query: 218 YVKRIEQPYIWGGEPELLMCSHVLRA-----------------XXYGDEYKNEEVNSINV 90
           YVKRI+QPY+WGGEPELLM SHV +A                   YG+EY  +E   INV
Sbjct: 241 YVKRIQQPYVWGGEPELLMASHVKKAPISVYMVDRSSGGLVNIAKYGEEYGKQEEKPINV 300

Query: 89  LFYGYGHYDVLEA 51
           LF+GYGHYD+LE+
Sbjct: 301 LFHGYGHYDILES 313


>ref|XP_007143828.1| hypothetical protein PHAVU_007G105100g [Phaseolus vulgaris]
           gi|561017018|gb|ESW15822.1| hypothetical protein
           PHAVU_007G105100g [Phaseolus vulgaris]
          Length = 305

 Score =  246 bits (627), Expect = 7e-63
 Identities = 140/245 (57%), Positives = 166/245 (67%), Gaps = 30/245 (12%)
 Frame = -3

Query: 698 SSGAASVWHTILP-SYWRQRPEAAAVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDSA 522
           + GAAS+WH I+P S  R R     V   H+ +     EGSWNVAWD RPARWLH  DSA
Sbjct: 67  AGGAASIWHAIMPRSGDRFRRGVVPV---HDLK----GEGSWNVAWDTRPARWLHRPDSA 119

Query: 521 WLLFGVSAG-APPVDLDPDSNSEVLVATDDS----KIN------NYRVIGVTADGRCLFR 375
           WLLFGV A  APP  +D  ++ E  VA D+S    K+       +YRV GV ADGRCLFR
Sbjct: 120 WLLFGVCACLAPPGCVDVVTDFEA-VAVDESCGVLKVEASADYADYRVTGVPADGRCLFR 178

Query: 374 AIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEFDMYVKRIEQP 195
           AIAH  CLR GE APDE  Q+ELADELRA+VV+ELLKRR+E EWFI+ +FD YVKRI+QP
Sbjct: 179 AIAHGDCLRNGEKAPDENCQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKRIQQP 238

Query: 194 YIWGGEPELLMCSHVLRA-----------------XXYGDEYKNE-EVNSINVLFYGYGH 69
           ++WGGEPELLM SHVL+                    YG+EY+N+ E NSINVLF+GYGH
Sbjct: 239 FVWGGEPELLMASHVLKTPISVFMRATGSVGLVNIAKYGEEYRNDKEENSINVLFHGYGH 298

Query: 68  YDVLE 54
           YD+LE
Sbjct: 299 YDILE 303


>ref|XP_007010220.1| Cysteine proteinases superfamily protein isoform 2 [Theobroma
           cacao] gi|508727133|gb|EOY19030.1| Cysteine proteinases
           superfamily protein isoform 2 [Theobroma cacao]
          Length = 330

 Score =  243 bits (621), Expect = 3e-62
 Identities = 136/260 (52%), Positives = 162/260 (62%), Gaps = 46/260 (17%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAAAVFGRHESEPVKHAE----GSWNVAWDARPARWLHNSDS 525
           GAAS+WH ILP             GR   E  K+ E    GSWNVAWDARPARWLH  DS
Sbjct: 67  GAASIWHAILPC-------GGGGGGRRRGEVWKNVERKGEGSWNVAWDARPARWLHRPDS 119

Query: 524 AWLLFGVSAGAPP----VDLDPDSNSEV----------LVATDDSK--------INNYRV 411
           AWLLFGV A   P    VD++PD++ ++          L A + S          +N +V
Sbjct: 120 AWLLFGVCACLAPMIEFVDVNPDADDKIEGAELNLVSRLSADEKSSSSSSSVAAADNCKV 179

Query: 410 IGVTADGRCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQV---VEELLKRRKEVEWF 240
            GV ADGRCLFRAIAH ACLR GE APDE  Q+ELADELRAQV   V ELLKRR+E EWF
Sbjct: 180 TGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVSLVVNELLKRREETEWF 239

Query: 239 IDEEFDMYVKRIEQPYIWGGEPELLMCSHVLRA-----------------XXYGDEYKNE 111
           I+ +FD YVK I+QPY+WGGEPE+LM SHVL+                    YG+EY+ +
Sbjct: 240 IEGDFDAYVKEIQQPYVWGGEPEILMASHVLKTPISVYMIPRSSSNLTKIAKYGEEYQKD 299

Query: 110 EVNSINVLFYGYGHYDVLEA 51
           + N INVLF+GYGHYD+LE+
Sbjct: 300 KENPINVLFHGYGHYDILES 319


>dbj|BAE71258.1| hypothetical protein [Trifolium pratense]
          Length = 326

 Score =  243 bits (620), Expect = 4e-62
 Identities = 138/254 (54%), Positives = 162/254 (63%), Gaps = 41/254 (16%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAAAVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDSAWLL 513
           GAAS+WH I+P         A  F  H    +K  EGSWNVAWDARPARWLH SDSAWLL
Sbjct: 66  GAASIWHAIMPCGGDGFQRGA--FMVHHDHELK-GEGSWNVAWDARPARWLHRSDSAWLL 122

Query: 512 FGVSAG-APP---VDLDPD-------------SNSEVLVATD-------DSKINNYRVIG 405
           FGV A  APP   VD+DP+             S SE L   D       D   ++YRV G
Sbjct: 123 FGVRAWLAPPPVIVDVDPEVPLPTSVISPDEISRSEGLEIKDAESDKPNDELSSDYRVTG 182

Query: 404 VTADGRCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEF 225
           V ADGRCLFRA+AH ACL+ GE AP+E RQ+ELADELRA+V EELLKRRKE EWFI+ +F
Sbjct: 183 VLADGRCLFRALAHGACLKNGEEAPNENRQRELADELRAKVAEELLKRRKETEWFIEGDF 242

Query: 224 DMYVKRIEQPYIWGGEPELLMCSHVLRA-----------------XXYGDEYKNEEVNSI 96
           D YV RI+Q ++WGGEPELLM SHVL+                    YG+EY N+E  SI
Sbjct: 243 DTYVTRIQQSFVWGGEPELLMASHVLKTPIFVFMRDPNSIDLVNIAKYGEEYMNDEGISI 302

Query: 95  NVLFYGYGHYDVLE 54
           NVLF+ +GHY++LE
Sbjct: 303 NVLFHRHGHYELLE 316


>ref|XP_006436685.1| hypothetical protein CICLE_v10032126mg [Citrus clementina]
           gi|568878376|ref|XP_006492172.1| PREDICTED:
           uncharacterized protein LOC102630016 [Citrus sinensis]
           gi|557538881|gb|ESR49925.1| hypothetical protein
           CICLE_v10032126mg [Citrus clementina]
          Length = 322

 Score =  240 bits (613), Expect = 3e-61
 Identities = 131/256 (51%), Positives = 159/256 (62%), Gaps = 43/256 (16%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAAAVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDSAWLL 513
           GAAS+WH ILPS      +  +   R  +   K  EGSWN A D RPARWLH +DSAWLL
Sbjct: 67  GAASIWHAILPS------DGCSGCRRRRNGRRKPGEGSWNAASDERPARWLHRADSAWLL 120

Query: 512 FGVSAGAPPVDL---DPDSNSEVLVATDD--SKINN---------------------YRV 411
           FGV +   P++      DSN E +   ++  SKI+                      ++V
Sbjct: 121 FGVCSCLAPIEYWTDSNDSNPETVTFYEEKISKIDGGGGGGDDDLNVKRCEIINERPFKV 180

Query: 410 IGVTADGRCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDE 231
            GV ADGRCLFRAIAH ACLR GE  PDE RQ+ELADELRAQVV+ELLKRRKE EWFI+ 
Sbjct: 181 TGVLADGRCLFRAIAHGACLRSGEEVPDEERQRELADELRAQVVDELLKRRKETEWFIEG 240

Query: 230 EFDMYVKRIEQPYIWGGEPELLMCSHVLR-----------------AXXYGDEYKNEEVN 102
           +FD YVK I+QPY+WGGEPELLM SHVL+                    YG+EY+ ++ +
Sbjct: 241 DFDTYVKEIQQPYVWGGEPELLMASHVLKKPIAVFMVVQSSGNLVNIANYGEEYQKDKES 300

Query: 101 SINVLFYGYGHYDVLE 54
            INVLF+GYGHYD+LE
Sbjct: 301 PINVLFHGYGHYDILE 316


>ref|XP_004496177.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cicer
           arietinum]
          Length = 313

 Score =  237 bits (604), Expect = 3e-60
 Identities = 136/251 (54%), Positives = 157/251 (62%), Gaps = 38/251 (15%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAAAVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDSAWLL 513
           GAAS+WH I P           V  +H+ +     EGSWNVAWDARPARWLH SDSAWLL
Sbjct: 64  GAASIWHAIRPC-GGDGFRRGVVTVQHDHD--LKGEGSWNVAWDARPARWLHRSDSAWLL 120

Query: 512 FGVSAG-APPVDLD------------PDSNSE---VLVATDDSKINN-----YRVIGVTA 396
           FGV A  APPV  D             D NSE   +  A  D + N+     YRV GV A
Sbjct: 121 FGVCACLAPPVIADVDLEAPPTPAINTDENSEGREMKYAEGDKERNDELSADYRVTGVLA 180

Query: 395 DGRCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEFDMY 216
           DGRCLFRAIAH ACL  GE AP+E RQ+ELADELRA+V EELLKRRKE EWFI+ +FD Y
Sbjct: 181 DGRCLFRAIAHGACLNNGEEAPNENRQRELADELRARVAEELLKRRKETEWFIEGDFDAY 240

Query: 215 VKRIEQPYIWGGEPELLMCSHVLRA-----------------XXYGDEYKNEEVNSINVL 87
           V RI Q Y+WGGEPELLM SHVL+                    YG+EY N++  SINVL
Sbjct: 241 VNRIRQTYVWGGEPELLMASHVLKTPIYVFMRDASSIDLVNIAKYGEEYMNDKEISINVL 300

Query: 86  FYGYGHYDVLE 54
           F+ +GHY++LE
Sbjct: 301 FHRHGHYEILE 311


>gb|EXC25419.1| hypothetical protein L484_016802 [Morus notabilis]
          Length = 338

 Score =  236 bits (601), Expect = 7e-60
 Identities = 138/274 (50%), Positives = 158/274 (57%), Gaps = 51/274 (18%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAAAVFGRHESE---PVKH-----AEGSWNVAWDARPARWLH 537
           GAAS+WH ILPS        +   GR       P  H      EGSWN A DARPARWLH
Sbjct: 68  GAASIWHAILPS--------SGAGGRRFDRWRLPAIHFELLKGEGSWNAAVDARPARWLH 119

Query: 536 NSDSAWLLFGVSAGAPPVDLD-------PDSNSEVLVATDDSKI---------------- 426
            +DSAWLLFGV A   P  LD        D +SE      + ++                
Sbjct: 120 RADSAWLLFGVCACLAPATLDVVGGGDGEDVSSETPAVVSEQRLVVSSASDGSFSGANID 179

Query: 425 --NNYRVIGVTADGRCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKE 252
              +YRV GV ADGRCLFRAIAH+A LR GE APDE RQ+ELADELRAQVV ELLKRR+E
Sbjct: 180 SSADYRVTGVLADGRCLFRAIAHVAFLRNGEEAPDENRQRELADELRAQVVNELLKRREE 239

Query: 251 VEWFIDEEFDMYVKRIEQPYIWGGEPELLMCSHVLRA-----------------XXYG-D 126
            EWFI+ +FD YVK I+QPY+WGGEPELLM SHVL+                    YG +
Sbjct: 240 SEWFIEGDFDAYVKNIQQPYVWGGEPELLMASHVLKTPIWVFMRDRSTGALVNIAKYGEE 299

Query: 125 EYKNEEVNSINVLFYGYGHYDVLEASR*PICTLT 24
           EY  +E N INVLF+GYGHYD+LE      C  T
Sbjct: 300 EYGKDEQNPINVLFHGYGHYDILETPSDKSCQKT 333


>ref|XP_002315401.2| hypothetical protein POPTR_0010s24050g [Populus trichocarpa]
           gi|550330486|gb|EEF01572.2| hypothetical protein
           POPTR_0010s24050g [Populus trichocarpa]
          Length = 303

 Score =  231 bits (590), Expect = 1e-58
 Identities = 120/207 (57%), Positives = 143/207 (69%), Gaps = 24/207 (11%)
 Frame = -3

Query: 692 GAASVWHTILPSYWRQRPEAAAVFGRHESEPVKHAEGSWNVAWDARPARWLHNSDSAWLL 513
           GAA++WH I P+ WR+R E  +V G          EGSWN AWD RPARWLH  DSAWLL
Sbjct: 63  GAAAIWHVIQPADWRRRTERRSVRG----------EGSWNAAWDGRPARWLHRPDSAWLL 112

Query: 512 FGVSAG-APPVDLDPDSNS---------------EVLVATDDSKINN--------YRVIG 405
           FGV A  AP ++   D N+               ++  ++DD+K +N        Y+V G
Sbjct: 113 FGVCACLAPAIEFLSDVNNIDDVDHQEKERIDGGDLNASSDDAKQDNSDATVGSDYKVTG 172

Query: 404 VTADGRCLFRAIAHMACLRKGEVAPDEIRQKELADELRAQVVEELLKRRKEVEWFIDEEF 225
           V ADGRCLFRAIAHMACLR GE APDE RQ+ELADELRAQVV+ELLKRR+E EWFI+ +F
Sbjct: 173 VLADGRCLFRAIAHMACLRNGEEAPDENRQRELADELRAQVVDELLKRREETEWFIEGDF 232

Query: 224 DMYVKRIEQPYIWGGEPELLMCSHVLR 144
           D YVKRI+QPY+WGGEPELLM SHVL+
Sbjct: 233 DAYVKRIQQPYVWGGEPELLMASHVLK 259


Top