BLASTX nr result

ID: Perilla23_contig00014253 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00014253
         (1025 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011095837.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   283   1e-73
ref|XP_012849177.1| PREDICTED: aspartic proteinase nepenthesin-1...   265   6e-68
emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]   221   1e-54
ref|XP_002265771.3| PREDICTED: aspartic proteinase nepenthesin-2...   220   2e-54
ref|XP_010092446.1| Aspartic proteinase nepenthesin-1 [Morus not...   202   4e-49
ref|XP_010064103.1| PREDICTED: aspartic proteinase CDR1 [Eucalyp...   199   3e-48
emb|CBI24128.3| unnamed protein product [Vitis vinifera]              192   3e-46
ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Caps...   189   4e-45
ref|XP_010543040.1| PREDICTED: aspartic proteinase nepenthesin-1...   188   7e-45
ref|XP_010260839.1| PREDICTED: aspartic proteinase nepenthesin-1...   187   1e-44
ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arab...   187   2e-44
gb|KDO61509.1| hypothetical protein CISIN_1g046757mg [Citrus sin...   186   3e-44
ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citr...   186   3e-44
gb|KHG15209.1| Asparticase nepenthesin-1 [Gossypium arboreum]         184   1e-43
ref|XP_012463657.1| PREDICTED: aspartic proteinase nepenthesin-1...   184   1e-43
ref|XP_009363310.1| PREDICTED: aspartic proteinase nepenthesin-1...   184   1e-43
ref|XP_010468142.1| PREDICTED: aspartic proteinase nepenthesin-2...   183   2e-43
ref|NP_187876.2| aspartyl protease family protein [Arabidopsis t...   183   2e-43
gb|AAL49921.1| unknown protein [Arabidopsis thaliana]                 183   2e-43
ref|XP_007049083.1| Eukaryotic aspartyl protease family protein,...   182   3e-43

>ref|XP_011095837.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Sesamum
           indicum]
          Length = 488

 Score =  283 bits (725), Expect = 1e-73
 Identities = 149/246 (60%), Positives = 177/246 (71%), Gaps = 1/246 (0%)
 Frame = -3

Query: 735 GGLKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKI 556
           GG K +LIHRH L  K    A+ ++RLRQL+HSDT+R+  IS K+R R +G    SRR++
Sbjct: 31  GGTKFELIHRHHLERK---PATQIQRLRQLLHSDTIRLPEISHKVRLR-QGHFDASRRQL 86

Query: 555 QEKNGYEPAC-NNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTG 379
            E+  Y PAC N+           GE+PMHSGADYG GQY V+ RVGSPAQK++LIADTG
Sbjct: 87  PEETAYYPACTNSSRRSKNDNNVSGEMPMHSGADYGTGQYFVRFRVGSPAQKLMLIADTG 146

Query: 378 SDLTWINXXXXXXXXXCSGNSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCPS 199
           SDLTW+N         C  +S   RVF AD SSSF TV CSS++CKIDLANLFSLA CPS
Sbjct: 147 SDLTWMNCKYRCRGGRCRKSSNKGRVFLADHSSSFRTVHCSSSMCKIDLANLFSLARCPS 206

Query: 198 PLDPCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDGV 19
           P+DPC YDYRYSDGSA +GLFA E VTF L+N RK R+ NVLVGCSES+ GQS +  DGV
Sbjct: 207 PMDPCAYDYRYSDGSAALGLFANEMVTFTLTNRRKTRLRNVLVGCSESTRGQSFQGADGV 266

Query: 18  IGLGYS 1
           +GLGYS
Sbjct: 267 MGLGYS 272


>ref|XP_012849177.1| PREDICTED: aspartic proteinase nepenthesin-1 [Erythranthe guttatus]
           gi|604314897|gb|EYU27603.1| hypothetical protein
           MIMGU_mgv1a004950mg [Erythranthe guttata]
          Length = 503

 Score =  265 bits (676), Expect = 6e-68
 Identities = 152/290 (52%), Positives = 178/290 (61%), Gaps = 14/290 (4%)
 Frame = -3

Query: 828 MVTCTRQGXXXXXXXXXXXXXSNSVEILEG----HGGLKLQLIHRHDLHPKWRG-GASPL 664
           MVT TRQ              + S++  EG     G +KL+LIHRH L  + R   A PL
Sbjct: 1   MVTHTRQRGFSLFIICLFTIVNYSLKFTEGIRVSDGAVKLELIHRHHLQGERRNVAAQPL 60

Query: 663 ERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKIQEKNGYEPACNNXXXXXXXXXXXG 484
           ERLRQLVHSD VR+R IS K+   + G   V RR  +  + + PA  N            
Sbjct: 61  ERLRQLVHSDAVRLRGISLKVMLIQGGAGPVRRRVSETDDAFIPASTNGGGGGGSNNKEQ 120

Query: 483 ------EVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDLTWINXXXXXXXXXCSG 322
                 ++P+ SGAD+G GQY V+ RVGSPAQKVVLIADTGSDLTW+N           G
Sbjct: 121 FSNVSGQLPISSGADFGTGQYFVQFRVGSPAQKVVLIADTGSDLTWMNCKYRCRGGGGGG 180

Query: 321 ---NSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCPSPLDPCTYDYRYSDGSA 151
              NS  RR+F ADRSSSF TVPCSS  C  DLANLFSL  CPSP+ PC YDYRYSDGSA
Sbjct: 181 CRRNSNKRRLFWADRSSSFRTVPCSSTTCTNDLANLFSLTRCPSPISPCAYDYRYSDGSA 240

Query: 150 TVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDGVIGLGYS 1
             GLF  ETVT  L+NGRK R+ NVL+GCS SS G + ++ DGVIGLGYS
Sbjct: 241 AQGLFGNETVTLSLTNGRKTRLHNVLIGCSISSSGPTFQSADGVIGLGYS 290


>emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  221 bits (562), Expect = 1e-54
 Identities = 120/247 (48%), Positives = 155/247 (62%), Gaps = 4/247 (1%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKIQE 550
           ++L+LIHRH      R   + L+RL++LVHSD+VR   I  K+R  +     + RRK +E
Sbjct: 1   MRLELIHRHSPQVMGRP-KTQLQRLKELVHSDSVRQLMILHKLRGGQ-----IPRRKAKE 54

Query: 549 KNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDL 370
                    +            EVPMH  ADYG GQY V  +VG+P+QK +L+ADTGSDL
Sbjct: 55  VLSSSSGRGSDDAI--------EVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDL 106

Query: 369 TWINXXXXXXXXXCSGNST----HRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCP 202
           TW++         CS        H+RVF A+ SSSF T+PC + +CKI+L +LFSL +CP
Sbjct: 107 TWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCP 166

Query: 201 SPLDPCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDG 22
           +PL PC YDYRYSDGS  +G FA ETVT  L  GRK ++ NVL+GCSES  GQS +A DG
Sbjct: 167 TPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADG 226

Query: 21  VIGLGYS 1
           V+GLGYS
Sbjct: 227 VMGLGYS 233


>ref|XP_002265771.3| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 489

 Score =  220 bits (560), Expect = 2e-54
 Identities = 120/247 (48%), Positives = 155/247 (62%), Gaps = 4/247 (1%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKIQE 550
           ++L+LIHRH      R   + L+RL++LVHSD+VR   I  K+R  +     + RRK +E
Sbjct: 41  MRLELIHRHSPQVMGRP-KTQLQRLKELVHSDSVRQLMILHKLRGGQ-----IPRRKAKE 94

Query: 549 KNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDL 370
                    +            EVPMH  ADYG GQY V  +VG+P+QK +L+ADTGSDL
Sbjct: 95  VLSSSSGRGSDDAI--------EVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDL 146

Query: 369 TWINXXXXXXXXXCSGNST----HRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCP 202
           TW++         CS        H+RVF A+ SSSF T+PC + +CKI+L +LFSL +CP
Sbjct: 147 TWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCP 206

Query: 201 SPLDPCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDG 22
           +PL PC YDYRYSDGS  +G FA ETVT  L  GRK ++ NVL+GCSES  GQS +A DG
Sbjct: 207 TPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADG 266

Query: 21  VIGLGYS 1
           V+GLGYS
Sbjct: 267 VMGLGYS 273


>ref|XP_010092446.1| Aspartic proteinase nepenthesin-1 [Morus notabilis]
           gi|587861358|gb|EXB51212.1| Aspartic proteinase
           nepenthesin-1 [Morus notabilis]
          Length = 464

 Score =  202 bits (514), Expect = 4e-49
 Identities = 104/249 (41%), Positives = 151/249 (60%), Gaps = 5/249 (2%)
 Frame = -3

Query: 735 GGLKLQLIHRHD--LHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRR 562
           G  +L+L+HR+   L  KW+   + +E+L +    D +R R +S +              
Sbjct: 22  GATRLELLHRNSPKLSEKWQIPETTMEKLIEFHRRDVLRHRMVSHR-------------- 67

Query: 561 KIQEKNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADT 382
               + G E A ++             +PM++GADYG G+Y V + VG+P Q+ +L+ADT
Sbjct: 68  ----RMGIETASSSASSIA--------MPMNAGADYGVGEYFVHVTVGTPGQRFMLVADT 115

Query: 381 GSDLTWINXXXXXXXXXCSGNSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCP 202
           GSDLTW++           G   +RRVF ADRSSSF T+PC S +CK++LANLFSL+ CP
Sbjct: 116 GSDLTWMHCRCGRRCGTHKGRLNNRRVFHADRSSSFKTIPCLSEMCKVELANLFSLSKCP 175

Query: 201 SPLDPCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVG---QSLEA 31
           +PL PC YDYRY +GS+ +G FA ET++ RL+NG+KR++ +VLVGC+ES  G      + 
Sbjct: 176 TPLTPCAYDYRYLEGSSAIGFFANETISVRLANGKKRKLRDVLVGCTESVQGAEESGFKG 235

Query: 30  GDGVIGLGY 4
            DGV+GLG+
Sbjct: 236 ADGVLGLGF 244


>ref|XP_010064103.1| PREDICTED: aspartic proteinase CDR1 [Eucalyptus grandis]
           gi|629105951|gb|KCW71420.1| hypothetical protein
           EUGRSUZ_F04481 [Eucalyptus grandis]
          Length = 477

 Score =  199 bits (506), Expect = 3e-48
 Identities = 111/247 (44%), Positives = 152/247 (61%), Gaps = 2/247 (0%)
 Frame = -3

Query: 735 GGLKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKI 556
           G ++L+LIH     PK       ++R+R+LVHSD +R     R + + K  +   +RRK+
Sbjct: 33  GNVRLKLIHSQAYAPK--SNYDQMKRIRELVHSDILR-----RGIMFSKHHQS--TRRKV 83

Query: 555 QEKNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGS 376
            EK      C+N             +P+ SG DYG GQY V++ VG+P QK++LIADTGS
Sbjct: 84  WEKPRRRTNCSNISIG---------MPISSGRDYGTGQYFVEVNVGTPPQKMLLIADTGS 134

Query: 375 DLTWINXXXXXXXXXCSGNSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCPSP 196
           +LTW+N                RR F++ RSS+F TVPCSS  CKID  +LFSLA CP+P
Sbjct: 135 ELTWMNCKRHG----------RRRGFQSTRSSTFKTVPCSSRTCKIDFMDLFSLARCPTP 184

Query: 195 LDPCTYDYRYSDGSATVGLFAKETVTFRLSN--GRKRRVENVLVGCSESSVGQSLEAGDG 22
             PC+YDYRYSDGS  +G+FA+ETVT  ++N  GR  +VE+V+VGC+ +  GQ  +  DG
Sbjct: 185 STPCSYDYRYSDGSGALGIFARETVTAEITNEKGRATKVEDVVVGCTLTLQGQGFQGADG 244

Query: 21  VIGLGYS 1
           V+GL YS
Sbjct: 245 VLGLAYS 251


>emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  192 bits (489), Expect = 3e-46
 Identities = 93/162 (57%), Positives = 115/162 (70%), Gaps = 4/162 (2%)
 Frame = -3

Query: 474 MHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDLTWINXXXXXXXXXCSGNST----HR 307
           MH  ADYG GQY V  +VG+P+QK +L+ADTGSDLTW++         CS        H+
Sbjct: 1   MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 60

Query: 306 RVFRADRSSSFTTVPCSSAICKIDLANLFSLASCPSPLDPCTYDYRYSDGSATVGLFAKE 127
           RVF A+ SSSF T+PC + +CKI+L +LFSL +CP+PL PC YDYRYSDGS  +G FA E
Sbjct: 61  RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 120

Query: 126 TVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDGVIGLGYS 1
           TVT  L  GRK ++ NVL+GCSES  GQS +A DGV+GLGYS
Sbjct: 121 TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYS 162


>ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Capsella rubella]
           gi|482566377|gb|EOA30566.1| hypothetical protein
           CARUB_v10013693mg [Capsella rubella]
          Length = 448

 Score =  189 bits (479), Expect = 4e-45
 Identities = 102/243 (41%), Positives = 138/243 (56%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKIQE 550
           L+L+L HR  L P      +PL R+  ++ +D  R   ISR  +++   +M         
Sbjct: 32  LRLELAHRDTLWP------NPLSRIEDIIGADHKRHSLISRNRKYKGGVKM--------- 76

Query: 549 KNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDL 370
                                   P+ SG DYG  QY  ++RVG+PA+K  ++ DTGS+L
Sbjct: 77  ------------------------PLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSEL 112

Query: 369 TWINXXXXXXXXXCSGNSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCPSPLD 190
           TW+N           G   +RRVFRA+ S SF TV C +  CK+DL NLFSL++CP+P  
Sbjct: 113 TWVNCKYRGRG---KGRVENRRVFRAEESKSFRTVGCFTQTCKVDLMNLFSLSTCPTPST 169

Query: 189 PCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDGVIGL 10
           PC+YDYRY+DGSA  G+FAKETVT  L+NGRK R+  +L+GCS S  GQS    DGV+GL
Sbjct: 170 PCSYDYRYADGSAAQGIFAKETVTVGLTNGRKARLHGLLIGCSSSFSGQSFRGADGVLGL 229

Query: 9   GYS 1
            +S
Sbjct: 230 AFS 232


>ref|XP_010543040.1| PREDICTED: aspartic proteinase nepenthesin-1 [Tarenaya hassleriana]
          Length = 440

 Score =  188 bits (477), Expect = 7e-45
 Identities = 90/161 (55%), Positives = 110/161 (68%)
 Frame = -3

Query: 483 EVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDLTWINXXXXXXXXXCSGNSTHRR 304
           E+P+ SG D+G GQYL +LRVG+P+QK  ++ DTGS+LTW+N                RR
Sbjct: 65  EMPLGSGRDFGTGQYLTELRVGTPSQKFTVVVDTGSELTWVNCRYGCRRNCTERRRKRRR 124

Query: 303 VFRADRSSSFTTVPCSSAICKIDLANLFSLASCPSPLDPCTYDYRYSDGSATVGLFAKET 124
           VFRAD+SSSF TV C S  CKIDL NLFSL++CPSP  PC Y YRY DGS   G+F +ET
Sbjct: 125 VFRADQSSSFRTVACESQTCKIDLMNLFSLSTCPSPSSPCAYHYRYVDGSEAEGIFGEET 184

Query: 123 VTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDGVIGLGYS 1
           VT  L+NGR+ RV+ VLVGCS S  G S    DGV+GL +S
Sbjct: 185 VTVGLTNGRRGRVKGVLVGCSHSFSGLSFRRADGVLGLAFS 225


>ref|XP_010260839.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Nelumbo
           nucifera]
          Length = 481

 Score =  187 bits (475), Expect = 1e-44
 Identities = 108/248 (43%), Positives = 141/248 (56%), Gaps = 5/248 (2%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGGA----SPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRR 562
           ++ ++IHRH      R GA    + LE++R+LV  D  R + I  ++  R E      RR
Sbjct: 27  MRFEMIHRHSPELSGRLGAGLQKTRLEQVRELVRLDEQRTQMIYHRIGQRTE------RR 80

Query: 561 KIQEKNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADT 382
           K  E                       VPM SG+  G G Y V  RVG+PAQ V+L+ADT
Sbjct: 81  KDAEGGADGQIGAAAWTGKVIGSSGASVPMFSGSFAGEGLYFVPFRVGTPAQNVLLVADT 140

Query: 381 GSDLTWINXXXXXXXXXCSGNSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCP 202
           GSDLTW+N            +   RR F AD SSSFTT+PC S +CK DLA +FSL  CP
Sbjct: 141 GSDLTWMNCIHGCRNCGRKVD--RRRFFNADLSSSFTTIPCLSRMCKNDLAVMFSLTDCP 198

Query: 201 SPLDPCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSL-EAGD 25
            PL+PC YDY YS G +  G FA E+VT RL+NGRK ++ +VLVGC++++ GQ      D
Sbjct: 199 KPLNPCKYDYSYSSGQSAQGFFANESVTVRLTNGRKMKIHHVLVGCTQTTQGQKFSNVVD 258

Query: 24  GVIGLGYS 1
           G++GLGYS
Sbjct: 259 GILGLGYS 266


>ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata] gi|297328626|gb|EFH59045.1| hypothetical protein
           ARALYDRAFT_478632 [Arabidopsis lyrata subsp. lyrata]
          Length = 449

 Score =  187 bits (474), Expect = 2e-44
 Identities = 101/243 (41%), Positives = 139/243 (57%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKIQE 550
           ++L+L HR  L P      +PL R+  ++ +D  R   ISRK +++   +M +       
Sbjct: 31  VRLKLAHRDTLWP------NPLSRIEDIIGADQKRHSLISRKRKFKGGVKMDLG------ 78

Query: 549 KNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDL 370
                                      SG DYG  QY  ++RVG+PA+K  ++ DTGS+L
Sbjct: 79  ---------------------------SGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSEL 111

Query: 369 TWINXXXXXXXXXCSGNSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCPSPLD 190
           TW+N           G   +RRVFRA+ S SF TV C +  CK+DL NLFSL++CP+P  
Sbjct: 112 TWVNCRYRGRG---KGKVKNRRVFRAEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPST 168

Query: 189 PCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDGVIGL 10
           PC+YDYRY+DGSA  G+FAKET+T  L+NGRK R+  +LVGCS S  GQS +  DGV+GL
Sbjct: 169 PCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGL 228

Query: 9   GYS 1
            +S
Sbjct: 229 AFS 231


>gb|KDO61509.1| hypothetical protein CISIN_1g046757mg [Citrus sinensis]
          Length = 445

 Score =  186 bits (471), Expect = 3e-44
 Identities = 99/245 (40%), Positives = 141/245 (57%), Gaps = 3/245 (1%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKIQE 550
           ++++LIHRH          S +ER+++L+H+D +R      K R R+     + +     
Sbjct: 7   VRMELIHRHSPKLNNMPMMSEVERMKELLHNDIIR----QNKRRGRR-----LRQTNNNN 57

Query: 549 KNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDL 370
            NG   +               E+P+ +G DYG G Y V+++VG+P+QK+ LI DTGS+ 
Sbjct: 58  NNGASGSA-------------IEMPLQAGRDYGTGMYFVEIKVGTPSQKLRLIVDTGSEF 104

Query: 369 TWINXXXXXXXXXCSGNS---THRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCPS 199
           +WI+             +   + RRVF+AD SSSF T+PCSS +CK + A LFSL  CP+
Sbjct: 105 SWISCRYHCGPSCTKKGTIAGSRRRVFKADLSSSFKTIPCSSDMCKSEFARLFSLTFCPT 164

Query: 198 PLDPCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDGV 19
           P  PC YDYRY+DGSA  G+F KE VT  L NG K R+E V++GCS++  GQ     DGV
Sbjct: 165 PTSPCAYDYRYADGSAAKGIFGKERVTIGLENGGKTRIEEVVMGCSDTIQGQIFAEADGV 224

Query: 18  IGLGY 4
           +GL Y
Sbjct: 225 LGLSY 229


>ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citrus clementina]
           gi|568881779|ref|XP_006493729.1| PREDICTED: aspartic
           proteinase nepenthesin-1-like [Citrus sinensis]
           gi|557524190|gb|ESR35557.1| hypothetical protein
           CICLE_v10004908mg [Citrus clementina]
          Length = 470

 Score =  186 bits (471), Expect = 3e-44
 Identities = 99/245 (40%), Positives = 141/245 (57%), Gaps = 3/245 (1%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKIQE 550
           ++++LIHRH          S +ER+++L+H+D +R      K R R+     + +     
Sbjct: 32  VRMELIHRHSPKLNNMPMMSEVERMKELLHNDIIR----QNKRRGRR-----LRQTNNNN 82

Query: 549 KNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDL 370
            NG   +               E+P+ +G DYG G Y V+++VG+P+QK+ LI DTGS+ 
Sbjct: 83  NNGASGSA-------------IEMPLQAGRDYGTGMYFVEIKVGTPSQKLRLIVDTGSEF 129

Query: 369 TWINXXXXXXXXXCSGNS---THRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCPS 199
           +WI+             +   + RRVF+AD SSSF T+PCSS +CK + A LFSL  CP+
Sbjct: 130 SWISCRYHCGPSCTKKGTIAGSRRRVFKADLSSSFKTIPCSSDMCKSEFARLFSLTFCPT 189

Query: 198 PLDPCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDGV 19
           P  PC YDYRY+DGSA  G+F KE VT  L NG K R+E V++GCS++  GQ     DGV
Sbjct: 190 PTSPCAYDYRYADGSAAKGIFGKERVTIGLENGGKTRIEEVVMGCSDTIQGQIFAEADGV 249

Query: 18  IGLGY 4
           +GL Y
Sbjct: 250 LGLSY 254


>gb|KHG15209.1| Asparticase nepenthesin-1 [Gossypium arboreum]
          Length = 473

 Score =  184 bits (467), Expect = 1e-43
 Identities = 100/244 (40%), Positives = 139/244 (56%), Gaps = 4/244 (1%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKIQE 550
           + L+LIHRH          +  +RL  L++ D +R   +S + R ++E  +  S      
Sbjct: 36  ITLELIHRHAPQFTNNNPITQHQRLVDLLYHDIIRHGIMSHRRRAKEEDPLTAS------ 89

Query: 549 KNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDL 370
                                 ++P+ SG D+G GQY+   +VG+P+QK  LI DTGSDL
Sbjct: 90  ---------------------IKMPLASGRDFGIGQYITSFKVGTPSQKFWLIVDTGSDL 128

Query: 369 TWINXXXXXXXXXCS----GNSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCP 202
           TWI           S    G    +RVF A  SSSF  VPC S +CK++L NLFSL +CP
Sbjct: 129 TWIRCRYRCSRGDRSCTSKGRINRKRVFHAPLSSSFNPVPCFSEMCKVELMNLFSLTTCP 188

Query: 201 SPLDPCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDG 22
           +P+ PC YDYRYSDGSA +G+FA ETV+  L+NGRK R+ NVL+GC++S  G +L+  DG
Sbjct: 189 TPITPCAYDYRYSDGSAAMGVFANETVSAGLTNGRKTRLHNVLIGCTDSFQGPTLQNVDG 248

Query: 21  VIGL 10
           ++GL
Sbjct: 249 IMGL 252


>ref|XP_012463657.1| PREDICTED: aspartic proteinase nepenthesin-1 [Gossypium raimondii]
           gi|763814626|gb|KJB81478.1| hypothetical protein
           B456_013G147300 [Gossypium raimondii]
          Length = 473

 Score =  184 bits (466), Expect = 1e-43
 Identities = 100/244 (40%), Positives = 140/244 (57%), Gaps = 4/244 (1%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKIQE 550
           + L+LIHRH          +  +RL  L++ D +R   +S + R ++E  +  S      
Sbjct: 36  ITLELIHRHAPQFTNNHPITQHQRLVDLLYHDIIRHGIMSHRRRAKEEDPLTAS------ 89

Query: 549 KNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDL 370
                                 ++P+ SG D+G GQY+   +VG+P+QK  LI DTGSDL
Sbjct: 90  ---------------------IKMPLASGRDFGIGQYITSFKVGTPSQKFWLIVDTGSDL 128

Query: 369 TWINXXXXXXXXXCS----GNSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCP 202
           TWI           S    G    +RVF A  SSSF+ VPC S +CK++L NLFSL +CP
Sbjct: 129 TWIRCRYRCSRGDRSCTRKGRINRKRVFHAPLSSSFSPVPCFSEMCKVELMNLFSLTTCP 188

Query: 201 SPLDPCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDG 22
           +P+ PC YDYRYSDGSA +G+FA ETV+  L+NGRK R+ NVL+GC++S  G +L+  DG
Sbjct: 189 TPITPCAYDYRYSDGSAAMGVFANETVSAGLTNGRKTRLHNVLIGCTDSFQGPTLQNVDG 248

Query: 21  VIGL 10
           ++GL
Sbjct: 249 IMGL 252


>ref|XP_009363310.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Pyrus x
           bretschneideri]
          Length = 497

 Score =  184 bits (466), Expect = 1e-43
 Identities = 107/261 (40%), Positives = 147/261 (56%), Gaps = 19/261 (7%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGG----ASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMG---- 574
           LKL+LIHR+  H     G     +  E LR L   D VR + IS + + ++E  +     
Sbjct: 36  LKLKLIHRYSPHYNGLHGDEKPKNQQELLRLLHRHDVVRHQMISYRRQQQQEESLLDAEE 95

Query: 573 --------VSRRKIQEKNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVG 418
                    +RR   EK G                    +P+ SG+DYG GQYLVK+++G
Sbjct: 96  VILNSSRIAARRMAWEKRG-----------------SMVMPISSGSDYGWGQYLVKIKIG 138

Query: 417 SPAQKVVLIADTGSDLTWINXXXXXXXXXCS--GNSTHRRVFRADRSSSFTTVPCSSAIC 244
           +PAQK +L+ADTGSDLTWIN             G   H+RVFRA+ SSSF TVPCSS +C
Sbjct: 139 TPAQKFLLVADTGSDLTWINCRYRCRNRCEKHQGRLQHKRVFRAELSSSFKTVPCSSKLC 198

Query: 243 KIDLANLFSLASCPSPLDPCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGC 64
           K+ L  +FSL  C +P  PC YDY Y +G+   GLFA ETV   L++GR+ ++ENV++GC
Sbjct: 199 KVGLWTMFSLQQCSTPTSPCRYDYSYIEGTHAFGLFANETVRATLASGRRTKLENVIIGC 258

Query: 63  SESSVGQ-SLEAGDGVIGLGY 4
           ++   G   +  GDG++GLG+
Sbjct: 259 TDHIKGSGGIRHGDGILGLGF 279


>ref|XP_010468142.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Camelina sativa]
          Length = 498

 Score =  183 bits (464), Expect = 2e-43
 Identities = 100/243 (41%), Positives = 136/243 (55%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKIQE 550
           ++L+L HR  L P      +PL R+   + +D  R   ISRK    K G           
Sbjct: 81  VRLELTHRDTLWP------NPLSRIGDSIGADHKRHSLISRKRTMYKGG----------- 123

Query: 549 KNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDL 370
                                 ++P+ SG DY   QY  ++RVG+PA+   ++ DTGS+L
Sbjct: 124 ---------------------VKMPLGSGIDYRTAQYFTEIRVGTPAKTFRVVVDTGSEL 162

Query: 369 TWINXXXXXXXXXCSGNSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCPSPLD 190
           TW+N           G + +RRVFRA+ S SF TV CS+  CK+DL NLFSL++CP+P  
Sbjct: 163 TWVNCRYRGRG---KGKAENRRVFRAEESKSFRTVGCSTQTCKVDLMNLFSLSTCPTPST 219

Query: 189 PCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDGVIGL 10
           PC+YDYRY+DGSA  G+FAKET+T  L+ GRK R+  +L+GCS S  GQS    DGV+GL
Sbjct: 220 PCSYDYRYADGSAAQGVFAKETITVGLTTGRKARLHGLLIGCSSSFSGQSFTGADGVLGL 279

Query: 9   GYS 1
            +S
Sbjct: 280 AFS 282


>ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
           gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA
           binding protein-like [Arabidopsis thaliana]
           gi|332641715|gb|AEE75236.1| aspartyl protease family
           protein [Arabidopsis thaliana]
          Length = 461

 Score =  183 bits (464), Expect = 2e-43
 Identities = 101/243 (41%), Positives = 137/243 (56%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKIQE 550
           ++L+L HR  L PK      PL R+  ++ +D  R   ISRK    +   +GV       
Sbjct: 49  VRLKLAHRDTLLPK------PLSRIEDVIGADQKRHSLISRK----RNSTVGV------- 91

Query: 549 KNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDL 370
                                 ++ + SG DYG  QY  ++RVG+PA+K  ++ DTGS+L
Sbjct: 92  ----------------------KMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSEL 129

Query: 369 TWINXXXXXXXXXCSGNSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCPSPLD 190
           TW+N          +    +RRVFRAD S SF TV C +  CK+DL NLFSL +CP+P  
Sbjct: 130 TWVNCRYR------ARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPST 183

Query: 189 PCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDGVIGL 10
           PC+YDYRY+DGSA  G+FAKET+T  L+NGR  R+   L+GCS S  GQS +  DGV+GL
Sbjct: 184 PCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGL 243

Query: 9   GYS 1
            +S
Sbjct: 244 AFS 246


>gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  183 bits (464), Expect = 2e-43
 Identities = 101/243 (41%), Positives = 137/243 (56%)
 Frame = -3

Query: 729 LKLQLIHRHDLHPKWRGGASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVSRRKIQE 550
           ++L+L HR  L PK      PL R+  ++ +D  R   ISRK    +   +GV       
Sbjct: 27  VRLKLAHRDTLLPK------PLSRIEDVIGADQKRHSLISRK----RNSTVGV------- 69

Query: 549 KNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIADTGSDL 370
                                 ++ + SG DYG  QY  ++RVG+PA+K  ++ DTGS+L
Sbjct: 70  ----------------------KMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSEL 107

Query: 369 TWINXXXXXXXXXCSGNSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLASCPSPLD 190
           TW+N          +    +RRVFRAD S SF TV C +  CK+DL NLFSL +CP+P  
Sbjct: 108 TWVNCRYR------ARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPST 161

Query: 189 PCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAGDGVIGL 10
           PC+YDYRY+DGSA  G+FAKET+T  L+NGR  R+   L+GCS S  GQS +  DGV+GL
Sbjct: 162 PCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGL 221

Query: 9   GYS 1
            +S
Sbjct: 222 AFS 224


>ref|XP_007049083.1| Eukaryotic aspartyl protease family protein, putative [Theobroma
           cacao] gi|508701344|gb|EOX93240.1| Eukaryotic aspartyl
           protease family protein, putative [Theobroma cacao]
          Length = 478

 Score =  182 bits (463), Expect = 3e-43
 Identities = 101/248 (40%), Positives = 142/248 (57%), Gaps = 6/248 (2%)
 Frame = -3

Query: 729 LKLQLIHRH-----DLHPKWRGG-ASPLERLRQLVHSDTVRVRAISRKMRWRKEGEMGVS 568
           ++ +LIHRH     + H    G   S  ER++QLVHSD  R+  IS+++  R+   M   
Sbjct: 37  VRFKLIHRHSPELGEDHGTTLGPPTSTRERIKQLVHSDNARLHTISQRLGPRR---MTFE 93

Query: 567 RRKIQEKNGYEPACNNXXXXXXXXXXXGEVPMHSGADYGAGQYLVKLRVGSPAQKVVLIA 388
            + +   N                    E+PM S AD G GQY V  RVGSP +K ++IA
Sbjct: 94  MKMMGSSN------------------LVELPMRSAADIGTGQYFVSFRVGSPPKKFIMIA 135

Query: 387 DTGSDLTWINXXXXXXXXXCSGNSTHRRVFRADRSSSFTTVPCSSAICKIDLANLFSLAS 208
           DTGS LTW+                H R+F A++S +F  +PCSS +CK++L+  FSLA 
Sbjct: 136 DTGSSLTWMRCSYKCKNFSMDRTKLHERIFYANQSRTFKPIPCSSDVCKVELSQSFSLAL 195

Query: 207 CPSPLDPCTYDYRYSDGSATVGLFAKETVTFRLSNGRKRRVENVLVGCSESSVGQSLEAG 28
           CP+P+ PC YDYRY+DG+  VG+F  +TV  RLS G+K +V +V+VGCSE+  G   +  
Sbjct: 196 CPTPMAPCAYDYRYADGTRVVGIFGNDTVKVRLSGGQKIKVTDVMVGCSEAIRGNFHDI- 254

Query: 27  DGVIGLGY 4
           DGV+GLG+
Sbjct: 255 DGVMGLGF 262


Top