BLASTX nr result

ID: Paeonia25_contig00011459 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00011459
         (1576 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]   599   e-168
ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2...   596   e-167
emb|CBI24128.3| unnamed protein product [Vitis vinifera]              545   e-152
gb|EYU27603.1| hypothetical protein MIMGU_mgv1a004950mg [Mimulus...   510   e-142
gb|EXB51212.1| Aspartic proteinase nepenthesin-1 [Morus notabilis]    492   e-136
ref|XP_007022806.1| Eukaryotic aspartyl protease family protein,...   491   e-136
ref|XP_004293837.1| PREDICTED: aspartic proteinase nepenthesin-1...   479   e-132
ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Caps...   478   e-132
ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arab...   467   e-129
ref|XP_007049083.1| Eukaryotic aspartyl protease family protein,...   458   e-126
ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citr...   456   e-125
ref|XP_006407304.1| hypothetical protein EUTSA_v10020732mg [Eutr...   451   e-124
ref|NP_187876.2| aspartyl protease family protein [Arabidopsis t...   446   e-122
gb|AAL49921.1| unknown protein [Arabidopsis thaliana]                 446   e-122
ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative...   420   e-115
gb|EPS68033.1| hypothetical protein M569_06741 [Genlisea aurea]       419   e-114
ref|XP_007211847.1| hypothetical protein PRUPE_ppa004710mg [Prun...   410   e-111
ref|XP_006429804.1| hypothetical protein CICLE_v10013820mg [Citr...   407   e-111
ref|XP_003532899.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   403   e-109
ref|XP_006598230.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   392   e-106

>emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  599 bits (1544), Expect = e-168
 Identities = 298/450 (66%), Positives = 344/450 (76%), Gaps = 13/450 (2%)
 Frame = -3

Query: 1445 MRLELIHRHS--LGGRPKTQLERINELLHSDSIRHRMISHKRQXXXXXXXRKAWERSSHK 1272
            MRLELIHRHS  + GRPKTQL+R+ EL+HSDS+R  MI HK +        KA E  S  
Sbjct: 1    MRLELIHRHSPQVMGRPKTQLQRLKELVHSDSVRQLMILHKLRGGQIPRR-KAKEVLSSS 59

Query: 1271 S------TIQMPIRSGADDRTGQYFVGIKVGTPAQKFMLIADTGSDLTWMNCKYRCVGHN 1110
            S       I++P+   AD   GQYFV  KVGTP+QKFML+ADTGSDLTWM+CKY C   N
Sbjct: 60   SGRGSDDAIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRN 119

Query: 1109 C-----RRLHHRRVFNSERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAYDYRYS 945
            C     RR+ H+RVF++  SS+FKT+PC +DMCK+ELM+LFSL +CP PLTPC YDYRYS
Sbjct: 120  CSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYS 179

Query: 944  DGSATVGLFANETVTVGLANGRKTKLHNMLIGCSESFKGQSFQAADGVLGLGFSKYSFAT 765
            DGS  +G FANETVTV L  GRK KLHN+LIGCSESF+GQSFQAADGV+GLG+SKYSFA 
Sbjct: 180  DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239

Query: 764  KAAEKYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPSKTRFTKLVLGVINPFYAVNV 585
            KAAEK+ GKFSYCLVDHLS KNVSNYLTFG   +K    +   +T+LVLG++N FYAVN+
Sbjct: 240  KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299

Query: 584  MGISIGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPAYQPVMDTXXXXXXXXXXXXL 405
            MGISIG  ML+IPSEVWD+   GGTILDSG+SLT LT PAYQPVM              +
Sbjct: 300  MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359

Query: 404  NIGPLEYCFNSTGFHDSLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFVSANWPGT 225
            +IGPLEYCFNSTGF +SLVPRL  HFADGA FEPPVKSYVI  ADGV+CLGFVS  WPGT
Sbjct: 360  DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT 419

Query: 224  SVIGNIMQQDHLWEFDLVQNKLSFSPSTCT 135
            SV+GNIMQQ+HLWEFDL   KL F+PS+CT
Sbjct: 420  SVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449


>ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  596 bits (1536), Expect = e-167
 Identities = 297/450 (66%), Positives = 343/450 (76%), Gaps = 13/450 (2%)
 Frame = -3

Query: 1445 MRLELIHRHS--LGGRPKTQLERINELLHSDSIRHRMISHKRQXXXXXXXRKAWERSSHK 1272
            MRLELIHRHS  + GRPKTQL+R+ EL+HSDS+R  MI HK +        KA E  S  
Sbjct: 1    MRLELIHRHSPQVMGRPKTQLQRLKELVHSDSVRQLMILHKLRGGQIPRR-KAKEVLSSS 59

Query: 1271 S------TIQMPIRSGADDRTGQYFVGIKVGTPAQKFMLIADTGSDLTWMNCKYRCVGHN 1110
            S       I++P+   AD   GQY V  KVGTP+QKFML+ADTGSDLTWM+CKY C   N
Sbjct: 60   SGRGSDDAIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRN 119

Query: 1109 C-----RRLHHRRVFNSERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAYDYRYS 945
            C     RR+ H+RVF++  SS+FKT+PC +DMCK+ELM+LFSL +CP PLTPC YDYRYS
Sbjct: 120  CSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYS 179

Query: 944  DGSATVGLFANETVTVGLANGRKTKLHNMLIGCSESFKGQSFQAADGVLGLGFSKYSFAT 765
            DGS  +G FANETVTV L  GRK KLHN+LIGCSESF+GQSFQAADGV+GLG+SKYSFA 
Sbjct: 180  DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239

Query: 764  KAAEKYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPSKTRFTKLVLGVINPFYAVNV 585
            KAAEK+ GKFSYCLVDHLS KNVSNYLTFG   +K    +   +T+LVLG++N FYAVN+
Sbjct: 240  KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299

Query: 584  MGISIGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPAYQPVMDTXXXXXXXXXXXXL 405
            MGISIG  ML+IPSEVWD+   GGTILDSG+SLT LT PAYQPVM              +
Sbjct: 300  MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359

Query: 404  NIGPLEYCFNSTGFHDSLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFVSANWPGT 225
            +IGPLEYCFNSTGF +SLVPRL  HFADGA FEPPVKSYVI  ADGV+CLGFVS  WPGT
Sbjct: 360  DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT 419

Query: 224  SVIGNIMQQDHLWEFDLVQNKLSFSPSTCT 135
            SV+GNIMQQ+HLWEFDL   KL F+PS+CT
Sbjct: 420  SVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449


>emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  545 bits (1405), Expect = e-152
 Identities = 261/374 (69%), Positives = 298/374 (79%), Gaps = 5/374 (1%)
 Frame = -3

Query: 1241 ADDRTGQYFVGIKVGTPAQKFMLIADTGSDLTWMNCKYRCVGHNC-----RRLHHRRVFN 1077
            AD   GQY V  KVGTP+QKFML+ADTGSDLTWM+CKY C   NC     RR+ H+RVF+
Sbjct: 5    ADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFH 64

Query: 1076 SERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAYDYRYSDGSATVGLFANETVTV 897
            +  SS+FKT+PC +DMCK+ELM+LFSL +CP PLTPC YDYRYSDGS  +G FANETVTV
Sbjct: 65   ANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTV 124

Query: 896  GLANGRKTKLHNMLIGCSESFKGQSFQAADGVLGLGFSKYSFATKAAEKYNGKFSYCLVD 717
             L  GRK KLHN+LIGCSESF+GQSFQAADGV+GLG+SKYSFA KAAEK+ GKFSYCLVD
Sbjct: 125  ELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVD 184

Query: 716  HLSPKNVSNYLTFGGHENKTFSPSKTRFTKLVLGVINPFYAVNVMGISIGNIMLRIPSEV 537
            HLS KNVSNYLTFG   +K    +   +T+LVLG++N FYAVN+MGISIG  ML+IPSEV
Sbjct: 185  HLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEV 244

Query: 536  WDISSVGGTILDSGTSLTVLTMPAYQPVMDTXXXXXXXXXXXXLNIGPLEYCFNSTGFHD 357
            WD+   GGTILDSG+SLT LT PAYQPVM              ++IGPLEYCFNSTGF +
Sbjct: 245  WDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEE 304

Query: 356  SLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFVSANWPGTSVIGNIMQQDHLWEFD 177
            SLVPRL  HFADGA FEPPVKSYVI  ADGV+CLGFVS  WPGTSV+GNIMQQ+HLWEFD
Sbjct: 305  SLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFD 364

Query: 176  LVQNKLSFSPSTCT 135
            L   KL F+PS+CT
Sbjct: 365  LGLKKLGFAPSSCT 378


>gb|EYU27603.1| hypothetical protein MIMGU_mgv1a004950mg [Mimulus guttatus]
          Length = 503

 Score =  510 bits (1313), Expect = e-142
 Identities = 270/469 (57%), Positives = 324/469 (69%), Gaps = 32/469 (6%)
 Frame = -3

Query: 1448 AMRLELIHRHSLGGRPKT----QLERINELLHSDSIRHRMISHKR---QXXXXXXXRKAW 1290
            A++LELIHRH L G  +      LER+ +L+HSD++R R IS K    Q       R+  
Sbjct: 37   AVKLELIHRHHLQGERRNVAAQPLERLRQLVHSDAVRLRGISLKVMLIQGGAGPVRRRVS 96

Query: 1289 ER----------------SSHKSTI-----QMPIRSGADDRTGQYFVGIKVGTPAQKFML 1173
            E                 S++K        Q+PI SGAD  TGQYFV  +VG+PAQK +L
Sbjct: 97   ETDDAFIPASTNGGGGGGSNNKEQFSNVSGQLPISSGADFGTGQYFVQFRVGSPAQKVVL 156

Query: 1172 IADTGSDLTWMNCKYRCVGHN---CRR-LHHRRVFNSERSSTFKTVPCFSDMCKVELMNL 1005
            IADTGSDLTWMNCKYRC G     CRR  + RR+F ++RSS+F+TVPC S  C  +L NL
Sbjct: 157  IADTGSDLTWMNCKYRCRGGGGGGCRRNSNKRRLFWADRSSSFRTVPCSSTTCTNDLANL 216

Query: 1004 FSLASCPAPLTPCAYDYRYSDGSATVGLFANETVTVGLANGRKTKLHNMLIGCSESFKGQ 825
            FSL  CP+P++PCAYDYRYSDGSA  GLF NETVT+ L NGRKT+LHN+LIGCS S  G 
Sbjct: 217  FSLTRCPSPISPCAYDYRYSDGSAAQGLFGNETVTLSLTNGRKTRLHNVLIGCSISSSGP 276

Query: 824  SFQAADGVLGLGFSKYSFATKAAEKYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPS 645
            +FQ+ADGV+GLG+S YS A KA+  + G FSYCLVDHLSPKN+S+YLTFG  + +T    
Sbjct: 277  TFQSADGVIGLGYSNYSLAVKASNLFRGIFSYCLVDHLSPKNISSYLTFGSAKQQT---D 333

Query: 644  KTRFTKLVLGVINPFYAVNVMGISIGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPA 465
               +T L+L VINPFYAV++ GISIG  ML IP+EVWD+   GG ILDSGTSLT L  PA
Sbjct: 334  TMHYTALILDVINPFYAVSMNGISIGGSMLDIPAEVWDVKGSGGVILDSGTSLTSLVGPA 393

Query: 464  YQPVMDTXXXXXXXXXXXXLNIGPLEYCFNSTGFHDSLVPRLKIHFADGARFEPPVKSYV 285
            Y+PVM              L++GPLEYCFNSTGF +S+VPRL  HF DGARFEPPVKSYV
Sbjct: 394  YRPVMAALTASLSGFEKLGLDVGPLEYCFNSTGFVESVVPRLVFHFGDGARFEPPVKSYV 453

Query: 284  IDTADGVKCLGFVSANWPGTSVIGNIMQQDHLWEFDLVQNKLSFSPSTC 138
            ID A GVKCLGFV   WPG SV+GNIMQQ++ WEFDLV  +L F  S+C
Sbjct: 454  IDAAPGVKCLGFVGGAWPGVSVVGNIMQQNYFWEFDLVNKRLGFGSSSC 502


>gb|EXB51212.1| Aspartic proteinase nepenthesin-1 [Morus notabilis]
          Length = 464

 Score =  492 bits (1266), Expect = e-136
 Identities = 259/481 (53%), Positives = 331/481 (68%), Gaps = 18/481 (3%)
 Frame = -3

Query: 1523 NLSIFFLFITNCLVSTPAFSENHST-----AMRLELIHRHS--LGGR---PKTQLERINE 1374
            N S+FF F          F+ N+       A RLEL+HR+S  L  +   P+T +E++ E
Sbjct: 3    NFSLFFFF----------FAINYGLIIDVGATRLELLHRNSPKLSEKWQIPETTMEKLIE 52

Query: 1373 LLHSDSIRHRMISHKRQXXXXXXXRKAWERSSHKSTIQMPIRSGADDRTGQYFVGIKVGT 1194
                D +RHRM+SH+R              SS  S+I MP+ +GAD   G+YFV + VGT
Sbjct: 53   FHRRDVLRHRMVSHRRMGIETA--------SSSASSIAMPMNAGADYGVGEYFVHVTVGT 104

Query: 1193 PAQKFMLIADTGSDLTWMNCKY--RCVGHNCRRLHHRRVFNSERSSTFKTVPCFSDMCKV 1020
            P Q+FML+ADTGSDLTWM+C+   RC  H   RL++RRVF+++RSS+FKT+PC S+MCKV
Sbjct: 105  PGQRFMLVADTGSDLTWMHCRCGRRCGTHK-GRLNNRRVFHADRSSSFKTIPCLSEMCKV 163

Query: 1019 ELMNLFSLASCPAPLTPCAYDYRYSDGSATVGLFANETVTVGLANGRKTKLHNMLIGCSE 840
            EL NLFSL+ CP PLTPCAYDYRY +GS+ +G FANET++V LANG+K KL ++L+GC+E
Sbjct: 164  ELANLFSLSKCPTPLTPCAYDYRYLEGSSAIGFFANETISVRLANGKKRKLRDVLVGCTE 223

Query: 839  SFKG---QSFQAADGVLGLGFSKYSFATKAAEKYNGKFSYCLVDHLSPKNVSNYLTFGGH 669
            S +G     F+ ADGVLGLGF  ++F  KAA+ + GKFSYCLVDHLSPKN+SNY+ FG  
Sbjct: 224  SVQGAEESGFKGADGVLGLGFGNHTFTRKAAQYFGGKFSYCLVDHLSPKNLSNYIIFGHD 283

Query: 668  E-NKTFSPSKTRFTKLVL-GVINPFYAVNVMGISIGNIMLRIPSEVWDISSVGGTILDSG 495
            + +K    S  + T LVL G   PFY VN+ GISIG ++LRIPS  W+ S  GG IL+SG
Sbjct: 284  KADKASCSSSLQHTDLVLGGDYGPFYGVNLSGISIGGVLLRIPSVAWNASLGGGAILESG 343

Query: 494  TSLTVLTMPAYQPV-MDTXXXXXXXXXXXXLNIGPLEYCFNSTGFHDSLVPRLKIHFADG 318
            TSLT LT P Y PV  +                GP E+CFNSTG+ +S +P L+IHF++G
Sbjct: 344  TSLTFLTDPVYGPVTSELNKFTSRFGTLLPPGGGPFEFCFNSTGYDESKMPPLRIHFSNG 403

Query: 317  ARFEPPVKSYVIDTADGVKCLGFVSANWPGTSVIGNIMQQDHLWEFDLVQNKLSFSPSTC 138
            A FEPPVKSY++D A   KCLGFVSA+WPGTS+IGNIMQQ+HLWEFDL   +L F+PSTC
Sbjct: 404  AIFEPPVKSYILDIAPEKKCLGFVSASWPGTSIIGNIMQQNHLWEFDLENTRLGFAPSTC 463

Query: 137  T 135
            T
Sbjct: 464  T 464


>ref|XP_007022806.1| Eukaryotic aspartyl protease family protein, putative [Theobroma
            cacao] gi|508722434|gb|EOY14331.1| Eukaryotic aspartyl
            protease family protein, putative [Theobroma cacao]
          Length = 473

 Score =  491 bits (1263), Expect = e-136
 Identities = 258/455 (56%), Positives = 314/455 (69%), Gaps = 17/455 (3%)
 Frame = -3

Query: 1448 AMRLELIHRHS--LGGRPKTQLERINELLHSDSIRHRMISHKRQXXXXXXXRKAWERSSH 1275
            +++LEL+HRH+  L  RPKTQ ER+ +L+H D IRH    ++RQ         A   S  
Sbjct: 22   SIKLELLHRHAPQLHARPKTQHERLKDLVHHDFIRH----NRRQAWETPKTTTA-TASKT 76

Query: 1274 KSTIQMPIRSGADDRTGQYFVGIKVGTPAQKFMLIADTGSDLTWMNCKYRCV-GHNC--- 1107
             + IQMP+ +G D   GQY    KVGTP+QKF LI DTGSDLTW+NC+YRC  G NC   
Sbjct: 77   NAAIQMPLSAGRDFGIGQYVTTFKVGTPSQKFRLIVDTGSDLTWINCRYRCARGDNCTTQ 136

Query: 1106 -RRLHHRRVFNSERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAYDYR------- 951
             R +   RVF +  SS+F+ +PCFS MCKVEL NLFSL  CP PLTPCAYDYR       
Sbjct: 137  ERGIKRGRVFRAHLSSSFRPIPCFSQMCKVELRNLFSLTICPTPLTPCAYDYRFNSLKLV 196

Query: 950  ---YSDGSATVGLFANETVTVGLANGRKTKLHNMLIGCSESFKGQSFQAADGVLGLGFSK 780
               Y DGS  +G+FA E+VTVGL N R  +LH++LIGCS+S +G++ +  DGVLGL  SK
Sbjct: 197  LNRYIDGSDAMGVFAKESVTVGLTNSRMARLHDVLIGCSDSSQGRTVKNVDGVLGLANSK 256

Query: 779  YSFATKAAEKYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPSKTRFTKLVLGVINPF 600
            YSF TKAAE++ GKFSYCLVDHLS  N SNYL FG + N+      TR+T+L L +++  
Sbjct: 257  YSFVTKAAERWGGKFSYCLVDHLSHINASNYLIFGANNNQLTVLGNTRYTRLELNLVSFS 316

Query: 599  YAVNVMGISIGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPAYQPVMDTXXXXXXXX 420
            YAVNV GISIG  ML IP +VWD    GGTILDSGTSL+ LT PAYQPVM          
Sbjct: 317  YAVNVQGISIGGKMLDIPLQVWDTRKGGGTILDSGTSLSFLTDPAYQPVMAAIKMSVSKY 376

Query: 419  XXXXLNIGPLEYCFNSTGFHDSLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFVSA 240
                L+  P+EYCFNSTGF ++LVP+L IHFADGARFEP  +SYVI  ADGV+CLGF+ A
Sbjct: 377  PQVKLHGVPMEYCFNSTGFDETLVPKLIIHFADGARFEPHWRSYVISAADGVRCLGFLPA 436

Query: 239  NWPGTSVIGNIMQQDHLWEFDLVQNKLSFSPSTCT 135
             +P  SVIGNIMQQ++LWEFDL  NKL F+PS+CT
Sbjct: 437  RFPSVSVIGNIMQQNYLWEFDLEGNKLRFAPSSCT 471


>ref|XP_004293837.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Fragaria vesca
            subsp. vesca]
          Length = 482

 Score =  479 bits (1232), Expect = e-132
 Identities = 263/486 (54%), Positives = 323/486 (66%), Gaps = 16/486 (3%)
 Frame = -3

Query: 1544 MKWRLISNLSI-FFLFITNCLVSTPAFSEN--HSTAMRLELIHRHSLGGR-PKTQLERIN 1377
            MK R  S L I F L+  + ++S  +F +       M+LELIHRHSL    PKTQLE I 
Sbjct: 1    MKGRTSSFLFIVFLLYALHGILSVHSFLDESKQDEPMKLELIHRHSLRVEMPKTQLELIE 60

Query: 1376 ELLHSDSIRHRMISHKRQXXXXXXXRKAWERSSHKS-TIQMPIRSGADDRTGQYFVGIKV 1200
            EL   D IRH+MIS +RQ             +   + +I MP+ S  D   GQYFV IKV
Sbjct: 61   ELQRHDVIRHQMISRRRQHHHHSIPTGLRRNALETAASIAMPLSSAWDFGAGQYFVQIKV 120

Query: 1199 GTPAQKFMLIADTGSDLTWMNCKYRCVGHNC------RRLHHRRVFNSERSSTFKTVPCF 1038
            GTP+Q+F+LIADTGSDLTWM CKYRCV   C       + + ++VF   +SSTFK +PC 
Sbjct: 121  GTPSQRFLLIADTGSDLTWMKCKYRCVADKCGLKRATMKKNKKKVFRPAQSSTFKIIPCS 180

Query: 1037 SDMCKVELMNLFSLASCPAPLTPCAYDYRYSDGSATVGLFANETVTVGLANGRKTKLHNM 858
            S+MCK EL   FS   CP PL+PC YDYRY++ S  +G FANETV V L NGR+ +L+++
Sbjct: 181  SEMCKFELE--FSRQECPTPLSPCKYDYRYAESSGALGFFANETVRVPLTNGRRARLNDV 238

Query: 857  LIGCSESF---KGQSFQAADGVLGLGFSKYSFATKAAEKYNGKFSYCLVDHLSPKNVSNY 687
            LIGC+ES    KG S +A DG+LGLGF K+SF  KAA     KFSYCLVDH+S KNVS+Y
Sbjct: 239  LIGCTESIEGPKGASIRAGDGILGLGFGKHSFVAKAASNLGDKFSYCLVDHMSNKNVSSY 298

Query: 686  LTFGGHENKTFSPSKTRFTKLVLG--VINPFYAVNVMGISIGNIMLRIPSEVWDISSVGG 513
            LTFG +       S+ R+TKL LG   I PFYAVN++GIS G+ ML+IP+EVW+ +  GG
Sbjct: 299  LTFGRNAETAQQNSRMRYTKLALGGPKIGPFYAVNLVGISAGSKMLKIPNEVWNENLGGG 358

Query: 512  TILDSGTSLTVLTMPAYQPVMDTXXXXXXXXXXXXLNIGPLEYCFNSTGFHDSLVPRLKI 333
            TI+DSGTSLT LT PAY  VMD              +    E+CFNSTG+  SLVPR  I
Sbjct: 359  TIVDSGTSLTFLTSPAYIHVMDELTMALSKYKKIPSDA--FEFCFNSTGYDQSLVPRFAI 416

Query: 332  HFADGARFEPPVKSYVIDTADGVKCLGFVSANWPGTSVIGNIMQQDHLWEFDLVQNKLSF 153
            HFADGA+FEPPVKSYVID A   KCLGF SA +PGT VIGNIMQQ++LWEFDL   +L +
Sbjct: 417  HFADGAKFEPPVKSYVIDVAIQTKCLGFQSAPFPGTIVIGNIMQQNYLWEFDLRGGRLGY 476

Query: 152  SPSTCT 135
            +PS+CT
Sbjct: 477  APSSCT 482


>ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Capsella rubella]
            gi|482566377|gb|EOA30566.1| hypothetical protein
            CARUB_v10013693mg [Capsella rubella]
          Length = 448

 Score =  478 bits (1231), Expect = e-132
 Identities = 245/456 (53%), Positives = 315/456 (69%), Gaps = 1/456 (0%)
 Frame = -3

Query: 1499 ITNCLVSTPAFSENHSTAMRLELIHRHSLGGRPKTQLERINELLHSDSIRHRMISHKRQX 1320
            IT  L+   A      TA+RLEL HR +L   P   L RI +++ +D  RH +IS  R+ 
Sbjct: 14   ITTMLLLISAADSVKDTALRLELAHRDTLWPNP---LSRIEDIIGADHKRHSLISRNRK- 69

Query: 1319 XXXXXXRKAWERSSHKSTIQMPIRSGADDRTGQYFVGIKVGTPAQKFMLIADTGSDLTWM 1140
                          +K  ++MP+ SG D  T QYF  ++VGTPA+KF ++ DTGS+LTW+
Sbjct: 70   --------------YKGGVKMPLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWV 115

Query: 1139 NCKYRCVGHNCRRLHHRRVFNSERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAY 960
            NCKYR  G    R+ +RRVF +E S +F+TV CF+  CKV+LMNLFSL++CP P TPC+Y
Sbjct: 116  NCKYR--GRGKGRVENRRVFRAEESKSFRTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSY 173

Query: 959  DYRYSDGSATVGLFANETVTVGLANGRKTKLHNMLIGCSESFKGQSFQAADGVLGLGFSK 780
            DYRY+DGSA  G+FA ETVTVGL NGRK +LH +LIGCS SF GQSF+ ADGVLGL FS 
Sbjct: 174  DYRYADGSAAQGIFAKETVTVGLTNGRKARLHGLLIGCSSSFSGQSFRGADGVLGLAFSD 233

Query: 779  YSFATKAAEKYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPSKTRFTKLVLGVINPF 600
            +SF + A   +  KFSYCLVDHLSPKNVSNYL FG   + T   +  R T L L +I PF
Sbjct: 234  FSFTSTATSLFGAKFSYCLVDHLSPKNVSNYLIFGSSSSAT-KNAPGRTTPLDLTLIPPF 292

Query: 599  YAVNVMGISIGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPAYQPVMDTXXXXXXXX 420
            YA++V+GIS+G  ML IP++VWD ++ GGT+LDSGTSLT+L+  AY+PV+          
Sbjct: 293  YAISVIGISLGEDMLDIPAQVWDATTGGGTVLDSGTSLTLLSEAAYKPVVTGLARYLDEL 352

Query: 419  XXXXLNIGPLEYCFNST-GFHDSLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFVS 243
                    P+EYCF+ST GF++S +P+L  H   GARFEP  KSY+IDTA GVKCLGF+S
Sbjct: 353  ERVKPEGVPIEYCFSSTSGFNESKLPQLTFHMKGGARFEPHRKSYLIDTAPGVKCLGFMS 412

Query: 242  ANWPGTSVIGNIMQQDHLWEFDLVQNKLSFSPSTCT 135
            A  P T+V+GNIMQQ++LWEFDL+ + LSF+PS+CT
Sbjct: 413  AGTPATNVVGNIMQQNYLWEFDLMASTLSFAPSSCT 448


>ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
            lyrata] gi|297328626|gb|EFH59045.1| hypothetical protein
            ARALYDRAFT_478632 [Arabidopsis lyrata subsp. lyrata]
          Length = 449

 Score =  467 bits (1201), Expect = e-129
 Identities = 245/469 (52%), Positives = 320/469 (68%), Gaps = 2/469 (0%)
 Frame = -3

Query: 1535 RLISNLSIFFLFITNCLVSTPAFSENHSTAMRLELIHRHSLGGRPKTQLERINELLHSDS 1356
            R+ ++LS   L IT  L+ T A S    TA+RL+L HR +L   P   L RI +++ +D 
Sbjct: 3    RIKTSLSCLCL-ITTLLLLTAADS-TEDTAVRLKLAHRDTLWPNP---LSRIEDIIGADQ 57

Query: 1355 IRHRMISHKRQXXXXXXXRKAWERSSHKSTIQMPIRSGADDRTGQYFVGIKVGTPAQKFM 1176
             RH +IS KR+                K  ++M + SG D  T QYF  ++VGTPA+KF 
Sbjct: 58   KRHSLISRKRKF---------------KGGVKMDLGSGIDYGTAQYFTEVRVGTPAKKFR 102

Query: 1175 LIADTGSDLTWMNCKYRCVGHNCRRLHHRRVFNSERSSTFKTVPCFSDMCKVELMNLFSL 996
            ++ DTGS+LTW+NC+YR  G    ++ +RRVF +E S +FKTV CF+  CKV+LMNLFSL
Sbjct: 103  VVVDTGSELTWVNCRYR--GRGKGKVKNRRVFRAEESKSFKTVGCFTQTCKVDLMNLFSL 160

Query: 995  ASCPAPLTPCAYDYRYSDGSATVGLFANETVTVGLANGRKTKLHNMLIGCSESFKGQSFQ 816
            ++CP P TPC+YDYRY+DGSA  G+FA ET+TVGL NGRK +L  +L+GCS SF GQSFQ
Sbjct: 161  STCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRGLLVGCSSSFSGQSFQ 220

Query: 815  AADGVLGLGFSKYSFATKAAEKYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPSKT- 639
             ADGVLGL FS +SF + A   +  K SYCLVDHLS KN+SNYL FG   + T + +   
Sbjct: 221  GADGVLGLAFSDFSFTSTATSLFGAKLSYCLVDHLSNKNISNYLIFGYSSSSTSTKTAPG 280

Query: 638  RFTKLVLGVINPFYAVNVMGISIGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPAYQ 459
            R T L L +I PFYA+N++GISIG+ ML IP++VWD ++ GGTILDSGTSLT+L   AY+
Sbjct: 281  RTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVWDATTGGGTILDSGTSLTLLAEAAYK 340

Query: 458  PVMDTXXXXXXXXXXXXLNIGPLEYCFNST-GFHDSLVPRLKIHFADGARFEPPVKSYVI 282
            PV+                  P+EYCF+ST GF++S +P+L  H   GARFEP  KSY++
Sbjct: 341  PVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFNESKLPQLTFHLKGGARFEPHRKSYLV 400

Query: 281  DTADGVKCLGFVSANWPGTSVIGNIMQQDHLWEFDLVQNKLSFSPSTCT 135
            D A GVKCLGF+SA  P T+V+GNIMQQ++LWEFDL+ + LSF+PSTCT
Sbjct: 401  DAAPGVKCLGFMSAGTPATNVVGNIMQQNYLWEFDLMASTLSFAPSTCT 449


>ref|XP_007049083.1| Eukaryotic aspartyl protease family protein, putative [Theobroma
            cacao] gi|508701344|gb|EOX93240.1| Eukaryotic aspartyl
            protease family protein, putative [Theobroma cacao]
          Length = 478

 Score =  458 bits (1179), Expect = e-126
 Identities = 236/487 (48%), Positives = 321/487 (65%), Gaps = 11/487 (2%)
 Frame = -3

Query: 1562 NITINMMKWRLISNLSIFFLFITNCLVSTPAFSENHSTAMRLELIHRHS--LG------- 1410
            +I++  +   L  N S FF    +  ++ P      +  +R +LIHRHS  LG       
Sbjct: 4    SISLVFLSLFLFFNHSFFFQAHASEAITPP------NEKVRFKLIHRHSPELGEDHGTTL 57

Query: 1409 GRPKTQLERINELLHSDSIRHRMISHKRQXXXXXXXRKAWERSSHKSTIQMPIRSGADDR 1230
            G P +  ERI +L+HSD+ R   IS +          K    S+    +++P+RS AD  
Sbjct: 58   GPPTSTRERIKQLVHSDNARLHTISQRLGPRRMTFEMKMMGSSN---LVELPMRSAADIG 114

Query: 1229 TGQYFVGIKVGTPAQKFMLIADTGSDLTWMNCKYRCVGHNCRRLH-HRRVFNSERSSTFK 1053
            TGQYFV  +VG+P +KF++IADTGS LTWM C Y+C   +  R   H R+F + +S TFK
Sbjct: 115  TGQYFVSFRVGSPPKKFIMIADTGSSLTWMRCSYKCKNFSMDRTKLHERIFYANQSRTFK 174

Query: 1052 TVPCFSDMCKVELMNLFSLASCPAPLTPCAYDYRYSDGSATVGLFANETVTVGLANGRKT 873
             +PC SD+CKVEL   FSLA CP P+ PCAYDYRY+DG+  VG+F N+TV V L+ G+K 
Sbjct: 175  PIPCSSDVCKVELSQSFSLALCPTPMAPCAYDYRYADGTRVVGIFGNDTVKVRLSGGQKI 234

Query: 872  KLHNMLIGCSESFKGQSFQAADGVLGLGFSKYSFATKAAEKYNGKFSYCLVDHLSPKNVS 693
            K+ ++++GCSE+ +G +F   DGV+GLGF ++SFA KAA+++  KFSYCLVDHLSP N+ 
Sbjct: 235  KVTDVMVGCSEAIRG-NFHDIDGVMGLGFDQHSFAVKAAKEFGDKFSYCLVDHLSPSNLV 293

Query: 692  NYLTFGGHENKTFSP-SKTRFTKLVLGVINPFYAVNVMGISIGNIMLRIPSEVWDISSVG 516
            N+L FGG    T SP    +FT+L+LG++NP+YAVNV GIS+   ML IPS +WD+   G
Sbjct: 294  NFLVFGG---VTSSPLPNMQFTQLILGIVNPYYAVNVSGISVNGKMLDIPSYIWDVKGDG 350

Query: 515  GTILDSGTSLTVLTMPAYQPVMDTXXXXXXXXXXXXLNIGPLEYCFNSTGFHDSLVPRLK 336
            G I+DSG+SLT L  P +  V+              LN+GP +YCF++ GF +SL+P+L 
Sbjct: 351  GVIMDSGSSLTYLVKPLFDKVIAAFQAPLSKFKKLELNLGP-DYCFSAAGFEESLMPKLA 409

Query: 335  IHFADGARFEPPVKSYVIDTADGVKCLGFVSANWPGTSVIGNIMQQDHLWEFDLVQNKLS 156
             HFADGA+  PPVKSYVID  + VKCLGF S +WPG SVIGNI+QQ+HLWEFDL+ ++L 
Sbjct: 410  FHFADGAKLVPPVKSYVIDAEEAVKCLGFSSTSWPGPSVIGNILQQNHLWEFDLLNSRLG 469

Query: 155  FSPSTCT 135
            F+ S+CT
Sbjct: 470  FAASSCT 476


>ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citrus clementina]
            gi|568881779|ref|XP_006493729.1| PREDICTED: aspartic
            proteinase nepenthesin-1-like [Citrus sinensis]
            gi|557524190|gb|ESR35557.1| hypothetical protein
            CICLE_v10004908mg [Citrus clementina]
          Length = 470

 Score =  456 bits (1174), Expect = e-125
 Identities = 234/448 (52%), Positives = 299/448 (66%), Gaps = 11/448 (2%)
 Frame = -3

Query: 1448 AMRLELIHRHS--LGGRPK-TQLERINELLHSDSIRHRMISHKRQXXXXXXXRKAWERSS 1278
            A+R+ELIHRHS  L   P  +++ER+ ELLH+D IR     +KR+              +
Sbjct: 31   AVRMELIHRHSPKLNNMPMMSEVERMKELLHNDIIRQ----NKRRGRRLRQTNNNNNNGA 86

Query: 1277 HKSTIQMPIRSGADDRTGQYFVGIKVGTPAQKFMLIADTGSDLTWMNCKYRCVGHNCRRL 1098
              S I+MP+++G D  TG YFV IKVGTP+QK  LI DTGS+ +W++C+Y C G +C + 
Sbjct: 87   SGSAIEMPLQAGRDYGTGMYFVEIKVGTPSQKLRLIVDTGSEFSWISCRYHC-GPSCTKK 145

Query: 1097 -----HHRRVFNSERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAYDYRYSDGSA 933
                   RRVF ++ SS+FKT+PC SDMCK E   LFSL  CP P +PCAYDYRY+DGSA
Sbjct: 146  GTIAGSRRRVFKADLSSSFKTIPCSSDMCKSEFARLFSLTFCPTPTSPCAYDYRYADGSA 205

Query: 932  TVGLFANETVTVGLANGRKTKLHNMLIGCSESFKGQSFQAADGVLGLGFSKYSFA---TK 762
              G+F  E VT+GL NG KT++  +++GCS++ +GQ F  ADGVLGL + KYSFA   T 
Sbjct: 206  AKGIFGKERVTIGLENGGKTRIEEVVMGCSDTIQGQIFAEADGVLGLSYDKYSFAQKVTN 265

Query: 761  AAEKYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPSKTRFTKLVLGVINPFYAVNVM 582
             +    GKF+YCLVDHLS KNVSNYL FG    +     + R    +LG+I P Y V+V 
Sbjct: 266  GSTFARGKFAYCLVDHLSHKNVSNYLIFGEESKR----MRMRMRYTLLGLIGPDYGVSVK 321

Query: 581  GISIGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPAYQPVMDTXXXXXXXXXXXXLN 402
            GISIG +ML IPS+VWD +  GGT  DSGT+LT L  PAY+PV+               +
Sbjct: 322  GISIGGVMLNIPSQVWDFNRGGGTAFDSGTTLTFLAEPAYKPVVAALEMSLSRYQRLKRD 381

Query: 401  IGPLEYCFNSTGFHDSLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFVSANWPGTS 222
              P EYCFNSTGF +S VP+L  HFADGARFEP  KSY+I  A G++CLGFVSA WPG S
Sbjct: 382  -APFEYCFNSTGFDESSVPKLVFHFADGARFEPHTKSYIIRVAHGIRCLGFVSATWPGAS 440

Query: 221  VIGNIMQQDHLWEFDLVQNKLSFSPSTC 138
             IGNIMQQ++ WEFDL++++L F+PSTC
Sbjct: 441  AIGNIMQQNYFWEFDLLKDRLGFAPSTC 468


>ref|XP_006407304.1| hypothetical protein EUTSA_v10020732mg [Eutrema salsugineum]
            gi|557108450|gb|ESQ48757.1| hypothetical protein
            EUTSA_v10020732mg [Eutrema salsugineum]
          Length = 444

 Score =  451 bits (1159), Expect = e-124
 Identities = 231/446 (51%), Positives = 296/446 (66%), Gaps = 1/446 (0%)
 Frame = -3

Query: 1472 AFSENHSTAMRLELIHRHSLGGRPKTQLERINELLHSDSIRHRMISHKRQXXXXXXXRKA 1293
            A      T +RLE+ HR +L     T   RI +++  D  RH +IS KR+        K 
Sbjct: 18   AADSTEDTVVRLEMAHRDTLW---PTAFRRIEDIIGEDQKRHSLISQKRKIKGGGGGAK- 73

Query: 1292 WERSSHKSTIQMPIRSGADDRTGQYFVGIKVGTPAQKFMLIADTGSDLTWMNCKYRCVGH 1113
                       M + SG D    QYF  ++VGTPA++F ++ DTGS+LTW+NC++   G 
Sbjct: 74   -----------MALGSGFDYGAAQYFAEVRVGTPAKRFRVVVDTGSELTWVNCRFHGKGK 122

Query: 1112 NCRRLHHRRVFNSERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAYDYRYSDGSA 933
                  +RRVF +E SS+F+ V C +  CKV+LMNLFSL++CP P TPC+YDYRY+DGSA
Sbjct: 123  E-----NRRVFRAEESSSFRKVGCLTQTCKVDLMNLFSLSNCPTPSTPCSYDYRYADGSA 177

Query: 932  TVGLFANETVTVGLANGRKTKLHNMLIGCSESFKGQSFQAADGVLGLGFSKYSFATKAAE 753
              G+FA ET TVGL NGRK KL  +LIGCS SF G SF+ ADGVLGL  S YSF +KA  
Sbjct: 178  AQGVFAKETFTVGLTNGRKAKLRGLLIGCSSSFSGDSFRGADGVLGLALSDYSFTSKATN 237

Query: 752  KYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPSKTRFTKLVLGVINPFYAVNVMGIS 573
             + GKFSYCLVDHLS KNVSNYLTFG   + T + +  R T L L +I PFYA+N++GIS
Sbjct: 238  IFGGKFSYCLVDHLSNKNVSNYLTFGSSSSTTKTAASIRTTPLDLKLIPPFYAINIIGIS 297

Query: 572  IGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPAYQPVMDTXXXXXXXXXXXXLNIGP 393
            IG+ ML IP++VWD ++ GGTILDSGTSLT L   AY+ V+                  P
Sbjct: 298  IGDDMLDIPTQVWDATAGGGTILDSGTSLTFLADAAYKAVVSGLERYLVGFKRVKPEGVP 357

Query: 392  LEYCFNST-GFHDSLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFVSANWPGTSVI 216
            +EYCF++T GF++S +P+L  HF  GARFEP  +SYV+DT +GV+CLGFVS   P T+V+
Sbjct: 358  IEYCFDTTSGFNESKLPQLTFHFKGGARFEPHRRSYVVDTLEGVRCLGFVSTGSPATNVV 417

Query: 215  GNIMQQDHLWEFDLVQNKLSFSPSTC 138
            GNIMQQ++LWEFDLV + LSF+PSTC
Sbjct: 418  GNIMQQNYLWEFDLVASTLSFAPSTC 443


>ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
            gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA
            binding protein-like [Arabidopsis thaliana]
            gi|332641715|gb|AEE75236.1| aspartyl protease family
            protein [Arabidopsis thaliana]
          Length = 461

 Score =  446 bits (1147), Expect = e-122
 Identities = 241/456 (52%), Positives = 302/456 (66%), Gaps = 1/456 (0%)
 Frame = -3

Query: 1499 ITNCLVSTPAFSENHSTAMRLELIHRHSLGGRPKTQLERINELLHSDSIRHRMISHKRQX 1320
            IT  L+ T A S    T++RL+L HR +L  +P   L RI +++ +D  RH +IS KR  
Sbjct: 32   ITTLLLITVADSMK-DTSVRLKLAHRDTLLPKP---LSRIEDVIGADQKRHSLISRKRNS 87

Query: 1319 XXXXXXRKAWERSSHKSTIQMPIRSGADDRTGQYFVGIKVGTPAQKFMLIADTGSDLTWM 1140
                              ++M + SG D  T QYF  I+VGTPA+KF ++ DTGS+LTW+
Sbjct: 88   TVG---------------VKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWV 132

Query: 1139 NCKYRCVGHNCRRLHHRRVFNSERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAY 960
            NC+YR  G +     +RRVF ++ S +FKTV C +  CKV+LMNLFSL +CP P TPC+Y
Sbjct: 133  NCRYRARGKD-----NRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY 187

Query: 959  DYRYSDGSATVGLFANETVTVGLANGRKTKLHNMLIGCSESFKGQSFQAADGVLGLGFSK 780
            DYRY+DGSA  G+FA ET+TVGL NGR  +L   LIGCS SF GQSFQ ADGVLGL FS 
Sbjct: 188  DYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSD 247

Query: 779  YSFATKAAEKYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPSKTRFTKLVLGVINPF 600
            +SF + A   Y  KFSYCLVDHLS KNVSNYL FG   +++   +  R T L L  I PF
Sbjct: 248  FSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFG--SSRSTKTAFRRTTPLDLTRIPPF 305

Query: 599  YAVNVMGISIGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPAYQPVMDTXXXXXXXX 420
            YA+NV+GIS+G  ML IPS+VWD +S GGTILDSGTSLT+L   AY+ V+          
Sbjct: 306  YAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVEL 365

Query: 419  XXXXLNIGPLEYCFNST-GFHDSLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFVS 243
                    P+EYCF+ T GF+ S +P+L  H   GARFEP  KSY++D A GVKCLGFVS
Sbjct: 366  KRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVS 425

Query: 242  ANWPGTSVIGNIMQQDHLWEFDLVQNKLSFSPSTCT 135
            A  P T+VIGNIMQQ++LWEFDL+ + LSF+PS CT
Sbjct: 426  AGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461


>gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  446 bits (1147), Expect = e-122
 Identities = 241/456 (52%), Positives = 302/456 (66%), Gaps = 1/456 (0%)
 Frame = -3

Query: 1499 ITNCLVSTPAFSENHSTAMRLELIHRHSLGGRPKTQLERINELLHSDSIRHRMISHKRQX 1320
            IT  L+ T A S    T++RL+L HR +L  +P   L RI +++ +D  RH +IS KR  
Sbjct: 10   ITTLLLITVADSMK-DTSVRLKLAHRDTLLPKP---LSRIEDVIGADQKRHSLISRKRNS 65

Query: 1319 XXXXXXRKAWERSSHKSTIQMPIRSGADDRTGQYFVGIKVGTPAQKFMLIADTGSDLTWM 1140
                              ++M + SG D  T QYF  I+VGTPA+KF ++ DTGS+LTW+
Sbjct: 66   TVG---------------VKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWV 110

Query: 1139 NCKYRCVGHNCRRLHHRRVFNSERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAY 960
            NC+YR  G +     +RRVF ++ S +FKTV C +  CKV+LMNLFSL +CP P TPC+Y
Sbjct: 111  NCRYRARGKD-----NRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY 165

Query: 959  DYRYSDGSATVGLFANETVTVGLANGRKTKLHNMLIGCSESFKGQSFQAADGVLGLGFSK 780
            DYRY+DGSA  G+FA ET+TVGL NGR  +L   LIGCS SF GQSFQ ADGVLGL FS 
Sbjct: 166  DYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSD 225

Query: 779  YSFATKAAEKYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPSKTRFTKLVLGVINPF 600
            +SF + A   Y  KFSYCLVDHLS KNVSNYL FG   +++   +  R T L L  I PF
Sbjct: 226  FSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFG--SSRSTKTAFRRTTPLDLTRIPPF 283

Query: 599  YAVNVMGISIGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPAYQPVMDTXXXXXXXX 420
            YA+NV+GIS+G  ML IPS+VWD +S GGTILDSGTSLT+L   AY+ V+          
Sbjct: 284  YAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVEL 343

Query: 419  XXXXLNIGPLEYCFNST-GFHDSLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFVS 243
                    P+EYCF+ T GF+ S +P+L  H   GARFEP  KSY++D A GVKCLGFVS
Sbjct: 344  KRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVS 403

Query: 242  ANWPGTSVIGNIMQQDHLWEFDLVQNKLSFSPSTCT 135
            A  P T+VIGNIMQQ++LWEFDL+ + LSF+PS CT
Sbjct: 404  AGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 439


>ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
            gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1
            precursor, putative [Ricinus communis]
          Length = 489

 Score =  420 bits (1080), Expect = e-115
 Identities = 221/456 (48%), Positives = 296/456 (64%), Gaps = 15/456 (3%)
 Frame = -3

Query: 1460 NHSTAMRLELIHRHS--------LGGRPKTQLERINELLHSDSIRHRMISHKRQXXXXXX 1305
            N+++ +  E+ H HS          G PK++L+   +LL SD+ R +MIS  R       
Sbjct: 38   NNNSGVWFEMFHMHSPKLKSQSKFLGPPKSRLDGTRQLLQSDNARRQMISSLRHGTRR-- 95

Query: 1304 XRKAWERSSHKSTIQMPIRSGADDRTGQYFVGIKVGTPA-QKFMLIADTGSDLTWMNCKY 1128
              KA+E S    T Q+PI SGAD    QYFV I++GTP  QKF+L+ DTGSDLTWMNC+Y
Sbjct: 96   --KAFEVSH---TAQIPIHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEY 150

Query: 1127 RCVGHNCRRLHHRRVFNSERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAYDYRY 948
             C        H  RVF +  SS+F+T+PC SD CK+EL + FSL  CP P  PC +DYRY
Sbjct: 151  WCKSCPKPNPHPGRVFRANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRY 210

Query: 947  SDGSATVGLFANETVTVGLANGRKTKLHNMLIGCSESFKGQSFQAADGVLGLGFSKYSFA 768
             +G   +G+FANETVTVGL + +K +L ++LIGC+ESF  ++    DGV+GLG+ K+S A
Sbjct: 211  LNGPRAIGVFANETVTVGLNDHKKIRLFDVLIGCTESF-NETNGFPDGVMGLGYRKHSLA 269

Query: 767  TKAAEKYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPSKTRFTKLVLGVINPFYAVN 588
             + AE +  KFSYCLVDHLS  N  N+L+FG  +       K + T+L+LG IN FY VN
Sbjct: 270  LRLAEIFGNKFSYCLVDHLSSSNHKNFLSFG--DIPEMKLPKMQHTELLLGYINAFYPVN 327

Query: 587  VMGISIGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPAYQPVMDTXXXXXXXXXXXX 408
            V GIS+G  ML I S++W+++ VGG I+DSGTSLT+L   AY  V+D             
Sbjct: 328  VSGISVGGSMLSISSDIWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDA----LKPIFDKH 383

Query: 407  LNIGPLE------YCFNSTGFHDSLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFV 246
              + P+E      +CF   GF  + VPRL IHFADGA F+PPVKSY+ID A+G+KCLG +
Sbjct: 384  KKVVPIELPELNNFCFEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGII 443

Query: 245  SANWPGTSVIGNIMQQDHLWEFDLVQNKLSFSPSTC 138
             A++PG+S++GN+MQQ+HLWE+DL + KL F PS+C
Sbjct: 444  KADFPGSSILGNVMQQNHLWEYDLGRGKLGFGPSSC 479


>gb|EPS68033.1| hypothetical protein M569_06741 [Genlisea aurea]
          Length = 449

 Score =  419 bits (1076), Expect = e-114
 Identities = 215/380 (56%), Positives = 264/380 (69%), Gaps = 4/380 (1%)
 Frame = -3

Query: 1262 QMPIRSGADDRTGQYFVGIKVGTPAQKFMLIADTGSDLTWMNCKYRCVGHNCRRLHHRRV 1083
            +MP+ +GAD    QY V  +VG+PAQ   LIADTGSDLTW  C Y C G  CRR    R+
Sbjct: 72   EMPMYAGADLGIAQYLVAFRVGSPAQSVALIADTGSDLTWTKCSYGC-GGGCRR-SSGRL 129

Query: 1082 FNSERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAYDYRYSDGSATVGLFANETV 903
            F+++RS++FKTV C S  C V+L   FSL+ C  P  PCAYDYRY+DGS+  G+FA ETV
Sbjct: 130  FDADRSTSFKTVECSSTTCTVDLAGAFSLSRCSPPSDPCAYDYRYADGSSAEGIFAGETV 189

Query: 902  TVGLANGR-KTKLHNMLIGCSESFKGQSFQAADGVLGLGFSKYSFATKAAEKYNGKFSYC 726
             + LA GR K +L N+LIGC+++F G SFQ +DGVLGLG+S +SFA  AA ++  KFSYC
Sbjct: 190  ELKLAKGRGKARLQNVLIGCTKNFSGSSFQTSDGVLGLGYSNFSFAHAAAARFGDKFSYC 249

Query: 725  LVDHLSPKNVSNYLTF--GGHENKTFSPSKTRFTKLVLGVINPFYAVNVMGISIGNIMLR 552
            L+DHL+ KN S+Y+TF  G   + + S    R+T LVLGVI   YAVNV GISIG   LR
Sbjct: 250  LLDHLAAKNKSSYITFSSGRSISASISAGPIRYTDLVLGVIGSNYAVNVRGISIGGSWLR 309

Query: 551  IPSEVWD-ISSVGGTILDSGTSLTVLTMPAYQPVMDTXXXXXXXXXXXXLNIGPLEYCFN 375
            IPS+ W+ +S  GG I+DSG+SLT L  PAY PV+              + IGP+E CFN
Sbjct: 310  IPSDTWNNLSGSGGVIIDSGSSLTALAPPAYAPVIAALNRSLARFGDPHVKIGPMECCFN 369

Query: 374  STGFHDSLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFVSANWPGTSVIGNIMQQD 195
            STGFH+S+VP+L IHFA G RFEPPVKSYVID A GV CLGFV A  PG SVIGNI+QQ+
Sbjct: 370  STGFHESVVPKLAIHFAGGTRFEPPVKSYVIDAAPGVVCLGFVQAASPGVSVIGNILQQN 429

Query: 194  HLWEFDLVQNKLSFSPSTCT 135
            H WEFDL   +L F+ S CT
Sbjct: 430  HWWEFDLGNRRLGFAASDCT 449


>ref|XP_007211847.1| hypothetical protein PRUPE_ppa004710mg [Prunus persica]
            gi|462407712|gb|EMJ13046.1| hypothetical protein
            PRUPE_ppa004710mg [Prunus persica]
          Length = 495

 Score =  410 bits (1053), Expect = e-111
 Identities = 223/497 (44%), Positives = 301/497 (60%), Gaps = 30/497 (6%)
 Frame = -3

Query: 1535 RLISNLSIFFLFITNCLVSTPAFSENHSTA-------MRLELIHRHSL--------GGRP 1401
            +L S   +FF+F+ + +        +H          MRLE+IHR+S         G  P
Sbjct: 2    KLRSGSILFFIFLLSAIHGLAFAHADHDEEDGDNGDEMRLEMIHRYSPHAKDHGVHGEIP 61

Query: 1400 KTQLERINELLHSDSIRHRMISHKRQXXXXXXXRKAWERSSH--------KSTIQMPIRS 1245
             TQ   I EL   D  R +M++ KRQ         +   S+         + ++ MP+ +
Sbjct: 62   PTQQALIQELHRHDVFRLQMMAQKRQQNGHDQGLNSSSSSNSTRRMDMQTRLSVTMPMNA 121

Query: 1244 GADDRTGQYFVGIKVGTPAQKFMLIADTGSDLTWMNCKYRCVGHNCR----RLHHRRVFN 1077
            G D   GQY V +K+GTPAQKF +I  TGSDLTW+ C   C G +C     R+ H RVFN
Sbjct: 122  GWDYGIGQYLVKLKLGTPAQKFTVIPSTGSDLTWVRCGSHC-GKSCGIRKGRIDHSRVFN 180

Query: 1076 SERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAYDYRYSDGSATVGLFANETVTV 897
            ++RSSTFK+V C S MC+ +L N  SL  CP PL+PC YDY Y +GS+ +G F  + V  
Sbjct: 181  TDRSSTFKSVTCSSKMCEFDLANFNSLNKCPRPLSPCRYDYSYVEGSSALGTFGTDIVRA 240

Query: 896  GLANGRKTKLHNMLIGCSESFKGQ-SFQAADGVLGLGFSKYSFATKAAEKYNGKFSYCLV 720
             L+NGR+ ++ ++LIGC+ES  G+ + + +DG+LGLGF KYSF TKAA KY GK SYCL+
Sbjct: 241  SLSNGRRNRMKDVLIGCTESIIGKGTAKGSDGILGLGFGKYSFTTKAALKYGGKVSYCLL 300

Query: 719  DHLSPKNVSNYLTFGGHENKTFSPSKTRFTKLVLGVIN--PFYAVNVMGISIGNIMLRIP 546
            DH+SPKNV++YLTFG ++ K     K R+T+LV G  N   FY VN+ GIS+G  ML IP
Sbjct: 301  DHMSPKNVTSYLTFGDNK-KAVLQGKMRYTQLVFGNPNKGSFYGVNLQGISVGGKMLNIP 359

Query: 545  SEVWDISSVGGTILDSGTSLTVLTMPAYQPVMDTXXXXXXXXXXXXLNIGPLEYCFNSTG 366
              +W+    GG ++DSG SLT LT PAY+PVM                    ++CF+  G
Sbjct: 360  LHIWNPKLGGGALVDSGMSLTFLTKPAYKPVMTALTMPLTKFRRLRSEEDDFDFCFDPRG 419

Query: 365  FHDSLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFVSANWPGTSVIGNIMQQDHLW 186
            + D LVP+L  HFA GA+F PPVKSYVID + G+KC+G +     G  +IGNI+QQ+HLW
Sbjct: 420  YRDRLVPKLVFHFAGGAKFAPPVKSYVIDVSPGMKCIGILPLA-EGACIIGNIIQQNHLW 478

Query: 185  EFDLVQNKLSFSPSTCT 135
            EF+LV+  L F+PSTCT
Sbjct: 479  EFNLVRKTLGFAPSTCT 495


>ref|XP_006429804.1| hypothetical protein CICLE_v10013820mg [Citrus clementina]
            gi|557531861|gb|ESR43044.1| hypothetical protein
            CICLE_v10013820mg [Citrus clementina]
          Length = 475

 Score =  407 bits (1046), Expect = e-111
 Identities = 219/465 (47%), Positives = 278/465 (59%), Gaps = 20/465 (4%)
 Frame = -3

Query: 1520 LSIFFLFITNCLVSTPAFSENHSTAMRLELIHRHS---------LGGRPKTQLERINELL 1368
            +S+F +F  +        S       R ELIHRHS             PK   ERI +L+
Sbjct: 12   ISLFLIFTLSLTRKAFLASTGKDPPPRFELIHRHSPQLSEHEATAYSPPKNLSERIRQLI 71

Query: 1367 HSDSIRHRMISHKRQXXXXXXXRKAWERSSHKST-------IQMPIRSGADDRTGQYFVG 1209
              D  R  MIS + +        +     SH  T       +++P+RSGAD   GQYFV 
Sbjct: 72   DGDIARQEMISRRLEDRRRRGRIRKASEISHHRTFNGTSNIVKIPLRSGADRGLGQYFVS 131

Query: 1208 IKVGTPAQKFMLIADTGSDLTWMNCKYRCVGHNCRR---LHHRRVFNSERSSTFKTVPCF 1038
             +VG+P QKF+LIADTGSDLTWM+C ++  G NC +       R+F ++ SSTFKT+PC 
Sbjct: 132  FRVGSPPQKFVLIADTGSDLTWMHCNHK--GENCPKDGLTPPNRMFQADASSTFKTIPCS 189

Query: 1037 SDMCKVELMNLFSLASCPAPLTPCAYDYRYSDGSATVGLFANETVTVGLANGRK-TKLHN 861
            S  CKV+L + FSL+ CP P+TPCAYDY Y DGS   G FANETVT G  + RK  +L  
Sbjct: 190  SRTCKVDLQDTFSLSMCPTPVTPCAYDYSYFDGSKVRGFFANETVTAGSIDRRKKVRLKE 249

Query: 860  MLIGCSESFKGQSFQAADGVLGLGFSKYSFATKAAEKYNGKFSYCLVDHLSPKNVSNYLT 681
            + +GC++   G +F  ADGVLGLGF K SFA  AA+ ++ KFSYCLVDHLSP N +N+L 
Sbjct: 250  VTVGCTDWANG-NFHNADGVLGLGFGKNSFAATAAKLFDNKFSYCLVDHLSPSNFANFLN 308

Query: 680  FGGHENKTFSPSKTRFTKLVLGVINPFYAVNVMGISIGNIMLRIPSEVWDISSVGGTILD 501
            FG    +       + T+L+LG +NPFYAVNV GISI   ML +P E+W I   GG ILD
Sbjct: 309  FGNTSKQHIQ--NMQHTQLILGELNPFYAVNVSGISIAGKMLNVPPEMWHIHGAGGVILD 366

Query: 500  SGTSLTVLTMPAYQPVMDTXXXXXXXXXXXXLNIGPLEYCFNSTGFHDSLVPRLKIHFAD 321
            SGT+LT L  PAY   +                +GPL +C+N   F  + VP+  +HFAD
Sbjct: 367  SGTTLTFLGEPAYAAAVAALRAPLEKYKKLGHVLGPLRFCYNDPRFDMADVPQFVLHFAD 426

Query: 320  GARFEPPVKSYVIDTADGVKCLGFVSANWPGTSVIGNIMQQDHLW 186
            GA+F PP KSYVID   GVKC+GF SA WP  +VIGNIMQQ+HLW
Sbjct: 427  GAKFVPPKKSYVIDADVGVKCIGFASAGWPANTVIGNIMQQNHLW 471


>ref|XP_003532899.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Glycine
            max]
          Length = 507

 Score =  403 bits (1036), Expect = e-109
 Identities = 224/513 (43%), Positives = 304/513 (59%), Gaps = 42/513 (8%)
 Frame = -3

Query: 1547 MMKWRLISNLSIFFLFITNCLVSTPAFSENHSTAMRLELIHRH----SLGGRPKTQLERI 1380
            MM+W  I+  SI      + ++     S      MRLEL+HRH    S GG    Q+E +
Sbjct: 5    MMQWNTITKASILITITLHLILPVAVNS------MRLELVHRHHERFSGGGGDVDQVEAV 58

Query: 1379 NELLHSDSIRHRMISHKRQXXXXXXXRKAWERSSHKSTIQMPIRSGADDRTGQYFVGIKV 1200
               ++ D +R + ++ +         RK  E ++  + ++MP+R+G DD  G+YF  +KV
Sbjct: 59   KGFVNRDGLRRQRMNQRWGVSNYDRRRKGLETTT-TTEVEMPMRAGRDDALGEYFTEVKV 117

Query: 1199 GTPAQKFMLIADTGSDLTWMNCKYRCV----------GHNCRRLHH-------------- 1092
            G+P Q+F L ADTGS+ TW NC  R             +  ++ HH              
Sbjct: 118  GSPGQRFWLAADTGSEFTWFNCVMRNATTTATTKKTRKNKTKKKHHHHSKRNRTRTTRRT 177

Query: 1091 ----------RRVFNSERSSTFKTVPCFSDMCKVELMNLFSLASCPAPLTPCAYDYRYSD 942
                      + VF   RS +F+ V C S  CK++L  LFSL+ CP P  PC YD  Y+D
Sbjct: 178  KKKKAKSNPCKGVFCPHRSKSFQAVTCASQKCKIDLSQLFSLSLCPKPSDPCLYDISYAD 237

Query: 941  GSATVGLFANETVTVGLANGRKTKLHNMLIGCSESFK-GQSF-QAADGVLGLGFSKYSFA 768
            GS+  G F  +T+TV L NG++ KL+N+ IGC++S + G +F +   G+LGLGF+K SF 
Sbjct: 238  GSSAKGFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFI 297

Query: 767  TKAAEKYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPSKTRFTKLVLGVINPFYAVN 588
             KAA +Y  KFSYCLVDHLS +NVS+YLT GGH N      + + T+L+L    PFY VN
Sbjct: 298  DKAAYEYGAKFSYCLVDHLSHRNVSSYLTIGGHHNAKLL-GEIKRTELIL--FPPFYGVN 354

Query: 587  VMGISIGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPAYQPVMDT-XXXXXXXXXXX 411
            V+GISIG  ML+IP +VWD +S GGT++DSGT+LT L +PAY+PV +             
Sbjct: 355  VVGISIGGQMLKIPPQVWDFNSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVT 414

Query: 410  XLNIGPLEYCFNSTGFHDSLVPRLKIHFADGARFEPPVKSYVIDTADGVKCLGFVSANW- 234
              + G L++CF++ GF DS+VPRL  HFA GARFEPPVKSY+ID A  VKC+G V  +  
Sbjct: 415  GEDFGALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGI 474

Query: 233  PGTSVIGNIMQQDHLWEFDLVQNKLSFSPSTCT 135
             G SVIGNIMQQ+HLWEFDL  N + F+PS CT
Sbjct: 475  GGASVIGNIMQQNHLWEFDLSTNTIGFAPSICT 507


>ref|XP_006598230.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Glycine
            max]
          Length = 521

 Score =  392 bits (1006), Expect = e-106
 Identities = 224/525 (42%), Positives = 307/525 (58%), Gaps = 54/525 (10%)
 Frame = -3

Query: 1547 MMKWRLISNLSIFFLFITNCLVSTPAFSENHSTAMRLELIHRH----SLGGRPKTQLERI 1380
            MM+W  I+  SI  + IT  L+   A +     +MRLEL+HRH    + GG    ++E +
Sbjct: 6    MMQWNTITKASIL-VTITLLLILPVAVN-----SMRLELVHRHHERFAGGGGDVDRVEAV 59

Query: 1379 NELLHSDSIRHRMISHK-RQXXXXXXXRKAWERSSHKSTIQMPIRSGADDRTGQYFVGIK 1203
               +  D +R + ++ +          RK +E ++  + ++MP+ SG DD  G+YF  +K
Sbjct: 60   KGFVKRDKLRRQRMNQRWGVVSNYDSRRKGFEMTTTPAEVEMPMHSGRDDALGEYFAEVK 119

Query: 1202 VGTPAQKFMLIADTGSDLTWMNC------------------KYRCVGHNCRRLHHRR--- 1086
            VG+P Q+F L+ DTGS+ TW+NC                  K +   H+  + H +R   
Sbjct: 120  VGSPGQRFWLVVDTGSEFTWLNCVMQNATTTTTTRKNKTKKKQQQHHHHVHKHHSKRNNR 179

Query: 1085 ------------------------VFNSERSSTFKTVPCFSDMCKVELMNLFSLASCPAP 978
                                    VF   +S +F+ V C S  CKV+L  LFSL+ CP P
Sbjct: 180  TRTRRTRKKKVKSSKSNKSDPCKGVFCPHKSKSFEAVTCASRKCKVDLSELFSLSVCPKP 239

Query: 977  LTPCAYDYRYSDGSATVGLFANETVTVGLANGRKTKLHNMLIGCSES-FKGQSF-QAADG 804
              PC YD  Y+DGS+  G F  +++TVGL NG++ KL+N+ IGC++S   G +F +   G
Sbjct: 240  SDPCLYDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTIGCTKSMLNGVNFNEETGG 299

Query: 803  VLGLGFSKYSFATKAAEKYNGKFSYCLVDHLSPKNVSNYLTFGGHENKTFSPSKTRFTKL 624
            +LGLGF+K SF  KAA KY  KFSYCLVDHLS ++VS+ LT GGH N      + R T+L
Sbjct: 300  ILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSVSSNLTIGGHHNAKLL-GEIRRTEL 358

Query: 623  VLGVINPFYAVNVMGISIGNIMLRIPSEVWDISSVGGTILDSGTSLTVLTMPAYQPVMDT 444
            +L    PFY VNV+GISIG  ML+IP +VWD ++ GGT++DSGT+LT L +PAY+ V + 
Sbjct: 359  IL--FPPFYGVNVVGISIGGQMLKIPPQVWDFNAEGGTLIDSGTTLTSLLLPAYEAVFEA 416

Query: 443  -XXXXXXXXXXXXLNIGPLEYCFNSTGFHDSLVPRLKIHFADGARFEPPVKSYVIDTADG 267
                          +   LE+CF++ GF DS+VPRL  HFA GARFEPPVKSY+ID A  
Sbjct: 417  LTKSLTKVKRVTGEDFDALEFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPL 476

Query: 266  VKCLGFVSANW-PGTSVIGNIMQQDHLWEFDLVQNKLSFSPSTCT 135
            VKC+G V  +   G SVIGNIMQQ+HLWEFDL  N + F+PSTCT
Sbjct: 477  VKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTVGFAPSTCT 521


Top