BLASTX nr result

ID: Cocculus22_contig00007227 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00007227
         (2290 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein ...   874   0.0  
ref|XP_006466523.1| PREDICTED: aspartic proteinase-like protein ...   866   0.0  
ref|XP_006426021.1| hypothetical protein CICLE_v10025144mg [Citr...   861   0.0  
ref|XP_007047435.1| Aspartyl protease family protein [Theobroma ...   843   0.0  
ref|XP_002306494.2| hypothetical protein POPTR_0005s18810g [Popu...   838   0.0  
ref|XP_007018558.1| Eukaryotic aspartyl protease family protein ...   827   0.0  
ref|XP_004509090.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   827   0.0  
ref|XP_006579920.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   823   0.0  
ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein ...   821   0.0  
ref|XP_006374851.1| hypothetical protein POPTR_0014s02040g [Popu...   821   0.0  
ref|XP_004289182.1| PREDICTED: aspartic proteinase-like protein ...   819   0.0  
gb|EXB28727.1| Aspartic proteinase-like protein 2 [Morus notabilis]   818   0.0  
ref|XP_007018559.1| Eukaryotic aspartyl protease family protein ...   816   0.0  
ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [S...   816   0.0  
ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor,...   816   0.0  
ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group] g...   812   0.0  
ref|XP_007208324.1| hypothetical protein PRUPE_ppa002682mg [Prun...   811   0.0  
ref|XP_007155866.1| hypothetical protein PHAVU_003G238400g [Phas...   809   0.0  
ref|XP_006472263.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   809   0.0  
ref|XP_006433596.1| hypothetical protein CICLE_v10000565mg [Citr...   807   0.0  

>ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
            gi|302142232|emb|CBI19435.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  874 bits (2259), Expect = 0.0
 Identities = 436/649 (67%), Positives = 506/649 (77%), Gaps = 19/649 (2%)
 Frame = +1

Query: 91   IFSAFLTLSFLQFAEISGDQEHDFLSHLRRGPSMVLPLCHSTLNSS--RMSFQERSFRRH 264
            + SA + LSF+     S  Q  +    +RR   M+ PL  ++  SS  R + +   +RRH
Sbjct: 8    LISAIVILSFVTIYSSSASQIPN--RGVRR--PMIFPLYFASPKSSGHRQAIEGSYWRRH 63

Query: 265  LQKVGNQLPNARMRLHDDLLANGYYTTRLWIGTPPQEFALIVDSGSTVTYVPCSTCEQCG 444
            L+      PNARMRL+DDLL+NGYYTTRLWIGTPPQEFALIVD+GSTVTYVPCS CE CG
Sbjct: 64   LKSDPYHHPNARMRLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCG 123

Query: 445  KHQDPRFQPDLSSTYQPVKCNIDCECDRERSQCIYERQYAELSSSSGVLGEDIVSFGNQS 624
            KHQDPRFQPD SSTY PVKCN+DC CD +   C+YER+YAE+SSSSGVLGEDI+SFGNQS
Sbjct: 124  KHQDPRFQPDESSTYHPVKCNMDCNCDHDGVNCVYERRYAEMSSSSGVLGEDIISFGNQS 183

Query: 625  ALKPQRAVFGCENVETGDLYSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYXXXXX 804
             + PQRAVFGCENVETGDLYSQ ADGIMGLGRGQLSI+DQLV+K VI+DSFSLCY     
Sbjct: 184  EVVPQRAVFGCENVETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHV 243

Query: 805  XXXXXXXXXISPPSGMVFSHSDPSRSPYYNIELKELHVAGKPLKLNSRVFDGKHGTVLDS 984
                     I PP  MVFS SDP RSPYYNIELKE+HVAGKPLKL+   FD KHGTVLDS
Sbjct: 244  GGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDS 303

Query: 985  GTTYAYLPEEAFMAFKDAVI-ENLNLKQIHGPDPNYNDICFSGAGSDISELEKIFPAVEM 1161
            GTTYAYLPEEAF+AF+DA+I ++ NLKQIHGPDPNYNDICFSGAG D+S+L K FP V+M
Sbjct: 304  GTTYAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDM 363

Query: 1162 VFGTGQKLSLAPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIIVRNTLVTYDRKNEKI 1341
            VF  GQKLSL PENYLF+H+KVHGAYCLGIF+NG D TTLLGGIIVRNTLVTYDR+NEKI
Sbjct: 364  VFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRNG-DSTTLLGGIIVRNTLVTYDRENEKI 422

Query: 1342 GFYKTNCSDLWKRLHIP---------------EATSPTV-LDHSSTTGLPPSMAPSGAPN 1473
            GF+KTNCS+LWKRLHIP                A +P V  ++++T G+PP++APSG P 
Sbjct: 423  GFWKTNCSELWKRLHIPGAPAAAPIVPTPKSVSAPAPVVSYNNNTTVGMPPTVAPSGLPQ 482

Query: 1474 YNFPDEFEVGLITFDISLSINYSDLVPHITELAEFIAHELEVNTSQVHLLNFTGKGNESL 1653
               P EF+VGLITFD+S S+NYS++ P+ TELAEFIAHELE+N SQVH LNF  KGN S+
Sbjct: 483  EVLPGEFQVGLITFDMSFSVNYSNMKPNFTELAEFIAHELEINASQVHFLNFFSKGNHSV 542

Query: 1654 TRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPGNFGSYQLVEWKIEPAIKRTWWKQHF 1833
             RWAIFPA SA YISNSTAMSIIL+L +HR+ LP  FGSYQLVEWK+EP IKRTWW+QHF
Sbjct: 543  IRWAIFPAESATYISNSTAMSIILQLKEHRVHLPERFGSYQLVEWKVEPQIKRTWWEQHF 602

Query: 1834 LAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDAAFPEQELQPL 1980
              V++GVI++ ILG +  G+W VW+ RQ A   YKP+ A  PEQELQ L
Sbjct: 603  WTVVVGVIITLILGLSTFGVWFVWKWRQNAVGTYKPIGARVPEQELQQL 651


>ref|XP_006466523.1| PREDICTED: aspartic proteinase-like protein 2-like [Citrus sinensis]
          Length = 633

 Score =  866 bits (2237), Expect = 0.0
 Identities = 421/606 (69%), Positives = 488/606 (80%), Gaps = 2/606 (0%)
 Frame = +1

Query: 169  HLRRGPSMVLPLCHSTLNSSRMSFQERSFRRHLQKVG-NQLPNARMRLHDDLLANGYYTT 345
            H R  P+MVLPL  S  N SR     R   RHLQ+   N  PNARMRL+DDLL NGYYTT
Sbjct: 32   HGRTRPAMVLPLYLSQPNISRSISISR---RHLQRSHLNSHPNARMRLYDDLLLNGYYTT 88

Query: 346  RLWIGTPPQEFALIVDSGSTVTYVPCSTCEQCGKHQDPRFQPDLSSTYQPVKCNIDCECD 525
            RLWIGTPPQ FALIVD+GSTVTYVPC+TCE CG HQDP+F+PDLSSTYQPVKCN+DC CD
Sbjct: 89   RLWIGTPPQTFALIVDTGSTVTYVPCATCEHCGDHQDPKFEPDLSSTYQPVKCNLDCNCD 148

Query: 526  RERSQCIYERQYAELSSSSGVLGEDIVSFGNQSALKPQRAVFGCENVETGDLYSQHADGI 705
            RER+QC+YER+YAE+SSSSGVLGEDI+SFGN+S LKPQRAVFGCENVETGDLYSQHADGI
Sbjct: 149  RERAQCVYERKYAEMSSSSGVLGEDIISFGNESDLKPQRAVFGCENVETGDLYSQHADGI 208

Query: 706  MGLGRGQLSIMDQLVEKGVISDSFSLCYXXXXXXXXXXXXXXISPPSGMVFSHSDPSRSP 885
            +GLGRG LS++DQLVEKGVISDSFSLCY              ISPP  MVF+HSDP RSP
Sbjct: 209  IGLGRGDLSVVDQLVEKGVISDSFSLCYGGMDVGGGAMVLGGISPPKDMVFTHSDPVRSP 268

Query: 886  YYNIELKELHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYLPEEAFMAFKDAVIENL-NLK 1062
            YYNI+LK +HVAGKPL LN +VFDGKHGTVLDSGTTYAYLPE AF+AFKDA++  L +LK
Sbjct: 269  YYNIDLKVIHVAGKPLPLNPKVFDGKHGTVLDSGTTYAYLPEAAFLAFKDAIMSELQSLK 328

Query: 1063 QIHGPDPNYNDICFSGAGSDISELEKIFPAVEMVFGTGQKLSLAPENYLFRHSKVHGAYC 1242
            QI GPDPNYNDICFSGA SD+S+L   FPAVEM FG GQKL LAPENYLFRHSKV GAYC
Sbjct: 329  QIRGPDPNYNDICFSGAPSDVSQLSDTFPAVEMAFGNGQKLLLAPENYLFRHSKVRGAYC 388

Query: 1243 LGIFQNGKDPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNCSDLWKRLHIPEATSPTVLDH 1422
            LGIFQNG+DPTTLLGGIIVRNTLV YDR++ KIGF+KTNCS+LW+RLHI  A SP +   
Sbjct: 389  LGIFQNGRDPTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSELWERLHITGALSP-IPSS 447

Query: 1423 SSTTGLPPSMAPSGAPNYNFPDEFEVGLITFDISLSINYSDLVPHITELAEFIAHELEVN 1602
            S        ++PS  PNY  P + ++G ITFD+ LSINYSDL PHI ELA+ IA EL+VN
Sbjct: 448  SEGKNSSTDLSPSEPPNYVLPGDLQIGRITFDMFLSINYSDLRPHIPELADSIAQELDVN 507

Query: 1603 TSQVHLLNFTGKGNESLTRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPGNFGSYQLV 1782
            TSQVHLLNF  KGN S   WA+FP+GSA YISN+TA+ II RL +HR+ +P  FG+Y+L+
Sbjct: 508  TSQVHLLNFMSKGNNSFIAWAVFPSGSANYISNATALRIISRLAEHRVHIPDTFGNYKLL 567

Query: 1783 EWKIEPAIKRTWWKQHFLAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDAAFPE 1962
            +W IEP +KRTWW++HFL V+L + +  ++G ++ GI  + RRR+Q+  +YKPVDAA PE
Sbjct: 568  QWNIEPKVKRTWWQEHFLMVVLAITIMMVVGLSVFGILFILRRRRQSVNSYKPVDAALPE 627

Query: 1963 QELQPL 1980
            QELQPL
Sbjct: 628  QELQPL 633


>ref|XP_006426021.1| hypothetical protein CICLE_v10025144mg [Citrus clementina]
            gi|557528011|gb|ESR39261.1| hypothetical protein
            CICLE_v10025144mg [Citrus clementina]
          Length = 633

 Score =  861 bits (2224), Expect = 0.0
 Identities = 419/606 (69%), Positives = 486/606 (80%), Gaps = 2/606 (0%)
 Frame = +1

Query: 169  HLRRGPSMVLPLCHSTLNSSRMSFQERSFRRHLQKVG-NQLPNARMRLHDDLLANGYYTT 345
            H R  P+MVLPL  S  N SR     R   RHLQ+   N  PNARMRL+DDLL NGYYTT
Sbjct: 32   HGRTRPAMVLPLYLSQPNISRSISISR---RHLQRSHPNSHPNARMRLYDDLLLNGYYTT 88

Query: 346  RLWIGTPPQEFALIVDSGSTVTYVPCSTCEQCGKHQDPRFQPDLSSTYQPVKCNIDCECD 525
            RLWIGTPPQ FALIVD+GSTVTYVPC+TCE CG HQDP+F+PDLSSTYQPVKCN+ C CD
Sbjct: 89   RLWIGTPPQTFALIVDTGSTVTYVPCATCEHCGDHQDPKFEPDLSSTYQPVKCNLYCNCD 148

Query: 526  RERSQCIYERQYAELSSSSGVLGEDIVSFGNQSALKPQRAVFGCENVETGDLYSQHADGI 705
            RER+QC+YER+YAE+SSSSGVLGEDI+SFGN+S LKPQRAVFGCENVETGDLYSQHADGI
Sbjct: 149  RERAQCVYERKYAEMSSSSGVLGEDIISFGNESDLKPQRAVFGCENVETGDLYSQHADGI 208

Query: 706  MGLGRGQLSIMDQLVEKGVISDSFSLCYXXXXXXXXXXXXXXISPPSGMVFSHSDPSRSP 885
            +GLGRG LS++DQLVEKGVISDSFSLCY              ISPP  MVF+HSDP RSP
Sbjct: 209  IGLGRGDLSVVDQLVEKGVISDSFSLCYGGMDVGGGAMVLGGISPPKDMVFTHSDPVRSP 268

Query: 886  YYNIELKELHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYLPEEAFMAFKDAVIENL-NLK 1062
            YYNI+LK +HVAGKPL LN +VFDGKHGTVLDSGTTYAYLPE AF+AFKDA++  L +LK
Sbjct: 269  YYNIDLKVIHVAGKPLPLNPKVFDGKHGTVLDSGTTYAYLPEAAFLAFKDAIMSELQSLK 328

Query: 1063 QIHGPDPNYNDICFSGAGSDISELEKIFPAVEMVFGTGQKLSLAPENYLFRHSKVHGAYC 1242
            QI GPDPNYNDICFSGA SD+S+L   FPAVEM FG GQKL L+PENYLFRHSKV GAYC
Sbjct: 329  QIRGPDPNYNDICFSGAPSDVSQLSDTFPAVEMAFGNGQKLLLSPENYLFRHSKVRGAYC 388

Query: 1243 LGIFQNGKDPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNCSDLWKRLHIPEATSPTVLDH 1422
            LGIFQNG+DPTTLLGGIIVRNTLV YDR++ KIGF+KTNCS+LW+RLHI  A SP +   
Sbjct: 389  LGIFQNGRDPTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSELWERLHITGALSP-IPSS 447

Query: 1423 SSTTGLPPSMAPSGAPNYNFPDEFEVGLITFDISLSINYSDLVPHITELAEFIAHELEVN 1602
            S        ++PS  PNY  P + ++G ITFD+ LSINYSDL PHI ELA+ IA EL+VN
Sbjct: 448  SEGKNSSTDLSPSEPPNYVLPGDLQIGRITFDMFLSINYSDLRPHIPELADSIAQELDVN 507

Query: 1603 TSQVHLLNFTGKGNESLTRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPGNFGSYQLV 1782
            TSQVHLLNF  KGN S   WA+FP+GSA YISN+TA+ II RL +HR+ +P  FG+Y+L+
Sbjct: 508  TSQVHLLNFMSKGNNSFIAWAVFPSGSANYISNATALRIISRLAEHRVHIPDTFGNYKLL 567

Query: 1783 EWKIEPAIKRTWWKQHFLAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDAAFPE 1962
            +W IEP +KRTWW++HFL V+L + +  ++G ++ GI  + RRR Q+  +YKPVDAA PE
Sbjct: 568  QWNIEPQVKRTWWQEHFLMVVLAITIMMVVGLSVFGILFILRRRHQSVNSYKPVDAALPE 627

Query: 1963 QELQPL 1980
            QELQPL
Sbjct: 628  QELQPL 633


>ref|XP_007047435.1| Aspartyl protease family protein [Theobroma cacao]
            gi|508699696|gb|EOX91592.1| Aspartyl protease family
            protein [Theobroma cacao]
          Length = 647

 Score =  843 bits (2177), Expect = 0.0
 Identities = 417/606 (68%), Positives = 482/606 (79%), Gaps = 7/606 (1%)
 Frame = +1

Query: 184  PSMVLPLCHSTLNSSRMSFQERSFRRHLQKVGNQL--PNARMRLHDDLLANGYYTTRLWI 357
            P+M+LPL     NSSR +F      RHL +  +    PNARMRL+DDLL NGYYTTRLWI
Sbjct: 43   PAMILPLFPFPKNSSR-TFSHSG--RHLLRSDSHSSHPNARMRLYDDLLLNGYYTTRLWI 99

Query: 358  GTPPQEFALIVDSGSTVTYVPCSTCEQCGKHQDPRFQPDLSSTYQPVKCNIDCECDRERS 537
            GTPPQ FALIVD+GSTVTYVPC+TCEQCG+HQDP+FQPDLSSTYQPVKCN+DC CD +R 
Sbjct: 100  GTPPQRFALIVDTGSTVTYVPCATCEQCGRHQDPKFQPDLSSTYQPVKCNLDCSCDTDRV 159

Query: 538  QCIYERQYAELSSSSGVLGEDIVSFGNQSALKPQRAVFGCENVETGDLYSQHADGIMGLG 717
            QC YERQYAE+SSSSGVLGEDI+SFGNQS L PQRAVFGCEN ETGDLYSQHADGIMGLG
Sbjct: 160  QCTYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAVFGCENEETGDLYSQHADGIMGLG 219

Query: 718  RGQLSIMDQLVEKGVISDSFSLCYXXXXXXXXXXXXXXISPPSGMVFSHSDPSRSPYYNI 897
            RG LS++DQLVEKGVISDSFSLCY              IS P  MVFS+SDP RSPYYNI
Sbjct: 220  RGDLSVVDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGISSPPDMVFSYSDPERSPYYNI 279

Query: 898  ELKELHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYLPEEAFMAFKDAVIENL-NLKQIHG 1074
            +LK +HVAGK L LN  VFD K+GTVLDSGTTYAYLPE AF AFK+A+I+ L +LKQI G
Sbjct: 280  DLKAIHVAGKQLPLNPNVFDVKYGTVLDSGTTYAYLPEAAFAAFKNAIIKELTSLKQIRG 339

Query: 1075 PDPNYNDICFSGAGSDISELEKIFPAVEMVFGTGQKLSLAPENYLFRHSKVHGAYCLGIF 1254
            PDPNYNDICFSGA SD+SEL KIFP VEMVF   QKL LAPENYLFRHSKV G YCLGIF
Sbjct: 340  PDPNYNDICFSGASSDVSELSKIFPTVEMVFDNQQKLLLAPENYLFRHSKVRGGYCLGIF 399

Query: 1255 QNGKDPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNCSDLWKRLHIPEATSPTVLDHS--- 1425
             N KDPTTLLGGIIVRNTLVTYDR++ KIGF+KTNCS+LW+RL I  A SP+    S   
Sbjct: 400  PNEKDPTTLLGGIIVRNTLVTYDREHLKIGFWKTNCSELWERLRINGAPSPSPSSSSGKD 459

Query: 1426 -STTGLPPSMAPSGAPNYNFPDEFEVGLITFDISLSINYSDLVPHITELAEFIAHELEVN 1602
             ST   PP+ AP G+ +Y  P E ++G IT D+SLSI+YS L PHI ELAEFIA EL+VN
Sbjct: 460  NSTVESPPTSAPDGSSHYAIPGEIQIGEITLDMSLSIDYSYLKPHINELAEFIAKELDVN 519

Query: 1603 TSQVHLLNFTGKGNESLTRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPGNFGSYQLV 1782
             SQVHLL+FT +GN SL  WAI P+GSA YISN  A+SII +L +HR++LP  FG+YQLV
Sbjct: 520  ASQVHLLDFTSEGNSSLVTWAIVPSGSATYISNVAAISIISQLAEHRVRLPDTFGNYQLV 579

Query: 1783 EWKIEPAIKRTWWKQHFLAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDAAFPE 1962
            +WK+EP++++TWW+QH+L V+L ++++ I+G +  G W++WRRRQQA   YKPVD A  E
Sbjct: 580  QWKVEPSVQQTWWQQHYLVVLLAIMITIIVGLSASGGWIIWRRRQQALKLYKPVDGAVSE 639

Query: 1963 QELQPL 1980
            QELQPL
Sbjct: 640  QELQPL 645


>ref|XP_002306494.2| hypothetical protein POPTR_0005s18810g [Populus trichocarpa]
            gi|550339265|gb|EEE93490.2| hypothetical protein
            POPTR_0005s18810g [Populus trichocarpa]
          Length = 641

 Score =  838 bits (2164), Expect = 0.0
 Identities = 409/603 (67%), Positives = 478/603 (79%), Gaps = 5/603 (0%)
 Frame = +1

Query: 187  SMVLPLCHSTLNSSRMSFQERSFRRHLQKVG-NQLPNARMRLHDDLLANGYYTTRLWIGT 363
            +M+LPL  S  N      +  S RR LQ+   N LPNA MRLHDDLL NGYYTTRLWIGT
Sbjct: 42   AMILPLFLSPPNPCT---KFSSTRRLLQRSNANALPNAHMRLHDDLLINGYYTTRLWIGT 98

Query: 364  PPQEFALIVDSGSTVTYVPCSTCEQCGKHQDPRFQPDLSSTYQPVKCNIDCECDRERSQC 543
            PPQ FALIVD+GS+VTYVPCS+CEQCG+HQDP+FQPDLSSTYQ VKCNIDC CD E+ QC
Sbjct: 99   PPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCNIDCNCDDEKQQC 158

Query: 544  IYERQYAELSSSSGVLGEDIVSFGNQSALKPQRAVFGCENVETGDLYSQHADGIMGLGRG 723
            +YERQYAE+S+SSGVLGEDI+SFGN SAL PQRAVFGCEN+ETGDLYSQHADGIMG+GRG
Sbjct: 159  VYERQYAEMSTSSGVLGEDIISFGNLSALAPQRAVFGCENMETGDLYSQHADGIMGMGRG 218

Query: 724  QLSIMDQLVEKGVISDSFSLCYXXXXXXXXXXXXXXISPPSGMVFSHSDPSRSPYYNIEL 903
             LSI+D LV+KGVI+DSFSLCY              ISPPS MVFS SDP RSPYYNI+L
Sbjct: 219  DLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLGGISPPSNMVFSQSDPVRSPYYNIDL 278

Query: 904  KELHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYLPEEAFMAFKDAVIENL-NLKQIHGPD 1080
            KE+HVAGKPL LN  VFDGKHGT+LDSGTTYAYLPE AF++FKDA+++ L +LK I GPD
Sbjct: 279  KEIHVAGKPLPLNPTVFDGKHGTILDSGTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPD 338

Query: 1081 PNYNDICFSGAGSDISELEKIFPAVEMVFGTGQKLSLAPENYLFRHSKVHGAYCLGIFQN 1260
            PNYNDICFSGAGSDIS+L   FPAVEMVFG GQKL L+PENYLFRHSKVHGAYCLGIFQN
Sbjct: 339  PNYNDICFSGAGSDISQLSSSFPAVEMVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQN 398

Query: 1261 GKDPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNCSDLWKRLHIPEATSPTVLD---HSST 1431
            GKDPTTLLGGI+VRNTLV YDR+N KIGF+KTNCS+LW+RL++  A  P       ++S 
Sbjct: 399  GKDPTTLLGGIVVRNTLVLYDRENSKIGFWKTNCSELWERLNVDGAPPPAPSSSNGNNSN 458

Query: 1432 TGLPPSMAPSGAPNYNFPDEFEVGLITFDISLSINYSDLVPHITELAEFIAHELEVNTSQ 1611
            T +PPS+APS   +Y  PDE ++G ITF++ L++NYSDL  HI+ELAE IA EL +N+SQ
Sbjct: 459  TEMPPSVAPSDQKHYGLPDEKKIGQITFEMMLNVNYSDLKLHISELAESIAQELGINSSQ 518

Query: 1612 VHLLNFTGKGNESLTRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPGNFGSYQLVEWK 1791
            V++LN   KGN S   WA+ P+GSA  ISN TA+SII R+ ++ L LP  FGSY L+ W+
Sbjct: 519  VYILNSMEKGNASYIEWAVVPSGSADCISNVTALSIIARVAEYHLHLPDTFGSYHLINWE 578

Query: 1792 IEPAIKRTWWKQHFLAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDAAFPEQEL 1971
            I+ + KRTWW+QHFL V+L   ++FI G    GIW +WR RQ+A   YKPVDA   EQEL
Sbjct: 579  IKASAKRTWWQQHFLLVVLASAVTFIFGLLALGIWFIWRHRQRALNPYKPVDAVVTEQEL 638

Query: 1972 QPL 1980
            QPL
Sbjct: 639  QPL 641


>ref|XP_007018558.1| Eukaryotic aspartyl protease family protein isoform 1 [Theobroma
            cacao] gi|508723886|gb|EOY15783.1| Eukaryotic aspartyl
            protease family protein isoform 1 [Theobroma cacao]
          Length = 694

 Score =  827 bits (2137), Expect = 0.0
 Identities = 415/639 (64%), Positives = 490/639 (76%), Gaps = 6/639 (0%)
 Frame = +1

Query: 82   LFRIFSAFLTLSFLQFAEISGDQEHDFLSHLRRGPSMVLPLCHSTLNSSRMSFQERSFRR 261
            L R  S  +  S L F   + D +H    H RR   MVLPL  S+ N S     + + RR
Sbjct: 62   LHRTSSLLIVCSVLWFHLATVDAKH----HHRR--PMVLPLHLSSRNHSLHRHVD-NLRR 114

Query: 262  HLQK--VGNQLPNARMRLHDDLLANGYYTTRLWIGTPPQEFALIVDSGSTVTYVPCSTCE 435
            HLQ+      +PNARMRL+DDLL+NGYYTTRLWIGTPPQEFALIVD+GSTVTYVPCS+C 
Sbjct: 115  HLQQSEFSPSIPNARMRLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSSCA 174

Query: 436  QCGKHQDPRFQPDLSSTYQPVKCNIDCECDRERSQCIYERQYAELSSSSGVLGEDIVSFG 615
            QCGKHQDPRFQPDLSSTYQPVKCN  C CD E+ QC Y+R+YAE+SSSSGVLGED+VSFG
Sbjct: 175  QCGKHQDPRFQPDLSSTYQPVKCNPSCNCDDEQKQCTYDRRYAEMSSSSGVLGEDVVSFG 234

Query: 616  NQSALKPQRAVFGCENVETGDLYSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYXX 795
            N+S L PQRAVFGCEN+ETGDLYSQ ADGIMGLGRG+LSI+DQLV+K VI DSFSLCY  
Sbjct: 235  NESELVPQRAVFGCENMETGDLYSQRADGIMGLGRGRLSIVDQLVDKSVIGDSFSLCYGG 294

Query: 796  XXXXXXXXXXXXISPPSGMVFSHSDPSRSPYYNIELKELHVAGKPLKLNSRVFDGKHGTV 975
                        I+PP  MVFSHSDP RSPYYNIELKE+HVAGKPLKL+  +FDG+HGTV
Sbjct: 295  MDVGGGAMVLGNITPPPDMVFSHSDPFRSPYYNIELKEMHVAGKPLKLHPGIFDGRHGTV 354

Query: 976  LDSGTTYAYLPEEAFMAFKDAVIENLN-LKQIHGPDPNYNDICFSGAGSDISELEKIFPA 1152
            LDSGTTYAYLP+ AF+AF+DA+I  ++ LK++HGPDPNY+DICFS AG D S+L KIFP 
Sbjct: 355  LDSGTTYAYLPKAAFVAFRDAIIREVHFLKRVHGPDPNYDDICFSSAGRDFSQLAKIFPE 414

Query: 1153 VEMVFGTGQKLSLAPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIIVRNTLVTYDRKN 1332
            VEMVF  G+KL L+PENYLF+H+KV GAYCLGIFQN  + TTLLGGI+VRNTLVTYDR N
Sbjct: 415  VEMVFNNGKKLLLSPENYLFQHTKVSGAYCLGIFQNA-EATTLLGGIVVRNTLVTYDRGN 473

Query: 1333 EKIGFYKTNCSDLWKRLHIPEATSPTVL---DHSSTTGLPPSMAPSGAPNYNFPDEFEVG 1503
            ++IGF+KTNCS+LW+R+  P A +P  L      +   +PP++APSG P    P  F +G
Sbjct: 474  DRIGFWKTNCSELWRRVQFPGAPAPAPLVSQSKDTNMEIPPALAPSGLPPNVLPGSFRIG 533

Query: 1504 LITFDISLSINYSDLVPHITELAEFIAHELEVNTSQVHLLNFTGKGNESLTRWAIFPAGS 1683
             ITFD+S+S N S+L P+  ELA+ I+ ELEV+ SQVHLLN T KGN+ L RW IFPA S
Sbjct: 534  FITFDMSISANDSNLKPNFKELADLISQELEVDKSQVHLLNVTSKGNDFLVRWGIFPAAS 593

Query: 1684 AKYISNSTAMSIILRLTKHRLQLPGNFGSYQLVEWKIEPAIKRTWWKQHFLAVILGVIMS 1863
            A YISN+TA+SIILRL  HR+Q P  FG+Y+LVEW  EP  K TWW+ HFLA+ LG + +
Sbjct: 594  ANYISNTTALSIILRLRDHRMQFPERFGNYKLVEWNAEPQRKMTWWQHHFLALALGFVTT 653

Query: 1864 FILGFTICGIWLVWRRRQQAFAAYKPVDAAFPEQELQPL 1980
             ILG +  GIWLV RRRQQA +AY+PV A  PEQELQPL
Sbjct: 654  LILGLSAIGIWLVHRRRQQAISAYEPVGAPTPEQELQPL 692


>ref|XP_004509090.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cicer
            arietinum]
          Length = 634

 Score =  827 bits (2136), Expect = 0.0
 Identities = 410/627 (65%), Positives = 483/627 (77%), Gaps = 2/627 (0%)
 Frame = +1

Query: 106  LTLSFLQFAEISGDQEHDFLSHLRRGPSMVLPLCHSTLNSSRMSFQERSFRRHLQKVGNQ 285
            L ++ L  + I+ D       H    P+MVLPL  +  NSS  +   R  R+       +
Sbjct: 12   LHIAILVPSSIAADTSFLRNRHHGSRPTMVLPLYLTAPNSSTSALDPR--RQLHGSESKR 69

Query: 286  LPNARMRLHDDLLANGYYTTRLWIGTPPQEFALIVDSGSTVTYVPCSTCEQCGKHQDPRF 465
             PNARMRLHDDLL NGYYTTRLWIGTPPQ FALIVD+GSTVTYVPCSTCEQCG+HQDP+F
Sbjct: 70   HPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKF 129

Query: 466  QPDLSSTYQPVKCNIDCECDRERSQCIYERQYAELSSSSGVLGEDIVSFGNQSALKPQRA 645
            QPDLSSTYQPVKC +DC CD +R QC+YERQYAE+S+SSGVLGED++SFGNQS L PQRA
Sbjct: 130  QPDLSSTYQPVKCTLDCNCDNDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRA 189

Query: 646  VFGCENVETGDLYSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYXXXXXXXXXXXX 825
            VFGCENVETGDLYSQHADGIMGLGRG LSIMDQLV+K V+SDSFSLCY            
Sbjct: 190  VFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVL 249

Query: 826  XXISPPSGMVFSHSDPSRSPYYNIELKELHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYL 1005
              ISPPS MVF+HSDP RSPYYNI+LKE+HVAGK L LN  VFDGKHGTVLDSGTTYAYL
Sbjct: 250  GGISPPSDMVFAHSDPVRSPYYNIDLKEIHVAGKRLLLNPSVFDGKHGTVLDSGTTYAYL 309

Query: 1006 PEEAFMAFKDAVIENL-NLKQIHGPDPNYNDICFSGAGSDISELEKIFPAVEMVFGTGQK 1182
            PEEAF+AFK+A+++ L +  QI GPDPNYND+CFSGAG D+S+L K FP V+MVFG G K
Sbjct: 310  PEEAFLAFKEAIVKELQSFNQISGPDPNYNDLCFSGAGIDVSQLSKSFPVVDMVFGNGHK 369

Query: 1183 LSLAPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNC 1362
             SL+PENY+FRHSKV GAYCLGIFQNGKDPTTLLGGIIVRNTLV YDR+  KIGF+KTNC
Sbjct: 370  YSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIIVRNTLVMYDREQTKIGFWKTNC 429

Query: 1363 SDLWKRLHIPEATSPTVLDHSSTTGLPPSMAPSGAPNYNF-PDEFEVGLITFDISLSINY 1539
            ++LW+RL I  A  P     +ST  L PS+APS  P +N  P EF +  IT  IS +I+Y
Sbjct: 430  AELWERLQISIA-PPNTEVKNSTKSLEPSVAPS-MPQHNVPPGEFRIAQITIVISFNISY 487

Query: 1540 SDLVPHITELAEFIAHELEVNTSQVHLLNFTGKGNESLTRWAIFPAGSAKYISNSTAMSI 1719
             D+ PH+TELA  IAHEL++NTSQVHLLNFT  GN SL RWAI     A   SN+TAMSI
Sbjct: 488  EDMKPHLTELAGLIAHELDINTSQVHLLNFTSAGNVSLARWAITSRPYADNFSNATAMSI 547

Query: 1720 ILRLTKHRLQLPGNFGSYQLVEWKIEPAIKRTWWKQHFLAVILGVIMSFILGFTICGIWL 1899
            I RL +H +QLP  F SY+LVEW +EP  K  WW+++++ V L  +++ +LG +I G++L
Sbjct: 548  IDRLAEHHMQLPDTFKSYKLVEWNLEPPSKSNWWQRYYMVVGLAFVLTLLLGLSIVGMFL 607

Query: 1900 VWRRRQQAFAAYKPVDAAFPEQELQPL 1980
            +W++RQQ+  +YKPVDAA PEQELQPL
Sbjct: 608  IWKKRQQSVHSYKPVDAAVPEQELQPL 634


>ref|XP_006579920.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Glycine
            max]
          Length = 635

 Score =  823 bits (2127), Expect = 0.0
 Identities = 409/630 (64%), Positives = 484/630 (76%), Gaps = 5/630 (0%)
 Frame = +1

Query: 106  LTLSFLQFAEISGDQEHDFLSHLRRGPSMVLPLCHSTLNSSRMSFQERSFRRHLQKVGNQ 285
            L+L     A ++GD       H    PSM+LPL  S  NSS  +   R  R+       +
Sbjct: 9    LSLILFLIATVAGDTALLRNRHHGSRPSMLLPLYLSAPNSSTSALDPR--RQLTGSESKR 66

Query: 286  LPNARMRLHDDLLANGYYTTRLWIGTPPQEFALIVDSGSTVTYVPCSTCEQCGKHQDPRF 465
             PNARMRLHDDLL NGYYTTRLWIGTPPQ FALIVD+GSTVTYVPCSTCEQCG+HQDP+F
Sbjct: 67   HPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKF 126

Query: 466  QPDLSSTYQPVKCNIDCECDRERSQCIYERQYAELSSSSGVLGEDIVSFGNQSALKPQRA 645
            QP+ SSTYQPVKC IDC CD +R QC+YERQYAE+S+SSGVLGED++SFGNQS L PQRA
Sbjct: 127  QPESSSTYQPVKCTIDCNCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRA 186

Query: 646  VFGCENVETGDLYSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYXXXXXXXXXXXX 825
            VFGCENVETGDLYSQHADGIMGLGRG LSIMDQLV+K VISDSFSLCY            
Sbjct: 187  VFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVL 246

Query: 826  XXISPPSGMVFSHSDPSRSPYYNIELKELHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYL 1005
              ISPPS M F++SDP RSPYYNI+LKE+HVAGK L LN+ VFDGKHGTVLDSGTTYAYL
Sbjct: 247  GGISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYAYL 306

Query: 1006 PEEAFMAFKDAVIENL-NLKQIHGPDPNYNDICFSGAGSDISELEKIFPAVEMVFGTGQK 1182
            PE AF+AFKDA+++ L +LKQI GPDPNYNDICFSGAG+D+S+L K FP V+MVFG G K
Sbjct: 307  PEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHK 366

Query: 1183 LSLAPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNC 1362
             SL+PENY+FRHSKV GAYCLGIFQNG D TTLLGGIIVRNTLV YDR+  KIGF+KTNC
Sbjct: 367  YSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNC 426

Query: 1363 SDLWKRLH---IPEATSPTVLDHSSTTGLPPSMAPSGAPNYNFPDEFEVGLITFDISLSI 1533
            ++LW+RL     P    P     +S+  L PS+APS + +   P E ++  IT  IS +I
Sbjct: 427  AELWERLQTSIAPPPLPPNSGVRNSSEALEPSVAPSVSQHNASPGELKIAQITMVISFNI 486

Query: 1534 NYSDLVPHITELAEFIAHELEVNTSQVHLLNFTGKGNESLTRWAIFPAGSAKYISNSTAM 1713
            +Y D+ PHITELA   AH L+ NTSQVHLLNFT  GN+SL++WAI P   A YISN+TAM
Sbjct: 487  SYVDMKPHITELAGLFAHGLDTNTSQVHLLNFTSTGNDSLSKWAITPKPYAHYISNTTAM 546

Query: 1714 SIILRLTKHRLQLPGNFGSYQLVEWKIEPAIKRTWWKQH-FLAVILGVIMSFILGFTICG 1890
            +II RL +HR+QLP  FG+Y+L++W +EP  K  WW+QH FL V L ++++ +LG +I G
Sbjct: 547  NIIDRLAEHRIQLPSTFGNYKLIDWSVEPPSK-NWWQQHFFLVVSLAILITLLLGLSILG 605

Query: 1891 IWLVWRRRQQAFAAYKPVDAAFPEQELQPL 1980
             +L+W++RQQ+  +YKPVDAA PEQELQPL
Sbjct: 606  TFLIWKKRQQSSHSYKPVDAAVPEQELQPL 635


>ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score =  821 bits (2121), Expect = 0.0
 Identities = 404/635 (63%), Positives = 484/635 (76%), Gaps = 4/635 (0%)
 Frame = +1

Query: 88   RIFSAFLTLSFLQFAEISGDQEHDFLSHLRRGPSMVLPLCHSTLNSSRMSFQERSFRRHL 267
            R  +  L+L  +    ++GD       H    P+M+LPL  S  NSS  +   R  R+  
Sbjct: 3    RALTHHLSLILILIVAVAGDANLLRNRHHGSRPAMLLPLYLSAPNSSTSALDPR--RQLT 60

Query: 268  QKVGNQLPNARMRLHDDLLANGYYTTRLWIGTPPQEFALIVDSGSTVTYVPCSTCEQCGK 447
                 + PNARMRLHDDLL NGYYTTRLWIGTPPQ FALIVD+GSTVTYVPCSTCEQCG+
Sbjct: 61   GSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGR 120

Query: 448  HQDPRFQPDLSSTYQPVKCNIDCECDRERSQCIYERQYAELSSSSGVLGEDIVSFGNQSA 627
            HQDP+FQP+ SSTYQPVKC IDC CD +R QC+YERQYAE+S+SSGVLGED++SFGNQS 
Sbjct: 121  HQDPKFQPESSSTYQPVKCTIDCNCDSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSE 180

Query: 628  LKPQRAVFGCENVETGDLYSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYXXXXXX 807
            L PQRAVFGCENVETGDLYSQHADGIMGLGRG LSIMDQLV+K VISDSFSLCY      
Sbjct: 181  LAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVG 240

Query: 808  XXXXXXXXISPPSGMVFSHSDPSRSPYYNIELKELHVAGKPLKLNSRVFDGKHGTVLDSG 987
                    ISPPS M F++SDP RSPYYNI+LKE+HVAGK L LN+ VFDGKHGTVLDSG
Sbjct: 241  GGAMVLGGISPPSDMAFAYSDPVRSPYYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSG 300

Query: 988  TTYAYLPEEAFMAFKDAVIENL-NLKQIHGPDPNYNDICFSGAGSDISELEKIFPAVEMV 1164
            TTYAYLPE AF+AFKDA+++ L +LK+I GPDPNYNDICFSGAG D+S+L K FP V+MV
Sbjct: 301  TTYAYLPEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMV 360

Query: 1165 FGTGQKLSLAPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIIVRNTLVTYDRKNEKIG 1344
            F  GQK +L+PENY+FRHSKV GAYCLG+FQNG D TTLLGGIIVRNTLV YDR+  KIG
Sbjct: 361  FENGQKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYDREQTKIG 420

Query: 1345 FYKTNCSDLWKRLHI---PEATSPTVLDHSSTTGLPPSMAPSGAPNYNFPDEFEVGLITF 1515
            F+KTNC++LW+RL I   P    P     +S+  L PS+APS + +   P E ++  IT 
Sbjct: 421  FWKTNCAELWERLQISVAPPPLPPNSGVRNSSEALEPSVAPSVSQHNARPGELKIVQITM 480

Query: 1516 DISLSINYSDLVPHITELAEFIAHELEVNTSQVHLLNFTGKGNESLTRWAIFPAGSAKYI 1695
             IS +I+Y D+ PHI ELA   AH L VNTSQVHLLNFT  GN+SL++WAI P   + YI
Sbjct: 481  VISFNISYVDMKPHIKELAGLFAHGLNVNTSQVHLLNFTSTGNDSLSKWAITPKPDSHYI 540

Query: 1696 SNSTAMSIILRLTKHRLQLPGNFGSYQLVEWKIEPAIKRTWWKQHFLAVILGVIMSFILG 1875
            SN+TAM+II RL +HR+QLPG FG+Y+L++W +EP  K  WW+QHFL V L ++++ +LG
Sbjct: 541  SNTTAMNIIARLAEHRIQLPGTFGNYKLIDWSVEPPSK-NWWQQHFLVVSLAILITLLLG 599

Query: 1876 FTICGIWLVWRRRQQAFAAYKPVDAAFPEQELQPL 1980
             +I G +L+W++RQQ+  +YKPVD   PEQELQPL
Sbjct: 600  LSILGTFLIWKKRQQSSHSYKPVDVVVPEQELQPL 634


>ref|XP_006374851.1| hypothetical protein POPTR_0014s02040g [Populus trichocarpa]
            gi|550323157|gb|ERP52648.1| hypothetical protein
            POPTR_0014s02040g [Populus trichocarpa]
          Length = 637

 Score =  821 bits (2120), Expect = 0.0
 Identities = 407/602 (67%), Positives = 475/602 (78%), Gaps = 6/602 (0%)
 Frame = +1

Query: 193  VLPLCHSTLNSS--RMSFQERSFRRHLQKVGNQLPNARMRLHDDLLANGYYTTRLWIGTP 366
            +LPL  S  N S  RM F     RRHLQ   ++LPNARMRL DDLL+NGYYTTRL+IGTP
Sbjct: 40   ILPLLLSIPNISAHRMPFDGHYSRRHLQN--SELPNARMRLFDDLLSNGYYTTRLFIGTP 97

Query: 367  PQEFALIVDSGSTVTYVPCSTCEQCGKHQDPRFQPDLSSTYQPVKCNIDCECDRERSQCI 546
            PQEFALIVD+GSTVTYVPCS+CEQCGKHQDPRFQPDLSSTY+PVKCN  C CD E  QC 
Sbjct: 98   PQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCNPSCNCDDEGKQCT 157

Query: 547  YERQYAELSSSSGVLGEDIVSFGNQSALKPQRAVFGCENVETGDLYSQHADGIMGLGRGQ 726
            YER+YAE+SSSSGV+ ED+VSFGN+S LKPQRAVFGCENVETGDLYSQ ADGIMGLGRG+
Sbjct: 158  YERRYAEMSSSSGVIAEDVVSFGNESELKPQRAVFGCENVETGDLYSQRADGIMGLGRGR 217

Query: 727  LSIMDQLVEKGVISDSFSLCYXXXXXXXXXXXXXXISPPSGMVFSHSDPSRSPYYNIELK 906
            LS++DQLV+KGVI DSFSLCY              ISPP  MVFSHS+P RSPYYNIELK
Sbjct: 218  LSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLGQISPPPNMVFSHSNPYRSPYYNIELK 277

Query: 907  ELHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYLPEEAFMAFKDAVIENL-NLKQIHGPDP 1083
            ELHVAGKPLKL  +VFD KHGTVLDSGTTYAY PE AF A KDA+++ + +LKQI GPDP
Sbjct: 278  ELHVAGKPLKLKPKVFDEKHGTVLDSGTTYAYFPEAAFHALKDAIMKEIRHLKQIPGPDP 337

Query: 1084 NYNDICFSGAGSDISELEKIFPAVEMVFGTGQKLSLAPENYLFRHSKVHGAYCLGIFQNG 1263
            NY+DICFSGAG ++S L K+FP V MVFG+GQKLSL+PENYLFRH+KV GAYCLGIFQNG
Sbjct: 338  NYHDICFSGAGREVSHLSKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNG 397

Query: 1264 KDPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNCSDLWKRLHIPEA-TSPTVLDHSSTTG- 1437
             D TTLLGGI+VRNTLVTYDR+N+KIGF+KTNCS+LWK L +P    S  VL  SS    
Sbjct: 398  NDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNCSELWKSLQVPGVPASAPVLSPSSNRSQ 457

Query: 1438 -LPPSMAPSGAPNYNFPDEFEVGLITFDISLSINYSDLVPHITELAEFIAHELEVNTSQV 1614
             +PP+ APS  P ++ P E  +G+I+FD+ +S N S+  P+ TE+AEFIAHELEV+  QV
Sbjct: 458  EMPPAQAPSSMPFFH-PGEIRIGIISFDMLISANNSNTKPNFTEVAEFIAHELEVDNLQV 516

Query: 1615 HLLNFTGKGNESLTRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPGNFGSYQLVEWKI 1794
            H+LNFT  GN  L +WAI PA SA YISN+TAM II +L++HRL  P  FGSY+LV+WK 
Sbjct: 517  HMLNFTSTGNNYLVKWAILPAESADYISNTTAMKIIQQLSEHRLHFPERFGSYELVKWKF 576

Query: 1795 EPAIKRTWWKQHFLAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDAAFPEQELQ 1974
            EP   RTWW+QHF+AV +GV+++ ++     G+WLVW RRQ+A   Y PV A  PEQELQ
Sbjct: 577  EPQKNRTWWQQHFVAVTVGVVVTLVVSLLSIGLWLVW-RRQKALGTYVPVGAVGPEQELQ 635

Query: 1975 PL 1980
            PL
Sbjct: 636  PL 637


>ref|XP_004289182.1| PREDICTED: aspartic proteinase-like protein 2-like [Fragaria vesca
            subsp. vesca]
          Length = 649

 Score =  819 bits (2115), Expect = 0.0
 Identities = 407/645 (63%), Positives = 494/645 (76%), Gaps = 9/645 (1%)
 Frame = +1

Query: 73   PKSLFRIFSAFLTLSFLQFAEISGDQEHD-FLSHLRRGPSMVLPLCHSTLNSSRMSFQER 249
            P  L  +F+ F+T   + F  +S +      L H    P+MVLPL HST +SS  S    
Sbjct: 7    PAHLTVLFTFFVTF-IIDFIHVSANPSIPALLLHSLPVPAMVLPLYHSTPDSSSSSSPTT 65

Query: 250  -SFRRHLQ-KVGNQLPNARMRLHDDLLANGYYTTRLWIGTPPQEFALIVDSGSTVTYVPC 423
             + RRHLQ    +Q PNARM L+DDLL NGYYTTRLWIGTPPQ FALIVD+GSTVTYVPC
Sbjct: 66   FNSRRHLQGSETSQRPNARMSLYDDLLRNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPC 125

Query: 424  STCEQCGKHQDPRFQPDLSSTYQPVKCNIDCECDRERSQCIYERQYAELSSSSGVLGEDI 603
            ++C+QCG+HQDP+F P+ SSTYQ VKCNIDC CD ER  CIYERQYAE+SSSSGVLGED+
Sbjct: 126  ASCQQCGRHQDPKFDPESSSTYQAVKCNIDCSCDSERVNCIYERQYAEMSSSSGVLGEDV 185

Query: 604  VSFGNQSALKPQRAVFGCENVETGDLYSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSL 783
            +SFGN+S LKPQRAVFGCENVETGDL+SQ ADGIMGLGRG LS++DQLVEKGVISDSFSL
Sbjct: 186  ISFGNRSDLKPQRAVFGCENVETGDLFSQPADGIMGLGRGDLSVVDQLVEKGVISDSFSL 245

Query: 784  CYXXXXXXXXXXXXXXISPPSGMVFSHSDPSRSPYYNIELKELHVAGKPLKLNSRVFDGK 963
            CY              IS P  MVF+ S+P RSPYYNI+LKE+HVAGK L+LN  +FDGK
Sbjct: 246  CYGGMDIGGGAMVLGGISTPEEMVFTQSNPGRSPYYNIDLKEIHVAGKQLQLNPSIFDGK 305

Query: 964  HGTVLDSGTTYAYLPEEAFMAFKDAVIENLN-LKQIHGPDPNYNDICFSGAGSDISELEK 1140
            HGTVLDSGTTYAYLPEEAF+AFK+A+++ LN LKQI GPDPNYNDICF+  GSD+S+L  
Sbjct: 306  HGTVLDSGTTYAYLPEEAFLAFKEAIMKELNSLKQISGPDPNYNDICFATDGSDVSKLSS 365

Query: 1141 IFPAVEMVFGTGQKLSLAPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIIVRNTLVTY 1320
             FP V+M+FG G+K  L+PENYLFRHSKV GAYCLGIFQNGKDPTTLLGGI+VRNTLV Y
Sbjct: 366  TFPQVDMIFGNGKKFGLSPENYLFRHSKVRGAYCLGIFQNGKDPTTLLGGILVRNTLVMY 425

Query: 1321 DRKNEKIGFYKTNCSDLWKRLHI---PEATSPTVLDHSSTTGLPPSMAPSGAPNYNFPDE 1491
            DR+  KIGF+KTNCS+LW+RLH+   P    PT  +++ T    P +APSG+P Y    +
Sbjct: 426  DREQSKIGFWKTNCSELWERLHVSPSPLPMPPTSEENNPTAEARPPVAPSGSP-YVLSGD 484

Query: 1492 FEVGLITFDISLSINYSDLVPHITELAEFIAHELEVNTSQVHLLNFTGKGNESLTRWAIF 1671
              +G ITFD+SL+I+YSDL PHITEL EFIA ELEV+TSQV+++NF+  GN S+ +W I 
Sbjct: 485  LHIGHITFDMSLNISYSDLEPHITELGEFIAQELEVSTSQVNMVNFSPSGNHSIIKWDIT 544

Query: 1672 PAGSAKYISNSTAMSIILRLTKHRLQLPGNFGSYQLVEWKIEPAIKRTWWKQH--FLAVI 1845
            PA S +Y+SN+TA++II +L +HRLQ P  FGSYQLV WK+EP  KRTWWK+    + V+
Sbjct: 545  PADSTEYLSNTTAVAIISKLAEHRLQFPVMFGSYQLVGWKVEPKAKRTWWKKSHVVVVVV 604

Query: 1846 LGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDAAFPEQELQPL 1980
            + +++  ++G  + G+WL+WR RQ+    YKPVDA   E ELQPL
Sbjct: 605  VSILVVLVVGLLVSGLWLLWRHRQRIVHPYKPVDATVTEAELQPL 649


>gb|EXB28727.1| Aspartic proteinase-like protein 2 [Morus notabilis]
          Length = 647

 Score =  818 bits (2112), Expect = 0.0
 Identities = 397/610 (65%), Positives = 480/610 (78%), Gaps = 6/610 (0%)
 Frame = +1

Query: 169  HLRRGPSMVLPLCHSTLNSSRMSFQERS-FRRHLQKVGNQL-PNARMRLHDDLLANGYYT 342
            H +  P+ +LPL      +S  S    S  RRHLQ+   ++ P+ARM LHDDLL NGYYT
Sbjct: 38   HAQARPAFLLPLHRDFYKTSSSSSSSSSDLRRHLQRSPAEVRPSARMSLHDDLLLNGYYT 97

Query: 343  TRLWIGTPPQEFALIVDSGSTVTYVPCSTCEQCGKHQDPRFQPDLSSTYQPVKCNIDCEC 522
            TRLWIGTPPQ FALIVD+GS+VTYVPCSTC+QCGKHQDPRF P+LSSTYQPVKCNIDC C
Sbjct: 98   TRLWIGTPPQRFALIVDTGSSVTYVPCSTCDQCGKHQDPRFDPELSSTYQPVKCNIDCTC 157

Query: 523  DRERSQCIYERQYAELSSSSGVLGEDIVSFGNQSALKPQRAVFGCENVETGDLYSQHADG 702
            D ++ QCIYERQYAE+S+SSGVL +D++SFGNQS L PQRA+FGCEN+ETGDLYSQHADG
Sbjct: 158  DNDQVQCIYERQYAEMSTSSGVLSDDLLSFGNQSELAPQRAIFGCENMETGDLYSQHADG 217

Query: 703  IMGLGRGQLSIMDQLVEKGVISDSFSLCYXXXXXXXXXXXXXXISPPSGMVFSHSDPSRS 882
            IMGLGRG LS++DQLV+KGVISDSFSLCY              ISPPSGMVF++SD  RS
Sbjct: 218  IMGLGRGDLSVVDQLVDKGVISDSFSLCYGGMDIGGGAMVLGGISPPSGMVFANSDAVRS 277

Query: 883  PYYNIELKELHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYLPEEAFMAFKDAVIENL-NL 1059
            PYYN++LKE+HVAGK L LN  VFDGKHGTVLDSGTTYAYLPE AF+AFK A+++ L +L
Sbjct: 278  PYYNVDLKEIHVAGKKLALNPMVFDGKHGTVLDSGTTYAYLPETAFLAFKTAIMKELQSL 337

Query: 1060 KQIHGPDPNYNDICFSGAGSDISELEKIFPAVEMVFGTGQKLSLAPENYLFRHSKVHGAY 1239
            +QI GPDPNYNDICFSGA SD+S+L K FP V+MVFG GQKLSL+PENYLF+HSKV GAY
Sbjct: 338  RQIRGPDPNYNDICFSGAESDVSQLSKSFPMVDMVFGNGQKLSLSPENYLFQHSKVRGAY 397

Query: 1240 CLGIFQNGKDPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNCSDLWKRLHI---PEATSPT 1410
            CLGIFQNG+DPTTLLGGIIVRNTLV YDR++ KIGF++TNCS+LW+RL I   P   SP 
Sbjct: 398  CLGIFQNGRDPTTLLGGIIVRNTLVMYDREHSKIGFWETNCSELWERLRISDSPPPMSPA 457

Query: 1411 VLDHSSTTGLPPSMAPSGAPNYNFPDEFEVGLITFDISLSINYSDLVPHITELAEFIAHE 1590
              +++ T  +PP++AP+ AP+Y  P++ ++G I F +S +I+++DL PHITELA +IA E
Sbjct: 458  FDENNLTREVPPTIAPNEAPDYVLPEDRQIGQIRFQLSFNISHNDLKPHITELAGYIAPE 517

Query: 1591 LEVNTSQVHLLNFTGKGNESLTRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPGNFGS 1770
            L VN SQV LLNFT KGN SL  W+I PA SA  IS++TA SII RL +H +QLPG F +
Sbjct: 518  LHVNVSQVRLLNFTSKGNHSLIAWSIVPAVSAGSISDTTATSIIARLAEHGMQLPGTFSN 577

Query: 1771 YQLVEWKIEPAIKRTWWKQHFLAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDA 1950
            Y+LV W + P  KR WW Q +L  IL   ++ I+  +  G+W +WRRR++A  AYKPV+A
Sbjct: 578  YELVGWDVVPKEKRKWWHQGYLVAILASFVTLIISLSAFGMWFIWRRRREALNAYKPVNA 637

Query: 1951 AFPEQELQPL 1980
              PEQELQPL
Sbjct: 638  VVPEQELQPL 647


>ref|XP_007018559.1| Eukaryotic aspartyl protease family protein isoform 2 [Theobroma
            cacao] gi|508723887|gb|EOY15784.1| Eukaryotic aspartyl
            protease family protein isoform 2 [Theobroma cacao]
          Length = 635

 Score =  816 bits (2109), Expect = 0.0
 Identities = 413/639 (64%), Positives = 487/639 (76%), Gaps = 6/639 (0%)
 Frame = +1

Query: 82   LFRIFSAFLTLSFLQFAEISGDQEHDFLSHLRRGPSMVLPLCHSTLNSSRMSFQERSFRR 261
            L R  S  +  S L F   + D +H    H RR   MVLPL  S+ N S     + + RR
Sbjct: 6    LHRTSSLLIVCSVLWFHLATVDAKH----HHRR--PMVLPLHLSSRNHSLHRHVD-NLRR 58

Query: 262  HLQK--VGNQLPNARMRLHDDLLANGYYTTRLWIGTPPQEFALIVDSGSTVTYVPCSTCE 435
            HLQ+      +PNARMRL+DDLL+NGYYTTRLWIGTPPQEFALIVD+GSTVTYVPCS+C 
Sbjct: 59   HLQQSEFSPSIPNARMRLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSSCA 118

Query: 436  QCGKHQDPRFQPDLSSTYQPVKCNIDCECDRERSQCIYERQYAELSSSSGVLGEDIVSFG 615
            QCGKHQDPRFQPDLSSTYQPVKCN  C CD E+ QC Y+R+YAE+SSSSGVLGED+VSFG
Sbjct: 119  QCGKHQDPRFQPDLSSTYQPVKCNPSCNCDDEQKQCTYDRRYAEMSSSSGVLGEDVVSFG 178

Query: 616  NQSALKPQRAVFGCENVETGDLYSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYXX 795
            N+S L PQRAVFGCEN+ETGDLYSQ ADGIMGLGRG+LSI+DQLV+K VI DSFSLCY  
Sbjct: 179  NESELVPQRAVFGCENMETGDLYSQRADGIMGLGRGRLSIVDQLVDKSVIGDSFSLCYGG 238

Query: 796  XXXXXXXXXXXXISPPSGMVFSHSDPSRSPYYNIELKELHVAGKPLKLNSRVFDGKHGTV 975
                        I+PP  MVFSHSDP RSPYYNIELKE+HVAGKPLKL+  +FDG+HGTV
Sbjct: 239  MDVGGGAMVLGNITPPPDMVFSHSDPFRSPYYNIELKEMHVAGKPLKLHPGIFDGRHGTV 298

Query: 976  LDSGTTYAYLPEEAFMAFKDAVIENLN-LKQIHGPDPNYNDICFSGAGSDISELEKIFPA 1152
            LDSGTTYAYLP+ AF+AF+DA+I  ++ LK++HGPDPNY+DICFS AG D S+L KIFP 
Sbjct: 299  LDSGTTYAYLPKAAFVAFRDAIIREVHFLKRVHGPDPNYDDICFSSAGRDFSQLAKIFPE 358

Query: 1153 VEMVFGTGQKLSLAPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIIVRNTLVTYDRKN 1332
            VEMVF  G+KL L+PENYLF   +V GAYCLGIFQN  + TTLLGGI+VRNTLVTYDR N
Sbjct: 359  VEMVFNNGKKLLLSPENYLF---QVSGAYCLGIFQNA-EATTLLGGIVVRNTLVTYDRGN 414

Query: 1333 EKIGFYKTNCSDLWKRLHIPEATSPTVL---DHSSTTGLPPSMAPSGAPNYNFPDEFEVG 1503
            ++IGF+KTNCS+LW+R+  P A +P  L      +   +PP++APSG P    P  F +G
Sbjct: 415  DRIGFWKTNCSELWRRVQFPGAPAPAPLVSQSKDTNMEIPPALAPSGLPPNVLPGSFRIG 474

Query: 1504 LITFDISLSINYSDLVPHITELAEFIAHELEVNTSQVHLLNFTGKGNESLTRWAIFPAGS 1683
             ITFD+S+S N S+L P+  ELA+ I+ ELEV+ SQVHLLN T KGN+ L RW IFPA S
Sbjct: 475  FITFDMSISANDSNLKPNFKELADLISQELEVDKSQVHLLNVTSKGNDFLVRWGIFPAAS 534

Query: 1684 AKYISNSTAMSIILRLTKHRLQLPGNFGSYQLVEWKIEPAIKRTWWKQHFLAVILGVIMS 1863
            A YISN+TA+SIILRL  HR+Q P  FG+Y+LVEW  EP  K TWW+ HFLA+ LG + +
Sbjct: 535  ANYISNTTALSIILRLRDHRMQFPERFGNYKLVEWNAEPQRKMTWWQHHFLALALGFVTT 594

Query: 1864 FILGFTICGIWLVWRRRQQAFAAYKPVDAAFPEQELQPL 1980
             ILG +  GIWLV RRRQQA +AY+PV A  PEQELQPL
Sbjct: 595  LILGLSAIGIWLVHRRRQQAISAYEPVGAPTPEQELQPL 633


>ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
            gi|241926493|gb|EER99637.1| hypothetical protein
            SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score =  816 bits (2109), Expect = 0.0
 Identities = 399/602 (66%), Positives = 481/602 (79%), Gaps = 2/602 (0%)
 Frame = +1

Query: 181  GPSMVLPLCHSTLNSSRMSFQERSFRRHLQKVGNQLPNARMRLHDDLLANGYYTTRLWIG 360
            GP + LPL  S  N+SR++    S RR L    +  PNARMRLHDDLL NGYYTTRL+IG
Sbjct: 42   GPPLFLPLTRSYPNASRLA---ASSRRGLGDGAH--PNARMRLHDDLLTNGYYTTRLYIG 96

Query: 361  TPPQEFALIVDSGSTVTYVPCSTCEQCGKHQDPRFQPDLSSTYQPVKCNIDCECDRERSQ 540
            TPPQEFALIVDSGSTVTYVPC++CEQCG HQDPRFQPDLSS+Y PVKCN+DC CD ++ Q
Sbjct: 97   TPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNVDCTCDSDKKQ 156

Query: 541  CIYERQYAELSSSSGVLGEDIVSFGNQSALKPQRAVFGCENVETGDLYSQHADGIMGLGR 720
            C YERQYAE+SSSSGVLGEDIVSFG +S LKPQRAVFGCEN ETGDL+SQHADGIMGLGR
Sbjct: 157  CTYERQYAEMSSSSGVLGEDIVSFGRESELKPQRAVFGCENSETGDLFSQHADGIMGLGR 216

Query: 721  GQLSIMDQLVEKGVISDSFSLCYXXXXXXXXXXXXXXISPPSGMVFSHSDPSRSPYYNIE 900
            GQLSIMDQLVEKGVISDSFSLCY              +  PS MVFSHSDP RSPYYNIE
Sbjct: 217  GQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPAPSDMVFSHSDPLRSPYYNIE 276

Query: 901  LKELHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYLPEEAFMAFKDAVIENL-NLKQIHGP 1077
            LKE+HVAGK L+++SRVF+ KHGTVLDSGTTYAYLPE+AF+AFKDAV   + +LK+I GP
Sbjct: 277  LKEIHVAGKALRVDSRVFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGP 336

Query: 1078 DPNYNDICFSGAGSDISELEKIFPAVEMVFGTGQKLSLAPENYLFRHSKVHGAYCLGIFQ 1257
            DPNY DICF+GAG ++S+L ++FP V+MVFG GQKLSL PENYLFRHSKV GAYCLG+FQ
Sbjct: 337  DPNYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQ 396

Query: 1258 NGKDPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNCSDLWKRLHIPEATSPT-VLDHSSTT 1434
            NGKDPTTLLGGIIVRNTLVTYDR NEKIGF+KTNCS+LW+RLHI +A SP    D +S T
Sbjct: 397  NGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNCSELWERLHISDAPSPAPSSDTNSET 456

Query: 1435 GLPPSMAPSGAPNYNFPDEFEVGLITFDISLSINYSDLVPHITELAEFIAHELEVNTSQV 1614
             + P+ APS  P      EF+VGLIT D+S+++ Y +L PH+ ELAE IA ELE+++SQV
Sbjct: 457  DMSPAPAPSSLP------EFDVGLITVDMSINVTYPNLKPHLHELAELIAKELEIDSSQV 510

Query: 1615 HLLNFTGKGNESLTRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPGNFGSYQLVEWKI 1794
             ++N T +GN +L RW IFPA S   +SN+TAM II RLT+H +QLP N GSYQL+EW +
Sbjct: 511  RVMNITSQGNSTLIRWGIFPAESDNAMSNATAMGIIYRLTQHHVQLPENLGSYQLLEWNV 570

Query: 1795 EPAIKRTWWKQHFLAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDAAFPEQELQ 1974
            +P  +R+W+++H ++++LG+++  ++  +   + LVWR++     AY+PVD+  PEQELQ
Sbjct: 571  QPLPRRSWFQEHVVSILLGILLVVLVTLSALLVVLVWRKKFSGQTAYRPVDSVAPEQELQ 630

Query: 1975 PL 1980
            PL
Sbjct: 631  PL 632


>ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223550828|gb|EEF52314.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 641

 Score =  816 bits (2109), Expect = 0.0
 Identities = 399/614 (64%), Positives = 469/614 (76%), Gaps = 6/614 (0%)
 Frame = +1

Query: 157  DFLSHLRRGPSMVLPLCHSTLNSSRMSFQERSFRRHLQKVGNQLPNARMRLHDDLLANGY 336
            D  +H  R   + L L  S ++S R  F     RR L    + LPNA MRL+DDLL+NGY
Sbjct: 30   DIPNHNHRPMIIPLHLSTSNISSHRKPFTSNYHRRQLHN--SDLPNAHMRLYDDLLSNGY 87

Query: 337  YTTRLWIGTPPQEFALIVDSGSTVTYVPCSTCEQCGKHQDPRFQPDLSSTYQPVKCNIDC 516
            YTTRL+IGTPPQEFALIVD+GSTVTYVPCSTCEQCGKHQDPRFQP+ SSTY+P++CN  C
Sbjct: 88   YTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQCNPSC 147

Query: 517  ECDRERSQCIYERQYAELSSSSGVLGEDIVSFGNQSALKPQRAVFGCENVETGDLYSQHA 696
             CD E  QC YER+YAE+SSSSG+L ED++SFGN+S L PQRA+FGCE VETG+L+SQ A
Sbjct: 148  NCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQRAIFGCETVETGELFSQRA 207

Query: 697  DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYXXXXXXXXXXXXXXISPPSGMVFSHSDPS 876
            DGIMGLGRG LS++DQLV K V+ +SFSLCY              I PP  MVF+HSDP 
Sbjct: 208  DGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVLGNIPPPPDMVFAHSDPY 267

Query: 877  RSPYYNIELKELHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYLPEEAFMAFKDAVIENLN 1056
            RS YYNIELKELHVAGK LKLN RVFDGKHGTVLDSGTTYAYLPEEAF+AFKDA+I+ + 
Sbjct: 268  RSAYYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSGTTYAYLPEEAFVAFKDAIIKEIK 327

Query: 1057 -LKQIHGPDPNYNDICFSGAGSDISELEKIFPAVEMVFGTGQKLSLAPENYLFRHSKVHG 1233
             LKQIHGPDP+YNDICFSGAG D+S+L KIFP V MVFG GQKLSL+PENYLFRH+KV G
Sbjct: 328  FLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVFGNGQKLSLSPENYLFRHTKVSG 387

Query: 1234 AYCLGIFQNGKDPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNCSDLWKRLH-----IPEA 1398
            AYCLGIFQNGKDPTTLLGGI+VRNTLVTYDR N+KIGF+KTNCS+LWKRL      IP  
Sbjct: 388  AYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDKIGFWKTNCSELWKRLQSQSPGIPAP 447

Query: 1399 TSPTVLDHSSTTGLPPSMAPSGAPNYNFPDEFEVGLITFDISLSINYSDLVPHITELAEF 1578
                    + +  + P+ APSG P    P EF +G+ITFD+ ++IN S   P++TE+AEF
Sbjct: 448  PPVVFSSGNKSESIAPTQAPSGLPPDFIPGEFRIGVITFDMLMNINNSAAKPNLTEVAEF 507

Query: 1579 IAHELEVNTSQVHLLNFTGKGNESLTRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPG 1758
            IAHEL+V+  QVH+LNFT +GN  L +W IFPA SA YISN+TAM+IIL+L  HRLQ P 
Sbjct: 508  IAHELQVDNLQVHMLNFTSQGNNYLVKWGIFPAESADYISNTTAMNIILQLRDHRLQFPE 567

Query: 1759 NFGSYQLVEWKIEPAIKRTWWKQHFLAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYK 1938
             FGSYQLVEW+I+P  + TWW +HF AV+ GV+   ++     GIW VWR RQ+A   Y+
Sbjct: 568  RFGSYQLVEWRIQPQRRPTWWHEHFFAVVAGVVTILLVSLLSIGIWTVWRHRQRALGTYE 627

Query: 1939 PVDAAFPEQELQPL 1980
            PV    PEQELQPL
Sbjct: 628  PVGGIVPEQELQPL 641


>ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
            gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast
            nucleoid DNA binding protein [Oryza sativa Japonica
            Group] gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza
            sativa Japonica Group]
          Length = 631

 Score =  812 bits (2098), Expect = 0.0
 Identities = 396/600 (66%), Positives = 478/600 (79%), Gaps = 1/600 (0%)
 Frame = +1

Query: 184  PSMVLPLCHSTLNSSRMSFQERSFRRHLQKVGNQLPNARMRLHDDLLANGYYTTRLWIGT 363
            P +VLPL  +  N++R+     S RR L    N  PNARMRLHDDLL NGYYTTRL+IGT
Sbjct: 44   PPLVLPLTLAYPNATRLPAS--SARRGLGDGHN--PNARMRLHDDLLTNGYYTTRLYIGT 99

Query: 364  PPQEFALIVDSGSTVTYVPCSTCEQCGKHQDPRFQPDLSSTYQPVKCNIDCECDRERSQC 543
            P QEFALIVDSGSTVTYVPC+TCEQCG HQDPRFQPDLSSTY PVKCN+DC CD ERSQC
Sbjct: 100  PSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKCNVDCTCDNERSQC 159

Query: 544  IYERQYAELSSSSGVLGEDIVSFGNQSALKPQRAVFGCENVETGDLYSQHADGIMGLGRG 723
             YERQYAE+SSSSGVLGEDI+SFG +S LKPQRAVFGCEN ETGDL+SQHADGIMGLGRG
Sbjct: 160  TYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENTETGDLFSQHADGIMGLGRG 219

Query: 724  QLSIMDQLVEKGVISDSFSLCYXXXXXXXXXXXXXXISPPSGMVFSHSDPSRSPYYNIEL 903
            QLSIMDQLVEKGVISDSFSLCY              +  P  MVFSHS+P RSPYYNIEL
Sbjct: 220  QLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIEL 279

Query: 904  KELHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYLPEEAFMAFKDAVIENLN-LKQIHGPD 1080
            KE+HVAGK L+L+ ++F+ KHGTVLDSGTTYAYLPE+AF+AFKDAV   +N LK+I GPD
Sbjct: 280  KEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPD 339

Query: 1081 PNYNDICFSGAGSDISELEKIFPAVEMVFGTGQKLSLAPENYLFRHSKVHGAYCLGIFQN 1260
            PNY DICF+GAG ++S+L ++FP V+MVFG GQKLSL+PENYLFRHSKV GAYCLG+FQN
Sbjct: 340  PNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQN 399

Query: 1261 GKDPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNCSDLWKRLHIPEATSPTVLDHSSTTGL 1440
            GKDPTTLLGGI+VRNTLVTYDR NEKIGF+KTNCS+LW+RLHI E  S    D  S   +
Sbjct: 400  GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSELWERLHISEVPSSAPSD--SEGDM 457

Query: 1441 PPSMAPSGAPNYNFPDEFEVGLITFDISLSINYSDLVPHITELAEFIAHELEVNTSQVHL 1620
             P+ APSG P      EF+VGLIT D+S+++ Y +L PH+ ELAE IA EL++++ QV +
Sbjct: 458  APAPAPSGLP------EFDVGLITVDMSINVTYPNLKPHLHELAELIAKELDIDSRQVRV 511

Query: 1621 LNFTGKGNESLTRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPGNFGSYQLVEWKIEP 1800
            +N T +GN +L RW IFPAG +  ++N+TAM II RLT+H +QLP N GSYQL+EW ++P
Sbjct: 512  MNVTSQGNSTLIRWGIFPAGPSNSMTNTTAMGIIYRLTQHHVQLPENLGSYQLLEWNVQP 571

Query: 1801 AIKRTWWKQHFLAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDAAFPEQELQPL 1980
              KR+W++ H ++++LG+++  +L  +   + +VWR++ +  AAY+PVD+A PEQELQPL
Sbjct: 572  LSKRSWFRDHVVSILLGILLVVLLTLSALLVLIVWRKKFRGQAAYRPVDSAVPEQELQPL 631


>ref|XP_007208324.1| hypothetical protein PRUPE_ppa002682mg [Prunus persica]
            gi|462403966|gb|EMJ09523.1| hypothetical protein
            PRUPE_ppa002682mg [Prunus persica]
          Length = 645

 Score =  811 bits (2094), Expect = 0.0
 Identities = 398/604 (65%), Positives = 474/604 (78%), Gaps = 5/604 (0%)
 Frame = +1

Query: 184  PSMVLPLCHSTLNSSRMSFQERSFRRHLQKVGN-QLPNARMRLHDDLLANGYYTTRLWIG 360
            P+MVLPL  ST NSS  S    + RR LQ+  +   PNARMRL+DDLL NGYYTTRLWIG
Sbjct: 45   PAMVLPLYLSTPNSS--SRTSSNPRRLLQRSESLNRPNARMRLYDDLLRNGYYTTRLWIG 102

Query: 361  TPPQEFALIVDSGSTVTYVPCSTCEQCGKHQDPRFQPDLSSTYQPVKCNIDCECDRERSQ 540
            TPPQ FALIVD+GSTVTYVPC++CE CG+HQDP+F P+ SSTY+ VKCNIDC CD ++  
Sbjct: 103  TPPQRFALIVDTGSTVTYVPCASCEMCGRHQDPKFDPEESSTYKAVKCNIDCTCDSDKVN 162

Query: 541  CIYERQYAELSSSSGVLGEDIVSFGNQSALKPQRAVFGCENVETGDLYSQHADGIMGLGR 720
            CIYERQYAE+S+SSGVLGED+VSFGNQS L PQRAVFGCEN+ETGDLYSQHADGIMGLGR
Sbjct: 163  CIYERQYAEMSTSSGVLGEDLVSFGNQSELAPQRAVFGCENLETGDLYSQHADGIMGLGR 222

Query: 721  GQLSIMDQLVEKGVISDSFSLCYXXXXXXXXXXXXXXISPPSGMVFSHSDPSRSPYYNIE 900
            G LS++DQLV+KGVISDSFSLCY               + PS MVF HS+P RSPYYN++
Sbjct: 223  GDLSVVDQLVDKGVISDSFSLCYGGMDIGGGSMVLGGFTTPSDMVFIHSNPVRSPYYNLD 282

Query: 901  LKELHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYLPEEAFMAFKDAVIENL-NLKQIHGP 1077
            LKE+H+AGK L LN  VFDGKHGTVLDSGTTYAYLPE AF+AFKD +++ L +LKQI GP
Sbjct: 283  LKEIHIAGKRLSLNPSVFDGKHGTVLDSGTTYAYLPEAAFLAFKDGIMKELSSLKQIRGP 342

Query: 1078 DPNYNDICFSGAGSDISELEKIFPAVEMVFGTGQKLSLAPENYLFRHSKVHGAYCLGIFQ 1257
            DPNYNDICFS   S++S     FP V+MVFG+G+KL+L+PENYLFRHSKV GAYCLG FQ
Sbjct: 343  DPNYNDICFSTDESEVSHPSDTFPTVDMVFGSGKKLTLSPENYLFRHSKVRGAYCLGFFQ 402

Query: 1258 NGKDPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNCSDLWKRLH---IPEATSPTVLDHSS 1428
            NGKDPTTLLGGI+VRNTLVTYDR+N KIGF+KTNCS+LW+RLH    P A  P   D +S
Sbjct: 403  NGKDPTTLLGGIVVRNTLVTYDRENSKIGFWKTNCSELWERLHQSVSPPAMPPASGDKNS 462

Query: 1429 TTGLPPSMAPSGAPNYNFPDEFEVGLITFDISLSINYSDLVPHITELAEFIAHELEVNTS 1608
            T G+ P++AP+GAP Y  P E ++G ITFD+SL+I+YSDL PHITELAEFIA ELEVNTS
Sbjct: 463  TPGVTPTLAPTGAPPYVLPGELQIGKITFDMSLNISYSDLKPHITELAEFIAQELEVNTS 522

Query: 1609 QVHLLNFTGKGNESLTRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPGNFGSYQLVEW 1788
            QVH+L F   GN+SL  WA+FPA S + ISN+TA  I+ RL +H LQ P  FGSY+L+ W
Sbjct: 523  QVHMLKFAANGNDSLISWAVFPADSTESISNTTAAVIVARLAEHHLQFPVMFGSYELLGW 582

Query: 1789 KIEPAIKRTWWKQHFLAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDAAFPEQE 1968
            ++EP  KR+WW QH   VIL ++   ++  ++ G+  + R RQ+    YKPV+AA PEQE
Sbjct: 583  RVEPKEKRSWW-QHSYVVILSILGILVIALSVVGLLFLLRHRQRTVNPYKPVNAAVPEQE 641

Query: 1969 LQPL 1980
            LQPL
Sbjct: 642  LQPL 645


>ref|XP_007155866.1| hypothetical protein PHAVU_003G238400g [Phaseolus vulgaris]
            gi|561029220|gb|ESW27860.1| hypothetical protein
            PHAVU_003G238400g [Phaseolus vulgaris]
          Length = 638

 Score =  809 bits (2090), Expect = 0.0
 Identities = 408/637 (64%), Positives = 484/637 (75%), Gaps = 7/637 (1%)
 Frame = +1

Query: 91   IFSAFLTLS-FLQFAEISGDQEHDFLSHLRRGPSMVLPLCHSTLNSSRMSFQERSFRRHL 267
            +   F+TLS FLQ   I+GD       +    P+MVLPL  S+ NS+  +   R  R+  
Sbjct: 9    VLITFITLSLFLQL--IAGDSALLRNRYPGARPAMVLPLYLSSPNSTSSALDPR--RQLY 64

Query: 268  QKVGNQLPNARMRLHDDLLANGYYTTRLWIGTPPQEFALIVDSGSTVTYVPCSTCEQCGK 447
                 + PNARMRLHDDLL NGYYTTRLWIGTP Q FALIVD+GSTVTYVPCS+CEQCG+
Sbjct: 65   GSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPAQMFALIVDTGSTVTYVPCSSCEQCGR 124

Query: 448  HQDPRFQPDLSSTYQPVKCNIDCECDRERSQCIYERQYAELSSSSGVLGEDIVSFGNQSA 627
            HQDP+FQPD SSTY+PVKC IDC CD +R QC+YERQYAE+S+SSGVLGEDI+SFGNQS 
Sbjct: 125  HQDPKFQPDSSSTYEPVKCTIDCNCDGDRIQCVYERQYAEMSTSSGVLGEDIISFGNQSD 184

Query: 628  LKPQRAVFGCENVETGDLYSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYXXXXXX 807
            L PQRAVFGCENVETGDLYSQHADGIMGLGRG LSIMDQLV+K VISDSFSLCY      
Sbjct: 185  LPPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVG 244

Query: 808  XXXXXXXXISPPSGMVFSHSDPSRSPYYNIELKELHVAGKPLKLNSRVFDGKHGTVLDSG 987
                    ISPPS MVF +SDP RSPYYNI+LKE+HVAGK L LN+ +FDGKHGTVLDSG
Sbjct: 245  GGAMVLGGISPPSDMVFGYSDPVRSPYYNIDLKEIHVAGKQLPLNANIFDGKHGTVLDSG 304

Query: 988  TTYAYLPEEAFMAFKDAVIENL-NLKQIHGPDPNYNDICFSGAGSDISELEKIFPAVEMV 1164
            TTYAYLPE AF+AFKD +++ L +LKQI GPDPNYNDICFSGAG   S+L K FP V+MV
Sbjct: 305  TTYAYLPEAAFLAFKDTILKELQSLKQISGPDPNYNDICFSGAGIVASQLSKSFPVVDMV 364

Query: 1165 FGTGQKLSLAPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIIVRNTLVTYDRKNEKIG 1344
            FG G K SLAPENY+FRHSKV GAYCLGIFQNGKDPTTLLGGI+VRNTLV YDR+ EKIG
Sbjct: 365  FGNGDKYSLAPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVMYDREQEKIG 424

Query: 1345 FYKTNCSDLWKRLHIPEATSPTVLDHSSTTGLP-----PSMAPSGAPNYNFPDEFEVGLI 1509
            F+KTNC++LW+RL   ++ +P+ L  +S    P     PS+APS + N   P E ++  I
Sbjct: 425  FWKTNCAELWERLQ--DSLAPSPLPPNSEVSNPTEASEPSVAPSLSQNNAPPGELKIAQI 482

Query: 1510 TFDISLSINYSDLVPHITELAEFIAHELEVNTSQVHLLNFTGKGNESLTRWAIFPAGSAK 1689
            T  IS +I Y D+ PHITELA   AHEL VNTSQVHLLNFT  GN+SL++WAI P   A 
Sbjct: 483  TMLISFNITYVDIKPHITELAGLFAHELNVNTSQVHLLNFTSSGNDSLSKWAITPKSDAH 542

Query: 1690 YISNSTAMSIILRLTKHRLQLPGNFGSYQLVEWKIEPAIKRTWWKQHFLAVILGVIMSFI 1869
            YISN+TA +II RL +HR+QLP  +G+Y+L++W +E   K  WW+Q+F  + L  +++ +
Sbjct: 543  YISNTTATNIISRLAEHRVQLPDTYGNYELIDWSVEHPSK-NWWQQYFWVLGLAFLIALL 601

Query: 1870 LGFTICGIWLVWRRRQQAFAAYKPVDAAFPEQELQPL 1980
            +G +I G +L+WR+RQQ    YKPVDAA  EQELQPL
Sbjct: 602  IGLSILGTFLIWRKRQQNLHTYKPVDAAVTEQELQPL 638


>ref|XP_006472263.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Citrus
            sinensis]
          Length = 638

 Score =  809 bits (2089), Expect = 0.0
 Identities = 386/601 (64%), Positives = 478/601 (79%), Gaps = 4/601 (0%)
 Frame = +1

Query: 190  MVLPLCHSTLNSSRMSFQERSFRRHLQKVGNQLPNARMRLHDDLLANGYYTTRLWIGTPP 369
            +++PL  S+ NS+     +   R  LQ   ++ PNARMRL+DDLL+NGYYTTRL IGTPP
Sbjct: 37   LIIPLHLSSPNSTVHRHADDRRRHLLQAQLSRKPNARMRLYDDLLSNGYYTTRLNIGTPP 96

Query: 370  QEFALIVDSGSTVTYVPCSTCEQCGKHQDPRFQPDLSSTYQPVKCNIDCECDRERSQCIY 549
            Q+FALIVD+GSTVTYVPCSTC+ CG+HQDPRFQ ++S+TYQ +KCN DC CD +R  CIY
Sbjct: 97   QQFALIVDTGSTVTYVPCSTCQHCGRHQDPRFQTEMSNTYQALKCNPDCNCDNDRKDCIY 156

Query: 550  ERQYAELSSSSGVLGEDIVSFGNQSALKPQRAVFGCENVETGDLYSQHADGIMGLGRGQL 729
            ER+YAE+SSSSGVLG D++SFGN+S L PQRAVFGCEN+ETGDLY+Q ADGIMGLGRG+L
Sbjct: 157  ERRYAEMSSSSGVLGVDVISFGNESELVPQRAVFGCENLETGDLYTQRADGIMGLGRGRL 216

Query: 730  SIMDQLVEKGVISDSFSLCYXXXXXXXXXXXXXXISPPSGMVFSHSDPSRSPYYNIELKE 909
            S++DQLVEKGVISDSFSLCY              I+PP  MVFSHSDP RSPYYNIELKE
Sbjct: 217  SVVDQLVEKGVISDSFSLCYGGMDVGGGAMVLGGITPPPDMVFSHSDPFRSPYYNIELKE 276

Query: 910  LHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYLPEEAFMAFKDAVIENLN-LKQIHGPDPN 1086
            L VAGKPLK++ R+FDG HGTVLDSGTTYAYLP  AF AFKDA+I+  + LK+I GPDPN
Sbjct: 277  LRVAGKPLKVSPRIFDGGHGTVLDSGTTYAYLPGHAFAAFKDALIKETHVLKRIRGPDPN 336

Query: 1087 YNDICFSGAGSDISELEKIFPAVEMVFGTGQKLSLAPENYLFRHSKVHGAYCLGIFQNGK 1266
            Y+D+CFSGAG D+SEL K FP V+MVFG GQKL+L+PENYLFRH KV GAYCLGIFQN  
Sbjct: 337  YDDMCFSGAGRDVSELSKTFPQVDMVFGNGQKLTLSPENYLFRHMKVSGAYCLGIFQN-S 395

Query: 1267 DPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNCSDLWKRLHIPEATSP---TVLDHSSTTG 1437
            D TTLLGGI+VRNTLVTYDR N+K+GF+KTNCS+LW+RL +P   +P       + S+ G
Sbjct: 396  DSTTLLGGIVVRNTLVTYDRGNDKVGFWKTNCSELWRRLQLPSVPAPPPSISSSNDSSIG 455

Query: 1438 LPPSMAPSGAPNYNFPDEFEVGLITFDISLSINYSDLVPHITELAEFIAHELEVNTSQVH 1617
            +PP +AP G P    P  F++G+ITFD+S S+N S + P+ TEL+EFIAHEL+V+  +VH
Sbjct: 456  MPPRLAPDGLPLNVLPGAFQIGVITFDMSFSLNNSHMKPNFTELSEFIAHELQVDDIEVH 515

Query: 1618 LLNFTGKGNESLTRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPGNFGSYQLVEWKIE 1797
            LLNF+ KG++ L RW IFP  S  YISN+TA++IILRL +H +Q P  FGS+QLV+W IE
Sbjct: 516  LLNFSSKGHDYLVRWGIFPDESDNYISNTTALNIILRLREHHMQFPERFGSHQLVKWNIE 575

Query: 1798 PAIKRTWWKQHFLAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDAAFPEQELQP 1977
            P IK+TWW++H +AV++G++++ +LG +I G+W VW+RRQ+A   Y+PV A  PEQELQP
Sbjct: 576  PQIKQTWWQRHLVAVVVGIVVTLLLGLSILGLWSVWKRRQEASKTYQPVGAVVPEQELQP 635

Query: 1978 L 1980
            L
Sbjct: 636  L 636


>ref|XP_006433596.1| hypothetical protein CICLE_v10000565mg [Citrus clementina]
            gi|557535718|gb|ESR46836.1| hypothetical protein
            CICLE_v10000565mg [Citrus clementina]
          Length = 638

 Score =  807 bits (2085), Expect = 0.0
 Identities = 386/601 (64%), Positives = 479/601 (79%), Gaps = 4/601 (0%)
 Frame = +1

Query: 190  MVLPLCHSTLNSSRMSFQERSFRRHLQKVGNQLPNARMRLHDDLLANGYYTTRLWIGTPP 369
            +++PL  S+ NS+     +   R  LQ   ++ PNARMRL+DDLL+NGYYTTRL IGTPP
Sbjct: 37   LIIPLHLSSPNSTVHRHADDRRRHLLQAQLSRKPNARMRLYDDLLSNGYYTTRLNIGTPP 96

Query: 370  QEFALIVDSGSTVTYVPCSTCEQCGKHQDPRFQPDLSSTYQPVKCNIDCECDRERSQCIY 549
            Q+FALIVD+GSTVTYVPCSTC+ CG+HQDPRFQ ++S+TYQ +KCN DC CD +R +CIY
Sbjct: 97   QQFALIVDTGSTVTYVPCSTCQHCGRHQDPRFQTEMSNTYQALKCNPDCNCDNDRKECIY 156

Query: 550  ERQYAELSSSSGVLGEDIVSFGNQSALKPQRAVFGCENVETGDLYSQHADGIMGLGRGQL 729
            ER+YAE+S+SSGVLG D++SFGN+S L PQRAVFGCEN+ETGDLY+Q ADGIMGLGRG+L
Sbjct: 157  ERRYAEMSTSSGVLGVDVISFGNESELVPQRAVFGCENLETGDLYTQRADGIMGLGRGRL 216

Query: 730  SIMDQLVEKGVISDSFSLCYXXXXXXXXXXXXXXISPPSGMVFSHSDPSRSPYYNIELKE 909
            SI+DQLVEKGVISDSFSLCY              I+PP  MVFSHSDP RSPYYNIELKE
Sbjct: 217  SIVDQLVEKGVISDSFSLCYGGMDVGGGAMVLGGITPPPDMVFSHSDPFRSPYYNIELKE 276

Query: 910  LHVAGKPLKLNSRVFDGKHGTVLDSGTTYAYLPEEAFMAFKDAVIENLN-LKQIHGPDPN 1086
            L VAGKPLK++ R+FDG HGTVLDSGTTYAYLP  AF AFKDA+I+  + LK+I GPDPN
Sbjct: 277  LRVAGKPLKVSPRIFDGGHGTVLDSGTTYAYLPGHAFAAFKDALIKETHVLKRIRGPDPN 336

Query: 1087 YNDICFSGAGSDISELEKIFPAVEMVFGTGQKLSLAPENYLFRHSKVHGAYCLGIFQNGK 1266
            Y+DICFSGAG D+SEL K FP V+MVFG GQKL+L+PENYLFRH KV GAYCLGIFQN  
Sbjct: 337  YDDICFSGAGRDVSELSKTFPQVDMVFGNGQKLTLSPENYLFRHMKVSGAYCLGIFQN-S 395

Query: 1267 DPTTLLGGIIVRNTLVTYDRKNEKIGFYKTNCSDLWKRLHIPEATSP---TVLDHSSTTG 1437
            D TTLLGGI+VRNTLVTYDR N+K+GF+KTNCS+LW+RL +P   +P       + S+ G
Sbjct: 396  DSTTLLGGIVVRNTLVTYDRGNDKVGFWKTNCSELWRRLQLPSVPAPPPSISSSNDSSIG 455

Query: 1438 LPPSMAPSGAPNYNFPDEFEVGLITFDISLSINYSDLVPHITELAEFIAHELEVNTSQVH 1617
            +PP +AP G P    P  F++G+ITFD+S S+N S + P+ TEL+EFIAHEL+V+  +VH
Sbjct: 456  MPPRLAPDGLPLNVLPGAFQIGVITFDMSFSLNNSHMKPNFTELSEFIAHELQVDDIEVH 515

Query: 1618 LLNFTGKGNESLTRWAIFPAGSAKYISNSTAMSIILRLTKHRLQLPGNFGSYQLVEWKIE 1797
            LLNF+ KG++ L RW IFP  S  YISN+TA++IILRL +H +Q P  FGS+QLV+W IE
Sbjct: 516  LLNFSSKGHDYLVRWGIFPDESDNYISNTTALNIILRLREHHMQFPERFGSHQLVKWNIE 575

Query: 1798 PAIKRTWWKQHFLAVILGVIMSFILGFTICGIWLVWRRRQQAFAAYKPVDAAFPEQELQP 1977
            P IK+TWW+++ +AV++G++++ +LG +I G+W VW+RRQ+A   Y+PV A  PEQELQP
Sbjct: 576  PQIKQTWWQRNLVAVVVGIVVTLLLGLSILGLWSVWKRRQEASKTYQPVGAVVPEQELQP 635

Query: 1978 L 1980
            L
Sbjct: 636  L 636


Top