BLASTX nr result
ID: Forsythia23_contig00031338
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00031338 (1187 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011095837.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR... 416 e-113 ref|XP_012849177.1| PREDICTED: aspartic proteinase nepenthesin-1... 346 2e-92 emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera] 314 9e-83 ref|XP_002265771.3| PREDICTED: aspartic proteinase nepenthesin-2... 311 8e-82 emb|CBI24128.3| unnamed protein product [Vitis vinifera] 276 3e-71 ref|XP_012463657.1| PREDICTED: aspartic proteinase nepenthesin-1... 273 1e-70 gb|KHG15209.1| Asparticase nepenthesin-1 [Gossypium arboreum] 272 3e-70 ref|XP_007022806.1| Eukaryotic aspartyl protease family protein,... 262 3e-67 ref|XP_010092446.1| Aspartic proteinase nepenthesin-1 [Morus not... 261 9e-67 ref|XP_010064103.1| PREDICTED: aspartic proteinase CDR1 [Eucalyp... 258 6e-66 gb|KDO61509.1| hypothetical protein CISIN_1g046757mg [Citrus sin... 256 2e-65 ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citr... 256 2e-65 ref|XP_004293837.1| PREDICTED: aspartic proteinase CDR1-like [Fr... 253 3e-64 ref|XP_007049083.1| Eukaryotic aspartyl protease family protein,... 252 3e-64 ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Caps... 251 7e-64 ref|XP_010260839.1| PREDICTED: aspartic proteinase nepenthesin-1... 248 8e-63 ref|XP_012486822.1| PREDICTED: aspartic proteinase nepenthesin-2... 247 1e-62 gb|KJB10346.1| hypothetical protein B456_001G196900 [Gossypium r... 247 1e-62 emb|CDX73806.1| BnaA03g32410D [Brassica napus] 246 2e-62 emb|CDY00795.1| BnaC03g37790D [Brassica napus] 245 4e-62 >ref|XP_011095837.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Sesamum indicum] Length = 488 Score = 416 bits (1070), Expect = e-113 Identities = 210/288 (72%), Positives = 237/288 (82%), Gaps = 6/288 (2%) Frame = -1 Query: 848 SQGHGESVGTKLELIHRHHFRRNQANGMQPMTQIERLRQLLHSDTIRQRSISERLRLQK- 672 S GHG GTK ELIHRHH R +P TQI+RLRQLLHSDTIR IS ++RL++ Sbjct: 24 SWGHGNPGGTKFELIHRHHLER------KPATQIQRLRQLLHSDTIRLPEISHKVRLRQG 77 Query: 671 ----SRRRVLESPDATYYHPACTNSSRRAKHDN-VSGEMAMYSGADFGTGQYFVSFKVGS 507 SRR++ P+ T Y+PACTNSSRR+K+DN VSGEM M+SGAD+GTGQYFV F+VGS Sbjct: 78 HFDASRRQL---PEETAYYPACTNSSRRSKNDNNVSGEMPMHSGADYGTGQYFVRFRVGS 134 Query: 506 PARRFMLIADTGSDLTWMNCKYRCHGAKCRKKSRKRRVFRADHSSSFVTVPCSSRMCKIE 327 PA++ MLIADTGSDLTWMNCKYRC G +CRK S K RVF ADHSSSF TV CSS MCKI+ Sbjct: 135 PAQKLMLIADTGSDLTWMNCKYRCRGGRCRKSSNKGRVFLADHSSSFRTVHCSSSMCKID 194 Query: 326 LANLFSLARCPSPHTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSES 147 LANLFSLARCPSP PCAYDYRYSDGS+ALG+FANE VTF LTN K R+ NVLVGCSES Sbjct: 195 LANLFSLARCPSPMDPCAYDYRYSDGSAALGLFANEMVTFTLTNRRKTRLRNVLVGCSES 254 Query: 146 STGQSFQGADGVMGLGYSNYSFALKAAGKFGGKFSYCLVDHLSPQNVS 3 + GQSFQGADGVMGLGYS+YSFA+KAA +FGGKFSYCLVDHLSP+NVS Sbjct: 255 TRGQSFQGADGVMGLGYSDYSFAVKAAKRFGGKFSYCLVDHLSPENVS 302 >ref|XP_012849177.1| PREDICTED: aspartic proteinase nepenthesin-1 [Erythranthe guttatus] gi|604314897|gb|EYU27603.1| hypothetical protein MIMGU_mgv1a004950mg [Erythranthe guttata] Length = 503 Score = 346 bits (887), Expect = 2e-92 Identities = 187/325 (57%), Positives = 230/325 (70%), Gaps = 17/325 (5%) Frame = -1 Query: 926 MKMYRQQRGXXXXXXXFYVVI-FLEKCSQGHGESVGT-KLELIHRHHFRRNQAN-GMQPM 756 M + +QRG + ++ + K ++G S G KLELIHRHH + + N QP+ Sbjct: 1 MVTHTRQRGFSLFIICLFTIVNYSLKFTEGIRVSDGAVKLELIHRHHLQGERRNVAAQPL 60 Query: 755 TQIERLRQLLHSDTIRQRSISERLRLQKS-----RRRVLESPDATYYHPACTN------S 609 ERLRQL+HSD +R R IS ++ L + RRRV E+ DA + PA TN S Sbjct: 61 ---ERLRQLVHSDAVRLRGISLKVMLIQGGAGPVRRRVSETDDA--FIPASTNGGGGGGS 115 Query: 608 SRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGSDLTWMNCKYRCHG 429 + + + NVSG++ + SGADFGTGQYFV F+VGSPA++ +LIADTGSDLTWMNCKYRC G Sbjct: 116 NNKEQFSNVSGQLPISSGADFGTGQYFVQFRVGSPAQKVVLIADTGSDLTWMNCKYRCRG 175 Query: 428 AK---CRKKSRKRRVFRADHSSSFVTVPCSSRMCKIELANLFSLARCPSPHTPCAYDYRY 258 CR+ S KRR+F AD SSSF TVPCSS C +LANLFSL RCPSP +PCAYDYRY Sbjct: 176 GGGGGCRRNSNKRRLFWADRSSSFRTVPCSSTTCTNDLANLFSLTRCPSPISPCAYDYRY 235 Query: 257 SDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSESSTGQSFQGADGVMGLGYSNYSFA 78 SDGS+A G+F NETVT LTNG K R+HNVL+GCS SS+G +FQ ADGV+GLGYSNYS A Sbjct: 236 SDGSAAQGLFGNETVTLSLTNGRKTRLHNVLIGCSISSSGPTFQSADGVIGLGYSNYSLA 295 Query: 77 LKAAGKFGGKFSYCLVDHLSPQNVS 3 +KA+ F G FSYCLVDHLSP+N+S Sbjct: 296 VKASNLFRGIFSYCLVDHLSPKNIS 320 >emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera] Length = 449 Score = 314 bits (804), Expect = 9e-83 Identities = 161/276 (58%), Positives = 200/276 (72%), Gaps = 4/276 (1%) Frame = -1 Query: 818 KLELIHRHHFRRNQANGMQPMTQIERLRQLLHSDTIRQRSISERLRLQKSRRRVLESPDA 639 +LELIHRH Q G +P TQ++RL++L+HSD++RQ I +LR + RR + Sbjct: 2 RLELIHRHS---PQVMG-RPKTQLQRLKELVHSDSVRQLMILHKLRGGQIPRRKAKE--- 54 Query: 638 TYYHPACTNSSRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGSDLT 459 ++SS R D + E+ M+ AD+G GQYFV+FKVG+P+++FML+ADTGSDLT Sbjct: 55 -----VLSSSSGRGSDDAI--EVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLT 107 Query: 458 WMNCKYRCHGAKCRKKSRKR----RVFRADHSSSFVTVPCSSRMCKIELANLFSLARCPS 291 WM+CKY C C + +R RVF A+ SSSF T+PC + MCKIEL +LFSL CP+ Sbjct: 108 WMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPT 167 Query: 290 PHTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSESSTGQSFQGADGV 111 P TPC YDYRYSDGS+ALG FANETVT L G K+++HNVL+GCSES GQSFQ ADGV Sbjct: 168 PLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGV 227 Query: 110 MGLGYSNYSFALKAAGKFGGKFSYCLVDHLSPQNVS 3 MGLGYS YSFA+KAA KFGGKFSYCLVDHLS +NVS Sbjct: 228 MGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHKNVS 263 >ref|XP_002265771.3| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera] Length = 489 Score = 311 bits (796), Expect = 8e-82 Identities = 160/276 (57%), Positives = 199/276 (72%), Gaps = 4/276 (1%) Frame = -1 Query: 818 KLELIHRHHFRRNQANGMQPMTQIERLRQLLHSDTIRQRSISERLRLQKSRRRVLESPDA 639 +LELIHRH Q G +P TQ++RL++L+HSD++RQ I +LR + RR + Sbjct: 42 RLELIHRHS---PQVMG-RPKTQLQRLKELVHSDSVRQLMILHKLRGGQIPRRKAKE--- 94 Query: 638 TYYHPACTNSSRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGSDLT 459 ++SS R D + E+ M+ AD+G GQY V+FKVG+P+++FML+ADTGSDLT Sbjct: 95 -----VLSSSSGRGSDDAI--EVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLT 147 Query: 458 WMNCKYRCHGAKCRKKSRKR----RVFRADHSSSFVTVPCSSRMCKIELANLFSLARCPS 291 WM+CKY C C + +R RVF A+ SSSF T+PC + MCKIEL +LFSL CP+ Sbjct: 148 WMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPT 207 Query: 290 PHTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSESSTGQSFQGADGV 111 P TPC YDYRYSDGS+ALG FANETVT L G K+++HNVL+GCSES GQSFQ ADGV Sbjct: 208 PLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGV 267 Query: 110 MGLGYSNYSFALKAAGKFGGKFSYCLVDHLSPQNVS 3 MGLGYS YSFA+KAA KFGGKFSYCLVDHLS +NVS Sbjct: 268 MGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHKNVS 303 >emb|CBI24128.3| unnamed protein product [Vitis vinifera] Length = 378 Score = 276 bits (705), Expect = 3e-71 Identities = 130/192 (67%), Positives = 153/192 (79%), Gaps = 4/192 (2%) Frame = -1 Query: 566 MYSGADFGTGQYFVSFKVGSPARRFMLIADTGSDLTWMNCKYRCHGAKCRKKSRKR---- 399 M+ AD+G GQY V+FKVG+P+++FML+ADTGSDLTWM+CKY C C + +R Sbjct: 1 MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 60 Query: 398 RVFRADHSSSFVTVPCSSRMCKIELANLFSLARCPSPHTPCAYDYRYSDGSSALGIFANE 219 RVF A+ SSSF T+PC + MCKIEL +LFSL CP+P TPC YDYRYSDGS+ALG FANE Sbjct: 61 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 120 Query: 218 TVTFGLTNGTKVRIHNVLVGCSESSTGQSFQGADGVMGLGYSNYSFALKAAGKFGGKFSY 39 TVT L G K+++HNVL+GCSES GQSFQ ADGVMGLGYS YSFA+KAA KFGGKFSY Sbjct: 121 TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 180 Query: 38 CLVDHLSPQNVS 3 CLVDHLS +NVS Sbjct: 181 CLVDHLSHKNVS 192 >ref|XP_012463657.1| PREDICTED: aspartic proteinase nepenthesin-1 [Gossypium raimondii] gi|763814626|gb|KJB81478.1| hypothetical protein B456_013G147300 [Gossypium raimondii] Length = 473 Score = 273 bits (699), Expect = 1e-70 Identities = 148/288 (51%), Positives = 180/288 (62%), Gaps = 7/288 (2%) Frame = -1 Query: 845 QGHGESVGTKLELIHRHHFRRNQANGMQPMTQIERLRQLLHSDTIRQRSISERLRLQKSR 666 Q +S LELIHRH Q P+TQ +RL LL+ D IR +S R Sbjct: 28 QHQHDSNSITLELIHRH---APQFTNNHPITQHQRLVDLLYHDIIRHGIMSHR------- 77 Query: 665 RRVLESPDATYYHPACTNSSRRAKHDN---VSGEMAMYSGADFGTGQYFVSFKVGSPARR 495 RRAK ++ S +M + SG DFG GQY SFKVG+P+++ Sbjct: 78 --------------------RRAKEEDPLTASIKMPLASGRDFGIGQYITSFKVGTPSQK 117 Query: 494 FMLIADTGSDLTWMNCKYRCHGA--KCRKKSR--KRRVFRADHSSSFVTVPCSSRMCKIE 327 F LI DTGSDLTW+ C+YRC C +K R ++RVF A SSSF VPC S MCK+E Sbjct: 118 FWLIVDTGSDLTWIRCRYRCSRGDRSCTRKGRINRKRVFHAPLSSSFSPVPCFSEMCKVE 177 Query: 326 LANLFSLARCPSPHTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSES 147 L NLFSL CP+P TPCAYDYRYSDGS+A+G+FANETV+ GLTNG K R+HNVL+GC++S Sbjct: 178 LMNLFSLTTCPTPITPCAYDYRYSDGSAAMGVFANETVSAGLTNGRKTRLHNVLIGCTDS 237 Query: 146 STGQSFQGADGVMGLGYSNYSFALKAAGKFGGKFSYCLVDHLSPQNVS 3 G + Q DG+MGL + YSFA AA FGGKFSYCLVDHLS N + Sbjct: 238 FQGPTLQNVDGIMGLANTKYSFATNAAATFGGKFSYCLVDHLSHLNAT 285 >gb|KHG15209.1| Asparticase nepenthesin-1 [Gossypium arboreum] Length = 473 Score = 272 bits (696), Expect = 3e-70 Identities = 148/288 (51%), Positives = 179/288 (62%), Gaps = 7/288 (2%) Frame = -1 Query: 845 QGHGESVGTKLELIHRHHFRRNQANGMQPMTQIERLRQLLHSDTIRQRSISERLRLQKSR 666 Q +S LELIHRH Q P+TQ +RL LL+ D IR +S R Sbjct: 28 QHQHDSNSITLELIHRH---APQFTNNNPITQHQRLVDLLYHDIIRHGIMSHR------- 77 Query: 665 RRVLESPDATYYHPACTNSSRRAKHDN---VSGEMAMYSGADFGTGQYFVSFKVGSPARR 495 RRAK ++ S +M + SG DFG GQY SFKVG+P+++ Sbjct: 78 --------------------RRAKEEDPLTASIKMPLASGRDFGIGQYITSFKVGTPSQK 117 Query: 494 FMLIADTGSDLTWMNCKYRCHGA--KCRKKSR--KRRVFRADHSSSFVTVPCSSRMCKIE 327 F LI DTGSDLTW+ C+YRC C K R ++RVF A SSSF VPC S MCK+E Sbjct: 118 FWLIVDTGSDLTWIRCRYRCSRGDRSCTSKGRINRKRVFHAPLSSSFNPVPCFSEMCKVE 177 Query: 326 LANLFSLARCPSPHTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSES 147 L NLFSL CP+P TPCAYDYRYSDGS+A+G+FANETV+ GLTNG K R+HNVL+GC++S Sbjct: 178 LMNLFSLTTCPTPITPCAYDYRYSDGSAAMGVFANETVSAGLTNGRKTRLHNVLIGCTDS 237 Query: 146 STGQSFQGADGVMGLGYSNYSFALKAAGKFGGKFSYCLVDHLSPQNVS 3 G + Q DG+MGL + YSFA AA FGGKFSYCLVDHLS N + Sbjct: 238 FQGPTLQNVDGIMGLANTKYSFATNAAATFGGKFSYCLVDHLSHLNAT 285 >ref|XP_007022806.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] gi|508722434|gb|EOY14331.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] Length = 473 Score = 262 bits (670), Expect = 3e-67 Identities = 142/286 (49%), Positives = 182/286 (63%), Gaps = 14/286 (4%) Frame = -1 Query: 818 KLELIHRHHFRRNQANGMQPMTQIERLRQLLHSDTIRQRSISERLRLQKSRRRVLESPDA 639 KLEL+HRH + + +P TQ ERL+ L+H D IR +RR+ E+P Sbjct: 24 KLELLHRHAPQLHA----RPKTQHERLKDLVHHDFIRH-----------NRRQAWETPKT 68 Query: 638 TYYHPACTNSSRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGSDLT 459 T + A N + +M + +G DFG GQY +FKVG+P+++F LI DTGSDLT Sbjct: 69 T---------TATASKTNAAIQMPLSAGRDFGIGQYVTTFKVGTPSQKFRLIVDTGSDLT 119 Query: 458 WMNCKYRC-HGAKCRKKSR---KRRVFRADHSSSFVTVPCSSRMCKIELANLFSLARCPS 291 W+NC+YRC G C + R + RVFRA SSSF +PC S+MCK+EL NLFSL CP+ Sbjct: 120 WINCRYRCARGDNCTTQERGIKRGRVFRAHLSSSFRPIPCFSQMCKVELRNLFSLTICPT 179 Query: 290 PHTPCAYDYR----------YSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSESST 141 P TPCAYDYR Y DGS A+G+FA E+VT GLTN R+H+VL+GCS+SS Sbjct: 180 PLTPCAYDYRFNSLKLVLNRYIDGSDAMGVFAKESVTVGLTNSRMARLHDVLIGCSDSSQ 239 Query: 140 GQSFQGADGVMGLGYSNYSFALKAAGKFGGKFSYCLVDHLSPQNVS 3 G++ + DGV+GL S YSF KAA ++GGKFSYCLVDHLS N S Sbjct: 240 GRTVKNVDGVLGLANSKYSFVTKAAERWGGKFSYCLVDHLSHINAS 285 >ref|XP_010092446.1| Aspartic proteinase nepenthesin-1 [Morus notabilis] gi|587861358|gb|EXB51212.1| Aspartic proteinase nepenthesin-1 [Morus notabilis] Length = 464 Score = 261 bits (666), Expect = 9e-67 Identities = 142/279 (50%), Positives = 188/279 (67%), Gaps = 6/279 (2%) Frame = -1 Query: 821 TKLELIHRHHFRRNQANGMQPMTQIERLRQLLHSDTIRQRSISERLRLQKSRRRVLESPD 642 T+LEL+HR+ + ++ P T +E+L + D +R R +S R R +E+ Sbjct: 24 TRLELLHRNSPKLSE-KWQIPETTMEKLIEFHRRDVLRHRMVSHR-------RMGIET-- 73 Query: 641 ATYYHPACTNSSRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGSDL 462 A +++S A M M +GAD+G G+YFV VG+P +RFML+ADTGSDL Sbjct: 74 ------ASSSASSIA--------MPMNAGADYGVGEYFVHVTVGTPGQRFMLVADTGSDL 119 Query: 461 TWMNCKYRCHGAKC---RKKSRKRRVFRADHSSSFVTVPCSSRMCKIELANLFSLARCPS 291 TWM+C RC G +C + + RRVF AD SSSF T+PC S MCK+ELANLFSL++CP+ Sbjct: 120 TWMHC--RC-GRRCGTHKGRLNNRRVFHADRSSSFKTIPCLSEMCKVELANLFSLSKCPT 176 Query: 290 PHTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSESSTG---QSFQGA 120 P TPCAYDYRY +GSSA+G FANET++ L NG K ++ +VLVGC+ES G F+GA Sbjct: 177 PLTPCAYDYRYLEGSSAIGFFANETISVRLANGKKRKLRDVLVGCTESVQGAEESGFKGA 236 Query: 119 DGVMGLGYSNYSFALKAAGKFGGKFSYCLVDHLSPQNVS 3 DGV+GLG+ N++F KAA FGGKFSYCLVDHLSP+N+S Sbjct: 237 DGVLGLGFGNHTFTRKAAQYFGGKFSYCLVDHLSPKNLS 275 >ref|XP_010064103.1| PREDICTED: aspartic proteinase CDR1 [Eucalyptus grandis] gi|629105951|gb|KCW71420.1| hypothetical protein EUGRSUZ_F04481 [Eucalyptus grandis] Length = 477 Score = 258 bits (659), Expect = 6e-66 Identities = 137/252 (54%), Positives = 173/252 (68%), Gaps = 2/252 (0%) Frame = -1 Query: 752 QIERLRQLLHSDTIRQRSISERLRLQKSRRRVLESPDATYYHPACTNSSRRAKHDNVSGE 573 Q++R+R+L+HSD +R R I Q +RR+V E P RR N+S Sbjct: 53 QMKRIRELVHSDILR-RGIMFSKHHQSTRRKVWEKP------------RRRTNCSNISIG 99 Query: 572 MAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGSDLTWMNCKYRCHGAKCRKKSRKRRV 393 M + SG D+GTGQYFV VG+P ++ +LIADTGS+LTWMNCK HG +RR Sbjct: 100 MPISSGRDYGTGQYFVEVNVGTPPQKMLLIADTGSELTWMNCKR--HG--------RRRG 149 Query: 392 FRADHSSSFVTVPCSSRMCKIELANLFSLARCPSPHTPCAYDYRYSDGSSALGIFANETV 213 F++ SS+F TVPCSSR CKI+ +LFSLARCP+P TPC+YDYRYSDGS ALGIFA ETV Sbjct: 150 FQSTRSSTFKTVPCSSRTCKIDFMDLFSLARCPTPSTPCSYDYRYSDGSGALGIFARETV 209 Query: 212 TFGLTN--GTKVRIHNVLVGCSESSTGQSFQGADGVMGLGYSNYSFALKAAGKFGGKFSY 39 T +TN G ++ +V+VGC+ + GQ FQGADGV+GL YSNYSFA +A+ FGG FSY Sbjct: 210 TAEITNEKGRATKVEDVVVGCTLTLQGQGFQGADGVLGLAYSNYSFATRASHTFGGTFSY 269 Query: 38 CLVDHLSPQNVS 3 CLVDHLS + +S Sbjct: 270 CLVDHLSHKYLS 281 >gb|KDO61509.1| hypothetical protein CISIN_1g046757mg [Citrus sinensis] Length = 445 Score = 256 bits (654), Expect = 2e-65 Identities = 138/282 (48%), Positives = 177/282 (62%), Gaps = 7/282 (2%) Frame = -1 Query: 827 VGTKLELIHRHHFRRNQANGMQPMTQIERLRQLLHSDTIRQRSISERLRLQKSRRRVLES 648 V ++ELIHRH + N M M+++ER+++LLH+D IRQ K R R L Sbjct: 5 VAVRMELIHRHS---PKLNNMPMMSEVERMKELLHNDIIRQN---------KRRGRRLRQ 52 Query: 647 PDATYYHPACTNSSRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGS 468 TN++ + EM + +G D+GTG YFV KVG+P+++ LI DTGS Sbjct: 53 ----------TNNNNNNGASGSAIEMPLQAGRDYGTGMYFVEIKVGTPSQKLRLIVDTGS 102 Query: 467 DLTWMNCKYRCHGAKCRKKSR----KRRVFRADHSSSFVTVPCSSRMCKIELANLFSLAR 300 + +W++C+Y C G C KK +RRVF+AD SSSF T+PCSS MCK E A LFSL Sbjct: 103 EFSWISCRYHC-GPSCTKKGTIAGSRRRVFKADLSSSFKTIPCSSDMCKSEFARLFSLTF 161 Query: 299 CPSPHTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSESSTGQSFQGA 120 CP+P +PCAYDYRY+DGS+A GIF E VT GL NG K RI V++GCS++ GQ F A Sbjct: 162 CPTPTSPCAYDYRYADGSAAKGIFGKERVTIGLENGGKTRIEEVVMGCSDTIQGQIFAEA 221 Query: 119 DGVMGLGYSNYSFALKAAGK---FGGKFSYCLVDHLSPQNVS 3 DGV+GL Y YSFA K GKF+YCLVDHLS +NVS Sbjct: 222 DGVLGLSYDKYSFAQKVTNGSTFARGKFAYCLVDHLSHKNVS 263 >ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citrus clementina] gi|568881779|ref|XP_006493729.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Citrus sinensis] gi|557524190|gb|ESR35557.1| hypothetical protein CICLE_v10004908mg [Citrus clementina] Length = 470 Score = 256 bits (654), Expect = 2e-65 Identities = 138/282 (48%), Positives = 177/282 (62%), Gaps = 7/282 (2%) Frame = -1 Query: 827 VGTKLELIHRHHFRRNQANGMQPMTQIERLRQLLHSDTIRQRSISERLRLQKSRRRVLES 648 V ++ELIHRH + N M M+++ER+++LLH+D IRQ K R R L Sbjct: 30 VAVRMELIHRHS---PKLNNMPMMSEVERMKELLHNDIIRQN---------KRRGRRLRQ 77 Query: 647 PDATYYHPACTNSSRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGS 468 TN++ + EM + +G D+GTG YFV KVG+P+++ LI DTGS Sbjct: 78 ----------TNNNNNNGASGSAIEMPLQAGRDYGTGMYFVEIKVGTPSQKLRLIVDTGS 127 Query: 467 DLTWMNCKYRCHGAKCRKKSR----KRRVFRADHSSSFVTVPCSSRMCKIELANLFSLAR 300 + +W++C+Y C G C KK +RRVF+AD SSSF T+PCSS MCK E A LFSL Sbjct: 128 EFSWISCRYHC-GPSCTKKGTIAGSRRRVFKADLSSSFKTIPCSSDMCKSEFARLFSLTF 186 Query: 299 CPSPHTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSESSTGQSFQGA 120 CP+P +PCAYDYRY+DGS+A GIF E VT GL NG K RI V++GCS++ GQ F A Sbjct: 187 CPTPTSPCAYDYRYADGSAAKGIFGKERVTIGLENGGKTRIEEVVMGCSDTIQGQIFAEA 246 Query: 119 DGVMGLGYSNYSFALKAAGK---FGGKFSYCLVDHLSPQNVS 3 DGV+GL Y YSFA K GKF+YCLVDHLS +NVS Sbjct: 247 DGVLGLSYDKYSFAQKVTNGSTFARGKFAYCLVDHLSHKNVS 288 >ref|XP_004293837.1| PREDICTED: aspartic proteinase CDR1-like [Fragaria vesca subsp. vesca] Length = 482 Score = 253 bits (645), Expect = 3e-64 Identities = 136/280 (48%), Positives = 169/280 (60%), Gaps = 8/280 (2%) Frame = -1 Query: 818 KLELIHRHHFRRNQANGMQPMTQIERLRQLLHSDTIRQRSISERLRLQKSRRRVLESPDA 639 KLELIHRH R P TQ+E + +L D IR + IS R + Sbjct: 38 KLELIHRHSLRVEM-----PKTQLELIEELQRHDVIRHQMISRRRQ-------------- 78 Query: 638 TYYHPACTNSSRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGSDLT 459 ++H T R A S M + S DFG GQYFV KVG+P++RF+LIADTGSDLT Sbjct: 79 HHHHSIPTGLRRNALETAASIAMPLSSAWDFGAGQYFVQIKVGTPSQRFLLIADTGSDLT 138 Query: 458 WMNCKYRCHGAKC-----RKKSRKRRVFRADHSSSFVTVPCSSRMCKIELANLFSLARCP 294 WM CKYRC KC K K++VFR SS+F +PCSS MCK EL FS CP Sbjct: 139 WMKCKYRCVADKCGLKRATMKKNKKKVFRPAQSSTFKIIPCSSEMCKFELE--FSRQECP 196 Query: 293 SPHTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSES---STGQSFQG 123 +P +PC YDYRY++ S ALG FANETV LTNG + R+++VL+GC+ES G S + Sbjct: 197 TPLSPCKYDYRYAESSGALGFFANETVRVPLTNGRRARLNDVLIGCTESIEGPKGASIRA 256 Query: 122 ADGVMGLGYSNYSFALKAAGKFGGKFSYCLVDHLSPQNVS 3 DG++GLG+ +SF KAA G KFSYCLVDH+S +NVS Sbjct: 257 GDGILGLGFGKHSFVAKAASNLGDKFSYCLVDHMSNKNVS 296 >ref|XP_007049083.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] gi|508701344|gb|EOX93240.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] Length = 478 Score = 252 bits (644), Expect = 3e-64 Identities = 129/274 (47%), Positives = 173/274 (63%), Gaps = 3/274 (1%) Frame = -1 Query: 818 KLELIHRHHFRRNQANGMQ---PMTQIERLRQLLHSDTIRQRSISERLRLQKSRRRVLES 648 + +LIHRH + +G P + ER++QL+HSD R +IS+RL Sbjct: 38 RFKLIHRHSPELGEDHGTTLGPPTSTRERIKQLVHSDNARLHTISQRL-----------G 86 Query: 647 PDATYYHPACTNSSRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGS 468 P + SS E+ M S AD GTGQYFVSF+VGSP ++F++IADTGS Sbjct: 87 PRRMTFEMKMMGSSNLV-------ELPMRSAADIGTGQYFVSFRVGSPPKKFIMIADTGS 139 Query: 467 DLTWMNCKYRCHGAKCRKKSRKRRVFRADHSSSFVTVPCSSRMCKIELANLFSLARCPSP 288 LTWM C Y+C + R+F A+ S +F +PCSS +CK+EL+ FSLA CP+P Sbjct: 140 SLTWMRCSYKCKNFSMDRTKLHERIFYANQSRTFKPIPCSSDVCKVELSQSFSLALCPTP 199 Query: 287 HTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSESSTGQSFQGADGVM 108 PCAYDYRY+DG+ +GIF N+TV L+ G K+++ +V+VGCSE+ G +F DGVM Sbjct: 200 MAPCAYDYRYADGTRVVGIFGNDTVKVRLSGGQKIKVTDVMVGCSEAIRG-NFHDIDGVM 258 Query: 107 GLGYSNYSFALKAAGKFGGKFSYCLVDHLSPQNV 6 GLG+ +SFA+KAA +FG KFSYCLVDHLSP N+ Sbjct: 259 GLGFDQHSFAVKAAKEFGDKFSYCLVDHLSPSNL 292 >ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Capsella rubella] gi|482566377|gb|EOA30566.1| hypothetical protein CARUB_v10013693mg [Capsella rubella] Length = 448 Score = 251 bits (641), Expect = 7e-64 Identities = 115/191 (60%), Positives = 148/191 (77%) Frame = -1 Query: 575 EMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGSDLTWMNCKYRCHGAKCRKKSRKRR 396 +M + SG D+GT QYF +VG+PA++F ++ DTGS+LTW+NCKYR G + + RR Sbjct: 75 KMPLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCKYRGRG---KGRVENRR 131 Query: 395 VFRADHSSSFVTVPCSSRMCKIELANLFSLARCPSPHTPCAYDYRYSDGSSALGIFANET 216 VFRA+ S SF TV C ++ CK++L NLFSL+ CP+P TPC+YDYRY+DGS+A GIFA ET Sbjct: 132 VFRAEESKSFRTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGIFAKET 191 Query: 215 VTFGLTNGTKVRIHNVLVGCSESSTGQSFQGADGVMGLGYSNYSFALKAAGKFGGKFSYC 36 VT GLTNG K R+H +L+GCS S +GQSF+GADGV+GL +S++SF A FG KFSYC Sbjct: 192 VTVGLTNGRKARLHGLLIGCSSSFSGQSFRGADGVLGLAFSDFSFTSTATSLFGAKFSYC 251 Query: 35 LVDHLSPQNVS 3 LVDHLSP+NVS Sbjct: 252 LVDHLSPKNVS 262 >ref|XP_010260839.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Nelumbo nucifera] Length = 481 Score = 248 bits (632), Expect = 8e-63 Identities = 136/276 (49%), Positives = 175/276 (63%), Gaps = 4/276 (1%) Frame = -1 Query: 818 KLELIHRH--HFRRNQANGMQPMTQIERLRQLLHSDTIRQRSISERLRLQKSRRRVLES- 648 + E+IHRH G+Q T++E++R+L+ D R + I R+ + RR+ E Sbjct: 28 RFEMIHRHSPELSGRLGAGLQK-TRLEQVRELVRLDEQRTQMIYHRIGQRTERRKDAEGG 86 Query: 647 PDATYYHPACTNSSRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGS 468 D A T + +V M+SG+ G G YFV F+VG+PA+ +L+ADTGS Sbjct: 87 ADGQIGAAAWTGKVIGSSGASVP----MFSGSFAGEGLYFVPFRVGTPAQNVLLVADTGS 142 Query: 467 DLTWMNCKYRCHGAKCRKKSRKRRVFRADHSSSFVTVPCSSRMCKIELANLFSLARCPSP 288 DLTWMNC + C C +K +RR F AD SSSF T+PC SRMCK +LA +FSL CP P Sbjct: 143 DLTWMNCIHGCRN--CGRKVDRRRFFNADLSSSFTTIPCLSRMCKNDLAVMFSLTDCPKP 200 Query: 287 HTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSESSTGQSFQG-ADGV 111 PC YDY YS G SA G FANE+VT LTNG K++IH+VLVGC++++ GQ F DG+ Sbjct: 201 LNPCKYDYSYSSGQSAQGFFANESVTVRLTNGRKMKIHHVLVGCTQTTQGQKFSNVVDGI 260 Query: 110 MGLGYSNYSFALKAAGKFGGKFSYCLVDHLSPQNVS 3 +GLGYS SFA K FG KFSYCLVDHLSP+NVS Sbjct: 261 LGLGYSPNSFATKVLQVFGSKFSYCLVDHLSPRNVS 296 >ref|XP_012486822.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Gossypium raimondii] Length = 490 Score = 247 bits (631), Expect = 1e-62 Identities = 130/281 (46%), Positives = 176/281 (62%), Gaps = 3/281 (1%) Frame = -1 Query: 839 HGESVGTKLELIHRHHFRRNQANGMQ---PMTQIERLRQLLHSDTIRQRSISERLRLQKS 669 HG+ K +LIHRH + +G P + ER++QL+HSDT R +IS RL ++ Sbjct: 42 HGK---VKFKLIHRHSPELGKMSGTTLGPPSSSRERIKQLIHSDTARLHAISHRLVPRRK 98 Query: 668 RRRVLESPDATYYHPACTNSSRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFM 489 +V + N+ E+ M S AD GTGQYFVSF++GSP R+F+ Sbjct: 99 NFQV-----------------ETLRSSNLV-ELPMRSAADIGTGQYFVSFRIGSPPRKFI 140 Query: 488 LIADTGSDLTWMNCKYRCHGAKCRKKSRKRRVFRADHSSSFVTVPCSSRMCKIELANLFS 309 +IADTGS +TWM CKY+C + R+F S +F+ +PC S MCK +LA FS Sbjct: 141 MIADTGSTVTWMKCKYKCKTCFDDRIHHHERIFNPKTSRTFIPIPCLSSMCKQDLARSFS 200 Query: 308 LARCPSPHTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSESSTGQSF 129 L +C +PCAYD+RYSDG+ LGIF N+TV LTNG K+++ +V++GCSE+ G +F Sbjct: 201 LQKCHRSTSPCAYDFRYSDGTKVLGIFGNDTVIVRLTNGKKIKVPDVMIGCSETIFG-NF 259 Query: 128 QGADGVMGLGYSNYSFALKAAGKFGGKFSYCLVDHLSPQNV 6 DGVMGLG+ +SFA+KAA KFG KFSYCLVDHLSP ++ Sbjct: 260 HDIDGVMGLGFDQHSFAVKAAEKFGNKFSYCLVDHLSPSDL 300 >gb|KJB10346.1| hypothetical protein B456_001G196900 [Gossypium raimondii] Length = 480 Score = 247 bits (631), Expect = 1e-62 Identities = 130/281 (46%), Positives = 176/281 (62%), Gaps = 3/281 (1%) Frame = -1 Query: 839 HGESVGTKLELIHRHHFRRNQANGMQ---PMTQIERLRQLLHSDTIRQRSISERLRLQKS 669 HG+ K +LIHRH + +G P + ER++QL+HSDT R +IS RL ++ Sbjct: 32 HGK---VKFKLIHRHSPELGKMSGTTLGPPSSSRERIKQLIHSDTARLHAISHRLVPRRK 88 Query: 668 RRRVLESPDATYYHPACTNSSRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFM 489 +V + N+ E+ M S AD GTGQYFVSF++GSP R+F+ Sbjct: 89 NFQV-----------------ETLRSSNLV-ELPMRSAADIGTGQYFVSFRIGSPPRKFI 130 Query: 488 LIADTGSDLTWMNCKYRCHGAKCRKKSRKRRVFRADHSSSFVTVPCSSRMCKIELANLFS 309 +IADTGS +TWM CKY+C + R+F S +F+ +PC S MCK +LA FS Sbjct: 131 MIADTGSTVTWMKCKYKCKTCFDDRIHHHERIFNPKTSRTFIPIPCLSSMCKQDLARSFS 190 Query: 308 LARCPSPHTPCAYDYRYSDGSSALGIFANETVTFGLTNGTKVRIHNVLVGCSESSTGQSF 129 L +C +PCAYD+RYSDG+ LGIF N+TV LTNG K+++ +V++GCSE+ G +F Sbjct: 191 LQKCHRSTSPCAYDFRYSDGTKVLGIFGNDTVIVRLTNGKKIKVPDVMIGCSETIFG-NF 249 Query: 128 QGADGVMGLGYSNYSFALKAAGKFGGKFSYCLVDHLSPQNV 6 DGVMGLG+ +SFA+KAA KFG KFSYCLVDHLSP ++ Sbjct: 250 HDIDGVMGLGFDQHSFAVKAAEKFGNKFSYCLVDHLSPSDL 290 >emb|CDX73806.1| BnaA03g32410D [Brassica napus] Length = 412 Score = 246 bits (629), Expect = 2e-62 Identities = 114/202 (56%), Positives = 149/202 (73%) Frame = -1 Query: 608 SRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGSDLTWMNCKYRCHG 429 S++ K + ++ + SG+D+G QYF +VG+PA+ F ++ DTGS+LTW+NC++R G Sbjct: 30 SQKRKTNGGGAKLPLRSGSDYGAAQYFADVRVGTPAKMFRVVVDTGSELTWVNCRFRGKG 89 Query: 428 AKCRKKSRKRRVFRADHSSSFVTVPCSSRMCKIELANLFSLARCPSPHTPCAYDYRYSDG 249 + + +KRRVFRAD SSSF V C ++ CK++L NLFSL+ CP+P TPC+YDYRYSDG Sbjct: 90 ---KGREKKRRVFRADESSSFRQVGCLTQTCKVDLTNLFSLSNCPTPSTPCSYDYRYSDG 146 Query: 248 SSALGIFANETVTFGLTNGTKVRIHNVLVGCSESSTGQSFQGADGVMGLGYSNYSFALKA 69 S+A G+FA ET T GLTNG R+H +L+GCS S G SFQGADGV+GL S+YSF KA Sbjct: 147 SAAQGVFAKETFTVGLTNGRVARLHGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKA 206 Query: 68 AGKFGGKFSYCLVDHLSPQNVS 3 FGGKFSYCLVDH S +NVS Sbjct: 207 TNLFGGKFSYCLVDHRSHKNVS 228 >emb|CDY00795.1| BnaC03g37790D [Brassica napus] Length = 441 Score = 245 bits (626), Expect = 4e-62 Identities = 113/202 (55%), Positives = 149/202 (73%) Frame = -1 Query: 608 SRRAKHDNVSGEMAMYSGADFGTGQYFVSFKVGSPARRFMLIADTGSDLTWMNCKYRCHG 429 S++ K + ++ + SG+D+G QYF +VG+PA+ F ++ DTGS+LTW+NC++R G Sbjct: 60 SQKRKTNGGGAKLPLRSGSDYGAAQYFADVRVGTPAKMFRVVVDTGSELTWVNCRFRGKG 119 Query: 428 AKCRKKSRKRRVFRADHSSSFVTVPCSSRMCKIELANLFSLARCPSPHTPCAYDYRYSDG 249 R + +KRRVFRA+ SSSF V C ++ CK++L NLFSL+ CP+P TPC+YDYRY+DG Sbjct: 120 ---RGREKKRRVFRAEESSSFRQVGCLTQTCKVDLMNLFSLSNCPTPSTPCSYDYRYADG 176 Query: 248 SSALGIFANETVTFGLTNGTKVRIHNVLVGCSESSTGQSFQGADGVMGLGYSNYSFALKA 69 S+A G+FA ET T GLTNG R+H +L+GCS S G SFQGADGV+GL S+YSF KA Sbjct: 177 SAAQGVFAKETFTVGLTNGRVARLHGLLIGCSSSFNGDSFQGADGVLGLALSDYSFTSKA 236 Query: 68 AGKFGGKFSYCLVDHLSPQNVS 3 FGGKFSYCLVDH S +NVS Sbjct: 237 TNLFGGKFSYCLVDHRSHENVS 258