BLASTX nr result
ID: Cheilocostus21_contig00043880
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cheilocostus21_contig00043880 (719 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KMS94764.1| hypothetical protein BVRB_015510 [Beta vulgaris s... 107 6e-23 ref|XP_020101642.1| uncharacterized protein LOC109719402 [Ananas... 103 2e-21 ref|XP_009773064.1| PREDICTED: uncharacterized protein LOC104223... 103 2e-21 dbj|GAV90231.1| gag-asp_proteas domain-containing protein, parti... 98 6e-21 dbj|GAV84407.1| gag-asp_proteas domain-containing protein [Cepha... 100 6e-21 dbj|GAV86380.1| gag-asp_proteas domain-containing protein, parti... 96 8e-21 gb|OMO54028.1| Aspartic peptidase [Corchorus capsularis] 93 5e-20 ref|XP_020114727.1| uncharacterized protein LOC109728667 [Ananas... 98 1e-19 ref|XP_017227982.1| PREDICTED: uncharacterized protein LOC108203... 98 1e-19 ref|XP_015057527.1| PREDICTED: uncharacterized protein LOC107003... 97 2e-19 ref|XP_015084310.1| PREDICTED: uncharacterized protein LOC107027... 97 4e-19 dbj|GAV58890.1| gag-asp_proteas domain-containing protein [Cepha... 93 4e-19 ref|XP_010910816.1| PREDICTED: uncharacterized protein LOC105036... 96 4e-19 ref|XP_010910353.2| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 96 7e-19 ref|XP_010314143.1| PREDICTED: uncharacterized protein LOC104644... 92 7e-19 dbj|GAV91234.1| gag-asp_proteas domain-containing protein, parti... 92 1e-18 dbj|GAV59676.1| gag-asp_proteas domain-containing protein [Cepha... 94 1e-18 gb|EOY26319.1| Uncharacterized protein TCM_027814 [Theobroma cacao] 95 1e-18 dbj|GAV69881.1| gag-asp_proteas domain-containing protein [Cepha... 93 1e-18 dbj|GAV92048.1| gag-asp_proteas domain-containing protein [Cepha... 91 3e-18 >gb|KMS94764.1| hypothetical protein BVRB_015510 [Beta vulgaris subsp. vulgaris] Length = 638 Score = 107 bits (267), Expect = 6e-23 Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 6/172 (3%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPK------D*FVTLNPMKINTITASTSHSPLAMSP 225 C+ CGG H+ RN K+ ++ ++ E + + +NP++ + +TS P Sbjct: 146 CYECGGLHYARNCPKKGKLGALMAEEPVKDSEDETPIRVNPLRFLNVARTTSFDPYG--- 202 Query: 226 KLLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICGIS 405 LLYVD++++G ++AMIDTGA+H FV + L L + + MK+VN G Sbjct: 203 GLLYVDVRVNGHSIKAMIDTGATHNFVSAEMAQNFNLPLTNCANWMKVVNSEALVAKGSV 262 Query: 406 YGVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNE 561 G +++G W G+ + + LDDFE ILG D +R A+V+V+P+L G+ I +E Sbjct: 263 RGADMQVGNWEGKCDLLAITLDDFELILGNDFIRKAKVSVMPHLGGILIGDE 314 >ref|XP_020101642.1| uncharacterized protein LOC109719402 [Ananas comosus] Length = 1464 Score = 103 bits (257), Expect = 2e-21 Identities = 56/171 (32%), Positives = 92/171 (53%), Gaps = 5/171 (2%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPKD*FVTLNPMKINTITASTSHSPLAMSPK----- 228 C+ CG H+ R+ + +++ + E + + P K+ + + LA S K Sbjct: 352 CYVCGENHWARDCPNKKKLNAVQTEQGE--GSAEPTKMGALRLVNAVQGLASSSKPLNKE 409 Query: 229 LLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICGISY 408 LLY+D+ ++G +A++DTGA+H FV E IGL L ++K VN Q + G++ Sbjct: 410 LLYIDVTLNGRATRALVDTGATHNFVAEAEAKRIGLPLEKDASRIKAVNSEAQPVAGVAK 469 Query: 409 GVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNE 561 GV + +GPWRG + +DDF+ ILGMD L ++ +P+L L I +E Sbjct: 470 GVAIAVGPWRGTANFTAAPIDDFQVILGMDFLASSKAVPMPHLGALSIMDE 520 >ref|XP_009773064.1| PREDICTED: uncharacterized protein LOC104223337 [Nicotiana sylvestris] Length = 1124 Score = 103 bits (256), Expect = 2e-21 Identities = 57/182 (31%), Positives = 100/182 (54%), Gaps = 10/182 (5%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPKD*--------FVTLNPMKINTITASTSHSPLAM 219 C++CGG H +N R+R++ ++ + +NP+++ + + + + Sbjct: 445 CWTCGGPHLAKNCPNRERVNALLATENNRDGEGQEVAAALVNPLQLLNVISLVNATSNET 504 Query: 220 SPK--LLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCI 393 +P L++++++I V AM+DTGA+H FV H GL + MK VN Q I Sbjct: 505 NPHSLLIHIEMRIGDKSVIAMVDTGATHTFVSANLVHKYGLKVTKCPSYMKTVNAKAQAI 564 Query: 394 CGISYGVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNERLLG 573 G++Y V + +G W+G+ ++ V+ L++FE ILG+D +R R +P+L G+ I NE G Sbjct: 565 VGMAYDVPMSVGNWKGKVNLMVIPLEEFEIILGIDFMRKHRFVPMPHLDGVMIMNEMSPG 624 Query: 574 FV 579 FV Sbjct: 625 FV 626 >dbj|GAV90231.1| gag-asp_proteas domain-containing protein, partial [Cephalotus follicularis] Length = 244 Score = 97.8 bits (242), Expect = 6e-21 Identities = 58/169 (34%), Positives = 99/169 (58%), Gaps = 8/169 (4%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIV------VEPKD*FVT-LNPMKI-NTITASTSHSPLAM 219 CF C G H ++ K+++++ +V VEP + T +NP+++ +TI T +P Sbjct: 11 CFLCNGPHQAKDCPKKEKLNVLVAEEIGNVEPNEGCPTRVNPLQLLSTIHEVTQATPF-- 68 Query: 220 SPKLLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICG 399 P LLYV + ++ + + AMI+TGASH FV E +G+ + +K VN A + + G Sbjct: 69 -PGLLYVKVVLNSMEIYAMINTGASHNFVNEKIVGKLGVKVDKHTSNIKAVNTAARLVQG 127 Query: 400 ISYGVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL 546 ++ V V++G WRG+ ++ ++ LDDF+ I G+D L +V +P+L GL Sbjct: 128 MTRDVSVQVGIWRGQLNLMIVLLDDFDVIFGIDFLTRNKVAPMPHLEGL 176 >dbj|GAV84407.1| gag-asp_proteas domain-containing protein [Cephalotus follicularis] Length = 375 Score = 100 bits (248), Expect = 6e-21 Identities = 67/236 (28%), Positives = 121/236 (51%), Gaps = 19/236 (8%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPKD*FVT--LNPMKINTITASTSHSPLAMS---PK 228 CF C G H R+ ++++++T+V F P+++N I ++ +A + P Sbjct: 106 CFLCDGPHRARDCPRKEKLNTLVAGEMGCFEPDGEGPIRVNPIQLLSTIREVAQTKPFPG 165 Query: 229 LLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICGISY 408 LLYV + ++ + + AMIDTGASH FV E GL + ++K+VN Q + G++ Sbjct: 166 LLYVKVVLNSVEIYAMIDTGASHNFVNERIVGKFGLKIEKHTSKIKVVNADAQPVQGVAR 225 Query: 409 GVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNERLLGFVY-- 582 V++++G W+G+ ++ ++ LDDF+ I G+D L ++ +P+L GL E FV+ Sbjct: 226 DVLLQVGAWKGQLNLMIVSLDDFDVIFGIDFLTRSKAVPMPHLKGLMFMGENQPCFVFGN 285 Query: 583 -----YLGD-------STLARDDRDQKRDVFGPASVSTIYVVSVIGGDLYICIYIL 714 Y G+ ST+ D +K D T Y+ S++ Y+ + ++ Sbjct: 286 MMEDHYCGNKGKNSMLSTMQISDGLRKGD--------TTYIASLVELKPYVVVEVV 333 >dbj|GAV86380.1| gag-asp_proteas domain-containing protein, partial [Cephalotus follicularis] Length = 199 Score = 96.3 bits (238), Expect = 8e-21 Identities = 57/173 (32%), Positives = 96/173 (55%), Gaps = 1/173 (0%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPKD*FVTLNPMKINTITASTSHSPLAMSPK-LLYV 240 CF C G H R+ +R ++T++ + ++ T + ++ A+ P LLYV Sbjct: 1 CFICEGPHRARDCPRRCALNTMMAQRENGGNTDSEAPTRVAPLQLINALRAVPPSGLLYV 60 Query: 241 DIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICGISYGVIV 420 ++++ G V AM+DTGA+H F+ E +GL + ++K VN Q + G+++GV + Sbjct: 61 NLRVQGQQVSAMVDTGATHSFLAERMVTQLGLEVDKHGSRIKAVNSQAQAMAGMAHGVQI 120 Query: 421 ELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNERLLGFV 579 +G W G+ + V+ LDDF+ ILG + +V +IP+L G+ I NER FV Sbjct: 121 AMGEWAGKIDLMVVPLDDFDLILGNNFFITEKVLIIPHLCGIFITNERNSCFV 173 >gb|OMO54028.1| Aspartic peptidase [Corchorus capsularis] Length = 160 Score = 93.2 bits (230), Expect = 5e-20 Identities = 51/158 (32%), Positives = 87/158 (55%), Gaps = 1/158 (0%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPKD*FVTLNPMKINTITASTSHSPLAMSPK-LLYV 240 CF C G H +R+ KR ++S I E + +K+ +I S P K L+++ Sbjct: 3 CFLCEGPHRVRDSPKRSKLSAIAREEQQPEKEKETLKLGSILLSVE--PKKRRKKGLMFM 60 Query: 241 DIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICGISYGVIV 420 D++++G V A++DTGAS LFV E +GL + +K VN E G++ GV + Sbjct: 61 DMEVAGYKVNAIVDTGASDLFVSEGGAKKLGLKVDKGQGWIKTVNSKETPTMGVAQGVEL 120 Query: 421 ELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPY 534 +LG W G+ ++ V+ LDD++ ++G+D L ++P+ Sbjct: 121 KLGAWSGKDNIEVIPLDDYDFVVGLDFLDRINALLVPF 158 >ref|XP_020114727.1| uncharacterized protein LOC109728667 [Ananas comosus] Length = 601 Score = 97.8 bits (242), Expect = 1e-19 Identities = 56/171 (32%), Positives = 89/171 (52%), Gaps = 5/171 (2%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPKD*FVTLNPMKINTITASTSHSPLAMSPK----- 228 C+ CG H+ R+ K+ +++ I + K P KI + + LA S K Sbjct: 66 CYVCGKNHWARDCRKKKQVNAI--QTKQGEGNAEPTKIGALRLVNAVQGLASSSKPHNKE 123 Query: 229 LLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICGISY 408 L ++D+ ++G +A++DTGA+H FV E IGL L ++K VN Q + G++ Sbjct: 124 LSHIDVTLNGRTTRALVDTGATHNFVAEMEAKQIGLPLEKDAGRIKAVNSEAQPVAGVAN 183 Query: 409 GVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNE 561 GV + +GPWRG + +DDF+ IL MD L + +P+L L I +E Sbjct: 184 GVAIAMGPWRGNANFTAAPIDDFQVILSMDFLASWKAVPMPHLGALSIIDE 234 >ref|XP_017227982.1| PREDICTED: uncharacterized protein LOC108203519 [Daucus carota subsp. sativus] Length = 715 Score = 97.8 bits (242), Expect = 1e-19 Identities = 68/221 (30%), Positives = 116/221 (52%), Gaps = 3/221 (1%) Frame = +1 Query: 1 KKVASAPKQLMKVT--MLSRPTFCFSCGGYHFIRNFLKRDRISTIVVEPKD*FVTLNPMK 174 KKV + KV+ +P C+ C G H +++ K++++S++ E + +NP++ Sbjct: 263 KKVQDGDQGKAKVSGGQTKKPLRCYICEGPHVMKDCPKKEKLSSLREESGEEPTRVNPLR 322 Query: 175 -INTITASTSHSPLAMSPKLLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSA 351 +N + + S LLYV + I+G V AM+DTGA++ FV + +GL L ++ Sbjct: 323 MLNALQTKEA----VKSSSLLYVQVAINGHDVMAMVDTGATNNFVADRNVEFLGLALKAS 378 Query: 352 LDQMKMVNIAEQCICGISYGVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIP 531 ++K VN Q I G S I +G W G+ ++ V+ +DDF+ ILG+D L A+ +V+P Sbjct: 379 TSRVKAVNYEAQLIKGSSQSDIT-VGSWTGKVNLFVVPVDDFDVILGIDFLLKAKASVMP 437 Query: 532 YLHGL*ICNERLLGFVYYLGDSTLARDDRDQKRDVFGPASV 654 ++ GL I + FV L RD +K ++ V Sbjct: 438 HIGGLMIEDASNPCFV----KGVLGRDSGKKKTELLSAMQV 474 >ref|XP_015057527.1| PREDICTED: uncharacterized protein LOC107003751 [Solanum pennellii] Length = 561 Score = 97.4 bits (241), Expect = 2e-19 Identities = 64/207 (30%), Positives = 107/207 (51%), Gaps = 20/207 (9%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVV-------EPKD*FVTL-NP--------MKINTI-TA 192 C++CGG H ++ R++++ ++ E ++ + NP M IN + Sbjct: 333 CWTCGGPHLAKSCPNREKVNALLAGNVNQREEDEEIVAAMANPLGLSFNHIMGINNVGEI 392 Query: 193 STSHSPLAMSPKLLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMV 372 S++ +P A L+++++++ V AM+DTGA+H FV +GL L + +K V Sbjct: 393 SSTSNPHA---SLIHIEMKVKEQCVMAMVDTGATHTFVDVKIATKLGLKLSKSPSYVKTV 449 Query: 373 NIAEQCICGISYGVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*I 552 N Q I G++YGV + G W G+H++ VM L DFE ILG+D LR + P+L G+ + Sbjct: 450 NAKAQAIVGMAYGVSMSTGSWVGKHNLMVMPLGDFEIILGIDFLRKFQFVPFPHLDGVMV 509 Query: 553 CNERLLGF---VYYLGDSTLARDDRDQ 624 N GF V+ GD +D+ Sbjct: 510 MNGSNAGFLKGVHPFGDINKVAKKKDK 536 >ref|XP_015084310.1| PREDICTED: uncharacterized protein LOC107027747 [Solanum pennellii] Length = 1099 Score = 96.7 bits (239), Expect = 4e-19 Identities = 60/189 (31%), Positives = 101/189 (53%), Gaps = 17/189 (8%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVV-------EPKD*FVTL-NP--------MKINTI-TA 192 C++CGG H ++ R++++ ++ E ++ + NP M IN + Sbjct: 333 CWTCGGPHLAKSCPNREKVNALLAGNVNQREEDEEIVAAMANPLGLSFNHIMGINNVGEI 392 Query: 193 STSHSPLAMSPKLLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMV 372 S++ +P A L+++++++ V AM+DTGA+H FV +GL L + +K V Sbjct: 393 SSTSNPHA---SLIHIEMKVKEQCVMAMVDTGATHTFVDVKIATKLGLKLSKSPSYVKTV 449 Query: 373 NIAEQCICGISYGVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*I 552 N Q I G++YGV + G W G+H++ VM L DFE ILG+D LR + P+L G+ + Sbjct: 450 NAKAQAIVGMAYGVSMSTGSWVGKHNLMVMPLGDFEIILGIDFLRKFQFVPFPHLDGVMV 509 Query: 553 CNERLLGFV 579 N GF+ Sbjct: 510 MNGSNAGFL 518 >dbj|GAV58890.1| gag-asp_proteas domain-containing protein [Cephalotus follicularis] Length = 262 Score = 93.2 bits (230), Expect = 4e-19 Identities = 60/180 (33%), Positives = 99/180 (55%), Gaps = 8/180 (4%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPKD*FVTLNPMKINTITASTSHSPL-------AMS 222 CF C G H R+ +R ++ ++ + + N + ++ A T SPL A+ Sbjct: 67 CFICEGPHRARDCPRRGALNAMMAQGE------NGVNADS-EAPTRVSPLQLINALRAVP 119 Query: 223 PK-LLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICG 399 P LLYV++++ G V AM+DTGA+H F+ E +GL + ++K VN Q + G Sbjct: 120 PSGLLYVNLRVQGQQVSAMVDTGATHSFLAERMVTQLGLRVDKHGSRIKTVNSQAQAVVG 179 Query: 400 ISYGVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNERLLGFV 579 +++GV + +G W G+ + V+ LDDF+ ILG + +V ++P+L G+ I NER FV Sbjct: 180 MAHGVQIAMGEWAGKIDLMVVPLDDFDLILGNNFFVTEKVLIMPHLCGIFITNERNPCFV 239 >ref|XP_010910816.1| PREDICTED: uncharacterized protein LOC105036779 [Elaeis guineensis] Length = 637 Score = 96.3 bits (238), Expect = 4e-19 Identities = 54/173 (31%), Positives = 96/173 (55%), Gaps = 1/173 (0%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPKD*FVTLNPMKINTIT-ASTSHSPLAMSPKLLYV 240 CF C G H ++ K++R++ +VVE +P +IN + + + ++ L+Y+ Sbjct: 280 CFICSGPHRAKDCPKKERLN-VVVEEGGRGDEASPSRINPLQLVNAMRAERSIPSDLMYM 338 Query: 241 DIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICGISYGVIV 420 +++ G + AM DTGA+H F+ E H +GL + + ++K VN + + GI+ GV V Sbjct: 339 KVRLGGKEMLAMADTGATHNFMTERIAHELGLKVTKSSSKIKAVNSVARDVAGIAAGVQV 398 Query: 421 ELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNERLLGFV 579 LG W G ++ LDDF+ ILG++ A+ ++P+L GL + +E F+ Sbjct: 399 SLGSWTGVLGFTIITLDDFDIILGIEFFVQAKAALLPHLGGLMLLHEEYPCFI 451 >ref|XP_010910353.2| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105036288 [Elaeis guineensis] Length = 1339 Score = 95.9 bits (237), Expect = 7e-19 Identities = 54/173 (31%), Positives = 95/173 (54%), Gaps = 1/173 (0%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPKD*FVTLNPMKINTITA-STSHSPLAMSPKLLYV 240 CF C G H ++ K++R++ VVE +P ++N + + + ++ LLY+ Sbjct: 307 CFICSGLHRAKDCPKKERLNA-VVEEDGRGDEASPSRVNPLQLINAMRAERSIPSGLLYM 365 Query: 241 DIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICGISYGVIV 420 +++ G + AM DTGA+H F+ E H +GL + + ++K VN + + GI+ GV V Sbjct: 366 KVRLGGKEMLAMADTGATHNFMTERTAHELGLKVTKSSSKIKAVNSVARDVTGIAAGVQV 425 Query: 421 ELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNERLLGFV 579 LG W G ++ LDDF+ ILG++ A+ ++P+L GL + +E F+ Sbjct: 426 SLGSWTGVLGFTIVTLDDFDIILGIEFFVQAKAALLPHLGGLMLLHEEYPCFI 478 >ref|XP_010314143.1| PREDICTED: uncharacterized protein LOC104644959 [Solanum lycopersicum] Length = 238 Score = 92.0 bits (227), Expect = 7e-19 Identities = 59/189 (31%), Positives = 98/189 (51%), Gaps = 17/189 (8%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVV-------EPKD*FVTL-NP--------MKINTI-TA 192 C++CGG H ++ R++++ ++ E ++ + NP M IN + Sbjct: 18 CWTCGGPHLAKSCPNREKVNALLAGNVYQREEDEEIVAAMANPLGLSFNHIMGINNVGEI 77 Query: 193 STSHSPLAMSPKLLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMV 372 S + +P A L+Y+++++ V AM+DTGA H FV +GL L + +K V Sbjct: 78 SNTSNPHA---SLIYIEMKVKEQCVMAMVDTGAIHTFVDVKIATKLGLKLSKSPSYVKTV 134 Query: 373 NIAEQCICGISYGVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*I 552 N Q I G++Y V + G W G+H++ VM L DFE ILG+D LR + P+L G+ + Sbjct: 135 NAKAQAIVGMAYSVSMSTGNWVGKHNLMVMPLGDFEIILGIDFLRKYQFVPFPHLDGVMV 194 Query: 553 CNERLLGFV 579 + GF+ Sbjct: 195 MSGSDAGFL 203 >dbj|GAV91234.1| gag-asp_proteas domain-containing protein, partial [Cephalotus follicularis] Length = 252 Score = 92.0 bits (227), Expect = 1e-18 Identities = 50/168 (29%), Positives = 95/168 (56%), Gaps = 1/168 (0%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPKD*FVTLNPMKINTITASTSHSPLAMSPK-LLYV 240 CF C G H R++ ++D ++ ++ + ++ T + ++ A+ P LLYV Sbjct: 72 CFICEGPHRARDYPRKDELNEMMDQGENGGNTDSEAPTRVAPLQLINALRAVPPSGLLYV 131 Query: 241 DIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICGISYGVIV 420 ++++ G V AM+DTGA+H F+ E +GL + + ++K VN Q + +++GV + Sbjct: 132 NLRVQGQQVSAMVDTGATHSFLAERMVTQLGLRVDTHGSRIKAVNSQAQAVASMAHGVQI 191 Query: 421 ELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNER 564 +G W G+ + V+ LD+F+ ILG + +V ++P++ G+ I NER Sbjct: 192 AMGEWAGKIDLMVVPLDNFDLILGNNFFVTEKVLIMPHICGIFITNER 239 >dbj|GAV59676.1| gag-asp_proteas domain-containing protein [Cephalotus follicularis] Length = 380 Score = 94.0 bits (232), Expect = 1e-18 Identities = 53/173 (30%), Positives = 93/173 (53%), Gaps = 1/173 (0%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPKD*FVTLNPMKINTITASTSHSPLAMSPK-LLYV 240 CF C G H R++ K+ +++ +V + + + ++ + P LLYV Sbjct: 67 CFICEGLHHARDYPKQGKLNVVVAQSESGGTIDSEAPTRVAPLQLINALRTVPPSDLLYV 126 Query: 241 DIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICGISYGVIV 420 + + G V AM+DTGA+H F+VE + L + Q+K VN Q + G+++G+ + Sbjct: 127 LMMVQGHQVSAMVDTGATHSFLVERMVDRLSLRVDKHTSQIKAVNSQAQVVAGMAHGLFI 186 Query: 421 ELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNERLLGFV 579 +G W G+ ++ V+ L+ F+ ILG D + +V ++P+L GL I NE+ FV Sbjct: 187 SMGSWEGKINLMVVPLEFFDLILGNDFISE-KVIMMPHLCGLFIMNEKSPNFV 238 >gb|EOY26319.1| Uncharacterized protein TCM_027814 [Theobroma cacao] Length = 1001 Score = 95.1 bits (235), Expect = 1e-18 Identities = 53/165 (32%), Positives = 90/165 (54%), Gaps = 4/165 (2%) Frame = +1 Query: 52 RPTFCFSCGGYHFIRNFLKRDRISTIVVEPK----D*FVTLNPMKINTITASTSHSPLAM 219 +P CF C G HF+R++ +R +++ I E + D V L M++ + + Sbjct: 328 KPKNCFLCEGPHFVRDYPQRAKLAAIASEDEEQQGDETVRLGSMQLGAVRKGGKQAK--- 384 Query: 220 SPKLLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICG 399 L+Y D+ ++G V+A++DTGAS LFV E + L SA +K VN I G Sbjct: 385 --GLMYADMVVAGQHVEALVDTGASDLFVSEQGAAKLDLKADSAGGWVKTVNSKWVRIKG 442 Query: 400 ISYGVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPY 534 I+ G+ V+LG W G + V+Q+DD+E ++G++ L + ++P+ Sbjct: 443 IAKGIDVQLGEWHGTEDIEVIQMDDYEVVMGLNFLERIQALLVPH 487 >dbj|GAV69881.1| gag-asp_proteas domain-containing protein [Cephalotus follicularis] Length = 341 Score = 93.2 bits (230), Expect = 1e-18 Identities = 55/173 (31%), Positives = 92/173 (53%), Gaps = 1/173 (0%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVVEPKD*FVTLNPMKINTITASTSHSPLAMSPK-LLYV 240 CF C G + R+ +R ++ ++ + ++ + ++ A+ P LLYV Sbjct: 67 CFICEGPYRARDCPRRGALNAMMAQGENGGDADSEAPTRVTPLQLINALRAVPPSGLLYV 126 Query: 241 DIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCICGISYGVIV 420 ++I G V AM+D GA+H F+ E + + L + ++K VN Q + G++YGV + Sbjct: 127 SMRIQGQQVSAMVDIGATHSFLAERMVNQLDLKVDKHGSRIKAVNSQAQAVAGMAYGVPI 186 Query: 421 ELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNERLLGFV 579 +G W G+ + V+ LDDF+ ILG D +V ++PYL GL I NE + FV Sbjct: 187 AMGEWAGKIDLMVVPLDDFDLILGNDFFISEKVIMMPYLSGLLIMNENIPCFV 239 >dbj|GAV92048.1| gag-asp_proteas domain-containing protein [Cephalotus follicularis] Length = 267 Score = 90.9 bits (224), Expect = 3e-18 Identities = 55/182 (30%), Positives = 101/182 (55%), Gaps = 9/182 (4%) Frame = +1 Query: 64 CFSCGGYHFIRNFLKRDRISTIVV------EPKD*FVTLNPMKINTITASTSHSPLAMS- 222 CF C G H R+ ++++++++V EP + P ++N I ++ +A + Sbjct: 67 CFLCDGPHRARDCPRKEKLNSLVAAEAGNSEPD----SEGPTRVNPIQLLSTVREVAQAK 122 Query: 223 --PKLLYVDIQISGLLVQAMIDTGASHLFVVEPATHHIGLCLFSALDQMKMVNIAEQCIC 396 P LLYV + ++ + + AMID+GASH FV E +GL + ++K VN + I Sbjct: 123 PFPGLLYVKVILNSVKIYAMIDSGASHNFVNERIVGKLGLKIEKHTSKIKAVNADARPIQ 182 Query: 397 GISYGVIVELGPWRGRHHVPVMQLDDFEAILGMDILRHARVTVIPYLHGL*ICNERLLGF 576 G++ V +++G W+G+ ++ ++ LDDF+ I G+D L + +P+L GL +E+ F Sbjct: 183 GVARDVPLQVGAWKGQLNLMIVPLDDFDVIFGIDFLVRNKAVPMPHLKGLMFVDEKQHCF 242 Query: 577 VY 582 V+ Sbjct: 243 VW 244