BLASTX nr result
ID: Cornus23_contig00022728
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cornus23_contig00022728 (927 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004368288.1| cysteine proteinase precursor, putative [Aca... 212 3e-52 dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare] 205 4e-50 ref|XP_004335426.1| cathepsin L, putative [Acanthamoeba castella... 200 1e-48 ref|XP_013762925.1| cruzipain [Thecamonas trahens ATCC 50062] gi... 190 2e-45 gb|AAF75547.1| cruzipain [Trypanosoma cruzi] 183 1e-43 gb|AAB41118.1| cruzipain [Trypanosoma cruzi] 183 1e-43 gb|AAB41119.1| cruzipain [Trypanosoma cruzi] 183 2e-43 ref|XP_818578.1| cysteine peptidase [Trypanosoma cruzi strain CL... 183 2e-43 gb|AFA34857.1| cathepsin L-like protein [Trypanosoma cruzi marin... 183 2e-43 gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi] 183 2e-43 gb|AAG35357.1|AF314929_1 cruzipain [Trypanosoma cruzi] 182 3e-43 ref|XP_818579.1| cruzipain precursor [Trypanosoma cruzi strain C... 182 3e-43 sp|P25779.1|CYSP_TRYCR RecName: Full=Cruzipain; AltName: Full=Cr... 182 3e-43 gb|AAL96762.1|AC104496_8 Tcc1l8.8 [Trypanosoma cruzi] 182 3e-43 ref|XP_805951.1| cysteine peptidase [Trypanosoma cruzi strain CL... 181 5e-43 gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi] 181 5e-43 gb|AAF75546.1| cruzipain [Trypanosoma cruzi] 180 1e-42 ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL... 180 1e-42 ref|XP_012755566.1| hypothetical protein SAMD00019534_046220 [Ac... 179 3e-42 ref|XP_009032801.1| hypothetical protein AURANDRAFT_18666 [Aureo... 179 3e-42 >ref|XP_004368288.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii str. Neff] gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii str. Neff] Length = 330 Score = 212 bits (540), Expect = 3e-52 Identities = 122/298 (40%), Positives = 166/298 (55%), Gaps = 16/298 (5%) Frame = +2 Query: 71 YVAKYGLTYSAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEFESTYLGS 250 + A+YG +Y++EE+ R ++ N+ R L N A G F+ LT +EF++TYL Sbjct: 35 FAAQYGKSYASEEFGERLRIFRDNLDRIDALNSANTGARYGVNKFADLTPKEFKATYLKG 94 Query: 251 KLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAFSATETIE 430 T + +G P P+ +DW G ++P KDQGQCG WAFS TE IE Sbjct: 95 ARSAGQKKAAAT----AKLDMTG-PLPSQFDWRDKGAVTPTKDQGQCG--WAFSVTEAIE 147 Query: 431 SYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXF--PSRAYNYVKSAGGLDTEASYPYHAGN 604 S W+L+G+ V+L+P+QIV P AY YV AGGLDTE SYPY A + Sbjct: 148 SQWFLSGRKLVSLAPQQIVDCDQGNGDYGCDGGDPPTAYEYVIKAGGLDTEESYPYTAED 207 Query: 605 GKAGTCAFKANSIGAKITGYTAING-------ESGIYRQASXSICVDASSWNSYKSGTLT 763 G+ CAFK +++GAKI+ +T I + G+ + SICVDASSW Y G +T Sbjct: 208 GQ---CAFKPSAVGAKISNWTYITTTKNETEMQYGLASRGPLSICVDASSWQYYIGGVIT 264 Query: 764 S-CGNSVDHCVQLTGYSNYPS------GYWNVRNSWGTGWGEAGYIHLAIGKNLCNLG 916 S C +S+DHCV +TGYS WN+RNSWG WG GY+++ G NLC +G Sbjct: 265 SLCEDSLDHCVMITGYSVQEGWDFMKYDVWNIRNSWGEDWGYGGYLYVQRGSNLCGVG 322 >dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 341 Score = 205 bits (522), Expect = 4e-50 Identities = 127/310 (40%), Positives = 165/310 (53%), Gaps = 23/310 (7%) Frame = +2 Query: 56 ADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEFE 232 A + + A++ Y S EEY R + N+ R AKL ++ V G T F +T EF+ Sbjct: 30 AKFQEFTARFSKNYKSVEEYTTRYATFLDNLERVAKLNQDGR-GVFGVTKFMDMTPAEFK 88 Query: 233 STYLGSKLEFNVYDIPETNFTAQSIPASGAPNPT-NYDWSAAGCISPVKDQGQCGSCWAF 409 +TYLG K + P A+ P N T + DW G ++PVKDQ QCGSCWAF Sbjct: 89 ATYLGFKPDEMA---PPKAPVAR--PHRAKRNATGSVDWRTKGAVTPVKDQAQCGSCWAF 143 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYVKSAGGLDTEASYP 589 SATE IES W+LAG ++LSP+QIV + AY YV+SAGGLDT+A+YP Sbjct: 144 SATEQIESNWFLAGNELISLSPQQIVSCDTTDGGCGGGWTYTAYQYVQSAGGLDTDAAYP 203 Query: 590 YHAGNGKAGTCAFK-ANSIGAKITGY---------TAINGESGIYRQ-----ASXSICVD 724 Y +G G GTC S A+I+G+ + N + Q + S+CVD Sbjct: 204 YSSGAGVTGTCDNPLPASPAAQISGFGYAIPTCSDSCTNQDENSMAQYMQENSPLSVCVD 263 Query: 725 ASSWNSYKSGTLT-----SCGNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHL 886 A W Y SG +T S + +DHCVQ GY S YW VRNSW T WGE G+I L Sbjct: 264 AEPWQFYSSGIMTVDQCPSDFSGLDHCVQAVGYDATGSQPYWIVRNSWNTNWGEDGFIRL 323 Query: 887 AIGKNLCNLG 916 A+G N C +G Sbjct: 324 ALGTNTCGIG 333 >ref|XP_004335426.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff] gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff] Length = 331 Score = 200 bits (509), Expect = 1e-48 Identities = 121/294 (41%), Positives = 160/294 (54%), Gaps = 15/294 (5%) Frame = +2 Query: 71 YVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQ-EENPLAVMGETIFSHLTQQEFESTYL 244 +V +YG +Y SAEE + R ++ N+ A L + G T F+ ++Q+EF+S L Sbjct: 37 FVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEGKTQFGITKFADMSQEEFQSRVL 96 Query: 245 GSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDW-SAAGCISPVKDQGQCGSCWAFSATE 421 S P T + G P+ +DW + G ++PV DQGQCGSCWAFSATE Sbjct: 97 MSNPP-----PPPTEKPYRGPKFEGFTAPSTFDWRNKPGVVTPVYDQGQCGSCWAFSATE 151 Query: 422 TIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYVKSAGGLDTEASYPYHAG 601 IES W LAG LS +QIV FPS AY+YV A GLD A+YPY A Sbjct: 152 NIESQWALAGHKLTGLSMQQIVDCSWWDDGCGGGFPSYAYDYVIDAPGLDALANYPYTA- 210 Query: 602 NGKAGTCAFKANSIGAKITGYTAINGESGIYRQAS-------XSICVDASSWNSYKSGT- 757 G+CAFK + + AKI+ +T +S ++ A+ S+CVDA SW SY G Sbjct: 211 --VGGSCAFKESQVVAKISSWTYTTTDSNEHQMANYLAQHGPISVCVDAESWPSYTGGVY 268 Query: 758 -LTSCGNSVDHCVQLTGY---SNYPSGYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 ++CG S+DHCV GY +N P YW +RNSWGT WG GY+HL G + C Sbjct: 269 RASACGTSIDHCVLAVGYNLTANPP--YWIIRNSWGTSWGLEGYMHLEFGTDAC 320 >ref|XP_013762925.1| cruzipain [Thecamonas trahens ATCC 50062] gi|906552699|gb|KNC45942.1| cruzipain [Thecamonas trahens ATCC 50062] Length = 394 Score = 190 bits (482), Expect = 2e-45 Identities = 121/300 (40%), Positives = 155/300 (51%), Gaps = 18/300 (6%) Frame = +2 Query: 68 TYVAKYGLTYSAEEYQFRETVYATNMVRAAKLQEENPLA----VMGETIFSHLTQQEFES 235 TY +Y + AE F V+ TN +AAKL+ N A G + F LT+ EF++ Sbjct: 29 TYKRQYASS-KAEAAAFE--VFKTNAEKAAKLEAANKAAGGDAKFGMSPFMDLTENEFKA 85 Query: 236 TYLGSK--LEFNVYDIPETNFTAQSIPASGAPNPTNYDWS--AAGCISPVKDQGQCGSCW 403 YL K +E ++P A ++ A P YDW I+PVK+QGQCGSCW Sbjct: 86 RYLMPKGAVEGGAAELPVLR--ASNVGAL----PKAYDWRDHKPAVITPVKNQGQCGSCW 139 Query: 404 AFSATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYVKSAGGLDTEAS 583 AFSA +ES W LAG V LS +Q+V AY+Y++ AGGL E Sbjct: 140 AFSAVSEVESMWALAGHELVVLSEQQVVDCDTTDDGCNGGDTISAYHYIEKAGGLVPEKD 199 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGES---------GIYRQASXSICVDASSW 736 YPY A +GK K +++ AKI GY S + SICVDASSW Sbjct: 200 YPYTARDGKCKDSVVKKDAV-AKIMGYNYATSPSTKNETQLAANLMSTGPVSICVDASSW 258 Query: 737 NSYKSGTLTSCGNSVDHCVQLTGYSNYPSG-YWNVRNSWGTGWGEAGYIHLAIGKNLCNL 913 +Y SG L+ CG +DHCVQ+TG+ S YW VRNSW T WG +GYI L G+N C L Sbjct: 259 QTYTSGILSHCGKQLDHCVQITGWGTSGSEMYWWVRNSWATSWGMSGYIQLKFGQNTCGL 318 >gb|AAF75547.1| cruzipain [Trypanosoma cruzi] Length = 467 Score = 183 bits (465), Expect = 1e-43 Identities = 108/295 (36%), Positives = 152/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 ASQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y F E ++ GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 95 WSRYHNGAAHFAAAQ--ERARVPVNVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEGS 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ I + + ++ VDASSW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVEIPQDEAQIAAWLAVNGPVAVAVDASSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW T WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWVIKNSWTTHWGEGGYIRIAKGSNQC 325 >gb|AAB41118.1| cruzipain [Trypanosoma cruzi] Length = 383 Score = 183 bits (465), Expect = 1e-43 Identities = 107/295 (36%), Positives = 152/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 ASQFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTAFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y F E ++ GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 95 RSRYHNGAAHFAAAQ--ERARVPVNVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDS 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDASSW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW T WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWVIKNSWTTQWGEDGYIRIAKGSNQC 325 >gb|AAB41119.1| cruzipain [Trypanosoma cruzi] Length = 467 Score = 183 bits (464), Expect = 2e-43 Identities = 107/295 (36%), Positives = 152/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 ASQFAEFKQKHGRVYGSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTAFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y F E ++ GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 95 RSRYHNGAAHFAAAQ--ERARVPVNVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDS 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDASSW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW T WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWVIKNSWTTQWGEDGYIRIAKGSNQC 325 >ref|XP_818578.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener] gi|70883837|gb|EAN96727.1| cysteine peptidase, putative [Trypanosoma cruzi] Length = 467 Score = 183 bits (464), Expect = 2e-43 Identities = 107/295 (36%), Positives = 152/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y + F E + GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 95 RSRYHNGAVHFAAAQ--ERARVPVKVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDS 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDASSW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW T WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTTQWGEEGYIRIAKGSNQC 325 >gb|AFA34857.1| cathepsin L-like protein [Trypanosoma cruzi marinkellei] Length = 467 Score = 183 bits (464), Expect = 2e-43 Identities = 107/295 (36%), Positives = 152/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 TSQFAEFKQKHGRVYKSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y F E ++ G P DW A G ++ VKDQGQCGSCWAF Sbjct: 95 RSRYHNGAAHFAAAQ--ERARVPVNVEVVGVPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +ES W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVESQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNDAFEWIVQENDGAVYTEES 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDA+SW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAANGPVAVAVDATSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYS-NYPSGYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY+ + P YW ++NSW T WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAPVPYWIIKNSWTTLWGEDGYIRIAKGSNQC 325 >gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi] Length = 467 Score = 183 bits (464), Expect = 2e-43 Identities = 107/295 (36%), Positives = 152/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 ASQFAEFKQKHGRVYGSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y F E ++ GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 95 RSRYHNGAAHFAAAQ--ERARVPVNVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEGS 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDASSW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW T WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWVIKNSWTTQWGEDGYIRIAKGSNQC 325 >gb|AAG35357.1|AF314929_1 cruzipain [Trypanosoma cruzi] Length = 467 Score = 182 bits (462), Expect = 3e-43 Identities = 107/295 (36%), Positives = 151/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y F E + GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 95 RSRYHNGAAHFAAAQ--ERARVPVKVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDS 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDASSW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW T WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTTQWGEEGYIRIAKGSNQC 325 >ref|XP_818579.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener] gi|70883838|gb|EAN96728.1| cruzipain precursor, putative [Trypanosoma cruzi] Length = 467 Score = 182 bits (462), Expect = 3e-43 Identities = 107/295 (36%), Positives = 151/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y F E + GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 95 RSRYHNGAAHFAAAQ--ERARVPVKVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDFGCSGGLMNNAFEWIVQENNGAVYTEDS 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDASSW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW T WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTTQWGEEGYIRIAKGSNQC 325 >sp|P25779.1|CYSP_TRYCR RecName: Full=Cruzipain; AltName: Full=Cruzaine; AltName: Full=Major cysteine proteinase; Flags: Precursor gi|162048|gb|AAA30181.1| cruzain [Trypanosoma cruzi] gi|29409382|gb|AAM33131.1| cysteine proteinase precursor [Trypanosoma cruzi] Length = 467 Score = 182 bits (462), Expect = 3e-43 Identities = 107/295 (36%), Positives = 151/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y F E + GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 95 RSRYHNGAAHFAAAQ--ERARVPVKVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDS 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDASSW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW T WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTTQWGEEGYIRIAKGSNQC 325 >gb|AAL96762.1|AC104496_8 Tcc1l8.8 [Trypanosoma cruzi] Length = 500 Score = 182 bits (462), Expect = 3e-43 Identities = 107/295 (36%), Positives = 152/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 68 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF 127 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y + F E + GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 128 RSRYHNGAVHFAAAQ--ERARVPVKVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 183 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 184 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDS 243 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDASSW +Y Sbjct: 244 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYT 303 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW T WGE GYI +A G N C Sbjct: 304 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTTQWGEEGYIRIAKGLNQC 358 >ref|XP_805951.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener] gi|70869552|gb|EAN84100.1| cysteine peptidase, putative [Trypanosoma cruzi] Length = 426 Score = 181 bits (460), Expect = 5e-43 Identities = 107/295 (36%), Positives = 151/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y F E + GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 95 RSRYHNGAAHFAAAQ--ERARVPVKVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDS 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDASSW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW T WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTTQWGEEGYIRIAKGLNQC 325 >gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi] Length = 467 Score = 181 bits (460), Expect = 5e-43 Identities = 107/295 (36%), Positives = 152/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 ASQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y F E ++ GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 95 RSRYHNGAAHFAAAQ--ERARVPVNVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDS 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDASSW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVGVDASSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW T WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTTQWGEGGYIRVAKGSNQC 325 >gb|AAF75546.1| cruzipain [Trypanosoma cruzi] Length = 467 Score = 180 bits (457), Expect = 1e-42 Identities = 106/295 (35%), Positives = 151/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 ASQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y F E ++ GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 95 RSRYHNGAAHFAAAQ--ERARVPVNVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCGGGLMNNAFGWIVQENNGAVYTENS 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDASSW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTAQWGEDGYIRIAKGSNQC 325 >ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener] gi|70885508|gb|EAN98323.1| cysteine peptidase, putative [Trypanosoma cruzi] Length = 467 Score = 180 bits (457), Expect = 1e-42 Identities = 106/295 (35%), Positives = 151/295 (51%), Gaps = 10/295 (3%) Frame = +2 Query: 53 SADWATYVAKYGLTY-SAEEYQFRETVYATNMVRAAKLQEENPLAVMGETIFSHLTQQEF 229 ++ +A + K+G Y SA E FR +V+ N+ A NP A G T FS LT++EF Sbjct: 35 ASQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEF 94 Query: 230 ESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCWAF 409 S Y F E ++ GAP DW A G ++ VKDQGQCGSCWAF Sbjct: 95 RSRYHNGAAHFAAAQ--ERARVPVNVEVVGAPAAV--DWRARGAVTAVKDQGQCGSCWAF 150 Query: 410 SATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYV--KSAGGLDTEAS 583 SA +E W+LAG P LS + +V + A+ ++ ++ G + TE S Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDS 210 Query: 584 YPYHAGNGKAGTCAFKANSIGAKITGYTAINGESG-----IYRQASXSICVDASSWNSYK 748 YPY +G G + C +++GA ITG+ + + + ++ VDASSW +Y Sbjct: 211 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYT 270 Query: 749 SGTLTSC-GNSVDHCVQLTGYSNYPS-GYWNVRNSWGTGWGEAGYIHLAIGKNLC 907 G +TSC +DH V L GY++ + YW ++NSW WGE GYI +A G N C Sbjct: 271 GGVMTSCVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTAQWGEDGYIRIAKGSNQC 325 >ref|XP_012755566.1| hypothetical protein SAMD00019534_046220 [Acytostelium subglobosum LB1] gi|735856198|dbj|GAM21447.1| hypothetical protein SAMD00019534_046220 [Acytostelium subglobosum LB1] Length = 335 Score = 179 bits (454), Expect = 3e-42 Identities = 108/314 (34%), Positives = 154/314 (49%), Gaps = 26/314 (8%) Frame = +2 Query: 56 ADWATYVAKYGLTYSAEEYQFRETVYATNM--VRAAKLQE--ENPLAVMGETIFSHLTQQ 223 + + + KY Y+ EE+ +R V+ N+ +R L+ + A F+ LT Sbjct: 25 SQFRAFQTKYNKQYTNEEFTYRFGVFKNNLKVIRNMNLKSKAQKSTATFDVNAFADLTVD 84 Query: 224 EFESTYLGSKLEFNVYDIPETNFTAQSIPASGAPNPTNYDWSAAGCISPVKDQGQCGSCW 403 EF+ YL S + + P A + P TNYDW+ G ++PVK+QGQCGSCW Sbjct: 85 EFKKYYLNSVVAERDMNAP----VAADVHVDAMP--TNYDWATLGAVTPVKNQGQCGSCW 138 Query: 404 AFSATETIESYWWLAGKPKVTLSPEQIV----------XXXXXXXXXXXXFPSRAYNYVK 553 +FSAT IE W+LAG LS + +V AY Y+ Sbjct: 139 SFSATGNIEGAWFLAGNNLTGLSEQNLVDCDHECMQYLGDHVCDQGCNGGLQPNAYEYIL 198 Query: 554 SAGGLDTEASYPYHAGNGKAGTCAFKANSIGAKITGYTAINGE-----SGIYRQASXSIC 718 GG+DTE SYPY G TC F A++IGAKI+ +T ++ S +Y +I Sbjct: 199 KNGGIDTEESYPYTGVTGT--TCNFDASNIGAKISSWTYVSSNETTMASYLYANGPLAIA 256 Query: 719 VDASSWNSYKSGT--LTSCGNSVDHCVQLTGY-----SNYPSGYWNVRNSWGTGWGEAGY 877 DA +W Y G CG+ +DH + +TG+ +N P YW V+NSWG WGE+GY Sbjct: 257 ADALTWQYYSGGVFDFKECGSVLDHGILITGFGVDTTNNEP--YWIVKNSWGADWGESGY 314 Query: 878 IHLAIGKNLCNLGS 919 + + GK LC L + Sbjct: 315 MRIIRGKGLCGLNT 328 >ref|XP_009032801.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens] gi|323457344|gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens] Length = 346 Score = 179 bits (454), Expect = 3e-42 Identities = 113/326 (34%), Positives = 162/326 (49%), Gaps = 43/326 (13%) Frame = +2 Query: 71 YVAKYGLTYSAEEYQFRETVYATNM-----VRAAKLQEENPLAVMGETIFSHLTQQEFES 235 YV Y T + E R T+++ N+ + A ++ E++ A G T F LT+ EF++ Sbjct: 27 YVKSYNSTEAEAE---RFTIFSANLRKTEALNAQRVDEDD--AEFGVTQFMDLTEAEFKA 81 Query: 236 TYLGSKLEFNVYDIPETNFTAQSIPAS--GAPNPTNYDWSA--AGCISPVKDQGQCGSCW 403 YL +P A+ + A+ G P + DW +G +S VKDQGQCGSCW Sbjct: 82 QYLNY--------VPSEQVLAEDVYAAPEGFAAPGSLDWRTKQSGVVSDVKDQGQCGSCW 133 Query: 404 AFSATETIESYWWLAGKPKVTLSPEQIVXXXXXXXXXXXXFPSRAYNYVKSAGGLDTEAS 583 AFSATE IES W LAG + +P+QIV AY YV+ AGG+ E++ Sbjct: 134 AFSATEQIESEWVLAGNDPLVFAPQQIVSCDKVDQGCNGGNTETAYAYVEKAGGMALESA 193 Query: 584 YPYHAG-NGKAGTCAFKANSIGAKITGYTAINGE---------------SGIYRQASXSI 715 YPY +G +G G C K + G + ++ + E + + SI Sbjct: 194 YPYKSGTSGNTGRCK-KFETAGGDVESFSYVVPECKKGKCNDQDEDKMAAALASHGPASI 252 Query: 716 CVDASSWNSYKSGTLTS--CG----NSVDHCVQLTGYSNYPSG------------YWNVR 841 CV+A +W +Y G +T+ CG N++DHCVQ+ GY+ Y WNVR Sbjct: 253 CVNAGAWQTYTKGVMTNLQCGSHAANALDHCVQVVGYTGYTGDAKACGKGLKDKCVWNVR 312 Query: 842 NSWGTGWGEAGYIHLAIGKNLCNLGS 919 NSWGT WG GYI + +GKN C + + Sbjct: 313 NSWGTSWGYQGYIRVQMGKNACGIAN 338