BLASTX nr result
ID: Akebia24_contig00024554
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00024554 (984 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003632695.1| PREDICTED: OTU domain-containing protein At3... 254 4e-65 ref|XP_002323302.2| OTU-like cysteine protease family protein [P... 242 1e-61 ref|XP_006490038.1| PREDICTED: OTU domain-containing protein At3... 236 1e-59 ref|XP_006421489.1| hypothetical protein CICLE_v10005351mg [Citr... 236 1e-59 ref|XP_007028914.1| Cysteine proteinases superfamily protein iso... 226 1e-56 ref|XP_006381039.1| hypothetical protein POPTR_0006s05620g [Popu... 218 3e-54 ref|XP_002534273.1| cysteine-type peptidase, putative [Ricinus c... 218 4e-54 emb|CAN60311.1| hypothetical protein VITISV_002512 [Vitis vinifera] 216 1e-53 gb|EXC30911.1| OTU domain-containing protein [Morus notabilis] 209 1e-51 ref|XP_004497941.1| PREDICTED: OTU domain-containing protein At3... 204 4e-50 ref|XP_007145652.1| hypothetical protein PHAVU_007G257000g [Phas... 202 1e-49 ref|XP_006421488.1| hypothetical protein CICLE_v10005351mg [Citr... 201 5e-49 ref|XP_007202322.1| hypothetical protein PRUPE_ppa008123mg [Prun... 201 5e-49 ref|XP_004291162.1| PREDICTED: OTU domain-containing protein At3... 199 1e-48 ref|XP_006588483.1| PREDICTED: uncharacterized protein LOC100810... 198 3e-48 ref|NP_001242273.1| uncharacterized protein LOC100810338 [Glycin... 198 3e-48 ref|XP_007028913.1| Cysteine proteinases superfamily protein iso... 182 2e-43 ref|XP_007028911.1| Cysteine proteinases superfamily protein iso... 182 2e-43 ref|XP_007028917.1| Cysteine proteinases superfamily protein iso... 179 1e-42 ref|XP_004229848.1| PREDICTED: OTU domain-containing protein At3... 174 5e-41 >ref|XP_003632695.1| PREDICTED: OTU domain-containing protein At3g57810-like [Vitis vinifera] Length = 340 Score = 254 bits (649), Expect = 4e-65 Identities = 136/252 (53%), Positives = 170/252 (67%) Frame = +2 Query: 227 PITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFSVTISRKSLC 406 PI+T A+++V + V RQM H+ + Q S S FYF++ +P+ +++S C Sbjct: 6 PISTCARNIVRLSGCVQRQMSSHICSLVSQGPSSSFSFYFYTGHSKPKNTFMSVSETFSC 65 Query: 407 SSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMXXXXXX 586 SS IT TFQG C SG SK+R +S L VKSL+ S G KR L I L CQ+M Sbjct: 66 SS-ITAFHTFQGSCFYSGLSKRRGSSRSLTVKSLIGSRGPSKRSLNISLTCQNMNVRLLV 124 Query: 587 XXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDDRDSSS 766 Q ++ KI+ N GSVSW +G ASAG +F L VC+SSSEPV+ E++++K +D+ Sbjct: 125 PKQGVLPKIKCNVGSVSWPQGCASAGLMFALLVCYSSSEPVHAESAQKK----EDKKGEC 180 Query: 767 VGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELRARVAD 946 SHGKKVYTDYSITGIPGDGRCLFRSV HGACLRSGKP+PS S QRELADELRA V D Sbjct: 181 YTNSHGKKVYTDYSITGIPGDGRCLFRSVVHGACLRSGKPAPSASCQRELADELRAEVVD 240 Query: 947 EFVRRREESEWF 982 EF+RRR E+EWF Sbjct: 241 EFIRRRSETEWF 252 >ref|XP_002323302.2| OTU-like cysteine protease family protein [Populus trichocarpa] gi|550320875|gb|EEF05063.2| OTU-like cysteine protease family protein [Populus trichocarpa] Length = 342 Score = 242 bits (618), Expect = 1e-61 Identities = 127/257 (49%), Positives = 169/257 (65%) Frame = +2 Query: 212 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFSVTIS 391 M+V SPI+T K+VVH+ RV +QM + V + SCCF + + + + +++S Sbjct: 1 MIVCSPISTCVKNVVHLSSRV-QQMGSTILNVVSGGQTTSCCFSSYPGLSRSSYSRLSVS 59 Query: 392 RKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 571 + C S + T Q C S +KQR + VK +V S G KR I LPCQ M Sbjct: 60 KTFSCPSI--SYQTIQSNCFGSVLTKQRADLQSFSVKGVVRSRGPLKRQFNISLPCQIMN 117 Query: 572 XXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 751 Q +++KI N GS+SW +G + G IFGL VC+SSSEP + EA+ KN E D+ Sbjct: 118 LRFSVSKQGVLSKINDNTGSISWSQGYPTTGIIFGLLVCYSSSEPTHAEAATHKNEEEDN 177 Query: 752 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 931 + S + +SHGK+VY DYSI GIPGDGRCLFRSVAHGAC+RSGKP+PSE+LQRELAD+LR Sbjct: 178 CNLSDIKFSHGKEVYRDYSIIGIPGDGRCLFRSVAHGACIRSGKPAPSENLQRELADDLR 237 Query: 932 ARVADEFVRRREESEWF 982 ++VADEF++RREE+EWF Sbjct: 238 SKVADEFIKRREETEWF 254 >ref|XP_006490038.1| PREDICTED: OTU domain-containing protein At3g57810-like [Citrus sinensis] Length = 341 Score = 236 bits (602), Expect = 1e-59 Identities = 123/257 (47%), Positives = 170/257 (66%) Frame = +2 Query: 212 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFSVTIS 391 M+V + I AK+VV++ R QM ++ GV + S SCCF+ S + + ++ Sbjct: 1 MIVSTSICACAKNVVNLGGRFQGQMGGNICGVTYRGPSSSCCFHLCSGQSKKNYTGIS-- 58 Query: 392 RKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 571 +++ SS++ FQ C G +K R N L ++S + S G +KR++ I L C SM Sbjct: 59 -RTISSSSLNVLQPFQATCFSLGLTKPRCNLQPLTIRSFIGSRGSQKRHIEISLACHSMK 117 Query: 572 XXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 751 Q ++ K++ N G + W KG ASAG I GL VC+SSS+ + EA+ EK + +D Sbjct: 118 MRLLVPNQGVLPKLKLNAGPIDWPKGCASAGLICGLLVCYSSSK-AHAEAADEKEDGEED 176 Query: 752 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 931 D S+V YSHGKKVYTDYS+ GIPGDGRCLFR+VAHGACLR+GKP+PS S+QRELAD+LR Sbjct: 177 YDLSNVKYSHGKKVYTDYSVIGIPGDGRCLFRAVAHGACLRAGKPAPSVSIQRELADDLR 236 Query: 932 ARVADEFVRRREESEWF 982 A+VADEF++RREE+EWF Sbjct: 237 AKVADEFIKRREETEWF 253 >ref|XP_006421489.1| hypothetical protein CICLE_v10005351mg [Citrus clementina] gi|557523362|gb|ESR34729.1| hypothetical protein CICLE_v10005351mg [Citrus clementina] Length = 341 Score = 236 bits (601), Expect = 1e-59 Identities = 123/257 (47%), Positives = 170/257 (66%) Frame = +2 Query: 212 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFSVTIS 391 M+V + I AK+VV++ R QM ++ GV + S SCCFY S + + ++ Sbjct: 1 MIVSTSICACAKNVVNLGGRFQGQMGGNICGVTYRGPSSSCCFYLCSGQSKKNYAGIS-- 58 Query: 392 RKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 571 +++ SS++ FQ C G +K R N L ++S + S G +KR++ I L C+SM Sbjct: 59 -RTISSSSLNVLQPFQATCFSPGLTKPRCNLRPLTIRSFIGSRGSQKRHIEISLACRSMK 117 Query: 572 XXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 751 Q ++ K++ N G + W KG ASAG I GL VC+SSS+ + EA+ EK + +D Sbjct: 118 MRLLVPSQGVLPKLKLNAGPIDWPKGCASAGLICGLLVCYSSSK-AHAEAADEKEDGEED 176 Query: 752 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 931 D S+V Y HGKKVYTDYS+ GIPGDGRCLFR+VAHGACLR+GKP+PS S+QRELAD+LR Sbjct: 177 YDLSNVKYLHGKKVYTDYSVIGIPGDGRCLFRAVAHGACLRAGKPAPSVSIQRELADDLR 236 Query: 932 ARVADEFVRRREESEWF 982 A+VADEF++RREE+EWF Sbjct: 237 AKVADEFIKRREETEWF 253 >ref|XP_007028914.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao] gi|590636687|ref|XP_007028915.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao] gi|590636690|ref|XP_007028916.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao] gi|508717519|gb|EOY09416.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao] gi|508717520|gb|EOY09417.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao] gi|508717521|gb|EOY09418.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao] Length = 340 Score = 226 bits (576), Expect = 1e-56 Identities = 122/257 (47%), Positives = 165/257 (64%) Frame = +2 Query: 212 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFSVTIS 391 MMV SPI+T AK+VVH+ +G +C V S SC ++ +S + ++ +++S Sbjct: 1 MMVCSPISTCAKNVVHLRGHMGSSLC----SVISCQPSSSCYYFSYSGHPKTKYTDLSVS 56 Query: 392 RKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 571 + S A+ FQ C S S++ L+VK +S +KR L I P QSM Sbjct: 57 YTTSGSPAVGYR-AFQAGCFRS--SRRSRKLQSLVVKESISDKTKQKRQLEISWPGQSMK 113 Query: 572 XXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 751 Q + K + G +SW +G AS G +FGL VC+SSSEPV+ EA+ K ++ DD Sbjct: 114 MKFLLPKQGTLQKFKCTAGPISWSQGCASVGLVFGLLVCYSSSEPVHAEAAGAKEDKQDD 173 Query: 752 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 931 +SS +SHGKKVYTDYS+ GIPGDGRC+FRSVAHGACLRSGK +PSE +QRELAD+LR Sbjct: 174 CESSHAKFSHGKKVYTDYSVIGIPGDGRCMFRSVAHGACLRSGKSAPSEHVQRELADDLR 233 Query: 932 ARVADEFVRRREESEWF 982 A+VADEF++RR+E+EWF Sbjct: 234 AKVADEFIKRRKETEWF 250 >ref|XP_006381039.1| hypothetical protein POPTR_0006s05620g [Populus trichocarpa] gi|550335541|gb|ERP58836.1| hypothetical protein POPTR_0006s05620g [Populus trichocarpa] Length = 338 Score = 218 bits (555), Expect = 3e-54 Identities = 122/257 (47%), Positives = 161/257 (62%) Frame = +2 Query: 212 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFSVTIS 391 M+V S I T K+VVH+ RV +QM + V + S S CF + + + +++S Sbjct: 1 MIVCSAINTCVKNVVHLSGRV-QQMGSTILNVVSRGQSTSRCFSLYPSRSRSNYSRLSVS 59 Query: 392 RKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 571 + C S + T C S KQR N L VK +V+S G KR I LP Q+M Sbjct: 60 KTFSCPSI--SFHTLHRNCFGSDSIKQRYNLVSLTVKGVVNSGGPLKRQFNISLPSQNMA 117 Query: 572 XXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 751 + ++ KI+ N GSVS + + G FGL VC+SSSEP + E++ KN E D Sbjct: 118 LRFSVSKRGLLAKIKGNVGSVSCSQRHTTTGIFFGLLVCYSSSEPTHAESATRKNKEEDI 177 Query: 752 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 931 +SS + +SHGK+VYTDYSI G+PGDGRCLFRSVAHGACLR GK +PSESLQRELAD+LR Sbjct: 178 CNSSDIKFSHGKEVYTDYSIIGVPGDGRCLFRSVAHGACLRFGKRAPSESLQRELADDLR 237 Query: 932 ARVADEFVRRREESEWF 982 + VADEF++RRE++EWF Sbjct: 238 SNVADEFIKRREDTEWF 254 >ref|XP_002534273.1| cysteine-type peptidase, putative [Ricinus communis] gi|223525596|gb|EEF28110.1| cysteine-type peptidase, putative [Ricinus communis] Length = 343 Score = 218 bits (554), Expect = 4e-54 Identities = 121/257 (47%), Positives = 164/257 (63%) Frame = +2 Query: 212 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFSVTIS 391 M+V SPI+T A+ VV++ + M + + S SCCF + ++IS Sbjct: 1 MIVCSPISTYARKVVYLSG-CAQHMGSTIFNMVSNGQSTSCCFCSCRAHLSKSYARLSIS 59 Query: 392 RKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 571 K+ S ++ T T SG +KQ + + VK L ++ G K++ + L Q++ Sbjct: 60 -KTFSSPSVGTCQTSNKNFSGSGSAKQSGSWQSITVKGLFNTRGPLKKHFNLSLAYQNLN 118 Query: 572 XXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 751 + M++KI+ N GS+SW + AS G I GL VC+SSSEP EA+ + +E D+ Sbjct: 119 MRFSLSKRGMLSKIKDNVGSISWAQECASTGLICGLLVCYSSSEPTRAEAAAREKDEEDN 178 Query: 752 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 931 D S V +SHGK+VYTDYSITGIPGDGRCLFRSVAHGA LR+GKP+PSESLQRELAD+LR Sbjct: 179 SDLSYVKFSHGKRVYTDYSITGIPGDGRCLFRSVAHGASLRTGKPAPSESLQRELADDLR 238 Query: 932 ARVADEFVRRREESEWF 982 ARVADEF+RRR+E+EWF Sbjct: 239 ARVADEFIRRRQETEWF 255 >emb|CAN60311.1| hypothetical protein VITISV_002512 [Vitis vinifera] Length = 806 Score = 216 bits (549), Expect = 1e-53 Identities = 110/181 (60%), Positives = 130/181 (71%) Frame = +2 Query: 440 GCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMXXXXXXXXQEMITKIRW 619 G C SG SK+R +S L VKSL+ S G KR L I L CQ+M Q ++ KI+ Sbjct: 542 GSCFYSGLSKRRGSSRSLTVKSLIGSRGPSKRSLNISLTCQNMNVRLLVPKQGVLPKIKC 601 Query: 620 NRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDDRDSSSVGYSHGKKVYT 799 N GSVSW +G ASAG +F L VC+SSSEPV+ E++++K +D+ SHGKKVYT Sbjct: 602 NVGSVSWPQGCASAGLMFALLVCYSSSEPVHAESAQKK----EDKKGECYTNSHGKKVYT 657 Query: 800 DYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELRARVADEFVRRREESEW 979 DYSITGIPGDGRCLFRSV HGACLRSGKP+PS S QRELADELRA V DEF+RRR E+EW Sbjct: 658 DYSITGIPGDGRCLFRSVVHGACLRSGKPAPSASCQRELADELRAEVVDEFIRRRSETEW 717 Query: 980 F 982 F Sbjct: 718 F 718 >gb|EXC30911.1| OTU domain-containing protein [Morus notabilis] Length = 893 Score = 209 bits (533), Expect = 1e-51 Identities = 118/266 (44%), Positives = 159/266 (59%) Frame = +2 Query: 185 FVHTPGYVNMMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQ 364 F+ Y NM+V I K + + + +M + V + SCCF + + Sbjct: 555 FIRNSCYDNMIVCPSIGACTKSIACLSGNIQTEMGSKLCSVVSRRPYSSCCFCLYPGNSK 614 Query: 365 PRFFSVTISRKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLG 544 ++ +++S+ L +S S TFQ + S FS ++ L +K LVS+ ++R L Sbjct: 615 TKYAHLSVSKNHLSNS----SPTFQKSFVSSCFSTEKGRLWSLALKDLVSAAEPQRRRLK 670 Query: 545 IPLPCQSMXXXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEAS 724 I L +M Q M+ KI + +AG + GL +C+SSS+P + E + Sbjct: 671 ISLANTAMSIRLLVPKQRMLVKIN-----------SGTAGLLGGLLICYSSSKPAHAEVA 719 Query: 725 REKNNENDDRDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESL 904 R ++ DD DSS V +SHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKP+PSESL Sbjct: 720 RSDDDSEDDCDSSYVKFSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPAPSESL 779 Query: 905 QRELADELRARVADEFVRRREESEWF 982 QRELAD LRARVADEF++RREE+EWF Sbjct: 780 QRELADNLRARVADEFIKRREETEWF 805 >ref|XP_004497941.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cicer arietinum] Length = 337 Score = 204 bits (519), Expect = 4e-50 Identities = 117/257 (45%), Positives = 155/257 (60%) Frame = +2 Query: 212 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFSVTIS 391 M + P++ + V + R M ++ G+ + S S F+ + ++I Sbjct: 1 MSICFPVSQSSISAVVVKGRTQLLMSSNICGLQSRGISCSFSSGFYPGKSGKNYVGLSIC 60 Query: 392 RKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 571 K CS+ + T +G L S SKQR ++ L S+VS H++ I L CQSM Sbjct: 61 TKPSCSTVM--GQTIRGGYLGSCCSKQRGSTQLF--NSIVSRKKHRE----ISLACQSMS 112 Query: 572 XXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 751 Q+M++K++ N G ++W + AS G IFGL VC SSEP + EA E NDD Sbjct: 113 MRLLVPKQKMLSKVKCNVGRINWPRSCASVGFIFGLFVCNLSSEPAHAEADYENRKRNDD 172 Query: 752 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 931 D ++V SHGK+VYTDYS+ GIPGDGRCLFRSVAHGA LRSGKP PSE QRELAD+LR Sbjct: 173 CDETNVKVSHGKQVYTDYSVIGIPGDGRCLFRSVAHGASLRSGKPPPSERFQRELADDLR 232 Query: 932 ARVADEFVRRREESEWF 982 A+VADEFV+RREE+EWF Sbjct: 233 AKVADEFVKRREETEWF 249 >ref|XP_007145652.1| hypothetical protein PHAVU_007G257000g [Phaseolus vulgaris] gi|561018842|gb|ESW17646.1| hypothetical protein PHAVU_007G257000g [Phaseolus vulgaris] Length = 339 Score = 202 bits (515), Expect = 1e-49 Identities = 115/230 (50%), Positives = 148/230 (64%), Gaps = 2/230 (0%) Frame = +2 Query: 299 RGVAPQLTSKSCCFYFHSKVFQPRFFSVTISRKSLCSSAITTSLTFQGCCLESGFSKQRN 478 RG++ +S S F S++ +++ K CS+ + T +G L S SKQR Sbjct: 35 RGISTSFSSSS--FPGESEI---NHVDLSVCTKLSCSTVM--GQTIRGGFLGSCCSKQRG 87 Query: 479 NSDLLMVKSLVSSMGHKKRYLGIPLPCQSMXXXXXXXXQEMITKIRWNRGSVSWLKGTAS 658 N+ SS+ +KRY I L CQS+ Q+++ K++ N G VSW +G AS Sbjct: 88 NTQFF------SSVVPRKRYHEISLACQSVNMRLFLPKQKLLHKVKRNFGPVSWPRGCAS 141 Query: 659 AGAIFGLSVCFSSSEPVYCEASREKNNENDDRDS--SSVGYSHGKKVYTDYSITGIPGDG 832 G IFGL VC SSSEP + E+ E N DD + S+V SHGKKVYTDYS+ GIPGDG Sbjct: 142 VGLIFGLLVCSSSSEPAHAESHSENENRKDDCNQYESNVKVSHGKKVYTDYSVIGIPGDG 201 Query: 833 RCLFRSVAHGACLRSGKPSPSESLQRELADELRARVADEFVRRREESEWF 982 RCLFRSV+ GACLRSGKP P+ES+QRELAD+LRARVADEF++RREE+EWF Sbjct: 202 RCLFRSVSRGACLRSGKPPPTESVQRELADDLRARVADEFIKRREETEWF 251 >ref|XP_006421488.1| hypothetical protein CICLE_v10005351mg [Citrus clementina] gi|557523361|gb|ESR34728.1| hypothetical protein CICLE_v10005351mg [Citrus clementina] Length = 311 Score = 201 bits (510), Expect = 5e-49 Identities = 113/257 (43%), Positives = 156/257 (60%) Frame = +2 Query: 212 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFSVTIS 391 M+V + I AK+VV++ R QM ++ GV + S SCCFY S + + ++ Sbjct: 1 MIVSTSICACAKNVVNLGGRFQGQMGGNICGVTYRGPSSSCCFYLCSGQSKKNYAGIS-- 58 Query: 392 RKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 571 +++ SS++ FQ C G +K + ++ LV S G Sbjct: 59 -RTISSSSLNVLQPFQATCFSPGLTKPS-----MKMRLLVPSQG---------------- 96 Query: 572 XXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 751 ++ K++ N G + W KG ASAG I GL VC+SSS+ + EA+ EK + +D Sbjct: 97 ---------VLPKLKLNAGPIDWPKGCASAGLICGLLVCYSSSK-AHAEAADEKEDGEED 146 Query: 752 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 931 D S+V Y HGKKVYTDYS+ GIPGDGRCLFR+VAHGACLR+GKP+PS S+QRELAD+LR Sbjct: 147 YDLSNVKYLHGKKVYTDYSVIGIPGDGRCLFRAVAHGACLRAGKPAPSVSIQRELADDLR 206 Query: 932 ARVADEFVRRREESEWF 982 A+VADEF++RREE+EWF Sbjct: 207 AKVADEFIKRREETEWF 223 >ref|XP_007202322.1| hypothetical protein PRUPE_ppa008123mg [Prunus persica] gi|462397853|gb|EMJ03521.1| hypothetical protein PRUPE_ppa008123mg [Prunus persica] Length = 344 Score = 201 bits (510), Expect = 5e-49 Identities = 121/261 (46%), Positives = 153/261 (58%) Frame = +2 Query: 200 GYVNMMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFS 379 G+VN +V PI K+VV + QM + V + S SCC + + FS Sbjct: 9 GFVNTIVCPPINHSPKNVVCLSGCTQIQMGSKICSVVSRGASSSCCKGLQTGKTGTKIFS 68 Query: 380 VTISRKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPC 559 + +S+ + T G C FSK + V++ G K L I L C Sbjct: 69 LPLSK----NRPTNIGQTSHGNCFRFFFSKDSRSLT-------VNAGGPNKGSLEISLAC 117 Query: 560 QSMXXXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNN 739 + M Q M+ KI+ N G VSW +G ASAG IFGL VC + S P + EA+ + + Sbjct: 118 RGMNTRLLVPRQGMLPKIKCNVGPVSWPQGCASAGLIFGLLVC-NCSGPAHAEAAH-RED 175 Query: 740 ENDDRDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELA 919 E DD D S V +S GKKVYTDYSI GIPGDGRCLFRSVAHGA LR+GK +P+ESLQRELA Sbjct: 176 EEDDNDLSYVKFSRGKKVYTDYSIIGIPGDGRCLFRSVAHGAYLRAGKAAPAESLQRELA 235 Query: 920 DELRARVADEFVRRREESEWF 982 D+LRARVADEF++RREE+EWF Sbjct: 236 DDLRARVADEFIKRREETEWF 256 >ref|XP_004291162.1| PREDICTED: OTU domain-containing protein At3g57810-like [Fragaria vesca subsp. vesca] Length = 343 Score = 199 bits (506), Expect = 1e-48 Identities = 117/261 (44%), Positives = 158/261 (60%) Frame = +2 Query: 200 GYVNMMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFS 379 GYVN +V + I A +VV + + QM + V + S S C+ +F + Sbjct: 9 GYVNTVVGTHINQGANNVVCMSGCIEMQMGSKICSVVSRGASSSYCYRLQPGKSGNKFGT 68 Query: 380 VTISRKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPC 559 +++++ S T T G C S FS R NS +SL + ++ L I L C Sbjct: 69 LSLTK----SRPSETGQTPHGSCFRSCFSMDRGNS-----RSLTVNAKRTQKCLEISLAC 119 Query: 560 QSMXXXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNN 739 + M Q M+ KI+ N G +SW + AG +FGL +C +SSEP + E + + ++ Sbjct: 120 RGMKTRILVPRQGMLPKIKCNVGPMSWTQ-CGYAGLMFGLLIC-NSSEPAHAETTHKNDD 177 Query: 740 ENDDRDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELA 919 + DD D S YSHGKKV+TDYSI GIPGDGRCLFRSVAHGACLR+GK +PS+SLQRELA Sbjct: 178 KEDDGDLS---YSHGKKVHTDYSIIGIPGDGRCLFRSVAHGACLRAGKSAPSQSLQRELA 234 Query: 920 DELRARVADEFVRRREESEWF 982 D+LRARVADEF++RREE+EWF Sbjct: 235 DDLRARVADEFIKRREETEWF 255 >ref|XP_006588483.1| PREDICTED: uncharacterized protein LOC100810338 isoform X1 [Glycine max] Length = 339 Score = 198 bits (503), Expect = 3e-48 Identities = 108/203 (53%), Positives = 134/203 (66%), Gaps = 2/203 (0%) Frame = +2 Query: 380 VTISRKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPC 559 +++ K CS+ + T +G L S SKQR N SS+ +KRY I L C Sbjct: 57 LSVCTKLSCSTVM--GQTIRGGFLGSCCSKQRGNPRFF------SSVVPRKRYHEISLAC 108 Query: 560 QSMXXXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNN 739 Q++ Q M+ K++ N GSVSW +G AS G IFGL VC SSEP + E+ E N Sbjct: 109 QTINMRLLVPKQNMMRKVKCNLGSVSWPRGCASVGLIFGLLVCNLSSEPAHAESHSENEN 168 Query: 740 ENDDRDS--SSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRE 913 DD + S+V HGKKVYTDYS+ GIPGDGRCLFRSVA GACLRSGKP P+ES+QRE Sbjct: 169 RKDDCNEYESNVKVLHGKKVYTDYSVIGIPGDGRCLFRSVARGACLRSGKPPPNESIQRE 228 Query: 914 LADELRARVADEFVRRREESEWF 982 LAD+LRARVADEF++R+EE+EWF Sbjct: 229 LADDLRARVADEFIKRKEETEWF 251 >ref|NP_001242273.1| uncharacterized protein LOC100810338 [Glycine max] gi|255645865|gb|ACU23423.1| unknown [Glycine max] Length = 339 Score = 198 bits (503), Expect = 3e-48 Identities = 108/203 (53%), Positives = 134/203 (66%), Gaps = 2/203 (0%) Frame = +2 Query: 380 VTISRKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPC 559 +++ K CS+ + T +G L S SKQR N SS+ +KRY I L C Sbjct: 57 LSVCTKLSCSTVM--GQTIRGGFLGSCCSKQRGNPRFF------SSVVPRKRYHEISLAC 108 Query: 560 QSMXXXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNN 739 Q++ Q M+ K++ N GSVSW +G AS G IFGL VC SSEP + E+ E N Sbjct: 109 QTINMRLLVPKQNMMRKVKCNLGSVSWPRGCASVGLIFGLLVCNLSSEPAHAESHSENEN 168 Query: 740 ENDDRDS--SSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRE 913 DD + S+V HGKKVYTDYS+ GIPGDGRCLFRSVA GACLRSGKP P+ES+QRE Sbjct: 169 RKDDCNEYESNVKVLHGKKVYTDYSVIGIPGDGRCLFRSVARGACLRSGKPPPNESIQRE 228 Query: 914 LADELRARVADEFVRRREESEWF 982 LAD+LRARVADEF++R+EE+EWF Sbjct: 229 LADDLRARVADEFIKRKEETEWF 251 >ref|XP_007028913.1| Cysteine proteinases superfamily protein isoform 3 [Theobroma cacao] gi|508717518|gb|EOY09415.1| Cysteine proteinases superfamily protein isoform 3 [Theobroma cacao] Length = 324 Score = 182 bits (461), Expect = 2e-43 Identities = 107/257 (41%), Positives = 147/257 (57%) Frame = +2 Query: 212 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFSVTIS 391 MMV SPI+T AK+VVH+ +G +C V S SC ++ +S + ++ +++S Sbjct: 1 MMVCSPISTCAKNVVHLRGHMGSSLC----SVISCQPSSSCYYFSYSGHPKTKYTDLSVS 56 Query: 392 RKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 571 + S A+ FQ C S S++ L+VK +S +KR L I P QSM Sbjct: 57 YTTSGSPAVGYR-AFQAGCFRS--SRRSRKLQSLVVKESISDKTKQKRQLEISWPGQSMK 113 Query: 572 XXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 751 Q + K + G +SW + EA+ K ++ DD Sbjct: 114 MKFLLPKQGTLQKFKCTAGPISWSQ-----------------------EAAGAKEDKQDD 150 Query: 752 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 931 +SS +SHGKKVYTDYS+ GIPGDGRC+FRSVAHGACLRSGK +PSE +QRELAD+LR Sbjct: 151 CESSHAKFSHGKKVYTDYSVIGIPGDGRCMFRSVAHGACLRSGKSAPSEHVQRELADDLR 210 Query: 932 ARVADEFVRRREESEWF 982 A+VADEF++RR+E+EWF Sbjct: 211 AKVADEFIKRRKETEWF 227 >ref|XP_007028911.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao] gi|590636674|ref|XP_007028912.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao] gi|508717516|gb|EOY09413.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao] gi|508717517|gb|EOY09414.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao] Length = 317 Score = 182 bits (461), Expect = 2e-43 Identities = 107/257 (41%), Positives = 147/257 (57%) Frame = +2 Query: 212 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFSVTIS 391 MMV SPI+T AK+VVH+ +G +C V S SC ++ +S + ++ +++S Sbjct: 1 MMVCSPISTCAKNVVHLRGHMGSSLC----SVISCQPSSSCYYFSYSGHPKTKYTDLSVS 56 Query: 392 RKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 571 + S A+ FQ C S S++ L+VK +S +KR L I P QSM Sbjct: 57 YTTSGSPAVGYR-AFQAGCFRS--SRRSRKLQSLVVKESISDKTKQKRQLEISWPGQSMK 113 Query: 572 XXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 751 Q + K + G +SW + EA+ K ++ DD Sbjct: 114 MKFLLPKQGTLQKFKCTAGPISWSQ-----------------------EAAGAKEDKQDD 150 Query: 752 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 931 +SS +SHGKKVYTDYS+ GIPGDGRC+FRSVAHGACLRSGK +PSE +QRELAD+LR Sbjct: 151 CESSHAKFSHGKKVYTDYSVIGIPGDGRCMFRSVAHGACLRSGKSAPSEHVQRELADDLR 210 Query: 932 ARVADEFVRRREESEWF 982 A+VADEF++RR+E+EWF Sbjct: 211 AKVADEFIKRRKETEWF 227 >ref|XP_007028917.1| Cysteine proteinases superfamily protein isoform 7 [Theobroma cacao] gi|508717522|gb|EOY09419.1| Cysteine proteinases superfamily protein isoform 7 [Theobroma cacao] Length = 291 Score = 179 bits (455), Expect = 1e-42 Identities = 106/256 (41%), Positives = 146/256 (57%) Frame = +2 Query: 212 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFFSVTIS 391 MMV SPI+T AK+VVH+ +G +C V S SC ++ +S + ++ +++S Sbjct: 1 MMVCSPISTCAKNVVHLRGHMGSSLC----SVISCQPSSSCYYFSYSGHPKTKYTDLSVS 56 Query: 392 RKSLCSSAITTSLTFQGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 571 + S A+ FQ C S S++ L+VK +S +KR L I P QSM Sbjct: 57 YTTSGSPAVGYR-AFQAGCFRS--SRRSRKLQSLVVKESISDKTKQKRQLEISWPGQSMK 113 Query: 572 XXXXXXXQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 751 Q + K + G +SW + EA+ K ++ DD Sbjct: 114 MKFLLPKQGTLQKFKCTAGPISWSQ-----------------------EAAGAKEDKQDD 150 Query: 752 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 931 +SS +SHGKKVYTDYS+ GIPGDGRC+FRSVAHGACLRSGK +PSE +QRELAD+LR Sbjct: 151 CESSHAKFSHGKKVYTDYSVIGIPGDGRCMFRSVAHGACLRSGKSAPSEHVQRELADDLR 210 Query: 932 ARVADEFVRRREESEW 979 A+VADEF++RR+E+EW Sbjct: 211 AKVADEFIKRRKETEW 226 >ref|XP_004229848.1| PREDICTED: OTU domain-containing protein At3g57810-like isoform 1 [Solanum lycopersicum] Length = 308 Score = 174 bits (441), Expect = 5e-41 Identities = 105/229 (45%), Positives = 133/229 (58%), Gaps = 7/229 (3%) Frame = +2 Query: 317 LTSKSCCFYFHSKVFQPRFFSVTISRKSLCSS----AITTSLTFQGC---CLESGFSKQR 475 +TS+ C S RF+ T K S+ I++S++F G C+ S + Sbjct: 1 MTSRFCNIVLQSPASSFRFYISTNPAKFTISAPRSNCISSSISFGGSQKRCIGYSNSNLK 60 Query: 476 NNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMXXXXXXXXQEMITKIRWNRGSVSWLKGTA 655 N+ L V + S +K I Q++ Q K N G + Sbjct: 61 NSCRTLTVTTAAS---RRKACCDISFWSQNVNMRLFLRTQSKFRKFGCNSGK----RNHT 113 Query: 656 SAGAIFGLSVCFSSSEPVYCEASREKNNENDDRDSSSVGYSHGKKVYTDYSITGIPGDGR 835 SAG G VC S+SEPV+ EAS + D +SS+ GYSHGKKVYTDYS+ GIPGDGR Sbjct: 114 SAGLSIGFLVCCSASEPVHAEASG--GSMGDSCESSTTGYSHGKKVYTDYSVIGIPGDGR 171 Query: 836 CLFRSVAHGACLRSGKPSPSESLQRELADELRARVADEFVRRREESEWF 982 CLFRSVAHGAC+RSGKP P+E+LQRELADELRARVADEF++RREE+EWF Sbjct: 172 CLFRSVAHGACVRSGKPPPNENLQRELADELRARVADEFIKRREETEWF 220