BLASTX nr result
ID: Akebia27_contig00021792
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00021792 (1082 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003632695.1| PREDICTED: OTU domain-containing protein At3... 317 6e-84 ref|XP_002323302.2| OTU-like cysteine protease family protein [P... 301 3e-79 ref|XP_006490038.1| PREDICTED: OTU domain-containing protein At3... 294 5e-77 ref|XP_006421489.1| hypothetical protein CICLE_v10005351mg [Citr... 293 7e-77 ref|XP_007028914.1| Cysteine proteinases superfamily protein iso... 287 6e-75 ref|XP_002534273.1| cysteine-type peptidase, putative [Ricinus c... 279 1e-72 emb|CAN60311.1| hypothetical protein VITISV_002512 [Vitis vinifera] 278 2e-72 ref|XP_006381039.1| hypothetical protein POPTR_0006s05620g [Popu... 274 4e-71 gb|EXC30911.1| OTU domain-containing protein [Morus notabilis] 270 8e-70 ref|XP_004497941.1| PREDICTED: OTU domain-containing protein At3... 269 2e-69 ref|XP_007145652.1| hypothetical protein PHAVU_007G257000g [Phas... 268 2e-69 ref|XP_006588483.1| PREDICTED: uncharacterized protein LOC100810... 265 3e-68 ref|XP_004291162.1| PREDICTED: OTU domain-containing protein At3... 262 2e-67 ref|NP_001242273.1| uncharacterized protein LOC100810338 [Glycin... 261 3e-67 ref|XP_007202322.1| hypothetical protein PRUPE_ppa008123mg [Prun... 260 6e-67 ref|XP_006421488.1| hypothetical protein CICLE_v10005351mg [Citr... 258 2e-66 ref|XP_007028913.1| Cysteine proteinases superfamily protein iso... 243 1e-61 ref|XP_007028911.1| Cysteine proteinases superfamily protein iso... 243 1e-61 ref|XP_004229848.1| PREDICTED: OTU domain-containing protein At3... 224 4e-56 ref|XP_006339468.1| PREDICTED: OTU domain-containing protein At3... 219 2e-54 >ref|XP_003632695.1| PREDICTED: OTU domain-containing protein At3g57810-like [Vitis vinifera] Length = 340 Score = 317 bits (812), Expect = 6e-84 Identities = 164/284 (57%), Positives = 203/284 (71%) Frame = -3 Query: 852 PITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVSVTISRKSLC 673 PI+T A+++V + V RQM H+ + Q S S FYF++ +P+ +++S C Sbjct: 6 PISTCARNIVRLSGCVQRQMSSHICSLVSQGPSSSFSFYFYTGHSKPKNTFMSVSETFSC 65 Query: 672 SSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMXXXXXX 493 SS IT TF+G C SG SK+R +S L VKSL+ S G KR L I L CQ+M Sbjct: 66 SS-ITAFHTFQGSCFYSGLSKRRGSSRSLTVKSLIGSRGPSKRSLNISLTCQNMNVRLLV 124 Query: 492 XKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDDRDSSS 313 KQ ++ KI+ N GSVSW +G ASAG +F L VC+SSSEPV+ E++++K +D+ Sbjct: 125 PKQGVLPKIKCNVGSVSWPQGCASAGLMFALLVCYSSSEPVHAESAQKK----EDKKGEC 180 Query: 312 VGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELRARVAD 133 SHGKKVYTDYSITGIPGDGRCLFRSV HGACLRSGKP+PS S QRELADELRA V D Sbjct: 181 YTNSHGKKVYTDYSITGIPGDGRCLFRSVVHGACLRSGKPAPSASCQRELADELRAEVVD 240 Query: 132 EFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 EF+RRR E+EWF++GDFDTYVS++RKPHVWGGEPELFMASHVLQ Sbjct: 241 EFIRRRSETEWFIEGDFDTYVSQMRKPHVWGGEPELFMASHVLQ 284 >ref|XP_002323302.2| OTU-like cysteine protease family protein [Populus trichocarpa] gi|550320875|gb|EEF05063.2| OTU-like cysteine protease family protein [Populus trichocarpa] Length = 342 Score = 301 bits (771), Expect = 3e-79 Identities = 154/289 (53%), Positives = 201/289 (69%) Frame = -3 Query: 867 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVSVTIS 688 M+V SPI+T K+VVH+ RV +QM + V + SCCF + + + + +++S Sbjct: 1 MIVCSPISTCVKNVVHLSSRV-QQMGSTILNVVSGGQTTSCCFSSYPGLSRSSYSRLSVS 59 Query: 687 RKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 508 + C S + T + C S +KQR + VK +V S G KR I LPCQ M Sbjct: 60 KTFSCPSI--SYQTIQSNCFGSVLTKQRADLQSFSVKGVVRSRGPLKRQFNISLPCQIMN 117 Query: 507 XXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 328 KQ +++KI N GS+SW +G + G IFGL VC+SSSEP + EA+ KN E D+ Sbjct: 118 LRFSVSKQGVLSKINDNTGSISWSQGYPTTGIIFGLLVCYSSSEPTHAEAATHKNEEEDN 177 Query: 327 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 148 + S + +SHGK+VY DYSI GIPGDGRCLFRSVAHGAC+RSGKP+PSE+LQRELAD+LR Sbjct: 178 CNLSDIKFSHGKEVYRDYSIIGIPGDGRCLFRSVAHGACIRSGKPAPSENLQRELADDLR 237 Query: 147 ARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 ++VADEF++RREE+EWF++G+FDTYVSRIRKPHVWGGEPEL MASHVL+ Sbjct: 238 SKVADEFIKRREETEWFIEGNFDTYVSRIRKPHVWGGEPELLMASHVLK 286 >ref|XP_006490038.1| PREDICTED: OTU domain-containing protein At3g57810-like [Citrus sinensis] Length = 341 Score = 294 bits (752), Expect = 5e-77 Identities = 148/289 (51%), Positives = 200/289 (69%) Frame = -3 Query: 867 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVSVTIS 688 M+V + I AK+VV++ R QM ++ GV + S SCCF+ S + + ++ Sbjct: 1 MIVSTSICACAKNVVNLGGRFQGQMGGNICGVTYRGPSSSCCFHLCSGQSKKNYTGIS-- 58 Query: 687 RKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 508 +++ SS++ F+ C G +K R N L ++S + S G +KR++ I L C SM Sbjct: 59 -RTISSSSLNVLQPFQATCFSLGLTKPRCNLQPLTIRSFIGSRGSQKRHIEISLACHSMK 117 Query: 507 XXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 328 Q ++ K++ N G + W KG ASAG I GL VC+SSS+ + EA+ EK + +D Sbjct: 118 MRLLVPNQGVLPKLKLNAGPIDWPKGCASAGLICGLLVCYSSSK-AHAEAADEKEDGEED 176 Query: 327 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 148 D S+V YSHGKKVYTDYS+ GIPGDGRCLFR+VAHGACLR+GKP+PS S+QRELAD+LR Sbjct: 177 YDLSNVKYSHGKKVYTDYSVIGIPGDGRCLFRAVAHGACLRAGKPAPSVSIQRELADDLR 236 Query: 147 ARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 A+VADEF++RREE+EWF++GDFD YVS+IRKPHVWGGEPEL MASHVL+ Sbjct: 237 AKVADEFIKRREETEWFIEGDFDLYVSQIRKPHVWGGEPELLMASHVLR 285 >ref|XP_006421489.1| hypothetical protein CICLE_v10005351mg [Citrus clementina] gi|557523362|gb|ESR34729.1| hypothetical protein CICLE_v10005351mg [Citrus clementina] Length = 341 Score = 293 bits (751), Expect = 7e-77 Identities = 148/289 (51%), Positives = 200/289 (69%) Frame = -3 Query: 867 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVSVTIS 688 M+V + I AK+VV++ R QM ++ GV + S SCCFY S + + ++ Sbjct: 1 MIVSTSICACAKNVVNLGGRFQGQMGGNICGVTYRGPSSSCCFYLCSGQSKKNYAGIS-- 58 Query: 687 RKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 508 +++ SS++ F+ C G +K R N L ++S + S G +KR++ I L C+SM Sbjct: 59 -RTISSSSLNVLQPFQATCFSPGLTKPRCNLRPLTIRSFIGSRGSQKRHIEISLACRSMK 117 Query: 507 XXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 328 Q ++ K++ N G + W KG ASAG I GL VC+SSS+ + EA+ EK + +D Sbjct: 118 MRLLVPSQGVLPKLKLNAGPIDWPKGCASAGLICGLLVCYSSSK-AHAEAADEKEDGEED 176 Query: 327 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 148 D S+V Y HGKKVYTDYS+ GIPGDGRCLFR+VAHGACLR+GKP+PS S+QRELAD+LR Sbjct: 177 YDLSNVKYLHGKKVYTDYSVIGIPGDGRCLFRAVAHGACLRAGKPAPSVSIQRELADDLR 236 Query: 147 ARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 A+VADEF++RREE+EWF++GDFD YVS+IRKPHVWGGEPEL MASHVL+ Sbjct: 237 AKVADEFIKRREETEWFIEGDFDLYVSQIRKPHVWGGEPELLMASHVLR 285 >ref|XP_007028914.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao] gi|590636687|ref|XP_007028915.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao] gi|590636690|ref|XP_007028916.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao] gi|508717519|gb|EOY09416.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao] gi|508717520|gb|EOY09417.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao] gi|508717521|gb|EOY09418.1| Cysteine proteinases superfamily protein isoform 4 [Theobroma cacao] Length = 340 Score = 287 bits (734), Expect = 6e-75 Identities = 150/289 (51%), Positives = 198/289 (68%) Frame = -3 Query: 867 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVSVTIS 688 MMV SPI+T AK+VVH+ +G +C V S SC ++ +S + ++ +++S Sbjct: 1 MMVCSPISTCAKNVVHLRGHMGSSLC----SVISCQPSSSCYYFSYSGHPKTKYTDLSVS 56 Query: 687 RKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 508 + S A+ + GC S S++ + L+VK +S +KR L I P QSM Sbjct: 57 YTTSGSPAVGYRAFQAGCFRSSRRSRKLQS---LVVKESISDKTKQKRQLEISWPGQSMK 113 Query: 507 XXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 328 KQ + K + G +SW +G AS G +FGL VC+SSSEPV+ EA+ K ++ DD Sbjct: 114 MKFLLPKQGTLQKFKCTAGPISWSQGCASVGLVFGLLVCYSSSEPVHAEAAGAKEDKQDD 173 Query: 327 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 148 +SS +SHGKKVYTDYS+ GIPGDGRC+FRSVAHGACLRSGK +PSE +QRELAD+LR Sbjct: 174 CESSHAKFSHGKKVYTDYSVIGIPGDGRCMFRSVAHGACLRSGKSAPSEHVQRELADDLR 233 Query: 147 ARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 A+VADEF++RR+E+EWFV+G+FD YVS+IRKPHVWGGEPELFMASHVLQ Sbjct: 234 AKVADEFIKRRKETEWFVEGNFDAYVSQIRKPHVWGGEPELFMASHVLQ 282 >ref|XP_002534273.1| cysteine-type peptidase, putative [Ricinus communis] gi|223525596|gb|EEF28110.1| cysteine-type peptidase, putative [Ricinus communis] Length = 343 Score = 279 bits (714), Expect = 1e-72 Identities = 148/289 (51%), Positives = 197/289 (68%) Frame = -3 Query: 867 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVSVTIS 688 M+V SPI+T A+ VV++ + M + + S SCCF + ++IS Sbjct: 1 MIVCSPISTYARKVVYLSG-CAQHMGSTIFNMVSNGQSTSCCFCSCRAHLSKSYARLSIS 59 Query: 687 RKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 508 K+ S ++ T T SG +KQ + + VK L ++ G K++ + L Q++ Sbjct: 60 -KTFSSPSVGTCQTSNKNFSGSGSAKQSGSWQSITVKGLFNTRGPLKKHFNLSLAYQNLN 118 Query: 507 XXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 328 K+ M++KI+ N GS+SW + AS G I GL VC+SSSEP EA+ + +E D+ Sbjct: 119 MRFSLSKRGMLSKIKDNVGSISWAQECASTGLICGLLVCYSSSEPTRAEAAAREKDEEDN 178 Query: 327 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 148 D S V +SHGK+VYTDYSITGIPGDGRCLFRSVAHGA LR+GKP+PSESLQRELAD+LR Sbjct: 179 SDLSYVKFSHGKRVYTDYSITGIPGDGRCLFRSVAHGASLRTGKPAPSESLQRELADDLR 238 Query: 147 ARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 ARVADEF+RRR+E+EWF++GDFDTYV+++RKPHVWGGEPELFMASHVL+ Sbjct: 239 ARVADEFIRRRQETEWFIEGDFDTYVAQMRKPHVWGGEPELFMASHVLK 287 >emb|CAN60311.1| hypothetical protein VITISV_002512 [Vitis vinifera] Length = 806 Score = 278 bits (712), Expect = 2e-72 Identities = 139/213 (65%), Positives = 163/213 (76%) Frame = -3 Query: 639 GCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMXXXXXXXKQEMITKIRW 460 G C SG SK+R +S L VKSL+ S G KR L I L CQ+M KQ ++ KI+ Sbjct: 542 GSCFYSGLSKRRGSSRSLTVKSLIGSRGPSKRSLNISLTCQNMNVRLLVPKQGVLPKIKC 601 Query: 459 NRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDDRDSSSVGYSHGKKVYT 280 N GSVSW +G ASAG +F L VC+SSSEPV+ E++++K +D+ SHGKKVYT Sbjct: 602 NVGSVSWPQGCASAGLMFALLVCYSSSEPVHAESAQKK----EDKKGECYTNSHGKKVYT 657 Query: 279 DYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELRARVADEFVRRREESEW 100 DYSITGIPGDGRCLFRSV HGACLRSGKP+PS S QRELADELRA V DEF+RRR E+EW Sbjct: 658 DYSITGIPGDGRCLFRSVVHGACLRSGKPAPSASCQRELADELRAEVVDEFIRRRSETEW 717 Query: 99 FVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 F++GDFDTYVS++RKPHVWGGEPELFMASHVLQ Sbjct: 718 FIEGDFDTYVSQMRKPHVWGGEPELFMASHVLQ 750 >ref|XP_006381039.1| hypothetical protein POPTR_0006s05620g [Populus trichocarpa] gi|550335541|gb|ERP58836.1| hypothetical protein POPTR_0006s05620g [Populus trichocarpa] Length = 338 Score = 274 bits (701), Expect = 4e-71 Identities = 147/289 (50%), Positives = 193/289 (66%) Frame = -3 Query: 867 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVSVTIS 688 M+V S I T K+VVH+ RV +QM + V + S S CF + + + +++S Sbjct: 1 MIVCSAINTCVKNVVHLSGRV-QQMGSTILNVVSRGQSTSRCFSLYPSRSRSNYSRLSVS 59 Query: 687 RKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 508 + C S + T C S KQR N L VK +V+S G KR I LP Q+M Sbjct: 60 KTFSCPSI--SFHTLHRNCFGSDSIKQRYNLVSLTVKGVVNSGGPLKRQFNISLPSQNMA 117 Query: 507 XXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 328 K+ ++ KI+ N GSVS + + G FGL VC+SSSEP + E++ KN E D Sbjct: 118 LRFSVSKRGLLAKIKGNVGSVSCSQRHTTTGIFFGLLVCYSSSEPTHAESATRKNKEEDI 177 Query: 327 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 148 +SS + +SHGK+VYTDYSI G+PGDGRCLFRSVAHGACLR GK +PSESLQRELAD+LR Sbjct: 178 CNSSDIKFSHGKEVYTDYSIIGVPGDGRCLFRSVAHGACLRFGKRAPSESLQRELADDLR 237 Query: 147 ARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 + VADEF++RRE++EWF++G+FD+YVS++RKPHVWGGEPEL MASHVL+ Sbjct: 238 SNVADEFIKRREDTEWFIEGNFDSYVSQMRKPHVWGGEPELLMASHVLK 286 >gb|EXC30911.1| OTU domain-containing protein [Morus notabilis] Length = 893 Score = 270 bits (690), Expect = 8e-70 Identities = 144/297 (48%), Positives = 190/297 (63%) Frame = -3 Query: 894 FVHTPGYVNMMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQ 715 F+ Y NM+V I K + + + +M + V + SCCF + + Sbjct: 555 FIRNSCYDNMIVCPSIGACTKSIACLSGNIQTEMGSKLCSVVSRRPYSSCCFCLYPGNSK 614 Query: 714 PRFVSVTISRKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLG 535 ++ +++S+ L +S+ T +F C FS ++ L +K LVS+ ++R L Sbjct: 615 TKYAHLSVSKNHLSNSSPTFQKSFVSSC----FSTEKGRLWSLALKDLVSAAEPQRRRLK 670 Query: 534 IPLPCQSMXXXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEAS 355 I L +M KQ M+ KI + +AG + GL +C+SSS+P + E + Sbjct: 671 ISLANTAMSIRLLVPKQRMLVKIN-----------SGTAGLLGGLLICYSSSKPAHAEVA 719 Query: 354 REKNNENDDRDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESL 175 R ++ DD DSS V +SHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKP+PSESL Sbjct: 720 RSDDDSEDDCDSSYVKFSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPAPSESL 779 Query: 174 QRELADELRARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVL 4 QRELAD LRARVADEF++RREE+EWFV+GDFDTYV+++RKPHVWGGEPELFMASHVL Sbjct: 780 QRELADNLRARVADEFIKRREETEWFVEGDFDTYVAQMRKPHVWGGEPELFMASHVL 836 >ref|XP_004497941.1| PREDICTED: OTU domain-containing protein At3g57810-like [Cicer arietinum] Length = 337 Score = 269 bits (687), Expect = 2e-69 Identities = 146/289 (50%), Positives = 189/289 (65%) Frame = -3 Query: 867 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVSVTIS 688 M + P++ + V + R M ++ G+ + S S F+ +V ++I Sbjct: 1 MSICFPVSQSSISAVVVKGRTQLLMSSNICGLQSRGISCSFSSGFYPGKSGKNYVGLSIC 60 Query: 687 RKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 508 K CS+ + T RG L S SKQR ++ L S+VS H++ I L CQSM Sbjct: 61 TKPSCSTVM--GQTIRGGYLGSCCSKQRGSTQLF--NSIVSRKKHRE----ISLACQSMS 112 Query: 507 XXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 328 KQ+M++K++ N G ++W + AS G IFGL VC SSEP + EA E NDD Sbjct: 113 MRLLVPKQKMLSKVKCNVGRINWPRSCASVGFIFGLFVCNLSSEPAHAEADYENRKRNDD 172 Query: 327 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 148 D ++V SHGK+VYTDYS+ GIPGDGRCLFRSVAHGA LRSGKP PSE QRELAD+LR Sbjct: 173 CDETNVKVSHGKQVYTDYSVIGIPGDGRCLFRSVAHGASLRSGKPPPSERFQRELADDLR 232 Query: 147 ARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 A+VADEFV+RREE+EWF++GDFD+Y+S+IRKPHVWGGEPELF+ASHVLQ Sbjct: 233 AKVADEFVKRREETEWFIEGDFDSYISQIRKPHVWGGEPELFIASHVLQ 281 >ref|XP_007145652.1| hypothetical protein PHAVU_007G257000g [Phaseolus vulgaris] gi|561018842|gb|ESW17646.1| hypothetical protein PHAVU_007G257000g [Phaseolus vulgaris] Length = 339 Score = 268 bits (686), Expect = 2e-69 Identities = 145/262 (55%), Positives = 181/262 (69%), Gaps = 2/262 (0%) Frame = -3 Query: 780 RGVAPQLTSKSCCFYFHSKVFQPRFVSVTISRKSLCSSAITTSSTFRGCCLESGFSKQRN 601 RG++ +S S F S++ V +++ K CS+ + T RG L S SKQR Sbjct: 35 RGISTSFSSSS--FPGESEI---NHVDLSVCTKLSCSTVM--GQTIRGGFLGSCCSKQRG 87 Query: 600 NSDLLMVKSLVSSMGHKKRYLGIPLPCQSMXXXXXXXKQEMITKIRWNRGSVSWLKGTAS 421 N+ SS+ +KRY I L CQS+ KQ+++ K++ N G VSW +G AS Sbjct: 88 NTQFF------SSVVPRKRYHEISLACQSVNMRLFLPKQKLLHKVKRNFGPVSWPRGCAS 141 Query: 420 AGAIFGLSVCFSSSEPVYCEASREKNNENDDRDS--SSVGYSHGKKVYTDYSITGIPGDG 247 G IFGL VC SSSEP + E+ E N DD + S+V SHGKKVYTDYS+ GIPGDG Sbjct: 142 VGLIFGLLVCSSSSEPAHAESHSENENRKDDCNQYESNVKVSHGKKVYTDYSVIGIPGDG 201 Query: 246 RCLFRSVAHGACLRSGKPSPSESLQRELADELRARVADEFVRRREESEWFVKGDFDTYVS 67 RCLFRSV+ GACLRSGKP P+ES+QRELAD+LRARVADEF++RREE+EWF++GDFDTY+S Sbjct: 202 RCLFRSVSRGACLRSGKPPPTESVQRELADDLRARVADEFIKRREETEWFIEGDFDTYIS 261 Query: 66 RIRKPHVWGGEPELFMASHVLQ 1 IRKPHVWGGEPELF+ASHVLQ Sbjct: 262 HIRKPHVWGGEPELFIASHVLQ 283 >ref|XP_006588483.1| PREDICTED: uncharacterized protein LOC100810338 isoform X1 [Glycine max] Length = 339 Score = 265 bits (676), Expect = 3e-68 Identities = 140/237 (59%), Positives = 168/237 (70%), Gaps = 2/237 (0%) Frame = -3 Query: 705 VSVTISRKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPL 526 V +++ K CS+ + T RG L S SKQR N SS+ +KRY I L Sbjct: 55 VGLSVCTKLSCSTVM--GQTIRGGFLGSCCSKQRGNPRFF------SSVVPRKRYHEISL 106 Query: 525 PCQSMXXXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREK 346 CQ++ KQ M+ K++ N GSVSW +G AS G IFGL VC SSEP + E+ E Sbjct: 107 ACQTINMRLLVPKQNMMRKVKCNLGSVSWPRGCASVGLIFGLLVCNLSSEPAHAESHSEN 166 Query: 345 NNENDDRDS--SSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQ 172 N DD + S+V HGKKVYTDYS+ GIPGDGRCLFRSVA GACLRSGKP P+ES+Q Sbjct: 167 ENRKDDCNEYESNVKVLHGKKVYTDYSVIGIPGDGRCLFRSVARGACLRSGKPPPNESIQ 226 Query: 171 RELADELRARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 RELAD+LRARVADEF++R+EE+EWFV+GDFDTYVS+IRKPHVWGGEPELF+ASHVLQ Sbjct: 227 RELADDLRARVADEFIKRKEETEWFVEGDFDTYVSQIRKPHVWGGEPELFIASHVLQ 283 >ref|XP_004291162.1| PREDICTED: OTU domain-containing protein At3g57810-like [Fragaria vesca subsp. vesca] Length = 343 Score = 262 bits (669), Expect = 2e-67 Identities = 146/293 (49%), Positives = 190/293 (64%) Frame = -3 Query: 879 GYVNMMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVS 700 GYVN +V + I A +VV + + QM + V + S S C+ +F + Sbjct: 9 GYVNTVVGTHINQGANNVVCMSGCIEMQMGSKICSVVSRGASSSYCYRLQPGKSGNKFGT 68 Query: 699 VTISRKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPC 520 +++++ S T T G C S FS R NS +SL + ++ L I L C Sbjct: 69 LSLTK----SRPSETGQTPHGSCFRSCFSMDRGNS-----RSLTVNAKRTQKCLEISLAC 119 Query: 519 QSMXXXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNN 340 + M +Q M+ KI+ N G +SW + AG +FGL +C +SSEP + E + + ++ Sbjct: 120 RGMKTRILVPRQGMLPKIKCNVGPMSWTQ-CGYAGLMFGLLIC-NSSEPAHAETTHKNDD 177 Query: 339 ENDDRDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELA 160 + DD D S YSHGKKV+TDYSI GIPGDGRCLFRSVAHGACLR+GK +PS+SLQRELA Sbjct: 178 KEDDGDLS---YSHGKKVHTDYSIIGIPGDGRCLFRSVAHGACLRAGKSAPSQSLQRELA 234 Query: 159 DELRARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 D+LRARVADEF++RREE+EWFV+GDFDTYVS+IRKPHVWGGEPEL MASHVLQ Sbjct: 235 DDLRARVADEFIKRREETEWFVEGDFDTYVSQIRKPHVWGGEPELLMASHVLQ 287 >ref|NP_001242273.1| uncharacterized protein LOC100810338 [Glycine max] gi|255645865|gb|ACU23423.1| unknown [Glycine max] Length = 339 Score = 261 bits (668), Expect = 3e-67 Identities = 139/237 (58%), Positives = 167/237 (70%), Gaps = 2/237 (0%) Frame = -3 Query: 705 VSVTISRKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPL 526 V +++ K CS+ + T RG L S SKQR N SS+ +KRY I L Sbjct: 55 VGLSVCTKLSCSTVM--GQTIRGGFLGSCCSKQRGNPRFF------SSVVPRKRYHEISL 106 Query: 525 PCQSMXXXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREK 346 CQ++ KQ M+ K++ N GSVSW +G AS G IFGL VC SSEP + E+ E Sbjct: 107 ACQTINMRLLVPKQNMMRKVKCNLGSVSWPRGCASVGLIFGLLVCNLSSEPAHAESHSEN 166 Query: 345 NNENDDRDS--SSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQ 172 N DD + S+V HGKKVYTDYS+ GIPGDGRCLFRSVA GACLRSGKP P+ES+Q Sbjct: 167 ENRKDDCNEYESNVKVLHGKKVYTDYSVIGIPGDGRCLFRSVARGACLRSGKPPPNESIQ 226 Query: 171 RELADELRARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 RELAD+LRARVADEF++R+EE+EWFV+GDFDTYVS+IRKPHVWGGE ELF+ASHVLQ Sbjct: 227 RELADDLRARVADEFIKRKEETEWFVEGDFDTYVSQIRKPHVWGGESELFIASHVLQ 283 >ref|XP_007202322.1| hypothetical protein PRUPE_ppa008123mg [Prunus persica] gi|462397853|gb|EMJ03521.1| hypothetical protein PRUPE_ppa008123mg [Prunus persica] Length = 344 Score = 260 bits (665), Expect = 6e-67 Identities = 148/293 (50%), Positives = 185/293 (63%) Frame = -3 Query: 879 GYVNMMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVS 700 G+VN +V PI K+VV + QM + V + S SCC + + S Sbjct: 9 GFVNTIVCPPINHSPKNVVCLSGCTQIQMGSKICSVVSRGASSSCCKGLQTGKTGTKIFS 68 Query: 699 VTISRKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPC 520 + +S+ + T G C FSK + V++ G K L I L C Sbjct: 69 LPLSK----NRPTNIGQTSHGNCFRFFFSKDSRSLT-------VNAGGPNKGSLEISLAC 117 Query: 519 QSMXXXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNN 340 + M +Q M+ KI+ N G VSW +G ASAG IFGL VC + S P + EA+ + + Sbjct: 118 RGMNTRLLVPRQGMLPKIKCNVGPVSWPQGCASAGLIFGLLVC-NCSGPAHAEAAH-RED 175 Query: 339 ENDDRDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELA 160 E DD D S V +S GKKVYTDYSI GIPGDGRCLFRSVAHGA LR+GK +P+ESLQRELA Sbjct: 176 EEDDNDLSYVKFSRGKKVYTDYSIIGIPGDGRCLFRSVAHGAYLRAGKAAPAESLQRELA 235 Query: 159 DELRARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 D+LRARVADEF++RREE+EWFV+GDFDTYVS+IR+PHVWGGEPELFMASHVL+ Sbjct: 236 DDLRARVADEFIKRREETEWFVEGDFDTYVSQIRRPHVWGGEPELFMASHVLK 288 >ref|XP_006421488.1| hypothetical protein CICLE_v10005351mg [Citrus clementina] gi|557523361|gb|ESR34728.1| hypothetical protein CICLE_v10005351mg [Citrus clementina] Length = 311 Score = 258 bits (660), Expect = 2e-66 Identities = 138/289 (47%), Positives = 186/289 (64%) Frame = -3 Query: 867 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVSVTIS 688 M+V + I AK+VV++ R QM ++ GV + S SCCFY S + + ++ Sbjct: 1 MIVSTSICACAKNVVNLGGRFQGQMGGNICGVTYRGPSSSCCFYLCSGQSKKNYAGIS-- 58 Query: 687 RKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 508 +++ SS++ F+ C G +K + ++ LV S G Sbjct: 59 -RTISSSSLNVLQPFQATCFSPGLTKPS-----MKMRLLVPSQG---------------- 96 Query: 507 XXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 328 ++ K++ N G + W KG ASAG I GL VC+SSS+ + EA+ EK + +D Sbjct: 97 ---------VLPKLKLNAGPIDWPKGCASAGLICGLLVCYSSSK-AHAEAADEKEDGEED 146 Query: 327 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 148 D S+V Y HGKKVYTDYS+ GIPGDGRCLFR+VAHGACLR+GKP+PS S+QRELAD+LR Sbjct: 147 YDLSNVKYLHGKKVYTDYSVIGIPGDGRCLFRAVAHGACLRAGKPAPSVSIQRELADDLR 206 Query: 147 ARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 A+VADEF++RREE+EWF++GDFD YVS+IRKPHVWGGEPEL MASHVL+ Sbjct: 207 AKVADEFIKRREETEWFIEGDFDLYVSQIRKPHVWGGEPELLMASHVLR 255 >ref|XP_007028913.1| Cysteine proteinases superfamily protein isoform 3 [Theobroma cacao] gi|508717518|gb|EOY09415.1| Cysteine proteinases superfamily protein isoform 3 [Theobroma cacao] Length = 324 Score = 243 bits (619), Expect = 1e-61 Identities = 135/289 (46%), Positives = 180/289 (62%) Frame = -3 Query: 867 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVSVTIS 688 MMV SPI+T AK+VVH+ +G +C V S SC ++ +S + ++ +++S Sbjct: 1 MMVCSPISTCAKNVVHLRGHMGSSLC----SVISCQPSSSCYYFSYSGHPKTKYTDLSVS 56 Query: 687 RKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 508 + S A+ + GC S S++ + L+VK +S +KR L I P QSM Sbjct: 57 YTTSGSPAVGYRAFQAGCFRSSRRSRKLQS---LVVKESISDKTKQKRQLEISWPGQSMK 113 Query: 507 XXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 328 KQ + K + G +SW + EA+ K ++ DD Sbjct: 114 MKFLLPKQGTLQKFKCTAGPISWSQ-----------------------EAAGAKEDKQDD 150 Query: 327 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 148 +SS +SHGKKVYTDYS+ GIPGDGRC+FRSVAHGACLRSGK +PSE +QRELAD+LR Sbjct: 151 CESSHAKFSHGKKVYTDYSVIGIPGDGRCMFRSVAHGACLRSGKSAPSEHVQRELADDLR 210 Query: 147 ARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 A+VADEF++RR+E+EWFV+G+FD YVS+IRKPHVWGGEPELFMASHVLQ Sbjct: 211 AKVADEFIKRRKETEWFVEGNFDAYVSQIRKPHVWGGEPELFMASHVLQ 259 >ref|XP_007028911.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao] gi|590636674|ref|XP_007028912.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao] gi|508717516|gb|EOY09413.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao] gi|508717517|gb|EOY09414.1| Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao] Length = 317 Score = 243 bits (619), Expect = 1e-61 Identities = 135/289 (46%), Positives = 180/289 (62%) Frame = -3 Query: 867 MMVYSPITTPAKHVVHICDRVGRQMCCHVRGVAPQLTSKSCCFYFHSKVFQPRFVSVTIS 688 MMV SPI+T AK+VVH+ +G +C V S SC ++ +S + ++ +++S Sbjct: 1 MMVCSPISTCAKNVVHLRGHMGSSLC----SVISCQPSSSCYYFSYSGHPKTKYTDLSVS 56 Query: 687 RKSLCSSAITTSSTFRGCCLESGFSKQRNNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMX 508 + S A+ + GC S S++ + L+VK +S +KR L I P QSM Sbjct: 57 YTTSGSPAVGYRAFQAGCFRSSRRSRKLQS---LVVKESISDKTKQKRQLEISWPGQSMK 113 Query: 507 XXXXXXKQEMITKIRWNRGSVSWLKGTASAGAIFGLSVCFSSSEPVYCEASREKNNENDD 328 KQ + K + G +SW + EA+ K ++ DD Sbjct: 114 MKFLLPKQGTLQKFKCTAGPISWSQ-----------------------EAAGAKEDKQDD 150 Query: 327 RDSSSVGYSHGKKVYTDYSITGIPGDGRCLFRSVAHGACLRSGKPSPSESLQRELADELR 148 +SS +SHGKKVYTDYS+ GIPGDGRC+FRSVAHGACLRSGK +PSE +QRELAD+LR Sbjct: 151 CESSHAKFSHGKKVYTDYSVIGIPGDGRCMFRSVAHGACLRSGKSAPSEHVQRELADDLR 210 Query: 147 ARVADEFVRRREESEWFVKGDFDTYVSRIRKPHVWGGEPELFMASHVLQ 1 A+VADEF++RR+E+EWFV+G+FD YVS+IRKPHVWGGEPELFMASHVLQ Sbjct: 211 AKVADEFIKRRKETEWFVEGNFDAYVSQIRKPHVWGGEPELFMASHVLQ 259 >ref|XP_004229848.1| PREDICTED: OTU domain-containing protein At3g57810-like isoform 1 [Solanum lycopersicum] Length = 308 Score = 224 bits (572), Expect = 4e-56 Identities = 129/261 (49%), Positives = 160/261 (61%), Gaps = 7/261 (2%) Frame = -3 Query: 762 LTSKSCCFYFHSKVFQPRFVSVTISRKSLCSS----AITTSSTFRGC---CLESGFSKQR 604 +TS+ C S RF T K S+ I++S +F G C+ S + Sbjct: 1 MTSRFCNIVLQSPASSFRFYISTNPAKFTISAPRSNCISSSISFGGSQKRCIGYSNSNLK 60 Query: 603 NNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMXXXXXXXKQEMITKIRWNRGSVSWLKGTA 424 N+ L V + S +K I Q++ Q K N G + Sbjct: 61 NSCRTLTVTTAAS---RRKACCDISFWSQNVNMRLFLRTQSKFRKFGCNSGK----RNHT 113 Query: 423 SAGAIFGLSVCFSSSEPVYCEASREKNNENDDRDSSSVGYSHGKKVYTDYSITGIPGDGR 244 SAG G VC S+SEPV+ EAS + D +SS+ GYSHGKKVYTDYS+ GIPGDGR Sbjct: 114 SAGLSIGFLVCCSASEPVHAEASG--GSMGDSCESSTTGYSHGKKVYTDYSVIGIPGDGR 171 Query: 243 CLFRSVAHGACLRSGKPSPSESLQRELADELRARVADEFVRRREESEWFVKGDFDTYVSR 64 CLFRSVAHGAC+RSGKP P+E+LQRELADELRARVADEF++RREE+EWF++GDF+TYV++ Sbjct: 172 CLFRSVAHGACVRSGKPPPNENLQRELADELRARVADEFIKRREETEWFIEGDFNTYVAQ 231 Query: 63 IRKPHVWGGEPELFMASHVLQ 1 IR HVWGGEPEL MASHVLQ Sbjct: 232 IRNSHVWGGEPELLMASHVLQ 252 >ref|XP_006339468.1| PREDICTED: OTU domain-containing protein At3g57810-like [Solanum tuberosum] Length = 308 Score = 219 bits (557), Expect = 2e-54 Identities = 126/261 (48%), Positives = 158/261 (60%), Gaps = 7/261 (2%) Frame = -3 Query: 762 LTSKSCCFYFHSKVFQPRFVSVTISRKSLCSS----AITTSSTFRGC---CLESGFSKQR 604 +TS+ C S RF T K + S+ I++S F G C+ S + Sbjct: 1 MTSRFCNIVLQSPASSLRFYISTNPAKFIISAPRSNCISSSINFEGSQKRCVGYTNSNLK 60 Query: 603 NNSDLLMVKSLVSSMGHKKRYLGIPLPCQSMXXXXXXXKQEMITKIRWNRGSVSWLKGTA 424 N+ L +L ++ + I Q++ Q K N G + Sbjct: 61 NSCRSL---TLTTAASRRNVCCDISFWSQNVNMRLFLPTQSKFHKFGCNSGQ----RNHT 113 Query: 423 SAGAIFGLSVCFSSSEPVYCEASREKNNENDDRDSSSVGYSHGKKVYTDYSITGIPGDGR 244 SAG G VC S+SEPV EAS + D +SS+ GYSHGKKVYTDYS+ GIPGDGR Sbjct: 114 SAGLFIGFLVCCSASEPVLAEASG--GSMGDSCESSTTGYSHGKKVYTDYSVIGIPGDGR 171 Query: 243 CLFRSVAHGACLRSGKPSPSESLQRELADELRARVADEFVRRREESEWFVKGDFDTYVSR 64 CLFRSVAHGAC+RSGKP P+E+LQR+LADELRARVADEF++RREE+EWF++GDF TYV++ Sbjct: 172 CLFRSVAHGACVRSGKPPPNENLQRQLADELRARVADEFIKRREETEWFIEGDFITYVAQ 231 Query: 63 IRKPHVWGGEPELFMASHVLQ 1 IR HVWGGEPEL MASHVLQ Sbjct: 232 IRNSHVWGGEPELLMASHVLQ 252