BLASTX nr result
ID: Glycyrrhiza32_contig00004487
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza32_contig00004487 (1220 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_004496177.1 PREDICTED: OTU domain-containing protein At3g5781... 408 e-139 BAE71258.1 hypothetical protein [Trifolium pratense] 395 e-133 XP_013469378.1 OTU-like cysteine protease [Medicago truncatula] ... 380 e-128 XP_003536306.1 PREDICTED: uncharacterized protein LOC100793001 [... 374 e-125 XP_017413456.1 PREDICTED: uncharacterized protein LOC108324995 [... 366 e-123 XP_003556279.1 PREDICTED: OTU domain-containing protein At3g5781... 365 e-122 XP_014512510.1 PREDICTED: uncharacterized protein LOC106771118 [... 364 e-122 XP_007143828.1 hypothetical protein PHAVU_007G105100g [Phaseolus... 362 e-121 XP_016177333.1 PREDICTED: uncharacterized protein LOC107619558 [... 362 e-120 XP_015941210.1 PREDICTED: uncharacterized protein LOC107466718 [... 360 e-120 XP_004142455.1 PREDICTED: OTU domain-containing protein At3g5781... 327 e-107 XP_016900257.1 PREDICTED: OTU domain-containing protein At3g5781... 325 e-106 XP_019459096.1 PREDICTED: uncharacterized protein LOC109359045 [... 323 e-105 XP_010032108.1 PREDICTED: OTU domain-containing protein At3g5781... 316 e-103 OMO50984.1 Ovarian tumor, otubain [Corchorus olitorius] 315 e-102 OMO98833.1 Ovarian tumor, otubain [Corchorus capsularis] 314 e-102 XP_018845374.1 PREDICTED: uncharacterized protein LOC109009371 [... 313 e-101 KHN37847.1 OTU domain-containing protein [Glycine soja] 310 e-101 GAU40884.1 hypothetical protein TSUD_40590 [Trifolium subterraneum] 310 e-101 EOY19029.1 Cysteine proteinases superfamily protein isoform 1 [T... 310 e-100 >XP_004496177.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cicer arietinum] Length = 313 Score = 408 bits (1048), Expect = e-139 Identities = 214/316 (67%), Positives = 233/316 (73%), Gaps = 7/316 (2%) Frame = +2 Query: 68 MLGVLCATRPKPWIFSFLHASS---AARLAHGT-AYSSASPRFSRPGHDGARRQHSSSCE 235 MLGVLCATR +PWIFSFLH+S+ AARLAH T A SS S RF ARR HSS+CE Sbjct: 1 MLGVLCATRSRPWIFSFLHSSASHHAARLAHCTVACSSLSTRFDATF--AARRHHSSACE 58 Query: 236 LRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 415 L+ IWHAI PCGGDGFRRGVV V HDH+LKGEGSWNVAWDARPARWLH DSA Sbjct: 59 LQ-LGGGAASIWHAIRPCGGDGFRRGVVTVQHDHDLKGEGSWNVAWDARPARWLHRSDSA 117 Query: 416 WLLFGVCDCLXXXXXXXXXXXXXXXXXXXS---SEGREVKVAECDSKEQDDEVSSDYRVT 586 WLLFGVC CL + SEGRE+K AE D KE++DE+S+DYRVT Sbjct: 118 WLLFGVCACLAPPVIADVDLEAPPTPAINTDENSEGREMKYAEGD-KERNDELSADYRVT 176 Query: 587 GVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGD 766 GVLADGRCLFRA+AHGACL NGE APNENRQ ELAD WF+EGD Sbjct: 177 GVLADGRCLFRAIAHGACLNNGEEAPNENRQRELADELRARVAEELLKRRKETEWFIEGD 236 Query: 767 FDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEIS 946 FDAYV RI+Q + WGGEPELLMASHVLKTPI VFMRD SSIDLVNIAKYGEEY ND+EIS Sbjct: 237 FDAYVNRIRQTYVWGGEPELLMASHVLKTPIYVFMRDASSIDLVNIAKYGEEYMNDKEIS 296 Query: 947 INVLFHRYGHYDILET 994 INVLFHR+GHY+ILET Sbjct: 297 INVLFHRHGHYEILET 312 >BAE71258.1 hypothetical protein [Trifolium pratense] Length = 326 Score = 395 bits (1014), Expect = e-133 Identities = 212/330 (64%), Positives = 228/330 (69%), Gaps = 13/330 (3%) Frame = +2 Query: 68 MLGVLCATRPKPWIFSFLHASSA-----ARLAHGTAYSSASPRFSRPGHDGARRQHSSSC 232 MLGVLCATR +PWIFSFLH SS+ ARLAH T SS+S P ARR HSS C Sbjct: 1 MLGVLCATRSRPWIFSFLHHSSSHHHHTARLAHITVASSSS---LSPTFFSARRNHSSQC 57 Query: 233 ELR-GXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPD 409 +L+ IWHAIMPCGGDGF+RG VHHDHELKGEGSWNVAWDARPARWLH D Sbjct: 58 KLQISAGGGAASIWHAIMPCGGDGFQRGAFMVHHDHELKGEGSWNVAWDARPARWLHRSD 117 Query: 410 SAWLLFGVCDCL-------XXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVS 568 SAWLLFGV L SEG E+K AE D + +DE+S Sbjct: 118 SAWLLFGVRAWLAPPPVIVDVDPEVPLPTSVISPDEISRSEGLEIKDAESD--KPNDELS 175 Query: 569 SDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXX 748 SDYRVTGVLADGRCLFRALAHGACLKNGE APNENRQ ELAD Sbjct: 176 SDYRVTGVLADGRCLFRALAHGACLKNGEEAPNENRQRELADELRAKVAEELLKRRKETE 235 Query: 749 WFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYR 928 WF+EGDFD YV RIQQ F WGGEPELLMASHVLKTPI VFMRD +SIDLVNIAKYGEEY Sbjct: 236 WFIEGDFDTYVTRIQQSFVWGGEPELLMASHVLKTPIFVFMRDPNSIDLVNIAKYGEEYM 295 Query: 929 NDEEISINVLFHRYGHYDILETS*PKLPKK 1018 NDE ISINVLFHR+GHY++LET PKL +K Sbjct: 296 NDEGISINVLFHRHGHYELLETLCPKLSQK 325 >XP_013469378.1 OTU-like cysteine protease [Medicago truncatula] KEH43416.1 OTU-like cysteine protease [Medicago truncatula] Length = 305 Score = 380 bits (976), Expect = e-128 Identities = 197/315 (62%), Positives = 215/315 (68%), Gaps = 6/315 (1%) Frame = +2 Query: 68 MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELR-- 241 MLGVLCATR +PWIFS H A RL+H T P ARR HS++C Sbjct: 1 MLGVLCATRSRPWIFSSHHHHHAFRLSHATVAPLTFP---------ARRHHSTACNNLQI 51 Query: 242 GXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWL 421 IWHAI PCGGDGFR G V +HHDHELKGEGSWNVAWDARPARWLH DSAWL Sbjct: 52 STGGGAASIWHAITPCGGDGFRTGGVMLHHDHELKGEGSWNVAWDARPARWLHRSDSAWL 111 Query: 422 LFGVCDCLXXXXXXXXXXXXXXXXXXX----SSEGREVKVAECDSKEQDDEVSSDYRVTG 589 LFGVC CL SSEGRE+K D E+DDE+++DYRVTG Sbjct: 112 LFGVCACLAPPVVLDVDPEAAAPTPAVFPNESSEGREMKDELSD--ERDDELNADYRVTG 169 Query: 590 VLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDF 769 VLADGRCLFRA+AHGACLKNGE APNE+RQ ELAD WF+EGDF Sbjct: 170 VLADGRCLFRAIAHGACLKNGEEAPNESRQRELADELRVKVAEELLNRRKETEWFIEGDF 229 Query: 770 DAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISI 949 D YV RIQQ + WGGEPELLMASHVLKTPI VFMRD SS+DLVNIAKYGEEY NDEEISI Sbjct: 230 DTYVTRIQQTYVWGGEPELLMASHVLKTPIYVFMRDASSMDLVNIAKYGEEYMNDEEISI 289 Query: 950 NVLFHRYGHYDILET 994 NVLFHR+GHY++LET Sbjct: 290 NVLFHRHGHYELLET 304 >XP_003536306.1 PREDICTED: uncharacterized protein LOC100793001 [Glycine max] KRH34730.1 hypothetical protein GLYMA_10G202000 [Glycine max] Length = 296 Score = 374 bits (959), Expect = e-125 Identities = 193/313 (61%), Positives = 216/313 (69%), Gaps = 1/313 (0%) Frame = +2 Query: 68 MLGVLCATRPKPWIFSFLHA-SSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRG 244 MLGVLCATRPKPW+ S +H +S RL H SASP RR+HS++C+L Sbjct: 1 MLGVLCATRPKPWLLSLVHVHASLPRLPHSPLSPSASPP--------PRRRHSTACKLFL 52 Query: 245 XXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLL 424 IWHAIMP G DG RRGVVAVH +LKGEGSWNVAWDARPARWLH PDSAWLL Sbjct: 53 SGGAAASIWHAIMPRGDDGLRRGVVAVH---DLKGEGSWNVAWDARPARWLHRPDSAWLL 109 Query: 425 FGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADG 604 FGVC CL S G D + ++DEVS+DYRVTGV ADG Sbjct: 110 FGVCACLAPPPGCVDADTNSAGIAVDESCGL------LDKEREEDEVSADYRVTGVPADG 163 Query: 605 RCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVK 784 RCLFRA+AHGACL+NGE AP+ENRQ ELAD WF+EGDFD Y++ Sbjct: 164 RCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQ 223 Query: 785 RIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLFH 964 RIQQP+ WGGEPELLMASHVLKTPISVFMRDT S++LVNIAKYGEEYRND++ISINVLFH Sbjct: 224 RIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVELVNIAKYGEEYRNDKDISINVLFH 283 Query: 965 RYGHYDILETS*P 1003 YGHYDILET P Sbjct: 284 GYGHYDILETLRP 296 >XP_017413456.1 PREDICTED: uncharacterized protein LOC108324995 [Vigna angularis] KOM35649.1 hypothetical protein LR48_Vigan02g179900 [Vigna angularis] BAT94560.1 hypothetical protein VIGAN_08117300 [Vigna angularis var. angularis] Length = 290 Score = 366 bits (940), Expect = e-123 Identities = 194/310 (62%), Positives = 211/310 (68%), Gaps = 1/310 (0%) Frame = +2 Query: 68 MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 247 MLGVLCATRPKPW+FS +HAS RL H + ASP RR HSS+C+L G Sbjct: 1 MLGVLCATRPKPWLFSLVHASPP-RLPHASVSLLASP---------PRRHHSSACKLFGS 50 Query: 248 XXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLF 427 IWHAIMP GDGFRRGVVAVH +LKGEGSWNVAWD RPARWLH DSAWLLF Sbjct: 51 AGGAGSIWHAIMPRSGDGFRRGVVAVH---DLKGEGSWNVAWDTRPARWLHRSDSAWLLF 107 Query: 428 GVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGR 607 GVC CL S + KE +VS+DYRVTGV ADGR Sbjct: 108 GVCACLAPPGCVDAVTDSDAVAADESCGVLD--------KELKVDVSADYRVTGVPADGR 159 Query: 608 CLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKR 787 CLFRA+AHGACL+NGE AP+ENRQ ELAD WF+EGDFD YVKR Sbjct: 160 CLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKR 219 Query: 788 IQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND-EEISINVLFH 964 IQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGE+YRND EE SINVLFH Sbjct: 220 IQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEDYRNDKEENSINVLFH 279 Query: 965 RYGHYDILET 994 YGHYDILE+ Sbjct: 280 GYGHYDILES 289 >XP_003556279.1 PREDICTED: OTU domain-containing protein At3g57810-like [Glycine max] KHN00921.1 OTU domain-containing protein [Glycine soja] KRG92054.1 hypothetical protein GLYMA_20G188400 [Glycine max] Length = 294 Score = 365 bits (936), Expect = e-122 Identities = 192/311 (61%), Positives = 213/311 (68%), Gaps = 2/311 (0%) Frame = +2 Query: 68 MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 247 MLGVLCATR KPW+FS +HAS RL+H SASP RR+HS++C+L Sbjct: 1 MLGVLCATRSKPWLFSLVHAS-LPRLSHAPLSPSASPP--------PRRRHSTACKLFLS 51 Query: 248 XXXXXXIWHAIMPC--GGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWL 421 IWHAIMP DGFRRGVVA H ++KGEGSWNVAWDARPARWLH PDSAWL Sbjct: 52 AGGAASIWHAIMPRVNDDDGFRRGVVAFH---DMKGEGSWNVAWDARPARWLHRPDSAWL 108 Query: 422 LFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLAD 601 LFGVC CL S D + ++ EVS+DYRVTGV AD Sbjct: 109 LFGVCACLAPPSSCVDADTNTDAIAVDES------CRLLDKEREEYEVSADYRVTGVPAD 162 Query: 602 GRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYV 781 GRCLFRA+AHGACL+NGE AP+ENRQ ELAD WF+EGDFD YV Sbjct: 163 GRCLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELMKRREETEWFIEGDFDTYV 222 Query: 782 KRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLF 961 +RIQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGEEYRND+EISINVLF Sbjct: 223 QRIQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEEYRNDKEISINVLF 282 Query: 962 HRYGHYDILET 994 H YGHYDILET Sbjct: 283 HGYGHYDILET 293 >XP_014512510.1 PREDICTED: uncharacterized protein LOC106771118 [Vigna radiata var. radiata] Length = 290 Score = 364 bits (934), Expect = e-122 Identities = 193/310 (62%), Positives = 210/310 (67%), Gaps = 1/310 (0%) Frame = +2 Query: 68 MLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCELRGX 247 MLGVLCATRPKPW+FS +HAS RL H + ASP RR HSS+C+L G Sbjct: 1 MLGVLCATRPKPWLFSLVHASPP-RLPHASVSLLASP---------PRRHHSSACKLFGS 50 Query: 248 XXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLF 427 IWHAIMP GDGFRRGVVAVH +LKGEGSWNVAWD RPARWLH DSAWLLF Sbjct: 51 AGGAGSIWHAIMPRSGDGFRRGVVAVH---DLKGEGSWNVAWDTRPARWLHRSDSAWLLF 107 Query: 428 GVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGR 607 GVC CL S + KE +VS+DYRVTGV ADGR Sbjct: 108 GVCACLAPPGCVDAVTDSDAVAADESCGVLD--------KELKVDVSADYRVTGVPADGR 159 Query: 608 CLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKR 787 CLFRA+AHGACL+NGE AP+ENRQ ELAD WF+EGDFD YVKR Sbjct: 160 CLFRAIAHGACLRNGEKAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYVKR 219 Query: 788 IQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND-EEISINVLFH 964 IQQP+ WGGEPELLMASHVLKTPISVFMRDT S+DLVNIAKYGE+Y ND EE SINVLFH Sbjct: 220 IQQPYVWGGEPELLMASHVLKTPISVFMRDTGSVDLVNIAKYGEDYMNDKEENSINVLFH 279 Query: 965 RYGHYDILET 994 YGHYDILE+ Sbjct: 280 GYGHYDILES 289 >XP_007143828.1 hypothetical protein PHAVU_007G105100g [Phaseolus vulgaris] ESW15822.1 hypothetical protein PHAVU_007G105100g [Phaseolus vulgaris] Length = 305 Score = 362 bits (928), Expect = e-121 Identities = 197/322 (61%), Positives = 214/322 (66%), Gaps = 1/322 (0%) Frame = +2 Query: 32 NPAHDSHLCTSSMLGVLCATRPKPWIFSFLHASSAARLAHGTAYSSASPRFSRPGHDGAR 211 NPAHDS +S MLGVLCATRP+PW+FS +HAS RL H + SASP R Sbjct: 7 NPAHDSF--SSPMLGVLCATRPRPWLFSHVHAS-LPRLVHASVSLSASP---------PR 54 Query: 212 RQHSSSCELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPAR 391 R HSS+C++ G IWHAIMP GD FRRGVV VH +LKGEGSWNVAWD RPAR Sbjct: 55 RHHSSACKIFGSAGGAASIWHAIMPRSGDRFRRGVVPVH---DLKGEGSWNVAWDTRPAR 111 Query: 392 WLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS 571 WLH PDSAWLLFGVC CL S +V+ A D + Sbjct: 112 WLHRPDSAWLLFGVCACLAPPGCVDVVTDFEAVAVDESCGVLKVE-ASAD--------YA 162 Query: 572 DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXW 751 DYRVTGV ADGRCLFRA+AHG CL+NGE AP+EN Q ELAD W Sbjct: 163 DYRVTGVPADGRCLFRAIAHGDCLRNGEKAPDENCQRELADELRAKVVDELLKRREETEW 222 Query: 752 FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 931 F+EGDFD YVKRIQQPF WGGEPELLMASHVLKTPISVFMR T S+ LVNIAKYGEEYRN Sbjct: 223 FIEGDFDTYVKRIQQPFVWGGEPELLMASHVLKTPISVFMRATGSVGLVNIAKYGEEYRN 282 Query: 932 D-EEISINVLFHRYGHYDILET 994 D EE SINVLFH YGHYDILET Sbjct: 283 DKEENSINVLFHGYGHYDILET 304 >XP_016177333.1 PREDICTED: uncharacterized protein LOC107619558 [Arachis ipaensis] Length = 327 Score = 362 bits (929), Expect = e-120 Identities = 201/328 (61%), Positives = 217/328 (66%), Gaps = 21/328 (6%) Frame = +2 Query: 74 GVLCATRPKPWIFS--FLHAS---SAARLAHGTAYSSASPRFSRP-GHDGARRQHSSSCE 235 GVLCATRPKPWI S LHAS S+ARL H A P F + ARR HSS+C Sbjct: 4 GVLCATRPKPWILSAAILHASLHHSSARLLH------APPLFPQLLRRTDARRHHSSACN 57 Query: 236 LRGXXXXXXX--IWHAIMPCGGDGF------RRGVVAVHH-DHELKGEGSWNVAWDARPA 388 G IWHAIMPCGG RGVVAVHH DHELKGEGSWNVAWDARPA Sbjct: 58 HGGDFGGGGAASIWHAIMPCGGGAGSGKKLRHRGVVAVHHHDHELKGEGSWNVAWDARPA 117 Query: 389 RWLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXX------SSEGREVKVAECDSKE 550 RWLH PDSAWLLFGVC CL + EG+ VKV Sbjct: 118 RWLHRPDSAWLLFGVCACLAPPVSSVTDLEATPPATATVVNRDINPEGQGVKV------- 170 Query: 551 QDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXX 730 D +SSDYRVTGVLADGRCLFRA+AHGACL+NGEAAP+E RQ ELAD Sbjct: 171 --DGLSSDYRVTGVLADGRCLFRAIAHGACLRNGEAAPDERRQRELADELRAQVVEELMK 228 Query: 731 XXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAK 910 WF+EGDFD YVKRIQQP+ WGGEPELLMASHVLKTPISVFMRDTSS+ LVNIAK Sbjct: 229 RREETEWFIEGDFDTYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSLSLVNIAK 288 Query: 911 YGEEYRNDEEISINVLFHRYGHYDILET 994 YGEEYRN++++ INVLFH YGHYDILET Sbjct: 289 YGEEYRNEKDVCINVLFHGYGHYDILET 316 >XP_015941210.1 PREDICTED: uncharacterized protein LOC107466718 [Arachis duranensis] Length = 327 Score = 360 bits (925), Expect = e-120 Identities = 200/328 (60%), Positives = 217/328 (66%), Gaps = 21/328 (6%) Frame = +2 Query: 74 GVLCATRPKPWIFS--FLHAS---SAARLAHGTAYSSASPRFSRP-GHDGARRQHSSSCE 235 GVLCATRPKPWI S LHAS S+ARL H A P F + RR HSS+C Sbjct: 4 GVLCATRPKPWILSAAILHASLHHSSARLLH------APPLFPQLLRRTDTRRHHSSACN 57 Query: 236 LRGXXXXXXX--IWHAIMPCGGDGF------RRGVVAVHH-DHELKGEGSWNVAWDARPA 388 G IWHAIMPCGG RGVVAVHH DHELKGEGSWNVAWDARPA Sbjct: 58 HGGDFGGGGAASIWHAIMPCGGGAGSGKKLRHRGVVAVHHHDHELKGEGSWNVAWDARPA 117 Query: 389 RWLHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXX------SSEGREVKVAECDSKE 550 RWLH PDSAWLLFGVC CL ++EG+ VKV Sbjct: 118 RWLHRPDSAWLLFGVCACLAPPVSSVADLEATPPATATVVNRDMNTEGQGVKV------- 170 Query: 551 QDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXX 730 D +SSDYRVTGVLADGRCLFRA+AHGACL+NGEAAP+E RQ ELAD Sbjct: 171 --DGLSSDYRVTGVLADGRCLFRAIAHGACLRNGEAAPDERRQRELADELRAQVVEELMK 228 Query: 731 XXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAK 910 WF+EGDFD YVKRIQQP+ WGGEPELLMASHVLKTPISVFMRDTSS+ LVNIAK Sbjct: 229 RREETEWFIEGDFDTYVKRIQQPYVWGGEPELLMASHVLKTPISVFMRDTSSLSLVNIAK 288 Query: 911 YGEEYRNDEEISINVLFHRYGHYDILET 994 YGEEYRN++++ INVLFH YGHYDILET Sbjct: 289 YGEEYRNEKDMCINVLFHGYGHYDILET 316 >XP_004142455.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis sativus] KGN52210.1 hypothetical protein Csa_5G615810 [Cucumis sativus] Length = 313 Score = 327 bits (837), Expect = e-107 Identities = 178/321 (55%), Positives = 204/321 (63%), Gaps = 4/321 (1%) Frame = +2 Query: 68 MLGVLCATRPKPWIF----SFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCE 235 MLGVLCA RPKPWI +F+H S+ H + S S D +R HSS+C+ Sbjct: 1 MLGVLCA-RPKPWILVSLSNFIHGSAVYHHHH---HQSRLLVQSPIQFDRRQRHHSSACK 56 Query: 236 LRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 415 L G IWHAIMP G H HE KGEGSWNVAWDARPARWLH PDSA Sbjct: 57 LAGGGAAS--IWHAIMPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPDSA 114 Query: 416 WLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVL 595 WLLFGVC C+ + +EV + Q+DE S+DYRVTGVL Sbjct: 115 WLLFGVCACIAPLDWVDASHEAVSL-----DQKKEVCESSGPEFNQNDESSADYRVTGVL 169 Query: 596 ADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDA 775 ADGRCLFRA+AHGACL++GE AP+++RQ ELAD W++EGDFDA Sbjct: 170 ADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDA 229 Query: 776 YVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINV 955 YVKRIQQPF WGGEPELLMASHVLKTPISVFMR+ SS L+NIAKYG+EY+ EE INV Sbjct: 230 YVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQKGEESPINV 289 Query: 956 LFHRYGHYDILETS*PKLPKK 1018 LFH YGHYDILETS K+ K Sbjct: 290 LFHGYGHYDILETSSDKVSLK 310 >XP_016900257.1 PREDICTED: OTU domain-containing protein At3g57810-like [Cucumis melo] Length = 313 Score = 325 bits (832), Expect = e-106 Identities = 177/321 (55%), Positives = 204/321 (63%), Gaps = 4/321 (1%) Frame = +2 Query: 68 MLGVLCATRPKPWIF----SFLHASSAARLAHGTAYSSASPRFSRPGHDGARRQHSSSCE 235 MLGVLCA RPKPWI +F+H S+ H + S S D +R HSS+C+ Sbjct: 1 MLGVLCA-RPKPWILVSLSNFIHGSAVYHHHH---HQSRLLVQSPIQFDRRQRHHSSACK 56 Query: 236 LRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSA 415 L G IWHAI+P G H HE KGEGSWNVAWDARPARWLH PDSA Sbjct: 57 LAGGGAAS--IWHAILPSGAGSSSNLCRPAIHCHERKGEGSWNVAWDARPARWLHRPDSA 114 Query: 416 WLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVL 595 WLLFGVC C+ + +EV + Q+DE S+DYRVTGVL Sbjct: 115 WLLFGVCACIAPLDWVDASHEAVSL-----DQKKEVCESSGPEFNQNDESSADYRVTGVL 169 Query: 596 ADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDA 775 ADGRCLFRA+AHGACL++GE AP+++RQ ELAD W++EGDFDA Sbjct: 170 ADGRCLFRAIAHGACLRSGEEAPDDDRQRELADELRAKVVDELLKRRKETEWYIEGDFDA 229 Query: 776 YVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINV 955 YVKRIQQPF WGGEPELLMASHVLKTPISVFMR+ SS L+NIAKYG+EY+ EE INV Sbjct: 230 YVKRIQQPFVWGGEPELLMASHVLKTPISVFMRERSSDGLINIAKYGQEYQMGEESPINV 289 Query: 956 LFHRYGHYDILETS*PKLPKK 1018 LFH YGHYDILETS K+ K Sbjct: 290 LFHGYGHYDILETSSDKVSLK 310 >XP_019459096.1 PREDICTED: uncharacterized protein LOC109359045 [Lupinus angustifolius] OIW01500.1 hypothetical protein TanjilG_19426 [Lupinus angustifolius] Length = 319 Score = 323 bits (828), Expect = e-105 Identities = 179/318 (56%), Positives = 202/318 (63%), Gaps = 8/318 (2%) Frame = +2 Query: 68 MLGVLCATRPKPWIFSFLHASSAARLAHGTA--YSSASPRFSRPGHDGARRQHSSSCELR 241 ML LC TRPKP S +A+ H +A + +S F PG DG RR HSS+C + Sbjct: 1 MLAALC-TRPKPSFLSSFFFQTASLHNHNSARFINGSSLHFYCPGGDGRRRHHSSACTIG 59 Query: 242 GXXXXXXXIWHAIMPCGGDGF----RRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPD 409 G IWH ++P R A+ H HEL+GEGSWN AWDARP+RWLH PD Sbjct: 60 GSCGGAASIWHVVLPERAGASICCDLRWRSALPH-HELRGEGSWNAAWDARPSRWLHRPD 118 Query: 410 SAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGR-EVKVAECDSKEQDDEVSSDYRVT 586 SAWLLFGVC CL S G ++K CD + +EVSS YR+T Sbjct: 119 SAWLLFGVCACLAPPLLLADVNTEVPSAEHDSDGGGGDLKGPGCD---EQNEVSSAYRIT 175 Query: 587 GVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGD 766 GVLADGRCLFRA+AHGACL NGE AP+ENRQ ELAD WF+EGD Sbjct: 176 GVLADGRCLFRAIAHGACLMNGEEAPDENRQRELADELRAQVVEELMKRREETEWFIEGD 235 Query: 767 FDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEY-RNDEEI 943 FDAYV RIQQPF WGGEPELLMASHVLKTPISVFMRD SS DLVNIAKYGEEY ++EI Sbjct: 236 FDAYVTRIQQPFVWGGEPELLMASHVLKTPISVFMRDRSSGDLVNIAKYGEEYITKEKEI 295 Query: 944 SINVLFHRYGHYDILETS 997 +INVLFH YGHYDILE S Sbjct: 296 AINVLFHGYGHYDILEIS 313 >XP_010032108.1 PREDICTED: OTU domain-containing protein At3g57810 [Eucalyptus grandis] KCW51502.1 hypothetical protein EUGRSUZ_J01018 [Eucalyptus grandis] Length = 314 Score = 316 bits (809), Expect = e-103 Identities = 178/320 (55%), Positives = 200/320 (62%), Gaps = 11/320 (3%) Frame = +2 Query: 68 MLGVLCATRPKPWIFS--FLHASSAARLAHGTAYSSASPRFSRPGHDG---ARRQHSSSC 232 MLGVLCA RPKPWI + F HAS+A S+A+ R RR HSSSC Sbjct: 1 MLGVLCA-RPKPWILASCFSHASAAHHCGRLAWVSAAAARLQLAADSPDRWRRRHHSSSC 59 Query: 233 ELRGXXXXXXX-----IWHAIMPCG-GDGFRRGVVAVHHDHELKGEGSWNVAWDARPARW 394 L G IWHAI+P G GD RR + +GEGSWNVAWDARPARW Sbjct: 60 RLGGASSCAHPCGVASIWHAILPSGEGDPPRR--MDQPRRPVFRGEGSWNVAWDARPARW 117 Query: 395 LHSPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSSD 574 LH PDSAWLLFGVC CL E +V + DS ++ S D Sbjct: 118 LHRPDSAWLLFGVCACLAPVDAAEPSREEVVP---------EARVEDRDSLDEAKRSSPD 168 Query: 575 YRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWF 754 YRVTGVLADGRCLFRA+AH ACL+ GEAAP++NRQ ELAD W Sbjct: 169 YRVTGVLADGRCLFRAIAHCACLRKGEAAPDDNRQRELADELRAQVVAELLKRREETEWA 228 Query: 755 LEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRND 934 +EGDFDAY++RIQQP+ WGGEPELLMASHVLKTPISVFM D SS +LVN+AKYGEEYR D Sbjct: 229 IEGDFDAYIERIQQPYVWGGEPELLMASHVLKTPISVFMVDRSSGNLVNVAKYGEEYRKD 288 Query: 935 EEISINVLFHRYGHYDILET 994 EEI INVLFH YGHYDILE+ Sbjct: 289 EEIPINVLFHGYGHYDILES 308 >OMO50984.1 Ovarian tumor, otubain [Corchorus olitorius] Length = 327 Score = 315 bits (806), Expect = e-102 Identities = 171/321 (53%), Positives = 197/321 (61%), Gaps = 12/321 (3%) Frame = +2 Query: 68 MLGVLCATRPKPWIFSFL----HASSAARLAHGTAYSSASPRFSRPGHDGAR-RQHSSSC 232 MLGVLCA PKPWI + L H +A H + P F+ D R R HS++C Sbjct: 1 MLGVLCARPPKPWILNSLSLVAHGGGSAAHHHDSRLLHW-PHFAHISADNRRCRHHSTAC 59 Query: 233 ELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 412 L G IWHAI+PCGG G R V + E KGEGSWNVAWDARPARWLH PDS Sbjct: 60 RLGGSDGGAASIWHAILPCGGSGRGRKREEVWKNVERKGEGSWNVAWDARPARWLHRPDS 119 Query: 413 AWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS------- 571 AWLLFGVC CL EG E+ S ++ +SS Sbjct: 120 AWLLFGVCACLAPMIEFVDVNPETDDKI----EGAELISINGLSADEKSSISSSPVAAPD 175 Query: 572 DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXW 751 +Y+VTGVLADGRCLFRA+AHGACL++GE AP+E RQ ELAD W Sbjct: 176 NYKVTGVLADGRCLFRAIAHGACLRSGEEAPDETRQRELADELRAQVVNELLKRREETEW 235 Query: 752 FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 931 F+EGDFDAYVK IQQP+ WGGEPELLMASHVLKTPISV+M SS +L+ IA YGEEY+ Sbjct: 236 FIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTPISVYMIHRSSRNLIKIADYGEEYQK 295 Query: 932 DEEISINVLFHRYGHYDILET 994 D+E INVLFH YGHYDILE+ Sbjct: 296 DKETPINVLFHGYGHYDILES 316 >OMO98833.1 Ovarian tumor, otubain [Corchorus capsularis] Length = 327 Score = 314 bits (804), Expect = e-102 Identities = 171/321 (53%), Positives = 197/321 (61%), Gaps = 12/321 (3%) Frame = +2 Query: 68 MLGVLCATRPKPWIFSFL----HASSAARLAHGTAYSSASPRFSRPGHDGAR-RQHSSSC 232 MLGVLCA PKPWI + L H +A H + P F+ D R R HS++C Sbjct: 1 MLGVLCARPPKPWILNSLSLVAHGGGSAAHHHDSRLLHW-PHFADLSADNRRCRHHSTAC 59 Query: 233 ELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 412 L G IWHAI+PCGG G R V + E KGEGSWNVAWDARPARWLH PDS Sbjct: 60 RLGGSDGGAASIWHAILPCGGSGRGRKREEVWKNVERKGEGSWNVAWDARPARWLHRPDS 119 Query: 413 AWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS------- 571 AWLLFGVC CL EG E+ S ++ +SS Sbjct: 120 AWLLFGVCACLAPMIEFVDVNPETDDKI----EGTELISINGLSADEKSSISSSPVAAPD 175 Query: 572 DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXW 751 +Y+VTGVLADGRCLFRA+AHGACL++GE AP+E RQ ELAD W Sbjct: 176 NYKVTGVLADGRCLFRAIAHGACLRSGEEAPDETRQRELADELRAQVVNELLKRREETEW 235 Query: 752 FLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGEEYRN 931 F+EGDFDAYVK IQQP+ WGGEPELLMASHVLKTPISV+M SS +L+ IA YGEEY+ Sbjct: 236 FIEGDFDAYVKEIQQPYVWGGEPELLMASHVLKTPISVYMIHRSSRNLIKIADYGEEYQK 295 Query: 932 DEEISINVLFHRYGHYDILET 994 D+E INVLFH YGHYDILE+ Sbjct: 296 DKETPINVLFHGYGHYDILES 316 >XP_018845374.1 PREDICTED: uncharacterized protein LOC109009371 [Juglans regia] Length = 328 Score = 313 bits (803), Expect = e-101 Identities = 181/342 (52%), Positives = 207/342 (60%), Gaps = 33/342 (9%) Frame = +2 Query: 68 MLGVLCATRPKPWIF-----SFLHASSAARLAHGTAYSSASPRFSRPGHDG---ARRQHS 223 MLGVLCA RPKPWI SF+H S+A G S PG +G RR HS Sbjct: 1 MLGVLCA-RPKPWILTSLSSSFVHGSAAHHHITGLRQS--------PGFNGDLKPRRHHS 51 Query: 224 SSCELRGXXXXXXX-IWHAIMPCGGDGFRRGVVAVHHD---HELKGEGSWNVAWDARPAR 391 S+C + G IWHAIMPCG G ++ + E +GEGSWNVAWDARPAR Sbjct: 52 SACRIDGSFGGGAASIWHAIMPCGAAGHPSDLLLRRNAMLRRERRGEGSWNVAWDARPAR 111 Query: 392 WLHSPD-SAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKE----QD 556 WLH PD SAWLLFGVC CL E K+ CDS + ++ Sbjct: 112 WLHRPDYSAWLLFGVCACLAPLDFAFDDSPEAIVV--------EAKIEACDSIDSNANKN 163 Query: 557 DEV----------------SSDYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGEL 688 DE+ S+DYRVTGVLADGRCLFRALAHGAC ++GE AP+ENRQ EL Sbjct: 164 DEIDGFDAIYSNTSKPKEGSADYRVTGVLADGRCLFRALAHGACSRSGEEAPDENRQREL 223 Query: 689 ADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVF 868 AD WF+EGDFDAYV+RIQQPF WGGEPELLMASHVLKTPISVF Sbjct: 224 ADELRAQVVDELLKRRKETEWFIEGDFDAYVERIQQPFVWGGEPELLMASHVLKTPISVF 283 Query: 869 MRDTSSIDLVNIAKYGEEYRNDEEISINVLFHRYGHYDILET 994 M++ SS LVNIAKYGEEYR +E+ INVLFH YGHYD+LE+ Sbjct: 284 MKNRSSGRLVNIAKYGEEYRKEEDSPINVLFHGYGHYDLLES 325 >KHN37847.1 OTU domain-containing protein [Glycine soja] Length = 234 Score = 310 bits (793), Expect = e-101 Identities = 154/236 (65%), Positives = 171/236 (72%) Frame = +2 Query: 296 DGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDSAWLLFGVCDCLXXXXXXXXXX 475 DGFRRGVVA H ++KGEGSWNVAWDARPARWLH PDSAWLLFGVC CL Sbjct: 8 DGFRRGVVAFH---DMKGEGSWNVAWDARPARWLHRPDSAWLLFGVCACLAPPSSCVDAD 64 Query: 476 XXXXXXXXXSSEGREVKVAECDSKEQDDEVSSDYRVTGVLADGRCLFRALAHGACLKNGE 655 S D + ++DEVS+DYRVTGV ADGRCLFRA+AHGACL+NGE Sbjct: 65 TNTDAIAVDES------CRLLDKEREEDEVSADYRVTGVPADGRCLFRAIAHGACLRNGE 118 Query: 656 AAPNENRQGELADXXXXXXXXXXXXXXXXXXWFLEGDFDAYVKRIQQPFAWGGEPELLMA 835 AP+ENRQ ELAD WF+EGDFD Y++RIQQP+ WGGEPELLMA Sbjct: 119 KAPDENRQRELADELRAKVVDELLKRREETEWFIEGDFDTYLQRIQQPYVWGGEPELLMA 178 Query: 836 SHVLKTPISVFMRDTSSIDLVNIAKYGEEYRNDEEISINVLFHRYGHYDILETS*P 1003 SHVLKTPISVFMRDT S++LVNIAKYGEEYRND++ISINVLFH YGHYDILET P Sbjct: 179 SHVLKTPISVFMRDTGSVELVNIAKYGEEYRNDKDISINVLFHGYGHYDILETLRP 234 >GAU40884.1 hypothetical protein TSUD_40590 [Trifolium subterraneum] Length = 266 Score = 310 bits (794), Expect = e-101 Identities = 165/272 (60%), Positives = 180/272 (66%), Gaps = 11/272 (4%) Frame = +2 Query: 68 MLGVLCATRPKPWIFSFLHASS----AARLAHGT-AYSSASPRFSRPGHDGARRQHSSSC 232 MLGVLCATR +PWIFSFLH+SS ARLAH T + SS P FS ARR HSS C Sbjct: 1 MLGVLCATRSRPWIFSFLHSSSHHNHTARLAHATVSASSLCPTFS------ARRNHSSQC 54 Query: 233 ELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLHSPDS 412 +L+ IWHAIMPCGGDG ++G VHHDHELKGEGSWNVAWDARPARWLH DS Sbjct: 55 KLQISTGGAASIWHAIMPCGGDGLQQGGFMVHHDHELKGEGSWNVAWDARPARWLHRSDS 114 Query: 413 AWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEG---REVK-VAECDSKEQDDEVSS--D 574 AWLLFGVC CL E RE+K + + +S + DE+SS D Sbjct: 115 AWLLFGVCACLAPPVDVEAEVPPLTTSVISPDENYKRREIKDIKDAESDKPSDELSSEAD 174 Query: 575 YRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXXXXXWF 754 YRVTGVLADGRCLFRA+AHGACLKNGE APNENRQ ELAD WF Sbjct: 175 YRVTGVLADGRCLFRAIAHGACLKNGEEAPNENRQRELADELRAKVAEELLKRRKETEWF 234 Query: 755 LEGDFDAYVKRIQQPFAWGGEPELLMASHVLK 850 +EGDFD YV RIQQ F WGGEPELLMASHVLK Sbjct: 235 IEGDFDTYVTRIQQTFVWGGEPELLMASHVLK 266 >EOY19029.1 Cysteine proteinases superfamily protein isoform 1 [Theobroma cacao] Length = 327 Score = 310 bits (794), Expect = e-100 Identities = 171/325 (52%), Positives = 197/325 (60%), Gaps = 16/325 (4%) Frame = +2 Query: 68 MLGVLCATRPKPWIFSFLHASSAARLAHG--TAYSSASPRFSRPGH-------DGARRQH 220 MLGVLCA PKPWI + S + +AHG A+ S P H D R H Sbjct: 1 MLGVLCARPPKPWILN-----SLSLIAHGGLAAHHHDSRLVEWPTHFADLSADDRRCRHH 55 Query: 221 SSSCELRGXXXXXXXIWHAIMPCGGDGFRRGVVAVHHDHELKGEGSWNVAWDARPARWLH 400 S++C L G IWHAI+PCGG G R V + E KGEGSWNVAWDARPARWLH Sbjct: 56 STACRLGGSDGGAASIWHAILPCGGGGGGRRRGEVWKNVERKGEGSWNVAWDARPARWLH 115 Query: 401 SPDSAWLLFGVCDCLXXXXXXXXXXXXXXXXXXXSSEGREVKVAECDSKEQDDEVSS--- 571 PDSAWLLFGVC CL EG E+ + S ++ SS Sbjct: 116 RPDSAWLLFGVCACLAPMIEFVDVNPDADDKI----EGAELNLVSRLSADEKSSSSSSSV 171 Query: 572 ----DYRVTGVLADGRCLFRALAHGACLKNGEAAPNENRQGELADXXXXXXXXXXXXXXX 739 + +VTGVLADGRCLFRA+AHGACL++GE AP+EN Q ELAD Sbjct: 172 AAADNCKVTGVLADGRCLFRAIAHGACLRSGEDAPDENHQRELADELRAQVVNELLKRRE 231 Query: 740 XXXWFLEGDFDAYVKRIQQPFAWGGEPELLMASHVLKTPISVFMRDTSSIDLVNIAKYGE 919 WF+EGDFDAYVK IQQP+ WGGEPE+LMASHVLKTPISV+M SS +L IAKYGE Sbjct: 232 ETEWFIEGDFDAYVKEIQQPYVWGGEPEILMASHVLKTPISVYMIPRSSSNLTKIAKYGE 291 Query: 920 EYRNDEEISINVLFHRYGHYDILET 994 EY+ D+E INVLFH YGHYDILE+ Sbjct: 292 EYQKDKENPINVLFHGYGHYDILES 316