BLASTX nr result
ID: Perilla23_contig00006507
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00006507 (1113 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_012835924.1| PREDICTED: uncharacterized protein LOC105956... 431 e-118 ref|XP_009622353.1| PREDICTED: uncharacterized protein LOC104113... 340 1e-90 ref|XP_009783658.1| PREDICTED: uncharacterized protein LOC104232... 331 6e-88 ref|XP_004240360.1| PREDICTED: uncharacterized protein LOC101253... 325 4e-86 ref|XP_006361204.1| PREDICTED: uncharacterized protein LOC102589... 317 1e-83 ref|XP_008233625.1| PREDICTED: uncharacterized protein LOC103332... 308 4e-81 ref|XP_010264758.1| PREDICTED: uncharacterized protein LOC104602... 308 7e-81 ref|XP_007218741.1| hypothetical protein PRUPE_ppa009184mg [Prun... 308 7e-81 ref|XP_004142760.2| PREDICTED: uncharacterized protein LOC101214... 307 9e-81 ref|XP_008458875.1| PREDICTED: uncharacterized protein LOC103498... 306 2e-80 gb|KHG26784.1| Cruciferin BnC2 [Gossypium arboreum] 305 4e-80 ref|XP_012089216.1| PREDICTED: uncharacterized protein LOC105647... 303 2e-79 ref|XP_006435534.1| hypothetical protein CICLE_v10032209mg [Citr... 302 4e-79 ref|XP_012460357.1| PREDICTED: uncharacterized protein LOC105780... 301 6e-79 gb|KDO69311.1| hypothetical protein CISIN_1g021888mg [Citrus sin... 301 8e-79 ref|XP_010256928.1| PREDICTED: uncharacterized protein LOC104597... 300 1e-78 ref|XP_008371702.1| PREDICTED: uncharacterized protein LOC103435... 300 1e-78 ref|XP_012456445.1| PREDICTED: uncharacterized protein LOC105777... 299 2e-78 ref|XP_007009333.1| Uncharacterized protein isoform 1 [Theobroma... 298 4e-78 ref|XP_003518643.1| PREDICTED: uncharacterized protein LOC100780... 298 5e-78 >ref|XP_012835924.1| PREDICTED: uncharacterized protein LOC105956617 [Erythranthe guttatus] gi|604334357|gb|EYU38441.1| hypothetical protein MIMGU_mgv1a011446mg [Erythranthe guttata] Length = 281 Score = 431 bits (1108), Expect = e-118 Identities = 217/283 (76%), Positives = 240/283 (84%), Gaps = 1/283 (0%) Frame = -3 Query: 1027 MMSRAPVAIGGGGMNLMVNNWLKRSGSGGDFSCRSTGSEHDLAVMVSDFLEIGSAGAESW 848 M + PVAIGGGGMNL+ NNWL R GSG DFSCRSTGSEHDLA MVSDFLEIGSAG++SW Sbjct: 1 MRNLTPVAIGGGGMNLVGNNWLHRRGSGVDFSCRSTGSEHDLAAMVSDFLEIGSAGSDSW 60 Query: 847 CSSDTDSGLSDLAYLTDRISLYIHSVDQYESDLMMVVKSLILSISETYRVEKPGAC-DAS 671 CSSD DSG SDLA DRIS+Y VDQYE DL MVVKSLILSISET ++EKP AC ++S Sbjct: 61 CSSDCDSGFSDLA---DRISMYKQPVDQYERDLTMVVKSLILSISETSQIEKPDACINSS 117 Query: 670 CILYSLVKLLQSSGYDAAVCATKWQNDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDIDFR 491 CI+YSLVKLL SSGYDAA+C +KWQ GK+PGGEHEFIDVIAH+N GGS+RYIIDIDFR Sbjct: 118 CIIYSLVKLLHSSGYDAALCKSKWQVFGKLPGGEHEFIDVIAHENKSGGSERYIIDIDFR 177 Query: 490 SHFQIARAVKPYNLLLSSLPAIYVGTMAKLKQLLQVMVEAARSSLDQNSMPLPPWRSLSY 311 SHFQIARAVK YN++LSSLPAIYVGT+ KLKQLLQ+M EAA+ SL+QNSMP PPWRS SY Sbjct: 178 SHFQIARAVKSYNVVLSSLPAIYVGTLTKLKQLLQIMAEAAKYSLEQNSMPFPPWRSFSY 237 Query: 310 LEAKWESPCQRIVNSQADPLSSSHHHCIGLLQRLKSFVLSDIK 182 LEAKWESPCQR V + SSSH HCIGLL+RLKSFV SD K Sbjct: 238 LEAKWESPCQRFVTLNSAASSSSHQHCIGLLRRLKSFVGSDFK 280 >ref|XP_009622353.1| PREDICTED: uncharacterized protein LOC104113771 [Nicotiana tomentosiformis] Length = 330 Score = 340 bits (872), Expect = 1e-90 Identities = 186/303 (61%), Positives = 217/303 (71%), Gaps = 11/303 (3%) Frame = -3 Query: 1054 NSRKRRGPEMMSRAPVAIGGGG-------MNLMVNNWLKRSGSGGDFSCRSTGSEHDLAV 896 +S +RRGP + + GGGG + NWL R+GSGG S SE DLA Sbjct: 2 SSHERRGPVAVVGSGGGGGGGGGGGGGGELYWTAGNWLNRTGSGGGGGY-SHESEPDLAA 60 Query: 895 MVSDFLEIGSAGAESWCSSDTDSGLSDLAYLTDRISLYIHSVDQYESDLMMVVKSLILSI 716 MVSDFLE S GAES SSD DSG SDLA L DRISL+ HSVDQYESDL MVV SLILS+ Sbjct: 61 MVSDFLESSSVGAESRYSSDNDSGFSDLALLADRISLHKHSVDQYESDLTMVVHSLILSL 120 Query: 715 SETYRVEKPGACDASCILYSLVKLLQSSGYDAAVCATKWQNDGKVPGGEHEFIDVIAHDN 536 E+ + KP C+ASCI +LVK LQS GYDAA+C+TKWQ GK+PGGEHE+I+VI+ N Sbjct: 121 GESCHLSKPETCNASCIRSNLVKFLQSCGYDAALCSTKWQGCGKIPGGEHEYIEVISRGN 180 Query: 535 GGGGSKRYIIDIDFRSHFQIARAVKPYNLLLSSLPAIYVGTMAKLKQLLQVMVEAARSSL 356 G S+RYIIDIDFRSHF+IARAVK YN++LS LP +YVGT+ KLKQ LQ MVEAAR SL Sbjct: 181 -DGCSERYIIDIDFRSHFEIARAVKSYNVVLSCLPPVYVGTVRKLKQYLQTMVEAARCSL 239 Query: 355 DQNSMPLPPWRSLSYLEAKWESPCQRIVNSQAD----PLSSSHHHCIGLLQRLKSFVLSD 188 QNSMPLPPWRSL+YLEAKWES QR+ N Q P +SSH HC LL R+KS + S+ Sbjct: 240 KQNSMPLPPWRSLAYLEAKWESSSQRVANFQVQSSIGPSNSSHQHCTELLWRIKSSIGSE 299 Query: 187 IKS 179 I + Sbjct: 300 INA 302 >ref|XP_009783658.1| PREDICTED: uncharacterized protein LOC104232213 [Nicotiana sylvestris] Length = 321 Score = 331 bits (849), Expect = 6e-88 Identities = 181/296 (61%), Positives = 215/296 (72%), Gaps = 5/296 (1%) Frame = -3 Query: 1051 SRKRRGPEMMSRAPVAIGGGGMNLMVNNWLKRSGS-GGDFSCRSTGSEHDLAVMVSDFLE 875 S +RRGP + + GGG + NWL R+GS GG +S S E DLA MVSDFLE Sbjct: 3 SHERRGPVAVGSSG-GNGGGELFWTAGNWLNRTGSVGGGYSHES---EPDLAAMVSDFLE 58 Query: 874 IGSAGAESWCSSDTDSGLSDLAYLTDRISLYIHSVDQYESDLMMVVKSLILSISETYRVE 695 S GAES SSD DSG SDLA L DRISL+ HSVDQYESDL MVV SLILS+ E+ + Sbjct: 59 SSSVGAESRYSSDNDSGFSDLALLADRISLHKHSVDQYESDLTMVVHSLILSLGESCYLS 118 Query: 694 KPGACDASCILYSLVKLLQSSGYDAAVCATKWQNDGKVPGGEHEFIDVIAHDNGGGGSKR 515 KP C+ASCI +LVKLLQS GY AA+C+TKWQ GK+PGGEHE+I+VI+H+N G S+R Sbjct: 119 KPETCNASCIRSNLVKLLQSCGYAAALCSTKWQGCGKIPGGEHEYIEVISHEN-DGCSER 177 Query: 514 YIIDIDFRSHFQIARAVKPYNLLLSSLPAIYVGTMAKLKQLLQVMVEAARSSLDQNSMPL 335 YIIDIDFRSHF+IARAVK YN++L+ LP +YVGT+ KLK LQ MVEAA+ SL QNSMPL Sbjct: 178 YIIDIDFRSHFEIARAVKSYNVVLNCLPPVYVGTVRKLKLYLQAMVEAAKCSLKQNSMPL 237 Query: 334 PPWRSLSYLEAKWESPCQRIVNSQADP----LSSSHHHCIGLLQRLKSFVLSDIKS 179 PPWRSL+YLE+KWES QR+ N Q SSH HC LL R+KS + S+ K+ Sbjct: 238 PPWRSLAYLESKWESSSQRVSNFQVQSNIGHTKSSHQHCTELLWRIKSSIESESKA 293 >ref|XP_004240360.1| PREDICTED: uncharacterized protein LOC101253593 isoform X1 [Solanum lycopersicum] Length = 312 Score = 325 bits (833), Expect = 4e-86 Identities = 175/283 (61%), Positives = 204/283 (72%), Gaps = 3/283 (1%) Frame = -3 Query: 1018 RAPVAIGGGGMNLMVNNWLKRSGSGGDFSCRSTGSEHDLAVMVSDFLEIGSAGAESWCSS 839 R PV + GGG NWL R G C S SE DLA MVSDFLE SAGAES CSS Sbjct: 7 RFPVVVNGGGGG---GNWLNRGDGVGSGGCYSHESEPDLAAMVSDFLESSSAGAESRCSS 63 Query: 838 DTDSGLSDLAYLTDRISLYIHSVDQYESDLMMVVKSLILSISETYRVEKPGACDASCILY 659 D DSG SDLA L D ISLY +SVD+YESDL MVV SLILS++E++ KP C+ASCI Sbjct: 64 DNDSGYSDLALLADTISLYKNSVDRYESDLTMVVHSLILSMTESFHNGKPETCNASCIRS 123 Query: 658 SLVKLLQSSGYDAAVCATKWQNDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDIDFRSHFQ 479 LVKLLQS G++A +CA KWQ GK+PGGEHE+I+VI+H N G S+RYIID+DFRSHF+ Sbjct: 124 YLVKLLQSCGFNADMCAIKWQGCGKIPGGEHEYIEVISHGN-DGCSERYIIDLDFRSHFE 182 Query: 478 IARAVKPYNLLLSSLPAIYVGTMAKLKQLLQVMVEAARSSLDQNSMPLPPWRSLSYLEAK 299 IARAVK YN++LS LP +YVGT+ KLK LQ MVEAA+ SL QNSMPLPPWRSL+YLEAK Sbjct: 183 IARAVKSYNVVLSCLPPVYVGTVTKLKLYLQAMVEAAKCSLKQNSMPLPPWRSLAYLEAK 242 Query: 298 WESP---CQRIVNSQADPLSSSHHHCIGLLQRLKSFVLSDIKS 179 WES V S +SSH HC LL R+KS + S+IK+ Sbjct: 243 WESSHKVANVQVQSSVSSSNSSHRHCTELLWRIKSCIGSEIKA 285 >ref|XP_006361204.1| PREDICTED: uncharacterized protein LOC102589081 [Solanum tuberosum] Length = 312 Score = 317 bits (812), Expect = 1e-83 Identities = 174/291 (59%), Positives = 206/291 (70%), Gaps = 4/291 (1%) Frame = -3 Query: 1039 RGPEMMSRAPVAIGGGGMNLMVNNWLKRSGSGGDFSCRSTGSEHDLAVMVSDFLEIGSAG 860 R + S V GGGG +WL R G C S SE DLA MVSDFLE SAG Sbjct: 2 RNNKRRSLVVVDDGGGG-----GDWLNRGDGVGSGGCYSHESEPDLAAMVSDFLESSSAG 56 Query: 859 AESWCSSDTDSGLSDLAYLTDRISLYIHSVDQYESDLMMVVKSLILSISETYRVEKPGAC 680 AES SD D G SDLA L D ISLY +SVD+YESDL MVV SLILS++E++ + KP C Sbjct: 57 AESRYISDNDPGYSDLALLADTISLYKNSVDRYESDLTMVVHSLILSMTESFHIGKPETC 116 Query: 679 DASCILYSLVKLLQSSGYDAAVCATKWQNDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDI 500 +ASCI LVKLLQS G++A +CATKWQ GK+PGGEHE+I+VI+H N G S+RYIID+ Sbjct: 117 NASCIRSYLVKLLQSCGFNADMCATKWQGCGKIPGGEHEYIEVISHGN-DGCSERYIIDL 175 Query: 499 DFRSHFQIARAVKPYNLLLSSLPAIYVGTMAKLKQLLQVMVEAARSSLDQNSMPLPPWRS 320 DFRSHF+IARAVK YN++LS LP +YVGT+ KLK LQ MVEAA+ SL QNSMPLPPWRS Sbjct: 176 DFRSHFEIARAVKSYNVVLSCLPPVYVGTVTKLKLYLQAMVEAAKCSLKQNSMPLPPWRS 235 Query: 319 LSYLEAKWESPCQRIVN----SQADPLSSSHHHCIGLLQRLKSFVLSDIKS 179 L+YLEAKWES R+ N S +SSH HC LL R+KS + S+IK+ Sbjct: 236 LAYLEAKWES-SHRVANVQVQSSVSSSNSSHRHCTELLWRIKSCIGSEIKA 285 >ref|XP_008233625.1| PREDICTED: uncharacterized protein LOC103332654 [Prunus mume] Length = 303 Score = 308 bits (790), Expect = 4e-81 Identities = 162/270 (60%), Positives = 201/270 (74%), Gaps = 5/270 (1%) Frame = -3 Query: 967 WLKRSGSGGDFSCRSTGSEHDLAVMVSDFLEIGSAGAESWCSSDTDSGLSDLAYLTDRIS 788 W+ R GG+FS S EHDLA+MV+DF E GSAGAESWCSSD+DS +SDLA+L D+I Sbjct: 13 WM-RGQIGGNFSHES---EHDLALMVTDFWENGSAGAESWCSSDSDSAISDLAHLADKIP 68 Query: 787 LYIHSVDQYESDLMMVVKSLILSISET-YRVEKPGACDASCILYSLVKLLQSSGYDAAVC 611 Y SV QYE DL VV SLILSISE K G C+ASC+ +SLVKLL+ SGYDAAVC Sbjct: 69 FYKRSVAQYEKDLTSVVHSLILSISENDLHFVKSGPCNASCLKFSLVKLLRLSGYDAAVC 128 Query: 610 ATKWQNDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDIDFRSHFQIARAVKPYNLLLSSLP 431 +WQ GKVPGG+HE+IDV+ ++N G S+R IID+DFRSHF+IARAV Y+ +L+SLP Sbjct: 129 VARWQGSGKVPGGDHEYIDVVNYNNSGS-SERLIIDLDFRSHFEIARAVHSYDRILNSLP 187 Query: 430 AIYVGTMAKLKQLLQVMVEAARSSLDQNSMPLPPWRSLSYLEAKWESPCQRIVNSQAD-- 257 +YVG++ ++KQ LQVMVEAARSSL QNSMPLPPWRSL+YL+AKW+SP QR N + Sbjct: 188 VVYVGSLTRMKQFLQVMVEAARSSLKQNSMPLPPWRSLAYLQAKWQSPYQREFNLDEENA 247 Query: 256 --PLSSSHHHCIGLLQRLKSFVLSDIKSGR 173 SS H C G L+ L+S + S+I++ R Sbjct: 248 NGAYSSDHKQCSGQLKMLQSLLQSEIEADR 277 >ref|XP_010264758.1| PREDICTED: uncharacterized protein LOC104602674, partial [Nelumbo nucifera] Length = 368 Score = 308 bits (788), Expect = 7e-81 Identities = 158/253 (62%), Positives = 196/253 (77%), Gaps = 5/253 (1%) Frame = -3 Query: 916 SEHDLAVMVSDFLEIGSAGAESWCSSDTDSGLSDLAYLTDRISLYIHSVDQYESDLMMVV 737 SEHDLAVMV+DFLE GS+GA+S CSSD+DSG SDLA+L +RISLY H VDQYESDL+ V Sbjct: 92 SEHDLAVMVTDFLENGSSGADSRCSSDSDSGFSDLAHLAERISLYRHKVDQYESDLLSTV 151 Query: 736 KSLILSISET-YRVEKPGACDASCILYSLVKLLQSSGYDAAVCATKWQNDGKVPGGEHEF 560 S++LSI+ET K C+ASCI +SL KLL+S GYDAAVC +KWQ GKVPGG+HE+ Sbjct: 152 HSILLSINETDLHTVKSSPCNASCIRFSLAKLLRSYGYDAAVCVSKWQGSGKVPGGDHEY 211 Query: 559 IDVIAHDNGGGGSKRYIIDIDFRSHFQIARAVKPYNLLLSSLPAIYVGTMAKLKQLLQVM 380 IDV+ H + G S+R IIDIDFRSHF+IARAV+ Y+ +L S+P +Y+G+++KL+Q LQVM Sbjct: 212 IDVVIHRDTGN-SERLIIDIDFRSHFEIARAVESYSAVLKSIPVVYLGSLSKLRQFLQVM 270 Query: 379 VEAARSSLDQNSMPLPPWRSLSYLEAKWESPCQRIVNSQA----DPLSSSHHHCIGLLQR 212 VEAARSSL QNSMPLPPWRS +YL+AKW S QR +N D S H CIG L+R Sbjct: 271 VEAARSSLKQNSMPLPPWRSFAYLQAKWHSAYQRKLNPDEQGIHDRSYSDHKQCIGHLRR 330 Query: 211 LKSFVLSDIKSGR 173 LKS + S+I++ R Sbjct: 331 LKSSLQSEIEAER 343 >ref|XP_007218741.1| hypothetical protein PRUPE_ppa009184mg [Prunus persica] gi|462415203|gb|EMJ19940.1| hypothetical protein PRUPE_ppa009184mg [Prunus persica] Length = 303 Score = 308 bits (788), Expect = 7e-81 Identities = 162/270 (60%), Positives = 200/270 (74%), Gaps = 5/270 (1%) Frame = -3 Query: 967 WLKRSGSGGDFSCRSTGSEHDLAVMVSDFLEIGSAGAESWCSSDTDSGLSDLAYLTDRIS 788 W+ R GG+FS S EHDLA+MV+DF E GSAGAESWCSSD+DS LSDLA+L D+I Sbjct: 13 WM-RGQIGGNFSHES---EHDLALMVTDFWENGSAGAESWCSSDSDSALSDLAHLADKIP 68 Query: 787 LYIHSVDQYESDLMMVVKSLILSISET-YRVEKPGACDASCILYSLVKLLQSSGYDAAVC 611 Y SV QYE DL VV SLILSISE K G C+ASC+ +SLVKLL+ SGYDAAVC Sbjct: 69 FYKRSVAQYEKDLTSVVHSLILSISENDLHFVKSGPCNASCLKFSLVKLLRLSGYDAAVC 128 Query: 610 ATKWQNDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDIDFRSHFQIARAVKPYNLLLSSLP 431 +WQ GKVPGG+HE++DV+ +N G S+R IID+DFRSHF+IARAV Y+ +L+SLP Sbjct: 129 VARWQGSGKVPGGDHEYVDVVNFNNSGS-SERLIIDLDFRSHFEIARAVHSYDRILNSLP 187 Query: 430 AIYVGTMAKLKQLLQVMVEAARSSLDQNSMPLPPWRSLSYLEAKWESPCQRIVNSQAD-- 257 +YVG++ ++KQ LQVMVEAARSSL QNSMPLPPWRSL+YL+AKW+SP QR N + Sbjct: 188 VVYVGSLTRMKQFLQVMVEAARSSLKQNSMPLPPWRSLAYLQAKWQSPYQREFNLDEENA 247 Query: 256 --PLSSSHHHCIGLLQRLKSFVLSDIKSGR 173 SS H C G L+ L+S + S+I++ R Sbjct: 248 NGAYSSDHKQCSGQLKMLQSLLQSEIEADR 277 >ref|XP_004142760.2| PREDICTED: uncharacterized protein LOC101214727 [Cucumis sativus] gi|700196138|gb|KGN51315.1| hypothetical protein Csa_5G517130 [Cucumis sativus] Length = 311 Score = 307 bits (787), Expect = 9e-81 Identities = 160/289 (55%), Positives = 211/289 (73%), Gaps = 10/289 (3%) Frame = -3 Query: 1009 VAIGGGGMNLMVNNWLK---RSGSGGDFSCRSTGSEHDLAVMVSDFLEIGSAGAESWCSS 839 V + GG + W+K R G G S SEHDLA+MVSDFLE GS G +SWCSS Sbjct: 5 VCVAGGDL------WVKVGARVGGVGQMGGFSHESEHDLALMVSDFLENGSGGGDSWCSS 58 Query: 838 DTDSGLSDLAYLTDRISLYIHSVDQYESDLMMVVKSLILSISET-YRVEKPGACDASCIL 662 D+DSG+SDLA+L ++I Y + V QYESDL+ VV SL LS++E + K G C+ASCI Sbjct: 59 DSDSGVSDLAHLAEKIVFYKNPVSQYESDLLSVVHSLTLSMNEKDLNMNKAGPCNASCIR 118 Query: 661 YSLVKLLQSSGYDAAVCATKWQNDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDIDFRSHF 482 + LVKLL+ SGYDAAVC T+WQ GKVPGG+HE+IDV+ + +G S+R I+DIDFRSHF Sbjct: 119 FVLVKLLRRSGYDAAVCTTRWQGAGKVPGGDHEYIDVVNYTSGS--SERLIVDIDFRSHF 176 Query: 481 QIARAVKPYNLLLSSLPAIYVGTMAKLKQLLQVMVEAARSSLDQNSMPLPPWRSLSYLEA 302 +IARAV+ Y+ +L+SLP IYVG++ +LK LQ+MVEAA+SSL NSMPLPPWRSL+YL+A Sbjct: 177 EIARAVESYDRILNSLPVIYVGSLPRLKHFLQIMVEAAKSSLKLNSMPLPPWRSLAYLQA 236 Query: 301 KWESPCQRIVNSQ------ADPLSSSHHHCIGLLQRLKSFVLSDIKSGR 173 KW+SPCQR+++ + + + SH CIG L+RL+S + S+I++ R Sbjct: 237 KWQSPCQRMLHPEEQQQLGSRDMLMSHKQCIGHLKRLQSVLQSEIETDR 285 >ref|XP_008458875.1| PREDICTED: uncharacterized protein LOC103498149 [Cucumis melo] Length = 311 Score = 306 bits (784), Expect = 2e-80 Identities = 158/286 (55%), Positives = 210/286 (73%), Gaps = 7/286 (2%) Frame = -3 Query: 1009 VAIGGGGMNLMVNNWLKRSGSGGDFSCRSTGSEHDLAVMVSDFLEIGSAGAESWCSSDTD 830 + + GG + + V + G G FS S EHDLA+MVSDFLE GS G ESWCSSD+D Sbjct: 5 ICVAGGDLWVKVGGRVGGVGQMGGFSHES---EHDLALMVSDFLENGSGGGESWCSSDSD 61 Query: 829 SGLSDLAYLTDRISLYIHSVDQYESDLMMVVKSLILSIS-ETYRVEKPGACDASCILYSL 653 SG+SDL +L ++I Y + V QYESDL+ VV SL LS++ + + K G C+ASCI + L Sbjct: 62 SGVSDLVHLAEKIVFYKNPVSQYESDLLSVVHSLTLSMNAKDLNMNKTGPCNASCIRFVL 121 Query: 652 VKLLQSSGYDAAVCATKWQNDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDIDFRSHFQIA 473 VKLL+ SGYDAAVC T+WQ GKVPGG+HE+IDV+ + +G S+R IIDIDFRSHF+IA Sbjct: 122 VKLLRRSGYDAAVCTTRWQGAGKVPGGDHEYIDVVNYTSGS--SERLIIDIDFRSHFEIA 179 Query: 472 RAVKPYNLLLSSLPAIYVGTMAKLKQLLQVMVEAARSSLDQNSMPLPPWRSLSYLEAKWE 293 RAV+ Y+ +L+SLP IYVG++++LK LQ+MVEAA+SSL NSMPLPPWRSL+YL+AKW+ Sbjct: 180 RAVESYDRILNSLPVIYVGSLSRLKHFLQIMVEAAKSSLKLNSMPLPPWRSLAYLQAKWQ 239 Query: 292 SPCQRIVNSQ------ADPLSSSHHHCIGLLQRLKSFVLSDIKSGR 173 SPCQR+++ + A + SH CIG L+RL+S + S+ ++ R Sbjct: 240 SPCQRMLHPEEQQQLGAKDMLMSHKQCIGHLKRLQSVLQSETETDR 285 >gb|KHG26784.1| Cruciferin BnC2 [Gossypium arboreum] Length = 307 Score = 305 bits (781), Expect = 4e-80 Identities = 160/255 (62%), Positives = 195/255 (76%), Gaps = 7/255 (2%) Frame = -3 Query: 916 SEHDLAVMVSDFLE--IGSAGAESWCSSDTDSGLSDLAYLTDRISLYIHSVDQYESDLMM 743 SEHDLA+MVSDFLE GSAGA+SWCSSD+DSG SDL +L D+IS Y H V QYE DL Sbjct: 30 SEHDLALMVSDFLENNAGSAGADSWCSSDSDSGFSDLIHLADKISYYKHPVGQYEIDLSS 89 Query: 742 VVKSLILSISET-YRVEKPGACDASCILYSLVKLLQSSGYDAAVCATKWQNDGKVPGGEH 566 VV SL+ SISET K G C+ASCI YSLVKLL+ SGYDAAVCA++WQ GKVPGG+H Sbjct: 90 VVHSLVFSISETDLHFVKSGQCNASCIRYSLVKLLRLSGYDAAVCASRWQGSGKVPGGDH 149 Query: 565 EFIDVIAHDNGGGGSKRYIIDIDFRSHFQIARAVKPYNLLLSSLPAIYVGTMAKLKQLLQ 386 E+IDV+ ++NG S+R IIDIDFRSHF+IARAV Y +L+SLP +YVG++ +LKQLLQ Sbjct: 150 EYIDVVNYNNGC--SERLIIDIDFRSHFEIARAVDSYGRILNSLPVVYVGSLTRLKQLLQ 207 Query: 385 VMVEAARSSLDQNSMPLPPWRSLSYLEAKWESPCQRIVNSQADPLS----SSHHHCIGLL 218 VMV+AARSSL QNSMP PPWRSL+YL+AKW SP QR +S S+H C G L Sbjct: 208 VMVDAARSSLKQNSMPFPPWRSLAYLQAKWHSPYQRKFTPDEHNISGNILSAHKQCNGHL 267 Query: 217 QRLKSFVLSDIKSGR 173 +RL+S + S++++ R Sbjct: 268 RRLQSSLQSELEAER 282 >ref|XP_012089216.1| PREDICTED: uncharacterized protein LOC105647655 isoform X1 [Jatropha curcas] gi|643739142|gb|KDP44956.1| hypothetical protein JCGZ_01456 [Jatropha curcas] Length = 305 Score = 303 bits (776), Expect = 2e-79 Identities = 158/264 (59%), Positives = 201/264 (76%), Gaps = 6/264 (2%) Frame = -3 Query: 946 GGDFSCRSTGSEHDLAVMVSDFLEIG-SAGAESWCSSDTDSGLSDLAYLTDRISLYIHSV 770 GG FS S EHDLA+MVSDFLE G S+GA+SWCSSD++SGLSDL +L D+IS Y HSV Sbjct: 20 GGGFSHES---EHDLALMVSDFLENGGSSGADSWCSSDSESGLSDLHHLADKISFYRHSV 76 Query: 769 DQYESDLMMVVKSLILSISET-YRVEKPGACDASCILYSLVKLLQSSGYDAAVCATKWQN 593 Q+ESDL+ +V SL++SI ET + K G C+ASCI +SLVK L+ +GYDAAVCA++WQ Sbjct: 77 AQHESDLLSLVHSLVVSIKETDLHLVKSGPCNASCIRFSLVKHLRLAGYDAAVCASRWQG 136 Query: 592 DGKVPGGEHEFIDVIAHDNGGGGSKRYIIDIDFRSHFQIARAVKPYNLLLSSLPAIYVGT 413 GKVPGG+HE++DV+ + N GG S+R +ID+DF+SHF+IARAV Y+ +L SLP IYVG+ Sbjct: 137 GGKVPGGDHEYVDVVTY-NSGGSSERLVIDVDFQSHFEIARAVDSYDRILKSLPVIYVGS 195 Query: 412 MAKLKQLLQVMVEAARSSLDQNSMPLPPWRSLSYLEAKWESPCQRIVNSQADPL----SS 245 M +LKQ LQVMVEAA+SSL QNSMPLPPWRSL+YL+AKW SP QR + SS Sbjct: 196 MTRLKQYLQVMVEAAKSSLKQNSMPLPPWRSLAYLQAKWHSPYQRHLTPDEQNFGSFNSS 255 Query: 244 SHHHCIGLLQRLKSFVLSDIKSGR 173 H C G L+RL+S + S+++ R Sbjct: 256 DHKQCSGHLKRLQSSLQSEMEEER 279 >ref|XP_006435534.1| hypothetical protein CICLE_v10032209mg [Citrus clementina] gi|568866283|ref|XP_006486486.1| PREDICTED: uncharacterized protein LOC102627296 [Citrus sinensis] gi|557537730|gb|ESR48774.1| hypothetical protein CICLE_v10032209mg [Citrus clementina] Length = 306 Score = 302 bits (773), Expect = 4e-79 Identities = 160/265 (60%), Positives = 199/265 (75%), Gaps = 5/265 (1%) Frame = -3 Query: 952 GSGGDFSCRSTGSEHDLAVMVSDFLEIGSAGAESWCSSDTDSGLSDLAYLTDRISLYIHS 773 G GG FS S EHDLA+MVSDFLE GSAG +S CSSD+DSG SDLA+L D+IS Y S Sbjct: 20 GIGGGFSHES---EHDLALMVSDFLENGSAGTDSLCSSDSDSGFSDLAHLADKISFYKRS 76 Query: 772 VDQYESDLMMVVKSLILSISET-YRVEKPGACDASCILYSLVKLLQSSGYDAAVCATKWQ 596 V QYE DL +VV SLILSI ET K C+ASCI + LVKLL+ SGYDAAVC+T+WQ Sbjct: 77 VPQYEMDLTLVVHSLILSIKETDLHAVKSDQCNASCIRFVLVKLLRLSGYDAAVCSTRWQ 136 Query: 595 NDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDIDFRSHFQIARAVKPYNLLLSSLPAIYVG 416 GKVPGG+HE+IDV+ + N G S+R IIDIDFRS+F+IARAV Y+ +L SLP +YVG Sbjct: 137 GSGKVPGGDHEYIDVVNY-NTAGSSERLIIDIDFRSYFEIARAVDSYDRILKSLPVVYVG 195 Query: 415 TMAKLKQLLQVMVEAARSSLDQNSMPLPPWRSLSYLEAKWESPCQRIVNSQADPLSSS-- 242 ++ +LKQ LQVMV+AARSSL QNSMPLPPWRSL+YL+AKW+SP QR N ++++ Sbjct: 196 SLIRLKQFLQVMVDAARSSLSQNSMPLPPWRSLAYLQAKWQSPHQRDFNPDEQNITTTYS 255 Query: 241 --HHHCIGLLQRLKSFVLSDIKSGR 173 H C G L+RL+S + S++++ R Sbjct: 256 LDHKQCRGHLKRLQSSLHSEVEAER 280 >ref|XP_012460357.1| PREDICTED: uncharacterized protein LOC105780519 [Gossypium raimondii] gi|763809189|gb|KJB76091.1| hypothetical protein B456_012G070400 [Gossypium raimondii] Length = 307 Score = 301 bits (771), Expect = 6e-79 Identities = 157/255 (61%), Positives = 194/255 (76%), Gaps = 7/255 (2%) Frame = -3 Query: 916 SEHDLAVMVSDFLE--IGSAGAESWCSSDTDSGLSDLAYLTDRISLYIHSVDQYESDLMM 743 SEHDLA+MVSDFLE GSAGA+SWCSSD+DSG SDL +L D+IS Y H V QYE DL Sbjct: 30 SEHDLALMVSDFLENNAGSAGADSWCSSDSDSGFSDLIHLADKISYYKHPVGQYEIDLSS 89 Query: 742 VVKSLILSISET-YRVEKPGACDASCILYSLVKLLQSSGYDAAVCATKWQNDGKVPGGEH 566 VV SL+ SISET K G C+ SCI YSLVKLL+ SGYDAAVCA++WQ GK PGG+H Sbjct: 90 VVHSLVFSISETDLHFVKSGQCNTSCIRYSLVKLLRLSGYDAAVCASRWQGSGKFPGGDH 149 Query: 565 EFIDVIAHDNGGGGSKRYIIDIDFRSHFQIARAVKPYNLLLSSLPAIYVGTMAKLKQLLQ 386 E+IDV+ ++NG S+R IIDIDFRSHF+IARAV Y +L+SLP +YVG++ +LKQLLQ Sbjct: 150 EYIDVVNYNNGC--SERLIIDIDFRSHFEIARAVDSYGRILNSLPVVYVGSLTRLKQLLQ 207 Query: 385 VMVEAARSSLDQNSMPLPPWRSLSYLEAKWESPCQRIVNSQADPLS----SSHHHCIGLL 218 +MV+AARSSL QNSMP PPWRSL+YL+AKW SP QR + +S S+H C G L Sbjct: 208 LMVDAARSSLKQNSMPFPPWRSLAYLQAKWHSPYQRKFSPDEHDISGNILSAHKQCNGNL 267 Query: 217 QRLKSFVLSDIKSGR 173 +RL+S + S++++ R Sbjct: 268 RRLQSSLQSELEAER 282 >gb|KDO69311.1| hypothetical protein CISIN_1g021888mg [Citrus sinensis] Length = 306 Score = 301 bits (770), Expect = 8e-79 Identities = 160/265 (60%), Positives = 198/265 (74%), Gaps = 5/265 (1%) Frame = -3 Query: 952 GSGGDFSCRSTGSEHDLAVMVSDFLEIGSAGAESWCSSDTDSGLSDLAYLTDRISLYIHS 773 G GG FS S EHDLA+MVSDFLE GSAG +S CSSD+DSG SDLA+L D+IS Y S Sbjct: 20 GIGGGFSHES---EHDLALMVSDFLENGSAGTDSLCSSDSDSGFSDLAHLADKISFYKRS 76 Query: 772 VDQYESDLMMVVKSLILSISET-YRVEKPGACDASCILYSLVKLLQSSGYDAAVCATKWQ 596 V QYE DL VV SLILSI ET K C+ASCI + LVKLL+ SGYDAAVC+T+WQ Sbjct: 77 VPQYEMDLTSVVHSLILSIKETDLHAVKSDQCNASCIRFVLVKLLRLSGYDAAVCSTRWQ 136 Query: 595 NDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDIDFRSHFQIARAVKPYNLLLSSLPAIYVG 416 GKVPGG+HE+IDV+ + N G S+R IIDIDFRS+F+IARAV Y+ +L SLP +YVG Sbjct: 137 GSGKVPGGDHEYIDVVNY-NTAGSSERLIIDIDFRSYFEIARAVDSYDRILKSLPVVYVG 195 Query: 415 TMAKLKQLLQVMVEAARSSLDQNSMPLPPWRSLSYLEAKWESPCQRIVNSQADPLSSS-- 242 ++ +LKQ LQVMV+AARSSL QNSMPLPPWRSL+YL+AKW+SP QR N ++++ Sbjct: 196 SLIRLKQFLQVMVDAARSSLSQNSMPLPPWRSLAYLQAKWQSPHQRDFNPDEQNITTTYS 255 Query: 241 --HHHCIGLLQRLKSFVLSDIKSGR 173 H C G L+RL+S + S++++ R Sbjct: 256 LDHKQCRGHLKRLQSSLHSEVEAER 280 >ref|XP_010256928.1| PREDICTED: uncharacterized protein LOC104597192 [Nelumbo nucifera] Length = 338 Score = 300 bits (769), Expect = 1e-78 Identities = 158/279 (56%), Positives = 203/279 (72%), Gaps = 12/279 (4%) Frame = -3 Query: 973 NNWLKRSGSGG-----DFSCRSTGSEHDLAVMVSDFLEIGSAGAESWCSSDTDSGLSDLA 809 N WL+ GSGG S SEHDL +MVSDFLE GS+GA+S CSSD+DS SD+A Sbjct: 36 NLWLRLVGSGGAPLVGKVDAFSHESEHDLGLMVSDFLENGSSGADSRCSSDSDSSFSDIA 95 Query: 808 YLTDRISLYIHSVDQYESDLMMVVKSLILSISET-YRVEKPGACDASCILYSLVKLLQSS 632 +L D+ISLY H VDQYES+L+ V SL+LSI+E K C+ASCI + L KLL+ S Sbjct: 96 HLADKISLYKHKVDQYESELLSTVHSLLLSINEKDLCFVKSSLCNASCIRFCLAKLLRLS 155 Query: 631 GYDAAVCATKWQNDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDIDFRSHFQIARAVKPYN 452 GYDAAVC++KWQ GKVPGG+HE+IDV+ H + G ++R IIDIDFRSHF+IARAV+ Y+ Sbjct: 156 GYDAAVCSSKWQGCGKVPGGDHEYIDVVTHHHSGN-TERLIIDIDFRSHFEIARAVESYD 214 Query: 451 LLLSSLPAIYVGTMAKLKQLLQVMVEAARSSLDQNSMPLPPWRSLSYLEAKWESPCQRIV 272 +L+S+P +YVG+ +KLKQ LQV+VEAARSSL QNSMPLPPWRS +YL+AKW S QR + Sbjct: 215 AVLNSVPVVYVGSPSKLKQFLQVLVEAARSSLKQNSMPLPPWRSFAYLQAKWHSAYQRKL 274 Query: 271 N------SQADPLSSSHHHCIGLLQRLKSFVLSDIKSGR 173 N + + S H C+G L+RL+S + S+I++ R Sbjct: 275 NPDERSIPEPETCCSDHKQCMGHLKRLQSSIQSEIEAER 313 >ref|XP_008371702.1| PREDICTED: uncharacterized protein LOC103435083 [Malus domestica] Length = 304 Score = 300 bits (768), Expect = 1e-78 Identities = 158/270 (58%), Positives = 200/270 (74%), Gaps = 5/270 (1%) Frame = -3 Query: 967 WLKRSGSGGDFSCRSTGSEHDLAVMVSDFLEIGSAGAESWCSSDTDSGLSDLAYLTDRIS 788 W+ R GG FS S EHDLA+MV+DFLE GS GAESWCSSD+DS LSDL +L D+I Sbjct: 14 WM-RGQIGGGFSHES---EHDLALMVTDFLENGSVGAESWCSSDSDSALSDLGHLADKIP 69 Query: 787 LYIHSVDQYESDLMMVVKSLILSISET-YRVEKPGACDASCILYSLVKLLQSSGYDAAVC 611 + SV QYESDL VV S ILSISE +K G C++SC+ ++LVKLL+ SGYDAAVC Sbjct: 70 FFKRSVGQYESDLTSVVHSSILSISENDLNFDKSGQCNSSCLKFALVKLLRLSGYDAAVC 129 Query: 610 ATKWQNDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDIDFRSHFQIARAVKPYNLLLSSLP 431 A +WQ GKVPGG+HE+IDV+ ++N G S+R IID+DFRSHF+IARAV+ Y +L+SLP Sbjct: 130 AARWQGSGKVPGGDHEYIDVVNYNNLGS-SERLIIDLDFRSHFEIARAVQSYGRILNSLP 188 Query: 430 AIYVGTMAKLKQLLQVMVEAARSSLDQNSMPLPPWRSLSYLEAKWESPCQRIVNSQADPL 251 +YVG++ +LKQ LQVM EAARSSL QNSMPLPPWRSL+YL+AKW+SP QR N + + Sbjct: 189 VVYVGSLTRLKQYLQVMAEAARSSLKQNSMPLPPWRSLAYLQAKWQSPYQRQFNLEEQNV 248 Query: 250 SSS----HHHCIGLLQRLKSFVLSDIKSGR 173 + S H C L+ L+S + S+I++ R Sbjct: 249 NGSYNFDHKKCSAHLKMLQSLLQSEIEAER 278 >ref|XP_012456445.1| PREDICTED: uncharacterized protein LOC105777638 [Gossypium raimondii] gi|763806986|gb|KJB73924.1| hypothetical protein B456_011G261400 [Gossypium raimondii] Length = 310 Score = 299 bits (766), Expect = 2e-78 Identities = 157/258 (60%), Positives = 196/258 (75%), Gaps = 7/258 (2%) Frame = -3 Query: 925 STGSEHDLAVMVSDFLEI--GSAGAESWCSSDTDSGLSDLAYLTDRISLYIHSVDQYESD 752 S SEHDLA+MVSDFLE GSAGA+SWCSSD++SG SDL +L D+IS Y HSV Y+ D Sbjct: 29 SNESEHDLALMVSDFLENNGGSAGADSWCSSDSESGFSDLIHLADKISYYKHSVCHYDMD 88 Query: 751 LMMVVKSLILSISET-YRVEKPGACDASCILYSLVKLLQSSGYDAAVCATKWQNDGKVPG 575 L+ VV SLILS+ ET K G C+ASCI YSLVKLL+ SGYDAAVC ++WQ GKVPG Sbjct: 89 LLSVVHSLILSMGETDLHTVKSGPCNASCIRYSLVKLLRLSGYDAAVCVSRWQRSGKVPG 148 Query: 574 GEHEFIDVIAHDNGGGGSKRYIIDIDFRSHFQIARAVKPYNLLLSSLPAIYVGTMAKLKQ 395 G+HE+IDV+ + NG S+R IIDIDFRSHF+IARAV Y+ +L+SLP +YVG++ +LKQ Sbjct: 149 GDHEYIDVVNYSNGN--SERVIIDIDFRSHFEIARAVDSYDRILNSLPVVYVGSLTRLKQ 206 Query: 394 LLQVMVEAARSSLDQNSMPLPPWRSLSYLEAKWESPCQR----IVNSQADPLSSSHHHCI 227 LLQ+MVEAARSSL QNSMP PPWRSL+YL+AKW SP QR + + + SS H C Sbjct: 207 LLQLMVEAARSSLKQNSMPFPPWRSLAYLQAKWYSPYQRQFAPLEHDISGNSSSCHKQCK 266 Query: 226 GLLQRLKSFVLSDIKSGR 173 G L+RL+ + S++++ R Sbjct: 267 GHLRRLQPSLQSELEAER 284 >ref|XP_007009333.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508726246|gb|EOY18143.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 316 Score = 298 bits (764), Expect = 4e-78 Identities = 162/292 (55%), Positives = 207/292 (70%), Gaps = 7/292 (2%) Frame = -3 Query: 1027 MMSRAPVAIGGGGMNLMVNNWLKRSGSGGDFSCRSTGSEHDLAVMVSDFLEI--GSAGAE 854 MM + G N N+ S + + S SEHDLA+MVSDFLE GSAG + Sbjct: 1 MMECSVWVAAGDHKNSNSNSNNNNSNNNNNLWGLSHESEHDLALMVSDFLENNGGSAGGD 60 Query: 853 SWCSSDTDSGLSDLAYLTDRISLYIHSVDQYESDLMMVVKSLILSISET-YRVEKPGACD 677 SWCSSD++SG SDL +L+D+IS Y H V QYE DL+ VV SLILS+SET K G C+ Sbjct: 61 SWCSSDSESGFSDLLHLSDKISYYKHPVGQYEIDLLSVVHSLILSVSETDLHFVKSGPCN 120 Query: 676 ASCILYSLVKLLQSSGYDAAVCATKWQNDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDID 497 ASCI + LVKLL+ SGYDAAVCA++WQ GKVPGG+HE+IDVI ++NG S+R IIDID Sbjct: 121 ASCIRFFLVKLLRLSGYDAAVCASRWQGSGKVPGGDHEYIDVINYNNGS--SERLIIDID 178 Query: 496 FRSHFQIARAVKPYNLLLSSLPAIYVGTMAKLKQLLQVMVEAARSSLDQNSMPLPPWRSL 317 FRSHF+IARAV Y+ +L+SLP +YVG++ +LKQLLQ+MV+AARSSL QNSMP PPWRSL Sbjct: 179 FRSHFEIARAVDSYDRILNSLPVVYVGSLTRLKQLLQLMVDAARSSLKQNSMPFPPWRSL 238 Query: 316 SYLEAKWESPCQRIVNSQADPL----SSSHHHCIGLLQRLKSFVLSDIKSGR 173 +YL+AKW+SP QR + SS H C G L+ L++ + S++++ R Sbjct: 239 AYLQAKWQSPYQRQFTPYEHDINGNVSSDHKQCNGHLKGLQASLQSELEAER 290 >ref|XP_003518643.1| PREDICTED: uncharacterized protein LOC100780208 [Glycine max] gi|947122251|gb|KRH70457.1| hypothetical protein GLYMA_02G091500 [Glycine max] Length = 308 Score = 298 bits (763), Expect = 5e-78 Identities = 162/290 (55%), Positives = 204/290 (70%), Gaps = 5/290 (1%) Frame = -3 Query: 1027 MMSRAPVAIGGGGMNLMVNNWLKRSGSGGDFSCRSTGSEHDLAVMVSDFLEIGSAGAESW 848 M R VA GG + W++ G G S SEHDLA+MVSDFLE GS+GAESW Sbjct: 1 MDCRVCVATGG-------DLWVRVGGGGEIGGGFSHESEHDLALMVSDFLENGSSGAESW 53 Query: 847 CSSDTDSGLSDLAYLTDRISLYIHSVDQYESDLMMVVKSLILSISET-YRVEKPGACDAS 671 CSSD+DSG SD A L +RI + SV Q+ESDL+ VV SLI S++ET +V G C AS Sbjct: 54 CSSDSDSGHSDFAQLAERIQICKLSVAQHESDLLSVVHSLIRSMNETNLQVMNSGPCYAS 113 Query: 670 CILYSLVKLLQSSGYDAAVCATKWQNDGKVPGGEHEFIDVIAHDNGGGGSKRYIIDIDFR 491 CI + LVKL++ SGYDA VCA+KWQ GKVPGG+HE+ID+I DN G S+R I+DIDFR Sbjct: 114 CIRFYLVKLMRLSGYDAGVCASKWQGSGKVPGGDHEYIDIII-DNNSGSSERLIVDIDFR 172 Query: 490 SHFQIARAVKPYNLLLSSLPAIYVGTMAKLKQLLQVMVEAARSSLDQNSMPLPPWRSLSY 311 SHF+IARAV Y+ +L+SLP +YVG+ +LKQ L +M EA RSSL QNSMPLPPWRSL+Y Sbjct: 173 SHFEIARAVDSYDRILNSLPVVYVGSFTRLKQFLGIMEEATRSSLKQNSMPLPPWRSLAY 232 Query: 310 LEAKWESPCQRIVNSQADPLSSS----HHHCIGLLQRLKSFVLSDIKSGR 173 L+AKW+SP +R +S+ + +S S H C G L+RL+S + S I+ R Sbjct: 233 LQAKWQSPYERYTHSEGNNISDSDCFDHKQCCGHLKRLQSCLQSGIEIDR 282