BLASTX nr result
ID: Rehmannia30_contig00004423
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia30_contig00004423 (3282 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_019196472.1| PREDICTED: uncharacterized protein LOC109190... 575 0.0 dbj|BAV56701.1| transposase [Ipomoea nil] 372 e-111 ref|XP_019150668.1| PREDICTED: uncharacterized protein LOC109147... 370 e-111 ref|XP_024185677.1| uncharacterized protein LOC112190477 [Rosa c... 366 e-109 ref|XP_019177120.1| PREDICTED: uncharacterized protein LOC109172... 365 e-108 ref|XP_011102064.1| uncharacterized protein LOC105180111 isoform... 366 e-104 ref|XP_019174725.1| PREDICTED: uncharacterized protein LOC109170... 365 e-104 ref|XP_011102062.1| uncharacterized protein LOC105180111 isoform... 366 e-104 ref|XP_011084232.1| uncharacterized protein LOC105166542 isoform... 365 e-104 ref|XP_011084230.1| uncharacterized protein LOC105166542 isoform... 365 e-104 ref|XP_011102063.1| uncharacterized protein LOC105180111 isoform... 361 e-102 ref|XP_011084231.1| uncharacterized protein LOC105166542 isoform... 360 e-102 gb|PRQ17143.1| putative Ulp1 protease family catalytic domain, p... 342 2e-99 gb|PRQ17594.1| putative Ulp1 protease family catalytic domain, p... 342 2e-99 ref|XP_012837747.1| PREDICTED: uncharacterized protein LOC105958... 347 2e-97 ref|XP_012837746.1| PREDICTED: uncharacterized protein LOC105958... 347 2e-97 ref|XP_011076941.1| uncharacterized protein LOC105161066 isoform... 344 1e-96 ref|XP_020548914.1| uncharacterized protein LOC105161066 isoform... 344 2e-96 ref|XP_011076937.1| uncharacterized protein LOC105161066 isoform... 344 2e-96 gb|PRQ20360.1| putative Ulp1 protease family catalytic domain, p... 324 2e-93 >ref|XP_019196472.1| PREDICTED: uncharacterized protein LOC109190440 [Ipomoea nil] dbj|BAV56710.1| transposase [Ipomoea nil] Length = 677 Score = 575 bits (1482), Expect = 0.0 Identities = 324/759 (42%), Positives = 462/759 (60%), Gaps = 11/759 (1%) Frame = +3 Query: 888 MAALRKLKGGHDDVQKGKADVSSDASHEEDGDSMEID-------SRDSQPRVSTRGRTHM 1046 MA RK K + QKG+ +V SD H +G +++D S+++Q STRGRT M Sbjct: 1 MAGRRKKKIVQQE-QKGQ-EVHSDEEH--NGKEVQVDEEENMSGSQETQSTRSTRGRTQM 56 Query: 1047 YRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVPKDVKEL 1226 ++LA+Q+A+G+K D++ N+LGQ +G AAELQSYIGVLARE VK+ +K+WKHVP+D+K+ Sbjct: 57 HKLAMQRAQGLKKDVQFNELGQPIGDSAAELQSYIGVLAREKVKLNFKTWKHVPQDIKDK 116 Query: 1227 IWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXXGYGISQ 1406 IW++VNL + V I+KK CLSSA KWRQYKT LT F+W GYGI Sbjct: 117 IWDAVNLSFRVPAIFKKPCLSSANDKWRQYKTQLTNNFIWKRLNDEENLHKPPPGYGIMG 176 Query: 1407 EDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCDDAEINRA 1586 ++W+ FVISRMSE F KLSE+QK +R N+YPHRLAR+GYA LA EI +ELCDD E+NRA Sbjct: 177 DEWSQFVISRMSEDFKKLSEQQKVQRKQNLYPHRLARKGYARLASEISTELCDDDEVNRA 236 Query: 1587 ILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALENQEHAGR 1766 ILWKKGR +K GEIEGD LK KID YI+QK++G L+++G EDILT+ALE++EH GR Sbjct: 237 ILWKKGRTSKQGEIEGDVLKTKFTKIDEYIQQKQDGLLQLQGPNEDILTQALESKEHGGR 296 Query: 1767 VRGIGGHITPSTYFRVLIGKKPVDRRAEQRNELMEAKKLIAEQGDLIKEQNVRLEKLEAI 1946 VR IGGH+ PSTYFR+ G P ++N L+ + + + R+ KLE + Sbjct: 297 VRAIGGHVNPSTYFRLGKGMLP----NHEKNVLLRRQATVED----------RVAKLENL 342 Query: 1947 FIKKYDTDNDEKASCSVKPKHQSNKDEADFVILDRKVALEGSAIPLTFKSKNEVDAYGTI 2126 ++ +V K S +++ D K A++ S + F K ++D Sbjct: 343 VLQ------------NVAFKSSSIEEKGSCTAKDAKGAMKLSEEEIGFM-KQKLD----- 384 Query: 2127 VHADEPDNFLDDEPIPTNCMYIANNQAMTESTPLPVKIPMTRDCLDDVVGNHVDSPTHLI 2306 F DD+ D++ +D L Sbjct: 385 --------FEDDD--------------------------------DEL--QFIDKEDVLE 402 Query: 2307 KLQNEKPSTMKVGDGKNKSKVVNSDMPRDLYMFYGCCKNVLQDGKSISITLDDDVFGTEK 2486 K +KPS K ++ +S MP+ L++ Y K L +G+S+ I LD++VFG E Sbjct: 403 KQCKKKPSKEV-----KKLELNSSSMPKSLWLLYCYYKRALGNGESLKIVLDENVFGEEC 457 Query: 2487 VIHVDLSDITHFCELESISCYSIIVYIWHLYKKMKEDF-VDNFLFVDPYHIGHVPTTRTD 2663 ++V D+T FC+L IS I VYIW+LYKKM ED ++ F F+ P H+GHVPTTRTD Sbjct: 458 TLYVHDEDVTPFCQLMPISYTCIAVYIWYLYKKMMEDNKLEKFRFMQPCHVGHVPTTRTD 517 Query: 2664 KSYLQQLMVARARFLADRLSNASINQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRI 2843 K++L + + +RAR LADRL + + +L PCN+G+HWILTVI+ K+ V++ DPL RI Sbjct: 518 KNFLDKQLESRARALADRLIDNPSSASLLVPCNVGFHWILTVINLSKDIVYLWDPLSHRI 577 Query: 2844 RDETWRDVVNLALKLFNA--DKGRKGKKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHE 3017 RD+ W+ VV +A+K+ +A G+KG+ K WE+++AP QPD QCGF+VM Y++ ++ Sbjct: 578 RDDDWKHVVEMAIKMVHAASGNGKKGRSKTAWEIVKAPRQPDSNQCGFYVMAYLKTLIEN 637 Query: 3018 CIN-TNKASLRTIFKKGDYTRAQIDEVRLEWAKCIQDHM 3131 + +K S++ +F++ +Y +A ID VR EWA + ++ Sbjct: 638 MPDIDDKDSVQALFQQVEYDKAVIDLVRSEWADILSSYI 676 >dbj|BAV56701.1| transposase [Ipomoea nil] Length = 677 Score = 372 bits (954), Expect = e-111 Identities = 199/415 (47%), Positives = 268/415 (64%), Gaps = 8/415 (1%) Frame = +3 Query: 888 MAALRKLKGGHDDVQKGKADVSSDASHEEDGDSMEID-------SRDSQPRVSTRGRTHM 1046 MA RK K + QKG+ +V SD H +G +++D S+++Q STRGRT M Sbjct: 1 MAGRRKKKIVQQE-QKGQ-EVHSDEEH--NGKEVQVDEEENMSGSQETQSTRSTRGRTQM 56 Query: 1047 YRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVPKDVKEL 1226 ++LA+Q+A+G+K D++ N+LGQ +G AAELQSYIGVLARE VK+ +K+WKHVP+D+K+ Sbjct: 57 HKLAMQRAQGLKKDVQFNELGQPIGDSAAELQSYIGVLAREKVKLNFKTWKHVPQDIKDK 116 Query: 1227 IWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXXGYGISQ 1406 IW++VNL + V I+KK CLSSA KWRQYKT LT F+W GYGI Sbjct: 117 IWDAVNLSFRVPAIFKKPCLSSANDKWRQYKTQLTNNFIWKRLNDEENLHKPPPGYGIMG 176 Query: 1407 EDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCDDAEINRA 1586 ++W+ FVISRMSE F KLSE+QK RR N+YPHRLAR+GYA LA EI +ELCDD E+NRA Sbjct: 177 DEWSQFVISRMSEDFKKLSEQQKVRRKQNLYPHRLARKGYARLASEISTELCDDDEVNRA 236 Query: 1587 ILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALENQEHAGR 1766 ILWKKGR +K GEIEGD LK KID YI+QK++G L+++G EDILT+ALE++EH GR Sbjct: 237 ILWKKGRTSKQGEIEGDVLKTKFTKIDEYIQQKQDGLLQLQGPNEDILTQALESKEHGGR 296 Query: 1767 VRGIGGHITPSTYFRVLIGKKPVDRRAEQRNELMEAKKLIAEQGDLIKEQNVRLEKLEAI 1946 VR IGGH+ PSTYFR+ G P K ++ + ++++ +LE L Sbjct: 297 VRAIGGHVNPSTYFRLGKGMLP-----------NHEKNVLLRRQATVEDRVAKLENLVLQ 345 Query: 1947 FIKKYDTDNDEKASCSVKPKHQSNK-DEADFVILDRKVALEGSAIPLTFKSKNEV 2108 + + +EK SC+ K + K E + + +K+ E L F K +V Sbjct: 346 NVAFKSSPIEEKGSCTAKDAKGAMKLSEEEIGFMKQKLDFEDDDDELQFIDKEDV 400 Score = 243 bits (620), Expect = 8e-65 Identities = 120/275 (43%), Positives = 182/275 (66%), Gaps = 4/275 (1%) Frame = +3 Query: 2319 EKPSTMKVGDGKNKSKVVNSDMPRDLYMFYGCCKNVLQDGKSISITLDDDVFGTEKVIHV 2498 EK K K ++ +S MP+ L++ Y K L +G+S+ I LD++VFG E ++V Sbjct: 402 EKQCKKKPSKEVKKLELNSSSMPKSLWLLYCYYKRALGNGESLKIVLDENVFGEECTLYV 461 Query: 2499 DLSDITHFCELESISCYSIIVYIWHLYKKMKEDF-VDNFLFVDPYHIGHVPTTRTDKSYL 2675 D+T FC+L IS I VYIW+LYKKM ED ++ F F+ P H+GHVPTTRTDK++L Sbjct: 462 HDEDVTPFCQLMPISYTCIAVYIWYLYKKMMEDNKLEKFRFMQPCHVGHVPTTRTDKNFL 521 Query: 2676 QQLMVARARFLADRLSNASINQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRIRDET 2855 + + +RAR LADRL + + +L PCN+G+HWILTVI+ K+ V++ DPL RIRD+ Sbjct: 522 DKQLESRARALADRLIDNPSSASLLVPCNVGFHWILTVINVSKDIVYLWDPLSHRIRDDD 581 Query: 2856 WRDVVNLALKLFNA--DKGRKGKKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECIN- 3026 W+ VV +A+K+ +A G+KG+ K WE+++AP QPD QCGF+VM Y++ ++ + Sbjct: 582 WKHVVEMAIKMVHAASGNGKKGRSKTAWEIVKAPRQPDSNQCGFYVMAYLKTLIENMPDI 641 Query: 3027 TNKASLRTIFKKGDYTRAQIDEVRLEWAKCIQDHM 3131 +K S++ +F++ +Y +A ID VR EWA + ++ Sbjct: 642 DDKDSVQALFQQVEYDKAVIDLVRSEWADILSSYI 676 >ref|XP_019150668.1| PREDICTED: uncharacterized protein LOC109147518 [Ipomoea nil] dbj|BAV56708.1| transposase [Ipomoea nil] Length = 677 Score = 370 bits (950), Expect = e-111 Identities = 198/415 (47%), Positives = 267/415 (64%), Gaps = 8/415 (1%) Frame = +3 Query: 888 MAALRKLKGGHDDVQKGKADVSSDASHEEDGDSMEID-------SRDSQPRVSTRGRTHM 1046 MA RK K + QKG+ +V D H +G +++D S+++Q STRGRT M Sbjct: 1 MAGRRKKKIVQQE-QKGQ-EVHGDEEH--NGKEVQVDEEENMSGSQETQSTRSTRGRTQM 56 Query: 1047 YRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVPKDVKEL 1226 ++LA+Q+A+G+K D++ N+LGQ +G AAELQSYIGVLARE VK+ +K+WKHVP+D+K+ Sbjct: 57 HKLAMQRAQGLKKDVQFNELGQPIGDSAAELQSYIGVLAREKVKLNFKTWKHVPQDIKDK 116 Query: 1227 IWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXXGYGISQ 1406 IW++VNL + V I+KK CLSSA KWRQYKT LT F+W GYGI Sbjct: 117 IWDAVNLSFRVPAIFKKPCLSSANDKWRQYKTQLTNNFIWKRLNDEENLHKPPPGYGIMG 176 Query: 1407 EDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCDDAEINRA 1586 ++W+ FVISRMSE F KLSE+QK RR N+YPHRLAR+GYA LA EI +ELCDD E+NRA Sbjct: 177 DEWSQFVISRMSEDFKKLSEQQKVRRKQNLYPHRLARKGYARLASEISTELCDDDEVNRA 236 Query: 1587 ILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALENQEHAGR 1766 ILWKKGR +K GEIEGD LK KID YI+QK++G L+++G EDILT+ALE++EH GR Sbjct: 237 ILWKKGRTSKQGEIEGDVLKTKFTKIDEYIQQKQDGLLQLQGPNEDILTQALESKEHGGR 296 Query: 1767 VRGIGGHITPSTYFRVLIGKKPVDRRAEQRNELMEAKKLIAEQGDLIKEQNVRLEKLEAI 1946 VR IGGH+ PSTYFR+ G P K ++ + ++++ +LE L Sbjct: 297 VRAIGGHVNPSTYFRLGKGMLP-----------NHEKNVLLRRQATVEDRVAKLENLVLQ 345 Query: 1947 FIKKYDTDNDEKASCSVKPKHQSNK-DEADFVILDRKVALEGSAIPLTFKSKNEV 2108 + + +EK SC+ K + K E + + +K+ E L F K +V Sbjct: 346 NVAFKSSPIEEKGSCTAKDAKGAMKLSEEEIGFMKQKLDFEDDDDELQFIDKEDV 400 Score = 244 bits (622), Expect = 4e-65 Identities = 121/275 (44%), Positives = 182/275 (66%), Gaps = 4/275 (1%) Frame = +3 Query: 2319 EKPSTMKVGDGKNKSKVVNSDMPRDLYMFYGCCKNVLQDGKSISITLDDDVFGTEKVIHV 2498 EK K K ++ +S MP+ L++ Y K L +G+S+ I LD++VFG E ++V Sbjct: 402 EKQCKKKPSKEVKKLELNSSSMPKSLWLLYCYYKRALGNGESLKIVLDENVFGEECTLYV 461 Query: 2499 DLSDITHFCELESISCYSIIVYIWHLYKKMKEDF-VDNFLFVDPYHIGHVPTTRTDKSYL 2675 D+T FC+L IS I VYIW+LYKKM ED ++ F F+ P H+GHVPTTRTDK++L Sbjct: 462 HDEDVTPFCQLMPISYTCIAVYIWYLYKKMMEDNKLEKFRFMQPCHVGHVPTTRTDKNFL 521 Query: 2676 QQLMVARARFLADRLSNASINQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRIRDET 2855 + + +RAR LADRL + + +L PCN+G+HWILTVI+ K+ V++ DPL RIRD+ Sbjct: 522 DKQLESRARALADRLIDNPSSASLLVPCNVGFHWILTVINVSKDIVYLWDPLSHRIRDDD 581 Query: 2856 WRDVVNLALKLFNA--DKGRKGKKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECIN- 3026 W+ VV +A+K+ +A G+KG+ K WE+++AP QPD QCGF+VM Y++ ++ + Sbjct: 582 WKHVVEMAIKMVHAASGNGKKGRSKTAWEIVKAPRQPDSNQCGFYVMAYLKTLIENMPDI 641 Query: 3027 TNKASLRTIFKKGDYTRAQIDEVRLEWAKCIQDHM 3131 +K S++ +F++ +Y +A ID VR EWA I ++ Sbjct: 642 DDKDSVQALFQQVEYDKAVIDLVRSEWADIISSYI 676 >ref|XP_024185677.1| uncharacterized protein LOC112190477 [Rosa chinensis] gb|PRQ48579.1| putative Ulp1 protease family catalytic domain, putative transposase, Ptta/En/Spm, plant [Rosa chinensis] Length = 725 Score = 366 bits (940), Expect = e-109 Identities = 236/723 (32%), Positives = 373/723 (51%), Gaps = 22/723 (3%) Frame = +3 Query: 1029 RGRTHMYRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVP 1208 RGRT M R+ + RG K+ + N G GK AAE+ SYIGV+ R V I +SW V Sbjct: 29 RGRTSMERIVNRALRGKKSVVEFNPKGVPFGKAAAEMASYIGVIVRTTVPIIVESWPKVE 88 Query: 1209 KDVKELIWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXX 1388 KD+K IW+SV + + + P +K LSSA +KWRQ+K+ LT ++V Sbjct: 89 KDLKNEIWKSVEMAFVLAPRCRKMVLSSAANKWRQFKSELTTKYVLPYKDQPDALKDPPE 148 Query: 1389 GYG-ISQEDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCD 1565 Y I Q+DW FV SR++ F KL EQKERR HR++R+GYA L E++ + + Sbjct: 149 EYDFIKQQDWEQFVKSRLTTDFQKLHMEQKERRGKLQNAHRMSRKGYAGLEAELKKTMNE 208 Query: 1566 DAEINRAILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALE 1745 D E++ A+LWKKGR +KNG I + + E A++D + + + +++D+ + L Sbjct: 209 D-ELDLAVLWKKGREDKNGNISHETVGEQAAEMDTLMNNEGDISNSNSRSSDDVRSMGLG 267 Query: 1746 NQEHAGRVRGIGGHITPSTYFRVLIGKKPVD---RRAEQRNELMEAKKLIAE---QGDLI 1907 EH+ RVR G + P+ L + +D + EQ+ EAK + E GD Sbjct: 268 TPEHSSRVRSAGECLMPNVSPPQLERESVLDEVRKMIEQQRLWFEAKISLLEAKISGDCP 327 Query: 1908 KEQNVRLEKLEAIFIKKYDTDNDEKASCSVKPKHQSNK-DEADFVILDRKVALEGSAIPL 2084 L A +K CS K + N+ D F + R+ ++G + L Sbjct: 328 ATSITLPTPLLA--------KPSKKGRCSGKTNVEDNEIDSEAFSFVGRREFMKGKSCKL 379 Query: 2085 TFKSKNEVDAYGTIVHADEPDNFLDDEPIPTNCMYIANNQAMTESTPLPVKIPMTRDCLD 2264 S N V ++GTI+ D ++ + P+ + +A + A+ E LP+ + + Sbjct: 380 AVGSINNVVSHGTIIEMDVANHKVHGVPLGEGNIRVAIDNALDEQALLPIPVTGELATVG 439 Query: 2265 DVVGNHVDSPTHLIKLQNEKP---STMKVGDGKNKSKVVNSDMPRDLYMFYGCCKNVLQD 2435 VG+HV P HL+KL NE+ S++K D N+ + +P+ L + Y + + D Sbjct: 440 QAVGSHVAWPKHLVKLMNEEERGNSSIKPRDLPNQDVI----LPKSLKLLYRYAERAMTD 495 Query: 2436 GKSISITLDDDVFGTEKVIHVDLSDITHFCELESISCYSIIVYIWHLYKKMKEDFVDNFL 2615 G+ IS+ +++ +FG K +++ D+ F E++ I I VY+ HLY +K+ + N + Sbjct: 496 GEPISVFMEEAIFGIAKTLNIFKEDVMQFMEMKEIPPRCITVYMRHLYDMLKQSNMANMV 555 Query: 2616 -FVDPYHIGHVPTTRTDKSYLQQLMVARARFLADRLSNASINQVVLAPCNIGYHWILTVI 2792 +DP I V +D R++ LA RL S +Q++L P N GYHW+LT+I Sbjct: 556 GLMDPSSIS-VGEGNSDH---------RSQVLATRLQQGSADQILLVPYNSGYHWMLTII 605 Query: 2793 DPYKETV--------HVLDPLGPRIRDETWRDVVNLALKLFNADKGRKGKKKPQWEVIRA 2948 KE + +DPL +R+E W+ VVN ++ FN + GR +K+P W+V+ Sbjct: 606 SEDKEVCYFMDPLQRYFMDPLRRSMREEEWKYVVNNGIRQFNIETGRGFRKQPLWKVLMG 665 Query: 2949 PIQPDEKQCGFFVMRYMREIL--HECINTNKASLRTIFKKGDYTRAQIDEVRLEWAKCIQ 3122 P QP +CG++VMRYM+EI+ H+ K R K YT+ ++DEVR EW + Sbjct: 666 PKQPSNMECGYYVMRYMKEIIEGHDLSFATKWDGR---KLNAYTQTELDEVRCEWTDFVS 722 Query: 3123 DHM 3131 +++ Sbjct: 723 NYV 725 >ref|XP_019177120.1| PREDICTED: uncharacterized protein LOC109172424 [Ipomoea nil] Length = 786 Score = 365 bits (937), Expect = e-108 Identities = 185/371 (49%), Positives = 247/371 (66%), Gaps = 1/371 (0%) Frame = +3 Query: 999 SRDSQPRVSTRGRTHMYRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVK 1178 S+++Q STRGRT M++LA+Q+A+G+K D++ N+LGQ +G AAELQSYIGVLARE VK Sbjct: 150 SQETQSTRSTRGRTQMHKLAMQRAQGLKKDVQFNELGQPIGDSAAELQSYIGVLAREKVK 209 Query: 1179 ITYKSWKHVPKDVKELIWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXX 1358 + +K+WKHVP+D+K+ IW++VNL + V I+KK CLSSA KWRQYKT LT F+W Sbjct: 210 LNFKTWKHVPQDIKDKIWDAVNLSFRVPAIFKKPCLSSANDKWRQYKTQLTNNFIWKRLN 269 Query: 1359 XXXXXXXXXXGYGISQEDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALA 1538 GYGI ++W+ FVISRMSE F KLSE+QK RR N+YPHRLAR+GYA LA Sbjct: 270 DEENLHKPPPGYGIMGDEWSQFVISRMSEDFKKLSEQQKVRRKQNLYPHRLARKGYARLA 329 Query: 1539 EEIESELCDDAEINRAILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGAT 1718 EI +ELCDD E+NRAILWKKGR +K GEIEGD LK KID YI+QK++G L+++G Sbjct: 330 SEISTELCDDDEVNRAILWKKGRTSKQGEIEGDVLKTKFTKIDEYIQQKQDGLLQLQGPN 389 Query: 1719 EDILTKALENQEHAGRVRGIGGHITPSTYFRVLIGKKPVDRRAEQRNELMEAKKLIAEQG 1898 EDILT+ALE++EH GRVR IGGH+ PSTYFR+ G P K ++ + Sbjct: 390 EDILTQALESKEHGGRVRAIGGHVNPSTYFRLGKGMLP-----------NHEKNVLLRRQ 438 Query: 1899 DLIKEQNVRLEKLEAIFIKKYDTDNDEKASCSVKPKHQSNK-DEADFVILDRKVALEGSA 2075 ++++ +LE L + + +EK SC+ K + K E + + +K+ E Sbjct: 439 ATVEDRVAKLENLVLQNVAFKSSPIEEKGSCTAKDAKGAMKLSEEEIGFMKQKLDFEDDD 498 Query: 2076 IPLTFKSKNEV 2108 L F K +V Sbjct: 499 DELQFIDKEDV 509 Score = 244 bits (622), Expect = 3e-64 Identities = 121/275 (44%), Positives = 182/275 (66%), Gaps = 4/275 (1%) Frame = +3 Query: 2319 EKPSTMKVGDGKNKSKVVNSDMPRDLYMFYGCCKNVLQDGKSISITLDDDVFGTEKVIHV 2498 EK K K ++ +S MP+ L++ Y K L +G+S+ I LD++VFG E ++V Sbjct: 511 EKQCKKKPSKEVKKLELNSSSMPKSLWLLYCYYKRALGNGESLKIVLDENVFGEECTLYV 570 Query: 2499 DLSDITHFCELESISCYSIIVYIWHLYKKMKEDF-VDNFLFVDPYHIGHVPTTRTDKSYL 2675 D+T FC+L IS I VYIW+LYKKM ED ++ F F+ P H+GHVPTTRTDK++L Sbjct: 571 HDEDVTPFCQLMPISYTCIAVYIWYLYKKMMEDNKLEKFRFMQPCHVGHVPTTRTDKNFL 630 Query: 2676 QQLMVARARFLADRLSNASINQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRIRDET 2855 + + +RAR LADRL + + +L PCN+G+HWILTVI+ K+ V++ DPL RIRD+ Sbjct: 631 DKQLESRARALADRLIDNPSSASLLVPCNVGFHWILTVINVSKDIVYLWDPLSHRIRDDD 690 Query: 2856 WRDVVNLALKLFNA--DKGRKGKKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECIN- 3026 W+ VV +A+K+ +A G+KG+ K WE+++AP QPD QCGF+VM Y++ ++ + Sbjct: 691 WKHVVEMAIKMVHAASGNGKKGRSKTAWEIVKAPRQPDSNQCGFYVMAYLKTLIENMPDI 750 Query: 3027 TNKASLRTIFKKGDYTRAQIDEVRLEWAKCIQDHM 3131 +K S++ +F++ +Y +A ID VR EWA I ++ Sbjct: 751 DDKDSVQALFQQVEYDKAVIDLVRSEWADIISSYI 785 >ref|XP_011102064.1| uncharacterized protein LOC105180111 isoform X3 [Sesamum indicum] Length = 1254 Score = 366 bits (939), Expect = e-104 Identities = 202/301 (67%), Positives = 232/301 (77%), Gaps = 10/301 (3%) Frame = +3 Query: 3 QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182 Q+GQRGY+VPTL+RSTSFR+ AD+RNFASGK+NSR SAT SG+ TLSQCL+LEP+V+ D Sbjct: 23 QNGQRGYAVPTLDRSTSFRDGADSRNFASGKANSRASATPSGEVTTLSQCLMLEPIVMGD 82 Query: 183 KKYARSGDLRRVLGFS---------FGDAHLKNSSSGTVDELKRLRAVEELKRLRASVAD 335 K RSGDL+RVLG S FG AHLKNSS G AVEELKRLRASVAD Sbjct: 83 PKNERSGDLKRVLGSSVGSSSEDNSFGAAHLKNSSPG---------AVEELKRLRASVAD 133 Query: 336 TCVKASDRAXXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTE 512 TC KAS RA Y E++SSKKQQQ N+M+TN+RS STLKIGSL+HR TE Sbjct: 134 TCFKASGRAKKLDDHLNKLNKYCEAVSSKKQQQRNDMITNERSG-STLKIGSLVHRNPTE 192 Query: 513 FGSKKLDDRPKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVK 692 FGS+K DDRPK+VG++ RLRTSVAETRA+C SG LRQPL V+KERDLLKD+NAD D+V+ Sbjct: 193 FGSQKFDDRPKSVGLNKRLRTSVAETRAECRNSGALRQPLMVSKERDLLKDTNADHDMVE 252 Query: 693 GKRRRSPAGGESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSF 872 K RR PAGGE +KKMK K SVGAV SRSV++DGELKRTMHHKLP ESSLQS DS + F Sbjct: 253 EKIRRLPAGGEGWDKKMKRKRSVGAVFSRSVDSDGELKRTMHHKLPSESSLQSGDSRYGF 312 Query: 873 R 875 R Sbjct: 313 R 313 >ref|XP_019174725.1| PREDICTED: uncharacterized protein LOC109170159 [Ipomoea nil] Length = 1211 Score = 365 bits (937), Expect = e-104 Identities = 185/371 (49%), Positives = 247/371 (66%), Gaps = 1/371 (0%) Frame = +3 Query: 999 SRDSQPRVSTRGRTHMYRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVK 1178 S+++Q STRGRT M++LA+Q+A+G+K D++ N+LGQ +G AAELQSYIGVLARE VK Sbjct: 575 SQETQSTRSTRGRTQMHKLAMQRAQGLKKDVQFNELGQPIGDSAAELQSYIGVLAREKVK 634 Query: 1179 ITYKSWKHVPKDVKELIWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXX 1358 + +K+WKHVP+D+K+ IW++VNL + V I+KK CLSSA KWRQYKT LT F+W Sbjct: 635 LNFKTWKHVPQDIKDKIWDAVNLSFRVPAIFKKPCLSSANDKWRQYKTQLTNNFIWKRLN 694 Query: 1359 XXXXXXXXXXGYGISQEDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALA 1538 GYGI ++W+ FVISRMSE F KLSE+QK RR N+YPHRLAR+GYA LA Sbjct: 695 DEENLHKPPPGYGIMGDEWSQFVISRMSEDFKKLSEQQKVRRKQNLYPHRLARKGYARLA 754 Query: 1539 EEIESELCDDAEINRAILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGAT 1718 EI +ELCDD E+NRAILWKKGR +K GEIEGD LK KID YI+QK++G L+++G Sbjct: 755 SEISTELCDDDEVNRAILWKKGRTSKQGEIEGDVLKTKFTKIDEYIQQKQDGLLQLQGPN 814 Query: 1719 EDILTKALENQEHAGRVRGIGGHITPSTYFRVLIGKKPVDRRAEQRNELMEAKKLIAEQG 1898 EDILT+ALE++EH GRVR IGGH+ PSTYFR+ G P K ++ + Sbjct: 815 EDILTQALESKEHGGRVRAIGGHVNPSTYFRLGKGMLP-----------NHEKNVLLRRQ 863 Query: 1899 DLIKEQNVRLEKLEAIFIKKYDTDNDEKASCSVKPKHQSNK-DEADFVILDRKVALEGSA 2075 ++++ +LE L + + +EK SC+ K + K E + + +K+ E Sbjct: 864 ATVEDRVAKLENLVLQNVAFKSSPIEEKGSCTAKDAKGAMKLSEEEIGFMKQKLDFEDDD 923 Query: 2076 IPLTFKSKNEV 2108 L F K +V Sbjct: 924 DELQFIDKEDV 934 Score = 244 bits (622), Expect = 1e-62 Identities = 121/275 (44%), Positives = 182/275 (66%), Gaps = 4/275 (1%) Frame = +3 Query: 2319 EKPSTMKVGDGKNKSKVVNSDMPRDLYMFYGCCKNVLQDGKSISITLDDDVFGTEKVIHV 2498 EK K K ++ +S MP+ L++ Y K L +G+S+ I LD++VFG E ++V Sbjct: 936 EKQCKKKPSKEVKKLELNSSSMPKSLWLLYCYYKRALGNGESLKIVLDENVFGEECTLYV 995 Query: 2499 DLSDITHFCELESISCYSIIVYIWHLYKKMKEDF-VDNFLFVDPYHIGHVPTTRTDKSYL 2675 D+T FC+L IS I VYIW+LYKKM ED ++ F F+ P H+GHVPTTRTDK++L Sbjct: 996 HDEDVTPFCQLMPISYTCIAVYIWYLYKKMMEDNKLEKFRFMQPCHVGHVPTTRTDKNFL 1055 Query: 2676 QQLMVARARFLADRLSNASINQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRIRDET 2855 + + +RAR LADRL + + +L PCN+G+HWILTVI+ K+ V++ DPL RIRD+ Sbjct: 1056 DKQLESRARALADRLIDNPSSASLLVPCNVGFHWILTVINVSKDIVYLWDPLSHRIRDDD 1115 Query: 2856 WRDVVNLALKLFNA--DKGRKGKKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECIN- 3026 W+ VV +A+K+ +A G+KG+ K WE+++AP QPD QCGF+VM Y++ ++ + Sbjct: 1116 WKHVVEMAIKMVHAASGNGKKGRSKTAWEIVKAPRQPDSNQCGFYVMAYLKTLIENMPDI 1175 Query: 3027 TNKASLRTIFKKGDYTRAQIDEVRLEWAKCIQDHM 3131 +K S++ +F++ +Y +A ID VR EWA I ++ Sbjct: 1176 DDKDSVQALFQQVEYDKAVIDLVRSEWADIISSYI 1210 >ref|XP_011102062.1| uncharacterized protein LOC105180111 isoform X1 [Sesamum indicum] Length = 1301 Score = 366 bits (939), Expect = e-104 Identities = 202/301 (67%), Positives = 232/301 (77%), Gaps = 10/301 (3%) Frame = +3 Query: 3 QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182 Q+GQRGY+VPTL+RSTSFR+ AD+RNFASGK+NSR SAT SG+ TLSQCL+LEP+V+ D Sbjct: 23 QNGQRGYAVPTLDRSTSFRDGADSRNFASGKANSRASATPSGEVTTLSQCLMLEPIVMGD 82 Query: 183 KKYARSGDLRRVLGFS---------FGDAHLKNSSSGTVDELKRLRAVEELKRLRASVAD 335 K RSGDL+RVLG S FG AHLKNSS G AVEELKRLRASVAD Sbjct: 83 PKNERSGDLKRVLGSSVGSSSEDNSFGAAHLKNSSPG---------AVEELKRLRASVAD 133 Query: 336 TCVKASDRAXXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTE 512 TC KAS RA Y E++SSKKQQQ N+M+TN+RS STLKIGSL+HR TE Sbjct: 134 TCFKASGRAKKLDDHLNKLNKYCEAVSSKKQQQRNDMITNERSG-STLKIGSLVHRNPTE 192 Query: 513 FGSKKLDDRPKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVK 692 FGS+K DDRPK+VG++ RLRTSVAETRA+C SG LRQPL V+KERDLLKD+NAD D+V+ Sbjct: 193 FGSQKFDDRPKSVGLNKRLRTSVAETRAECRNSGALRQPLMVSKERDLLKDTNADHDMVE 252 Query: 693 GKRRRSPAGGESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSF 872 K RR PAGGE +KKMK K SVGAV SRSV++DGELKRTMHHKLP ESSLQS DS + F Sbjct: 253 EKIRRLPAGGEGWDKKMKRKRSVGAVFSRSVDSDGELKRTMHHKLPSESSLQSGDSRYGF 312 Query: 873 R 875 R Sbjct: 313 R 313 >ref|XP_011084232.1| uncharacterized protein LOC105166542 isoform X3 [Sesamum indicum] Length = 1254 Score = 365 bits (937), Expect = e-104 Identities = 201/301 (66%), Positives = 232/301 (77%), Gaps = 10/301 (3%) Frame = +3 Query: 3 QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182 Q+GQRGY+VPTL+RSTSFR+ AD+RNFASGK+NSR SAT SG+ TLSQCL+LEP+V+ D Sbjct: 23 QNGQRGYAVPTLDRSTSFRDGADSRNFASGKANSRASATPSGEVTTLSQCLMLEPIVMGD 82 Query: 183 KKYARSGDLRRVLGFS---------FGDAHLKNSSSGTVDELKRLRAVEELKRLRASVAD 335 K RSGDL+RVLG S FG AH+KNSS G AVEELKRLRASVAD Sbjct: 83 PKNERSGDLKRVLGSSVGSSSEDNSFGAAHMKNSSPG---------AVEELKRLRASVAD 133 Query: 336 TCVKASDRAXXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTE 512 TC KAS RA Y E++SSKKQQQ N+M+TN+RS STLKIGSL+HR TE Sbjct: 134 TCFKASGRAKKLDDHLNKLNKYCEAVSSKKQQQRNDMITNERSG-STLKIGSLVHRNPTE 192 Query: 513 FGSKKLDDRPKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVK 692 FGS+K DDRPK+VG++ RLRTSVAETRA+C SG LRQPL V+KERDLLKD+NAD D+V+ Sbjct: 193 FGSQKFDDRPKSVGLNKRLRTSVAETRAECRNSGALRQPLMVSKERDLLKDTNADPDMVE 252 Query: 693 GKRRRSPAGGESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSF 872 K RR PAGGE +KKMK K SVGAV SRSV++DGELKRTMHHKLP ESSLQS DS + F Sbjct: 253 EKIRRLPAGGEGWDKKMKRKRSVGAVFSRSVDSDGELKRTMHHKLPSESSLQSGDSRYGF 312 Query: 873 R 875 R Sbjct: 313 R 313 >ref|XP_011084230.1| uncharacterized protein LOC105166542 isoform X1 [Sesamum indicum] Length = 1301 Score = 365 bits (937), Expect = e-104 Identities = 201/301 (66%), Positives = 232/301 (77%), Gaps = 10/301 (3%) Frame = +3 Query: 3 QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182 Q+GQRGY+VPTL+RSTSFR+ AD+RNFASGK+NSR SAT SG+ TLSQCL+LEP+V+ D Sbjct: 23 QNGQRGYAVPTLDRSTSFRDGADSRNFASGKANSRASATPSGEVTTLSQCLMLEPIVMGD 82 Query: 183 KKYARSGDLRRVLGFS---------FGDAHLKNSSSGTVDELKRLRAVEELKRLRASVAD 335 K RSGDL+RVLG S FG AH+KNSS G AVEELKRLRASVAD Sbjct: 83 PKNERSGDLKRVLGSSVGSSSEDNSFGAAHMKNSSPG---------AVEELKRLRASVAD 133 Query: 336 TCVKASDRAXXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTE 512 TC KAS RA Y E++SSKKQQQ N+M+TN+RS STLKIGSL+HR TE Sbjct: 134 TCFKASGRAKKLDDHLNKLNKYCEAVSSKKQQQRNDMITNERSG-STLKIGSLVHRNPTE 192 Query: 513 FGSKKLDDRPKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVK 692 FGS+K DDRPK+VG++ RLRTSVAETRA+C SG LRQPL V+KERDLLKD+NAD D+V+ Sbjct: 193 FGSQKFDDRPKSVGLNKRLRTSVAETRAECRNSGALRQPLMVSKERDLLKDTNADPDMVE 252 Query: 693 GKRRRSPAGGESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSF 872 K RR PAGGE +KKMK K SVGAV SRSV++DGELKRTMHHKLP ESSLQS DS + F Sbjct: 253 EKIRRLPAGGEGWDKKMKRKRSVGAVFSRSVDSDGELKRTMHHKLPSESSLQSGDSRYGF 312 Query: 873 R 875 R Sbjct: 313 R 313 >ref|XP_011102063.1| uncharacterized protein LOC105180111 isoform X2 [Sesamum indicum] Length = 1297 Score = 361 bits (927), Expect = e-102 Identities = 200/296 (67%), Positives = 229/296 (77%), Gaps = 10/296 (3%) Frame = +3 Query: 3 QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182 Q+GQRGY+VPTL+RSTSFR+ AD+RNFASGK+NSR SAT SG+ TLSQCL+LEP+V+ D Sbjct: 23 QNGQRGYAVPTLDRSTSFRDGADSRNFASGKANSRASATPSGEVTTLSQCLMLEPIVMGD 82 Query: 183 KKYARSGDLRRVLGFS---------FGDAHLKNSSSGTVDELKRLRAVEELKRLRASVAD 335 K RSGDL+RVLG S FG AHLKNSS G AVEELKRLRASVAD Sbjct: 83 PKNERSGDLKRVLGSSVGSSSEDNSFGAAHLKNSSPG---------AVEELKRLRASVAD 133 Query: 336 TCVKASDRAXXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTE 512 TC KAS RA Y E++SSKKQQQ N+M+TN+RS STLKIGSL+HR TE Sbjct: 134 TCFKASGRAKKLDDHLNKLNKYCEAVSSKKQQQRNDMITNERSG-STLKIGSLVHRNPTE 192 Query: 513 FGSKKLDDRPKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVK 692 FGS+K DDRPK+VG++ RLRTSVAETRA+C SG LRQPL V+KERDLLKD+NAD D+V+ Sbjct: 193 FGSQKFDDRPKSVGLNKRLRTSVAETRAECRNSGALRQPLMVSKERDLLKDTNADHDMVE 252 Query: 693 GKRRRSPAGGESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDS 860 K RR PAGGE +KKMK K SVGAV SRSV++DGELKRTMHHKLP ESSLQS DS Sbjct: 253 EKIRRLPAGGEGWDKKMKRKRSVGAVFSRSVDSDGELKRTMHHKLPSESSLQSGDS 308 >ref|XP_011084231.1| uncharacterized protein LOC105166542 isoform X2 [Sesamum indicum] Length = 1297 Score = 360 bits (925), Expect = e-102 Identities = 199/296 (67%), Positives = 229/296 (77%), Gaps = 10/296 (3%) Frame = +3 Query: 3 QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182 Q+GQRGY+VPTL+RSTSFR+ AD+RNFASGK+NSR SAT SG+ TLSQCL+LEP+V+ D Sbjct: 23 QNGQRGYAVPTLDRSTSFRDGADSRNFASGKANSRASATPSGEVTTLSQCLMLEPIVMGD 82 Query: 183 KKYARSGDLRRVLGFS---------FGDAHLKNSSSGTVDELKRLRAVEELKRLRASVAD 335 K RSGDL+RVLG S FG AH+KNSS G AVEELKRLRASVAD Sbjct: 83 PKNERSGDLKRVLGSSVGSSSEDNSFGAAHMKNSSPG---------AVEELKRLRASVAD 133 Query: 336 TCVKASDRAXXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTE 512 TC KAS RA Y E++SSKKQQQ N+M+TN+RS STLKIGSL+HR TE Sbjct: 134 TCFKASGRAKKLDDHLNKLNKYCEAVSSKKQQQRNDMITNERSG-STLKIGSLVHRNPTE 192 Query: 513 FGSKKLDDRPKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVK 692 FGS+K DDRPK+VG++ RLRTSVAETRA+C SG LRQPL V+KERDLLKD+NAD D+V+ Sbjct: 193 FGSQKFDDRPKSVGLNKRLRTSVAETRAECRNSGALRQPLMVSKERDLLKDTNADPDMVE 252 Query: 693 GKRRRSPAGGESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDS 860 K RR PAGGE +KKMK K SVGAV SRSV++DGELKRTMHHKLP ESSLQS DS Sbjct: 253 EKIRRLPAGGEGWDKKMKRKRSVGAVFSRSVDSDGELKRTMHHKLPSESSLQSGDS 308 >gb|PRQ17143.1| putative Ulp1 protease family catalytic domain, putative transposase, Ptta/En/Spm, plant [Rosa chinensis] Length = 775 Score = 342 bits (877), Expect = 2e-99 Identities = 240/789 (30%), Positives = 394/789 (49%), Gaps = 41/789 (5%) Frame = +3 Query: 879 LDSMAALRKLKGGHDDVQKGKADVS---SDASHEEDGDSMEIDSRDSQPRVSTRGRTH-- 1043 + S +RK G +K + S +D EE +S+ ++ S V TRG+ + Sbjct: 1 MGSKKGIRKSPRGKKLKRKADLETSHPETDEVLEEKEESVSANTITSTESVKTRGKRNVV 60 Query: 1044 -MYRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVPKDVK 1220 +Y++ ++KA G K + + G G+ LQSYIG+LAR V I SW V D+K Sbjct: 61 ALYKVLVKKALGKKFKVSYTETGNPNGRIRHTLQSYIGMLARTKVPINIVSWPEVDGDLK 120 Query: 1221 ELIWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXXGYG- 1397 + +W V + V P KK L+SAG+KWR +KT LTR++V Y Sbjct: 121 DKLWLDVQETFKVAPESKKLVLTSAGTKWRAFKTMLTRKYVLPYLGKKKKLRKPPSQYNF 180 Query: 1398 ISQEDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCDDAEI 1577 + +E W FV R +E F++L EQ ER Y HRL+R+GY L EE+ L + I Sbjct: 181 VGREPWKDFVKERTTEKFLQLHNEQSERVKKRKYHHRLSRKGYIGLEEELRKTLPEGEVI 240 Query: 1578 NRAILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALENQEH 1757 +RAI+WKK R K+G+I+ + + KID+ +E+K +G+L+I G D+L++ALE EH Sbjct: 241 DRAIMWKKARQRKDGDID-EEARGVATKIDDLLEKKSKGELEISG-NSDVLSQALETPEH 298 Query: 1758 AGRVRGIGGHITPSTYF------RVLIGKKPVDRRAEQRN-ELMEAKKLIAEQGDLIKE- 1913 +GRVRG+GG + PSTYF R+ I K + R +R+ EL E KK++A Q +E Sbjct: 299 SGRVRGVGGFVNPSTYFKMPKQKRIRITKAELLARDRERDRELEETKKMLAAQQARTEEL 358 Query: 1914 QNVRLEKLEAIFIKK---------YDTDNDEKASCSVKPKHQSNKDEADFVILDRKVALE 2066 + ++ +LEA+ K + + + S K Q + ++ + KV E Sbjct: 359 LHKKIAQLEALITGKTPYTSPLNVHVVGENTISPISDKGSFQDIRRNTSNILDEPKVKQE 418 Query: 2067 -------------GSAIPLTFKSKNEVDAYGTIVHADEPDNFLDDEPIPTNCMYIANNQA 2207 G L + + + A+GT+ ++ + P+ C+ ++ + A Sbjct: 419 VDDCEVVPPPTEMGGTCELAVDTISNIVAFGTVFDEEDVSRVIHGVPMKEGCVRVSVDGA 478 Query: 2208 MTESTPLPVKIPMTRDCLDDVVGNHVDSPTHLIKLQNEKPSTMKVGDGKN--KSKVVNSD 2381 + E LP + + + VG+HV P L+ + K K+ K +N Sbjct: 479 IQEEARLPFPVGDEMELVGQAVGSHVAWPEELVIRRVNKKKKRKMDFVKQLFDKAELNPF 538 Query: 2382 MPRDLYMFYGCCKNVL-QDGKSISITLDDDVFGTEKVIHVDLSDITHFCELESISCYSII 2558 +P+ + Y K ++ Q +SI LDD VFG +K + + ++ E+ I I Sbjct: 539 VPKRCKLLYKHAKTIMSQTNESIRTMLDDSVFGVQKQLFILTENVIDLLEMNKIGQGVIA 598 Query: 2559 VYIWHLYKKMKE-DFVDNFLFVDPYHIGHVPTTRTDKSYLQQLMVARARFLADRLSNASI 2735 Y+ +L++ ++E D +D F F+DP T ++S +L +RL Sbjct: 599 AYMANLHETLRERDELDTFGFIDP-----AATYMCERSEF-------VPYLVNRLKEGKG 646 Query: 2736 NQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRIRDETWRDVVNLALKLFNADKGRKG 2915 +++ L N G HWILT+I +++ ++++DPL + + W + V A+K +NA+KGR Sbjct: 647 DRIFLMAYNPGEHWILTII--WEDEIYIVDPLPKPVHYKPWENAVINAVKTYNAEKGRVT 704 Query: 2916 KKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECINTNKASLRTIFKKGDYTRAQIDEV 3095 K + AP QP +CG++VMRYM++I+++ + +KG YT+ Q+DEV Sbjct: 705 KVPKLKLLPGAPKQPGGVECGYYVMRYMKDIINDDTLSFSTKWAVKDRKG-YTQQQLDEV 763 Query: 3096 RLEWAKCIQ 3122 R+E A +Q Sbjct: 764 RIEVADYLQ 772 >gb|PRQ17594.1| putative Ulp1 protease family catalytic domain, putative transposase, Ptta/En/Spm, plant [Rosa chinensis] Length = 775 Score = 342 bits (876), Expect = 2e-99 Identities = 239/789 (30%), Positives = 396/789 (50%), Gaps = 41/789 (5%) Frame = +3 Query: 879 LDSMAALRKLKGGHDDVQKGKADVS---SDASHEEDGDSMEIDSRDSQPRVSTRGRTH-- 1043 + S +RK G +K + S +D EE + + ++ S V TRG+ + Sbjct: 1 MGSKKGIRKSPRGKKLKRKADLETSQPETDEVLEEKEEFVSANTITSTESVKTRGKRNVV 60 Query: 1044 -MYRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVPKDVK 1220 +Y++ ++KA G K + + G G+ LQSYIG+LAR V I SW V D+K Sbjct: 61 ALYKVLVKKALGKKFKVSYTETGNPNGRIRHTLQSYIGMLARTKVPINIVSWPEVDGDLK 120 Query: 1221 ELIWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXXGYG- 1397 + +W V + V P KK L+SAG+KWR +KT LTR++V Y Sbjct: 121 DKLWLDVQETFKVAPESKKLVLTSAGTKWRAFKTMLTRKYVLPYLGKKKKLRKPPSQYNF 180 Query: 1398 ISQEDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCDDAEI 1577 + +E W FV R +E F++L EQ ER Y HRL+R+GY L EE+ L + I Sbjct: 181 VGREPWKDFVKERTTEKFLQLHNEQSERVKKRKYHHRLSRKGYIGLEEELRKTLPEGEVI 240 Query: 1578 NRAILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALENQEH 1757 +RAI+WKK R K+G+I+ + + KID+ +E+K +G+L+I G++ D+L++ALE EH Sbjct: 241 DRAIMWKKARQRKDGDID-EEARGVATKIDDLLEKKSKGELEISGSS-DVLSQALETPEH 298 Query: 1758 AGRVRGIGGHITPSTYF------RVLIGKKPVDRRAEQRN-ELMEAKKLIAEQGDLIKEQ 1916 +GRVRG+GG + PSTYF R+ I K + R +R+ EL E KK++A Q +E+ Sbjct: 299 SGRVRGVGGFVNPSTYFKMPKQKRIRITKAELLARDRERDRELEETKKMLAAQQARTEER 358 Query: 1917 -NVRLEKLEAIFIKK---------YDTDNDEKASCSVKPKHQSNKDEADFVILDRKVALE 2066 + ++ +LEA+ K + + + S K Q + ++ + KV E Sbjct: 359 LHKKIAQLEALITGKTPYTSPLNVHVVGENTISPISDKGSFQDIRRNTSNILDEPKVKQE 418 Query: 2067 -------------GSAIPLTFKSKNEVDAYGTIVHADEPDNFLDDEPIPTNCMYIANNQA 2207 G L + + + A+GT+ ++ + P+ C+ ++ + A Sbjct: 419 VDDCEVVPPPTEMGGTCELAVDTISNIVAFGTVFDEEDVSRVIHGVPMKEGCVRVSVDGA 478 Query: 2208 MTESTPLPVKIPMTRDCLDDVVGNHVDSPTHLIKLQNEKPSTMKVGDGKN--KSKVVNSD 2381 + E LP + + + VG+HV P L+ + K K+ K +N Sbjct: 479 IQEEARLPFPVGDEMELVGQAVGSHVAWPEELVIRRVNKKKKRKMDFVKQLFDKAELNPF 538 Query: 2382 MPRDLYMFYGCCKNVL-QDGKSISITLDDDVFGTEKVIHVDLSDITHFCELESISCYSII 2558 +P+ + Y K ++ Q +SI LDD VFG +K + + ++ E+ I I Sbjct: 539 VPKRCKLLYKHAKTIMSQTNESIRTMLDDSVFGVQKQLFILTENVIDLLEMNKIGQGVIA 598 Query: 2559 VYIWHLYKKMKE-DFVDNFLFVDPYHIGHVPTTRTDKSYLQQLMVARARFLADRLSNASI 2735 Y+ +L++ ++E D +D F F+DP T ++S +L +RL Sbjct: 599 AYMANLHETLRERDELDTFGFIDP-----AATYMCERSEF-------VPYLVNRLKEGKG 646 Query: 2736 NQVVLAPCNIGYHWILTVIDPYKETVHVLDPLGPRIRDETWRDVVNLALKLFNADKGRKG 2915 +++ L N G HWILT+I +++ ++++DPL + + W + V A+K +NA+KGR Sbjct: 647 DRIFLMAYNPGEHWILTII--WEDEIYIVDPLPKPVHYKPWENAVINAVKTYNAEKGRVT 704 Query: 2916 KKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECINTNKASLRTIFKKGDYTRAQIDEV 3095 K + AP QP +CG++VMRYM++I+++ + +KG YT+ Q+DEV Sbjct: 705 KVPKLKLLPGAPKQPGGVECGYYVMRYMKDIINDDTLSFSTKWAVKDRKG-YTQQQLDEV 763 Query: 3096 RLEWAKCIQ 3122 R+E A +Q Sbjct: 764 RIEVADYLQ 772 >ref|XP_012837747.1| PREDICTED: uncharacterized protein LOC105958287 isoform X2 [Erythranthe guttata] Length = 1261 Score = 347 bits (889), Expect = 2e-97 Identities = 190/292 (65%), Positives = 224/292 (76%), Gaps = 1/292 (0%) Frame = +3 Query: 3 QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182 Q+GQRGYS TL+RSTSFRE D++NF SGK+NSRGSA++SGD L+QCL+L+PV + D Sbjct: 23 QNGQRGYSAATLDRSTSFREGTDSKNFTSGKANSRGSASSSGDVTALTQCLMLDPVALCD 82 Query: 183 KKYARSGDLRRVLGFSFGDAHLKNSSSGTVDELKRLRAVEELKRLRASVADTCVKASDRA 362 K+ RS +L+R+LGFS G +NS S + AVEELKRLRASVADTCVKAS RA Sbjct: 83 LKHPRSNELKRLLGFSVGSGSEENSFSAAHLKNTSPVAVEELKRLRASVADTCVKASGRA 142 Query: 363 XXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTEFGSKKLDDR 539 + ES+SSKKQQQ NE+LTN+RSS S LK GSL+HR +EFG++K DDR Sbjct: 143 KKLDDHLSKLNKFVESVSSKKQQQRNEILTNERSSGSNLKSGSLMHRNPSEFGNQKFDDR 202 Query: 540 PKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVKGKRRRSPAG 719 PKN G++ RLRTSVAETRA+C +GVLRQ L VTKERDLLKD +ADSDIV+ K RR PAG Sbjct: 203 PKNGGVNKRLRTSVAETRAECRNNGVLRQSLMVTKERDLLKDVSADSDIVEEKIRRLPAG 262 Query: 720 GESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSFR 875 GE +KKMK K SVGAV SRSV+NDGELKRTMH+KL ESSLQSSDS SFR Sbjct: 263 GEGWDKKMKRKRSVGAVFSRSVDNDGELKRTMHNKLTNESSLQSSDSNLSFR 314 >ref|XP_012837746.1| PREDICTED: uncharacterized protein LOC105958287 isoform X1 [Erythranthe guttata] Length = 1262 Score = 347 bits (889), Expect = 2e-97 Identities = 190/292 (65%), Positives = 224/292 (76%), Gaps = 1/292 (0%) Frame = +3 Query: 3 QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182 Q+GQRGYS TL+RSTSFRE D++NF SGK+NSRGSA++SGD L+QCL+L+PV + D Sbjct: 23 QNGQRGYSAATLDRSTSFREGTDSKNFTSGKANSRGSASSSGDVTALTQCLMLDPVALCD 82 Query: 183 KKYARSGDLRRVLGFSFGDAHLKNSSSGTVDELKRLRAVEELKRLRASVADTCVKASDRA 362 K+ RS +L+R+LGFS G +NS S + AVEELKRLRASVADTCVKAS RA Sbjct: 83 LKHPRSNELKRLLGFSVGSGSEENSFSAAHLKNTSPVAVEELKRLRASVADTCVKASGRA 142 Query: 363 XXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTEFGSKKLDDR 539 + ES+SSKKQQQ NE+LTN+RSS S LK GSL+HR +EFG++K DDR Sbjct: 143 KKLDDHLSKLNKFVESVSSKKQQQRNEILTNERSSGSNLKSGSLMHRNPSEFGNQKFDDR 202 Query: 540 PKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVKGKRRRSPAG 719 PKN G++ RLRTSVAETRA+C +GVLRQ L VTKERDLLKD +ADSDIV+ K RR PAG Sbjct: 203 PKNGGVNKRLRTSVAETRAECRNNGVLRQSLMVTKERDLLKDVSADSDIVEEKIRRLPAG 262 Query: 720 GESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSFR 875 GE +KKMK K SVGAV SRSV+NDGELKRTMH+KL ESSLQSSDS SFR Sbjct: 263 GEGWDKKMKRKRSVGAVFSRSVDNDGELKRTMHNKLTNESSLQSSDSNLSFR 314 >ref|XP_011076941.1| uncharacterized protein LOC105161066 isoform X3 [Sesamum indicum] Length = 1264 Score = 344 bits (883), Expect = 1e-96 Identities = 189/292 (64%), Positives = 219/292 (75%), Gaps = 1/292 (0%) Frame = +3 Query: 3 QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182 Q+GQRGYS L RS+SFRE +++RN AS K NSRGSAT+SGD +LSQCL+LEP+V+ D Sbjct: 23 QNGQRGYSAQALGRSSSFREVSESRNLASAKLNSRGSATSSGDVPSLSQCLMLEPIVMGD 82 Query: 183 KKYARSGDLRRVLGFSFGDAHLKNSSSGTVDELKRLRAVEELKRLRASVADTCVKASDRA 362 KY RSGDLRRVLGFS G + +S AVEELKRLRASVADTCVKAS R Sbjct: 83 PKYLRSGDLRRVLGFSVGSNSEERNSPPV--------AVEELKRLRASVADTCVKASGRV 134 Query: 363 XXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTEFGSKKLDDR 539 +FE+M KKQQQ NE+L N+RSS STLKIGS IHR +E S+K +DR Sbjct: 135 KKLDEHLNKLNKFFEAMPYKKQQQRNELLMNERSSGSTLKIGSQIHRNPSELASQKFEDR 194 Query: 540 PKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVKGKRRRSPAG 719 PKN G++ RLRTSVAETRA+C +GVLRQPL TKERD+ KD+NADSD+V+ K RR PAG Sbjct: 195 PKN-GLNKRLRTSVAETRAECRNNGVLRQPLMATKERDMPKDNNADSDMVEEKNRRLPAG 253 Query: 720 GESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSFR 875 GE +KKMK K SVGAV SRSV+NDGE+KRTMHHKL IESSLQSSDS H FR Sbjct: 254 GEGWDKKMKRKRSVGAVFSRSVDNDGEVKRTMHHKLTIESSLQSSDSIHGFR 305 >ref|XP_020548914.1| uncharacterized protein LOC105161066 isoform X2 [Sesamum indicum] Length = 1294 Score = 344 bits (883), Expect = 2e-96 Identities = 189/292 (64%), Positives = 219/292 (75%), Gaps = 1/292 (0%) Frame = +3 Query: 3 QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182 Q+GQRGYS L RS+SFRE +++RN AS K NSRGSAT+SGD +LSQCL+LEP+V+ D Sbjct: 23 QNGQRGYSAQALGRSSSFREVSESRNLASAKLNSRGSATSSGDVPSLSQCLMLEPIVMGD 82 Query: 183 KKYARSGDLRRVLGFSFGDAHLKNSSSGTVDELKRLRAVEELKRLRASVADTCVKASDRA 362 KY RSGDLRRVLGFS G + +S AVEELKRLRASVADTCVKAS R Sbjct: 83 PKYLRSGDLRRVLGFSVGSNSEERNSPPV--------AVEELKRLRASVADTCVKASGRV 134 Query: 363 XXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTEFGSKKLDDR 539 +FE+M KKQQQ NE+L N+RSS STLKIGS IHR +E S+K +DR Sbjct: 135 KKLDEHLNKLNKFFEAMPYKKQQQRNELLMNERSSGSTLKIGSQIHRNPSELASQKFEDR 194 Query: 540 PKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVKGKRRRSPAG 719 PKN G++ RLRTSVAETRA+C +GVLRQPL TKERD+ KD+NADSD+V+ K RR PAG Sbjct: 195 PKN-GLNKRLRTSVAETRAECRNNGVLRQPLMATKERDMPKDNNADSDMVEEKNRRLPAG 253 Query: 720 GESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSFR 875 GE +KKMK K SVGAV SRSV+NDGE+KRTMHHKL IESSLQSSDS H FR Sbjct: 254 GEGWDKKMKRKRSVGAVFSRSVDNDGEVKRTMHHKLTIESSLQSSDSIHGFR 305 >ref|XP_011076937.1| uncharacterized protein LOC105161066 isoform X1 [Sesamum indicum] ref|XP_011076938.1| uncharacterized protein LOC105161066 isoform X1 [Sesamum indicum] ref|XP_011076939.1| uncharacterized protein LOC105161066 isoform X1 [Sesamum indicum] Length = 1297 Score = 344 bits (883), Expect = 2e-96 Identities = 189/292 (64%), Positives = 219/292 (75%), Gaps = 1/292 (0%) Frame = +3 Query: 3 QSGQRGYSVPTLNRSTSFRERADNRNFASGKSNSRGSATTSGDGITLSQCLLLEPVVIDD 182 Q+GQRGYS L RS+SFRE +++RN AS K NSRGSAT+SGD +LSQCL+LEP+V+ D Sbjct: 23 QNGQRGYSAQALGRSSSFREVSESRNLASAKLNSRGSATSSGDVPSLSQCLMLEPIVMGD 82 Query: 183 KKYARSGDLRRVLGFSFGDAHLKNSSSGTVDELKRLRAVEELKRLRASVADTCVKASDRA 362 KY RSGDLRRVLGFS G + +S AVEELKRLRASVADTCVKAS R Sbjct: 83 PKYLRSGDLRRVLGFSVGSNSEERNSPPV--------AVEELKRLRASVADTCVKASGRV 134 Query: 363 XXXXXXXXXXXXYFESMSSKKQQQ-NEMLTNDRSSASTLKIGSLIHRITTEFGSKKLDDR 539 +FE+M KKQQQ NE+L N+RSS STLKIGS IHR +E S+K +DR Sbjct: 135 KKLDEHLNKLNKFFEAMPYKKQQQRNELLMNERSSGSTLKIGSQIHRNPSELASQKFEDR 194 Query: 540 PKNVGMSNRLRTSVAETRADCWTSGVLRQPLPVTKERDLLKDSNADSDIVKGKRRRSPAG 719 PKN G++ RLRTSVAETRA+C +GVLRQPL TKERD+ KD+NADSD+V+ K RR PAG Sbjct: 195 PKN-GLNKRLRTSVAETRAECRNNGVLRQPLMATKERDMPKDNNADSDMVEEKNRRLPAG 253 Query: 720 GESREKKMKMKHSVGAVLSRSVNNDGELKRTMHHKLPIESSLQSSDSTHSFR 875 GE +KKMK K SVGAV SRSV+NDGE+KRTMHHKL IESSLQSSDS H FR Sbjct: 254 GEGWDKKMKRKRSVGAVFSRSVDNDGEVKRTMHHKLTIESSLQSSDSIHGFR 305 >gb|PRQ20360.1| putative Ulp1 protease family catalytic domain, putative transposase, Ptta/En/Spm, plant [Rosa chinensis] Length = 724 Score = 324 bits (830), Expect = 2e-93 Identities = 230/741 (31%), Positives = 363/741 (48%), Gaps = 48/741 (6%) Frame = +3 Query: 1044 MYRLAIQKARGVKTDIRLNKLGQAVGKEAAELQSYIGVLARENVKITYKSWKHVPKDVKE 1223 MY++ ++KA G K + G G LQSYIG+LAR V I SW +V D+K Sbjct: 1 MYKVLVKKALGKKFKVTYTDTGNLNGSIRHTLQSYIGMLARTKVPINIVSWPNVDGDLKN 60 Query: 1224 LIWESVNLYYNVDPIWKKACLSSAGSKWRQYKTNLTRQFVWXXXXXXXXXXXXXXGYG-I 1400 +W V + V P KK L+SAG+KWR +KT LTR++V Y + Sbjct: 61 KLWLDVKDTFKVAPESKKLVLTSAGTKWRAFKTMLTRKYVLPYLGKKKKLRKPPSQYAFV 120 Query: 1401 SQEDWNSFVISRMSESFMKLSEEQKERRNANIYPHRLARRGYAALAEEIESELCDDAEIN 1580 ++ W FV R +E +++L +Q ER Y HRL+R+GY L EE++ L + I+ Sbjct: 121 GRQPWRQFVKERTTEKWLELHNKQSERVRKRKYHHRLSRKGYIGLEEELKKTLPEGEVID 180 Query: 1581 RAILWKKGRANKNGEIEGDHLKETVAKIDNYIEQKKEGKLKIEGATEDILTKALENQEHA 1760 AI+WKK R K+G+ + + V KID+ +E+K +G+L+I G++ D+L++ALE EH+ Sbjct: 181 CAIMWKKARQRKDGD-RDEKARAVVTKIDDLLEKKSKGELEISGSS-DVLSQALETLEHS 238 Query: 1761 GRVRGIGGHITPSTYF------RVLIGKKPVDRRAEQRN-ELMEAKKLI-AEQGDLIKEQ 1916 GRVRG+GG I PSTYF R+ I K + R +R+ EL E KK++ A+Q + Sbjct: 239 GRVRGVGGFINPSTYFKLPKLKRIRITKADLLARDRERDRELEETKKMLTAQQAKAEELL 298 Query: 1917 NVRLEKLEAIFIKKYDTDNDEK------ASCSVKP------------KHQSNKDEA---- 2030 N R+ LE + K T N C + P +N DEA Sbjct: 299 NKRIAALEVMITGK--TPNTPPLNVHVLGDCRISPISDKGSIHDRTLNTSNNLDEAKVKE 356 Query: 2031 ---DFVILDRKVALEGSAIPLTFKSKNEVDAYGTIVHADEPDNFLDDEPIPTNCMYIANN 2201 D ++ + G L + N + A+GT+ ++ + + P+ C+ ++ + Sbjct: 357 EVQDCEVVPPPTEM-GGTCELAVDTINNIVAFGTVFEDEDVNRMIHGVPLKEGCVRVSVD 415 Query: 2202 QAMTESTPLPVKIPMTRDCLDDVVGNHVDSPTHLIKLQNEKPSTMKVGDGKN--KSKVVN 2375 A+ LP + + +G+HV P L+ + K K+ K +N Sbjct: 416 GAIQAEARLPFLVEGEMGLVGQAIGSHVAWPEELVIRRVNKKKKRKMDFVKQLFDQAELN 475 Query: 2376 SDMPRDLYMFYGCCKNVL-QDGKSISITLDDDVFGTEKVIHVDLSDITHFCELESISCYS 2552 S +P+ + Y K ++ Q + IS LDD VFG K + + ++T E++ I Sbjct: 476 SFVPKRCKLLYKHAKTIMSQTSELISTVLDDKVFGLHKELFILTENVTDLLEMKKIGQGV 535 Query: 2553 IIVYIWHLYKKMKE-DFVDNFLFVDPYHIGHVPTTRTDKSYLQQLMVARARFLADRLSNA 2729 I Y+ HL++ + E D +D F F+DP T ++S +L DRL Sbjct: 536 IAAYMAHLHETLTERDELDTFTFIDP-----AATYNCERS-------GFGPYLVDRLKEG 583 Query: 2730 SINQVVLAPCNIG----------YHWILTVIDPYKETVHVLDPLGPRIRDETWRDVVNLA 2879 +++ P N G HWILT+I +++ V++LDPL + W V A Sbjct: 584 KADRIFFMPYNPGCIMWAMKYYKEHWILTII--WEDEVYILDPLPNPVHYTAWETAVMNA 641 Query: 2880 LKLFNADKGRKGKKKPQWEVIRAPIQPDEKQCGFFVMRYMREILHECINTNKASLRTIFK 3059 +K +NA+KGR K + P QP +CG++VMRYM++I+++ + + Sbjct: 642 VKSYNAEKGRANKVPKLRLLPGVPKQPGGIECGYYVMRYMKDIINDDTLSFSTKWAVKTR 701 Query: 3060 KGDYTRAQIDEVRLEWAKCIQ 3122 KG YT+ Q+DEVR+E A +Q Sbjct: 702 KG-YTQQQLDEVRMEVANYLQ 721