BLASTX nr result
ID: Rehmannia29_contig00011116
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia29_contig00011116 (1837 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PIN08153.1| hypothetical protein CDL12_19269 [Handroanthus im... 547 0.0 ref|XP_011090943.1| uncharacterized protein LOC105171501 [Sesamu... 528 e-175 ref|XP_011071455.1| uncharacterized protein LOC105156897 [Sesamu... 523 e-173 ref|XP_022865987.1| uncharacterized protein LOC111385805 [Olea e... 452 e-145 gb|KZV24014.1| hypothetical protein F511_08975 [Dorcoceras hygro... 427 e-136 emb|CDP19542.1| unnamed protein product [Coffea canephora] 371 e-115 gb|EYU43933.1| hypothetical protein MIMGU_mgv1a018763mg, partial... 363 e-113 ref|XP_019235744.1| PREDICTED: uncharacterized protein LOC109216... 366 e-112 emb|CBI26064.3| unnamed protein product, partial [Vitis vinifera] 366 e-112 ref|XP_012859056.1| PREDICTED: uncharacterized protein LOC105978... 360 e-111 ref|XP_009757778.1| PREDICTED: uncharacterized protein LOC104210... 361 e-110 ref|XP_009619501.1| PREDICTED: uncharacterized protein LOC104111... 358 e-110 ref|XP_024022662.1| uncharacterized protein LOC21406306 [Morus n... 351 e-109 gb|EOY06483.1| Uncharacterized protein TCM_021187 isoform 3 [The... 358 e-109 gb|EOY06482.1| Uncharacterized protein TCM_021187 isoform 2 [The... 358 e-109 gb|EOY06481.1| Uncharacterized protein TCM_021187 isoform 1 [The... 358 e-109 ref|XP_016542314.1| PREDICTED: uncharacterized protein LOC107842... 353 e-108 ref|XP_021291361.1| uncharacterized protein LOC110421952 isoform... 353 e-107 ref|XP_021291360.1| uncharacterized protein LOC110421952 isoform... 353 e-107 ref|XP_007035557.2| PREDICTED: uncharacterized protein LOC186034... 353 e-107 >gb|PIN08153.1| hypothetical protein CDL12_19269 [Handroanthus impetiginosus] Length = 844 Score = 547 bits (1409), Expect = 0.0 Identities = 325/615 (52%), Positives = 385/615 (62%), Gaps = 78/615 (12%) Frame = +3 Query: 42 EFCFLVIW*VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRL 221 E C + VSK F DEKMS VDYVSSLKA+VGTN+LVE +GIG GK DLT MALEP + Sbjct: 238 EECKSALLEVSKTFGDEKMSLVDYVSSLKAMVGTNVLVEVVGIGKGKQDLTGMALEPPKS 297 Query: 222 TQVIPVRSEIPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQ 401 Q IPVR EIP+GKACS LTT+EI+KFLSGDYRLSKARSNDLFWEA+WPRLLARGWHSEQ Sbjct: 298 NQAIPVRPEIPSGKACSSLTTTEIVKFLSGDYRLSKARSNDLFWEAVWPRLLARGWHSEQ 357 Query: 402 PESRGYI-GPKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXA 578 P+ +GYI G KH LVFLMPGVKKFSRRKL+KG+ YFDSVTDVL KVAK Sbjct: 358 PKDQGYIAGSKHCLVFLMPGVKKFSRRKLVKGEQYFDSVTDVLSKVAKEPGLIELDNEEV 417 Query: 579 NGYKKEEE---NELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGG 749 +G K +E+ N T E KS+ED +DLPTK+R FYLQPRTPNR+T KF VVDTSLS G Sbjct: 418 DGNKNKEDDDSNLSTSEIKSDEDEDDLPTKERHFYLQPRTPNRNTHTIKFMVVDTSLSDG 477 Query: 750 KIRDLRTLPSEISNTLISLDFTEDRNQN--------------------------NKDVNH 851 K+R+LRTLPSEISN LIS D TED +++ K NH Sbjct: 478 KVRELRTLPSEISNFLISFDQTEDNDEDTVEENSDESNTIIASMRDNPRMSKSGGKVKNH 537 Query: 852 DDISSYQDSRTV------YPYP-KNNKDVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISK 1010 DD SYQD+R V P K +D+ D K RKV K+ RK+K+ N D+IAPI+K Sbjct: 538 DDRPSYQDARPVCHDISKTSVPGKKQRDLYDDKKHRKVVKAPLKRKRKEGNADHIAPIAK 597 Query: 1011 RCRRLTANNTHEETSDGVNHSSTSPRSDNRQSCPCSDIHRDLNENLSSVASSGQDKLSST 1190 C+RLTA N+HEE + SS P N S C ++ RD NENLSS SS QDKLSST Sbjct: 598 SCQRLTA-NSHEE----MIQSSVGPTVQNGASSVCPEV-RDFNENLSSQVSSCQDKLSST 651 Query: 1191 S------------------------------------EDSQTHLSIDLNLPQYSPESSEN 1262 S E +T L ID+NLPQ S + Sbjct: 652 SSSKGSPSESVEFVTTSNIPAAETSTISNVHAAETLTESPETQLLIDINLPQVSQDFE-- 709 Query: 1263 GLLPTDSNNEQDNQSIKPDNNSLPKPS-----VNVHFITNPPRHSTRNPHLSIRALEAIA 1427 +S EQDN I+PDN+ LPK S I NP RHSTRN L+ +ALEA+ Sbjct: 710 -----NSTKEQDNHFIQPDNHHLPKSSEIEAAPEDQSIMNPRRHSTRNRPLTKKALEALV 764 Query: 1428 DGYLTVNRKQRGKITSSHEDLTSRPSRRARGGVHPNESPSSYTASQIEETENGASNSGNC 1607 +GYLTVNR+ + + T SH++L SRPS+R RG + P+ES +S AS IEE ENG SNS N Sbjct: 765 NGYLTVNRRPKRRDTKSHDNLGSRPSKRTRGVIGPSESTNSSMASHIEEVENGVSNSDND 824 Query: 1608 HIASEVGVFPEANEE 1652 + ++ V P A EE Sbjct: 825 NTLNKFQVLPNATEE 839 >ref|XP_011090943.1| uncharacterized protein LOC105171501 [Sesamum indicum] ref|XP_011090944.1| uncharacterized protein LOC105171501 [Sesamum indicum] Length = 861 Score = 528 bits (1359), Expect = e-175 Identities = 317/598 (53%), Positives = 377/598 (63%), Gaps = 61/598 (10%) Frame = +3 Query: 42 EFCFLVIW*VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRL 221 E C + VSK F DEKMS DYVSSLKA+VG NILVEA+GIG GK DLT MALEPSR Sbjct: 272 EECRNALLEVSKTFGDEKMSLADYVSSLKAMVGMNILVEAVGIGKGKQDLTGMALEPSRS 331 Query: 222 TQVIPVRSEIPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQ 401 QVIP R EIPTGKACS LTT+EIIKFLSGDYRLSKARSNDLFWEA+WPRLLARGWHSEQ Sbjct: 332 NQVIPARPEIPTGKACSSLTTTEIIKFLSGDYRLSKARSNDLFWEAVWPRLLARGWHSEQ 391 Query: 402 PESRGYI-GPKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXA 578 P+ +GY+ G KH LVFLMPGVKKFSRRKL+KGDHYFDSVTDVL KVAK Sbjct: 392 PKDQGYVAGSKHCLVFLMPGVKKFSRRKLVKGDHYFDSVTDVLSKVAKNPGLIELDAEED 451 Query: 579 NGYKKEEENELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGGKIR 758 + KK+E+ E T E+KS +D N LPT+Q+ YLQPRTPNRST + KFTVVDTSLS GK+R Sbjct: 452 HDSKKKEDYERTKERKSEQDDNHLPTRQQHCYLQPRTPNRSTAIIKFTVVDTSLSDGKVR 511 Query: 759 DLRTLPSEISNTLISLDFTEDRNQNNKDVNHD---------------------------- 854 +LRT+PSEISN I+ D ED + + D Sbjct: 512 ELRTVPSEISNAFIASDHIEDSDDDTPGETTDESDTSDTIMLDASVTDNVSLKTTESDDK 571 Query: 855 --------DIS-SYQDSRTVYPYP--------KNNKDVSDKTKSRKVSKSIPSRKQKQRN 983 D+S S QD+RTV P KN KD+ +S+KV+KS+ SRK KQ N Sbjct: 572 LFPGKKDQDVSVSCQDARTVNPDESATLLPDLKNTKDLQRNKQSKKVTKSLLSRKVKQGN 631 Query: 984 VDYIAPISKRCRRLTANNTHEETSDGVNHSSTSPRSDNRQSCPCSDIHRDLNENLSSVAS 1163 VD++AP++KR R L A +ETS G+ S T+PR +N S S +H ++ ENLSS Sbjct: 632 VDHMAPMNKRRRILNACRM-DETSSGLLPSWTAPRLENGMSSCSSSVH-EITENLSSQVG 689 Query: 1164 SGQDKLSSTS----------EDSQTHLSIDLNLPQYSPESSENGLLPTDSNNEQDNQSIK 1313 QDKLSSTS E+ Q IDLN+PQ SPES G + T+SN +Q N S K Sbjct: 690 LCQDKLSSTSSSRGSPAESIENHQMQTLIDLNVPQVSPESENCGFM-TESNKDQGNTSKK 748 Query: 1314 PDNNSLP-----KPSVNVHFITNPPRHSTRNPHLSIRALEAIADGYLTVNRKQRGKITSS 1478 D+ LP + P RHSTRN + RALEA+ADGYLTVNRK++ Sbjct: 749 LDDRRLPISTAAEARCEQQSEVYPRRHSTRNRPPTTRALEALADGYLTVNRKRK------ 802 Query: 1479 HEDLTSRPSRRARGGVHPNESPSSYTASQIEETENGASNSGNCHIASEVGVFPEANEE 1652 +RPS+ R + PNES +S SQ+EE ENG S SGNC+I + V EAN+E Sbjct: 803 ----VNRPSQHVRIVIGPNESTNSSVDSQMEEAENGVSESGNCNIFVKSQVPAEANDE 856 >ref|XP_011071455.1| uncharacterized protein LOC105156897 [Sesamum indicum] ref|XP_011071456.1| uncharacterized protein LOC105156897 [Sesamum indicum] Length = 884 Score = 523 bits (1348), Expect = e-173 Identities = 320/618 (51%), Positives = 381/618 (61%), Gaps = 86/618 (13%) Frame = +3 Query: 69 VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRLTQVIPVRSE 248 VSK F +EKM VDYVSSLKA+VG NILVEA+ IGTGK DLTRMALEP R Q IPVR E Sbjct: 269 VSKNFGEEKMLLVDYVSSLKALVGMNILVEAVAIGTGKQDLTRMALEPLRSNQAIPVRPE 328 Query: 249 IPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQPESRGYI-- 422 +PTGK CS LTT+EIIKFLSGDYRLSKARSNDLFWEA+WPRLLARGWHSEQP++ ++ Sbjct: 329 MPTGKRCSSLTTTEIIKFLSGDYRLSKARSNDLFWEAVWPRLLARGWHSEQPQNPRHVAG 388 Query: 423 GPKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXANGYKKEEE 602 KH LVFLMPG+KKFSRRKL+KG HYFDSVTDVLGKV K +G KKEE Sbjct: 389 SNKHCLVFLMPGIKKFSRRKLVKGYHYFDSVTDVLGKVRKEPGLIDLDNEETDGNKKEEG 448 Query: 603 NELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGGKIRDLRTLPSE 782 +E G+KK ED N PT+ R+ YLQPRT N S +FTVVDTSLS GK+R+LR LPSE Sbjct: 449 HERAGKKKLKEDENYRPTRHRRSYLQPRTSNCSMDDTRFTVVDTSLSDGKVRELRALPSE 508 Query: 783 ISNTL-ISLDFTE-------DRNQNNKDV---------------------------NHDD 857 SN + ISL T+ + N D HDD Sbjct: 509 TSNMMSISLVHTQGGAQVTLEENNGESDATNTITPDAYAADNPTAKTSRKTFPARKKHDD 568 Query: 858 ISSYQDSRTVY--------PYPKNNKDVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISKR 1013 SS+QD+ TVY P KN K + DK +SRKV K RKQ+Q +VDY APISKR Sbjct: 569 NSSFQDTHTVYPDISKTSGPDLKNKKGLIDKKQSRKVPKPHLRRKQEQGDVDYTAPISKR 628 Query: 1014 CRRLTANNTHEETSDGVNHSSTSPRSDNRQSCPCSDIHRDLNENLSSVASSGQDKLSSTS 1193 CRRLTAN +E DGV SS +PRS N S CS R+ NEN+SS QDKL S+S Sbjct: 629 CRRLTANGC-DEVRDGVIRSSIAPRSGNSTSY-CSSGTREFNENVSSQVRLCQDKLLSSS 686 Query: 1194 ------------------------------------EDSQTHLSIDLNLPQYSPESSENG 1265 E+++T IDLNLPQ SPE E+ Sbjct: 687 SSKGDKLLPTSSSKGSPHESIKCNPVSSIHAKEPSPENTRTPFLIDLNLPQLSPE-IEDY 745 Query: 1266 LLPTDSNNEQDNQSIKPDNNSLPKPS-----VNVHFITNPPRHSTRNPHLSIRALEAIAD 1430 + TD +Q++ SIKP+N+ L K S + + NP RHSTRN + RALEA+AD Sbjct: 746 SVATDMRMDQNDGSIKPENHCLSKSSDIEAGMELPSTVNPLRHSTRNRPPTTRALEAVAD 805 Query: 1431 GYLTVNRKQRGKITSSHEDLTSRPSRRARGGVHPNESPSSYTASQIEETENGASNSGNCH 1610 GYLTVNR++R + TSS ++ SR S+RAR V PN+SP+S AS IEE ENG SN+G + Sbjct: 806 GYLTVNRRRRSRDTSSRGNIASRRSQRARRVVAPNDSPNSSMASHIEEAENGVSNTGTNN 865 Query: 1611 IASEVGVFPEANEE*VWR 1664 + S+ + EAN E V R Sbjct: 866 MFSKFHIPTEANNESVPR 883 >ref|XP_022865987.1| uncharacterized protein LOC111385805 [Olea europaea var. sylvestris] ref|XP_022865988.1| uncharacterized protein LOC111385805 [Olea europaea var. sylvestris] Length = 863 Score = 452 bits (1163), Expect = e-145 Identities = 274/594 (46%), Positives = 356/594 (59%), Gaps = 59/594 (9%) Frame = +3 Query: 42 EFCFLVIW*VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRL 221 E C + V K F + KMS DYV SLK +VGTNILV+A+GIG GK DLT MA E SR Sbjct: 272 EECQNALLEVCKTFGEGKMSLEDYVFSLKTMVGTNILVKAVGIGKGKQDLTGMAFEISRS 331 Query: 222 TQVIPVRSEIPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQ 401 QVIP+R EIPTGKACS LT SEI+ FL+GDYRLSKARSNDLFWEA+WPRLLARGWHSE+ Sbjct: 332 NQVIPIRPEIPTGKACSALTPSEIVNFLTGDYRLSKARSNDLFWEAVWPRLLARGWHSEE 391 Query: 402 PESRGYIGPK-HSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXA 578 P+++GY +SLVFL+PG+ KFSRRKL+KG+HYFD V DVL KVA+ Sbjct: 392 PKNQGYAAVSMYSLVFLVPGINKFSRRKLVKGEHYFDCVADVLSKVAREPGLLELENEED 451 Query: 579 NGYKKEEENELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGGKIR 758 K +EE + T E K +D ++ P +Q FYLQPRTP +T FTVVDTS + GK Sbjct: 452 EKNKNKEEYKWTSESKLLKDDDEPPIRQHHFYLQPRTPKWNTDAMTFTVVDTSSADGKPC 511 Query: 759 DLRTLPSEISNTLISLDFTEDRNQNNKD------------------VNHDDISSYQDSRT 884 LR+LP EISNT+IS + +EDRN + D N+ ++ + + + Sbjct: 512 KLRSLPFEISNTIISQNRSEDRNGDTHDEATDESDIVDTMLVDDSETNNTNLGTTKSNLE 571 Query: 885 VYPYPK-----------------NNKDVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISKR 1013 + P K N KD+ +SR + KS SRK K+ NVD +AP +KR Sbjct: 572 MLPGRKDCDTICQGSDISVTKLKNKKDLHQDKQSRNLVKSRLSRKLKRENVDNMAPSTKR 631 Query: 1014 CRRLTANNTHEETSDGVNHSSTSPRSDNRQSCPCSDIHRDLNENLSSVASSGQDKLSSTS 1193 R+LTA + +E TS+GV HS+ P +N CS H EN+S+ A S Q+KLS T Sbjct: 632 HRKLTACSGNE-TSNGVTHSTLVPSQENEIISLCSGSH-GFTENISAQAGSSQEKLSYTG 689 Query: 1194 EDS-----------------QTHLSIDLNLPQYSPESSENGLLPTDSNNEQDNQSIKPDN 1322 Q+H IDLNLPQ SP+ EN +L T+ E+D + +KPD+ Sbjct: 690 SSKGSPTGSVECTEPHLRHLQSHSLIDLNLPQVSPDL-ENAVLTTEIIKEEDERILKPDD 748 Query: 1323 NS-LP-----KPSVNVHFITNPPRHSTRNPHLSIRALEAIADGYLTVNRKQRGKITSSHE 1484 + LP +P+ N+H RHSTRN L+ +ALEA+A G+LT NRK++ K T+S E Sbjct: 749 HCPLPSTSGEQPNPNLH------RHSTRNRPLTAKALEALASGFLTTNRKRKNKDTTSRE 802 Query: 1485 DLTSRPSRRARGGVHPNESPSSYTASQIEETENGASNSGNCHIASEVGVFPEAN 1646 +LT RP R A+ V NE + SQI+E ENG SNSG+ ++ + V + N Sbjct: 803 NLTPRPRRYAQDIVALNEFSNDTVTSQIQEGENGVSNSGDSNVRDKFQVLKDEN 856 >gb|KZV24014.1| hypothetical protein F511_08975 [Dorcoceras hygrometricum] Length = 873 Score = 427 bits (1099), Expect = e-136 Identities = 268/612 (43%), Positives = 346/612 (56%), Gaps = 75/612 (12%) Frame = +3 Query: 42 EFCFLVIW*VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRL 221 E C + VSK+F DEKMS YV+SLKA++G + LV A+GIGTGK DLTRMA+EPSR Sbjct: 253 EECRSALLEVSKRFGDEKMSLAKYVASLKAMIGMSALVGAVGIGTGKQDLTRMAMEPSRS 312 Query: 222 TQVIPVRSEIPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQ 401 Q + +R EIPTGKACS LTT EIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQ Sbjct: 313 NQAVQMRPEIPTGKACSSLTTEEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQ 372 Query: 402 PESRGY-IGPKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXA 578 P+ + Y +G KH LVFL+PGV+KFSRRKL+KGDHYFDSVTDVL KVAK Sbjct: 373 PKDQVYAVGSKHCLVFLVPGVQKFSRRKLVKGDHYFDSVTDVLSKVAKEPELIELHTEED 432 Query: 579 NGYKKEEENELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGGKIR 758 K +E++ T + ED +D ++QR FYLQPRTP R + KFTVVDTSL GK+R Sbjct: 433 GTTKSQEDHRCTSGRGVEEDDSDQSSRQRHFYLQPRTPLRHSGATKFTVVDTSLPDGKLR 492 Query: 759 DLRTLPSEISNTLI-------------------------------SLDFTEDR------- 824 ++R+L +EISN L +D DR Sbjct: 493 EIRSLSTEISNILTYGKRTTVMDEDSSYESAYESETMSSVLLNSSVIDRVSDRPNKSGAE 552 Query: 825 -----NQNNKDVNHDDISSYQDSRTVYPYPKNNKDVSDKTKSRKVSKSIPSRKQKQRNVD 989 +N H D ++ K K++S +SR V + + SRK K+ N Sbjct: 553 MLPRKKENVGGTMHQDSHDASSDISLISL-KKKKNLSGNKESRNVVEPLLSRKPKKGNKP 611 Query: 990 YIAPISKRCRRLTANNTHEETSDGVNHSSTSPRSDNRQSCPCSDIHRDLNENLSSVASSG 1169 Y AP +K+ R+ T + +HEET D + TS R DN S CS IH E S+ Sbjct: 612 YSAPTAKQ-RKKTISGSHEETRDHKACTLTSTRLDNEISSCCSGIHGSA-EKFSTEMVPC 669 Query: 1170 QDKLSST-------------SEDSQTHLS-------------IDLNLPQYSPESSENGLL 1271 ++KL+ST + ++ TH + IDLN+PQ SENG+ Sbjct: 670 ENKLASTGSPNCSAAENVECNPNTSTHSTEFSQVNPHKSQTLIDLNMPQVF--QSENGIF 727 Query: 1272 PTDSNNEQDNQSIKPDNNSLPKPS-----VNVHFITNPPRHSTRNPHLSIRALEAIADGY 1436 T+S+ EQ+ +K D+ LPK S TN RH TRN + +ALEA A+GY Sbjct: 728 STESSKEQNTSILKSDDQPLPKVSPIQATSEQQSTTNSLRHGTRNRPPTTKALEARANGY 787 Query: 1437 LTVNRKQRGKITSSHEDLTSRPSRRARGGVHPNESPSSYTASQIEETENGASNSGNCHIA 1616 LTVNR+++ K TS ED +SRP +R R S S+I +++NG +SGN ++ Sbjct: 788 LTVNRRRKSKDTSWQEDPSSRPLQRNRVVSRDESSACVSVPSEIGKSQNGVVDSGNSNMF 847 Query: 1617 SEVGVFPEANEE 1652 E+ P AN++ Sbjct: 848 GELQAMPVANDK 859 >emb|CDP19542.1| unnamed protein product [Coffea canephora] Length = 805 Score = 371 bits (953), Expect = e-115 Identities = 244/581 (41%), Positives = 323/581 (55%), Gaps = 58/581 (9%) Frame = +3 Query: 69 VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRLTQVIPVRSE 248 VSK F + KMS +YV SLKA+VG ++LVE +GIG GK DLT MALEP R IP+R E Sbjct: 232 VSKTFVEGKMSLEEYVFSLKAMVGLSLLVEVVGIGKGKQDLTGMALEPVRSNHAIPMRPE 291 Query: 249 IPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQPESRGY-IG 425 IPTGKACS LT++EI+KFL+GDYRLSKARS+DLFWEA+WPRLLARGWHSE+P+ GY G Sbjct: 292 IPTGKACSSLTSNEIVKFLTGDYRLSKARSSDLFWEAVWPRLLARGWHSEEPKDPGYAAG 351 Query: 426 PKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXANGYKKEEEN 605 K+SLVFL+PG+KKFSRR+L+KG+HYFDSV+DVL KVA +KEEE Sbjct: 352 SKNSLVFLVPGIKKFSRRRLVKGNHYFDSVSDVLSKVASEPGLIELENEVDESKRKEEEY 411 Query: 606 ELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGG-KIRDLRTLPSE 782 E + ++K +G+D+P ++R+ YLQPRTP R + KFT+VDT L K+++LR L E Sbjct: 412 ECSRKRKL--EGDDMPNQRRRSYLQPRTPYRGSDGMKFTIVDTGLEDARKVKELRRLLRE 469 Query: 783 ISNTLISLDFTEDRNQNNKDVNHDDISSY----QDSRTVYPYPKNNKDVSDKTKSRKVSK 950 S +E + N+ D+ DD S DS + + K + D S+ + + Sbjct: 470 FS--------SEFNSGNSYDIIDDDSSEVSTEESDSPDMTLHNKGDNDTSNASNHLSNGE 521 Query: 951 SIPSRKQKQ-------RNVDY-IAPISKRCRRLTANNTHEETSDGVNHSSTSPRSDNRQS 1106 +P RK Q + Y + P KR R LTA N H ETS+ + + P+SD+ S Sbjct: 522 ILPDRKDLQIHAPTCENHASYDMNPAFKRARGLTACN-HLETSNVLTDRAILPKSDSELS 580 Query: 1107 CPCSDIHRDLNENLSSVASSGQDKLSSTS------------------------EDSQTHL 1214 SD+ RD EN+ + ++ DKLS ++ + SQ Sbjct: 581 SRGSDV-RDFAENVPPLVATPPDKLSLSNSSKGSPTESVEHDTVSCLVASDPQQSSQNPT 639 Query: 1215 SIDLNLPQYSPESSENGLLPTDSNNE----QDNQSIKPDNNSLPKPSVNVHFITNPPRHS 1382 IDLN+PQ P E G L TD+ E D PD + P+ N+ N R Sbjct: 640 LIDLNIPQV-PVDFETGSLRTDATTENPVDHDELERAPDKVN-PEHQANM----NLQRRG 693 Query: 1383 TRNPHLSIRALEAIADGYLTVNRKQRGKITSSHEDLTSRPSRRAR--------------- 1517 TR + RALEA+A GYLTVNR+++G S E++ SRPSRRAR Sbjct: 694 TRVRPPTTRALEALAHGYLTVNRRRKGSEARSRENMRSRPSRRARGAGQGVAFQSVHQLN 753 Query: 1518 -GGVHPNESPSSYTASQIEETENGASNSGNCHIASEVGVFP 1637 G V P P S + ++ T G N G + V P Sbjct: 754 LGSVDPRSEPGS---NSVDSTVQGGENVGKLQVQHAGNVTP 791 >gb|EYU43933.1| hypothetical protein MIMGU_mgv1a018763mg, partial [Erythranthe guttata] Length = 721 Score = 363 bits (933), Expect = e-113 Identities = 262/577 (45%), Positives = 329/577 (57%), Gaps = 33/577 (5%) Frame = +3 Query: 21 IFLKFCWEFCFLVIW*VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRM 200 IFL+ E C + VSK FA+EKMS DYVSSLK++VG NILVEA+ IG GK DLT Sbjct: 214 IFLRVSEE-CRNALLEVSKTFAEEKMSLADYVSSLKSMVGVNILVEAVAIGAGKRDLTGA 272 Query: 201 ALEPSRLTQ-VIPVRSEIPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLL 377 +LEPSR + +RSEIPTGKACS LT +EI +FL G+YRLSKARSNDLFWEA+WPRLL Sbjct: 273 SLEPSRSSYPTAHIRSEIPTGKACSALTANEIARFLCGNYRLSKARSNDLFWEAVWPRLL 332 Query: 378 ARGWHSEQPESRGYIGPKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXX 557 ARGWHSEQP++ SLVFL+PGV+KFS+RKL+KGD YFDSV DVL VAK Sbjct: 333 ARGWHSEQPKNH---TSNFSLVFLLPGVRKFSKRKLVKGDDYFDSVADVLSMVAKDPGLI 389 Query: 558 XXXXXXANGYKKEE----ENELTGE--KKSNEDGNDLPTKQRQFYLQPRTPNR-STRVAK 716 +K++ +NE G+ NE+ +D QR YLQPR P R S V K Sbjct: 390 QLENEEQEKDEKDDVSMTKNEGNGDVSMTKNEENDD----QRHCYLQPRNPKRKSAVVMK 445 Query: 717 FTVVDTSLSGGKIRDLRTLPSEISNTLISLDFTEDRNQNNKDVNHDDISSYQDSRTVYPY 896 FTVVDTS+S G++R+LR EIS+ I D D N +D Sbjct: 446 FTVVDTSMSNGRVRELR----EISSVPIGGD----------DGNDED-----------AL 480 Query: 897 PKNNKDVSDKTKSRKVSKSIPSRK-QKQRNVDY-IAPISKRCRRLTANNTHEETSDGVNH 1070 KN KD K +K+ KS RK +KQRN DY + P +KRCR T Sbjct: 481 EKNKKDFQGK---KKLPKSQVGRKTKKQRNEDYVVGPTTKRCRAQT-------------- 523 Query: 1071 SSTSPRSDNRQSCPCSDIHRDLNENLSS-VASSGQDKLSSTS-------EDSQTHLSIDL 1226 PCS H +++ENLSS V S+ DK S S E+ + IDL Sbjct: 524 -------------PCS--HEEVDENLSSQVGSANLDKPSCASSSKGSPVEEKTPQILIDL 568 Query: 1227 NLPQYSPESSENGLLPTDSNNEQDN--QSIKP---------DNNSLPK-PSVNVHFITNP 1370 NLPQ P+S N + D E+ Q+I P + L P+ V N Sbjct: 569 NLPQVCPDSEYNDSVKVDVEEEEGESLQNIPPAAEVAEVAAEEEPLQNIPAAEVAVNANQ 628 Query: 1371 PRHSTRNPHLSIRALEAIADGYLTVN-RKQRGKITSSHEDLTSRPSRRARGG-VHPNESP 1544 R+STRN ++R+L+A+A GYL VN RK++GK +S++D+ +P +R RGG V PNES Sbjct: 629 RRYSTRNQTPTMRSLQAVAHGYLAVNHRKRKGKEAASNDDV--KPCQRPRGGCVGPNEST 686 Query: 1545 SSYTASQIEETE-NGASNSGNCHIASEVGVFPEANEE 1652 SS ASQ+EE+ NGAS SGN S+V PE +EE Sbjct: 687 SSSAASQVEESSGNGASTSGN---ESQVPPPPENDEE 720 >ref|XP_019235744.1| PREDICTED: uncharacterized protein LOC109216072 [Nicotiana attenuata] gb|OIT25065.1| hypothetical protein A4A49_37455 [Nicotiana attenuata] Length = 857 Score = 366 bits (940), Expect = e-112 Identities = 249/595 (41%), Positives = 334/595 (56%), Gaps = 76/595 (12%) Frame = +3 Query: 69 VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRLTQVIPVRSE 248 VSK F + K+ +YV SLKA+VG N+L+EA+GIG K+DLT +ALEPS+ I RSE Sbjct: 259 VSKAFGEGKILLEEYVFSLKAMVGVNMLIEAVGIGKDKYDLTCVALEPSKSNHAI--RSE 316 Query: 249 IPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQPESRGY-IG 425 +P GKACS LTT+E++KFL+GDYRLSKARSNDLFWEA+WPRLLARGWHSEQP++ Y Sbjct: 317 LPAGKACSSLTTNEVVKFLTGDYRLSKARSNDLFWEAVWPRLLARGWHSEQPKNLNYAAN 376 Query: 426 PKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXANGYKKEEEN 605 PK++LVFLMPGVKKFSRR L+KG HYFDSVTDVLGKVA K +E Sbjct: 377 PKNALVFLMPGVKKFSRR-LIKGIHYFDSVTDVLGKVASDPKLLELGVED-ECTKGKEGG 434 Query: 606 ELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGG---KIRDLRTLP 776 + T E K +D DLPT+QR YLQPRTPNR T V KFTVVDTSLS G K+R+LR+LP Sbjct: 435 DWTDETKLEQD--DLPTRQRPCYLQPRTPNRCTDVMKFTVVDTSLSDGKPYKVRELRSLP 492 Query: 777 SEISNTL------------ISLDFTEDRNQNNKDVNHD------------------DISS 866 EIS+ L +S D ++ N +H +IS+ Sbjct: 493 VEISSKLSLGSHAEGSEEELSTDESDSVGTNKAKTDHHNSSRIFSNGETHSDEKGFEISA 552 Query: 867 YQDSRTVYPYP----------KNNKDVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISKRC 1016 P+P KN K++ + + RK K+ +++ K+ NV ++API+++ Sbjct: 553 SSKKFQEVPHPAASTVPVNAWKNTKNICEDKQPRKAIKAHSNKRLKENNVHFVAPIAQKR 612 Query: 1017 RRLTANNTHEETSDGVNHSSTSPRSDNRQSCPCSDIHRDLNENLSSVASSGQDKLSSTS- 1193 RRLTA + E S + +S P Q + DL+ N +ASS +DK+SS+S Sbjct: 613 RRLTACSRGETNSSVMVNSLMVP--GREQEVRHTSSSNDLSLNNIQIASS-EDKVSSSSS 669 Query: 1194 -----------------------EDSQTHLSIDLNLPQYSPESSENGLLPTDSNNEQDNQ 1304 E+ QT IDLN PQ P+S L+P + ++ N Sbjct: 670 SKSSPSQSAECASADHHVLKLPEEEPQTRAMIDLNEPQVPPDSEYEFLMPALTEDQSGNT 729 Query: 1305 SIKPDNNSLPKPSVNVHFI------TNPPRHSTRNPHLSIRALEAIADGYLTV-NRKQRG 1463 D + K S + N RH TRN + RALEA+A+G+LTV +R+Q+ Sbjct: 730 KRPDDVSGELKTSTQSASMEQQQPSLNSRRHGTRNRPPTTRALEALANGFLTVHSRRQKN 789 Query: 1464 KITSSHEDLT-SRPSRRARGGVHPNESPSSYTASQIEETENGASNSGNCHIASEV 1625 K S LT SR S++ GG+ + S +S SQ+EE E S +G ++ ++ Sbjct: 790 KEGGSRGKLTSSRSSQQTPGGMKTDFS-NSTVVSQMEEGEAAVSKAGESNMFGKI 843 >emb|CBI26064.3| unnamed protein product, partial [Vitis vinifera] Length = 847 Score = 366 bits (939), Expect = e-112 Identities = 240/560 (42%), Positives = 311/560 (55%), Gaps = 42/560 (7%) Frame = +3 Query: 69 VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRLTQVIPVRSE 248 VSK F + K+ +YVS+LKA VG NI +EA+GIG G+ DLT +ALEP + QV PVR E Sbjct: 255 VSKTFGEGKILLEEYVSTLKATVGMNIFIEAVGIGKGRQDLTGIALEPLKHNQVAPVRPE 314 Query: 249 IPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQPESRGY-IG 425 +P GKACS LT EIIK L+GD+RLSKARS+DLFWEA+WPRLLARGWHSEQP Y G Sbjct: 315 MPIGKACSSLTPQEIIKCLTGDFRLSKARSSDLFWEAVWPRLLARGWHSEQPRGHNYAAG 374 Query: 426 PKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXANGYKKEEEN 605 K LVFL+PGVKKFSRRKL+KG HYFDSV+DVL KVA G K +EE+ Sbjct: 375 SKQPLVFLIPGVKKFSRRKLVKGSHYFDSVSDVLSKVASDPGLLEFEIEADEGNKSKEES 434 Query: 606 ELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGG---KIRDLRTLP 776 LT E K ++D DL ++ YLQPRTPNR+ + KFTVVDTSL+ G K +++R+LP Sbjct: 435 GLTNETKLDKD--DLSDQRHHCYLQPRTPNRNVDIVKFTVVDTSLANGAKYKEKEVRSLP 492 Query: 777 SEISNTLISLDFTEDRNQNNKDVNHDDISSYQDSRTVYPYPKN-NKDVSDKTKSRKVSKS 953 E SNT S E+ +++ + D S+ + PK+ N ++ + K + K Sbjct: 493 FESSNTSTSSSHFEENDEDTSEELVVDESNSDSTSLPAKVPKSQNTNMYNAKKQSRAPKC 552 Query: 954 IPSRKQKQRNVDYIAPISKRCRRLTANNTHEETSDGVNHSSTSPRSDNRQSCPCSDIHRD 1133 RK K +Y+AP++KR RRLTA + ETS P +S C H Sbjct: 553 HLGRKMKPDMSNYLAPVTKRRRRLTA-CSRAETSQSTITFLVGPELKQEESGGCIGKHDS 611 Query: 1134 ----------LNENLSSVASSGQDK--------LSST-------SEDSQTHLSIDLNLPQ 1238 L E L S +SS +D LSS E+ Q IDLNLP Sbjct: 612 DEIIHCKVVPLTEKLCSSSSSCKDSRIDGREGMLSSNCSGAEHPREELQFRTMIDLNLPV 671 Query: 1239 YSPESSENGLLPTDSNNEQDNQSIKPDNNSLPKPSVNVHFITNPP-----RHSTRNPHLS 1403 + +L S + D S + D+ + K S+ V PP R STRN L+ Sbjct: 672 LPDAETGEPVLVASSERQDDQASKQADDPNALKTSIGVANSEQPPNMNSRRQSTRNRPLT 731 Query: 1404 IRALEAIADGYLTVNRKQRGKITS-SHEDLTSRPSRRARGGVHPNES-PSSYTASQIEET 1577 +ALEA+A G+L R++R + + EDL SRPSRRAR + ES + S+++E Sbjct: 732 TKALEALASGFLNTRRRRRKRTEAFPGEDLISRPSRRARCKMRVTESFGTGIMDSKVQEE 791 Query: 1578 ENGASNS-----GNCHIASE 1622 NG N HI SE Sbjct: 792 GNGVCNDNEDMFSKFHIRSE 811 >ref|XP_012859056.1| PREDICTED: uncharacterized protein LOC105978179 [Erythranthe guttata] ref|XP_012859066.1| PREDICTED: uncharacterized protein LOC105978179 [Erythranthe guttata] ref|XP_012859075.1| PREDICTED: uncharacterized protein LOC105978179 [Erythranthe guttata] Length = 742 Score = 360 bits (924), Expect = e-111 Identities = 262/591 (44%), Positives = 331/591 (56%), Gaps = 47/591 (7%) Frame = +3 Query: 21 IFLKFCWEFCFLVIW*VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRM 200 IFL+ E C + VSK FA+EKMS DYVSSLK++VG NILVEA+ IG GK DLT Sbjct: 214 IFLRVSEE-CRNALLEVSKTFAEEKMSLADYVSSLKSMVGVNILVEAVAIGAGKRDLTGA 272 Query: 201 ALEPSRLTQ-VIPVRSEIPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLL 377 +LEPSR + +RSEIPTGKACS LT +EI +FL G+YRLSKARSNDLFWEA+WPRLL Sbjct: 273 SLEPSRSSYPTAHIRSEIPTGKACSALTANEIARFLCGNYRLSKARSNDLFWEAVWPRLL 332 Query: 378 ARGWHSEQPESRGYIGPKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXX 557 ARGWHSEQP++ SLVFL+PGV+KFS+RKL+KGD YFDSV DVL VAK Sbjct: 333 ARGWHSEQPKNH---TSNFSLVFLLPGVRKFSKRKLVKGDDYFDSVADVLSMVAKDPGLI 389 Query: 558 XXXXXXANGYKKEE----ENELTGEKK-------------SNEDGNDLP---TKQRQFYL 677 +K++ +NE G+ N+D +D+ +QR YL Sbjct: 390 QLENEEQEKDEKDDVSMTKNEGNGDVSMTKNEENDDVSMIKNDDNDDVSITRQQQRHCYL 449 Query: 678 QPRTPNR-STRVAKFTVVDTSLSGGKIRDLRTLPSEISNTLISLDFTEDRNQNNKDVNHD 854 QPR P R S V KFTVVDTS+S G++R+LR EIS+ I D D N + Sbjct: 450 QPRNPKRKSAVVMKFTVVDTSMSNGRVRELR----EISSVPIGGD----------DGNDE 495 Query: 855 DISSYQDSRTVYPYPKNNKDVSDKTKSRKVSKSIPSRK-QKQRNVDY-IAPISKRCRRLT 1028 D KN KD K +K+ KS RK +KQRN DY + P +KRCR T Sbjct: 496 D-----------ALEKNKKDFQGK---KKLPKSQVGRKTKKQRNEDYVVGPTTKRCRAQT 541 Query: 1029 ANNTHEETSDGVNHSSTSPRSDNRQSCPCSDIHRDLNENLSS-VASSGQDKLSSTS---- 1193 PCS H +++ENLSS V S+ DK S S Sbjct: 542 ---------------------------PCS--HEEVDENLSSQVGSANLDKPSCASSSKG 572 Query: 1194 ---EDSQTHLSIDLNLPQYSPESSENGLLPTDSNNEQDN--QSIKP---------DNNSL 1331 E+ + IDLNLPQ P+S N + D E+ Q+I P + L Sbjct: 573 SPVEEKTPQILIDLNLPQVCPDSEYNDSVKVDVEEEEGESLQNIPPAAEVAEVAAEEEPL 632 Query: 1332 PK-PSVNVHFITNPPRHSTRNPHLSIRALEAIADGYLTVN-RKQRGKITSSHEDLTSRPS 1505 P+ V N R+STRN ++R+L+A+A GYL VN RK++GK +S++D+ +P Sbjct: 633 QNIPAAEVAVNANQRRYSTRNQTPTMRSLQAVAHGYLAVNHRKRKGKEAASNDDV--KPC 690 Query: 1506 RRARGG-VHPNESPSSYTASQIEETE-NGASNSGNCHIASEVGVFPEANEE 1652 +R RGG V PNES SS ASQ+EE+ NGAS SGN S+V PE +EE Sbjct: 691 QRPRGGCVGPNESTSSSAASQVEESSGNGASTSGN---ESQVPPPPENDEE 738 >ref|XP_009757778.1| PREDICTED: uncharacterized protein LOC104210549 [Nicotiana sylvestris] ref|XP_009757779.1| PREDICTED: uncharacterized protein LOC104210549 [Nicotiana sylvestris] ref|XP_016435405.1| PREDICTED: uncharacterized protein LOC107761674 [Nicotiana tabacum] Length = 856 Score = 361 bits (926), Expect = e-110 Identities = 248/594 (41%), Positives = 332/594 (55%), Gaps = 75/594 (12%) Frame = +3 Query: 69 VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRLTQVIPVRSE 248 VSK F + K+ +YV SLKA+VG N+L+EA+GIG K+DLT +ALEPS+ I RSE Sbjct: 259 VSKAFGEGKILLEEYVFSLKAMVGVNMLIEAVGIGKDKYDLTCVALEPSKSNHAI--RSE 316 Query: 249 IPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQPESRGY-IG 425 +P GKACS LTT+E++KFL+GDYRLSKARSNDLFWEA+WPRLLARGWHSEQP++ Y Sbjct: 317 LPAGKACSSLTTNEVVKFLTGDYRLSKARSNDLFWEAVWPRLLARGWHSEQPKNLNYAAN 376 Query: 426 PKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXANGYKKEEEN 605 PK++LVFLMPGVKKFSRR L+KG HYFDSVTDVLGKVA K +E Sbjct: 377 PKNALVFLMPGVKKFSRR-LIKGIHYFDSVTDVLGKVASDPKLLELDAED-ECTKGKEGR 434 Query: 606 ELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGGK---IRDLRTLP 776 + T E K +D DLPT+QR YLQPRTPNR T V KFTVVDTSLS GK +R+LR+LP Sbjct: 435 DWTDEAKLEQD--DLPTRQRPCYLQPRTPNRCTDVMKFTVVDTSLSDGKPYRVRELRSLP 492 Query: 777 SEISNTLISLDFTEDRNQ-----------NNKDVNHD------------------DISSY 869 EIS+ L E+ + NK NH+ +IS+ Sbjct: 493 VEISSKLSLGSHAEESEEELSSDESDSVGTNKAKNHNNSLRIFSNGETHSEEKGFEISAS 552 Query: 870 QDSRTVYPYP----------KNNKDVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISKRCR 1019 P+P KN K++ + + RKV K+ +++ K+ NV ++API+++ R Sbjct: 553 SKKFQEVPHPAFSTVPVNASKNTKNICEDKQPRKVIKAHSNKRLKENNVHFVAPIAQKRR 612 Query: 1020 RLTANNTHEETSDGVNHSSTSPRSDNRQSCPCSDIHRDLNENLSSVASSGQDKLSSTS-- 1193 RLTA + E S + +S P + S +L+ N +ASS +DK+SS+S Sbjct: 613 RLTACSRGETNSSVMVNSLMVPGREQEMRHTSSS--NELSLNNIPIASS-EDKVSSSSSS 669 Query: 1194 ----------------------EDSQTHLSIDLNLPQYSPESSENGLLPTDSNNEQDNQS 1307 E Q IDLN PQ P+S L+P + ++ N Sbjct: 670 KSSPSQSTECASADHHVLKLPHEVPQNRTMIDLNEPQVPPDSEYEILMPALTEDQSGNMK 729 Query: 1308 IKPDNNSLPKPSVNVHFI------TNPPRHSTRNPHLSIRALEAIADGYLTV-NRKQRGK 1466 D + K S + + N RH TRN + RALEA+A+G+LTV +R+Q+ K Sbjct: 730 RPDDVSGELKTSTHSASMEQQQPSLNSRRHGTRNRPPTTRALEALANGFLTVHSRRQKSK 789 Query: 1467 ITSSHEDLT-SRPSRRARGGVHPNESPSSYTASQIEETENGASNSGNCHIASEV 1625 S T SR S++ GG+ + S +S SQ+EE E S G ++ ++ Sbjct: 790 EGGSRRKSTSSRSSQQTPGGMKTDFS-NSTVVSQMEEGEAVVSKGGESNMFGKI 842 >ref|XP_009619501.1| PREDICTED: uncharacterized protein LOC104111495 [Nicotiana tomentosiformis] ref|XP_009619508.1| PREDICTED: uncharacterized protein LOC104111495 [Nicotiana tomentosiformis] ref|XP_016449583.1| PREDICTED: uncharacterized protein LOC107774544 [Nicotiana tabacum] ref|XP_016449584.1| PREDICTED: uncharacterized protein LOC107774544 [Nicotiana tabacum] Length = 857 Score = 358 bits (920), Expect = e-110 Identities = 243/579 (41%), Positives = 323/579 (55%), Gaps = 75/579 (12%) Frame = +3 Query: 69 VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRLTQVIPVRSE 248 VSK F + K+ +YV SLKA+VG N+L+EA+GIG K+DLT +ALEPS+ I RSE Sbjct: 259 VSKAFGEGKILLEEYVFSLKAMVGVNMLIEAVGIGKDKYDLTCVALEPSKSNHAI--RSE 316 Query: 249 IPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQPESRGY-IG 425 +P GKACS LTT+E++KFL+GDYRLSKARSNDLFWEA+WPRLLARGWHSEQP++ Y Sbjct: 317 LPAGKACSSLTTNEVVKFLTGDYRLSKARSNDLFWEAVWPRLLARGWHSEQPKNLNYAAN 376 Query: 426 PKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXANGYKKEEEN 605 PK++LVFLMPGVKKFSRR L+KG HYFDSVTDVLGKVA K +E Sbjct: 377 PKNALVFLMPGVKKFSRR-LIKGIHYFDSVTDVLGKVASDPKLLELDAED-ECTKGKEGR 434 Query: 606 ELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGG---KIRDLRTLP 776 + T E K +D DLPT+QR YLQPRTPNR T V KFTVVDTSLS G K+R+L +LP Sbjct: 435 DWTDEAKLEQD--DLPTRQRPCYLQPRTPNRCTDVMKFTVVDTSLSDGKPYKVRELGSLP 492 Query: 777 SEISNTL------------ISLDFTEDRNQNNKDVNHD------------------DISS 866 +EIS+ L +S D ++ N +H+ +IS+ Sbjct: 493 AEISSKLSLGSHAEESEEELSTDESDSVGTNKAKTDHNNSSRIFSNGEPHSDEKGFEISA 552 Query: 867 YQDSRTVYPYP----------KNNKDVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISKRC 1016 P+P KN K++ + + RKV K+ +++ K+ NV ++API+++ Sbjct: 553 SSKKFQEVPHPASSTVPVNASKNTKNICEDKQPRKVIKAHSNKRLKENNVHFVAPIAQKR 612 Query: 1017 RRLTANNTHEETSDGVNHSSTSPRSDNRQSCPCSDIHRDLNENLSSVASSGQDKLSSTS- 1193 RRLTA + E S + +S P Q + DL+ N +ASS +DK+SS+S Sbjct: 613 RRLTACSRGETNSSVMVNSLMVP--GREQEVRHTSSSNDLSLNNIQIASS-EDKVSSSSS 669 Query: 1194 -----------------------EDSQTHLSIDLNLPQYSPESSENGLLPTDSNNEQDNQ 1304 E+ QT IDLN PQ P+S L+P + + N Sbjct: 670 SKSSPSQSAECASADHHVLKLPEEEPQTRAMIDLNEPQVPPDSEYEILMPALTEDLSGNT 729 Query: 1305 SIKPDNNSLPKPSVNVHFI------TNPPRHSTRNPHLSIRALEAIADGYLTV-NRKQRG 1463 D + K S + + N RH TRN + RALEA+A+G+LTV +R+Q+ Sbjct: 730 KRPDDVSGELKTSTHSASMEQQQPSLNSRRHGTRNRPPTTRALEALANGFLTVHSRRQKS 789 Query: 1464 KITSSHEDLTSRPSRRARGGVHPNESPSSYTASQIEETE 1580 K S TS S + G + +S SQ+EE E Sbjct: 790 KEGGSKRKSTSSRSSQPTPGCMGTDFSNSTVVSQMEEGE 828 >ref|XP_024022662.1| uncharacterized protein LOC21406306 [Morus notabilis] Length = 606 Score = 351 bits (900), Expect = e-109 Identities = 236/602 (39%), Positives = 318/602 (52%), Gaps = 76/602 (12%) Frame = +3 Query: 69 VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRLTQVIPVRSE 248 VSK++ + K+S +YV +LK+ G N LVEA+GIG GK DLT M ++ + QV+ VR E Sbjct: 20 VSKQYGEGKISLEEYVFTLKSTFGLNALVEAVGIGKGKQDLTGMVMDTPKSNQVVHVRPE 79 Query: 249 IPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQPESRGY-IG 425 IP GKACS LT EI+ FL+GD+RLSKARS+DLFWEA+WPRLLARGWHSEQP + + G Sbjct: 80 IPIGKACSTLTPLEIVNFLTGDFRLSKARSSDLFWEAVWPRLLARGWHSEQPNNHSFTAG 139 Query: 426 PKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXANGYKKEEEN 605 KHSLVFL+PG+KKFSRRKL+KGDHYFDSV+DVL KVA GYK +EEN Sbjct: 140 SKHSLVFLLPGIKKFSRRKLVKGDHYFDSVSDVLSKVAS-----EPGLLEIEGYKIKEEN 194 Query: 606 ELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSG---GKIRDLRTLP 776 E K +++ D P +QR YL+PRTPNR+T KFTVVDTSL+ GK+R+LR+LP Sbjct: 195 GWNDETKLDQE--DFPDEQRHCYLKPRTPNRATDAMKFTVVDTSLANGRTGKVRELRSLP 252 Query: 777 SEISNTLISLDFTEDRNQNNKDVNHDDISSY----------QDSRTVYPYPKN-----NK 911 EI NT S +ED ++++ D + D SS D + P NK Sbjct: 253 VEIRNTCTSQSESEDDDEDSSDESADKSSSVNALSSDKDETSDLKAAVPKLSKSLSFANK 312 Query: 912 DVSDKTKSRKVSKSIP---------------------SRKQKQRNVDYIAPISKRCRRLT 1028 DV T S V IP S+K+ N +AP+ KR RRL Sbjct: 313 DVEHGTDSTIVPAKIPKDKHNDLCNGAQLKKGTKSKLSQKEGPENKIQLAPVMKRRRRLP 372 Query: 1029 ANNTHEETSDGVNHSSTSPRSDNRQSCPCSDIHRDLNENLSSVASSGQDKLSSTS----- 1193 + + + + N S SC + DL+EN+ S Q+KLSSTS Sbjct: 373 PPSRKDTSCNTTNSRVDSRLQQEASSCV---ENSDLSENMLSQVDPSQEKLSSTSSSRGC 429 Query: 1194 ---------------------EDSQTHLSIDLNLPQYSPESSENGLLPTDSNNEQDNQSI 1310 E Q+ IDLN+P S ++ + ++ QD Q Sbjct: 430 SPITSAEGIPSSNHMGAEQPLEKPQSRTFIDLNMP-ISQDAETDEPFTKETTARQDQQRS 488 Query: 1311 KPDNNSLPKPSV---------NVHFITNPPRHSTRNPHLSIRALEAIADGYLTVNRKQRG 1463 K +N P+ SV P R STRN L+ + LEA A G++ +K++ Sbjct: 489 KESDN--PQLSVKSSECAANSEQEANVGPRRQSTRNRPLTTKVLEAFACGFMDTKQKRKA 546 Query: 1464 KITSSHEDLTSRPSRRARGGVHPNESPSSYTAS-QIEETENGASNSGNCHIASEVGVFPE 1640 K ++L RPSRR R + P ES +S +E+ E +G+ + +++GV + Sbjct: 547 KDAFPRDNLKLRPSRRPRPRLSPQESFNSANVDFTMEQRETIQKTNGD--VFNKLGVSSQ 604 Query: 1641 AN 1646 N Sbjct: 605 TN 606 >gb|EOY06483.1| Uncharacterized protein TCM_021187 isoform 3 [Theobroma cacao] Length = 866 Score = 358 bits (918), Expect = e-109 Identities = 240/585 (41%), Positives = 318/585 (54%), Gaps = 64/585 (10%) Frame = +3 Query: 42 EFCFLVIW*VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRL 221 E C + VSK F + K+ +YV +LKA VG N LV A+GIG GK DLT + LEP + Sbjct: 268 EECQNTLLEVSKAFGEGKIMLEEYVFTLKATVGLNSLVSAVGIGKGKEDLTGITLEPMKA 327 Query: 222 TQVIPVRSEIPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQ 401 QV PVR EIP GKACS LT EII FL+G YRLSKARSNDLFWEA+WPRLLARGWHSEQ Sbjct: 328 NQVAPVRPEIPVGKACSALTPLEIINFLTGSYRLSKARSNDLFWEAVWPRLLARGWHSEQ 387 Query: 402 PESRGY-IGPKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXA 578 P S+GY G KHSLVFL+PGVKKFSRRKL+KGDHYFDSV+DVL +VA Sbjct: 388 PASQGYTAGSKHSLVFLIPGVKKFSRRKLVKGDHYFDSVSDVLSRVASDPGLLELEIGAD 447 Query: 579 NGYKKEEENELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGG--- 749 G +EEN + D +DLP +QR YL+PR PNR V FTVVDTSL G Sbjct: 448 KGDSSKEEN------GTESDRDDLPNRQRHCYLKPRIPNRGADVMAFTVVDTSLDDGGKF 501 Query: 750 KIRDLRTLPSE--ISNTLISLDFTEDRNQNNKDVNHDDISSYQDSRTVYP---------Y 896 K+R+LR+LP E ISN+ S + T + + D+ S ++ + P Y Sbjct: 502 KVRELRSLPIEMNISNSSDSEESTSEELIDESDLADTSCSGRVETNGLKPTEINHDREVY 561 Query: 897 PKNN--------------------KDVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISKRC 1016 P N KD K + K K+ PS++ K N + +AP++KRC Sbjct: 562 PDGNASNNKFPVDGQASTNVPAIPKDPKTKVCNGKAMKNQPSQRIKIDNKNNLAPVTKRC 621 Query: 1017 RRLTANNTHEETSDGVNHSSTSPRSDNRQSCPC-------SDIHRDLN---ENLSSVASS 1166 R+LTA + E G S SP +++ C ++I +++ + LSS +SS Sbjct: 622 RKLTACSRKETIQKG-KIISVSPGLKQKEASCCEGNPDGSAEIPSEVDPVEQQLSSASSS 680 Query: 1167 -------GQDKLSSTSEDS-QTHLS------IDLNLPQYSPESSENGLLPTDSNNEQDNQ 1304 G+ L ST + QTH+ IDLNLP ++ + + +E +N Sbjct: 681 KGSPTIRGEGILRSTCAGAEQTHVEHQHRTLIDLNLPVLLDGETDEPFMGEVTESEHENP 740 Query: 1305 SIKPDNNSLPK-----PSVNVHFITNPPRHSTRNPHLSIRALEAIADGYLTVNRKQRGKI 1469 S +P+N S P+ PS + N R STRN + +ALEA+A G+LT +K++ + Sbjct: 741 SRQPNNASQPEATCCMPSSELQPNMNARRQSTRNRPPTTKALEALACGFLTTTQKRKRRD 800 Query: 1470 TSSHEDLTSRPSRRARGGVHPNESPSSYTASQIEETENGASNSGN 1604 + E+ SR SRRA GG +E+ E + +GN Sbjct: 801 GFARENSLSRASRRAHGGAKVSENYGDGMVDFKAEVKGNGMCNGN 845 >gb|EOY06482.1| Uncharacterized protein TCM_021187 isoform 2 [Theobroma cacao] Length = 868 Score = 358 bits (918), Expect = e-109 Identities = 240/585 (41%), Positives = 318/585 (54%), Gaps = 64/585 (10%) Frame = +3 Query: 42 EFCFLVIW*VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRL 221 E C + VSK F + K+ +YV +LKA VG N LV A+GIG GK DLT + LEP + Sbjct: 270 EECQNTLLEVSKAFGEGKIMLEEYVFTLKATVGLNSLVSAVGIGKGKEDLTGITLEPMKA 329 Query: 222 TQVIPVRSEIPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQ 401 QV PVR EIP GKACS LT EII FL+G YRLSKARSNDLFWEA+WPRLLARGWHSEQ Sbjct: 330 NQVAPVRPEIPVGKACSALTPLEIINFLTGSYRLSKARSNDLFWEAVWPRLLARGWHSEQ 389 Query: 402 PESRGY-IGPKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXA 578 P S+GY G KHSLVFL+PGVKKFSRRKL+KGDHYFDSV+DVL +VA Sbjct: 390 PASQGYTAGSKHSLVFLIPGVKKFSRRKLVKGDHYFDSVSDVLSRVASDPGLLELEIGAD 449 Query: 579 NGYKKEEENELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGG--- 749 G +EEN + D +DLP +QR YL+PR PNR V FTVVDTSL G Sbjct: 450 KGDSSKEEN------GTESDRDDLPNRQRHCYLKPRIPNRGADVMAFTVVDTSLDDGGKF 503 Query: 750 KIRDLRTLPSE--ISNTLISLDFTEDRNQNNKDVNHDDISSYQDSRTVYP---------Y 896 K+R+LR+LP E ISN+ S + T + + D+ S ++ + P Y Sbjct: 504 KVRELRSLPIEMNISNSSDSEESTSEELIDESDLADTSCSGRVETNGLKPTEINHDREVY 563 Query: 897 PKNN--------------------KDVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISKRC 1016 P N KD K + K K+ PS++ K N + +AP++KRC Sbjct: 564 PDGNASNNKFPVDGQASTNVPAIPKDPKTKVCNGKAMKNQPSQRIKIDNKNNLAPVTKRC 623 Query: 1017 RRLTANNTHEETSDGVNHSSTSPRSDNRQSCPC-------SDIHRDLN---ENLSSVASS 1166 R+LTA + E G S SP +++ C ++I +++ + LSS +SS Sbjct: 624 RKLTACSRKETIQKG-KIISVSPGLKQKEASCCEGNPDGSAEIPSEVDPVEQQLSSASSS 682 Query: 1167 -------GQDKLSSTSEDS-QTHLS------IDLNLPQYSPESSENGLLPTDSNNEQDNQ 1304 G+ L ST + QTH+ IDLNLP ++ + + +E +N Sbjct: 683 KGSPTIRGEGILRSTCAGAEQTHVEHQHRTLIDLNLPVLLDGETDEPFMGEVTESEHENP 742 Query: 1305 SIKPDNNSLPK-----PSVNVHFITNPPRHSTRNPHLSIRALEAIADGYLTVNRKQRGKI 1469 S +P+N S P+ PS + N R STRN + +ALEA+A G+LT +K++ + Sbjct: 743 SRQPNNASQPEATCCMPSSELQPNMNARRQSTRNRPPTTKALEALACGFLTTTQKRKRRD 802 Query: 1470 TSSHEDLTSRPSRRARGGVHPNESPSSYTASQIEETENGASNSGN 1604 + E+ SR SRRA GG +E+ E + +GN Sbjct: 803 GFARENSLSRASRRAHGGAKVSENYGDGMVDFKAEVKGNGMCNGN 847 >gb|EOY06481.1| Uncharacterized protein TCM_021187 isoform 1 [Theobroma cacao] Length = 888 Score = 358 bits (918), Expect = e-109 Identities = 240/585 (41%), Positives = 318/585 (54%), Gaps = 64/585 (10%) Frame = +3 Query: 42 EFCFLVIW*VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRL 221 E C + VSK F + K+ +YV +LKA VG N LV A+GIG GK DLT + LEP + Sbjct: 290 EECQNTLLEVSKAFGEGKIMLEEYVFTLKATVGLNSLVSAVGIGKGKEDLTGITLEPMKA 349 Query: 222 TQVIPVRSEIPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQ 401 QV PVR EIP GKACS LT EII FL+G YRLSKARSNDLFWEA+WPRLLARGWHSEQ Sbjct: 350 NQVAPVRPEIPVGKACSALTPLEIINFLTGSYRLSKARSNDLFWEAVWPRLLARGWHSEQ 409 Query: 402 PESRGY-IGPKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXA 578 P S+GY G KHSLVFL+PGVKKFSRRKL+KGDHYFDSV+DVL +VA Sbjct: 410 PASQGYTAGSKHSLVFLIPGVKKFSRRKLVKGDHYFDSVSDVLSRVASDPGLLELEIGAD 469 Query: 579 NGYKKEEENELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGG--- 749 G +EEN + D +DLP +QR YL+PR PNR V FTVVDTSL G Sbjct: 470 KGDSSKEEN------GTESDRDDLPNRQRHCYLKPRIPNRGADVMAFTVVDTSLDDGGKF 523 Query: 750 KIRDLRTLPSE--ISNTLISLDFTEDRNQNNKDVNHDDISSYQDSRTVYP---------Y 896 K+R+LR+LP E ISN+ S + T + + D+ S ++ + P Y Sbjct: 524 KVRELRSLPIEMNISNSSDSEESTSEELIDESDLADTSCSGRVETNGLKPTEINHDREVY 583 Query: 897 PKNN--------------------KDVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISKRC 1016 P N KD K + K K+ PS++ K N + +AP++KRC Sbjct: 584 PDGNASNNKFPVDGQASTNVPAIPKDPKTKVCNGKAMKNQPSQRIKIDNKNNLAPVTKRC 643 Query: 1017 RRLTANNTHEETSDGVNHSSTSPRSDNRQSCPC-------SDIHRDLN---ENLSSVASS 1166 R+LTA + E G S SP +++ C ++I +++ + LSS +SS Sbjct: 644 RKLTACSRKETIQKG-KIISVSPGLKQKEASCCEGNPDGSAEIPSEVDPVEQQLSSASSS 702 Query: 1167 -------GQDKLSSTSEDS-QTHLS------IDLNLPQYSPESSENGLLPTDSNNEQDNQ 1304 G+ L ST + QTH+ IDLNLP ++ + + +E +N Sbjct: 703 KGSPTIRGEGILRSTCAGAEQTHVEHQHRTLIDLNLPVLLDGETDEPFMGEVTESEHENP 762 Query: 1305 SIKPDNNSLPK-----PSVNVHFITNPPRHSTRNPHLSIRALEAIADGYLTVNRKQRGKI 1469 S +P+N S P+ PS + N R STRN + +ALEA+A G+LT +K++ + Sbjct: 763 SRQPNNASQPEATCCMPSSELQPNMNARRQSTRNRPPTTKALEALACGFLTTTQKRKRRD 822 Query: 1470 TSSHEDLTSRPSRRARGGVHPNESPSSYTASQIEETENGASNSGN 1604 + E+ SR SRRA GG +E+ E + +GN Sbjct: 823 GFARENSLSRASRRAHGGAKVSENYGDGMVDFKAEVKGNGMCNGN 867 >ref|XP_016542314.1| PREDICTED: uncharacterized protein LOC107842798 [Capsicum annuum] Length = 863 Score = 353 bits (907), Expect = e-108 Identities = 232/588 (39%), Positives = 319/588 (54%), Gaps = 63/588 (10%) Frame = +3 Query: 69 VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRLTQVIPVRSE 248 V+K F + K+ +YV SL A++G ++L+EA+GIG GK+DLT M LEPSR VRSE Sbjct: 274 VNKAFGEGKILLEEYVFSLMAMIGVSMLIEAVGIGKGKYDLTCMTLEPSRSNYA--VRSE 331 Query: 249 IPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQPESRGY-IG 425 +P GKAC+ LTT E+IKFL+GDYRLSKARSND+FWEA+WPRLLARGWHS +P++ Y Sbjct: 332 VPVGKACATLTTDEVIKFLTGDYRLSKARSNDIFWEAVWPRLLARGWHSLKPKNLNYAAN 391 Query: 426 PKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXANGYKKEEEN 605 PK+ VFL+P VKKFS RKL+KG+HYFDSVTDVLGKVA + E Sbjct: 392 PKNPYVFLLPDVKKFS-RKLVKGNHYFDSVTDVLGKVASDPKL----------LELNAEG 440 Query: 606 ELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGG---KIRDLRTLP 776 E T E K D DLPT+QR YLQPRTPNR V KFTVVDTSLS G K+R+LR+LP Sbjct: 441 ECTDEIKLEHD--DLPTRQRPCYLQPRTPNRHMDVMKFTVVDTSLSDGKPYKLRELRSLP 498 Query: 777 SEISNTLISLDFTEDRNQ--------------NNKDVNHDD------------------- 857 +ISN L S + E+ + N + NH++ Sbjct: 499 VDISNKLSSGNKAEESEEESTDESDSVGTSVVNEAEENHNNSLKIISNGEMHSDEKGYKI 558 Query: 858 -ISSYQDSRTVYPY--PKNNKDVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISKRCRRLT 1028 +SS + + + +P K K++ K RKV KS ++ K+ N D++API+KR RRLT Sbjct: 559 SVSSQKFASSSFPVIDSKKTKNICKDKKPRKVVKSHSFKRLKENNEDFVAPIAKRRRRLT 618 Query: 1029 ANNTHEETSDG---------VNHSSTS--------PRSDNRQSCPCSDIHRDLNENLSSV 1157 A + + + H+S+S P + + S+ + + Sbjct: 619 ACSRGSSMVNSLMVPGMEQEMRHTSSSNDLSPNNIPIASSEDKVSSSNSSKSSPSQSAEC 678 Query: 1158 ASSGQDKLSSTSEDSQTHLSIDLNLPQYSPESSENGLLPTDSNNEQDNQSIKPDNNSLPK 1337 AS+ L + +T IDLN PQ P+S L+P ++ N D + K Sbjct: 679 ASADGHGLKLPDAERKTRTMIDLNEPQVPPDSEFEILMPALMEDKSGNMKSPDDVSGELK 738 Query: 1338 PSVNVHFI------TNPPRHSTRNPHLSIRALEAIADGYLTVNRKQRGKITSSHEDLTSR 1499 + + N RHSTRN + R LEA+A+G+LTVN +Q+ K S TSR Sbjct: 739 TLTHSASMEQQQPSLNSRRHSTRNRPPTTRVLEALANGFLTVNTRQKSKEGGSKRKSTSR 798 Query: 1500 PSRRARGGVHPNESPSSYTASQIEETENGASNSGNCHIASEVGVFPEA 1643 SR+ G + +S SQ+EE ++ S G+ ++ ++ PE+ Sbjct: 799 SSRQTPDGTRVTDFSNSAVVSQMEEDKDAVSTGGDSNMFGKIQHPPES 846 >ref|XP_021291361.1| uncharacterized protein LOC110421952 isoform X2 [Herrania umbratica] Length = 875 Score = 353 bits (907), Expect = e-107 Identities = 235/560 (41%), Positives = 311/560 (55%), Gaps = 69/560 (12%) Frame = +3 Query: 69 VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRLTQVIPVRSE 248 VSK F + K+ +YV +LKA VG N LV A+GIG GK DLT M LEP + QV PVR E Sbjct: 281 VSKAFGEGKILLEEYVFTLKATVGLNALVSAVGIGKGKEDLTGMNLEPMKANQVAPVRPE 340 Query: 249 IPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQPESRGY-IG 425 IP GKACS LT EII FL+G+YRLSKARSNDLFWEA+WPRLLARGWHSEQP S+GY G Sbjct: 341 IPVGKACSALTPLEIINFLTGNYRLSKARSNDLFWEAVWPRLLARGWHSEQPASQGYTAG 400 Query: 426 PKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXANGYKKEEEN 605 KHSLVFL+PGVKKFSRRKL+KGDHYFDS++DVL +VA G +EEN Sbjct: 401 SKHSLVFLIPGVKKFSRRKLVKGDHYFDSISDVLSRVASDPGLLELEIGADKGDSSKEEN 460 Query: 606 ELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGG---KIRDLRTLP 776 + D DLP +QR YL+PR PNR V FTVVDTSL G K+R+LR+LP Sbjct: 461 ------GAESDREDLPNRQRHCYLKPRIPNRGADVMTFTVVDTSLDDGGKFKVRELRSLP 514 Query: 777 SEISNTLISLDFTEDRNQ----------------------NNKDVNHD-------DISSY 869 E++N D E ++ ++NHD + S+ Sbjct: 515 IEMNNCNSLGDSEESTSEELIDESDLADTSCSGRVETNGLKPSEINHDREVYPDGNASNN 574 Query: 870 ------QDSRTVYPYPKNNK-DVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISKRCRRLT 1028 Q S +V PK+ K V + + RK K+ P ++ K N + +AP++KR R+LT Sbjct: 575 KFSVDGQPSTSVPAIPKDPKTKVCNGMQPRKAMKNQPHQRIKNDNKNDLAPVTKRRRKLT 634 Query: 1029 ANNTHEETSDGVNHSSTSPRSDNRQSCPC-------SDIHRDLN---ENLSSVASS---- 1166 A N E T G S SP +++ C ++I +++ + LSS +SS Sbjct: 635 ACNRKETTQKG-KIISVSPGLKQKEASCCEGNPDGSAEIPSEVDPVEQQLSSASSSKGSP 693 Query: 1167 ---GQDKLSS-------TSEDSQTHLSIDLNLPQYSPESSENGLLPTDSNNEQDNQSIKP 1316 G+ L S T E+ Q IDLNLP ++ + + E +N S +P Sbjct: 694 TIRGEGILRSTCAGAEQTHEELQHRTLIDLNLPVLLDGETDEPFMGEVTEREHENPSSQP 753 Query: 1317 DNNSLPK-----PSVNVHFITNPPRHSTRNPHLSIRALEAIADGYLTVNRKQRGKITSSH 1481 +N S P+ PS + N R STRN + +ALEA+A G+L+ +K++ + + Sbjct: 754 NNASQPEATSCMPSSELQPNMNARRQSTRNRPPTTKALEALACGFLSTTQKRKRRDGFAR 813 Query: 1482 EDLTSRPSRRARGGVHPNES 1541 E+ SRPSRRA GG +E+ Sbjct: 814 ENSLSRPSRRAHGGAKFSEN 833 >ref|XP_021291360.1| uncharacterized protein LOC110421952 isoform X1 [Herrania umbratica] Length = 877 Score = 353 bits (907), Expect = e-107 Identities = 235/560 (41%), Positives = 311/560 (55%), Gaps = 69/560 (12%) Frame = +3 Query: 69 VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRLTQVIPVRSE 248 VSK F + K+ +YV +LKA VG N LV A+GIG GK DLT M LEP + QV PVR E Sbjct: 283 VSKAFGEGKILLEEYVFTLKATVGLNALVSAVGIGKGKEDLTGMNLEPMKANQVAPVRPE 342 Query: 249 IPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQPESRGY-IG 425 IP GKACS LT EII FL+G+YRLSKARSNDLFWEA+WPRLLARGWHSEQP S+GY G Sbjct: 343 IPVGKACSALTPLEIINFLTGNYRLSKARSNDLFWEAVWPRLLARGWHSEQPASQGYTAG 402 Query: 426 PKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXANGYKKEEEN 605 KHSLVFL+PGVKKFSRRKL+KGDHYFDS++DVL +VA G +EEN Sbjct: 403 SKHSLVFLIPGVKKFSRRKLVKGDHYFDSISDVLSRVASDPGLLELEIGADKGDSSKEEN 462 Query: 606 ELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGG---KIRDLRTLP 776 + D DLP +QR YL+PR PNR V FTVVDTSL G K+R+LR+LP Sbjct: 463 ------GAESDREDLPNRQRHCYLKPRIPNRGADVMTFTVVDTSLDDGGKFKVRELRSLP 516 Query: 777 SEISNTLISLDFTEDRNQ----------------------NNKDVNHD-------DISSY 869 E++N D E ++ ++NHD + S+ Sbjct: 517 IEMNNCNSLGDSEESTSEELIDESDLADTSCSGRVETNGLKPSEINHDREVYPDGNASNN 576 Query: 870 ------QDSRTVYPYPKNNK-DVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISKRCRRLT 1028 Q S +V PK+ K V + + RK K+ P ++ K N + +AP++KR R+LT Sbjct: 577 KFSVDGQPSTSVPAIPKDPKTKVCNGMQPRKAMKNQPHQRIKNDNKNDLAPVTKRRRKLT 636 Query: 1029 ANNTHEETSDGVNHSSTSPRSDNRQSCPC-------SDIHRDLN---ENLSSVASS---- 1166 A N E T G S SP +++ C ++I +++ + LSS +SS Sbjct: 637 ACNRKETTQKG-KIISVSPGLKQKEASCCEGNPDGSAEIPSEVDPVEQQLSSASSSKGSP 695 Query: 1167 ---GQDKLSS-------TSEDSQTHLSIDLNLPQYSPESSENGLLPTDSNNEQDNQSIKP 1316 G+ L S T E+ Q IDLNLP ++ + + E +N S +P Sbjct: 696 TIRGEGILRSTCAGAEQTHEELQHRTLIDLNLPVLLDGETDEPFMGEVTEREHENPSSQP 755 Query: 1317 DNNSLPK-----PSVNVHFITNPPRHSTRNPHLSIRALEAIADGYLTVNRKQRGKITSSH 1481 +N S P+ PS + N R STRN + +ALEA+A G+L+ +K++ + + Sbjct: 756 NNASQPEATSCMPSSELQPNMNARRQSTRNRPPTTKALEALACGFLSTTQKRKRRDGFAR 815 Query: 1482 EDLTSRPSRRARGGVHPNES 1541 E+ SRPSRRA GG +E+ Sbjct: 816 ENSLSRPSRRAHGGAKFSEN 835 >ref|XP_007035557.2| PREDICTED: uncharacterized protein LOC18603483 isoform X2 [Theobroma cacao] Length = 866 Score = 353 bits (906), Expect = e-107 Identities = 239/585 (40%), Positives = 317/585 (54%), Gaps = 64/585 (10%) Frame = +3 Query: 42 EFCFLVIW*VSKKFADEKMSFVDYVSSLKAIVGTNILVEAIGIGTGKHDLTRMALEPSRL 221 E C + VSK F + K+ +YV +LKA VG N LV A+GIG GK DLT + LEP + Sbjct: 268 EECQNTLLEVSKAFGEGKILLEEYVFTLKATVGLNSLVSAVGIGKGKEDLTGITLEPMKA 327 Query: 222 TQVIPVRSEIPTGKACSFLTTSEIIKFLSGDYRLSKARSNDLFWEAIWPRLLARGWHSEQ 401 QV PVR EIP GKACS LT EII FL+G YRLSKARSNDLFWEA+WPRLLARGWHSEQ Sbjct: 328 NQVAPVRPEIPVGKACSALTPLEIINFLTGSYRLSKARSNDLFWEAVWPRLLARGWHSEQ 387 Query: 402 PESRGY-IGPKHSLVFLMPGVKKFSRRKLLKGDHYFDSVTDVLGKVAKXXXXXXXXXXXA 578 P S+GY G KHSLVFL+PGVKKFSRRKL+KGDHYFDSV+DVL +VA Sbjct: 388 PASQGYTAGSKHSLVFLIPGVKKFSRRKLVKGDHYFDSVSDVLSRVASDPGLLELEIGAD 447 Query: 579 NGYKKEEENELTGEKKSNEDGNDLPTKQRQFYLQPRTPNRSTRVAKFTVVDTSLSGG--- 749 G +EEN + D +DLP +QR YL+PR PNR V FTVVDTSL G Sbjct: 448 KGDSSKEEN------GTESDRDDLPNRQRHCYLKPRIPNRGADVMTFTVVDTSLDDGGKF 501 Query: 750 KIRDLRTLPSE--ISNTLISLDFTEDRNQNNKDVNHDDISSYQDSRTVYP---------Y 896 K+R+LR+LP E ISN+ S + T + + D+ S ++ + P Y Sbjct: 502 KVRELRSLPIEMNISNSSDSEESTSEELIDESDLADTSCSGRVETNGLKPTEINHDREVY 561 Query: 897 PKNN--------------------KDVSDKTKSRKVSKSIPSRKQKQRNVDYIAPISKRC 1016 P N KD K + K K+ PS++ K N + +AP++KR Sbjct: 562 PDGNASNNKFPVDGQASTNVPAIPKDPKTKVCNGKAMKNQPSQRIKIDNKNNLAPVTKRR 621 Query: 1017 RRLTANNTHEETSDGVNHSSTSPRSDNRQSCPC-------SDIHRDLN---ENLSSVASS 1166 R+LTA + E G S SP +++ C ++I +++ + LSS +SS Sbjct: 622 RKLTACSRKETIQKG-KIISVSPGLKQKEASCCEGNPDGSAEIPSEVDPVEQQLSSASSS 680 Query: 1167 -------GQDKLSSTSEDS-QTHLS------IDLNLPQYSPESSENGLLPTDSNNEQDNQ 1304 G+ L ST + QTH+ IDLNLP ++ + + +E +N Sbjct: 681 KGSPTIRGEGILRSTCAGAEQTHVEHQHRTLIDLNLPVLLDGETDEPFMGEVTESEHENP 740 Query: 1305 SIKPDNNSLPK-----PSVNVHFITNPPRHSTRNPHLSIRALEAIADGYLTVNRKQRGKI 1469 S +P+N S P+ PS + N R STRN + +ALEA+A G+LT +K++ + Sbjct: 741 SRQPNNASQPEATCCMPSSELQPNMNARRQSTRNRPPTTKALEALACGFLTTTQKRKRRD 800 Query: 1470 TSSHEDLTSRPSRRARGGVHPNESPSSYTASQIEETENGASNSGN 1604 + E+ SR SRRA GG +E+ E + +GN Sbjct: 801 GFARENYLSRASRRAHGGAKVSENYGDGMVDFKAEVKGNGMCNGN 845