BLASTX nr result

ID: Chrysanthemum21_contig00019028 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00019028
         (2423 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_023753272.1| nuclear intron maturase 4, mitochondrial [La...  1116   0.0  
ref|XP_022022385.1| nuclear intron maturase 4, mitochondrial [He...  1113   0.0  
gb|PLY99240.1| hypothetical protein LSAT_6X112880 [Lactuca sativa]   1103   0.0  
ref|XP_018827568.1| PREDICTED: uncharacterized protein LOC108996...   976   0.0  
dbj|GAV86201.1| RVT_1 domain-containing protein/Intron_maturas2 ...   942   0.0  
ref|XP_021830183.1| nuclear intron maturase 4, mitochondrial iso...   941   0.0  
ref|XP_021830184.1| nuclear intron maturase 4, mitochondrial iso...   941   0.0  
ref|XP_020413650.1| uncharacterized protein LOC109947584 isoform...   936   0.0  
ref|XP_020413649.1| uncharacterized protein LOC109947584 isoform...   936   0.0  
gb|ONI20791.1| hypothetical protein PRUPE_2G034000 [Prunus persica]   936   0.0  
gb|ONI20792.1| hypothetical protein PRUPE_2G034000 [Prunus persica]   936   0.0  
ref|XP_019199271.1| PREDICTED: uncharacterized protein LOC109192...   936   0.0  
ref|XP_019199270.1| PREDICTED: uncharacterized protein LOC109192...   936   0.0  
ref|XP_008348277.1| PREDICTED: uncharacterized protein LOC103411...   934   0.0  
ref|XP_015891101.1| PREDICTED: uncharacterized protein LOC107425...   932   0.0  
gb|EOX92573.1| Intron maturase isoform 1 [Theobroma cacao] >gi|5...   931   0.0  
ref|XP_024166789.1| nuclear intron maturase 4, mitochondrial [Ro...   929   0.0  
ref|XP_017980919.1| PREDICTED: uncharacterized protein LOC186118...   928   0.0  
ref|XP_021300995.1| uncharacterized protein LOC110429335 isoform...   928   0.0  
ref|XP_021300992.1| uncharacterized protein LOC110429335 isoform...   928   0.0  

>ref|XP_023753272.1| nuclear intron maturase 4, mitochondrial [Lactuca sativa]
          Length = 795

 Score = 1116 bits (2886), Expect = 0.0
 Identities = 562/770 (72%), Positives = 643/770 (83%), Gaps = 7/770 (0%)
 Frame = +1

Query: 85   LLLGKRCLCHSVDPTLKXXXXXXXXXXXXXXDKMSLASNLAFLVEESSSVNERKPMSRLE 264
            LL GKR + HSVD +LK              D MSLASNLA LVEESS VNERKP +RLE
Sbjct: 32   LLTGKRYISHSVDSSLKSNNNGNNSNI----DTMSLASNLACLVEESSLVNERKPRTRLE 87

Query: 265  LKRYLELCIKKRVKDQFIDGKFHDVFEKVIANPYTLRYAYDSIRVNSNVNLSSECDAIQF 444
            LKR+LE  IKK VKDQF DGKFH++ EKVIANPYTL+ AYD IRVNSN++L SE D I F
Sbjct: 88   LKRFLESRIKKSVKDQFKDGKFHNLIEKVIANPYTLQDAYDIIRVNSNISLLSESDDINF 147

Query: 445  DSLAEELSSGSFDIKANIYSISTRGANKET----LVLPNLKLNIIQEAIRIALEVVYKPH 612
            DSLA+ELSSG+FDI +N+YSIST+GA KE     LVLPNLKL IIQEAIRIALEVVYKPH
Sbjct: 148  DSLAQELSSGNFDINSNVYSISTKGAKKEKQKEKLVLPNLKLTIIQEAIRIALEVVYKPH 207

Query: 613  YSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVLNRKMDTLILRKLISTMESKIEDPR 792
            YSKISHGCRSGRG SSA KYI K++ N+NWWFT+++N+K+D   L KLISTME+KI+DP+
Sbjct: 208  YSKISHGCRSGRGHSSALKYIRKQVSNSNWWFTVIVNKKVDDSTLSKLISTMETKIQDPK 267

Query: 793  LYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPIFMNIYLDNFDSEILNLSMKYEVLD 972
            LY++IH+MFD GV+NMEFGGFAKGHGLPQEG+LSP+ MNIYLD FD EILNLSMKYE LD
Sbjct: 268  LYSLIHSMFDVGVLNMEFGGFAKGHGLPQEGLLSPVLMNIYLDLFDHEILNLSMKYEALD 327

Query: 973  GEPDGSQSK--LRSWFKRQMGTIKKNEEVMSGIRVHCCRSMDEILVAVKGSKEVSLDFKS 1146
             + DGS+SK  LR WF+RQM   ++NE   SG+R+HCCR MDEIL+ +KGSKEVSL  KS
Sbjct: 328  SQHDGSKSKSKLRGWFRRQMS--QQNEGNTSGVRIHCCRLMDEILIVIKGSKEVSLTLKS 385

Query: 1147 EIETYMRESLYLDFDKTTDILPCLDPRGVKFIGNTVKKIVKDSSTVRAVHKLKEKVELYT 1326
            EIE ++RE L+L+ D  +DI PC DPRGVKF GN VKK ++++  VRAVHKLKEKVEL+ 
Sbjct: 386  EIENFIREFLHLEVDNKSDIFPCNDPRGVKFSGNIVKKSMRENPAVRAVHKLKEKVELFA 445

Query: 1327 LQKQEAWDEGTVRIGKKILGHGFKKVKESEIKQLADSNSVLSQVSRFRKSGMETDHWYKV 1506
            LQKQEAWDE  +RIGKK LGHGFKKVKESEIK LAD  SVLSQVSRFRK GMETDHWYKV
Sbjct: 446  LQKQEAWDECMIRIGKKCLGHGFKKVKESEIKHLADCTSVLSQVSRFRKPGMETDHWYKV 505

Query: 1507 LVKIWMQEAVVKYTETEESILSKFIAEKSLPQELKDSFYTFQNHAKNYVSSETKSTLDLL 1686
            L+KI MQ+   KYT+TEESILSK I E +LPQ+LKDSFYTFQNH KNYVSSET S L LL
Sbjct: 506  LLKIRMQDMDSKYTDTEESILSKLITENALPQDLKDSFYTFQNHVKNYVSSETSSVLTLL 565

Query: 1687 LDSGASPESVSVTEVVAPIKAIRMCLERYGITNSEGYPRACRMLVLLDHDHIIDWFSGIV 1866
             +S    ESVS+TEV+APIKAIRMCL+RYGITNSEG PRACRMLVLLDHDHIIDWFSG+V
Sbjct: 566  PES--ESESVSITEVLAPIKAIRMCLQRYGITNSEGIPRACRMLVLLDHDHIIDWFSGLV 623

Query: 1867 NRWLRWYRSCDNFTEVKLIITEQVRKSCIRTLATKYRLHELEIERKFDSDLSRIPLTEDF 2046
            +RW++WYR CDNF EVK II+ Q+RKSCIRTLATKYRLHE EIE+KFDSDLS IP TE+ 
Sbjct: 624  SRWVKWYRLCDNFNEVKHIISIQIRKSCIRTLATKYRLHETEIEKKFDSDLSGIPSTEEI 683

Query: 2047 ESEVIS-ETLVSSDDGLLYGIPHSGLCLLSLARMVSESRPCTCFVLGCRDAAPCVYTVHV 2223
            E+E +  +  +  D+GL+YGIP+SGLCL+SLAR+VSESRPC CFVLGCR  APCVYT+HV
Sbjct: 684  ENERLEYDERLECDEGLMYGIPYSGLCLVSLARIVSESRPCGCFVLGCRVDAPCVYTIHV 743

Query: 2224 MQRQKFPVWKTGFSSCIHPSINRRRIGLCNQHLKDLFLGHISLQSISFGA 2373
            MQRQKFPVWKTGFS+CIHPSIN RRIGLC QHLKDLF+G ISLQS+SFGA
Sbjct: 744  MQRQKFPVWKTGFSTCIHPSINGRRIGLCKQHLKDLFIGRISLQSVSFGA 793


>ref|XP_022022385.1| nuclear intron maturase 4, mitochondrial [Helianthus annuus]
 ref|XP_022022386.1| nuclear intron maturase 4, mitochondrial [Helianthus annuus]
 gb|OTF85144.1| putative intron maturase, type II family protein [Helianthus annuus]
          Length = 792

 Score = 1113 bits (2880), Expect = 0.0
 Identities = 564/793 (71%), Positives = 646/793 (81%), Gaps = 16/793 (2%)
 Frame = +1

Query: 37   VSRRQPIILDLHFTYYLLL---------GKRCLCHSVDPTLKXXXXXXXXXXXXXXDKMS 189
            + RR+ I    H  ++ L+         GKR + HSVD + K              DKMS
Sbjct: 2    ICRRKLIDRTTHLQWFHLISVNLSAVDTGKRYIGHSVDSSQKSINNGNITTI----DKMS 57

Query: 190  LASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIANPYT 369
            LASNLA LV+ESS V+ERKPM+RLELKRYLEL IKKRVKDQ+ +GKFHD+ EKVIANPYT
Sbjct: 58   LASNLACLVDESSPVDERKPMNRLELKRYLELRIKKRVKDQYKNGKFHDLIEKVIANPYT 117

Query: 370  LRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETLVLPN 549
            L+ AYDS+RVNSNVNLSSECD I F SLAEEL+SG FDI +N+YSIS+R  N E LVLPN
Sbjct: 118  LQDAYDSMRVNSNVNLSSECDNINFTSLAEELASGKFDISSNVYSISSRRTNNEKLVLPN 177

Query: 550  LKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVLNRK 729
            LK N+IQEAIRIALEVVY+PH+SKISHGCR+GRGQSSA KYICKEI N +WWF L LN+K
Sbjct: 178  LKTNVIQEAIRIALEVVYRPHFSKISHGCRTGRGQSSALKYICKEIANPDWWFPLTLNKK 237

Query: 730  MDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPIFMN 909
            +D  IL KLISTMESKIEDP+LY+IIH+MF+AGV+N+EFG FAKGHGLPQEG+LSPI +N
Sbjct: 238  IDMRILSKLISTMESKIEDPKLYSIIHSMFNAGVLNIEFGAFAKGHGLPQEGVLSPILIN 297

Query: 910  IYLDNFDSEILNLSMKYEVLDGEPDGS--QSKLRSWFKRQMGTIKKNEEVMSGIRVHCCR 1083
            IYLD FD EILNLSMKYEVLD   DG   +SKLRSWF+RQMG  + N    SG+RVHCCR
Sbjct: 298  IYLDLFDHEILNLSMKYEVLDTSDDGQRDESKLRSWFRRQMGPTR-NGGYTSGVRVHCCR 356

Query: 1084 SMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGVKFIGNTVKKI 1263
             MDEILV V GSKEVS+  KSEIETY++ SLYLD D  T+IL C DPRGVKF G  +K+ 
Sbjct: 357  LMDEILVLVNGSKEVSMALKSEIETYVQGSLYLDLDSKTNILRCNDPRGVKFTGKIIKRT 416

Query: 1264 VKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESEIKQLADSNS 1443
            V +S  VRAVHKLKEKVEL+ LQK+EAWDEG VRIGKK+LGHGFKKVKESEIK LAD NS
Sbjct: 417  VIESPAVRAVHKLKEKVELFALQKEEAWDEGMVRIGKKVLGHGFKKVKESEIKHLADYNS 476

Query: 1444 VLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSLPQELKDSFY 1623
            VLS+VS FRK GM TDHWYKVL+K+WMQ+  +K  ETEE ILSK + EK+LPQ+LKDSFY
Sbjct: 477  VLSRVSCFRKPGMLTDHWYKVLLKVWMQDITLKKHETEEHILSKLLTEKALPQDLKDSFY 536

Query: 1624 TFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYGITNSEGYPR 1803
             FQNH K YVSSE KST+ LL +S  S ESV +TEV +P+KAIRMCL+RYGITN EGYPR
Sbjct: 537  AFQNHVKRYVSSEAKSTVTLLPESTVSSESVYITEVSSPLKAIRMCLQRYGITNREGYPR 596

Query: 1804 ACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIRTLATKYRLH 1983
            AC  LVLLDHDHIIDWF GIV RWLRWY  CDNF E+KL+ITE+VRKSCIRTLATKYRLH
Sbjct: 597  ACHTLVLLDHDHIIDWFHGIVTRWLRWYHLCDNFGEIKLLITEKVRKSCIRTLATKYRLH 656

Query: 1984 ELEIERKFDSDLSRIPLTEDFESEVISETLVSS-----DDGLLYGIPHSGLCLLSLARMV 2148
            E EIERKFDSDLS +P ++D+E + ++ T+V S     D+GLLYGIP+SGLCLLSLAR V
Sbjct: 657  EFEIERKFDSDLSVLPSSDDYEIDSVN-TIVESRDFYYDEGLLYGIPYSGLCLLSLARTV 715

Query: 2149 SESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIGLCNQHLKD 2328
            S+SRPCTCFV+GCR AAPCVYT+HVMQRQKFPVW TGFSSCIH S++ +RIGLC QHLKD
Sbjct: 716  SKSRPCTCFVIGCRAAAPCVYTIHVMQRQKFPVWNTGFSSCIHLSMHGKRIGLCKQHLKD 775

Query: 2329 LFLGHISLQSISF 2367
            LF+GHISLQSI F
Sbjct: 776  LFIGHISLQSICF 788


>gb|PLY99240.1| hypothetical protein LSAT_6X112880 [Lactuca sativa]
          Length = 735

 Score = 1103 bits (2853), Expect = 0.0
 Identities = 550/737 (74%), Positives = 629/737 (85%), Gaps = 7/737 (0%)
 Frame = +1

Query: 184  MSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIANP 363
            MSLASNLA LVEESS VNERKP +RLELKR+LE  IKK VKDQF DGKFH++ EKVIANP
Sbjct: 1    MSLASNLACLVEESSLVNERKPRTRLELKRFLESRIKKSVKDQFKDGKFHNLIEKVIANP 60

Query: 364  YTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKET--- 534
            YTL+ AYD IRVNSN++L SE D I FDSLA+ELSSG+FDI +N+YSIST+GA KE    
Sbjct: 61   YTLQDAYDIIRVNSNISLLSESDDINFDSLAQELSSGNFDINSNVYSISTKGAKKEKQKE 120

Query: 535  -LVLPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFT 711
             LVLPNLKL IIQEAIRIALEVVYKPHYSKISHGCRSGRG SSA KYI K++ N+NWWFT
Sbjct: 121  KLVLPNLKLTIIQEAIRIALEVVYKPHYSKISHGCRSGRGHSSALKYIRKQVSNSNWWFT 180

Query: 712  LVLNRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGIL 891
            +++N+K+D   L KLISTME+KI+DP+LY++IH+MFD GV+NMEFGGFAKGHGLPQEG+L
Sbjct: 181  VIVNKKVDDSTLSKLISTMETKIQDPKLYSLIHSMFDVGVLNMEFGGFAKGHGLPQEGLL 240

Query: 892  SPIFMNIYLDNFDSEILNLSMKYEVLDGEPDGSQSK--LRSWFKRQMGTIKKNEEVMSGI 1065
            SP+ MNIYLD FD EILNLSMKYE LD + DGS+SK  LR WF+RQM   ++NE   SG+
Sbjct: 241  SPVLMNIYLDLFDHEILNLSMKYEALDSQHDGSKSKSKLRGWFRRQMS--QQNEGNTSGV 298

Query: 1066 RVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGVKFIG 1245
            R+HCCR MDEIL+ +KGSKEVSL  KSEIE ++RE L+L+ D  +DI PC DPRGVKF G
Sbjct: 299  RIHCCRLMDEILIVIKGSKEVSLTLKSEIENFIREFLHLEVDNKSDIFPCNDPRGVKFSG 358

Query: 1246 NTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESEIKQ 1425
            N VKK ++++  VRAVHKLKEKVEL+ LQKQEAWDE  +RIGKK LGHGFKKVKESEIK 
Sbjct: 359  NIVKKSMRENPAVRAVHKLKEKVELFALQKQEAWDECMIRIGKKCLGHGFKKVKESEIKH 418

Query: 1426 LADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSLPQE 1605
            LAD  SVLSQVSRFRK GMETDHWYKVL+KI MQ+   KYT+TEESILSK I E +LPQ+
Sbjct: 419  LADCTSVLSQVSRFRKPGMETDHWYKVLLKIRMQDMDSKYTDTEESILSKLITENALPQD 478

Query: 1606 LKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYGITN 1785
            LKDSFYTFQNH KNYVSSET S L LL +S    ESVS+TEV+APIKAIRMCL+RYGITN
Sbjct: 479  LKDSFYTFQNHVKNYVSSETSSVLTLLPES--ESESVSITEVLAPIKAIRMCLQRYGITN 536

Query: 1786 SEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIRTLA 1965
            SEG PRACRMLVLLDHDHIIDWFSG+V+RW++WYR CDNF EVK II+ Q+RKSCIRTLA
Sbjct: 537  SEGIPRACRMLVLLDHDHIIDWFSGLVSRWVKWYRLCDNFNEVKHIISIQIRKSCIRTLA 596

Query: 1966 TKYRLHELEIERKFDSDLSRIPLTEDFESEVIS-ETLVSSDDGLLYGIPHSGLCLLSLAR 2142
            TKYRLHE EIE+KFDSDLS IP TE+ E+E +  +  +  D+GL+YGIP+SGLCL+SLAR
Sbjct: 597  TKYRLHETEIEKKFDSDLSGIPSTEEIENERLEYDERLECDEGLMYGIPYSGLCLVSLAR 656

Query: 2143 MVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIGLCNQHL 2322
            +VSESRPC CFVLGCR  APCVYT+HVMQRQKFPVWKTGFS+CIHPSIN RRIGLC QHL
Sbjct: 657  IVSESRPCGCFVLGCRVDAPCVYTIHVMQRQKFPVWKTGFSTCIHPSINGRRIGLCKQHL 716

Query: 2323 KDLFLGHISLQSISFGA 2373
            KDLF+G ISLQS+SFGA
Sbjct: 717  KDLFIGRISLQSVSFGA 733


>ref|XP_018827568.1| PREDICTED: uncharacterized protein LOC108996256 [Juglans regia]
 ref|XP_018827569.1| PREDICTED: uncharacterized protein LOC108996256 [Juglans regia]
 ref|XP_018827570.1| PREDICTED: uncharacterized protein LOC108996256 [Juglans regia]
 ref|XP_018827571.1| PREDICTED: uncharacterized protein LOC108996256 [Juglans regia]
 ref|XP_018827573.1| PREDICTED: uncharacterized protein LOC108996256 [Juglans regia]
 ref|XP_018827574.1| PREDICTED: uncharacterized protein LOC108996256 [Juglans regia]
          Length = 807

 Score =  976 bits (2523), Expect = 0.0
 Identities = 482/741 (65%), Positives = 586/741 (79%), Gaps = 10/741 (1%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIAN 360
            KM+LA NLA +VEESS V+ERKP SR+ELKRY EL IKKRVK+Q++DGKF D+  KVIAN
Sbjct: 65   KMTLAMNLACVVEESSCVDERKPKSRMELKRYCELRIKKRVKEQYMDGKFQDLMTKVIAN 124

Query: 361  PYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETLV 540
            P TL+ AY+ IR+NSNV++S   D   F S+AEEL SGSFD+K N +SIST+GANKETLV
Sbjct: 125  PDTLQDAYNCIRLNSNVDISINNDRFDFSSMAEELCSGSFDVKVNTFSISTKGANKETLV 184

Query: 541  LPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVL 720
            LP L+L I+QEAIRI LEV+YKP++SKISHGCRSGRG SSA KYI KEI N +WWFT+ +
Sbjct: 185  LPTLRLKIVQEAIRIILEVIYKPYFSKISHGCRSGRGHSSALKYISKEISNPDWWFTVHI 244

Query: 721  NRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPI 900
            N+K+D  +L KLIS ME KIEDP LYAIIH+MFDA V+N+EFGGF KGHGLPQEG+LS I
Sbjct: 245  NKKLDACVLAKLISIMEGKIEDPSLYAIIHSMFDAQVLNLEFGGFPKGHGLPQEGVLSAI 304

Query: 901  FMNIYLDNFDSEILNLSMKYEVLD----GEPDGSQSKLRSWFKRQM--GTIKKNEEVMSG 1062
             +NIYLD FD E   LSMKYE LD       DGS S LRSWF+RQ+    +    E   G
Sbjct: 305  LINIYLDLFDREFYRLSMKYEALDPSIHSNRDGSYSMLRSWFRRQLKDNDLNCQSENNIG 364

Query: 1063 IRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGVKFI 1242
            IRVH CR MDEI  A+ GS+EV+L FKSEI  Y+R SL+LD D  T++LPC  P+ ++F+
Sbjct: 365  IRVHSCRFMDEIFFAISGSEEVALSFKSEILNYLRNSLHLDIDNQTELLPCEGPQEIRFL 424

Query: 1243 GNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESEIK 1422
            G  V++ +K+S  V+AVHKLKEKVEL+ LQKQEAWD GT+RIGKK LGHG KKVKESEIK
Sbjct: 425  GYLVRRSIKESPAVKAVHKLKEKVELFALQKQEAWDAGTIRIGKKWLGHGLKKVKESEIK 484

Query: 1423 QLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSLPQ 1602
             LADSNSVL Q+S  RK+GMETDHWYK L+KIWMQ+A  K  ++EE ILSK++AE SLPQ
Sbjct: 485  HLADSNSVLGQISHLRKAGMETDHWYKHLLKIWMQDAKAKAAKSEEIILSKYVAEPSLPQ 544

Query: 1603 ELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYGIT 1782
            ELKDSFY FQ  A+ YVS+ET STL L+ +  +S +S + TE++AP+ AI+  L RYG+ 
Sbjct: 545  ELKDSFYEFQRCAEEYVSAETASTLALMPNYSSSCDSETTTEIIAPVNAIKKRLLRYGLA 604

Query: 1783 NSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIRTL 1962
             ++GYPR   +L+L D+  IIDWFSG+V RWLRW+  CDN  EVKL+I++Q+RKSCIRTL
Sbjct: 605  TNDGYPRTTTLLILQDNIQIIDWFSGVVRRWLRWWSECDNVNEVKLLISDQLRKSCIRTL 664

Query: 1963 ATKYRLHELEIERKFDSDLSRIPLTEDFESEVISE----TLVSSDDGLLYGIPHSGLCLL 2130
            A KYR+HE EIE++FDS+LSRIP T++ E E+  E     +  +D+ L+YGI +SGLCLL
Sbjct: 665  AAKYRIHENEIEKRFDSELSRIPSTQEIEQEMAYEKSNNQVFDNDEALMYGISYSGLCLL 724

Query: 2131 SLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIGLC 2310
            SLARMV+ESRPC CFV+GC   AP VYT+HVM+RQKFP WKTGFSSCIHPS+NRRRIGLC
Sbjct: 725  SLARMVTESRPCNCFVMGCPSPAPSVYTLHVMERQKFPGWKTGFSSCIHPSLNRRRIGLC 784

Query: 2311 NQHLKDLFLGHISLQSISFGA 2373
             QHLKDL+LG+ISLQSI FGA
Sbjct: 785  KQHLKDLYLGNISLQSIDFGA 805


>dbj|GAV86201.1| RVT_1 domain-containing protein/Intron_maturas2 domain-containing
            protein [Cephalotus follicularis]
          Length = 800

 Score =  942 bits (2435), Expect = 0.0
 Identities = 461/741 (62%), Positives = 583/741 (78%), Gaps = 10/741 (1%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIAN 360
            KM+LA NLA +VEESS +++R+P SR+E+KR++ELC+KKRVK+Q+ +GKF ++ +KVIA+
Sbjct: 58   KMTLAENLASVVEESSGLDKRRPNSRMEMKRFIELCVKKRVKEQYTNGKFQNLMKKVIAH 117

Query: 361  PYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETLV 540
            P TL  AY+ IR+NSNVN++S  +++ F S+AEEL SGSFD+ AN +SIST+GA KE LV
Sbjct: 118  PQTLEDAYNCIRLNSNVNIASNDESVSFKSMAEELWSGSFDVNANTFSISTKGARKEVLV 177

Query: 541  LPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVL 720
            LPN+KL I+QEAIRI +EVVYKPH+SKISHGCRSGRG S+A +YI KEI   +WWFTL+L
Sbjct: 178  LPNMKLKIVQEAIRIVMEVVYKPHFSKISHGCRSGRGHSTALRYISKEICGPDWWFTLLL 237

Query: 721  NRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPI 900
            ++K+D  +L KLIS ME KIED  LYAII +MFDA V+N+EFGGF KGHGLPQEG+LSPI
Sbjct: 238  SKKLDACVLAKLISIMEDKIEDSNLYAIIQSMFDAQVLNLEFGGFPKGHGLPQEGVLSPI 297

Query: 901  FMNIYLDNFDSEILNLSMKYEVLDG----EPDGSQSKLRSWFKRQM--GTIKKNEEVMSG 1062
             +NIYLD  D E   LSMKYE L+     + DGS SKLRSWF+RQ+    +K   E  SG
Sbjct: 298  LINIYLDLSDREFYRLSMKYEALNPNFEIDRDGSHSKLRSWFRRQLKENDLKHTVEKNSG 357

Query: 1063 IRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGVKFI 1242
             RV+CCR +DEI  AV GSK+V+L FKSEI  Y++ SL+LD D  T+ILPC  P+G++F+
Sbjct: 358  PRVYCCRFLDEIFFAVSGSKDVALGFKSEILNYLQNSLHLDVDNQTEILPCEGPQGIRFL 417

Query: 1243 GNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESEIK 1422
            G  V+  V++S  +R VHKLKEKV+L+  QK EAW++GT+RIGKK L HG KKVKESEIK
Sbjct: 418  GTLVRTSVRESPAIRTVHKLKEKVKLFAFQKNEAWNDGTIRIGKKWLAHGLKKVKESEIK 477

Query: 1423 QLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSLPQ 1602
             + DS+S LSQ+S FRK GM+TDHWYKVL+K+WMQ+   K  E+EE I SK++ E +LPQ
Sbjct: 478  HIVDSSSTLSQISSFRKVGMKTDHWYKVLLKVWMQDIEAKAVESEEFIFSKYVVEPALPQ 537

Query: 1603 ELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYGIT 1782
            EL+DSFY FQ   + YVSSET STL L+ +  +S ESV+VTEV+API AI+  L RYG  
Sbjct: 538  ELRDSFYEFQKCTEKYVSSETASTLALMPNLSSSTESVTVTEVIAPIDAIKKRLLRYGFA 597

Query: 1783 NSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIRTL 1962
             S+G+PRA  +L+L D+  IIDWFSGIV RWLRWY  C+NF+E+KLII + VRKSCIRTL
Sbjct: 598  TSQGFPRAVPLLILQDNIQIIDWFSGIVRRWLRWYSKCNNFSEIKLIICDLVRKSCIRTL 657

Query: 1963 ATKYRLHELEIERKFDSDLSRIPLTEDFESEVISETLVS----SDDGLLYGIPHSGLCLL 2130
            A KYR+HE EIE++FD +L  IP T+D E E++ E L S    +D+ L+YGI +SGLC L
Sbjct: 658  AAKYRVHESEIEKRFDLELVMIPSTQDIEQELVYEILDSQAFDNDESLMYGIIYSGLCSL 717

Query: 2131 SLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIGLC 2310
              AR+VS+SRPCTCFV+GC   AP VYT+HVM+RQKFP WKTGFSS IHPS+NRRR+GLC
Sbjct: 718  YQARIVSQSRPCTCFVMGCSAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNRRRLGLC 777

Query: 2311 NQHLKDLFLGHISLQSISFGA 2373
             QH+KDL+LG+ISLQSI+FGA
Sbjct: 778  KQHVKDLYLGNISLQSINFGA 798


>ref|XP_021830183.1| nuclear intron maturase 4, mitochondrial isoform X1 [Prunus avium]
          Length = 879

 Score =  941 bits (2433), Expect = 0.0
 Identities = 467/745 (62%), Positives = 578/745 (77%), Gaps = 15/745 (2%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIAN 360
            +M LA NLA LVEESS ++ER+P SR+ELKR LEL IKKRVK+Q+++GKF ++  KVI+N
Sbjct: 138  EMKLAENLANLVEESSHMDERRPKSRMELKRSLELRIKKRVKEQYVNGKFRNLMAKVISN 197

Query: 361  PYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETLV 540
            P TL+ AYD IR+NSN+N++   D   FDS+A+EL  GSFD+ AN +SIS +GA +E LV
Sbjct: 198  PETLQDAYDCIRLNSNINIAFNDDNTYFDSIAKELCCGSFDVNANTFSISKKGAREEVLV 257

Query: 541  LPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVL 720
            LPN+ L +IQEAIRI LEVVYKP +SKISHG RSGRG S+A KYI KEI N +WWFTL++
Sbjct: 258  LPNINLRVIQEAIRIVLEVVYKPDFSKISHGYRSGRGHSTALKYISKEISNPDWWFTLLI 317

Query: 721  NRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPI 900
            N+K+D  IL KLI+ ME K+EDP LYA+I +MF+A V+N+EFGGF KGHGLPQEG+LSPI
Sbjct: 318  NKKLDACILGKLITVMEDKVEDPSLYAMIQSMFNANVLNLEFGGFPKGHGLPQEGVLSPI 377

Query: 901  FMNIYLDNFDSEILNLSMKYEVL------DGEPDGSQSKLRSWFKRQMGTIKKNEEVMSG 1062
             MNIYL+ FD E   LSMKYE L      D +P   QSKLRSWF+R +   K N+   +G
Sbjct: 378  LMNIYLNQFDHEFYRLSMKYEALSPSLHYDQKP---QSKLRSWFRRHL---KGNDRGCAG 431

Query: 1063 -----IRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPR 1227
                 IRVH CR MDEI  +V GSK+ +LDFKSE+  Y+++SL+LD D   ++L C  P 
Sbjct: 432  EESFSIRVHSCRFMDEIFFSVAGSKDAALDFKSEVLNYLQKSLHLDVDDRAELLSCQMPH 491

Query: 1228 GVKFIGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVK 1407
            G+ F+G  V++ V++S   RAVHKLKEKV L+ LQK+EAW+ GTV IGKK LGHG KKVK
Sbjct: 492  GIHFLGTLVRRNVRESPATRAVHKLKEKVALFGLQKEEAWNAGTVSIGKKWLGHGLKKVK 551

Query: 1408 ESEIKQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAE 1587
            ESEI+ LAD  SVLSQ+S FRKSGMETDHWYK L+KIWM++   K  E+E++ILSK++AE
Sbjct: 552  ESEIEHLADCRSVLSQISHFRKSGMETDHWYKHLLKIWMEDVKAKAAESEDAILSKYVAE 611

Query: 1588 KSLPQELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLE 1767
             +LPQEL++SFY FQ   K YVSSET STL LL  + +S ESV +TE++AP+ AI+  L 
Sbjct: 612  PALPQELRNSFYEFQRQVKTYVSSETTSTLSLLPSAASSTESVIITEIIAPVNAIKKRLL 671

Query: 1768 RYGITNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKS 1947
            RYG+T S+GYPR   +L+L D+D IIDWFSGIV RWLRWY  CDNF EVKL+I+  VRKS
Sbjct: 672  RYGLTTSDGYPRTSSLLILQDNDQIIDWFSGIVRRWLRWYAECDNFNEVKLLISNIVRKS 731

Query: 1948 CIRTLATKYRLHELEIERKFDSDLSRIPLTEDFESEVISET----LVSSDDGLLYGIPHS 2115
            CIRTLA KYR+HE EI ++FD++LSRIP T++ E E+++ET       +D+ L YGI +S
Sbjct: 732  CIRTLAAKYRVHETEIAKRFDTELSRIPSTQEIEQEMVNETSDAQSYDNDEALTYGISYS 791

Query: 2116 GLCLLSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRR 2295
            GLCLLSLARMVSESRPC CFV GC   AP VYT+HVM+RQKFP WKTGFSSCIHPS+NRR
Sbjct: 792  GLCLLSLARMVSESRPCNCFVNGCMAPAPSVYTLHVMERQKFPGWKTGFSSCIHPSLNRR 851

Query: 2296 RIGLCNQHLKDLFLGHISLQSISFG 2370
            R+GLC QHLKDL+LGHISLQSI+FG
Sbjct: 852  RLGLCKQHLKDLYLGHISLQSINFG 876


>ref|XP_021830184.1| nuclear intron maturase 4, mitochondrial isoform X2 [Prunus avium]
 ref|XP_021830185.1| nuclear intron maturase 4, mitochondrial isoform X2 [Prunus avium]
          Length = 838

 Score =  941 bits (2433), Expect = 0.0
 Identities = 467/745 (62%), Positives = 578/745 (77%), Gaps = 15/745 (2%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIAN 360
            +M LA NLA LVEESS ++ER+P SR+ELKR LEL IKKRVK+Q+++GKF ++  KVI+N
Sbjct: 97   EMKLAENLANLVEESSHMDERRPKSRMELKRSLELRIKKRVKEQYVNGKFRNLMAKVISN 156

Query: 361  PYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETLV 540
            P TL+ AYD IR+NSN+N++   D   FDS+A+EL  GSFD+ AN +SIS +GA +E LV
Sbjct: 157  PETLQDAYDCIRLNSNINIAFNDDNTYFDSIAKELCCGSFDVNANTFSISKKGAREEVLV 216

Query: 541  LPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVL 720
            LPN+ L +IQEAIRI LEVVYKP +SKISHG RSGRG S+A KYI KEI N +WWFTL++
Sbjct: 217  LPNINLRVIQEAIRIVLEVVYKPDFSKISHGYRSGRGHSTALKYISKEISNPDWWFTLLI 276

Query: 721  NRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPI 900
            N+K+D  IL KLI+ ME K+EDP LYA+I +MF+A V+N+EFGGF KGHGLPQEG+LSPI
Sbjct: 277  NKKLDACILGKLITVMEDKVEDPSLYAMIQSMFNANVLNLEFGGFPKGHGLPQEGVLSPI 336

Query: 901  FMNIYLDNFDSEILNLSMKYEVL------DGEPDGSQSKLRSWFKRQMGTIKKNEEVMSG 1062
             MNIYL+ FD E   LSMKYE L      D +P   QSKLRSWF+R +   K N+   +G
Sbjct: 337  LMNIYLNQFDHEFYRLSMKYEALSPSLHYDQKP---QSKLRSWFRRHL---KGNDRGCAG 390

Query: 1063 -----IRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPR 1227
                 IRVH CR MDEI  +V GSK+ +LDFKSE+  Y+++SL+LD D   ++L C  P 
Sbjct: 391  EESFSIRVHSCRFMDEIFFSVAGSKDAALDFKSEVLNYLQKSLHLDVDDRAELLSCQMPH 450

Query: 1228 GVKFIGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVK 1407
            G+ F+G  V++ V++S   RAVHKLKEKV L+ LQK+EAW+ GTV IGKK LGHG KKVK
Sbjct: 451  GIHFLGTLVRRNVRESPATRAVHKLKEKVALFGLQKEEAWNAGTVSIGKKWLGHGLKKVK 510

Query: 1408 ESEIKQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAE 1587
            ESEI+ LAD  SVLSQ+S FRKSGMETDHWYK L+KIWM++   K  E+E++ILSK++AE
Sbjct: 511  ESEIEHLADCRSVLSQISHFRKSGMETDHWYKHLLKIWMEDVKAKAAESEDAILSKYVAE 570

Query: 1588 KSLPQELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLE 1767
             +LPQEL++SFY FQ   K YVSSET STL LL  + +S ESV +TE++AP+ AI+  L 
Sbjct: 571  PALPQELRNSFYEFQRQVKTYVSSETTSTLSLLPSAASSTESVIITEIIAPVNAIKKRLL 630

Query: 1768 RYGITNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKS 1947
            RYG+T S+GYPR   +L+L D+D IIDWFSGIV RWLRWY  CDNF EVKL+I+  VRKS
Sbjct: 631  RYGLTTSDGYPRTSSLLILQDNDQIIDWFSGIVRRWLRWYAECDNFNEVKLLISNIVRKS 690

Query: 1948 CIRTLATKYRLHELEIERKFDSDLSRIPLTEDFESEVISET----LVSSDDGLLYGIPHS 2115
            CIRTLA KYR+HE EI ++FD++LSRIP T++ E E+++ET       +D+ L YGI +S
Sbjct: 691  CIRTLAAKYRVHETEIAKRFDTELSRIPSTQEIEQEMVNETSDAQSYDNDEALTYGISYS 750

Query: 2116 GLCLLSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRR 2295
            GLCLLSLARMVSESRPC CFV GC   AP VYT+HVM+RQKFP WKTGFSSCIHPS+NRR
Sbjct: 751  GLCLLSLARMVSESRPCNCFVNGCMAPAPSVYTLHVMERQKFPGWKTGFSSCIHPSLNRR 810

Query: 2296 RIGLCNQHLKDLFLGHISLQSISFG 2370
            R+GLC QHLKDL+LGHISLQSI+FG
Sbjct: 811  RLGLCKQHLKDLYLGHISLQSINFG 835


>ref|XP_020413650.1| uncharacterized protein LOC109947584 isoform X2 [Prunus persica]
 ref|XP_020413651.1| uncharacterized protein LOC109947584 isoform X2 [Prunus persica]
          Length = 817

 Score =  936 bits (2419), Expect = 0.0
 Identities = 465/742 (62%), Positives = 575/742 (77%), Gaps = 12/742 (1%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIAN 360
            +M LA NLA LV+ESS ++ER+P SR+ELKR LEL IKKRVK+Q+I+GKF ++  KVI+N
Sbjct: 76   EMKLAENLANLVKESSHMDERRPKSRMELKRSLELRIKKRVKEQYINGKFRNLMAKVISN 135

Query: 361  PYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETLV 540
            P TLR AYD IR+NSN+N +   D   FDS+A+EL  GSFD+ AN +SIS +GA +E LV
Sbjct: 136  PETLRDAYDCIRLNSNINTAFNDDNTSFDSIAKELGCGSFDVNANTFSISKKGAREEVLV 195

Query: 541  LPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVL 720
            LPN+ L +IQEAIRI LEVVYKP +SKISHG RSGRG S+A KYI KEI N +WWFTL++
Sbjct: 196  LPNINLRVIQEAIRIVLEVVYKPDFSKISHGYRSGRGHSTALKYISKEISNPDWWFTLLI 255

Query: 721  NRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPI 900
            N+K+D  IL KLI+ ME K+EDP LYA+I +MF+A V+N+EFGGF KGHGLPQEG+LS I
Sbjct: 256  NKKLDACILGKLITVMEDKVEDPSLYAMIQSMFNANVLNLEFGGFPKGHGLPQEGVLSSI 315

Query: 901  FMNIYLDNFDSEILNLSMKYEVLDGE---PDGSQSKLRSWFKRQMGTIKKNEEVMSG--- 1062
             MNIYL+ FD E   LSMKYE L         SQSKLRSWF+R++   K N+   +G   
Sbjct: 316  LMNIYLNQFDYEFYRLSMKYEALSPSLHSDQKSQSKLRSWFRRRL---KGNDLGCAGEES 372

Query: 1063 --IRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGVK 1236
              IRVH CR MDEI  +V GSK+ +LDFKSE+  Y+++SL+LD D   ++L C    G++
Sbjct: 373  FSIRVHSCRFMDEIFFSVAGSKDAALDFKSEVLNYLQKSLHLDVDDQAELLSCQMLHGIR 432

Query: 1237 FIGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESE 1416
            F+G  V++ V++S   RAVHKLKEKV L+ LQK+EAW+ GTV IGKK LGHG KKVKESE
Sbjct: 433  FLGTLVRRNVRESPATRAVHKLKEKVALFGLQKEEAWNAGTVSIGKKWLGHGLKKVKESE 492

Query: 1417 IKQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSL 1596
            IK LAD  SVLS++S FRKSGMETDHWYK L+KIWM++   K  E+E++ILSK++AE +L
Sbjct: 493  IKHLADCRSVLSKISHFRKSGMETDHWYKHLLKIWMEDVNAKAAESEDAILSKYVAEPAL 552

Query: 1597 PQELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYG 1776
            PQEL++SFY FQ   K YVSSET STL LL  + +S ESV +TE++AP+ AI+  L RYG
Sbjct: 553  PQELRNSFYEFQRQVKTYVSSETTSTLSLLPSAASSTESVIITEIIAPVNAIKKRLLRYG 612

Query: 1777 ITNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIR 1956
            +T S+GYPR   +L+L D+D IIDWFSGIV RWLRWY  CDNF EVKL+I+  VRKSCIR
Sbjct: 613  LTTSDGYPRTSSLLILQDNDQIIDWFSGIVRRWLRWYAECDNFNEVKLLISNIVRKSCIR 672

Query: 1957 TLATKYRLHELEIERKFDSDLSRIPLTEDFESEVISET----LVSSDDGLLYGIPHSGLC 2124
            TLA KYR+HE EIE++FD++LSRIP T++ E E+++ET       +D+ L YGI +SGLC
Sbjct: 673  TLAAKYRVHETEIEKRFDTELSRIPSTQEIEQEMVNETSDAQSYDNDEALTYGISYSGLC 732

Query: 2125 LLSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIG 2304
            LLSLARMVSESRPC CFV GC   AP VYT+HVM+RQKFP W TGFSSCIHPS+NRRR+G
Sbjct: 733  LLSLARMVSESRPCNCFVNGCMAPAPSVYTLHVMERQKFPGWNTGFSSCIHPSLNRRRLG 792

Query: 2305 LCNQHLKDLFLGHISLQSISFG 2370
            LC QHLKDL+LGHISLQSI+FG
Sbjct: 793  LCKQHLKDLYLGHISLQSINFG 814


>ref|XP_020413649.1| uncharacterized protein LOC109947584 isoform X1 [Prunus persica]
 gb|ONI20794.1| hypothetical protein PRUPE_2G034000 [Prunus persica]
 gb|ONI20795.1| hypothetical protein PRUPE_2G034000 [Prunus persica]
          Length = 900

 Score =  936 bits (2419), Expect = 0.0
 Identities = 465/742 (62%), Positives = 575/742 (77%), Gaps = 12/742 (1%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIAN 360
            +M LA NLA LV+ESS ++ER+P SR+ELKR LEL IKKRVK+Q+I+GKF ++  KVI+N
Sbjct: 159  EMKLAENLANLVKESSHMDERRPKSRMELKRSLELRIKKRVKEQYINGKFRNLMAKVISN 218

Query: 361  PYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETLV 540
            P TLR AYD IR+NSN+N +   D   FDS+A+EL  GSFD+ AN +SIS +GA +E LV
Sbjct: 219  PETLRDAYDCIRLNSNINTAFNDDNTSFDSIAKELGCGSFDVNANTFSISKKGAREEVLV 278

Query: 541  LPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVL 720
            LPN+ L +IQEAIRI LEVVYKP +SKISHG RSGRG S+A KYI KEI N +WWFTL++
Sbjct: 279  LPNINLRVIQEAIRIVLEVVYKPDFSKISHGYRSGRGHSTALKYISKEISNPDWWFTLLI 338

Query: 721  NRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPI 900
            N+K+D  IL KLI+ ME K+EDP LYA+I +MF+A V+N+EFGGF KGHGLPQEG+LS I
Sbjct: 339  NKKLDACILGKLITVMEDKVEDPSLYAMIQSMFNANVLNLEFGGFPKGHGLPQEGVLSSI 398

Query: 901  FMNIYLDNFDSEILNLSMKYEVLDGE---PDGSQSKLRSWFKRQMGTIKKNEEVMSG--- 1062
             MNIYL+ FD E   LSMKYE L         SQSKLRSWF+R++   K N+   +G   
Sbjct: 399  LMNIYLNQFDYEFYRLSMKYEALSPSLHSDQKSQSKLRSWFRRRL---KGNDLGCAGEES 455

Query: 1063 --IRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGVK 1236
              IRVH CR MDEI  +V GSK+ +LDFKSE+  Y+++SL+LD D   ++L C    G++
Sbjct: 456  FSIRVHSCRFMDEIFFSVAGSKDAALDFKSEVLNYLQKSLHLDVDDQAELLSCQMLHGIR 515

Query: 1237 FIGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESE 1416
            F+G  V++ V++S   RAVHKLKEKV L+ LQK+EAW+ GTV IGKK LGHG KKVKESE
Sbjct: 516  FLGTLVRRNVRESPATRAVHKLKEKVALFGLQKEEAWNAGTVSIGKKWLGHGLKKVKESE 575

Query: 1417 IKQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSL 1596
            IK LAD  SVLS++S FRKSGMETDHWYK L+KIWM++   K  E+E++ILSK++AE +L
Sbjct: 576  IKHLADCRSVLSKISHFRKSGMETDHWYKHLLKIWMEDVNAKAAESEDAILSKYVAEPAL 635

Query: 1597 PQELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYG 1776
            PQEL++SFY FQ   K YVSSET STL LL  + +S ESV +TE++AP+ AI+  L RYG
Sbjct: 636  PQELRNSFYEFQRQVKTYVSSETTSTLSLLPSAASSTESVIITEIIAPVNAIKKRLLRYG 695

Query: 1777 ITNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIR 1956
            +T S+GYPR   +L+L D+D IIDWFSGIV RWLRWY  CDNF EVKL+I+  VRKSCIR
Sbjct: 696  LTTSDGYPRTSSLLILQDNDQIIDWFSGIVRRWLRWYAECDNFNEVKLLISNIVRKSCIR 755

Query: 1957 TLATKYRLHELEIERKFDSDLSRIPLTEDFESEVISET----LVSSDDGLLYGIPHSGLC 2124
            TLA KYR+HE EIE++FD++LSRIP T++ E E+++ET       +D+ L YGI +SGLC
Sbjct: 756  TLAAKYRVHETEIEKRFDTELSRIPSTQEIEQEMVNETSDAQSYDNDEALTYGISYSGLC 815

Query: 2125 LLSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIG 2304
            LLSLARMVSESRPC CFV GC   AP VYT+HVM+RQKFP W TGFSSCIHPS+NRRR+G
Sbjct: 816  LLSLARMVSESRPCNCFVNGCMAPAPSVYTLHVMERQKFPGWNTGFSSCIHPSLNRRRLG 875

Query: 2305 LCNQHLKDLFLGHISLQSISFG 2370
            LC QHLKDL+LGHISLQSI+FG
Sbjct: 876  LCKQHLKDLYLGHISLQSINFG 897


>gb|ONI20791.1| hypothetical protein PRUPE_2G034000 [Prunus persica]
          Length = 902

 Score =  936 bits (2419), Expect = 0.0
 Identities = 465/742 (62%), Positives = 575/742 (77%), Gaps = 12/742 (1%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIAN 360
            +M LA NLA LV+ESS ++ER+P SR+ELKR LEL IKKRVK+Q+I+GKF ++  KVI+N
Sbjct: 159  EMKLAENLANLVKESSHMDERRPKSRMELKRSLELRIKKRVKEQYINGKFRNLMAKVISN 218

Query: 361  PYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETLV 540
            P TLR AYD IR+NSN+N +   D   FDS+A+EL  GSFD+ AN +SIS +GA +E LV
Sbjct: 219  PETLRDAYDCIRLNSNINTAFNDDNTSFDSIAKELGCGSFDVNANTFSISKKGAREEVLV 278

Query: 541  LPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVL 720
            LPN+ L +IQEAIRI LEVVYKP +SKISHG RSGRG S+A KYI KEI N +WWFTL++
Sbjct: 279  LPNINLRVIQEAIRIVLEVVYKPDFSKISHGYRSGRGHSTALKYISKEISNPDWWFTLLI 338

Query: 721  NRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPI 900
            N+K+D  IL KLI+ ME K+EDP LYA+I +MF+A V+N+EFGGF KGHGLPQEG+LS I
Sbjct: 339  NKKLDACILGKLITVMEDKVEDPSLYAMIQSMFNANVLNLEFGGFPKGHGLPQEGVLSSI 398

Query: 901  FMNIYLDNFDSEILNLSMKYEVLDGE---PDGSQSKLRSWFKRQMGTIKKNEEVMSG--- 1062
             MNIYL+ FD E   LSMKYE L         SQSKLRSWF+R++   K N+   +G   
Sbjct: 399  LMNIYLNQFDYEFYRLSMKYEALSPSLHSDQKSQSKLRSWFRRRL---KGNDLGCAGEES 455

Query: 1063 --IRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGVK 1236
              IRVH CR MDEI  +V GSK+ +LDFKSE+  Y+++SL+LD D   ++L C    G++
Sbjct: 456  FSIRVHSCRFMDEIFFSVAGSKDAALDFKSEVLNYLQKSLHLDVDDQAELLSCQMLHGIR 515

Query: 1237 FIGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESE 1416
            F+G  V++ V++S   RAVHKLKEKV L+ LQK+EAW+ GTV IGKK LGHG KKVKESE
Sbjct: 516  FLGTLVRRNVRESPATRAVHKLKEKVALFGLQKEEAWNAGTVSIGKKWLGHGLKKVKESE 575

Query: 1417 IKQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSL 1596
            IK LAD  SVLS++S FRKSGMETDHWYK L+KIWM++   K  E+E++ILSK++AE +L
Sbjct: 576  IKHLADCRSVLSKISHFRKSGMETDHWYKHLLKIWMEDVNAKAAESEDAILSKYVAEPAL 635

Query: 1597 PQELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYG 1776
            PQEL++SFY FQ   K YVSSET STL LL  + +S ESV +TE++AP+ AI+  L RYG
Sbjct: 636  PQELRNSFYEFQRQVKTYVSSETTSTLSLLPSAASSTESVIITEIIAPVNAIKKRLLRYG 695

Query: 1777 ITNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIR 1956
            +T S+GYPR   +L+L D+D IIDWFSGIV RWLRWY  CDNF EVKL+I+  VRKSCIR
Sbjct: 696  LTTSDGYPRTSSLLILQDNDQIIDWFSGIVRRWLRWYAECDNFNEVKLLISNIVRKSCIR 755

Query: 1957 TLATKYRLHELEIERKFDSDLSRIPLTEDFESEVISET----LVSSDDGLLYGIPHSGLC 2124
            TLA KYR+HE EIE++FD++LSRIP T++ E E+++ET       +D+ L YGI +SGLC
Sbjct: 756  TLAAKYRVHETEIEKRFDTELSRIPSTQEIEQEMVNETSDAQSYDNDEALTYGISYSGLC 815

Query: 2125 LLSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIG 2304
            LLSLARMVSESRPC CFV GC   AP VYT+HVM+RQKFP W TGFSSCIHPS+NRRR+G
Sbjct: 816  LLSLARMVSESRPCNCFVNGCMAPAPSVYTLHVMERQKFPGWNTGFSSCIHPSLNRRRLG 875

Query: 2305 LCNQHLKDLFLGHISLQSISFG 2370
            LC QHLKDL+LGHISLQSI+FG
Sbjct: 876  LCKQHLKDLYLGHISLQSINFG 897


>gb|ONI20792.1| hypothetical protein PRUPE_2G034000 [Prunus persica]
          Length = 819

 Score =  936 bits (2419), Expect = 0.0
 Identities = 465/742 (62%), Positives = 575/742 (77%), Gaps = 12/742 (1%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIAN 360
            +M LA NLA LV+ESS ++ER+P SR+ELKR LEL IKKRVK+Q+I+GKF ++  KVI+N
Sbjct: 76   EMKLAENLANLVKESSHMDERRPKSRMELKRSLELRIKKRVKEQYINGKFRNLMAKVISN 135

Query: 361  PYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETLV 540
            P TLR AYD IR+NSN+N +   D   FDS+A+EL  GSFD+ AN +SIS +GA +E LV
Sbjct: 136  PETLRDAYDCIRLNSNINTAFNDDNTSFDSIAKELGCGSFDVNANTFSISKKGAREEVLV 195

Query: 541  LPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVL 720
            LPN+ L +IQEAIRI LEVVYKP +SKISHG RSGRG S+A KYI KEI N +WWFTL++
Sbjct: 196  LPNINLRVIQEAIRIVLEVVYKPDFSKISHGYRSGRGHSTALKYISKEISNPDWWFTLLI 255

Query: 721  NRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPI 900
            N+K+D  IL KLI+ ME K+EDP LYA+I +MF+A V+N+EFGGF KGHGLPQEG+LS I
Sbjct: 256  NKKLDACILGKLITVMEDKVEDPSLYAMIQSMFNANVLNLEFGGFPKGHGLPQEGVLSSI 315

Query: 901  FMNIYLDNFDSEILNLSMKYEVLDGE---PDGSQSKLRSWFKRQMGTIKKNEEVMSG--- 1062
             MNIYL+ FD E   LSMKYE L         SQSKLRSWF+R++   K N+   +G   
Sbjct: 316  LMNIYLNQFDYEFYRLSMKYEALSPSLHSDQKSQSKLRSWFRRRL---KGNDLGCAGEES 372

Query: 1063 --IRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGVK 1236
              IRVH CR MDEI  +V GSK+ +LDFKSE+  Y+++SL+LD D   ++L C    G++
Sbjct: 373  FSIRVHSCRFMDEIFFSVAGSKDAALDFKSEVLNYLQKSLHLDVDDQAELLSCQMLHGIR 432

Query: 1237 FIGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESE 1416
            F+G  V++ V++S   RAVHKLKEKV L+ LQK+EAW+ GTV IGKK LGHG KKVKESE
Sbjct: 433  FLGTLVRRNVRESPATRAVHKLKEKVALFGLQKEEAWNAGTVSIGKKWLGHGLKKVKESE 492

Query: 1417 IKQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSL 1596
            IK LAD  SVLS++S FRKSGMETDHWYK L+KIWM++   K  E+E++ILSK++AE +L
Sbjct: 493  IKHLADCRSVLSKISHFRKSGMETDHWYKHLLKIWMEDVNAKAAESEDAILSKYVAEPAL 552

Query: 1597 PQELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYG 1776
            PQEL++SFY FQ   K YVSSET STL LL  + +S ESV +TE++AP+ AI+  L RYG
Sbjct: 553  PQELRNSFYEFQRQVKTYVSSETTSTLSLLPSAASSTESVIITEIIAPVNAIKKRLLRYG 612

Query: 1777 ITNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIR 1956
            +T S+GYPR   +L+L D+D IIDWFSGIV RWLRWY  CDNF EVKL+I+  VRKSCIR
Sbjct: 613  LTTSDGYPRTSSLLILQDNDQIIDWFSGIVRRWLRWYAECDNFNEVKLLISNIVRKSCIR 672

Query: 1957 TLATKYRLHELEIERKFDSDLSRIPLTEDFESEVISET----LVSSDDGLLYGIPHSGLC 2124
            TLA KYR+HE EIE++FD++LSRIP T++ E E+++ET       +D+ L YGI +SGLC
Sbjct: 673  TLAAKYRVHETEIEKRFDTELSRIPSTQEIEQEMVNETSDAQSYDNDEALTYGISYSGLC 732

Query: 2125 LLSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIG 2304
            LLSLARMVSESRPC CFV GC   AP VYT+HVM+RQKFP W TGFSSCIHPS+NRRR+G
Sbjct: 733  LLSLARMVSESRPCNCFVNGCMAPAPSVYTLHVMERQKFPGWNTGFSSCIHPSLNRRRLG 792

Query: 2305 LCNQHLKDLFLGHISLQSISFG 2370
            LC QHLKDL+LGHISLQSI+FG
Sbjct: 793  LCKQHLKDLYLGHISLQSINFG 814


>ref|XP_019199271.1| PREDICTED: uncharacterized protein LOC109192986 isoform X2 [Ipomoea
            nil]
          Length = 802

 Score =  936 bits (2418), Expect = 0.0
 Identities = 462/742 (62%), Positives = 572/742 (77%), Gaps = 11/742 (1%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIAN 360
            +MSLA NLA LVEE+S V E K MSR+E KR+LE  IKK+VK+Q+++GKF D+  K+I +
Sbjct: 59   RMSLAQNLASLVEETSIVKETKRMSRMEQKRFLERQIKKKVKEQYMNGKFKDLMAKLIGD 118

Query: 361  PYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETLV 540
              TLR AYD IR+NSNV+L+ + D + F+++AEELS G FD+ AN YSI+TRG  KE L+
Sbjct: 119  HKTLRDAYDCIRLNSNVDLALDGDDLPFEAMAEELSLGHFDVSANTYSIATRGPKKEVLI 178

Query: 541  LPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVL 720
            LPN+KL ++QEAIRI LEVVY+PH+SKISHGCRSGR   SA +YICK+++N NWWFTL +
Sbjct: 179  LPNVKLRVVQEAIRIVLEVVYRPHFSKISHGCRSGRSHLSALRYICKDMKNANWWFTLPV 238

Query: 721  NRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPI 900
            N+K+D  IL KLIS ME KIEDP L+AII +MFD  V+N+EFGGF KGHGLPQEG LSPI
Sbjct: 239  NKKLDDCILTKLISVMEDKIEDPSLFAIIRSMFDCQVLNLEFGGFPKGHGLPQEGSLSPI 298

Query: 901  FMNIYLDNFDSEILNLSMKYEVLDGEP----DGSQSKLRSWFKRQM---GTIKKNEEVMS 1059
             MNIYLD FD E+  +SM+YE LD  P    D + SKLRSWF+RQM   G ++ N    S
Sbjct: 299  LMNIYLDLFDREMYRMSMRYEALDTGPKTEEDANLSKLRSWFRRQMNGGGAVEVNTGGYS 358

Query: 1060 GIRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGVKF 1239
            G+RVH CR MDEI +AV G KEV++  KSE + +++ SL+L+ D   DILPC    G++F
Sbjct: 359  GVRVHSCRFMDEIFIAVSGHKEVAVALKSETQDFLKNSLHLEVDNDIDILPCDGHNGIRF 418

Query: 1240 IGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESEI 1419
            +G  VK+ + +S  ++AVHKLKEKV+L+ LQKQEAWD GT+RIGKK L HG KKVKESEI
Sbjct: 419  LGCVVKRSLIESPALKAVHKLKEKVKLFALQKQEAWDAGTIRIGKKCLAHGLKKVKESEI 478

Query: 1420 KQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSLP 1599
            K LAD++S LS+VS FRK+GMETDHW+KVL+K+WMQ+   K  E +E ILSK + E SLP
Sbjct: 479  KHLADNSSTLSKVSHFRKAGMETDHWFKVLLKVWMQDINAKSAENQEFILSKHLVEPSLP 538

Query: 1600 QELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYGI 1779
            QEL+DSFY FQ     Y+SSET STL LL ++ +S ES   T+++AP+  I+  L RYG+
Sbjct: 539  QELRDSFYEFQKCVDEYISSETASTLALLPNADSSSESAFRTQIIAPVNTIKKRLYRYGL 598

Query: 1780 TNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIRT 1959
            TNS+G+P AC ML+ LDH+HIIDWFSGIV R LRWY   DNF EVK II  Q+R SCIRT
Sbjct: 599  TNSDGFPHACHMLIFLDHNHIIDWFSGIVKRLLRWYSEFDNFNEVKFIICNQLRMSCIRT 658

Query: 1960 LATKYRLHELEIERKFDSDLSRIPLTEDFESEVISETLVS----SDDGLLYGIPHSGLCL 2127
            LA KYR+HE  IE+KFDS+LSRIP T++ E E+ +E   S    +DD L YGI +SG+CL
Sbjct: 659  LAVKYRIHEKVIEKKFDSELSRIPSTDEVELELTNEASTSQTSDNDDSLTYGISYSGICL 718

Query: 2128 LSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIGL 2307
            LSLARMVS+SRPC CFV GC  AAP VYT+HVM+RQKFP WKTGFSSCIHPS++ RRIGL
Sbjct: 719  LSLARMVSQSRPCNCFVTGCMAAAPRVYTLHVMERQKFPGWKTGFSSCIHPSLHTRRIGL 778

Query: 2308 CNQHLKDLFLGHISLQSISFGA 2373
            CN HLKDLFLG+ISLQSI FG+
Sbjct: 779  CNDHLKDLFLGNISLQSIDFGS 800


>ref|XP_019199270.1| PREDICTED: uncharacterized protein LOC109192986 isoform X1 [Ipomoea
            nil]
          Length = 821

 Score =  936 bits (2418), Expect = 0.0
 Identities = 462/742 (62%), Positives = 572/742 (77%), Gaps = 11/742 (1%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIAN 360
            +MSLA NLA LVEE+S V E K MSR+E KR+LE  IKK+VK+Q+++GKF D+  K+I +
Sbjct: 78   RMSLAQNLASLVEETSIVKETKRMSRMEQKRFLERQIKKKVKEQYMNGKFKDLMAKLIGD 137

Query: 361  PYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETLV 540
              TLR AYD IR+NSNV+L+ + D + F+++AEELS G FD+ AN YSI+TRG  KE L+
Sbjct: 138  HKTLRDAYDCIRLNSNVDLALDGDDLPFEAMAEELSLGHFDVSANTYSIATRGPKKEVLI 197

Query: 541  LPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVL 720
            LPN+KL ++QEAIRI LEVVY+PH+SKISHGCRSGR   SA +YICK+++N NWWFTL +
Sbjct: 198  LPNVKLRVVQEAIRIVLEVVYRPHFSKISHGCRSGRSHLSALRYICKDMKNANWWFTLPV 257

Query: 721  NRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPI 900
            N+K+D  IL KLIS ME KIEDP L+AII +MFD  V+N+EFGGF KGHGLPQEG LSPI
Sbjct: 258  NKKLDDCILTKLISVMEDKIEDPSLFAIIRSMFDCQVLNLEFGGFPKGHGLPQEGSLSPI 317

Query: 901  FMNIYLDNFDSEILNLSMKYEVLDGEP----DGSQSKLRSWFKRQM---GTIKKNEEVMS 1059
             MNIYLD FD E+  +SM+YE LD  P    D + SKLRSWF+RQM   G ++ N    S
Sbjct: 318  LMNIYLDLFDREMYRMSMRYEALDTGPKTEEDANLSKLRSWFRRQMNGGGAVEVNTGGYS 377

Query: 1060 GIRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGVKF 1239
            G+RVH CR MDEI +AV G KEV++  KSE + +++ SL+L+ D   DILPC    G++F
Sbjct: 378  GVRVHSCRFMDEIFIAVSGHKEVAVALKSETQDFLKNSLHLEVDNDIDILPCDGHNGIRF 437

Query: 1240 IGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESEI 1419
            +G  VK+ + +S  ++AVHKLKEKV+L+ LQKQEAWD GT+RIGKK L HG KKVKESEI
Sbjct: 438  LGCVVKRSLIESPALKAVHKLKEKVKLFALQKQEAWDAGTIRIGKKCLAHGLKKVKESEI 497

Query: 1420 KQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSLP 1599
            K LAD++S LS+VS FRK+GMETDHW+KVL+K+WMQ+   K  E +E ILSK + E SLP
Sbjct: 498  KHLADNSSTLSKVSHFRKAGMETDHWFKVLLKVWMQDINAKSAENQEFILSKHLVEPSLP 557

Query: 1600 QELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYGI 1779
            QEL+DSFY FQ     Y+SSET STL LL ++ +S ES   T+++AP+  I+  L RYG+
Sbjct: 558  QELRDSFYEFQKCVDEYISSETASTLALLPNADSSSESAFRTQIIAPVNTIKKRLYRYGL 617

Query: 1780 TNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIRT 1959
            TNS+G+P AC ML+ LDH+HIIDWFSGIV R LRWY   DNF EVK II  Q+R SCIRT
Sbjct: 618  TNSDGFPHACHMLIFLDHNHIIDWFSGIVKRLLRWYSEFDNFNEVKFIICNQLRMSCIRT 677

Query: 1960 LATKYRLHELEIERKFDSDLSRIPLTEDFESEVISETLVS----SDDGLLYGIPHSGLCL 2127
            LA KYR+HE  IE+KFDS+LSRIP T++ E E+ +E   S    +DD L YGI +SG+CL
Sbjct: 678  LAVKYRIHEKVIEKKFDSELSRIPSTDEVELELTNEASTSQTSDNDDSLTYGISYSGICL 737

Query: 2128 LSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIGL 2307
            LSLARMVS+SRPC CFV GC  AAP VYT+HVM+RQKFP WKTGFSSCIHPS++ RRIGL
Sbjct: 738  LSLARMVSQSRPCNCFVTGCMAAAPRVYTLHVMERQKFPGWKTGFSSCIHPSLHTRRIGL 797

Query: 2308 CNQHLKDLFLGHISLQSISFGA 2373
            CN HLKDLFLG+ISLQSI FG+
Sbjct: 798  CNDHLKDLFLGNISLQSIDFGS 819


>ref|XP_008348277.1| PREDICTED: uncharacterized protein LOC103411417 [Malus domestica]
          Length = 820

 Score =  934 bits (2414), Expect = 0.0
 Identities = 464/744 (62%), Positives = 572/744 (76%), Gaps = 13/744 (1%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIAN 360
            KM LA NLA LVEESS +NER+P  R++LKR LEL IKKRVK+Q+I+GKF D+  KVIAN
Sbjct: 78   KMKLAENLACLVEESSHINERRPKGRMQLKRCLELRIKKRVKEQYINGKFRDLMVKVIAN 137

Query: 361  PYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETLV 540
            P TL+ AYD IR+NSNV+++       FDS+AEE+  GSFD  AN +SIS RG   E LV
Sbjct: 138  PETLQDAYDCIRLNSNVDIALSDAKNSFDSMAEEMRHGSFDANANTFSISKRGVGNEVLV 197

Query: 541  LPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLVL 720
            LPNL L +IQEAIR+ LEVVYKP +SKISHG RSGRG S+A KYI KEI N +WWFT++L
Sbjct: 198  LPNLNLKVIQEAIRVVLEVVYKPDFSKISHGYRSGRGHSTALKYISKEISNPDWWFTVLL 257

Query: 721  NRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSPI 900
            N+K+D  IL +L+  ME KI DP L+ +I +MF A V+N+EFGGF KGHGLPQEGILSPI
Sbjct: 258  NKKLDACILGELLKAMEGKIVDPSLFDMIKSMFHANVLNLEFGGFPKGHGLPQEGILSPI 317

Query: 901  FMNIYLDNFDSEILNLSMKYEVL--DGEPD-GSQSKLRSWFKRQMGTIKKNEEV------ 1053
             MNIYLD FD E   LSMKYE L  D + D  SQSKLRSWF+R +   K N ++      
Sbjct: 318  LMNIYLDQFDREFYRLSMKYEALSLDSQNDQNSQSKLRSWFRRHL---KGNNDLGCAGEE 374

Query: 1054 MSGIRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGV 1233
                RVH CR MDEI  +  GSK+ +L+FKSE+  Y+++SL+L+ D  T++LPC  P G+
Sbjct: 375  SCSARVHSCRFMDEIFFSXSGSKDAALEFKSEVLNYLQKSLHLEVDDQTELLPCQKPHGI 434

Query: 1234 KFIGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKES 1413
            +F+G  V++ V +S   +AVHKLKEKV L+ LQKQEAW+ GTV IGKK LGHG KKVKES
Sbjct: 435  RFLGTLVRRNVIESPATKAVHKLKEKVALFGLQKQEAWNVGTVHIGKKWLGHGLKKVKES 494

Query: 1414 EIKQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKS 1593
            EIK LADS+SVL+Q+S FRK GMETDHWYK L+KIWMQ+   K  E+EE++LSK +AE +
Sbjct: 495  EIKHLADSSSVLNQISHFRKFGMETDHWYKHLLKIWMQDVNAKAEESEEAVLSKHVAEPA 554

Query: 1594 LPQELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERY 1773
            LP+EL +SFY FQ   + YVSSET S L LL ++G+S ESV +TE++AP+ A++  L+RY
Sbjct: 555  LPEELTNSFYEFQRQVEKYVSSETSSILALLPNAGSSAESVVITEIIAPVNAVKKRLQRY 614

Query: 1774 GITNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCI 1953
            G+T S+GYPR   +LVL D+D IIDWFSGIV RWLRWY  C NF EVKL+I++ VRKSCI
Sbjct: 615  GLTTSDGYPRTSSLLVLQDNDQIIDWFSGIVRRWLRWYAECVNFKEVKLLISDLVRKSCI 674

Query: 1954 RTLATKYRLHELEIERKFDSDLSRIPLTEDFESEVISET----LVSSDDGLLYGIPHSGL 2121
            RTLA KYR+HE EIER+FD++LSRIP T++ E E++ ET       +D+ L+YGI +SGL
Sbjct: 675  RTLAAKYRVHENEIERRFDTELSRIPSTQEIEQEMVDETSDTQAFENDEALMYGISYSGL 734

Query: 2122 CLLSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRI 2301
            C+LSLARMVSESRPC CFV GC  +AP VYT+HVM+RQKFP WKTGFSSCIHPS+NRRRI
Sbjct: 735  CVLSLARMVSESRPCNCFVFGCMASAPSVYTLHVMERQKFPGWKTGFSSCIHPSLNRRRI 794

Query: 2302 GLCNQHLKDLFLGHISLQSISFGA 2373
            GLC QHLKDL+LGH+SLQS+ FGA
Sbjct: 795  GLCKQHLKDLYLGHVSLQSVDFGA 818


>ref|XP_015891101.1| PREDICTED: uncharacterized protein LOC107425606 [Ziziphus jujuba]
 ref|XP_015891103.1| PREDICTED: uncharacterized protein LOC107425606 [Ziziphus jujuba]
 ref|XP_015891104.1| PREDICTED: uncharacterized protein LOC107425606 [Ziziphus jujuba]
          Length = 791

 Score =  932 bits (2410), Expect = 0.0
 Identities = 454/741 (61%), Positives = 582/741 (78%), Gaps = 10/741 (1%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESS-SVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIA 357
            KM+LA +LA L+  +S  V+  KP SR+ELKRYLEL IKKRVK+Q+I+GKF  +   VIA
Sbjct: 43   KMTLAKDLACLIGNNSIEVDAGKPKSRMELKRYLELRIKKRVKEQYINGKFQGLVSNVIA 102

Query: 358  NPYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETL 537
            N  TL+ AY+ IR+NSNV+++   D I F+S+A+EL  G+FD+ AN +SIST+G  KE L
Sbjct: 103  NTKTLQDAYNCIRLNSNVDVAVGNDNICFESMAKELRCGNFDVNANTFSISTKGTRKEVL 162

Query: 538  VLPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLV 717
            VLPNL L ++QEAIRI LEVVY+PH+SKISHGCRSGRG  +A KY+ KEI + +WWFT++
Sbjct: 163  VLPNLMLKVVQEAIRIVLEVVYRPHFSKISHGCRSGRGHRTALKYVRKEISSPDWWFTVL 222

Query: 718  LNRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSP 897
            +N+K+D  IL KL+S ME KIEDP LY II +MF+A V+N+EFGGF KG GLPQEG+LSP
Sbjct: 223  INKKLDGCILDKLLSVMEEKIEDPSLYCIIRSMFNAQVLNLEFGGFPKGQGLPQEGVLSP 282

Query: 898  IFMNIYLDNFDSEILNLSMKYEVLDGEPDGSQ---SKLRSWFKRQMGTIKKN--EEVMSG 1062
            I MNIYLD FD E   L+MKYE LD + D +Q   SKLR WF+R + T  ++  +E +  
Sbjct: 283  ILMNIYLDLFDHEFYRLTMKYEALDRDIDTAQRSCSKLRGWFRRHLKTNDRSCVDEEIFN 342

Query: 1063 IRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGVKFI 1242
            +++HCCR MDEIL AV GSK+V+LDF++EI  Y++ SLYLD D  T+I PC  PRG+ F+
Sbjct: 343  VKIHCCRLMDEILFAVSGSKDVALDFRTEILNYLKMSLYLDIDDQTEIFPCYGPRGINFL 402

Query: 1243 GNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESEIK 1422
            G  +K+ VK+S  ++AVHKLKEKVEL+ +QKQ+AW+ GTVRIGKK L +G KKVKESEI+
Sbjct: 403  GTLIKRRVKESPRMKAVHKLKEKVELFAIQKQQAWNAGTVRIGKKWLAYGLKKVKESEIR 462

Query: 1423 QLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSLPQ 1602
             LADSNSVLSQ+S  RKSGMETDHWYK L+KIWMQE  VK  E+E+SILSK+++E ++P+
Sbjct: 463  HLADSNSVLSQISHLRKSGMETDHWYKHLLKIWMQEVNVKAAESEDSILSKYVSEPAMPK 522

Query: 1603 ELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYGIT 1782
            EL++SFY FQ  A+ Y+ SET STL LL +SG+S + +  TE++AP+ AI+ CL RYG+T
Sbjct: 523  ELRNSFYEFQKRAEEYIVSETASTLSLLPNSGSSVKPLIKTEIIAPVDAIKKCLLRYGLT 582

Query: 1783 NSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIRTL 1962
             S+GYP A   L+LLD   IIDWFSGIV RWLRWY  CDNF+E+KL+I  Q+RKSC+RTL
Sbjct: 583  TSKGYPCASSSLILLDSSEIIDWFSGIVRRWLRWYGECDNFSEIKLLIAHQIRKSCVRTL 642

Query: 1963 ATKYRLHELEIERKFDSDLSRIPLTEDFESEVISETLVS----SDDGLLYGIPHSGLCLL 2130
            A KYR+HE EIE KFD++LSRIP T+D E E++ ET  +    +D+ L+YGI +SGL LL
Sbjct: 643  AAKYRMHENEIENKFDTELSRIPSTQDIEQEMVDETSATQAYENDEALMYGISYSGLLLL 702

Query: 2131 SLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIGLC 2310
            SL+RMVS+SRPC+CFV+GC   APCVY++HVM+RQKFP WK+GF S IHPS+NR+RIGLC
Sbjct: 703  SLSRMVSQSRPCSCFVMGCLAPAPCVYSLHVMERQKFPGWKSGFPSFIHPSLNRKRIGLC 762

Query: 2311 NQHLKDLFLGHISLQSISFGA 2373
             +HLKDL+LGHISLQSI F +
Sbjct: 763  KKHLKDLYLGHISLQSIDFSS 783


>gb|EOX92573.1| Intron maturase isoform 1 [Theobroma cacao]
 gb|EOX92574.1| Intron maturase isoform 1 [Theobroma cacao]
          Length = 801

 Score =  931 bits (2406), Expect = 0.0
 Identities = 466/743 (62%), Positives = 578/743 (77%), Gaps = 11/743 (1%)
 Frame = +1

Query: 178  DKMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIA 357
            +KM+LA +LA LVEESS  +ERK  SR+ELKR LEL +KKRVK+Q+++G FH++  KVIA
Sbjct: 57   EKMTLAKDLACLVEESSHQDERKAKSRMELKRSLELRVKKRVKEQYLNGNFHNLMAKVIA 116

Query: 358  NPYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETL 537
            NP TL+ AY+ IR+NSNV++S + D++ F S+AEEL  GSFD+KAN +S+STRGA+KE L
Sbjct: 117  NPATLQDAYNCIRLNSNVDISVKHDSVCFKSMAEELLEGSFDVKANTFSVSTRGASKEVL 176

Query: 538  VLPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLV 717
            VLPNLK+ I+QEAIRI LEVVYKPH+SKISHGCRSGR  S+A +YI KEI + +WWFTL+
Sbjct: 177  VLPNLKMRIVQEAIRIVLEVVYKPHFSKISHGCRSGRDHSTALRYISKEIASPSWWFTLI 236

Query: 718  LNRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSP 897
            LN+K+D+ IL KLIS ++ K+ED +L A I +MFDA V+N EFGGF KGHGLPQEG+LSP
Sbjct: 237  LNKKVDSSILAKLISKLQDKVEDNQLLATIQSMFDAQVLNFEFGGFPKGHGLPQEGVLSP 296

Query: 898  IFMNIYLDNFDSEILNLSMKYEVL----DGEPDGSQSKLRSWFKRQM--GTIKKNEEVMS 1059
            I MNIYL  FD E   LSM+YE L    D + D S SKLR+WF+RQ+    +K      S
Sbjct: 297  ILMNIYLHLFDQEFYRLSMRYEALHPGFDKDEDMSYSKLRNWFRRQLKENDVKYTVNDDS 356

Query: 1060 GIRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFD-KTTDILPCLDPRGVK 1236
              RVHCCR MDEI  A+ GSK+V+L FKSEI  + + SL LD D + T+ILPC +  G++
Sbjct: 357  SPRVHCCRFMDEIFFAISGSKDVALSFKSEIVDFFKNSLELDVDDEQTEILPCNESNGIR 416

Query: 1237 FIGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESE 1416
            F+G  V++ V++    RAVHKLKEKV+L+  QKQ+AW+ GTV IG+K L HG KKVKESE
Sbjct: 417  FLGALVRRSVQEGPATRAVHKLKEKVKLFASQKQDAWNAGTVGIGRKWLAHGLKKVKESE 476

Query: 1417 IKQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSL 1596
            I+ LADS S LS++S FRK+GMETDHWYKVL KIWMQ+   K  E EESILSK + E +L
Sbjct: 477  IEHLADSGSTLSKISCFRKAGMETDHWYKVLTKIWMQDIKAKAAENEESILSKCVVEPAL 536

Query: 1597 PQELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYG 1776
            PQELK+S+Y F   A  YV SET +TL LL +S ++  SV++TE++AP+ AI+  L RYG
Sbjct: 537  PQELKESYYEFLKRANEYVYSETAATLALLPNSSSNAGSVAITEIIAPVNAIKKRLLRYG 596

Query: 1777 ITNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIR 1956
            +T SEGYPR   +LVL D+  IIDWFSGIV RWLRWYR CDNF E+KLII+  +RKSCIR
Sbjct: 597  LTTSEGYPRVVSLLVLQDNFQIIDWFSGIVCRWLRWYRECDNFNEIKLIISTILRKSCIR 656

Query: 1957 TLATKYRLHELEIERKFDSDLSRIPLTEDFESEVISETLVS----SDDGLLYGIPHSGLC 2124
            TLA KYR+HE EIE++FDS+L RIP TE+ E E+  ET  S    SD+ L+YGI +SGLC
Sbjct: 657  TLAAKYRIHESEIEKQFDSELCRIPSTEEVEQELTYETSDSHSFDSDEALMYGISYSGLC 716

Query: 2125 LLSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIG 2304
            LLSLARMVS+SRPC CFV+GC  AAP VYT+H M+RQKFP WKTGFSSCIHPS+N+RRIG
Sbjct: 717  LLSLARMVSQSRPCNCFVMGCSMAAPSVYTLHAMERQKFPGWKTGFSSCIHPSLNKRRIG 776

Query: 2305 LCNQHLKDLFLGHISLQSISFGA 2373
            LC +HLKDL+LGHISLQSI+FGA
Sbjct: 777  LCKKHLKDLYLGHISLQSINFGA 799


>ref|XP_024166789.1| nuclear intron maturase 4, mitochondrial [Rosa chinensis]
 gb|PRQ55368.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 832

 Score =  929 bits (2401), Expect = 0.0
 Identities = 464/742 (62%), Positives = 574/742 (77%), Gaps = 11/742 (1%)
 Frame = +1

Query: 181  KMSLASNLAFLVEESSSVNER-KPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIA 357
            KM LA NLA LV+ESS ++ER +P SR+ELKR LEL IKKRVK+Q+++GKF D+  KVIA
Sbjct: 92   KMKLAENLACLVDESSHIDERGRPRSRMELKRSLELRIKKRVKEQYLNGKFRDLMAKVIA 151

Query: 358  NPYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETL 537
            NP TL+ AYD IR+NSN+++        FDSLAEEL  G+FD+ AN +SIST+ A K+ L
Sbjct: 152  NPETLQDAYDCIRLNSNIDVVVNDKKTSFDSLAEELYLGNFDVNANTFSISTKNARKDVL 211

Query: 538  VLPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLV 717
            VLPN+ L +IQEAIRI LEVVYKPH+SKISHG RSGRG S+A KYI KE  +++WWF L+
Sbjct: 212  VLPNVNLKVIQEAIRIVLEVVYKPHFSKISHGYRSGRGHSTALKYISKETASSDWWFILL 271

Query: 718  LNRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSP 897
            +N+K+D  IL KLIS ME KIEDP LY +I +M  A V+N EFGGF KGHGLPQEG+LSP
Sbjct: 272  INKKLDACILAKLISVMEEKIEDPSLYGMIQSMLHANVLNFEFGGFPKGHGLPQEGVLSP 331

Query: 898  IFMNIYLDNFDSEILNLSMKYEVLDG---EPDGSQSKLRSWFKRQMGTIKKNEEVMSG-- 1062
            I MNIYLD FD E   LSMKYE L         SQS+LRSWF+R +   K N+    G  
Sbjct: 332  ILMNIYLDLFDREFYRLSMKYEALIPGLHTDQKSQSRLRSWFRRNL---KGNDLGCVGEE 388

Query: 1063 -IRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFDKTTDILPCLDPRGVKF 1239
              RVH CR MDEI  +V GSK+ +LDFKSE++ Y+++SL+L+ D  T++LP    +G++F
Sbjct: 389  SFRVHSCRFMDEIFFSVAGSKDAALDFKSEVQNYLQKSLHLEVDDQTELLPFQMSQGIRF 448

Query: 1240 IGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESEI 1419
            +G  +++ VK+S   +AVHKLKEKV L+ LQKQEAWD GTV IGKK LGH  KKVKESEI
Sbjct: 449  LGTLIRRNVKESPATKAVHKLKEKVALFGLQKQEAWDSGTVSIGKKWLGHALKKVKESEI 508

Query: 1420 KQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSLP 1599
            K LADS SVLSQ+S  RKSGMETDHWYK ++KIWMQ+   K  E+EE+ILSK++AE +LP
Sbjct: 509  KHLADSRSVLSQISHCRKSGMETDHWYKYMLKIWMQDVKAKAEESEEAILSKYVAEPALP 568

Query: 1600 QELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYGI 1779
            +EL++SFY FQ H + YVSSET STL LL ++G+S +SV VTE++AP+ AI+  L RYG+
Sbjct: 569  EELRNSFYEFQRHVEKYVSSETTSTLALLPNAGSSTDSVIVTEIIAPVIAIKKRLLRYGL 628

Query: 1780 TNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIRT 1959
               +GYPR C +LVL D+  IIDWFSGIV RWLRWY  CDNF EVKL+I++ VRKSCIRT
Sbjct: 629  LTRDGYPRTCSLLVLQDNVQIIDWFSGIVRRWLRWYAKCDNFNEVKLLISDLVRKSCIRT 688

Query: 1960 LATKYRLHELEIERKFDSDLSRIPLTEDFESEVISET----LVSSDDGLLYGIPHSGLCL 2127
            LA KYR+HE EIE++FD++LS IP T + E E++SET     + +D+ L+YGI +SGLCL
Sbjct: 689  LAAKYRVHEAEIEKRFDTELSSIPSTLEVEQEMVSETSDPQALENDEALMYGISYSGLCL 748

Query: 2128 LSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIGL 2307
            LSLARMVS+SRPC CFV+GC   AP VYT+HVM+RQKFP WKTGFSSCIHPS+NRRRIGL
Sbjct: 749  LSLARMVSQSRPCNCFVIGCTAPAPSVYTIHVMERQKFPGWKTGFSSCIHPSLNRRRIGL 808

Query: 2308 CNQHLKDLFLGHISLQSISFGA 2373
            C QHLK+L+LGH+SLQSI FGA
Sbjct: 809  CKQHLKNLYLGHMSLQSIDFGA 830


>ref|XP_017980919.1| PREDICTED: uncharacterized protein LOC18611882 [Theobroma cacao]
 ref|XP_017980921.1| PREDICTED: uncharacterized protein LOC18611882 [Theobroma cacao]
 ref|XP_017980924.1| PREDICTED: uncharacterized protein LOC18611882 [Theobroma cacao]
          Length = 801

 Score =  928 bits (2399), Expect = 0.0
 Identities = 465/743 (62%), Positives = 576/743 (77%), Gaps = 11/743 (1%)
 Frame = +1

Query: 178  DKMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIA 357
            +KM LA +LA LVEESS  +ERK  SR+ELKR LEL +KKRVK+Q+++G FH++  KVIA
Sbjct: 57   EKMMLAKDLACLVEESSHQDERKAKSRMELKRSLELRVKKRVKEQYLNGNFHNLMAKVIA 116

Query: 358  NPYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETL 537
            NP TL+ AY+ IR+NSNV++S + D++ F S+AEEL  GSFD+KAN +S+STRGA+KE L
Sbjct: 117  NPATLQDAYNCIRLNSNVDISVKHDSVCFKSMAEELLEGSFDVKANTFSVSTRGASKEVL 176

Query: 538  VLPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLV 717
            VLPNLK+ I+QEAIRI LEVVYKPH+SKISHGCRSGR  S+A +YI KEI + +WWFTL+
Sbjct: 177  VLPNLKMRIVQEAIRIVLEVVYKPHFSKISHGCRSGRDHSTALRYISKEIASPSWWFTLI 236

Query: 718  LNRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSP 897
            LN+K+D+ IL KLIS ++ K+ED +L A I +MFDA V+N EFGGF KGHGLPQEG+LSP
Sbjct: 237  LNKKVDSSILAKLISKLQDKVEDNQLLATIQSMFDAQVLNFEFGGFPKGHGLPQEGVLSP 296

Query: 898  IFMNIYLDNFDSEILNLSMKYEVL----DGEPDGSQSKLRSWFKRQM--GTIKKNEEVMS 1059
            I MNIYL  FD E   LSM+YE L    D + D S SKLR+WF+RQ+    +K      S
Sbjct: 297  ILMNIYLHLFDQEFYRLSMRYEALHPGFDKDEDMSYSKLRNWFRRQLKENDVKYTVNDDS 356

Query: 1060 GIRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFD-KTTDILPCLDPRGVK 1236
              RVHCCR MDEI  A+ GSK+V+L FKSEI  + + SL LD D + T+ILPC +  G++
Sbjct: 357  SPRVHCCRFMDEIFFAISGSKDVALSFKSEIVDFFKNSLELDVDDEQTEILPCNESNGIR 416

Query: 1237 FIGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESE 1416
            F+G  V++ V++    RAVHKLKEKV+L+  QKQ+AW+ GTV IG+K L HG KKVKESE
Sbjct: 417  FLGALVRRSVQEGPATRAVHKLKEKVKLFASQKQDAWNAGTVGIGRKWLAHGLKKVKESE 476

Query: 1417 IKQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSL 1596
            I+ LADS S LS++S FRK+GMETDHWYKVL KIWMQ+   K  E EESILSK + E +L
Sbjct: 477  IEHLADSGSTLSKISCFRKAGMETDHWYKVLTKIWMQDIKAKAAENEESILSKCVVEPAL 536

Query: 1597 PQELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYG 1776
            PQELK+S+Y F   A  YV SET +TL LL +S ++  SV++TE++AP+ AI+  L RYG
Sbjct: 537  PQELKESYYEFLKRANEYVYSETAATLALLPNSSSNAGSVAITEIIAPVNAIKKRLLRYG 596

Query: 1777 ITNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIR 1956
            +T SEGYP    +LVL D+  IIDWFSGIV RWLRWYR CDNF E+KLII+  +RKSCIR
Sbjct: 597  LTTSEGYPHVVSLLVLQDNFQIIDWFSGIVCRWLRWYRECDNFNEIKLIISTILRKSCIR 656

Query: 1957 TLATKYRLHELEIERKFDSDLSRIPLTEDFESEVISETLVS----SDDGLLYGIPHSGLC 2124
            TLA KYR+HE EIE++FDS+L RIP TE+ E E+  ET  S    SD+ L+YGI +SGLC
Sbjct: 657  TLAAKYRIHESEIEKQFDSELCRIPSTEEVEQELTYETSDSHSFDSDEALMYGISYSGLC 716

Query: 2125 LLSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIG 2304
            LLSLARMVS+SRPC CFV+GC  AAP VYT+H M+RQKFP WKTGFSSCIHPS+N+RRIG
Sbjct: 717  LLSLARMVSQSRPCNCFVMGCSMAAPSVYTLHAMERQKFPGWKTGFSSCIHPSLNKRRIG 776

Query: 2305 LCNQHLKDLFLGHISLQSISFGA 2373
            LC +HLKDL+LGHISLQSI+FGA
Sbjct: 777  LCKKHLKDLYLGHISLQSINFGA 799


>ref|XP_021300995.1| uncharacterized protein LOC110429335 isoform X2 [Herrania umbratica]
          Length = 761

 Score =  928 bits (2398), Expect = 0.0
 Identities = 468/743 (62%), Positives = 574/743 (77%), Gaps = 11/743 (1%)
 Frame = +1

Query: 178  DKMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIA 357
            +KM LA +LA LVEESS+ +ERK  SR+ELKR LEL IKKRVK+Q+++GKFH++  KVIA
Sbjct: 17   EKMMLAKDLACLVEESSNQDERKAKSRMELKRSLELRIKKRVKEQYLNGKFHNLMAKVIA 76

Query: 358  NPYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETL 537
            NP TL+ AY+ IR+NSNV++S + D++ F SLAEEL  GSFD+KAN +S+STRGA+KE L
Sbjct: 77   NPATLQDAYNCIRLNSNVDISVKDDSVCFKSLAEELLEGSFDVKANTFSVSTRGASKEVL 136

Query: 538  VLPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLV 717
            VLPNLK+ I+QEAIRI LEVVYKPH+SKISHGCRSGR  S+A +YI KEI + +WWFTLV
Sbjct: 137  VLPNLKMRIVQEAIRIVLEVVYKPHFSKISHGCRSGRDHSTALRYISKEIASPSWWFTLV 196

Query: 718  LNRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSP 897
            LN+K+D+ IL KLIS +E K+ED +L   I +MFD   +N EFGGF KGHGLPQEG+LSP
Sbjct: 197  LNKKVDSSILAKLISKLEDKVEDNQLLGTIQSMFDGQALNFEFGGFPKGHGLPQEGVLSP 256

Query: 898  IFMNIYLDNFDSEILNLSMKYEVL----DGEPDGSQSKLRSWFKRQM--GTIKKNEEVMS 1059
            I MNIYL  FD E   LSM YE L    D + D S SKLR+WF+RQ+     K  +   S
Sbjct: 257  ILMNIYLHLFDQEFYRLSMTYEALHPGFDKDEDMSYSKLRNWFRRQLKENDAKYTDNDDS 316

Query: 1060 GIRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFD-KTTDILPCLDPRGVK 1236
              RVHCCR MDEI  A+ GSK+V+L FKSE+  + + SL LD D + T+ILPC +  G++
Sbjct: 317  SPRVHCCRFMDEIFFAISGSKDVALSFKSEVVDFFKNSLELDVDDEQTEILPCNESNGIR 376

Query: 1237 FIGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESE 1416
            F+G  V++ V++    RAVHKLKEKV+L+  QKQ AW+ GTV IG+K L HG KKVKESE
Sbjct: 377  FLGALVRRSVQEGPATRAVHKLKEKVKLFASQKQHAWNAGTVGIGRKWLAHGLKKVKESE 436

Query: 1417 IKQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSL 1596
            I+ LADS S LS++S FRK+GMETDHWYKVL KIWMQ+   K  E EESILSK++ E +L
Sbjct: 437  IEHLADSGSTLSKISCFRKAGMETDHWYKVLTKIWMQDIKAKAAENEESILSKYVVEPAL 496

Query: 1597 PQELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYG 1776
            PQELK+S+Y F   A  YV SET +TL LL +S A+  SV++TE+VAP+ AI+  L RYG
Sbjct: 497  PQELKESYYEFLKRANEYVYSETVATLALLPNSSANTGSVAITEIVAPVNAIKKRLLRYG 556

Query: 1777 ITNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIR 1956
            +T+SEGYPR   +LVL D+  IIDWFSGIV RWLRWYR CDNF E+KLII+  +RKSCIR
Sbjct: 557  LTSSEGYPRVVSLLVLQDNFQIIDWFSGIVCRWLRWYRECDNFNEIKLIISTILRKSCIR 616

Query: 1957 TLATKYRLHELEIERKFDSDLSRIPLTEDFESEVISETLVS----SDDGLLYGIPHSGLC 2124
            TLA KYR+HE EIE++FDS+L RIP TE+ E E   ET  S    SD+ L+YGI +SGLC
Sbjct: 617  TLAAKYRIHESEIEKQFDSELCRIPSTEEVEQEPTYETSDSHSFDSDEALMYGISYSGLC 676

Query: 2125 LLSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIG 2304
            LLSLARMVS+SR C CFV+GC  AAP VYT+H M+RQKFP WKTGFSSCIHPS+N+RRIG
Sbjct: 677  LLSLARMVSQSRACNCFVMGCSMAAPSVYTLHAMERQKFPGWKTGFSSCIHPSLNKRRIG 736

Query: 2305 LCNQHLKDLFLGHISLQSISFGA 2373
            LC +HLKDL+LGHISLQSI+FGA
Sbjct: 737  LCKKHLKDLYLGHISLQSINFGA 759


>ref|XP_021300992.1| uncharacterized protein LOC110429335 isoform X1 [Herrania umbratica]
 ref|XP_021300993.1| uncharacterized protein LOC110429335 isoform X1 [Herrania umbratica]
 ref|XP_021300994.1| uncharacterized protein LOC110429335 isoform X1 [Herrania umbratica]
          Length = 802

 Score =  928 bits (2398), Expect = 0.0
 Identities = 468/743 (62%), Positives = 574/743 (77%), Gaps = 11/743 (1%)
 Frame = +1

Query: 178  DKMSLASNLAFLVEESSSVNERKPMSRLELKRYLELCIKKRVKDQFIDGKFHDVFEKVIA 357
            +KM LA +LA LVEESS+ +ERK  SR+ELKR LEL IKKRVK+Q+++GKFH++  KVIA
Sbjct: 58   EKMMLAKDLACLVEESSNQDERKAKSRMELKRSLELRIKKRVKEQYLNGKFHNLMAKVIA 117

Query: 358  NPYTLRYAYDSIRVNSNVNLSSECDAIQFDSLAEELSSGSFDIKANIYSISTRGANKETL 537
            NP TL+ AY+ IR+NSNV++S + D++ F SLAEEL  GSFD+KAN +S+STRGA+KE L
Sbjct: 118  NPATLQDAYNCIRLNSNVDISVKDDSVCFKSLAEELLEGSFDVKANTFSVSTRGASKEVL 177

Query: 538  VLPNLKLNIIQEAIRIALEVVYKPHYSKISHGCRSGRGQSSAFKYICKEIQNTNWWFTLV 717
            VLPNLK+ I+QEAIRI LEVVYKPH+SKISHGCRSGR  S+A +YI KEI + +WWFTLV
Sbjct: 178  VLPNLKMRIVQEAIRIVLEVVYKPHFSKISHGCRSGRDHSTALRYISKEIASPSWWFTLV 237

Query: 718  LNRKMDTLILRKLISTMESKIEDPRLYAIIHTMFDAGVINMEFGGFAKGHGLPQEGILSP 897
            LN+K+D+ IL KLIS +E K+ED +L   I +MFD   +N EFGGF KGHGLPQEG+LSP
Sbjct: 238  LNKKVDSSILAKLISKLEDKVEDNQLLGTIQSMFDGQALNFEFGGFPKGHGLPQEGVLSP 297

Query: 898  IFMNIYLDNFDSEILNLSMKYEVL----DGEPDGSQSKLRSWFKRQM--GTIKKNEEVMS 1059
            I MNIYL  FD E   LSM YE L    D + D S SKLR+WF+RQ+     K  +   S
Sbjct: 298  ILMNIYLHLFDQEFYRLSMTYEALHPGFDKDEDMSYSKLRNWFRRQLKENDAKYTDNDDS 357

Query: 1060 GIRVHCCRSMDEILVAVKGSKEVSLDFKSEIETYMRESLYLDFD-KTTDILPCLDPRGVK 1236
              RVHCCR MDEI  A+ GSK+V+L FKSE+  + + SL LD D + T+ILPC +  G++
Sbjct: 358  SPRVHCCRFMDEIFFAISGSKDVALSFKSEVVDFFKNSLELDVDDEQTEILPCNESNGIR 417

Query: 1237 FIGNTVKKIVKDSSTVRAVHKLKEKVELYTLQKQEAWDEGTVRIGKKILGHGFKKVKESE 1416
            F+G  V++ V++    RAVHKLKEKV+L+  QKQ AW+ GTV IG+K L HG KKVKESE
Sbjct: 418  FLGALVRRSVQEGPATRAVHKLKEKVKLFASQKQHAWNAGTVGIGRKWLAHGLKKVKESE 477

Query: 1417 IKQLADSNSVLSQVSRFRKSGMETDHWYKVLVKIWMQEAVVKYTETEESILSKFIAEKSL 1596
            I+ LADS S LS++S FRK+GMETDHWYKVL KIWMQ+   K  E EESILSK++ E +L
Sbjct: 478  IEHLADSGSTLSKISCFRKAGMETDHWYKVLTKIWMQDIKAKAAENEESILSKYVVEPAL 537

Query: 1597 PQELKDSFYTFQNHAKNYVSSETKSTLDLLLDSGASPESVSVTEVVAPIKAIRMCLERYG 1776
            PQELK+S+Y F   A  YV SET +TL LL +S A+  SV++TE+VAP+ AI+  L RYG
Sbjct: 538  PQELKESYYEFLKRANEYVYSETVATLALLPNSSANTGSVAITEIVAPVNAIKKRLLRYG 597

Query: 1777 ITNSEGYPRACRMLVLLDHDHIIDWFSGIVNRWLRWYRSCDNFTEVKLIITEQVRKSCIR 1956
            +T+SEGYPR   +LVL D+  IIDWFSGIV RWLRWYR CDNF E+KLII+  +RKSCIR
Sbjct: 598  LTSSEGYPRVVSLLVLQDNFQIIDWFSGIVCRWLRWYRECDNFNEIKLIISTILRKSCIR 657

Query: 1957 TLATKYRLHELEIERKFDSDLSRIPLTEDFESEVISETLVS----SDDGLLYGIPHSGLC 2124
            TLA KYR+HE EIE++FDS+L RIP TE+ E E   ET  S    SD+ L+YGI +SGLC
Sbjct: 658  TLAAKYRIHESEIEKQFDSELCRIPSTEEVEQEPTYETSDSHSFDSDEALMYGISYSGLC 717

Query: 2125 LLSLARMVSESRPCTCFVLGCRDAAPCVYTVHVMQRQKFPVWKTGFSSCIHPSINRRRIG 2304
            LLSLARMVS+SR C CFV+GC  AAP VYT+H M+RQKFP WKTGFSSCIHPS+N+RRIG
Sbjct: 718  LLSLARMVSQSRACNCFVMGCSMAAPSVYTLHAMERQKFPGWKTGFSSCIHPSLNKRRIG 777

Query: 2305 LCNQHLKDLFLGHISLQSISFGA 2373
            LC +HLKDL+LGHISLQSI+FGA
Sbjct: 778  LCKKHLKDLYLGHISLQSINFGA 800


Top