BLASTX nr result

ID: Chrysanthemum21_contig00008856 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00008856
         (1703 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KVH88044.1| Poly A polymerase, head domain-containing protein...   827   0.0  
ref|XP_023733521.1| uncharacterized protein LOC111881359 isoform...   771   0.0  
ref|XP_023733520.1| uncharacterized protein LOC111881359 isoform...   769   0.0  
ref|XP_023733522.1| uncharacterized protein LOC111881359 isoform...   769   0.0  
gb|OTG17290.1| putative poly A polymerase, head domain-containin...   749   0.0  
ref|XP_021974381.1| uncharacterized protein LOC110869437 [Helian...   749   0.0  
ref|XP_019264330.1| PREDICTED: uncharacterized protein LOC109241...   665   0.0  
ref|XP_009604520.1| PREDICTED: uncharacterized protein LOC104099...   663   0.0  
ref|XP_016450657.1| PREDICTED: poly(A) polymerase I-like [Nicoti...   660   0.0  
ref|XP_009779993.1| PREDICTED: putative CCA tRNA nucleotidyltran...   659   0.0  
ref|XP_002266814.1| PREDICTED: uncharacterized protein LOC100259...   656   0.0  
ref|XP_006361912.1| PREDICTED: poly(A) polymerase I-like isoform...   650   0.0  
ref|XP_009604528.1| PREDICTED: uncharacterized protein LOC104099...   648   0.0  
ref|XP_023908023.1| uncharacterized protein LOC112019738 [Quercu...   645   0.0  
ref|XP_010249182.1| PREDICTED: uncharacterized protein LOC104591...   644   0.0  
ref|XP_009779994.1| PREDICTED: uncharacterized protein LOC104229...   643   0.0  
ref|XP_019181262.1| PREDICTED: uncharacterized protein LOC109176...   643   0.0  
ref|XP_021285979.1| uncharacterized protein LOC110417781 isoform...   639   0.0  
ref|XP_021285981.1| uncharacterized protein LOC110417781 isoform...   635   0.0  
ref|XP_007038104.2| PREDICTED: poly(A) polymerase I [Theobroma c...   637   0.0  

>gb|KVH88044.1| Poly A polymerase, head domain-containing protein, partial [Cynara
            cardunculus var. scolymus]
          Length = 579

 Score =  827 bits (2136), Expect = 0.0
 Identities = 409/491 (83%), Positives = 448/491 (91%), Gaps = 2/491 (0%)
 Frame = +1

Query: 1    ESVSVNNDHNSYQTSSARNVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRK 180
            ESVSV N+   +Q S+ARNVKGGNDDCKPH+WKK CSKELGI+TS+ITKPA+ VLNVLRK
Sbjct: 90   ESVSVRNE--DFQISTARNVKGGNDDCKPHQWKKSCSKELGIKTSKITKPAKFVLNVLRK 147

Query: 181  KGYDVYLVGGCVRDLVLKRTPKDFDILTTAELKEV--MRAFPRCEIVGRRFPICHVHVDD 354
            KGY+VYLVGGCVRDLVL+RTPKDFDILT+AELKEV  M AFPRCEIVGRRFPICHVHVDD
Sbjct: 148  KGYEVYLVGGCVRDLVLERTPKDFDILTSAELKEVPVMGAFPRCEIVGRRFPICHVHVDD 207

Query: 355  AIVEVSSFSTNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVY 534
            AIVEVSSFST GRKFGRKSK  +RKPSGC+ECD+IRW+NCMQRDFTINGLMFDPF++IVY
Sbjct: 208  AIVEVSSFSTTGRKFGRKSKPVLRKPSGCNECDYIRWRNCMQRDFTINGLMFDPFSKIVY 267

Query: 535  DYIGGMEDIQKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSL 714
            DYIGGMEDIQ+AKVRCI PANTSFVEDCARILRG+RIAARLGFR SRETSHF+KELSNSL
Sbjct: 268  DYIGGMEDIQRAKVRCIIPANTSFVEDCARILRGIRIAARLGFRLSRETSHFVKELSNSL 327

Query: 715  LRLDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMX 894
            LRLDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLP+QASYLVSHGFRRRDKRSNM 
Sbjct: 328  LRLDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPVQASYLVSHGFRRRDKRSNML 387

Query: 895  XXXXXXXXXXXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVD 1074
                        PDRPCHSCLWVS+LAFHEALV+QPRD LVI AFSIAVHSGGSLS+AVD
Sbjct: 388  LSLFASLDKLLAPDRPCHSCLWVSILAFHEALVDQPRDVLVIAAFSIAVHSGGSLSEAVD 447

Query: 1075 IAREISQPHDTSFHEISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYP 1254
            IAREISQPHD SFHEISEPI  YSKD L+ EVI+L ASVK ALRRLTDE++VSQALI+YP
Sbjct: 448  IAREISQPHDMSFHEISEPI-CYSKDALVDEVIKLAASVKAALRRLTDEHYVSQALIKYP 506

Query: 1255 QAPQSDMVFISWALSLKVSSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGR 1434
            QAPQSD+VFISW+LSLKV S+FEC+++ +NRR +PK GSEIDYDSLA+GRL EVR++FGR
Sbjct: 507  QAPQSDLVFISWSLSLKVCSMFECVKRVKNRRLIPKEGSEIDYDSLAIGRLQEVRNVFGR 566

Query: 1435 VVFDTVYPTKL 1467
            VVFDTVYP KL
Sbjct: 567  VVFDTVYPLKL 577


>ref|XP_023733521.1| uncharacterized protein LOC111881359 isoform X2 [Lactuca sativa]
          Length = 532

 Score =  771 bits (1992), Expect = 0.0
 Identities = 394/493 (79%), Positives = 434/493 (88%), Gaps = 4/493 (0%)
 Frame = +1

Query: 1    ESVSVNNDHNSYQTSSARNVKG-GNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLR 177
            ESVSV+N+ N  Q S+AR VKG GND+ KPHEWKKLCSKELG++TSRI KPA+ VLNVLR
Sbjct: 42   ESVSVHNEDN--QLSTARIVKGRGNDESKPHEWKKLCSKELGVKTSRIIKPAKFVLNVLR 99

Query: 178  KKGYDVYLVGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDA 357
            KKGY+VYLVGGCVRDL+LKRTPKDFDILT+AELKEVMRAFP CEIVGRRFPICHVHVDDA
Sbjct: 100  KKGYEVYLVGGCVRDLILKRTPKDFDILTSAELKEVMRAFPHCEIVGRRFPICHVHVDDA 159

Query: 358  IVEVSSFSTNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYD 537
            IVEVSSFST GR+    SK  +RKP GC+E DFIRW+NC+QRDFTINGLMFDPFARIVYD
Sbjct: 160  IVEVSSFSTTGRR----SKFSLRKPKGCNESDFIRWRNCVQRDFTINGLMFDPFARIVYD 215

Query: 538  YIGGMEDIQKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLL 717
            YIGGMEDIQKAKVRCI PAN SFVEDCARILRGVRIAARL F+FSRETSHF+KELS+SLL
Sbjct: 216  YIGGMEDIQKAKVRCIAPANISFVEDCARILRGVRIAARLRFQFSRETSHFVKELSDSLL 275

Query: 718  RLDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXX 897
            RLDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYL+SHGFRRRDKRSNM  
Sbjct: 276  RLDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLISHGFRRRDKRSNMLL 335

Query: 898  XXXXXXXXXXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDI 1077
                       PDRPCH CLWV +LAFHEALVE+ RD+LVIGAFSIAVH GGSLS+AVDI
Sbjct: 336  SLFGSLDKLVAPDRPCHCCLWVGILAFHEALVEEGRDSLVIGAFSIAVHGGGSLSEAVDI 395

Query: 1078 AREISQPHDTSFHE--ISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQY 1251
            A +IS P +TSFHE  IS   Y YSK ELM EV+RL ASVK ALRRLTDE+FVSQALI Y
Sbjct: 396  AMKISPP-ETSFHEVIISPTTYLYSKHELMEEVLRLAASVKAALRRLTDEHFVSQALINY 454

Query: 1252 PQAPQSDMVFISWALSLKVSSIFECIRKGRNRR-FVPKRGSEIDYDSLALGRLDEVRDIF 1428
            PQAPQS +VFISWALSLKV+SIF+C+++G+ RR F+PK+G+EIDY SLALGRLDEVR IF
Sbjct: 455  PQAPQSHLVFISWALSLKVNSIFDCVKRGKTRRTFLPKQGNEIDYQSLALGRLDEVRAIF 514

Query: 1429 GRVVFDTVYPTKL 1467
            GR+VFDT+YP+ L
Sbjct: 515  GRLVFDTLYPSNL 527


>ref|XP_023733520.1| uncharacterized protein LOC111881359 isoform X1 [Lactuca sativa]
 gb|PLY74045.1| hypothetical protein LSAT_8X147080 [Lactuca sativa]
          Length = 533

 Score =  770 bits (1987), Expect = 0.0
 Identities = 389/492 (79%), Positives = 429/492 (87%), Gaps = 3/492 (0%)
 Frame = +1

Query: 1    ESVSVNNDHNSYQTSSARNVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRK 180
            ESVSV+N+ N   T+    V  GND+ KPHEWKKLCSKELG++TSRI KPA+ VLNVLRK
Sbjct: 42   ESVSVHNEDNQLSTARIVKVGRGNDESKPHEWKKLCSKELGVKTSRIIKPAKFVLNVLRK 101

Query: 181  KGYDVYLVGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAI 360
            KGY+VYLVGGCVRDL+LKRTPKDFDILT+AELKEVMRAFP CEIVGRRFPICHVHVDDAI
Sbjct: 102  KGYEVYLVGGCVRDLILKRTPKDFDILTSAELKEVMRAFPHCEIVGRRFPICHVHVDDAI 161

Query: 361  VEVSSFSTNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDY 540
            VEVSSFST GR+    SK  +RKP GC+E DFIRW+NC+QRDFTINGLMFDPFARIVYDY
Sbjct: 162  VEVSSFSTTGRR----SKFSLRKPKGCNESDFIRWRNCVQRDFTINGLMFDPFARIVYDY 217

Query: 541  IGGMEDIQKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLR 720
            IGGMEDIQKAKVRCI PAN SFVEDCARILRGVRIAARL F+FSRETSHF+KELS+SLLR
Sbjct: 218  IGGMEDIQKAKVRCIAPANISFVEDCARILRGVRIAARLRFQFSRETSHFVKELSDSLLR 277

Query: 721  LDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXX 900
            LDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYL+SHGFRRRDKRSNM   
Sbjct: 278  LDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLISHGFRRRDKRSNMLLS 337

Query: 901  XXXXXXXXXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIA 1080
                      PDRPCH CLWV +LAFHEALVE+ RD+LVIGAFSIAVH GGSLS+AVDIA
Sbjct: 338  LFGSLDKLVAPDRPCHCCLWVGILAFHEALVEEGRDSLVIGAFSIAVHGGGSLSEAVDIA 397

Query: 1081 REISQPHDTSFHE--ISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYP 1254
             +IS P +TSFHE  IS   Y YSK ELM EV+RL ASVK ALRRLTDE+FVSQALI YP
Sbjct: 398  MKISPP-ETSFHEVIISPTTYLYSKHELMEEVLRLAASVKAALRRLTDEHFVSQALINYP 456

Query: 1255 QAPQSDMVFISWALSLKVSSIFECIRKGRNRR-FVPKRGSEIDYDSLALGRLDEVRDIFG 1431
            QAPQS +VFISWALSLKV+SIF+C+++G+ RR F+PK+G+EIDY SLALGRLDEVR IFG
Sbjct: 457  QAPQSHLVFISWALSLKVNSIFDCVKRGKTRRTFLPKQGNEIDYQSLALGRLDEVRAIFG 516

Query: 1432 RVVFDTVYPTKL 1467
            R+VFDT+YP+ L
Sbjct: 517  RLVFDTLYPSNL 528


>ref|XP_023733522.1| uncharacterized protein LOC111881359 isoform X3 [Lactuca sativa]
          Length = 530

 Score =  769 bits (1986), Expect = 0.0
 Identities = 393/492 (79%), Positives = 433/492 (88%), Gaps = 3/492 (0%)
 Frame = +1

Query: 1    ESVSVNNDHNSYQTSSARNVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRK 180
            ESVSV+N+ N  Q S+AR VKG ND+ KPHEWKKLCSKELG++TSRI KPA+ VLNVLRK
Sbjct: 42   ESVSVHNEDN--QLSTARIVKG-NDESKPHEWKKLCSKELGVKTSRIIKPAKFVLNVLRK 98

Query: 181  KGYDVYLVGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAI 360
            KGY+VYLVGGCVRDL+LKRTPKDFDILT+AELKEVMRAFP CEIVGRRFPICHVHVDDAI
Sbjct: 99   KGYEVYLVGGCVRDLILKRTPKDFDILTSAELKEVMRAFPHCEIVGRRFPICHVHVDDAI 158

Query: 361  VEVSSFSTNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDY 540
            VEVSSFST GR+    SK  +RKP GC+E DFIRW+NC+QRDFTINGLMFDPFARIVYDY
Sbjct: 159  VEVSSFSTTGRR----SKFSLRKPKGCNESDFIRWRNCVQRDFTINGLMFDPFARIVYDY 214

Query: 541  IGGMEDIQKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLR 720
            IGGMEDIQKAKVRCI PAN SFVEDCARILRGVRIAARL F+FSRETSHF+KELS+SLLR
Sbjct: 215  IGGMEDIQKAKVRCIAPANISFVEDCARILRGVRIAARLRFQFSRETSHFVKELSDSLLR 274

Query: 721  LDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXX 900
            LDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYL+SHGFRRRDKRSNM   
Sbjct: 275  LDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLISHGFRRRDKRSNMLLS 334

Query: 901  XXXXXXXXXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIA 1080
                      PDRPCH CLWV +LAFHEALVE+ RD+LVIGAFSIAVH GGSLS+AVDIA
Sbjct: 335  LFGSLDKLVAPDRPCHCCLWVGILAFHEALVEEGRDSLVIGAFSIAVHGGGSLSEAVDIA 394

Query: 1081 REISQPHDTSFHE--ISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYP 1254
             +IS P +TSFHE  IS   Y YSK ELM EV+RL ASVK ALRRLTDE+FVSQALI YP
Sbjct: 395  MKISPP-ETSFHEVIISPTTYLYSKHELMEEVLRLAASVKAALRRLTDEHFVSQALINYP 453

Query: 1255 QAPQSDMVFISWALSLKVSSIFECIRKGRNRR-FVPKRGSEIDYDSLALGRLDEVRDIFG 1431
            QAPQS +VFISWALSLKV+SIF+C+++G+ RR F+PK+G+EIDY SLALGRLDEVR IFG
Sbjct: 454  QAPQSHLVFISWALSLKVNSIFDCVKRGKTRRTFLPKQGNEIDYQSLALGRLDEVRAIFG 513

Query: 1432 RVVFDTVYPTKL 1467
            R+VFDT+YP+ L
Sbjct: 514  RLVFDTLYPSNL 525


>gb|OTG17290.1| putative poly A polymerase, head domain-containing protein
            [Helianthus annuus]
          Length = 432

 Score =  749 bits (1934), Expect = 0.0
 Identities = 366/431 (84%), Positives = 396/431 (91%)
 Frame = +1

Query: 184  GYDVYLVGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIV 363
            GY+VYLVGGCVRDLVLKRTPKDFDILT+AELKEVMRAFPRCEIVGRRFPICHVHVDDAIV
Sbjct: 2    GYEVYLVGGCVRDLVLKRTPKDFDILTSAELKEVMRAFPRCEIVGRRFPICHVHVDDAIV 61

Query: 364  EVSSFSTNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYI 543
            EVSSFST GRKFGRKSKS MRKPSGCDE DF+RW+NCMQRDFTINGLMFDPFARIVYDY+
Sbjct: 62   EVSSFSTIGRKFGRKSKSSMRKPSGCDEYDFVRWRNCMQRDFTINGLMFDPFARIVYDYV 121

Query: 544  GGMEDIQKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRL 723
            GGM+DIQKAKVRCI PA+TSF EDCARILRG+RIAARLGFRFSRETSH+LKE S+SLLRL
Sbjct: 122  GGMKDIQKAKVRCIIPASTSFAEDCARILRGIRIAARLGFRFSRETSHYLKEFSDSLLRL 181

Query: 724  DKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXX 903
            DKGRIH+EMNYMLAYGSAEASLRLLWKFGLLEILLP+QASYLVSHGFRRRDKRSNM    
Sbjct: 182  DKGRIHLEMNYMLAYGSAEASLRLLWKFGLLEILLPLQASYLVSHGFRRRDKRSNMLLSL 241

Query: 904  XXXXXXXXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAR 1083
                     PDRPCHSCLWVS+LAFHEALVEQPRDALVI AFSIAVH GGSLS+AVDIA+
Sbjct: 242  FASLDKLLAPDRPCHSCLWVSILAFHEALVEQPRDALVIAAFSIAVHRGGSLSEAVDIAK 301

Query: 1084 EISQPHDTSFHEISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAP 1263
            +ISQPH+TSFHEIS+  Y+Y+KDELM EVI+L  SVK  LR+LTDEN VSQAL+QYPQAP
Sbjct: 302  QISQPHETSFHEISQSCYAYTKDELMDEVIKLADSVKSTLRKLTDENIVSQALVQYPQAP 361

Query: 1264 QSDMVFISWALSLKVSSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVF 1443
            QSDMVFISWALSLKV S+F+C+++GRNRRFV  RG EIDY+SLALGRLDEVRDIFGRVVF
Sbjct: 362  QSDMVFISWALSLKVCSMFDCVKRGRNRRFVSNRGGEIDYESLALGRLDEVRDIFGRVVF 421

Query: 1444 DTVYPTKLAEQ 1476
            DTVYP +LA Q
Sbjct: 422  DTVYPIQLATQ 432


>ref|XP_021974381.1| uncharacterized protein LOC110869437 [Helianthus annuus]
          Length = 454

 Score =  749 bits (1934), Expect = 0.0
 Identities = 366/431 (84%), Positives = 396/431 (91%)
 Frame = +1

Query: 184  GYDVYLVGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIV 363
            GY+VYLVGGCVRDLVLKRTPKDFDILT+AELKEVMRAFPRCEIVGRRFPICHVHVDDAIV
Sbjct: 24   GYEVYLVGGCVRDLVLKRTPKDFDILTSAELKEVMRAFPRCEIVGRRFPICHVHVDDAIV 83

Query: 364  EVSSFSTNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYI 543
            EVSSFST GRKFGRKSKS MRKPSGCDE DF+RW+NCMQRDFTINGLMFDPFARIVYDY+
Sbjct: 84   EVSSFSTIGRKFGRKSKSSMRKPSGCDEYDFVRWRNCMQRDFTINGLMFDPFARIVYDYV 143

Query: 544  GGMEDIQKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRL 723
            GGM+DIQKAKVRCI PA+TSF EDCARILRG+RIAARLGFRFSRETSH+LKE S+SLLRL
Sbjct: 144  GGMKDIQKAKVRCIIPASTSFAEDCARILRGIRIAARLGFRFSRETSHYLKEFSDSLLRL 203

Query: 724  DKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXX 903
            DKGRIH+EMNYMLAYGSAEASLRLLWKFGLLEILLP+QASYLVSHGFRRRDKRSNM    
Sbjct: 204  DKGRIHLEMNYMLAYGSAEASLRLLWKFGLLEILLPLQASYLVSHGFRRRDKRSNMLLSL 263

Query: 904  XXXXXXXXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAR 1083
                     PDRPCHSCLWVS+LAFHEALVEQPRDALVI AFSIAVH GGSLS+AVDIA+
Sbjct: 264  FASLDKLLAPDRPCHSCLWVSILAFHEALVEQPRDALVIAAFSIAVHRGGSLSEAVDIAK 323

Query: 1084 EISQPHDTSFHEISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAP 1263
            +ISQPH+TSFHEIS+  Y+Y+KDELM EVI+L  SVK  LR+LTDEN VSQAL+QYPQAP
Sbjct: 324  QISQPHETSFHEISQSCYAYTKDELMDEVIKLADSVKSTLRKLTDENIVSQALVQYPQAP 383

Query: 1264 QSDMVFISWALSLKVSSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVF 1443
            QSDMVFISWALSLKV S+F+C+++GRNRRFV  RG EIDY+SLALGRLDEVRDIFGRVVF
Sbjct: 384  QSDMVFISWALSLKVCSMFDCVKRGRNRRFVSNRGGEIDYESLALGRLDEVRDIFGRVVF 443

Query: 1444 DTVYPTKLAEQ 1476
            DTVYP +LA Q
Sbjct: 444  DTVYPIQLATQ 454


>ref|XP_019264330.1| PREDICTED: uncharacterized protein LOC109241957 [Nicotiana attenuata]
 gb|OIT36511.1| hypothetical protein A4A49_06650 [Nicotiana attenuata]
          Length = 527

 Score =  665 bits (1715), Expect = 0.0
 Identities = 326/484 (67%), Positives = 399/484 (82%)
 Frame = +1

Query: 22   DHNSYQTSSARNVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRKKGYDVYL 201
            ++ S+ TS   +VKG  D+  P +WKKL S+ELGI TS I KP R VLN L+KKG++VYL
Sbjct: 50   NNRSHNTS---HVKG--DESGP-KWKKLSSEELGISTSMIAKPTRVVLNGLKKKGFEVYL 103

Query: 202  VGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSSFS 381
            VGGCVRDL+L +TPKDFDI+T+AELKEV++ F RCEIVGRRFPICHVHVDD IVEVSSF+
Sbjct: 104  VGGCVRDLLLNKTPKDFDIITSAELKEVLKTFQRCEIVGRRFPICHVHVDDTIVEVSSFN 163

Query: 382  TNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGMEDI 561
            T GR+F R S + +R+P+ CDE DFIRW+NC+ RDFTINGLMFDPFARIVYDY+GG+EDI
Sbjct: 164  TTGRRFKRNSYNVVRRPAMCDEADFIRWKNCLGRDFTINGLMFDPFARIVYDYLGGLEDI 223

Query: 562  QKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGRIH 741
            ++AKVRC+ PA+ SF+EDCARILRGVRIA RLGFRFSRET+HF+KEL++S+ RLDKGRI 
Sbjct: 224  RRAKVRCVIPASASFIEDCARILRGVRIAGRLGFRFSRETAHFVKELASSISRLDKGRIL 283

Query: 742  MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXXXX 921
            MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASY +S GFRRRDKRSNM          
Sbjct: 284  MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYFISQGFRRRDKRSNMLLTLFSTLDN 343

Query: 922  XXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQPH 1101
               PDRPCHS LW+++LAFH+ALV++PRD LV+ AFS+AVH GGSLSD + +AR+ISQPH
Sbjct: 344  LLAPDRPCHSSLWIAILAFHKALVDRPRDPLVVAAFSVAVHCGGSLSDVLGVARKISQPH 403

Query: 1102 DTSFHEISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAPQSDMVF 1281
            DT F E+ +     S + L+ E++ L   V+ ALR++TDE+F+SQAL +YPQAP+SD+VF
Sbjct: 404  DTRFSELLDFRIVESDEALLDEMMDLATYVEAALRKMTDEHFISQALTEYPQAPKSDLVF 463

Query: 1282 ISWALSLKVSSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVYPT 1461
            I WALS KV +IFEC+R+G+ + F  KRGS+IDY+SLALG+L E+R IF RVVFDTV+P 
Sbjct: 464  IPWALSQKVDAIFECVRRGKEKGFRRKRGSKIDYESLALGKLHEIRHIFARVVFDTVFPP 523

Query: 1462 KLAE 1473
             L +
Sbjct: 524  HLKD 527


>ref|XP_009604520.1| PREDICTED: uncharacterized protein LOC104099282 isoform X1 [Nicotiana
            tomentosiformis]
 ref|XP_016492628.1| PREDICTED: poly(A) polymerase I-like isoform X1 [Nicotiana tabacum]
          Length = 527

 Score =  663 bits (1711), Expect = 0.0
 Identities = 325/486 (66%), Positives = 398/486 (81%)
 Frame = +1

Query: 16   NNDHNSYQTSSARNVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRKKGYDV 195
            N  HN+       +VKG  D+  P +WKKL S+ELGI TS I KP R VLN L+KKG++V
Sbjct: 51   NRSHNT------SDVKG--DENAP-KWKKLSSEELGISTSMIAKPTRVVLNGLKKKGFEV 101

Query: 196  YLVGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSS 375
            YLVGGCVRDL+L +TPKDFDI+T+AELKEV++ F RCEIVGRRFPICHVHVDD IVEVSS
Sbjct: 102  YLVGGCVRDLLLNKTPKDFDIITSAELKEVLKTFQRCEIVGRRFPICHVHVDDTIVEVSS 161

Query: 376  FSTNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGME 555
            F+T GR+F R S + +R+P+ CDE DFIRW+NC+ RDFTINGLMFDPFARIVYDY+GG+E
Sbjct: 162  FNTTGRRFRRNSYNVVRRPATCDEADFIRWKNCLGRDFTINGLMFDPFARIVYDYLGGLE 221

Query: 556  DIQKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGR 735
            DI++AKVRC+ PA+ SFVEDCARILRGVRIA RLGFRFSRET+HF+KEL++S+ RLDKGR
Sbjct: 222  DIRRAKVRCVIPASASFVEDCARILRGVRIAGRLGFRFSRETAHFVKELASSISRLDKGR 281

Query: 736  IHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXX 915
            I MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASY +S GFRRRDKRSNM        
Sbjct: 282  ILMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYFISQGFRRRDKRSNMLLTLFSTL 341

Query: 916  XXXXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQ 1095
                 PDRPCHS LW+++LAFH+ALV++PRD LV+ AFS+AVH GGSLSD + +AR+ISQ
Sbjct: 342  DNLLAPDRPCHSSLWIAILAFHKALVDRPRDPLVVAAFSVAVHCGGSLSDVLGVARKISQ 401

Query: 1096 PHDTSFHEISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAPQSDM 1275
            PHDT F E+ +     S + L+ E++ L   V+ ALR++TDE+F+S+AL +YPQAP+SD+
Sbjct: 402  PHDTRFSELLDFRIVESDEALLDEMMDLATYVEAALRKMTDEHFISRALTEYPQAPKSDL 461

Query: 1276 VFISWALSLKVSSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVY 1455
            VFI WALS KV +IFEC+R+G+ + F  KRGS+IDY+SLALG+L E+R IF RVVFDT++
Sbjct: 462  VFIPWALSQKVDAIFECVRRGKEKGFRRKRGSKIDYESLALGKLHEIRHIFARVVFDTMF 521

Query: 1456 PTKLAE 1473
            P+ L +
Sbjct: 522  PSHLKD 527


>ref|XP_016450657.1| PREDICTED: poly(A) polymerase I-like [Nicotiana tabacum]
          Length = 527

 Score =  660 bits (1702), Expect = 0.0
 Identities = 324/486 (66%), Positives = 395/486 (81%)
 Frame = +1

Query: 16   NNDHNSYQTSSARNVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRKKGYDV 195
            N  HN+       +VKG  D+  P +WKKL S+ELGI TS I KP R VLN L+KKG++V
Sbjct: 51   NRSHNT------SDVKG--DESGP-KWKKLSSEELGISTSMIAKPTRVVLNGLKKKGFEV 101

Query: 196  YLVGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSS 375
            YLVGGCVRDL+L +TPKDFDI+T+AELKEV++ F RCEIVGRRFPICHVHVDD IVEVSS
Sbjct: 102  YLVGGCVRDLLLNKTPKDFDIITSAELKEVLKTFQRCEIVGRRFPICHVHVDDTIVEVSS 161

Query: 376  FSTNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGME 555
            F+T GR+F R S + +R+P+ CDE DFIRW+NC+ RDFTINGLMFDPFARIVYDY+GG+E
Sbjct: 162  FNTTGRRFKRNSYNVVRRPAMCDEADFIRWKNCLGRDFTINGLMFDPFARIVYDYLGGLE 221

Query: 556  DIQKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGR 735
            DI++AKVRC+ PA+ SF+EDCARILRGVRIA RLGFRFSRET+HF+KEL++S+ +LDKGR
Sbjct: 222  DIRRAKVRCVIPASASFIEDCARILRGVRIAGRLGFRFSRETAHFVKELASSISKLDKGR 281

Query: 736  IHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXX 915
            I MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASY +S GFRRRDKRSNM        
Sbjct: 282  ILMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYFISQGFRRRDKRSNMLLSLFSTL 341

Query: 916  XXXXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQ 1095
                 PDRPCHS LW+++LAFH+ALV++PRD LV+ AFS AVH GGSLSD + +A +ISQ
Sbjct: 342  DNLLAPDRPCHSSLWIAILAFHKALVDRPRDPLVVAAFSAAVHCGGSLSDVLGVAGKISQ 401

Query: 1096 PHDTSFHEISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAPQSDM 1275
            PHDT F E+ +     S + L+ E++ L   V+ ALR++TDE+F+SQAL +YPQAP+SD+
Sbjct: 402  PHDTRFSELLDFRIVESDEALLDEMMDLATYVEAALRKMTDEHFISQALTEYPQAPKSDL 461

Query: 1276 VFISWALSLKVSSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVY 1455
            VFI WALS KV +IFEC+R+G+ + F  KRGS+IDY+SLALG+L E+R IF RVVFDTV+
Sbjct: 462  VFIPWALSQKVDAIFECVRRGKEKGFRQKRGSKIDYESLALGKLHEIRHIFARVVFDTVF 521

Query: 1456 PTKLAE 1473
            P  L +
Sbjct: 522  PPHLKD 527


>ref|XP_009779993.1| PREDICTED: putative CCA tRNA nucleotidyltransferase 2 isoform X1
            [Nicotiana sylvestris]
          Length = 527

 Score =  659 bits (1699), Expect = 0.0
 Identities = 324/486 (66%), Positives = 394/486 (81%)
 Frame = +1

Query: 16   NNDHNSYQTSSARNVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRKKGYDV 195
            N  HN+       +VKG  D+  P +WKKL S+ELGI TS I KP R VLN L+KKG++V
Sbjct: 51   NRSHNT------SDVKG--DESGP-KWKKLSSEELGISTSMIAKPTRVVLNGLKKKGFEV 101

Query: 196  YLVGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSS 375
            YLVGGCVRDL+L +TPKDFDI+T+AELKEV++ F RCEIVGRRFPICHVHVDD IVEVSS
Sbjct: 102  YLVGGCVRDLLLNKTPKDFDIITSAELKEVLKTFQRCEIVGRRFPICHVHVDDTIVEVSS 161

Query: 376  FSTNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGME 555
            F+T GR+F R S + +R+P+ CDE DFIRW+NC+ RDFTINGLMFDPFARIVYDY+GG+E
Sbjct: 162  FNTTGRRFKRNSYNVVRRPAMCDEADFIRWKNCLGRDFTINGLMFDPFARIVYDYLGGLE 221

Query: 556  DIQKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGR 735
            DI++AKVRC+ PA+ SF+EDCARILRGVRIA RLGFRFSRET+HF+KEL++S+ +LDKGR
Sbjct: 222  DIRRAKVRCVIPASASFIEDCARILRGVRIAGRLGFRFSRETAHFVKELASSISKLDKGR 281

Query: 736  IHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXX 915
            I MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASY +S GFRRRDKRSNM        
Sbjct: 282  ILMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYFISQGFRRRDKRSNMLLSLFSTL 341

Query: 916  XXXXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQ 1095
                 PDRPCHS LW+++LAFH+ALV++PRD LV+ AFS AVH GGSLSD + +A +ISQ
Sbjct: 342  DNLLAPDRPCHSSLWIAILAFHKALVDRPRDPLVVAAFSAAVHCGGSLSDVLGVAGKISQ 401

Query: 1096 PHDTSFHEISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAPQSDM 1275
            PHDT F E+ +     S + L+ E++ L   V+ ALR++TDE F+SQAL +YPQAP+SD+
Sbjct: 402  PHDTRFSELLDFRIVESDEALLDEMMDLATYVEAALRKMTDEYFISQALTEYPQAPKSDL 461

Query: 1276 VFISWALSLKVSSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVY 1455
            VFI WALS KV +IFEC+R+G+ + F  KRGS+IDY+SLALG+L E+R IF RVVFDTV+
Sbjct: 462  VFIPWALSQKVDAIFECVRRGKEKGFRQKRGSKIDYESLALGKLHEIRHIFARVVFDTVF 521

Query: 1456 PTKLAE 1473
            P  L +
Sbjct: 522  PPHLKD 527


>ref|XP_002266814.1| PREDICTED: uncharacterized protein LOC100259104 isoform X1 [Vitis
            vinifera]
 emb|CBI35659.3| unnamed protein product, partial [Vitis vinifera]
          Length = 537

 Score =  656 bits (1693), Expect = 0.0
 Identities = 326/490 (66%), Positives = 394/490 (80%), Gaps = 1/490 (0%)
 Frame = +1

Query: 1    ESVSVNNDHNSY-QTSSARNVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLR 177
            E V V  +   +  TS  R    GN   K  EWKKL SK+LGIRTS I KP R VLN L+
Sbjct: 41   EPVGVTKEEEPHWATSDGR----GNGASKAPEWKKLNSKDLGIRTSMIAKPTRYVLNGLK 96

Query: 178  KKGYDVYLVGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDA 357
            KKGY+VYLVGGCVRDL+LKRTPKDFDI+T+AELKEV+RAFPRCE+VG+RFPICHVHV+D 
Sbjct: 97   KKGYEVYLVGGCVRDLILKRTPKDFDIITSAELKEVLRAFPRCEVVGKRFPICHVHVNDT 156

Query: 358  IVEVSSFSTNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYD 537
            IVEVSSFST+G++ GRK    +R+P  CD+ D+IRW+NC+QRDFTINGLMFDP+ +IVYD
Sbjct: 157  IVEVSSFSTSGKRTGRKLDYILRRPPDCDDHDYIRWRNCLQRDFTINGLMFDPYTKIVYD 216

Query: 538  YIGGMEDIQKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLL 717
            Y+GGM+DI+KAKVR + PAN SFVEDCARILRGVRIAARLGFRF+++ +H ++ELS S+L
Sbjct: 217  YMGGMQDIKKAKVRTVIPANISFVEDCARILRGVRIAARLGFRFTKDIAHSVRELSCSVL 276

Query: 718  RLDKGRIHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXX 897
            RLDKGRI MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQA+YLVS GFRRRD+RSNM  
Sbjct: 277  RLDKGRILMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQAAYLVSQGFRRRDQRSNMLL 336

Query: 898  XXXXXXXXXXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDI 1077
                       PDRPCH+ LW+ +LAFH+ALV+QPR  +V+ AFS+AVH+GGSLS+AV+I
Sbjct: 337  SLFSNLDRLVAPDRPCHNSLWIGMLAFHKALVDQPRHPMVVAAFSLAVHNGGSLSEAVEI 396

Query: 1078 AREISQPHDTSFHEISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQ 1257
            AR ISQPHD SF E+ EP    S + L+ E++ L ASVK AL ++TDE+FVSQA+ +YP+
Sbjct: 397  ARRISQPHDQSFSELLEPQDLDSDESLIDEIMDLAASVKSALMKMTDEHFVSQAMSKYPR 456

Query: 1258 APQSDMVFISWALSLKVSSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRV 1437
            AP SD+VFIS A  L+ S IF+C++ G  + FVPK+GS IDY+ LALG L EVR +F R+
Sbjct: 457  APYSDLVFISLASFLRASKIFQCVQGGAEKGFVPKQGSRIDYEFLALGSLREVRHVFARI 516

Query: 1438 VFDTVYPTKL 1467
            VFDTVYP  L
Sbjct: 517  VFDTVYPLSL 526


>ref|XP_006361912.1| PREDICTED: poly(A) polymerase I-like isoform X1 [Solanum tuberosum]
          Length = 520

 Score =  650 bits (1676), Expect = 0.0
 Identities = 316/473 (66%), Positives = 383/473 (80%)
 Frame = +1

Query: 55   NVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRKKGYDVYLVGGCVRDLVLK 234
            N   G  D    +WKKL S+ELGI TS I KP R VLN L+KKG++VYLVGGCVRDL+L 
Sbjct: 49   NASDGKGDGDAPKWKKLSSEELGISTSMIAKPTRLVLNGLKKKGFEVYLVGGCVRDLILN 108

Query: 235  RTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSSFSTNGRKFGRKSK 414
            RTPKDFDILT+AELKEV++ F RCEIVGRRFPICHVH+DD IVEVSSF+  GRKF R   
Sbjct: 109  RTPKDFDILTSAELKEVLKIFQRCEIVGRRFPICHVHIDDTIVEVSSFTATGRKFKRNGY 168

Query: 415  SPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGMEDIQKAKVRCITPA 594
            + +R+P  C E DFIRW+NC+ RDFTINGLMFDPFA+IVYDY+GG+EDI++AKVRC+ PA
Sbjct: 169  NVVRRPPACSEADFIRWKNCLARDFTINGLMFDPFAKIVYDYLGGLEDIRRAKVRCVIPA 228

Query: 595  NTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGRIHMEMNYMLAYGS 774
            N SF+EDCARILRGVRIA RL FRF+RET+HF+KEL++S+ RLDKGRI +EMNYMLAYGS
Sbjct: 229  NASFIEDCARILRGVRIAGRLRFRFARETAHFIKELASSISRLDKGRILLEMNYMLAYGS 288

Query: 775  AEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXXXXXXXPDRPCHSC 954
            AEASLRLLWKFGLLEILLPIQASYL+S GFRRRD+RSNM             P+RPCHS 
Sbjct: 289  AEASLRLLWKFGLLEILLPIQASYLISQGFRRRDRRSNMLLTLFSTLDNLLAPNRPCHSS 348

Query: 955  LWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQPHDTSFHEISEPI 1134
            LW+++LAFH+ALV++PRD LV+ AFSIAVH GGSLSD + I R+ISQPHDT F E+ +  
Sbjct: 349  LWIAILAFHKALVDRPRDPLVVAAFSIAVHCGGSLSDVLGIVRKISQPHDTRFPELLDQN 408

Query: 1135 YSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAPQSDMVFISWALSLKVSS 1314
               S + L+ E++ L   V+ AL+ +TDE+FVS+ALI+YPQAP+SD+VFISW L+ KVS+
Sbjct: 409  IE-SDEALLDEMMDLATYVEAALQEMTDEHFVSRALIEYPQAPKSDLVFISWTLAQKVSA 467

Query: 1315 IFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVYPTKLAE 1473
            IFEC+R+G+ + F  KRG +IDY+SLALG+L EVR IF  VVFDTV+P  L +
Sbjct: 468  IFECVRRGKEKDFRRKRGRKIDYESLALGKLREVRHIFAMVVFDTVFPPHLKD 520


>ref|XP_009604528.1| PREDICTED: uncharacterized protein LOC104099282 isoform X2 [Nicotiana
            tomentosiformis]
 ref|XP_016492629.1| PREDICTED: poly(A) polymerase I-like isoform X2 [Nicotiana tabacum]
          Length = 520

 Score =  648 bits (1671), Expect = 0.0
 Identities = 321/486 (66%), Positives = 393/486 (80%)
 Frame = +1

Query: 16   NNDHNSYQTSSARNVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRKKGYDV 195
            N  HN+       +VKG  D+  P +WKKL S+ELGI TS I KP R VLN L+KKG++V
Sbjct: 51   NRSHNT------SDVKG--DENAP-KWKKLSSEELGISTSMIAKPTRVVLNGLKKKGFEV 101

Query: 196  YLVGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSS 375
            YLVGGCVRDL+L +TPKDFDI+T+AELKEV++ F RCEIVGRRFPICHVHVDD IVEVSS
Sbjct: 102  YLVGGCVRDLLLNKTPKDFDIITSAELKEVLKTFQRCEIVGRRFPICHVHVDDTIVEVSS 161

Query: 376  FSTNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGME 555
            F+T GR+F R S + +R+P+ CDE DFIRW+NC+ RDFTINGLMFDPFARIVYDY+GG+E
Sbjct: 162  FNTTGRRFRRNSYNVVRRPATCDEADFIRWKNCLGRDFTINGLMFDPFARIVYDYLGGLE 221

Query: 556  DIQKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGR 735
            DI++AKVRC+ PA+ SFVEDCARILRGVRIA RLGFRFSRET+HF+KEL++S+ RLDKGR
Sbjct: 222  DIRRAKVRCVIPASASFVEDCARILRGVRIAGRLGFRFSRETAHFVKELASSISRLDKGR 281

Query: 736  IHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXX 915
            I MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQ       GFRRRDKRSNM        
Sbjct: 282  ILMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQ-------GFRRRDKRSNMLLTLFSTL 334

Query: 916  XXXXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQ 1095
                 PDRPCHS LW+++LAFH+ALV++PRD LV+ AFS+AVH GGSLSD + +AR+ISQ
Sbjct: 335  DNLLAPDRPCHSSLWIAILAFHKALVDRPRDPLVVAAFSVAVHCGGSLSDVLGVARKISQ 394

Query: 1096 PHDTSFHEISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAPQSDM 1275
            PHDT F E+ +     S + L+ E++ L   V+ ALR++TDE+F+S+AL +YPQAP+SD+
Sbjct: 395  PHDTRFSELLDFRIVESDEALLDEMMDLATYVEAALRKMTDEHFISRALTEYPQAPKSDL 454

Query: 1276 VFISWALSLKVSSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVY 1455
            VFI WALS KV +IFEC+R+G+ + F  KRGS+IDY+SLALG+L E+R IF RVVFDT++
Sbjct: 455  VFIPWALSQKVDAIFECVRRGKEKGFRRKRGSKIDYESLALGKLHEIRHIFARVVFDTMF 514

Query: 1456 PTKLAE 1473
            P+ L +
Sbjct: 515  PSHLKD 520


>ref|XP_023908023.1| uncharacterized protein LOC112019738 [Quercus suber]
          Length = 528

 Score =  645 bits (1665), Expect = 0.0
 Identities = 318/467 (68%), Positives = 378/467 (80%)
 Frame = +1

Query: 76   DCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRKKGYDVYLVGGCVRDLVLKRTPKDFD 255
            D K  EWK+L SKELGI TS ITKP R VLN L++KGYDVYLVGGCVRDL+L+RTPKDFD
Sbjct: 59   DQKALEWKRLNSKELGISTSMITKPTRKVLNGLKRKGYDVYLVGGCVRDLILQRTPKDFD 118

Query: 256  ILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSSFSTNGRKFGRKSKSPMRKPS 435
            I+T+AELKEVMR F  CEIVG+RFPICHVHV+D IVEVSSFST+GRKF R       KP 
Sbjct: 119  IITSAELKEVMRTFSWCEIVGKRFPICHVHVEDNIVEVSSFSTSGRKFDRDLSDDFEKPV 178

Query: 436  GCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGMEDIQKAKVRCITPANTSFVED 615
            GCDE D+IRW+NC+QRDFTINGLMFDP+ARIVYDYIGGMED++K+KV+ + PA+TSF ED
Sbjct: 179  GCDEKDYIRWKNCLQRDFTINGLMFDPYARIVYDYIGGMEDLRKSKVQTVIPASTSFQED 238

Query: 616  CARILRGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGRIHMEMNYMLAYGSAEASLRL 795
            CARILR +R+AARLGFRFSRET+HF+K LS S++RLDK R+ MEMNYMLAYGSAEASLRL
Sbjct: 239  CARILRAIRVAARLGFRFSRETAHFVKNLSCSVIRLDKARLLMEMNYMLAYGSAEASLRL 298

Query: 796  LWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXXXXXXXPDRPCHSCLWVSLLA 975
            LWKFGLLEILLPIQA+Y V HGFRRRDKRSNM             PDRPCHS LWV +LA
Sbjct: 299  LWKFGLLEILLPIQAAYFVRHGFRRRDKRSNMLLSLFSNMDKLLAPDRPCHSSLWVGILA 358

Query: 976  FHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQPHDTSFHEISEPIYSYSKDE 1155
            FH AL E PRD +V+ AFS+AVH+GG + +A++IA++I+ PHD SFHE+SEP    S+  
Sbjct: 359  FHTALSEFPRDPMVVAAFSLAVHNGGDILEAINIAKKITAPHDVSFHELSEPQILDSR-A 417

Query: 1156 LMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAPQSDMVFISWALSLKVSSIFECIRK 1335
            +  EV+ L ASVK AL ++TDE+F+SQA+  YPQAP SDMVFI   L L+V  IFEC+R 
Sbjct: 418  MTHEVMDLAASVKSALCKMTDEHFISQAMSGYPQAPYSDMVFIPLGLYLRVCRIFECVRD 477

Query: 1336 GRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVYPTKLAEQ 1476
            G  + FV K+GS+IDY+SLALG + EVR IF R+VFDTVYP +L ++
Sbjct: 478  GAEKGFVAKQGSKIDYESLALGGMQEVRHIFARIVFDTVYPLRLNQE 524


>ref|XP_010249182.1| PREDICTED: uncharacterized protein LOC104591829 [Nelumbo nucifera]
          Length = 523

 Score =  644 bits (1662), Expect = 0.0
 Identities = 324/473 (68%), Positives = 376/473 (79%), Gaps = 3/473 (0%)
 Frame = +1

Query: 58   VKGGNDDCKP-HE--WKKLCSKELGIRTSRITKPARTVLNVLRKKGYDVYLVGGCVRDLV 228
            VKG N +    HE  WK L SK+LGI TS I KP R VLN LR+KGY+VYLVGGCVRDL+
Sbjct: 44   VKGENHELISFHEIGWKTLNSKDLGITTSMIAKPTRVVLNGLRRKGYEVYLVGGCVRDLI 103

Query: 229  LKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSSFSTNGRKFGRK 408
            L+RTPKDFDI+T+AEL+EVMR F RCEIVGRRFPICHVH+DD IVEVSSFST+GRK  R 
Sbjct: 104  LQRTPKDFDIITSAELREVMRTFSRCEIVGRRFPICHVHIDDTIVEVSSFSTSGRKCNRD 163

Query: 409  SKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGMEDIQKAKVRCIT 588
                 ++P GCDE DF+RW+NC+QRDFTINGLMFDP+A IVYDY+GGMED++KAKVR I 
Sbjct: 164  FSYFFKRPPGCDEHDFVRWRNCLQRDFTINGLMFDPYANIVYDYMGGMEDLKKAKVRTII 223

Query: 589  PANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGRIHMEMNYMLAY 768
            PA  SF EDCARILR VRIAARLGFRF+RET+H +K+LS+S+L+LDKGRI MEMNYMLA+
Sbjct: 224  PATNSFQEDCARILRAVRIAARLGFRFTRETAHSVKDLSSSVLKLDKGRILMEMNYMLAF 283

Query: 769  GSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXXXXXXXPDRPCH 948
            GSAEAS RLLWKFGLLEILLPIQA+Y VS GFRRRDKR+NM             PDRPCH
Sbjct: 284  GSAEASFRLLWKFGLLEILLPIQAAYFVSQGFRRRDKRTNMLLSLFSNLDRLLAPDRPCH 343

Query: 949  SCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQPHDTSFHEISE 1128
            S LWV +LAFH+ALV+QPRD LV+  F++AVHSGG L +AV+IAR ISQPHD SF E+SE
Sbjct: 344  SSLWVVILAFHKALVDQPRDPLVVATFALAVHSGGDLLEAVNIARRISQPHDLSFFELSE 403

Query: 1129 PIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAPQSDMVFISWALSLKV 1308
            P  S S + LM  V+ L ASVK AL  +TDE+FVSQA+ +YPQAP SD+VFI  AL LK 
Sbjct: 404  PQTSISDEALMDAVVDLAASVKAALSMMTDESFVSQAMAEYPQAPYSDLVFIPLALYLKA 463

Query: 1309 SSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVYPTKL 1467
              IFECIR G+ + F PKRG +I+Y+ LALG L EVR +  RVVFDTVYP  L
Sbjct: 464  CRIFECIRVGKEKGFAPKRGRKINYEDLALGSLQEVRHLLARVVFDTVYPPNL 516


>ref|XP_009779994.1| PREDICTED: uncharacterized protein LOC104229113 isoform X2 [Nicotiana
            sylvestris]
          Length = 520

 Score =  643 bits (1659), Expect = 0.0
 Identities = 320/486 (65%), Positives = 389/486 (80%)
 Frame = +1

Query: 16   NNDHNSYQTSSARNVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRKKGYDV 195
            N  HN+       +VKG  D+  P +WKKL S+ELGI TS I KP R VLN L+KKG++V
Sbjct: 51   NRSHNT------SDVKG--DESGP-KWKKLSSEELGISTSMIAKPTRVVLNGLKKKGFEV 101

Query: 196  YLVGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSS 375
            YLVGGCVRDL+L +TPKDFDI+T+AELKEV++ F RCEIVGRRFPICHVHVDD IVEVSS
Sbjct: 102  YLVGGCVRDLLLNKTPKDFDIITSAELKEVLKTFQRCEIVGRRFPICHVHVDDTIVEVSS 161

Query: 376  FSTNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGME 555
            F+T GR+F R S + +R+P+ CDE DFIRW+NC+ RDFTINGLMFDPFARIVYDY+GG+E
Sbjct: 162  FNTTGRRFKRNSYNVVRRPAMCDEADFIRWKNCLGRDFTINGLMFDPFARIVYDYLGGLE 221

Query: 556  DIQKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGR 735
            DI++AKVRC+ PA+ SF+EDCARILRGVRIA RLGFRFSRET+HF+KEL++S+ +LDKGR
Sbjct: 222  DIRRAKVRCVIPASASFIEDCARILRGVRIAGRLGFRFSRETAHFVKELASSISKLDKGR 281

Query: 736  IHMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXX 915
            I MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQ       GFRRRDKRSNM        
Sbjct: 282  ILMEMNYMLAYGSAEASLRLLWKFGLLEILLPIQ-------GFRRRDKRSNMLLSLFSTL 334

Query: 916  XXXXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQ 1095
                 PDRPCHS LW+++LAFH+ALV++PRD LV+ AFS AVH GGSLSD + +A +ISQ
Sbjct: 335  DNLLAPDRPCHSSLWIAILAFHKALVDRPRDPLVVAAFSAAVHCGGSLSDVLGVAGKISQ 394

Query: 1096 PHDTSFHEISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAPQSDM 1275
            PHDT F E+ +     S + L+ E++ L   V+ ALR++TDE F+SQAL +YPQAP+SD+
Sbjct: 395  PHDTRFSELLDFRIVESDEALLDEMMDLATYVEAALRKMTDEYFISQALTEYPQAPKSDL 454

Query: 1276 VFISWALSLKVSSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVY 1455
            VFI WALS KV +IFEC+R+G+ + F  KRGS+IDY+SLALG+L E+R IF RVVFDTV+
Sbjct: 455  VFIPWALSQKVDAIFECVRRGKEKGFRQKRGSKIDYESLALGKLHEIRHIFARVVFDTVF 514

Query: 1456 PTKLAE 1473
            P  L +
Sbjct: 515  PPHLKD 520


>ref|XP_019181262.1| PREDICTED: uncharacterized protein LOC109176260 [Ipomoea nil]
 ref|XP_019181263.1| PREDICTED: uncharacterized protein LOC109176260 [Ipomoea nil]
          Length = 539

 Score =  643 bits (1659), Expect = 0.0
 Identities = 312/469 (66%), Positives = 379/469 (80%)
 Frame = +1

Query: 91   EWKKLCSKELGIRTSRITKPARTVLNVLRKKGYDVYLVGGCVRDLVLKRTPKDFDILTTA 270
            EWKKL S+ELGIRTS I+K  R VLN L++KGYDVYLVGGCVRDL+LK+ PKDFD++T+A
Sbjct: 70   EWKKLSSEELGIRTSMISKSTRLVLNCLKQKGYDVYLVGGCVRDLILKKIPKDFDVITSA 129

Query: 271  ELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSSFSTNGRKFGRKSKSPMRKPSGCDEC 450
            ELKEV++ FPRCEIVGRRFPICHVHVDD IVEVSSF+T GRKFG  S   +R+P  C++ 
Sbjct: 130  ELKEVLKTFPRCEIVGRRFPICHVHVDDVIVEVSSFTTMGRKFGMNSYHAVRRPPKCNDA 189

Query: 451  DFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGMEDIQKAKVRCITPANTSFVEDCARIL 630
            DF+RW+NC+ RDFTINGLMFDPFARI+YDY+GGMEDI++AKVR + PA+TSF+EDCARIL
Sbjct: 190  DFMRWKNCLGRDFTINGLMFDPFARIIYDYMGGMEDIRRAKVRSVIPASTSFMEDCARIL 249

Query: 631  RGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGRIHMEMNYMLAYGSAEASLRLLWKFG 810
            RGVRIA+RLGFRFSRET+HF++E ++S+LRLD+GRI MEMNYMLAYGSAEASLRLLWKFG
Sbjct: 250  RGVRIASRLGFRFSRETAHFVREFASSVLRLDRGRILMEMNYMLAYGSAEASLRLLWKFG 309

Query: 811  LLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXXXXXXXPDRPCHSCLWVSLLAFHEAL 990
            +LEILLPIQASYLVS GF+RRD RSNM             PDRPCH+ LW+++LAFH+AL
Sbjct: 310  ILEILLPIQASYLVSQGFKRRDTRSNMLLSLFSSLDNLLAPDRPCHTSLWIAILAFHKAL 369

Query: 991  VEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQPHDTSFHEISEPIYSYSKDELMAEV 1170
             ++PRD LV+ AF I VH+ GS SDA+ I R+IS PHD  F E+S      S + L+ EV
Sbjct: 370  ADRPRDPLVVAAFCIVVHTSGSSSDALGIVRKISHPHDPRFSELSIDHDLKSDEVLLDEV 429

Query: 1171 IRLGASVKDALRRLTDENFVSQALIQYPQAPQSDMVFISWALSLKVSSIFECIRKGRNRR 1350
            + L A VK AL+++TDE FVSQALI+YP+AP SDMVFIS ALS KV +IFEC+R+G+   
Sbjct: 430  MNLAADVKSALKKMTDEYFVSQALIEYPEAPHSDMVFISQALSQKVCAIFECVRRGKQMG 489

Query: 1351 FVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVYPTKLAEQ*SISNFS 1497
            +  K GS+IDY++L LGRL EVR +F  VVFDTVYP +     S+   S
Sbjct: 490  YAQKHGSKIDYEALTLGRLHEVRRVFAMVVFDTVYPLQTKRDNSVDKVS 538


>ref|XP_021285979.1| uncharacterized protein LOC110417781 isoform X1 [Herrania umbratica]
          Length = 528

 Score =  639 bits (1647), Expect = 0.0
 Identities = 313/479 (65%), Positives = 381/479 (79%)
 Frame = +1

Query: 22   DHNSYQTSSARNVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRKKGYDVYL 201
            D +S  + S+       +D K  +WKKL S++LGI T+ I+KP R VLN L++KGY+VYL
Sbjct: 47   DKSSQSSDSSHTNNVSEEDSKLPQWKKLNSQDLGISTTNISKPTRKVLNGLKRKGYEVYL 106

Query: 202  VGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSSFS 381
            VGGCVRDL+LKRTPKDFDI+TTAEL+EV+ AF RCEIVGRRFPICHVH+ D IVEVSSFS
Sbjct: 107  VGGCVRDLILKRTPKDFDIITTAELREVVTAFSRCEIVGRRFPICHVHIGDTIVEVSSFS 166

Query: 382  TNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGMEDI 561
            T+G+KFGR     + +P+GCDE DFIRW+NC+QRDFTINGLMFDP+ARI+YDY+GG+EDI
Sbjct: 167  TSGQKFGRSLNYKLGRPAGCDEKDFIRWRNCLQRDFTINGLMFDPYARIIYDYMGGIEDI 226

Query: 562  QKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGRIH 741
            +KAKVR + PA TSF EDCARILR +RIAARLGF FSRET+HF+K LS S+LRLDK RI 
Sbjct: 227  RKAKVRTVIPAGTSFQEDCARILRAIRIAARLGFSFSRETAHFIKNLSCSILRLDKSRIL 286

Query: 742  MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXXXX 921
            MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQA+Y VS+G RRRDKRSNM          
Sbjct: 287  MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQAAYFVSNGLRRRDKRSNMLLSLFSNLDR 346

Query: 922  XXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQPH 1101
               PDRPCH  LWV +LAFH+AL+++P+D LV+ A+S++VH+GG + +AV+IA  I++ H
Sbjct: 347  LLAPDRPCHGSLWVGILAFHKALLDEPKDPLVVAAYSLSVHNGGDILEAVNIATRINKSH 406

Query: 1102 DTSFHEISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAPQSDMVF 1281
            DTSFHE+SEP  +     L+ EV+ L ASVK  L ++TDE+FVSQA+  YPQAP SD+VF
Sbjct: 407  DTSFHELSEP-RNLENQTLINEVMDLAASVKSTLCKMTDEHFVSQAMSAYPQAPFSDLVF 465

Query: 1282 ISWALSLKVSSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVYP 1458
            I  AL LKV  +FEC+R+G  + FV K+GS IDY+ LALG L E+R  F RVVFDTVYP
Sbjct: 466  IPLALYLKVCKVFECVREGAEKGFVAKQGSRIDYELLALGSLSELRHTFARVVFDTVYP 524


>ref|XP_021285981.1| uncharacterized protein LOC110417781 isoform X3 [Herrania umbratica]
          Length = 468

 Score =  635 bits (1638), Expect = 0.0
 Identities = 310/462 (67%), Positives = 375/462 (81%)
 Frame = +1

Query: 73   DDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRKKGYDVYLVGGCVRDLVLKRTPKDF 252
            +D K  +WKKL S++LGI T+ I+KP R VLN L++KGY+VYLVGGCVRDL+LKRTPKDF
Sbjct: 4    EDSKLPQWKKLNSQDLGISTTNISKPTRKVLNGLKRKGYEVYLVGGCVRDLILKRTPKDF 63

Query: 253  DILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSSFSTNGRKFGRKSKSPMRKP 432
            DI+TTAEL+EV+ AF RCEIVGRRFPICHVH+ D IVEVSSFST+G+KFGR     + +P
Sbjct: 64   DIITTAELREVVTAFSRCEIVGRRFPICHVHIGDTIVEVSSFSTSGQKFGRSLNYKLGRP 123

Query: 433  SGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGMEDIQKAKVRCITPANTSFVE 612
            +GCDE DFIRW+NC+QRDFTINGLMFDP+ARI+YDY+GG+EDI+KAKVR + PA TSF E
Sbjct: 124  AGCDEKDFIRWRNCLQRDFTINGLMFDPYARIIYDYMGGIEDIRKAKVRTVIPAGTSFQE 183

Query: 613  DCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGRIHMEMNYMLAYGSAEASLR 792
            DCARILR +RIAARLGF FSRET+HF+K LS S+LRLDK RI MEMNYMLAYGSAEASLR
Sbjct: 184  DCARILRAIRIAARLGFSFSRETAHFIKNLSCSILRLDKSRILMEMNYMLAYGSAEASLR 243

Query: 793  LLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXXXXXXXPDRPCHSCLWVSLL 972
            LLWKFGLLEILLPIQA+Y VS+G RRRDKRSNM             PDRPCH  LWV +L
Sbjct: 244  LLWKFGLLEILLPIQAAYFVSNGLRRRDKRSNMLLSLFSNLDRLLAPDRPCHGSLWVGIL 303

Query: 973  AFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQPHDTSFHEISEPIYSYSKD 1152
            AFH+AL+++P+D LV+ A+S++VH+GG + +AV+IA  I++ HDTSFHE+SEP  +    
Sbjct: 304  AFHKALLDEPKDPLVVAAYSLSVHNGGDILEAVNIATRINKSHDTSFHELSEP-RNLENQ 362

Query: 1153 ELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAPQSDMVFISWALSLKVSSIFECIR 1332
             L+ EV+ L ASVK  L ++TDE+FVSQA+  YPQAP SD+VFI  AL LKV  +FEC+R
Sbjct: 363  TLINEVMDLAASVKSTLCKMTDEHFVSQAMSAYPQAPFSDLVFIPLALYLKVCKVFECVR 422

Query: 1333 KGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVYP 1458
            +G  + FV K+GS IDY+ LALG L E+R  F RVVFDTVYP
Sbjct: 423  EGAEKGFVAKQGSRIDYELLALGSLSELRHTFARVVFDTVYP 464


>ref|XP_007038104.2| PREDICTED: poly(A) polymerase I [Theobroma cacao]
          Length = 528

 Score =  637 bits (1642), Expect = 0.0
 Identities = 314/479 (65%), Positives = 379/479 (79%)
 Frame = +1

Query: 22   DHNSYQTSSARNVKGGNDDCKPHEWKKLCSKELGIRTSRITKPARTVLNVLRKKGYDVYL 201
            D +S  + S+       +D K  +WKKL S++LGI T+ I+KP R VLN L++KGY+VYL
Sbjct: 47   DKSSQGSDSSHRNNVSEEDSKLPQWKKLNSQDLGISTTNISKPTRKVLNGLKRKGYEVYL 106

Query: 202  VGGCVRDLVLKRTPKDFDILTTAELKEVMRAFPRCEIVGRRFPICHVHVDDAIVEVSSFS 381
            VGGCVRDL+LKRTPKDFDI+TTAEL+EV+RAF RCEIVGRRFPICHVH+ D IVEVSSFS
Sbjct: 107  VGGCVRDLILKRTPKDFDIITTAELREVVRAFSRCEIVGRRFPICHVHIGDTIVEVSSFS 166

Query: 382  TNGRKFGRKSKSPMRKPSGCDECDFIRWQNCMQRDFTINGLMFDPFARIVYDYIGGMEDI 561
            T+G+KFGR     + +P+GCDE DFIRW+NC+QRDFTINGLMFDP+ARI+YDY+GG+EDI
Sbjct: 167  TSGQKFGRSLNYKLGRPAGCDEKDFIRWRNCLQRDFTINGLMFDPYARIIYDYMGGIEDI 226

Query: 562  QKAKVRCITPANTSFVEDCARILRGVRIAARLGFRFSRETSHFLKELSNSLLRLDKGRIH 741
            +KAKVR + PA TSF EDCARILR +RIAARLGF FSRET+HF+K LS S+LRLDK RI 
Sbjct: 227  RKAKVRTVIPAGTSFQEDCARILRAIRIAARLGFSFSRETAHFIKNLSCSILRLDKSRIL 286

Query: 742  MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQASYLVSHGFRRRDKRSNMXXXXXXXXXX 921
            MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQA+Y VS+G RRRDKRSNM          
Sbjct: 287  MEMNYMLAYGSAEASLRLLWKFGLLEILLPIQAAYFVSNGLRRRDKRSNMLLSLFSNLDR 346

Query: 922  XXXPDRPCHSCLWVSLLAFHEALVEQPRDALVIGAFSIAVHSGGSLSDAVDIAREISQPH 1101
               PDRPCH  LWV +LAFH+AL ++PRD LV+ A+S+ VH+GG + +AV+IA  I++ H
Sbjct: 347  LLAPDRPCHGSLWVGILAFHKALFDKPRDPLVVAAYSLVVHNGGDILEAVNIATRINKSH 406

Query: 1102 DTSFHEISEPIYSYSKDELMAEVIRLGASVKDALRRLTDENFVSQALIQYPQAPQSDMVF 1281
            DTSF E+SEP  +     L+ EV+ L ASVK  L ++TDE+FVSQA+  YPQAP SD+VF
Sbjct: 407  DTSFRELSEP-RNLENQTLINEVMDLAASVKSTLCKMTDEHFVSQAMSAYPQAPFSDLVF 465

Query: 1282 ISWALSLKVSSIFECIRKGRNRRFVPKRGSEIDYDSLALGRLDEVRDIFGRVVFDTVYP 1458
            I  AL LKV  +FEC+R+G  + FV K+GS IDY+ LALG L E+R  F RVVFDTVYP
Sbjct: 466  IPLALYLKVCKVFECVREGAEKGFVAKQGSRIDYELLALGSLSELRHTFARVVFDTVYP 524


Top