BLASTX nr result

ID: Perilla23_contig00007309 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00007309
         (1053 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011091205.1| PREDICTED: poly(A) RNA polymerase protein 1-...   293   2e-76
ref|XP_012843633.1| PREDICTED: uncharacterized protein LOC105963...   185   5e-44
ref|XP_011093188.1| PREDICTED: uncharacterized protein LOC105173...   146   3e-32
ref|XP_011069389.1| PREDICTED: poly(A) RNA polymerase cid11-like...   101   1e-18
ref|XP_009799247.1| PREDICTED: terminal uridylyltransferase 7 [N...    97   2e-17
ref|XP_009626263.1| PREDICTED: uncharacterized protein LOC104116...    96   6e-17
ref|XP_006339776.1| PREDICTED: uncharacterized protein LOC102603...    87   2e-14
ref|XP_010106745.1| Poly(A) RNA polymerase cid11 [Morus notabili...    82   7e-13
ref|XP_007051995.1| Nucleotidyltransferase family protein isofor...    82   9e-13
ref|XP_007051994.1| Nucleotidyltransferase family protein isofor...    82   9e-13
ref|XP_007051993.1| Nucleotidyltransferase family protein isofor...    82   9e-13
ref|XP_007051992.1| Nucleotidyltransferase family protein isofor...    82   9e-13
ref|XP_007051991.1| Nucleotidyltransferase family protein isofor...    82   9e-13
gb|KJB49361.1| hypothetical protein B456_008G115100 [Gossypium r...    81   1e-12
gb|KJB49360.1| hypothetical protein B456_008G115100 [Gossypium r...    81   1e-12
ref|XP_012437624.1| PREDICTED: terminal uridylyltransferase 7-li...    81   1e-12
ref|XP_004229872.1| PREDICTED: uncharacterized protein LOC101244...    76   4e-11
emb|CDP20563.1| unnamed protein product [Coffea canephora]             75   7e-11
ref|XP_010518183.1| PREDICTED: uncharacterized protein LOC104793...    65   9e-08

>ref|XP_011091205.1| PREDICTED: poly(A) RNA polymerase protein 1-like [Sesamum indicum]
          Length = 729

 Score =  293 bits (750), Expect = 2e-76
 Identities = 172/352 (48%), Positives = 210/352 (59%), Gaps = 20/352 (5%)
 Frame = +2

Query: 56   MNVRGGDAPPPNGGEFLLQLLRNNKPPNSTHPILXXXXXXXXXXXXXXXXAVAAVGPSIP 235
            MN RGGDAPPPNGGEFLLQLLRN  PPNS  P                  AVAAVGP+IP
Sbjct: 1    MNGRGGDAPPPNGGEFLLQLLRN--PPNSNPPT-PSPLHHPPSQTFSQDPAVAAVGPTIP 57

Query: 236  TYPLPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXXXXXXX----ARGFTHS 403
            ++PLPH+AFP SNG DF +          FA HNY                  + GF ++
Sbjct: 58   SFPLPHSAFP-SNGHDFLYRPANPSPSPPFAPHNYFQQNPNVKPNLNPDFSSPSAGFNYA 116

Query: 404  PPQFDLQPKLISPGDDARKLGFYGDNSKPIAANQPGQNLIFGSLNRDVLNNGTVNVLDQS 583
              QF+L    ISPGDD RKL  YGDNSK    +Q  QN+IFGSLNRD+L++   +VL QS
Sbjct: 117  LHQFNLHSNRISPGDDMRKLASYGDNSKSSVPHQQEQNIIFGSLNRDILHSDPASVLRQS 176

Query: 584  FYRMND----------------NRFPSSSIDVNENARGNPALLRGQKQERSTSSNDRLKQ 715
             YR N                 N F  +SID++ +ARGN  +LR  +QE S++S++R KQ
Sbjct: 177  LYRENRLGNSYQEEGLRMDRMLNGFRVNSIDIDGSARGNSTILRAYEQEVSSNSDNRRKQ 236

Query: 716  GDGGNHRVVAPPPGFSSNSKNVEQREYGYGRTSDLNGDKGMCNYGQLQQNDRIGNQLDLP 895
             D G++R VAPPPGFSSN KNV  RE+GYGRTS  + DKG  N+G L +NDR GNQLD P
Sbjct: 237  VDNGSYRAVAPPPGFSSNVKNVRNREFGYGRTSVHDLDKGKANFGDLCRNDRSGNQLDSP 296

Query: 896  GLPAGSSRQSASTLDIEESMKGLNVGDGERSEDSRRGVQAKLNNDDSQMDDL 1051
              PA SS QS S  DIEESM  L+  DGE  E+   G + K+N   +++ DL
Sbjct: 297  APPARSSLQSVSAFDIEESMMELHGKDGENGEELICGGRDKVNRGQTEIHDL 348


>ref|XP_012843633.1| PREDICTED: uncharacterized protein LOC105963736 [Erythranthe
            guttatus] gi|604321452|gb|EYU32028.1| hypothetical
            protein MIMGU_mgv1a001944mg [Erythranthe guttata]
          Length = 735

 Score =  185 bits (470), Expect = 5e-44
 Identities = 142/398 (35%), Positives = 180/398 (45%), Gaps = 66/398 (16%)
 Frame = +2

Query: 56   MNVRGGDAPPPNGGEFLLQLLRNNKPPNSTHPILXXXXXXXXXXXXXXXXAVAAVGPSIP 235
            MN RGGDAPP +GG+FLLQLLRN  PPN +H  L                AVAAVGP++P
Sbjct: 1    MNARGGDAPPASGGDFLLQLLRN--PPNFSH--LTPPSQQQTPDIFSQDPAVAAVGPTVP 56

Query: 236  TYPLPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXXXXXXXARG------FT 397
            T+PLP   FP SNG D  F          FA H Y                         
Sbjct: 57   TFPLPQGGFP-SNGTDLQFRQWKHSPVPPFAPHQYFQQNPIARPNLNPDFPSPPPPGELN 115

Query: 398  HSPPQFDLQPKLISPGDDARKLGFYGDNSKP----------------------------- 490
            ++P QF+LQ   ISPG+DARKL  YGDNS+P                             
Sbjct: 116  YAPHQFNLQSNRISPGEDARKLAPYGDNSRPSAAAHQQLQSNRIPLGEDARRLGVFGEIA 175

Query: 491  ---IAANQPGQN-LIFGSLNRDVLNNGTVNVLDQSFYRMND----------------NRF 610
               +A +Q  QN LIFGSLNRD+L     +VL QS + M+                 NRF
Sbjct: 176  TPSVAQHQREQNHLIFGSLNRDILQTDAGDVLHQSLHPMDKLGNSYLEEVLGMDRRMNRF 235

Query: 611  PSSSIDVNENARGNPALLRGQKQERSTSSNDRLKQGDGGNHRVVAPPPGFSSNSKNVEQR 790
            P +  +VN N+RGN            +S N+R  QGD G+HR +APP   S+N KNV  R
Sbjct: 236  PVN--EVNGNSRGN------------SSGNERRNQGDNGSHRALAPPGFSSNNMKNVGNR 281

Query: 791  EYGY-GRTSDLNGDKGMCNYGQLQQNDRIGNQLDLPGLPAGSSRQSASTLDIEESMKGLN 967
            E+GY  R  D   DKG  N G   +N  + N ++ PG                 SM G++
Sbjct: 282  EHGYVTRNPDNYVDKGKGNSGGSYKNGGVSNPINSPG-----------------SMMGIH 324

Query: 968  VGDGERSEDSRRG----------VQAKLNNDDSQMDDL 1051
            V DG + ++ R G           Q+K+N  + QM  L
Sbjct: 325  VEDGGKGKELRFGGQNNKNQGDRAQSKMNGIEDQMGSL 362


>ref|XP_011093188.1| PREDICTED: uncharacterized protein LOC105173206 [Sesamum indicum]
          Length = 685

 Score =  146 bits (368), Expect = 3e-32
 Identities = 123/346 (35%), Positives = 164/346 (47%), Gaps = 15/346 (4%)
 Frame = +2

Query: 56   MNVRGGDAPPP-NGGEFLLQLLRNNKPPNSTHPILXXXXXXXXXXXXXXXXAVAAVGPSI 232
            M+  GGD  P  NGGEFLL+LLR  KPP S  P                  AVAAVGP+I
Sbjct: 1    MDGGGGDVQPSGNGGEFLLRLLR--KPPVSHTPF----HSQPPSANISYDPAVAAVGPTI 54

Query: 233  PTYPLPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXXXXXXXARGFTHSPPQ 412
            P +   + + PP                  FA HN+                    +  Q
Sbjct: 55   PAFSWNYDSSPP------------------FAPHNFFPQNPNLNPNC---------ASHQ 87

Query: 413  FDLQPKLISPGDDARKLGFYGDNSKPIAANQPGQNLIFGSLNRDVLN--NGTVNVLDQSF 586
            F LQ  +IS  D  R  G + +NS P   +QP  NLIF S N D  N  + +V    +S+
Sbjct: 88   FTLQSSVISVCDGTRNSGIFTENSGPGVVHQPELNLIFRSAN-DAGNALHESVGQRSKSW 146

Query: 587  YR-MND--------NRFPSSSIDVNENARGNPALLRGQKQERSTSSNDRLKQGDGGNHRV 739
               M D        N F  +S  +N N RGN  +    +QER+++S+ R K G  G +R 
Sbjct: 147  NAFMGDDFGKHRRLNGFQMNSNAINGNGRGNSCMSSAYEQERNSNSDARRKLGQSGIYRA 206

Query: 740  VAPPPGFSSNSKNVEQREYGYGRTSDLNGDKGMCNYGQL-QQNDRIGNQLDLPGLPAGSS 916
            VAPPPGFS+   NV         T+D N   G  N+G L +++ R+ NQLD PG P G S
Sbjct: 207  VAPPPGFSTKIMNV--------GTADHNATTGN-NFGDLHKKSSRLSNQLDSPGPPPGGS 257

Query: 917  RQSASTLDIEESMKGLNVGDGERSEDSRRGVQAKLNNDD--SQMDD 1048
             QS S  D EESM  L   DG+ +++ R G Q K++ D   +++DD
Sbjct: 258  PQSVSAFDTEESMLKLRGEDGDSAKELRHGGQDKISGDGVRNELDD 303


>ref|XP_011069389.1| PREDICTED: poly(A) RNA polymerase cid11-like [Sesamum indicum]
          Length = 536

 Score =  101 bits (251), Expect = 1e-18
 Identities = 62/152 (40%), Positives = 87/152 (57%), Gaps = 3/152 (1%)
 Frame = +2

Query: 602  NRFPSSSIDVNENARGNPALLRGQKQERSTSSNDRLKQGDGGNHRVVAPPPGFSSNSKNV 781
            N F  +S  +N N RGN  +    +QER+++S+ R K G  G +R VAPPPGFS+   NV
Sbjct: 12   NGFQMNSNAINGNGRGNSCMSSAYEQERNSNSDARRKLGQSGIYRAVAPPPGFSTKIMNV 71

Query: 782  EQREYGYGRTSDLNGDKGMCNYGQL-QQNDRIGNQLDLPGLPAGSSRQSASTLDIEESMK 958
                     T+D N   G  N+G L +++ R+ NQLD PG P G S QS S  D EESM 
Sbjct: 72   --------GTADHNATTGN-NFGDLHKKSSRLSNQLDSPGPPPGGSPQSVSAFDTEESML 122

Query: 959  GLNVGDGERSEDSRRGVQAKLNNDD--SQMDD 1048
             L   DG+ +++ R G Q K++ D   +++DD
Sbjct: 123  KLRGEDGDSAKELRHGGQDKISGDGVRNELDD 154


>ref|XP_009799247.1| PREDICTED: terminal uridylyltransferase 7 [Nicotiana sylvestris]
          Length = 766

 Score = 97.4 bits (241), Expect = 2e-17
 Identities = 118/418 (28%), Positives = 162/418 (38%), Gaps = 86/418 (20%)
 Frame = +2

Query: 56   MNVRGGDAPPP---------NGGEFLLQLLRNNKPPNSTHPILXXXXXXXXXXXXXXXXA 208
            MN  GGDA  P         NGGEFLLQLL+N   P+  HP                  A
Sbjct: 1    MNGGGGDAASPPLSSQSAAANGGEFLLQLLQNR--PHHHHP---QHQPPPELQTLPHDPA 55

Query: 209  VAAVGPSIPTYPLPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXXXXXXXAR 388
            VAAVGPS+P   LP++  PP                  F  HN+               +
Sbjct: 56   VAAVGPSVPFPQLPYSHSPP-----------------LFVPHNF-------------FIQ 85

Query: 389  GFTHSP-PQFDLQPKLISP----------------------GDDARKLGFYGDNSKPI-A 496
            GF  +P P  +L P   SP                      G++   LG +G N+ P  +
Sbjct: 86   GFLQNPNPSHNLNPNFSSPPAHPSGFGQIQHAGGPLGFGSVGENMGNLGIFGGNAMPYNS 145

Query: 497  ANQPGQNLIFGSLNRD------VLNN-------GTVNVLDQSFY--RMNDNRFPSSSIDV 631
             ++  QNL FGSL RD      +LNN       G V    Q     R+ + R  +     
Sbjct: 146  THELDQNLTFGSLRRDIRGDLGILNNRLNGDLAGKVGNFAQKNQESRLGNVRMLNGVEGK 205

Query: 632  NENARGNP-------ALLRGQKQERSTSSNDRLKQG--------DGGNHRVVAPPPGFSS 766
             +N  G+          LRG +Q+ STS     + G          G+ R V PPPGFS 
Sbjct: 206  LDNVIGSGRKQHESLGNLRGLEQQNSTSGGGGGESGGLGWGRQFQSGSVRGVVPPPGFSG 265

Query: 767  NSK------NVEQREYGY----GRTSDLNGD---------KGMCNYGQLQQNDRIGNQLD 889
             ++      NV++ +  +     R   LN +         +   NY  +     I  QLD
Sbjct: 266  KARSIGFEHNVDKEKSNFVESNHRVIGLNHENERESKYLPRNGKNYAIVSDGRGIFRQLD 325

Query: 890  LPGLPAGSSRQSASTLDIEESMKGLNVGDGERSEDSRRGVQAKLN----NDDSQMDDL 1051
             PG  AGS   S    D+E+SM  L   + E  E++  G++ KL        S++DDL
Sbjct: 326  SPGPLAGSKLHSVLASDVEDSMLELQGEEAESGEETGSGMRDKLGRGSARGQSELDDL 383


>ref|XP_009626263.1| PREDICTED: uncharacterized protein LOC104116995 [Nicotiana
            tomentosiformis]
          Length = 766

 Score = 95.5 bits (236), Expect = 6e-17
 Identities = 120/419 (28%), Positives = 163/419 (38%), Gaps = 87/419 (20%)
 Frame = +2

Query: 56   MNVRGGDA--PP-------PNGGEFLLQLLRNNKPPNSTHPILXXXXXXXXXXXXXXXXA 208
            MN  GGDA  PP       PN GEFLLQLL+N   P+  +P                  A
Sbjct: 1    MNGVGGDAASPPLSSQSATPNSGEFLLQLLQNR--PHHHYP---QHQPQPELQTLPHDPA 55

Query: 209  VAAVGPSIPTYPLPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXXXXXXXAR 388
            VAAVGPS+P   LP++  PP                  FA HN+               +
Sbjct: 56   VAAVGPSVPFSQLPYSHSPP-----------------LFAPHNF-------------FIQ 85

Query: 389  GFTHSP-PQFDLQPKLISP----------------------GDDARKLGFYGDNSKPI-A 496
            GF  +P P  +L P   SP                      G++   LG +  ++KP  +
Sbjct: 86   GFFQNPNPSHNLNPNFSSPPAHPSGFSQIQHAGGPLGFGSVGENMGNLGIFSADAKPYNS 145

Query: 497  ANQPGQNLIFGSLNRD------VLNN-------GTVNVLDQSFY--RMNDNRFPSSSIDV 631
             ++  QNL FGSL RD      +LNN       G V    Q     R+ + R  +     
Sbjct: 146  THELDQNLTFGSLRRDIRGDLGILNNRLNGDLAGKVGNFAQKNQESRLGNVRMLNGVEGK 205

Query: 632  NENARGNP-------ALLRGQKQERSTSSNDRLKQG--------DGGNHRVVAPPPGFSS 766
             +N  G+          LRG +Q+ STS     + G          G+ R V PPPGFS 
Sbjct: 206  LDNVIGSGRKQHESLGNLRGLEQQNSTSGGGGGESGGLGWGRQFPSGSVRGVVPPPGFSG 265

Query: 767  NSKN--------------VEQREYGYG------RTSDLNGDKGMCNYGQLQQNDRIGNQL 886
              ++              VE    G G      R S      G  NY  +     I  QL
Sbjct: 266  KPRSMGFEHNVDKEKSNFVELNHRGIGLNHKNERESKFLPRNGK-NYAIVSDGRGIFRQL 324

Query: 887  DLPGLPAGSSRQSASTLDIEESMKGLNVGDGERSEDSRRGVQAKL----NNDDSQMDDL 1051
            D PG PAGS   S    D+E+S+  L+  + E  E++  G++ KL    +   S++DDL
Sbjct: 325  DAPGPPAGSKLHSVLASDVEDSILELHGEEAESGEETGTGMRDKLGRGSSRGQSELDDL 383


>ref|XP_006339776.1| PREDICTED: uncharacterized protein LOC102603223 [Solanum tuberosum]
          Length = 775

 Score = 87.4 bits (215), Expect = 2e-14
 Identities = 109/382 (28%), Positives = 142/382 (37%), Gaps = 63/382 (16%)
 Frame = +2

Query: 83   PPNGGEFLLQLLRNNKPPNSTH------PILXXXXXXXXXXXXXXXXAVAAVGPSIPTYP 244
            P NGGEFLLQLL+N+  P+  H      P                  AVAAVGPS+P  P
Sbjct: 14   PSNGGEFLLQLLQNH--PHQLHSQPQPLPQPLPPPLRPELQTLPHDPAVAAVGPSMPYPP 71

Query: 245  LPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXXXXXXXARGFTHSPP----- 409
            L H    PS                 F  HN+                    SPP     
Sbjct: 72   LFHTPTNPS-------VLPYSHSPPLFVPHNFFVRGFLQNPNSSHTINPNFSSPPAPTGF 124

Query: 410  -QFDLQPKLI--SPGDDARKLGFYGDNSKPIAANQP-GQNLIFGSLNRDVLNNGTVNVLD 577
             QF     L   S G++   LG +G N+K   +N     NLIFGSL RD+   G V++L+
Sbjct: 125  SQFQHASPLGFGSVGENMGNLGIFGANAKASNSNNEFDHNLIFGSLRRDI--QGNVSMLN 182

Query: 578  QSFY-----------------RMNDNRFPSSSIDVNENARGNPAL----LRGQKQ----- 679
              F                  R+ + R  +      EN  G+       LRG +Q     
Sbjct: 183  DRFSDDLACKVGNFEQKNQESRLTNVRMLNGVEGKRENVIGSGRKQLGNLRGLEQQNRGG 242

Query: 680  ---ERSTSSNDRLKQGDGGNHRVVAPPPGFSS------------NSKN--VEQREYGYGR 808
               E  +    R +Q   G  R   PPPGFSS            N KN  VE    G G 
Sbjct: 243  GGGESESGGLGRGRQFHSGTVRGAVPPPGFSSKPRSRDFEHNVDNEKNNFVELNHRGIGL 302

Query: 809  TSDLNGD-----KGMCNYGQLQQNDRIGNQLDLPGLPAGSSRQSASTLDIEESMKGLNVG 973
                  +     +   NY     + R+  QLD P  PAGS   S    D+E+S   L+  
Sbjct: 303  NHKYERESKHLTRNGKNYAIGSDDQRVFRQLDSPVPPAGSKLHSVLGSDVEDSTLELHGE 362

Query: 974  DGERSEDSRRGVQAKLNNDDSQ 1039
            D E  E++  G++  L    +Q
Sbjct: 363  DAESGEETVSGMRNVLGRSSAQ 384


>ref|XP_010106745.1| Poly(A) RNA polymerase cid11 [Morus notabilis]
            gi|587924413|gb|EXC11712.1| Poly(A) RNA polymerase cid11
            [Morus notabilis]
          Length = 703

 Score = 82.0 bits (201), Expect = 7e-13
 Identities = 103/365 (28%), Positives = 151/365 (41%), Gaps = 42/365 (11%)
 Frame = +2

Query: 68   GGDAPPP-----NGGEFLLQLLRN----------NKPPNSTHPILXXXXXXXXXXXXXXX 202
            GG+AP P     NGGEFLL LL+            +PP    P                 
Sbjct: 4    GGNAPSPPTPAANGGEFLLSLLQKPQAAKSASPPPQPPPPQPPPPQSQQRQQPQQSLAVD 63

Query: 203  XAVAAVGPSIPTYPLPHAAFPPSNGGDFT----FXXXXXXXXXXFAQHNYXXXXXXXXXX 370
             AVAA GPS+P +P PH    PSNG D      +          FA + +          
Sbjct: 64   PAVAAGGPSVP-FPPPH--LWPSNGQDLLHPLHWPVHSLANPPPFAPNGFL--------- 111

Query: 371  XXXXARGFTHS--PPQFDLQPKLISPGDDARKLGFYGD-NSKPIAANQPGQNLIFGSLNR 541
                  GF HS  P QF  +    + G+D R+LGF G  NS P             +LN 
Sbjct: 112  ------GFPHSFFPNQFQGKQVSGNVGEDLRRLGFSGGVNSNP-------------NLNL 152

Query: 542  DVLNNGTVNVLDQSFYRMNDNRFPSSSIDVNE-----NARGNPALLRGQKQERSTSSNDR 706
            + + +G V   +Q  +++     PS  + + E     +A     L+   ++  S SS++ 
Sbjct: 153  NPI-HGIVQQKNQLEHKLKFGSLPSEIVIIPEALPKVDASNFNNLVDRSRRLSSNSSSNA 211

Query: 707  LKQGDGGNHRVVAPPPGFSSNSKNVEQREYGYGRTSDLNGD---------KGMCNYGQLQ 859
            ++QG+   H+   PPPGF S  K      +  G  + ++GD         + +   G   
Sbjct: 212  VRQGN-YEHQRTNPPPGFRSKPKRT-GLNHSIGGENSVSGDLMRTRDVLAEDIGIRGDGS 269

Query: 860  QNDRIGNQLDLPGLPAGSSRQSASTLDIEESMKGL-----NVGDGERSED-SRRGVQAKL 1021
            +   +  QLD PG P+GS+ +S    D+EESM  L      VG G   +D  +R V + L
Sbjct: 270  RGLELSAQLDRPGPPSGSNLRSVLASDVEESMMKLESDAVEVGGGHEIDDIGQRLVDSLL 329

Query: 1022 NNDDS 1036
              D+S
Sbjct: 330  IEDES 334


>ref|XP_007051995.1| Nucleotidyltransferase family protein isoform 5 [Theobroma cacao]
            gi|508704256|gb|EOX96152.1| Nucleotidyltransferase family
            protein isoform 5 [Theobroma cacao]
          Length = 635

 Score = 81.6 bits (200), Expect = 9e-13
 Identities = 107/371 (28%), Positives = 142/371 (38%), Gaps = 55/371 (14%)
 Frame = +2

Query: 56   MNVRGGDAPPP---NGGEFLLQLLRN-------------NKPPNSTHPILXXXXXXXXXX 187
            M   GG+AP P   NGGEFLL LL+              ++    T P            
Sbjct: 1    MTGNGGEAPSPPAANGGEFLLSLLQKPQQHLQQQQSPLFSRATPVTIPQPQQQQQQQQQQ 60

Query: 188  XXXXXXAVAAVGPSIPTYPLPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXX 367
                  AVAAVGP++P  PL      PSNG D                            
Sbjct: 61   PLVIDPAVAAVGPTLPFRPLW-----PSNGRDLPGLWPQ--------------------- 94

Query: 368  XXXXXARGFTHSPPQ------FDLQPKLISPG-----------DDARKLGFYG-DNSKP- 490
                     T SPP       F L P   SPG           DD R+LG  G DN+K  
Sbjct: 95   ---------TLSPPLAPNFLGFPLSP-WSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNH 144

Query: 491  -----IAANQPGQNLIFGSLNRDVLNNGTV------NVLDQSFYRMNDNRFPSSSIDVNE 637
                 +      Q L+FGS   D+    T       N+L+ S   +++ +  S    +N 
Sbjct: 145  VIQNRVQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENSKLNLSNQQLDSR---LNS 201

Query: 638  NARGNPALLRGQKQERSTSSNDRLKQGDGGNHRVVAP-----PPGFSSNSKNVE-QREYG 799
            N   +P +     Q R++    + +Q  G      +P     PPGF    +     R++G
Sbjct: 202  NPNTSPYVF----QHRNSGDRGKQQQHGGSYRPTPSPEARRSPPGFLGKPRGGGGNRDFG 257

Query: 800  YGRTS-DLNGDKGMCNYGQLQQNDRIG--NQLDLPGLPAGSSRQSASTLDIEESMKGLNV 970
              R   + N DK    Y Q   ++ +G   QLD PG PAGS+ QS S  DIEES+  L+ 
Sbjct: 258  NRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATDIEESLLELH- 316

Query: 971  GDGERSEDSRR 1003
             DG R   SRR
Sbjct: 317  SDGGRDRFSRR 327


>ref|XP_007051994.1| Nucleotidyltransferase family protein isoform 4, partial [Theobroma
            cacao] gi|508704255|gb|EOX96151.1| Nucleotidyltransferase
            family protein isoform 4, partial [Theobroma cacao]
          Length = 585

 Score = 81.6 bits (200), Expect = 9e-13
 Identities = 107/371 (28%), Positives = 142/371 (38%), Gaps = 55/371 (14%)
 Frame = +2

Query: 56   MNVRGGDAPPP---NGGEFLLQLLRN-------------NKPPNSTHPILXXXXXXXXXX 187
            M   GG+AP P   NGGEFLL LL+              ++    T P            
Sbjct: 1    MTGNGGEAPSPPAANGGEFLLSLLQKPQQHLQQQQSPLFSRATPVTIPQPQQQQQQQQQQ 60

Query: 188  XXXXXXAVAAVGPSIPTYPLPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXX 367
                  AVAAVGP++P  PL      PSNG D                            
Sbjct: 61   PLVIDPAVAAVGPTLPFRPLW-----PSNGRDLPGLWPQ--------------------- 94

Query: 368  XXXXXARGFTHSPPQ------FDLQPKLISPG-----------DDARKLGFYG-DNSKP- 490
                     T SPP       F L P   SPG           DD R+LG  G DN+K  
Sbjct: 95   ---------TLSPPLAPNFLGFPLSP-WSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNH 144

Query: 491  -----IAANQPGQNLIFGSLNRDVLNNGTV------NVLDQSFYRMNDNRFPSSSIDVNE 637
                 +      Q L+FGS   D+    T       N+L+ S   +++ +  S    +N 
Sbjct: 145  VIQNRVQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENSKLNLSNQQLDSR---LNS 201

Query: 638  NARGNPALLRGQKQERSTSSNDRLKQGDGGNHRVVAP-----PPGFSSNSKNVE-QREYG 799
            N   +P +     Q R++    + +Q  G      +P     PPGF    +     R++G
Sbjct: 202  NPNTSPYVF----QHRNSGDRGKQQQHGGSYRPTPSPEARRSPPGFLGKPRGGGGNRDFG 257

Query: 800  YGRTS-DLNGDKGMCNYGQLQQNDRIG--NQLDLPGLPAGSSRQSASTLDIEESMKGLNV 970
              R   + N DK    Y Q   ++ +G   QLD PG PAGS+ QS S  DIEES+  L+ 
Sbjct: 258  NRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATDIEESLLELH- 316

Query: 971  GDGERSEDSRR 1003
             DG R   SRR
Sbjct: 317  SDGGRDRFSRR 327


>ref|XP_007051993.1| Nucleotidyltransferase family protein isoform 3, partial [Theobroma
            cacao] gi|508704254|gb|EOX96150.1| Nucleotidyltransferase
            family protein isoform 3, partial [Theobroma cacao]
          Length = 584

 Score = 81.6 bits (200), Expect = 9e-13
 Identities = 107/371 (28%), Positives = 142/371 (38%), Gaps = 55/371 (14%)
 Frame = +2

Query: 56   MNVRGGDAPPP---NGGEFLLQLLRN-------------NKPPNSTHPILXXXXXXXXXX 187
            M   GG+AP P   NGGEFLL LL+              ++    T P            
Sbjct: 1    MTGNGGEAPSPPAANGGEFLLSLLQKPQQHLQQQQSPLFSRATPVTIPQPQQQQQQQQQQ 60

Query: 188  XXXXXXAVAAVGPSIPTYPLPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXX 367
                  AVAAVGP++P  PL      PSNG D                            
Sbjct: 61   PLVIDPAVAAVGPTLPFRPLW-----PSNGRDLPGLWPQ--------------------- 94

Query: 368  XXXXXARGFTHSPPQ------FDLQPKLISPG-----------DDARKLGFYG-DNSKP- 490
                     T SPP       F L P   SPG           DD R+LG  G DN+K  
Sbjct: 95   ---------TLSPPLAPNFLGFPLSP-WSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNH 144

Query: 491  -----IAANQPGQNLIFGSLNRDVLNNGTV------NVLDQSFYRMNDNRFPSSSIDVNE 637
                 +      Q L+FGS   D+    T       N+L+ S   +++ +  S    +N 
Sbjct: 145  VIQNRVQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENSKLNLSNQQLDSR---LNS 201

Query: 638  NARGNPALLRGQKQERSTSSNDRLKQGDGGNHRVVAP-----PPGFSSNSKNVE-QREYG 799
            N   +P +     Q R++    + +Q  G      +P     PPGF    +     R++G
Sbjct: 202  NPNTSPYVF----QHRNSGDRGKQQQHGGSYRPTPSPEARRSPPGFLGKPRGGGGNRDFG 257

Query: 800  YGRTS-DLNGDKGMCNYGQLQQNDRIG--NQLDLPGLPAGSSRQSASTLDIEESMKGLNV 970
              R   + N DK    Y Q   ++ +G   QLD PG PAGS+ QS S  DIEES+  L+ 
Sbjct: 258  NRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATDIEESLLELH- 316

Query: 971  GDGERSEDSRR 1003
             DG R   SRR
Sbjct: 317  SDGGRDRFSRR 327


>ref|XP_007051992.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao]
            gi|508704253|gb|EOX96149.1| Nucleotidyltransferase family
            protein isoform 2 [Theobroma cacao]
          Length = 621

 Score = 81.6 bits (200), Expect = 9e-13
 Identities = 107/371 (28%), Positives = 142/371 (38%), Gaps = 55/371 (14%)
 Frame = +2

Query: 56   MNVRGGDAPPP---NGGEFLLQLLRN-------------NKPPNSTHPILXXXXXXXXXX 187
            M   GG+AP P   NGGEFLL LL+              ++    T P            
Sbjct: 1    MTGNGGEAPSPPAANGGEFLLSLLQKPQQHLQQQQSPLFSRATPVTIPQPQQQQQQQQQQ 60

Query: 188  XXXXXXAVAAVGPSIPTYPLPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXX 367
                  AVAAVGP++P  PL      PSNG D                            
Sbjct: 61   PLVIDPAVAAVGPTLPFRPLW-----PSNGRDLPGLWPQ--------------------- 94

Query: 368  XXXXXARGFTHSPPQ------FDLQPKLISPG-----------DDARKLGFYG-DNSKP- 490
                     T SPP       F L P   SPG           DD R+LG  G DN+K  
Sbjct: 95   ---------TLSPPLAPNFLGFPLSP-WSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNH 144

Query: 491  -----IAANQPGQNLIFGSLNRDVLNNGTV------NVLDQSFYRMNDNRFPSSSIDVNE 637
                 +      Q L+FGS   D+    T       N+L+ S   +++ +  S    +N 
Sbjct: 145  VIQNRVQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENSKLNLSNQQLDSR---LNS 201

Query: 638  NARGNPALLRGQKQERSTSSNDRLKQGDGGNHRVVAP-----PPGFSSNSKNVE-QREYG 799
            N   +P +     Q R++    + +Q  G      +P     PPGF    +     R++G
Sbjct: 202  NPNTSPYVF----QHRNSGDRGKQQQHGGSYRPTPSPEARRSPPGFLGKPRGGGGNRDFG 257

Query: 800  YGRTS-DLNGDKGMCNYGQLQQNDRIG--NQLDLPGLPAGSSRQSASTLDIEESMKGLNV 970
              R   + N DK    Y Q   ++ +G   QLD PG PAGS+ QS S  DIEES+  L+ 
Sbjct: 258  NRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATDIEESLLELH- 316

Query: 971  GDGERSEDSRR 1003
             DG R   SRR
Sbjct: 317  SDGGRDRFSRR 327


>ref|XP_007051991.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao]
            gi|508704252|gb|EOX96148.1| Nucleotidyltransferase family
            protein isoform 1 [Theobroma cacao]
          Length = 722

 Score = 81.6 bits (200), Expect = 9e-13
 Identities = 107/371 (28%), Positives = 142/371 (38%), Gaps = 55/371 (14%)
 Frame = +2

Query: 56   MNVRGGDAPPP---NGGEFLLQLLRN-------------NKPPNSTHPILXXXXXXXXXX 187
            M   GG+AP P   NGGEFLL LL+              ++    T P            
Sbjct: 1    MTGNGGEAPSPPAANGGEFLLSLLQKPQQHLQQQQSPLFSRATPVTIPQPQQQQQQQQQQ 60

Query: 188  XXXXXXAVAAVGPSIPTYPLPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXX 367
                  AVAAVGP++P  PL      PSNG D                            
Sbjct: 61   PLVIDPAVAAVGPTLPFRPLW-----PSNGRDLPGLWPQ--------------------- 94

Query: 368  XXXXXARGFTHSPPQ------FDLQPKLISPG-----------DDARKLGFYG-DNSKP- 490
                     T SPP       F L P   SPG           DD R+LG  G DN+K  
Sbjct: 95   ---------TLSPPLAPNFLGFPLSP-WSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNH 144

Query: 491  -----IAANQPGQNLIFGSLNRDVLNNGTV------NVLDQSFYRMNDNRFPSSSIDVNE 637
                 +      Q L+FGS   D+    T       N+L+ S   +++ +  S    +N 
Sbjct: 145  VIQNRVQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENSKLNLSNQQLDSR---LNS 201

Query: 638  NARGNPALLRGQKQERSTSSNDRLKQGDGGNHRVVAP-----PPGFSSNSKNVE-QREYG 799
            N   +P +     Q R++    + +Q  G      +P     PPGF    +     R++G
Sbjct: 202  NPNTSPYVF----QHRNSGDRGKQQQHGGSYRPTPSPEARRSPPGFLGKPRGGGGNRDFG 257

Query: 800  YGRTS-DLNGDKGMCNYGQLQQNDRIG--NQLDLPGLPAGSSRQSASTLDIEESMKGLNV 970
              R   + N DK    Y Q   ++ +G   QLD PG PAGS+ QS S  DIEES+  L+ 
Sbjct: 258  NRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATDIEESLLELH- 316

Query: 971  GDGERSEDSRR 1003
             DG R   SRR
Sbjct: 317  SDGGRDRFSRR 327


>gb|KJB49361.1| hypothetical protein B456_008G115100 [Gossypium raimondii]
          Length = 571

 Score = 81.3 bits (199), Expect = 1e-12
 Identities = 106/364 (29%), Positives = 152/364 (41%), Gaps = 40/364 (10%)
 Frame = +2

Query: 68   GGDAP---PPNGGEFLLQLLRNNKPPNSTH-----PILXXXXXXXXXXXXXXXX------ 205
            GGDAP     NGGEFLL LL+N  PP         P+L                      
Sbjct: 5    GGDAPREPTANGGEFLLSLLQN--PPQQQQQQRQSPLLSRVTPMLIPQPLQQQQLQQQPL 62

Query: 206  ----AVAAVGPSIPTYPLPHAAFPPSNGGDF------TFXXXXXXXXXXFAQHNYXXXXX 355
                AVAAVG     +PL   ++P SNG D       T           F Q+ +     
Sbjct: 63   PLDPAVAAVGR---VFPLQSPSWP-SNGRDLSTPWAQTISSPLVPNFLAFPQNPWS---- 114

Query: 356  XXXXXXXXXARGFTHSPPQFDLQPKLISPGDDARKLGF--YGDNSKPIAAN--------Q 505
                     + G      + DL        DD R+LGF    +NS  +           Q
Sbjct: 115  ---------SSGNQFVGNRGDLN-------DDLRRLGFPSVDNNSNNLIQQKHPEQQQQQ 158

Query: 506  PGQNLIFGSLNRDVLNNGTVNVLDQSFYRMNDNRFPSSSIDVNENARGNPALLRGQKQER 685
              Q L+FGS   D+       +L +    +N N F  S++D+++ A  +P   + Q    
Sbjct: 159  QQQKLVFGSFPSDI------QILQKPEGLLNGNLFDKSNLDLSKPANSSPYAFQHQ---- 208

Query: 686  STSSNDRLKQGDGGNHR-VVAPPPGFSSNSKNVE-QREYGYGRTS-DLNGDKGMCNYGQL 856
              S   + +Q   GN+R  + PPPGFS   +     R++G  R   + N DK    Y QL
Sbjct: 209  --SERGKQQQHHVGNYRETLRPPPGFSGKPRGGGGSRDFGARRNHLEHNVDKLRAEYSQL 266

Query: 857  QQNDRIG--NQLDLPGLPAGSSRQSASTLDIEESMKGLN-VGDGERSEDSRRGVQAKLNN 1027
              ++ +G   QLD PG PAGS+ QS +  DI+ES+  L+  G G+  E   R V++ L  
Sbjct: 267  SNDNEMGLRGQLDHPGPPAGSNLQSGT--DIKESLMELHRFGGGQVDEVGERIVESLLIE 324

Query: 1028 DDSQ 1039
            ++S+
Sbjct: 325  EESE 328


>gb|KJB49360.1| hypothetical protein B456_008G115100 [Gossypium raimondii]
          Length = 579

 Score = 81.3 bits (199), Expect = 1e-12
 Identities = 106/364 (29%), Positives = 152/364 (41%), Gaps = 40/364 (10%)
 Frame = +2

Query: 68   GGDAP---PPNGGEFLLQLLRNNKPPNSTH-----PILXXXXXXXXXXXXXXXX------ 205
            GGDAP     NGGEFLL LL+N  PP         P+L                      
Sbjct: 5    GGDAPREPTANGGEFLLSLLQN--PPQQQQQQRQSPLLSRVTPMLIPQPLQQQQLQQQPL 62

Query: 206  ----AVAAVGPSIPTYPLPHAAFPPSNGGDF------TFXXXXXXXXXXFAQHNYXXXXX 355
                AVAAVG     +PL   ++P SNG D       T           F Q+ +     
Sbjct: 63   PLDPAVAAVGR---VFPLQSPSWP-SNGRDLSTPWAQTISSPLVPNFLAFPQNPWS---- 114

Query: 356  XXXXXXXXXARGFTHSPPQFDLQPKLISPGDDARKLGF--YGDNSKPIAAN--------Q 505
                     + G      + DL        DD R+LGF    +NS  +           Q
Sbjct: 115  ---------SSGNQFVGNRGDLN-------DDLRRLGFPSVDNNSNNLIQQKHPEQQQQQ 158

Query: 506  PGQNLIFGSLNRDVLNNGTVNVLDQSFYRMNDNRFPSSSIDVNENARGNPALLRGQKQER 685
              Q L+FGS   D+       +L +    +N N F  S++D+++ A  +P   + Q    
Sbjct: 159  QQQKLVFGSFPSDI------QILQKPEGLLNGNLFDKSNLDLSKPANSSPYAFQHQ---- 208

Query: 686  STSSNDRLKQGDGGNHR-VVAPPPGFSSNSKNVE-QREYGYGRTS-DLNGDKGMCNYGQL 856
              S   + +Q   GN+R  + PPPGFS   +     R++G  R   + N DK    Y QL
Sbjct: 209  --SERGKQQQHHVGNYRETLRPPPGFSGKPRGGGGSRDFGARRNHLEHNVDKLRAEYSQL 266

Query: 857  QQNDRIG--NQLDLPGLPAGSSRQSASTLDIEESMKGLN-VGDGERSEDSRRGVQAKLNN 1027
              ++ +G   QLD PG PAGS+ QS +  DI+ES+  L+  G G+  E   R V++ L  
Sbjct: 267  SNDNEMGLRGQLDHPGPPAGSNLQSGT--DIKESLMELHRFGGGQVDEVGERIVESLLIE 324

Query: 1028 DDSQ 1039
            ++S+
Sbjct: 325  EESE 328


>ref|XP_012437624.1| PREDICTED: terminal uridylyltransferase 7-like [Gossypium raimondii]
            gi|763782288|gb|KJB49359.1| hypothetical protein
            B456_008G115100 [Gossypium raimondii]
          Length = 694

 Score = 81.3 bits (199), Expect = 1e-12
 Identities = 106/364 (29%), Positives = 152/364 (41%), Gaps = 40/364 (10%)
 Frame = +2

Query: 68   GGDAP---PPNGGEFLLQLLRNNKPPNSTH-----PILXXXXXXXXXXXXXXXX------ 205
            GGDAP     NGGEFLL LL+N  PP         P+L                      
Sbjct: 5    GGDAPREPTANGGEFLLSLLQN--PPQQQQQQRQSPLLSRVTPMLIPQPLQQQQLQQQPL 62

Query: 206  ----AVAAVGPSIPTYPLPHAAFPPSNGGDF------TFXXXXXXXXXXFAQHNYXXXXX 355
                AVAAVG     +PL   ++P SNG D       T           F Q+ +     
Sbjct: 63   PLDPAVAAVGR---VFPLQSPSWP-SNGRDLSTPWAQTISSPLVPNFLAFPQNPWS---- 114

Query: 356  XXXXXXXXXARGFTHSPPQFDLQPKLISPGDDARKLGF--YGDNSKPIAAN--------Q 505
                     + G      + DL        DD R+LGF    +NS  +           Q
Sbjct: 115  ---------SSGNQFVGNRGDLN-------DDLRRLGFPSVDNNSNNLIQQKHPEQQQQQ 158

Query: 506  PGQNLIFGSLNRDVLNNGTVNVLDQSFYRMNDNRFPSSSIDVNENARGNPALLRGQKQER 685
              Q L+FGS   D+       +L +    +N N F  S++D+++ A  +P   + Q    
Sbjct: 159  QQQKLVFGSFPSDI------QILQKPEGLLNGNLFDKSNLDLSKPANSSPYAFQHQ---- 208

Query: 686  STSSNDRLKQGDGGNHR-VVAPPPGFSSNSKNVE-QREYGYGRTS-DLNGDKGMCNYGQL 856
              S   + +Q   GN+R  + PPPGFS   +     R++G  R   + N DK    Y QL
Sbjct: 209  --SERGKQQQHHVGNYRETLRPPPGFSGKPRGGGGSRDFGARRNHLEHNVDKLRAEYSQL 266

Query: 857  QQNDRIG--NQLDLPGLPAGSSRQSASTLDIEESMKGLN-VGDGERSEDSRRGVQAKLNN 1027
              ++ +G   QLD PG PAGS+ QS +  DI+ES+  L+  G G+  E   R V++ L  
Sbjct: 267  SNDNEMGLRGQLDHPGPPAGSNLQSGT--DIKESLMELHRFGGGQVDEVGERIVESLLIE 324

Query: 1028 DDSQ 1039
            ++S+
Sbjct: 325  EESE 328


>ref|XP_004229872.1| PREDICTED: uncharacterized protein LOC101244121 [Solanum
            lycopersicum]
          Length = 775

 Score = 76.3 bits (186), Expect = 4e-11
 Identities = 103/392 (26%), Positives = 142/392 (36%), Gaps = 68/392 (17%)
 Frame = +2

Query: 68   GGDAPPP---------NGGEFLLQLLRNNKPPNSTHPILXXXXXXXXXXXXXXXXAVAAV 220
            GGDA  P         NGGEFLLQLL+N+     + P                  AVAAV
Sbjct: 5    GGDAASPPLSSQSTPSNGGEFLLQLLQNHPHQLHSQP---QPPLRPELQNLPHDPAVAAV 61

Query: 221  GPSIPTYPLPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXXXXXXXARGFTH 400
            GPS+P  PL H    PS                 F  HN+                    
Sbjct: 62   GPSMPYPPLFHTPTNPS-------VLPYSHSPPLFVPHNFFIRGFLQNPNSGHTTNPNYS 114

Query: 401  SPP------QFDLQPKLI--SPGDDARKLGFYGDNSKPIAANQP-GQNLIFGSLNRDVLN 553
            SPP      Q+     L   S G++   LG +G N+K   +N     NLIFGSL   +  
Sbjct: 115  SPPAPSGFSQYHHASPLGFGSVGENMGNLGIFGANAKASNSNNEFDHNLIFGSLRSHI-- 172

Query: 554  NGTVNVLDQSF------------YRMNDNRFPSSSI---------DVNENARGNPALLRG 670
             G V++++  F             + +++R  +  +         +V  + R     LRG
Sbjct: 173  QGNVSMMNDRFSDDLASKVGNFEQKNHESRLANVRMLNGVEGKLENVIGSGRKQLGNLRG 232

Query: 671  QKQERSTSSNDRLKQGDGG----------NHRVVAPPPGFSS------------NSKN-- 778
             +Q+ S       +   GG            R V PPPGFSS            N KN  
Sbjct: 233  LEQQNSGGGGGESESESGGLGWGRQFHSGTVRGVVPPPGFSSKPRSRDFEHNVDNEKNNF 292

Query: 779  VEQREYGYGRTSDLNGD-----KGMCNYGQLQQNDRIGNQLDLPGLPAGSSRQSASTLDI 943
            VE    G G       +     +   NY     + R+  +LD P  PAGS   S    D+
Sbjct: 293  VELNHRGIGLNHKYERESKHLSRNGKNYAIGSDDQRVFRRLDSPVPPAGSKLHSVLASDV 352

Query: 944  EESMKGLNVGDGERSEDSRRGVQAKLNNDDSQ 1039
            E+S   L   D E  E++   ++  L    +Q
Sbjct: 353  EDSTLELRGEDAESGEETVSVMRDVLGRSSAQ 384


>emb|CDP20563.1| unnamed protein product [Coffea canephora]
          Length = 776

 Score = 75.5 bits (184), Expect = 7e-11
 Identities = 108/406 (26%), Positives = 155/406 (38%), Gaps = 78/406 (19%)
 Frame = +2

Query: 68   GGDAPPP-NGGEFLLQLLRNNKPPNSTHPILXXXXXXXXXXXXXXXXA--------VAAV 220
            G +APPP +GGEFLLQLL+  KP    H +                 A        VAAV
Sbjct: 5    GSEAPPPVDGGEFLLQLLQ--KPSYHHHQLQHAPSPQPQPQLPPPPPAQILPHDPAVAAV 62

Query: 221  GPSIPTYPLPHAAFPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXXXXXXX-----A 385
            G ++P   L +   PP       +          F  H+Y                   +
Sbjct: 63   GRTLPFQQLFNGPVPP-------WPHPHHQSPPPFVPHSYFLQNPSRPNPGPNPNPNSNS 115

Query: 386  RGFTHSPPQFD--------LQPKLISPGDDARKLGFYGDNSKPIAA-NQPGQNLIFGSLN 538
               + SPP F+        +Q  L + G+D RKLG  G+ S P +A  Q  +++IFGSL 
Sbjct: 116  NFSSASPPGFNQGNFQQHNVQFNLSAMGNDIRKLGNLGNPSNPSSAYTQDPKDIIFGSLR 175

Query: 539  -RDVLNNGT----------------VNVLDQSFYRMN--DNRFPSSSIDVNENARGNPAL 661
              DVL NG                 V++L+ +  R+N  +  F  +S+D   ++ G    
Sbjct: 176  GNDVLLNGNFGENLLVSQQEKSKLGVSLLEGNVGRLNGFEVEFLGNSVDQRVHSSG---- 231

Query: 662  LRGQKQERSTSSNDRL----KQGDGGNHRVVAPPPGF---SSNSKNVEQREYGY------ 802
            LRG    R  +S+        QG   N    A P GF     + K  + R+ G+      
Sbjct: 232  LRGYGNFRGDTSSRGTGYWGSQGPDRNDHRAAVPSGFIGQQMSGKEFDNRKKGFEHGGQK 291

Query: 803  -----GRTSDLNG---------DKGMCNYGQLQQNDRIGNQLDLPGLPAGSSRQSASTLD 940
                 G +  L+G          +   N G    +  +  QLD PG P GS+ QS S  D
Sbjct: 292  VGSNFGESRFLSGKNEKERRFLSRKAGNDGDCLDDRGLSVQLDCPGSPPGSTLQSVSASD 351

Query: 941  IEESMK---------GLNVGDGERSEDSRRGVQAKLNNDDSQMDDL 1051
            +E+ M+         G   G+G R      G     +N     DDL
Sbjct: 352  VEDPMRTFHEEDSKGGKIFGNGRRKNSKEDG-----HNGHEDFDDL 392


>ref|XP_010518183.1| PREDICTED: uncharacterized protein LOC104793522 [Camelina sativa]
          Length = 767

 Score = 65.1 bits (157), Expect = 9e-08
 Identities = 94/353 (26%), Positives = 131/353 (37%), Gaps = 48/353 (13%)
 Frame = +2

Query: 80   PPPNGGEFLLQLLRNNKPPNSTHPILXXXXXXXXXXXXXXXXAVAAVGPSIPTYPLPHAA 259
            P  N G+F+L LL+ N  P+ +                    A+AAVGP++  +P P   
Sbjct: 13   PSDNVGDFILSLLQQNTRPSPSQQ--------QQGGPQHLDPAIAAVGPTVNPFP-PSIW 63

Query: 260  FPPSNGGDFTFXXXXXXXXXXFAQHNYXXXXXXXXXXXXXXARGFTHSP---PQFDLQPK 430
               SN G              F+   +                 FTH+P    QFD   +
Sbjct: 64   QSSSNNGPGGHHNPSSSWPLAFSPPPHPNLSPNLLGFPQ-----FTHNPFTANQFDSNQR 118

Query: 431  LISPGDDARKLGFYGDNSKPIAA------------NQPGQNLIFGSLNRDV------LNN 556
            L SP +DA +LGF G  + PI +                + L+FGS + D       L N
Sbjct: 119  L-SP-EDAYRLGFPGTGTHPIQSMIQQQQLPPPPPQSETRQLVFGSFSGDATQSLNGLRN 176

Query: 557  GTVNVLDQSFYRMNDNRFPSSSIDVNENARGNPALLRGQK-QERSTSSNDRLKQGDGGNH 733
            G  N++  S +     R    S+ +N N   N +  R     E     + R   G  GN+
Sbjct: 177  G--NLMYDSKHHEQLMRNHPQSVVLNPNTDPNLSHHRNHDVNELRGGHSGRGNWGPIGNN 234

Query: 734  ------RVVAPPPGFSSNSK---NVEQREYGYGRTSDLNGDKGMCNYGQLQ--------- 859
                      PPPGFSSN +   N+     G       N DK M  +             
Sbjct: 235  VRGFKSTPTPPPPGFSSNQRGWDNMNLASKGDDSFQRNNHDKAMWEHSSFNAEADRLRGL 294

Query: 860  --QND---RIGNQLDLPGLPAGSSRQSASTLDIEESMKGLNV---GDGERSED 994
              QN+    +  Q+D PG P G+S  S ST D   S+  LN    G  ER E+
Sbjct: 295  SIQNESKFNLSQQIDHPGPPKGTSLHSVSTADAANSLSMLNKEARGRIERKEE 347


Top