BLASTX nr result

ID: Atropa21_contig00029413 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00029413
         (719 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581...   190   4e-46
gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum]         190   4e-46
gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum ...   182   1e-43
gb|AAT38724.1| Putative retrotransposon protein, identical [Sola...   181   2e-43
ref|XP_004243106.1| PREDICTED: uncharacterized protein LOC101256...   176   9e-42
emb|CAH66066.1| OSIGBa0092O07.1 [Oryza sativa Indica Group] gi|1...   125   2e-41
gb|ABA91307.1| retrotransposon protein, putative, Ty3-gypsy subc...   128   9e-41
gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao]    171   2e-40
gb|EOY31663.1| CCHC-type integrase [Theobroma cacao]                  171   3e-40
gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobrom...   171   3e-40
gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobrom...   170   5e-40
gb|ABG37663.1| CCHC-type integrase [Populus trichocarpa]              169   6e-40
gb|EOY32249.1| CCHC-type integrase [Theobroma cacao]                  169   8e-40
gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobrom...   169   8e-40
gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao]   169   1e-39
gb|EOY21520.1| CCHC-type integrase, putative [Theobroma cacao]        166   7e-39
gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom...   151   2e-34
gb|EOY19685.1| DNA/RNA polymerases superfamily protein, putative...   149   1e-33
gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobrom...   147   3e-33
emb|CAA73042.1| polyprotein [Ananas comosus]                          146   7e-33

>ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum]
          Length = 1946

 Score =  190 bits (482), Expect = 4e-46
 Identities = 103/147 (70%), Positives = 113/147 (76%), Gaps = 1/147 (0%)
 Frame = -3

Query: 717  EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538
            EGFVVYCDAS VGLGCVLMQ+ KVIAYASRQLK HEKNYPTHDLELAAV+FALKIW HYL
Sbjct: 1303 EGFVVYCDASRVGLGCVLMQNGKVIAYASRQLKVHEKNYPTHDLELAAVVFALKIWRHYL 1362

Query: 537  YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361
            YGVHV++FT+HKSL+Y+F Q  LNL        L+          G ANVVADALSR SM
Sbjct: 1363 YGVHVDVFTDHKSLQYVFTQKDLNLRQRRWLEFLKDYDMSVHYHPGKANVVADALSRVSM 1422

Query: 360  KSLEHVEEQTRTMAKEVHRLASLGVRL 280
             SL HV+   R MA+EVHRLA LGVRL
Sbjct: 1423 GSLAHVDIGDREMAREVHRLARLGVRL 1449



 Score =  104 bits (259), Expect = 3e-20
 Identities = 70/170 (41%), Positives = 91/170 (53%), Gaps = 8/170 (4%)
 Frame = -2

Query: 487  FQAMEVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENH---GE 317
            F   ++  R RRWLE LK+YD+ V  HPG         +V+ + L      +  H   G+
Sbjct: 1380 FTQKDLNLRQRRWLEFLKDYDMSVHYHPGKA-------NVVADALSRVSMGSLAHVDIGD 1432

Query: 316  GSTPISQFRSSTFGLE-----DSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHK 152
                    R +  G+      +  VV  +   SSLV E   KQ  D++LL+LK  + + K
Sbjct: 1433 REMAREVHRLARLGVRLEEVGNGGVVVVDGARSSLVDEVIAKQDLDSSLLELKALVKEGK 1492

Query: 151  TTAFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
               F QGGD G LRY+GRLCVP VDGLRE+I+ E H S YSIH GS KMY
Sbjct: 1493 VEVFSQGGD-GALRYQGRLCVPCVDGLREKILEEAHNSSYSIHPGSTKMY 1541


>gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum]
          Length = 4543

 Score =  190 bits (482), Expect = 4e-46
 Identities = 99/149 (66%), Positives = 114/149 (76%), Gaps = 1/149 (0%)
 Frame = -3

Query: 717  EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538
            +GFVV+CDAS VGLGCVLMQ+DKVIAYASRQLK HEKNYPTHDLELAAV+FALKIW HYL
Sbjct: 734  QGFVVHCDASRVGLGCVLMQNDKVIAYASRQLKAHEKNYPTHDLELAAVVFALKIWRHYL 793

Query: 537  YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361
            YGVHV+IFT+HKSL+Y+  Q +LNL        L+  +       G ANVVAD+LSR SM
Sbjct: 794  YGVHVDIFTDHKSLQYVLTQKELNLRQRRWLELLKDYVLSILYHPGKANVVADSLSRLSM 853

Query: 360  KSLEHVEEQTRTMAKEVHRLASLGVRLLD 274
             S  H+EE  R + K+VHRLA LGVR  D
Sbjct: 854  GSTAHIEEGRRELTKDVHRLACLGVRFTD 882



 Score =  190 bits (482), Expect = 4e-46
 Identities = 99/149 (66%), Positives = 114/149 (76%), Gaps = 1/149 (0%)
 Frame = -3

Query: 717  EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538
            +GFVV+CDAS VGLGCVLMQ+DKVIAYASRQLK HEKNYPTHDLELAAV+FALKIW HYL
Sbjct: 2244 QGFVVHCDASRVGLGCVLMQNDKVIAYASRQLKAHEKNYPTHDLELAAVVFALKIWRHYL 2303

Query: 537  YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361
            YGVHV+IFT+HKSL+Y+  Q +LNL        L+  +       G ANVVAD+LSR SM
Sbjct: 2304 YGVHVDIFTDHKSLQYVLTQKELNLRQRRWLELLKDYVLSILYHPGKANVVADSLSRLSM 2363

Query: 360  KSLEHVEEQTRTMAKEVHRLASLGVRLLD 274
             S  H+EE  R + K+VHRLA LGVR  D
Sbjct: 2364 GSTAHIEEGRRELTKDVHRLACLGVRFTD 2392



 Score =  190 bits (482), Expect = 4e-46
 Identities = 99/149 (66%), Positives = 114/149 (76%), Gaps = 1/149 (0%)
 Frame = -3

Query: 717  EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538
            +GFVV+CDAS VGLGCVLMQ+DKVIAYASRQLK HEKNYPTHDLELAAV+FALKIW HYL
Sbjct: 3754 QGFVVHCDASRVGLGCVLMQNDKVIAYASRQLKAHEKNYPTHDLELAAVVFALKIWRHYL 3813

Query: 537  YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361
            YGVHV+IFT+HKSL+Y+  Q +LNL        L+  +       G ANVVAD+LSR SM
Sbjct: 3814 YGVHVDIFTDHKSLQYVLTQKELNLRQRRWLELLKDYVLSILYHPGKANVVADSLSRLSM 3873

Query: 360  KSLEHVEEQTRTMAKEVHRLASLGVRLLD 274
             S  H+EE  R + K+VHRLA LGVR  D
Sbjct: 3874 GSTAHIEEGRRELTKDVHRLACLGVRFTD 3902



 Score =  122 bits (306), Expect = 1e-25
 Identities = 75/166 (45%), Positives = 98/166 (59%), Gaps = 8/166 (4%)
 Frame = -2

Query: 475  EVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENH-GEGSTPIS 299
            E+  R RRWLELLK+Y + +L HPG         +V+ + L      +  H  EG   ++
Sbjct: 815  ELNLRQRRWLELLKDYVLSILYHPGKA-------NVVADSLSRLSMGSTAHIEEGRRELT 867

Query: 298  Q--FRSSTFGLEDSD-----VVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAF 140
            +   R +  G+  +D     +   NR ESSLV E K+KQ  D  LL+LK  + K +  AF
Sbjct: 868  KDVHRLACLGVRFTDSAKGGIAVANRAESSLVLEVKKKQDQDPILLELKANVQKQRVLAF 927

Query: 139  EQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            EQGGD G LRY+GRLCVP VDGL+E+IM E H S YS+H GS KMY
Sbjct: 928  EQGGD-GALRYQGRLCVPMVDGLQEKIMEEAHSSRYSVHPGSTKMY 972



 Score =  122 bits (306), Expect = 1e-25
 Identities = 75/166 (45%), Positives = 98/166 (59%), Gaps = 8/166 (4%)
 Frame = -2

Query: 475  EVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENH-GEGSTPIS 299
            E+  R RRWLELLK+Y + +L HPG         +V+ + L      +  H  EG   ++
Sbjct: 2325 ELNLRQRRWLELLKDYVLSILYHPGKA-------NVVADSLSRLSMGSTAHIEEGRRELT 2377

Query: 298  Q--FRSSTFGLEDSD-----VVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAF 140
            +   R +  G+  +D     +   NR ESSLV E K+KQ  D  LL+LK  + K +  AF
Sbjct: 2378 KDVHRLACLGVRFTDSAKGGIAVANRAESSLVLEVKKKQDQDPILLELKANVQKQRVLAF 2437

Query: 139  EQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            EQGGD G LRY+GRLCVP VDGL+E+IM E H S YS+H GS KMY
Sbjct: 2438 EQGGD-GALRYQGRLCVPMVDGLQEKIMEEAHSSRYSVHPGSTKMY 2482



 Score =  122 bits (306), Expect = 1e-25
 Identities = 75/166 (45%), Positives = 98/166 (59%), Gaps = 8/166 (4%)
 Frame = -2

Query: 475  EVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENH-GEGSTPIS 299
            E+  R RRWLELLK+Y + +L HPG         +V+ + L      +  H  EG   ++
Sbjct: 3835 ELNLRQRRWLELLKDYVLSILYHPGKA-------NVVADSLSRLSMGSTAHIEEGRRELT 3887

Query: 298  Q--FRSSTFGLEDSD-----VVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAF 140
            +   R +  G+  +D     +   NR ESSLV E K+KQ  D  LL+LK  + K +  AF
Sbjct: 3888 KDVHRLACLGVRFTDSAKGGIAVANRAESSLVLEVKKKQDQDPILLELKANVQKQRVLAF 3947

Query: 139  EQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            EQGGD G LRY+GRLCVP VDGL+E+IM E H S YS+H GS KMY
Sbjct: 3948 EQGGD-GALRYQGRLCVPMVDGLQEKIMEEAHSSRYSVHPGSTKMY 3992


>gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1515

 Score =  182 bits (461), Expect = 1e-43
 Identities = 95/149 (63%), Positives = 111/149 (74%), Gaps = 1/149 (0%)
 Frame = -3

Query: 717  EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538
            +G VVYCDAS +GLGCVLMQ+ KVIAYASRQLK HEKNYPTHDLELA V+FALK+W HYL
Sbjct: 952  QGLVVYCDASRIGLGCVLMQNGKVIAYASRQLKVHEKNYPTHDLELAVVVFALKLWRHYL 1011

Query: 537  YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361
            YGVHV+IFT+HKSL+Y+  Q +LNL        L+          G ANVVAD+LSR SM
Sbjct: 1012 YGVHVDIFTDHKSLQYVLTQKELNLRQRRWLELLKDYDLSILYHPGKANVVADSLSRLSM 1071

Query: 360  KSLEHVEEQTRTMAKEVHRLASLGVRLLD 274
             S  H+EE  R +AK++HRLA LGVR  D
Sbjct: 1072 GSTTHIEEGRRELAKDMHRLACLGVRFTD 1100



 Score =  125 bits (314), Expect = 1e-26
 Identities = 73/166 (43%), Positives = 96/166 (57%), Gaps = 8/166 (4%)
 Frame = -2

Query: 475  EVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENH---GEGSTP 305
            E+  R RRWLELLK+YD+ +L HPG         +V+ + L      +  H   G     
Sbjct: 1033 ELNLRQRRWLELLKDYDLSILYHPGKA-------NVVADSLSRLSMGSTTHIEEGRRELA 1085

Query: 304  ISQFRSSTFGLEDSD-----VVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAF 140
                R +  G+  +D     +   ++ ESSL++E KEKQ  D  LL+LK  + K +  AF
Sbjct: 1086 KDMHRLACLGVRFTDSTEGGIAVTSKAESSLMSEVKEKQDQDPILLELKANVQKQRVLAF 1145

Query: 139  EQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            EQGGD G LRY+GRLCVP VDGL+ER+M E H S YS+H GS KMY
Sbjct: 1146 EQGGD-GVLRYQGRLCVPMVDGLQERVMEEAHSSRYSVHPGSTKMY 1190


>gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum]
          Length = 1602

 Score =  181 bits (459), Expect = 2e-43
 Identities = 95/149 (63%), Positives = 110/149 (73%), Gaps = 1/149 (0%)
 Frame = -3

Query: 717  EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538
            +G VVYCDAS +GLGCVLMQ+ KVIAYASRQLK HEKNYPTHDLELA V+FALK+W HYL
Sbjct: 958  QGLVVYCDASRIGLGCVLMQNGKVIAYASRQLKVHEKNYPTHDLELAVVVFALKLWRHYL 1017

Query: 537  YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361
            YGVHV+IFT+HKSL+Y+  Q  LNL        L+          G ANVVAD+LSR SM
Sbjct: 1018 YGVHVDIFTDHKSLQYVLTQKALNLRQRRWLELLKDYDLSILYHPGKANVVADSLSRLSM 1077

Query: 360  KSLEHVEEQTRTMAKEVHRLASLGVRLLD 274
             S  H+EE  R +AK++HRLA LGVR  D
Sbjct: 1078 GSTTHIEEGRRELAKDMHRLACLGVRFTD 1106



 Score =  123 bits (309), Expect = 5e-26
 Identities = 72/165 (43%), Positives = 95/165 (57%), Gaps = 8/165 (4%)
 Frame = -2

Query: 472  VEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENH---GEGSTPI 302
            +  R RRWLELLK+YD+ +L HPG         +V+ + L      +  H   G      
Sbjct: 1040 LNLRQRRWLELLKDYDLSILYHPGKA-------NVVADSLSRLSMGSTTHIEEGRRELAK 1092

Query: 301  SQFRSSTFGLEDSD-----VVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAFE 137
               R +  G+  +D     +   ++ ESSL++E KEKQ  D  LL+LK  + K +  AFE
Sbjct: 1093 DMHRLACLGVRFTDSTEGGIAVTSKAESSLMSEVKEKQDQDPILLELKANVQKQRVLAFE 1152

Query: 136  QGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            QGGD G LRY+GRLCVP VDGL+ER+M E H S YS+H GS KMY
Sbjct: 1153 QGGD-GVLRYQGRLCVPMVDGLQERVMEEAHSSRYSVHPGSTKMY 1196


>ref|XP_004243106.1| PREDICTED: uncharacterized protein LOC101256304 [Solanum
           lycopersicum]
          Length = 647

 Score =  176 bits (445), Expect = 9e-42
 Identities = 94/147 (63%), Positives = 111/147 (75%), Gaps = 1/147 (0%)
 Frame = -3

Query: 717 EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538
           +G+V+YCDASGVGLGCVLMQH KVIAYASRQL+ HEKNY THDLELA VI A+KIW HYL
Sbjct: 417 DGYVIYCDASGVGLGCVLMQHGKVIAYASRQLRKHEKNYRTHDLELAVVIHAMKIWMHYL 476

Query: 537 YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTM-I*MFSAIQGANVVADALSR*SM 361
           YGVHV+I+T+HKSL+YIFKQ +LNL        L+   I +      AN+VADALSR SM
Sbjct: 477 YGVHVDIYTDHKSLQYIFKQKELNLRQRRWLELLKDYDIDILYHPGKANIVADALSRKSM 536

Query: 360 KSLEHVEEQTRTMAKEVHRLASLGVRL 280
            SL  V+ + R M  E+  L+SLGVRL
Sbjct: 537 GSLTDVQPERRDMVWEIQWLSSLGVRL 563


>emb|CAH66066.1| OSIGBa0092O07.1 [Oryza sativa Indica Group]
            gi|116309115|emb|CAH66220.1| OSIGBa0157N01.6 [Oryza
            sativa Indica Group]
          Length = 1445

 Score =  125 bits (314), Expect(2) = 2e-41
 Identities = 59/83 (71%), Positives = 66/83 (79%)
 Frame = -3

Query: 711  FVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLYG 532
            F VYCDAS  GLGCVLMQ  +V+AYASRQL+PH  NYPTHDLELAAV+ ALKIW HYL G
Sbjct: 875  FQVYCDASRQGLGCVLMQEGRVVAYASRQLRPHVTNYPTHDLELAAVVHALKIWRHYLIG 934

Query: 531  VHVNIFTNHKSLKYIFKQWKLNL 463
                ++T+HKSLKYIF Q  LNL
Sbjct: 935  NRCEVYTDHKSLKYIFTQPDLNL 957



 Score = 70.5 bits (171), Expect(2) = 2e-41
 Identities = 45/162 (27%), Positives = 69/162 (42%)
 Frame = -2

Query: 487  FQAMEVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENHGEGST 308
            F   ++  R RRWLEL+K+YD+ +  HPG                               
Sbjct: 950  FTQPDLNLRQRRWLELIKDYDMGIHYHPGK-----------------------------A 980

Query: 307  PISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAFEQGG 128
             +        G+ +   V       +LV + +  Q ND  + +LK+ +   K   F +  
Sbjct: 981  NVQDLEHLNLGIVEHGYVAALEARPTLVDQVRAAQVNDPKIAELKKNMRVGKAREFHED- 1039

Query: 127  DDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            + GT+    RLCVP+   L++ I+ E H + YSIH GS KMY
Sbjct: 1040 EHGTIWLGERLCVPDDKELKDLILTEAHQTQYSIHPGSTKMY 1081


>gb|ABA91307.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 1284

 Score =  128 bits (321), Expect(2) = 9e-41
 Identities = 60/83 (72%), Positives = 69/83 (83%)
 Frame = -3

Query: 711  FVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLYG 532
            FV+YCDAS  GLG VLMQ  KV+AYASRQL+PHE+NYPTHDLELAAV+ AL IW HYL G
Sbjct: 838  FVIYCDASRQGLGGVLMQDGKVVAYASRQLRPHEENYPTHDLELAAVVHALNIWRHYLIG 897

Query: 531  VHVNIFTNHKSLKYIFKQWKLNL 463
             H +I+T+HK+LKYIF Q  LNL
Sbjct: 898  NHCDIYTDHKNLKYIFTQSDLNL 920



 Score = 65.9 bits (159), Expect(2) = 9e-41
 Identities = 50/164 (30%), Positives = 74/164 (45%), Gaps = 2/164 (1%)
 Frame = -2

Query: 487  FQAMEVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENHGEGST 308
            F   ++  R RRWLEL+K+YD++V  HPG         +V+ + L      N    EG  
Sbjct: 959  FIQSDLNLRQRRWLELIKDYDLEVHYHPGKA-------NVVADALSRKSHCNHLEMEGMA 1011

Query: 307  PISQFRSSTFGLE--DSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAFEQ 134
            P  +   +   L       V     +  L  + +E Q ++  + ++KE +       F  
Sbjct: 1012 PELKEELAQLNLHIVPRGQVNTLDIQPLLRTQIEEAQKDNEEIREVKERLAAGFAKEFST 1071

Query: 133  GGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
               D  L Y+ R+ VP   GLR  I+ E H S YS+H GS KMY
Sbjct: 1072 DEKD-VLWYKKRIYVPKQGGLRGLILKEAHESAYSLHPGSTKMY 1114


>gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao]
          Length = 1480

 Score =  171 bits (433), Expect = 2e-40
 Identities = 89/146 (60%), Positives = 105/146 (71%), Gaps = 1/146 (0%)
 Frame = -3

Query: 714  GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535
            G+ V+CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY
Sbjct: 938  GYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 997

Query: 534  GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358
            G    I+T+HKSLKYIF+Q  LNL        L+          G ANVVADALSR SM 
Sbjct: 998  GETCEIYTDHKSLKYIFQQRDLNLRQHRWMELLKDYDCTILYHPGKANVVADALSRKSMG 1057

Query: 357  SLEHVEEQTRTMAKEVHRLASLGVRL 280
            SL H+    R++ +E+H L  +GVRL
Sbjct: 1058 SLAHISIGRRSLVREIHSLGDIGVRL 1083



 Score = 78.6 bits (192), Expect = 2e-12
 Identities = 57/168 (33%), Positives = 81/168 (48%), Gaps = 6/168 (3%)
 Frame = -2

Query: 487  FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326
            FQ  ++  R  RW+ELLK+YD  +L HPG     ++  SR S  S+ H  +       E 
Sbjct: 1014 FQQRDLNLRQHRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREI 1073

Query: 325  HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146
            H  G   +    + T  L     +   R    L+   KE Q  D  +++  E     K  
Sbjct: 1074 HSLGDIGVRLEVAETNAL-----LAHFRVRPILMDRIKEAQSKDEFVIKALEDPQGRKGK 1128

Query: 145  AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
             F +G  DG LRY  RL VP+ DGLR  I+ E H++ Y +H G++KMY
Sbjct: 1129 MFTKG-TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGALKMY 1175


>gb|EOY31663.1| CCHC-type integrase [Theobroma cacao]
          Length = 395

 Score =  171 bits (432), Expect = 3e-40
 Identities = 88/146 (60%), Positives = 105/146 (71%), Gaps = 1/146 (0%)
 Frame = -3

Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535
           G+ V+CDASG+GLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY
Sbjct: 44  GYTVFCDASGIGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 103

Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358
           G    I+T+HKSLKYIF+Q  LNL        L+          G ANVVADALSR SM 
Sbjct: 104 GETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 163

Query: 357 SLEHVEEQTRTMAKEVHRLASLGVRL 280
           SL H+    R++ +E+H L  +GVRL
Sbjct: 164 SLAHISIGRRSLVREIHSLGDIGVRL 189



 Score = 79.3 bits (194), Expect = 1e-12
 Identities = 58/168 (34%), Positives = 81/168 (48%), Gaps = 6/168 (3%)
 Frame = -2

Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326
           FQ  ++  R RRW+ELLK+YD  +L HPG     ++  SR S  S+ H  +       E 
Sbjct: 120 FQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREI 179

Query: 325 HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146
           H  G   +    + T  L     +   R    L+   KE Q  D  +++  E     K  
Sbjct: 180 HSLGDIGVRLEVAETNAL-----LAHFRVRPILMDRIKEAQSKDEFVIKALEDPQGRKGK 234

Query: 145 AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            F +G  DG LRY  RL VP+ DGLR  I+ E H++ Y +H G+ KMY
Sbjct: 235 MFTKG-TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMY 281


>gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1502

 Score =  171 bits (432), Expect = 3e-40
 Identities = 89/146 (60%), Positives = 105/146 (71%), Gaps = 1/146 (0%)
 Frame = -3

Query: 714  GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535
            G++V+CDASGVGLGCVLMQH KVIAYASRQLK HE NYP HDLE+AA++FALKIW HYLY
Sbjct: 933  GYMVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEHNYPIHDLEMAAIVFALKIWRHYLY 992

Query: 534  GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358
            G    I+T+HKSLKYIF+Q  LNL        L+          G ANVVADALSR SM 
Sbjct: 993  GETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 1052

Query: 357  SLEHVEEQTRTMAKEVHRLASLGVRL 280
            SL H+    R++ +E+H L  +GVRL
Sbjct: 1053 SLAHISIGRRSLVREIHSLGDIGVRL 1078



 Score = 79.3 bits (194), Expect = 1e-12
 Identities = 58/168 (34%), Positives = 81/168 (48%), Gaps = 6/168 (3%)
 Frame = -2

Query: 487  FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326
            FQ  ++  R RRW+ELLK+YD  +L HPG     ++  SR S  S+ H  +       E 
Sbjct: 1009 FQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREI 1068

Query: 325  HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146
            H  G   +    + T  L     +   R    L+   KE Q  D  +++  E     K  
Sbjct: 1069 HSLGDIGVRLEVAETNAL-----LAHFRVRPILMDRIKEAQSKDEFVIKALEDPQGRKGK 1123

Query: 145  AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
             F +G  DG LRY  RL VP+ DGLR  I+ E H++ Y +H G+ KMY
Sbjct: 1124 MFTKG-TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMY 1170


>gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 878

 Score =  170 bits (430), Expect = 5e-40
 Identities = 89/146 (60%), Positives = 105/146 (71%), Gaps = 1/146 (0%)
 Frame = -3

Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535
           G+ V+CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY
Sbjct: 418 GYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 477

Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358
           G    I+T+HKSLKYIF+Q  LNL        L+          G ANVVADALSR SM 
Sbjct: 478 GETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 537

Query: 357 SLEHVEEQTRTMAKEVHRLASLGVRL 280
           SL H+    R++ +E+H L  +GVRL
Sbjct: 538 SLAHIFIGRRSLVREIHSLGDIGVRL 563



 Score = 78.2 bits (191), Expect = 2e-12
 Identities = 58/168 (34%), Positives = 81/168 (48%), Gaps = 6/168 (3%)
 Frame = -2

Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326
           FQ  ++  R RRW+ELLK+YD  +L HPG     ++  SR S  S+ H  +       E 
Sbjct: 494 FQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHIFIGRRSLVREI 553

Query: 325 HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146
           H  G   +    + T  L     +   R    L+   KE Q  D  +++  E     K  
Sbjct: 554 HSLGDIGVRLEVAETNAL-----LAHFRVRPILMDRIKEAQSKDEFVIKALEDPQGRKGK 608

Query: 145 AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            F +G  DG LRY  RL VP+ DGLR  I+ E H++ Y +H G+ KMY
Sbjct: 609 MFTKG-TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMY 655


>gb|ABG37663.1| CCHC-type integrase [Populus trichocarpa]
          Length = 2037

 Score =  169 bits (429), Expect = 6e-40
 Identities = 91/145 (62%), Positives = 102/145 (70%), Gaps = 1/145 (0%)
 Frame = -3

Query: 714  GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535
            G+ VYCDAS VGLGCVLMQH KVIAYASRQLK HE+NYPTHDLE+ AVIFALKIW HYLY
Sbjct: 1787 GYTVYCDASRVGLGCVLMQHGKVIAYASRQLKKHEQNYPTHDLEMTAVIFALKIWRHYLY 1846

Query: 534  GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358
            G    IFT+HKSLKYIF+Q  LNL        L+          G ANVVADALSR S  
Sbjct: 1847 GETCEIFTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTIHYHPGKANVVADALSRKSSG 1906

Query: 357  SLEHVEEQTRTMAKEVHRLASLGVR 283
            SL H++E  R + +E+H L   GVR
Sbjct: 1907 SLAHIQEVRRPLIRELHELVDEGVR 1931



 Score = 86.3 bits (212), Expect = 9e-15
 Identities = 61/171 (35%), Positives = 88/171 (51%), Gaps = 9/171 (5%)
 Frame = -2

Query: 487  FQAMEVEFRLRRWLELLKNYDIDVLCHPGSE------CGSRCS*SVIH--EKLRTC*RT- 335
            FQ  ++  R RRW+ELLK+YD  +  HPG           + S S+ H  E  R   R  
Sbjct: 1863 FQQRDLNLRQRRWMELLKDYDCTIHYHPGKANVVADALSRKSSGSLAHIQEVRRPLIREL 1922

Query: 334  NENHGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKH 155
            +E   EG     +F  S  G     ++   + +S L  + K  Q  D +LL+++  + + 
Sbjct: 1923 HELVDEGV----RFDLSEAGA----MIAHFQVKSDLFDKIKAAQKKDDSLLRIRNEVEQG 1974

Query: 154  KTTAFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            K   F  G DD  LRY+ RLCVP+VD LR  +M E H ++Y++H GS KMY
Sbjct: 1975 KAAGFVIGDDD-VLRYKDRLCVPDVDDLRRELMVEAHQTVYTVHPGSTKMY 2024


>gb|EOY32249.1| CCHC-type integrase [Theobroma cacao]
          Length = 282

 Score =  169 bits (428), Expect = 8e-40
 Identities = 88/146 (60%), Positives = 105/146 (71%), Gaps = 1/146 (0%)
 Frame = -3

Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535
           G+ V+CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY
Sbjct: 22  GYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 81

Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358
           G    I+T+HKSLKYIF+Q  L+L        L+          G ANVVADALSR SM 
Sbjct: 82  GETCEIYTDHKSLKYIFQQRDLDLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 141

Query: 357 SLEHVEEQTRTMAKEVHRLASLGVRL 280
           SL H+    R++ +E+H L  +GVRL
Sbjct: 142 SLAHISIGRRSLVREIHSLGDIGVRL 167



 Score = 80.5 bits (197), Expect = 5e-13
 Identities = 58/168 (34%), Positives = 82/168 (48%), Gaps = 6/168 (3%)
 Frame = -2

Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326
           FQ  +++ R RRW+ELLK+YD  +L HPG     ++  SR S  S+ H  +       E 
Sbjct: 98  FQQRDLDLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREI 157

Query: 325 HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146
           H  G   +    + T  L     +   R    L+   KE Q  D  +++  E     K  
Sbjct: 158 HSLGDIGVRLEVAETNAL-----LAHFRVRPILMDRIKEAQSKDEFMIKALEDPQGRKGK 212

Query: 145 AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            F +G  DG LRY  RL VP+ DGLR  I+ E H++ Y +H G+ KMY
Sbjct: 213 MFTKG-TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMY 259


>gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 666

 Score =  169 bits (428), Expect = 8e-40
 Identities = 88/146 (60%), Positives = 105/146 (71%), Gaps = 1/146 (0%)
 Frame = -3

Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535
           G+ V+CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP H+LE+AA++FALKIW HYLY
Sbjct: 135 GYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHNLEMAAIVFALKIWRHYLY 194

Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358
           G    I+T+HKSLKYIF+Q  LNL        L+          G ANVVADALSR SM 
Sbjct: 195 GETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 254

Query: 357 SLEHVEEQTRTMAKEVHRLASLGVRL 280
           SL H+    R++ +E+H L  +GVRL
Sbjct: 255 SLAHISIGRRSLVREIHSLGDIGVRL 280



 Score = 80.5 bits (197), Expect = 5e-13
 Identities = 58/168 (34%), Positives = 83/168 (49%), Gaps = 6/168 (3%)
 Frame = -2

Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326
           FQ  ++  R RRW+ELLK+YD  +L HPG     ++  SR S  S+ H  +       E 
Sbjct: 211 FQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREI 270

Query: 325 HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146
           H  G   +    + T  L     +   R    L+ + KE Q  D  +++  E     K  
Sbjct: 271 HSLGDIGVRLEVAETNAL-----LAHFRVRPILMDKIKEAQSKDEFVIKALEDPQGRKGK 325

Query: 145 AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            F +G  DG LRY  RL VP+ DGLR +I+ E H++ Y +H G+ KMY
Sbjct: 326 MFTKG-TDGVLRYGTRLYVPDGDGLRRKILEEAHMAAYVVHPGATKMY 372


>gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao]
          Length = 860

 Score =  169 bits (427), Expect = 1e-39
 Identities = 88/146 (60%), Positives = 104/146 (71%), Gaps = 1/146 (0%)
 Frame = -3

Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535
           G+ V+CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY
Sbjct: 252 GYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 311

Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358
           G    I+ +HKSLKYIF+Q  LNL        L+          G ANVVADALSR SM 
Sbjct: 312 GETCEIYMDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 371

Query: 357 SLEHVEEQTRTMAKEVHRLASLGVRL 280
           SL H+    R++ +E+H L  +GVRL
Sbjct: 372 SLAHISIGRRSLVREIHSLGDIGVRL 397



 Score = 80.5 bits (197), Expect = 5e-13
 Identities = 58/168 (34%), Positives = 83/168 (49%), Gaps = 6/168 (3%)
 Frame = -2

Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326
           FQ  ++  R RRW+ELLK+YD  +L HPG     ++  SR S  S+ H  +       E 
Sbjct: 328 FQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREI 387

Query: 325 HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146
           H  G   +    + T     S ++   R    L+ + KE Q  D  +++  E     K  
Sbjct: 388 HSLGDIGVRLEVAET-----SALLAHFRVRPILMDKIKEAQSKDEFVIKALEDPQGRKGK 442

Query: 145 AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            F +G  DG LRY  RL VP+ DGLR  I+ E H++ Y +H G+ KMY
Sbjct: 443 MFTKG-TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMY 489


>gb|EOY21520.1| CCHC-type integrase, putative [Theobroma cacao]
          Length = 508

 Score =  166 bits (420), Expect = 7e-39
 Identities = 86/143 (60%), Positives = 102/143 (71%), Gaps = 1/143 (0%)
 Frame = -3

Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535
           G+ V+CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY
Sbjct: 355 GYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 414

Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358
           G    I+T+HKSLKYIF+Q  LNL        L+          G ANVVADALSR SM 
Sbjct: 415 GETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 474

Query: 357 SLEHVEEQTRTMAKEVHRLASLG 289
           SL H+    R++ +E+H L  +G
Sbjct: 475 SLAHISIGRRSLVREIHSLGDIG 497


>gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1447

 Score =  151 bits (382), Expect = 2e-34
 Identities = 80/125 (64%), Positives = 92/125 (73%), Gaps = 1/125 (0%)
 Frame = -3

Query: 714  GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535
            G+ ++CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY
Sbjct: 841  GYTMFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 900

Query: 534  GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358
            G    I+T+HKSLKYIF+Q  LNL        L+          G ANVVADALSR SM 
Sbjct: 901  GETCEIYTDHKSLKYIFQQRDLNLRQCRWMELLKDYDCTILYHPGKANVVADALSRKSMG 960

Query: 357  SLEHV 343
            SL H+
Sbjct: 961  SLAHI 965



 Score = 70.5 bits (171), Expect = 5e-10
 Identities = 52/162 (32%), Positives = 73/162 (45%)
 Frame = -2

Query: 487  FQAMEVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENHGEGST 308
            FQ  ++  R  RW+ELLK+YD  +L HPG         +V+ + L               
Sbjct: 917  FQQRDLNLRQCRWMELLKDYDCTILYHPGKA-------NVVADALS-------------- 955

Query: 307  PISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAFEQGG 128
                 R S   L    +V        L+ + KE Q  D  +++  E     K   F +G 
Sbjct: 956  -----RKSMGSLAHISIV-----RPILMDKIKEAQSKDEFVIKALEDPQGRKGKMFTKG- 1004

Query: 127  DDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
             DG LRY  RL VP+ DGLR  I+ E H++ Y +H G+ KMY
Sbjct: 1005 TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMY 1046


>gb|EOY19685.1| DNA/RNA polymerases superfamily protein, putative [Theobroma cacao]
          Length = 1347

 Score =  149 bits (375), Expect = 1e-33
 Identities = 84/147 (57%), Positives = 100/147 (68%), Gaps = 1/147 (0%)
 Frame = -3

Query: 717  EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538
            +GFVVY DAS +GLGCVLMQ +KV+AYASRQLK HE NYPTHDLELAAV+FALKIW HYL
Sbjct: 725  KGFVVYSDASKLGLGCVLMQDEKVVAYASRQLKRHEANYPTHDLELAAVVFALKIWRHYL 784

Query: 537  YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361
            YG H  IFT+HKSLKY+  Q +LNL        ++    +     G ANVVADALSR S 
Sbjct: 785  YGEHCRIFTDHKSLKYLLTQKELNLRQRRWLELIKDYDLVIDYHPGKANVVADALSRKSS 844

Query: 360  KSLEHVEEQTRTMAKEVHRLASLGVRL 280
             SL  ++         +  + SLGV+L
Sbjct: 845  SSLAALQS---CYFSALIEMKSLGVQL 868



 Score = 67.8 bits (164), Expect = 3e-09
 Identities = 51/163 (31%), Positives = 81/163 (49%), Gaps = 5/163 (3%)
 Frame = -2

Query: 475  EVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*SVIHEKLRTC*RTNENHGEGS 311
            E+  R RRWLEL+K+YD+ +  HPG     ++  SR S S +   L++C      +    
Sbjct: 806  ELNLRQRRWLELIKDYDLVIDYHPGKANVVADALSRKSSSSL-AALQSC------YFSAL 858

Query: 310  TPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAFEQG 131
              +          ED  V+       SL+ + K+ Q +D  L +  + +     + F + 
Sbjct: 859  IEMKSLGVQLRNGEDGSVLANFIVRPSLLNQIKDIQRSDDELRKEIQKLTDGGVSEF-RF 917

Query: 130  GDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            G+D  L +R R+CVP  + LR+ IM E H S Y+++ GS KMY
Sbjct: 918  GEDNVLMFRDRVCVPEGNQLRQTIMEEAHSSAYALNPGSTKMY 960


>gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1515

 Score =  147 bits (372), Expect = 3e-33
 Identities = 83/147 (56%), Positives = 100/147 (68%), Gaps = 1/147 (0%)
 Frame = -3

Query: 717  EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538
            +GF+VY DAS +GLGCVLMQ +KV+AYASRQLK HE NYPTHDLELAAV+FALKIW HYL
Sbjct: 871  KGFIVYSDASKLGLGCVLMQDEKVVAYASRQLKRHEANYPTHDLELAAVVFALKIWRHYL 930

Query: 537  YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361
            YG H  IFT+HKSLKY+  Q +LNL        ++    +     G ANVVADALSR S 
Sbjct: 931  YGEHCRIFTDHKSLKYLLTQKELNLRQRRWLELIKDYDLVIDYHLGKANVVADALSRKSS 990

Query: 360  KSLEHVEEQTRTMAKEVHRLASLGVRL 280
             SL  ++         +  + SLGV+L
Sbjct: 991  SSLAALQS---CYFPALIEMKSLGVQL 1014



 Score = 63.9 bits (154), Expect = 5e-08
 Identities = 49/163 (30%), Positives = 80/163 (49%), Gaps = 5/163 (3%)
 Frame = -2

Query: 475  EVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*SVIHEKLRTC*RTNENHGEGS 311
            E+  R RRWLEL+K+YD+ +  H G     ++  SR S S +   L++C      +    
Sbjct: 952  ELNLRQRRWLELIKDYDLVIDYHLGKANVVADALSRKSSSSL-AALQSC------YFPAL 1004

Query: 310  TPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAFEQG 131
              +          ED  ++       SL+ + K+ Q +D  L +  + +     + F + 
Sbjct: 1005 IEMKSLGVQLRNGEDGSLLANFIVRPSLLNQIKDIQRSDDELRKEIQKLTDGGVSEF-RF 1063

Query: 130  GDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            G+D  L ++ R+CVP  + LR+ IM E H S Y++H GS KMY
Sbjct: 1064 GEDNVLMFKDRVCVPEGNQLRQAIMEEAHSSAYALHPGSTKMY 1106


>emb|CAA73042.1| polyprotein [Ananas comosus]
          Length = 871

 Score =  146 bits (368), Expect = 7e-33
 Identities = 84/140 (60%), Positives = 100/140 (71%), Gaps = 2/140 (1%)
 Frame = -3

Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535
           G+VVY DAS  GLGCVLMQ DKVIAYASRQLK +EKNYPTHDLELAAV+FALK+W HYLY
Sbjct: 312 GYVVYSDASLNGLGCVLMQDDKVIAYASRQLKEYEKNYPTHDLELAAVVFALKLWRHYLY 371

Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358
           G    ++T+HKSLKY+F Q +LNL        L+          G ANVVADALSR SM+
Sbjct: 372 GERCEVYTDHKSLKYLFTQKELNLRQRRWLELLKDYDLTILYHPGKANVVADALSRKSME 431

Query: 357 SLE-HVEEQTRTMAKEVHRL 301
           +L  HV  Q R + +++ RL
Sbjct: 432 NLAMHVVTQPR-LIEQMKRL 450



 Score = 79.7 bits (195), Expect = 9e-13
 Identities = 60/171 (35%), Positives = 85/171 (49%), Gaps = 9/171 (5%)
 Frame = -2

Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*SVIHEKLRTC*RTNENH 323
           F   E+  R RRWLELLK+YD+ +L HPG     ++  SR S   +   + T  R  E  
Sbjct: 388 FTQKELNLRQRRWLELLKDYDLTILYHPGKANVVADALSRKSMENLAMHVVTQPRLIEQM 447

Query: 322 G----EGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKH 155
                E  TP +  R  T  ++ +           L+   KEKQ +D  L ++K  +   
Sbjct: 448 KRLELEIVTPDTPMRLMTLVVQPT-----------LLDRIKEKQASDVELQKIKGKMVDG 496

Query: 154 KTTAFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2
            T  F   GD G +R+RGR+CVP   G++E I+ E H + Y+IH G  KMY
Sbjct: 497 CTGDFTLDGD-GLMRFRGRICVPADSGIKEDILQEAHRAPYAIHPGGTKMY 546


Top