BLASTX nr result

ID: Atropa21_contig00038296 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00038296
         (743 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAT38724.1| Putative retrotransposon protein, identical [Sola...   295   8e-78
gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum ...   293   3e-77
gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum]           289   8e-76
gb|AAV31171.1| Putative polyprotein, identical [Solanum tuberosum]    253   6e-65
ref|XP_006353601.1| PREDICTED: uncharacterized protein LOC102586...   234   2e-59
gb|ABI34333.1| Gag-pol polyprotein, putative [Solanum demissum]       211   2e-52
gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum]     210   5e-52
ref|XP_004237286.1| PREDICTED: uncharacterized protein LOC101250...   208   1e-51
ref|XP_004253493.1| PREDICTED: uncharacterized protein LOC101265...   202   7e-50
gb|EOY08404.1| Retrotransposon-like protein [Theobroma cacao]         201   2e-49
gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum]   201   3e-49
ref|XP_006347259.1| PREDICTED: uncharacterized protein LOC102584...   200   5e-49
gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom...   200   5e-49
ref|XP_004488407.1| PREDICTED: uncharacterized protein LOC101502...   196   7e-48
gb|EOY19088.1| Uncharacterized protein TCM_043787 [Theobroma cacao]   192   1e-46
ref|XP_004250589.1| PREDICTED: uncharacterized protein LOC101263...   192   1e-46
gb|EOX93842.1| Uncharacterized protein TCM_002794 [Theobroma cacao]   191   2e-46
gb|EOX94203.1| DNA/RNA polymerases superfamily protein [Theobrom...   190   5e-46
gb|EOY19083.1| DNA/RNA polymerases superfamily protein [Theobrom...   189   6e-46
gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobrom...   189   8e-46

>gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum]
          Length = 1602

 Score =  295 bits (756), Expect = 8e-78
 Identities = 143/234 (61%), Positives = 180/234 (76%)
 Frame = +3

Query: 39   LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
            LLDPG++LS+VTP +A KF + P+ + EPF VSTPVGES++A RVYR+C V I    T+ 
Sbjct: 484  LLDPGASLSFVTPYVANKFDVLPERLCEPFCVSTPVGESILAERVYRDCPVSINHKSTMV 543

Query: 219  DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
            DL+EL+MVDFD+I+GMDWL +C A++DCRT++V FQFP EP+LEW  ++A  KGRFISYL
Sbjct: 544  DLIELDMVDFDVILGMDWLHACYASIDCRTRVVKFQFPSEPILEWSSSSAVPKGRFISYL 603

Query: 399  KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
            KA+K++SKG IYHL RV D  VE P  QS+P+V EF +VFPD L  +PPEREI+FGID++
Sbjct: 604  KARKLVSKGCIYHLARVNDSSVEIPYFQSVPIVREFPEVFPDDLPGIPPEREIDFGIDLI 663

Query: 579  PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
            PDT+PISIPPYRMAPA               GFIRP++SP GA V FVRKK+GS
Sbjct: 664  PDTRPISIPPYRMAPA----ELKELKDLLEKGFIRPSVSPWGAPVLFVRKKDGS 713


>gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1515

 Score =  293 bits (751), Expect = 3e-77
 Identities = 142/234 (60%), Positives = 180/234 (76%)
 Frame = +3

Query: 39   LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
            LLDPG++LS+VTP +A KF + P+ + EPF VSTPVGES++A RVYR+C V I    T+ 
Sbjct: 478  LLDPGASLSFVTPYVANKFDVLPERLCEPFCVSTPVGESILAERVYRDCPVSINHKSTMV 537

Query: 219  DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
            DL+EL+MVDFD+I+GMDWL +C A++DCRT++V FQFP EP+LEW  ++A  KGRFISYL
Sbjct: 538  DLIELDMVDFDVILGMDWLHACYASIDCRTRVVKFQFPSEPILEWSSSSAVPKGRFISYL 597

Query: 399  KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
            KA+K++SKG IYHL RV D  VE P  QS+P+V EF +VFP+ L  +PPEREI+FGID++
Sbjct: 598  KARKLVSKGCIYHLARVNDSSVEIPYFQSVPIVREFPEVFPNDLPGIPPEREIDFGIDLI 657

Query: 579  PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
            PDT+PISIPPYRMAPA               GFIRP++SP GA V FVRKK+GS
Sbjct: 658  PDTRPISIPPYRMAPA----ELKELKDLLEKGFIRPSVSPWGAPVLFVRKKDGS 707


>gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum]
          Length = 1554

 Score =  289 bits (739), Expect = 8e-76
 Identities = 141/234 (60%), Positives = 176/234 (75%)
 Frame = +3

Query: 39   LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
            LLDPG +LS+VT  +A KF + P+ + EPF VSTPVGES++A RVYR+C   I    T+A
Sbjct: 559  LLDPGVSLSFVTLYVANKFDVLPERLCEPFCVSTPVGESILAERVYRDCPDSINHKSTMA 618

Query: 219  DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
            DLVEL+MVDFD+I+GM+WL +C A++DCRT++V FQFP EPV EW  ++A  KGRFISYL
Sbjct: 619  DLVELDMVDFDVILGMNWLHACYASLDCRTRVVKFQFPNEPVFEWSSSSAVPKGRFISYL 678

Query: 399  KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
            KA+K++SKG IYHL+RV D  VE P  QS+P+V EF  VFPD L  +PPEREI+FGID++
Sbjct: 679  KARKLVSKGCIYHLVRVHDSSVEIPHFQSVPIVREFPKVFPDDLPGIPPEREIDFGIDLI 738

Query: 579  PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
            PDT PISIPPYRMAP+               GFIRP++SP GA V FVRKK+GS
Sbjct: 739  PDTHPISIPPYRMAPSELKELKEQLKDLLDKGFIRPSVSPWGAPVLFVRKKDGS 792


>gb|AAV31171.1| Putative polyprotein, identical [Solanum tuberosum]
          Length = 1487

 Score =  253 bits (645), Expect = 6e-65
 Identities = 119/196 (60%), Positives = 154/196 (78%)
 Frame = +3

Query: 39   LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
            LLDPG++LS+VTP +A KF +  + + EPF VSTPVGES++A RVY +C V I    T+A
Sbjct: 425  LLDPGASLSFVTPYVANKFDVLLERLCEPFCVSTPVGESILAERVYCDCPVSINHKSTMA 484

Query: 219  DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
            DLV+L+MVDFD+I GMDWL +C  ++DCRT++V FQFP EPV+EW  ++   KG FISYL
Sbjct: 485  DLVDLDMVDFDVISGMDWLHACYTSLDCRTRVVKFQFPNEPVIEWSSSSVVPKGCFISYL 544

Query: 399  KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
            KA+K++SKG +YHL+RV D  V+ P  QS+P+V EF +VFPD L  +P EREIEFGI ++
Sbjct: 545  KARKLVSKGCVYHLVRVHDSSVKMPPFQSVPIVREFPEVFPDDLPGIPSEREIEFGIGLI 604

Query: 579  PDTQPISIPPYRMAPA 626
            PDT+PISIPPYRMAPA
Sbjct: 605  PDTRPISIPPYRMAPA 620


>ref|XP_006353601.1| PREDICTED: uncharacterized protein LOC102586067 [Solanum tuberosum]
          Length = 881

 Score =  234 bits (598), Expect = 2e-59
 Identities = 119/234 (50%), Positives = 154/234 (65%)
 Frame = +3

Query: 39  LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
           L+DPG+ LS+VTP +A KF +E +L+ E +EVSTP+G S+VAR+VYRNC           
Sbjct: 367 LIDPGATLSFVTPLVARKFHVESELLHESYEVSTPIGVSIVARKVYRNCPY--------- 417

Query: 219 DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
                                  A++DCRT+ V F+FP EPVLEW+     +KG+FIS +
Sbjct: 418 -----------------------ASIDCRTRKVKFRFPNEPVLEWESRDVVVKGKFISCI 454

Query: 399 KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
           KA ++ISKG +YH++RV DV+ + P ++SIPVVNEFLDVFP+ L  +PPEREI+ GID+L
Sbjct: 455 KAHRLISKGCLYHIVRVNDVESKVPPIESIPVVNEFLDVFPEDLPGVPPEREIDLGIDLL 514

Query: 579 PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
           PDTQPISIPPYRMAPA               GFIRP+ SP GA V FV+KK+GS
Sbjct: 515 PDTQPISIPPYRMAPAELKELKEQLKDLLEKGFIRPSHSPWGAPVLFVKKKDGS 568


>gb|ABI34333.1| Gag-pol polyprotein, putative [Solanum demissum]
          Length = 800

 Score =  211 bits (537), Expect = 2e-52
 Identities = 110/217 (50%), Positives = 142/217 (65%)
 Frame = +3

Query: 90  KFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLADLVELEMVDFDIIMGMD 269
           KF + P+++ EPF VSTPV + VV +RVYR+C + +    TL DLVELEM+DFD+I+GMD
Sbjct: 237 KFEIPPEVLVEPFSVSTPVYDLVVIKRVYRSCPISLSHRVTLVDLVELEMLDFDVILGMD 296

Query: 270 WLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYLKAKKMISKGYIYHLIRV 449
           WL +  A +DCR ++V FQFP EP+LEWKG     +G+F+S LKA+KMIS G IY L+RV
Sbjct: 297 WLHAYYAYIDCRIRVVRFQFPNEPILEWKGGNYISRGQFVSCLKARKMISNGCIYQLLRV 356

Query: 450 RDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVLPDTQPISIPPYRMAPAX 629
           RDV+ +  +L+S+PVVN F  VFPD              +  LP++QPISIPPYRMA   
Sbjct: 357 RDVESKTSSLESVPVVNVFPKVFPDDC------------LVFLPNSQPISIPPYRMASVE 404

Query: 630 XXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
                         GFIR +ISP G  V FVR K+GS
Sbjct: 405 LKELKEQLKDFLVKGFIRSSISPWGDPVLFVRNKDGS 441


>gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum]
          Length = 1771

 Score =  210 bits (534), Expect = 5e-52
 Identities = 108/234 (46%), Positives = 147/234 (62%)
 Frame = +3

Query: 39   LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
            L DPGS  SYV    A + GM    + EP  VSTPVGES+V  ++ R+C V I    T  
Sbjct: 666  LFDPGSTFSYVFVYFAPRLGMRSASLTEPIHVSTPVGESLVVDQILRSCLVTIQGCDTRV 725

Query: 219  DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
            DL+ L+MVDFD+I+GMDWL+   A +DC  K V    PG   + W+G  +      IS++
Sbjct: 726  DLILLDMVDFDVILGMDWLSPYHAVLDCYAKTVTLAMPGISPVLWQGAYSHTPTWIISFM 785

Query: 399  KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
            +A+++++ G + +L  VRDV  +  ++ S+PVV EF DVFP  L  LPP+R+I+F ID+ 
Sbjct: 786  RARRLVASGCLAYLAYVRDVSRDDSSVDSVPVVREFADVFPIDLPGLPPDRDIDFAIDLE 845

Query: 579  PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
            PDT+PISIPPYRMAPA               GFIRP++SP GA V FV+KK+G+
Sbjct: 846  PDTRPISIPPYRMAPAELRELSAQLEDLLGKGFIRPSVSPWGAPVLFVKKKDGT 899


>ref|XP_004237286.1| PREDICTED: uncharacterized protein LOC101250208 [Solanum
           lycopersicum]
          Length = 497

 Score =  208 bits (530), Expect = 1e-51
 Identities = 99/196 (50%), Positives = 138/196 (70%)
 Frame = +3

Query: 39  LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
           LLDPG   S+VT  +A KF + P ++ EPF +STPVG S+V  RVY+ C + + +     
Sbjct: 302 LLDPGPTSSFVTLLVAMKFDILPDILDEPFLISTPVGASMVVDRVYKGCPISLPNRVAFV 361

Query: 219 DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
           DL+EL M++ D+I GM+ L +C A++DCRT++V   FP EPV +WKG  +  KG  IS L
Sbjct: 362 DLIELHMLNIDVIFGMNRLHACFASIDCRTRVVKIPFPNEPVFQWKGGNSNPKGNIISCL 421

Query: 399 KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
           K+ K+I+KG IYH++R RD + E P ++S+PVV EF ++FPD L  +  EREI+F ID+L
Sbjct: 422 KSCKVIAKGSIYHIVRGRDFESEVPPIESVPVVREFPEIFPDDLPGISLEREIDFNIDLL 481

Query: 579 PDTQPISIPPYRMAPA 626
            +TQPISIPPYRM+ A
Sbjct: 482 SNTQPISIPPYRMSLA 497


>ref|XP_004253493.1| PREDICTED: uncharacterized protein LOC101265119 [Solanum
           lycopersicum]
          Length = 518

 Score =  202 bits (515), Expect = 7e-50
 Identities = 109/235 (46%), Positives = 144/235 (61%)
 Frame = +3

Query: 39  LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
           L DPGS  SYV+ S A    +  +L+  P  VSTPVGESVV  +VYR+C V     +T  
Sbjct: 238 LFDPGSTFSYVSSSFANGLNLHCELLDMPIRVSTPVGESVVVEKVYRSCVVNFVGSKTSV 297

Query: 219 DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
           DLV L M DF +I+GM  L+   A +DC  K V    PG   L W+G+  +   R +S+L
Sbjct: 298 DLVILAMDDFGVILGMTCLSPQFAILDCNAKTVTLAKPGTDPLVWEGDYTSNPVRIVSFL 357

Query: 399 KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
           +A+KMISKG +  L  ++D   + P ++S  VV EFLDVFP +L  +PP+R+I+F ID+ 
Sbjct: 358 RARKMISKGCLAFLAHLKDDTTQVPWIESFSVVREFLDVFPAELPGMPPDRDIDFCIDLE 417

Query: 579 PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGSF 743
           P T+PI IPPYRMAPA               GFIRP+ SP GA + FV+KK+GSF
Sbjct: 418 PGTRPIFIPPYRMAPAELSELKAQLQELLNKGFIRPSASPWGAPILFVKKKDGSF 472


>gb|EOY08404.1| Retrotransposon-like protein [Theobroma cacao]
          Length = 654

 Score =  201 bits (512), Expect = 2e-49
 Identities = 103/234 (44%), Positives = 145/234 (61%)
 Frame = +3

Query: 39   LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
            L DPG+  S+++P  A + G       E   VSTP+ E  VA   Y +C V +    T  
Sbjct: 343  LFDPGATHSFISPCFASRLGRGRVRREEQLVVSTPLKEIFVAEWEYESCVVRVKDKDTSV 402

Query: 219  DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
            +LV L+ +DFD+I+GM+WL+ C A+VDC  K+V F FPGEP    +G+ +      IS +
Sbjct: 403  NLVVLDTLDFDVILGMNWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDRSNAPTNLISVI 462

Query: 399  KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
             A++++ +G I +L  V+D   +   +  + VV EF+DVFP++L  LPPERE+EF ID++
Sbjct: 463  SARRLLRQGCIGYLAVVKDSQAKIGDVTQVSVVKEFVDVFPEELPGLPPEREVEFCIDLI 522

Query: 579  PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
            PDT+PISIPPYRMAPA               GFIRP++SP GA V FV+KK+GS
Sbjct: 523  PDTRPISIPPYRMAPAELKELKDQLEDLLDKGFIRPSVSPWGAPVLFVKKKDGS 576


>gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum]
          Length = 1475

 Score =  201 bits (510), Expect = 3e-49
 Identities = 105/234 (44%), Positives = 147/234 (62%)
 Frame = +3

Query: 39   LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
            L DPGS  SYV+   A +  M  + +  P  VSTPVGES+V  +V            T A
Sbjct: 502  LFDPGSTFSYVSVYYASRLSMMSEPLVAPLRVSTPVGESLVVDQV----------RDTRA 551

Query: 219  DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
            DL+ L+MVDFD+I+GMDWL+   A +DC +K V    PG P + W+G+  +     IS++
Sbjct: 552  DLILLDMVDFDVILGMDWLSPYRAVLDCFSKTVTLAIPGIPPVVWQGSRGSTPVGVISFI 611

Query: 399  KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
            +A+++++ G + +L  VRDV  E P ++S+PVV +F+DVFP  L  LPPER+I+F I++ 
Sbjct: 612  RARRLVASGCLSYLAYVRDVSREVPPVESVPVVRDFIDVFPTDLPGLPPERDIDFPIELE 671

Query: 579  PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
            P T+PISIPPYRMAPA               GFIRP++SP GA V FV+KK+G+
Sbjct: 672  PGTRPISIPPYRMAPAELKELSVQLQDLLGKGFIRPSVSPWGAPVLFVKKKDGT 725


>ref|XP_006347259.1| PREDICTED: uncharacterized protein LOC102584611 [Solanum tuberosum]
          Length = 1107

 Score =  200 bits (508), Expect = 5e-49
 Identities = 104/234 (44%), Positives = 145/234 (61%)
 Frame = +3

Query: 39  LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
           L DPGS  SYV+     + GM  + + EP  VSTPVGES+V  ++ R+C V I    T  
Sbjct: 60  LFDPGSTFSYVSFYFVPRLGMRSESLAEPVHVSTPVGESLVVDQILRSCLVTIQCCDTRV 119

Query: 219 DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
           DL+ L+MVDFD+I+GMDWL+   A +D   K V    PG   + W+   +      IS++
Sbjct: 120 DLILLDMVDFDVILGMDWLSPYHAVLDFYAKTVTLAMPGISPVLWQSAYSHTPTGIISFM 179

Query: 399 KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
           +A+++++ G + +L  VRDV  E  ++ S+PVV EF DVFP  L  LPPER+I+F I++ 
Sbjct: 180 RARRLVASGCLAYLAYVRDVSREGSSVDSVPVVREFADVFPTDLPGLPPERDIDFSIELE 239

Query: 579 PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
           P T+PISIPPYRMAPA               GFIRP++SP G+ V FV+KK+G+
Sbjct: 240 PGTRPISIPPYRMAPAELRELSVQLEDLLGKGFIRPSVSPWGSPVLFVKKKDGT 293


>gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1447

 Score =  200 bits (508), Expect = 5e-49
 Identities = 103/234 (44%), Positives = 145/234 (61%)
 Frame = +3

Query: 39   LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
            L DPG+  S+++   A + G       E   VSTP+ E  VA   Y +C V +    T  
Sbjct: 362  LFDPGATHSFISTCFASRLGRGRVRREEQLVVSTPLKEIFVAEWEYESCVVRVKDKDTSV 421

Query: 219  DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
            +LV L+ +DFD+I+GM+WL+ C A+VDC  K+V F FPGEP    +G+ +      IS +
Sbjct: 422  NLVVLDTLDFDVILGMNWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDRSNAPTNLISVI 481

Query: 399  KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
             A++++ +G I +L  V+D   +   +  + VV EF+DVFP++L SLPPERE+EF ID++
Sbjct: 482  SARRLLRQGCIGYLAVVKDSQAKIGDVTQVSVVKEFVDVFPEELPSLPPEREVEFCIDLI 541

Query: 579  PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
            PDT+PISIPPYRMAPA               GFIRP++SP GA V FV+KK+GS
Sbjct: 542  PDTRPISIPPYRMAPAELKELKDQLEDLLDKGFIRPSVSPWGAPVLFVKKKDGS 595


>ref|XP_004488407.1| PREDICTED: uncharacterized protein LOC101502180 [Cicer arietinum]
          Length = 1235

 Score =  196 bits (498), Expect = 7e-48
 Identities = 102/234 (43%), Positives = 142/234 (60%)
 Frame = +3

Query: 39   LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
            L D G+  S+V+   A + G     + EP  V+TPVG +++A+ VYR C + I       
Sbjct: 575  LFDLGATHSFVSSWFATRLGKCSSSLEEPLVVATPVGGNLLAKSVYRCCDITIDGKVFSV 634

Query: 219  DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
            DLV ++++DFD+I+GMDWLA   A +DC  K+V F+ PG+ V  ++G    +    I  L
Sbjct: 635  DLVVIDLIDFDVILGMDWLAFHHATLDCHDKVVKFEIPGQSVFSFQGERCWVPHNQILAL 694

Query: 399  KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
             A K++ +G   ++  VRD  V    L+ IP+  EF DVFP++L  LPP+REIEF ID++
Sbjct: 695  AASKLMRRGCQAYIALVRDTQVAEEKLEKIPIACEFPDVFPEELPGLPPDREIEFSIDLV 754

Query: 579  PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
            P+T PISIPPYRMAPA               GFIRP+ SP GA V FV+KK+GS
Sbjct: 755  PNTHPISIPPYRMAPAKLKELREQLQDLLDKGFIRPSSSPWGAPVLFVKKKDGS 808


>gb|EOY19088.1| Uncharacterized protein TCM_043787 [Theobroma cacao]
          Length = 649

 Score =  192 bits (487), Expect = 1e-46
 Identities = 103/234 (44%), Positives = 138/234 (58%)
 Frame = +3

Query: 39  LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
           L+D GS+ SYV+ + A         + E   + TP+GE +V    YR+C V +   +   
Sbjct: 166 LIDSGSDRSYVSTTFASIADRNLSPLEEEIVIHTPLGEKLVRNSCYRDCGVRVGEEEFRG 225

Query: 219 DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
           DL+ LE++DFD+I+GMDWL +  ANVDC  K V  +      + + G    L    IS +
Sbjct: 226 DLIPLEILDFDLILGMDWLTAHRANVDCFRKEVVLRNSEGAEIVFVGKRRVLPSCVISAI 285

Query: 399 KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
           KA K++ KGY  +L  V D     P L+ +P+V+EF DVFPD L  LPP+RE+EF ID+L
Sbjct: 286 KASKLVQKGYPTYLAYVIDTSKGEPKLEDVPIVSEFPDVFPDDLPGLPPDRELEFPIDLL 345

Query: 579 PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
           P T PISIPPYRMAPA               GFIRP+ISP GA V FV+KK+G+
Sbjct: 346 PGTAPISIPPYRMAPAELKELKVQLQELVDKGFIRPSISPWGAPVLFVKKKDGT 399


>ref|XP_004250589.1| PREDICTED: uncharacterized protein LOC101263780 [Solanum
           lycopersicum]
          Length = 508

 Score =  192 bits (487), Expect = 1e-46
 Identities = 93/196 (47%), Positives = 130/196 (66%)
 Frame = +3

Query: 39  LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
           L DPGS  SYV+   A + GM  + + EP  VSTP+GE +V  +V R+C V I  + T A
Sbjct: 249 LFDPGSTFSYVSIYFAPRLGMRSESLEEPVHVSTPIGEFLVVDQVLRSCLVTIQGYDTRA 308

Query: 219 DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
           DL+ L+M+DFD+I+GMDWL+   A +DC  K V    PG P + W+   +      IS++
Sbjct: 309 DLIMLDMIDFDVILGMDWLSPYHAVLDCYAKTVTLSMPGVPSVLWQAAYSHTPTGIISFI 368

Query: 399 KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
           +A++++S G + +L  +RDV  E P++ S+PVV E+ DVFP  L  LPPER+I+F ID+ 
Sbjct: 369 RARRLVSSGCLAYLAHIRDVSREGPSVDSVPVVREYADVFPTDLPCLPPERDIDFAIDLE 428

Query: 579 PDTQPISIPPYRMAPA 626
           P T+PISIPPYRMAPA
Sbjct: 429 PGTRPISIPPYRMAPA 444


>gb|EOX93842.1| Uncharacterized protein TCM_002794 [Theobroma cacao]
          Length = 509

 Score =  191 bits (486), Expect = 2e-46
 Identities = 99/234 (42%), Positives = 141/234 (60%)
 Frame = +3

Query: 39  LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
           L DP +  S+++   A + G       E   VSTP+ E  V    Y +C V +    T  
Sbjct: 124 LFDPSATHSFISLCFASRLGRGRVRREEQLVVSTPLKEIFVVEWEYESCVVRVQDKDTSV 183

Query: 219 DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
           +LV L+ +DFD+I+GM+WL+ C A+VDC  K+V F FPGEP    +G+ +      IS +
Sbjct: 184 NLVVLDTLDFDVILGMNWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDRSNAPTNLISVI 243

Query: 399 KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
            A++++ +G I +L  V+D   +   +  + VV EF+DVFP++L  LPPERE+EF ID++
Sbjct: 244 SARRLLRQGCIGYLAVVKDSQAKIGDVTQVSVVKEFVDVFPEELPGLPPEREVEFCIDLI 303

Query: 579 PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
           PD +PISIPPYRMAPA               GFIRP++SP GA V FV+KK+GS
Sbjct: 304 PDIRPISIPPYRMAPAELKELKDQLEDLLDKGFIRPSVSPWGAPVLFVKKKDGS 357


>gb|EOX94203.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1336

 Score =  190 bits (482), Expect = 5e-46
 Identities = 101/234 (43%), Positives = 138/234 (58%)
 Frame = +3

Query: 39   LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
            L+D GS+ SYV+ + A         + E   + TP+GE +V    YR+C V +   +   
Sbjct: 356  LIDSGSDRSYVSTTFASIAARNLSPLEEEIVIHTPLGEKLVRNSCYRDCGVRVGEEEFRG 415

Query: 219  DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
            DL+ L+++DFD+I+GMDWL +  ANVDC  K V  +      + + G    L    IS +
Sbjct: 416  DLIPLKILDFDLILGMDWLTTHRANVDCFRKEVVLRNSEGAEIVFVGKHRVLPSCVISAI 475

Query: 399  KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
            KA K++ KGY  +L  V D     P L+ +P+V+EF DVFPD L  LPP+RE+EF ID+L
Sbjct: 476  KASKLVQKGYPTYLAYVIDTSKGEPKLEDVPIVSEFPDVFPDDLPGLPPDRELEFPIDLL 535

Query: 579  PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
            P T PISIPPYRMAPA               GFIRP+ISP GA + FV+KK+G+
Sbjct: 536  PGTAPISIPPYRMAPAELKELKVQLQELVDKGFIRPSISPWGAPILFVKKKDGT 589


>gb|EOY19083.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 906

 Score =  189 bits (481), Expect = 6e-46
 Identities = 101/234 (43%), Positives = 136/234 (58%)
 Frame = +3

Query: 39   LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
            L+D GS+ SYV+ + A         + E   V TP+GE ++    YR+C V +   +   
Sbjct: 439  LIDSGSDRSYVSTTFASITDRNLSPLEEEIVVHTPLGEQLIRNTCYRDCGVRVGEEEFRG 498

Query: 219  DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
            DL+ LE++DFD+I+GMDWL +  ANVDC  K V  +      + + G    L    IS +
Sbjct: 499  DLIPLEILDFDLILGMDWLTAHRANVDCFRKEVVLRNSEGAEIVFVGERRVLPSYVISAI 558

Query: 399  KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
            K  K++ KGY  +L  V D     P L+ +P+V+EF DVFPD L  +PP RE+EF ID+L
Sbjct: 559  KVSKLVQKGYPTYLAYVIDTSKGEPKLEDVPIVSEFSDVFPDNLPRIPPNRELEFPIDLL 618

Query: 579  PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
            P T PISIPPYRMAPA               GFIRP+ISP GA V FV+KK+G+
Sbjct: 619  PSTVPISIPPYRMAPAELKELKAQLQDLVDKGFIRPSISPWGAPVLFVKKKDGT 672


>gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1537

 Score =  189 bits (480), Expect = 8e-46
 Identities = 101/234 (43%), Positives = 137/234 (58%)
 Frame = +3

Query: 39   LLDPGSNLSYVTPSIACKFGMEPKLIREPFEVSTPVGESVVARRVYRNCSVLICSHQTLA 218
            L+D GS+ SYV+ + A         + E   V TP+GE ++    YR+C V +   +   
Sbjct: 406  LIDSGSDRSYVSTTFASITDRNLSPLEEEIVVHTPLGEQLIRNTCYRDCGVRVGEEEFRG 465

Query: 219  DLVELEMVDFDIIMGMDWLASC*ANVDCRTKIV*FQFPGEPVLEWKGNTATLKGRFISYL 398
            DL+ LE++DFD+I+GMDWL +  AN+DC  K V  +      + + G    L    IS +
Sbjct: 466  DLIPLEILDFDLILGMDWLTTHRANLDCFRKEVVLRNSEGAEIVFVGERRVLPSCVISAI 525

Query: 399  KAKKMISKGYIYHLIRVRDVDVEPPTLQSIPVVNEFLDVFPDKLSSLPPEREIEFGIDVL 578
            KA K++ KGY  +L  V D     P L+ +P+V+EF DVFPD L  +PP RE+EF ID+L
Sbjct: 526  KASKLVQKGYPTYLAYVIDTSKGEPKLEDVPIVSEFPDVFPDDLPGIPPNRELEFPIDLL 585

Query: 579  PDTQPISIPPYRMAPAXXXXXXXXXXXXXXXGFIRPNISP*GASVFFVRKKNGS 740
            P T PISIPPYRMAPA               GFIRP+ISP GA V FV+KK+G+
Sbjct: 586  PGTAPISIPPYRMAPAELKELKAQLQDLVDKGFIRPSISPWGAPVLFVKKKDGT 639


Top