BLASTX nr result

ID: Papaver31_contig00051718 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver31_contig00051718
         (989 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera]   405   e-110
emb|CAN59997.1| hypothetical protein VITISV_020888 [Vitis vinifera]   397   e-108
gb|AIG55302.1| gag-pol, partial [Camellia sinensis]                   389   e-105
ref|XP_008244885.1| PREDICTED: uncharacterized protein LOC103342...   389   e-105
ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [The...   379   e-102
ref|XP_010111872.1| Transposon Ty3-I Gag-Pol polyprotein [Morus ...   375   e-101
ref|XP_012567311.1| PREDICTED: uncharacterized protein LOC105851...   374   e-101
emb|CAN77191.1| hypothetical protein VITISV_006389 [Vitis vinifera]   374   e-101
emb|CAA73042.1| polyprotein [Ananas comosus]                          373   e-100
ref|XP_007023888.1| DNA/RNA polymerases superfamily protein [The...   372   e-100
ref|XP_007010454.1| Uncharacterized protein TCM_044274 [Theobrom...   367   6e-99
ref|XP_007036977.1| DNA/RNA polymerases superfamily protein [The...   367   1e-98
ref|XP_010026793.1| PREDICTED: uncharacterized protein LOC104417...   366   1e-98
ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [The...   366   2e-98
ref|XP_012073065.1| PREDICTED: uncharacterized protein LOC105634...   365   2e-98
ref|XP_007099710.1| Retrotransposon protein, Ty3-gypsy subclass,...   365   2e-98
ref|XP_007028176.1| DNA/RNA polymerases superfamily protein [The...   365   2e-98
ref|XP_008812481.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   364   7e-98
ref|XP_007028157.1| DNA/RNA polymerases superfamily protein [The...   364   7e-98
emb|CAN61694.1| hypothetical protein VITISV_026655 [Vitis vinifera]   363   1e-97

>emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera]
          Length = 1573

 Score =  405 bits (1040), Expect = e-110
 Identities = 194/335 (57%), Positives = 260/335 (77%), Gaps = 6/335 (1%)
 Frame = -1

Query: 989  EFNLQYHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVEGLSI----EEKETTVYACN 822
            +F L YHPGKANVVADALSRKS G L SLG   +++Y  +E   +    E +   +Y+ +
Sbjct: 1078 DFALHYHPGKANVVADALSRKSYGQLFSLGLREFEMYAVIEDFELCLVQEGRGPCLYSIS 1137

Query: 821  LIAQPVLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVVPG--VLR 648
              A+P+++Q IV+ Q + + L ++  +LV G   + W++  DG +R+ GRL VP    LR
Sbjct: 1138 --ARPMVIQRIVEAQVHDEFLEKVKAQLVAGEIDENWSMYEDGSVRFKGRLCVPKDVELR 1195

Query: 647  RKVMDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSG 468
             +++ +AH++KY+IHPG +KMY+DL+RQF WSGMK+D+A++V+ C  CQQVK EH+RP+ 
Sbjct: 1196 NELLADAHRAKYTIHPGNTKMYQDLKRQFXWSGMKRDIAQFVANCQICQQVKAEHQRPAE 1255

Query: 467  LLQPLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQ 288
            LLQPLP+ +WKW++ITMDFV GLPRT S ++ VWVIVDRLTKSAHFL +K TD++ +L++
Sbjct: 1256 LLQPLPIPKWKWDNITMDFVIGLPRTRSKKNGVWVIVDRLTKSAHFLAMKTTDSMNSLAK 1315

Query: 287  LYVKEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQI 108
            LY++EIVRLHG+P+SIVSDRDP+FTS+FW+S Q+A+ T L  ST FHPQTDGQSERVIQI
Sbjct: 1316 LYIQEIVRLHGIPVSIVSDRDPKFTSQFWQSLQRALGTQLNFSTVFHPQTDGQSERVIQI 1375

Query: 107  LEDMLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            LEDMLRAC LDF G+W D LPL EF+YNN YQ+SI
Sbjct: 1376 LEDMLRACVLDFGGNWADYLPLAEFAYNNXYQSSI 1410


>emb|CAN59997.1| hypothetical protein VITISV_020888 [Vitis vinifera]
          Length = 893

 Score =  397 bits (1020), Expect = e-108
 Identities = 191/335 (57%), Positives = 261/335 (77%), Gaps = 6/335 (1%)
 Frame = -1

Query: 989  EFNLQYHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVEG----LSIEEKETTVYACN 822
            +F L YHPGKANVVADALSRK+ G L+SL    ++++  +E     L +E     +Y+  
Sbjct: 384  DFALHYHPGKANVVADALSRKNVGQLSSLELREFEMHAVIEDFELCLGLEGHGPCLYS-- 441

Query: 821  LIAQPVLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVVPGV--LR 648
            ++A+P+++Q IV+ Q + + L ++  +LV G   + W++  DG + + GRL VP    LR
Sbjct: 442  ILARPMVIQRIVEAQVHDEFLEKVKAQLVAGEIDENWSMYEDGSVWFKGRLCVPKDVGLR 501

Query: 647  RKVMDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSG 468
             +++ +AHK+KY+IHPG +KMY+DL+RQFW +GMK+D+A++V+ C  CQQVK EH+RP+G
Sbjct: 502  NELLADAHKAKYTIHPGNTKMYQDLKRQFWCNGMKRDIAQFVANCQICQQVKAEHQRPAG 561

Query: 467  LLQPLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQ 288
            LLQPLP+ EWKW++ITMDFV  LPRT S ++ VWVIVDRLTKSAHFL +K T+++ +L++
Sbjct: 562  LLQPLPIPEWKWDNITMDFVIRLPRTRSKKNGVWVIVDRLTKSAHFLAMKTTNSMNSLAK 621

Query: 287  LYVKEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQI 108
            LY++EIVRLHG P+SIVSDRDP+FTS+FW+S Q+A+ T L  STAFHPQTDGQSERVIQI
Sbjct: 622  LYIQEIVRLHGKPVSIVSDRDPKFTSQFWQSLQRALGTQLNFSTAFHPQTDGQSERVIQI 681

Query: 107  LEDMLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            LEDMLRAC LDF G+W D LPL EF+YNNSYQ+++
Sbjct: 682  LEDMLRACVLDFGGNWADYLPLAEFAYNNSYQSNL 716


>gb|AIG55302.1| gag-pol, partial [Camellia sinensis]
          Length = 923

 Score =  389 bits (1000), Expect = e-105
 Identities = 182/331 (54%), Positives = 242/331 (73%), Gaps = 2/331 (0%)
 Frame = -1

Query: 989  EFNLQYHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVEGLSIEEKETTVYAC--NLI 816
            +F L  HPGKANVVADALSRK+   +A +    W++  A+    +   E+   A   +++
Sbjct: 384  DFELHCHPGKANVVADALSRKTISDVACIAIREWEMLGALGEFDLLLGESVEAAALFSVV 443

Query: 815  AQPVLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVVPGVLRRKVM 636
            AQP LV  +++ Q+   ++  +  K+  G    G T+  +  +RY  RL VP   R +V+
Sbjct: 444  AQPTLVTRVLEAQRGDLEIESLREKISSGKVEKGLTVYPEQSVRYRDRLFVPESCREEVL 503

Query: 635  DEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLLQP 456
             E H S+ ++HPG +KMY+DL RQFWW GMK+DVA +VS+CLTCQQVK EH+RP+GLLQP
Sbjct: 504  GEFHHSRLAVHPGGTKMYQDLGRQFWWRGMKRDVAVFVSKCLTCQQVKAEHQRPAGLLQP 563

Query: 455  LPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLYVK 276
            LP+AEWKWEHITMDFV GLPRT  G D +WV+VDRLTKSAHF+P++  D++  L+ LY++
Sbjct: 564  LPIAEWKWEHITMDFVVGLPRTQRGSDAIWVVVDRLTKSAHFIPMRVRDSMDHLADLYIR 623

Query: 275  EIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILEDM 96
            ++VRLHGVP++IVSDRDP FT++ W+S Q A+ T L  STA+HPQTDGQSER IQILEDM
Sbjct: 624  DVVRLHGVPVTIVSDRDPCFTARLWQSLQSALGTKLTFSTAYHPQTDGQSERTIQILEDM 683

Query: 95   LRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            LR C LDF G+W   LPL+EF+YNNS+Q+SI
Sbjct: 684  LRGCVLDFSGTWERHLPLVEFAYNNSFQSSI 714


>ref|XP_008244885.1| PREDICTED: uncharacterized protein LOC103342989 [Prunus mume]
          Length = 1162

 Score =  389 bits (999), Expect = e-105
 Identities = 182/296 (61%), Positives = 229/296 (77%), Gaps = 2/296 (0%)
 Frame = -1

Query: 884  LYEAVEGLSIEEKETTVYACNLIAQPVLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTI 705
            L++ ++G+  +      Y  NL+  P L+ +I++ Q++     +I  K+  G  PDGW +
Sbjct: 495  LFDQLQGMHQDH----AYLYNLVTHPTLIGKIIEAQESDSVSQDIRTKITTGETPDGWNV 550

Query: 704  QSDGGLRYFGRLVVPGV--LRRKVMDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVA 531
             +DGGLRY  RL VP +  LR++V+ E H S Y+IHPG +KMY DL+R FWW+GMK+D+ 
Sbjct: 551  HADGGLRYLDRLYVPEISDLRKEVLKEGHHSFYTIHPGGTKMYLDLKRNFWWNGMKRDIE 610

Query: 530  EYVSRCLTCQQVKIEHRRPSGLLQPLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDR 351
            ++V++CLTCQQVK EH++PSG LQPLPVAEWKW+HITMDFVTGLPR+  GRD +WVIVDR
Sbjct: 611  KFVAKCLTCQQVKAEHQKPSGSLQPLPVAEWKWDHITMDFVTGLPRSPKGRDAIWVIVDR 670

Query: 350  LTKSAHFLPIKKTDTLGTLSQLYVKEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTT 171
            LTKSAHFLP+K T++   L +LYV+EIVRLHG+P+SIVSDRD +FTSKFW S QKA+ T 
Sbjct: 671  LTKSAHFLPVKTTESTENLGKLYVREIVRLHGIPVSIVSDRDSKFTSKFWGSLQKALGTQ 730

Query: 170  LCLSTAFHPQTDGQSERVIQILEDMLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            L  STAFHPQTDGQSER IQILEDMLRAC LDF GSW D L L EF+YNNSYQ+SI
Sbjct: 731  LNFSTAFHPQTDGQSERTIQILEDMLRACILDFGGSWEDHLILAEFAYNNSYQSSI 786


>ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508708318|gb|EOY00215.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1537

 Score =  379 bits (972), Expect = e-102
 Identities = 183/330 (55%), Positives = 241/330 (73%), Gaps = 4/330 (1%)
 Frame = -1

Query: 980  LQYHPGKANVVADALSRKSGGTLASL--GFYTWKLYEAVEGLSIEEKETTVYACNLIAQP 807
            + YHP KANVVADALSRKS  +LA+L   +++  L     G+ +   E      + + +P
Sbjct: 984  IDYHPRKANVVADALSRKSSSSLATLRSSYFSMLLEMKSLGIQLNNGEDGTLLASFVVRP 1043

Query: 806  VLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVVP--GVLRRKVMD 633
             L+ +I + QK+ D L +  +KL DG   + + +  DG L    R+ VP    LRR +++
Sbjct: 1044 SLLNQIRELQKSDDWLKQEVQKLQDGKASE-FRLSDDGTLMLRDRICVPKDDQLRRAILE 1102

Query: 632  EAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLLQPL 453
            EAH S Y++HPG +KMY+ ++  +WW GM++D+AE+V++CLTCQQ+K EH++PSG LQPL
Sbjct: 1103 EAHYSAYALHPGSTKMYRTIKESYWWPGMERDIAEFVAKCLTCQQIKAEHQKPSGTLQPL 1162

Query: 452  PVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLYVKE 273
             + EWKWEH+TMDFV GLPRT SG+D +WVIVDRLTKSAHFL I  T ++  L++LY+ E
Sbjct: 1163 SIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDE 1222

Query: 272  IVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILEDML 93
            IVRLHGVP+SIVSDRD RFTS+FW  FQ+A+ T L  STAFHPQTDGQSER IQ LEDML
Sbjct: 1223 IVRLHGVPVSIVSDRDLRFTSRFWPKFQEALGTKLRFSTAFHPQTDGQSERTIQTLEDML 1282

Query: 92   RACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            RAC +DF GSW   LPL+EF+YNNS+Q+SI
Sbjct: 1283 RACVIDFIGSWDRHLPLVEFAYNNSFQSSI 1312


>ref|XP_010111872.1| Transposon Ty3-I Gag-Pol polyprotein [Morus notabilis]
            gi|587945430|gb|EXC31837.1| Transposon Ty3-I Gag-Pol
            polyprotein [Morus notabilis]
          Length = 1088

 Score =  375 bits (963), Expect = e-101
 Identities = 186/329 (56%), Positives = 235/329 (71%), Gaps = 4/329 (1%)
 Frame = -1

Query: 977  QYHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVEGLSIEEKETTVYAC--NLIAQPV 804
            +YHPGKANVVADALSRKS G L SL F  W     V    ++  E +  AC  N++A P 
Sbjct: 707  EYHPGKANVVADALSRKSHGVLTSLAFEDWNRLATVGSFDLQCYEDSNKACIFNIVATPT 766

Query: 803  LVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVV--PGVLRRKVMDE 630
            L Q + + Q + ++ +E+  +   G   +GW I  +G L   G+LVV     LR  V+ E
Sbjct: 767  LKQLVKQGQWHDEEHSEVWNQFQSGEQIEGWQISPEGFLIRKGKLVVLNDSDLRDAVLYE 826

Query: 629  AHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLLQPLP 450
            AH+SK+SIH G +KMY DL+RQ+WW GMK+DV  +V++C  C+QVK +H+RPSG LQPLP
Sbjct: 827  AHRSKFSIHLGSTKMYMDLKRQYWWRGMKRDVVNFVAKCSICKQVKADHQRPSGELQPLP 886

Query: 449  VAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLYVKEI 270
            + +WKW+H+TMDFVTGLPRT  G D VWV+VDRLTK+AHF+PI+    +  L +LY++ I
Sbjct: 887  IPDWKWDHVTMDFVTGLPRTQEGYDAVWVVVDRLTKTAHFIPIRADYKVPKLCRLYIERI 946

Query: 269  VRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILEDMLR 90
            V LHGVP+SIVSDRD +FTSKFW+  Q A+ T L  STAFHPQTDGQSERVIQILED+LR
Sbjct: 947  VTLHGVPVSIVSDRDAQFTSKFWKGLQNALGTELRFSTAFHPQTDGQSERVIQILEDILR 1006

Query: 89   ACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            A  LDF+G W   LP  EF+YNNSYQASI
Sbjct: 1007 AYVLDFEGRWGKYLPNAEFAYNNSYQASI 1035


>ref|XP_012567311.1| PREDICTED: uncharacterized protein LOC105851235 [Cicer arietinum]
          Length = 1114

 Score =  374 bits (960), Expect = e-101
 Identities = 185/331 (55%), Positives = 240/331 (72%), Gaps = 7/331 (2%)
 Frame = -1

Query: 974  YHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVE-----GLSIEEKETTVYACNLIAQ 810
            YHPGKANVVADALSRKS G+LA +      + +  +     G+ +E   + ++  ++  +
Sbjct: 579  YHPGKANVVADALSRKSMGSLAHIAEVNRPIVKEFQKVVESGIQLELGHSRLFLAHVQIR 638

Query: 809  PVLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVVPGV--LRRKVM 636
              +V +I + Q     L  +   + +G   D +++ SDG LR   RL VP V  LRRK++
Sbjct: 639  STIVDDIKEAQSQDPYLVNMVNNVQNGKISD-FSVDSDGVLRLKARLCVPNVGGLRRKIL 697

Query: 635  DEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLLQP 456
            +EAH S Y+IHPG +KMY+DLR  +WW GMK+DVA++VSRCL CQQVK EH++P+GLLQP
Sbjct: 698  EEAHHSSYTIHPGSNKMYQDLRELYWWEGMKRDVADFVSRCLVCQQVKAEHQKPAGLLQP 757

Query: 455  LPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLYVK 276
            + + EWKWE I MDFVTGLPRT  G D+VWVI+DRLTKSAHFLP+K T T    ++LY+ 
Sbjct: 758  VEIPEWKWEGIAMDFVTGLPRTQKGYDSVWVIIDRLTKSAHFLPVKTTYTASQYAKLYLD 817

Query: 275  EIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILEDM 96
            +IV LHGVP+SI+SDR  +FT++FW+SFQ ++ T L LSTAFHPQTDGQSER IQILEDM
Sbjct: 818  KIVSLHGVPVSIISDRGAQFTAQFWKSFQTSLGTRLKLSTAFHPQTDGQSERTIQILEDM 877

Query: 95   LRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
             RAC LD  GSW   LPL+EF+YNNSYQ+SI
Sbjct: 878  FRACVLDLGGSWDQHLPLMEFAYNNSYQSSI 908


>emb|CAN77191.1| hypothetical protein VITISV_006389 [Vitis vinifera]
          Length = 1387

 Score =  374 bits (960), Expect = e-101
 Identities = 185/335 (55%), Positives = 249/335 (74%), Gaps = 6/335 (1%)
 Frame = -1

Query: 989  EFNLQYHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVEG----LSIEEKETTVYACN 822
            +F L YHPGKANVVADALSRKS G L+SL    ++++  +E     L +E     +Y+ +
Sbjct: 857  DFALHYHPGKANVVADALSRKSVGQLSSLELREFEMHTVIEDFELCLGLEGHGPCLYSIS 916

Query: 821  LIAQPVLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVVPG--VLR 648
              A+P ++Q IV+ Q + + L ++  +LV G   + W++  DG +R+ GRL VP    LR
Sbjct: 917  --ARPXVIQRIVEAQVHDEFLEKVKTQLVAGEIDENWSMYEDGSVRFKGRLCVPKDVELR 974

Query: 647  RKVMDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSG 468
             +++ +AH++KY+IHPG +K+           GMKKD+A++V+ C  CQQVK EH+RP+G
Sbjct: 975  NELLADAHRAKYTIHPGNTKI-----------GMKKDIAQFVANCQICQQVKAEHQRPAG 1023

Query: 467  LLQPLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQ 288
            LLQPLP+ EWKW++ITMDFV GLPRT S ++ VW+IVDRLTKS HFL +K  D++ +L++
Sbjct: 1024 LLQPLPIPEWKWDNITMDFVIGLPRTRSKKNGVWMIVDRLTKSTHFLAMKTIDSMNSLAK 1083

Query: 287  LYVKEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQI 108
            LY++EIVRLHG+P+SIVSDRDP+FTS+FW+S Q+ + T L  STAFHPQTDGQSERVIQI
Sbjct: 1084 LYIQEIVRLHGIPVSIVSDRDPKFTSQFWQSLQRTLGTQLNFSTAFHPQTDGQSERVIQI 1143

Query: 107  LEDMLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            LEDMLRAC LDF G+W D LPL EF+YNNSYQ+SI
Sbjct: 1144 LEDMLRACVLDFGGNWADYLPLAEFAYNNSYQSSI 1178


>emb|CAA73042.1| polyprotein [Ananas comosus]
          Length = 871

 Score =  373 bits (958), Expect = e-100
 Identities = 188/333 (56%), Positives = 237/333 (71%), Gaps = 4/333 (1%)
 Frame = -1

Query: 989  EFNLQYHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVEGLSIE--EKETTVYACNLI 816
            +  + YHPGKANVVADALSRKS   LA       +L E ++ L +E    +T +    L+
Sbjct: 408  DLTILYHPGKANVVADALSRKSMENLAMHVVTQPRLIEQMKRLELEIVTPDTPMRLMTLV 467

Query: 815  AQPVLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVVPGV--LRRK 642
             QP L+  I + Q +  +L +I  K+VDG   D +T+  DG +R+ GR+ VP    ++  
Sbjct: 468  VQPTLLDRIKEKQASDVELQKIKGKMVDGCTGD-FTLDGDGLMRFRGRICVPADSGIKED 526

Query: 641  VMDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLL 462
            ++ EAH++ Y+IHPG +KMYKDL+  +WW G+KKDV E+V++CLTCQQVK EHR P+G L
Sbjct: 527  ILQEAHRAPYAIHPGGTKMYKDLKLLYWWPGIKKDVGEFVAKCLTCQQVKAEHRVPAGKL 586

Query: 461  QPLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLY 282
            Q LP+  WKWE ITMDFVTGLPR+ +G D +WVIVDRLTKSAHF+PI  T T   L+Q+Y
Sbjct: 587  QSLPIPVWKWEKITMDFVTGLPRSQAGHDAIWVIVDRLTKSAHFIPIHTTWTGERLAQVY 646

Query: 281  VKEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILE 102
            + EIVRLHGVP SIVSDRD RF S FW S Q A+ T L  STAFHPQ+DGQSER IQ LE
Sbjct: 647  LDEIVRLHGVPTSIVSDRDTRFVSHFWRSLQDALGTRLDFSTAFHPQSDGQSERTIQTLE 706

Query: 101  DMLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            DMLRAC +DF+G W   LP+ EF+YNNSYQASI
Sbjct: 707  DMLRACVIDFQGGWSQHLPMAEFAYNNSYQASI 739


>ref|XP_007023888.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508779254|gb|EOY26510.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1290

 Score =  372 bits (954), Expect = e-100
 Identities = 179/330 (54%), Positives = 236/330 (71%), Gaps = 4/330 (1%)
 Frame = -1

Query: 980  LQYHPGKANVVADALSRKSGGTLASL--GFYTWKLYEAVEGLSIEEKETTVYACNLIAQP 807
            + YHPGKANVV DALSRKS  +LA+L   ++   L     G+ +   E      + + +P
Sbjct: 773  IDYHPGKANVVTDALSRKSSSSLATLRSSYFPMLLEMKSLGIQLNNGEDGTLLASFVVRP 832

Query: 806  VLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVVP--GVLRRKVMD 633
             L+ +I + QK  D L +  +KL DG   + + +  DG L    R+ VP    LRR +++
Sbjct: 833  SLLNQIRELQKFDDWLKQEVQKLQDGEASE-FRLSDDGTLMLRDRICVPKDDQLRRAILE 891

Query: 632  EAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLLQPL 453
            EAH S Y++HPG +KMY+ ++  +WW GMK+D+AE+V++CL CQQ+K EH++ SG LQPL
Sbjct: 892  EAHSSAYALHPGSTKMYQTIKESYWWPGMKRDIAEFVAKCLICQQIKAEHQKSSGTLQPL 951

Query: 452  PVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLYVKE 273
            P+ EWKWEH+TMDFV GLPRT SG+D +WVI+ RLTKSAHFL I  T ++  L++LY+ E
Sbjct: 952  PIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIMGRLTKSAHFLAIHSTYSIERLARLYIDE 1011

Query: 272  IVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILEDML 93
            +VRLHGVP+SIVSDRDPRFTS+FW  FQ+A+ T L  STAFHPQ DGQSER IQ LEDML
Sbjct: 1012 VVRLHGVPVSIVSDRDPRFTSRFWPKFQEALGTKLRFSTAFHPQIDGQSERTIQTLEDML 1071

Query: 92   RACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            RAC +DF  SW   LPL+EF+YNNS+Q+SI
Sbjct: 1072 RACVIDFIRSWDRHLPLVEFAYNNSFQSSI 1101


>ref|XP_007010454.1| Uncharacterized protein TCM_044274 [Theobroma cacao]
            gi|508727367|gb|EOY19264.1| Uncharacterized protein
            TCM_044274 [Theobroma cacao]
          Length = 860

 Score =  367 bits (943), Expect = 6e-99
 Identities = 182/333 (54%), Positives = 236/333 (70%), Gaps = 9/333 (2%)
 Frame = -1

Query: 974  YHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVE-----GLSIEEKETTVYACNLIAQ 810
            YHPGKANVVADALSRKS G+LA +      L   +      G+ +E  ET+    +   +
Sbjct: 353  YHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLGDIGVRLEVAETSALLAHFRVR 412

Query: 809  PVLVQEIVKTQKNSDDLNEICRKLVDGTGPDG--WTIQSDGGLRYFGRLVVPGV--LRRK 642
            P+L+ +I + Q   +    + + L D  G  G  +T  +DG LRY  RL VP    LRR+
Sbjct: 413  PILMDKIKEAQSKDEF---VIKALEDPQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRRE 469

Query: 641  VMDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLL 462
            +++EAH + Y +HPG +KMY+DL+  +WW G+K+DVAE+VS+CL CQQVK EH++P+GLL
Sbjct: 470  ILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLL 529

Query: 461  QPLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLY 282
            QPLPV EWKWEHI MDFVTGLPRT+ G D++W++VDRLTKSAHFLP+K T      +++Y
Sbjct: 530  QPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVY 589

Query: 281  VKEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILE 102
            V EIVRLHG+P+SIVSDR  +FTS+FW   Q+A+ T L  STAFHPQTDGQSER I+ LE
Sbjct: 590  VDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIKTLE 649

Query: 101  DMLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            DMLRAC +D    W   LPL+EF+YNNS+Q SI
Sbjct: 650  DMLRACVIDLGVKWEQYLPLVEFAYNNSFQTSI 682


>ref|XP_007036977.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508774222|gb|EOY21478.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 878

 Score =  367 bits (941), Expect = 1e-98
 Identities = 183/333 (54%), Positives = 234/333 (70%), Gaps = 9/333 (2%)
 Frame = -1

Query: 974  YHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVE-----GLSIEEKETTVYACNLIAQ 810
            YHPGKANVVADALSRKS G+LA +      L   +      G+ +E  ET     +   +
Sbjct: 519  YHPGKANVVADALSRKSMGSLAHIFIGRRSLVREIHSLGDIGVRLEVAETNALLAHFRVR 578

Query: 809  PVLVQEIVKTQKNSDDLNEICRKLVDGTGPDG--WTIQSDGGLRYFGRLVVPGV--LRRK 642
            P+L+  I + Q   +    + + L D  G  G  +T  +DG LRY  RL VP    LRR+
Sbjct: 579  PILMDRIKEAQSKDEF---VIKALEDPQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRRE 635

Query: 641  VMDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLL 462
            +++EAH + Y +HPG +KMY+DL+  +WW G+K+DVAE+VS+CL CQQVK EH++P+GLL
Sbjct: 636  ILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLL 695

Query: 461  QPLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLY 282
            QPLPV EWKWEHI MDFVTGLPRT+ G D++W++VDRLTKSAHFLP+K T      +++Y
Sbjct: 696  QPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVY 755

Query: 281  VKEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILE 102
            V EIVRLHG+P+SIVSDR  +FTS+FW   Q+A+ T L  STAFHPQTDGQSER IQ LE
Sbjct: 756  VDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLE 815

Query: 101  DMLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            DMLRAC +D    W   LPL+EF+YNNS+Q SI
Sbjct: 816  DMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSI 848


>ref|XP_010026793.1| PREDICTED: uncharacterized protein LOC104417177 [Eucalyptus grandis]
          Length = 1753

 Score =  366 bits (940), Expect = 1e-98
 Identities = 182/330 (55%), Positives = 239/330 (72%), Gaps = 6/330 (1%)
 Frame = -1

Query: 974  YHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVEGLSIEEKETTVYACNLIA----QP 807
            YHPGKAN VADALSRKS  ++A +    W L E         K    +  NL+A    +P
Sbjct: 963  YHPGKANKVADALSRKS--SVAQMVLKEWGLIERARDSDF--KFEVGHLSNLVATLRIEP 1018

Query: 806  VLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVVPG--VLRRKVMD 633
             +  +I   Q+   D+ +I ++  +    D + I  DG LR+ GRLVVP    LR +++ 
Sbjct: 1019 EVQVKIRTLQQMDSDVQKILQEDAEKRKAD-FQISEDGTLRFQGRLVVPDDVELREEILS 1077

Query: 632  EAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLLQPL 453
            EAH+S YSIHPG +KMY++LR+ +WW GMK D+A++V++CLTCQQVK +H +P GLL+PL
Sbjct: 1078 EAHRSNYSIHPGSTKMYQNLRQHYWWCGMKADIAKHVAKCLTCQQVKAQHCKPGGLLRPL 1137

Query: 452  PVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLYVKE 273
             + EWKWEHITMDFVTGLPR+  G D++WV+VDRLTKSAHF+ +++  +L  L+ LYV++
Sbjct: 1138 EIPEWKWEHITMDFVTGLPRSQRGNDSIWVVVDRLTKSAHFIAVRRDLSLDRLADLYVRQ 1197

Query: 272  IVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILEDML 93
            +VR+HGVP++I SDRDPRFT+ FW+S Q A+ T L  STA+HPQTDGQSER IQ LEDML
Sbjct: 1198 VVRMHGVPVTITSDRDPRFTAAFWKSLQSALGTKLQYSTAYHPQTDGQSERTIQTLEDML 1257

Query: 92   RACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            RAC LDFKGSW +QL L+EF+YNNSYQ SI
Sbjct: 1258 RACVLDFKGSWEEQLHLVEFAYNNSYQQSI 1287


>ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508779195|gb|EOY26451.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 679

 Score =  366 bits (939), Expect = 2e-98
 Identities = 182/333 (54%), Positives = 234/333 (70%), Gaps = 9/333 (2%)
 Frame = -1

Query: 974  YHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVE-----GLSIEEKETTVYACNLIAQ 810
            YHPGKANVVADALSRKS G+LA +      L   +      G+ +E  ET     +   +
Sbjct: 142  YHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLGDIGVRLEVAETNALLAHFRVR 201

Query: 809  PVLVQEIVKTQKNSDDLNEICRKLVDGTGPDG--WTIQSDGGLRYFGRLVVPGV--LRRK 642
            P+L+  I + Q   +    + + L D  G  G  +T  +DG LRY  RL VP    LRR+
Sbjct: 202  PILMDRIKEAQSKDEF---VIKALEDPRGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRRE 258

Query: 641  VMDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLL 462
            +++EAH + Y +HPG +KMY+DL+  +WW G+K+DVAE+VS+CL CQQVK EH++P+GLL
Sbjct: 259  ILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLL 318

Query: 461  QPLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLY 282
            QPLPV EWKWEHI MDFVTGLPRT+ G D++W++VD+LTKSAHFLP+K T      +++Y
Sbjct: 319  QPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDQLTKSAHFLPVKTTYGAAHYARVY 378

Query: 281  VKEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILE 102
            V EIVRLHG+P+SIVSDR  +FTS+FW   Q+A+ T L  STAFHPQTDGQSER IQ LE
Sbjct: 379  VDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLE 438

Query: 101  DMLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            DMLRAC +D    W   LPL+EF+YNNS+Q SI
Sbjct: 439  DMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSI 471


>ref|XP_012073065.1| PREDICTED: uncharacterized protein LOC105634770 [Jatropha curcas]
          Length = 1963

 Score =  365 bits (938), Expect = 2e-98
 Identities = 188/332 (56%), Positives = 235/332 (70%), Gaps = 6/332 (1%)
 Frame = -1

Query: 980  LQYHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVEGLSIEEKETTV----YACNLIA 813
            + Y PGKANVVADALSRK    +A+L      L   +   SI  K  T+       NL  
Sbjct: 212  IDYQPGKANVVADALSRK---IIANLRTTALSLVHDLR--SINAKFETISDNWVLANLQV 266

Query: 812  QPVLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVVPGVL--RRKV 639
            +P+L+ +I       DD  +I  +  DG  PD +++  DG L +  RL VP  L  R  +
Sbjct: 267  KPLLIDQIRTAILTDDDYQKILSEAQDGKRPD-FSVSRDGLLLFRDRLYVPSDLDLRHLI 325

Query: 638  MDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLLQ 459
            + EAH S +++HPG +KMY+DL R +WW+GMKKD+AE+V++CLTCQQVK EH+ P+GL  
Sbjct: 326  LKEAHDSPFAMHPGATKMYRDLTRNYWWTGMKKDIAEFVAKCLTCQQVKAEHQVPAGLHH 385

Query: 458  PLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLYV 279
            PL + EWKWE +TMDF+ GLP T    D VWVIVDRLTKSAHFLPI+   +L  L+++Y+
Sbjct: 386  PLQIPEWKWERVTMDFLMGLPLTQKKHDAVWVIVDRLTKSAHFLPIRSNYSLEKLAEMYI 445

Query: 278  KEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILED 99
             EIVRLHGVP+SIVSDRDPRFTS+FW S QKA+ T L  STAFHPQTDGQSER+IQILED
Sbjct: 446  GEIVRLHGVPVSIVSDRDPRFTSRFWASLQKALGTRLNFSTAFHPQTDGQSERIIQILED 505

Query: 98   MLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            MLRAC L+F+GSW + LPLIEF+YNNSYQ SI
Sbjct: 506  MLRACVLEFEGSWDNYLPLIEFAYNNSYQTSI 537


>ref|XP_007099710.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma
            cacao] gi|508728428|gb|EOY20325.1| Retrotransposon
            protein, Ty3-gypsy subclass, putative [Theobroma cacao]
          Length = 460

 Score =  365 bits (938), Expect = 2e-98
 Identities = 182/333 (54%), Positives = 233/333 (69%), Gaps = 9/333 (2%)
 Frame = -1

Query: 974  YHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVE-----GLSIEEKETTVYACNLIAQ 810
            YHPGKANVVADALSRKS G+LA +      L   +      G+ +E  ET     +   +
Sbjct: 105  YHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLGDIGVRLEVAETNALLAHFRVR 164

Query: 809  PVLVQEIVKTQKNSDDLNEICRKLVDGTGPDG--WTIQSDGGLRYFGRLVVPGV--LRRK 642
            P+L+  I + Q   +    + + L D  G  G  +T  +DG LRY  RL VP    LRR+
Sbjct: 165  PILMDRIKEAQSKDEF---VIKALEDPQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRRE 221

Query: 641  VMDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLL 462
            +++EAH + Y +HPG +KMY+DL+  +WW G+K+DVAE+VS+CL CQQVK EH++P+GLL
Sbjct: 222  ILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLL 281

Query: 461  QPLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLY 282
            QPLPV EWKWEHI MDFVTGLPRT+ G D++W++VDRLTKSAHFLP+K T      +++Y
Sbjct: 282  QPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVY 341

Query: 281  VKEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILE 102
            V EIVRLHG+P+SIVSDR  +FTS+FW   Q+A+ T L   TAFHPQTDGQSER IQ LE
Sbjct: 342  VDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFITAFHPQTDGQSERTIQTLE 401

Query: 101  DMLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            DMLRAC +D    W   LPL+EF+YNNS+Q SI
Sbjct: 402  DMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSI 434


>ref|XP_007028176.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508716781|gb|EOY08678.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 666

 Score =  365 bits (938), Expect = 2e-98
 Identities = 182/333 (54%), Positives = 233/333 (69%), Gaps = 9/333 (2%)
 Frame = -1

Query: 974  YHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVE-----GLSIEEKETTVYACNLIAQ 810
            YHPGKANVVADALSRKS G+LA +      L   +      G+ +E  ET     +   +
Sbjct: 236  YHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLGDIGVRLEVAETNALLAHFRVR 295

Query: 809  PVLVQEIVKTQKNSDDLNEICRKLVDGTGPDG--WTIQSDGGLRYFGRLVVPGV--LRRK 642
            P+L+ +I + Q   +    + + L D  G  G  +T  +DG LRY  RL VP    LRRK
Sbjct: 296  PILMDKIKEAQSKDEF---VIKALEDPQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRRK 352

Query: 641  VMDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLL 462
            +++EAH + Y +HPG +KMY+DL+  +WW G+K+DVAE+VS+CL CQQVK EH++P+GLL
Sbjct: 353  ILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLL 412

Query: 461  QPLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLY 282
            QPLPV EWKWEHI MDFVTGLPRT+ G D++W++VDRLTKSAHFL +K T      +++Y
Sbjct: 413  QPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLSVKTTYGAAQYARVY 472

Query: 281  VKEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILE 102
            V EIVRLHG+P+SIVSDR  +FTS+FW   Q+A+ T L  ST FHPQTDGQSER IQ LE
Sbjct: 473  VDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTTFHPQTDGQSERTIQTLE 532

Query: 101  DMLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            DMLRAC +D    W   LPL+EF+YNNS+Q SI
Sbjct: 533  DMLRACVIDLGVKWEQYLPLVEFAYNNSFQTSI 565


>ref|XP_008812481.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103723366
            [Phoenix dactylifera]
          Length = 1246

 Score =  364 bits (934), Expect = 7e-98
 Identities = 183/333 (54%), Positives = 233/333 (69%), Gaps = 4/333 (1%)
 Frame = -1

Query: 989  EFNLQYHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVEGLSIE--EKETTVYACNLI 816
            + +++YHP KANVVADALSRKS     SL     ++ +  E + I+   K+      +L+
Sbjct: 751  DLSIKYHPEKANVVADALSRKSAVGSISLLTTQKQILKDFEMMQIDVITKDAGSMLTSLL 810

Query: 815  AQPVLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVVP--GVLRRK 642
             QP L++ I   Q+    L  +   +  G  P+   I  DG LR+  RL VP    L+R+
Sbjct: 811  VQPTLIERIKTAQQTDAHLCRLRNDVERGLRPE-LRIHPDGTLRFGCRLCVPKDADLKRE 869

Query: 641  VMDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLL 462
            +++EAH+S++SIHPG +KMY DLR  FWW+GMK+++A +V+RCL CQQVK EH+RP+GLL
Sbjct: 870  ILEEAHQSRFSIHPGSTKMYTDLREHFWWNGMKREIAGFVARCLVCQQVKAEHQRPAGLL 929

Query: 461  QPLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLY 282
            +PL + EWKWEHITMDFV GLPRT    D VWVIVDRLTKSAHFLP +   +L  L+Q Y
Sbjct: 930  EPLEIPEWKWEHITMDFVIGLPRTVRRNDAVWVIVDRLTKSAHFLPFRVGTSLDKLAQRY 989

Query: 281  VKEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILE 102
            + +IVRLHG P+SIVSDRDPRF S FW SFQ AM T L LSTA+HPQTDGQSER IQ LE
Sbjct: 990  IDDIVRLHGAPVSIVSDRDPRFVSGFWRSFQTAMGTDLRLSTAYHPQTDGQSERTIQTLE 1049

Query: 101  DMLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            DMLR C +D  G W D + L+EF+YNNSY +SI
Sbjct: 1050 DMLRTCTVDLGGCWDDHISLVEFAYNNSYHSSI 1082


>ref|XP_007028157.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508716762|gb|EOY08659.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 937

 Score =  364 bits (934), Expect = 7e-98
 Identities = 182/333 (54%), Positives = 233/333 (69%), Gaps = 9/333 (2%)
 Frame = -1

Query: 974  YHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVE-----GLSIEEKETTVYACNLIAQ 810
            +HPGKANVVADALSRKS G+LA +      L + +      G+ +E  ET     +   +
Sbjct: 372  HHPGKANVVADALSRKSMGSLAHISIGRRSLVKEIHSLGDIGVRLEVAETNALLAHFRVR 431

Query: 809  PVLVQEIVKTQKNSDDLNEICRKLVDGTGPDG--WTIQSDGGLRYFGRLVVPGV--LRRK 642
            P+L+  I + Q   +    + + L D  G  G  +T  +DG LRY  RL VP    LRR+
Sbjct: 432  PILMDRIKEAQSKDEF---VIKALEDPRGKKGKMFTKGTDGVLRYGTRLYVPDSDGLRRE 488

Query: 641  VMDEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLL 462
            +++EAH + Y IHPG +KMY+DL+  +WW G+K+DVAE+VS+CL CQQVK EH++P+GLL
Sbjct: 489  ILEEAHMAAYVIHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLL 548

Query: 461  QPLPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLY 282
            QPLPV EWKWEHI MDFVTGLPRT  G D++W++VDRLTKSAHFLP+K T      +++Y
Sbjct: 549  QPLPVPEWKWEHIAMDFVTGLPRTNGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVY 608

Query: 281  VKEIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILE 102
            V EIVRLHG+P+SIVSDR  +FTS+FW   Q+A+ T L  STAFHPQTDGQSE  IQ LE
Sbjct: 609  VDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSEWTIQTLE 668

Query: 101  DMLRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            DMLRAC +D    W   LPL+EF+YNNS+Q SI
Sbjct: 669  DMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSI 701


>emb|CAN61694.1| hypothetical protein VITISV_026655 [Vitis vinifera]
          Length = 1313

 Score =  363 bits (932), Expect = 1e-97
 Identities = 180/331 (54%), Positives = 232/331 (70%), Gaps = 2/331 (0%)
 Frame = -1

Query: 989  EFNLQYHPGKANVVADALSRKSGGTLASLGFYTWKLYEAVEGLSIEEKETTVYACNLIAQ 810
            +F L YHPGK N VADALSRKS                                      
Sbjct: 817  DFALHYHPGKXNXVADALSRKS-------------------------------------- 838

Query: 809  PVLVQEIVKTQKNSDDLNEICRKLVDGTGPDGWTIQSDGGLRYFGRLVVPG--VLRRKVM 636
                  I + Q + + L ++   LV G   + W++  DG +R+ GRL VP    LR +++
Sbjct: 839  -----RIXEAQVHDEFLEKVKAXLVAGEIDENWSMYEDGSVRFKGRLCVPKDVELRNELL 893

Query: 635  DEAHKSKYSIHPGESKMYKDLRRQFWWSGMKKDVAEYVSRCLTCQQVKIEHRRPSGLLQP 456
             +AH++KY+IHPG +KMY+DL+RQFWWSGMK+D+A++V+    CQQVK EH+RP+GLLQP
Sbjct: 894  ADAHRAKYTIHPGNTKMYQDLKRQFWWSGMKRDIAQFVANFQICQQVKAEHQRPAGLLQP 953

Query: 455  LPVAEWKWEHITMDFVTGLPRTTSGRDTVWVIVDRLTKSAHFLPIKKTDTLGTLSQLYVK 276
            LP+ EWKW++ITMDFV GLPRT S ++ VWVIVD LTKSAHFL +K TD++ +L++LY++
Sbjct: 954  LPIPEWKWDNITMDFVIGLPRTRSKKNGVWVIVDCLTKSAHFLAMKTTDSMNSLAKLYIQ 1013

Query: 275  EIVRLHGVPLSIVSDRDPRFTSKFWESFQKAMDTTLCLSTAFHPQTDGQSERVIQILEDM 96
            EIVRLHG+ +SIVSDRDP+FTS+FW+S Q+A+ T L  +TAFHPQTDGQSERVIQILEDM
Sbjct: 1014 EIVRLHGILVSIVSDRDPKFTSQFWQSLQRALGTQLNFNTAFHPQTDGQSERVIQILEDM 1073

Query: 95   LRACALDFKGSWIDQLPLIEFSYNNSYQASI 3
            LRAC LDF G+W D LPL EF+YNNSYQ+SI
Sbjct: 1074 LRACVLDFGGNWADYLPLAEFAYNNSYQSSI 1104


Top