BLASTX nr result

ID: Rehmannia27_contig00032989 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia27_contig00032989
         (1138 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007200265.1| hypothetical protein PRUPE_ppa015000mg [Prun...   359   e-121
ref|XP_012073065.1| PREDICTED: uncharacterized protein LOC105634...   359   e-117
ref|XP_008812481.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   342   e-117
emb|CAA73042.1| polyprotein [Ananas comosus]                          337   e-116
ref|XP_007221234.1| hypothetical protein PRUPE_ppb019121mg [Prun...   355   e-115
ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [The...   346   e-115
emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera]   348   e-114
ref|XP_015075513.1| PREDICTED: uncharacterized protein LOC107019...   350   e-114
ref|XP_007036977.1| DNA/RNA polymerases superfamily protein [The...   341   e-113
ref|XP_007219896.1| hypothetical protein PRUPE_ppb014768mg [Prun...   350   e-112
gb|KYP78784.1| Retrotransposable element Tf2, partial [Cajanus c...   343   e-112
ref|XP_012829796.1| PREDICTED: uncharacterized protein LOC105950...   365   e-112
ref|XP_007023888.1| DNA/RNA polymerases superfamily protein [The...   331   e-112
ref|XP_007010454.1| Uncharacterized protein TCM_044274 [Theobrom...   335   e-112
ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [The...   338   e-112
ref|XP_007028176.1| DNA/RNA polymerases superfamily protein [The...   335   e-111
gb|AAO19383.1| putative polyprotein [Oryza sativa Japonica Group...   335   e-111
gb|AAO45752.1| pol protein [Cucumis melo subsp. melo]                 322   e-111
gb|AAV31278.1| putative polyprotein [Oryza sativa Japonica Group]     333   e-111
gb|ABG22001.1| retrotransposon protein, putative, Ty3-gypsy subc...   333   e-111

>ref|XP_007200265.1| hypothetical protein PRUPE_ppa015000mg [Prunus persica]
            gi|462395665|gb|EMJ01464.1| hypothetical protein
            PRUPE_ppa015000mg [Prunus persica]
          Length = 1493

 Score =  359 bits (922), Expect(2) = e-121
 Identities = 172/274 (62%), Positives = 205/274 (74%)
 Frame = +2

Query: 314  ILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXXS 493
            ++ERI VAQL DP L +IR  V  G R DY I  DGAL  GT+L VP N          +
Sbjct: 1018 LVERIIVAQLGDPTLCRIRGEVESGSRKDYAIRGDGALVTGTRLHVPKNDYLKREILEEA 1077

Query: 494  HNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLMI 673
            H + Y++HPGSTKM   L +++SW  MK D+A YV RCL CQQVKAE Q+PSGL+QPL I
Sbjct: 1078 HCSTYTMHPGSTKMYRTLREYYSWPHMKGDIAKYVSRCLICQQVKAERQKPSGLMQPLPI 1137

Query: 674  PQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEIV 853
            P+WKWER+ MDFV  LP+T  G + IWVIVDRLTKS H LP+K+TY++ K A+L++ EIV
Sbjct: 1138 PEWKWERITMDFVFKLPRTSKGHDGIWVIVDRLTKSTHFLPIKETYSLTKLAKLFVDEIV 1197

Query: 854  RLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLRA 1033
            RLHG P+SIVSDRD +FTS FWK L  AMGT+L FSTAFHPQTDGQSERTIQTLEDMLR+
Sbjct: 1198 RLHGAPVSIVSDRDARFTSRFWKCLQEAMGTRLQFSTAFHPQTDGQSERTIQTLEDMLRS 1257

Query: 1034 CVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            CV+    +WD  L LVEF+YNNSY +SI MAPYE
Sbjct: 1258 CVLQMKDSWDTHLALVEFAYNNSYHASIKMAPYE 1291



 Score =  104 bits (260), Expect(2) = e-121
 Identities = 54/100 (54%), Positives = 70/100 (70%), Gaps = 1/100 (1%)
 Frame = +3

Query: 3    RHYLYEAKYEIYMDHKSLK-FFTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
            RHYLY    +I+ DHKSLK FFTQ+ELNMRQR  LEL+KDYDC I Y+ G+ANVVADALS
Sbjct: 913  RHYLYGETCQIFTDHKSLKYFFTQRELNMRQRRWLELIKDYDCTIEYYPGRANVVADALS 972

Query: 180  RKGRGTIYALHTIQKPLLKDMQSLELEIVSQEKSSYLTTL 299
            RK  G++  L T   PLL +++   +E+   ++   L +L
Sbjct: 973  RKTTGSLTHLRTTYLPLLVELRKDGVELEMTQQGGILASL 1012


>ref|XP_012073065.1| PREDICTED: uncharacterized protein LOC105634770 [Jatropha curcas]
          Length = 1963

 Score =  359 bits (922), Expect(2) = e-117
 Identities = 163/274 (59%), Positives = 215/274 (78%)
 Frame = +2

Query: 314  ILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXXS 493
            ++++I+ A L D    KI S   +GKR D+ ++ DG L    +L VP++          +
Sbjct: 270  LIDQIRTAILTDDDYQKILSEAQDGKRPDFSVSRDGLLLFRDRLYVPSDLDLRHLILKEA 329

Query: 494  HNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLMI 673
            H++P+++HPG+TKM  DL++++ W GMK D+A +V +CLTCQQVKAEHQ P+GL  PL I
Sbjct: 330  HDSPFAMHPGATKMYRDLTRNYWWTGMKKDIAEFVAKCLTCQQVKAEHQVPAGLHHPLQI 389

Query: 674  PQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEIV 853
            P+WKWERV MDF++GLP T+   +++WVIVDRLTKSAH LP++  Y++EK AE+YI EIV
Sbjct: 390  PEWKWERVTMDFLMGLPLTQKKHDAVWVIVDRLTKSAHFLPIRSNYSLEKLAEMYIGEIV 449

Query: 854  RLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLRA 1033
            RLHGVP+SIVSDRDP+FTS FW SL +A+GT+L+FSTAFHPQTDGQSER IQ LEDMLRA
Sbjct: 450  RLHGVPVSIVSDRDPRFTSRFWASLQKALGTRLNFSTAFHPQTDGQSERIIQILEDMLRA 509

Query: 1034 CVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            CV++F G+WD +LPL+EF+YNNSYQ+SIGM PYE
Sbjct: 510  CVLEFEGSWDNYLPLIEFAYNNSYQTSIGMPPYE 543



 Score = 93.2 bits (230), Expect(2) = e-117
 Identities = 52/103 (50%), Positives = 68/103 (66%), Gaps = 1/103 (0%)
 Frame = +3

Query: 3   RHYLYEAKYEIYMDHKSLKFF-TQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
           RHYLY  K  I+ DHKSLK+  TQ+ELN+RQR  LEL+KDYDC+I+Y  GKANVVADALS
Sbjct: 168 RHYLYGEKCYIFTDHKSLKYLGTQRELNLRQRRWLELIKDYDCIIDYQPGKANVVADALS 227

Query: 180 RKGRGTIYALHTIQKPLLKDMQSLELEIVSQEKSSYLTTLTLQ 308
           RK    I  L T    L+ D++S+  +  +   +  L  L ++
Sbjct: 228 RK---IIANLRTTALSLVHDLRSINAKFETISDNWVLANLQVK 267


>ref|XP_008812481.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103723366
            [Phoenix dactylifera]
          Length = 1246

 Score =  342 bits (878), Expect(2) = e-117
 Identities = 162/275 (58%), Positives = 200/275 (72%)
 Frame = +2

Query: 311  SILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXX 490
            +++ERIK AQ  D  L ++R+ V  G R +  I  DG LR G +L VP +          
Sbjct: 814  TLIERIKTAQQTDAHLCRLRNDVERGLRPELRIHPDGTLRFGCRLCVPKDADLKREILEE 873

Query: 491  SHNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLM 670
            +H + +SIHPGSTKM  DL +HF W GMK ++A +V RCL CQQVKAEHQRP+GLL+PL 
Sbjct: 874  AHQSRFSIHPGSTKMYTDLREHFWWNGMKREIAGFVARCLVCQQVKAEHQRPAGLLEPLE 933

Query: 671  IPQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEI 850
            IP+WKWE + MDFV+GLP+T    +++WVIVDRLTKSAH LP +   +++K A+ YI +I
Sbjct: 934  IPEWKWEHITMDFVIGLPRTVRRNDAVWVIVDRLTKSAHFLPFRVGTSLDKLAQRYIDDI 993

Query: 851  VRLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLR 1030
            VRLHG P+SIVSDRDP+F S FW+S   AMGT L  STA+HPQTDGQSERTIQTLEDMLR
Sbjct: 994  VRLHGAPVSIVSDRDPRFVSGFWRSFQTAMGTDLRLSTAYHPQTDGQSERTIQTLEDMLR 1053

Query: 1031 ACVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
             C +D  G WD  + LVEF+YNNSY SSI MAPYE
Sbjct: 1054 TCTVDLGGCWDDHISLVEFAYNNSYHSSIQMAPYE 1088



 Score =  109 bits (273), Expect(2) = e-117
 Identities = 56/102 (54%), Positives = 74/102 (72%), Gaps = 1/102 (0%)
 Frame = +3

Query: 6    HYLYEAKYEIYMDHKSLKF-FTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALSR 182
            HYLY    E++ DHKSLK+ FTQKELNMRQR  LEL+KDYD  I YH  KANVVADALSR
Sbjct: 711  HYLYGEPCEVFTDHKSLKYIFTQKELNMRQRRWLELLKDYDLSIKYHPEKANVVADALSR 770

Query: 183  KGRGTIYALHTIQKPLLKDMQSLELEIVSQEKSSYLTTLTLQ 308
            K      +L T QK +LKD + +++++++++  S LT+L +Q
Sbjct: 771  KSAVGSISLLTTQKQILKDFEMMQIDVITKDAGSMLTSLLVQ 812


>emb|CAA73042.1| polyprotein [Ananas comosus]
          Length = 871

 Score =  337 bits (864), Expect(2) = e-116
 Identities = 158/275 (57%), Positives = 204/275 (74%)
 Frame = +2

Query: 311  SILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXX 490
            ++L+RIK  Q +D +L KI+  +V+G   D+ +  DG +R   ++ VP +          
Sbjct: 471  TLLDRIKEKQASDVELQKIKGKMVDGCTGDFTLDGDGLMRFRGRICVPADSGIKEDILQE 530

Query: 491  SHNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLM 670
            +H APY+IHPG TKM  DL   + W G+K DV  +V +CLTCQQVKAEH+ P+G LQ L 
Sbjct: 531  AHRAPYAIHPGGTKMYKDLKLLYWWPGIKKDVGEFVAKCLTCQQVKAEHRVPAGKLQSLP 590

Query: 671  IPQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEI 850
            IP WKWE++ MDFV GLP+++ G ++IWVIVDRLTKSAH +P+  T+  E+ A++Y+ EI
Sbjct: 591  IPVWKWEKITMDFVTGLPRSQAGHDAIWVIVDRLTKSAHFIPIHTTWTGERLAQVYLDEI 650

Query: 851  VRLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLR 1030
            VRLHGVP SIVSDRD +F S FW+SL  A+GT+L FSTAFHPQ+DGQSERTIQTLEDMLR
Sbjct: 651  VRLHGVPTSIVSDRDTRFVSHFWRSLQDALGTRLDFSTAFHPQSDGQSERTIQTLEDMLR 710

Query: 1031 ACVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            ACVIDF G W + LP+ EF+YNNSYQ+SI MAP+E
Sbjct: 711  ACVIDFQGGWSQHLPMAEFAYNNSYQASIKMAPFE 745



 Score =  111 bits (278), Expect(2) = e-116
 Identities = 61/104 (58%), Positives = 76/104 (73%), Gaps = 2/104 (1%)
 Frame = +3

Query: 3   RHYLYEAKYEIYMDHKSLKF-FTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
           RHYLY  + E+Y DHKSLK+ FTQKELN+RQR  LEL+KDYD  I YH GKANVVADALS
Sbjct: 367 RHYLYGERCEVYTDHKSLKYLFTQKELNLRQRRWLELLKDYDLTILYHPGKANVVADALS 426

Query: 180 RKGRGTIYALHTIQKP-LLKDMQSLELEIVSQEKSSYLTTLTLQ 308
           RK    + A+H + +P L++ M+ LELEIV+ +    L TL +Q
Sbjct: 427 RKSMENL-AMHVVTQPRLIEQMKRLELEIVTPDTPMRLMTLVVQ 469


>ref|XP_007221234.1| hypothetical protein PRUPE_ppb019121mg [Prunus persica]
            gi|462417788|gb|EMJ22433.1| hypothetical protein
            PRUPE_ppb019121mg [Prunus persica]
          Length = 552

 Score =  355 bits (912), Expect = e-115
 Identities = 170/297 (57%), Positives = 215/297 (72%)
 Frame = +2

Query: 245  ELRVGNCISREVELFDYPDLTTSILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGA 424
            +LRVG  +  +  L     +   ++ERI  AQ  DP +  +R  V  G R D  +  DGA
Sbjct: 55   KLRVGLHVDNQGALLATLHVRPVLVERILAAQSQDPLICTLRVEVANGDRTDCSVRNDGA 114

Query: 425  LRLGTKLAVPNNXXXXXXXXXXSHNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQR 604
            L +G +L VPN+          +H + +++HPGSTKM H L +H+ W  MK ++A YV+R
Sbjct: 115  LMVGNRLYVPNDEALKREILEEAHESAFAMHPGSTKMYHTLREHYWWPFMKKEIAEYVRR 174

Query: 605  CLTCQQVKAEHQRPSGLLQPLMIPQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSA 784
            CL CQQVKAE Q+PSGLLQPL IP+WKWER+ MDFV  LP+T+   + +WVIVDRLTKSA
Sbjct: 175  CLICQQVKAERQKPSGLLQPLPIPEWKWERITMDFVFKLPRTQSKHDGVWVIVDRLTKSA 234

Query: 785  HLLPVKKTYNMEKYAELYISEIVRLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFST 964
            H LPV+  Y++ K A+++I EIVRLHGVP+SIVSDRDP+FTS FW  L+ A GTQL FST
Sbjct: 235  HFLPVRANYSLNKLAKIFIDEIVRLHGVPVSIVSDRDPRFTSRFWTKLNEAFGTQLQFST 294

Query: 965  AFHPQTDGQSERTIQTLEDMLRACVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            AFHPQTDGQSERTIQTLEDMLRAC + F G WD+ LPL+EF+YNNSYQ SIGM+P++
Sbjct: 295  AFHPQTDGQSERTIQTLEDMLRACALQFRGDWDEKLPLMEFAYNNSYQVSIGMSPFD 351



 Score = 70.1 bits (170), Expect = 1e-09
 Identities = 36/72 (50%), Positives = 47/72 (65%)
 Frame = +3

Query: 84  MRQRGSLELVKDYDCVINYHQGKANVVADALSRKGRGTIYALHTIQKPLLKDMQSLELEI 263
           MRQR  LEL+KDYDC I +H G+ANVVADALSRK  G+I  L     PL+ +M+ L + +
Sbjct: 1   MRQRRWLELIKDYDCTIEHHPGRANVVADALSRKSSGSIAYLRGRYLPLMVEMRKLRVGL 60

Query: 264 VSQEKSSYLTTL 299
               + + L TL
Sbjct: 61  HVDNQGALLATL 72


>ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508708318|gb|EOY00215.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1537

 Score =  346 bits (888), Expect(2) = e-115
 Identities = 163/275 (59%), Positives = 208/275 (75%)
 Frame = +2

Query: 311  SILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXX 490
            S+L +I+  Q +D  L +    + +GK  ++ ++ DG L L  ++ VP +          
Sbjct: 1044 SLLNQIRELQKSDDWLKQEVQKLQDGKASEFRLSDDGTLMLRDRICVPKDDQLRRAILEE 1103

Query: 491  SHNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLM 670
            +H + Y++HPGSTKM   + + + W GM+ D+A +V +CLTCQQ+KAEHQ+PSG LQPL 
Sbjct: 1104 AHYSAYALHPGSTKMYRTIKESYWWPGMERDIAEFVAKCLTCQQIKAEHQKPSGTLQPLS 1163

Query: 671  IPQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEI 850
            IP+WKWE V MDFV+GLP+T+ G ++IWVIVDRLTKSAH L +  TY++E+ A LYI EI
Sbjct: 1164 IPEWKWEHVTMDFVLGLPRTQSGKDAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDEI 1223

Query: 851  VRLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLR 1030
            VRLHGVP+SIVSDRD +FTS FW     A+GT+L FSTAFHPQTDGQSERTIQTLEDMLR
Sbjct: 1224 VRLHGVPVSIVSDRDLRFTSRFWPKFQEALGTKLRFSTAFHPQTDGQSERTIQTLEDMLR 1283

Query: 1031 ACVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            ACVIDF G+WD+ LPLVEF+YNNS+QSSIGMAPYE
Sbjct: 1284 ACVIDFIGSWDRHLPLVEFAYNNSFQSSIGMAPYE 1318



 Score = 98.2 bits (243), Expect(2) = e-115
 Identities = 51/103 (49%), Positives = 71/103 (68%), Gaps = 1/103 (0%)
 Frame = +3

Query: 3    RHYLYEAKYEIYMDHKSLKFF-TQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
            RHYLY  +  I+ DHKSLK+  TQKELN+RQR  LEL+KDYD VI+YH  KANVVADALS
Sbjct: 940  RHYLYGERCRIFYDHKSLKYLLTQKELNLRQRQWLELIKDYDLVIDYHPRKANVVADALS 999

Query: 180  RKGRGTIYALHTIQKPLLKDMQSLELEIVSQEKSSYLTTLTLQ 308
            RK   ++  L +    +L +M+SL +++ + E  + L +  ++
Sbjct: 1000 RKSSSSLATLRSSYFSMLLEMKSLGIQLNNGEDGTLLASFVVR 1042


>emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera]
          Length = 1573

 Score =  348 bits (893), Expect(2) = e-114
 Identities = 165/275 (60%), Positives = 207/275 (75%), Gaps = 1/275 (0%)
 Frame = +2

Query: 314  ILERIKVAQLNDPKLVKIRSAVVEGKRDD-YMITADGALRLGTKLAVPNNXXXXXXXXXX 490
            +++RI  AQ++D  L K+++ +V G+ D+ + +  DG++R   +L VP +          
Sbjct: 1142 VIQRIVEAQVHDEFLEKVKAQLVAGEIDENWSMYEDGSVRFKGRLCVPKDVELRNELLAD 1201

Query: 491  SHNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLM 670
            +H A Y+IHPG+TKM  DL + F W GMK D+A +V  C  CQQVKAEHQRP+ LLQPL 
Sbjct: 1202 AHRAKYTIHPGNTKMYQDLKRQFXWSGMKRDIAQFVANCQICQQVKAEHQRPAELLQPLP 1261

Query: 671  IPQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEI 850
            IP+WKW+ + MDFV+GLP+T+   N +WVIVDRLTKSAH L +K T +M   A+LYI EI
Sbjct: 1262 IPKWKWDNITMDFVIGLPRTRSKKNGVWVIVDRLTKSAHFLAMKTTDSMNSLAKLYIQEI 1321

Query: 851  VRLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLR 1030
            VRLHG+P+SIVSDRDPKFTS FW+SL RA+GTQL+FST FHPQTDGQSER IQ LEDMLR
Sbjct: 1322 VRLHGIPVSIVSDRDPKFTSQFWQSLQRALGTQLNFSTVFHPQTDGQSERVIQILEDMLR 1381

Query: 1031 ACVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            ACV+DF G W  +LPL EF+YNN YQSSIGMAPYE
Sbjct: 1382 ACVLDFGGNWADYLPLAEFAYNNXYQSSIGMAPYE 1416



 Score = 94.7 bits (234), Expect(2) = e-114
 Identities = 47/100 (47%), Positives = 69/100 (69%), Gaps = 1/100 (1%)
 Frame = +3

Query: 6    HYLYEAKYEIYMDHKSLKF-FTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALSR 182
            HYLY  K+E+Y DHKSLK+ FTQK+LN RQR  +E ++DYD  ++YH GKANVVADALSR
Sbjct: 1038 HYLYGEKFEVYSDHKSLKYIFTQKDLNSRQRRWMETLEDYDFALHYHPGKANVVADALSR 1097

Query: 183  KGRGTIYALHTIQKPLLKDMQSLELEIVSQEKSSYLTTLT 302
            K  G +++L   +  +   ++  EL +V + +   L +++
Sbjct: 1098 KSYGQLFSLGLREFEMYAVIEDFELCLVQEGRGPCLYSIS 1137


>ref|XP_015075513.1| PREDICTED: uncharacterized protein LOC107019601 [Solanum pennellii]
          Length = 1739

 Score =  350 bits (898), Expect(2) = e-114
 Identities = 164/272 (60%), Positives = 201/272 (73%)
 Frame = +2

Query: 320  ERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXXSHN 499
            ++I+  Q +D KL  IR  V+ G+  + ++ +DG LR+G ++ VP            +H 
Sbjct: 1157 DQIRAHQFDDEKLCLIRDKVLRGEAKEAVLDSDGVLRIGGRICVPRTGDLIRLILEEAHC 1216

Query: 500  APYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLMIPQ 679
            + YSIHPG+ KM HDLS+H+ W GMK D++ +V RCLTCQQVK EHQRP G+ Q + IP 
Sbjct: 1217 SRYSIHPGAAKMYHDLSQHYWWCGMKRDISDFVSRCLTCQQVKCEHQRPGGVSQRMPIPT 1276

Query: 680  WKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEIVRL 859
            WKWER+ MDFVVGLP T  G++SIWV+VDRLTKSAH +PV+  Y  EK  ELYIS+IVRL
Sbjct: 1277 WKWERITMDFVVGLPTTVGGYDSIWVVVDRLTKSAHFIPVRVKYTAEKLVELYISQIVRL 1336

Query: 860  HGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLRACV 1039
            HGVP+SI+SDR   FTS FWK+L   +GTQL  STAFHPQTDGQSERTIQ LEDMLRACV
Sbjct: 1337 HGVPVSIISDRGSLFTSHFWKALQHGLGTQLDMSTAFHPQTDGQSERTIQVLEDMLRACV 1396

Query: 1040 IDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            IDF   WD+ LPL EF+YNNSY SSI MAP+E
Sbjct: 1397 IDFGARWDRHLPLAEFAYNNSYHSSIQMAPFE 1428



 Score = 90.1 bits (222), Expect(2) = e-114
 Identities = 47/82 (57%), Positives = 61/82 (74%), Gaps = 3/82 (3%)
 Frame = +3

Query: 3    RHYLYEAKYEIYMDHKSLKF-FTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
            RHYLY    EI+ DH+SL++ F+Q++LN+RQR  LEL+KDYD  I YH GKANVVADALS
Sbjct: 1076 RHYLYGVHCEIFTDHRSLQYIFSQRDLNLRQRKWLELLKDYDVTILYHPGKANVVADALS 1135

Query: 180  RK--GRGTIYALHTIQKPLLKD 239
            RK    G++ AL   ++PL +D
Sbjct: 1136 RKTPSMGSLAALSIEERPLARD 1157


>ref|XP_007036977.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508774222|gb|EOY21478.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 878

 Score =  341 bits (874), Expect(2) = e-113
 Identities = 160/274 (58%), Positives = 201/274 (73%)
 Frame = +2

Query: 314  ILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXXS 493
            +++RIK AQ  D  ++K        K   +    DG LR GT+L VP+           +
Sbjct: 581  LMDRIKEAQSKDEFVIKALEDPQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRREILEEA 640

Query: 494  HNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLMI 673
            H A Y +HPG+TKM  DL + + W G+K DVA +V +CL CQQVKAEHQ+P+GLLQPL +
Sbjct: 641  HMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPV 700

Query: 674  PQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEIV 853
            P+WKWE + MDFV GLP+T  G++SIW++VDRLTKSAH LPVK TY   +YA +Y+ EIV
Sbjct: 701  PEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDEIV 760

Query: 854  RLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLRA 1033
            RLHG+P+SIVSDR  +FTS FW  L  A+GT+L FSTAFHPQTDGQSERTIQTLEDMLRA
Sbjct: 761  RLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRA 820

Query: 1034 CVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            CVID    W+++LPLVEF+YNNS+Q+SI MAP+E
Sbjct: 821  CVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFE 854



 Score = 97.1 bits (240), Expect(2) = e-113
 Identities = 48/84 (57%), Positives = 63/84 (75%), Gaps = 1/84 (1%)
 Frame = +3

Query: 3   RHYLYEAKYEIYMDHKSLKF-FTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
           RHYLY    EIY DHKSLK+ F Q++LN+RQR  +EL+KDYDC I YH GKANVVADALS
Sbjct: 473 RHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALS 532

Query: 180 RKGRGTIYALHTIQKPLLKDMQSL 251
           RK  G++  +   ++ L++++ SL
Sbjct: 533 RKSMGSLAHIFIGRRSLVREIHSL 556


>ref|XP_007219896.1| hypothetical protein PRUPE_ppb014768mg [Prunus persica]
            gi|462416358|gb|EMJ21095.1| hypothetical protein
            PRUPE_ppb014768mg [Prunus persica]
          Length = 602

 Score =  350 bits (897), Expect = e-112
 Identities = 173/317 (54%), Positives = 220/317 (69%)
 Frame = +2

Query: 185  GSRDYLCLAHDSKATTERHAELRVGNCISREVELFDYPDLTTSILERIKVAQLNDPKLVK 364
            GS  +L  A+       R  E+ +G  +++   +F    L   ++ER+ VAQL DP L +
Sbjct: 103  GSLAHLRTAYLPLLVELRKDEVELG--MTQRGGIFASLHLRPILVERVIVAQLGDPTLCR 160

Query: 365  IRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXXSHNAPYSIHPGSTKMKHD 544
            IR  V  G R DY I  DGAL  GT+L VP N          +H + YS+HPGSTKM   
Sbjct: 161  IRGEVENGTRKDYAIRGDGALVTGTRLCVPKNDDLKREIMEEAHCSTYSMHPGSTKMYRT 220

Query: 545  LSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLMIPQWKWERVGMDFVVGLP 724
            L +++SW  MK D+A +V +CL CQQVKAE Q+PSGL+QPL+IP+WKWER+ MDFV  LP
Sbjct: 221  LREYYSWPHMKGDIAKFVSKCLICQQVKAERQKPSGLMQPLLIPEWKWERITMDFVFKLP 280

Query: 725  KTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEIVRLHGVPLSIVSDRDPKF 904
            +T  G + IWVIVDRLTKSAH +P+K+TY++ K A+L++ EIVRLHG P+SIVSDRD +F
Sbjct: 281  RTSNGHDGIWVIVDRLTKSAHFIPIKETYSLTKLAKLFVDEIVRLHGAPVSIVSDRDARF 340

Query: 905  TSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLRACVIDFSGTWDKFLPLVE 1084
            TS FW+ L  A+GT L F+T FHPQT+GQSERTIQT EDMLR+CV+     WD  L LVE
Sbjct: 341  TSRFWRCLQEAIGTGLQFNTTFHPQTEGQSERTIQTQEDMLRSCVLQIKDAWDAHLALVE 400

Query: 1085 FSYNNSYQSSIGMAPYE 1135
            F+YNNSY +SI MAPYE
Sbjct: 401  FAYNNSYHASIQMAPYE 417



 Score =  100 bits (250), Expect = 1e-19
 Identities = 54/102 (52%), Positives = 69/102 (67%), Gaps = 1/102 (0%)
 Frame = +3

Query: 6   HYLYEAKYEIYMDHKSLK-FFTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALSR 182
           HYLY    +I+ D KSLK FFTQKELNMRQR  LEL+KDYD  I YH G+ANVVADALSR
Sbjct: 40  HYLYGETCQIFTDPKSLKYFFTQKELNMRQRRWLELIKDYDSTIEYHPGRANVVADALSR 99

Query: 183 KGRGTIYALHTIQKPLLKDMQSLELEIVSQEKSSYLTTLTLQ 308
           K  G++  L T   PLL +++  E+E+   ++     +L L+
Sbjct: 100 KTTGSLAHLRTAYLPLLVELRKDEVELGMTQRGGIFASLHLR 141


>gb|KYP78784.1| Retrotransposable element Tf2, partial [Cajanus cajan]
          Length = 901

 Score =  343 bits (881), Expect(2) = e-112
 Identities = 161/278 (57%), Positives = 204/278 (73%)
 Frame = +2

Query: 302  LTTSILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXX 481
            ++  +L+ I+ AQL D  LV  R A+  G   ++++ +DG +R G ++ VP+        
Sbjct: 408  VSNDMLKEIRDAQLEDSFLVARREAIEGGSGGEFVLGSDGVVRFGDRVCVPSEATLRRLI 467

Query: 482  XXXSHNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQ 661
                H +  S HPGSTKM  DL K F W  MK D+  +   CL CQ+ K EHQ+PSGLLQ
Sbjct: 468  LEEGHKSKLSFHPGSTKMYQDLKKMFWWPRMKRDIEEFASACLVCQKAKVEHQKPSGLLQ 527

Query: 662  PLMIPQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYI 841
            PL IP+WKW+ + MDFVV LP+T  G +SIWVIVDRLTK AH LP+   Y++EK A+LYI
Sbjct: 528  PLSIPEWKWDSISMDFVVALPRTVGGHDSIWVIVDRLTKCAHFLPINIKYSLEKLAKLYI 587

Query: 842  SEIVRLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLED 1021
            SEIVRLHGVP SIVSDRDP+FTS FW+SL +A+GTQL  S+A+HPQTDGQ+ERTIQ+LED
Sbjct: 588  SEIVRLHGVPSSIVSDRDPRFTSRFWESLQQALGTQLRLSSAYHPQTDGQTERTIQSLED 647

Query: 1022 MLRACVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            +LRACV+D  G+WD FLPL+EF+YNNS+ SSIGMAPYE
Sbjct: 648  LLRACVLDQGGSWDSFLPLIEFTYNNSFHSSIGMAPYE 685



 Score = 92.8 bits (229), Expect(2) = e-112
 Identities = 53/109 (48%), Positives = 68/109 (62%), Gaps = 1/109 (0%)
 Frame = +3

Query: 3   RHYLYEAKYEIYMDHKSLKF-FTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
           RHYLY +K+E++ DHK+LK+ F QKELNMRQR  LE +KDYD  ++YH GKANVVADALS
Sbjct: 308 RHYLYGSKFEVFSDHKNLKYLFDQKELNMRQRRWLEFLKDYDFELSYHPGKANVVADALS 367

Query: 180 RKGRGTIYALHTIQKPLLKDMQSLELEIVSQEKSSYLTTLTLQRQFSKE 326
           RK    I +L   +  LL   + L L   +   S  L  + +     KE
Sbjct: 368 RKSL-HISSLMIREMDLLAQFRDLSLACETTSSSIRLGMIRVSNDMLKE 415


>ref|XP_012829796.1| PREDICTED: uncharacterized protein LOC105950954 [Erythranthe guttata]
          Length = 1316

 Score =  365 bits (936), Expect = e-112
 Identities = 186/319 (58%), Positives = 223/319 (69%), Gaps = 4/319 (1%)
 Frame = +2

Query: 191  RDYLC--LAHDSKATTERHAELRVGNCISREV--ELFDYPDLTTSILERIKVAQLNDPKL 358
            +DY C  L H SKA         V + +SR+    L   P L ++I    K AQ +D +L
Sbjct: 735  KDYDCEILYHPSKANV-------VADALSRKSMSALIIKPPLESTI----KSAQDHDDQL 783

Query: 359  VKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXXSHNAPYSIHPGSTKMK 538
            VKIR  +  G+  ++ +T    L+   ++ +P N          +H  PYS HPG TKM 
Sbjct: 784  VKIREGLATGQNPNFSMTDGKILKFQGRICIPANKEIKGLILDEAHKTPYSCHPGETKMY 843

Query: 539  HDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLMIPQWKWERVGMDFVVG 718
             DL K + W GMK D+A YV  CL CQQ+K EHQRP GLLQ   IP+WKWE V MDFV G
Sbjct: 844  QDLKKLYWWPGMKKDIAKYVSECLICQQIKTEHQRPGGLLQSNHIPEWKWESVTMDFVQG 903

Query: 719  LPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEIVRLHGVPLSIVSDRDP 898
             PKT  G +SIWVIVDRLTKSAH LPVK T+++EK AELYI EIVRLHGVP+SI+SDRDP
Sbjct: 904  FPKTLKGSDSIWVIVDRLTKSAHFLPVKTTFSLEKLAELYIGEIVRLHGVPISIISDRDP 963

Query: 899  KFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLRACVIDFSGTWDKFLPL 1078
            +FTS FWK LH AMGT+LSFSTA+HPQTDGQSERTI+TLEDMLRAC++DF G W+  LPL
Sbjct: 964  RFTSKFWKRLHEAMGTRLSFSTAYHPQTDGQSERTIKTLEDMLRACIMDFGGNWESRLPL 1023

Query: 1079 VEFSYNNSYQSSIGMAPYE 1135
            +EFSYNNS+QSSIGMAPYE
Sbjct: 1024 IEFSYNNSFQSSIGMAPYE 1042



 Score = 95.1 bits (235), Expect = 1e-17
 Identities = 53/85 (62%), Positives = 59/85 (69%), Gaps = 1/85 (1%)
 Frame = +3

Query: 3   RHYLYEAKYEIYMDHKSLK-FFTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
           RHYLY  K  I+ DHKSLK FFTQKELNMRQR  LELVKDYDC I YH  KANVVADALS
Sbjct: 697 RHYLYGEKCSIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANVVADALS 756

Query: 180 RKGRGTIYALHTIQKPLLKDMQSLE 254
           RK    +     I+ PL   ++S +
Sbjct: 757 RKSMSAL----IIKPPLESTIKSAQ 777


>ref|XP_007023888.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508779254|gb|EOY26510.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1290

 Score =  331 bits (849), Expect(2) = e-112
 Identities = 156/275 (56%), Positives = 203/275 (73%)
 Frame = +2

Query: 311  SILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXX 490
            S+L +I+  Q  D  L +    + +G+  ++ ++ DG L L  ++ VP +          
Sbjct: 833  SLLNQIRELQKFDDWLKQEVQKLQDGEASEFRLSDDGTLMLRDRICVPKDDQLRRAILEE 892

Query: 491  SHNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLM 670
            +H++ Y++HPGSTKM   + + + W GMK D+A +V +CL CQQ+KAEHQ+ SG LQPL 
Sbjct: 893  AHSSAYALHPGSTKMYQTIKESYWWPGMKRDIAEFVAKCLICQQIKAEHQKSSGTLQPLP 952

Query: 671  IPQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEI 850
            IP+WKWE V MDFV+GLP+T+ G ++IWVI+ RLTKSAH L +  TY++E+ A LYI E+
Sbjct: 953  IPEWKWEHVTMDFVLGLPRTQSGKDAIWVIMGRLTKSAHFLAIHSTYSIERLARLYIDEV 1012

Query: 851  VRLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLR 1030
            VRLHGVP+SIVSDRDP+FTS FW     A+GT+L FSTAFHPQ DGQSERTIQTLEDMLR
Sbjct: 1013 VRLHGVPVSIVSDRDPRFTSRFWPKFQEALGTKLRFSTAFHPQIDGQSERTIQTLEDMLR 1072

Query: 1031 ACVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            ACVIDF  +WD+ LPLVEF+YNNS+QSSIGMA YE
Sbjct: 1073 ACVIDFIRSWDRHLPLVEFAYNNSFQSSIGMATYE 1107



 Score =  103 bits (256), Expect(2) = e-112
 Identities = 52/103 (50%), Positives = 72/103 (69%), Gaps = 1/103 (0%)
 Frame = +3

Query: 3    RHYLYEAKYEIYMDHKSLKFF-TQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
            RHYLY  +  I+ DHKSLK+  TQKELN+RQR  LEL+KDYD VI+YH GKANVV DALS
Sbjct: 729  RHYLYGERCRIFFDHKSLKYLLTQKELNLRQRRWLELIKDYDLVIDYHPGKANVVTDALS 788

Query: 180  RKGRGTIYALHTIQKPLLKDMQSLELEIVSQEKSSYLTTLTLQ 308
            RK   ++  L +   P+L +M+SL +++ + E  + L +  ++
Sbjct: 789  RKSSSSLATLRSSYFPMLLEMKSLGIQLNNGEDGTLLASFVVR 831


>ref|XP_007010454.1| Uncharacterized protein TCM_044274 [Theobroma cacao]
            gi|508727367|gb|EOY19264.1| Uncharacterized protein
            TCM_044274 [Theobroma cacao]
          Length = 860

 Score =  335 bits (859), Expect(2) = e-112
 Identities = 157/274 (57%), Positives = 200/274 (72%)
 Frame = +2

Query: 314  ILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXXS 493
            ++++IK AQ  D  ++K        K   +    DG LR GT+L VP+           +
Sbjct: 415  LMDKIKEAQSKDEFVIKALEDPQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRREILEEA 474

Query: 494  HNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLMI 673
            H A Y +HPG+TKM  DL + + W G+K DVA +V +CL CQQVKAEHQ+P+GLLQPL +
Sbjct: 475  HMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPV 534

Query: 674  PQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEIV 853
            P+WKWE + MDFV GLP+T  G++SIW++VDRLTKSAH LPVK TY   +YA +Y+ EIV
Sbjct: 535  PEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDEIV 594

Query: 854  RLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLRA 1033
            RLHG+P+SIVSDR  +FTS FW  L  A+GT+L FSTAFHPQTDGQSERTI+TLEDMLRA
Sbjct: 595  RLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIKTLEDMLRA 654

Query: 1034 CVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            CVID    W+++LPLVEF+YNNS+Q+SI MA +E
Sbjct: 655  CVIDLGVKWEQYLPLVEFAYNNSFQTSIQMAAFE 688



 Score = 99.4 bits (246), Expect(2) = e-112
 Identities = 49/84 (58%), Positives = 64/84 (76%), Gaps = 1/84 (1%)
 Frame = +3

Query: 3   RHYLYEAKYEIYMDHKSLKF-FTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
           RHYLY    EIYMDHKSLK+ F Q++LN+RQR  +EL+KDYDC I YH GKANVVADALS
Sbjct: 307 RHYLYGETCEIYMDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALS 366

Query: 180 RKGRGTIYALHTIQKPLLKDMQSL 251
           RK  G++  +   ++ L++++ SL
Sbjct: 367 RKSMGSLAHISIGRRSLVREIHSL 390


>ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508779195|gb|EOY26451.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 679

 Score =  338 bits (867), Expect(2) = e-112
 Identities = 159/274 (58%), Positives = 200/274 (72%)
 Frame = +2

Query: 314  ILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXXS 493
            +++RIK AQ  D  ++K        K   +    DG LR GT+L VP+           +
Sbjct: 204  LMDRIKEAQSKDEFVIKALEDPRGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRREILEEA 263

Query: 494  HNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLMI 673
            H A Y +HPG+TKM  DL + + W G+K DVA +V +CL CQQVKAEHQ+P+GLLQPL +
Sbjct: 264  HMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPV 323

Query: 674  PQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEIV 853
            P+WKWE + MDFV GLP+T  G++SIW++VD+LTKSAH LPVK TY    YA +Y+ EIV
Sbjct: 324  PEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDQLTKSAHFLPVKTTYGAAHYARVYVDEIV 383

Query: 854  RLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLRA 1033
            RLHG+P+SIVSDR  +FTS FW  L  A+GT+L FSTAFHPQTDGQSERTIQTLEDMLRA
Sbjct: 384  RLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRA 443

Query: 1034 CVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            CVID    W+++LPLVEF+YNNS+Q+SI MAP+E
Sbjct: 444  CVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFE 477



 Score = 95.5 bits (236), Expect(2) = e-112
 Identities = 47/84 (55%), Positives = 62/84 (73%), Gaps = 1/84 (1%)
 Frame = +3

Query: 3   RHYLYEAKYEIYMDHKSLKF-FTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
           RHYLY    EIY DHKSLK+ F Q++ N+RQR  +EL+KDYDC I YH GKANVVADALS
Sbjct: 96  RHYLYGETCEIYTDHKSLKYIFQQRDFNLRQRRWMELLKDYDCTILYHPGKANVVADALS 155

Query: 180 RKGRGTIYALHTIQKPLLKDMQSL 251
           RK  G++  +   ++ L++++ SL
Sbjct: 156 RKSMGSLAHISIGRRSLVREIHSL 179


>ref|XP_007028176.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508716781|gb|EOY08678.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 666

 Score =  335 bits (859), Expect(2) = e-111
 Identities = 157/274 (57%), Positives = 199/274 (72%)
 Frame = +2

Query: 314  ILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXXS 493
            ++++IK AQ  D  ++K        K   +    DG LR GT+L VP+           +
Sbjct: 298  LMDKIKEAQSKDEFVIKALEDPQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRRKILEEA 357

Query: 494  HNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLMI 673
            H A Y +HPG+TKM  DL + + W G+K DVA +V +CL CQQVKAEHQ+P+GLLQPL +
Sbjct: 358  HMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPV 417

Query: 674  PQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEIV 853
            P+WKWE + MDFV GLP+T  G++SIW++VDRLTKSAH L VK TY   +YA +Y+ EIV
Sbjct: 418  PEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLSVKTTYGAAQYARVYVDEIV 477

Query: 854  RLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLRA 1033
            RLHG+P+SIVSDR  +FTS FW  L  A+GT+L FST FHPQTDGQSERTIQTLEDMLRA
Sbjct: 478  RLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTTFHPQTDGQSERTIQTLEDMLRA 537

Query: 1034 CVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            CVID    W+++LPLVEF+YNNS+Q+SI MAP+E
Sbjct: 538  CVIDLGVKWEQYLPLVEFAYNNSFQTSIQMAPFE 571



 Score = 97.1 bits (240), Expect(2) = e-111
 Identities = 48/84 (57%), Positives = 63/84 (75%), Gaps = 1/84 (1%)
 Frame = +3

Query: 3   RHYLYEAKYEIYMDHKSLKF-FTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
           RHYLY    EIY DHKSLK+ F Q++LN+RQR  +EL+KDYDC I YH GKANVVADALS
Sbjct: 190 RHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALS 249

Query: 180 RKGRGTIYALHTIQKPLLKDMQSL 251
           RK  G++  +   ++ L++++ SL
Sbjct: 250 RKSMGSLAHISIGRRSLVREIHSL 273


>gb|AAO19383.1| putative polyprotein [Oryza sativa Japonica Group]
            gi|108710558|gb|ABF98353.1| retrotransposon protein,
            putative, Ty3-gypsy subclass [Oryza sativa Japonica
            Group]
          Length = 1770

 Score =  335 bits (860), Expect(2) = e-111
 Identities = 155/275 (56%), Positives = 202/275 (73%)
 Frame = +2

Query: 311  SILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXX 490
            +++++++ AQ+NDP + +I+  +  GK   ++    G + LG ++ VP+N          
Sbjct: 1268 TLIDQVREAQINDPDIQEIKKNMRRGKAIGFLEDEQGTVWLGERICVPDNKDLKDAILKE 1327

Query: 491  SHNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLM 670
            +H+  YSIHPGSTKM  DL++ FSW  MK ++A YV  C  CQ+VKAEHQ+P+GLLQPL 
Sbjct: 1328 AHDTLYSIHPGSTKMYQDLTEGFSWASMKREIAEYVAVCDVCQRVKAEHQKPAGLLQPLK 1387

Query: 671  IPQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEI 850
            IP+WKWE +GMDF+ GLP+T  G +SIWVIVDRLTK AH +PVK TY+  + AELY++ I
Sbjct: 1388 IPEWKWEEIGMDFITGLPRTSSGHDSIWVIVDRLTKVAHFIPVKTTYSGSRLAELYMARI 1447

Query: 851  VRLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLR 1030
            V LHGVP  IVSDR  +FTS FWK L   MG++L+FSTA+HPQTDGQ+ER  Q LEDMLR
Sbjct: 1448 VCLHGVPKKIVSDRGSQFTSNFWKKLQEEMGSKLNFSTAYHPQTDGQTERVNQVLEDMLR 1507

Query: 1031 ACVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            AC +DF G+WDK LP  EFSYNNSYQ+S+ MAPYE
Sbjct: 1508 ACALDFGGSWDKNLPYAEFSYNNSYQASLQMAPYE 1542



 Score = 95.5 bits (236), Expect(2) = e-111
 Identities = 51/91 (56%), Positives = 62/91 (68%), Gaps = 1/91 (1%)
 Frame = +3

Query: 3    RHYLYEAKYEIYMDHKSLKF-FTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
            RHYL+  + E+Y DHKSLK+ FTQ +LNMRQR  LEL+KDYD  I+YH GKANVVAD LS
Sbjct: 1167 RHYLFGTRTEVYTDHKSLKYIFTQPDLNMRQRRWLELIKDYDMGIHYHPGKANVVADTLS 1226

Query: 180  RKGRGTIYALHTIQKPLLKDMQSLELEIVSQ 272
            RKG         +   L K+ + L L IVS+
Sbjct: 1227 RKGYCNATEGRQLPLELCKEFERLNLGIVSR 1257


>gb|AAO45752.1| pol protein [Cucumis melo subsp. melo]
          Length = 923

 Score =  322 bits (824), Expect(2) = e-111
 Identities = 157/279 (56%), Positives = 204/279 (73%), Gaps = 4/279 (1%)
 Frame = +2

Query: 311  SILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXX 490
            ++ +RI  AQ NDP LV+ R     G+  ++ +++DG L    +L VP++          
Sbjct: 446  TLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSE 505

Query: 491  SHNAPYSIHPGSTKMKHDLS-KHFSWVG---MKNDVATYVQRCLTCQQVKAEHQRPSGLL 658
            +H++P+S+HPGST+   D+S     ++G   MK +VA +V +CL CQQVKA  Q+P+GLL
Sbjct: 506  AHSSPFSMHPGSTE---DVSGPEAGFIGGRNMKREVAEFVSKCLVCQQVKAPRQKPAGLL 562

Query: 659  QPLMIPQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELY 838
            QPL IP+WKWE V MDF+ GLP+T  GF  IWV+VDRLTKSAH +P K TY   K+A+LY
Sbjct: 563  QPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLY 622

Query: 839  ISEIVRLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLE 1018
            +SEIVRLHGVP+SIVSDRD +FTS FWK L  AMGT+L FSTAFHPQTDGQ+ER  Q LE
Sbjct: 623  MSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLE 682

Query: 1019 DMLRACVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            DMLRAC ++F G+WD  L L+EF+YNNSYQ++IGMAP+E
Sbjct: 683  DMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFE 721



 Score =  109 bits (272), Expect(2) = e-111
 Identities = 62/103 (60%), Positives = 73/103 (70%), Gaps = 1/103 (0%)
 Frame = +3

Query: 3   RHYLYEAKYEIYMDHKSLK-FFTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
           RHYLY  K +I+ DHKSLK FFTQKELNMRQR  LELVKDYDC I YH GKANVVADALS
Sbjct: 343 RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALS 402

Query: 180 RKGRGTIYALHTIQKPLLKDMQSLELEIVSQEKSSYLTTLTLQ 308
           RK   +  AL T Q PL +D++  E+ ++    +  L  LT+Q
Sbjct: 403 RKVSHSA-ALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQ 444


>gb|AAV31278.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1727

 Score =  333 bits (855), Expect(2) = e-111
 Identities = 154/275 (56%), Positives = 200/275 (72%)
 Frame = +2

Query: 311  SILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXX 490
            +++++++ AQ+NDP + +I+  +  GK   ++    G +RLG ++ VP+N          
Sbjct: 1166 TLIDQVREAQINDPDIQEIKKNMRRGKAIGFLEDEHGTVRLGERICVPDNKDLKDAILKE 1225

Query: 491  SHNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLM 670
            +H+  YSIHPGSTKM  DL + F W  MK ++A YV  C  CQ+VKAEHQ+P+ LLQPL 
Sbjct: 1226 AHDTLYSIHPGSTKMYQDLKERFWWASMKREIAEYVAVCDVCQRVKAEHQKPASLLQPLK 1285

Query: 671  IPQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEI 850
            IP+WKWE +GMDF+ GLP+T  G +SIWVIVDRLTK AH +PVK TY+  + AELY++ I
Sbjct: 1286 IPEWKWEEIGMDFITGLPRTSSGHDSIWVIVDRLTKVAHFIPVKTTYSGSRLAELYMARI 1345

Query: 851  VRLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLR 1030
            V LHGVP  IVSDR  +FTS FWK L   MG++L+FSTA+HPQTDGQ+ER  Q LEDMLR
Sbjct: 1346 VCLHGVPKKIVSDRGSQFTSNFWKKLQEEMGSKLNFSTAYHPQTDGQTERVNQILEDMLR 1405

Query: 1031 ACVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            AC +DF G+WDK LP  EFSYNNSYQ+S+ MAPYE
Sbjct: 1406 ACALDFGGSWDKSLPYAEFSYNNSYQASLQMAPYE 1440



 Score = 97.1 bits (240), Expect(2) = e-111
 Identities = 52/91 (57%), Positives = 63/91 (69%), Gaps = 1/91 (1%)
 Frame = +3

Query: 3    RHYLYEAKYEIYMDHKSLKF-FTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
            RHYL+  + E+Y DHKSLK+ FTQ +LNMRQR  LEL+KDYD  I+YH GKANVVADALS
Sbjct: 1065 RHYLFGTRTEVYTDHKSLKYIFTQPDLNMRQRRWLELIKDYDMGIHYHPGKANVVADALS 1124

Query: 180  RKGRGTIYALHTIQKPLLKDMQSLELEIVSQ 272
            RKG         +   L K+ + L L IVS+
Sbjct: 1125 RKGYCNATEGRQLPLELCKEFERLNLGIVSR 1155


>gb|ABG22001.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 1751

 Score =  333 bits (854), Expect(2) = e-111
 Identities = 154/275 (56%), Positives = 200/275 (72%)
 Frame = +2

Query: 311  SILERIKVAQLNDPKLVKIRSAVVEGKRDDYMITADGALRLGTKLAVPNNXXXXXXXXXX 490
            +++++++ AQ+NDP + +I+  +  GK   ++    G + LG ++ VP+N          
Sbjct: 1265 TLIDQVREAQINDPDIQEIKKNMRRGKAIGFLEDEQGTVWLGERICVPDNKDLKDAILQE 1324

Query: 491  SHNAPYSIHPGSTKMKHDLSKHFSWVGMKNDVATYVQRCLTCQQVKAEHQRPSGLLQPLM 670
            +H+  YSIHPGSTKM  DL + F W  MK ++A YV  C  CQ+VKAEHQ+P+GLLQPL 
Sbjct: 1325 AHDTLYSIHPGSTKMYQDLKERFWWASMKREIAEYVVVCDVCQRVKAEHQKPAGLLQPLK 1384

Query: 671  IPQWKWERVGMDFVVGLPKTKVGFNSIWVIVDRLTKSAHLLPVKKTYNMEKYAELYISEI 850
            IP+WKWE +GMDF+ GLP+T  G +SIWVIVDRLTK AH +PVK TY+  + AELY++ I
Sbjct: 1385 IPEWKWEEIGMDFITGLPRTSSGHDSIWVIVDRLTKVAHFIPVKTTYSRSRLAELYMARI 1444

Query: 851  VRLHGVPLSIVSDRDPKFTSAFWKSLHRAMGTQLSFSTAFHPQTDGQSERTIQTLEDMLR 1030
            V LHGVP  IVSDR  +FTS FWK L   MG++L+FSTA+HPQTDGQ+ER  Q LEDMLR
Sbjct: 1445 VCLHGVPKKIVSDRGSQFTSNFWKKLQEEMGSKLNFSTAYHPQTDGQTERVNQILEDMLR 1504

Query: 1031 ACVIDFSGTWDKFLPLVEFSYNNSYQSSIGMAPYE 1135
            AC +DF G+WDK LP  EFSYNNSYQ+S+ MAPYE
Sbjct: 1505 ACALDFGGSWDKNLPYAEFSYNNSYQASLQMAPYE 1539



 Score = 97.1 bits (240), Expect(2) = e-111
 Identities = 52/91 (57%), Positives = 63/91 (69%), Gaps = 1/91 (1%)
 Frame = +3

Query: 3    RHYLYEAKYEIYMDHKSLKF-FTQKELNMRQRGSLELVKDYDCVINYHQGKANVVADALS 179
            RHYL+  + E+Y DHKSLK+ FTQ +LNMRQR  LEL+KDYD  I+YH GKANVVADALS
Sbjct: 1164 RHYLFGTRTEVYTDHKSLKYIFTQPDLNMRQRRWLELIKDYDMGIHYHPGKANVVADALS 1223

Query: 180  RKGRGTIYALHTIQKPLLKDMQSLELEIVSQ 272
            RKG         +   L K+ + L L IVS+
Sbjct: 1224 RKGYCNATEGRQLPLELCKEFERLNLGIVSR 1254


Top