BLASTX nr result

ID: Sinomenium22_contig00019158 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00019158
         (1089 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007226950.1| hypothetical protein PRUPE_ppa025194mg, part...   162   2e-37
ref|XP_007219137.1| hypothetical protein PRUPE_ppa015965mg [Prun...   162   2e-37
ref|XP_007224256.1| hypothetical protein PRUPE_ppa018408mg, part...   162   3e-37
ref|XP_007203318.1| hypothetical protein PRUPE_ppa019964mg, part...   162   3e-37
ref|XP_007220363.1| hypothetical protein PRUPE_ppa016496mg, part...   129   2e-27
ref|XP_004305946.1| PREDICTED: uncharacterized protein LOC101303...   127   6e-27
ref|XP_007214321.1| hypothetical protein PRUPE_ppb019697mg [Prun...   112   4e-22
ref|XP_007225525.1| hypothetical protein PRUPE_ppa026504mg, part...    89   3e-16
gb|AAK91332.1|AC090441_14 Putative gag-pol polyprotein [Oryza sa...    72   4e-10
dbj|BAA89466.1| gag-pol polyprotein [Oryza sativa Indica Group]        72   5e-10
gb|AAQ56388.1| putative gag-pol polyprotein [Oryza sativa Japoni...    72   5e-10
gb|AAN09859.1| putative polyprotein [Oryza sativa Japonica Group]      71   7e-10
gb|ABG65972.1| retrotransposon protein, putative, Ty3-gypsy subc...    71   7e-10
ref|XP_007027874.1| DNA/RNA polymerases superfamily protein [The...    71   9e-10
ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [The...    69   3e-09
ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cac...    69   3e-09
ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [The...    69   5e-09
gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]                  68   6e-09
ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobrom...    68   8e-09
ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobrom...    68   8e-09

>ref|XP_007226950.1| hypothetical protein PRUPE_ppa025194mg, partial [Prunus persica]
           gi|462423886|gb|EMJ28149.1| hypothetical protein
           PRUPE_ppa025194mg, partial [Prunus persica]
          Length = 1347

 Score =  162 bits (411), Expect = 2e-37
 Identities = 84/150 (56%), Positives = 108/150 (72%), Gaps = 5/150 (3%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D GSQKNLIS + V K+GLET PHPKPY LGWIQKD++L+I  QC F+FAIT++YIDE
Sbjct: 331 IIDPGSQKNLISEALVRKVGLETTPHPKPYPLGWIQKDVDLQITKQCTFKFAITNRYIDE 390

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVVR--KDRTCQKL---D 825
           +TCEVVP    Q    S YLW+RDAI+Y R +KY  +KDGK F +   K +    L   +
Sbjct: 391 VTCEVVPLDVCQVILGSPYLWDRDAIHYRRLRKYRLVKDGKEFHINACKPQATNNLLIDN 450

Query: 826 LVTTCQAQRMVNVCQKFVLLMVRPLEAEAG 915
           L+T  QA+R+VN C +FVLLM+RP +  +G
Sbjct: 451 LLTANQAKRLVNSCGRFVLLMIRPQDQSSG 480



 Score = 51.6 bits (122), Expect(2) = 1e-07
 Identities = 44/165 (26%), Positives = 70/165 (42%)
 Frame = +2

Query: 221 HEKGKCWKLHLELFPAK*KKDERGKRTMAADTTLNDKIELGLVEEANKSLSFMAIPKETA 400
           H K KCW LH EL P K +K+ +G+    A  T     EL  +++ + +L+ M  P +  
Sbjct: 250 HAKDKCWILHPELRP-KREKNNQGRNDRKATLTTQQAEELPELKQPDVTLTLMTRPADIE 308

Query: 401 SSSPALVYEKEELFIGWIQVKQDIIGASWILEVKRT*FQQAKFRSWGWRRCHILSHICWG 580
            +     Y +EELF   IQVKQ ++ A      ++    +A  R  G            G
Sbjct: 309 DT-----YNREELFHVNIQVKQSVVQAIIDPGSQKNLISEALVRKVGLETTPHPKPYPLG 363

Query: 581 GFKKTWSSKLTASVSSDLQSPANILMK*LARWCPLDICKVIFEVP 715
             +K    ++T   +         + +      PLD+C+VI   P
Sbjct: 364 WIQKDVDLQITKQCTFKFAITNRYIDEVTCEVVPLDVCQVILGSP 408



 Score = 32.0 bits (71), Expect(2) = 1e-07
 Identities = 21/62 (33%), Positives = 30/62 (48%)
 Frame = +1

Query: 22  DVYNKFVAGLHWQIQGEMHLYQAMNISQASGIALAIEQNNKAGAPRVPGNGKKRDGGQPS 201
           DV+ K+  GL   I+ E+ L+    I +A+  A+AIE  NK          KK D  +P 
Sbjct: 168 DVFMKYTGGLADYIRKELKLFTVDTIEEATVKAIAIEAKNKR-------TDKKDDRSKPV 220

Query: 202 KK 207
            K
Sbjct: 221 NK 222


>ref|XP_007219137.1| hypothetical protein PRUPE_ppa015965mg [Prunus persica]
            gi|462415599|gb|EMJ20336.1| hypothetical protein
            PRUPE_ppa015965mg [Prunus persica]
          Length = 1484

 Score =  162 bits (411), Expect = 2e-37
 Identities = 114/316 (36%), Positives = 163/316 (51%), Gaps = 24/316 (7%)
 Frame = +1

Query: 22   DVYNKFVAGLHWQIQGEMHLYQAMNISQASGIALAIEQNNKAGAPRVPGNGKKRDGGQPS 201
            DV+ K+  GL   I+ E+ L+    I +A+  A+AIE  NK          KK D  +P 
Sbjct: 249  DVFMKYTGGLADYIRKELKLFTVDTIEKATVKAIAIEAKNKR-------TDKKDDRSKPV 301

Query: 202  KKLL-----RRT*EGEVLEITPRTISCKVKER*EGKAYDGGGHNFE*QDRARTGRGGKQE 366
             K       +R+ EG+  ++      C+     + K +         +++   GR  ++ 
Sbjct: 302  NKTDWQKKGKRSKEGQTQKVY--CDHCQTSRHAKDKCWTLHPELRPKREKNNQGRNDRKA 359

Query: 367  SIFYGNTEG------DSFKFTCLS--------LREGRVVHRMDSG*AGYHWGIVDTGSQK 504
            ++     E            T ++             + H            I+D GSQK
Sbjct: 360  TLTTQQAEELPELKQPDVTLTLMTRPADIEDTYNREELFHVNIQVKQSVVQAIIDPGSQK 419

Query: 505  NLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDEITCEVVPP 684
            NLIS + V K+GLET PHPKPY LGWIQKD++L+I  QC F+FAIT++YIDE+TCEVVP 
Sbjct: 420  NLISEALVRKVGLETTPHPKPYPLGWIQKDVDLQITKQCTFKFAITNRYIDEVTCEVVPL 479

Query: 685  *YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVVR--KDRTCQKL---DLVTTCQAQ 849
               Q    S YLW+RDAI+Y R +KY  +KDGK F +   K +    L   +L+T  QA+
Sbjct: 480  DVCQVILGSPYLWDRDAIHYRRLRKYRLVKDGKEFHINACKPQATNNLLTDNLLTANQAK 539

Query: 850  RMVNVCQKFVLLMVRP 897
            R+VN C +FVLLM+RP
Sbjct: 540  RLVNSCGRFVLLMIRP 555


>ref|XP_007224256.1| hypothetical protein PRUPE_ppa018408mg, partial [Prunus persica]
           gi|462421192|gb|EMJ25455.1| hypothetical protein
           PRUPE_ppa018408mg, partial [Prunus persica]
          Length = 1440

 Score =  162 bits (409), Expect = 3e-37
 Identities = 84/150 (56%), Positives = 107/150 (71%), Gaps = 5/150 (3%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D GSQKNLIS + V K+GLET PHPKPY LGWIQKD++L+I  QC F+FAIT++YIDE
Sbjct: 402 IIDPGSQKNLISEALVRKVGLETTPHPKPYPLGWIQKDVDLQITKQCTFKFAITNRYIDE 461

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVVR--KDRTCQKL---D 825
           +TCEVVP    Q    S YLW+RDAI+Y R +KY  +KDGK F +   K +    L   +
Sbjct: 462 VTCEVVPLDVCQVILGSPYLWDRDAIHYRRLRKYRLVKDGKEFHINACKHQATNNLLTDN 521

Query: 826 LVTTCQAQRMVNVCQKFVLLMVRPLEAEAG 915
           L+T  QA+R+VN C +FVLLM+RP +   G
Sbjct: 522 LLTANQAKRLVNSCGRFVLLMIRPQDQNIG 551



 Score = 51.6 bits (122), Expect(2) = 1e-07
 Identities = 44/165 (26%), Positives = 70/165 (42%)
 Frame = +2

Query: 221 HEKGKCWKLHLELFPAK*KKDERGKRTMAADTTLNDKIELGLVEEANKSLSFMAIPKETA 400
           H K KCW LH EL P K +K+ +G+    A  T     EL  +++ + +L+ M  P +  
Sbjct: 321 HAKDKCWILHPELRP-KREKNNQGRNDRKATLTTQQAEELPELKQPDVTLTLMTRPADIE 379

Query: 401 SSSPALVYEKEELFIGWIQVKQDIIGASWILEVKRT*FQQAKFRSWGWRRCHILSHICWG 580
            +     Y +EELF   IQVKQ ++ A      ++    +A  R  G            G
Sbjct: 380 DT-----YNREELFHVNIQVKQSVVQAIIDPGSQKNLISEALVRKVGLETTPHPKPYPLG 434

Query: 581 GFKKTWSSKLTASVSSDLQSPANILMK*LARWCPLDICKVIFEVP 715
             +K    ++T   +         + +      PLD+C+VI   P
Sbjct: 435 WIQKDVDLQITKQCTFKFAITNRYIDEVTCEVVPLDVCQVILGSP 479



 Score = 32.0 bits (71), Expect(2) = 1e-07
 Identities = 21/62 (33%), Positives = 30/62 (48%)
 Frame = +1

Query: 22  DVYNKFVAGLHWQIQGEMHLYQAMNISQASGIALAIEQNNKAGAPRVPGNGKKRDGGQPS 201
           DV+ K+  GL   I+ E+ L+    I +A+  A+AIE  NK          KK D  +P 
Sbjct: 239 DVFMKYTGGLADYIRKELKLFTVDTIEEATVKAIAIEAKNKR-------TDKKDDRSKPV 291

Query: 202 KK 207
            K
Sbjct: 292 NK 293


>ref|XP_007203318.1| hypothetical protein PRUPE_ppa019964mg, partial [Prunus persica]
           gi|462398849|gb|EMJ04517.1| hypothetical protein
           PRUPE_ppa019964mg, partial [Prunus persica]
          Length = 1488

 Score =  162 bits (409), Expect = 3e-37
 Identities = 84/150 (56%), Positives = 107/150 (71%), Gaps = 5/150 (3%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D GSQKNLIS + V K+GLET PHPKPY LGWIQKD++L+I  QC F+FAIT++YIDE
Sbjct: 402 IIDPGSQKNLISEALVRKVGLETTPHPKPYPLGWIQKDVDLQITKQCTFKFAITNRYIDE 461

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVVR--KDRTCQKL---D 825
           +TCEVVP    Q    S YLW+RDAI+Y R +KY  +KDGK F +   K +    L   +
Sbjct: 462 VTCEVVPLDVCQVILGSPYLWDRDAIHYRRLRKYRLVKDGKEFHINACKPQATNNLLTDN 521

Query: 826 LVTTCQAQRMVNVCQKFVLLMVRPLEAEAG 915
           L+T  QA+R+VN C +FVLLM+RP +   G
Sbjct: 522 LLTANQAKRLVNSCGRFVLLMIRPQDQNIG 551



 Score = 51.6 bits (122), Expect(2) = 5e-07
 Identities = 44/165 (26%), Positives = 70/165 (42%)
 Frame = +2

Query: 221 HEKGKCWKLHLELFPAK*KKDERGKRTMAADTTLNDKIELGLVEEANKSLSFMAIPKETA 400
           H K KCW LH EL P K +K+ +G+    A  T     EL  +++ + +L+ M  P +  
Sbjct: 321 HAKDKCWILHPELRP-KREKNNQGRNDRKATLTTQQAEELPELKQPDVTLTLMTRPADIE 379

Query: 401 SSSPALVYEKEELFIGWIQVKQDIIGASWILEVKRT*FQQAKFRSWGWRRCHILSHICWG 580
            +     Y +EELF   IQVKQ ++ A      ++    +A  R  G            G
Sbjct: 380 DT-----YNREELFHVNIQVKQSVVQAIIDPGSQKNLISEALVRKVGLETTPHPKPYPLG 434

Query: 581 GFKKTWSSKLTASVSSDLQSPANILMK*LARWCPLDICKVIFEVP 715
             +K    ++T   +         + +      PLD+C+VI   P
Sbjct: 435 WIQKDVDLQITKQCTFKFAITNRYIDEVTCEVVPLDVCQVILGSP 479



 Score = 30.0 bits (66), Expect(2) = 5e-07
 Identities = 20/62 (32%), Positives = 29/62 (46%)
 Frame = +1

Query: 22  DVYNKFVAGLHWQIQGEMHLYQAMNISQASGIALAIEQNNKAGAPRVPGNGKKRDGGQPS 201
           DV+  +  GL   I+ E+ L+    I +A+  A+AIE  NK          KK D  +P 
Sbjct: 239 DVFMNYTGGLADYIRKELKLFTVDTIEEATVKAIAIEAKNKR-------TDKKDDRSKPV 291

Query: 202 KK 207
            K
Sbjct: 292 NK 293


>ref|XP_007220363.1| hypothetical protein PRUPE_ppa016496mg, partial [Prunus persica]
           gi|462416825|gb|EMJ21562.1| hypothetical protein
           PRUPE_ppa016496mg, partial [Prunus persica]
          Length = 373

 Score =  129 bits (325), Expect = 2e-27
 Identities = 63/102 (61%), Positives = 79/102 (77%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D GSQKNLIS + V K+GLET PHPKPY LGWIQKD++L+I  QC F+FAIT++YI+E
Sbjct: 270 IIDPGSQKNLISEALVRKVGLETTPHPKPYPLGWIQKDVDLQITKQCTFKFAITNRYINE 329

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKN 786
           +TCEVVP    Q    S  LW+RDAI+Y R +KY  +KDGK+
Sbjct: 330 VTCEVVPLDVCQVILGSPSLWDRDAIHYRRLRKYRLVKDGKD 371



 Score = 53.1 bits (126), Expect(2) = 1e-07
 Identities = 44/166 (26%), Positives = 72/166 (43%)
 Frame = +2

Query: 221 HEKGKCWKLHLELFPAK*KKDERGKRTMAADTTLNDKIELGLVEEANKSLSFMAIPKETA 400
           H K KCW LH EL P K +K+ +G++   A  T     EL  +++ + +L+ M  P +  
Sbjct: 189 HAKDKCWILHPELRP-KREKNNQGRKDRKATLTTQQAEELPELKQPDVTLTLMTRPADIE 247

Query: 401 SSSPALVYEKEELFIGWIQVKQDIIGASWILEVKRT*FQQAKFRSWGWRRCHILSHICWG 580
            +     Y +EELF   IQVKQ ++ A      ++    +A  R  G            G
Sbjct: 248 DT-----YNREELFHVNIQVKQSVVQAIIDPGSQKNLISEALVRKVGLETTPHPKPYPLG 302

Query: 581 GFKKTWSSKLTASVSSDLQSPANILMK*LARWCPLDICKVIFEVPT 718
             +K    ++T   +         + +      PLD+C+VI   P+
Sbjct: 303 WIQKDVDLQITKQCTFKFAITNRYINEVTCEVVPLDVCQVILGSPS 348



 Score = 30.8 bits (68), Expect(2) = 1e-07
 Identities = 16/41 (39%), Positives = 24/41 (58%)
 Frame = +1

Query: 22  DVYNKFVAGLHWQIQGEMHLYQAMNISQASGIALAIEQNNK 144
           DV+ K+  GL   I+ E+ L+    I +A+  A+AIE  NK
Sbjct: 127 DVFMKYTGGLADYIRKELKLFTVDTIEEATVKAIAIEAKNK 167


>ref|XP_004305946.1| PREDICTED: uncharacterized protein LOC101303732 [Fragaria vesca
           subsp. vesca]
          Length = 458

 Score =  127 bits (320), Expect = 6e-27
 Identities = 100/276 (36%), Positives = 144/276 (52%), Gaps = 18/276 (6%)
 Frame = +1

Query: 25  VYNKFVAGLHWQIQGEMHLYQAMNISQASGIALAIE----QNNKAGAPRVPGNGKKRDGG 192
           VY K+V+GL+  I+ E+ L+   +I++AS  A+AIE    +    G  ++PGN K  + G
Sbjct: 61  VYMKYVSGLNEYIRKELRLFTVESIAEASVKAIAIESRLRKGEAKGEAKLPGN-KTNNSG 119

Query: 193 QPSKKLLRRT*EGEVLEITPRTISCKVKER*EGKAYDGGGHNFE*QDRARTGRGGKQESI 372
              ++  R   E E  E +  +  C        K +    H      R +  +  K+ ++
Sbjct: 120 VKKEEPKRDKNESEGKESSTCS-HCGATNHAVEKCWVKYPHLKPRGLRQQEAK--KKAAL 176

Query: 373 FYGNTE--GDSFKFTCLSLREGR--VVHRMDSG*AGYHW----------GIVDTGSQKNL 510
             G TE  G +   T L+L  G+  +  + D     +             IVD GSQKNL
Sbjct: 177 ITGPTEVPGMTEPNTRLNLMAGKTPIAEKEDPREQLFVVKLQVKTSLVDAIVDPGSQKNL 236

Query: 511 ISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDEITCEVVPP*Y 690
           IS + V KLGL+T+ HPKPY LGWI+K+  L +  QC F+FA+   YIDE+TC+VVP   
Sbjct: 237 ISEALVQKLGLKTVKHPKPYPLGWIRKEAGLSVVNQCTFKFALHESYIDEVTCDVVPLDV 296

Query: 691 MQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVVR 798
            Q    + YLW+R AIY  RAQKY   KD + +VVR
Sbjct: 297 CQVILGNPYLWDRYAIYDRRAQKYTLTKDERQYVVR 332


>ref|XP_007214321.1| hypothetical protein PRUPE_ppb019697mg [Prunus persica]
           gi|462410186|gb|EMJ15520.1| hypothetical protein
           PRUPE_ppb019697mg [Prunus persica]
          Length = 303

 Score =  112 bits (279), Expect = 4e-22
 Identities = 56/106 (52%), Positives = 73/106 (68%)
 Frame = +1

Query: 532 KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDEITCEVVPP*YMQGDF*S 711
           K+GL+T PHPK Y LGWIQKD++L I  QC F+FAIT++YIDE+TCEVVP    Q    S
Sbjct: 51  KVGLDTTPHPKLYPLGWIQKDVDLHITKQCTFKFAITNRYIDEVTCEVVPLDVCQVILGS 110

Query: 712 TYLWERDAIYY*RAQKYEFMKDGKNFVVRKDRTCQKLDLVTTCQAQ 849
            YLW+RDAI+Y R +KY  +KD K F +   +     +L+T  QA+
Sbjct: 111 PYLWDRDAIHYRRLRKYRLVKDAKEFHINAYKPQAIDNLLTANQAK 156


>ref|XP_007225525.1| hypothetical protein PRUPE_ppa026504mg, partial [Prunus persica]
            gi|462422461|gb|EMJ26724.1| hypothetical protein
            PRUPE_ppa026504mg, partial [Prunus persica]
          Length = 750

 Score = 88.6 bits (218), Expect(2) = 3e-16
 Identities = 64/184 (34%), Positives = 83/184 (45%), Gaps = 3/184 (1%)
 Frame = +1

Query: 481  IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
            I+D GSQKNLIS + V K+GL T  HPK Y LGWIQKD++L+I  QC F+FAIT++YIDE
Sbjct: 22   IIDPGSQKNLISEALVRKVGLNTTLHPKLYPLGWIQKDVDLQITKQCTFKFAITNRYIDE 81

Query: 661  ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVVRKDRTCQKLDLVTTC 840
             T                                                    +L+T  
Sbjct: 82   ET---------------------------------------------------YNLLTAN 90

Query: 841  QAQRMVNVCQKFVLLMVRPLEAEAG---ISATAFSYTGNDGXXXXXXXXXXXXXXXXNGL 1011
            QA+R+VN+C +FVLLM+R     +G   +S  + S T                     GL
Sbjct: 91   QAKRLVNLCGRFVLLMIRSQNQSSGAVTLSTLSLSPT-QCSDIGKLQKKFKDLFHDVQGL 149

Query: 1012 PPRR 1023
            PPRR
Sbjct: 150  PPRR 153



 Score = 24.3 bits (51), Expect(2) = 3e-16
 Identities = 11/21 (52%), Positives = 15/21 (71%)
 Frame = +2

Query: 419 VYEKEELFIGWIQVKQDIIGA 481
           +Y +EELF   IQVKQ ++ A
Sbjct: 1   MYNREELFHVNIQVKQSVVQA 21


>gb|AAK91332.1|AC090441_14 Putative gag-pol polyprotein [Oryza sativa Japonica Group]
           gi|15217296|gb|AAK92640.1|AC079634_1 Putative
           retroelement [Oryza sativa Japonica Group]
           gi|31431373|gb|AAP53161.1| retrotransposon protein,
           putative, Ty3-gypsy subclass [Oryza sativa Japonica
           Group]
          Length = 1708

 Score = 72.0 bits (175), Expect = 4e-10
 Identities = 37/105 (35%), Positives = 60/105 (57%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D GS  NL SA  V KL L T PHP+PY + W+    ++K+    +  FAI S Y D 
Sbjct: 505 IIDRGSCNNLASAEMVEKLALSTQPHPQPYYIQWLNSSGKVKVTRLVRVHFAIGS-YHDS 563

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVV 795
           I C+VVP           + +++D++++ ++ +Y F+ +GK  V+
Sbjct: 564 INCDVVPMQACSIFLGRPWQFDKDSLHFGKSNQYSFVHNGKKLVL 608


>dbj|BAA89466.1| gag-pol polyprotein [Oryza sativa Indica Group]
          Length = 1587

 Score = 71.6 bits (174), Expect = 5e-10
 Identities = 37/105 (35%), Positives = 60/105 (57%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D GS  NL SA  V KL L T PHP+PY + W+    ++K+    +  FAI S Y D 
Sbjct: 505 IIDGGSCNNLASAEMVEKLALSTQPHPQPYYIQWLNSSGKVKVTRLVRVHFAIGS-YHDS 563

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVV 795
           I C+VVP           + +++D++++ ++ +Y F+ +GK  V+
Sbjct: 564 INCDVVPMQACSMLLGRPWQFDKDSLHFGKSNQYSFVHNGKKLVL 608


>gb|AAQ56388.1| putative gag-pol polyprotein [Oryza sativa Japonica Group]
           gi|91795218|gb|ABE60890.1| putative polyprotein [Oryza
           sativa Japonica Group]
          Length = 1616

 Score = 71.6 bits (174), Expect = 5e-10
 Identities = 37/105 (35%), Positives = 60/105 (57%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D GS  NL SA  V KL L T PHP+PY + W+    ++K+    +  FAI S Y D 
Sbjct: 505 IIDGGSCNNLASAEMVEKLALSTQPHPQPYYIQWLNSSGKVKVTRLVRVHFAIGS-YHDS 563

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVV 795
           I C+VVP           + +++D++++ ++ +Y F+ +GK  V+
Sbjct: 564 INCDVVPMQACSMLLGRPWQFDKDSLHFGKSNQYSFVHNGKKLVL 608


>gb|AAN09859.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 928

 Score = 71.2 bits (173), Expect = 7e-10
 Identities = 39/105 (37%), Positives = 58/105 (55%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D GS KNL+S+  V KLGL T  HP PY + W+      K+   C+  F+I S Y D 
Sbjct: 506 IIDGGSCKNLLSSDLVKKLGLTTRTHPHPYHIQWLNDSGRAKVTQVCRVLFSIGS-YADS 564

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVV 795
           + C+VVP           +  + DA ++ R+ KY F+ +GK F++
Sbjct: 565 VDCDVVPMQACSLLLGCPWEHDNDATHHGRSNKYTFVHNGKKFIL 609


>gb|ABG65972.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
           Japonica Group]
          Length = 1315

 Score = 71.2 bits (173), Expect = 7e-10
 Identities = 39/105 (37%), Positives = 58/105 (55%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D GS KNL+S+  V KLGL T  HP PY + W+      K+   C+  F+I S Y D 
Sbjct: 506 IIDGGSCKNLLSSDLVKKLGLTTRTHPHPYHIQWLNDSGRAKVTQVCRVLFSIGS-YADS 564

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVV 795
           + C+VVP           +  + DA ++ R+ KY F+ +GK F++
Sbjct: 565 VDCDVVPMQACSLLLGCPWEHDNDATHHGRSNKYTFVHNGKKFIL 609


>ref|XP_007027874.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508716479|gb|EOY08376.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 558

 Score = 70.9 bits (172), Expect = 9e-10
 Identities = 35/105 (33%), Positives = 55/105 (52%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           ++D GS +N+IS   V KL L T  HP PY +GW++K  E+ +  QC  +F +     DE
Sbjct: 345 VIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEVPVTTQCLVKFTMGDNLDDE 404

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVV 795
             C+VVP           +L++ D ++  +   Y F KD K + +
Sbjct: 405 ALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTYSFYKDNKRYTL 449


>ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508702148|gb|EOX94044.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 546

 Score = 69.3 bits (168), Expect = 3e-09
 Identities = 34/105 (32%), Positives = 56/105 (53%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           ++D GS +N+IS   V KL L T  HP PY +GW++K  E+ +  QC  +F + +   DE
Sbjct: 349 VIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEVPVTTQCLVKFTMGNNLDDE 408

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVV 795
             C+VVP           +L++ D ++  +   Y F K+ K + +
Sbjct: 409 ALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTYSFYKNNKRYTL 453


>ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cacao]
           gi|508704828|gb|EOX96724.1| Gag-pol polyprotein,
           putative [Theobroma cacao]
          Length = 794

 Score = 68.9 bits (167), Expect = 3e-09
 Identities = 39/115 (33%), Positives = 64/115 (55%), Gaps = 2/115 (1%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D+GS +N+I+   V KL L+T  HP PY L W++K  E+K+  +C  +F+I ++Y DE
Sbjct: 393 IIDSGSCENVIANYMVKKLKLQTEVHPHPYKLQWLRKGNEVKVTKRCCVQFSIGNKYEDE 452

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVVR--KDRTCQK 819
           + C+V+P           + ++R A +      Y F+KDG   ++   K   C K
Sbjct: 453 VWCDVIPMDACHLLLGRPWQYDRRAHHDGYKNTYSFIKDGAKIMLTPLKPEDCPK 507


>ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508703673|gb|EOX95569.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 1452

 Score = 68.6 bits (166), Expect = 5e-09
 Identities = 34/105 (32%), Positives = 55/105 (52%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           ++D GS +N+IS   V KL L T  HP PY +GW++K  E+ +  QC  +F +     DE
Sbjct: 340 VIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEVPVTTQCLVKFTMGDNSDDE 399

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVV 795
             C+VVP           +L++ D ++  +   Y F K+ K + +
Sbjct: 400 ALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTYSFYKNNKRYTL 444


>gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]
          Length = 1475

 Score = 68.2 bits (165), Expect = 6e-09
 Identities = 34/96 (35%), Positives = 53/96 (55%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D GS  N+ S++ + KL L T  HP PY L W+ K  E+++D QC   F+I   Y DE
Sbjct: 403 IIDGGSCTNVASSTLIEKLSLPTQDHPSPYKLRWLNKGAEVRVDKQCLVTFSIGKNYSDE 462

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEF 768
             C+V+P           + ++RD++++ R   Y F
Sbjct: 463 ALCDVLPMDACHLLLGRPWEFDRDSVHHGRDNTYTF 498


>ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobroma cacao]
           gi|508726763|gb|EOY18660.1| Uncharacterized protein
           TCM_043155 [Theobroma cacao]
          Length = 625

 Score = 67.8 bits (164), Expect = 8e-09
 Identities = 36/100 (36%), Positives = 57/100 (57%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D+GS +N+++   V KL L T  HP PY L W++K  E+K+  +C  +F I ++Y DE
Sbjct: 362 IIDSGSCENVVANYMVEKLKLPTEVHPHPYKLQWLRKGNEVKVTKRCCIQFFIRNKYEDE 421

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDG 780
           + C+V+P           + ++R A Y      Y F+KDG
Sbjct: 422 VWCDVIPMDACHLLLGRPWQYDRRAHYDGYKNTYSFIKDG 461


>ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobroma cacao]
           gi|508718388|gb|EOY10285.1| Uncharacterized protein
           TCM_025656 [Theobroma cacao]
          Length = 505

 Score = 67.8 bits (164), Expect = 8e-09
 Identities = 35/105 (33%), Positives = 61/105 (58%)
 Frame = +1

Query: 481 IVDTGSQKNLISASQV*KLGLETMPHPKPYLLGWIQKDMELKIDCQCKFRFAITSQYIDE 660
           I+D+GS +N+I+   V KL L+T  HP PY L W++K  E+K+  +C  +F+I ++Y DE
Sbjct: 242 IIDSGSCENVIANYMVEKLKLQTEVHPHPYKLQWLRKGNEVKVTKRCCVQFSIGNKYEDE 301

Query: 661 ITCEVVPP*YMQGDF*STYLWERDAIYY*RAQKYEFMKDGKNFVV 795
           + C+++P           + ++R A +      Y F+KDG   ++
Sbjct: 302 VWCDIIPMDACHLLLGRPWQYDRRAHHDGYKNTYSFIKDGAKIML 346


Top