BLASTX nr result

ID: Mentha25_contig00017040 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00017040
         (1708 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_001591635.1| hypothetical protein SS1G_07081 [Sclerotinia...   113   9e-29
ref|XP_001598842.1| hypothetical protein SS1G_00931 [Sclerotinia...   112   2e-28
ref|XP_004912460.1| PREDICTED: retrotransposon-derived protein P...    84   2e-26
emb|CBN81178.1| Pol polyprotein [Dicentrarchus labrax]                 81   2e-26
gb|EKD04365.1| retrotransposon nucleocapsid protein [Trichosporo...   125   6e-26
gb|EKD00111.1| retrotransposon nucleocapsid protein [Trichosporo...   124   1e-25
gb|EXJ86300.1| hypothetical protein A1O1_06670 [Capronia coronat...   108   2e-25
emb|CCG85041.1| protein of unknown function [Taphrina deformans ...   111   2e-25
emb|CCG85028.1| protein of unknown function [Taphrina deformans ...    94   3e-24
emb|CCG85107.1| protein of unknown function [Taphrina deformans ...    96   3e-24
gb|EKG11343.1| Retrotransposon gag protein [Macrophomina phaseol...   117   2e-23
gb|AAR29046.2| gag-pol polyprotein [Aspergillus flavus]                99   4e-23
ref|XP_007431817.1| PREDICTED: retrotransposon-derived protein P...    98   1e-22
gb|EKG20520.1| Retrotransposon gag protein [Macrophomina phaseol...   113   2e-22
gb|EKG15822.1| Retrotransposon gag protein [Macrophomina phaseol...   113   2e-22
emb|CCG84995.1| protein of unknown function [Taphrina deformans ...    91   9e-22
gb|AAH87517.1| LOC496091 protein, partial [Xenopus laevis]            110   2e-21
emb|CCG85123.1| protein of unknown function [Taphrina deformans ...    95   3e-21
ref|XP_001818504.2| gag-pol polyprotein [Aspergillus oryzae RIB40]     96   5e-21
ref|XP_003189096.1| gag-pol polyprotein [Aspergillus oryzae RIB40]     96   5e-21

>ref|XP_001591635.1| hypothetical protein SS1G_07081 [Sclerotinia sclerotiorum 1980]
            gi|154704859|gb|EDO04598.1| hypothetical protein
            SS1G_07081 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1056

 Score =  113 bits (282), Expect(2) = 9e-29
 Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 15/188 (7%)
 Frame = +2

Query: 524  PDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQITG 703
            PD F GDRSK++ FVAQ  LYY  +   F T   +I +  SF RG+AF W+EPF + +  
Sbjct: 37   PDLFHGDRSKHRAFVAQADLYYAFNGHLFLTQMQRILWLISFFRGTAFNWIEPFFNDLMT 96

Query: 704  DVS---------------FHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYS 838
              +               F+ Y  F  G    F + D    AER +  L Q GS ++Y +
Sbjct: 97   KTTDGQLNENMKPETRRLFNTYESFRQGFDRAFGEVDPDHMAERALRQLKQTGSVTAYTA 156

Query: 839  QFVALIAQLGWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARR 1018
            +F     ++ W +D ++  ++  GLKD IKD L     P+ +SE     I +DN+   RR
Sbjct: 157  KFQQYAGRITWDDDYLRSQFY-EGLKDIIKDELARAPKPSNLSELIETSILIDNRFYERR 215

Query: 1019 CEKKKSTR 1042
             EKK  T+
Sbjct: 216  MEKKGITQ 223



 Score = 42.7 bits (99), Expect(2) = 9e-29
 Identities = 22/46 (47%), Positives = 23/46 (50%), Gaps = 11/46 (23%)
 Frame = +3

Query: 1158 DPMELDAVSR-----------KSYRRANNLCTYCGASGHWVRDCEK 1262
            DPMELD   R           K  RR NNLC  CG SGH  RDC +
Sbjct: 242  DPMELDGAERHQKPNGLSVEEKKRRRENNLCFTCGKSGHMSRDCSQ 287


>ref|XP_001598842.1| hypothetical protein SS1G_00931 [Sclerotinia sclerotiorum 1980]
            gi|154691790|gb|EDN91528.1| hypothetical protein
            SS1G_00931 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 433

 Score =  112 bits (280), Expect(2) = 2e-28
 Identities = 65/188 (34%), Positives = 93/188 (49%), Gaps = 15/188 (7%)
 Frame = +2

Query: 524  PDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQITG 703
            PD F GDRSK++ FVAQ  LYY  +   F T   +I +  SF RG+AF W+EPF + +  
Sbjct: 37   PDLFHGDRSKHRAFVAQADLYYAFNGHLFLTQMQRILWLISFFRGTAFNWIEPFFNDLMT 96

Query: 704  DVS---------------FHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYS 838
              +               F+ Y  F  G    F + D    AER +  L Q GS ++Y +
Sbjct: 97   KTTDGQLNENMKPETRRLFNTYESFRQGFDRAFGEVDPDHMAERALRQLKQTGSVTAYTA 156

Query: 839  QFVALIAQLGWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARR 1018
            +F     ++ W +D ++  ++  GLKD IKD L     P  +SE     I +DN+   RR
Sbjct: 157  KFQQYAGRITWDDDYLRSQFY-EGLKDIIKDELARAPKPPNLSELIETSILIDNRFYERR 215

Query: 1019 CEKKKSTR 1042
             EKK  T+
Sbjct: 216  MEKKGITQ 223



 Score = 42.7 bits (99), Expect(2) = 2e-28
 Identities = 22/46 (47%), Positives = 23/46 (50%), Gaps = 11/46 (23%)
 Frame = +3

Query: 1158 DPMELDAVSR-----------KSYRRANNLCTYCGASGHWVRDCEK 1262
            DPMELD   R           K  RR NNLC  CG SGH  RDC +
Sbjct: 242  DPMELDGAERHQKPNGLSVEEKKRRRENNLCFTCGKSGHMSRDCSQ 287


>ref|XP_004912460.1| PREDICTED: retrotransposon-derived protein PEG10-like [Xenopus
            (Silurana) tropicalis]
          Length = 566

 Score = 84.3 bits (207), Expect(3) = 2e-26
 Identities = 51/174 (29%), Positives = 81/174 (46%), Gaps = 1/174 (0%)
 Frame = +2

Query: 509  PEGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI 688
            P    P++F GDR  ++TF     L +   P+ +ST++ K+    S L G    W    +
Sbjct: 67   PNVAMPEKFSGDRKTFRTFTNACKLLFTLKPRMYSTEQIKVGVIISLLLGEPQSWAFHLM 126

Query: 689  DQITGDVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGS-CSSYYSQFVALIAQL 865
            +  T   S      F   L   + DP + A AE  + NL Q+      Y  +F    + +
Sbjct: 127  E--TRSTSLLTVDSFFQALAVLYDDPHRTAAAEASLRNLRQRSRPVEDYTVEFRKYASDV 184

Query: 866  GWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEK 1027
             W + ++K H FR GL DS+KD L     P ++ E   L I++D ++  R+ EK
Sbjct: 185  DWNQAALK-HQFRLGLSDSLKDELARVGVPASLDEIIHLSIQIDRRLRERKLEK 237



 Score = 47.8 bits (112), Expect(3) = 2e-26
 Identities = 31/92 (33%), Positives = 43/92 (46%), Gaps = 2/92 (2%)
 Frame = +1

Query: 1438 AMIDSGATSLFVDAEFLXXXXXXXXXXXYPETLRVVDGRESCEGAIKHE-IELDIWLGD- 1611
            A++DSGA+  F+D++             +P  LRV DG     G I  E   L + L   
Sbjct: 345  AILDSGASGCFLDSQVAQIHRIPLKKKQFPVFLRVADGSPINSGPILLESTPLSLTLNKI 404

Query: 1612 HKERTLFQVTKLAEYPLILGKAWLDRHNPDID 1707
            H E   F +      P+I+G  WL RHNP I+
Sbjct: 405  HHEHLSFDIVSSPLSPVIIGLPWLRRHNPVIN 436



 Score = 36.2 bits (82), Expect(3) = 2e-26
 Identities = 29/102 (28%), Positives = 42/102 (41%), Gaps = 25/102 (24%)
 Frame = +3

Query: 1119 PDTPQPMTNLPVD------DPMELDAV------SRKSYRRANNLCTYCGASGHWVRDC-- 1256
            P  P P+ + P        +PM++ A+        +  RR  NLC YCG SGH +R C  
Sbjct: 248  PKAPPPIRSEPGPNSSDEMEPMQIGALRPALSPEERLRRRRLNLCLYCGLSGHVLRSCPT 307

Query: 1257 ----------EKLNSRDTRVAAAALSTD-SEKDLTLPALYQS 1349
                      + L  R   +   ++S     K L LPA+  S
Sbjct: 308  RPCKRSTYKTDTLKYRFAPLLTLSISLQLGNKTLNLPAILDS 349


>emb|CBN81178.1| Pol polyprotein [Dicentrarchus labrax]
          Length = 1618

 Score = 80.9 bits (198), Expect(3) = 2e-26
 Identities = 51/175 (29%), Positives = 89/175 (50%), Gaps = 2/175 (1%)
 Frame = +2

Query: 509  PEGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI 688
            P    P+ + GD      F+ Q +L +   P  +S+D +++ F  S L G A +W     
Sbjct: 229  PHVPTPERYAGDLGACGRFLLQCSLVFQQQPLTYSSDSTRVAFVISLLSGKAAQWATALW 288

Query: 689  DQITGDV-SFHKYSDFLAGLMAGFADPDQYATAEREIENLIQ-KGSCSSYYSQFVALIAQ 862
            ++ +    +F ++SD L  +   F  P +   A + + NL Q  GS + +  +F  L A+
Sbjct: 289  EKHSPICETFQRFSDELRKV---FDHPVRGREAAKRLLNLRQGSGSVAEFSVEFRVLAAE 345

Query: 863  LGWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEK 1027
             GW E++++   F  GL + +KD L  +D   ++ E  +L I+LDN++  RR EK
Sbjct: 346  SGWDEEALQT-VFVHGLSEVMKDELAARDSAASLDELISLAIRLDNRLRERRREK 399



 Score = 43.9 bits (102), Expect(3) = 2e-26
 Identities = 25/91 (27%), Positives = 41/91 (45%), Gaps = 1/91 (1%)
 Frame = +1

Query: 1438 AMIDSGATSLFVDAEFLXXXXXXXXXXXYPETLRVVDGRESCEGAIKHEIELDIWL-GDH 1614
            A++DSGA   F+D+ F+             + +  +DG+       +  + L + L G+H
Sbjct: 522  ALVDSGAEESFIDSAFVLQANIPTIKLPDNQPVNALDGKHLAN-ITRQTVPLTLILSGNH 580

Query: 1615 KERTLFQVTKLAEYPLILGKAWLDRHNPDID 1707
            +E     V      P++LG  WL  HNP  D
Sbjct: 581  REEISLLVISSPNTPVVLGYPWLKLHNPQFD 611



 Score = 43.1 bits (100), Expect(3) = 2e-26
 Identities = 25/87 (28%), Positives = 40/87 (45%), Gaps = 4/87 (4%)
 Frame = +3

Query: 1149 PVDDPMELDAV----SRKSYRRANNLCTYCGASGHWVRDCEKLNSRDTRVAAAALSTDSE 1316
            PV++PM+L       + +  R  + LC YCG  GH++  C +L  ++T    A +S    
Sbjct: 437  PVEEPMQLGRTRLTQAERQRRMRSGLCIYCGQHGHFLAACPQLPKKETENGVAPVSQVPA 496

Query: 1317 KDLTLPALYQSKN*LH*RTHGYPLVTL 1397
                  +  Q K  LH + +  PL  L
Sbjct: 497  SSSAPLSRLQLKASLHWQLNSIPLTAL 523


>gb|EKD04365.1| retrotransposon nucleocapsid protein [Trichosporon asahii var. asahii
            CBS 8904]
          Length = 1687

 Score =  125 bits (314), Expect = 6e-26
 Identities = 60/188 (31%), Positives = 105/188 (55%)
 Frame = +2

Query: 509  PEGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI 688
            P+   P+ F G R+K  TF+ Q+ +  G  P +F T+ SK+ +A S+LR +AF W +P++
Sbjct: 60   PKVSSPEYFSGQRNKVTTFITQVRMVIGLQPSRFPTENSKVLYAGSYLRDTAFLWFQPYV 119

Query: 689  DQITGDVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLG 868
                     + ++ F   L + F DPD+ ATAER++ NL Q+GS S+Y + F    A + 
Sbjct: 120  ASEKQPDWLNDFNLFCKELRSMFGDPDEVATAERQLYNLRQRGSASAYVADFTRYAALVN 179

Query: 869  WTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEKKKSTRII 1048
            W ++++   ++ RGLKD+IKD L   D P  +  +  + +++D ++  R  E+ +S    
Sbjct: 180  WNDEALCAQFY-RGLKDAIKDELARTDKPKDLKTYKDIAVRIDTRLFERHLERDRSKTFT 238

Query: 1049 KAIANFSS 1072
                 F++
Sbjct: 239  TTTTTFNN 246


>gb|EKD00111.1| retrotransposon nucleocapsid protein [Trichosporon asahii var. asahii
            CBS 8904]
          Length = 1662

 Score =  124 bits (312), Expect = 1e-25
 Identities = 61/178 (34%), Positives = 101/178 (56%)
 Frame = +2

Query: 509  PEGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI 688
            P+   P+ F G R+K  TF+ Q+ +  G  P +F T+ SK+ +A SFL  +AF WL+P++
Sbjct: 58   PKVSSPEYFSGQRNKVTTFITQVRMVIGLQPSRFPTENSKVLYAGSFLCDTAFLWLQPYV 117

Query: 689  DQITGDVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLG 868
                     + ++ F   L + F DPD+ ATAER++ NL Q+GS S+Y + F    A + 
Sbjct: 118  ASDHPPAWLNDFNLFCKELRSMFGDPDEVATAERQLYNLRQRGSASAYVADFTRFAAVVN 177

Query: 869  WTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEKKKSTR 1042
            W ++++   ++ RGLKD IKD L   D P  +  +    +++D ++  R  EK +S +
Sbjct: 178  WNDEALCAQFY-RGLKDPIKDELARTDKPKDLKAYKETAVRIDTRLFERHNEKDRSVK 234


>gb|EXJ86300.1| hypothetical protein A1O1_06670 [Capronia coronata CBS 617.96]
          Length = 799

 Score =  108 bits (269), Expect(2) = 2e-25
 Identities = 63/185 (34%), Positives = 100/185 (54%), Gaps = 12/185 (6%)
 Frame = +2

Query: 512  EGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEP--- 682
            E  + D F+GD++K K F+ QL   +   P K+    S++ FAA  L+G+AF W EP   
Sbjct: 9    EEVKVDYFYGDKAKLKMFLVQLKAIFKLYPAKYPNPSSQVLFAALNLKGAAFAWFEPTMT 68

Query: 683  ---------FIDQITGDVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYY 835
                      +DQ T  + FH +++F   +   F   D+ ATAER + ++ Q+GS + YY
Sbjct: 69   DYLEANESSSLDQETRMI-FHSFANFEIKIKQVFGVADEEATAERMLHDVKQRGSTAQYY 127

Query: 836  SQFVALIAQLGWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEAR 1015
            + F  L  ++ W ED++   Y+ RGL D +KD +   + P T  +     I++DN++  R
Sbjct: 128  ALFKQLAKRVSWNEDALAAAYY-RGLSDQVKDRM--DEVPDTYKDMVDKSIEIDNRLYER 184

Query: 1016 RCEKK 1030
            R EKK
Sbjct: 185  RMEKK 189



 Score = 36.2 bits (82), Expect(2) = 2e-25
 Identities = 26/89 (29%), Positives = 36/89 (40%), Gaps = 28/89 (31%)
 Frame = +3

Query: 1140 TNLPVDDPMELDAVSR---------------------KSYRRANNLCTYCGASGHWVRDC 1256
            TN    DPM+LDA+ R                     +  R+  NLC  CG SGH  ++C
Sbjct: 205  TNYSYGDPMDLDAMERGRSSRPKGQRFGGFRSNGNKEREKRKKENLCYNCGKSGHRAKEC 264

Query: 1257 ----EKLNSRDTRVAAAALSTDS---EKD 1322
                ++L+  D      A   D+   EKD
Sbjct: 265  HAKAQQLHMMDDSAGIEAKKADTSMKEKD 293


>emb|CCG85041.1| protein of unknown function [Taphrina deformans PYCC 5710]
          Length = 309

 Score =  111 bits (277), Expect(2) = 2e-25
 Identities = 59/175 (33%), Positives = 104/175 (59%), Gaps = 2/175 (1%)
 Frame = +2

Query: 524  PDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQITG 703
            PDEF G R K +TF+ Q+ L + ++P  FSTD  +   A S+LRG A++W+ P   ++  
Sbjct: 65   PDEFHGTRKKLETFLFQMELKFEAEPDVFSTDHRRTICAISYLRGEAYEWVIP-AQRLGL 123

Query: 704  DVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLGWTEDS 883
            +  F  Y+ F   L+  F +P++    +R+I  L Q GSC++Y   F++L  +LGW +++
Sbjct: 124  EALFPTYTVFHESLVRAFGNPNELDNYKRKIRLLRQHGSCANYTRVFMSLCTRLGWNQEA 183

Query: 884  VKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEK--KKSTR 1042
            ++  Y R+GL D++KD ++  +   ++ +     +  D ++EAR+ E+  KKS R
Sbjct: 184  LRSQY-RQGLSDAVKDQMIHVNTGVSLQDMIDQALLNDGRLEARQIERQNKKSLR 237



 Score = 33.1 bits (74), Expect(2) = 2e-25
 Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 4/43 (9%)
 Frame = +3

Query: 1161 PMELDAV----SRKSYRRANNLCTYCGASGHWVRDCEKLNSRD 1277
            PME+DAV    S+K  ++    C  C  +GH  RDC +  S++
Sbjct: 267  PMEIDAVEATTSKKQEQQRLGKCFTCNKTGHLARDCPEKRSKN 309


>emb|CCG85028.1| protein of unknown function [Taphrina deformans PYCC 5710]
          Length = 334

 Score = 94.4 bits (233), Expect(2) = 3e-24
 Identities = 73/229 (31%), Positives = 113/229 (49%), Gaps = 4/229 (1%)
 Frame = +2

Query: 368  ARIHPQIE--GPKITSSACENQQLVLATYFSLGMEVSSDSKPQIVSQAMPEGKRPDEFFG 541
            AR+  Q++  GP  T S     +L LA+  S     +    P        + K P+ F G
Sbjct: 27   ARLQHQVDSQGPATTPSQGVTPEL-LASLASAFTAAAPPRSPGYSGDNSLKLKEPEVFNG 85

Query: 542  DRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQITGDV-SFH 718
             R   + F+A L L + ++ ++F  D SKI +A S LRG AF+ ++   +QI  DV +  
Sbjct: 86   GRKDLERFLAALLLKFSAERKRFPDDHSKITYAMSLLRGDAFEIVQ---NQIVQDVDNLG 142

Query: 719  KYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLGWTEDSVKIHY 898
             + DF + L   F DPD   TA  E+ENL Q+GS   Y ++F  L   +    D  K+ +
Sbjct: 143  SFQDFRSSLERAFGDPDSAQTAMLELENLRQRGSIVKYNAEFHRL-ENILHLNDIAKLAF 201

Query: 899  FRRGLKDSIKDNLVGK-DCPTTISEFAALCIKLDNQIEARRCEKKKSTR 1042
            +RRG+ D IK+ L  K +   T   F     +LD  +  R+ +++ S R
Sbjct: 202  YRRGVSDHIKNILSEKLEKYDTFEAFEKAVTQLDANLYVRKQDQRYSVR 250



 Score = 46.6 bits (109), Expect(2) = 3e-24
 Identities = 26/76 (34%), Positives = 34/76 (44%), Gaps = 12/76 (15%)
 Frame = +3

Query: 1089 HSQKSVENHAPDTPQPMTNLPVDDPMELDAV------------SRKSYRRANNLCTYCGA 1232
            H     +N  P  P  +T      PME+DAV              K YRR N+LC+YCG 
Sbjct: 256  HHHGRGDNTGPRGPAVITE-QTHTPMEIDAVISTPARRGPLSDKEKKYRRDNDLCSYCGG 314

Query: 1233 SGHWVRDCEKLNSRDT 1280
            SGH+   C +   + T
Sbjct: 315  SGHYANSCPEKKKKFT 330


>emb|CCG85107.1| protein of unknown function [Taphrina deformans PYCC 5710]
          Length = 334

 Score = 95.9 bits (237), Expect(2) = 3e-24
 Identities = 72/229 (31%), Positives = 115/229 (50%), Gaps = 4/229 (1%)
 Frame = +2

Query: 368  ARIHPQIE--GPKITSSACENQQLVLATYFSLGMEVSSDSKPQIVSQAMPEGKRPDEFFG 541
            AR+  Q++  GP +T +     +L LA+  S     +    P        + K P+ F G
Sbjct: 27   ARLQHQVDSQGPAVTPAQGVTPEL-LASLASAFTAAAPPRSPGYSGDNSLKLKEPEVFNG 85

Query: 542  DRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQITGDV-SFH 718
             R   + F+A L L + ++ ++F  D SKI +A S LRG AF+ ++   +QI  DV +  
Sbjct: 86   GRKDLERFLAALLLKFSAERKRFPDDHSKITYAMSLLRGDAFEIVQ---NQIVQDVDNLG 142

Query: 719  KYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLGWTEDSVKIHY 898
             + DF + L   F DPD   TA  E+ENL Q+GS   Y ++F  L   +    D  K+ +
Sbjct: 143  SFQDFRSSLERAFGDPDSAQTAMLELENLRQRGSIVKYNAEFHRL-ENILHLNDIAKLAF 201

Query: 899  FRRGLKDSIKDNLVGK-DCPTTISEFAALCIKLDNQIEARRCEKKKSTR 1042
            +RRG+ D IK+ L+ K +   T   F     +LD  +  R+ +++ S R
Sbjct: 202  YRRGVSDHIKNILLEKLEKYDTFEAFEKAVTQLDANLYVRKQDQRYSVR 250



 Score = 44.7 bits (104), Expect(2) = 3e-24
 Identities = 24/68 (35%), Positives = 31/68 (45%), Gaps = 12/68 (17%)
 Frame = +3

Query: 1089 HSQKSVENHAPDTPQPMTNLPVDDPMELDAV------------SRKSYRRANNLCTYCGA 1232
            H     +N  P  P  +T      PM++DAV              K YRR N+LC+YCG 
Sbjct: 256  HHHGRGDNTGPRGPA-VTTKQTHTPMDIDAVISTPAHRGPLSDKEKKYRRDNDLCSYCGG 314

Query: 1233 SGHWVRDC 1256
            SGH+   C
Sbjct: 315  SGHYANSC 322


>gb|EKG11343.1| Retrotransposon gag protein [Macrophomina phaseolina MS6]
          Length = 634

 Score =  117 bits (292), Expect = 2e-23
 Identities = 65/182 (35%), Positives = 105/182 (57%), Gaps = 11/182 (6%)
 Frame = +2

Query: 521  RPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI-DQI 697
            +PD F+G+R K +TF++QL LY+  + + F TD  K+ FAA++LR +A +W EP++ D++
Sbjct: 87   KPDLFYGNRKKLQTFLSQLDLYFFFNSRDFPTDDKKVMFAATYLRDTAAQWFEPYLRDRM 146

Query: 698  TGDVS---------FHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVA 850
              +           F  Y  F+  +   F D D+   A R + N+ QK S + Y ++F  
Sbjct: 147  EKEPEARKEDTKKVFGSYKHFVTQIKQSFGDLDEVNKARRAVMNIHQKTSVADYTTEFQK 206

Query: 851  LIAQLG-WTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEK 1027
              A L  W++ ++  HY+R GLK+ +KD L+ +D P T+ E   L IK DN++  R+ EK
Sbjct: 207  AAAYLDDWSDRALMDHYYR-GLKERVKDQLMTQDDPKTLDELIKLAIKCDNRLFERQSEK 265

Query: 1028 KK 1033
             K
Sbjct: 266  YK 267


>gb|AAR29046.2| gag-pol polyprotein [Aspergillus flavus]
          Length = 1998

 Score = 98.6 bits (244), Expect(2) = 4e-23
 Identities = 74/273 (27%), Positives = 127/273 (46%), Gaps = 20/273 (7%)
 Frame = +2

Query: 272  KEQPITPTAPIP----DSTPVKIQL-----LCTQAYSRGKQARIHPQIEGPKITSSACEN 424
            K+ P+  T P        T VK QL     + TQ  +  K+   + +IE  K+     E 
Sbjct: 9    KKTPVKSTPPAETDSESETTVKEQLKQMKSMITQLVNNAKEK--NQEIENLKVQLGEAER 66

Query: 425  QQLVLATYFS-LGMEVSSDSKPQIVSQAMPEGKRPDEFFGDRSKYKTFVAQLALYYGSDP 601
             +     + + L  +V + +    + +       P  F G RSK + F+ Q+ ++  ++ 
Sbjct: 67   IRNEQQDHIAQLDAQVGASAPKDAIGKVKLPKAEP--FDGTRSKLQAFLTQMNMHIHANR 124

Query: 602  QKFSTDKSKIRFAASFLRGSAFKWLEPFIDQI----------TGDVSFHKYSDFLAGLMA 751
            +    +  K+ F ++ LRG+A+ W EP+I +           T    F    +    L  
Sbjct: 125  KNLIDEADKVIFISTHLRGAAWNWFEPYIREYYEVVPDNWSNTTRELFTDSGNLRKHLER 184

Query: 752  GFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLGWTEDSVKIHYFRRGLKDSIKD 931
             F D D  A AER++++L Q+GS S+Y ++F  +I+++ W E  V +  F  GLKD +KD
Sbjct: 185  TFGDVDAEAVAERKLKHLYQRGSASTYAAEFQQIISRMDWNE-KVYVSTFISGLKDHVKD 243

Query: 932  NLVGKDCPTTISEFAALCIKLDNQIEARRCEKK 1030
                 D P T++E     +K+DN+   R  EK+
Sbjct: 244  EFARIDRPATLNEAIDFAVKVDNRYHERLMEKR 276



 Score = 38.5 bits (88), Expect(2) = 4e-23
 Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 8/78 (10%)
 Frame = +3

Query: 1065 SHRP--QTMSHSQKS---VENHAPDTPQPMTNLPVDDPMELDAVSRKSY--RRANNLCTY 1223
            SHRP  Q  S+ Q+    V+++ P  P+PM     +   +   +S+K    RR   LC  
Sbjct: 285  SHRPKGQYKSNDQRERTGVKHNDPYGPKPMELDATEGQGQSKGISQKERERRRREKLCYN 344

Query: 1224 CGASGHWVRDC-EKLNSR 1274
            CG +GH  +DC +K NS+
Sbjct: 345  CGRAGHMSKDCRQKRNSQ 362


>ref|XP_007431817.1| PREDICTED: retrotransposon-derived protein PEG10-like [Python
            bivittatus]
          Length = 495

 Score = 97.8 bits (242), Expect(2) = 1e-22
 Identities = 60/194 (30%), Positives = 96/194 (49%), Gaps = 1/194 (0%)
 Frame = +2

Query: 455  LGMEVSSDSKPQIVSQAMPEGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIR 634
            +G+ V     P         G  P+++ G+  + +TF+AQ  L+    P +F TD++++ 
Sbjct: 44   IGLGVQPPPPPPPPVLLCSPGSMPEKYGGEVEQMRTFLAQCELFLDGRPGEFPTDQTRVA 103

Query: 635  FAASFLRGSAFKWLEPFIDQITGDVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQK 814
            F  S L+GSA KW  PFI+  T D   + Y +F+      F DP +  TA R+I  L Q 
Sbjct: 104  FVMSLLKGSAAKWATPFIE--TRDPMLNNYQNFVTAFRGHFGDPVRRLTACRDIRKLKQG 161

Query: 815  GS-CSSYYSQFVALIAQLGWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIK 991
            G     + + F +L+  + W E ++ I  FR GL   ++  +V +  P T+     LCI 
Sbjct: 162  GKPVRLFIADFKSLVGDVEWNEIAL-IDQFREGLDPELRSEMVKQGIPNTLDGLYQLCI- 219

Query: 992  LDNQIEARRCEKKK 1033
                +EAR  E K+
Sbjct: 220  ---MVEARLMELKQ 230



 Score = 37.4 bits (85), Expect(2) = 1e-22
 Identities = 26/89 (29%), Positives = 37/89 (41%), Gaps = 17/89 (19%)
 Frame = +3

Query: 1125 TPQPM-----------TNLPVDDPMELDAVSR------KSYRRANNLCTYCGASGHWVRD 1253
            TPQP+           T     +PM+L A  R      +  RR  NLC YCG  GH +  
Sbjct: 239  TPQPLLATIPSPRAVTTEFTAGEPMQLGAAQRTMSGEERQRRRDLNLCFYCGTPGHMI-- 296

Query: 1254 CEKLNSRDTRVAAAALSTDSEKDLTLPAL 1340
              KLN  D+    + +   + +   LP +
Sbjct: 297  --KLNLIDSGTTMSFIDVQTVQKWQLPTV 323


>gb|EKG20520.1| Retrotransposon gag protein [Macrophomina phaseolina MS6]
          Length = 296

 Score =  113 bits (283), Expect = 2e-22
 Identities = 63/182 (34%), Positives = 104/182 (57%), Gaps = 11/182 (6%)
 Frame = +2

Query: 521  RPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI-DQI 697
            +PD F+G+R K +TF++QL LY+  + + F TD  ++ FAA++LR +A +W EP++ D++
Sbjct: 72   KPDLFYGNRKKLQTFLSQLDLYFFFNSRDFPTDDKRVMFAATYLRDTAAQWFEPYLRDRM 131

Query: 698  TGDVS---------FHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVA 850
              +           F  Y  F+  +     D D+   A R + N+ QK S + Y ++F  
Sbjct: 132  EKEPEARKENTKKVFGSYKHFVTQIKQSSGDLDEVNKARRAVMNIHQKTSVADYTTEFQK 191

Query: 851  LIAQLG-WTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEK 1027
              A L  W++ ++  HY+R GLK+ +KD L+ +D P T+ E   L IK DN++  R+ EK
Sbjct: 192  AAAYLDDWSDRALMDHYYR-GLKERVKDQLITQDDPKTLDELIKLAIKCDNRLFKRQSEK 250

Query: 1028 KK 1033
             K
Sbjct: 251  YK 252


>gb|EKG15822.1| Retrotransposon gag protein [Macrophomina phaseolina MS6]
          Length = 296

 Score =  113 bits (283), Expect = 2e-22
 Identities = 64/182 (35%), Positives = 104/182 (57%), Gaps = 11/182 (6%)
 Frame = +2

Query: 521  RPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI-DQI 697
            +PD F+G+R K +TF++QL LY+  + + F TD  +I FAA++LR +  +W EP++ D+I
Sbjct: 72   KPDLFYGNRKKLQTFLSQLDLYFFFNSRDFPTDDKRIIFAATYLRDTTAQWFEPYLRDRI 131

Query: 698  TGDVS---------FHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVA 850
              +           F  Y  F+  +   F D D+   A R + N+ QK S + Y ++F  
Sbjct: 132  EKEPEARKKDTKKVFSSYKHFVTQIKQSFGDLDEVNKARRAVINIHQKTSVADYTTEFQK 191

Query: 851  LIAQLG-WTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEK 1027
              A L  W++ ++  HY+R GLK+ +KD L+ +D P T+ +   L IK DN++  R+ EK
Sbjct: 192  AAAYLDDWSDRALMDHYYR-GLKERVKDQLMTQDDPKTLDKLIKLAIKCDNRLFKRQSEK 250

Query: 1028 KK 1033
             K
Sbjct: 251  YK 252


>emb|CCG84995.1| protein of unknown function [Taphrina deformans PYCC 5710]
          Length = 303

 Score = 91.3 bits (225), Expect(2) = 9e-22
 Identities = 56/177 (31%), Positives = 91/177 (51%), Gaps = 2/177 (1%)
 Frame = +2

Query: 518  KRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQI 697
            K P  F G+ ++ +TFV+ L L + ++   F T+  K+ +A S LR  A + LEP+++  
Sbjct: 44   KDPAFFTGNPAELRTFVSGLQLKFYAEAISFDTEAKKVSYACSLLRDGAAQVLEPYLNNF 103

Query: 698  TGDV-SFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGS-CSSYYSQFVALIAQLGW 871
               + S   + DF   L   F DPD+  T ER++  L Q     + Y +QF+ L A LGW
Sbjct: 104  AQYMDSISSFEDFAKLLQTSFGDPDEKKTFERDLYRLFQNSDPVTVYTAQFLRLSAPLGW 163

Query: 872  TEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEKKKSTR 1042
              ++++  Y   GL   +KD L  +  P   +E   +  K++ +  AR  E+K S R
Sbjct: 164  NNEALESRYL-YGLSKRVKDELTRRAPPRGRAELMQMASKINARFRARDLERKDSHR 219



 Score = 41.2 bits (95), Expect(2) = 9e-22
 Identities = 24/76 (31%), Positives = 36/76 (47%), Gaps = 7/76 (9%)
 Frame = +3

Query: 1071 RPQTMSHSQKSVENHAPDTPQPMTNLPVDDPMELDAVSR-------KSYRRANNLCTYCG 1229
            R  T+  +Q+ V    P     + N  +  PM+LD   R       K  R  NNLC YCG
Sbjct: 227  RNPTVPSTQREVVAGNPRPSTSVNNRTI--PMDLDGTKRGPLSDAEKKRRYNNNLCLYCG 284

Query: 1230 ASGHWVRDCEKLNSRD 1277
             +GH + +C+  N ++
Sbjct: 285  QAGHQIDECKLRNRKN 300


>gb|AAH87517.1| LOC496091 protein, partial [Xenopus laevis]
          Length = 225

 Score =  110 bits (275), Expect = 2e-21
 Identities = 65/189 (34%), Positives = 100/189 (52%), Gaps = 2/189 (1%)
 Frame = +2

Query: 467  VSSDSKPQI-VSQAMPEGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAA 643
            V+   +PQ+  + A  +   PD+F GDR  ++ FV Q  L +   P KF  D  K+ +  
Sbjct: 33   VAQTQQPQVGATSAAIKMPVPDKFSGDRKMFRGFVNQCKLLFMLQPNKFQDDTLKVGWIL 92

Query: 644  SFLRGSAFKWLEPFIDQITGDVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQ-KGS 820
            + L G A  W  P I+Q +  +S   ++ FLA +   F DP++ ATAE  +  L Q   S
Sbjct: 93   TLLSGEALAWASPLIEQQSPLLS--NFNGFLAAMSVIFDDPNKIATAETTLLTLTQGSRS 150

Query: 821  CSSYYSQFVALIAQLGWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDN 1000
             + Y + F   +    W E + + H FRRGL +++KD L   D P  +S F  LCIK+D+
Sbjct: 151  VAEYAATFRRWVLDTSWNEAAQRFH-FRRGLSEAMKDELARVDAPDNLSSFVQLCIKIDS 209

Query: 1001 QIEARRCEK 1027
            ++  RR E+
Sbjct: 210  RLSERRKER 218


>emb|CCG85123.1| protein of unknown function [Taphrina deformans PYCC 5710]
          Length = 292

 Score = 95.1 bits (235), Expect(2) = 3e-21
 Identities = 56/177 (31%), Positives = 93/177 (52%), Gaps = 2/177 (1%)
 Frame = +2

Query: 518  KRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQI 697
            K P  F GD ++ +TFV+ L L + ++   F T+  K+ +A S LR  A + LEP+++  
Sbjct: 44   KDPAFFTGDPAELRTFVSGLQLKFYAEAISFDTEAKKVSYACSLLRDGAAQVLEPYLNNF 103

Query: 698  TGDV-SFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGS-CSSYYSQFVALIAQLGW 871
               + S   + DF   L   F DPD+  T ER++  L Q     + Y +QF+ L A LGW
Sbjct: 104  AQYMDSISSFKDFAKLLQTSFGDPDEKKTFERDLYRLFQNSDPVTVYTAQFLRLSAPLGW 163

Query: 872  TEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEKKKSTR 1042
             +++++  Y   GL + +KD L  +  P   +E   +  ++D +  AR  E++ S R
Sbjct: 164  NDEALESRYL-YGLSERVKDELTRRAPPRNRAELMQMASEIDARFRARDLERRDSHR 219



 Score = 35.8 bits (81), Expect(2) = 3e-21
 Identities = 22/64 (34%), Positives = 30/64 (46%), Gaps = 7/64 (10%)
 Frame = +3

Query: 1071 RPQTMSHSQKSVENHAPDTPQPMTNLPVDDPMELDAVSR-------KSYRRANNLCTYCG 1229
            R  T S +++ V    P    P+ N  +  PM++D   R       K  R  NNLC YCG
Sbjct: 227  RNPTTSSTRREVVAGNPKPNTPVNNGTI--PMDVDGTRRGPLSDAEKKRRYDNNLCLYCG 284

Query: 1230 ASGH 1241
             +GH
Sbjct: 285  QAGH 288


>ref|XP_001818504.2| gag-pol polyprotein [Aspergillus oryzae RIB40]
          Length = 1941

 Score = 95.5 bits (236), Expect(2) = 5e-21
 Identities = 73/273 (26%), Positives = 125/273 (45%), Gaps = 20/273 (7%)
 Frame = +2

Query: 272  KEQPITPTAPIP----DSTPVKIQL-----LCTQAYSRGKQARIHPQIEGPKITSSACEN 424
            K+ P+  T P        T VK QL     + TQ  +  K+   + +IE  K+     E 
Sbjct: 9    KKTPVKNTPPAETDSESETTVKEQLKQMKNMITQLVNNAKEK--NQEIENLKVQLGEAER 66

Query: 425  QQLVLATYFS-LGMEVSSDSKPQIVSQAMPEGKRPDEFFGDRSKYKTFVAQLALYYGSDP 601
             +     + + L  +V + +    + +       P  F G RSK + F+ Q+ ++  ++ 
Sbjct: 67   IRSEQQDHIAQLDAQVGASAPKDAIGKVKLPKAEP--FDGTRSKLQAFLTQMNMHIHANR 124

Query: 602  QKFSTDKSKIRFAASFLRGSAFKWLEPFIDQI----------TGDVSFHKYSDFLAGLMA 751
            +    +  K+ F ++ LRG+A+ W EP+I +           T    F    +    L  
Sbjct: 125  KNLIDEADKVIFISTHLRGAAWNWFEPYIREYYEVVPDNWSNTTRELFTDSGNLRKHLER 184

Query: 752  GFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLGWTEDSVKIHYFRRGLKDSIKD 931
             F D D  A AER+++ L Q+GS S+Y ++F  +I+++ W E  V +  F  GLK  +KD
Sbjct: 185  TFGDVDAEAVAERKLKQLYQRGSASTYAAEFQQIISRMDWNE-KVYVSTFISGLKGHVKD 243

Query: 932  NLVGKDCPTTISEFAALCIKLDNQIEARRCEKK 1030
                 D P T++E     +K+DN+   R  EK+
Sbjct: 244  EFARIDRPATLNEAIDFAVKVDNRYHERLMEKR 276



 Score = 34.3 bits (77), Expect(2) = 5e-21
 Identities = 27/86 (31%), Positives = 37/86 (43%), Gaps = 14/86 (16%)
 Frame = +3

Query: 1065 SHRP--QTMSHSQKSVENHAPDTPQPMTNLPVDDPMELDAVSRKSY-----------RRA 1205
            SHRP  Q  S+ Q+       + P  +       PMELDA   +S            R+ 
Sbjct: 285  SHRPKGQYKSNDQRERTGAKHNDPYGLK------PMELDATEGQSQSRGISQKERERRKR 338

Query: 1206 NNLCTYCGASGHWVRDC-EKLNSRDT 1280
              LC  CG +GH  +DC +K NS  +
Sbjct: 339  EKLCYNCGKAGHMSKDCRQKRNSHQS 364


>ref|XP_003189096.1| gag-pol polyprotein [Aspergillus oryzae RIB40]
          Length = 1941

 Score = 95.5 bits (236), Expect(2) = 5e-21
 Identities = 73/273 (26%), Positives = 125/273 (45%), Gaps = 20/273 (7%)
 Frame = +2

Query: 272  KEQPITPTAPIP----DSTPVKIQL-----LCTQAYSRGKQARIHPQIEGPKITSSACEN 424
            K+ P+  T P        T VK QL     + TQ  +  K+   + +IE  K+     E 
Sbjct: 9    KKTPVKNTPPAETDSESETTVKEQLKQMKNMITQLVNNAKEK--NQEIENLKVQLGEAER 66

Query: 425  QQLVLATYFS-LGMEVSSDSKPQIVSQAMPEGKRPDEFFGDRSKYKTFVAQLALYYGSDP 601
             +     + + L  +V + +    + +       P  F G RSK + F+ Q+ ++  ++ 
Sbjct: 67   IRSEQQDHIAQLDAQVGASAPKDAIGKVKLPKAEP--FDGTRSKLQAFLTQMNMHIHANR 124

Query: 602  QKFSTDKSKIRFAASFLRGSAFKWLEPFIDQI----------TGDVSFHKYSDFLAGLMA 751
            +    +  K+ F ++ LRG+A+ W EP+I +           T    F    +    L  
Sbjct: 125  KNLIDEADKVIFISTHLRGAAWNWFEPYIREYYEVVPDNWSNTTRELFTDSGNLRKHLER 184

Query: 752  GFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLGWTEDSVKIHYFRRGLKDSIKD 931
             F D D  A AER+++ L Q+GS S+Y ++F  +I+++ W E  V +  F  GLK  +KD
Sbjct: 185  TFGDVDAEAVAERKLKQLYQRGSASTYAAEFQQIISRMDWNE-KVYVSTFISGLKGHVKD 243

Query: 932  NLVGKDCPTTISEFAALCIKLDNQIEARRCEKK 1030
                 D P T++E     +K+DN+   R  EK+
Sbjct: 244  EFARIDRPATLNEAIDFAVKVDNRYHERLMEKR 276



 Score = 34.3 bits (77), Expect(2) = 5e-21
 Identities = 27/86 (31%), Positives = 37/86 (43%), Gaps = 14/86 (16%)
 Frame = +3

Query: 1065 SHRP--QTMSHSQKSVENHAPDTPQPMTNLPVDDPMELDAVSRKSY-----------RRA 1205
            SHRP  Q  S+ Q+       + P  +       PMELDA   +S            R+ 
Sbjct: 285  SHRPKGQYKSNDQRERTGAKHNDPYGLK------PMELDATEGQSQSRGISQKERERRKR 338

Query: 1206 NNLCTYCGASGHWVRDC-EKLNSRDT 1280
              LC  CG +GH  +DC +K NS  +
Sbjct: 339  EKLCYNCGKAGHMSKDCRQKRNSHQS 364


Top