BLASTX nr result
ID: Mentha25_contig00017040
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00017040 (1708 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_001591635.1| hypothetical protein SS1G_07081 [Sclerotinia... 113 9e-29 ref|XP_001598842.1| hypothetical protein SS1G_00931 [Sclerotinia... 112 2e-28 ref|XP_004912460.1| PREDICTED: retrotransposon-derived protein P... 84 2e-26 emb|CBN81178.1| Pol polyprotein [Dicentrarchus labrax] 81 2e-26 gb|EKD04365.1| retrotransposon nucleocapsid protein [Trichosporo... 125 6e-26 gb|EKD00111.1| retrotransposon nucleocapsid protein [Trichosporo... 124 1e-25 gb|EXJ86300.1| hypothetical protein A1O1_06670 [Capronia coronat... 108 2e-25 emb|CCG85041.1| protein of unknown function [Taphrina deformans ... 111 2e-25 emb|CCG85028.1| protein of unknown function [Taphrina deformans ... 94 3e-24 emb|CCG85107.1| protein of unknown function [Taphrina deformans ... 96 3e-24 gb|EKG11343.1| Retrotransposon gag protein [Macrophomina phaseol... 117 2e-23 gb|AAR29046.2| gag-pol polyprotein [Aspergillus flavus] 99 4e-23 ref|XP_007431817.1| PREDICTED: retrotransposon-derived protein P... 98 1e-22 gb|EKG20520.1| Retrotransposon gag protein [Macrophomina phaseol... 113 2e-22 gb|EKG15822.1| Retrotransposon gag protein [Macrophomina phaseol... 113 2e-22 emb|CCG84995.1| protein of unknown function [Taphrina deformans ... 91 9e-22 gb|AAH87517.1| LOC496091 protein, partial [Xenopus laevis] 110 2e-21 emb|CCG85123.1| protein of unknown function [Taphrina deformans ... 95 3e-21 ref|XP_001818504.2| gag-pol polyprotein [Aspergillus oryzae RIB40] 96 5e-21 ref|XP_003189096.1| gag-pol polyprotein [Aspergillus oryzae RIB40] 96 5e-21 >ref|XP_001591635.1| hypothetical protein SS1G_07081 [Sclerotinia sclerotiorum 1980] gi|154704859|gb|EDO04598.1| hypothetical protein SS1G_07081 [Sclerotinia sclerotiorum 1980 UF-70] Length = 1056 Score = 113 bits (282), Expect(2) = 9e-29 Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 15/188 (7%) Frame = +2 Query: 524 PDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQITG 703 PD F GDRSK++ FVAQ LYY + F T +I + SF RG+AF W+EPF + + Sbjct: 37 PDLFHGDRSKHRAFVAQADLYYAFNGHLFLTQMQRILWLISFFRGTAFNWIEPFFNDLMT 96 Query: 704 DVS---------------FHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYS 838 + F+ Y F G F + D AER + L Q GS ++Y + Sbjct: 97 KTTDGQLNENMKPETRRLFNTYESFRQGFDRAFGEVDPDHMAERALRQLKQTGSVTAYTA 156 Query: 839 QFVALIAQLGWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARR 1018 +F ++ W +D ++ ++ GLKD IKD L P+ +SE I +DN+ RR Sbjct: 157 KFQQYAGRITWDDDYLRSQFY-EGLKDIIKDELARAPKPSNLSELIETSILIDNRFYERR 215 Query: 1019 CEKKKSTR 1042 EKK T+ Sbjct: 216 MEKKGITQ 223 Score = 42.7 bits (99), Expect(2) = 9e-29 Identities = 22/46 (47%), Positives = 23/46 (50%), Gaps = 11/46 (23%) Frame = +3 Query: 1158 DPMELDAVSR-----------KSYRRANNLCTYCGASGHWVRDCEK 1262 DPMELD R K RR NNLC CG SGH RDC + Sbjct: 242 DPMELDGAERHQKPNGLSVEEKKRRRENNLCFTCGKSGHMSRDCSQ 287 >ref|XP_001598842.1| hypothetical protein SS1G_00931 [Sclerotinia sclerotiorum 1980] gi|154691790|gb|EDN91528.1| hypothetical protein SS1G_00931 [Sclerotinia sclerotiorum 1980 UF-70] Length = 433 Score = 112 bits (280), Expect(2) = 2e-28 Identities = 65/188 (34%), Positives = 93/188 (49%), Gaps = 15/188 (7%) Frame = +2 Query: 524 PDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQITG 703 PD F GDRSK++ FVAQ LYY + F T +I + SF RG+AF W+EPF + + Sbjct: 37 PDLFHGDRSKHRAFVAQADLYYAFNGHLFLTQMQRILWLISFFRGTAFNWIEPFFNDLMT 96 Query: 704 DVS---------------FHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYS 838 + F+ Y F G F + D AER + L Q GS ++Y + Sbjct: 97 KTTDGQLNENMKPETRRLFNTYESFRQGFDRAFGEVDPDHMAERALRQLKQTGSVTAYTA 156 Query: 839 QFVALIAQLGWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARR 1018 +F ++ W +D ++ ++ GLKD IKD L P +SE I +DN+ RR Sbjct: 157 KFQQYAGRITWDDDYLRSQFY-EGLKDIIKDELARAPKPPNLSELIETSILIDNRFYERR 215 Query: 1019 CEKKKSTR 1042 EKK T+ Sbjct: 216 MEKKGITQ 223 Score = 42.7 bits (99), Expect(2) = 2e-28 Identities = 22/46 (47%), Positives = 23/46 (50%), Gaps = 11/46 (23%) Frame = +3 Query: 1158 DPMELDAVSR-----------KSYRRANNLCTYCGASGHWVRDCEK 1262 DPMELD R K RR NNLC CG SGH RDC + Sbjct: 242 DPMELDGAERHQKPNGLSVEEKKRRRENNLCFTCGKSGHMSRDCSQ 287 >ref|XP_004912460.1| PREDICTED: retrotransposon-derived protein PEG10-like [Xenopus (Silurana) tropicalis] Length = 566 Score = 84.3 bits (207), Expect(3) = 2e-26 Identities = 51/174 (29%), Positives = 81/174 (46%), Gaps = 1/174 (0%) Frame = +2 Query: 509 PEGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI 688 P P++F GDR ++TF L + P+ +ST++ K+ S L G W + Sbjct: 67 PNVAMPEKFSGDRKTFRTFTNACKLLFTLKPRMYSTEQIKVGVIISLLLGEPQSWAFHLM 126 Query: 689 DQITGDVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGS-CSSYYSQFVALIAQL 865 + T S F L + DP + A AE + NL Q+ Y +F + + Sbjct: 127 E--TRSTSLLTVDSFFQALAVLYDDPHRTAAAEASLRNLRQRSRPVEDYTVEFRKYASDV 184 Query: 866 GWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEK 1027 W + ++K H FR GL DS+KD L P ++ E L I++D ++ R+ EK Sbjct: 185 DWNQAALK-HQFRLGLSDSLKDELARVGVPASLDEIIHLSIQIDRRLRERKLEK 237 Score = 47.8 bits (112), Expect(3) = 2e-26 Identities = 31/92 (33%), Positives = 43/92 (46%), Gaps = 2/92 (2%) Frame = +1 Query: 1438 AMIDSGATSLFVDAEFLXXXXXXXXXXXYPETLRVVDGRESCEGAIKHE-IELDIWLGD- 1611 A++DSGA+ F+D++ +P LRV DG G I E L + L Sbjct: 345 AILDSGASGCFLDSQVAQIHRIPLKKKQFPVFLRVADGSPINSGPILLESTPLSLTLNKI 404 Query: 1612 HKERTLFQVTKLAEYPLILGKAWLDRHNPDID 1707 H E F + P+I+G WL RHNP I+ Sbjct: 405 HHEHLSFDIVSSPLSPVIIGLPWLRRHNPVIN 436 Score = 36.2 bits (82), Expect(3) = 2e-26 Identities = 29/102 (28%), Positives = 42/102 (41%), Gaps = 25/102 (24%) Frame = +3 Query: 1119 PDTPQPMTNLPVD------DPMELDAV------SRKSYRRANNLCTYCGASGHWVRDC-- 1256 P P P+ + P +PM++ A+ + RR NLC YCG SGH +R C Sbjct: 248 PKAPPPIRSEPGPNSSDEMEPMQIGALRPALSPEERLRRRRLNLCLYCGLSGHVLRSCPT 307 Query: 1257 ----------EKLNSRDTRVAAAALSTD-SEKDLTLPALYQS 1349 + L R + ++S K L LPA+ S Sbjct: 308 RPCKRSTYKTDTLKYRFAPLLTLSISLQLGNKTLNLPAILDS 349 >emb|CBN81178.1| Pol polyprotein [Dicentrarchus labrax] Length = 1618 Score = 80.9 bits (198), Expect(3) = 2e-26 Identities = 51/175 (29%), Positives = 89/175 (50%), Gaps = 2/175 (1%) Frame = +2 Query: 509 PEGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI 688 P P+ + GD F+ Q +L + P +S+D +++ F S L G A +W Sbjct: 229 PHVPTPERYAGDLGACGRFLLQCSLVFQQQPLTYSSDSTRVAFVISLLSGKAAQWATALW 288 Query: 689 DQITGDV-SFHKYSDFLAGLMAGFADPDQYATAEREIENLIQ-KGSCSSYYSQFVALIAQ 862 ++ + +F ++SD L + F P + A + + NL Q GS + + +F L A+ Sbjct: 289 EKHSPICETFQRFSDELRKV---FDHPVRGREAAKRLLNLRQGSGSVAEFSVEFRVLAAE 345 Query: 863 LGWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEK 1027 GW E++++ F GL + +KD L +D ++ E +L I+LDN++ RR EK Sbjct: 346 SGWDEEALQT-VFVHGLSEVMKDELAARDSAASLDELISLAIRLDNRLRERRREK 399 Score = 43.9 bits (102), Expect(3) = 2e-26 Identities = 25/91 (27%), Positives = 41/91 (45%), Gaps = 1/91 (1%) Frame = +1 Query: 1438 AMIDSGATSLFVDAEFLXXXXXXXXXXXYPETLRVVDGRESCEGAIKHEIELDIWL-GDH 1614 A++DSGA F+D+ F+ + + +DG+ + + L + L G+H Sbjct: 522 ALVDSGAEESFIDSAFVLQANIPTIKLPDNQPVNALDGKHLAN-ITRQTVPLTLILSGNH 580 Query: 1615 KERTLFQVTKLAEYPLILGKAWLDRHNPDID 1707 +E V P++LG WL HNP D Sbjct: 581 REEISLLVISSPNTPVVLGYPWLKLHNPQFD 611 Score = 43.1 bits (100), Expect(3) = 2e-26 Identities = 25/87 (28%), Positives = 40/87 (45%), Gaps = 4/87 (4%) Frame = +3 Query: 1149 PVDDPMELDAV----SRKSYRRANNLCTYCGASGHWVRDCEKLNSRDTRVAAAALSTDSE 1316 PV++PM+L + + R + LC YCG GH++ C +L ++T A +S Sbjct: 437 PVEEPMQLGRTRLTQAERQRRMRSGLCIYCGQHGHFLAACPQLPKKETENGVAPVSQVPA 496 Query: 1317 KDLTLPALYQSKN*LH*RTHGYPLVTL 1397 + Q K LH + + PL L Sbjct: 497 SSSAPLSRLQLKASLHWQLNSIPLTAL 523 >gb|EKD04365.1| retrotransposon nucleocapsid protein [Trichosporon asahii var. asahii CBS 8904] Length = 1687 Score = 125 bits (314), Expect = 6e-26 Identities = 60/188 (31%), Positives = 105/188 (55%) Frame = +2 Query: 509 PEGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI 688 P+ P+ F G R+K TF+ Q+ + G P +F T+ SK+ +A S+LR +AF W +P++ Sbjct: 60 PKVSSPEYFSGQRNKVTTFITQVRMVIGLQPSRFPTENSKVLYAGSYLRDTAFLWFQPYV 119 Query: 689 DQITGDVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLG 868 + ++ F L + F DPD+ ATAER++ NL Q+GS S+Y + F A + Sbjct: 120 ASEKQPDWLNDFNLFCKELRSMFGDPDEVATAERQLYNLRQRGSASAYVADFTRYAALVN 179 Query: 869 WTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEKKKSTRII 1048 W ++++ ++ RGLKD+IKD L D P + + + +++D ++ R E+ +S Sbjct: 180 WNDEALCAQFY-RGLKDAIKDELARTDKPKDLKTYKDIAVRIDTRLFERHLERDRSKTFT 238 Query: 1049 KAIANFSS 1072 F++ Sbjct: 239 TTTTTFNN 246 >gb|EKD00111.1| retrotransposon nucleocapsid protein [Trichosporon asahii var. asahii CBS 8904] Length = 1662 Score = 124 bits (312), Expect = 1e-25 Identities = 61/178 (34%), Positives = 101/178 (56%) Frame = +2 Query: 509 PEGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI 688 P+ P+ F G R+K TF+ Q+ + G P +F T+ SK+ +A SFL +AF WL+P++ Sbjct: 58 PKVSSPEYFSGQRNKVTTFITQVRMVIGLQPSRFPTENSKVLYAGSFLCDTAFLWLQPYV 117 Query: 689 DQITGDVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLG 868 + ++ F L + F DPD+ ATAER++ NL Q+GS S+Y + F A + Sbjct: 118 ASDHPPAWLNDFNLFCKELRSMFGDPDEVATAERQLYNLRQRGSASAYVADFTRFAAVVN 177 Query: 869 WTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEKKKSTR 1042 W ++++ ++ RGLKD IKD L D P + + +++D ++ R EK +S + Sbjct: 178 WNDEALCAQFY-RGLKDPIKDELARTDKPKDLKAYKETAVRIDTRLFERHNEKDRSVK 234 >gb|EXJ86300.1| hypothetical protein A1O1_06670 [Capronia coronata CBS 617.96] Length = 799 Score = 108 bits (269), Expect(2) = 2e-25 Identities = 63/185 (34%), Positives = 100/185 (54%), Gaps = 12/185 (6%) Frame = +2 Query: 512 EGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEP--- 682 E + D F+GD++K K F+ QL + P K+ S++ FAA L+G+AF W EP Sbjct: 9 EEVKVDYFYGDKAKLKMFLVQLKAIFKLYPAKYPNPSSQVLFAALNLKGAAFAWFEPTMT 68 Query: 683 ---------FIDQITGDVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYY 835 +DQ T + FH +++F + F D+ ATAER + ++ Q+GS + YY Sbjct: 69 DYLEANESSSLDQETRMI-FHSFANFEIKIKQVFGVADEEATAERMLHDVKQRGSTAQYY 127 Query: 836 SQFVALIAQLGWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEAR 1015 + F L ++ W ED++ Y+ RGL D +KD + + P T + I++DN++ R Sbjct: 128 ALFKQLAKRVSWNEDALAAAYY-RGLSDQVKDRM--DEVPDTYKDMVDKSIEIDNRLYER 184 Query: 1016 RCEKK 1030 R EKK Sbjct: 185 RMEKK 189 Score = 36.2 bits (82), Expect(2) = 2e-25 Identities = 26/89 (29%), Positives = 36/89 (40%), Gaps = 28/89 (31%) Frame = +3 Query: 1140 TNLPVDDPMELDAVSR---------------------KSYRRANNLCTYCGASGHWVRDC 1256 TN DPM+LDA+ R + R+ NLC CG SGH ++C Sbjct: 205 TNYSYGDPMDLDAMERGRSSRPKGQRFGGFRSNGNKEREKRKKENLCYNCGKSGHRAKEC 264 Query: 1257 ----EKLNSRDTRVAAAALSTDS---EKD 1322 ++L+ D A D+ EKD Sbjct: 265 HAKAQQLHMMDDSAGIEAKKADTSMKEKD 293 >emb|CCG85041.1| protein of unknown function [Taphrina deformans PYCC 5710] Length = 309 Score = 111 bits (277), Expect(2) = 2e-25 Identities = 59/175 (33%), Positives = 104/175 (59%), Gaps = 2/175 (1%) Frame = +2 Query: 524 PDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQITG 703 PDEF G R K +TF+ Q+ L + ++P FSTD + A S+LRG A++W+ P ++ Sbjct: 65 PDEFHGTRKKLETFLFQMELKFEAEPDVFSTDHRRTICAISYLRGEAYEWVIP-AQRLGL 123 Query: 704 DVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLGWTEDS 883 + F Y+ F L+ F +P++ +R+I L Q GSC++Y F++L +LGW +++ Sbjct: 124 EALFPTYTVFHESLVRAFGNPNELDNYKRKIRLLRQHGSCANYTRVFMSLCTRLGWNQEA 183 Query: 884 VKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEK--KKSTR 1042 ++ Y R+GL D++KD ++ + ++ + + D ++EAR+ E+ KKS R Sbjct: 184 LRSQY-RQGLSDAVKDQMIHVNTGVSLQDMIDQALLNDGRLEARQIERQNKKSLR 237 Score = 33.1 bits (74), Expect(2) = 2e-25 Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 4/43 (9%) Frame = +3 Query: 1161 PMELDAV----SRKSYRRANNLCTYCGASGHWVRDCEKLNSRD 1277 PME+DAV S+K ++ C C +GH RDC + S++ Sbjct: 267 PMEIDAVEATTSKKQEQQRLGKCFTCNKTGHLARDCPEKRSKN 309 >emb|CCG85028.1| protein of unknown function [Taphrina deformans PYCC 5710] Length = 334 Score = 94.4 bits (233), Expect(2) = 3e-24 Identities = 73/229 (31%), Positives = 113/229 (49%), Gaps = 4/229 (1%) Frame = +2 Query: 368 ARIHPQIE--GPKITSSACENQQLVLATYFSLGMEVSSDSKPQIVSQAMPEGKRPDEFFG 541 AR+ Q++ GP T S +L LA+ S + P + K P+ F G Sbjct: 27 ARLQHQVDSQGPATTPSQGVTPEL-LASLASAFTAAAPPRSPGYSGDNSLKLKEPEVFNG 85 Query: 542 DRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQITGDV-SFH 718 R + F+A L L + ++ ++F D SKI +A S LRG AF+ ++ +QI DV + Sbjct: 86 GRKDLERFLAALLLKFSAERKRFPDDHSKITYAMSLLRGDAFEIVQ---NQIVQDVDNLG 142 Query: 719 KYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLGWTEDSVKIHY 898 + DF + L F DPD TA E+ENL Q+GS Y ++F L + D K+ + Sbjct: 143 SFQDFRSSLERAFGDPDSAQTAMLELENLRQRGSIVKYNAEFHRL-ENILHLNDIAKLAF 201 Query: 899 FRRGLKDSIKDNLVGK-DCPTTISEFAALCIKLDNQIEARRCEKKKSTR 1042 +RRG+ D IK+ L K + T F +LD + R+ +++ S R Sbjct: 202 YRRGVSDHIKNILSEKLEKYDTFEAFEKAVTQLDANLYVRKQDQRYSVR 250 Score = 46.6 bits (109), Expect(2) = 3e-24 Identities = 26/76 (34%), Positives = 34/76 (44%), Gaps = 12/76 (15%) Frame = +3 Query: 1089 HSQKSVENHAPDTPQPMTNLPVDDPMELDAV------------SRKSYRRANNLCTYCGA 1232 H +N P P +T PME+DAV K YRR N+LC+YCG Sbjct: 256 HHHGRGDNTGPRGPAVITE-QTHTPMEIDAVISTPARRGPLSDKEKKYRRDNDLCSYCGG 314 Query: 1233 SGHWVRDCEKLNSRDT 1280 SGH+ C + + T Sbjct: 315 SGHYANSCPEKKKKFT 330 >emb|CCG85107.1| protein of unknown function [Taphrina deformans PYCC 5710] Length = 334 Score = 95.9 bits (237), Expect(2) = 3e-24 Identities = 72/229 (31%), Positives = 115/229 (50%), Gaps = 4/229 (1%) Frame = +2 Query: 368 ARIHPQIE--GPKITSSACENQQLVLATYFSLGMEVSSDSKPQIVSQAMPEGKRPDEFFG 541 AR+ Q++ GP +T + +L LA+ S + P + K P+ F G Sbjct: 27 ARLQHQVDSQGPAVTPAQGVTPEL-LASLASAFTAAAPPRSPGYSGDNSLKLKEPEVFNG 85 Query: 542 DRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQITGDV-SFH 718 R + F+A L L + ++ ++F D SKI +A S LRG AF+ ++ +QI DV + Sbjct: 86 GRKDLERFLAALLLKFSAERKRFPDDHSKITYAMSLLRGDAFEIVQ---NQIVQDVDNLG 142 Query: 719 KYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLGWTEDSVKIHY 898 + DF + L F DPD TA E+ENL Q+GS Y ++F L + D K+ + Sbjct: 143 SFQDFRSSLERAFGDPDSAQTAMLELENLRQRGSIVKYNAEFHRL-ENILHLNDIAKLAF 201 Query: 899 FRRGLKDSIKDNLVGK-DCPTTISEFAALCIKLDNQIEARRCEKKKSTR 1042 +RRG+ D IK+ L+ K + T F +LD + R+ +++ S R Sbjct: 202 YRRGVSDHIKNILLEKLEKYDTFEAFEKAVTQLDANLYVRKQDQRYSVR 250 Score = 44.7 bits (104), Expect(2) = 3e-24 Identities = 24/68 (35%), Positives = 31/68 (45%), Gaps = 12/68 (17%) Frame = +3 Query: 1089 HSQKSVENHAPDTPQPMTNLPVDDPMELDAV------------SRKSYRRANNLCTYCGA 1232 H +N P P +T PM++DAV K YRR N+LC+YCG Sbjct: 256 HHHGRGDNTGPRGPA-VTTKQTHTPMDIDAVISTPAHRGPLSDKEKKYRRDNDLCSYCGG 314 Query: 1233 SGHWVRDC 1256 SGH+ C Sbjct: 315 SGHYANSC 322 >gb|EKG11343.1| Retrotransposon gag protein [Macrophomina phaseolina MS6] Length = 634 Score = 117 bits (292), Expect = 2e-23 Identities = 65/182 (35%), Positives = 105/182 (57%), Gaps = 11/182 (6%) Frame = +2 Query: 521 RPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI-DQI 697 +PD F+G+R K +TF++QL LY+ + + F TD K+ FAA++LR +A +W EP++ D++ Sbjct: 87 KPDLFYGNRKKLQTFLSQLDLYFFFNSRDFPTDDKKVMFAATYLRDTAAQWFEPYLRDRM 146 Query: 698 TGDVS---------FHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVA 850 + F Y F+ + F D D+ A R + N+ QK S + Y ++F Sbjct: 147 EKEPEARKEDTKKVFGSYKHFVTQIKQSFGDLDEVNKARRAVMNIHQKTSVADYTTEFQK 206 Query: 851 LIAQLG-WTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEK 1027 A L W++ ++ HY+R GLK+ +KD L+ +D P T+ E L IK DN++ R+ EK Sbjct: 207 AAAYLDDWSDRALMDHYYR-GLKERVKDQLMTQDDPKTLDELIKLAIKCDNRLFERQSEK 265 Query: 1028 KK 1033 K Sbjct: 266 YK 267 >gb|AAR29046.2| gag-pol polyprotein [Aspergillus flavus] Length = 1998 Score = 98.6 bits (244), Expect(2) = 4e-23 Identities = 74/273 (27%), Positives = 127/273 (46%), Gaps = 20/273 (7%) Frame = +2 Query: 272 KEQPITPTAPIP----DSTPVKIQL-----LCTQAYSRGKQARIHPQIEGPKITSSACEN 424 K+ P+ T P T VK QL + TQ + K+ + +IE K+ E Sbjct: 9 KKTPVKSTPPAETDSESETTVKEQLKQMKSMITQLVNNAKEK--NQEIENLKVQLGEAER 66 Query: 425 QQLVLATYFS-LGMEVSSDSKPQIVSQAMPEGKRPDEFFGDRSKYKTFVAQLALYYGSDP 601 + + + L +V + + + + P F G RSK + F+ Q+ ++ ++ Sbjct: 67 IRNEQQDHIAQLDAQVGASAPKDAIGKVKLPKAEP--FDGTRSKLQAFLTQMNMHIHANR 124 Query: 602 QKFSTDKSKIRFAASFLRGSAFKWLEPFIDQI----------TGDVSFHKYSDFLAGLMA 751 + + K+ F ++ LRG+A+ W EP+I + T F + L Sbjct: 125 KNLIDEADKVIFISTHLRGAAWNWFEPYIREYYEVVPDNWSNTTRELFTDSGNLRKHLER 184 Query: 752 GFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLGWTEDSVKIHYFRRGLKDSIKD 931 F D D A AER++++L Q+GS S+Y ++F +I+++ W E V + F GLKD +KD Sbjct: 185 TFGDVDAEAVAERKLKHLYQRGSASTYAAEFQQIISRMDWNE-KVYVSTFISGLKDHVKD 243 Query: 932 NLVGKDCPTTISEFAALCIKLDNQIEARRCEKK 1030 D P T++E +K+DN+ R EK+ Sbjct: 244 EFARIDRPATLNEAIDFAVKVDNRYHERLMEKR 276 Score = 38.5 bits (88), Expect(2) = 4e-23 Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 8/78 (10%) Frame = +3 Query: 1065 SHRP--QTMSHSQKS---VENHAPDTPQPMTNLPVDDPMELDAVSRKSY--RRANNLCTY 1223 SHRP Q S+ Q+ V+++ P P+PM + + +S+K RR LC Sbjct: 285 SHRPKGQYKSNDQRERTGVKHNDPYGPKPMELDATEGQGQSKGISQKERERRRREKLCYN 344 Query: 1224 CGASGHWVRDC-EKLNSR 1274 CG +GH +DC +K NS+ Sbjct: 345 CGRAGHMSKDCRQKRNSQ 362 >ref|XP_007431817.1| PREDICTED: retrotransposon-derived protein PEG10-like [Python bivittatus] Length = 495 Score = 97.8 bits (242), Expect(2) = 1e-22 Identities = 60/194 (30%), Positives = 96/194 (49%), Gaps = 1/194 (0%) Frame = +2 Query: 455 LGMEVSSDSKPQIVSQAMPEGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIR 634 +G+ V P G P+++ G+ + +TF+AQ L+ P +F TD++++ Sbjct: 44 IGLGVQPPPPPPPPVLLCSPGSMPEKYGGEVEQMRTFLAQCELFLDGRPGEFPTDQTRVA 103 Query: 635 FAASFLRGSAFKWLEPFIDQITGDVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQK 814 F S L+GSA KW PFI+ T D + Y +F+ F DP + TA R+I L Q Sbjct: 104 FVMSLLKGSAAKWATPFIE--TRDPMLNNYQNFVTAFRGHFGDPVRRLTACRDIRKLKQG 161 Query: 815 GS-CSSYYSQFVALIAQLGWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIK 991 G + + F +L+ + W E ++ I FR GL ++ +V + P T+ LCI Sbjct: 162 GKPVRLFIADFKSLVGDVEWNEIAL-IDQFREGLDPELRSEMVKQGIPNTLDGLYQLCI- 219 Query: 992 LDNQIEARRCEKKK 1033 +EAR E K+ Sbjct: 220 ---MVEARLMELKQ 230 Score = 37.4 bits (85), Expect(2) = 1e-22 Identities = 26/89 (29%), Positives = 37/89 (41%), Gaps = 17/89 (19%) Frame = +3 Query: 1125 TPQPM-----------TNLPVDDPMELDAVSR------KSYRRANNLCTYCGASGHWVRD 1253 TPQP+ T +PM+L A R + RR NLC YCG GH + Sbjct: 239 TPQPLLATIPSPRAVTTEFTAGEPMQLGAAQRTMSGEERQRRRDLNLCFYCGTPGHMI-- 296 Query: 1254 CEKLNSRDTRVAAAALSTDSEKDLTLPAL 1340 KLN D+ + + + + LP + Sbjct: 297 --KLNLIDSGTTMSFIDVQTVQKWQLPTV 323 >gb|EKG20520.1| Retrotransposon gag protein [Macrophomina phaseolina MS6] Length = 296 Score = 113 bits (283), Expect = 2e-22 Identities = 63/182 (34%), Positives = 104/182 (57%), Gaps = 11/182 (6%) Frame = +2 Query: 521 RPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI-DQI 697 +PD F+G+R K +TF++QL LY+ + + F TD ++ FAA++LR +A +W EP++ D++ Sbjct: 72 KPDLFYGNRKKLQTFLSQLDLYFFFNSRDFPTDDKRVMFAATYLRDTAAQWFEPYLRDRM 131 Query: 698 TGDVS---------FHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVA 850 + F Y F+ + D D+ A R + N+ QK S + Y ++F Sbjct: 132 EKEPEARKENTKKVFGSYKHFVTQIKQSSGDLDEVNKARRAVMNIHQKTSVADYTTEFQK 191 Query: 851 LIAQLG-WTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEK 1027 A L W++ ++ HY+R GLK+ +KD L+ +D P T+ E L IK DN++ R+ EK Sbjct: 192 AAAYLDDWSDRALMDHYYR-GLKERVKDQLITQDDPKTLDELIKLAIKCDNRLFKRQSEK 250 Query: 1028 KK 1033 K Sbjct: 251 YK 252 >gb|EKG15822.1| Retrotransposon gag protein [Macrophomina phaseolina MS6] Length = 296 Score = 113 bits (283), Expect = 2e-22 Identities = 64/182 (35%), Positives = 104/182 (57%), Gaps = 11/182 (6%) Frame = +2 Query: 521 RPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFI-DQI 697 +PD F+G+R K +TF++QL LY+ + + F TD +I FAA++LR + +W EP++ D+I Sbjct: 72 KPDLFYGNRKKLQTFLSQLDLYFFFNSRDFPTDDKRIIFAATYLRDTTAQWFEPYLRDRI 131 Query: 698 TGDVS---------FHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGSCSSYYSQFVA 850 + F Y F+ + F D D+ A R + N+ QK S + Y ++F Sbjct: 132 EKEPEARKKDTKKVFSSYKHFVTQIKQSFGDLDEVNKARRAVINIHQKTSVADYTTEFQK 191 Query: 851 LIAQLG-WTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEK 1027 A L W++ ++ HY+R GLK+ +KD L+ +D P T+ + L IK DN++ R+ EK Sbjct: 192 AAAYLDDWSDRALMDHYYR-GLKERVKDQLMTQDDPKTLDKLIKLAIKCDNRLFKRQSEK 250 Query: 1028 KK 1033 K Sbjct: 251 YK 252 >emb|CCG84995.1| protein of unknown function [Taphrina deformans PYCC 5710] Length = 303 Score = 91.3 bits (225), Expect(2) = 9e-22 Identities = 56/177 (31%), Positives = 91/177 (51%), Gaps = 2/177 (1%) Frame = +2 Query: 518 KRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQI 697 K P F G+ ++ +TFV+ L L + ++ F T+ K+ +A S LR A + LEP+++ Sbjct: 44 KDPAFFTGNPAELRTFVSGLQLKFYAEAISFDTEAKKVSYACSLLRDGAAQVLEPYLNNF 103 Query: 698 TGDV-SFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGS-CSSYYSQFVALIAQLGW 871 + S + DF L F DPD+ T ER++ L Q + Y +QF+ L A LGW Sbjct: 104 AQYMDSISSFEDFAKLLQTSFGDPDEKKTFERDLYRLFQNSDPVTVYTAQFLRLSAPLGW 163 Query: 872 TEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEKKKSTR 1042 ++++ Y GL +KD L + P +E + K++ + AR E+K S R Sbjct: 164 NNEALESRYL-YGLSKRVKDELTRRAPPRGRAELMQMASKINARFRARDLERKDSHR 219 Score = 41.2 bits (95), Expect(2) = 9e-22 Identities = 24/76 (31%), Positives = 36/76 (47%), Gaps = 7/76 (9%) Frame = +3 Query: 1071 RPQTMSHSQKSVENHAPDTPQPMTNLPVDDPMELDAVSR-------KSYRRANNLCTYCG 1229 R T+ +Q+ V P + N + PM+LD R K R NNLC YCG Sbjct: 227 RNPTVPSTQREVVAGNPRPSTSVNNRTI--PMDLDGTKRGPLSDAEKKRRYNNNLCLYCG 284 Query: 1230 ASGHWVRDCEKLNSRD 1277 +GH + +C+ N ++ Sbjct: 285 QAGHQIDECKLRNRKN 300 >gb|AAH87517.1| LOC496091 protein, partial [Xenopus laevis] Length = 225 Score = 110 bits (275), Expect = 2e-21 Identities = 65/189 (34%), Positives = 100/189 (52%), Gaps = 2/189 (1%) Frame = +2 Query: 467 VSSDSKPQI-VSQAMPEGKRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAA 643 V+ +PQ+ + A + PD+F GDR ++ FV Q L + P KF D K+ + Sbjct: 33 VAQTQQPQVGATSAAIKMPVPDKFSGDRKMFRGFVNQCKLLFMLQPNKFQDDTLKVGWIL 92 Query: 644 SFLRGSAFKWLEPFIDQITGDVSFHKYSDFLAGLMAGFADPDQYATAEREIENLIQ-KGS 820 + L G A W P I+Q + +S ++ FLA + F DP++ ATAE + L Q S Sbjct: 93 TLLSGEALAWASPLIEQQSPLLS--NFNGFLAAMSVIFDDPNKIATAETTLLTLTQGSRS 150 Query: 821 CSSYYSQFVALIAQLGWTEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDN 1000 + Y + F + W E + + H FRRGL +++KD L D P +S F LCIK+D+ Sbjct: 151 VAEYAATFRRWVLDTSWNEAAQRFH-FRRGLSEAMKDELARVDAPDNLSSFVQLCIKIDS 209 Query: 1001 QIEARRCEK 1027 ++ RR E+ Sbjct: 210 RLSERRKER 218 >emb|CCG85123.1| protein of unknown function [Taphrina deformans PYCC 5710] Length = 292 Score = 95.1 bits (235), Expect(2) = 3e-21 Identities = 56/177 (31%), Positives = 93/177 (52%), Gaps = 2/177 (1%) Frame = +2 Query: 518 KRPDEFFGDRSKYKTFVAQLALYYGSDPQKFSTDKSKIRFAASFLRGSAFKWLEPFIDQI 697 K P F GD ++ +TFV+ L L + ++ F T+ K+ +A S LR A + LEP+++ Sbjct: 44 KDPAFFTGDPAELRTFVSGLQLKFYAEAISFDTEAKKVSYACSLLRDGAAQVLEPYLNNF 103 Query: 698 TGDV-SFHKYSDFLAGLMAGFADPDQYATAEREIENLIQKGS-CSSYYSQFVALIAQLGW 871 + S + DF L F DPD+ T ER++ L Q + Y +QF+ L A LGW Sbjct: 104 AQYMDSISSFKDFAKLLQTSFGDPDEKKTFERDLYRLFQNSDPVTVYTAQFLRLSAPLGW 163 Query: 872 TEDSVKIHYFRRGLKDSIKDNLVGKDCPTTISEFAALCIKLDNQIEARRCEKKKSTR 1042 +++++ Y GL + +KD L + P +E + ++D + AR E++ S R Sbjct: 164 NDEALESRYL-YGLSERVKDELTRRAPPRNRAELMQMASEIDARFRARDLERRDSHR 219 Score = 35.8 bits (81), Expect(2) = 3e-21 Identities = 22/64 (34%), Positives = 30/64 (46%), Gaps = 7/64 (10%) Frame = +3 Query: 1071 RPQTMSHSQKSVENHAPDTPQPMTNLPVDDPMELDAVSR-------KSYRRANNLCTYCG 1229 R T S +++ V P P+ N + PM++D R K R NNLC YCG Sbjct: 227 RNPTTSSTRREVVAGNPKPNTPVNNGTI--PMDVDGTRRGPLSDAEKKRRYDNNLCLYCG 284 Query: 1230 ASGH 1241 +GH Sbjct: 285 QAGH 288 >ref|XP_001818504.2| gag-pol polyprotein [Aspergillus oryzae RIB40] Length = 1941 Score = 95.5 bits (236), Expect(2) = 5e-21 Identities = 73/273 (26%), Positives = 125/273 (45%), Gaps = 20/273 (7%) Frame = +2 Query: 272 KEQPITPTAPIP----DSTPVKIQL-----LCTQAYSRGKQARIHPQIEGPKITSSACEN 424 K+ P+ T P T VK QL + TQ + K+ + +IE K+ E Sbjct: 9 KKTPVKNTPPAETDSESETTVKEQLKQMKNMITQLVNNAKEK--NQEIENLKVQLGEAER 66 Query: 425 QQLVLATYFS-LGMEVSSDSKPQIVSQAMPEGKRPDEFFGDRSKYKTFVAQLALYYGSDP 601 + + + L +V + + + + P F G RSK + F+ Q+ ++ ++ Sbjct: 67 IRSEQQDHIAQLDAQVGASAPKDAIGKVKLPKAEP--FDGTRSKLQAFLTQMNMHIHANR 124 Query: 602 QKFSTDKSKIRFAASFLRGSAFKWLEPFIDQI----------TGDVSFHKYSDFLAGLMA 751 + + K+ F ++ LRG+A+ W EP+I + T F + L Sbjct: 125 KNLIDEADKVIFISTHLRGAAWNWFEPYIREYYEVVPDNWSNTTRELFTDSGNLRKHLER 184 Query: 752 GFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLGWTEDSVKIHYFRRGLKDSIKD 931 F D D A AER+++ L Q+GS S+Y ++F +I+++ W E V + F GLK +KD Sbjct: 185 TFGDVDAEAVAERKLKQLYQRGSASTYAAEFQQIISRMDWNE-KVYVSTFISGLKGHVKD 243 Query: 932 NLVGKDCPTTISEFAALCIKLDNQIEARRCEKK 1030 D P T++E +K+DN+ R EK+ Sbjct: 244 EFARIDRPATLNEAIDFAVKVDNRYHERLMEKR 276 Score = 34.3 bits (77), Expect(2) = 5e-21 Identities = 27/86 (31%), Positives = 37/86 (43%), Gaps = 14/86 (16%) Frame = +3 Query: 1065 SHRP--QTMSHSQKSVENHAPDTPQPMTNLPVDDPMELDAVSRKSY-----------RRA 1205 SHRP Q S+ Q+ + P + PMELDA +S R+ Sbjct: 285 SHRPKGQYKSNDQRERTGAKHNDPYGLK------PMELDATEGQSQSRGISQKERERRKR 338 Query: 1206 NNLCTYCGASGHWVRDC-EKLNSRDT 1280 LC CG +GH +DC +K NS + Sbjct: 339 EKLCYNCGKAGHMSKDCRQKRNSHQS 364 >ref|XP_003189096.1| gag-pol polyprotein [Aspergillus oryzae RIB40] Length = 1941 Score = 95.5 bits (236), Expect(2) = 5e-21 Identities = 73/273 (26%), Positives = 125/273 (45%), Gaps = 20/273 (7%) Frame = +2 Query: 272 KEQPITPTAPIP----DSTPVKIQL-----LCTQAYSRGKQARIHPQIEGPKITSSACEN 424 K+ P+ T P T VK QL + TQ + K+ + +IE K+ E Sbjct: 9 KKTPVKNTPPAETDSESETTVKEQLKQMKNMITQLVNNAKEK--NQEIENLKVQLGEAER 66 Query: 425 QQLVLATYFS-LGMEVSSDSKPQIVSQAMPEGKRPDEFFGDRSKYKTFVAQLALYYGSDP 601 + + + L +V + + + + P F G RSK + F+ Q+ ++ ++ Sbjct: 67 IRSEQQDHIAQLDAQVGASAPKDAIGKVKLPKAEP--FDGTRSKLQAFLTQMNMHIHANR 124 Query: 602 QKFSTDKSKIRFAASFLRGSAFKWLEPFIDQI----------TGDVSFHKYSDFLAGLMA 751 + + K+ F ++ LRG+A+ W EP+I + T F + L Sbjct: 125 KNLIDEADKVIFISTHLRGAAWNWFEPYIREYYEVVPDNWSNTTRELFTDSGNLRKHLER 184 Query: 752 GFADPDQYATAEREIENLIQKGSCSSYYSQFVALIAQLGWTEDSVKIHYFRRGLKDSIKD 931 F D D A AER+++ L Q+GS S+Y ++F +I+++ W E V + F GLK +KD Sbjct: 185 TFGDVDAEAVAERKLKQLYQRGSASTYAAEFQQIISRMDWNE-KVYVSTFISGLKGHVKD 243 Query: 932 NLVGKDCPTTISEFAALCIKLDNQIEARRCEKK 1030 D P T++E +K+DN+ R EK+ Sbjct: 244 EFARIDRPATLNEAIDFAVKVDNRYHERLMEKR 276 Score = 34.3 bits (77), Expect(2) = 5e-21 Identities = 27/86 (31%), Positives = 37/86 (43%), Gaps = 14/86 (16%) Frame = +3 Query: 1065 SHRP--QTMSHSQKSVENHAPDTPQPMTNLPVDDPMELDAVSRKSY-----------RRA 1205 SHRP Q S+ Q+ + P + PMELDA +S R+ Sbjct: 285 SHRPKGQYKSNDQRERTGAKHNDPYGLK------PMELDATEGQSQSRGISQKERERRKR 338 Query: 1206 NNLCTYCGASGHWVRDC-EKLNSRDT 1280 LC CG +GH +DC +K NS + Sbjct: 339 EKLCYNCGKAGHMSKDCRQKRNSHQS 364