BLASTX nr result

ID: Mentha25_contig00049815 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00049815
         (786 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002556819.1| Pc06g02160 [Penicillium chrysogenum Wisconsi...   162   2e-37
gb|AAZ28935.1| polyprotein [Phanerochaete chrysosporium RP-78]        159   1e-36
emb|CAI72292.1| putative polyprotein [Phytophthora infestans]         158   2e-36
gb|AAZ28936.1| polyprotein [Phanerochaete chrysosporium RP-78]        156   9e-36
dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsi...   151   3e-34
ref|XP_002488353.1| conserved hypothetical protein [Talaromyces ...   150   4e-34
gb|EUC59597.1| Gag-Pol polyprotein/retrotransposon, putative [Rh...   150   5e-34
ref|XP_001596107.1| hypothetical protein SS1G_02323 [Sclerotinia...   150   7e-34
ref|XP_969432.2| PREDICTED: similar to Copia protein (Gag-int-po...   149   9e-34
gb|EFA07743.1| hypothetical protein TcasGA2_TC002223 [Tribolium ...   149   9e-34
gb|EMR87315.1| putative retroelement pol poly protein [Botryotin...   149   1e-33
gb|EFA07744.1| hypothetical protein TcasGA2_TC002224 [Tribolium ...   148   2e-33
prf||1107279B ORF g                                                   148   3e-33
emb|CAD27357.1| hypothetical protein [Drosophila melanogaster]        148   3e-33
emb|CCU76267.1| Gag-Pol polyprotein [Blumeria graminis f. sp. ho...   148   3e-33
pir||PC1232 copia polyprotein - fruit fly (Drosophila simulans) ...   148   3e-33
dbj|BAA01703.1| unnamed protein product [Drosophila simulans]         148   3e-33
sp|P04146.3|COPIA_DROME RecName: Full=Copia protein; AltName: Fu...   148   3e-33
gb|EFN65994.1| Retrovirus-related Pol polyprotein from transposo...   148   3e-33
emb|CCE34911.1| uncharacterized protein CPUR_08850 [Claviceps pu...   147   3e-33

>ref|XP_002556819.1| Pc06g02160 [Penicillium chrysogenum Wisconsin 54-1255]
            gi|211581432|emb|CAP79209.1| Pc06g02160 [Penicillium
            chrysogenum Wisconsin 54-1255]
          Length = 1531

 Score =  162 bits (409), Expect = 2e-37
 Identities = 89/219 (40%), Positives = 131/219 (59%), Gaps = 1/219 (0%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L ++SN TRPDI FAT  +A++ S P   H + V R++ YLK    K I+++        
Sbjct: 1263 LNFSSNQTRPDIAFATGYVARYASNPNQAHMDAVDRIFAYLKSDARKGIVYSD------- 1315

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFE 427
            + G   K F DSDFA    +R+S +G+V  + GGP+ W S+RQK++ATSTM+AEYIA  E
Sbjct: 1316 KHGLQLKGFVDSDFAGCEDSRKSTTGWVFTLAGGPISWSSQRQKTVATSTMDAEYIACAE 1375

Query: 426  ASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALA-NGTNSTKAKHIDVAYHFTR 250
            A+K A+W+   + DL +    I T  + +Y DN +AL L  N    ++AKHIDV ++F R
Sbjct: 1376 AAKEAMWIRNFINDLHIPGIHIDT--VPLYIDNNAALKLTRNPEFHSRAKHIDVKHNFIR 1433

Query: 249  RCVKNSTINVEYIPTKDMLADILTKPLAHSKAASILDKL 133
              V+   I+ + + TKD LAD+ TK L  S    ++ +L
Sbjct: 1434 EKVEEGLIDTQRVNTKDNLADVFTKALPRSTHEDLVKRL 1472


>gb|AAZ28935.1| polyprotein [Phanerochaete chrysosporium RP-78]
          Length = 1394

 Score =  159 bits (402), Expect = 1e-36
 Identities = 91/211 (43%), Positives = 126/211 (59%), Gaps = 1/211 (0%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L YA+  TRPDI +A   L++F   P   HW  + RV RYLK T    II+   P  Q  
Sbjct: 1170 LMYAAVGTRPDISYAVQTLSQFCERPSTAHWTALKRVLRYLKGTAEWGIIYK-APEAQTT 1228

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFE 427
                +   +SD+D+ ++   ++S+SG+V ++GG PVCW S++QKS+A S+MEAEY+A   
Sbjct: 1229 PIEVVG--YSDADWGANPDDQKSISGYVFLLGGAPVCWASRKQKSVALSSMEAEYMAGST 1286

Query: 426  ASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALANGTNST-KAKHIDVAYHFTR 250
            A+  A+W   LL +L  A      N   +Y DNQSALALA  T +  +AKHID+ YHF R
Sbjct: 1287 AASQALWCRMLLEELGFAQ----PNPTLLYMDNQSALALARNTGTQGRAKHIDIRYHFLR 1342

Query: 249  RCVKNSTINVEYIPTKDMLADILTKPLAHSK 157
              + +  I+V + P +D  ADI TKPLA  K
Sbjct: 1343 DKISSKEISVAHCPGEDNPADIFTKPLARQK 1373


>emb|CAI72292.1| putative polyprotein [Phytophthora infestans]
          Length = 1353

 Score =  158 bits (399), Expect = 2e-36
 Identities = 87/213 (40%), Positives = 128/213 (60%), Gaps = 1/213 (0%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L Y +  TRPDI +  +QLA+F   P  +HW    RV +YLK T++  I++  G    G 
Sbjct: 1131 LMYITTCTRPDIAYVVTQLARFLEDPGTQHWKAAIRVLQYLKSTRHHGIVYKSGTSGFGT 1190

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFE 427
             +    + F+D+D+ S+I  RRSVSG ++MIG  PV ++SK Q+++A S+ EAEY+AL  
Sbjct: 1191 -QAVKAEAFTDADWGSNIDDRRSVSGVMVMIGNAPVVFKSKYQRTVALSSAEAEYMALSL 1249

Query: 426  ASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALA-NGTNSTKAKHIDVAYHFTR 250
             ++  +W   +L+D+    E +G    +V+ DNQ A+ALA N     + KH+D+ +HF R
Sbjct: 1250 CTQEVLWTRAMLKDM--GHEQVGAT--QVWEDNQGAIALASNAGYHARTKHVDIRHHFIR 1305

Query: 249  RCVKNSTINVEYIPTKDMLADILTKPLAHSKAA 151
              V+ STI V YI TK  LAD+LTK L     A
Sbjct: 1306 ENVERSTIKVAYIDTKQQLADMLTKALGTKSLA 1338


>gb|AAZ28936.1| polyprotein [Phanerochaete chrysosporium RP-78]
          Length = 1511

 Score =  156 bits (394), Expect = 9e-36
 Identities = 91/219 (41%), Positives = 133/219 (60%), Gaps = 1/219 (0%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L YA+ +TRPDI +A +QLA+F   P M+HWN + RVY YLK T++  ++          
Sbjct: 1300 LMYAAIATRPDIAYAVNQLARFAENPGMKHWNALRRVYAYLKGTRDLSLVLG-----GDA 1354

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFE 427
            R+G +   ++D+D  S  + R++VSG+  +IGG  V W SKRQ+ +A ST EAEY+AL  
Sbjct: 1355 RDGPLVG-YTDADGMS-TEGRQAVSGYAFLIGGA-VSWSSKRQEIVALSTSEAEYVALTH 1411

Query: 426  ASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALANGTN-STKAKHIDVAYHFTR 250
            A+K A+W+   L ++      +    M++Y+DNQSA+ALA       ++KHID+ YHF R
Sbjct: 1412 AAKEALWLRNYLHEV----WQMPLQPMQLYSDNQSAIALARDDRYHARSKHIDIRYHFIR 1467

Query: 249  RCVKNSTINVEYIPTKDMLADILTKPLAHSKAASILDKL 133
              +++  I V Y PT+DM+AD LTK L   KA      L
Sbjct: 1468 YHIEHGNITVTYCPTEDMVADTLTKALPSMKAKHFASSL 1506


>dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsis thaliana]
          Length = 1499

 Score =  151 bits (381), Expect = 3e-34
 Identities = 79/201 (39%), Positives = 121/201 (60%), Gaps = 1/201 (0%)
 Frame = -1

Query: 768  STRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGVREGAMT 589
            ++RPDI +A+S L+++   P  +H     RV RY+K T        +G H + V +  + 
Sbjct: 1141 ASRPDIMYASSYLSRYMRSPLKQHLQEAKRVLRYVKGT------LTYGIHFKRVEKPELV 1194

Query: 588  KLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFEASKLAV 409
              FSDSD+A  ++ ++S SG+V  IG G  CW S +QK++A ST EAEYIA+  A+  A+
Sbjct: 1195 G-FSDSDWAGSVEDKKSTSGYVFTIGSGAFCWNSSKQKTVAQSTAEAEYIAVCSAANQAI 1253

Query: 408  WVTRLLRDLRVADELIGTNGMKVYTDNQSALALA-NGTNSTKAKHIDVAYHFTRRCVKNS 232
            W+ RL+ ++    E     G++++ DN+SA+A+  N     + KHID+ YHF R   +N 
Sbjct: 1254 WLQRLVNEIGFKAE----KGIRIFCDNKSAIAIGKNPVQHRRTKHIDIKYHFVREAQQNG 1309

Query: 231  TINVEYIPTKDMLADILTKPL 169
             I +EY P +  +ADILTKPL
Sbjct: 1310 KIKLEYCPGELQIADILTKPL 1330


>ref|XP_002488353.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
            gi|218712171|gb|EED11597.1| conserved hypothetical
            protein [Talaromyces stipitatus ATCC 10500]
          Length = 1345

 Score =  150 bits (380), Expect = 4e-34
 Identities = 84/209 (40%), Positives = 125/209 (59%), Gaps = 5/209 (2%)
 Frame = -1

Query: 780  YASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGVRE 601
            Y    TRPD+ FA S+L+KF   P ++H   + RV RYL  T+N  I +        V  
Sbjct: 1127 YLMICTRPDLAFALSRLSKFVQKPGIKHAAALKRVLRYLAGTQNLGIAYCKSYSNDSVLY 1186

Query: 600  GAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFEAS 421
            G     +SDSDFA+D+  RRS SGF+ ++ GGP+ W+SK+Q  + +ST +AEY+ L  AS
Sbjct: 1187 G-----YSDSDFAADLNNRRSTSGFIFLLNGGPISWKSKQQSLVTSSTHDAEYVGLATAS 1241

Query: 420  KLAVWVTRLLRDL--RVADELIGTNGMKVYTDNQSALALANGTN---STKAKHIDVAYHF 256
               +W+ +L+  +  + A+  + +N   ++ DNQ A+A AN  +   ST++KHID+ +H 
Sbjct: 1242 YEVIWLRKLILAILPQYAEHTMPSN--TIHCDNQGAIATANQPSHSPSTRSKHIDIRFHV 1299

Query: 255  TRRCVKNSTINVEYIPTKDMLADILTKPL 169
             R  + N  I +EYI T +M ADILTK L
Sbjct: 1300 IREAIANGLIRLEYIRTTEMTADILTKAL 1328


>gb|EUC59597.1| Gag-Pol polyprotein/retrotransposon, putative [Rhizoctonia solani
           AG-3 Rhs1AP]
          Length = 497

 Score =  150 bits (379), Expect = 5e-34
 Identities = 83/207 (40%), Positives = 120/207 (57%), Gaps = 1/207 (0%)
 Frame = -1

Query: 786 LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
           L + S  TRPDI FA ++L +  S P  +HW  +  V RYL  T +  ++++        
Sbjct: 245 LNWLSLGTRPDIAFALARLGQAQSNPHPKHWQALTHVLRYLSGTLDMGLVYS------AK 298

Query: 606 REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFE 427
            +     + +DS FA  + TRRS SGFV+++GG  V W S++Q  + TS+ EAEYIA+  
Sbjct: 299 ADRPEPHMHTDSAFADCVDTRRSHSGFVVLVGGAAVAWSSRKQAIVTTSSTEAEYIAMGV 358

Query: 426 ASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALANGTN-STKAKHIDVAYHFTR 250
           A+K A W+ RLL DL    E      +++Y DNQS+L LA     ST+ KH+DV YHF R
Sbjct: 359 AAKEAAWMKRLLLDL----EFPNNGPLRIYADNQSSLILATSEKLSTRTKHLDVQYHFVR 414

Query: 249 RCVKNSTINVEYIPTKDMLADILTKPL 169
           +  K      +++ TK  +AD+LTKPL
Sbjct: 415 QLAKMGICVFKWVSTKLNVADVLTKPL 441


>ref|XP_001596107.1| hypothetical protein SS1G_02323 [Sclerotinia sclerotiorum 1980]
            gi|154699731|gb|EDN99469.1| hypothetical protein
            SS1G_02323 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1519

 Score =  150 bits (378), Expect = 7e-34
 Identities = 92/221 (41%), Positives = 128/221 (57%), Gaps = 3/221 (1%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L Y    +RPDI F+  +L ++   P   + N + RV RYL+ T N R+   FGP   GV
Sbjct: 1302 LMYTMVYSRPDIAFSLGKLNQYMKDPAEFYMNQLRRVMRYLRTTINYRL--RFGPG--GV 1357

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFE 427
            R      ++SD+D+AS I  R+S SG V ++GGGPV W S++QK+++TST E+EY+A   
Sbjct: 1358 RN---LVVYSDADYASSIVDRKSTSGVVALLGGGPVFWMSRKQKAVSTSTTESEYVAQSI 1414

Query: 426  ASKLAVWVTRLLRDLRVADELIGTNGMKV--YTDNQSALALANGTNST-KAKHIDVAYHF 256
            A+K   W+ ++LRD+      I  NG KV    DNQ A+AL      T ++KHIDVAYH 
Sbjct: 1415 AAKQGQWLAQVLRDMGYR-HYISENGTKVDMKGDNQGAIALVKNAQLTDRSKHIDVAYHH 1473

Query: 255  TRRCVKNSTINVEYIPTKDMLADILTKPLAHSKAASILDKL 133
             R   +   + V YIPT  M+AD LTKPL        ++ L
Sbjct: 1474 VRDLAEKGMLEVSYIPTDKMVADGLTKPLGKDAFRKFVEML 1514


>ref|XP_969432.2| PREDICTED: similar to Copia protein (Gag-int-pol protein) [Tribolium
            castaneum]
          Length = 1360

 Score =  149 bits (377), Expect = 9e-34
 Identities = 81/214 (37%), Positives = 129/214 (60%), Gaps = 4/214 (1%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L Y + +TRPDI F  SQL +FN+C    HW    RV RYLK T +  + F    H    
Sbjct: 1147 LTYLAMTTRPDIAFVVSQLGQFNNCYDEEHWKAAKRVMRYLKGTIHLGLSFR-ATHKP-- 1203

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFE 427
                  + + D+D+ +  + RRS +GF+ ++ G  + W +K+Q+++A ST EAEY+A+ E
Sbjct: 1204 -----IRAYVDADWGNCTEDRRSFTGFIFLLNGSAISWDTKKQRTVALSTTEAEYMAMAE 1258

Query: 426  ASKLAVWVTRLLRDL---RVADELIGTNGMKVYTDNQSALALA-NGTNSTKAKHIDVAYH 259
             +K A+++ R +++L   ++AD       +K+Y DNQSA+ LA N     ++KHIDV +H
Sbjct: 1259 CAKEAIYLRRFIQELGFDKLAD-------VKIYCDNQSAIRLAENPVFHARSKHIDVRHH 1311

Query: 258  FTRRCVKNSTINVEYIPTKDMLADILTKPLAHSK 157
            F R  +++  +++E+IPT+  +AD LTK LA  K
Sbjct: 1312 FVREVLRDKQVSLEHIPTEQQVADFLTKGLAKQK 1345


>gb|EFA07743.1| hypothetical protein TcasGA2_TC002223 [Tribolium castaneum]
          Length = 1384

 Score =  149 bits (377), Expect = 9e-34
 Identities = 81/214 (37%), Positives = 129/214 (60%), Gaps = 4/214 (1%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L Y + +TRPDI F  SQL +FN+C    HW    RV RYLK T +  + F    H    
Sbjct: 1171 LTYLAMTTRPDIAFVVSQLGQFNNCYDEEHWKAAKRVMRYLKGTIHLGLSFR-ATHKP-- 1227

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFE 427
                  + + D+D+ +  + RRS +GF+ ++ G  + W +K+Q+++A ST EAEY+A+ E
Sbjct: 1228 -----IRAYVDADWGNCTEDRRSFTGFIFLLNGSAISWDTKKQRTVALSTTEAEYMAMAE 1282

Query: 426  ASKLAVWVTRLLRDL---RVADELIGTNGMKVYTDNQSALALA-NGTNSTKAKHIDVAYH 259
             +K A+++ R +++L   ++AD       +K+Y DNQSA+ LA N     ++KHIDV +H
Sbjct: 1283 CAKEAIYLRRFIQELGFDKLAD-------VKIYCDNQSAIRLAENPVFHARSKHIDVRHH 1335

Query: 258  FTRRCVKNSTINVEYIPTKDMLADILTKPLAHSK 157
            F R  +++  +++E+IPT+  +AD LTK LA  K
Sbjct: 1336 FVREVLRDKQVSLEHIPTEQQVADFLTKGLAKQK 1369


>gb|EMR87315.1| putative retroelement pol poly protein [Botryotinia fuckeliana BcDW1]
          Length = 1553

 Score =  149 bits (376), Expect = 1e-33
 Identities = 86/210 (40%), Positives = 129/210 (61%), Gaps = 3/210 (1%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L Y    +RPDI F   +L+++   P   H + + RV RYL+ T N ++   FGP   GV
Sbjct: 1328 LMYTMVYSRPDIAFGLGKLSQYMKDPADFHMHQLRRVMRYLRTTINYKL--RFGPG--GV 1383

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFE 427
            R      ++SD+D+AS++  R+S SG V ++GGGPV W S++Q S++TST E+EYIA   
Sbjct: 1384 RN---LVIYSDADYASNVVDRKSTSGVVALLGGGPVFWMSRKQNSVSTSTTESEYIAQSI 1440

Query: 426  ASKLAVWVTRLLRDLRVADELIGTNG--MKVYTDNQSALALANGTNST-KAKHIDVAYHF 256
            A+K   W+ ++LRD+    + +  NG  +++  DNQ A+AL      T ++KHID+AYH 
Sbjct: 1441 AAKQGQWLAQILRDMGY-KQFVAENGSTVEMKGDNQGAIALVKNAQLTDRSKHIDIAYHH 1499

Query: 255  TRRCVKNSTINVEYIPTKDMLADILTKPLA 166
             R   +   +++ YIPT  M+AD LTKPLA
Sbjct: 1500 VRDLKQKGKVDISYIPTDKMVADGLTKPLA 1529


>gb|EFA07744.1| hypothetical protein TcasGA2_TC002224 [Tribolium castaneum]
          Length = 2378

 Score =  148 bits (374), Expect = 2e-33
 Identities = 81/214 (37%), Positives = 128/214 (59%), Gaps = 4/214 (1%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L Y + +TRPDI F  SQL +FN+C    HW    RV RYLK T +  + F    H    
Sbjct: 1131 LTYLAMTTRPDIAFVVSQLGQFNNCYDEEHWKAAKRVMRYLKGTIHLGLSFR-ATHKP-- 1187

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFE 427
                    + D+D+ +  + RRS +GF+ ++ G  + W +K+Q+++A ST EAEY+A+ E
Sbjct: 1188 -----IHAYVDADWGNCTEDRRSFTGFIFLLNGSAISWDTKKQRTVALSTTEAEYMAMAE 1242

Query: 426  ASKLAVWVTRLLRDL---RVADELIGTNGMKVYTDNQSALALA-NGTNSTKAKHIDVAYH 259
             +K A+++ R +++L   ++AD       +K+Y DNQSA+ LA N     ++KHIDV +H
Sbjct: 1243 CAKEAIYLRRFIQELGFDKLAD-------VKIYCDNQSAIRLAENPVFHARSKHIDVRHH 1295

Query: 258  FTRRCVKNSTINVEYIPTKDMLADILTKPLAHSK 157
            F R  +++  +++E+IPT+  +AD LTK LA  K
Sbjct: 1296 FVREVLRDKQVSLEHIPTEQQVADFLTKGLAKQK 1329



 Score = 60.8 bits (146), Expect = 5e-07
 Identities = 33/96 (34%), Positives = 50/96 (52%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L Y + +TRPDI F  SQL +FN+C    HW    RV RYLK T +  + F    H    
Sbjct: 2290 LTYLAMTTRPDIAFVVSQLGQFNNCYDEEHWKAAKRVMRYLKGTIHLGLSFR-ATHKP-- 2346

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPV 499
                  + + D+D+ +  + RRS +GF+ ++ G  +
Sbjct: 2347 -----IRAYVDADWGNCTEDRRSFTGFIFLLNGSAI 2377


>prf||1107279B ORF g
          Length = 1410

 Score =  148 bits (373), Expect = 3e-33
 Identities = 87/220 (39%), Positives = 129/220 (58%), Gaps = 2/220 (0%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L Y    TRPD+  A + L++++S      W  + RV RYLK T + ++IF      +  
Sbjct: 1189 LMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENK 1248

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVL-MIGGGPVCWQSKRQKSIATSTMEAEYIALF 430
              G     + DSD+A     R+S +G++  M     +CW +KRQ S+A S+ EAEY+ALF
Sbjct: 1249 IIG-----YVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALF 1303

Query: 429  EASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALANGTNSTK-AKHIDVAYHFT 253
            EA + A+W+  LL  + +  E    N +K+Y DNQ  +++AN  +  K AKHID+ YHF 
Sbjct: 1304 EAVREALWLKFLLTSINIKLE----NPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFA 1359

Query: 252  RRCVKNSTINVEYIPTKDMLADILTKPLAHSKAASILDKL 133
            R  V+N+ I +EYIPT++ LADI TKPL  ++   + DKL
Sbjct: 1360 REQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKL 1399


>emb|CAD27357.1| hypothetical protein [Drosophila melanogaster]
          Length = 1017

 Score =  148 bits (373), Expect = 3e-33
 Identities = 87/220 (39%), Positives = 129/220 (58%), Gaps = 2/220 (0%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L Y    TRPD+  A + L++++S      W  + RV RYLK T + ++IF      +  
Sbjct: 796  LMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENK 855

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVL-MIGGGPVCWQSKRQKSIATSTMEAEYIALF 430
              G     + DSD+A     R+S +G++  M     +CW +KRQ S+A S+ EAEY+ALF
Sbjct: 856  IIG-----YVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALF 910

Query: 429  EASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALANGTNSTK-AKHIDVAYHFT 253
            EA + A+W+  LL  + +  E    N +K+Y DNQ  +++AN  +  K AKHID+ YHF 
Sbjct: 911  EAVREALWLKFLLTSINIKLE----NPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFA 966

Query: 252  RRCVKNSTINVEYIPTKDMLADILTKPLAHSKAASILDKL 133
            R  V+N+ I +EYIPT++ LADI TKPL  ++   + DKL
Sbjct: 967  REQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKL 1006


>emb|CCU76267.1| Gag-Pol polyprotein [Blumeria graminis f. sp. hordei DH14]
          Length = 1492

 Score =  148 bits (373), Expect = 3e-33
 Identities = 88/219 (40%), Positives = 128/219 (58%), Gaps = 1/219 (0%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L + +  TRPDI FAT  L++F + P  +H     RV+RYL  T N+ I+     +T   
Sbjct: 1274 LNFLAIQTRPDIAFATGVLSRFLTNPSPQHMKACDRVFRYLAGTINRSIVLGGKGYTA-- 1331

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFE 427
                    +SDSD+A DI  RRS SGFV  +GGG V  QSKRQ ++A S+ EAEY  L +
Sbjct: 1332 -----LHGYSDSDYAGDISMRRSTSGFVFFLGGGAVSVQSKRQTTVALSSTEAEYYGLTK 1386

Query: 426  ASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALA-NGTNSTKAKHIDVAYHFTR 250
            A+  A W+ + L +L        +  +K++ DNQS+LALA N     + KHI + +H+ R
Sbjct: 1387 AAMEASWIRQFLEELGNR-----SKSVKLFGDNQSSLALAENPEFHQRTKHIAIKHHYKR 1441

Query: 249  RCVKNSTINVEYIPTKDMLADILTKPLAHSKAASILDKL 133
              V+N  I++ ++PT+DM+AD LTKPL   K    +++L
Sbjct: 1442 EQVQNGFIDLWFVPTEDMVADGLTKPLPTVKHQHFVEQL 1480


>pir||PC1232 copia polyprotein - fruit fly (Drosophila simulans) retrotransposon
            copia (fragments)
          Length = 787

 Score =  148 bits (373), Expect = 3e-33
 Identities = 87/220 (39%), Positives = 129/220 (58%), Gaps = 2/220 (0%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L Y    TRPD+  A + L++++S      W  + RV RYLK T + ++IF      +  
Sbjct: 566  LMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKRNLAFENK 625

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVL-MIGGGPVCWQSKRQKSIATSTMEAEYIALF 430
              G     + DSD+A     R+S +G++  M     +CW +KRQ S+A S+ EAEY+ALF
Sbjct: 626  IIG-----YVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALF 680

Query: 429  EASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALANGTNSTK-AKHIDVAYHFT 253
            EA + A+W+  LL  + +  E    N +K+Y DNQ  +++AN  +  K AKHID+ YHF 
Sbjct: 681  EAVREALWLKFLLTSINIKLE----NPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFA 736

Query: 252  RRCVKNSTINVEYIPTKDMLADILTKPLAHSKAASILDKL 133
            R  V+N+ I +EYIPT++ LADI TKPL  ++   + DKL
Sbjct: 737  REQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKL 776


>dbj|BAA01703.1| unnamed protein product [Drosophila simulans]
          Length = 1409

 Score =  148 bits (373), Expect = 3e-33
 Identities = 87/220 (39%), Positives = 129/220 (58%), Gaps = 2/220 (0%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L Y    TRPD+  A + L++++S      W  + RV RYLK T + ++IF      +  
Sbjct: 1188 LMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKRNLAFENK 1247

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVL-MIGGGPVCWQSKRQKSIATSTMEAEYIALF 430
              G     + DSD+A     R+S +G++  M     +CW +KRQ S+A S+ EAEY+ALF
Sbjct: 1248 IIG-----YVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALF 1302

Query: 429  EASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALANGTNSTK-AKHIDVAYHFT 253
            EA + A+W+  LL  + +  E    N +K+Y DNQ  +++AN  +  K AKHID+ YHF 
Sbjct: 1303 EAVREALWLKFLLTSINIKLE----NPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFA 1358

Query: 252  RRCVKNSTINVEYIPTKDMLADILTKPLAHSKAASILDKL 133
            R  V+N+ I +EYIPT++ LADI TKPL  ++   + DKL
Sbjct: 1359 REQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKL 1398


>sp|P04146.3|COPIA_DROME RecName: Full=Copia protein; AltName: Full=Gag-int-pol protein;
            Contains: RecName: Full=Copia VLP protein; Contains:
            RecName: Full=Copia protease gi|1491679|emb|CAA26444.1|
            31 KD polyprotein [Drosophila melanogaster]
            gi|19309876|emb|CAA28054.2| hypothetical protein
            [Drosophila melanogaster] gi|41058041|gb|AAR99086.1|
            SD14423p [Drosophila melanogaster]
          Length = 1409

 Score =  148 bits (373), Expect = 3e-33
 Identities = 87/220 (39%), Positives = 129/220 (58%), Gaps = 2/220 (0%)
 Frame = -1

Query: 786  LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
            L Y    TRPD+  A + L++++S      W  + RV RYLK T + ++IF      +  
Sbjct: 1188 LMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENK 1247

Query: 606  REGAMTKLFSDSDFASDIQTRRSVSGFVL-MIGGGPVCWQSKRQKSIATSTMEAEYIALF 430
              G     + DSD+A     R+S +G++  M     +CW +KRQ S+A S+ EAEY+ALF
Sbjct: 1248 IIG-----YVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALF 1302

Query: 429  EASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALANGTNSTK-AKHIDVAYHFT 253
            EA + A+W+  LL  + +  E    N +K+Y DNQ  +++AN  +  K AKHID+ YHF 
Sbjct: 1303 EAVREALWLKFLLTSINIKLE----NPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFA 1358

Query: 252  RRCVKNSTINVEYIPTKDMLADILTKPLAHSKAASILDKL 133
            R  V+N+ I +EYIPT++ LADI TKPL  ++   + DKL
Sbjct: 1359 REQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKL 1398


>gb|EFN65994.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Camponotus floridanus]
          Length = 239

 Score =  148 bits (373), Expect = 3e-33
 Identities = 83/211 (39%), Positives = 128/211 (60%), Gaps = 1/211 (0%)
 Frame = -1

Query: 786 LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
           L Y +++TRPDI FA S L ++N+C    HW    RV RYLK   +  + F       G 
Sbjct: 32  LTYLASTTRPDISFAVSNLGQYNNCFGANHWKAAKRVLRYLKGNIDVGLTF-------GS 84

Query: 606 REGAMTKLFSDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIALFE 427
             G++   F+D+D+ +  + RRS SG++ M+ GGPV W+S++Q+++A ST EAEY+AL E
Sbjct: 85  DSGSIVG-FADADWGNT-EDRRSFSGYIFMLNGGPVSWESRKQRTVALSTTEAEYMALTE 142

Query: 426 ASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALA-NGTNSTKAKHIDVAYHFTR 250
           +SK A+++ R L +L   D     +G+ +Y DNQSA+ L  N     ++KHID+ +HF R
Sbjct: 143 SSKEAIFLRRFLIELGSND----LSGIIIYCDNQSAMKLTENPVYHGRSKHIDIRHHFIR 198

Query: 249 RCVKNSTINVEYIPTKDMLADILTKPLAHSK 157
             +     ++++I T+D  AD LTK L  +K
Sbjct: 199 EAIGRKEFHLKHISTEDQAADFLTKGLVKAK 229


>emb|CCE34911.1| uncharacterized protein CPUR_08850 [Claviceps purpurea 20.1]
          Length = 626

 Score =  147 bits (372), Expect = 3e-33
 Identities = 84/209 (40%), Positives = 124/209 (59%), Gaps = 3/209 (1%)
 Frame = -1

Query: 786 LGYASNSTRPDICFATSQLAKFNSCPFMRHWNGVCRVYRYLKETKNKRIIFNFGPHTQGV 607
           L Y    TRPDI FA S L++F S P   H + + RV+RYL  T++ ++++         
Sbjct: 413 LMYLMLGTRPDIAFAVSCLSRFMSNPTSTHNSAIKRVFRYLNATQDLQLVY--------- 463

Query: 606 REGAMTKLF--SDSDFASDIQTRRSVSGFVLMIGGGPVCWQSKRQKSIATSTMEAEYIAL 433
            +G +  L   +D+D+A DI TRRS SG++  +G G + W SKRQ ++A ST EAEY+  
Sbjct: 464 -KGPLRPLTGNTDADWAGDISTRRSTSGYIFSLGSGAISWSSKRQPTVALSTCEAEYMGQ 522

Query: 432 FEASKLAVWVTRLLRDLRVADELIGTNGMKVYTDNQSALALA-NGTNSTKAKHIDVAYHF 256
            +A+K A+W+ RLL +L        T    ++ DNQ A+ALA N  +  + KHID+ +HF
Sbjct: 523 TQAAKEAIWLKRLLGELLNEQPAAVT----IFGDNQGAIALAKNPQHHARTKHIDIQWHF 578

Query: 255 TRRCVKNSTINVEYIPTKDMLADILTKPL 169
            R       IN+E++P+ D +AD LTKPL
Sbjct: 579 VREKQIAGEINLEHVPSADQIADGLTKPL 607


Top