BLASTX nr result

ID: Chrysanthemum22_contig00036510 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00036510
         (793 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ADB85430.1| putative retrotransposon protein [Phyllostachys e...   358   e-114
dbj|BAA22288.1| polyprotein [Oryza australiensis]                     356   e-110
ref|WP_042516817.1| hypothetical protein [Lactobacillus brevis] ...   328   e-110
gb|ADB85429.1| putative retrotransposon protein [Phyllostachys e...   353   e-109
gb|AAC26250.1| contains similarity to reverse transcriptase (Pfa...   346   e-109
gb|KYP76577.1| Retrovirus-related Pol polyprotein from transposo...   328   e-109
gb|EXX50149.1| gag-pol fusion protein [Rhizophagus irregularis D...   350   e-108
emb|CAD39797.2| OSJNBa0071G03.10 [Oryza sativa Japonica Group]        343   e-108
gb|PKI72506.1| hypothetical protein CRG98_007109 [Punica granatum]    339   e-108
gb|PNX87842.1| retrotransposon protein putative Ty1-copia subcla...   331   e-108
gb|OTG07182.1| putative zinc finger, CCHC-type [Helianthus annuus]    348   e-108
gb|AAP44605.1| putative polyprotein [Oryza sativa Japonica Group]     344   e-107
gb|ABF97047.1| retrotransposon protein, putative, Ty1-copia subc...   344   e-106
gb|AAX94813.1| retrotransposon protein, putative, Ty1-copia sub-...   338   e-106
gb|AMY96445.1| gag/pol protein [Momordica dioica]                     342   e-105
gb|PPZ05609.1| hypothetical protein C5P41_24865, partial [Escher...   332   e-104
gb|PPY93112.1| hypothetical protein C5P31_25365, partial [Escher...   330   e-104
gb|AAV85747.1| Integrase core domain, putative [Oryza sativa Jap...   335   e-103
gb|AAS01945.1| putative polyprotein [Oryza sativa Japonica Group]     330   e-102
gb|ABF97213.1| retrotransposon protein, putative, Ty1-copia subc...   330   e-102

>gb|ADB85430.1| putative retrotransposon protein [Phyllostachys edulis]
          Length = 896

 Score =  358 bits (918), Expect = e-114
 Identities = 180/264 (68%), Positives = 209/264 (79%)
 Frame = -1

Query: 793  EIKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKD 614
            EIK FGF +N +EPCVY K SGS +  LILYVDDIL++GN IP L  VK+ L K FSMKD
Sbjct: 541  EIKRFGFIKNKEEPCVYMKVSGSTLVILILYVDDILLVGNDIPMLESVKSSLRKSFSMKD 600

Query: 613  LGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCA 434
            LG+AAYILGI+IYRDRS+RLIGLSQ  YIDK+L RFNMQNSK+GFLPM     LS   C 
Sbjct: 601  LGDAAYILGIRIYRDRSKRLIGLSQEMYIDKVLNRFNMQNSKRGFLPMAHGINLSKNQCP 660

Query: 433  STPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKY 254
            +T  E  +M  +PYASA+GSIMYA+ CTRPDV++A ++TSRYQ +P   HW AVKNILKY
Sbjct: 661  TTTDERDKMSDIPYASAIGSIMYAMICTRPDVSYALSVTSRYQADPSEGHWTAVKNILKY 720

Query: 253  LRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTT 74
            LR TK++FLVYGG  D EL V G+ DASFQTDKDD +SQ+G+VF++NGGAV WKS KQ T
Sbjct: 721  LRRTKDVFLVYGG--DEELVVNGYTDASFQTDKDDYRSQSGFVFILNGGAVSWKSSKQET 778

Query: 73   VAMSITEAEYIAASEAAMEAVWIR 2
            VA S TEAEYIAASEAA E VWIR
Sbjct: 779  VADSTTEAEYIAASEAAKEGVWIR 802


>dbj|BAA22288.1| polyprotein [Oryza australiensis]
          Length = 1317

 Score =  356 bits (914), Expect = e-110
 Identities = 182/263 (69%), Positives = 207/263 (78%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            IK FGF +N +E CVY K SGS + FLILYVDDIL++GN IP L  VK+ L   FSMKDL
Sbjct: 962  IKGFGFIKNEEEACVYKKVSGSAIVFLILYVDDILLIGNDIPMLESVKSSLKNSFSMKDL 1021

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNM +SKKGFLPM     LS   C  
Sbjct: 1022 GEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMHDSKKGFLPMSHGINLSKNQCPQ 1081

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            T  E  +M  VPYASA+GSIMYA+ CTRPDV++A + TSRYQ +PG  HW AVKNILKYL
Sbjct: 1082 THDERNKMGMVPYASAIGSIMYAMLCTRPDVSYALSATSRYQSDPGEGHWTAVKNILKYL 1141

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R TK+MFLVYGG  D  L V+G+ DASFQTDKDD +SQ+G+VF +NGGAV WKS KQ TV
Sbjct: 1142 RRTKDMFLVYGGEED--LVVSGYTDASFQTDKDDYRSQSGFVFCLNGGAVSWKSSKQDTV 1199

Query: 70   AMSITEAEYIAASEAAMEAVWIR 2
            A S TEAEYIAASEAA EAVWI+
Sbjct: 1200 ADSTTEAEYIAASEAAKEAVWIK 1222


>ref|WP_042516817.1| hypothetical protein [Lactobacillus brevis]
 gb|KIO93795.1| hypothetical protein QP38_2416 [Lactobacillus brevis]
          Length = 283

 Score =  328 bits (842), Expect = e-110
 Identities = 167/260 (64%), Positives = 196/260 (75%)
 Frame = -1

Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
           IK+FGF Q + E C+Y K SGS V FLILYVDDIL++GN++  L  +K YL K FSMKDL
Sbjct: 26  IKAFGFIQVVGESCIYKKVSGSSVVFLILYVDDILLIGNNVEFLESIKDYLNKSFSMKDL 85

Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
           GEAAYILGIKIYRDRS+R+IGLSQS Y+D +LKRF M+ SKKGFLP+     LS   C +
Sbjct: 86  GEAAYILGIKIYRDRSKRVIGLSQSTYLDNVLKRFKMEQSKKGFLPVLQGTKLSKTQCPA 145

Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
           T  +   M+ VPYASA+GSIMYA+ CTRPDV+ A ++  R Q NP V HW AVKNILKYL
Sbjct: 146 TDEDREHMRSVPYASAIGSIMYAMMCTRPDVSLAISMAGRSQSNPAVHHWTAVKNILKYL 205

Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
           + TKEMFLVYGG  D EL V G+ DASF TD DD+KSQT YVF++N GAV W S KQ+ V
Sbjct: 206 KRTKEMFLVYGG--DEELAVKGYVDASFDTDPDDSKSQTRYVFILNEGAVSWCSSKQSVV 263

Query: 70  AMSITEAEYIAASEAAMEAV 11
           A S  EAEY+AASEAA E V
Sbjct: 264 ADSTCEAEYMAASEAAKEGV 283


>gb|ADB85429.1| putative retrotransposon protein [Phyllostachys edulis]
          Length = 1313

 Score =  353 bits (906), Expect = e-109
 Identities = 184/266 (69%), Positives = 210/266 (78%), Gaps = 2/266 (0%)
 Frame = -1

Query: 793  EIKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKD 614
            EIK FGF +N +EPCVY K SGS +  LILYVDDIL++GN IP L  VKA L   FSMKD
Sbjct: 957  EIKRFGFVKNKEEPCVYMKVSGSTLVILILYVDDILLIGNDIPMLESVKASLKNSFSMKD 1016

Query: 613  LGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGC- 437
            LGEAAYILGIKIYRDRSRRLIGLSQS YIDK+L RFNMQN+KKGFLPM  +HG+S     
Sbjct: 1017 LGEAAYILGIKIYRDRSRRLIGLSQSTYIDKVLIRFNMQNTKKGFLPM--SHGISPSKSQ 1074

Query: 436  -ASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNIL 260
              ST  E  RM  +PYASA+GSIMYA+ CTR DV++A ++TSRYQ +PG  HW AVKNIL
Sbjct: 1075 RPSTTDERDRMNGIPYASAIGSIMYAMICTRQDVSYALSVTSRYQADPGECHWTAVKNIL 1134

Query: 259  KYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQ 80
            KYLR TK+ FL+YGG  D EL V G+ DASFQTDKDD +SQ+G+VF++NGGAV WKS KQ
Sbjct: 1135 KYLRRTKDAFLIYGG--DEELVVNGYTDASFQTDKDDYRSQSGFVFILNGGAVSWKSSKQ 1192

Query: 79   TTVAMSITEAEYIAASEAAMEAVWIR 2
             TVA S T+AEYIAASEAA E VWIR
Sbjct: 1193 ETVADSTTKAEYIAASEAAKEGVWIR 1218


>gb|AAC26250.1| contains similarity to reverse transcriptase (Pfam: rvt.hmm, score
            19.29) [Arabidopsis thaliana]
 emb|CAB80804.1| putative retrotransposon protein [Arabidopsis thaliana]
          Length = 964

 Score =  346 bits (888), Expect = e-109
 Identities = 174/263 (66%), Positives = 203/263 (77%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            IK F F +N +EPCVY K SGS V FL+LYVDDIL++GN IP L  VK +LG CFSMKD+
Sbjct: 609  IKEFDFIRNEEEPCVYKKTSGSAVAFLVLYVDDILLLGNDIPLLQSVKTWLGSCFSMKDM 668

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEAAYILGI+IYRDR  ++IGLSQ  YIDK+L RFNM +SKKGF+PM     LS   C S
Sbjct: 669  GEAAYILGIRIYRDRLNKIIGLSQDTYIDKVLHRFNMHDSKKGFIPMSHGITLSKTQCPS 728

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            T  E  RM K+PYASA+GSIMYA+  TRPDVA A ++TSRYQ +PG  HW  V+NI KYL
Sbjct: 729  THDERERMSKIPYASAIGSIMYAMLYTRPDVACALSMTSRYQSDPGESHWIVVRNIFKYL 788

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R TK+ FLVYGG+   EL V+G+ DASFQTDKDD +SQ+G+ F +NGGAV WKS KQ+TV
Sbjct: 789  RRTKDKFLVYGGS--EELVVSGYTDASFQTDKDDFRSQSGFFFCLNGGAVSWKSTKQSTV 846

Query: 70   AMSITEAEYIAASEAAMEAVWIR 2
            A S TEAEYIAASEAA E VWIR
Sbjct: 847  ADSTTEAEYIAASEAAKEVVWIR 869


>gb|KYP76577.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 368

 Score =  328 bits (842), Expect = e-109
 Identities = 162/263 (61%), Positives = 201/263 (76%)
 Frame = -1

Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
           IK F F +  +EPCVY + SGS + FL+LYVDDIL++GN IP L   K +L + FSMKDL
Sbjct: 58  IKKFDFVRCEEEPCVYKRVSGSTIIFLMLYVDDILLIGNDIPFLQSTKIWLSEQFSMKDL 117

Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
           GEAAYILGIKIYRDRS+R++GLSQS YID IL+R+NM+NSK+G+ P+     LS++ C  
Sbjct: 118 GEAAYILGIKIYRDRSKRMLGLSQSMYIDTILRRYNMENSKRGYFPIGTGVTLSNEDCPK 177

Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
           T  E  RM +VPYASAVG+IMY + CTRPDVAFA  + SRYQ NPG ++W+ VK ILKYL
Sbjct: 178 TLEERTRMNRVPYASAVGAIMYIMTCTRPDVAFAPGVVSRYQANPGEENWKVVKTILKYL 237

Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
           R T++ FL+YGG    EL + G+ DASF +DKDD+KS +GYVF + GGAV WKS KQ TV
Sbjct: 238 RRTQDQFLIYGG---TELMLKGYTDASFASDKDDSKSISGYVFTLYGGAVSWKSSKQATV 294

Query: 70  AMSITEAEYIAASEAAMEAVWIR 2
           A S TEAEYIAAS+A  EAVW++
Sbjct: 295 ADSTTEAEYIAASDATKEAVWMK 317


>gb|EXX50149.1| gag-pol fusion protein [Rhizophagus irregularis DAOM 197198w]
          Length = 1303

 Score =  350 bits (899), Expect = e-108
 Identities = 176/264 (66%), Positives = 212/264 (80%)
 Frame = -1

Query: 793  EIKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKD 614
            ++K FGF+++ DE CVY KASGS VTFL+LYVDDIL+MGN IP+L +VKA+LGKCF+MKD
Sbjct: 944  KVKEFGFSRSEDESCVYVKASGSIVTFLVLYVDDILLMGNDIPTLQDVKAWLGKCFAMKD 1003

Query: 613  LGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCA 434
            LGEAAYILGI+I RDR +RLIGLSQ  Y++K+LKRF+M+NSKKG LP+Q N  LS     
Sbjct: 1004 LGEAAYILGIRILRDRKKRLIGLSQGTYLEKVLKRFSMENSKKGELPIQSNAKLSKTQSP 1063

Query: 433  STPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKY 254
            ST  E+A M +VPYASAVGSIMYA+ CTRPDVAFA ++ SRYQ NPG  HW AVKNILKY
Sbjct: 1064 STDEEIAEMSRVPYASAVGSIMYAMTCTRPDVAFALSMVSRYQGNPGRAHWIAVKNILKY 1123

Query: 253  LRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTT 74
            LR TK M LV GG+    L V G+ DASFQTD+D  +SQ+G+VF++NGGAV WKS KQ T
Sbjct: 1124 LRRTKNMVLVLGGSD--TLRVEGYTDASFQTDRDSGRSQSGWVFLLNGGAVTWKSSKQET 1181

Query: 73   VAMSITEAEYIAASEAAMEAVWIR 2
            VA S  E+EYIAASEA+ EA W++
Sbjct: 1182 VADSTCESEYIAASEASKEAAWLK 1205


>emb|CAD39797.2| OSJNBa0071G03.10 [Oryza sativa Japonica Group]
          Length = 948

 Score =  343 bits (881), Expect = e-108
 Identities = 174/263 (66%), Positives = 203/263 (77%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            +K  GF +N  EPCVY K SGS + FLILYVDDIL++GN IP L  VK  L   FSMKDL
Sbjct: 684  VKVLGFFKNEQEPCVYKKISGSALVFLILYVDDILLIGNDIPILESVKTLLKNSFSMKDL 743

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM  +  L    C  
Sbjct: 744  GEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSHDINLGKNQCPQ 803

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            T  E  +M  +PYASA+GSIMYA+ CT PDV++A + T+RYQ +PG  HW AVKNILKYL
Sbjct: 804  TTDERNKMSVIPYASAIGSIMYAMLCTCPDVSYALSATNRYQSDPGESHWIAVKNILKYL 863

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R T++MFLVYGG    EL V G+ DASFQTDKDD +S++G+VF +NGG V WKS KQ TV
Sbjct: 864  RRTEDMFLVYGG--QEELVVNGYTDASFQTDKDDFRSRSGFVFCLNGGVVSWKSSKQDTV 921

Query: 70   AMSITEAEYIAASEAAMEAVWIR 2
            A S TEAEYIAASEAA +AVWI+
Sbjct: 922  ADSTTEAEYIAASEAAKDAVWIK 944


>gb|PKI72506.1| hypothetical protein CRG98_007109 [Punica granatum]
          Length = 783

 Score =  339 bits (870), Expect = e-108
 Identities = 174/262 (66%), Positives = 205/262 (78%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            IK FGF +N DEPCVY K SGS V FL+LYVDDIL++GN I SL  VK +LG+CFSMKDL
Sbjct: 439  IKEFGFIKNEDEPCVYKKVSGSVVIFLVLYVDDILLIGNDILSLQSVKTWLGRCFSMKDL 498

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEA Y+LGIKIYRDRS RL+GLSQSAYIDK+L RF+MQ+SKKG LPM     LS     S
Sbjct: 499  GEATYVLGIKIYRDRSNRLLGLSQSAYIDKVLWRFSMQDSKKGSLPMLHGISLSKAQSPS 558

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            T  E  R+ ++PYASA+GSIMYA+ CTR +V++  ++TSRYQ +PG +HW AVKNILKYL
Sbjct: 559  TREERDRINRIPYASAIGSIMYAMLCTRSNVSYTLSMTSRYQSDPGERHWIAVKNILKYL 618

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R TKE+FLVYGG  + EL V G+ D SFQT+KDD++SQ+GYV  +NGGAV WKS KQ TV
Sbjct: 619  RRTKEIFLVYGG--EEELVVRGYTDVSFQTNKDDSRSQSGYVLCLNGGAVSWKSSKQETV 676

Query: 70   AMSITEAEYIAASEAAMEAVWI 5
            A S  EAEYIAAS AA EAV I
Sbjct: 677  ADSTIEAEYIAASNAAKEAVGI 698


>gb|PNX87842.1| retrotransposon protein putative Ty1-copia subclass, partial
            [Trifolium pratense]
          Length = 511

 Score =  331 bits (848), Expect = e-108
 Identities = 162/255 (63%), Positives = 198/255 (77%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            I+ F F +  +EPCVY K SGS + FL+LYVDDIL+ GN IPS+   K +L + FSMKDL
Sbjct: 260  IEKFNFVKCEEEPCVYKKISGSSIIFLVLYVDDILLFGNDIPSMQSTKVWLSEQFSMKDL 319

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEAAYILGIKIYRDRS+RL+GLSQS YID ILKR+NM+ SK+G+LP+ +   LS + C  
Sbjct: 320  GEAAYILGIKIYRDRSKRLLGLSQSMYIDTILKRYNMEKSKRGYLPVGMGVSLSRENCPK 379

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            T  E  RM +VPYASAVG+IMY + CTRPDVA+A  +TSRYQ NPG +HW+ VK ILKYL
Sbjct: 380  TLEERERMSRVPYASAVGAIMYTMTCTRPDVAYALGVTSRYQANPGEEHWKVVKTILKYL 439

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R TK+ FL+YG   ++EL + G+ DASF +DKDD+KS +GYVF +NGGA+ WKS KQ TV
Sbjct: 440  RRTKDQFLIYG---NSELSLKGYTDASFASDKDDSKSISGYVFTLNGGAISWKSSKQATV 496

Query: 70   AMSITEAEYIAASEA 26
            A S TEAEYIAASEA
Sbjct: 497  ADSTTEAEYIAASEA 511


>gb|OTG07182.1| putative zinc finger, CCHC-type [Helianthus annuus]
          Length = 1325

 Score =  348 bits (894), Expect = e-108
 Identities = 168/264 (63%), Positives = 213/264 (80%)
 Frame = -1

Query: 793  EIKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKD 614
            ++K FGF +N DEPCVY KASGS ++FLILYVDDILI+GN+IP L E+K +LG CF+M+D
Sbjct: 966  KVKEFGFVKNEDEPCVYRKASGSAISFLILYVDDILIIGNNIPMLKEIKHWLGSCFAMQD 1025

Query: 613  LGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCA 434
            LGEAAYILGIKIYR+RS+RL+GL+QS YID++++RF M+NSKKG +PM     L      
Sbjct: 1026 LGEAAYILGIKIYRNRSKRLLGLTQSTYIDQVMRRFKMENSKKGGVPMTKGTVLDKSQAP 1085

Query: 433  STPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKY 254
            S   E+ +M+ VPYASA+G IMYA+ CTRPDV++A ++TSRYQQNPGV HW AVKNILKY
Sbjct: 1086 SEDREIKQMEGVPYASAIGFIMYAMVCTRPDVSYALSMTSRYQQNPGVAHWTAVKNILKY 1145

Query: 253  LRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTT 74
            LR TK+MFL++GG  + EL V  + DASFQTD+D ++SQTG+VF +NGGAV WKS KQ+ 
Sbjct: 1146 LRRTKDMFLIFGG-VNEELTVKCYTDASFQTDRDTSRSQTGFVFTLNGGAVSWKSSKQSV 1204

Query: 73   VAMSITEAEYIAASEAAMEAVWIR 2
            VA S TE+EYIAAS+ A EA W++
Sbjct: 1205 VADSTTESEYIAASDVAKEAAWMK 1228


>gb|AAP44605.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1161

 Score =  344 bits (883), Expect = e-107
 Identities = 177/263 (67%), Positives = 203/263 (77%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            +K+ GF +N +EPCVY K SGS + FLILYVDDIL++GN IP L  VK  L   FSMKDL
Sbjct: 824  VKALGFVKNEEEPCVYKKISGSALVFLILYVDDILLIGNDIPMLESVKTSLKYSFSMKDL 883

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM     L    C  
Sbjct: 884  GEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSHGINLGKNQCPQ 943

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            T  E  +M  +PYASA+GSIMYA+ CTR DV++A + TSRYQ + G  HW AVKNILKYL
Sbjct: 944  TTDERNKMSVIPYASAIGSIMYAMLCTRLDVSYALSATSRYQSDLGESHWIAVKNILKYL 1003

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R TK+MFLVYG     EL V G+ DASFQTDKDD +SQ+G+VF +NGGAV WKS KQ TV
Sbjct: 1004 RRTKDMFLVYG--RQEELVVNGYTDASFQTDKDDFRSQSGFVFCLNGGAVSWKSSKQDTV 1061

Query: 70   AMSITEAEYIAASEAAMEAVWIR 2
            A S TEAEYIAASEAA EAVWI+
Sbjct: 1062 ADSTTEAEYIAASEAAKEAVWIK 1084


>gb|ABF97047.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
            Japonica Group]
          Length = 1248

 Score =  344 bits (883), Expect = e-106
 Identities = 177/263 (67%), Positives = 203/263 (77%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            +K+ GF +N +EPCVY K SGS + FLILYVDDIL++GN IP L  VK  L   FSMKDL
Sbjct: 911  VKALGFVKNEEEPCVYKKISGSALVFLILYVDDILLIGNDIPMLESVKTSLKYSFSMKDL 970

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM     L    C  
Sbjct: 971  GEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSHGINLGKNQCPQ 1030

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            T  E  +M  +PYASA+GSIMYA+ CTR DV++A + TSRYQ + G  HW AVKNILKYL
Sbjct: 1031 TTDERNKMSVIPYASAIGSIMYAMLCTRLDVSYALSATSRYQSDLGESHWIAVKNILKYL 1090

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R TK+MFLVYG     EL V G+ DASFQTDKDD +SQ+G+VF +NGGAV WKS KQ TV
Sbjct: 1091 RRTKDMFLVYG--RQEELVVNGYTDASFQTDKDDFRSQSGFVFCLNGGAVSWKSSKQDTV 1148

Query: 70   AMSITEAEYIAASEAAMEAVWIR 2
            A S TEAEYIAASEAA EAVWI+
Sbjct: 1149 ADSTTEAEYIAASEAAKEAVWIK 1171


>gb|AAX94813.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa
            Japonica Group]
 gb|ABA93176.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
            Japonica Group]
          Length = 938

 Score =  338 bits (866), Expect = e-106
 Identities = 172/260 (66%), Positives = 199/260 (76%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            +K+ GF +N  EPCVY K SGS + FLILYVDDIL++ N IP L  VK  L   FSMKDL
Sbjct: 681  VKALGFVKNEQEPCVYKKISGSALVFLILYVDDILLIENDIPMLESVKTSLKNSFSMKDL 740

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEAAYILGI+IY+DRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM     L    C  
Sbjct: 741  GEAAYILGIRIYKDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSHGINLGKNQCPQ 800

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            T  E  +M  +PYASA+GSIMYA+ CTRPDV++A + TS+YQ +PG  HW A+KNILKYL
Sbjct: 801  TTNERNKMSVIPYASAIGSIMYAMLCTRPDVSYALSATSQYQSDPGESHWIALKNILKYL 860

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R TK+MFLVYGG    EL V G+ DASFQ DKDD +SQ+G+VF +NGGAV WKS KQ TV
Sbjct: 861  RRTKDMFLVYGG--QEELVVNGYTDASFQIDKDDFRSQSGFVFYLNGGAVSWKSSKQDTV 918

Query: 70   AMSITEAEYIAASEAAMEAV 11
            A S TEAEYIAASEAA E V
Sbjct: 919  ADSTTEAEYIAASEAAKEVV 938


>gb|AMY96445.1| gag/pol protein [Momordica dioica]
          Length = 1313

 Score =  342 bits (878), Expect = e-105
 Identities = 172/263 (65%), Positives = 203/263 (77%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            IK+FGF QN+DE CVY K SGS V FLILYVDDIL++GN +  L +VK +L   FSMKDL
Sbjct: 988  IKAFGFIQNVDESCVYKKISGSVVAFLILYVDDILLIGNDVEYLEDVKKWLNTSFSMKDL 1047

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEA YILGI+IYRDRS + IG+SQS YIDK+L RF MQ+SKKG LP +    LS + C  
Sbjct: 1048 GEAQYILGIRIYRDRSNKTIGMSQSTYIDKVLSRFKMQDSKKGLLPFRHGIHLSKEQCPK 1107

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            TP EV  M+ +PY+SA+GS+MYA+ CTRPDV +A +I SRYQ NPG  HW AVKNILKYL
Sbjct: 1108 TPQEVEDMRNIPYSSAIGSLMYAMLCTRPDVCYALSIVSRYQSNPGRDHWTAVKNILKYL 1167

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R T+ MFLVYGG  D +L V G+ D+SFQTDKDD+KSQ+G VF +NGGAV W+S KQT V
Sbjct: 1168 RRTRNMFLVYGG--DKDLAVKGYTDSSFQTDKDDSKSQSG-VFTLNGGAVSWRSSKQTCV 1224

Query: 70   AMSITEAEYIAASEAAMEAVWIR 2
            A S  EAEY+AA EAA EAVWIR
Sbjct: 1225 ADSTCEAEYVAACEAAKEAVWIR 1247


>gb|PPZ05609.1| hypothetical protein C5P41_24865, partial [Escherichia coli]
          Length = 859

 Score =  332 bits (850), Expect = e-104
 Identities = 160/263 (60%), Positives = 205/263 (77%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            IKSFGF +N DEPCVY K S S +TFL+LYVDDIL+MGN    L  +K +L   FSMKDL
Sbjct: 563  IKSFGFIKNEDEPCVYKKVSDSAITFLVLYVDDILLMGNDTGMLTTIKVWLSNTFSMKDL 622

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEA YILGI+IYRDR++R+IGLSQS Y++K+LKRFNM +SK+G LP++    LS +    
Sbjct: 623  GEATYILGIRIYRDRAKRIIGLSQSLYLEKVLKRFNMLDSKRGLLPVRHGIHLSKEMSPK 682

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            TP E  +M ++PYASA+GS+MYA+ CTRP++A+A ++TSRYQ NPG++HW A+KNILKYL
Sbjct: 683  TPEERDKMARIPYASAIGSLMYAMLCTRPNIAYAVSLTSRYQSNPGLEHWIAIKNILKYL 742

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R TK++FL+YGG    +L + G+ D+ FQ+D DD KS + YVF+ NGGAV WKS KQ+T 
Sbjct: 743  RRTKDLFLIYGG---GDLQLDGYTDSDFQSDIDDRKSTSRYVFICNGGAVSWKSFKQSTT 799

Query: 70   AMSITEAEYIAASEAAMEAVWIR 2
            A S  EAEYIAAS+AA EAVWI+
Sbjct: 800  ADSTIEAEYIAASDAAKEAVWIK 822


>gb|PPY93112.1| hypothetical protein C5P31_25365, partial [Escherichia coli]
          Length = 813

 Score =  330 bits (847), Expect = e-104
 Identities = 161/263 (61%), Positives = 205/263 (77%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            IKSFGF +N DEPCVY K S S +TFL+LYVDDIL+MGN    L  +K +L   FSMKDL
Sbjct: 472  IKSFGFIKNEDEPCVYKKVSDSAITFLVLYVDDILLMGNDTGMLTTIKVWLSNTFSMKDL 531

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEA YILGI+IYRDR++R+IGLSQS Y++K+LKRFNM +SK+G L ++    LS +    
Sbjct: 532  GEATYILGIRIYRDRAKRIIGLSQSLYLEKVLKRFNMLDSKRGLLLVRHGIHLSKEMSPK 591

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            TP E  +M ++PYASA+GS+MYA+ CTRPD+A+A ++TSRYQ NPG++H  AVKNILKYL
Sbjct: 592  TPEERDKMARIPYASAIGSLMYAMLCTRPDIAYAISLTSRYQSNPGLEHSIAVKNILKYL 651

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R TK++FL+YGG    +L + G+ D+ FQ+D DD KS +GYVF+ NGGAV WKS KQ+T 
Sbjct: 652  RRTKDLFLIYGG---GDLQLDGYTDSDFQSDIDDRKSTSGYVFICNGGAVSWKSSKQSTT 708

Query: 70   AMSITEAEYIAASEAAMEAVWIR 2
            A S TEAEYIAAS+AA EA+WI+
Sbjct: 709  ADSTTEAEYIAASDAAKEAIWIK 731


>gb|AAV85747.1| Integrase core domain, putative [Oryza sativa Japonica Group]
 gb|AAX92956.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa
            Japonica Group]
 gb|ABA92827.2| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
            Japonica Group]
          Length = 1184

 Score =  335 bits (858), Expect = e-103
 Identities = 172/263 (65%), Positives = 198/263 (75%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            +K+ GF +N +EPCVY K SGS + FLILYVDDIL++GN I  L  VK  L   FSMKDL
Sbjct: 833  VKALGFVRNEEEPCVYKKISGSALVFLILYVDDILLIGNDISMLESVKTSLKNSFSMKDL 892

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM     L    C  
Sbjct: 893  GEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSHGINLGKNQCPQ 952

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            T  E  +M  +PYASA+GSIMYA+ CTRPDV++A + TSRYQ +PG  HW AVKNILKYL
Sbjct: 953  TTDERNKMSVIPYASAIGSIMYAMLCTRPDVSYALSATSRYQSDPGESHWIAVKNILKYL 1012

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R TK+MFL YGG    EL V G+ DASFQ DKDD +SQ+G+VF +NGGAV WKS KQ  V
Sbjct: 1013 RRTKDMFLAYGG--QEELVVNGYTDASFQIDKDDFRSQSGFVFCLNGGAVSWKSSKQDIV 1070

Query: 70   AMSITEAEYIAASEAAMEAVWIR 2
              S TEAEYIAAS    EAVWI+
Sbjct: 1071 VDSTTEAEYIAAS----EAVWIK 1089


>gb|AAS01945.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1084

 Score =  330 bits (846), Expect = e-102
 Identities = 168/260 (64%), Positives = 196/260 (75%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            +K+ GF +N + PCVY K SGS + FLILYVDDIL++GN IP L  V   L   FSMKDL
Sbjct: 827  VKALGFVKNEEVPCVYKKISGSALVFLILYVDDILLIGNDIPMLESVNISLKNSFSMKDL 886

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEAAYILGI+IYRD S+RLIGLS+S YIDK+LK FNMQ+SKKGFLPM     L+   C  
Sbjct: 887  GEAAYILGIRIYRDGSKRLIGLSESTYIDKVLKMFNMQDSKKGFLPMSHGINLNKNQCLQ 946

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            T  E  +M  +PYASA+GSIMYA+ CTRPDV++  + TSRYQ +P   HW AVKNILKYL
Sbjct: 947  TTNEQNKMSVIPYASAIGSIMYAMLCTRPDVSYVLSATSRYQSDPSESHWIAVKNILKYL 1006

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R TK+MFLVYGG    EL V G+ DASFQTDKDD +SQ+G+VF +NG AV WKS KQ T 
Sbjct: 1007 RRTKDMFLVYGG--QEELVVNGYTDASFQTDKDDFRSQSGFVFCLNGSAVSWKSSKQDTA 1064

Query: 70   AMSITEAEYIAASEAAMEAV 11
            A S TEAEYIAAS+AA EAV
Sbjct: 1065 ANSTTEAEYIAASKAAKEAV 1084


>gb|ABF97213.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
            Japonica Group]
          Length = 1142

 Score =  330 bits (846), Expect = e-102
 Identities = 168/260 (64%), Positives = 196/260 (75%)
 Frame = -1

Query: 790  IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611
            +K+ GF +N + PCVY K SGS + FLILYVDDIL++GN IP L  V   L   FSMKDL
Sbjct: 885  VKALGFVKNEEVPCVYKKISGSALVFLILYVDDILLIGNDIPMLESVNISLKNSFSMKDL 944

Query: 610  GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431
            GEAAYILGI+IYRD S+RLIGLS+S YIDK+LK FNMQ+SKKGFLPM     L+   C  
Sbjct: 945  GEAAYILGIRIYRDGSKRLIGLSESTYIDKVLKMFNMQDSKKGFLPMSHGINLNKNQCLQ 1004

Query: 430  TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251
            T  E  +M  +PYASA+GSIMYA+ CTRPDV++  + TSRYQ +P   HW AVKNILKYL
Sbjct: 1005 TTNEQNKMSVIPYASAIGSIMYAMLCTRPDVSYVLSATSRYQSDPSESHWIAVKNILKYL 1064

Query: 250  RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71
            R TK+MFLVYGG    EL V G+ DASFQTDKDD +SQ+G+VF +NG AV WKS KQ T 
Sbjct: 1065 RRTKDMFLVYGG--QEELVVNGYTDASFQTDKDDFRSQSGFVFCLNGSAVSWKSSKQDTA 1122

Query: 70   AMSITEAEYIAASEAAMEAV 11
            A S TEAEYIAAS+AA EAV
Sbjct: 1123 ANSTTEAEYIAASKAAKEAV 1142


Top