BLASTX nr result
ID: Chrysanthemum22_contig00036510
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum22_contig00036510 (793 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ADB85430.1| putative retrotransposon protein [Phyllostachys e... 358 e-114 dbj|BAA22288.1| polyprotein [Oryza australiensis] 356 e-110 ref|WP_042516817.1| hypothetical protein [Lactobacillus brevis] ... 328 e-110 gb|ADB85429.1| putative retrotransposon protein [Phyllostachys e... 353 e-109 gb|AAC26250.1| contains similarity to reverse transcriptase (Pfa... 346 e-109 gb|KYP76577.1| Retrovirus-related Pol polyprotein from transposo... 328 e-109 gb|EXX50149.1| gag-pol fusion protein [Rhizophagus irregularis D... 350 e-108 emb|CAD39797.2| OSJNBa0071G03.10 [Oryza sativa Japonica Group] 343 e-108 gb|PKI72506.1| hypothetical protein CRG98_007109 [Punica granatum] 339 e-108 gb|PNX87842.1| retrotransposon protein putative Ty1-copia subcla... 331 e-108 gb|OTG07182.1| putative zinc finger, CCHC-type [Helianthus annuus] 348 e-108 gb|AAP44605.1| putative polyprotein [Oryza sativa Japonica Group] 344 e-107 gb|ABF97047.1| retrotransposon protein, putative, Ty1-copia subc... 344 e-106 gb|AAX94813.1| retrotransposon protein, putative, Ty1-copia sub-... 338 e-106 gb|AMY96445.1| gag/pol protein [Momordica dioica] 342 e-105 gb|PPZ05609.1| hypothetical protein C5P41_24865, partial [Escher... 332 e-104 gb|PPY93112.1| hypothetical protein C5P31_25365, partial [Escher... 330 e-104 gb|AAV85747.1| Integrase core domain, putative [Oryza sativa Jap... 335 e-103 gb|AAS01945.1| putative polyprotein [Oryza sativa Japonica Group] 330 e-102 gb|ABF97213.1| retrotransposon protein, putative, Ty1-copia subc... 330 e-102 >gb|ADB85430.1| putative retrotransposon protein [Phyllostachys edulis] Length = 896 Score = 358 bits (918), Expect = e-114 Identities = 180/264 (68%), Positives = 209/264 (79%) Frame = -1 Query: 793 EIKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKD 614 EIK FGF +N +EPCVY K SGS + LILYVDDIL++GN IP L VK+ L K FSMKD Sbjct: 541 EIKRFGFIKNKEEPCVYMKVSGSTLVILILYVDDILLVGNDIPMLESVKSSLRKSFSMKD 600 Query: 613 LGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCA 434 LG+AAYILGI+IYRDRS+RLIGLSQ YIDK+L RFNMQNSK+GFLPM LS C Sbjct: 601 LGDAAYILGIRIYRDRSKRLIGLSQEMYIDKVLNRFNMQNSKRGFLPMAHGINLSKNQCP 660 Query: 433 STPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKY 254 +T E +M +PYASA+GSIMYA+ CTRPDV++A ++TSRYQ +P HW AVKNILKY Sbjct: 661 TTTDERDKMSDIPYASAIGSIMYAMICTRPDVSYALSVTSRYQADPSEGHWTAVKNILKY 720 Query: 253 LRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTT 74 LR TK++FLVYGG D EL V G+ DASFQTDKDD +SQ+G+VF++NGGAV WKS KQ T Sbjct: 721 LRRTKDVFLVYGG--DEELVVNGYTDASFQTDKDDYRSQSGFVFILNGGAVSWKSSKQET 778 Query: 73 VAMSITEAEYIAASEAAMEAVWIR 2 VA S TEAEYIAASEAA E VWIR Sbjct: 779 VADSTTEAEYIAASEAAKEGVWIR 802 >dbj|BAA22288.1| polyprotein [Oryza australiensis] Length = 1317 Score = 356 bits (914), Expect = e-110 Identities = 182/263 (69%), Positives = 207/263 (78%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 IK FGF +N +E CVY K SGS + FLILYVDDIL++GN IP L VK+ L FSMKDL Sbjct: 962 IKGFGFIKNEEEACVYKKVSGSAIVFLILYVDDILLIGNDIPMLESVKSSLKNSFSMKDL 1021 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNM +SKKGFLPM LS C Sbjct: 1022 GEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMHDSKKGFLPMSHGINLSKNQCPQ 1081 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T E +M VPYASA+GSIMYA+ CTRPDV++A + TSRYQ +PG HW AVKNILKYL Sbjct: 1082 THDERNKMGMVPYASAIGSIMYAMLCTRPDVSYALSATSRYQSDPGEGHWTAVKNILKYL 1141 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R TK+MFLVYGG D L V+G+ DASFQTDKDD +SQ+G+VF +NGGAV WKS KQ TV Sbjct: 1142 RRTKDMFLVYGGEED--LVVSGYTDASFQTDKDDYRSQSGFVFCLNGGAVSWKSSKQDTV 1199 Query: 70 AMSITEAEYIAASEAAMEAVWIR 2 A S TEAEYIAASEAA EAVWI+ Sbjct: 1200 ADSTTEAEYIAASEAAKEAVWIK 1222 >ref|WP_042516817.1| hypothetical protein [Lactobacillus brevis] gb|KIO93795.1| hypothetical protein QP38_2416 [Lactobacillus brevis] Length = 283 Score = 328 bits (842), Expect = e-110 Identities = 167/260 (64%), Positives = 196/260 (75%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 IK+FGF Q + E C+Y K SGS V FLILYVDDIL++GN++ L +K YL K FSMKDL Sbjct: 26 IKAFGFIQVVGESCIYKKVSGSSVVFLILYVDDILLIGNNVEFLESIKDYLNKSFSMKDL 85 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEAAYILGIKIYRDRS+R+IGLSQS Y+D +LKRF M+ SKKGFLP+ LS C + Sbjct: 86 GEAAYILGIKIYRDRSKRVIGLSQSTYLDNVLKRFKMEQSKKGFLPVLQGTKLSKTQCPA 145 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T + M+ VPYASA+GSIMYA+ CTRPDV+ A ++ R Q NP V HW AVKNILKYL Sbjct: 146 TDEDREHMRSVPYASAIGSIMYAMMCTRPDVSLAISMAGRSQSNPAVHHWTAVKNILKYL 205 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 + TKEMFLVYGG D EL V G+ DASF TD DD+KSQT YVF++N GAV W S KQ+ V Sbjct: 206 KRTKEMFLVYGG--DEELAVKGYVDASFDTDPDDSKSQTRYVFILNEGAVSWCSSKQSVV 263 Query: 70 AMSITEAEYIAASEAAMEAV 11 A S EAEY+AASEAA E V Sbjct: 264 ADSTCEAEYMAASEAAKEGV 283 >gb|ADB85429.1| putative retrotransposon protein [Phyllostachys edulis] Length = 1313 Score = 353 bits (906), Expect = e-109 Identities = 184/266 (69%), Positives = 210/266 (78%), Gaps = 2/266 (0%) Frame = -1 Query: 793 EIKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKD 614 EIK FGF +N +EPCVY K SGS + LILYVDDIL++GN IP L VKA L FSMKD Sbjct: 957 EIKRFGFVKNKEEPCVYMKVSGSTLVILILYVDDILLIGNDIPMLESVKASLKNSFSMKD 1016 Query: 613 LGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGC- 437 LGEAAYILGIKIYRDRSRRLIGLSQS YIDK+L RFNMQN+KKGFLPM +HG+S Sbjct: 1017 LGEAAYILGIKIYRDRSRRLIGLSQSTYIDKVLIRFNMQNTKKGFLPM--SHGISPSKSQ 1074 Query: 436 -ASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNIL 260 ST E RM +PYASA+GSIMYA+ CTR DV++A ++TSRYQ +PG HW AVKNIL Sbjct: 1075 RPSTTDERDRMNGIPYASAIGSIMYAMICTRQDVSYALSVTSRYQADPGECHWTAVKNIL 1134 Query: 259 KYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQ 80 KYLR TK+ FL+YGG D EL V G+ DASFQTDKDD +SQ+G+VF++NGGAV WKS KQ Sbjct: 1135 KYLRRTKDAFLIYGG--DEELVVNGYTDASFQTDKDDYRSQSGFVFILNGGAVSWKSSKQ 1192 Query: 79 TTVAMSITEAEYIAASEAAMEAVWIR 2 TVA S T+AEYIAASEAA E VWIR Sbjct: 1193 ETVADSTTKAEYIAASEAAKEGVWIR 1218 >gb|AAC26250.1| contains similarity to reverse transcriptase (Pfam: rvt.hmm, score 19.29) [Arabidopsis thaliana] emb|CAB80804.1| putative retrotransposon protein [Arabidopsis thaliana] Length = 964 Score = 346 bits (888), Expect = e-109 Identities = 174/263 (66%), Positives = 203/263 (77%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 IK F F +N +EPCVY K SGS V FL+LYVDDIL++GN IP L VK +LG CFSMKD+ Sbjct: 609 IKEFDFIRNEEEPCVYKKTSGSAVAFLVLYVDDILLLGNDIPLLQSVKTWLGSCFSMKDM 668 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEAAYILGI+IYRDR ++IGLSQ YIDK+L RFNM +SKKGF+PM LS C S Sbjct: 669 GEAAYILGIRIYRDRLNKIIGLSQDTYIDKVLHRFNMHDSKKGFIPMSHGITLSKTQCPS 728 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T E RM K+PYASA+GSIMYA+ TRPDVA A ++TSRYQ +PG HW V+NI KYL Sbjct: 729 THDERERMSKIPYASAIGSIMYAMLYTRPDVACALSMTSRYQSDPGESHWIVVRNIFKYL 788 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R TK+ FLVYGG+ EL V+G+ DASFQTDKDD +SQ+G+ F +NGGAV WKS KQ+TV Sbjct: 789 RRTKDKFLVYGGS--EELVVSGYTDASFQTDKDDFRSQSGFFFCLNGGAVSWKSTKQSTV 846 Query: 70 AMSITEAEYIAASEAAMEAVWIR 2 A S TEAEYIAASEAA E VWIR Sbjct: 847 ADSTTEAEYIAASEAAKEVVWIR 869 >gb|KYP76577.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 368 Score = 328 bits (842), Expect = e-109 Identities = 162/263 (61%), Positives = 201/263 (76%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 IK F F + +EPCVY + SGS + FL+LYVDDIL++GN IP L K +L + FSMKDL Sbjct: 58 IKKFDFVRCEEEPCVYKRVSGSTIIFLMLYVDDILLIGNDIPFLQSTKIWLSEQFSMKDL 117 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEAAYILGIKIYRDRS+R++GLSQS YID IL+R+NM+NSK+G+ P+ LS++ C Sbjct: 118 GEAAYILGIKIYRDRSKRMLGLSQSMYIDTILRRYNMENSKRGYFPIGTGVTLSNEDCPK 177 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T E RM +VPYASAVG+IMY + CTRPDVAFA + SRYQ NPG ++W+ VK ILKYL Sbjct: 178 TLEERTRMNRVPYASAVGAIMYIMTCTRPDVAFAPGVVSRYQANPGEENWKVVKTILKYL 237 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R T++ FL+YGG EL + G+ DASF +DKDD+KS +GYVF + GGAV WKS KQ TV Sbjct: 238 RRTQDQFLIYGG---TELMLKGYTDASFASDKDDSKSISGYVFTLYGGAVSWKSSKQATV 294 Query: 70 AMSITEAEYIAASEAAMEAVWIR 2 A S TEAEYIAAS+A EAVW++ Sbjct: 295 ADSTTEAEYIAASDATKEAVWMK 317 >gb|EXX50149.1| gag-pol fusion protein [Rhizophagus irregularis DAOM 197198w] Length = 1303 Score = 350 bits (899), Expect = e-108 Identities = 176/264 (66%), Positives = 212/264 (80%) Frame = -1 Query: 793 EIKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKD 614 ++K FGF+++ DE CVY KASGS VTFL+LYVDDIL+MGN IP+L +VKA+LGKCF+MKD Sbjct: 944 KVKEFGFSRSEDESCVYVKASGSIVTFLVLYVDDILLMGNDIPTLQDVKAWLGKCFAMKD 1003 Query: 613 LGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCA 434 LGEAAYILGI+I RDR +RLIGLSQ Y++K+LKRF+M+NSKKG LP+Q N LS Sbjct: 1004 LGEAAYILGIRILRDRKKRLIGLSQGTYLEKVLKRFSMENSKKGELPIQSNAKLSKTQSP 1063 Query: 433 STPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKY 254 ST E+A M +VPYASAVGSIMYA+ CTRPDVAFA ++ SRYQ NPG HW AVKNILKY Sbjct: 1064 STDEEIAEMSRVPYASAVGSIMYAMTCTRPDVAFALSMVSRYQGNPGRAHWIAVKNILKY 1123 Query: 253 LRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTT 74 LR TK M LV GG+ L V G+ DASFQTD+D +SQ+G+VF++NGGAV WKS KQ T Sbjct: 1124 LRRTKNMVLVLGGSD--TLRVEGYTDASFQTDRDSGRSQSGWVFLLNGGAVTWKSSKQET 1181 Query: 73 VAMSITEAEYIAASEAAMEAVWIR 2 VA S E+EYIAASEA+ EA W++ Sbjct: 1182 VADSTCESEYIAASEASKEAAWLK 1205 >emb|CAD39797.2| OSJNBa0071G03.10 [Oryza sativa Japonica Group] Length = 948 Score = 343 bits (881), Expect = e-108 Identities = 174/263 (66%), Positives = 203/263 (77%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 +K GF +N EPCVY K SGS + FLILYVDDIL++GN IP L VK L FSMKDL Sbjct: 684 VKVLGFFKNEQEPCVYKKISGSALVFLILYVDDILLIGNDIPILESVKTLLKNSFSMKDL 743 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM + L C Sbjct: 744 GEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSHDINLGKNQCPQ 803 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T E +M +PYASA+GSIMYA+ CT PDV++A + T+RYQ +PG HW AVKNILKYL Sbjct: 804 TTDERNKMSVIPYASAIGSIMYAMLCTCPDVSYALSATNRYQSDPGESHWIAVKNILKYL 863 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R T++MFLVYGG EL V G+ DASFQTDKDD +S++G+VF +NGG V WKS KQ TV Sbjct: 864 RRTEDMFLVYGG--QEELVVNGYTDASFQTDKDDFRSRSGFVFCLNGGVVSWKSSKQDTV 921 Query: 70 AMSITEAEYIAASEAAMEAVWIR 2 A S TEAEYIAASEAA +AVWI+ Sbjct: 922 ADSTTEAEYIAASEAAKDAVWIK 944 >gb|PKI72506.1| hypothetical protein CRG98_007109 [Punica granatum] Length = 783 Score = 339 bits (870), Expect = e-108 Identities = 174/262 (66%), Positives = 205/262 (78%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 IK FGF +N DEPCVY K SGS V FL+LYVDDIL++GN I SL VK +LG+CFSMKDL Sbjct: 439 IKEFGFIKNEDEPCVYKKVSGSVVIFLVLYVDDILLIGNDILSLQSVKTWLGRCFSMKDL 498 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEA Y+LGIKIYRDRS RL+GLSQSAYIDK+L RF+MQ+SKKG LPM LS S Sbjct: 499 GEATYVLGIKIYRDRSNRLLGLSQSAYIDKVLWRFSMQDSKKGSLPMLHGISLSKAQSPS 558 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T E R+ ++PYASA+GSIMYA+ CTR +V++ ++TSRYQ +PG +HW AVKNILKYL Sbjct: 559 TREERDRINRIPYASAIGSIMYAMLCTRSNVSYTLSMTSRYQSDPGERHWIAVKNILKYL 618 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R TKE+FLVYGG + EL V G+ D SFQT+KDD++SQ+GYV +NGGAV WKS KQ TV Sbjct: 619 RRTKEIFLVYGG--EEELVVRGYTDVSFQTNKDDSRSQSGYVLCLNGGAVSWKSSKQETV 676 Query: 70 AMSITEAEYIAASEAAMEAVWI 5 A S EAEYIAAS AA EAV I Sbjct: 677 ADSTIEAEYIAASNAAKEAVGI 698 >gb|PNX87842.1| retrotransposon protein putative Ty1-copia subclass, partial [Trifolium pratense] Length = 511 Score = 331 bits (848), Expect = e-108 Identities = 162/255 (63%), Positives = 198/255 (77%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 I+ F F + +EPCVY K SGS + FL+LYVDDIL+ GN IPS+ K +L + FSMKDL Sbjct: 260 IEKFNFVKCEEEPCVYKKISGSSIIFLVLYVDDILLFGNDIPSMQSTKVWLSEQFSMKDL 319 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEAAYILGIKIYRDRS+RL+GLSQS YID ILKR+NM+ SK+G+LP+ + LS + C Sbjct: 320 GEAAYILGIKIYRDRSKRLLGLSQSMYIDTILKRYNMEKSKRGYLPVGMGVSLSRENCPK 379 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T E RM +VPYASAVG+IMY + CTRPDVA+A +TSRYQ NPG +HW+ VK ILKYL Sbjct: 380 TLEERERMSRVPYASAVGAIMYTMTCTRPDVAYALGVTSRYQANPGEEHWKVVKTILKYL 439 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R TK+ FL+YG ++EL + G+ DASF +DKDD+KS +GYVF +NGGA+ WKS KQ TV Sbjct: 440 RRTKDQFLIYG---NSELSLKGYTDASFASDKDDSKSISGYVFTLNGGAISWKSSKQATV 496 Query: 70 AMSITEAEYIAASEA 26 A S TEAEYIAASEA Sbjct: 497 ADSTTEAEYIAASEA 511 >gb|OTG07182.1| putative zinc finger, CCHC-type [Helianthus annuus] Length = 1325 Score = 348 bits (894), Expect = e-108 Identities = 168/264 (63%), Positives = 213/264 (80%) Frame = -1 Query: 793 EIKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKD 614 ++K FGF +N DEPCVY KASGS ++FLILYVDDILI+GN+IP L E+K +LG CF+M+D Sbjct: 966 KVKEFGFVKNEDEPCVYRKASGSAISFLILYVDDILIIGNNIPMLKEIKHWLGSCFAMQD 1025 Query: 613 LGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCA 434 LGEAAYILGIKIYR+RS+RL+GL+QS YID++++RF M+NSKKG +PM L Sbjct: 1026 LGEAAYILGIKIYRNRSKRLLGLTQSTYIDQVMRRFKMENSKKGGVPMTKGTVLDKSQAP 1085 Query: 433 STPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKY 254 S E+ +M+ VPYASA+G IMYA+ CTRPDV++A ++TSRYQQNPGV HW AVKNILKY Sbjct: 1086 SEDREIKQMEGVPYASAIGFIMYAMVCTRPDVSYALSMTSRYQQNPGVAHWTAVKNILKY 1145 Query: 253 LRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTT 74 LR TK+MFL++GG + EL V + DASFQTD+D ++SQTG+VF +NGGAV WKS KQ+ Sbjct: 1146 LRRTKDMFLIFGG-VNEELTVKCYTDASFQTDRDTSRSQTGFVFTLNGGAVSWKSSKQSV 1204 Query: 73 VAMSITEAEYIAASEAAMEAVWIR 2 VA S TE+EYIAAS+ A EA W++ Sbjct: 1205 VADSTTESEYIAASDVAKEAAWMK 1228 >gb|AAP44605.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1161 Score = 344 bits (883), Expect = e-107 Identities = 177/263 (67%), Positives = 203/263 (77%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 +K+ GF +N +EPCVY K SGS + FLILYVDDIL++GN IP L VK L FSMKDL Sbjct: 824 VKALGFVKNEEEPCVYKKISGSALVFLILYVDDILLIGNDIPMLESVKTSLKYSFSMKDL 883 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM L C Sbjct: 884 GEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSHGINLGKNQCPQ 943 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T E +M +PYASA+GSIMYA+ CTR DV++A + TSRYQ + G HW AVKNILKYL Sbjct: 944 TTDERNKMSVIPYASAIGSIMYAMLCTRLDVSYALSATSRYQSDLGESHWIAVKNILKYL 1003 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R TK+MFLVYG EL V G+ DASFQTDKDD +SQ+G+VF +NGGAV WKS KQ TV Sbjct: 1004 RRTKDMFLVYG--RQEELVVNGYTDASFQTDKDDFRSQSGFVFCLNGGAVSWKSSKQDTV 1061 Query: 70 AMSITEAEYIAASEAAMEAVWIR 2 A S TEAEYIAASEAA EAVWI+ Sbjct: 1062 ADSTTEAEYIAASEAAKEAVWIK 1084 >gb|ABF97047.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 1248 Score = 344 bits (883), Expect = e-106 Identities = 177/263 (67%), Positives = 203/263 (77%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 +K+ GF +N +EPCVY K SGS + FLILYVDDIL++GN IP L VK L FSMKDL Sbjct: 911 VKALGFVKNEEEPCVYKKISGSALVFLILYVDDILLIGNDIPMLESVKTSLKYSFSMKDL 970 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM L C Sbjct: 971 GEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSHGINLGKNQCPQ 1030 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T E +M +PYASA+GSIMYA+ CTR DV++A + TSRYQ + G HW AVKNILKYL Sbjct: 1031 TTDERNKMSVIPYASAIGSIMYAMLCTRLDVSYALSATSRYQSDLGESHWIAVKNILKYL 1090 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R TK+MFLVYG EL V G+ DASFQTDKDD +SQ+G+VF +NGGAV WKS KQ TV Sbjct: 1091 RRTKDMFLVYG--RQEELVVNGYTDASFQTDKDDFRSQSGFVFCLNGGAVSWKSSKQDTV 1148 Query: 70 AMSITEAEYIAASEAAMEAVWIR 2 A S TEAEYIAASEAA EAVWI+ Sbjct: 1149 ADSTTEAEYIAASEAAKEAVWIK 1171 >gb|AAX94813.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa Japonica Group] gb|ABA93176.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 938 Score = 338 bits (866), Expect = e-106 Identities = 172/260 (66%), Positives = 199/260 (76%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 +K+ GF +N EPCVY K SGS + FLILYVDDIL++ N IP L VK L FSMKDL Sbjct: 681 VKALGFVKNEQEPCVYKKISGSALVFLILYVDDILLIENDIPMLESVKTSLKNSFSMKDL 740 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEAAYILGI+IY+DRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM L C Sbjct: 741 GEAAYILGIRIYKDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSHGINLGKNQCPQ 800 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T E +M +PYASA+GSIMYA+ CTRPDV++A + TS+YQ +PG HW A+KNILKYL Sbjct: 801 TTNERNKMSVIPYASAIGSIMYAMLCTRPDVSYALSATSQYQSDPGESHWIALKNILKYL 860 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R TK+MFLVYGG EL V G+ DASFQ DKDD +SQ+G+VF +NGGAV WKS KQ TV Sbjct: 861 RRTKDMFLVYGG--QEELVVNGYTDASFQIDKDDFRSQSGFVFYLNGGAVSWKSSKQDTV 918 Query: 70 AMSITEAEYIAASEAAMEAV 11 A S TEAEYIAASEAA E V Sbjct: 919 ADSTTEAEYIAASEAAKEVV 938 >gb|AMY96445.1| gag/pol protein [Momordica dioica] Length = 1313 Score = 342 bits (878), Expect = e-105 Identities = 172/263 (65%), Positives = 203/263 (77%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 IK+FGF QN+DE CVY K SGS V FLILYVDDIL++GN + L +VK +L FSMKDL Sbjct: 988 IKAFGFIQNVDESCVYKKISGSVVAFLILYVDDILLIGNDVEYLEDVKKWLNTSFSMKDL 1047 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEA YILGI+IYRDRS + IG+SQS YIDK+L RF MQ+SKKG LP + LS + C Sbjct: 1048 GEAQYILGIRIYRDRSNKTIGMSQSTYIDKVLSRFKMQDSKKGLLPFRHGIHLSKEQCPK 1107 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 TP EV M+ +PY+SA+GS+MYA+ CTRPDV +A +I SRYQ NPG HW AVKNILKYL Sbjct: 1108 TPQEVEDMRNIPYSSAIGSLMYAMLCTRPDVCYALSIVSRYQSNPGRDHWTAVKNILKYL 1167 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R T+ MFLVYGG D +L V G+ D+SFQTDKDD+KSQ+G VF +NGGAV W+S KQT V Sbjct: 1168 RRTRNMFLVYGG--DKDLAVKGYTDSSFQTDKDDSKSQSG-VFTLNGGAVSWRSSKQTCV 1224 Query: 70 AMSITEAEYIAASEAAMEAVWIR 2 A S EAEY+AA EAA EAVWIR Sbjct: 1225 ADSTCEAEYVAACEAAKEAVWIR 1247 >gb|PPZ05609.1| hypothetical protein C5P41_24865, partial [Escherichia coli] Length = 859 Score = 332 bits (850), Expect = e-104 Identities = 160/263 (60%), Positives = 205/263 (77%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 IKSFGF +N DEPCVY K S S +TFL+LYVDDIL+MGN L +K +L FSMKDL Sbjct: 563 IKSFGFIKNEDEPCVYKKVSDSAITFLVLYVDDILLMGNDTGMLTTIKVWLSNTFSMKDL 622 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEA YILGI+IYRDR++R+IGLSQS Y++K+LKRFNM +SK+G LP++ LS + Sbjct: 623 GEATYILGIRIYRDRAKRIIGLSQSLYLEKVLKRFNMLDSKRGLLPVRHGIHLSKEMSPK 682 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 TP E +M ++PYASA+GS+MYA+ CTRP++A+A ++TSRYQ NPG++HW A+KNILKYL Sbjct: 683 TPEERDKMARIPYASAIGSLMYAMLCTRPNIAYAVSLTSRYQSNPGLEHWIAIKNILKYL 742 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R TK++FL+YGG +L + G+ D+ FQ+D DD KS + YVF+ NGGAV WKS KQ+T Sbjct: 743 RRTKDLFLIYGG---GDLQLDGYTDSDFQSDIDDRKSTSRYVFICNGGAVSWKSFKQSTT 799 Query: 70 AMSITEAEYIAASEAAMEAVWIR 2 A S EAEYIAAS+AA EAVWI+ Sbjct: 800 ADSTIEAEYIAASDAAKEAVWIK 822 >gb|PPY93112.1| hypothetical protein C5P31_25365, partial [Escherichia coli] Length = 813 Score = 330 bits (847), Expect = e-104 Identities = 161/263 (61%), Positives = 205/263 (77%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 IKSFGF +N DEPCVY K S S +TFL+LYVDDIL+MGN L +K +L FSMKDL Sbjct: 472 IKSFGFIKNEDEPCVYKKVSDSAITFLVLYVDDILLMGNDTGMLTTIKVWLSNTFSMKDL 531 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEA YILGI+IYRDR++R+IGLSQS Y++K+LKRFNM +SK+G L ++ LS + Sbjct: 532 GEATYILGIRIYRDRAKRIIGLSQSLYLEKVLKRFNMLDSKRGLLLVRHGIHLSKEMSPK 591 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 TP E +M ++PYASA+GS+MYA+ CTRPD+A+A ++TSRYQ NPG++H AVKNILKYL Sbjct: 592 TPEERDKMARIPYASAIGSLMYAMLCTRPDIAYAISLTSRYQSNPGLEHSIAVKNILKYL 651 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R TK++FL+YGG +L + G+ D+ FQ+D DD KS +GYVF+ NGGAV WKS KQ+T Sbjct: 652 RRTKDLFLIYGG---GDLQLDGYTDSDFQSDIDDRKSTSGYVFICNGGAVSWKSSKQSTT 708 Query: 70 AMSITEAEYIAASEAAMEAVWIR 2 A S TEAEYIAAS+AA EA+WI+ Sbjct: 709 ADSTTEAEYIAASDAAKEAIWIK 731 >gb|AAV85747.1| Integrase core domain, putative [Oryza sativa Japonica Group] gb|AAX92956.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa Japonica Group] gb|ABA92827.2| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 1184 Score = 335 bits (858), Expect = e-103 Identities = 172/263 (65%), Positives = 198/263 (75%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 +K+ GF +N +EPCVY K SGS + FLILYVDDIL++GN I L VK L FSMKDL Sbjct: 833 VKALGFVRNEEEPCVYKKISGSALVFLILYVDDILLIGNDISMLESVKTSLKNSFSMKDL 892 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM L C Sbjct: 893 GEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSHGINLGKNQCPQ 952 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T E +M +PYASA+GSIMYA+ CTRPDV++A + TSRYQ +PG HW AVKNILKYL Sbjct: 953 TTDERNKMSVIPYASAIGSIMYAMLCTRPDVSYALSATSRYQSDPGESHWIAVKNILKYL 1012 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R TK+MFL YGG EL V G+ DASFQ DKDD +SQ+G+VF +NGGAV WKS KQ V Sbjct: 1013 RRTKDMFLAYGG--QEELVVNGYTDASFQIDKDDFRSQSGFVFCLNGGAVSWKSSKQDIV 1070 Query: 70 AMSITEAEYIAASEAAMEAVWIR 2 S TEAEYIAAS EAVWI+ Sbjct: 1071 VDSTTEAEYIAAS----EAVWIK 1089 >gb|AAS01945.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1084 Score = 330 bits (846), Expect = e-102 Identities = 168/260 (64%), Positives = 196/260 (75%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 +K+ GF +N + PCVY K SGS + FLILYVDDIL++GN IP L V L FSMKDL Sbjct: 827 VKALGFVKNEEVPCVYKKISGSALVFLILYVDDILLIGNDIPMLESVNISLKNSFSMKDL 886 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEAAYILGI+IYRD S+RLIGLS+S YIDK+LK FNMQ+SKKGFLPM L+ C Sbjct: 887 GEAAYILGIRIYRDGSKRLIGLSESTYIDKVLKMFNMQDSKKGFLPMSHGINLNKNQCLQ 946 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T E +M +PYASA+GSIMYA+ CTRPDV++ + TSRYQ +P HW AVKNILKYL Sbjct: 947 TTNEQNKMSVIPYASAIGSIMYAMLCTRPDVSYVLSATSRYQSDPSESHWIAVKNILKYL 1006 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R TK+MFLVYGG EL V G+ DASFQTDKDD +SQ+G+VF +NG AV WKS KQ T Sbjct: 1007 RRTKDMFLVYGG--QEELVVNGYTDASFQTDKDDFRSQSGFVFCLNGSAVSWKSSKQDTA 1064 Query: 70 AMSITEAEYIAASEAAMEAV 11 A S TEAEYIAAS+AA EAV Sbjct: 1065 ANSTTEAEYIAASKAAKEAV 1084 >gb|ABF97213.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 1142 Score = 330 bits (846), Expect = e-102 Identities = 168/260 (64%), Positives = 196/260 (75%) Frame = -1 Query: 790 IKSFGFTQNLDEPCVYHKASGSHVTFLILYVDDILIMGNHIPSLNEVKAYLGKCFSMKDL 611 +K+ GF +N + PCVY K SGS + FLILYVDDIL++GN IP L V L FSMKDL Sbjct: 885 VKALGFVKNEEVPCVYKKISGSALVFLILYVDDILLIGNDIPMLESVNISLKNSFSMKDL 944 Query: 610 GEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVNHGLSSQGCAS 431 GEAAYILGI+IYRD S+RLIGLS+S YIDK+LK FNMQ+SKKGFLPM L+ C Sbjct: 945 GEAAYILGIRIYRDGSKRLIGLSESTYIDKVLKMFNMQDSKKGFLPMSHGINLNKNQCLQ 1004 Query: 430 TPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNITSRYQQNPGVKHWQAVKNILKYL 251 T E +M +PYASA+GSIMYA+ CTRPDV++ + TSRYQ +P HW AVKNILKYL Sbjct: 1005 TTNEQNKMSVIPYASAIGSIMYAMLCTRPDVSYVLSATSRYQSDPSESHWIAVKNILKYL 1064 Query: 250 RATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAVDWKSKKQTTV 71 R TK+MFLVYGG EL V G+ DASFQTDKDD +SQ+G+VF +NG AV WKS KQ T Sbjct: 1065 RRTKDMFLVYGG--QEELVVNGYTDASFQTDKDDFRSQSGFVFCLNGSAVSWKSSKQDTA 1122 Query: 70 AMSITEAEYIAASEAAMEAV 11 A S TEAEYIAAS+AA EAV Sbjct: 1123 ANSTTEAEYIAASKAAKEAV 1142