BLASTX nr result
ID: Chrysanthemum21_contig00039517
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum21_contig00039517 (788 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|WP_042516817.1| hypothetical protein [Lactobacillus brevis] ... 334 e-112 gb|ADB85430.1| putative retrotransposon protein [Phyllostachys e... 352 e-112 gb|PNX87842.1| retrotransposon protein putative Ty1-copia subcla... 337 e-110 gb|PKI72506.1| hypothetical protein CRG98_007109 [Punica granatum] 343 e-110 dbj|BAA22288.1| polyprotein [Oryza australiensis] 350 e-108 gb|OTG07182.1| putative zinc finger, CCHC-type [Helianthus annuus] 350 e-108 gb|ADB85429.1| putative retrotransposon protein [Phyllostachys e... 348 e-107 gb|AAC26250.1| contains similarity to reverse transcriptase (Pfa... 341 e-107 gb|KYP76577.1| Retrovirus-related Pol polyprotein from transposo... 323 e-107 emb|CAD39797.2| OSJNBa0071G03.10 [Oryza sativa Japonica Group] 339 e-106 gb|EXX50149.1| gag-pol fusion protein [Rhizophagus irregularis D... 345 e-106 gb|AAX94813.1| retrotransposon protein, putative, Ty1-copia sub-... 338 e-106 gb|AAV85747.1| Integrase core domain, putative [Oryza sativa Jap... 339 e-105 gb|AAP44605.1| putative polyprotein [Oryza sativa Japonica Group] 338 e-105 gb|ABF97047.1| retrotransposon protein, putative, Ty1-copia subc... 338 e-104 gb|PPZ05609.1| hypothetical protein C5P41_24865, partial [Escher... 329 e-103 gb|AMY96445.1| gag/pol protein [Momordica dioica] 337 e-103 gb|AAS01945.1| putative polyprotein [Oryza sativa Japonica Group] 332 e-103 gb|PPY93112.1| hypothetical protein C5P31_25365, partial [Escher... 327 e-103 gb|ABF97213.1| retrotransposon protein, putative, Ty1-copia subc... 332 e-103 >ref|WP_042516817.1| hypothetical protein [Lactobacillus brevis] gb|KIO93795.1| hypothetical protein QP38_2416 [Lactobacillus brevis] Length = 283 Score = 334 bits (857), Expect = e-112 Identities = 166/262 (63%), Positives = 197/262 (75%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WNKRFD IK+FGF Q + E C+Y K SGS V FL+LYVDDIL++GN++ L +K Sbjct: 15 SRSWNKRFDEVIKAFGFIQVVGESCIYKKVSGSSVVFLILYVDDILLIGNNVEFLESIKD 74 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 YL K FSMKDLGEAAYILGIKIYRDRS+R+IGLSQS Y+D +LKRF M+ SKKGFLP+ Sbjct: 75 YLNKSFSMKDLGEAAYILGIKIYRDRSKRVIGLSQSTYLDNVLKRFKMEQSKKGFLPVLQ 134 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 LS C +T + M+ VPYASA+GSIMYA+ CTRPDV+ A ++ R Q NP V H Sbjct: 135 GTKLSKTQCPATDEDREHMRSVPYASAIGSIMYAMMCTRPDVSLAISMAGRSQSNPAVHH 194 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYL+ TKEMFLVYGG D EL V G+ DASF TD DD+KSQT YVF++N GA Sbjct: 195 WTAVKNILKYLKRTKEMFLVYGG--DEELAVKGYVDASFDTDPDDSKSQTRYVFILNEGA 252 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V W S KQ+ +A S EAEY+A Sbjct: 253 VSWCSSKQSVVADSTCEAEYMA 274 >gb|ADB85430.1| putative retrotransposon protein [Phyllostachys edulis] Length = 896 Score = 352 bits (904), Expect = e-112 Identities = 175/262 (66%), Positives = 206/262 (78%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD EIK FGF +N +EPCVY K SGS + L+LYVDDIL++GN IP L VK+ Sbjct: 531 SRSWNIRFDEEIKRFGFIKNKEEPCVYMKVSGSTLVILILYVDDILLVGNDIPMLESVKS 590 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 L K FSMKDLG+AAYILGI+IYRDRS+RLIGLSQ YIDK+L RFNMQNSK+GFLPM Sbjct: 591 SLRKSFSMKDLGDAAYILGIRIYRDRSKRLIGLSQEMYIDKVLNRFNMQNSKRGFLPMAH 650 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 LS C +T E +M +PYASA+GSIMYA+ CTRPDV++A ++TSRYQ +P H Sbjct: 651 GINLSKNQCPTTTDERDKMSDIPYASAIGSIMYAMICTRPDVSYALSVTSRYQADPSEGH 710 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYLR TK++FLVYGG D EL V G+ DASFQTDKDD +SQ+G+VF++NGGA Sbjct: 711 WTAVKNILKYLRRTKDVFLVYGG--DEELVVNGYTDASFQTDKDDYRSQSGFVFILNGGA 768 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ T+A S TEAEYIA Sbjct: 769 VSWKSSKQETVADSTTEAEYIA 790 >gb|PNX87842.1| retrotransposon protein putative Ty1-copia subclass, partial [Trifolium pratense] Length = 511 Score = 337 bits (864), Expect = e-110 Identities = 164/262 (62%), Positives = 201/262 (76%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RF+ I+ F F + +EPCVY K SGS + FLVLYVDDIL+ GN IPS+ K Sbjct: 249 SRSWNIRFNNTIEKFNFVKCEEEPCVYKKISGSSIIFLVLYVDDILLFGNDIPSMQSTKV 308 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 +L + FSMKDLGEAAYILGIKIYRDRS+RL+GLSQS YID ILKR+NM+ SK+G+LP+ + Sbjct: 309 WLSEQFSMKDLGEAAYILGIKIYRDRSKRLLGLSQSMYIDTILKRYNMEKSKRGYLPVGM 368 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 LS + C T E RM +VPYASAVG+IMY + CTRPDVA+A +TSRYQ NPG +H Sbjct: 369 GVSLSRENCPKTLEERERMSRVPYASAVGAIMYTMTCTRPDVAYALGVTSRYQANPGEEH 428 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W+ VK ILKYLR TK+ FL+YG ++EL + G+ DASF +DKDD+KS +GYVF +NGGA Sbjct: 429 WKVVKTILKYLRRTKDQFLIYG---NSELSLKGYTDASFASDKDDSKSISGYVFTLNGGA 485 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 + WKS KQ T+A S TEAEYIA Sbjct: 486 ISWKSSKQATVADSTTEAEYIA 507 >gb|PKI72506.1| hypothetical protein CRG98_007109 [Punica granatum] Length = 783 Score = 343 bits (881), Expect = e-110 Identities = 173/262 (66%), Positives = 204/262 (77%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD IK FGF +N DEPCVY K SGS V FLVLYVDDIL++GN I SL VK Sbjct: 428 SRSWNLRFDDAIKEFGFIKNEDEPCVYKKVSGSVVIFLVLYVDDILLIGNDILSLQSVKT 487 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 +LG+CFSMKDLGEA Y+LGIKIYRDRS RL+GLSQSAYIDK+L RF+MQ+SKKG LPM Sbjct: 488 WLGRCFSMKDLGEATYVLGIKIYRDRSNRLLGLSQSAYIDKVLWRFSMQDSKKGSLPMLH 547 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 LS ST E R+ ++PYASA+GSIMYA+ CTR +V++ ++TSRYQ +PG +H Sbjct: 548 GISLSKAQSPSTREERDRINRIPYASAIGSIMYAMLCTRSNVSYTLSMTSRYQSDPGERH 607 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYLR TKE+FLVYGG + EL V G+ D SFQT+KDD++SQ+GYV +NGGA Sbjct: 608 WIAVKNILKYLRRTKEIFLVYGG--EEELVVRGYTDVSFQTNKDDSRSQSGYVLCLNGGA 665 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ T+A S EAEYIA Sbjct: 666 VSWKSSKQETVADSTIEAEYIA 687 >dbj|BAA22288.1| polyprotein [Oryza australiensis] Length = 1317 Score = 350 bits (899), Expect = e-108 Identities = 177/262 (67%), Positives = 203/262 (77%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD IK FGF +N +E CVY K SGS + FL+LYVDDIL++GN IP L VK+ Sbjct: 951 SRSWNIRFDEVIKGFGFIKNEEEACVYKKVSGSAIVFLILYVDDILLIGNDIPMLESVKS 1010 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 L FSMKDLGEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNM +SKKGFLPM Sbjct: 1011 SLKNSFSMKDLGEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMHDSKKGFLPMSH 1070 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 LS C T E +M VPYASA+GSIMYA+ CTRPDV++A + TSRYQ +PG H Sbjct: 1071 GINLSKNQCPQTHDERNKMGMVPYASAIGSIMYAMLCTRPDVSYALSATSRYQSDPGEGH 1130 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYLR TK+MFLVYGG D L V+G+ DASFQTDKDD +SQ+G+VF +NGGA Sbjct: 1131 WTAVKNILKYLRRTKDMFLVYGGEED--LVVSGYTDASFQTDKDDYRSQSGFVFCLNGGA 1188 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ T+A S TEAEYIA Sbjct: 1189 VSWKSSKQDTVADSTTEAEYIA 1210 >gb|OTG07182.1| putative zinc finger, CCHC-type [Helianthus annuus] Length = 1325 Score = 350 bits (897), Expect = e-108 Identities = 167/262 (63%), Positives = 211/262 (80%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD ++K FGF +N DEPCVY KASGS ++FL+LYVDDILI+GN+IP L E+K Sbjct: 956 SRSWNLRFDQKVKEFGFVKNEDEPCVYRKASGSAISFLILYVDDILIIGNNIPMLKEIKH 1015 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 +LG CF+M+DLGEAAYILGIKIYR+RS+RL+GL+QS YID++++RF M+NSKKG +PM Sbjct: 1016 WLGSCFAMQDLGEAAYILGIKIYRNRSKRLLGLTQSTYIDQVMRRFKMENSKKGGVPMTK 1075 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 L S E+ +M+ VPYASA+G IMYA+ CTRPDV++A ++TSRYQQNPGV H Sbjct: 1076 GTVLDKSQAPSEDREIKQMEGVPYASAIGFIMYAMVCTRPDVSYALSMTSRYQQNPGVAH 1135 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYLR TK+MFL++GG + EL V + DASFQTD+D ++SQTG+VF +NGGA Sbjct: 1136 WTAVKNILKYLRRTKDMFLIFGG-VNEELTVKCYTDASFQTDRDTSRSQTGFVFTLNGGA 1194 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ+ +A S TE+EYIA Sbjct: 1195 VSWKSSKQSVVADSTTESEYIA 1216 >gb|ADB85429.1| putative retrotransposon protein [Phyllostachys edulis] Length = 1313 Score = 348 bits (892), Expect = e-107 Identities = 179/264 (67%), Positives = 207/264 (78%), Gaps = 2/264 (0%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD EIK FGF +N +EPCVY K SGS + L+LYVDDIL++GN IP L VKA Sbjct: 947 SRSWNIRFDEEIKRFGFVKNKEEPCVYMKVSGSTLVILILYVDDILLIGNDIPMLESVKA 1006 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 L FSMKDLGEAAYILGIKIYRDRSRRLIGLSQS YIDK+L RFNMQN+KKGFLPM Sbjct: 1007 SLKNSFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSTYIDKVLIRFNMQNTKKGFLPM-- 1064 Query: 428 NHGLSSQGC--ASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGV 255 +HG+S ST E RM +PYASA+GSIMYA+ CTR DV++A ++TSRYQ +PG Sbjct: 1065 SHGISPSKSQRPSTTDERDRMNGIPYASAIGSIMYAMICTRQDVSYALSVTSRYQADPGE 1124 Query: 254 KHWQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVING 75 HW AVKNILKYLR TK+ FL+YGG D EL V G+ DASFQTDKDD +SQ+G+VF++NG Sbjct: 1125 CHWTAVKNILKYLRRTKDAFLIYGG--DEELVVNGYTDASFQTDKDDYRSQSGFVFILNG 1182 Query: 74 GAVDWKSKKQTTIAMSATEAEYIA 3 GAV WKS KQ T+A S T+AEYIA Sbjct: 1183 GAVSWKSSKQETVADSTTKAEYIA 1206 >gb|AAC26250.1| contains similarity to reverse transcriptase (Pfam: rvt.hmm, score 19.29) [Arabidopsis thaliana] emb|CAB80804.1| putative retrotransposon protein [Arabidopsis thaliana] Length = 964 Score = 341 bits (874), Expect = e-107 Identities = 170/262 (64%), Positives = 200/262 (76%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RF+ IK F F +N +EPCVY K SGS V FLVLYVDDIL++GN IP L VK Sbjct: 598 SRSWNLRFNEAIKEFDFIRNEEEPCVYKKTSGSAVAFLVLYVDDILLLGNDIPLLQSVKT 657 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 +LG CFSMKD+GEAAYILGI+IYRDR ++IGLSQ YIDK+L RFNM +SKKGF+PM Sbjct: 658 WLGSCFSMKDMGEAAYILGIRIYRDRLNKIIGLSQDTYIDKVLHRFNMHDSKKGFIPMSH 717 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 LS C ST E RM K+PYASA+GSIMYA+ TRPDVA A ++TSRYQ +PG H Sbjct: 718 GITLSKTQCPSTHDERERMSKIPYASAIGSIMYAMLYTRPDVACALSMTSRYQSDPGESH 777 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W V+NI KYLR TK+ FLVYGG+ EL V+G+ DASFQTDKDD +SQ+G+ F +NGGA Sbjct: 778 WIVVRNIFKYLRRTKDKFLVYGGS--EELVVSGYTDASFQTDKDDFRSQSGFFFCLNGGA 835 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ+T+A S TEAEYIA Sbjct: 836 VSWKSTKQSTVADSTTEAEYIA 857 >gb|KYP76577.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 368 Score = 323 bits (828), Expect = e-107 Identities = 159/262 (60%), Positives = 198/262 (75%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN +F+ IK F F + +EPCVY + SGS + FL+LYVDDIL++GN IP L K Sbjct: 47 SRSWNIQFNKTIKKFDFVRCEEEPCVYKRVSGSTIIFLMLYVDDILLIGNDIPFLQSTKI 106 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 +L + FSMKDLGEAAYILGIKIYRDRS+R++GLSQS YID IL+R+NM+NSK+G+ P+ Sbjct: 107 WLSEQFSMKDLGEAAYILGIKIYRDRSKRMLGLSQSMYIDTILRRYNMENSKRGYFPIGT 166 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 LS++ C T E RM +VPYASAVG+IMY + CTRPDVAFA + SRYQ NPG ++ Sbjct: 167 GVTLSNEDCPKTLEERTRMNRVPYASAVGAIMYIMTCTRPDVAFAPGVVSRYQANPGEEN 226 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W+ VK ILKYLR T++ FL+YGG EL + G+ DASF +DKDD+KS +GYVF + GGA Sbjct: 227 WKVVKTILKYLRRTQDQFLIYGG---TELMLKGYTDASFASDKDDSKSISGYVFTLYGGA 283 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ T+A S TEAEYIA Sbjct: 284 VSWKSSKQATVADSTTEAEYIA 305 >emb|CAD39797.2| OSJNBa0071G03.10 [Oryza sativa Japonica Group] Length = 948 Score = 339 bits (869), Expect = e-106 Identities = 170/262 (64%), Positives = 199/262 (75%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD +K GF +N EPCVY K SGS + FL+LYVDDIL++GN IP L VK Sbjct: 673 SRSWNIRFDEVVKVLGFFKNEQEPCVYKKISGSALVFLILYVDDILLIGNDIPILESVKT 732 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 L FSMKDLGEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM Sbjct: 733 LLKNSFSMKDLGEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSH 792 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 + L C T E +M +PYASA+GSIMYA+ CT PDV++A + T+RYQ +PG H Sbjct: 793 DINLGKNQCPQTTDERNKMSVIPYASAIGSIMYAMLCTCPDVSYALSATNRYQSDPGESH 852 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYLR T++MFLVYGG EL V G+ DASFQTDKDD +S++G+VF +NGG Sbjct: 853 WIAVKNILKYLRRTEDMFLVYGG--QEELVVNGYTDASFQTDKDDFRSRSGFVFCLNGGV 910 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ T+A S TEAEYIA Sbjct: 911 VSWKSSKQDTVADSTTEAEYIA 932 >gb|EXX50149.1| gag-pol fusion protein [Rhizophagus irregularis DAOM 197198w] Length = 1303 Score = 345 bits (884), Expect = e-106 Identities = 174/262 (66%), Positives = 207/262 (79%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN F ++K FGF+++ DE CVY KASGS VTFLVLYVDDIL+MGN IP+L +VKA Sbjct: 934 SRSWNLCFHEKVKEFGFSRSEDESCVYVKASGSIVTFLVLYVDDILLMGNDIPTLQDVKA 993 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 +LGKCF+MKDLGEAAYILGI+I RDR +RLIGLSQ Y++K+LKRF+M+NSKKG LP+Q Sbjct: 994 WLGKCFAMKDLGEAAYILGIRILRDRKKRLIGLSQGTYLEKVLKRFSMENSKKGELPIQS 1053 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 N LS ST E+A M +VPYASAVGSIMYA+ CTRPDVAFA ++ SRYQ NPG H Sbjct: 1054 NAKLSKTQSPSTDEEIAEMSRVPYASAVGSIMYAMTCTRPDVAFALSMVSRYQGNPGRAH 1113 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYLR TK M LV GG+ L V G+ DASFQTD+D +SQ+G+VF++NGGA Sbjct: 1114 WIAVKNILKYLRRTKNMVLVLGGSD--TLRVEGYTDASFQTDRDSGRSQSGWVFLLNGGA 1171 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ T+A S E+EYIA Sbjct: 1172 VTWKSSKQETVADSTCESEYIA 1193 >gb|AAX94813.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa Japonica Group] gb|ABA93176.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 938 Score = 338 bits (866), Expect = e-106 Identities = 169/262 (64%), Positives = 198/262 (75%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN FD +K+ GF +N EPCVY K SGS + FL+LYVDDIL++ N IP L VK Sbjct: 670 SRSWNIHFDEIVKALGFVKNEQEPCVYKKISGSALVFLILYVDDILLIENDIPMLESVKT 729 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 L FSMKDLGEAAYILGI+IY+DRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM Sbjct: 730 SLKNSFSMKDLGEAAYILGIRIYKDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSH 789 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 L C T E +M +PYASA+GSIMYA+ CTRPDV++A + TS+YQ +PG H Sbjct: 790 GINLGKNQCPQTTNERNKMSVIPYASAIGSIMYAMLCTRPDVSYALSATSQYQSDPGESH 849 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W A+KNILKYLR TK+MFLVYGG EL V G+ DASFQ DKDD +SQ+G+VF +NGGA Sbjct: 850 WIALKNILKYLRRTKDMFLVYGG--QEELVVNGYTDASFQIDKDDFRSQSGFVFYLNGGA 907 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ T+A S TEAEYIA Sbjct: 908 VSWKSSKQDTVADSTTEAEYIA 929 >gb|AAV85747.1| Integrase core domain, putative [Oryza sativa Japonica Group] gb|AAX92956.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa Japonica Group] gb|ABA92827.2| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 1184 Score = 339 bits (870), Expect = e-105 Identities = 170/262 (64%), Positives = 197/262 (75%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD +K+ GF +N +EPCVY K SGS + FL+LYVDDIL++GN I L VK Sbjct: 822 SRSWNIRFDEVVKALGFVRNEEEPCVYKKISGSALVFLILYVDDILLIGNDISMLESVKT 881 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 L FSMKDLGEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM Sbjct: 882 SLKNSFSMKDLGEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSH 941 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 L C T E +M +PYASA+GSIMYA+ CTRPDV++A + TSRYQ +PG H Sbjct: 942 GINLGKNQCPQTTDERNKMSVIPYASAIGSIMYAMLCTRPDVSYALSATSRYQSDPGESH 1001 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYLR TK+MFL YGG EL V G+ DASFQ DKDD +SQ+G+VF +NGGA Sbjct: 1002 WIAVKNILKYLRRTKDMFLAYGG--QEELVVNGYTDASFQIDKDDFRSQSGFVFCLNGGA 1059 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ + S TEAEYIA Sbjct: 1060 VSWKSSKQDIVVDSTTEAEYIA 1081 >gb|AAP44605.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1161 Score = 338 bits (868), Expect = e-105 Identities = 172/262 (65%), Positives = 199/262 (75%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD +K+ GF +N +EPCVY K SGS + FL+LYVDDIL++GN IP L VK Sbjct: 813 SRSWNIRFDEVVKALGFVKNEEEPCVYKKISGSALVFLILYVDDILLIGNDIPMLESVKT 872 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 L FSMKDLGEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM Sbjct: 873 SLKYSFSMKDLGEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSH 932 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 L C T E +M +PYASA+GSIMYA+ CTR DV++A + TSRYQ + G H Sbjct: 933 GINLGKNQCPQTTDERNKMSVIPYASAIGSIMYAMLCTRLDVSYALSATSRYQSDLGESH 992 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYLR TK+MFLVYG EL V G+ DASFQTDKDD +SQ+G+VF +NGGA Sbjct: 993 WIAVKNILKYLRRTKDMFLVYG--RQEELVVNGYTDASFQTDKDDFRSQSGFVFCLNGGA 1050 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ T+A S TEAEYIA Sbjct: 1051 VSWKSSKQDTVADSTTEAEYIA 1072 >gb|ABF97047.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 1248 Score = 338 bits (868), Expect = e-104 Identities = 172/262 (65%), Positives = 199/262 (75%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD +K+ GF +N +EPCVY K SGS + FL+LYVDDIL++GN IP L VK Sbjct: 900 SRSWNIRFDEVVKALGFVKNEEEPCVYKKISGSALVFLILYVDDILLIGNDIPMLESVKT 959 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 L FSMKDLGEAAYILGI+IYRDRS+RLIGLSQS YIDK+LKRFNMQ+SKKGFLPM Sbjct: 960 SLKYSFSMKDLGEAAYILGIRIYRDRSKRLIGLSQSTYIDKVLKRFNMQDSKKGFLPMSH 1019 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 L C T E +M +PYASA+GSIMYA+ CTR DV++A + TSRYQ + G H Sbjct: 1020 GINLGKNQCPQTTDERNKMSVIPYASAIGSIMYAMLCTRLDVSYALSATSRYQSDLGESH 1079 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYLR TK+MFLVYG EL V G+ DASFQTDKDD +SQ+G+VF +NGGA Sbjct: 1080 WIAVKNILKYLRRTKDMFLVYG--RQEELVVNGYTDASFQTDKDDFRSQSGFVFCLNGGA 1137 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ T+A S TEAEYIA Sbjct: 1138 VSWKSSKQDTVADSTTEAEYIA 1159 >gb|PPZ05609.1| hypothetical protein C5P41_24865, partial [Escherichia coli] Length = 859 Score = 329 bits (843), Expect = e-103 Identities = 160/262 (61%), Positives = 201/262 (76%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD IKSFGF +N DEPCVY K S S +TFLVLYVDDIL+MGN L +K Sbjct: 552 SRSWNIRFDEAIKSFGFIKNEDEPCVYKKVSDSAITFLVLYVDDILLMGNDTGMLTTIKV 611 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 +L FSMKDLGEA YILGI+IYRDR++R+IGLSQS Y++K+LKRFNM +SK+G LP++ Sbjct: 612 WLSNTFSMKDLGEATYILGIRIYRDRAKRIIGLSQSLYLEKVLKRFNMLDSKRGLLPVRH 671 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 LS + TP E +M ++PYASA+GS+MYA+ CTRP++A+A +LTSRYQ NPG++H Sbjct: 672 GIHLSKEMSPKTPEERDKMARIPYASAIGSLMYAMLCTRPNIAYAVSLTSRYQSNPGLEH 731 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W A+KNILKYLR TK++FL+YGG +L + G+ D+ FQ+D DD KS + YVF+ NGGA Sbjct: 732 WIAIKNILKYLRRTKDLFLIYGG---GDLQLDGYTDSDFQSDIDDRKSTSRYVFICNGGA 788 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ+T A S EAEYIA Sbjct: 789 VSWKSFKQSTTADSTIEAEYIA 810 >gb|AMY96445.1| gag/pol protein [Momordica dioica] Length = 1313 Score = 337 bits (863), Expect = e-103 Identities = 166/262 (63%), Positives = 200/262 (76%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD IK+FGF QN+DE CVY K SGS V FL+LYVDDIL++GN + L +VK Sbjct: 977 SRSWNIRFDEVIKAFGFIQNVDESCVYKKISGSVVAFLILYVDDILLIGNDVEYLEDVKK 1036 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 +L FSMKDLGEA YILGI+IYRDRS + IG+SQS YIDK+L RF MQ+SKKG LP + Sbjct: 1037 WLNTSFSMKDLGEAQYILGIRIYRDRSNKTIGMSQSTYIDKVLSRFKMQDSKKGLLPFRH 1096 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 LS + C TP EV M+ +PY+SA+GS+MYA+ CTRPDV +A ++ SRYQ NPG H Sbjct: 1097 GIHLSKEQCPKTPQEVEDMRNIPYSSAIGSLMYAMLCTRPDVCYALSIVSRYQSNPGRDH 1156 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYLR T+ MFLVYGG D +L V G+ D+SFQTDKDD+KSQ+G VF +NGGA Sbjct: 1157 WTAVKNILKYLRRTRNMFLVYGG--DKDLAVKGYTDSSFQTDKDDSKSQSG-VFTLNGGA 1213 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V W+S KQT +A S EAEY+A Sbjct: 1214 VSWRSSKQTCVADSTCEAEYVA 1235 >gb|AAS01945.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1084 Score = 332 bits (852), Expect = e-103 Identities = 167/262 (63%), Positives = 195/262 (74%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD +K+ GF +N + PCVY K SGS + FL+LYVDDIL++GN IP L V Sbjct: 816 SRSWNIRFDEVVKALGFVKNEEVPCVYKKISGSALVFLILYVDDILLIGNDIPMLESVNI 875 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 L FSMKDLGEAAYILGI+IYRD S+RLIGLS+S YIDK+LK FNMQ+SKKGFLPM Sbjct: 876 SLKNSFSMKDLGEAAYILGIRIYRDGSKRLIGLSESTYIDKVLKMFNMQDSKKGFLPMSH 935 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 L+ C T E +M +PYASA+GSIMYA+ CTRPDV++ + TSRYQ +P H Sbjct: 936 GINLNKNQCLQTTNEQNKMSVIPYASAIGSIMYAMLCTRPDVSYVLSATSRYQSDPSESH 995 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYLR TK+MFLVYGG EL V G+ DASFQTDKDD +SQ+G+VF +NG A Sbjct: 996 WIAVKNILKYLRRTKDMFLVYGG--QEELVVNGYTDASFQTDKDDFRSQSGFVFCLNGSA 1053 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ T A S TEAEYIA Sbjct: 1054 VSWKSSKQDTAANSTTEAEYIA 1075 >gb|PPY93112.1| hypothetical protein C5P31_25365, partial [Escherichia coli] Length = 813 Score = 327 bits (837), Expect = e-103 Identities = 161/261 (61%), Positives = 200/261 (76%) Frame = -1 Query: 785 RQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKAY 606 R WN RFD IKSFGF +N DEPCVY K S S +TFLVLYVDDIL+MGN L +K + Sbjct: 462 RSWNIRFDEAIKSFGFIKNEDEPCVYKKVSDSAITFLVLYVDDILLMGNDTGMLTTIKVW 521 Query: 605 LGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQVN 426 L FSMKDLGEA YILGI+IYRDR++R+IGLSQS Y++K+LKRFNM +SK+G L ++ Sbjct: 522 LSNTFSMKDLGEATYILGIRIYRDRAKRIIGLSQSLYLEKVLKRFNMLDSKRGLLLVRHG 581 Query: 425 HGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKHW 246 LS + TP E +M ++PYASA+GS+MYA+ CTRPD+A+A +LTSRYQ NPG++H Sbjct: 582 IHLSKEMSPKTPEERDKMARIPYASAIGSLMYAMLCTRPDIAYAISLTSRYQSNPGLEHS 641 Query: 245 QAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGAV 66 AVKNILKYLR TK++FL+YGG +L + G+ D+ FQ+D DD KS +GYVF+ NGGAV Sbjct: 642 IAVKNILKYLRRTKDLFLIYGG---GDLQLDGYTDSDFQSDIDDRKSTSGYVFICNGGAV 698 Query: 65 DWKSKKQTTIAMSATEAEYIA 3 WKS KQ+T A S TEAEYIA Sbjct: 699 SWKSSKQSTTADSTTEAEYIA 719 >gb|ABF97213.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 1142 Score = 332 bits (852), Expect = e-103 Identities = 167/262 (63%), Positives = 195/262 (74%) Frame = -1 Query: 788 SRQWNKRFDGEIKSFGFTQNLDEPCVYHKASGSHVTFLVLYVDDILIMGNHIPSLNEVKA 609 SR WN RFD +K+ GF +N + PCVY K SGS + FL+LYVDDIL++GN IP L V Sbjct: 874 SRSWNIRFDEVVKALGFVKNEEVPCVYKKISGSALVFLILYVDDILLIGNDIPMLESVNI 933 Query: 608 YLGKCFSMKDLGEAAYILGIKIYRDRSRRLIGLSQSAYIDKILKRFNMQNSKKGFLPMQV 429 L FSMKDLGEAAYILGI+IYRD S+RLIGLS+S YIDK+LK FNMQ+SKKGFLPM Sbjct: 934 SLKNSFSMKDLGEAAYILGIRIYRDGSKRLIGLSESTYIDKVLKMFNMQDSKKGFLPMSH 993 Query: 428 NHGLSSQGCASTPAEVARMKKVPYASAVGSIMYAVRCTRPDVAFAQNLTSRYQQNPGVKH 249 L+ C T E +M +PYASA+GSIMYA+ CTRPDV++ + TSRYQ +P H Sbjct: 994 GINLNKNQCLQTTNEQNKMSVIPYASAIGSIMYAMLCTRPDVSYVLSATSRYQSDPSESH 1053 Query: 248 WQAVKNILKYLRATKEMFLVYGGNPDAELDVTGFCDASFQTDKDDTKSQTGYVFVINGGA 69 W AVKNILKYLR TK+MFLVYGG EL V G+ DASFQTDKDD +SQ+G+VF +NG A Sbjct: 1054 WIAVKNILKYLRRTKDMFLVYGG--QEELVVNGYTDASFQTDKDDFRSQSGFVFCLNGSA 1111 Query: 68 VDWKSKKQTTIAMSATEAEYIA 3 V WKS KQ T A S TEAEYIA Sbjct: 1112 VSWKSSKQDTAANSTTEAEYIA 1133