BLASTX nr result
ID: Ophiopogon23_contig00011280
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon23_contig00011280 (963 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PPD88818.1| hypothetical protein GOBAR_DD14224 [Gossypium bar... 210 1e-58 gb|KYP71220.1| Retrovirus-related Pol polyprotein from transposo... 193 4e-57 gb|ABD63105.1| polyprotein-like, related [Asparagus officinalis] 194 8e-57 gb|PHT53674.1| hypothetical protein CQW23_08136 [Capsicum baccatum] 189 3e-54 gb|KYP58145.1| Retrovirus-related Pol polyprotein from transposo... 184 4e-53 gb|PHT55765.1| hypothetical protein CQW23_04251 [Capsicum baccatum] 172 4e-49 gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium bar... 173 6e-47 gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium bar... 173 6e-47 gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposo... 159 1e-42 gb|KJB48829.1| hypothetical protein B456_008G089200 [Gossypium r... 151 7e-42 gb|EOY09126.1| Uncharacterized protein TCM_024518 [Theobroma cacao] 152 1e-40 gb|KYP36635.1| Retrovirus-related Pol polyprotein from transposo... 155 1e-40 gb|KYP37021.1| Retrovirus-related Pol polyprotein from transposo... 152 6e-40 gb|EOY22705.1| Transducin/WD40 repeat-like superfamily protein [... 147 2e-39 gb|ABO36622.1| copia LTR rider [Solanum lycopersicum] >gi|133711... 155 3e-38 gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposo... 148 4e-38 gb|PNX71218.1| putative retrotransposon Ty1-copia subclass prote... 141 3e-37 dbj|GAU44261.1| hypothetical protein TSUD_400070 [Trifolium subt... 149 8e-37 gb|OMO83367.1| Integrase, catalytic core [Corchorus capsularis] 149 1e-36 gb|KYP67041.1| Retrovirus-related Pol polyprotein from transposo... 142 3e-36 >gb|PPD88818.1| hypothetical protein GOBAR_DD14224 [Gossypium barbadense] Length = 718 Score = 210 bits (534), Expect = 1e-58 Identities = 110/268 (41%), Positives = 163/268 (60%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 P ++ +WEELD+KAL+AIQLCL + VL V+ EKT+ +LW RL+ Y KSLANR++L Sbjct: 311 PENLNQTEWEELDEKALSAIQLCLTNTVLQDVLMEKTSFALWKRLETLYATKSLANRLVL 370 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 KQRLF RM EG + HI++F +++N+L +EV I+DEDQA+LLLCSLPS+YKSF++ + Sbjct: 371 KQRLFTFRMNEGELLKDHISQFITLLNDLKNVEVHIDDEDQAMLLLCSLPSSYKSFKETL 430 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSEVKSCS 266 IY GK+ L +V+ HLL++DK+D + + D +++ KK ++R C Sbjct: 431 IY-GKDKLSFEDVKGHLLSRDKLDNEFDLNSKADRQASVLVASKKREKR---------CR 480 Query: 265 FCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVCSVTDSLVS 86 +CK+ GHV+ ++ D GD+ ++ S +D+ Sbjct: 481 YCKKLGHVK--------ADYYKLRNKRAAESNEEDVVGANLVDEGGDDFLLVSTSDN-SK 531 Query: 85 NDDDWIVDSGCSKHISPKAETFSTYTSV 2 +WI+DSGCS H+ P E FSTY+SV Sbjct: 532 LTSEWILDSGCSFHMYPNREWFSTYSSV 559 >gb|KYP71220.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 690 Score = 193 bits (490), Expect(2) = 4e-57 Identities = 105/275 (38%), Positives = 159/275 (57%), Gaps = 7/275 (2%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 P +M D W+ELD+KAL+AIQLCL+ EVL +V NE TAA+LW +L+ Y+ KSLAN++ L Sbjct: 45 PVNMTDEQWDELDEKALSAIQLCLSKEVLREVANETTAAALWLKLESLYMTKSLANKLRL 104 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 K+RL+ +RM EG P+ SH+ EF SII +L+ IE+KI+DED+A+LL+ SLPSTYK F++ + Sbjct: 105 KERLYTIRMVEGTPIQSHLNEFNSIIMDLENIEIKIDDEDKAVLLIVSLPSTYKHFKEIM 164 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKG-------KQRSSTN 287 +Y ++L +V+ +LL+K+K D + G+ S ++KG + +S Sbjct: 165 LYSNNDTLSFEDVKSNLLSKEKFDLDIHSEDKGEGLSVRGRTQEKGSTSNKKSRSKSRGR 224 Query: 286 SEVKSCSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVCS 107 K+C +CK+ GH E + D +V+ S Sbjct: 225 KSNKTCRYCKKFGH--------DISDCFILKKKQERQEKGKNPAEAANVETDSDGDVMIS 276 Query: 106 VTDSLVSNDDDWIVDSGCSKHISPKAETFSTYTSV 2 V+ S +WI+DSGC+ H+ P + F+T V Sbjct: 277 VSSDKRSK-TEWILDSGCTFHMCPYKDLFTTLEPV 310 Score = 58.2 bits (139), Expect(2) = 4e-57 Identities = 28/40 (70%), Positives = 33/40 (82%) Frame = -2 Query: 926 MSTAIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807 MST KFDI+KFDG I FS W+V+M A+LTQ+GLKKAL G Sbjct: 1 MSTVTKFDIEKFDGKICFSIWKVQMKAVLTQNGLKKALDG 40 >gb|ABD63105.1| polyprotein-like, related [Asparagus officinalis] Length = 289 Score = 194 bits (494), Expect = 8e-57 Identities = 98/193 (50%), Positives = 135/193 (69%), Gaps = 6/193 (3%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 P M + DW +L+D+ALT IQLCL +EV+ +V+ EKT A LWS+L+D YL KSL NR++L Sbjct: 37 PVTMTEEDWNQLEDRALTTIQLCLTNEVMQEVLTEKTTADLWSKLEDLYLMKSLTNRLLL 96 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 KQRL+ L M EG P+ SHI EF S+I L KI+VKI DEDQALLLLCSL +YK FRD + Sbjct: 97 KQRLYTLWMSEGTPIKSHIGEFNSVITYLSKIDVKINDEDQALLLLCSLLPSYKHFRDTM 156 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSEVKS-- 272 IY G++S+ I + +E+LLN +KI+K+LT S +GD + L+ + + ++R + KS Sbjct: 157 IY-GRDSIGIKDAKENLLNTEKINKELTTSESGDKADGLFVR-GRSEERDFGGNMYKSKH 214 Query: 271 ----CSFCKRTGH 245 C +CK+ H Sbjct: 215 RNLTCRYCKKKEH 227 >gb|PHT53674.1| hypothetical protein CQW23_08136 [Capsicum baccatum] Length = 707 Score = 189 bits (481), Expect(2) = 3e-54 Identities = 101/271 (37%), Positives = 154/271 (56%), Gaps = 4/271 (1%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 P+ + WEE D+KAL+ IQLCL+ EVL +VINEKTAA +WS+L+ Y+ KSLAN++ L Sbjct: 43 PATTTEEQWEETDEKALSTIQLCLSREVLREVINEKTAAGIWSKLESLYMTKSLANKLRL 102 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 K+RLF LRM EG P+ SH+ EF SII +L+ ++V+I+DED+A+LL+ SLP +Y+ F++ + Sbjct: 103 KERLFTLRMSEGTPIQSHLGEFNSIIIDLENLDVEIDDEDKAVLLIVSLPPSYRHFKEIM 162 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKG----KQRSSTNSEV 278 +YG +L ++V+ +LL+K+K D Q+ +G+ + G Q S Sbjct: 163 LYGNNVTLSFDDVKSNLLSKEKFDTQIHSESSGEGLVVRGRNHEVGASGKNQSKSKGKNS 222 Query: 277 KSCSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVCSVTD 98 K C +CK+ H + V E + S T+ Sbjct: 223 KYCRYCKKRNH----EISECYKLKNKQNREDKGKQPEKSAEASFVETESDGECFIASGTE 278 Query: 97 SLVSNDDDWIVDSGCSKHISPKAETFSTYTS 5 N +W++DSGC+ H+SP + F+TY S Sbjct: 279 QRSKN--EWVLDSGCTFHMSPNRDWFTTYES 307 Score = 52.0 bits (123), Expect(2) = 3e-54 Identities = 26/40 (65%), Positives = 33/40 (82%) Frame = -2 Query: 926 MSTAIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807 MST KFDI+KFDG I+F+ W+V+M A+LTQ+GLKK L G Sbjct: 1 MST--KFDIEKFDGKISFAIWRVQMLAVLTQNGLKKVLSG 38 >gb|KYP58145.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 262 Score = 184 bits (467), Expect = 4e-53 Identities = 88/194 (45%), Positives = 132/194 (68%), Gaps = 7/194 (3%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 P +M D W+ELD+KAL+AIQLCL+ EVL +V NE TAA+LW +L+ Y+ KSLAN++ L Sbjct: 21 PVNMTDEQWDELDEKALSAIQLCLSKEVLREVANETTAAALWLKLESLYMTKSLANKLRL 80 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 K+RL+ +RM EG P+ SH+ EF SII +L+ IE+KI+DED+A+LL+ SLPSTYK F++ + Sbjct: 81 KERLYTIRMVEGTPIQSHLNEFNSIIMDLENIEIKIDDEDKAVLLIVSLPSTYKHFKEIM 140 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKG-------KQRSSTN 287 +Y ++L +V+ +LL+K+K D + G+ S ++KG + +S Sbjct: 141 LYSNNDTLSFEDVKSNLLSKEKFDLDIHSEDKGEGLSVRGRTQEKGSTSNKKSRSKSRRR 200 Query: 286 SEVKSCSFCKRTGH 245 K+C +CK+ GH Sbjct: 201 KTNKTCRYCKKFGH 214 >gb|PHT55765.1| hypothetical protein CQW23_04251 [Capsicum baccatum] Length = 553 Score = 172 bits (436), Expect(2) = 4e-49 Identities = 83/182 (45%), Positives = 127/182 (69%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 P+ + WEE+D+KAL+ IQLCL+ EVL +VINEKTAA +WS+L+ Y+ KSLAN++ L Sbjct: 43 PATTTEEQWEEMDEKALSIIQLCLSREVLREVINEKTAAGIWSKLESLYMTKSLANKLRL 102 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 K+RLF LRM EG P+ SH+ EF SII +L+ ++V+I+DED+A+LL+ SLP +Y+ F+ + Sbjct: 103 KERLFTLRMSEGTPIQSHLGEFNSIIIDLENLDVEIDDEDKAVLLIVSLPPSYRHFKKIM 162 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSEVKSCS 266 +YG +L ++V+ +LL+K+K D Q+ +G+ L K + K +S ++ KS Sbjct: 163 LYGNNVTLSFDDVKSNLLSKEKFDTQIHSESSGE---GLVVKWRNHKVGASGKNQSKSRG 219 Query: 265 FC 260 C Sbjct: 220 LC 221 Score = 52.4 bits (124), Expect(2) = 4e-49 Identities = 23/37 (62%), Positives = 32/37 (86%) Frame = -2 Query: 917 AIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807 ++KFDI+KFDG I+F+ W+V+M A+LTQ+GLKK L G Sbjct: 2 SMKFDIEKFDGKISFAIWRVQMLAVLTQNGLKKVLSG 38 >gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium barbadense] Length = 2351 Score = 173 bits (438), Expect(2) = 6e-47 Identities = 100/274 (36%), Positives = 157/274 (57%), Gaps = 9/274 (3%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 PS + + +++ ++A +AI LCL DEVL +V +EKTA+ LW RL+ Y+ KSL NR+ L Sbjct: 547 PSTLSEEQKDDMLERAHSAILLCLGDEVLREVADEKTASGLWLRLESKYMTKSLTNRLYL 606 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 KQRL+ L+M+EG PV+ H+ +F SII +L+ I+ KI+DEDQA+++LCSLP +Y++F D + Sbjct: 607 KQRLYALKMEEGTPVSQHLDKFNSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTM 666 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGS--PNGDDSSALYAKEKKGKQRSSTNSEVKS 272 +Y G++ L + EV+ + L+ ++ K++TG N + + K K SS+ S +S Sbjct: 667 MY-GRDDLTLEEVK-NALSSSELRKKITGKVVENNEGEGLVARGRSKAKGGSSSKSHPRS 724 Query: 271 -------CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVV 113 C +CK+ GH++ AD D E+V Sbjct: 725 QSKKRIQCYYCKKYGHMKVDCPKRKEKSESQEQQNDRANVAD--------ADSSSDAEIV 776 Query: 112 CSVTDSLVSNDDDWIVDSGCSKHISPKAETFSTY 11 +V+DS WI+D+G + HIS + FSTY Sbjct: 777 LAVSDSYAGG--RWILDTGATFHISTSKDAFSTY 808 Score = 44.3 bits (103), Expect(2) = 6e-47 Identities = 18/38 (47%), Positives = 28/38 (73%) Frame = -2 Query: 920 TAIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807 ++ K+D++KF G +FS W+++M A+L Q GL KAL G Sbjct: 505 SSTKYDVEKFTGKNSFSLWRIKMRAVLVQQGLLKALSG 542 >gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium barbadense] Length = 1841 Score = 173 bits (438), Expect(2) = 6e-47 Identities = 100/274 (36%), Positives = 157/274 (57%), Gaps = 9/274 (3%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 PS + + +++ ++A +AI LCL DEVL +V +EKTA+ LW RL+ Y+ KSL NR+ L Sbjct: 568 PSTLSEEQKDDMLERAHSAILLCLGDEVLREVADEKTASGLWLRLESKYMTKSLTNRLYL 627 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 KQRL+ L+M+EG PV+ H+ +F SII +L+ I+ KI+DEDQA+++LCSLP +Y++F D + Sbjct: 628 KQRLYALKMEEGTPVSQHLDKFNSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTM 687 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGS--PNGDDSSALYAKEKKGKQRSSTNSEVKS 272 +Y G++ L + EV+ + L+ ++ K++TG N + + K K SS+ S +S Sbjct: 688 MY-GRDDLTLEEVK-NALSSSELRKKITGKVVENNEGEGLVARGRSKAKGGSSSKSHPRS 745 Query: 271 -------CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVV 113 C +CK+ GH++ AD D E+V Sbjct: 746 QSKKRIQCYYCKKYGHMKVDCPKRKEKSESQEQQNDRANVAD--------ADSSSDAEIV 797 Query: 112 CSVTDSLVSNDDDWIVDSGCSKHISPKAETFSTY 11 +V+DS WI+D+G + HIS + FSTY Sbjct: 798 LAVSDSYAGG--RWILDTGATFHISTSKDAFSTY 829 Score = 44.3 bits (103), Expect(2) = 6e-47 Identities = 18/38 (47%), Positives = 28/38 (73%) Frame = -2 Query: 920 TAIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807 ++ K+D++KF G +FS W+++M A+L Q GL KAL G Sbjct: 526 SSTKYDVEKFTGKNSFSLWRIKMRAVLVQQGLLKALSG 563 >gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 337 Score = 159 bits (402), Expect = 1e-42 Identities = 93/276 (33%), Positives = 154/276 (55%), Gaps = 8/276 (2%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 PS + D + ++L KA + I L L DEVL +V EK+AA +W +L+ Y+ KSL N++ L Sbjct: 43 PSTLSDKEKKDLLSKAHSTIILSLGDEVLREVAEEKSAAGIWLKLESLYMTKSLTNKLYL 102 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 K+RL L+M+EG+ + H++ FT + +L ++V+I++EDQA++LLCSLPS++++ D + Sbjct: 103 KKRLHQLKMEEGSSIKEHVSLFTKAVLDLKSVDVRIDEEDQAVMLLCSLPSSFENLVDTM 162 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAK--------EKKGKQRSST 290 ++ G+++L + EV+ L +++ K G D AL A+ + K K+RS Sbjct: 163 LF-GRDTLTLEEVKATLNSRELKKKITENKGEGGDPEALMARGRLEKRDSKSKNKRRSKY 221 Query: 289 NSEVKSCSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVC 110 +E K+C +CK+ GH R VAD EV Sbjct: 222 KNE-KACYYCKKEGHFRKECPERKKKNNGKYNDESDIAV---------VADGYESAEV-- 269 Query: 109 SVTDSLVSNDDDWIVDSGCSKHISPKAETFSTYTSV 2 ++ S + ++WI+DSGCS H++P E FS+Y + Sbjct: 270 -LSISTKKHSEEWILDSGCSFHMTPNLEWFSSYKEI 304 >gb|KJB48829.1| hypothetical protein B456_008G089200 [Gossypium raimondii] Length = 164 Score = 151 bits (381), Expect(2) = 7e-42 Identities = 69/123 (56%), Positives = 95/123 (77%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 P ++ WEELD KAL+ IQLCLA+ VL +V+ EKT+++LW RL+ Y KSLANR++L Sbjct: 42 PENLNKTKWEELDGKALSVIQLCLANTVLQEVLMEKTSSALWKRLETLYATKSLANRLVL 101 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 KQ LF RM EG + HI++F +++N+L K+EV I+DEDQA+LLLCSLP +YKSF++ + Sbjct: 102 KQHLFTFRMNEGEILRDHISQFITLLNDLKKVEVHIDDEDQAMLLLCSLPPSYKSFKEIL 161 Query: 445 IYG 437 IYG Sbjct: 162 IYG 164 Score = 49.3 bits (116), Expect(2) = 7e-42 Identities = 22/36 (61%), Positives = 27/36 (75%) Frame = -2 Query: 920 TAIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKAL 813 T +F+I+KFDG NF+ WQVRM AIL Q GLKK + Sbjct: 2 TTTRFEIEKFDGETNFNLWQVRMMAILVQSGLKKVV 37 >gb|EOY09126.1| Uncharacterized protein TCM_024518 [Theobroma cacao] Length = 277 Score = 152 bits (385), Expect = 1e-40 Identities = 79/194 (40%), Positives = 130/194 (67%), Gaps = 5/194 (2%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 PS++ D++ ++L +KA +AI L L+DEVL +V +E++AA++W +L+ Y+ KSL NR+ + Sbjct: 46 PSNLSDSEKDDLMEKAHSAILLTLSDEVLREVTDEESAAAMWFKLESIYITKSLTNRLYM 105 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 KQRL+ L+M EG VN+HI EF +I +L I+VKIEDED AL+LLC LP +Y++F D + Sbjct: 106 KQRLYTLKMSEGTSVNTHIDEFNRVILDLKNIDVKIEDEDLALILLCYLPPSYENFVDTM 165 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQ-----RSSTNSE 281 +Y G+++L +VR + LN ++ K++ G N + + L +GK+ + + ++ Sbjct: 166 LY-GRDTLTFEDVRAY-LNSKELKKKVGGIRNENQAEGLVVNRGRGKEKGLDKKGKSRAK 223 Query: 280 VKSCSFCKRTGHVR 239 K+C C + GH R Sbjct: 224 GKTCWNCGQKGHFR 237 >gb|KYP36635.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 364 Score = 155 bits (391), Expect = 1e-40 Identities = 95/271 (35%), Positives = 143/271 (52%), Gaps = 7/271 (2%) Frame = -3 Query: 802 SDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIILK 623 S K + E ++KA + I L L+DEVL +V +E+TA+ LW +L+ Y+ KS+ N+++LK Sbjct: 56 SASKIEELAEQEEKAHSLILLSLSDEVLYEVADEETASGLWCKLEKLYMTKSICNKLLLK 115 Query: 622 QRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAII 443 +RLF L M+EG P+ H+ E S++ EL I+VKIEDED A++LL SLP +Y+SF +++ Sbjct: 116 RRLFGLHMKEGTPLKDHLDELNSVLMELRDIDVKIEDEDAAMILLASLPPSYESFVNSLS 175 Query: 442 YGGKESLKINEV------REHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSE 281 GKE + + EV RE L ++ GS +S K+KK K + TN Sbjct: 176 V-GKECITMEEVKSSLHSREFRLRASGNSEESNGSSLVVSNSGKNMKKKKDKSKRKTNVN 234 Query: 280 VKS-CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVCSV 104 K C++CK GH + + + E+V S+ Sbjct: 235 PKDICNYCKEPGHWKKDCPKKKGKPSAAVAK----------------EESTSENELVLSI 278 Query: 103 TDSLVSNDDDWIVDSGCSKHISPKAETFSTY 11 D ++D WI+DSGCS H+ P F TY Sbjct: 279 ADQPQHSEDQWILDSGCSFHMCPNRTWFDTY 309 >gb|KYP37021.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 352 Score = 152 bits (385), Expect = 6e-40 Identities = 94/271 (34%), Positives = 142/271 (52%), Gaps = 7/271 (2%) Frame = -3 Query: 802 SDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIILK 623 S K + E ++KA + I L L+DEVL +V +E+TA+ LW +L+ Y+ KS+ N+++LK Sbjct: 56 SASKIEELAEQEEKAHSLILLSLSDEVLYEVADEETASGLWCKLEKLYMTKSICNKLLLK 115 Query: 622 QRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAII 443 +RLF L M+EG P+ H+ E S++ EL I+VKIEDED A++LL LP +Y+SF +++ Sbjct: 116 RRLFGLHMKEGTPLKDHLDELNSVLMELRDIDVKIEDEDAAMILLAYLPPSYESFVNSLS 175 Query: 442 YGGKESLKINEV------REHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSE 281 GKE + + EV RE L ++ GS +S K+KK K + TN Sbjct: 176 V-GKECITMEEVKSSLHSREFRLRASGNSEESNGSSLVVSNSGKNMKKKKDKSKRKTNVN 234 Query: 280 VKS-CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVCSV 104 K C++CK GH + + + E+V S+ Sbjct: 235 PKDICNYCKEPGHWKKDCPKKKGKPSAAVAK----------------EESTSENELVLSI 278 Query: 103 TDSLVSNDDDWIVDSGCSKHISPKAETFSTY 11 D ++D WI+DSGCS H+ P F TY Sbjct: 279 ADQPQHSEDQWILDSGCSFHMCPNRTWFDTY 309 >gb|EOY22705.1| Transducin/WD40 repeat-like superfamily protein [Theobroma cacao] Length = 1029 Score = 147 bits (371), Expect(2) = 2e-39 Identities = 78/194 (40%), Positives = 126/194 (64%), Gaps = 5/194 (2%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 PS++ D + ++L KA + I L L+DEVL +V +E++AA++W +L+ Y+ KSL NR+ + Sbjct: 168 PSNLSDGEKDDLMKKAHSVILLALSDEVLREVTDEESAAAVWFKLESIYMTKSLTNRLYM 227 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 KQRL+ L+M EG VN+HI EF +I +L I+VKIEDED AL+LLC LP +Y++F D + Sbjct: 228 KQRLYTLKMSEGTSVNTHIDEFNRVILDLKNIDVKIEDEDLALILLCYLPPSYENFVDTM 287 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQ-----RSSTNSE 281 +Y G+++L +VR LN ++ K++ G N + + L +GK+ + + ++ Sbjct: 288 LY-GRDTLTFEDVRAS-LNFKELKKKVGGIRNENQAEGLVVNRGRGKEKGLDRKGKSRAK 345 Query: 280 VKSCSFCKRTGHVR 239 K+C C + GH R Sbjct: 346 GKTCWNCGQKGHFR 359 Score = 45.1 bits (105), Expect(2) = 2e-39 Identities = 22/43 (51%), Positives = 33/43 (76%), Gaps = 1/43 (2%) Frame = -2 Query: 932 IAMSTA-IKFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807 +AM+T+ K++I+KF+G +FS W+V+M A+L Q GL KAL G Sbjct: 121 LAMATSSTKYEIEKFNGRNDFSLWRVKMRALLVQQGLLKALKG 163 >gb|ABO36622.1| copia LTR rider [Solanum lycopersicum] gb|ABO36636.1| copia LTR rider [Solanum lycopersicum] Length = 1307 Score = 155 bits (391), Expect = 3e-38 Identities = 89/265 (33%), Positives = 142/265 (53%), Gaps = 8/265 (3%) Frame = -3 Query: 772 LDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIILKQRLFLLRMQE 593 L++KA + I LCLAD+V+ +V +E+TAA LW +L+ Y+ KSL N+++LKQRLF LRM E Sbjct: 52 LEEKAHSTIMLCLADDVITEVSDEETAAGLWLKLESLYMTKSLTNKLLLKQRLFGLRMAE 111 Query: 592 GAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAIIYGGKESLKIN 413 G + H+ + +++ EL I+VKIEDED AL+LL SLP ++++F + I GK+++ + Sbjct: 112 GTQLREHLEQLNTLLLELRNIDVKIEDEDAALILLVSLPMSFENFVQSFIV-GKDTVSLE 170 Query: 412 EVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSEVKS--------CSFCK 257 EVR L +++ + + G+ S L+ +KG++ ++ S C++CK Sbjct: 171 EVRSALHSRE-LRHKANGTSTDIQPSGLFTSSRKGRKNGGKKNKPMSKGAKPDDVCNYCK 229 Query: 256 RTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVCSVTDSLVSNDD 77 GH + + EE + V D + D Sbjct: 230 EKGHWKFDCPKKKKQSEKQSVSAAV------------AEEDTNSEEDIALVADEHTHHSD 277 Query: 76 DWIVDSGCSKHISPKAETFSTYTSV 2 W++DSG S HI P+ E F+TY V Sbjct: 278 VWVLDSGASYHICPRREWFTTYEQV 302 >gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 780 Score = 148 bits (374), Expect(2) = 4e-38 Identities = 88/275 (32%), Positives = 144/275 (52%), Gaps = 12/275 (4%) Frame = -3 Query: 802 SDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIILK 623 S + + + +KA +AI LCL D+ L +V EKTAA++W +L+ Y+ KSLA+R+ LK Sbjct: 45 SSLTQKEKTNMIEKARSAIILCLGDKALREVAREKTAAAMWLKLESLYMTKSLAHRLCLK 104 Query: 622 QRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAII 443 QRL+ +M E + +AEF I+++L+ IEV++EDED+ALLLL SLP Y+ F+DAI+ Sbjct: 105 QRLYSFKMTETKSIVDQLAEFNKILDDLENIEVQLEDEDKALLLLNSLPRNYEHFKDAIL 164 Query: 442 YGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAK---EKKGKQRSSTNSEVKS 272 YG ++ + ++EV+ + K+ + +Q + + S ++ EKKG+ + + KS Sbjct: 165 YGKEQDITLDEVQTSIRTKE-LQRQQDNKTDDNGESLNVSRGRSEKKGQSQKGKKARSKS 223 Query: 271 ---------CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEE 119 C +C + GH + ++D + Sbjct: 224 KIGDRSKFKCFYCHKVGHFK----------KNCPERNRDQKSSADSADIAAISDGYESAD 273 Query: 118 VVCSVTDSLVSNDDDWIVDSGCSKHISPKAETFST 14 V+ T DW++DSGCS H+ PK + F T Sbjct: 274 VLVVTTS---QTQKDWVMDSGCSYHMCPKKDYFET 305 Score = 39.3 bits (90), Expect(2) = 4e-38 Identities = 16/35 (45%), Positives = 24/35 (68%) Frame = -2 Query: 911 KFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807 K+DI+KF G +F W+++M AIL Q G +A+ G Sbjct: 5 KYDIEKFSGENDFGLWRIKMEAILIQQGCAEAIKG 39 >gb|PNX71218.1| putative retrotransposon Ty1-copia subclass protein [Trifolium pratense] gb|PNY02521.1| putative retrotransposon Ty1-copia subclass protein [Trifolium pratense] Length = 257 Score = 141 bits (355), Expect(2) = 3e-37 Identities = 70/195 (35%), Positives = 114/195 (58%), Gaps = 8/195 (4%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 P+DM D DW EL +KA I+LC++DEV+ +++ + + +L+ Y+ K+ NR+ Sbjct: 45 PTDMADDDWLELQEKAAGLIRLCVSDEVMYHILDLTSPKEVLDKLESQYISKTRMNRLFT 104 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 K RL+ L+M+EG+ + H+ F +II EL K+ VKI+DED A++LLCSLPS+YK + + Sbjct: 105 KMRLYSLKMREGSDLQQHVNTFNNIITELVKLGVKIDDEDSAIMLLCSLPSSYKHLVNTL 164 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNG--------DDSSALYAKEKKGKQRSST 290 IY GK+++ +N + LL+ ++ + + G D + K GK S Sbjct: 165 IY-GKDTISLNVITATLLSHSRMSQNVEVGTQGKGLYVKGSQDHGQIKGKADSGKMSKSK 223 Query: 289 NSEVKSCSFCKRTGH 245 N ++ C CK+ GH Sbjct: 224 NRKIAECYSCKQIGH 238 Score = 43.9 bits (102), Expect(2) = 3e-37 Identities = 17/40 (42%), Positives = 29/40 (72%) Frame = -2 Query: 932 IAMSTAIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKAL 813 +A+ + +KF++++FDG NF W+ R+ +L Q GL+KAL Sbjct: 1 MAIDSGVKFEVERFDGTGNFRLWERRVKDLLAQQGLQKAL 40 >dbj|GAU44261.1| hypothetical protein TSUD_400070 [Trifolium subterraneum] Length = 635 Score = 149 bits (376), Expect = 8e-37 Identities = 96/279 (34%), Positives = 145/279 (51%), Gaps = 15/279 (5%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 P+++ D + E+DDKAL+AI LCLAD+VL +V EK+AA++W++L Y+ KSLA++ L Sbjct: 185 PTNLSDTEKAEMDDKALSAIILCLADKVLREVAKEKSAAAMWAKLDKLYMTKSLAHKQCL 244 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 KQ+L+ RM E V+ ++EF II++L I+VKIEDEDQA LLC+LP + + DA+ Sbjct: 245 KQQLYFFRMVENKSVSEQLSEFNKIIDDLANIDVKIEDEDQAFHLLCALPKSLEHLNDAL 304 Query: 445 IYGGKESLKINEVREHL-------LNKDKIDKQLTG----SPNGDDSSALYAKEKKGKQR 299 IYG + ++ ++EV+ L LN+ KID G D+ K+ + K R Sbjct: 305 IYGKEGTITLDEVQAALRTKELIKLNELKIDDSGEGLNVTRGRSDNRGKGKGKKHRSKSR 364 Query: 298 SSTNSEVK-SCSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDE 122 + + K C C GH + + V +E Sbjct: 365 AKGDGGSKFKCYHCHEPGHFK------------------KDCPQRRGSDSSSAQIAVSEE 406 Query: 121 EVVCSVTDSLVSN---DDDWIVDSGCSKHISPKAETFST 14 E S V++ + W++DSGCS HI P + F T Sbjct: 407 EGYESAGALTVTSWEPEKSWVMDSGCSYHICPSKKYFET 445 >gb|OMO83367.1| Integrase, catalytic core [Corchorus capsularis] Length = 785 Score = 149 bits (377), Expect = 1e-36 Identities = 87/275 (31%), Positives = 144/275 (52%), Gaps = 10/275 (3%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 P D + +E++ KA +AI L L++EVL +V+ EK ASLW L D Y+KKSLANR+ Sbjct: 20 PEKSTDKEIKEINSKAHSAILLSLSNEVLREVVAEKDTASLWKALDDKYMKKSLANRLFQ 79 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 KQRL+ +M E P+ H+ F II +L + VKIEDED AL+LL SLP ++++FRD + Sbjct: 80 KQRLYTFKMVENTPIKDHLDSFNRIILDLGGVRVKIEDEDLALILLFSLPRSFQNFRDTM 139 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSEVKS-- 272 +Y G++++ + +V++ LL+K+ +K S + D + L + K++SS + +S Sbjct: 140 LY-GRDTIALKDVKDALLSKELQNKV---SADVDGEAGLIVTRGRNKEKSSGTTRFRSRS 195 Query: 271 --------CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEV 116 C +C GH+R + + + DE Sbjct: 196 KSRVSRLRCFYCNEKGHLRKDCPDRKKGNSSEKMESNVKAMVAIVQEGSSLVETSDDEVG 255 Query: 115 VCSVTDSLVSNDDDWIVDSGCSKHISPKAETFSTY 11 +T S + + W++D+ S H++ F+T+ Sbjct: 256 TDVLTVSTTGSANTWVLDTSASYHMTFSRNLFTTF 290 >gb|KYP67041.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 325 Score = 142 bits (358), Expect = 3e-36 Identities = 91/275 (33%), Positives = 143/275 (52%), Gaps = 10/275 (3%) Frame = -3 Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626 P+ M D + + +KA + I L L DEVL +V E TAA +W L+D + KKSL NR+ Sbjct: 44 PTTMTDDVKKAMLEKAHSLILLSLTDEVLREVGEETTAAGMWKMLEDKFQKKSLTNRLYQ 103 Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446 KQRL+ L+M E V H+ F II +L I VK++DED A++LLCSLP +Y++F D + Sbjct: 104 KQRLYTLQMSENMSVRDHLDNFNRIILDLQSIGVKVDDEDLAIILLCSLPKSYENFIDTM 163 Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAK--------EKKGKQRSST 290 +Y G++S+ +N V++ L +K K+ +++ S N DD ++ KG RS++ Sbjct: 164 LY-GRDSITLNNVKDSLQSK-KLKRRVVSSSNVDDVGLTVSRGRSMERGNSSKGHTRSNS 221 Query: 289 NSEVKS--CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEV 116 S+ K C CK GH+R + D GD Sbjct: 222 LSKSKKVRCYKCKEVGHIRKNCPQLKKNRNSNASAAVVRSSATVSSESSDEGD-GGD--- 277 Query: 115 VCSVTDSLVSNDDDWIVDSGCSKHISPKAETFSTY 11 +T S + D W++D+G S H++ + F+++ Sbjct: 278 --VLTVSTIGFADTWVIDTGASYHMTFNRKLFNSF 310