BLASTX nr result

ID: Ophiopogon23_contig00011280 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon23_contig00011280
         (963 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PPD88818.1| hypothetical protein GOBAR_DD14224 [Gossypium bar...   210   1e-58
gb|KYP71220.1| Retrovirus-related Pol polyprotein from transposo...   193   4e-57
gb|ABD63105.1| polyprotein-like, related [Asparagus officinalis]      194   8e-57
gb|PHT53674.1| hypothetical protein CQW23_08136 [Capsicum baccatum]   189   3e-54
gb|KYP58145.1| Retrovirus-related Pol polyprotein from transposo...   184   4e-53
gb|PHT55765.1| hypothetical protein CQW23_04251 [Capsicum baccatum]   172   4e-49
gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium bar...   173   6e-47
gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium bar...   173   6e-47
gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposo...   159   1e-42
gb|KJB48829.1| hypothetical protein B456_008G089200 [Gossypium r...   151   7e-42
gb|EOY09126.1| Uncharacterized protein TCM_024518 [Theobroma cacao]   152   1e-40
gb|KYP36635.1| Retrovirus-related Pol polyprotein from transposo...   155   1e-40
gb|KYP37021.1| Retrovirus-related Pol polyprotein from transposo...   152   6e-40
gb|EOY22705.1| Transducin/WD40 repeat-like superfamily protein [...   147   2e-39
gb|ABO36622.1| copia LTR rider [Solanum lycopersicum] >gi|133711...   155   3e-38
gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposo...   148   4e-38
gb|PNX71218.1| putative retrotransposon Ty1-copia subclass prote...   141   3e-37
dbj|GAU44261.1| hypothetical protein TSUD_400070 [Trifolium subt...   149   8e-37
gb|OMO83367.1| Integrase, catalytic core [Corchorus capsularis]       149   1e-36
gb|KYP67041.1| Retrovirus-related Pol polyprotein from transposo...   142   3e-36

>gb|PPD88818.1| hypothetical protein GOBAR_DD14224 [Gossypium barbadense]
          Length = 718

 Score =  210 bits (534), Expect = 1e-58
 Identities = 110/268 (41%), Positives = 163/268 (60%)
 Frame = -3

Query: 805  PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
            P ++   +WEELD+KAL+AIQLCL + VL  V+ EKT+ +LW RL+  Y  KSLANR++L
Sbjct: 311  PENLNQTEWEELDEKALSAIQLCLTNTVLQDVLMEKTSFALWKRLETLYATKSLANRLVL 370

Query: 625  KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
            KQRLF  RM EG  +  HI++F +++N+L  +EV I+DEDQA+LLLCSLPS+YKSF++ +
Sbjct: 371  KQRLFTFRMNEGELLKDHISQFITLLNDLKNVEVHIDDEDQAMLLLCSLPSSYKSFKETL 430

Query: 445  IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSEVKSCS 266
            IY GK+ L   +V+ HLL++DK+D +   +   D  +++    KK ++R         C 
Sbjct: 431  IY-GKDKLSFEDVKGHLLSRDKLDNEFDLNSKADRQASVLVASKKREKR---------CR 480

Query: 265  FCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVCSVTDSLVS 86
            +CK+ GHV+                              ++ D  GD+ ++ S +D+   
Sbjct: 481  YCKKLGHVK--------ADYYKLRNKRAAESNEEDVVGANLVDEGGDDFLLVSTSDN-SK 531

Query: 85   NDDDWIVDSGCSKHISPKAETFSTYTSV 2
               +WI+DSGCS H+ P  E FSTY+SV
Sbjct: 532  LTSEWILDSGCSFHMYPNREWFSTYSSV 559


>gb|KYP71220.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 690

 Score =  193 bits (490), Expect(2) = 4e-57
 Identities = 105/275 (38%), Positives = 159/275 (57%), Gaps = 7/275 (2%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           P +M D  W+ELD+KAL+AIQLCL+ EVL +V NE TAA+LW +L+  Y+ KSLAN++ L
Sbjct: 45  PVNMTDEQWDELDEKALSAIQLCLSKEVLREVANETTAAALWLKLESLYMTKSLANKLRL 104

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           K+RL+ +RM EG P+ SH+ EF SII +L+ IE+KI+DED+A+LL+ SLPSTYK F++ +
Sbjct: 105 KERLYTIRMVEGTPIQSHLNEFNSIIMDLENIEIKIDDEDKAVLLIVSLPSTYKHFKEIM 164

Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKG-------KQRSSTN 287
           +Y   ++L   +V+ +LL+K+K D  +     G+  S     ++KG       + +S   
Sbjct: 165 LYSNNDTLSFEDVKSNLLSKEKFDLDIHSEDKGEGLSVRGRTQEKGSTSNKKSRSKSRGR 224

Query: 286 SEVKSCSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVCS 107
              K+C +CK+ GH                              E    +   D +V+ S
Sbjct: 225 KSNKTCRYCKKFGH--------DISDCFILKKKQERQEKGKNPAEAANVETDSDGDVMIS 276

Query: 106 VTDSLVSNDDDWIVDSGCSKHISPKAETFSTYTSV 2
           V+    S   +WI+DSGC+ H+ P  + F+T   V
Sbjct: 277 VSSDKRSK-TEWILDSGCTFHMCPYKDLFTTLEPV 310



 Score = 58.2 bits (139), Expect(2) = 4e-57
 Identities = 28/40 (70%), Positives = 33/40 (82%)
 Frame = -2

Query: 926 MSTAIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807
           MST  KFDI+KFDG I FS W+V+M A+LTQ+GLKKAL G
Sbjct: 1   MSTVTKFDIEKFDGKICFSIWKVQMKAVLTQNGLKKALDG 40


>gb|ABD63105.1| polyprotein-like, related [Asparagus officinalis]
          Length = 289

 Score =  194 bits (494), Expect = 8e-57
 Identities = 98/193 (50%), Positives = 135/193 (69%), Gaps = 6/193 (3%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           P  M + DW +L+D+ALT IQLCL +EV+ +V+ EKT A LWS+L+D YL KSL NR++L
Sbjct: 37  PVTMTEEDWNQLEDRALTTIQLCLTNEVMQEVLTEKTTADLWSKLEDLYLMKSLTNRLLL 96

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           KQRL+ L M EG P+ SHI EF S+I  L KI+VKI DEDQALLLLCSL  +YK FRD +
Sbjct: 97  KQRLYTLWMSEGTPIKSHIGEFNSVITYLSKIDVKINDEDQALLLLCSLLPSYKHFRDTM 156

Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSEVKS-- 272
           IY G++S+ I + +E+LLN +KI+K+LT S +GD +  L+ +  + ++R    +  KS  
Sbjct: 157 IY-GRDSIGIKDAKENLLNTEKINKELTTSESGDKADGLFVR-GRSEERDFGGNMYKSKH 214

Query: 271 ----CSFCKRTGH 245
               C +CK+  H
Sbjct: 215 RNLTCRYCKKKEH 227


>gb|PHT53674.1| hypothetical protein CQW23_08136 [Capsicum baccatum]
          Length = 707

 Score =  189 bits (481), Expect(2) = 3e-54
 Identities = 101/271 (37%), Positives = 154/271 (56%), Gaps = 4/271 (1%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           P+   +  WEE D+KAL+ IQLCL+ EVL +VINEKTAA +WS+L+  Y+ KSLAN++ L
Sbjct: 43  PATTTEEQWEETDEKALSTIQLCLSREVLREVINEKTAAGIWSKLESLYMTKSLANKLRL 102

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           K+RLF LRM EG P+ SH+ EF SII +L+ ++V+I+DED+A+LL+ SLP +Y+ F++ +
Sbjct: 103 KERLFTLRMSEGTPIQSHLGEFNSIIIDLENLDVEIDDEDKAVLLIVSLPPSYRHFKEIM 162

Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKG----KQRSSTNSEV 278
           +YG   +L  ++V+ +LL+K+K D Q+    +G+         + G     Q  S     
Sbjct: 163 LYGNNVTLSFDDVKSNLLSKEKFDTQIHSESSGEGLVVRGRNHEVGASGKNQSKSKGKNS 222

Query: 277 KSCSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVCSVTD 98
           K C +CK+  H                               + V      E  + S T+
Sbjct: 223 KYCRYCKKRNH----EISECYKLKNKQNREDKGKQPEKSAEASFVETESDGECFIASGTE 278

Query: 97  SLVSNDDDWIVDSGCSKHISPKAETFSTYTS 5
               N  +W++DSGC+ H+SP  + F+TY S
Sbjct: 279 QRSKN--EWVLDSGCTFHMSPNRDWFTTYES 307



 Score = 52.0 bits (123), Expect(2) = 3e-54
 Identities = 26/40 (65%), Positives = 33/40 (82%)
 Frame = -2

Query: 926 MSTAIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807
           MST  KFDI+KFDG I+F+ W+V+M A+LTQ+GLKK L G
Sbjct: 1   MST--KFDIEKFDGKISFAIWRVQMLAVLTQNGLKKVLSG 38


>gb|KYP58145.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 262

 Score =  184 bits (467), Expect = 4e-53
 Identities = 88/194 (45%), Positives = 132/194 (68%), Gaps = 7/194 (3%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           P +M D  W+ELD+KAL+AIQLCL+ EVL +V NE TAA+LW +L+  Y+ KSLAN++ L
Sbjct: 21  PVNMTDEQWDELDEKALSAIQLCLSKEVLREVANETTAAALWLKLESLYMTKSLANKLRL 80

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           K+RL+ +RM EG P+ SH+ EF SII +L+ IE+KI+DED+A+LL+ SLPSTYK F++ +
Sbjct: 81  KERLYTIRMVEGTPIQSHLNEFNSIIMDLENIEIKIDDEDKAVLLIVSLPSTYKHFKEIM 140

Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKG-------KQRSSTN 287
           +Y   ++L   +V+ +LL+K+K D  +     G+  S     ++KG       + +S   
Sbjct: 141 LYSNNDTLSFEDVKSNLLSKEKFDLDIHSEDKGEGLSVRGRTQEKGSTSNKKSRSKSRRR 200

Query: 286 SEVKSCSFCKRTGH 245
              K+C +CK+ GH
Sbjct: 201 KTNKTCRYCKKFGH 214


>gb|PHT55765.1| hypothetical protein CQW23_04251 [Capsicum baccatum]
          Length = 553

 Score =  172 bits (436), Expect(2) = 4e-49
 Identities = 83/182 (45%), Positives = 127/182 (69%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           P+   +  WEE+D+KAL+ IQLCL+ EVL +VINEKTAA +WS+L+  Y+ KSLAN++ L
Sbjct: 43  PATTTEEQWEEMDEKALSIIQLCLSREVLREVINEKTAAGIWSKLESLYMTKSLANKLRL 102

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           K+RLF LRM EG P+ SH+ EF SII +L+ ++V+I+DED+A+LL+ SLP +Y+ F+  +
Sbjct: 103 KERLFTLRMSEGTPIQSHLGEFNSIIIDLENLDVEIDDEDKAVLLIVSLPPSYRHFKKIM 162

Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSEVKSCS 266
           +YG   +L  ++V+ +LL+K+K D Q+    +G+    L  K +  K  +S  ++ KS  
Sbjct: 163 LYGNNVTLSFDDVKSNLLSKEKFDTQIHSESSGE---GLVVKWRNHKVGASGKNQSKSRG 219

Query: 265 FC 260
            C
Sbjct: 220 LC 221



 Score = 52.4 bits (124), Expect(2) = 4e-49
 Identities = 23/37 (62%), Positives = 32/37 (86%)
 Frame = -2

Query: 917 AIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807
           ++KFDI+KFDG I+F+ W+V+M A+LTQ+GLKK L G
Sbjct: 2   SMKFDIEKFDGKISFAIWRVQMLAVLTQNGLKKVLSG 38


>gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium barbadense]
          Length = 2351

 Score =  173 bits (438), Expect(2) = 6e-47
 Identities = 100/274 (36%), Positives = 157/274 (57%), Gaps = 9/274 (3%)
 Frame = -3

Query: 805  PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
            PS + +   +++ ++A +AI LCL DEVL +V +EKTA+ LW RL+  Y+ KSL NR+ L
Sbjct: 547  PSTLSEEQKDDMLERAHSAILLCLGDEVLREVADEKTASGLWLRLESKYMTKSLTNRLYL 606

Query: 625  KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
            KQRL+ L+M+EG PV+ H+ +F SII +L+ I+ KI+DEDQA+++LCSLP +Y++F D +
Sbjct: 607  KQRLYALKMEEGTPVSQHLDKFNSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTM 666

Query: 445  IYGGKESLKINEVREHLLNKDKIDKQLTGS--PNGDDSSALYAKEKKGKQRSSTNSEVKS 272
            +Y G++ L + EV+ + L+  ++ K++TG    N +    +     K K  SS+ S  +S
Sbjct: 667  MY-GRDDLTLEEVK-NALSSSELRKKITGKVVENNEGEGLVARGRSKAKGGSSSKSHPRS 724

Query: 271  -------CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVV 113
                   C +CK+ GH++                                AD   D E+V
Sbjct: 725  QSKKRIQCYYCKKYGHMKVDCPKRKEKSESQEQQNDRANVAD--------ADSSSDAEIV 776

Query: 112  CSVTDSLVSNDDDWIVDSGCSKHISPKAETFSTY 11
             +V+DS       WI+D+G + HIS   + FSTY
Sbjct: 777  LAVSDSYAGG--RWILDTGATFHISTSKDAFSTY 808



 Score = 44.3 bits (103), Expect(2) = 6e-47
 Identities = 18/38 (47%), Positives = 28/38 (73%)
 Frame = -2

Query: 920 TAIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807
           ++ K+D++KF G  +FS W+++M A+L Q GL KAL G
Sbjct: 505 SSTKYDVEKFTGKNSFSLWRIKMRAVLVQQGLLKALSG 542


>gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium barbadense]
          Length = 1841

 Score =  173 bits (438), Expect(2) = 6e-47
 Identities = 100/274 (36%), Positives = 157/274 (57%), Gaps = 9/274 (3%)
 Frame = -3

Query: 805  PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
            PS + +   +++ ++A +AI LCL DEVL +V +EKTA+ LW RL+  Y+ KSL NR+ L
Sbjct: 568  PSTLSEEQKDDMLERAHSAILLCLGDEVLREVADEKTASGLWLRLESKYMTKSLTNRLYL 627

Query: 625  KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
            KQRL+ L+M+EG PV+ H+ +F SII +L+ I+ KI+DEDQA+++LCSLP +Y++F D +
Sbjct: 628  KQRLYALKMEEGTPVSQHLDKFNSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTM 687

Query: 445  IYGGKESLKINEVREHLLNKDKIDKQLTGS--PNGDDSSALYAKEKKGKQRSSTNSEVKS 272
            +Y G++ L + EV+ + L+  ++ K++TG    N +    +     K K  SS+ S  +S
Sbjct: 688  MY-GRDDLTLEEVK-NALSSSELRKKITGKVVENNEGEGLVARGRSKAKGGSSSKSHPRS 745

Query: 271  -------CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVV 113
                   C +CK+ GH++                                AD   D E+V
Sbjct: 746  QSKKRIQCYYCKKYGHMKVDCPKRKEKSESQEQQNDRANVAD--------ADSSSDAEIV 797

Query: 112  CSVTDSLVSNDDDWIVDSGCSKHISPKAETFSTY 11
             +V+DS       WI+D+G + HIS   + FSTY
Sbjct: 798  LAVSDSYAGG--RWILDTGATFHISTSKDAFSTY 829



 Score = 44.3 bits (103), Expect(2) = 6e-47
 Identities = 18/38 (47%), Positives = 28/38 (73%)
 Frame = -2

Query: 920 TAIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807
           ++ K+D++KF G  +FS W+++M A+L Q GL KAL G
Sbjct: 526 SSTKYDVEKFTGKNSFSLWRIKMRAVLVQQGLLKALSG 563


>gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 337

 Score =  159 bits (402), Expect = 1e-42
 Identities = 93/276 (33%), Positives = 154/276 (55%), Gaps = 8/276 (2%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           PS + D + ++L  KA + I L L DEVL +V  EK+AA +W +L+  Y+ KSL N++ L
Sbjct: 43  PSTLSDKEKKDLLSKAHSTIILSLGDEVLREVAEEKSAAGIWLKLESLYMTKSLTNKLYL 102

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           K+RL  L+M+EG+ +  H++ FT  + +L  ++V+I++EDQA++LLCSLPS++++  D +
Sbjct: 103 KKRLHQLKMEEGSSIKEHVSLFTKAVLDLKSVDVRIDEEDQAVMLLCSLPSSFENLVDTM 162

Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAK--------EKKGKQRSST 290
           ++ G+++L + EV+  L +++   K       G D  AL A+        + K K+RS  
Sbjct: 163 LF-GRDTLTLEEVKATLNSRELKKKITENKGEGGDPEALMARGRLEKRDSKSKNKRRSKY 221

Query: 289 NSEVKSCSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVC 110
            +E K+C +CK+ GH R                               VAD     EV  
Sbjct: 222 KNE-KACYYCKKEGHFRKECPERKKKNNGKYNDESDIAV---------VADGYESAEV-- 269

Query: 109 SVTDSLVSNDDDWIVDSGCSKHISPKAETFSTYTSV 2
            ++ S   + ++WI+DSGCS H++P  E FS+Y  +
Sbjct: 270 -LSISTKKHSEEWILDSGCSFHMTPNLEWFSSYKEI 304


>gb|KJB48829.1| hypothetical protein B456_008G089200 [Gossypium raimondii]
          Length = 164

 Score =  151 bits (381), Expect(2) = 7e-42
 Identities = 69/123 (56%), Positives = 95/123 (77%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           P ++    WEELD KAL+ IQLCLA+ VL +V+ EKT+++LW RL+  Y  KSLANR++L
Sbjct: 42  PENLNKTKWEELDGKALSVIQLCLANTVLQEVLMEKTSSALWKRLETLYATKSLANRLVL 101

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           KQ LF  RM EG  +  HI++F +++N+L K+EV I+DEDQA+LLLCSLP +YKSF++ +
Sbjct: 102 KQHLFTFRMNEGEILRDHISQFITLLNDLKKVEVHIDDEDQAMLLLCSLPPSYKSFKEIL 161

Query: 445 IYG 437
           IYG
Sbjct: 162 IYG 164



 Score = 49.3 bits (116), Expect(2) = 7e-42
 Identities = 22/36 (61%), Positives = 27/36 (75%)
 Frame = -2

Query: 920 TAIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKAL 813
           T  +F+I+KFDG  NF+ WQVRM AIL Q GLKK +
Sbjct: 2   TTTRFEIEKFDGETNFNLWQVRMMAILVQSGLKKVV 37


>gb|EOY09126.1| Uncharacterized protein TCM_024518 [Theobroma cacao]
          Length = 277

 Score =  152 bits (385), Expect = 1e-40
 Identities = 79/194 (40%), Positives = 130/194 (67%), Gaps = 5/194 (2%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           PS++ D++ ++L +KA +AI L L+DEVL +V +E++AA++W +L+  Y+ KSL NR+ +
Sbjct: 46  PSNLSDSEKDDLMEKAHSAILLTLSDEVLREVTDEESAAAMWFKLESIYITKSLTNRLYM 105

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           KQRL+ L+M EG  VN+HI EF  +I +L  I+VKIEDED AL+LLC LP +Y++F D +
Sbjct: 106 KQRLYTLKMSEGTSVNTHIDEFNRVILDLKNIDVKIEDEDLALILLCYLPPSYENFVDTM 165

Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQ-----RSSTNSE 281
           +Y G+++L   +VR + LN  ++ K++ G  N + +  L     +GK+     +  + ++
Sbjct: 166 LY-GRDTLTFEDVRAY-LNSKELKKKVGGIRNENQAEGLVVNRGRGKEKGLDKKGKSRAK 223

Query: 280 VKSCSFCKRTGHVR 239
            K+C  C + GH R
Sbjct: 224 GKTCWNCGQKGHFR 237


>gb|KYP36635.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Cajanus cajan]
          Length = 364

 Score =  155 bits (391), Expect = 1e-40
 Identities = 95/271 (35%), Positives = 143/271 (52%), Gaps = 7/271 (2%)
 Frame = -3

Query: 802 SDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIILK 623
           S  K  +  E ++KA + I L L+DEVL +V +E+TA+ LW +L+  Y+ KS+ N+++LK
Sbjct: 56  SASKIEELAEQEEKAHSLILLSLSDEVLYEVADEETASGLWCKLEKLYMTKSICNKLLLK 115

Query: 622 QRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAII 443
           +RLF L M+EG P+  H+ E  S++ EL  I+VKIEDED A++LL SLP +Y+SF +++ 
Sbjct: 116 RRLFGLHMKEGTPLKDHLDELNSVLMELRDIDVKIEDEDAAMILLASLPPSYESFVNSLS 175

Query: 442 YGGKESLKINEV------REHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSE 281
             GKE + + EV      RE  L      ++  GS     +S    K+KK K +  TN  
Sbjct: 176 V-GKECITMEEVKSSLHSREFRLRASGNSEESNGSSLVVSNSGKNMKKKKDKSKRKTNVN 234

Query: 280 VKS-CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVCSV 104
            K  C++CK  GH +                                 +   + E+V S+
Sbjct: 235 PKDICNYCKEPGHWKKDCPKKKGKPSAAVAK----------------EESTSENELVLSI 278

Query: 103 TDSLVSNDDDWIVDSGCSKHISPKAETFSTY 11
            D    ++D WI+DSGCS H+ P    F TY
Sbjct: 279 ADQPQHSEDQWILDSGCSFHMCPNRTWFDTY 309


>gb|KYP37021.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Cajanus cajan]
          Length = 352

 Score =  152 bits (385), Expect = 6e-40
 Identities = 94/271 (34%), Positives = 142/271 (52%), Gaps = 7/271 (2%)
 Frame = -3

Query: 802 SDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIILK 623
           S  K  +  E ++KA + I L L+DEVL +V +E+TA+ LW +L+  Y+ KS+ N+++LK
Sbjct: 56  SASKIEELAEQEEKAHSLILLSLSDEVLYEVADEETASGLWCKLEKLYMTKSICNKLLLK 115

Query: 622 QRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAII 443
           +RLF L M+EG P+  H+ E  S++ EL  I+VKIEDED A++LL  LP +Y+SF +++ 
Sbjct: 116 RRLFGLHMKEGTPLKDHLDELNSVLMELRDIDVKIEDEDAAMILLAYLPPSYESFVNSLS 175

Query: 442 YGGKESLKINEV------REHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSE 281
             GKE + + EV      RE  L      ++  GS     +S    K+KK K +  TN  
Sbjct: 176 V-GKECITMEEVKSSLHSREFRLRASGNSEESNGSSLVVSNSGKNMKKKKDKSKRKTNVN 234

Query: 280 VKS-CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVCSV 104
            K  C++CK  GH +                                 +   + E+V S+
Sbjct: 235 PKDICNYCKEPGHWKKDCPKKKGKPSAAVAK----------------EESTSENELVLSI 278

Query: 103 TDSLVSNDDDWIVDSGCSKHISPKAETFSTY 11
            D    ++D WI+DSGCS H+ P    F TY
Sbjct: 279 ADQPQHSEDQWILDSGCSFHMCPNRTWFDTY 309


>gb|EOY22705.1| Transducin/WD40 repeat-like superfamily protein [Theobroma cacao]
          Length = 1029

 Score =  147 bits (371), Expect(2) = 2e-39
 Identities = 78/194 (40%), Positives = 126/194 (64%), Gaps = 5/194 (2%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           PS++ D + ++L  KA + I L L+DEVL +V +E++AA++W +L+  Y+ KSL NR+ +
Sbjct: 168 PSNLSDGEKDDLMKKAHSVILLALSDEVLREVTDEESAAAVWFKLESIYMTKSLTNRLYM 227

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           KQRL+ L+M EG  VN+HI EF  +I +L  I+VKIEDED AL+LLC LP +Y++F D +
Sbjct: 228 KQRLYTLKMSEGTSVNTHIDEFNRVILDLKNIDVKIEDEDLALILLCYLPPSYENFVDTM 287

Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQ-----RSSTNSE 281
           +Y G+++L   +VR   LN  ++ K++ G  N + +  L     +GK+     +  + ++
Sbjct: 288 LY-GRDTLTFEDVRAS-LNFKELKKKVGGIRNENQAEGLVVNRGRGKEKGLDRKGKSRAK 345

Query: 280 VKSCSFCKRTGHVR 239
            K+C  C + GH R
Sbjct: 346 GKTCWNCGQKGHFR 359



 Score = 45.1 bits (105), Expect(2) = 2e-39
 Identities = 22/43 (51%), Positives = 33/43 (76%), Gaps = 1/43 (2%)
 Frame = -2

Query: 932 IAMSTA-IKFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807
           +AM+T+  K++I+KF+G  +FS W+V+M A+L Q GL KAL G
Sbjct: 121 LAMATSSTKYEIEKFNGRNDFSLWRVKMRALLVQQGLLKALKG 163


>gb|ABO36622.1| copia LTR rider [Solanum lycopersicum]
 gb|ABO36636.1| copia LTR rider [Solanum lycopersicum]
          Length = 1307

 Score =  155 bits (391), Expect = 3e-38
 Identities = 89/265 (33%), Positives = 142/265 (53%), Gaps = 8/265 (3%)
 Frame = -3

Query: 772 LDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIILKQRLFLLRMQE 593
           L++KA + I LCLAD+V+ +V +E+TAA LW +L+  Y+ KSL N+++LKQRLF LRM E
Sbjct: 52  LEEKAHSTIMLCLADDVITEVSDEETAAGLWLKLESLYMTKSLTNKLLLKQRLFGLRMAE 111

Query: 592 GAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAIIYGGKESLKIN 413
           G  +  H+ +  +++ EL  I+VKIEDED AL+LL SLP ++++F  + I  GK+++ + 
Sbjct: 112 GTQLREHLEQLNTLLLELRNIDVKIEDEDAALILLVSLPMSFENFVQSFIV-GKDTVSLE 170

Query: 412 EVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSEVKS--------CSFCK 257
           EVR  L +++ +  +  G+      S L+   +KG++     ++  S        C++CK
Sbjct: 171 EVRSALHSRE-LRHKANGTSTDIQPSGLFTSSRKGRKNGGKKNKPMSKGAKPDDVCNYCK 229

Query: 256 RTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEVVCSVTDSLVSNDD 77
             GH +                                 +    EE +  V D    + D
Sbjct: 230 EKGHWKFDCPKKKKQSEKQSVSAAV------------AEEDTNSEEDIALVADEHTHHSD 277

Query: 76  DWIVDSGCSKHISPKAETFSTYTSV 2
            W++DSG S HI P+ E F+TY  V
Sbjct: 278 VWVLDSGASYHICPRREWFTTYEQV 302


>gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 780

 Score =  148 bits (374), Expect(2) = 4e-38
 Identities = 88/275 (32%), Positives = 144/275 (52%), Gaps = 12/275 (4%)
 Frame = -3

Query: 802 SDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIILK 623
           S +   +   + +KA +AI LCL D+ L +V  EKTAA++W +L+  Y+ KSLA+R+ LK
Sbjct: 45  SSLTQKEKTNMIEKARSAIILCLGDKALREVAREKTAAAMWLKLESLYMTKSLAHRLCLK 104

Query: 622 QRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAII 443
           QRL+  +M E   +   +AEF  I+++L+ IEV++EDED+ALLLL SLP  Y+ F+DAI+
Sbjct: 105 QRLYSFKMTETKSIVDQLAEFNKILDDLENIEVQLEDEDKALLLLNSLPRNYEHFKDAIL 164

Query: 442 YGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAK---EKKGKQRSSTNSEVKS 272
           YG ++ + ++EV+  +  K+ + +Q     + +  S   ++   EKKG+ +    +  KS
Sbjct: 165 YGKEQDITLDEVQTSIRTKE-LQRQQDNKTDDNGESLNVSRGRSEKKGQSQKGKKARSKS 223

Query: 271 ---------CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEE 119
                    C +C + GH +                               ++D     +
Sbjct: 224 KIGDRSKFKCFYCHKVGHFK----------KNCPERNRDQKSSADSADIAAISDGYESAD 273

Query: 118 VVCSVTDSLVSNDDDWIVDSGCSKHISPKAETFST 14
           V+   T        DW++DSGCS H+ PK + F T
Sbjct: 274 VLVVTTS---QTQKDWVMDSGCSYHMCPKKDYFET 305



 Score = 39.3 bits (90), Expect(2) = 4e-38
 Identities = 16/35 (45%), Positives = 24/35 (68%)
 Frame = -2

Query: 911 KFDIQKFDGVINFSRWQVRMNAILTQHGLKKALLG 807
           K+DI+KF G  +F  W+++M AIL Q G  +A+ G
Sbjct: 5   KYDIEKFSGENDFGLWRIKMEAILIQQGCAEAIKG 39


>gb|PNX71218.1| putative retrotransposon Ty1-copia subclass protein [Trifolium
           pratense]
 gb|PNY02521.1| putative retrotransposon Ty1-copia subclass protein [Trifolium
           pratense]
          Length = 257

 Score =  141 bits (355), Expect(2) = 3e-37
 Identities = 70/195 (35%), Positives = 114/195 (58%), Gaps = 8/195 (4%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           P+DM D DW EL +KA   I+LC++DEV+  +++  +   +  +L+  Y+ K+  NR+  
Sbjct: 45  PTDMADDDWLELQEKAAGLIRLCVSDEVMYHILDLTSPKEVLDKLESQYISKTRMNRLFT 104

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           K RL+ L+M+EG+ +  H+  F +II EL K+ VKI+DED A++LLCSLPS+YK   + +
Sbjct: 105 KMRLYSLKMREGSDLQQHVNTFNNIITELVKLGVKIDDEDSAIMLLCSLPSSYKHLVNTL 164

Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNG--------DDSSALYAKEKKGKQRSST 290
           IY GK+++ +N +   LL+  ++ + +     G         D   +  K   GK   S 
Sbjct: 165 IY-GKDTISLNVITATLLSHSRMSQNVEVGTQGKGLYVKGSQDHGQIKGKADSGKMSKSK 223

Query: 289 NSEVKSCSFCKRTGH 245
           N ++  C  CK+ GH
Sbjct: 224 NRKIAECYSCKQIGH 238



 Score = 43.9 bits (102), Expect(2) = 3e-37
 Identities = 17/40 (42%), Positives = 29/40 (72%)
 Frame = -2

Query: 932 IAMSTAIKFDIQKFDGVINFSRWQVRMNAILTQHGLKKAL 813
           +A+ + +KF++++FDG  NF  W+ R+  +L Q GL+KAL
Sbjct: 1   MAIDSGVKFEVERFDGTGNFRLWERRVKDLLAQQGLQKAL 40


>dbj|GAU44261.1| hypothetical protein TSUD_400070 [Trifolium subterraneum]
          Length = 635

 Score =  149 bits (376), Expect = 8e-37
 Identities = 96/279 (34%), Positives = 145/279 (51%), Gaps = 15/279 (5%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           P+++ D +  E+DDKAL+AI LCLAD+VL +V  EK+AA++W++L   Y+ KSLA++  L
Sbjct: 185 PTNLSDTEKAEMDDKALSAIILCLADKVLREVAKEKSAAAMWAKLDKLYMTKSLAHKQCL 244

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           KQ+L+  RM E   V+  ++EF  II++L  I+VKIEDEDQA  LLC+LP + +   DA+
Sbjct: 245 KQQLYFFRMVENKSVSEQLSEFNKIIDDLANIDVKIEDEDQAFHLLCALPKSLEHLNDAL 304

Query: 445 IYGGKESLKINEVREHL-------LNKDKIDKQLTG----SPNGDDSSALYAKEKKGKQR 299
           IYG + ++ ++EV+  L       LN+ KID    G        D+      K+ + K R
Sbjct: 305 IYGKEGTITLDEVQAALRTKELIKLNELKIDDSGEGLNVTRGRSDNRGKGKGKKHRSKSR 364

Query: 298 SSTNSEVK-SCSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDE 122
           +  +   K  C  C   GH +                             +     V +E
Sbjct: 365 AKGDGGSKFKCYHCHEPGHFK------------------KDCPQRRGSDSSSAQIAVSEE 406

Query: 121 EVVCSVTDSLVSN---DDDWIVDSGCSKHISPKAETFST 14
           E   S     V++   +  W++DSGCS HI P  + F T
Sbjct: 407 EGYESAGALTVTSWEPEKSWVMDSGCSYHICPSKKYFET 445


>gb|OMO83367.1| Integrase, catalytic core [Corchorus capsularis]
          Length = 785

 Score =  149 bits (377), Expect = 1e-36
 Identities = 87/275 (31%), Positives = 144/275 (52%), Gaps = 10/275 (3%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           P    D + +E++ KA +AI L L++EVL +V+ EK  ASLW  L D Y+KKSLANR+  
Sbjct: 20  PEKSTDKEIKEINSKAHSAILLSLSNEVLREVVAEKDTASLWKALDDKYMKKSLANRLFQ 79

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           KQRL+  +M E  P+  H+  F  II +L  + VKIEDED AL+LL SLP ++++FRD +
Sbjct: 80  KQRLYTFKMVENTPIKDHLDSFNRIILDLGGVRVKIEDEDLALILLFSLPRSFQNFRDTM 139

Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAKEKKGKQRSSTNSEVKS-- 272
           +Y G++++ + +V++ LL+K+  +K    S + D  + L     + K++SS  +  +S  
Sbjct: 140 LY-GRDTIALKDVKDALLSKELQNKV---SADVDGEAGLIVTRGRNKEKSSGTTRFRSRS 195

Query: 271 --------CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEV 116
                   C +C   GH+R                             + + +   DE  
Sbjct: 196 KSRVSRLRCFYCNEKGHLRKDCPDRKKGNSSEKMESNVKAMVAIVQEGSSLVETSDDEVG 255

Query: 115 VCSVTDSLVSNDDDWIVDSGCSKHISPKAETFSTY 11
              +T S   + + W++D+  S H++     F+T+
Sbjct: 256 TDVLTVSTTGSANTWVLDTSASYHMTFSRNLFTTF 290


>gb|KYP67041.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 325

 Score =  142 bits (358), Expect = 3e-36
 Identities = 91/275 (33%), Positives = 143/275 (52%), Gaps = 10/275 (3%)
 Frame = -3

Query: 805 PSDMKDADWEELDDKALTAIQLCLADEVLDQVINEKTAASLWSRLQDHYLKKSLANRIIL 626
           P+ M D   + + +KA + I L L DEVL +V  E TAA +W  L+D + KKSL NR+  
Sbjct: 44  PTTMTDDVKKAMLEKAHSLILLSLTDEVLREVGEETTAAGMWKMLEDKFQKKSLTNRLYQ 103

Query: 625 KQRLFLLRMQEGAPVNSHIAEFTSIINELDKIEVKIEDEDQALLLLCSLPSTYKSFRDAI 446
           KQRL+ L+M E   V  H+  F  II +L  I VK++DED A++LLCSLP +Y++F D +
Sbjct: 104 KQRLYTLQMSENMSVRDHLDNFNRIILDLQSIGVKVDDEDLAIILLCSLPKSYENFIDTM 163

Query: 445 IYGGKESLKINEVREHLLNKDKIDKQLTGSPNGDDSSALYAK--------EKKGKQRSST 290
           +Y G++S+ +N V++ L +K K+ +++  S N DD     ++          KG  RS++
Sbjct: 164 LY-GRDSITLNNVKDSLQSK-KLKRRVVSSSNVDDVGLTVSRGRSMERGNSSKGHTRSNS 221

Query: 289 NSEVKS--CSFCKRTGHVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXETHVADLVGDEEV 116
            S+ K   C  CK  GH+R                             +   D  GD   
Sbjct: 222 LSKSKKVRCYKCKEVGHIRKNCPQLKKNRNSNASAAVVRSSATVSSESSDEGD-GGD--- 277

Query: 115 VCSVTDSLVSNDDDWIVDSGCSKHISPKAETFSTY 11
              +T S +   D W++D+G S H++   + F+++
Sbjct: 278 --VLTVSTIGFADTWVIDTGASYHMTFNRKLFNSF 310


Top