BLASTX nr result

ID: Chrysanthemum22_contig00004087 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00004087
         (1372 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|OTG30017.1| putative zinc finger, CCHC-type [Helianthus annuus]    431   e-135
gb|PNX96445.1| copia LTR rider [Trifolium pratense]                   375   e-114
gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposo...   362   e-114
gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygro...   369   e-112
emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera]   343   e-103
gb|KYP71220.1| Retrovirus-related Pol polyprotein from transposo...   327   e-101
gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium bar...   330   6e-97
gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium bar...   330   6e-97
emb|CAJ09951.2| putative gag-pol polyprotein [Citrus sinensis]        326   1e-96
emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera]   314   1e-94
emb|CAN70013.1| hypothetical protein VITISV_017116 [Vitis vinifera]   303   6e-90
gb|KYP65226.1| Retrovirus-related Pol polyprotein from transposo...   296   1e-89
dbj|BAQ19356.1| putative gag-pol polyprotein [Torenia fournieri]      286   1e-86
emb|CAN72567.1| hypothetical protein VITISV_044177 [Vitis vinifera]   289   6e-85
dbj|GAU51472.1| hypothetical protein TSUD_95870 [Trifolium subte...   293   4e-84
gb|OMO83367.1| Integrase, catalytic core [Corchorus capsularis]       283   1e-83
gb|ABO36622.1| copia LTR rider [Solanum lycopersicum] >gi|133711...   282   1e-80
gb|PON54809.1| LOW QUALITY PROTEIN: Gag-Pol-related retrotranspo...   263   2e-80
gb|PKU72844.1| Retrovirus-related Pol polyprotein from transposo...   275   2e-79
gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposo...   256   2e-78

>gb|OTG30017.1| putative zinc finger, CCHC-type [Helianthus annuus]
          Length = 1308

 Score =  431 bits (1108), Expect = e-135
 Identities = 228/463 (49%), Positives = 305/463 (65%), Gaps = 7/463 (1%)
 Frame = +1

Query: 4    MTGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAALE---VLPGDMDTATKGELNKKAH 174
            M   KF++EKFDG  DFGLWR+KMRALL+  G   AL     LP  +    K ++ +KAH
Sbjct: 1    MVSTKFELEKFDGKNDFGLWRVKMRALLVHQGIVDALAGEAKLPAGLTDKEKKDILEKAH 60

Query: 175  SAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISE 354
            SA+ILSLG++VLREV+ ET+AAG+W KLE+LYMTKSLAN         TF + +G+++ +
Sbjct: 61   SAIILSLGDRVLREVSKETSAAGIWAKLESLYMTKSLANRLYLKKRLYTFQLASGKSLED 120

Query: 355  HIDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLN 534
            H DEFNK++LDL NI+V  +DED            +EHFVDTL+YGR++L++E+V+A LN
Sbjct: 121  HTDEFNKVILDLENIDVSIDDEDKAIIFLASLPQTFEHFVDTLMYGRDSLSMEEVLAALN 180

Query: 535  SKEIKERSKAKGDDGEGLFVRGRTDRKNSHQXXXXXXXXXXXXXLKCYICQSEEHLIRNC 714
            SKE+K+RS AK + GEGL VRGR ++K+                 KCYIC SE+H  R+C
Sbjct: 181  SKELKKRSDAKEEIGEGLVVRGRPEQKSFKGKNTPRSKSKFKR--KCYICNSEKHFKRDC 238

Query: 715  PKNNRKKSNGFVKKDDQ---PSSSGSVYDDSEVMMVMSADALLDWIMDSGGSYHMTPRLD 885
            P   +KK      K      P SS   Y+ ++V++V   +   +WI+DSGGSYHMTP  +
Sbjct: 239  PDRFKKKKYDSGSKSQHGGSPDSSNDGYESADVLVVSKGNQDDNWILDSGGSYHMTPHRE 298

Query: 886  LLFDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISLGTL 1065
               D    D G V LGD+R C+++G G V I+L++G+   L NVR+IPEL RN+ISLG  
Sbjct: 299  YFQDIEMQDMGTVKLGDDRTCRVQGQGTVVIKLENGTELKLVNVRFIPELTRNIISLGIF 358

Query: 1066 EKEGYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEGEVNASVEE-KDSLAQVW 1242
            EKEG ++ +++GK K+I GSMVI +GTRR N +Y LDG   +G VN SVE  K S A +W
Sbjct: 359  EKEGCSVSLKNGKAKIIKGSMVIFTGTRRGNNIYMLDGKVSQG-VNCSVERPKISDAVLW 417

Query: 1243 HKRLGHISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHRV 1371
            H+RLGHIS+ GL  L KQE+ G     +  FCE+C+LGKSHRV
Sbjct: 418  HRRLGHISDQGLNELKKQEVLGNFDGREAGFCEHCILGKSHRV 460


>gb|PNX96445.1| copia LTR rider [Trifolium pratense]
          Length = 1318

 Score =  375 bits (962), Expect = e-114
 Identities = 201/467 (43%), Positives = 285/467 (61%), Gaps = 11/467 (2%)
 Frame = +1

Query: 4    MTGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAALE---VLPGDMDTATKGELNKKAH 174
            M   K++IEKF G  DFGLWR+KM+ALL+Q GC  AL+    +  ++  A K  + +KAH
Sbjct: 1    MPSTKYEIEKFTGVNDFGLWRLKMKALLVQQGCLEALKGEAAMNAELTAAEKTNMIEKAH 60

Query: 175  SAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISE 354
            SA++LSLG+KVLR+V+ ETTA+G+W KLE+LYMTKSL N         +F M   + ++E
Sbjct: 61   SAILLSLGDKVLRQVSKETTASGLWAKLESLYMTKSLVNRLYLKQALYSFKMVEDKVLAE 120

Query: 355  HIDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLN 534
             +D FNK++LDL NI+VK +DED            + HF +TLLYGRE+LT E+V + L 
Sbjct: 121  QLDMFNKLILDLENIDVKIDDEDQALLLLCALPRSHAHFKETLLYGRESLTFEEVQSALY 180

Query: 535  SKEIKERSKAKGDD-GEGLFVRGRTDRKN----SHQXXXXXXXXXXXXXLKCYICQSEEH 699
            SK++ ER + K    GEGL V+G+  RKN                    ++CY C+ E H
Sbjct: 181  SKDLNERKEHKPSTVGEGLAVKGKFLRKNGKFDKKGKSQSKSYSDEVSGIRCYHCKKEGH 240

Query: 700  LIRNCP---KNNRKKSNGFVKKDDQPSSSGSVYDDSEVMMVMSADALLDWIMDSGGSYHM 870
              + CP   K++    N  + +DD        ++ S+V++V S+D+  +WIMDSG ++HM
Sbjct: 241  TRKVCPERLKDHGGNGNAAIVQDD--------FESSDVLVVSSSDSRKEWIMDSGCTWHM 292

Query: 871  TPRLDLLFDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLI 1050
            TP  DL  +  + DGG VLLG+N+ CKI G+G VR +L D S  +L  VRY+P+LKRNL+
Sbjct: 293  TPNKDLFEELCDQDGGSVLLGNNKACKIAGVGSVRFKLHDESIRLLTEVRYVPDLKRNLL 352

Query: 1051 SLGTLEKEGYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEGEVNASVEEKDSL 1230
            SLG  +K+GY  + +   ++V+ GS  +L G ++   +Y+L+   V G  N    +  S 
Sbjct: 353  SLGEFDKKGYVFQGEKSILRVMKGSKEVLRGVKKQG-LYTLEAEVVSGSTNVVSTKPLSK 411

Query: 1231 AQVWHKRLGHISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHRV 1371
             ++WH RLGH+SE GL  L KQ L G   + KL FCE CV GKS RV
Sbjct: 412  TEIWHMRLGHVSERGLVELGKQNLLGGDKIEKLKFCEPCVFGKSCRV 458


>gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 780

 Score =  362 bits (929), Expect = e-114
 Identities = 199/466 (42%), Positives = 279/466 (59%), Gaps = 10/466 (2%)
 Frame = +1

Query: 4    MTGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAAL---EVLPGDMDTATKGELNKKAH 174
            M   K+DIEKF G  DFGLWRIKM A+LIQ GC  A+   E +   +    K  + +KA 
Sbjct: 1    MGNTKYDIEKFSGENDFGLWRIKMEAILIQQGCAEAIKGEEKMSSSLTQKEKTNMIEKAR 60

Query: 175  SAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISE 354
            SA+IL LG+K LREV  E TAA +W KLE+LYMTKSLA+         +F M   ++I +
Sbjct: 61   SAIILCLGDKALREVAREKTAAAMWLKLESLYMTKSLAHRLCLKQRLYSFKMTETKSIVD 120

Query: 355  HIDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREA-LTLEDVMATL 531
             + EFNKI+ DL NIEV+ EDED            YEHF D +LYG+E  +TL++V  ++
Sbjct: 121  QLAEFNKILDDLENIEVQLEDEDKALLLLNSLPRNYEHFKDAILYGKEQDITLDEVQTSI 180

Query: 532  NSKEIKERSKAKGDD-GEGLFV-RGRTDRKNSHQXXXXXXXXXXXXX---LKCYICQSEE 696
             +KE++ +   K DD GE L V RGR+++K   Q                 KC+ C    
Sbjct: 181  RTKELQRQQDNKTDDNGESLNVSRGRSEKKGQSQKGKKARSKSKIGDRSKFKCFYCHKVG 240

Query: 697  HLIRNCPKNNRKKSNGFVKKDDQPSSSGSVYDDSEVMMVMSADALLDWIMDSGGSYHMTP 876
            H  +NCP+ NR + +     D    S G  Y+ ++V++V ++    DW+MDSG SYHM P
Sbjct: 241  HFKKNCPERNRDQKSSADSADIAAISDG--YESADVLVVTTSQTQKDWVMDSGCSYHMCP 298

Query: 877  RLDLLFDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISL 1056
            + D        +GG VLLGD+  C+++GIG VR+++ D   ++L +VRY+P+LKRNLIS+
Sbjct: 299  KKDYFETLKLKEGGTVLLGDDHPCQVQGIGTVRLKMFDNREYILKDVRYVPDLKRNLISI 358

Query: 1057 GTLEKEGYTIKMQSGKIKVINGSMVILSGTR-RDNCVYSLDGHAVEGEVNASVEEKDSLA 1233
               +  GY  K Q G +K++NGS+VI  G + ++N ++ LDG  V    + +  + D   
Sbjct: 359  SMFDSLGYATKTQHGVLKILNGSLVIAKGNKDKNNGLFVLDGSTVMAHASIARNDIDK-T 417

Query: 1234 QVWHKRLGHISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHRV 1371
            ++WH RLGH+SE GL  L+KQ L     L KL+FCE+CVLGKSHRV
Sbjct: 418  KLWHLRLGHVSERGLIELEKQNLLKGDKLDKLEFCEHCVLGKSHRV 463


>gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygrometricum]
          Length = 1309

 Score =  369 bits (947), Expect = e-112
 Identities = 199/465 (42%), Positives = 289/465 (62%), Gaps = 9/465 (1%)
 Frame = +1

Query: 4    MTGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAALEVLPGDMDTATKG---ELNKKAH 174
            M+  KFD+EKF G+ DF LWRIKM+ALL+  G   AL   P D DT  K    E + KA 
Sbjct: 1    MSTTKFDLEKFTGSNDFSLWRIKMKALLVHTGLGGALNPEPQD-DTIDKKKIVETDSKAF 59

Query: 175  SAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISE 354
            SA++L LG++VLREV  E +A  +WNKLE+LY+ +SLAN         T ++  G+ + +
Sbjct: 60   SAILLCLGDEVLREVAEEVSALSLWNKLESLYLKRSLANRLYLKKSLYTIHLEEGKDLKK 119

Query: 355  HIDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLN 534
            H+DEFNKI+LDL N+++K  DED            YEHFVDT+LYG+E LT+ +V + LN
Sbjct: 120  HMDEFNKIILDLKNVDIKITDEDCAILMLSSLPRSYEHFVDTMLYGKETLTMAEVKSALN 179

Query: 535  SKEIKERSKAKGDD-GEGLFVRGRTDRKNS--HQXXXXXXXXXXXXXLKCYICQSEEHLI 705
            SKE+ ++++ K +  GEGL VRGRT ++ S   +             LKC++C  E H  
Sbjct: 180  SKELHKKNETKMESTGEGLNVRGRTYKRESRNEKGGKHRSQSRTRGKLKCFVCHKEGHFK 239

Query: 706  RNCPKNNRKKSNGFVKKDDQPSSSGSV---YDDSEVMMVMSADALLDWIMDSGGSYHMTP 876
            ++CP  +R+  N   +KD  P  +  V   Y+ +EV++V   +    W+MDSG S+HM P
Sbjct: 240  KDCP--DRRARNPERRKD--PGDAAVVSDGYESAEVLVVSRTNKQDCWVMDSGCSFHMCP 295

Query: 877  RLDLLFDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISL 1056
                  + +E + G VLLG+NRECK+ GIG V +++ DG    +  VRY+P+L+RNL+S+
Sbjct: 296  IKSWFQNLVEEESGHVLLGNNRECKVMGIGSVLLKMHDGCVRTITEVRYVPDLRRNLLSI 355

Query: 1057 GTLEKEGYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEGEVNASVEEKDSLAQ 1236
            G L+ +G+ +K++ G +KVI GS+ ++ G+ +DN +Y L+   V G  NA+V   +  A+
Sbjct: 356  GMLDSKGFNVKIEGGTMKVIKGSLTVMRGS-QDNGLYILEASTVTGSSNAAVGGANK-AR 413

Query: 1237 VWHKRLGHISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHRV 1371
            +WH RLGH+SE GL  L KQ L G+  +  L FC+ CVLGK  RV
Sbjct: 414  LWHLRLGHVSEKGLVELSKQNLLGRDKVDDLSFCDECVLGKCSRV 458


>emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera]
          Length = 1208

 Score =  343 bits (881), Expect = e-103
 Identities = 195/459 (42%), Positives = 269/459 (58%), Gaps = 3/459 (0%)
 Frame = +1

Query: 4    MTGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAAL---EVLPGDMDTATKGELNKKAH 174
            M  AKFD+EKF G  DFGL R+KMRALL+Q G + AL   + LP  M    K EL +KAH
Sbjct: 1    MGTAKFDVEKFTGKNDFGLXRLKMRALLVQQGLQDALLGEKNLPSTMQEKQKIELLEKAH 60

Query: 175  SAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISE 354
            SA+ILSLG+ VLRE     +AA VW KLE+LYMTKSLAN         TF M  G +I  
Sbjct: 61   SAIILSLGDTVLREXAKAKSAAEVWLKLESLYMTKSLANRLHKKIKLYTFKMTPGMSIEX 120

Query: 355  HIDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLN 534
            H+D FNKI+LDL NI++   DED            Y +  D ++YGR++LT ++V + L+
Sbjct: 121  HLDHFNKIILDLENIDITISDEDKAILLLTSLDASYTNMKDAIMYGRDSLTFDEVQSILH 180

Query: 535  SKEIKERSKAKGDDGEGLFVRGRTDRKNSHQXXXXXXXXXXXXXLKCYICQSEEHLIRNC 714
            ++E++++ ++K + GEGL +RGR++++                  KC+IC  E H  ++C
Sbjct: 181  ARELQKQEESKEESGEGLNIRGRSEKREKKGKNSKSRSKSKTKKFKCFICHKEGHFKKDC 240

Query: 715  PKNNRKKSNGFVKKDDQPSSSGSVYDDSEVMMVMSADALLDWIMDSGGSYHMTPRLDLLF 894
            P    ++ N   K  ++ +   S Y        +   AL   ++   G          L 
Sbjct: 241  PD---RRQNTVKKTVNRWTRVRSGY--------LIQGALFTCVLSKLG----------LK 279

Query: 895  DFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISLGTLEKE 1074
             F E DGG VLLG+N+ CKI G G VRI+  DG   VL +VRYIPELKRNLISLG L+K 
Sbjct: 280  TFKEADGGYVLLGNNKHCKILGTGTVRIKHYDGIERVLEDVRYIPELKRNLISLGMLDKS 339

Query: 1075 GYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEGEVNASVEEKDSLAQVWHKRL 1254
            GYT K +   ++V  GS+ ++ GT + N +Y+L G  V G+V+  ++E     ++WH+RL
Sbjct: 340  GYTFKSEPNSLRVARGSLTVMKGTIK-NGLYTLIGQTVTGKVSTVLKEDVGTTKLWHQRL 398

Query: 1255 GHISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHRV 1371
            GHIS  GLQ L+KQ + G   L  L FCE+CV GK+ RV
Sbjct: 399  GHISHRGLQELEKQGVLGNYKLTDLPFCEHCVFGKATRV 437


>gb|KYP71220.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 690

 Score =  327 bits (839), Expect = e-101
 Identities = 193/464 (41%), Positives = 270/464 (58%), Gaps = 9/464 (1%)
 Frame = +1

Query: 7    TGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAALE---VLPGDMDTATKGELNKKAHS 177
            T  KFDIEKFDG   F +W+++M+A+L Q+G + AL+     P +M      EL++KA S
Sbjct: 3    TVTKFDIEKFDGKICFSIWKVQMKAVLTQNGLKKALDGKAKKPVNMTDEQWDELDEKALS 62

Query: 178  AMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISEH 357
            A+ L L  +VLREV  ETTAA +W KLE+LYMTKSLAN         T  M  G  I  H
Sbjct: 63   AIQLCLSKEVLREVANETTAAALWLKLESLYMTKSLANKLRLKERLYTIRMVEGTPIQSH 122

Query: 358  IDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLY-GREALTLEDVMATLN 534
            ++EFN I++DL NIE+K +DED            Y+HF + +LY   + L+ EDV + L 
Sbjct: 123  LNEFNSIIMDLENIEIKIDDEDKAVLLIVSLPSTYKHFKEIMLYSNNDTLSFEDVKSNLL 182

Query: 535  SKEIKERSKAKGDDGEGLFVRGRTDRKNSHQXXXXXXXXXXXXXLK-CYICQSEEHLIRN 711
            SKE  +      D GEGL VRGRT  K S                K C  C+   H I +
Sbjct: 183  SKEKFDLDIHSEDKGEGLSVRGRTQEKGSTSNKKSRSKSRGRKSNKTCRYCKKFGHDISD 242

Query: 712  CPKNNRKKSNGFVKKDDQPSSSGSVYDDSEVMMVMSAD--ALLDWIMDSGGSYHMTPRLD 885
            C    +K+      K+   +++     D +VM+ +S+D  +  +WI+DSG ++HM P  D
Sbjct: 243  CFILKKKQERQEKGKNPAEAANVETDSDGDVMISVSSDKRSKTEWILDSGCTFHMCPYKD 302

Query: 886  LLFDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISLGTL 1065
            L       D G VL+G++ +CKI GIG ++I+  DG+   L NVR+IP+LKRNLISLGTL
Sbjct: 303  LFTTLEPVDSGVVLMGNDTQCKIAGIGTIQIKTHDGTIKTLSNVRFIPDLKRNLISLGTL 362

Query: 1066 EKEGYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEGE--VNASVEEKDSLAQV 1239
            E  G     + G +KV  G++V+L   R  + +Y L G  V G   V++S+ +KD+  ++
Sbjct: 363  ESLGCKYSAEGGVLKVSKGAIVLLKANRIGS-LYILQGSIVTGSAAVSSSMSDKDA-TKL 420

Query: 1240 WHKRLGHISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHRV 1371
            WH RLGH+SE G+ +L KQ L G + +GKL+FCE+CV GK  RV
Sbjct: 421  WHMRLGHMSEKGMHLLSKQGLLGNQGIGKLEFCEHCVFGKQKRV 464


>gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium barbadense]
          Length = 2351

 Score =  330 bits (845), Expect = 6e-97
 Identities = 180/465 (38%), Positives = 275/465 (59%), Gaps = 9/465 (1%)
 Frame = +1

Query: 4    MTGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAAL---EVLPGDMDTATKGELNKKAH 174
            ++  K+D+EKF G   F LWRIKMRA+L+Q G   AL   + LP  +    K ++ ++AH
Sbjct: 504  VSSTKYDVEKFTGKNSFSLWRIKMRAVLVQQGLLKALSGKDKLPSTLSEEQKDDMLERAH 563

Query: 175  SAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISE 354
            SA++L LG++VLREV  E TA+G+W +LE+ YMTKSL N            M  G  +S+
Sbjct: 564  SAILLCLGDEVLREVADEKTASGLWLRLESKYMTKSLTNRLYLKQRLYALKMEEGTPVSQ 623

Query: 355  HIDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLN 534
            H+D+FN I++DL NI+ K +DED            YE+FVDT++YGR+ LTLE+V   L+
Sbjct: 624  HLDKFNSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTMMYGRDDLTLEEVKNALS 683

Query: 535  SKEIKERSKAK---GDDGEGLFVRGRTDRKNSHQXXXXXXXXXXXXXLKCYICQSEEHLI 705
            S E++++   K    ++GEGL  RGR+  K                 ++CY C+   H+ 
Sbjct: 684  SSELRKKITGKVVENNEGEGLVARGRSKAKGG-SSSKSHPRSQSKKRIQCYYCKKYGHMK 742

Query: 706  RNCPKNNRKKSNGFVKKDDQPSSSGSVYDDSEVMMVMS-ADALLDWIMDSGGSYHMTPRL 882
             +CPK   K  +   + D    +      D+E+++ +S + A   WI+D+G ++H++   
Sbjct: 743  VDCPKRKEKSESQEQQNDRANVADADSSSDAEIVLAVSDSYAGGRWILDTGATFHISTSK 802

Query: 883  DLLFDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISLGT 1062
            D  F   E   G VL+G++  C++ GIG VRI++ DG    L +VR+IPE+K+NLISL T
Sbjct: 803  D-AFSTYEKHSGSVLMGNDHACQVMGIGTVRIKMFDGIVRTLTDVRHIPEMKKNLISLST 861

Query: 1063 LEKEGYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEG--EVNASVEEKDSLAQ 1236
            L+K+G+    + G +KV +G++ ++ G   +  +Y LDG +V G   V++S +      +
Sbjct: 862  LDKKGFRYSAEGGVLKVFSGALTVIRG-NLERGLYFLDGSSVTGVAGVSSSDDLDSDTTK 920

Query: 1237 VWHKRLGHISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHRV 1371
            +WH RLGH+SE GL VL K+ L   +  GKL+FCE+CV GK  RV
Sbjct: 921  LWHMRLGHMSERGLSVLSKRGLLSGQCTGKLNFCEHCVFGKQTRV 965


>gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium barbadense]
          Length = 1841

 Score =  330 bits (845), Expect = 6e-97
 Identities = 180/465 (38%), Positives = 275/465 (59%), Gaps = 9/465 (1%)
 Frame = +1

Query: 4    MTGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAAL---EVLPGDMDTATKGELNKKAH 174
            ++  K+D+EKF G   F LWRIKMRA+L+Q G   AL   + LP  +    K ++ ++AH
Sbjct: 525  VSSTKYDVEKFTGKNSFSLWRIKMRAVLVQQGLLKALSGKDKLPSTLSEEQKDDMLERAH 584

Query: 175  SAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISE 354
            SA++L LG++VLREV  E TA+G+W +LE+ YMTKSL N            M  G  +S+
Sbjct: 585  SAILLCLGDEVLREVADEKTASGLWLRLESKYMTKSLTNRLYLKQRLYALKMEEGTPVSQ 644

Query: 355  HIDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLN 534
            H+D+FN I++DL NI+ K +DED            YE+FVDT++YGR+ LTLE+V   L+
Sbjct: 645  HLDKFNSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTMMYGRDDLTLEEVKNALS 704

Query: 535  SKEIKERSKAK---GDDGEGLFVRGRTDRKNSHQXXXXXXXXXXXXXLKCYICQSEEHLI 705
            S E++++   K    ++GEGL  RGR+  K                 ++CY C+   H+ 
Sbjct: 705  SSELRKKITGKVVENNEGEGLVARGRSKAKGG-SSSKSHPRSQSKKRIQCYYCKKYGHMK 763

Query: 706  RNCPKNNRKKSNGFVKKDDQPSSSGSVYDDSEVMMVMS-ADALLDWIMDSGGSYHMTPRL 882
             +CPK   K  +   + D    +      D+E+++ +S + A   WI+D+G ++H++   
Sbjct: 764  VDCPKRKEKSESQEQQNDRANVADADSSSDAEIVLAVSDSYAGGRWILDTGATFHISTSK 823

Query: 883  DLLFDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISLGT 1062
            D  F   E   G VL+G++  C++ GIG VRI++ DG    L +VR+IPE+K+NLISL T
Sbjct: 824  D-AFSTYEKHSGSVLMGNDHACQVMGIGTVRIKMFDGIVRTLTDVRHIPEMKKNLISLST 882

Query: 1063 LEKEGYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEG--EVNASVEEKDSLAQ 1236
            L+K+G+    + G +KV +G++ ++ G   +  +Y LDG +V G   V++S +      +
Sbjct: 883  LDKKGFRYSAEGGVLKVFSGALTVIRG-NLERGLYFLDGSSVTGVAGVSSSDDLDSDTTK 941

Query: 1237 VWHKRLGHISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHRV 1371
            +WH RLGH+SE GL VL K+ L   +  GKL+FCE+CV GK  RV
Sbjct: 942  LWHMRLGHMSERGLSVLSKRGLLSGQCTGKLNFCEHCVFGKQTRV 986


>emb|CAJ09951.2| putative gag-pol polyprotein [Citrus sinensis]
          Length = 1334

 Score =  326 bits (836), Expect = 1e-96
 Identities = 189/475 (39%), Positives = 274/475 (57%), Gaps = 20/475 (4%)
 Frame = +1

Query: 4    MTGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAALEVLPGDMDTAT-------KGELN 162
            M+  + +IEKF   GDF LW++KM+ALL+  G E+AL+    D++ +T       + ++ 
Sbjct: 1    MSLPRHEIEKFTIGGDFSLWKLKMKALLVHQGLESALD--EEDLEASTGSGIDDKRRQIQ 58

Query: 163  KKAHSAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGR 342
             +AHS +ILSLG+ +LRE++ E TA G+WNK+ETL M KSLA+         TF M  G 
Sbjct: 59   NRAHSTLILSLGDSILREISEEKTALGIWNKVETLCMKKSLAHRLFLKKRLYTFSMREGV 118

Query: 343  TISEHIDEFNKIVLDLANIE-VKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDV 519
            TI +HID FNKI+LDL  +E VK  DED            YE FVDT+LYGR  LTLEDV
Sbjct: 119  TIQDHIDTFNKIILDLEGVENVKICDEDKAFFLLSSLPKSYEGFVDTMLYGRTTLTLEDV 178

Query: 520  MATLNSKEIKERSKAKGDDGEGLFVR--GRTDRKNSHQ----XXXXXXXXXXXXXLKCYI 681
             A+L+SKEI++  + +  +GEGL  R   + D+KN +Q                  KC+ 
Sbjct: 179  KASLSSKEIQKNCELETSNGEGLMARTEKKKDQKNKNQGKGHGKNQETADKKKKKRKCFY 238

Query: 682  CQSEEHLIRNCPKNNRKKSNGFVKKDDQPSSSGS-VYDDSEVMMVMSADALLDWIMDSGG 858
            C+ E H IR+C +  +K+S          S  GS  Y  +++++  +++    W++DSG 
Sbjct: 239  CRKEGHYIRDCFEKKKKESQEKSGDAAVASDDGSDGYQSADLLVASNSNTKGQWVIDSGC 298

Query: 859  SYHMTPRLDLLFDFLECDGGRVLLGDNRECKIRGI-----GKVRIQLKDGSSFVLHNVRY 1023
            S+H+ P   L + +   DGGRVL+G+N  C I GI      +  ++L       LH VR+
Sbjct: 299  SFHLCPEKTLFYKYEAVDGGRVLMGNNNVCNIVGIWFCKRSRCLMELLRS----LHEVRH 354

Query: 1024 IPELKRNLISLGTLEKEGYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEGEVN 1203
             P LKRNLISLG L+  GY  K + G ++V  G+ +++ G   +N +Y L G +V  +  
Sbjct: 355  APRLKRNLISLGMLDSLGYFFKSRIGGLEVRKGTEIVMKGV-NENGLYVLQGSSVPVQEG 413

Query: 1204 ASVEEKDSLAQVWHKRLGHISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHR 1368
             S   ++   ++WH RLGH+S  GLQ L KQ L G   + +L+FCENC+ GKSHR
Sbjct: 414  VSAVSEEDRTKLWHLRLGHMSIKGLQELSKQGLLGGDRIQQLEFCENCIFGKSHR 468


>emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera]
          Length = 894

 Score =  314 bits (805), Expect = 1e-94
 Identities = 182/460 (39%), Positives = 254/460 (55%), Gaps = 4/460 (0%)
 Frame = +1

Query: 4    MTGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAAL---EVLPGDMDTATKGELNKKAH 174
            M   KFD+EKF G  DFGLWR+KMRALL+Q G + AL   + LP  M    K EL +KAH
Sbjct: 1    MGTVKFDVEKFTGKNDFGLWRLKMRALLVQQGLQDALLGEKNLPXTMQEKHKIELLEKAH 60

Query: 175  SAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISE 354
             A+ILSLG+  LREV    +AA +  KLE+LYMTKSLAN         TF M    +I E
Sbjct: 61   GAIILSLGDTXLREVAKAKSAAKLLLKLESLYMTKSLANRLHKXIKLYTFKMTPSMSIEE 120

Query: 355  HIDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLN 534
            H+D FNKI+LDL NI++   +ED            Y +  + ++YGR+ LT ++V + L+
Sbjct: 121  HLDHFNKIILDLKNIDIAVSNEDKAILLLTSLDASYTNMKEAIMYGRDILTFDEVQSILH 180

Query: 535  SKEIKERSKAKGDDGEGLFVRGRTDRKNSHQ-XXXXXXXXXXXXXLKCYICQSEEHLIRN 711
            ++E+ ++ ++K + GEGL +RG++ ++   +               KC+IC  E H  ++
Sbjct: 181  ARELHKQEESKEELGEGLNIRGKSKKREKKKGNNSKSRSKSKTKKFKCFICHKEGHFKKD 240

Query: 712  CPKNNRKKSNGFVKKDDQPSSSGSVYDDSEVMMVMSADALLDWIMDSGGSYHMTPRLDLL 891
            CP   +      + + D        YD++ V+ V   D+  +WI+DSG S+HM P     
Sbjct: 241  CPDMRQNTXKKTMNEGDATMILDG-YDNAGVLNVAEVDSGKEWILDSGCSFHMCPIKAWF 299

Query: 892  FDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISLGTLEK 1071
             DF E +GG VLLG+N+ CKI G G V+I+  DG   VL ++RYIPELK NLISLG L+K
Sbjct: 300  EDFKEANGGHVLLGNNKHCKILGTGTVKIKHYDGIERVLEDIRYIPELKMNLISLGMLDK 359

Query: 1072 EGYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEGEVNASVEEKDSLAQVWHKR 1251
             GYT K +   ++V  GS+ ++                                   H+R
Sbjct: 360  LGYTFKSEPNSLRVARGSLTVMK----------------------------------HQR 385

Query: 1252 LGHISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHRV 1371
            LGHIS  GLQ L+KQ + G   L  L FCE+ V GK+ RV
Sbjct: 386  LGHISHRGLQELEKQGVLGNYKLTYLPFCEHYVFGKATRV 425


>emb|CAN70013.1| hypothetical protein VITISV_017116 [Vitis vinifera]
          Length = 947

 Score =  303 bits (775), Expect = 6e-90
 Identities = 164/413 (39%), Positives = 238/413 (57%)
 Frame = +1

Query: 133  MDTATKGELNKKAHSAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXX 312
            M    K +L +KA SA+ILSLG+ +LREV      A +W KLE+LYMTKSLAN       
Sbjct: 91   MQEKEKTKLLEKAQSAIILSLGDTMLREVAKAKPTAELWLKLESLYMTKSLANRLHKKIK 150

Query: 313  XXTFYMPAGRTISEHIDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYG 492
              TF +  G +I EH D FNKI+LDL NI++   +ED            Y +  + ++YG
Sbjct: 151  LYTFKITPGMSIEEHFDHFNKIILDLENIDITVSNEDKAILLLTSLDASYTNMKEAIMYG 210

Query: 493  REALTLEDVMATLNSKEIKERSKAKGDDGEGLFVRGRTDRKNSHQXXXXXXXXXXXXXLK 672
            R+++T ++V + L+ +E++++ ++K + GEGL +RGR D++                  K
Sbjct: 211  RDSMTFDEVQSILHPRELQKQEESKDESGEGLNIRGRYDKREKKCKNLKAKSKSNTKKFK 270

Query: 673  CYICQSEEHLIRNCPKNNRKKSNGFVKKDDQPSSSGSVYDDSEVMMVMSADALLDWIMDS 852
            C+IC  E H  ++C    +      V + D        YD ++V+ V   D+  +WI+DS
Sbjct: 271  CFICHKEGHFKKDCSDKRQNTIKKTVNEGDAAVILDG-YDSAKVLNVAEMDSGKEWILDS 329

Query: 853  GGSYHMTPRLDLLFDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPE 1032
            G S+HM P      DF E +GG VLLG+N+ CKI G   VRI+  DG   VL  VRYIPE
Sbjct: 330  GCSFHMCPIKAWFEDFKEANGGHVLLGNNKHCKILGTSIVRIKHYDGIERVLEVVRYIPE 389

Query: 1033 LKRNLISLGTLEKEGYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEGEVNASV 1212
            LKRNLISLG L+K GYT K +   ++V  GS+ ++ GT + N +Y+L G  + G+V+  +
Sbjct: 390  LKRNLISLGMLDKLGYTFKSKPNSLRVARGSLTVMKGTIK-NGLYTLIGQTMTGKVSIVL 448

Query: 1213 EEKDSLAQVWHKRLGHISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHRV 1371
            +E   + ++WH+RLGHI+   LQ   KQ + G   L  L FCE+CV  K+ RV
Sbjct: 449  KEDMGITKLWHQRLGHINHKRLQEPQKQGVLGNYKLTDLPFCEHCVFSKATRV 501


>gb|KYP65226.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 689

 Score =  296 bits (758), Expect = 1e-89
 Identities = 169/439 (38%), Positives = 254/439 (57%), Gaps = 18/439 (4%)
 Frame = +1

Query: 73   MRALLIQHGCEAALEVLPGDM--DTATKGE----LNKKAHSAMILSLGNKVLREVTGETT 234
            MRALL+  G    ++ L G+   + AT  E    + +KAHSA+ILSLG+KVLR+V+ E T
Sbjct: 1    MRALLVHQGL---VDALAGEAKAENATVDEERKKMQEKAHSAIILSLGDKVLRQVSKEKT 57

Query: 235  AAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISEHIDEFNKIVLDLANIEVKFE 414
            AAG+W+KLE+LYMTKSL N         +F M   + + E +D+FNK++LDL NI+V  +
Sbjct: 58   AAGIWSKLESLYMTKSLVNRLYLKQSLYSFKMNEDKPVGEQLDQFNKLILDLENIDVTID 117

Query: 415  DEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLNSKEIKERSKAKGD-DGEGLF 591
            DED            Y HF +T+L+GR+++++++V A +NSKE+ ER + K   +GEGL 
Sbjct: 118  DEDQALLLLCSLPRAYSHFKETMLFGRDSVSIDEVQAAINSKELNERKEKKPTVNGEGLT 177

Query: 592  VRGRTDRKNSH------QXXXXXXXXXXXXXLKCYICQSEEHLIRNCPK-----NNRKKS 738
             +G+T +K S       +             ++CY C+ E H  + CP+      N++K 
Sbjct: 178  AKGKTSKKYSKPDKKKPKPEKQKDGGESTFTIRCYHCKKEGHTRKVCPERLANGGNKEKG 237

Query: 739  NGFVKKDDQPSSSGSVYDDSEVMMVMSADALLDWIMDSGGSYHMTPRLDLLFDFLECDGG 918
              +V   +        Y+ +E ++V   ++ L+WIMDSG S+HMTPR     +F +   G
Sbjct: 238  KYYV---NVVIVQDEGYESAEALVVSKDNSKLEWIMDSGCSWHMTPRRSWFENFADQADG 294

Query: 919  RVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISLGTLEKEGYTIKMQS 1098
             VLLGDN+ CKI+GIG +R +  DG   VL +VRY+P+LKRNLISLG  +K+GY  + Q 
Sbjct: 295  LVLLGDNKPCKIKGIGSIRFRFHDGIERVLADVRYVPDLKRNLISLGEFDKKGYVFQGQE 354

Query: 1099 GKIKVINGSMVILSGTRRDNCVYSLDGHAVEGEVNASVEEKDSLAQVWHKRLGHISEAGL 1278
            G + V+   +V++ G  + N +YS+DG  + G    +  +  S  ++WHKRLGH      
Sbjct: 355  GILNVVKNYVVVMRGIMK-NGLYSVDGEVITGSAATASRKLPSKTELWHKRLGH------ 407

Query: 1279 QVLDKQELFGKKSLGKLDF 1335
               DK     K + G LD+
Sbjct: 408  ---DKFSTRQKNTKGILDY 423


>dbj|BAQ19356.1| putative gag-pol polyprotein [Torenia fournieri]
          Length = 605

 Score =  286 bits (732), Expect = 1e-86
 Identities = 168/420 (40%), Positives = 236/420 (56%), Gaps = 31/420 (7%)
 Frame = +1

Query: 13   AKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAAL-------EVLPGDMDT---------- 141
            A+F++EKF G  DFGLW++KMRALL Q G    L        V+ G   T          
Sbjct: 3    ARFEVEKFTGDNDFGLWKMKMRALLTQQGLIEVLMVEDPPATVVAGTAPTGQEDAAAAAV 62

Query: 142  -----ATKGELNKKAHSAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXX 306
                 A K  L+ KAHS +ILSLG++VLR+V+ E+TA G+W KLE LYMTKSLAN     
Sbjct: 63   NAQAAAEKKILDSKAHSVIILSLGDRVLRQVSHESTALGLWKKLEELYMTKSLANRLYLK 122

Query: 307  XXXXTFYMPAGRTISEHIDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLL 486
                +F M   + I E +D+F K++LDL NIEVK EDED            Y  F DTLL
Sbjct: 123  QALYSFKMIEEKAIDEQMDQFIKLILDLENIEVKIEDEDQALLLVCALPRSYNTFKDTLL 182

Query: 487  YGREALTLEDVMATLNSKEIKER--SKAKGDDGEGLFVRGRTDRKNSHQXXXXXXXXXXX 660
            YGRE LTL++V A L SK++  R  +KA G   E L+V+G+ + K +H+           
Sbjct: 183  YGRETLTLKEVQAALKSKQLNTRIDNKAVGSTSEALYVKGKGEEKKTHK----ERKNKSK 238

Query: 661  XXLKCYICQSEEHLIRNCPKNNRKKSNGFVKKDDQPSSSGSVYDDSEVMMVMSADALL-- 834
              +KC+ C  E H+ +NCPK  R K  G   +  + + +   Y+ ++V+ V   D  +  
Sbjct: 239  KKVKCFYCDEEGHMCKNCPKKERDK--GKKVEQGEAAMACESYESADVLAVTHEDQDVTK 296

Query: 835  -----DWIMDSGGSYHMTPRLDLLFDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSS 999
                  W++DS  S+H+T     + DF  CDG  V +G+ ++ KI G G V+I+LK G  
Sbjct: 297  SEKSGKWLLDSASSFHVTCVKSWIKDFKGCDGCLVSVGEEKQYKILGFGTVKIRLKTGGV 356

Query: 1000 FVLHNVRYIPELKRNLISLGTLEKEGYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDG 1179
             +L NV++IP+L RNLIS+G L+ +G+     +G +KV  GS VI+SGT + N  Y + G
Sbjct: 357  RILRNVKFIPDLGRNLISVGLLDVQGFKCVAGNGVMKVFKGSKVIMSGTLQKNRTYHVTG 416


>emb|CAN72567.1| hypothetical protein VITISV_044177 [Vitis vinifera]
          Length = 950

 Score =  289 bits (740), Expect = 6e-85
 Identities = 168/449 (37%), Positives = 255/449 (56%), Gaps = 21/449 (4%)
 Frame = +1

Query: 4    MTGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAALE---VLPGDMDTATKGELNKKAH 174
            M+  KF++EKF+G+ DF LW++KM+ALL+Q  C  A+E    LP  +    K E+  +AH
Sbjct: 1    MSSQKFEVEKFNGSNDFTLWKLKMKALLVQQKCAQAIEGEETLPVGLTAVEKEEVVSRAH 60

Query: 175  SAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISE 354
            SA++LSL ++VLREV  ETTA G+W K E+ Y  KSL N         T  M  G  + +
Sbjct: 61   SAILLSLADEVLREVADETTAVGLWRKFESKYQKKSLTNRLYQKRQLHTLKMSEGMQVRD 120

Query: 355  HIDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLN 534
            H++ FN+I+LDL  + VK E+ED            YE+FVDT++YGR++++  DV   L 
Sbjct: 121  HLNNFNRIILDLNGVGVKVEEEDQAMILLCSLPSSYENFVDTMMYGRBSISXNDVKDALQ 180

Query: 535  SKEIKE--RSKAKGDDGEGLFV-RGRTDRKNSHQXXXXXXXXXXXXXLKCYICQSEEHLI 705
            SKE+++      +G    GL V RGR+  +N                ++C+  + + H  
Sbjct: 181  SKELQKLVSGSEEGSVETGLTVSRGRSMERNG--GGRSKSXSKSKAAMRCFHXKEKGHFR 238

Query: 706  RNCPKNNR---KKSNG-----FVKKDDQPSSSGSVYDDSEVMMVMSADALLDWIMDSGGS 861
            +NCP+  +     SNG       +KD +   S    +  +V+ V ++ +   WI+D+G S
Sbjct: 239  KNCPQRQKGIGXGSNGNAQVVVAQKDSEKQDSSDEGEGGDVLTVSTSSSAESWILDTGAS 298

Query: 862  YHMTPRLDLLFDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKR 1041
            YHM    DL   F E +G  V LGD+ E  ++G G V+I++ DG    L N  Y+P L++
Sbjct: 299  YHMAYSRDLFTTFKEWNGS-VKLGDDGELGVKGSGSVQIKMYDGLVRTL-NAWYVPGLRK 356

Query: 1042 NLISLGTLEKEGYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEGEVNA----- 1206
            NLIS+GTL+K GYT     G ++V  G++V++ G R  + +Y+L G +V G         
Sbjct: 357  NLISVGTLDKNGYTFSGSGGVLRVSKGALVVMKG-RLQHGIYTLMGSSVLGTAAVSSSMA 415

Query: 1207 --SVEEKDSLAQVWHKRLGHISEAGLQVL 1287
              SVE+KD+  ++WH+RLGH+SE GL +L
Sbjct: 416  IDSVEKKDNCTELWHRRLGHMSEKGLSIL 444


>dbj|GAU51472.1| hypothetical protein TSUD_95870 [Trifolium subterraneum]
          Length = 1682

 Score =  293 bits (749), Expect = 4e-84
 Identities = 175/457 (38%), Positives = 247/457 (54%), Gaps = 3/457 (0%)
 Frame = +1

Query: 10   GAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAALE---VLPGDMDTATKGELNKKAHSA 180
            G KFDIEKF G+ DFGLW++KMRA+L+Q  C  AL+    +P  +    K E+N KA S+
Sbjct: 2    GLKFDIEKFTGSNDFGLWKLKMRAVLVQQKCVEALKGPTQMPAHLSVYEKTEMNDKAVSS 61

Query: 181  MILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISEHI 360
            + L LG+KVLREV  E +A  +W KL+ LYMTKSLA           + M   +++ E +
Sbjct: 62   ITLCLGDKVLREVACEISAVMMWTKLDALYMTKSLARRQCLKERPYFYRMVENKSVVEQL 121

Query: 361  DEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLNSK 540
             EFNKI+ DLANI+V  EDED              H +   L     L ++D    LN  
Sbjct: 122  AEFNKIIDDLANIDVILEDEDKAF-----------HLLTKELTKLRDLKIDDSGECLNVA 170

Query: 541  EIKERSKAKGDDGEGLFVRGRTDRKNSHQXXXXXXXXXXXXXLKCYICQSEEHLIRNCPK 720
              +   K KG        +G+  R  S                KCY C    H  ++CP+
Sbjct: 171  RGRSEYKGKG--------KGKKHRSKSRPKGGGDSGGK----FKCYHCHEPGHFKKDCPQ 218

Query: 721  NNRKKSNGFVKKDDQPSSSGSVYDDSEVMMVMSADALLDWIMDSGGSYHMTPRLDLLFDF 900
               +K  G      Q ++S   Y+ +  + V S +    W+MDSG S HM  R +     
Sbjct: 219  ---RKGGG--SSSAQIATSDEGYESAGALTVTSWEPEKIWVMDSGCSDHMCLRKEYFKTL 273

Query: 901  LECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISLGTLEKEGY 1080
               +GG V LG+N+  K++G G +R+++ D   F+L NVRYIPELKRNLIS+   +  GY
Sbjct: 274  ELKEGGVVRLGNNKAGKVQGTGTIRLKMYDDRDFLLKNVRYIPELKRNLISISMFDGLGY 333

Query: 1081 TIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEGEVNASVEEKDSLAQVWHKRLGH 1260
            + + + G I++ +G++VI  G++  N +Y L+G  V      ++ EK  + ++WH RLGH
Sbjct: 334  STRFEHGSIRISHGALVIAKGSKM-NGLYILEGSTVISNALVTIVEKADMTKLWHLRLGH 392

Query: 1261 ISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHRV 1371
            +SE GL  L KQ   GKK L KLDFC+NC LGK H+V
Sbjct: 393  VSERGLVELAKQGSLGKKILNKLDFCDNCTLGKQHKV 429


>gb|OMO83367.1| Integrase, catalytic core [Corchorus capsularis]
          Length = 785

 Score =  283 bits (723), Expect = 1e-83
 Identities = 173/451 (38%), Positives = 247/451 (54%), Gaps = 18/451 (3%)
 Frame = +1

Query: 73   MRALLIQHGCEAAL--EVLPGDMDTATKGELNKKAHSAMILSLGNKVLREVTGETTAAGV 246
            M+A++IQ  C  A+  E+LP         E+N KAHSA++LSL N+VLREV  E   A +
Sbjct: 1    MKAIMIQQNCAGAIDKEMLPEKSTDKEIKEINSKAHSAILLSLSNEVLREVVAEKDTASL 60

Query: 247  WNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISEHIDEFNKIVLDLANIEVKFEDEDX 426
            W  L+  YM KSLAN         TF M     I +H+D FN+I+LDL  + VK EDED 
Sbjct: 61   WKALDDKYMKKSLANRLFQKQRLYTFKMVENTPIKDHLDSFNRIILDLGGVRVKIEDEDL 120

Query: 427  XXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLNSKEIKERSKAKGDDGEGLFV-RGR 603
                       +++F DT+LYGR+ + L+DV   L SKE++ +  A  D   GL V RGR
Sbjct: 121  ALILLFSLPRSFQNFRDTMLYGRDTIALKDVKDALLSKELQNKVSADVDGEAGLIVTRGR 180

Query: 604  TDRKNSHQXXXXXXXXXXXXXLKCYICQSEEHLIRNCPKNNRKKSNGFVKKD-------- 759
               K+S               L+C+ C  + HL ++CP  +RKK N   K +        
Sbjct: 181  NKEKSSGTTRFRSRSKSRVSRLRCFYCNEKGHLRKDCP--DRKKGNSSEKMESNVKAMVA 238

Query: 760  --DQPSSSGSVYDD---SEVMMVMSADALLDWIMDSGGSYHMTPRLDLLFDFLECDGGRV 924
               + SS     DD   ++V+ V +  +   W++D+  SYHMT   +L   F E +G  V
Sbjct: 239  IVQEGSSLVETSDDEVGTDVLTVSTTGSANTWVLDTSASYHMTFSRNLFTTFKEWNGS-V 297

Query: 925  LLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISLGTLEKEGYTIKMQSGK 1104
            +LGD     ++G G V+I+  DG +    +   +PEL RNLISLGTL+K+GY    ++G+
Sbjct: 298  MLGDKTTLTVKGSGSVQIKTHDG-TIRTFDAWLVPEL-RNLISLGTLDKQGYKYSGENGQ 355

Query: 1105 IKVINGSMVILSGTRRDNCVYSLDGHAVEGE--VNASVEEKDSLAQVWHKRLGHISEAGL 1278
            IKV  G+M IL G +  + +Y+L G++V GE  V+ S+ + +   ++WH RLGH+SE GL
Sbjct: 356  IKVSKGAMTILKG-KLQHGIYTLIGNSVIGEVAVSESLGDSNDRTELWHLRLGHMSEQGL 414

Query: 1279 QVLDKQELFGKKSLGKLDFCENCVLGKSHRV 1371
             +L K+ L      GKL  CE CVLGK   V
Sbjct: 415  SILSKRGLLDGSECGKLKCCETCVLGKQRGV 445


>gb|ABO36622.1| copia LTR rider [Solanum lycopersicum]
 gb|ABO36636.1| copia LTR rider [Solanum lycopersicum]
          Length = 1307

 Score =  282 bits (721), Expect = 1e-80
 Identities = 161/466 (34%), Positives = 252/466 (54%), Gaps = 11/466 (2%)
 Frame = +1

Query: 4    MTGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAALEVLPGDMDTATKGELNKKAHSAM 183
            M+     I+KF G   F LW+IKMRALL Q G  A L      + T     L +KAHS +
Sbjct: 1    MSALNVKIDKFTGRNSFSLWQIKMRALLKQQGFWAPLSKDKNAVVTPEMAILEEKAHSTI 60

Query: 184  ILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISEHID 363
            +L L + V+ EV+ E TAAG+W KLE+LYMTKSL N            M  G  + EH++
Sbjct: 61   MLCLADDVITEVSDEETAAGLWLKLESLYMTKSLTNKLLLKQRLFGLRMAEGTQLREHLE 120

Query: 364  EFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLNSKE 543
            + N ++L+L NI+VK EDED            +E+FV + + G++ ++LE+V + L+S+E
Sbjct: 121  QLNTLLLELRNIDVKIEDEDAALILLVSLPMSFENFVQSFIVGKDTVSLEEVRSALHSRE 180

Query: 544  IKERSKAKGDD--GEGLFVRGRTDRKNSHQXXXXXXXXXXXXXLKCYICQSEEHLIRNCP 717
            ++ ++     D    GLF   R  RKN  +             + C  C+ + H   +CP
Sbjct: 181  LRHKANGTSTDIQPSGLFTSSRKGRKNGGKKNKPMSKGAKPDDV-CNYCKEKGHWKFDCP 239

Query: 718  KNNRKKSNGFVKKDDQPSSSGSVYDD---SEVMMVMSADALLD----WIMDSGGSYHMTP 876
            K          K+ ++ S S +V ++   SE  + + AD        W++DSG SYH+ P
Sbjct: 240  KKK--------KQSEKQSVSAAVAEEDTNSEEDIALVADEHTHHSDVWVLDSGASYHICP 291

Query: 877  RLDLLFDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISL 1056
            R +    + + DGG + + ++  CK+ G G ++I+  DGS   L+ VR++P + +NLISL
Sbjct: 292  RREWFTTYEQVDGGSISMANSSVCKVVGTGSIKIRTHDGSFCTLNEVRHVPLMTKNLISL 351

Query: 1057 GTLEKEGYTIKMQSGKIKVINGSMVILSGTRRDNCVYSLDGHAVEGEVNASVEE--KDSL 1230
              L+ +G++   + G ++V  GS +IL G  R   +Y L G  V G  + +  E  +  +
Sbjct: 352  SLLDSKGFSWSGKDGVLRVWKGSNLILKGVMR-GTLYFLQGSTVTGSAHVASSEFHQKDM 410

Query: 1231 AQVWHKRLGHISEAGLQVLDKQELFGKKSLGKLDFCENCVLGKSHR 1368
             ++WH RLGH+ E G+Q+L K++L     +  L+FCE+CV GK HR
Sbjct: 411  TKLWHIRLGHMGERGMQILSKEDLLAGHKVKSLEFCEHCVFGKLHR 456


>gb|PON54809.1| LOW QUALITY PROTEIN: Gag-Pol-related retrotransposon family protein
            [Trema orientalis]
          Length = 380

 Score =  263 bits (672), Expect = 2e-80
 Identities = 146/377 (38%), Positives = 212/377 (56%), Gaps = 16/377 (4%)
 Frame = +1

Query: 7    TGAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAALEVLPGDMDTATKG----------E 156
            T  KFDIEKF G  DF LW++KM A+L+Q G E AL  L  D+    K           E
Sbjct: 3    TTTKFDIEKFTGKNDFELWKMKMEAILVQQGLEKAL--LSEDLTATDKESLAEMKKKIEE 60

Query: 157  LNKKAHSAMILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPA 336
            ++ KA+SA+ILSL ++VLR+V  E   +G+W KLE LY  K+L            F M  
Sbjct: 61   VSPKAYSAIILSLSDQVLRKVLREKIISGIWIKLEELYRAKTLPGRIYLKERFFGFKMDK 120

Query: 337  GRTISEHIDEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLED 516
             ++I E++D++ K+VLDL N+ +K +D+D             ++F +TL YGR+ +T+++
Sbjct: 121  SKSIEENLDDYTKLVLDLENLGIKVDDKDKAIILLNSLPRNLKNFKETLKYGRQTITVDE 180

Query: 517  VMATLNSKEIKERSKAKGDDGEGLFVRGRTDRKNSH------QXXXXXXXXXXXXXLKCY 678
            V   L SK +  +   K   GEGL +RGRT ++++H      Q             +K Y
Sbjct: 181  VQNALESKLLDMKGSEKNAQGEGLHIRGRTTKQDNHDGKGKSQSRSKSRGKKDYSKVKYY 240

Query: 679  ICQSEEHLIRNCPKNNRKKSNGFVKKDDQPSSSGSVYDDSEVMMVMSADALLDWIMDSGG 858
             C    H+ R  P+   K +    K D         Y+ SEV+ +  ++   +W+MDSG 
Sbjct: 241  HCNKNGHIRRLRPERQNKDAG---KLDGDAVIVDDGYESSEVLSISESENSKEWVMDSGC 297

Query: 859  SYHMTPRLDLLFDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDGSSFVLHNVRYIPELK 1038
            SYHM PR D   D+ E DGG+VL+G+N  CK+ GIG + I++ DG + +L NVR++PELK
Sbjct: 298  SYHMCPREDWFMDYQEVDGGKVLMGNNMACKVMGIGSISIRMFDGVTRILKNVRHVPELK 357

Query: 1039 RNLISLGTLEKEGYTIK 1089
            R+LISLGTL+K GY  K
Sbjct: 358  RSLISLGTLDKSGYGFK 374


>gb|PKU72844.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Dendrobium catenatum]
          Length = 993

 Score =  275 bits (703), Expect = 2e-79
 Identities = 154/444 (34%), Positives = 239/444 (53%), Gaps = 9/444 (2%)
 Frame = +1

Query: 67   IKMRALLIQHGCEAAL---EVLPGDMDTATKGELNKKAHSAMILSLGNKVLREVTGETTA 237
            +K+ A+LIQ G E AL     LP  M    K  + KKA S++IL L ++VLR+V+   T 
Sbjct: 1    MKLEAILIQQGVEKALLPESELPSTMSDQEKLSIQKKAFSSIILCLADQVLRKVSHVKTV 60

Query: 238  AGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISEHIDEFNKIVLDLANIEVKFED 417
            + +W KLE LY  K+L N          + M   ++I +++DEFNK++LDL N+EVK ED
Sbjct: 61   SELWKKLEELYRQKTLPNRIYLKEKFFGYKMDEAKSIDDNLDEFNKLILDLENLEVKIED 120

Query: 418  EDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLNSKEIKERSKAKGDDGEGLFVR 597
            ED              +F +TL YGRE +T+++V   L+SK +  +   K   GEGL VR
Sbjct: 121  EDKAIILLNSLPKSLRNFKETLKYGRETITVDEVQNALSSKILDMKISEKNHSGEGLHVR 180

Query: 598  GRTDRKNSHQXXXXXXXXXXXXX------LKCYICQSEEHLIRNCPKNNRKKSNGFVKKD 759
            GR+ ++ + Q                   +KC+ C    H+ R CP+ N K  +   +  
Sbjct: 181  GRSQKRGTSQKKWKSKSRSKSASKKDYKNVKCWQCNKTGHIRRFCPEKNPKDKS---QSQ 237

Query: 760  DQPSSSGSVYDDSEVMMVMSADALLDWIMDSGGSYHMTPRLDLLFDFLECDGGRVLLGDN 939
               +  G  YD ++V+ V                                    +LLG+N
Sbjct: 238  GDAAIVGENYDSADVLNVSD----------------------------------LLLGNN 263

Query: 940  RECKIRGIGKVRIQLKDGSSFVLHNVRYIPELKRNLISLGTLEKEGYTIKMQSGKIKVIN 1119
            + C + GIG + +++ DG   +L +VR++P+LKRNLISLGTL+  GY  + + G +++  
Sbjct: 264  KACDVVGIGSIAVKMHDGHVRILKDVRHVPDLKRNLISLGTLDDSGYIFRSERGLLRISK 323

Query: 1120 GSMVILSGTRRDNCVYSLDGHAVEGEVNASVEEKDSLAQVWHKRLGHISEAGLQVLDKQE 1299
            G++VI+ G +R N +Y L G  + GE + + ++     ++WH+RLGH+S+ GL  L KQ 
Sbjct: 324  GALVIMKGIKR-NGLYVLQGATLVGETHVTAKQNLDKTKLWHQRLGHLSDRGLIELQKQG 382

Query: 1300 LFGKKSLGKLDFCENCVLGKSHRV 1371
            LFG  S+ K+DFCE+C++GKSHR+
Sbjct: 383  LFGNDSIAKIDFCESCIIGKSHRL 406


>gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 337

 Score =  256 bits (655), Expect = 2e-78
 Identities = 135/334 (40%), Positives = 200/334 (59%), Gaps = 6/334 (1%)
 Frame = +1

Query: 10  GAKFDIEKFDGTGDFGLWRIKMRALLIQHGCEAALE---VLPGDMDTATKGELNKKAHSA 180
           G +FD+EKF G  DF L RIKM+ALL+  G + AL+    LP  +    K +L  KAHS 
Sbjct: 2   GTRFDVEKFTGENDFSLRRIKMQALLVHQGLDDALQGASKLPSTLSDKEKKDLLSKAHST 61

Query: 181 MILSLGNKVLREVTGETTAAGVWNKLETLYMTKSLANXXXXXXXXXTFYMPAGRTISEHI 360
           +ILSLG++VLREV  E +AAG+W KLE+LYMTKSL N            M  G +I EH+
Sbjct: 62  IILSLGDEVLREVAEEKSAAGIWLKLESLYMTKSLTNKLYLKKRLHQLKMEEGSSIKEHV 121

Query: 361 DEFNKIVLDLANIEVKFEDEDXXXXXXXXXXXXYEHFVDTLLYGREALTLEDVMATLNSK 540
             F K VLDL +++V+ ++ED            +E+ VDT+L+GR+ LTLE+V ATLNS+
Sbjct: 122 SLFTKAVLDLKSVDVRIDEEDQAVMLLCSLPSSFENLVDTMLFGRDTLTLEEVKATLNSR 181

Query: 541 EIKER---SKAKGDDGEGLFVRGRTDRKNSHQXXXXXXXXXXXXXLKCYICQSEEHLIRN 711
           E+K++   +K +G D E L  RGR ++++S                 CY C+ E H  + 
Sbjct: 182 ELKKKITENKGEGGDPEALMARGRLEKRDSKSKNKRRSKYKNEK--ACYYCKKEGHFRKE 239

Query: 712 CPKNNRKKSNGFVKKDDQPSSSGSVYDDSEVMMVMSADALLDWIMDSGGSYHMTPRLDLL 891
           CP+  +KK+NG    +   +     Y+ +EV+ + +     +WI+DSG S+HMTP L+  
Sbjct: 240 CPE-RKKKNNGKYNDESDIAVVADGYESAEVLSISTKKHSEEWILDSGCSFHMTPNLEWF 298

Query: 892 FDFLECDGGRVLLGDNRECKIRGIGKVRIQLKDG 993
             + E DGG+VL+G+N  C + GIG ++++++DG
Sbjct: 299 SSYKEIDGGKVLMGNNMVCNVIGIGTIKLKVQDG 332


Top