BLASTX nr result

ID: Ophiopogon25_contig00038455 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon25_contig00038455
         (1144 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PKU78070.1| Retrovirus-related Pol polyprotein from transposo...   375   e-124
gb|KYP66219.1| Retrovirus-related Pol polyprotein from transposo...   391   e-124
gb|KYP32069.1| Retrovirus-related Pol polyprotein from transposo...   385   e-122
ref|XP_020867873.1| uncharacterized protein LOC110224828 [Arabid...   385   e-122
dbj|GAU28864.1| hypothetical protein TSUD_293160 [Trifolium subt...   385   e-122
gb|KYP69041.1| Retrovirus-related Pol polyprotein from transposo...   391   e-121
gb|KYP44533.1| Retrovirus-related Pol polyprotein from transposo...   391   e-121
gb|KZV47435.1| hypothetical protein F511_22511, partial [Dorcoce...   378   e-120
gb|KYP66220.1| Retrovirus-related Pol polyprotein from transposo...   387   e-120
gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabi...   382   e-118
gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabi...   383   e-118
gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thal...   382   e-118
emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]         380   e-117
gb|AIC77183.1| polyprotein [Gossypium barbadense]                     380   e-117
emb|CAB75469.1| copia-type reverse transcriptase-like protein [A...   378   e-117
gb|KZV34378.1| Integrase, catalytic core domain containing prote...   351   e-113
gb|KZV33171.1| Integrase, catalytic core domain containing prote...   350   e-111
gb|PHT36714.1| hypothetical protein CQW23_24414 [Capsicum baccatum]   361   e-110
gb|PRQ55987.1| putative RNA-directed DNA polymerase [Rosa chinen...   341   e-110
gb|KYP57183.1| Retrovirus-related Pol polyprotein from transposo...   350   e-109

>gb|PKU78070.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Dendrobium catenatum]
          Length = 477

 Score =  375 bits (964), Expect = e-124
 Identities = 185/362 (51%), Positives = 243/362 (67%), Gaps = 2/362 (0%)
 Frame = +1

Query: 64   SFRGRGQGNRGRYTPGRGNKKNVQCHKCHKFGHYASECWHKNEDLNNFAEATNDVGNNST 243
            S RGRG+G        R  K  V+C+ C+KFGH+A EC      +N       +      
Sbjct: 60   STRGRGRGRSN----SRYEKSQVKCYNCNKFGHFAKECRAPKSKVNEKVNYVEEERKEDD 115

Query: 244  LLLA--NDDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEGKGK 417
            +LL    ++   ++  WYLD+GASNHMCGK+ +F+EL + V GNVS GD SK+ V+GKG 
Sbjct: 116  ILLLAYKNNEKCEDGTWYLDTGASNHMCGKRSMFVELDETVGGNVSFGDDSKIEVKGKGN 175

Query: 418  IKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIACVK 597
            I I  K+G  ++IS+VY++PNM+SNILS+GQLL KGY + ++NN+L+LK+  G  IA V 
Sbjct: 176  ILIRLKNGNHQFISNVYFVPNMRSNILSLGQLLEKGYDIHLKNNYLFLKDNIGTLIAKVP 235

Query: 598  MTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAINLP 777
            M+RNRMF L++  +V +C +   ++ SW WHLRFGHLNF GL+LLS   MVRGLP I  P
Sbjct: 236  MSRNRMFLLNIQNDVAKCLKACYKDVSWLWHLRFGHLNFGGLELLSKKEMVRGLPCIKHP 295

Query: 778  NHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDDFSR 957
            + VCE C++ K  R  FP   S RA+  L+L+HTD+CGPI+P SLG + YF+ FIDDFSR
Sbjct: 296  DQVCEACLLGKHFRKSFPRESSSRAQKPLELIHTDVCGPIKPCSLGKSNYFLLFIDDFSR 355

Query: 958  KLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGIKRQ 1137
            K WVY LK+KS  F  FK FKA VE ESG K+  +RSDRG E+TS  FQE+C   GI+R 
Sbjct: 356  KTWVYFLKQKSEVFGIFKKFKAAVEKESGLKIKAMRSDRGGEFTSKEFQEFCEANGIRRS 415

Query: 1138 FT 1143
             T
Sbjct: 416  LT 417


>gb|KYP66219.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1033

 Score =  391 bits (1005), Expect = e-124
 Identities = 198/365 (54%), Positives = 256/365 (70%), Gaps = 5/365 (1%)
 Frame = +1

Query: 64   SFRGRGQGNRGRYTPGRGNKKNVQCHKCHKFGHYASECWHKN----EDLNNFAEATNDVG 231
            S RGRG+GN       R +K  ++C+ C+KFGHYASEC   N    E+  N+AE      
Sbjct: 194  SNRGRGRGNPN----SRYDKSRIKCYNCNKFGHYASECRAPNKNKVEEKANYAEERCQ-- 247

Query: 232  NNSTLLLA-NDDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEG 408
             + TLLLA       +++ WYLDSGASNHMCGK+ +F+EL + V GNV+ GD SK+ VEG
Sbjct: 248  EDGTLLLAYKGQDKGEDNQWYLDSGASNHMCGKRSMFVELDESVKGNVAFGDESKVAVEG 307

Query: 409  KGKIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIA 588
            KG + I  K+G+ ++IS+VYY+P+MKSNILS+GQLL KGY +Q++NN+L +++     IA
Sbjct: 308  KGNVLIRLKNGEHQFISNVYYVPSMKSNILSLGQLLEKGYDIQLKNNNLSIRDNTSRFIA 367

Query: 589  CVKMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAI 768
             V MTRNRMF L++ ++  +C +   +++SW WHLRFGHLNF GL+LLS   MVRGLP I
Sbjct: 368  KVPMTRNRMFVLNIQSDGPQCLKMCYKDQSWLWHLRFGHLNFKGLELLSKKAMVRGLPCI 427

Query: 769  NLPNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDD 948
              PN VCEGC++ KQ RL FP     RA+  L+L+HTD+CGPI+P SLG + YF+ FIDD
Sbjct: 428  THPNQVCEGCLLGKQFRLSFPKESDSRAQKPLELIHTDVCGPIKPRSLGKSNYFLLFIDD 487

Query: 949  FSRKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGI 1128
            FSRK WVY LKEKS  F  FK FKA VE ESG  +  LRSDRG E+TS  FQ+YC + GI
Sbjct: 488  FSRKTWVYFLKEKSEVFENFKKFKAHVEKESGLLIKALRSDRGGEFTSKEFQKYCEDNGI 547

Query: 1129 KRQFT 1143
            +RQ T
Sbjct: 548  RRQLT 552


>gb|KYP32069.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 954

 Score =  385 bits (989), Expect = e-122
 Identities = 194/363 (53%), Positives = 253/363 (69%), Gaps = 5/363 (1%)
 Frame = +1

Query: 70   RGRGQGNRGRYTPGRGNKKNVQCHKCHKFGHYASECWHKN----EDLNNFAEATNDVGNN 237
            RGRG+GN       R  K  ++C+ C+KFGHYAS+C   N    E+  N+AE       +
Sbjct: 197  RGRGRGNPN----SRYGKSRIKCYNCNKFGHYASKCRAPNKNKVEEKANYAEERCQ--ED 250

Query: 238  STLLLA-NDDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEGKG 414
             TLLLA       +++ WYLDSGASNHMCGK+ +F+EL + V GNV+ GD SK+ VEGKG
Sbjct: 251  GTLLLAYKGQDKGEDNQWYLDSGASNHMCGKRSMFVELDESVKGNVAFGDESKVAVEGKG 310

Query: 415  KIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIACV 594
             + I  K+G+ ++IS+VYY+P+MKSNILS+GQLL KGY +Q++NN+L +++     IA V
Sbjct: 311  NVLIRLKNGEHQFISNVYYVPSMKSNILSLGQLLEKGYDIQLKNNNLSIRDNTSRFIAKV 370

Query: 595  KMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAINL 774
             MTRNRMF L++ ++  +C +   +++SW WHLRFGHLNF GL+LLS   MVRGLP I  
Sbjct: 371  PMTRNRMFVLNIQSDGPQCLKMCYKDQSWLWHLRFGHLNFKGLELLSKKAMVRGLPCITH 430

Query: 775  PNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDDFS 954
            PN VCEGC++ KQ RL FP     RA+  L+L+HTD+CGPI+P SLG + YF+ FIDDFS
Sbjct: 431  PNQVCEGCLLGKQFRLSFPKESDSRAQKPLELIHTDVCGPIKPRSLGKSNYFLFFIDDFS 490

Query: 955  RKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGIKR 1134
            RK WVY LKEKS  F  FK FK  VE E+G  +  LRSDRG E+TS  FQ+YC + GI+R
Sbjct: 491  RKTWVYFLKEKSEVFENFKKFKDNVEKENGLLIKALRSDRGGEFTSKEFQKYCEDNGIRR 550

Query: 1135 QFT 1143
            Q T
Sbjct: 551  QLT 553


>ref|XP_020867873.1| uncharacterized protein LOC110224828 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 961

 Score =  385 bits (989), Expect = e-122
 Identities = 195/367 (53%), Positives = 255/367 (69%), Gaps = 7/367 (1%)
 Frame = +1

Query: 64   SFRGRGQGN-RGRYTPGRGNKKNVQCHKCHKFGHYASECW----HKNEDLNNFAEATNDV 228
            S RGRG+G+ + RY     +K +V+C+ C KFGHYASEC      K E+  N+ E    V
Sbjct: 260  SSRGRGRGSPKSRY-----DKSSVKCYNCGKFGHYASECKAPSNKKVEEKANYVE--EQV 312

Query: 229  GNNSTLLLAN--DDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPV 402
                 LL+A+       +N  WYLDSGASNHMCG K +F+EL + V GNV+LGD SK+ V
Sbjct: 313  QEEDMLLMASYKKGEHEENHKWYLDSGASNHMCGSKSMFVELDESVRGNVALGDESKMEV 372

Query: 403  EGKGKIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGR 582
            +GKGKI I  K+G  ++IS+VYYIP+MK+NILS+GQLL KGY +++++N+L +++     
Sbjct: 373  KGKGKILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNL 432

Query: 583  IACVKMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLP 762
            I  V M++NRMF L++  ++ +C +   + ESW WHLRFGHLNF GLKLLS   MVRGLP
Sbjct: 433  ITKVSMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLKLLSKKEMVRGLP 492

Query: 763  AINLPNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFI 942
             IN PN VCEGC++ KQ ++ FP   S RA+  L+L+HTD+CGPI+P SLG + YF+ FI
Sbjct: 493  CINHPNQVCEGCLLGKQFKMSFPKESSTRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFI 552

Query: 943  DDFSRKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQ 1122
            DDFSRK WVY LKEKS  F  FK FKA VE ESG  +  +RSDRG E+TS  F +YC + 
Sbjct: 553  DDFSRKTWVYFLKEKSEVFENFKRFKAHVEKESGLTIKSMRSDRGGEFTSKEFLKYCEDN 612

Query: 1123 GIKRQFT 1143
            GI+RQ T
Sbjct: 613  GIRRQLT 619


>dbj|GAU28864.1| hypothetical protein TSUD_293160 [Trifolium subterraneum]
          Length = 951

 Score =  385 bits (988), Expect = e-122
 Identities = 195/366 (53%), Positives = 256/366 (69%), Gaps = 6/366 (1%)
 Frame = +1

Query: 64   SFRGRGQGN-RGRYTPGRGNKKNVQCHKCHKFGHYASECW----HKNEDLNNFAEATNDV 228
            S RG G+G+ + RY     +K  V+C+ C KFGHYASEC      K E+  N+ E  +  
Sbjct: 261  SSRGHGRGSPKPRY-----DKSRVKCYNCEKFGHYASECRAPSNRKVEEKANYVEEISQ- 314

Query: 229  GNNSTLLLANDDTSVQND-VWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVE 405
              + TLLLA+ D     D  WYLDSGASNHMCG++ +F+EL + V+GNV+ GD SK+ V+
Sbjct: 315  -EDGTLLLAHKDNERGGDNQWYLDSGASNHMCGRRSMFVELDESVNGNVAFGDESKVAVK 373

Query: 406  GKGKIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRI 585
            GKG + I  K+G  ++IS+VYY+PNMKSNILS+GQLL KGY +Q++NN+L +++ +   I
Sbjct: 374  GKGNVLIRLKNGDHQFISNVYYVPNMKSNILSLGQLLEKGYDIQLKNNNLSIRDHSNKFI 433

Query: 586  ACVKMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPA 765
            A V M+RNRMF L++  +V +C +   + E W WHLRFGHLNF GL+LLS   MVRGLP 
Sbjct: 434  AKVTMSRNRMFVLNIQNDVAQCLKMCYKEEPWLWHLRFGHLNFGGLELLSKKEMVRGLPY 493

Query: 766  INLPNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFID 945
            IN PN VCEGC++ KQ ++ FP   S RA+  L+L+H D+CGPI+P SLG + YF+ FID
Sbjct: 494  INHPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHMDVCGPIKPRSLGKSNYFLLFID 553

Query: 946  DFSRKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQG 1125
            +FSRK WVY LKEKS  F  FK FKALVE ESG  +  +RSDRG E+TSN F +YC +  
Sbjct: 554  NFSRKTWVYFLKEKSEVFENFKKFKALVEKESGRVIKAIRSDRGGEFTSNDFLKYCEDND 613

Query: 1126 IKRQFT 1143
            I+RQ T
Sbjct: 614  IRRQLT 619


>gb|KYP69041.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1342

 Score =  391 bits (1005), Expect = e-121
 Identities = 198/365 (54%), Positives = 256/365 (70%), Gaps = 5/365 (1%)
 Frame = +1

Query: 64   SFRGRGQGNRGRYTPGRGNKKNVQCHKCHKFGHYASECWHKN----EDLNNFAEATNDVG 231
            S RGRG+GN       R +K  ++C+ C+KFGHYASEC   N    E+  N+AE      
Sbjct: 260  SNRGRGRGNPN----SRYDKSRIKCYNCNKFGHYASECRAPNKNKVEEKANYAEERCQ-- 313

Query: 232  NNSTLLLA-NDDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEG 408
             + TLLLA       +++ WYLDSGASNHMCGK+ +F+EL + V GNV+ GD SK+ VEG
Sbjct: 314  EDGTLLLAYKGQDKGEDNQWYLDSGASNHMCGKRSMFVELDESVKGNVAFGDESKVAVEG 373

Query: 409  KGKIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIA 588
            KG + I  K+G+ ++IS+VYY+P+MKSNILS+GQLL KGY +Q++NN+L +++     IA
Sbjct: 374  KGNVLIRLKNGEHQFISNVYYVPSMKSNILSLGQLLEKGYDIQLKNNNLSIRDNTSRFIA 433

Query: 589  CVKMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAI 768
             V MTRNRMF L++ ++  +C +   +++SW WHLRFGHLNF GL+LLS   MVRGLP I
Sbjct: 434  KVPMTRNRMFVLNIQSDGPQCLKMCYKDQSWLWHLRFGHLNFKGLELLSKKAMVRGLPCI 493

Query: 769  NLPNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDD 948
              PN VCEGC++ KQ RL FP     RA+  L+L+HTD+CGPI+P SLG + YF+ FIDD
Sbjct: 494  THPNQVCEGCLLGKQFRLSFPKESDSRAQKPLELIHTDVCGPIKPRSLGKSNYFLLFIDD 553

Query: 949  FSRKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGI 1128
            FSRK WVY LKEKS  F  FK FKA VE ESG  +  LRSDRG E+TS  FQ+YC + GI
Sbjct: 554  FSRKTWVYFLKEKSEVFENFKKFKAHVEKESGLLIKALRSDRGGEFTSKEFQKYCEDNGI 613

Query: 1129 KRQFT 1143
            +RQ T
Sbjct: 614  RRQLT 618


>gb|KYP44533.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1342

 Score =  391 bits (1005), Expect = e-121
 Identities = 198/365 (54%), Positives = 256/365 (70%), Gaps = 5/365 (1%)
 Frame = +1

Query: 64   SFRGRGQGNRGRYTPGRGNKKNVQCHKCHKFGHYASECWHKN----EDLNNFAEATNDVG 231
            S RGRG+GN       R +K  ++C+ C+KFGHYASEC   N    E+  N+AE      
Sbjct: 260  SNRGRGRGNPN----SRYDKSRIKCYNCNKFGHYASECRAPNKNKVEEKANYAEERCQ-- 313

Query: 232  NNSTLLLA-NDDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEG 408
             + TLLLA       +++ WYLDSGASNHMCGK+ +F+EL + V GNV+ GD SK+ VEG
Sbjct: 314  EDGTLLLAYKGQDKGEDNQWYLDSGASNHMCGKRSMFVELDESVKGNVAFGDESKVAVEG 373

Query: 409  KGKIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIA 588
            KG + I  K+G+ ++IS+VYY+P+MKSNILS+GQLL KGY +Q++NN+L +++     IA
Sbjct: 374  KGNVLIRLKNGEHQFISNVYYVPSMKSNILSLGQLLEKGYDIQLKNNNLSIRDNTSRFIA 433

Query: 589  CVKMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAI 768
             V MTRNRMF L++ ++  +C +   +++SW WHLRFGHLNF GL+LLS   MVRGLP I
Sbjct: 434  KVPMTRNRMFVLNIQSDGPQCLKMCYKDQSWLWHLRFGHLNFKGLELLSKKAMVRGLPCI 493

Query: 769  NLPNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDD 948
              PN VCEGC++ KQ RL FP     RA+  L+L+HTD+CGPI+P SLG + YF+ FIDD
Sbjct: 494  THPNQVCEGCLLGKQFRLSFPKESDSRAQKPLELIHTDVCGPIKPRSLGKSNYFLLFIDD 553

Query: 949  FSRKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGI 1128
            FSRK WVY LKEKS  F  FK FKA VE ESG  +  LRSDRG E+TS  FQ+YC + GI
Sbjct: 554  FSRKTWVYFLKEKSEVFENFKKFKAHVEKESGLLIKALRSDRGGEFTSKEFQKYCEDNGI 613

Query: 1129 KRQFT 1143
            +RQ T
Sbjct: 614  RRQLT 618


>gb|KZV47435.1| hypothetical protein F511_22511, partial [Dorcoceras hygrometricum]
          Length = 881

 Score =  378 bits (970), Expect = e-120
 Identities = 179/362 (49%), Positives = 252/362 (69%), Gaps = 4/362 (1%)
 Frame = +1

Query: 70   RGRGQGNR----GRYTPGRGNKKNVQCHKCHKFGHYASECWHKNEDLNNFAEATNDVGNN 237
            R RG+G R    GR T  R +K NV+C+ CHKFGHY+ EC +  E+ NNFA+ + +  N 
Sbjct: 209  RSRGRGKRPRGGGRQTQQRYDKSNVECYNCHKFGHYSYECRNNVEETNNFAKNSIEEVNP 268

Query: 238  STLLLANDDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEGKGK 417
            + LL         ND WYLDSGAS+H+CG K+LF+EL + + G ++ GDSS++ V+G+G 
Sbjct: 269  TLLLACKTTQEKDNDKWYLDSGASSHICGNKDLFVELDESIGGKITFGDSSQVQVQGRGT 328

Query: 418  IKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIACVK 597
            I    K+G  + IS+VYY+P+MKSN+LS+GQLL K Y++ +++  L +K+ +G R+  V 
Sbjct: 329  ILFRSKNGSHQLISNVYYVPDMKSNVLSLGQLLEKNYEISLKDKSLTMKDESG-RLIEVP 387

Query: 598  MTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAINLP 777
            MT+NRM  L++ ++V  C +   ++ SW WH+R GHLNF  LKL+S   MV+GLP+I+ P
Sbjct: 388  MTKNRMLLLNIQSDVPMCLKSFFKDSSWLWHMRLGHLNFDSLKLMSKRKMVKGLPSIDHP 447

Query: 778  NHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDDFSR 957
            N +CEGCI+ KQ+R  F      RA+  L+L+H+D+CGPI+P SLG + YFI FIDDFSR
Sbjct: 448  NQLCEGCILGKQARKSFSKKSMTRAQHPLELIHSDVCGPIKPSSLGKSNYFIIFIDDFSR 507

Query: 958  KLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGIKRQ 1137
            K WVY +KEKS  F TFK FK +VE +SG+++  LRSDRG E+TSN F+++C + GI R 
Sbjct: 508  KTWVYFIKEKSEVFETFKKFKIMVEKQSGYQIQALRSDRGGEFTSNEFKKFCEDNGIHRP 567

Query: 1138 FT 1143
             T
Sbjct: 568  MT 569


>gb|KYP66220.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1331

 Score =  387 bits (993), Expect = e-120
 Identities = 195/365 (53%), Positives = 253/365 (69%), Gaps = 5/365 (1%)
 Frame = +1

Query: 64   SFRGRGQGNRGRYTPGRGNKKNVQCHKCHKFGHYASECWHKN----EDLNNFAEATNDVG 231
            S RGRG+GN       R +K  ++C+ C+KFGHYASEC   N    E+  N+AE      
Sbjct: 249  SNRGRGRGNPN----SRYDKSRIKCYNCNKFGHYASECRAPNKNKVEEKANYAEERCQ-- 302

Query: 232  NNSTLLLA-NDDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEG 408
             + TLLLA       +++ WYLDSGASNHMCGK+ +F+EL + V GNV+ GD SK+ VEG
Sbjct: 303  EDGTLLLAYKGQDKGEDNQWYLDSGASNHMCGKRSMFVELDESVKGNVAFGDESKVAVEG 362

Query: 409  KGKIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIA 588
            KG + I  K+G+ ++IS++YY+P+MKSNILS+GQLL KGY +Q++NN+L +++     I 
Sbjct: 363  KGNVLIQLKNGEHQFISNIYYVPSMKSNILSLGQLLEKGYDIQLKNNNLSIRDNTSRFIT 422

Query: 589  CVKMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAI 768
             V M RNRMF L++ ++  +C +   +++SW WHLRFGHLNF GL LLS   MVRGLP I
Sbjct: 423  KVPMMRNRMFVLNIQSDGPQCLKMCYKDQSWLWHLRFGHLNFKGLDLLSKKAMVRGLPCI 482

Query: 769  NLPNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDD 948
              PN VCEGC++ KQ RL FP     RA+  L+L+HTD+CGPI+P SLG + YF+ FIDD
Sbjct: 483  THPNQVCEGCLLGKQFRLSFPKESDSRAQKPLELIHTDVCGPIKPRSLGKSNYFLLFIDD 542

Query: 949  FSRKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGI 1128
            FSRK WVY LKEKS  F  FK FKA VE ESG  +  LRSDRG E+TS  FQ+YC + GI
Sbjct: 543  FSRKTWVYFLKEKSEVFENFKKFKAHVEKESGLLIKALRSDRGGEFTSKEFQKYCEDNGI 602

Query: 1129 KRQFT 1143
            +RQ T
Sbjct: 603  RRQLT 607


>gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabidopsis thaliana]
 gb|AAG50765.1|AC079131_10 copia-type polyprotein, putative [Arabidopsis thaliana]
          Length = 1320

 Score =  382 bits (982), Expect = e-118
 Identities = 193/367 (52%), Positives = 255/367 (69%), Gaps = 7/367 (1%)
 Frame = +1

Query: 64   SFRGRGQGN-RGRYTPGRGNKKNVQCHKCHKFGHYASECW----HKNEDLNNFAEATNDV 228
            S RGRG+G+ + RY     +K +V+C+ C KFGHYASEC      K E+  N+ E    +
Sbjct: 261  SSRGRGKGHPKSRY-----DKSSVKCYNCGKFGHYASECKAPSNKKFEEKANYVE--EKI 313

Query: 229  GNNSTLLLAN--DDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPV 402
                 LL+A+   D   +N  WYLDSGASNHMCG+K +F EL + V GNV+LGD SK+ V
Sbjct: 314  QEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEV 373

Query: 403  EGKGKIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGR 582
            +GKG I I  K+G  ++IS+VYYIP+MK+NILS+GQLL KGY +++++N+L +++     
Sbjct: 374  KGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNL 433

Query: 583  IACVKMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLP 762
            I  V M++NRMF L++  ++ +C +   + ESW WHLRFGHLNF GL+LLS   MVRGLP
Sbjct: 434  ITKVPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLP 493

Query: 763  AINLPNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFI 942
             IN PN VCEGC++ KQ ++ FP   S RA+  L+L+HTD+CGPI+P SLG + YF+ FI
Sbjct: 494  CINHPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFI 553

Query: 943  DDFSRKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQ 1122
            DDFSRK WVY LKEKS  F  FK FKA VE ESG  +  +RSDRG E+TS  F +YC + 
Sbjct: 554  DDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDN 613

Query: 1123 GIKRQFT 1143
            GI+RQ T
Sbjct: 614  GIRRQLT 620


>gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabidopsis thaliana]
          Length = 1352

 Score =  383 bits (983), Expect = e-118
 Identities = 193/367 (52%), Positives = 255/367 (69%), Gaps = 7/367 (1%)
 Frame = +1

Query: 64   SFRGRGQGN-RGRYTPGRGNKKNVQCHKCHKFGHYASECW----HKNEDLNNFAEATNDV 228
            S RGRG+G+ + RY     +K +V+C+ C KFGHYASEC      K E+  N+ E    +
Sbjct: 261  SSRGRGKGHPKSRY-----DKSSVKCYNCGKFGHYASECKAPSNKKFEEKANYVE--EKI 313

Query: 229  GNNSTLLLAN--DDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPV 402
                 LL+A+   D   +N  WYLDSGASNHMCG+K +F EL + V GNV+LGD SK+ V
Sbjct: 314  QEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEV 373

Query: 403  EGKGKIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGR 582
            +GKG I I  K+G  ++IS+VYYIP+MK+NILS+GQLL KGY +++++N+L +++     
Sbjct: 374  KGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNL 433

Query: 583  IACVKMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLP 762
            I  V M++NRMF L++  ++ +C +   + ESW WHLRFGHLNF GL+LLS   MVRGLP
Sbjct: 434  ITKVPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLP 493

Query: 763  AINLPNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFI 942
             IN PN VCEGC++ KQ ++ FP   S RA+  L+L+HTD+CGPI+P SLG + YF+ FI
Sbjct: 494  CINHPNQVCEGCLLGKQFKMSFPKESSSRAQKSLELIHTDVCGPIKPKSLGKSNYFLLFI 553

Query: 943  DDFSRKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQ 1122
            DDFSRK WVY LKEKS  F  FK FKA VE ESG  +  +RSDRG E+TS  F +YC + 
Sbjct: 554  DDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDN 613

Query: 1123 GIKRQFT 1143
            GI+RQ T
Sbjct: 614  GIRRQLT 620


>gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thaliana]
          Length = 1352

 Score =  382 bits (982), Expect = e-118
 Identities = 193/367 (52%), Positives = 255/367 (69%), Gaps = 7/367 (1%)
 Frame = +1

Query: 64   SFRGRGQGN-RGRYTPGRGNKKNVQCHKCHKFGHYASECW----HKNEDLNNFAEATNDV 228
            S RGRG+G+ + RY     +K +V+C+ C KFGHYASEC      K E+  N+ E    +
Sbjct: 261  SSRGRGKGHPKSRY-----DKSSVKCYNCGKFGHYASECKAPSNKKFEEKANYVE--EKI 313

Query: 229  GNNSTLLLAN--DDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPV 402
                 LL+A+   D   +N  WYLDSGASNHMCG+K +F EL + V GNV+LGD SK+ V
Sbjct: 314  QEEDMLLMASYKKDEQKENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEV 373

Query: 403  EGKGKIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGR 582
            +GKG I I  K+G  ++IS+VYYIP+MK+NILS+GQLL KGY +++++N+L +++     
Sbjct: 374  KGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNL 433

Query: 583  IACVKMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLP 762
            I  V M++NRMF L++  ++ +C +   + ESW WHLRFGHLNF GL+LLS   MVRGLP
Sbjct: 434  ITKVPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLP 493

Query: 763  AINLPNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFI 942
             IN PN VCEGC++ KQ ++ FP   S RA+  L+L+HTD+CGPI+P SLG + YF+ FI
Sbjct: 494  CINHPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFI 553

Query: 943  DDFSRKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQ 1122
            DDFSRK WVY LKEKS  F  FK FKA VE ESG  +  +RSDRG E+TS  F +YC + 
Sbjct: 554  DDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDN 613

Query: 1123 GIKRQFT 1143
            GI+RQ T
Sbjct: 614  GIRRQLT 620


>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]
          Length = 1352

 Score =  380 bits (977), Expect = e-117
 Identities = 192/367 (52%), Positives = 255/367 (69%), Gaps = 7/367 (1%)
 Frame = +1

Query: 64   SFRGRGQGN-RGRYTPGRGNKKNVQCHKCHKFGHYASECW----HKNEDLNNFAEATNDV 228
            S RGRG+G+ + RY     +K +V+C+ C KFGHYASEC      K E+  ++ E    +
Sbjct: 261  SSRGRGKGHPKSRY-----DKSSVKCYNCGKFGHYASECKAPSNKKFEEKAHYVE--EKI 313

Query: 229  GNNSTLLLAN--DDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPV 402
                 LL+A+   D   +N  WYLDSGASNHMCG+K +F EL + V GNV+LGD SK+ V
Sbjct: 314  QEEDMLLMASYKKDEQKENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEV 373

Query: 403  EGKGKIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGR 582
            +GKG I I  K+G  ++IS+VYYIP+MK+NILS+GQLL KGY +++++N+L +++     
Sbjct: 374  KGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNL 433

Query: 583  IACVKMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLP 762
            I  V M++NRMF L++  ++ +C +   + ESW WHLRFGHLNF GL+LLS   MVRGLP
Sbjct: 434  ITKVPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLP 493

Query: 763  AINLPNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFI 942
             IN PN VCEGC++ KQ ++ FP   S RA+  L+L+HTD+CGPI+P SLG + YF+ FI
Sbjct: 494  CINHPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFI 553

Query: 943  DDFSRKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQ 1122
            DDFSRK WVY LKEKS  F  FK FKA VE ESG  +  +RSDRG E+TS  F +YC + 
Sbjct: 554  DDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDN 613

Query: 1123 GIKRQFT 1143
            GI+RQ T
Sbjct: 614  GIRRQLT 620


>gb|AIC77183.1| polyprotein [Gossypium barbadense]
          Length = 1369

 Score =  380 bits (977), Expect = e-117
 Identities = 188/364 (51%), Positives = 254/364 (69%), Gaps = 8/364 (2%)
 Frame = +1

Query: 76   RGQGNRGRYTPGRG----NKKNVQCHKCHKFGHYASEC--WHKNEDLNNFAEAT--NDVG 231
            RG+G+RGR   GRG    NK  VQC+ C+K+GH++ EC   HK ++ N+ A A   N+  
Sbjct: 277  RGRGSRGR---GRGRFQENKSQVQCYNCNKYGHFSYECRSTHKVDERNHVAVAAEGNEKV 333

Query: 232  NNSTLLLANDDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEGK 411
             +S  L   ++   +  VWYLD+GASNHMCG+KELF EL + VHG ++ GD+S   ++GK
Sbjct: 334  ESSVFLTYGENEDRKRSVWYLDNGASNHMCGRKELFTELDETVHGQITFGDNSHAEIKGK 393

Query: 412  GKIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIAC 591
            GK+ I Q++G+ +YISDVYY+P +KSN++S+GQLL KGY+V M++  L ++N +G  +  
Sbjct: 394  GKVVITQRNGEKKYISDVYYVPALKSNLISLGQLLEKGYEVHMKDRSLAIRNKSGELVVR 453

Query: 592  VKMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAIN 771
            V MTRNR+F L + +   +C +  ++NESW WHLR+GHL FSGLKLLS   MV GLP+IN
Sbjct: 454  VDMTRNRLFTLDIESGEVKCMKTDLKNESWLWHLRYGHLGFSGLKLLSKTNMVNGLPSIN 513

Query: 772  LPNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDDF 951
             P+ +CE C+  KQ R  F  GKS RA   L++VHTDI GP +  SLGGNRY++TFIDD+
Sbjct: 514  HPDQLCEACVKGKQHRQKFEVGKSRRARRPLEIVHTDISGPYDIESLGGNRYYLTFIDDY 573

Query: 952  SRKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGIK 1131
            SRK WVY LK KS A   FK FKA+VE +SG  L ILRSDRG EYT+  ++ +C++ GI 
Sbjct: 574  SRKCWVYFLKAKSEALEKFKEFKAMVEKQSGRYLKILRSDRGGEYTAKLYESFCKDHGII 633

Query: 1132 RQFT 1143
             Q T
Sbjct: 634  HQLT 637


>emb|CAB75469.1| copia-type reverse transcriptase-like protein [Arabidopsis thaliana]
          Length = 1272

 Score =  378 bits (970), Expect = e-117
 Identities = 191/367 (52%), Positives = 251/367 (68%), Gaps = 7/367 (1%)
 Frame = +1

Query: 64   SFRGRGQGN-RGRYTPGRGNKKNVQCHKCHKFGHYASECWHKNEDLNNFAEATN----DV 228
            S RGRG+G+ + RY     +K +V+C+ C KFGHYASEC  K      F E  N     +
Sbjct: 261  SSRGRGKGHPKSRY-----DKSSVKCYNCGKFGHYASEC--KAPSNKKFKEKANYVEEKI 313

Query: 229  GNNSTLLLAN--DDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPV 402
                 LL+A+   D   +N  WYLDSGASNHMCG+K +F EL + V GNV+LGD SK+ V
Sbjct: 314  QEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFAELDESVRGNVALGDESKMEV 373

Query: 403  EGKGKIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGR 582
            +GKG I I  K+G  ++IS+VYYIP+MK+NILS+GQLL KGY +++++N+L +++     
Sbjct: 374  KGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDKESNL 433

Query: 583  IACVKMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLP 762
            I  V M++NRMF L++  ++ +C +   + ESW WHLRFGHLNF GL+LLS   MVRGLP
Sbjct: 434  ITKVPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLP 493

Query: 763  AINLPNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFI 942
             IN PN VCEGC++  Q ++ FP   S RA+  L+L+HTD+CGPI+P SLG + YF+ FI
Sbjct: 494  CINHPNQVCEGCLLGNQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFI 553

Query: 943  DDFSRKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQ 1122
            DDFSRK WVY LKEKS  F  FK FKA VE ESG  +  +RSD G E+TS  F +YC + 
Sbjct: 554  DDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDSGGEFTSKEFLKYCEDN 613

Query: 1123 GIKRQFT 1143
            GI+RQ T
Sbjct: 614  GIRRQLT 620


>gb|KZV34378.1| Integrase, catalytic core domain containing protein [Dorcoceras
            hygrometricum]
          Length = 578

 Score =  351 bits (900), Expect = e-113
 Identities = 177/360 (49%), Positives = 229/360 (63%), Gaps = 2/360 (0%)
 Frame = +1

Query: 70   RGRGQGNRGRYTPGRGNKKNVQCHKCHKFGHYASECWHK--NEDLNNFAEATNDVGNNST 243
            RGRG+GN  R T  RG K  ++C+ CHK+GHY+ EC     NE+ N       D    S 
Sbjct: 124  RGRGRGNYER-TDDRGKKSQIECYSCHKYGHYSWECPSNMDNEEANLVENREYDA-EQSL 181

Query: 244  LLLANDDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEGKGKIK 423
            LL   D++      WYLD+GASNHM G KE F+EL     G VS GD++K+ +EGKG I 
Sbjct: 182  LLALKDESKSNASTWYLDNGASNHMTGDKEKFVELDTSQKGFVSFGDNTKVKIEGKGTIL 241

Query: 424  IYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIACVKMT 603
               K+G  + +SDV Y+P + SNILSIGQLL + YK+ M +  LW+++++   IA V MT
Sbjct: 242  FEAKNGSHKVLSDVCYVPKLTSNILSIGQLLERNYKIYMADRTLWIRDSDSNLIAKVSMT 301

Query: 604  RNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAINLPNH 783
            +NRMF L L      C +  +++ SW+WH+RFGHLNF GLK L    MV+G+P I+ P+ 
Sbjct: 302  KNRMFLLDLKDCGPMCLKSFVQDPSWKWHMRFGHLNFGGLKALGDHKMVKGIPKIDHPDQ 361

Query: 784  VCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDDFSRKL 963
            +CE C+  K  R  FP     RA   LQLVH D+CGPI+P S G + YF+ FIDDFSRK 
Sbjct: 362  LCEACLFGKHPRKSFPKQSLSRAIKPLQLVHADVCGPIKPQSFGKSCYFVLFIDDFSRKT 421

Query: 964  WVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGIKRQFT 1143
            WVY LK KS AF  FK FKALVE ESG+++  LR+DRG E+TSN F  +C   GI+R  T
Sbjct: 422  WVYFLKYKSEAFDAFKKFKALVEKESGYEIKALRTDRGGEFTSNEFNSFCELHGIRRPLT 481


>gb|KZV33171.1| Integrase, catalytic core domain containing protein [Dorcoceras
            hygrometricum]
          Length = 702

 Score =  350 bits (898), Expect = e-111
 Identities = 175/360 (48%), Positives = 228/360 (63%), Gaps = 2/360 (0%)
 Frame = +1

Query: 70   RGRGQGNRGRYTPGRGNKKNVQCHKCHKFGHYASECWHK--NEDLNNFAEATNDVGNNST 243
            RGRG+GN  R T  RG K  ++C+ CHK+GHY+ EC     NE+ N       D    S 
Sbjct: 124  RGRGRGNYER-TDDRGKKSQIECYSCHKYGHYSWECPSNMDNEEANLVENREYDA-EQSL 181

Query: 244  LLLANDDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEGKGKIK 423
             L  ND++      WYLD+G +NHM G KE F+EL     G VS GD++K+ +EGK  I 
Sbjct: 182  FLALNDESKSNASTWYLDNGVNNHMTGDKEKFVELDTSQKGFVSFGDNTKVKIEGKVTIL 241

Query: 424  IYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIACVKMT 603
               K+G  + +SDVYY+P + SNILSIGQLL + YK+ ME+  LW+++++   IA V MT
Sbjct: 242  FEAKNGSHKVLSDVYYVPKLTSNILSIGQLLERNYKIYMEDRTLWIRDSDSNLIARVSMT 301

Query: 604  RNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAINLPNH 783
            +N MF L L      C +  +++ SW+WH+RFGHLNF GLK L    MV+G+P I+ P+ 
Sbjct: 302  KNNMFQLDLKDCGPMCLKSFVQDPSWKWHMRFGHLNFGGLKALGDHKMVKGIPKIDHPDQ 361

Query: 784  VCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDDFSRKL 963
            +CE C+ SK  R  FP     RA   LQLVH D+CGPI+P S G + YF+ FIDDFSRK 
Sbjct: 362  LCEACLFSKHPRKSFPKKSLSRAIKPLQLVHADVCGPIKPQSFGKSCYFVLFIDDFSRKT 421

Query: 964  WVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGIKRQFT 1143
            WVY LK KS AF  FK FK LVE ESG+++  LR+DRG E+TSN F  +C   GI+R  T
Sbjct: 422  WVYFLKYKSEAFDAFKKFKTLVEKESGYEIKALRTDRGGEFTSNEFNSFCELHGIRRPLT 481


>gb|PHT36714.1| hypothetical protein CQW23_24414 [Capsicum baccatum]
          Length = 1427

 Score =  361 bits (927), Expect = e-110
 Identities = 181/363 (49%), Positives = 248/363 (68%), Gaps = 7/363 (1%)
 Frame = +1

Query: 76   RGQGNRGRYTPGRGN-----KKNVQCHKCHKFGHYASECWH--KNEDLNNFAEATNDVGN 234
            RG+G RGR + GRG      K  VQC+ C K+G+Y+ +C    K E+ ++ A   N+ G 
Sbjct: 575  RGRGARGR-SRGRGRSQHGFKSQVQCYNCDKYGYYSYKCRSAPKQEERSHVAAIENENGE 633

Query: 235  NSTLLLANDDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEGKG 414
            +   L    D     +VWYLD+ ASNHMCG+ ELF+EL + V+G V+ GD S++ V+GKG
Sbjct: 634  SRIFLTYKGDQGSNRNVWYLDNCASNHMCGRMELFVELDESVNGRVTFGDDSQIDVKGKG 693

Query: 415  KIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIACV 594
            K+ I QK+G+ +YI+DVYY+P +KSNI+SIGQL   GY+V +++  L L+N N   ++ V
Sbjct: 694  KVMITQKNGEKKYITDVYYVPALKSNIISIGQLCELGYEVTIKDCSLTLRNKNREVVSKV 753

Query: 595  KMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAINL 774
             MTRN +F + + +   +C +  I+++SW WHLR+GHL FSGLKLL+   MV GLP IN 
Sbjct: 754  DMTRNHLFTIDIESGEVKCMKISIKDDSWLWHLRYGHLGFSGLKLLAKENMVNGLPKINP 813

Query: 775  PNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDDFS 954
            P+H+CE CI  KQ R  F  GKS RA   L++VH+D+ GP +  SLGGNRY++TFIDDFS
Sbjct: 814  PDHLCEACIKGKQHRQSFEVGKSRRARKPLEIVHSDLAGPFDIPSLGGNRYYLTFIDDFS 873

Query: 955  RKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGIKR 1134
            R+ WVY LKEKS     FK FKA+VE +SG+ + ILRSDRG EYT+N F+++ +E GI  
Sbjct: 874  RRSWVYILKEKSETLDKFKEFKAMVEKQSGYYVKILRSDRGGEYTANLFEDFVKEHGIIH 933

Query: 1135 QFT 1143
            Q T
Sbjct: 934  QLT 936


>gb|PRQ55987.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 526

 Score =  341 bits (874), Expect = e-110
 Identities = 176/363 (48%), Positives = 226/363 (62%), Gaps = 6/363 (1%)
 Frame = +1

Query: 73   GRGQGNRGRYTPGRGN--KKNVQCHKCHKFGHYASECW----HKNEDLNNFAEATNDVGN 234
            GRG+G RG +  GRG+  K NV+C++CH +GH+ SEC     +   +  NFAE       
Sbjct: 164  GRGRG-RGGFDGGRGSFDKSNVECYRCHGYGHFKSECTSNLHYGRGEKANFAEKEE---K 219

Query: 235  NSTLLLANDDTSVQNDVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEGKG 414
               +LL         DVWYLDSG SNHMCG K LF +  +     + LG+ +K+ V GKG
Sbjct: 220  EEEILLMAYHEGTNTDVWYLDSGCSNHMCGNKYLFSDYDEFFKDTMKLGNDAKMTVVGKG 279

Query: 415  KIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIACV 594
             IK+ +  G    I DV+Y+P++KSN++SIGQL  KGY + M      + +   G IA V
Sbjct: 280  NIKL-KIGGHVVKICDVFYVPDLKSNLISIGQLQEKGYTIIMRKGCCQIMHPEKGLIAQV 338

Query: 595  KMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAINL 774
             MT NRMFPLH+  ++  C+   + + SW WH+R+GHLNF+ L+ L    +V GLP I  
Sbjct: 339  TMTTNRMFPLHIQHDIQTCYTMQMSDASWLWHMRYGHLNFNCLRTLQQRSLVTGLPHITC 398

Query: 775  PNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDDFS 954
               VCE C+I KQ R PFP   +WRA+  LQLVH DICGPI PVS G  RYFITF DDFS
Sbjct: 399  STRVCEECVIGKQHRDPFPKAGAWRAKTVLQLVHLDICGPIHPVSNGNKRYFITFTDDFS 458

Query: 955  RKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGIKR 1134
            RK WVY +++KS AF  FK FK LVE ESG ++ ILR DRG EY S+AF  +C   GI+R
Sbjct: 459  RKTWVYFMEQKSEAFGVFKSFKTLVEKESGKEIKILRYDRGGEYNSSAFMSFCASYGIRR 518

Query: 1135 QFT 1143
            Q T
Sbjct: 519  QLT 521


>gb|KYP57183.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 884

 Score =  350 bits (898), Expect = e-109
 Identities = 176/363 (48%), Positives = 234/363 (64%), Gaps = 3/363 (0%)
 Frame = +1

Query: 64   SFRGRGQGNRGRYTPGRGNKKNVQCHKCHKFGHYASECWHKNEDLN-NFAEATNDVGNNS 240
            +FRGRG+G RGR+   + +K  V+C+ CHK GHY  EC  K ++   N  E   ++    
Sbjct: 206  NFRGRGRG-RGRHA--QFDKTRVECYHCHKLGHYQYECPDKEKETKVNLVEFEGEM---- 258

Query: 241  TLLLANDDTSVQN--DVWYLDSGASNHMCGKKELFMELAKGVHGNVSLGDSSKLPVEGKG 414
             LL+A  D    +  D WYLDSG SNHMCG K LF  + +     V LGD+S + V GKG
Sbjct: 259  -LLMAYIDKKENSSGDTWYLDSGCSNHMCGNKSLFYNMDETFRETVKLGDNSCISVMGKG 317

Query: 415  KIKIYQKSGKPEYISDVYYIPNMKSNILSIGQLLGKGYKVQMENNHLWLKNANGGRIACV 594
             IK + K+     IS+V+YIP++KSN++S+GQL  +GY + ++ +   + +   G I   
Sbjct: 318  DIKFHMKNNTVHTISNVFYIPDLKSNLISMGQLQERGYIIIIQQSRCQIHHPEKGLIVDA 377

Query: 595  KMTRNRMFPLHLNTEVDRCFQGVIENESWRWHLRFGHLNFSGLKLLSTAGMVRGLPAINL 774
            KMT NRMFP+H+  ++ +CF   +++ +W WHLR+GHL+F GLK L    MV GLP IN 
Sbjct: 378  KMTANRMFPMHIQYDIQKCFSTRVQDPTWLWHLRYGHLSFKGLKTLHEKNMVEGLPKINC 437

Query: 775  PNHVCEGCIISKQSRLPFPSGKSWRAEAHLQLVHTDICGPIEPVSLGGNRYFITFIDDFS 954
            P  +CE CI+ KQ R  FP GK+WRA+  LQLVH+DICGPI P S G  RYFI FIDD S
Sbjct: 438  PTEICEDCIVGKQHRDSFPHGKAWRAQQILQLVHSDICGPINPTSNGNKRYFIIFIDDHS 497

Query: 955  RKLWVYPLKEKSAAFITFKHFKALVEAESGHKLLILRSDRGVEYTSNAFQEYCREQGIKR 1134
            RK WVY L+EKS AF+ FK FK+ VE ESG  + ILR+DRG E+ S+ F  +C   GI+R
Sbjct: 498  RKTWVYFLQEKSEAFLIFKSFKSRVEKESGKYIQILRTDRGGEFNSHNFASFCELHGIQR 557

Query: 1135 QFT 1143
            Q T
Sbjct: 558  QLT 560


Top