BLASTX nr result

ID: Cheilocostus21_contig00056904 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00056904
         (801 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAA69272.1| lectin receptor kinase, partial [Arabidopsis tha...   199   7e-56
gb|ACN78973.1| copia-type polyprotein [Glycine max] >gi|22501615...   202   1e-55
gb|AAF16534.1|AC013482_8 T26F17.17 [Arabidopsis thaliana]             201   5e-55
emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]         199   2e-54
gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thal...   199   2e-54
emb|CAB75469.1| copia-type reverse transcriptase-like protein [A...   199   3e-54
gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabi...   199   3e-54
ref|XP_013583262.1| PREDICTED: LOW QUALITY PROTEIN: copia protei...   197   7e-54
gb|KZV30699.1| hypothetical protein F511_19492 [Dorcoceras hygro...   187   4e-52
ref|XP_020867873.1| uncharacterized protein LOC110224828 [Arabid...   189   3e-51
gb|KZV28520.1| hypothetical protein F511_15600 [Dorcoceras hygro...   183   1e-50
gb|KYP66219.1| Retrovirus-related Pol polyprotein from transposo...   187   3e-50
gb|KYP66220.1| Retrovirus-related Pol polyprotein from transposo...   187   3e-50
gb|KYP69041.1| Retrovirus-related Pol polyprotein from transposo...   187   3e-50
gb|KYP44533.1| Retrovirus-related Pol polyprotein from transposo...   187   3e-50
gb|KZV47435.1| hypothetical protein F511_22511, partial [Dorcoce...   176   2e-46
gb|KYP42300.1| Retrovirus-related Pol polyprotein from transposo...   165   4e-46
gb|AIC77183.1| polyprotein [Gossypium barbadense]                     175   6e-46
gb|PHT36714.1| hypothetical protein CQW23_24414 [Capsicum baccatum]   166   7e-43
gb|PRQ34009.1| putative RNA-directed DNA polymerase [Rosa chinen...   157   4e-42

>emb|CAA69272.1| lectin receptor kinase, partial [Arabidopsis thaliana]
          Length = 623

 Score =  199 bits (505), Expect = 7e-56
 Identities = 111/268 (41%), Positives = 153/268 (57%), Gaps = 1/268 (0%)
 Frame = -1

Query: 801 PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
           P EAW  RKP + HL+VF S A+AH+ D    KLD+KS KYIF+GYD+ +KGYKLYN +T
Sbjct: 177 PQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRNKLDDKSEKYIFIGYDNNSKGYKLYNPDT 236

Query: 621 KSRYKQR-C*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEISTP*LSPQCNEPSSSGE 445
           K     R   F      +    +  Y+F P  +E++ E T E       P   EP++   
Sbjct: 237 KKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDKPEPTRE------EPPSEEPTTPPT 290

Query: 444 SASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYFDASK 265
           S +  Q +            R PR R++  +Y+ T    ++T  CL  + EP+++ +A +
Sbjct: 291 SPTSSQIEESSS-------ERTPRFRSIQELYEVTENQENLTLFCLFAECEPMDFQEAIE 343

Query: 264 DYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARLITKS 85
                           KN T ELT+LP GH  IGVKWVYK KKNS+G++ERYKARL+ K 
Sbjct: 344 KKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKG 403

Query: 84  YKQRQGTNYNEVFAHVAHLDTIRLIISM 1
           Y QR G +Y+E+FA VA L+T+RLIIS+
Sbjct: 404 YSQRAGIDYDEIFAPVARLETVRLIISL 431


>gb|ACN78973.1| copia-type polyprotein [Glycine max]
 gb|ACN78980.1| copia-type polyprotein [Glycine max]
          Length = 1042

 Score =  202 bits (515), Expect = 1e-55
 Identities = 119/271 (43%), Positives = 161/271 (59%), Gaps = 4/271 (1%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW   KP + HL+VF S AYAH+ D    KLD++S K++F+GYD+ +KGYKLYN N 
Sbjct: 371  PQEAWSGVKPRVDHLRVFGSIAYAHVPDQGRFKLDDRSEKHVFIGYDASSKGYKLYNPNN 430

Query: 621  -KSRYKQRC*F**RNRMESARRDRCYDFLPSSDEEEKED-TLEISTP*LSP--QCNEPSS 454
             K+   +   F          ++  YDF P  +E ++E  T   STP LSP    NE SS
Sbjct: 431  GKTIVSRDVEFYEEGTWNWEEKEDTYDFFPYFEEIDEEALTPNDSTPALSPTPSTNEASS 490

Query: 453  SGESASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYFD 274
            S E +S                 R  R RN+  +Y ET  IND+   CL VD++PLN+ +
Sbjct: 491  SSEGSSS---------------ERPRRMRNIQELYDETEVINDL--FCLFVDSKPLNFDE 533

Query: 273  ASKDYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARLI 94
            A KD R             KN T EL++LP+GH  IGVKWV+K KKN++G++ER+KARL+
Sbjct: 534  AMKDKRWRQAMEEEIKAIEKNNTWELSSLPKGHEAIGVKWVFKIKKNAKGEVERHKARLV 593

Query: 93   TKSYKQRQGTNYNEVFAHVAHLDTIRLIISM 1
             K YKQ+   +Y+EVFA VA ++TIRL+IS+
Sbjct: 594  AKGYKQQYEVDYDEVFAPVARMETIRLLISL 624


>gb|AAF16534.1|AC013482_8 T26F17.17 [Arabidopsis thaliana]
          Length = 1291

 Score =  201 bits (511), Expect = 5e-55
 Identities = 113/268 (42%), Positives = 153/268 (57%), Gaps = 1/268 (0%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW  RKP + HL+VF S A+AH+ D    KLD+KS KYIF+GYD+ +KGYKLYN +T
Sbjct: 619  PQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDT 678

Query: 621  KSRYKQR-C*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEISTP*LSPQCNEPSSSGE 445
            K     R   F      +    +  Y+F P  +E+E E T E       P   EP++   
Sbjct: 679  KKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEPEPTRE------EPPSEEPTTRPT 732

Query: 444  SASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYFDASK 265
            S +  Q +            R PR R++  +Y+ T    ++T  CL  + EP+++ +A +
Sbjct: 733  SLTSSQIEESSS-------ERTPRFRSIQELYEVTENQENLTLFCLFAECEPMDFQEAIE 785

Query: 264  DYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARLITKS 85
                            KN T ELT+LP GH  IGVKWVYK KKNS+G++ERYKARL+ K 
Sbjct: 786  KKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKG 845

Query: 84   YKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            Y QR G +Y+EVFA VA L+T+RLIIS+
Sbjct: 846  YSQRAGIDYDEVFAPVARLETVRLIISL 873


>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]
          Length = 1352

 Score =  199 bits (507), Expect = 2e-54
 Identities = 113/268 (42%), Positives = 152/268 (56%), Gaps = 1/268 (0%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW  RKP + HL+VF S A+AH+ D    KLD+KS KYIF+GYD+ +KGYKLYN +T
Sbjct: 680  PQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDT 739

Query: 621  KSRYKQR-C*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEISTP*LSPQCNEPSSSGE 445
            K     R   F      +    +  Y+F P  +E+E E T E       P   EP++   
Sbjct: 740  KKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEPEPTRE------EPPSEEPTTPPT 793

Query: 444  SASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYFDASK 265
            S +  Q +            R PR R++  +Y+ T    ++T  CL  + EP+++  A +
Sbjct: 794  SPTSSQIEESSS-------ERTPRFRSIQELYEVTENQENLTLFCLFAECEPMDFQKAIE 846

Query: 264  DYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARLITKS 85
                            KN T ELT+LP GH  IGVKWVYK KKNS+G++ERYKARL+ K 
Sbjct: 847  KKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKG 906

Query: 84   YKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            Y QR G +Y+EVFA VA L+T+RLIIS+
Sbjct: 907  YSQRVGIDYDEVFAPVARLETVRLIISL 934


>gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thaliana]
          Length = 1352

 Score =  199 bits (507), Expect = 2e-54
 Identities = 113/268 (42%), Positives = 152/268 (56%), Gaps = 1/268 (0%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW  RKP + HL+VF S A+AH+ D    KLD+KS KYIF+GYD+ +KGYKLYN +T
Sbjct: 680  PQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDT 739

Query: 621  KSRYKQR-C*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEISTP*LSPQCNEPSSSGE 445
            K     R   F      +    +  Y+F P  +E+E E T E       P   EP++   
Sbjct: 740  KKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEPEPTRE------EPPSEEPTTPPT 793

Query: 444  SASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYFDASK 265
            S +  Q +            R PR R++  +Y+ T    ++T  CL  + EP+++  A +
Sbjct: 794  SPTSSQIEESSS-------ERTPRFRSIQELYEVTENQENLTLFCLFAECEPMDFQKAIE 846

Query: 264  DYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARLITKS 85
                            KN T ELT+LP GH  IGVKWVYK KKNS+G++ERYKARL+ K 
Sbjct: 847  KKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKG 906

Query: 84   YKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            Y QR G +Y+EVFA VA L+T+RLIIS+
Sbjct: 907  YSQRVGIDYDEVFAPVARLETVRLIISL 934


>emb|CAB75469.1| copia-type reverse transcriptase-like protein [Arabidopsis thaliana]
          Length = 1272

 Score =  199 bits (505), Expect = 3e-54
 Identities = 111/268 (41%), Positives = 153/268 (57%), Gaps = 1/268 (0%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW  RKP + HL+VF S A+AH+ D    KLD+KS KYIF+GYD+ +KGYKLYN +T
Sbjct: 680  PQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRNKLDDKSEKYIFIGYDNNSKGYKLYNPDT 739

Query: 621  KSRYKQR-C*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEISTP*LSPQCNEPSSSGE 445
            K     R   F      +    +  Y+F P  +E++ E T E       P   EP++   
Sbjct: 740  KKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDKPEPTRE------EPPSEEPTTPPT 793

Query: 444  SASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYFDASK 265
            S +  Q +            R PR R++  +Y+ T    ++T  CL  + EP+++ +A +
Sbjct: 794  SPTSSQIEESSS-------ERTPRFRSIQELYEVTENQENLTLFCLFAECEPMDFQEAIE 846

Query: 264  DYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARLITKS 85
                            KN T ELT+LP GH  IGVKWVYK KKNS+G++ERYKARL+ K 
Sbjct: 847  KKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKG 906

Query: 84   YKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            Y QR G +Y+E+FA VA L+T+RLIIS+
Sbjct: 907  YSQRAGIDYDEIFAPVARLETVRLIISL 934


>gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabidopsis thaliana]
          Length = 1352

 Score =  199 bits (505), Expect = 3e-54
 Identities = 113/268 (42%), Positives = 153/268 (57%), Gaps = 1/268 (0%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW  RK  + HL+VF S A+AH+ D    KLD+KS KYIF+GYD+ +KGYKLYN +T
Sbjct: 680  PQEAWSGRKSGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDT 739

Query: 621  KSRYKQR-C*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEISTP*LSPQCNEPSSSGE 445
            K     R   F      +    +  Y+F P  +E+E E T E       P   EP++   
Sbjct: 740  KKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEPEPTRE------EPPSEEPTTPPT 793

Query: 444  SASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYFDASK 265
            S +  Q +            R PR R++  +Y+ T    ++T  CL  + EP+++ +A +
Sbjct: 794  SPTSSQIEESSS-------ERTPRFRSIQELYEVTENQENLTLFCLFAECEPMDFQEAIE 846

Query: 264  DYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARLITKS 85
                            KN T ELT+LP GH TIGVKWVYK KKNS+G++ERYKARL+ K 
Sbjct: 847  KKTWRNAMDEEIKSIQKNDTWELTSLPNGHKTIGVKWVYKAKKNSKGEVERYKARLVAKG 906

Query: 84   YKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            Y QR G +Y+EVFA VA L+T+RLIIS+
Sbjct: 907  YIQRAGIDYDEVFAPVARLETVRLIISL 934


>ref|XP_013583262.1| PREDICTED: LOW QUALITY PROTEIN: copia protein [Brassica oleracea var.
            oleracea]
          Length = 1184

 Score =  197 bits (502), Expect = 7e-54
 Identities = 117/269 (43%), Positives = 154/269 (57%), Gaps = 2/269 (0%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EA   RKP + HL+VF S A+AH+ D    KLD+KS KYIF+GYD+ +KGYKLYN  T
Sbjct: 569  PQEAXSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDANSKGYKLYNPET 628

Query: 621  KSRYKQR-C*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEI-STP*LSPQCNEPSSSG 448
            K     R   F      +    +  Y+F PS +E+  E   E  +TP  SP     SS G
Sbjct: 629  KKTIINRNVIFDEEGEWDWRSNNEDYNFFPSFEEDNVEQPREEPTTPPTSPTT---SSQG 685

Query: 447  ESASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYFDAS 268
            + +S                 R PR R+L +IY+ T   +++T  CL  D EP+N+ +A 
Sbjct: 686  DESSS---------------ERTPRFRSLQDIYEVTENQDNLTLFCLFADCEPMNFEEAK 730

Query: 267  KDYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARLITK 88
            +                KN T +L +LP GH  IGVKWVYK KKNS+G++ERYKARL+ K
Sbjct: 731  EKKSXRSAMDEEIKSIQKNDTWKLASLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAK 790

Query: 87   SYKQRQGTNYNEVFAHVAHLDTIRLIISM 1
             Y QR G +Y+EVFA VA L+T+RLIIS+
Sbjct: 791  CYSQRAGIDYDEVFAPVARLETVRLIISL 819


>gb|KZV30699.1| hypothetical protein F511_19492 [Dorcoceras hygrometricum]
          Length = 536

 Score =  187 bits (475), Expect = 4e-52
 Identities = 106/275 (38%), Positives = 157/275 (57%), Gaps = 8/275 (2%)
 Frame = -1

Query: 801 PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
           P EAW  + P + HL++F S AYA + + +  KLD++S K +F+GY+  +KGYKL++ ++
Sbjct: 67  PQEAWSGQTPGVHHLRIFGSIAYAQVPEQERSKLDDRSRKLVFIGYNENSKGYKLFSPDS 126

Query: 621 KS-------RYKQRC*F**RNRMESARRDRCYDFLPSSDEE-EKEDTLEISTP*LSPQCN 466
           +         + +   +  R++ E+      YD  P  DEE + E  +E   P   P   
Sbjct: 127 RRIVISRDVEFDEDATWNWRSKTENDS----YDIYPYFDEETDMEQEVEQQDPTPPPSSG 182

Query: 465 EPSSSGESASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPL 286
             ++ G S+                  + P+ R+LA+IY ET  I+ +   CLL DAEPL
Sbjct: 183 LSNTPGSSSGE----------------KTPKYRSLADIYNETQAIDGMNLFCLLADAEPL 226

Query: 285 NYFDASKDYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYK 106
           ++ +A KD +             KN T ELT+LP+ H  IGVKWVYK KKN+ G++ERYK
Sbjct: 227 SFDEAEKDEKWRRAMDEEIHAIVKNDTWELTSLPKNHQVIGVKWVYKAKKNANGEVERYK 286

Query: 105 ARLITKSYKQRQGTNYNEVFAHVAHLDTIRLIISM 1
           ARL+ K YKQ+ G +Y+EVFA VA L+TIRL+IS+
Sbjct: 287 ARLVAKGYKQKHGVDYDEVFAPVARLETIRLLISL 321


>ref|XP_020867873.1| uncharacterized protein LOC110224828 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 961

 Score =  189 bits (481), Expect = 3e-51
 Identities = 113/274 (41%), Positives = 152/274 (55%), Gaps = 7/274 (2%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW  RKP + HL+VF S A+AH+ D    KLD+KS KYIF+GYD+ +KGYKLYN +T
Sbjct: 679  PQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDT 738

Query: 621  KSRYKQR-C*F**RNRMESARRDRCYDFLPSSDEEEKEDTL------EISTP*LSPQCNE 463
            K     R   F      +    +  Y+F P  +E++ E T       E +TP  SP    
Sbjct: 739  KKTIISRNVVFDEEEEWDWKSNEDDYNFFPHFEEDDSELTRDEPPREEPTTPPTSPT--- 795

Query: 462  PSSSGESASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLN 283
             SS GE +S                 R    R+L  +Y+ T   +++T  CL  + EP++
Sbjct: 796  -SSQGEESSS---------------ERTLHFRSLQELYEVTENQDNLTLFCLFAECEPMD 839

Query: 282  YFDASKDYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKA 103
            + +A +                KN T EL +LP GH  IGVKWVYK KKNS+G++ERYKA
Sbjct: 840  FQEAIEKKTWRNAMDEEIKAIKKNDTWELASLPNGHKAIGVKWVYKAKKNSKGEVERYKA 899

Query: 102  RLITKSYKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            RL+ K Y QR   +Y+EVFA VA L+T+RLIIS+
Sbjct: 900  RLVAKGYSQRARIDYDEVFAPVARLETVRLIISL 933


>gb|KZV28520.1| hypothetical protein F511_15600 [Dorcoceras hygrometricum]
          Length = 539

 Score =  183 bits (465), Expect = 1e-50
 Identities = 103/275 (37%), Positives = 155/275 (56%), Gaps = 8/275 (2%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P E W  + P + HL++F S AYA + + +  KLD++S K +F+GY+  +KGYKL++ ++
Sbjct: 242  PQETWSGQTPGVHHLRIFGSIAYAQVPEQERSKLDDRSRKLVFIGYNENSKGYKLFSPDS 301

Query: 621  KS-------RYKQRC*F**RNRMESARRDRCYDFLPSSDEE-EKEDTLEISTP*LSPQCN 466
            +         + +   +  R++ E+      YD  P  DEE + E  +E   P   P   
Sbjct: 302  RRIVISRDVEFDEDATWNWRSKTENDS----YDIFPYFDEETDMEQEVEQQDPTPPPSSG 357

Query: 465  EPSSSGESASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPL 286
              ++ G S+                  + P+ R+LA+IY ET  I+ +   CLL DAEPL
Sbjct: 358  LSNTPGSSSGE----------------KTPKYRSLADIYNETQAIDGMNLFCLLADAEPL 401

Query: 285  NYFDASKDYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYK 106
            ++ +A KD +             KN T ELT+LP+ H  IGVKW+YK KKN+ G++ERYK
Sbjct: 402  SFDEAEKDEKWRRAMDEEIHAIVKNDTWELTSLPKNHQVIGVKWMYKAKKNANGEVERYK 461

Query: 105  ARLITKSYKQRQGTNYNEVFAHVAHLDTIRLIISM 1
             RL+ K YKQ+ G +Y+EVFA VA L+TIRL+IS+
Sbjct: 462  TRLVAKGYKQKHGVDYDEVFAPVARLETIRLLISL 496


>gb|KYP66219.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1033

 Score =  187 bits (475), Expect = 3e-50
 Identities = 110/272 (40%), Positives = 153/272 (56%), Gaps = 5/272 (1%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW  RKP I HL+VF S A+ H+ D    KLD+KS KYIF+GYD+ +KGYKLYN ++
Sbjct: 612  PQEAWSGRKPGISHLRVFGSIAHVHVPDEKRSKLDDKSEKYIFIGYDANSKGYKLYNPDS 671

Query: 621  -KSRYKQRC*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEI----STP*LSPQCNEPS 457
             K+   +   F      + +     + F P  +E++ E   +     +TP  SP  N   
Sbjct: 672  RKTIISRNVVFDEEGEWDWSTNCEDHTFFPCVEEDDVEQQQQPQETPTTPPTSP--NTTL 729

Query: 456  SSGESASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYF 277
               ES+S                 R PR R+L  IY+ T  ++++T  CL  D EP+N+ 
Sbjct: 730  QDYESSSE----------------RMPRFRSLQEIYEATENLDNVTLFCLFADCEPMNFQ 773

Query: 276  DASKDYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARL 97
            +A                  KN T EL +LP+ H+ IGVKWVYK KK+S+G+++RYKARL
Sbjct: 774  EAIGKKSWRNAMDEEIEAIKKNDTWELVSLPKEHTAIGVKWVYKAKKDSKGEVQRYKARL 833

Query: 96   ITKSYKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            + K Y QR G +Y+EVFA VA L+T+RLIIS+
Sbjct: 834  VAKGYSQRAGIDYDEVFAPVARLETVRLIISL 865


>gb|KYP66220.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1331

 Score =  187 bits (475), Expect = 3e-50
 Identities = 110/272 (40%), Positives = 153/272 (56%), Gaps = 5/272 (1%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW  RKP I HL+VF S A+ H+ D    KLD+KS KYIF+GYD+ +KGYKLYN ++
Sbjct: 667  PQEAWSGRKPGISHLRVFGSIAHVHVPDEKRSKLDDKSEKYIFIGYDANSKGYKLYNPDS 726

Query: 621  -KSRYKQRC*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEI----STP*LSPQCNEPS 457
             K+   +   F      + +     + F P  +E++ E   +     +TP  SP  N   
Sbjct: 727  RKTIISRNVVFDEEGEWDWSTNCEDHTFFPCVEEDDVEQQQQPQETPTTPPTSP--NTTL 784

Query: 456  SSGESASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYF 277
               ES+S                 R PR R+L  IY+ T  ++++T  CL  D EP+N+ 
Sbjct: 785  QDYESSSE----------------RMPRFRSLQEIYEATENLDNVTLFCLFADCEPMNFQ 828

Query: 276  DASKDYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARL 97
            +A                  KN T EL +LP+ H+ IGVKWVYK KK+S+G+++RYKARL
Sbjct: 829  EAIGKKSWRNAMDEEIEAIKKNDTWELVSLPKEHTAIGVKWVYKAKKDSKGEVQRYKARL 888

Query: 96   ITKSYKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            + K Y QR G +Y+EVFA VA L+T+RLIIS+
Sbjct: 889  VAKGYSQRAGIDYDEVFAPVARLETVRLIISL 920


>gb|KYP69041.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1342

 Score =  187 bits (475), Expect = 3e-50
 Identities = 110/272 (40%), Positives = 153/272 (56%), Gaps = 5/272 (1%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW  RKP I HL+VF S A+ H+ D    KLD+KS KYIF+GYD+ +KGYKLYN ++
Sbjct: 678  PQEAWSGRKPGISHLRVFGSIAHVHVPDEKRSKLDDKSEKYIFIGYDANSKGYKLYNPDS 737

Query: 621  -KSRYKQRC*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEI----STP*LSPQCNEPS 457
             K+   +   F      + +     + F P  +E++ E   +     +TP  SP  N   
Sbjct: 738  RKTIISRNVVFDEEGEWDWSTNCEDHTFFPCVEEDDVEQQQQPQETPTTPPTSP--NTTL 795

Query: 456  SSGESASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYF 277
               ES+S                 R PR R+L  IY+ T  ++++T  CL  D EP+N+ 
Sbjct: 796  QDYESSSE----------------RMPRFRSLQEIYEATENLDNVTLFCLFADCEPMNFQ 839

Query: 276  DASKDYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARL 97
            +A                  KN T EL +LP+ H+ IGVKWVYK KK+S+G+++RYKARL
Sbjct: 840  EAIGKKSWRNAMDEEIEAIKKNDTWELVSLPKEHTAIGVKWVYKAKKDSKGEVQRYKARL 899

Query: 96   ITKSYKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            + K Y QR G +Y+EVFA VA L+T+RLIIS+
Sbjct: 900  VAKGYSQRAGIDYDEVFAPVARLETVRLIISL 931


>gb|KYP44533.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1342

 Score =  187 bits (475), Expect = 3e-50
 Identities = 110/272 (40%), Positives = 153/272 (56%), Gaps = 5/272 (1%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW  RKP I HL+VF S A+ H+ D    KLD+KS KYIF+GYD+ +KGYKLYN ++
Sbjct: 678  PQEAWSGRKPGISHLRVFGSIAHVHVPDEKRSKLDDKSEKYIFIGYDANSKGYKLYNPDS 737

Query: 621  -KSRYKQRC*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEI----STP*LSPQCNEPS 457
             K+   +   F      + +     + F P  +E++ E   +     +TP  SP  N   
Sbjct: 738  RKTIISRNVVFDEEGEWDWSTNCEDHTFFPCVEEDDVEQQQQPQETPTTPPTSP--NTTL 795

Query: 456  SSGESASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYF 277
               ES+S                 R PR R+L  IY+ T  ++++T  CL  D EP+N+ 
Sbjct: 796  QDYESSSE----------------RMPRFRSLQEIYEATENLDNVTLFCLFADCEPMNFQ 839

Query: 276  DASKDYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARL 97
            +A                  KN T EL +LP+ H+ IGVKWVYK KK+S+G+++RYKARL
Sbjct: 840  EAIGKKSWRNAMDEEIEAIKKNDTWELVSLPKEHTAIGVKWVYKAKKDSKGEVQRYKARL 899

Query: 96   ITKSYKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            + K Y QR G +Y+EVFA VA L+T+RLIIS+
Sbjct: 900  VAKGYSQRAGIDYDEVFAPVARLETVRLIISL 931


>gb|KZV47435.1| hypothetical protein F511_22511, partial [Dorcoceras hygrometricum]
          Length = 881

 Score =  176 bits (445), Expect = 2e-46
 Identities = 100/260 (38%), Positives = 145/260 (55%), Gaps = 1/260 (0%)
 Frame = -1

Query: 777  KPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT-KSRYKQR 601
            KP++ HL+VF S AYAH+ D    KLD+KS +Y+F+GYD+ +K YKLYN N  K    + 
Sbjct: 637  KPNVAHLRVFGSIAYAHVPDEKRTKLDDKSARYVFIGYDTNSKCYKLYNPNNGKIILSRD 696

Query: 600  C*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEISTP*LSPQCNEPSSSGESASR*QND 421
              F   +  +    +  Y + P  D++E+E T   + P   P  ++   S          
Sbjct: 697  VEFDEESAWDWNVSNETYSYSPFFDDQEEESTHPTTPPPSPPPQDDQDGSSSQP------ 750

Query: 420  XXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYFDASKDYRXXXXX 241
                       RR    R L    +E   +++ T  CLL + EP+++ DA  D +     
Sbjct: 751  -----------RRFRSLRELYKTTEEVQNLSEFTQFCLLAETEPVSFEDAVYDEKWKHAM 799

Query: 240  XXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARLITKSYKQRQGTN 61
                    KN T EL +LP+G S+IGVKW+YK K+N++G+IE+YKARL+ K YKQ+ G +
Sbjct: 800  DGEIKAIRKNDTWELASLPKGKSSIGVKWMYKIKRNAKGEIEKYKARLVAKGYKQKVGID 859

Query: 60   YNEVFAHVAHLDTIRLIISM 1
            Y+EVFA VA L+TIRLIIS+
Sbjct: 860  YDEVFAPVARLETIRLIISL 879


>gb|KYP42300.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 275

 Score =  165 bits (417), Expect = 4e-46
 Identities = 98/267 (36%), Positives = 144/267 (53%)
 Frame = -1

Query: 801 PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
           P E W   KPS+ HL+VF S AY  +      KL+++S KYIF+GYD ++K YKL++ + 
Sbjct: 40  PQEIWSGMKPSVSHLRVFGSLAYGQVPRQHRTKLEDRSKKYIFIGYDEKSKAYKLFDPDN 99

Query: 621 KSRYKQRC*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEISTP*LSPQCNEPSSSGES 442
           K     R      +      +  C++   S++ E   D +  ST   +   +E S     
Sbjct: 100 KKVVVSR------DVHVEETKQWCWN--NSAEVETSSDIVVPSTTTTTEFSDEESEP--- 148

Query: 441 ASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRINDITYICLLVDAEPLNYFDASKD 262
                              + PR R+L  IY  T   N++  +CLL D+E L++  A +D
Sbjct: 149 -------------------QQPRMRSLREIYDTT---NEVHVVCLLADSEDLSFEKAVQD 186

Query: 261 YRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARLITKSY 82
            +             +NKT ELTNLPEG   IGVKWVYK K N++G++ERYKARL+ K Y
Sbjct: 187 EKWRTTMDEEFGAIERNKTWELTNLPEGARPIGVKWVYKKKMNAEGEVERYKARLVVKGY 246

Query: 81  KQRQGTNYNEVFAHVAHLDTIRLIISM 1
           KQ++G +Y+EVFA V  +++IRL+IS+
Sbjct: 247 KQKEGIDYDEVFAPVTRMESIRLLISL 273


>gb|AIC77183.1| polyprotein [Gossypium barbadense]
          Length = 1369

 Score =  175 bits (443), Expect = 6e-46
 Identities = 108/279 (38%), Positives = 147/279 (52%), Gaps = 12/279 (4%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW   KP + HLK+F   AYAH+ +    KLD++  K IF+GYD R+K Y+LYN  T
Sbjct: 697  PEEAWSGHKPRVGHLKIFGCIAYAHVPEQQRKKLDDRGEKCIFIGYDKRSKAYRLYNPLT 756

Query: 621  KSRYKQRC*F**RNRMESARRDRCYDFLPSSDEEEKEDTL---------EISTP*LSPQC 469
            K     R              D   D+   S+EE+K + L         E      SP  
Sbjct: 757  KKLIISR----------DVEFDEA-DYWRWSEEEKKVEGLFFNEDDNNQEEQGDDQSPGT 805

Query: 468  NEPSSSGESASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRIN---DITYICLLVD 298
              PSS   S+     D               RTR+L +IY  T  +    D +  CL+ +
Sbjct: 806  TAPSSPTSSSGSSSLDEAPT-----------RTRSLNDIYNSTEPVETQFDYSLFCLMTE 854

Query: 297  AEPLNYFDASKDYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDI 118
             +P+ Y +A ++ +             +N T ELT+LPEGHS IGVKWVYKTK N +G +
Sbjct: 855  CDPVTYEEAIENNKWKKAMDEEIAAIRRNDTWELTSLPEGHSPIGVKWVYKTKTNKEGKV 914

Query: 117  ERYKARLITKSYKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            E+YKARL+ K YKQRQG +Y+E+FA VA +DTIRL+I++
Sbjct: 915  EKYKARLVAKGYKQRQGVDYDEIFAPVARIDTIRLLIAV 953


>gb|PHT36714.1| hypothetical protein CQW23_24414 [Capsicum baccatum]
          Length = 1427

 Score =  166 bits (420), Expect = 7e-43
 Identities = 97/276 (35%), Positives = 151/276 (54%), Gaps = 9/276 (3%)
 Frame = -1

Query: 801  PLEAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNT 622
            P EAW  +KP + HLK+F   AY+H+ +    KLD++  K IF+GYD R+K Y+ YN  T
Sbjct: 996  PNEAWSGQKPGVGHLKIFGCIAYSHVPEQLRKKLDDRGEKCIFIGYDERSKAYRFYNPLT 1055

Query: 621  KSRYKQR-C*F**RNRMESARRDRCYDFLPSSDEEEKEDTLEIS-----TP*LSPQCNEP 460
            K     R   F   +    +  ++  + L  SDEE+ +  ++       +P  S     P
Sbjct: 1056 KKVIISRDVEFDEADYWRWSEEEKKVEGLFFSDEEDDDFVIQNEEGDGQSPPESSGATNP 1115

Query: 459  SSSGESASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRIN---DITYICLLVDAEP 289
            S+S   +S   +D               + R+L  IY++T  I    D +  CL+ + +P
Sbjct: 1116 STSASPSSSSSSDAPT------------KMRSLHEIYEDTEPIETTFDYSLFCLMAECDP 1163

Query: 288  LNYFDASKDYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERY 109
            + Y +A+ D +             +N T ELT++PEGH+ IGVKWVYKTK N +G +++Y
Sbjct: 1164 VTYEEANVDVKWKKAMDEEIAAIRRNDTWELTSMPEGHNPIGVKWVYKTKTNKEGKVDKY 1223

Query: 108  KARLITKSYKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            KARL+ K YK++ G +Y+EVFA VA +DT+RL+ ++
Sbjct: 1224 KARLVAKGYKKKYGVDYDEVFAPVARIDTVRLLTAL 1259


>gb|PRQ34009.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 366

 Score =  157 bits (396), Expect = 4e-42
 Identities = 99/271 (36%), Positives = 137/271 (50%), Gaps = 6/271 (2%)
 Frame = -1

Query: 795 EAWCRRKPSILHLKVFDSTAYAHILDNDCIKLDNKSGKYIFVGYDSRTKGYKLYNRNTKS 616
           E W   KP+I H++VF   A+AH+ D+   KLD+K+ K IF+GY + TKGYKLYN  TK 
Sbjct: 44  EVWSGDKPNIQHMRVFGCIAFAHVPDHIRKKLDDKADKCIFIGYSTVTKGYKLYNPKTKK 103

Query: 615 RYKQR-C*F**RNRME-SARRDRCYDFLPSSDEEEKEDTLEISTP*LSPQCNEPSSSGES 442
               R   F  ++  +  ++  R    +P  D+   +D           Q N P +   S
Sbjct: 104 VIMSRDVTFDEQSAWDWCSKEKRPATLIPLEDDLSDDDQ----------QVNNPETQSPS 153

Query: 441 ASR*QNDXXXXXXXXXXSRRHPRTRNLANIYQETVRIN----DITYICLLVDAEPLNYFD 274
               + +           R HP    L +    T R +    DI    L  D +PL + +
Sbjct: 154 NQVPEAESPLEVASTRPQREHPLPPYLKDYKLNTTRRSISDEDIVNFALYADCDPLTFNE 213

Query: 273 ASKDYRXXXXXXXXXXXXXKNKT*ELTNLPEGHSTIGVKWVYKTKKNSQGDIERYKARLI 94
           A    +             KN T ELT+LPEG + IGVKWVYKTK    GD++R+K RL+
Sbjct: 214 ACHQQQWVKAMDDEIHAIEKNDTWELTSLPEGKTAIGVKWVYKTKYKQNGDVDRFKERLV 273

Query: 93  TKSYKQRQGTNYNEVFAHVAHLDTIRLIISM 1
            KSYKQR G +Y EVFA V  LDT+R++IS+
Sbjct: 274 AKSYKQRPGIDYLEVFAPVVRLDTVRMVISL 304