BLASTX nr result

ID: Rehmannia31_contig00018239 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia31_contig00018239
         (804 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011073037.1| uncharacterized protein LOC105158097 [Sesamu...   224   6e-67
ref|XP_012849888.1| PREDICTED: retrovirus-related Pol polyprotei...   183   3e-50
gb|AAK29467.1| polyprotein-like [Solanum chilense]                    154   7e-39
gb|PKA49510.1| Retrovirus-related Pol polyprotein from transposo...   148   1e-38
gb|PKI48613.1| hypothetical protein CRG98_031032 [Punica granatum]    148   4e-38
sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol poly...   152   6e-38
gb|KYP71220.1| Retrovirus-related Pol polyprotein from transposo...   149   4e-37
gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi...   149   4e-37
gb|OTG02614.1| putative zinc finger, CCHC-type, Ribonuclease H-l...   149   4e-37
gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygro...   146   6e-36
gb|KYP50611.1| Retrovirus-related Pol polyprotein from transposo...   142   1e-35
gb|KYP48283.1| Retrovirus-related Pol polyprotein from transposo...   139   1e-34
gb|OAE31341.1| hypothetical protein AXG93_4510s1170 [Marchantia ...   137   1e-34
gb|KYP46254.1| Retrovirus-related Pol polyprotein from transposo...   141   1e-34
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-...   141   2e-34
gb|ACL97387.1| Gag-Pol polyprotein [Lotus japonicus]                  140   4e-34
gb|ABA98804.1| retrotransposon protein, putative, Ty1-copia subc...   140   4e-34
gb|ABA98656.1| retrotransposon protein, putative, Ty1-copia subc...   140   4e-34
gb|KYP34487.1| Retrovirus-related Pol polyprotein from transposo...   137   9e-34
gb|PNX74094.1| copia LTR rider, partial [Trifolium pratense]          139   1e-33

>ref|XP_011073037.1| uncharacterized protein LOC105158097 [Sesamum indicum]
          Length = 472

 Score =  224 bits (571), Expect = 6e-67
 Identities = 124/271 (45%), Positives = 173/271 (63%), Gaps = 8/271 (2%)
 Frame = +3

Query: 6   EVMHVRGRSQY-----RFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGH 170
           E+  VRGR+++     R++++ +     N++ +                  CYNCG  GH
Sbjct: 193 EINSVRGRTRFGNFNSRYNSRSRSKTKTNRSKSRPRETNLRDDKIRDRR--CYNCGTKGH 250

Query: 171 YIREC--PNKKGNQNQSNDQANLASTS-ENAGDIFMVTGICDVHIVNSVHSSTVCENEWL 341
           YI++C  P ++      +D+  +++ S E+ G++F+V      +  NSV  ST   +EWL
Sbjct: 251 YIKDCRKPRRENRDRNYDDKEKVSNVSIESNGEVFVV------YEANSV--STFDMHEWL 302

Query: 342 IDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRH 521
           IDS CTFHMSPFK++F+N K     FVSMANEKKC + G+GDI L FD GY   LKNVR+
Sbjct: 303 IDSGCTFHMSPFKDIFTNLKYEHAGFVSMANEKKCEIKGLGDISLCFD-GYKMLLKNVRY 361

Query: 522 VPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVSFAGNS 701
           VPDL +NL+SCAALE++GLEGR+G GLMKI+KGSLV+FKA ++ NLY+C     S+   +
Sbjct: 362 VPDLSHNLISCAALEENGLEGRWGKGLMKIMKGSLVVFKAERKRNLYIC---TASYDNIA 418

Query: 702 VNVVQEDKTDLWHKRLGHMSSKGLEILHKAG 794
            +V   D T LWHKRLGH+S KGL+ L + G
Sbjct: 419 ASVSVCDLTSLWHKRLGHISQKGLDFLKRDG 449


>ref|XP_012849888.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT
           1-94 [Erythranthe guttata]
          Length = 598

 Score =  183 bits (465), Expect = 3e-50
 Identities = 93/169 (55%), Positives = 122/169 (72%)
 Frame = +3

Query: 297 VNSVHSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICL 476
           VNSV +S + ENEWL+DSAC++HM+P + +FS+Y +MKN  V++A+     V GIG +CL
Sbjct: 5   VNSVLAS-LSENEWLLDSACSYHMTPRREVFSDYVQMKNCGVTLADGTMIVVNGIGTVCL 63

Query: 477 KFDSGYAYTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNN 656
           KF SG   TLKNVRHVP L +NL+SCA LEDDG  G +G+G M I+KGS  +FKA++  N
Sbjct: 64  KFVSGSVLTLKNVRHVPTLSHNLISCAVLEDDGFRGDWGDGCMNIMKGSRYLFKALRMGN 123

Query: 657 LYVCIGKPVSFAGNSVNVVQEDKTDLWHKRLGHMSSKGLEILHKAGCFG 803
           +YVC  +    +  S+NVVQ D ++LWHK LGHMS+K L ILHK   FG
Sbjct: 124 MYVCSAE----SSASMNVVQNDLSELWHKGLGHMSNKWLSILHKNQYFG 168


>gb|AAK29467.1| polyprotein-like [Solanum chilense]
          Length = 1328

 Score =  154 bits (390), Expect = 7e-39
 Identities = 85/220 (38%), Positives = 125/220 (56%), Gaps = 5/220 (2%)
 Frame = +3

Query: 144 CYNCGEVGHYIRECPNKKGNQNQSNDQANLASTS---ENAGDIFMVTGICD--VHIVNSV 308
           CYNC + GH+ R+CPN K  + +S+ Q N  +T+   +N  D+ ++    +  +H+  + 
Sbjct: 233 CYNCDQPGHFKRDCPNPKRGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGT- 291

Query: 309 HSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDS 488
                 E+EW++D+A ++H +P ++LF  Y       V M N     + GIGDIC K + 
Sbjct: 292 ------ESEWVVDTAASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNV 345

Query: 489 GYAYTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVC 668
           G    LK+VRHVPDL  NL+S  AL+ DG E  F N   ++ KG+LVI K V R  LY  
Sbjct: 346 GCTLVLKDVRHVPDLRMNLISGIALDQDGYENYFANQKWRLTKGALVIAKGVARGTLYRT 405

Query: 669 IGKPVSFAGNSVNVVQEDKTDLWHKRLGHMSSKGLEILHK 788
             +      N+ +  +E+  DLWHKR+GH S KGL+IL K
Sbjct: 406 NAEICQGELNAAH--EENSADLWHKRMGHTSEKGLQILSK 443


>gb|PKA49510.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Apostasia shenzhenica]
          Length = 365

 Score =  148 bits (373), Expect = 1e-38
 Identities = 89/257 (34%), Positives = 128/257 (49%), Gaps = 2/257 (0%)
 Frame = +3

Query: 21  RGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRECPNKKG 200
           RGRS+ +F NQ ++ +   +N N                  CY C + GH+ R+CP K  
Sbjct: 91  RGRSKNKFGNQYRYRSISKENDNR-----------------CYYCKKEGHWKRDCPKKSK 133

Query: 201 NQNQ--SNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLIDSACTFHMSP 374
            Q Q  S ++A++AS  E   +      +C    ++S  S       W++DS C++HM P
Sbjct: 134 QQQQKKSGEEASVASRLEKDSET-----LCTFSCMDSSDS-------WILDSDCSYHMCP 181

Query: 375 FKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHVPDLCNNLMSC 554
           F++ FS Y       V M N  +C   GIG I +K   G   TL  VRHVPDL   L+S 
Sbjct: 182 FRDWFSTYSIHDGGRVIMGNNSECKSVGIGTIKIKMFDGVIRTLTEVRHVPDLRKGLISL 241

Query: 555 AALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVSFAGNSVNVVQEDKTDL 734
             L+  G      +G++K+ KG+ V+ K  K  +LY  IGK ++      +   +D T L
Sbjct: 242 GTLDASGCTFIGSDGIIKVKKGAPVVMKGEKIESLYRLIGKTITGDIAVTSSTDDDDTML 301

Query: 735 WHKRLGHMSSKGLEILH 785
           WH RLGHMS +GL  LH
Sbjct: 302 WHARLGHMSERGLLELH 318


>gb|PKI48613.1| hypothetical protein CRG98_031032 [Punica granatum]
          Length = 435

 Score =  148 bits (373), Expect = 4e-38
 Identities = 88/260 (33%), Positives = 132/260 (50%), Gaps = 2/260 (0%)
 Frame = +3

Query: 9   VMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRECP 188
           V   RGRSQ +        N  +++ N                  CY+ G+ GHY REC 
Sbjct: 84  VTEQRGRSQSK--------NSSSKHGNSGDKSRGRSKSKTRKVVTCYHYGKEGHYKRECR 135

Query: 189 NKKGNQNQSNDQANLASTSENA--GDIFMVTGICDVHIVNSVHSSTVCENEWLIDSACTF 362
             K NQN + +      T+  A  G+ ++V   CD   VN     T  ++ W+ D+  +F
Sbjct: 136 ALKKNQNGNGESKKEEGTTTVASDGETYIV---CDEAYVNF----TCQDSTWVADTGVSF 188

Query: 363 HMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHVPDLCNN 542
           H++P ++ FS+Y      +V M N + C + GIGD+CL+ + G    LK VRHVP++  N
Sbjct: 189 HVTPHRDFFSSYTTGDYGYVRMGNGQSCKIVGIGDVCLETELGCKLLLKKVRHVPEIRLN 248

Query: 543 LMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVSFAGNSVNVVQED 722
           L+S   L D+G    F NG  K+ KGSL++ +  K + LY    +  S     VNV ++ 
Sbjct: 249 LISMGQLNDEGYSNEFSNGRWKLSKGSLIVARGQKTDTLYRLRARHNS---GQVNVAEDY 305

Query: 723 KTDLWHKRLGHMSSKGLEIL 782
              LWH+RL H+S KG++IL
Sbjct: 306 SIKLWHRRLRHISEKGIQIL 325


>sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol polyprotein from transposon
           TNT 1-94; Includes: RecName: Full=Protease; Includes:
           RecName: Full=Reverse transcriptase; Includes: RecName:
           Full=Endonuclease
 emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
          Length = 1328

 Score =  152 bits (383), Expect = 6e-38
 Identities = 86/227 (37%), Positives = 125/227 (55%), Gaps = 12/227 (5%)
 Frame = +3

Query: 144 CYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTV 323
           CYNC + GH+ R+CPN +  + +++ Q N  +T+    +        + ++V  ++    
Sbjct: 232 CYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQN--------NDNVVLFINEEEE 283

Query: 324 C------ENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFD 485
           C      E+EW++D+A + H +P ++LF  Y       V M N     + GIGDIC+K +
Sbjct: 284 CMHLSGPESEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTN 343

Query: 486 SGYAYTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLY- 662
            G    LK+VRHVPDL  NL+S  AL+ DG E  F N   ++ KGSLVI K V R  LY 
Sbjct: 344 VGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYR 403

Query: 663 ----VCIGKPVSFAGNSVNVVQED-KTDLWHKRLGHMSSKGLEILHK 788
               +C G+        +N  Q++   DLWHKR+GHMS KGL+IL K
Sbjct: 404 TNAEICQGE--------LNAAQDEISVDLWHKRMGHMSEKGLQILAK 442


>gb|KYP71220.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 690

 Score =  149 bits (375), Expect = 4e-37
 Identities = 92/270 (34%), Positives = 136/270 (50%), Gaps = 3/270 (1%)
 Frame = +3

Query: 3   GEVMHVRGRSQYR--FDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYI 176
           GE + VRGR+Q +    N++     + + SN                  C  C + GH I
Sbjct: 197 GEGLSVRGRTQEKGSTSNKKSRSKSRGRKSNKT----------------CRYCKKFGHDI 240

Query: 177 RECPNKKGNQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLIDSAC 356
            +C   K  Q +     N A  +    D        D  ++ SV S    + EW++DS C
Sbjct: 241 SDCFILKKKQERQEKGKNPAEAANVETD-------SDGDVMISVSSDKRSKTEWILDSGC 293

Query: 357 TFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHVPDLC 536
           TFHM P+K+LF+  + + +  V M N+ +C + GIG I +K   G   TL NVR +PDL 
Sbjct: 294 TFHMCPYKDLFTTLEPVDSGVVLMGNDTQCKIAGIGTIQIKTHDGTIKTLSNVRFIPDLK 353

Query: 537 NNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVS-FAGNSVNVV 713
            NL+S   LE  G +     G++K+ KG++V+ KA +  +LY+  G  V+  A  S ++ 
Sbjct: 354 RNLISLGTLESLGCKYSAEGGVLKVSKGAIVLLKANRIGSLYILQGSIVTGSAAVSSSMS 413

Query: 714 QEDKTDLWHKRLGHMSSKGLEILHKAGCFG 803
            +D T LWH RLGHMS KG+ +L K G  G
Sbjct: 414 DKDATKLWHMRLGHMSEKGMHLLSKQGLLG 443


>gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1335

 Score =  149 bits (377), Expect = 4e-37
 Identities = 81/224 (36%), Positives = 129/224 (57%), Gaps = 6/224 (2%)
 Frame = +3

Query: 144 CYNCGEVGHYIREC-----PNKKGNQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSV 308
           C+ CG+ GH+ ++C      NK   Q   N +++LA ++E      ++    +  +V   
Sbjct: 221 CWICGKEGHFKKQCYKWIERNKSKQQGSDNGESSLAKSTEAFNPAMVLLATDETLVVTDS 280

Query: 309 HSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDS 488
            +     NEW++D+ C+FHM+P K+ F ++KE+ + +V M N+    V GIG I ++   
Sbjct: 281 IA-----NEWVLDTGCSFHMTPRKDWFKDFKELSSGYVKMGNDTYSPVKGIGSIKIRNSD 335

Query: 489 GYAYTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVC 668
           G    L +VR++P++  NL+S   LED G   +  +G++KI+KG   I K  KR+ LY+ 
Sbjct: 336 GSQVILTDVRYMPNMTRNLISLGTLEDRGCWFKSQDGILKIVKGCSTILKGQKRDTLYIL 395

Query: 669 IGKPVSFAGNSVNVVQ-EDKTDLWHKRLGHMSSKGLEILHKAGC 797
            G  V+  G S +  + +D+T LWH RLGHMS KG+EIL K GC
Sbjct: 396 DG--VTEEGESHSSAEVKDETALWHSRLGHMSQKGMEILVKKGC 437


>gb|OTG02614.1| putative zinc finger, CCHC-type, Ribonuclease H-like domain,
            GAG-pre-integrase domain protein [Helianthus annuus]
          Length = 702

 Score =  149 bits (375), Expect = 4e-37
 Identities = 79/226 (34%), Positives = 121/226 (53%), Gaps = 11/226 (4%)
 Frame = +3

Query: 144  CYNCGEVGHYIRECP-----------NKKGNQNQSNDQANLASTSENAGDIFMVTGICDV 290
            C++CG  GH I+ C            N K N ++ +D  N  +    A + F +   CD 
Sbjct: 365  CHHCGRKGHTIKFCRQLKKEKKKADYNNKKNNHKKDDGGNDTAEVNTATEEFFIC--CDD 422

Query: 291  HIVNSVHSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDI 470
             +VN        ++ W++DS  T H++  ++ FS+Y       V M N     + G+GD+
Sbjct: 423  DVVNITRD----DSSWVVDSGATCHVTSQRDFFSSYTPGDFGVVKMGNNGLSKIIGVGDV 478

Query: 471  CLKFDSGYAYTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKR 650
            CLKFD+G    L NV+HV D+  NL+S   L+DDG    FG+G+ K+ +GSL++ +  + 
Sbjct: 479  CLKFDTGMELVLHNVKHVSDIRLNLISAGLLDDDGYHSTFGDGVWKLTRGSLIVARGKRS 538

Query: 651  NNLYVCIGKPVSFAGNSVNVVQEDKTDLWHKRLGHMSSKGLEILHK 788
            + LY  +  P     +  ++V  D T+LWHKRLGHMS KG+ IL K
Sbjct: 539  SKLY--MAHPKISTDSVHSLVDNDMTELWHKRLGHMSEKGMHILLK 582


>gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygrometricum]
          Length = 1309

 Score =  146 bits (368), Expect = 6e-36
 Identities = 85/270 (31%), Positives = 135/270 (50%), Gaps = 3/270 (1%)
 Frame = +3

Query: 3   GEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRE 182
           GE ++VRGR+  R    +K   H++Q+                    C+ C + GH+ ++
Sbjct: 195 GEGLNVRGRTYKRESRNEKGGKHRSQSRTRGKLK-------------CFVCHKEGHFKKD 241

Query: 183 CPNKKG-NQNQSNDQANLASTSEN--AGDIFMVTGICDVHIVNSVHSSTVCENEWLIDSA 353
           CP+++  N  +  D  + A  S+   + ++ +V             S T  ++ W++DS 
Sbjct: 242 CPDRRARNPERRKDPGDAAVVSDGYESAEVLVV-------------SRTNKQDCWVMDSG 288

Query: 354 CTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHVPDL 533
           C+FHM P K+ F N  E ++  V + N ++C V GIG + LK   G   T+  VR+VPDL
Sbjct: 289 CSFHMCPIKSWFQNLVEEESGHVLLGNNRECKVMGIGSVLLKMHDGCVRTITEVRYVPDL 348

Query: 534 CNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVSFAGNSVNVV 713
             NL+S   L+  G   +   G MK++KGSL + +  + N LY+     V+ + N+  V 
Sbjct: 349 RRNLLSIGMLDSKGFNVKIEGGTMKVIKGSLTVMRGSQDNGLYILEASTVTGSSNAA-VG 407

Query: 714 QEDKTDLWHKRLGHMSSKGLEILHKAGCFG 803
             +K  LWH RLGH+S KGL  L K    G
Sbjct: 408 GANKARLWHLRLGHVSEKGLVELSKQNLLG 437


>gb|KYP50611.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Cajanus cajan]
          Length = 448

 Score =  142 bits (357), Expect = 1e-35
 Identities = 81/223 (36%), Positives = 118/223 (52%), Gaps = 3/223 (1%)
 Frame = +3

Query: 144 CYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTV 323
           C  C E GH+  +CP KK  +  +       S+S+N     +V  I D H  ++      
Sbjct: 1   CNYCKEPGHWKNDCPKKKNQKPTAVTVQESTSSSDNE----LVLSIVDNHQQSA------ 50

Query: 324 CENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYT 503
             ++W++DS C++HM P ++ F  Y+E     V M N+  C   GIG I L+   G   T
Sbjct: 51  --DQWVLDSGCSYHMCPNRSWFLTYEERLGGRVFMGNDMPCKTVGIGTIQLRMHDGVIRT 108

Query: 504 LKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPV 683
           L  VRHVPDL  NL+S   L+  G +    NG+M+I +GS V+ +  K+ NLY+  G   
Sbjct: 109 LTEVRHVPDLKKNLISVGVLDSKGFKCNVKNGVMEIKRGSTVVMRGFKKGNLYMLQGSTS 168

Query: 684 SFAGNSVNVVQE---DKTDLWHKRLGHMSSKGLEILHKAGCFG 803
           S +  SV+V ++   D T LWH RLGHMS +G+ IL +    G
Sbjct: 169 SIS-ESVSVAEKNIPDLTYLWHMRLGHMSERGMMILSRQQLLG 210


>gb|KYP48283.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 431

 Score =  139 bits (349), Expect = 1e-34
 Identities = 85/258 (32%), Positives = 128/258 (49%)
 Frame = +3

Query: 21  RGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRECPNKKG 200
           RG+S+   DN+ K  NH++ N++                  C+NCG+ GHY  +C N   
Sbjct: 192 RGKSR---DNRSKSRNHRSSNNSKTIK--------------CWNCGQTGHYKNQCKNAPK 234

Query: 201 NQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLIDSACTFHMSPFK 380
           NQ ++  +AN+ASTS             D  ++ S+ S    E  W++DS  +FH +  +
Sbjct: 235 NQ-EAKAEANIASTSGR-----------DDALICSLESK---EESWVLDSGASFHATSQR 279

Query: 381 NLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHVPDLCNNLMSCAA 560
             F NY       V + NE+ C + G G + +K   G  + LKNVRH+PDL  NL+S   
Sbjct: 280 EFFENYVPGNLGKVYLGNEQSCEIVGKGVVKIKL-KGSVWELKNVRHIPDLTKNLISVGQ 338

Query: 561 LEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVSFAGNSVNVVQEDKTDLWH 740
           L  +G    F     KI KG++ I +  K   LY   G     A + + +   D  +LW+
Sbjct: 339 LASEGYTTTFHGDNWKISKGAMTIARGKKSGTLYKTAG-----AYHLIAIAANDNPNLWY 393

Query: 741 KRLGHMSSKGLEILHKAG 794
           +RLGHMS KG++I+H  G
Sbjct: 394 QRLGHMSEKGMKIMHSKG 411


>gb|OAE31341.1| hypothetical protein AXG93_4510s1170 [Marchantia polymorpha subsp.
           ruderalis]
          Length = 344

 Score =  137 bits (344), Expect = 1e-34
 Identities = 77/222 (34%), Positives = 119/222 (53%), Gaps = 2/222 (0%)
 Frame = +3

Query: 144 CYNCGEVGHYIRECPN--KKGNQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSS 317
           C+ C ++GH  ++C +  KK  +NQ++  + +   S N GD     G  D  ++ +  S 
Sbjct: 84  CFYCNKMGHLKKDCYSWIKKTKENQAST-SQVHQDSANFGD-----GYSDGEVLMA--SG 135

Query: 318 TVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYA 497
            +  +EW++DS CT+HM+  K+  S+Y+E+    V M N   C+V GIG + +K   G  
Sbjct: 136 KLKASEWILDSGCTYHMTHNKHWLSDYQELNEGKVIMGNNHSCSVAGIGSVSIKMADGVV 195

Query: 498 YTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGK 677
             L+NVR VP+L  NL+S   L+D G   +   G M I KG+  I K +K   LY  IG+
Sbjct: 196 RILENVRWVPELSRNLISIGILDDLGYTNKIEQGSMYIAKGA-TILKGMKIGGLYYLIGE 254

Query: 678 PVSFAGNSVNVVQEDKTDLWHKRLGHMSSKGLEILHKAGCFG 803
            +    +        K  LWH+RLGH+S KGL+++ K    G
Sbjct: 255 TLYGVASVATTQDHSKATLWHRRLGHISEKGLQLMSKQNLLG 296


>gb|KYP46254.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Cajanus cajan]
          Length = 609

 Score =  141 bits (355), Expect = 1e-34
 Identities = 84/223 (37%), Positives = 116/223 (52%), Gaps = 3/223 (1%)
 Frame = +3

Query: 144 CYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTV 323
           C  C E GH+ ++CP KK NQ      A   S+ EN     +V  I D H     HS+  
Sbjct: 163 CNYCKEPGHWKKDCP-KKRNQKSVAVAAQEDSSFENE----LVLSIVDNH----QHSA-- 211

Query: 324 CENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYT 503
              +W++DS C++HM P ++ F  Y++     V M N+  C   GIG I +K   G   T
Sbjct: 212 --EQWVLDSGCSYHMCPNRSWFLTYEKKSGGDVFMGNDMACKTIGIGTIQIKMHDGVIRT 269

Query: 504 LKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPV 683
           L  VRHVPDL  NL+S   L   G +     G+M+I KGS V+ + +K+ NLY+  G   
Sbjct: 270 LTEVRHVPDLKKNLISVGVLHTKGFKCNVEGGVMEISKGSTVVIRGIKKGNLYMLQGS-T 328

Query: 684 SFAGNSVNVVQE---DKTDLWHKRLGHMSSKGLEILHKAGCFG 803
           +    SV+V  +   D T LWH RLGHMS +G+ +L K    G
Sbjct: 329 NLISESVSVADKHTPDLTHLWHMRLGHMSERGMMVLRKQKLLG 371


>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
           sativa Japonica Group]
 gb|ABA92739.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
           Japonica Group]
          Length = 1373

 Score =  141 bits (356), Expect = 2e-34
 Identities = 86/272 (31%), Positives = 136/272 (50%), Gaps = 10/272 (3%)
 Frame = +3

Query: 3   GEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRE 182
           GE +HVRGR++ R  N++ +D      S                   C  C    H I E
Sbjct: 196 GEALHVRGRTENRTSNEKNYDRRGRSKSKPPGNKKF-----------CVYCKLKNHNIDE 244

Query: 183 CPNKKGNQNQSNDQ-----ANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLID 347
           C   +  + ++        A+ A++ +++GD  +V   C               +EW++D
Sbjct: 245 CKKVQAKERKNKKDGKVSVASAAASDDDSGDCLVVFAGC-----------VAGHDEWILD 293

Query: 348 SACTFHMSPFKNLFSNYKEM-KNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHV 524
           SAC+FH+   +N FS+YK + K   V M ++  C + GIG + +K D G   TLKNVR++
Sbjct: 294 SACSFHICTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYI 353

Query: 525 PDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKA-VKRNNLYVCIGKPVSFAGNS 701
           P +  NL+S + L+ +G +    +G++K+ KGSLV  K  V    LYV  G  ++ + ++
Sbjct: 354 PGMSRNLISLSTLDAEGYKYSGSDGVLKVSKGSLVCLKGDVNSAKLYVLRGCTLTGSDSA 413

Query: 702 VNVVQED---KTDLWHKRLGHMSSKGLEILHK 788
              +  D   KT+LWH RLGHMS  G+  L K
Sbjct: 414 AAAITNDEPSKTNLWHMRLGHMSHLGMTELMK 445


>gb|ACL97387.1| Gag-Pol polyprotein [Lotus japonicus]
          Length = 1305

 Score =  140 bits (354), Expect = 4e-34
 Identities = 80/219 (36%), Positives = 121/219 (55%), Gaps = 6/219 (2%)
 Frame = +3

Query: 144 CYNCGEVGHYIREC-PNKKGNQNQSN---DQANLASTSENAGDIFMVTGICDVHIVNSVH 311
           CYNCG+ GH  ++C  NKK  +  S     Q  +ASTS++   ++    +       S  
Sbjct: 230 CYNCGKRGHLKKDCWSNKKSGEKSSEASTSQGCVASTSDDGEVLYSEAAV-------STK 282

Query: 312 SSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSG 491
                 + W++DS  T+HM+P ++ F  Y+ +    V M N+    + GIG + +K   G
Sbjct: 283 GKNRLTDVWIVDSGATWHMTPRRDWFCTYEPVSEGNVFMGNDHALEIVGIGTVKIKMYDG 342

Query: 492 YAYTLKNVRHVPDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKAVK-RNNLYVC 668
              TL+ VRHV +L  NL+S   L+D G +     G++K++KGSLV+ KA K   NLY+ 
Sbjct: 343 TIRTLQEVRHVKELAKNLLSVGQLDDLGYKYDIQGGILKVVKGSLVVMKAKKVAANLYML 402

Query: 669 IGKPVSFAGNSVNV-VQEDKTDLWHKRLGHMSSKGLEIL 782
           +G     A  SV V  QE+ T +WH+RLGHMS +GL++L
Sbjct: 403 LGDTWQMADASVAVGSQEETTMMWHRRLGHMSERGLKVL 441


>gb|ABA98804.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
           Japonica Group]
          Length = 1333

 Score =  140 bits (354), Expect = 4e-34
 Identities = 86/272 (31%), Positives = 136/272 (50%), Gaps = 10/272 (3%)
 Frame = +3

Query: 3   GEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRE 182
           GE +HVRGR++ R  N++ +D      S                   C  C    H I E
Sbjct: 196 GEALHVRGRTENRTSNEKNYDRRGRSKSKPPGNKKF-----------CVYCKLKNHNIDE 244

Query: 183 CPNKKGNQNQSNDQ-----ANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLID 347
           C   +  + ++        A+ A++ +++GD  +V   C               +EW++D
Sbjct: 245 CKKVQAKERKNKKDGKVSVASAAASDDDSGDCLVVFAGC-----------VAGHDEWILD 293

Query: 348 SACTFHMSPFKNLFSNYKEM-KNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHV 524
           SAC+FH+   +N FS+YK + K   V M ++  C + GIG + +K D G   TLKNVR++
Sbjct: 294 SACSFHICTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYI 353

Query: 525 PDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKA-VKRNNLYVCIGKPVSFAGNS 701
           P +  NL+S + L+ +G +    +G++K+ KGSLV  K  +    LYV  G  ++ + ++
Sbjct: 354 PGMSRNLISLSTLDAEGYKYSGSDGVLKVSKGSLVCLKGDLNSAKLYVLRGCTLTGSDSA 413

Query: 702 VNVVQED---KTDLWHKRLGHMSSKGLEILHK 788
              V  D   KT+LWH RLGHMS  G+  L K
Sbjct: 414 AAAVTNDEPSKTNLWHMRLGHMSHLGMTELMK 445


>gb|ABA98656.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
           Japonica Group]
          Length = 1333

 Score =  140 bits (354), Expect = 4e-34
 Identities = 86/272 (31%), Positives = 136/272 (50%), Gaps = 10/272 (3%)
 Frame = +3

Query: 3   GEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRE 182
           GE +HVRGR++ R  N++ +D      S                   C  C    H I E
Sbjct: 196 GEALHVRGRTENRTSNEKNYDRRGRSKSKPPGNKKF-----------CVYCKLKNHNIDE 244

Query: 183 CPNKKGNQNQSNDQ-----ANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLID 347
           C   +  + ++        A+ A++ +++GD  +V   C               +EW++D
Sbjct: 245 CKKVQAKERKNKKDGKVSVASAAASDDDSGDCLVVFAGC-----------VAGHDEWILD 293

Query: 348 SACTFHMSPFKNLFSNYKEM-KNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHV 524
           SAC+FH+   +N FS+YK + K   V M ++  C + GIG + +K D G   TLKNVR++
Sbjct: 294 SACSFHICTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYI 353

Query: 525 PDLCNNLMSCAALEDDGLEGRFGNGLMKILKGSLVIFKA-VKRNNLYVCIGKPVSFAGNS 701
           P +  NL+S + L+ +G +    +G++K+ KGSLV  K  +    LYV  G  ++ + ++
Sbjct: 354 PGMSRNLISLSTLDAEGYKYSGSDGVLKVSKGSLVCLKGDLNSAKLYVLRGCTLTGSDSA 413

Query: 702 VNVVQED---KTDLWHKRLGHMSSKGLEILHK 788
              V  D   KT+LWH RLGHMS  G+  L K
Sbjct: 414 AAAVTNDEPSKTNLWHMRLGHMSHLGMTELMK 445


>gb|KYP34487.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 485

 Score =  137 bits (345), Expect = 9e-34
 Identities = 84/261 (32%), Positives = 131/261 (50%), Gaps = 3/261 (1%)
 Frame = +3

Query: 21  RGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRECPNKKG 200
           RGRS+ R   Q KF N                         C+NC + GH+  +C   K 
Sbjct: 218 RGRSKSRAKGQPKFRND----------------------IVCWNCDKRGHFTNQCKAPKK 255

Query: 201 NQN---QSNDQANLASTSENAGDIFMVTGICDVHIVNSVHSSTVCENEWLIDSACTFHMS 371
           N+N   + +D++  A+T E            D  ++ S+ S       W++DS  +FH +
Sbjct: 256 NKNHKKRDDDESANAATDE-----------IDDALICSLDSPI---ESWIMDSGASFHTT 301

Query: 372 PFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYAYTLKNVRHVPDLCNNLMS 551
           P   L +NY   +   V +A+ K  N+ G GDI ++  SG  +TLKNVRH+P L  NL+S
Sbjct: 302 PSNELLTNYVSGRFGKVYLADGKPLNIVGKGDIAIRTSSGSHWTLKNVRHIPALKRNLIS 361

Query: 552 CAALEDDGLEGRFGNGLMKILKGSLVIFKAVKRNNLYVCIGKPVSFAGNSVNVVQEDKTD 731
              L+D+G E  FG+G  K+ KG+L++ +  KR +LY+   + +     + N      + 
Sbjct: 362 VGQLDDEGHETTFGDGAWKVKKGNLIVARGKKRGSLYMVADENMIAVTEAAN-----NSF 416

Query: 732 LWHKRLGHMSSKGLEILHKAG 794
           LWH+RLGHMS KG++++   G
Sbjct: 417 LWHQRLGHMSEKGMKLMATKG 437


>gb|PNX74094.1| copia LTR rider, partial [Trifolium pratense]
          Length = 876

 Score =  139 bits (351), Expect = 1e-33
 Identities = 80/222 (36%), Positives = 125/222 (56%), Gaps = 7/222 (3%)
 Frame = +3

Query: 144 CYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENA--GDIFMVTGICDVHIVNSVHSS 317
           CY+C ++GH+ R+CPN K   + S   ANL  T ++   GDI  V             SS
Sbjct: 159 CYSCKQIGHWKRDCPNIKQGSSTS---ANLVHTDDSCSEGDILCV-------------SS 202

Query: 318 TVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVSMANEKKCNVFGIGDICLKFDSGYA 497
           + C + W++DS C++HM+P +  F+ ++     FV + ++K C + GIG I +  D G  
Sbjct: 203 SKCTDAWILDSGCSYHMTPNREWFTTFRSGSFGFVYLGDDKACAITGIGQIKIAMDDGGV 262

Query: 498 YTLKNVRHVPDLCNNLMSCAALEDDGLEGRF--GNGLMKILKGSLVIFKAVKR--NNLYV 665
            TL NVR++P+L  NL+S   L+++G   R      ++K+ KG+L + + VKR   N+Y 
Sbjct: 263 RTLTNVRYIPELRKNLISLGTLQENGYSYRSDRDRDILKVSKGALTVMR-VKRTAGNIYK 321

Query: 666 CIGKPVSFAGNSVNVVQE-DKTDLWHKRLGHMSSKGLEILHK 788
            +G  V   G+  +V  + D T LWH RLGH+S +G+  LHK
Sbjct: 322 LLGNTV--VGDVASVESDNDATKLWHLRLGHLSERGMMELHK 361


Top