BLASTX nr result

ID: Panax24_contig00024330 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Panax24_contig00024330
         (1312 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

JAU08363.1 Retrovirus-related Pol polyprotein from transposon TN...   441   e-139
JAU51269.1 Retrovirus-related Pol polyprotein from transposon TN...   389   e-130
OMP03432.1 Integrase, catalytic core [Corchorus capsularis]           379   e-116
KYP65378.1 Retrovirus-related Pol polyprotein from transposon TN...   340   e-111
CAN63563.1 hypothetical protein VITISV_003097 [Vitis vinifera]        293   5e-86
XP_013688817.1 PREDICTED: uncharacterized protein LOC106392554, ...   296   6e-86
CAB71063.1 copia-type polyprotein [Arabidopsis thaliana]              290   4e-84
AAD50001.1 Hypothetical protein [Arabidopsis thaliana]                290   4e-84
AAG50698.1 copia-type polyprotein, putative [Arabidopsis thalian...   289   1e-83
CAB75469.1 copia-type reverse transcriptase-like protein [Arabid...   287   5e-83
JAU09122.1 Retrovirus-related Pol polyprotein from transposon TN...   267   9e-83
AAG60117.1 copia-type polyprotein, putative [Arabidopsis thaliana]    286   1e-82
KYP66219.1 Retrovirus-related Pol polyprotein from transposon TN...   280   1e-81
GAU51371.1 hypothetical protein TSUD_247260 [Trifolium subterran...   282   5e-81
KYP69041.1 Retrovirus-related Pol polyprotein from transposon TN...   280   2e-80
KYP44533.1 Retrovirus-related Pol polyprotein from transposon TN...   280   2e-80
XP_016709245.1 PREDICTED: LOW QUALITY PROTEIN: pleiotropic drug ...   280   3e-80
KZV33171.1 Integrase, catalytic core domain containing protein [...   269   1e-79
KYP66220.1 Retrovirus-related Pol polyprotein from transposon TN...   276   5e-79
KYP57183.1 Retrovirus-related Pol polyprotein from transposon TN...   270   1e-78

>JAU08363.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Noccaea
            caerulescens]
          Length = 1335

 Score =  441 bits (1133), Expect = e-139
 Identities = 214/440 (48%), Positives = 295/440 (67%), Gaps = 17/440 (3%)
 Frame = -3

Query: 1274 VSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEADVLFTGRRKESLY 1095
            ++LK+VYHVPG+ KNL SV    D+G YVLFGPK+V+ L NI+ ++ADV+ TG R + LY
Sbjct: 360  ITLKNVYHVPGVKKNLLSVVNAVDSGNYVLFGPKDVKFLKNIQELKADVVHTGARVKDLY 419

Query: 1094 LLSASDAYVEKTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVPLFNEIHHDVVCPGC 915
            +LSAS++Y+EK S N++  +WH RLGH+    L+ +  K L++G+P         +C GC
Sbjct: 420  VLSASNSYIEKMSTNDNDFIWHARLGHINMTKLKVMVNKDLVNGLPKLKIKDEGKLCEGC 479

Query: 914  QYGKSHCLPFPNSKNRASAALQLVHSDLMGPTKTPSYSSFRYVMVLVDEFSRFTWVYFLE 735
            QYGKSH LPF NS +R +A L+ VHSDLMGPT+T SYS FRY+++ VD+FSR+TWVYF++
Sbjct: 480  QYGKSHRLPFDNSTSRCNAPLERVHSDLMGPTRTSSYSGFRYMLLFVDDFSRYTWVYFVK 539

Query: 734  NKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCREHDIQRQMTFRETPQ 555
             KSE FS F +FK  VE E G +IK +RTDNGGE+MS++FL++CR+  I+R+ T   TPQ
Sbjct: 540  EKSEVFSKFQEFKVTVEGELGRKIKTLRTDNGGEFMSNEFLSFCRKQGIKREFTCPYTPQ 599

Query: 554  QNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLPSWPGTEPSPFEALYH 375
            QNGV ERK+ +L+  C SWLH KNLP+ LWA  ++   +VINR+P  P    SP+E ++ 
Sbjct: 600  QNGVAERKIRHLSETCRSWLHAKNLPKALWAEGMRCAAYVINRMPLSPNNMKSPYEMVHG 659

Query: 374  HTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKGWKCMDPETKKVDVSR 195
              P+V +FR+FG +CY HV  + +TKL+ +A++CIFVGYD  RKGW+CMDPET +  +SR
Sbjct: 660  KKPTVKHFRIFGSICYVHVFDSQRTKLEAKAKKCIFVGYDEQRKGWRCMDPETHRYTISR 719

Query: 194  DVVFDEVSSL--------------QIDTDRDTIDLSPFPDGASRE---RGSNITPTKENI 66
            DVVFDEVSS                +  D   + +    DG S E   +G   +  +E  
Sbjct: 720  DVVFDEVSSYYGPPQVLVEQDGAGSLKGDEPAVQIP--SDGGSSEPETQGERGSTNQEED 777

Query: 65   QEEETTGTVLQRTSRQRRQP 6
            +EE+      QR  R   +P
Sbjct: 778  EEEDQGSMANQRPKRNVVKP 797


>JAU51269.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Noccaea caerulescens]
          Length = 334

 Score =  389 bits (998), Expect = e-130
 Identities = 176/334 (52%), Positives = 243/334 (72%)
 Frame = -3

Query: 1199 GRYVLFGPKNVQILSNIKHIEADVLFTGRRKESLYLLSASDAYVEKTSQNESATLWHNRL 1020
            G YVLFG K+V+ L NI+ ++ADV+ TG R + LY+LSAS++Y+EK S N++  +WH RL
Sbjct: 1    GNYVLFGTKDVKFLKNIQELKADVVHTGARVKDLYVLSASNSYIEKMSTNDNDFIWHARL 60

Query: 1019 GHVGYQLLQKISTKKLLDGVPLFNEIHHDVVCPGCQYGKSHCLPFPNSKNRASAALQLVH 840
            GH+    L+ +  K L++G+P         +C GCQYGKSH LPF NS +R +A L+ VH
Sbjct: 61   GHINMTKLKVMVNKDLVNGLPKLKIQDEGKLCEGCQYGKSHRLPFDNSTSRCNAPLERVH 120

Query: 839  SDLMGPTKTPSYSSFRYVMVLVDEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIK 660
            SDLMGPT+T SYS FRY+++ VD+FSR+TWVYF++ KSE FS F +FK  VE E G +IK
Sbjct: 121  SDLMGPTRTSSYSGFRYMLLFVDDFSRYTWVYFVKEKSEVFSKFQEFKVTVEGELGRKIK 180

Query: 659  CMRTDNGGEYMSDQFLNYCREHDIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNL 480
             +RTDNGGE+MS++FL++CR+  I+R+ T   TPQQNGV ERK+ +L+  C SWLH KNL
Sbjct: 181  TLRTDNGGEFMSNEFLSFCRKQGIKREFTCPYTPQQNGVAERKIRHLSETCRSWLHAKNL 240

Query: 479  PRELWAAAVQSTCHVINRLPSWPGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQT 300
            P+ LWA  ++   +VINR+P  P    SP+E ++   P+V +FR+FG +CY HV  + +T
Sbjct: 241  PKALWAEGMRCAAYVINRMPLSPNNMKSPYEMVHGKKPTVKHFRIFGSICYVHVFDSQRT 300

Query: 299  KLDPRARRCIFVGYDTHRKGWKCMDPETKKVDVS 198
            KL+ +A++CIFVGYD  RKGW+CMDPET +  +S
Sbjct: 301  KLEAKAKKCIFVGYDEQRKGWRCMDPETHRYTIS 334


>OMP03432.1 Integrase, catalytic core [Corchorus capsularis]
          Length = 1347

 Score =  379 bits (972), Expect = e-116
 Identities = 183/374 (48%), Positives = 254/374 (67%)
 Frame = -3

Query: 1289 SNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEADVLFTGRR 1110
            S    V L +V+HV G+ KNL SV+Q+TD G YV+FGP++V++  +       ++  G+ 
Sbjct: 390  SGQHQVQLDNVFHVSGMKKNLLSVAQLTDPGNYVVFGPRDVKVYPSFTPTCPPIM-EGKW 448

Query: 1109 KESLYLLSASDAYVEKTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVPLFNEIHHDV 930
             E +Y++SA  AYV+KT +NE+A LWH RLGHV Y  L+ +  K +L G+P   ++  D 
Sbjct: 449  MEYVYVMSAQTAYVDKTRRNETADLWHARLGHVSYFKLKAMMKKSMLKGLPQL-DVKEDT 507

Query: 929  VCPGCQYGKSHCLPFPNSKNRASAALQLVHSDLMGPTKTPSYSSFRYVMVLVDEFSRFTW 750
            VC GCQYG++H LP+  SK +A   LQLVHSD+ G  K PS S ++Y++  +D++SR+ W
Sbjct: 508  VCAGCQYGRAHQLPYEESKFKAKMPLQLVHSDVFGKMKQPSVSGYQYMITFIDDYSRYVW 567

Query: 749  VYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCREHDIQRQMTF 570
            V F++ KSEA + F +FKE+VEKE G +I+C+RTDNGGEY S +F N+ RE  I+RQ+T 
Sbjct: 568  VDFMKEKSEALTKFKEFKERVEKEVGRKIQCLRTDNGGEYTSKEFSNFLRECRIRRQLTC 627

Query: 569  RETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLPSWPGTEPSPF 390
              TPQQNGV ERK  +L  +C S LH +N+P   WA  +++  HV+NRLP       SPF
Sbjct: 628  PNTPQQNGVAERKNMHLAEICRSMLHARNVPPRFWAECMKTVAHVVNRLPQARLDFISPF 687

Query: 389  EALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKGWKCMDPETKK 210
            + L++  P+VS+FRVFG +CY  V    ++K D +A RCIFVGYD  RKGW+C DP T +
Sbjct: 688  QKLWNMKPTVSHFRVFGCICYVFVPDHLRSKFDKKAIRCIFVGYDDQRKGWRCCDPTTGR 747

Query: 209  VDVSRDVVFDEVSS 168
              VSR+VVFDE SS
Sbjct: 748  CYVSRNVVFDEASS 761


>KYP65378.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Cajanus cajan]
          Length = 337

 Score =  340 bits (871), Expect = e-111
 Identities = 161/320 (50%), Positives = 223/320 (69%)
 Frame = -3

Query: 1127 LFTGRRKESLYLLSASDAYVEKTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVPLFN 948
            +  GRR ES+Y++SA +AYV K  +NE+A LWH RLGHV Y  L+ +  + +L G+P   
Sbjct: 4    IMKGRRLESVYVMSAQEAYVNKARKNETADLWHARLGHVSYNRLKAMMKQSMLRGLPNL- 62

Query: 947  EIHHDVVCPGCQYGKSHCLPFPNSKNRASAALQLVHSDLMGPTKTPSYSSFRYVMVLVDE 768
            E+  +VVC GCQYGK+H LP+  SK +A A L+LVHSD+ GP K  S S  +Y++ ++D+
Sbjct: 63   EMRENVVCVGCQYGKAHELPYEESKYKAKAPLELVHSDVFGPVKQLSISKNKYMITIIDD 122

Query: 767  FSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCREHDI 588
            +SR+ WVYFL+ KSEA   F++FKE++EKE G  I+C+RTDNGGEY S +F  Y ++  I
Sbjct: 123  YSRYVWVYFLKEKSEALKKFIEFKEKIEKEVGRMIRCLRTDNGGEYTSKEFNQYLQKCGI 182

Query: 587  QRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLPSWPG 408
            +RQ+T + T QQNG++ERK  +L   C S LH+KN+P   W   ++++ +VINRLP    
Sbjct: 183  RRQLTCQNTLQQNGIIERKNRHLVKTCRSMLHSKNVPPRFWTECMKTSAYVINRLPQARL 242

Query: 407  TEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKGWKCM 228
               SP E L++  PSVSYFRVFG VCY  +    ++K + +A RCIFVGYD+ RKGW+C 
Sbjct: 243  GFVSPHEKLWNTKPSVSYFRVFGCVCYVFMPGQERSKFEKKAIRCIFVGYDSQRKGWRCC 302

Query: 227  DPETKKVDVSRDVVFDEVSS 168
            +P T +  VSR+VVFDE SS
Sbjct: 303  NPTTGRCYVSRNVVFDEASS 322


>CAN63563.1 hypothetical protein VITISV_003097 [Vitis vinifera]
          Length = 1052

 Score =  293 bits (749), Expect = 5e-86
 Identities = 149/335 (44%), Positives = 207/335 (61%)
 Frame = -3

Query: 1286 NNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEADVLFTGRRK 1107
            N   VSL++VYHVPG+ KNL SV+Q+T +G +VLF P++V++  +++ +E  V+  G R 
Sbjct: 277  NTNDVSLQNVYHVPGMKKNLLSVAQLTSSGHFVLFSPQDVKVXRDLEIMEEPVI-KGWRL 335

Query: 1106 ESLYLLSASDAYVEKTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVPLFNEIHHDVV 927
            ES+Y++    AYV+KT +NE A LWH RL HV Y  L  +  K +L G+P          
Sbjct: 336  ESIYVMFVETAYVDKTRKNEIADLWHMRLSHVSYSKLTVMMKKSMLKGLPQLE------- 388

Query: 926  CPGCQYGKSHCLPFPNSKNRASAALQLVHSDLMGPTKTPSYSSFRYVMVLVDEFSRFTWV 747
                  GK+H L +  SK +A   L+L+HSD+ GP K    S  +Y++  +D+FSR+ WV
Sbjct: 389  ------GKAHQLSYEESKWKAKGPLELIHSDVFGPVKQAXLSGMKYMVTFIDDFSRYVWV 442

Query: 746  YFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCREHDIQRQMTFR 567
            YF++ KSE FS F +FKE  E E   +I C+RTDNG  Y S++F  + RE  ++ Q T  
Sbjct: 443  YFMKEKSETFSKFKEFKEMTEIEVDKRIHCLRTDNGXXYTSNEFFYFLRECRVRHQFTCA 502

Query: 566  ETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLPSWPGTEPSPFE 387
             T QQNGV ERK  +L  +C S LH KN+P   WA A+++   VINRLP       SPFE
Sbjct: 503  NTLQQNGVAERKNRHLAEICRSMLHAKNVPGRFWAEAMKTXAFVINRLPQQRLNFSSPFE 562

Query: 386  ALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRA 282
             L++  P+VSYFRVFG VCY  V K  + K+D +A
Sbjct: 563  KLWNIKPTVSYFRVFGCVCYVFVPKHLRNKMDKKA 597


>XP_013688817.1 PREDICTED: uncharacterized protein LOC106392554, partial [Brassica
            napus]
          Length = 2682

 Score =  296 bits (759), Expect = 6e-86
 Identities = 162/441 (36%), Positives = 245/441 (55%), Gaps = 5/441 (1%)
 Frame = -3

Query: 1310 GHFEADISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEAD 1131
            G+    + N     + +VY++P +  N+ S+ Q+ + G  +     ++ +  N  ++   
Sbjct: 385  GNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNSLSLRDNANNLITK 444

Query: 1130 VLFTGRRKESLYLLSASD--AYVEKTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVP 957
            V  +  R   +++L+  +  A   K    E + LWH R GH+ +  L+ +S K+++ G+P
Sbjct: 445  VPMSSNR---MFVLNIQNDIARCLKMCYKEESWLWHLRFGHLNFGGLELLSKKEMVKGLP 501

Query: 956  LFNEIHHDVVCPGCQYGKSHCLPFPN-SKNRASAALQLVHSDLMGPTKTPSYSSFRYVMV 780
              N  H + VC GC  GK   + FP  S+ RA   L+L+H+D+ GP K  S     Y ++
Sbjct: 502  CIN--HPNQVCEGCLLGKQFKMSFPKESETRARKPLELIHTDVCGPIKPSSLGKSNYFLL 559

Query: 779  LVDEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCR 600
             +D+FSR TWVYFL+ KSE F NF KFK  VEKE GL+IK MR+D GGE+MS +FL YC 
Sbjct: 560  FIDDFSRKTWVYFLKQKSEVFENFKKFKAHVEKESGLKIKSMRSDRGGEFMSKEFLKYCE 619

Query: 599  EHDIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLP 420
            ++ I+RQ+T   TPQQNGV ERK   +  M  S L +K LP+ELWA AV    ++ NR P
Sbjct: 620  DNGIRRQLTVPRTPQQNGVAERKNRTILEMARSMLKSKKLPKELWAEAVACAVYISNRSP 679

Query: 419  SWPGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKG 240
            +    E +P EA     P VS+ RVFG + +AHV    ++KLD ++ + IF+GYD + KG
Sbjct: 680  TKSVLEKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDANSKG 739

Query: 239  WKCMDPETKKVDVSRDVVFDEVSSLQIDTDRDTIDLSPF--PDGASRERGSNITPTKENI 66
            +K  +PETKK  +SR+V+FDE       ++ +  +  P    +   + R    TP     
Sbjct: 740  YKLYNPETKKTIISRNVIFDEEGEWDWRSNNEDYNFFPSFEEENVEQPREEPATPPTSPT 799

Query: 65   QEEETTGTVLQRTSRQRRQPD 3
               +   +  +RT R R   D
Sbjct: 800  TSSQGDESSSERTPRFRSLQD 820



 Score =  296 bits (759), Expect = 6e-86
 Identities = 162/441 (36%), Positives = 245/441 (55%), Gaps = 5/441 (1%)
 Frame = -3

Query: 1310 GHFEADISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEAD 1131
            G+    + N     + +VY++P +  N+ S+ Q+ + G  +     ++ +  N  ++   
Sbjct: 1724 GNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNSLSLRDNANNLITK 1783

Query: 1130 VLFTGRRKESLYLLSASD--AYVEKTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVP 957
            V  +  R   +++L+  +  A   K    E + LWH R GH+ +  L+ +S K+++ G+P
Sbjct: 1784 VPMSSNR---MFVLNIQNDIARCLKMCYKEESWLWHLRFGHLNFGGLELLSKKEMVKGLP 1840

Query: 956  LFNEIHHDVVCPGCQYGKSHCLPFPN-SKNRASAALQLVHSDLMGPTKTPSYSSFRYVMV 780
              N  H + VC GC  GK   + FP  S+ RA   L+L+H+D+ GP K  S     Y ++
Sbjct: 1841 CIN--HPNQVCEGCLLGKQFKMSFPKESETRARKPLELIHTDVCGPIKPSSLGKSNYFLL 1898

Query: 779  LVDEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCR 600
             +D+FSR TWVYFL+ KSE F NF KFK  VEKE GL+IK MR+D GGE+MS +FL YC 
Sbjct: 1899 FIDDFSRKTWVYFLKQKSEVFENFKKFKAHVEKESGLKIKSMRSDRGGEFMSKEFLKYCE 1958

Query: 599  EHDIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLP 420
            ++ I+RQ+T   TPQQNGV ERK   +  M  S L +K LP+ELWA AV    ++ NR P
Sbjct: 1959 DNGIRRQLTVPRTPQQNGVAERKNRTILEMARSMLKSKKLPKELWAEAVACAVYISNRSP 2018

Query: 419  SWPGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKG 240
            +    E +P EA     P VS+ RVFG + +AHV    ++KLD ++ + IF+GYD + KG
Sbjct: 2019 TKSVLEKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDANSKG 2078

Query: 239  WKCMDPETKKVDVSRDVVFDEVSSLQIDTDRDTIDLSPF--PDGASRERGSNITPTKENI 66
            +K  +PETKK  +SR+V+FDE       ++ +  +  P    +   + R    TP     
Sbjct: 2079 YKLYNPETKKTIISRNVIFDEEGEWDWRSNNEDYNFFPSFEEENVEQPREEPATPPTSPT 2138

Query: 65   QEEETTGTVLQRTSRQRRQPD 3
               +   +  +RT R R   D
Sbjct: 2139 TSSQGDESSSERTPRFRSLQD 2159


>CAB71063.1 copia-type polyprotein [Arabidopsis thaliana]
          Length = 1352

 Score =  290 bits (743), Expect = 4e-84
 Identities = 160/434 (36%), Positives = 243/434 (55%), Gaps = 3/434 (0%)
 Frame = -3

Query: 1310 GHFEADISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEAD 1131
            G+    + N     + +VY++P +  N+ S+ Q+ + G  +     N+ I     ++   
Sbjct: 377  GNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITK 436

Query: 1130 VLFTGRRKESLYLLSASDAYVE--KTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVP 957
            V  +   K  +++L+  +   +  K    E + LWH R GH+ +  L+ +S K+++ G+P
Sbjct: 437  VPMS---KNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLP 493

Query: 956  LFNEIHHDVVCPGCQYGKSHCLPFPN-SKNRASAALQLVHSDLMGPTKTPSYSSFRYVMV 780
              N  H + VC GC  GK   + FP  S +RA   L+L+H+D+ GP K  S     Y ++
Sbjct: 494  CIN--HPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLL 551

Query: 779  LVDEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCR 600
             +D+FSR TWVYFL+ KSE F  F KFK  VEKE GL IK MR+D GGE+ S +FL YC 
Sbjct: 552  FIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCE 611

Query: 599  EHDIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLP 420
            ++ I+RQ+T   +PQQNGVVERK   +  M  S L +K LP+ELWA AV    +++NR P
Sbjct: 612  DNGIRRQLTVPRSPQQNGVVERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSP 671

Query: 419  SWPGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKG 240
            +   +  +P EA     P VS+ RVFG + +AHV    ++KLD ++ + IF+GYD + KG
Sbjct: 672  TKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKG 731

Query: 239  WKCMDPETKKVDVSRDVVFDEVSSLQIDTDRDTIDLSPFPDGASRERGSNITPTKENIQE 60
            +K  +P+TKK  +SR++VFDE      +++ +  +  P  +    E      PT+E    
Sbjct: 732  YKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEPE------PTREEPPS 785

Query: 59   EETTGTVLQRTSRQ 18
            EE T      TS Q
Sbjct: 786  EEPTTPPTSPTSSQ 799


>AAD50001.1 Hypothetical protein [Arabidopsis thaliana]
          Length = 1352

 Score =  290 bits (743), Expect = 4e-84
 Identities = 160/434 (36%), Positives = 243/434 (55%), Gaps = 3/434 (0%)
 Frame = -3

Query: 1310 GHFEADISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEAD 1131
            G+    + N     + +VY++P +  N+ S+ Q+ + G  +     N+ I     ++   
Sbjct: 377  GNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITK 436

Query: 1130 VLFTGRRKESLYLLSASDAYVE--KTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVP 957
            V  +   K  +++L+  +   +  K    E + LWH R GH+ +  L+ +S K+++ G+P
Sbjct: 437  VPMS---KNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLP 493

Query: 956  LFNEIHHDVVCPGCQYGKSHCLPFPN-SKNRASAALQLVHSDLMGPTKTPSYSSFRYVMV 780
              N  H + VC GC  GK   + FP  S +RA   L+L+H+D+ GP K  S     Y ++
Sbjct: 494  CIN--HPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLL 551

Query: 779  LVDEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCR 600
             +D+FSR TWVYFL+ KSE F  F KFK  VEKE GL IK MR+D GGE+ S +FL YC 
Sbjct: 552  FIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCE 611

Query: 599  EHDIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLP 420
            ++ I+RQ+T   +PQQNGVVERK   +  M  S L +K LP+ELWA AV    +++NR P
Sbjct: 612  DNGIRRQLTVPRSPQQNGVVERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSP 671

Query: 419  SWPGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKG 240
            +   +  +P EA     P VS+ RVFG + +AHV    ++KLD ++ + IF+GYD + KG
Sbjct: 672  TKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKG 731

Query: 239  WKCMDPETKKVDVSRDVVFDEVSSLQIDTDRDTIDLSPFPDGASRERGSNITPTKENIQE 60
            +K  +P+TKK  +SR++VFDE      +++ +  +  P  +    E      PT+E    
Sbjct: 732  YKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEPE------PTREEPPS 785

Query: 59   EETTGTVLQRTSRQ 18
            EE T      TS Q
Sbjct: 786  EEPTTPPTSPTSSQ 799


>AAG50698.1 copia-type polyprotein, putative [Arabidopsis thaliana] AAG50765.1
            copia-type polyprotein, putative [Arabidopsis thaliana]
          Length = 1320

 Score =  289 bits (739), Expect = 1e-83
 Identities = 159/434 (36%), Positives = 242/434 (55%), Gaps = 3/434 (0%)
 Frame = -3

Query: 1310 GHFEADISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEAD 1131
            G+    + N     + +VY++P +  N+ S+ Q+ + G  +     N+ I     ++   
Sbjct: 377  GNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITK 436

Query: 1130 VLFTGRRKESLYLLSASDAYVE--KTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVP 957
            V  +   K  +++L+  +   +  K    E + LWH R GH+ +  L+ +S K+++ G+P
Sbjct: 437  VPMS---KNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLP 493

Query: 956  LFNEIHHDVVCPGCQYGKSHCLPFPN-SKNRASAALQLVHSDLMGPTKTPSYSSFRYVMV 780
              N  H + VC GC  GK   + FP  S +RA   L+L+H+D+ GP K  S     Y ++
Sbjct: 494  CIN--HPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLL 551

Query: 779  LVDEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCR 600
             +D+FSR TWVYFL+ KSE F  F KFK  VEKE GL IK MR+D GGE+ S +FL YC 
Sbjct: 552  FIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCE 611

Query: 599  EHDIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLP 420
            ++ I+RQ+T   +PQQNGV ERK   +  M  S L +K LP+ELWA AV    +++NR P
Sbjct: 612  DNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSP 671

Query: 419  SWPGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKG 240
            +   +  +P EA     P VS+ RVFG + +AHV    ++KLD ++ + IF+GYD + KG
Sbjct: 672  TKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKG 731

Query: 239  WKCMDPETKKVDVSRDVVFDEVSSLQIDTDRDTIDLSPFPDGASRERGSNITPTKENIQE 60
            +K  +P+TKK  +SR++VFDE      +++ +  +  P  +    E      PT+E    
Sbjct: 732  YKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDKPE------PTREEPPS 785

Query: 59   EETTGTVLQRTSRQ 18
            EE T      TS Q
Sbjct: 786  EEPTTPPTSPTSSQ 799


>CAB75469.1 copia-type reverse transcriptase-like protein [Arabidopsis thaliana]
          Length = 1272

 Score =  287 bits (734), Expect = 5e-83
 Identities = 158/434 (36%), Positives = 241/434 (55%), Gaps = 3/434 (0%)
 Frame = -3

Query: 1310 GHFEADISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEAD 1131
            G+    + N     + +VY++P +  N+ S+ Q+ + G  +     N+ I     ++   
Sbjct: 377  GNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDKESNLITK 436

Query: 1130 VLFTGRRKESLYLLSASDAYVE--KTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVP 957
            V  +   K  +++L+  +   +  K    E + LWH R GH+ +  L+ +S K+++ G+P
Sbjct: 437  VPMS---KNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLP 493

Query: 956  LFNEIHHDVVCPGCQYGKSHCLPFPN-SKNRASAALQLVHSDLMGPTKTPSYSSFRYVMV 780
              N  H + VC GC  G    + FP  S +RA   L+L+H+D+ GP K  S     Y ++
Sbjct: 494  CIN--HPNQVCEGCLLGNQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLL 551

Query: 779  LVDEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCR 600
             +D+FSR TWVYFL+ KSE F  F KFK  VEKE GL IK MR+D+GGE+ S +FL YC 
Sbjct: 552  FIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDSGGEFTSKEFLKYCE 611

Query: 599  EHDIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLP 420
            ++ I+RQ+T   +PQQNGV ERK   +  M  S L +K LP+ELWA AV    +++NR P
Sbjct: 612  DNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSP 671

Query: 419  SWPGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKG 240
            +   +  +P EA     P VS+ RVFG + +AHV    + KLD ++ + IF+GYD + KG
Sbjct: 672  TKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRNKLDDKSEKYIFIGYDNNSKG 731

Query: 239  WKCMDPETKKVDVSRDVVFDEVSSLQIDTDRDTIDLSPFPDGASRERGSNITPTKENIQE 60
            +K  +P+TKK  +SR++VFDE      +++ +  +  P  +    E      PT+E    
Sbjct: 732  YKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDKPE------PTREEPPS 785

Query: 59   EETTGTVLQRTSRQ 18
            EE T      TS Q
Sbjct: 786  EEPTTPPTSPTSSQ 799


>JAU09122.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Noccaea caerulescens]
          Length = 357

 Score =  267 bits (683), Expect = 9e-83
 Identities = 143/345 (41%), Positives = 201/345 (58%), Gaps = 26/345 (7%)
 Frame = -3

Query: 1133 DVLFTGRRK--------------------ESLYLLSASDAYVEK-----TSQNESATLWH 1029
            D+LFTG RK                        LL  S+  VE+        +  A LWH
Sbjct: 2    DILFTGDRKTCNIFHPTRGKIAESVMSTNRMFILLGESNVAVEEEKCLQVDISNKAELWH 61

Query: 1028 NRLGHVGYQLLQKISTKKLLDGVPLFNEIHHDVVCPGCQYGKSHCLPFPN-SKNRASAAL 852
            +R GH+ Y+ L  + +K+++ G+P   ++    VC  C  GK H +PFP  SK RA+  L
Sbjct: 62   HRYGHLSYKGLNTLCSKEMVVGLPEIEDVK--TVCDACVRGKHHRVPFPKQSKWRATERL 119

Query: 851  QLVHSDLMGPTKTPSYSSFRYVMVLVDEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFG 672
            +L+HSDL GP    S S  RY++  +D+FSR TW+YF+  KSEAF  F  FK  VEK+ G
Sbjct: 120  ELIHSDLCGPISPSSNSQIRYLISFIDDFSRKTWIYFVGEKSEAFDRFKTFKAFVEKQTG 179

Query: 671  LQIKCMRTDNGGEYMSDQFLNYCREHDIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLH 492
            L IK +RTD GGEY S++F  +CREH I+RQ+T   TPQQNGV ERK   + +M  + L 
Sbjct: 180  LLIKGLRTDRGGEYNSNEFKGFCREHGIKRQLTTAFTPQQNGVAERKNRTIMNMVRAALL 239

Query: 491  TKNLPRELWAAAVQSTCHVINRLPSWPGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSK 312
             K +P+  W  AVQ   H++NR P+    + +P EA     PSV +FRVFG V Y H+  
Sbjct: 240  EKEVPKSFWPDAVQWVNHILNRSPTLVVKDKTPEEAWSGKKPSVEHFRVFGCVGYVHIPD 299

Query: 311  TNQTKLDPRARRCIFVGYDTHRKGWKCMDPETKKVDVSRDVVFDE 177
              ++KLD ++ +C+ +G+ +  K +K  DP TKK+ +SRDV+F+E
Sbjct: 300  AKRSKLDDKSVKCVLLGFSSESKAFKMFDPATKKIHISRDVIFEE 344


>AAG60117.1 copia-type polyprotein, putative [Arabidopsis thaliana]
          Length = 1352

 Score =  286 bits (733), Expect = 1e-82
 Identities = 158/434 (36%), Positives = 242/434 (55%), Gaps = 3/434 (0%)
 Frame = -3

Query: 1310 GHFEADISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEAD 1131
            G+    + N     + +VY++P +  N+ S+ Q+ + G  +     N+ I     ++   
Sbjct: 377  GNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITK 436

Query: 1130 VLFTGRRKESLYLLSASDAYVE--KTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVP 957
            V  +   K  +++L+  +   +  K    E + LWH R GH+ +  L+ +S K+++ G+P
Sbjct: 437  VPMS---KNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLP 493

Query: 956  LFNEIHHDVVCPGCQYGKSHCLPFPN-SKNRASAALQLVHSDLMGPTKTPSYSSFRYVMV 780
              N  H + VC GC  GK   + FP  S +RA  +L+L+H+D+ GP K  S     Y ++
Sbjct: 494  CIN--HPNQVCEGCLLGKQFKMSFPKESSSRAQKSLELIHTDVCGPIKPKSLGKSNYFLL 551

Query: 779  LVDEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCR 600
             +D+FSR TWVYFL+ KSE F  F KFK  VEKE GL IK MR+D GGE+ S +FL YC 
Sbjct: 552  FIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCE 611

Query: 599  EHDIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLP 420
            ++ I+RQ+T   +PQQNGV ERK   +  M  S L +K LP+ELWA AV    +++NR P
Sbjct: 612  DNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSP 671

Query: 419  SWPGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKG 240
            +   +  +P EA       VS+ RVFG + +AHV    ++KLD ++ + IF+GYD + KG
Sbjct: 672  TKSVSGKTPQEAWSGRKSGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKG 731

Query: 239  WKCMDPETKKVDVSRDVVFDEVSSLQIDTDRDTIDLSPFPDGASRERGSNITPTKENIQE 60
            +K  +P+TKK  +SR++VFDE      +++ +  +  P  +    E      PT+E    
Sbjct: 732  YKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEPE------PTREEPPS 785

Query: 59   EETTGTVLQRTSRQ 18
            EE T      TS Q
Sbjct: 786  EEPTTPPTSPTSSQ 799


>KYP66219.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1033

 Score =  280 bits (717), Expect = 1e-81
 Identities = 154/410 (37%), Positives = 231/410 (56%), Gaps = 5/410 (1%)
 Frame = -3

Query: 1292 ISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEADVLFTGR 1113
            + N     + +VY+VP +  N+ S+ Q+ + G  +     N+ I  N     A V  T  
Sbjct: 315  LKNGEHQFISNVYYVPSMKSNILSLGQLLEKGYDIQLKNNNLSIRDNTSRFIAKVPMTRN 374

Query: 1112 RKESLYLLSASDAYVEKTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVPLFNEIHHD 933
            R   L + S     ++   +++S  LWH R GH+ ++ L+ +S K ++ G+P     H +
Sbjct: 375  RMFVLNIQSDGPQCLKMCYKDQS-WLWHLRFGHLNFKGLELLSKKAMVRGLPCIT--HPN 431

Query: 932  VVCPGCQYGKSHCLPFPN-SKNRASAALQLVHSDLMGPTKTPSYSSFRYVMVLVDEFSRF 756
             VC GC  GK   L FP  S +RA   L+L+H+D+ GP K  S     Y ++ +D+FSR 
Sbjct: 432  QVCEGCLLGKQFRLSFPKESDSRAQKPLELIHTDVCGPIKPRSLGKSNYFLLFIDDFSRK 491

Query: 755  TWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCREHDIQRQM 576
            TWVYFL+ KSE F NF KFK  VEKE GL IK +R+D GGE+ S +F  YC ++ I+RQ+
Sbjct: 492  TWVYFLKEKSEVFENFKKFKAHVEKESGLLIKALRSDRGGEFTSKEFQKYCEDNGIRRQL 551

Query: 575  TFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLPSWPGTEPS 396
            T   +PQQNGV ERK   +  M  S L +K LP+E WA AV    ++ NR P+   +  +
Sbjct: 552  TVPRSPQQNGVAERKNRTILEMARSMLKSKKLPKEFWAEAVACAVYLTNRSPTRSVSGKT 611

Query: 395  PFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKGWKCMDPET 216
            P EA     P +S+ RVFG + + HV    ++KLD ++ + IF+GYD + KG+K  +P++
Sbjct: 612  PQEAWSGRKPGISHLRVFGSIAHVHVPDEKRSKLDDKSEKYIFIGYDANSKGYKLYNPDS 671

Query: 215  KKVDVSRDVVFDEVSSLQIDTDRDTIDLSPFP----DGASRERGSNITPT 78
            +K  +SR+VVFDE       T+ +  D + FP    D   +++    TPT
Sbjct: 672  RKTIISRNVVFDEEGEWDWSTNCE--DHTFFPCVEEDDVEQQQQPQETPT 719


>GAU51371.1 hypothetical protein TSUD_247260 [Trifolium subterraneum]
          Length = 1980

 Score =  282 bits (722), Expect = 5e-81
 Identities = 170/452 (37%), Positives = 249/452 (55%), Gaps = 25/452 (5%)
 Frame = -3

Query: 1286 NNRGVS--LKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEADVLFTGR 1113
            N +GV+  ++DVY+VPGL  NL SV Q+ + G  VL      +I  + K +      T  
Sbjct: 342  NVKGVNYLVRDVYYVPGLKNNLLSVGQLQERGLAVLMQSNECRIYHHTKGLVFQTNMTAN 401

Query: 1112 RKESLYLLSASDAYVEKTSQNES--------ATLWHNRLGHVGYQLLQKISTKKLLDGVP 957
            R   +++L +S   ++K ++ E         A LWH R GH+ Y+ L+ + TK ++ G+P
Sbjct: 402  R---MFVLLSSTQSIKKENKEECFQVTTDDVAHLWHRRFGHLSYKGLKTLQTKNMVRGLP 458

Query: 956  LFNEIHHDVVCPGCQYGKSHC-LPFPNSKNRASAALQLVHSDLMGPTKTPSYSSFRYVMV 780
             F+    ++VC  C  GK H  +    S  RAS  L+LVH+D+ GP    S  + RY + 
Sbjct: 459  SFSV--GEIVCTNCLKGKQHRDVISRRSTWRASEKLELVHADICGPISPFSEGNKRYFIC 516

Query: 779  LVDEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCR 600
             +D++SR  WVYFL  KS+AF+ F  FK  VEKE GL IKC+RTD GGE+ S++F  YC+
Sbjct: 517  FIDDYSRKAWVYFLAYKSDAFTTFKLFKALVEKETGLSIKCLRTDRGGEFTSNEFKEYCK 576

Query: 599  EHDIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLP 420
             + I+RQ+T   TPQQNGV ERK   + +M  S L  KN+PR+ W  AV    +V+NR P
Sbjct: 577  MNGIKRQLTVAYTPQQNGVAERKNRTVMNMVRSLLVEKNVPRKFWVEAVNWAFYVLNRCP 636

Query: 419  SWPGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKG 240
            +    E +P EA Y   PSV + RVFG + YAHV    +TKL+ ++R C+  G     K 
Sbjct: 637  TSSVKEMTPVEAWYGMKPSVGHLRVFGCIAYAHVPDARRTKLEDKSRCCVLFGVSEESKA 696

Query: 239  WKCMDPETKKVDVSRDVVFDEVS--SLQIDTDRDTIDLSPFPDGASRERGSNITPTKE-- 72
            ++  DP +K++ +SRDVVF+E    + +  ++ D    + + D  S ER  +    +E  
Sbjct: 697  YRLYDPTSKRIIISRDVVFEEDGQWNWEKKSEEDNKFDTEWEDEKSEEREESSDGNEEEN 756

Query: 71   ----NIQEEETTGTVLQRTS------RQRRQP 6
                N  E  T G   + T+      R RR P
Sbjct: 757  ATDGNEDENATDGNEDENTASPVTEHRNRRAP 788


>KYP69041.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1342

 Score =  280 bits (717), Expect = 2e-80
 Identities = 154/410 (37%), Positives = 231/410 (56%), Gaps = 5/410 (1%)
 Frame = -3

Query: 1292 ISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEADVLFTGR 1113
            + N     + +VY+VP +  N+ S+ Q+ + G  +     N+ I  N     A V  T  
Sbjct: 381  LKNGEHQFISNVYYVPSMKSNILSLGQLLEKGYDIQLKNNNLSIRDNTSRFIAKVPMTRN 440

Query: 1112 RKESLYLLSASDAYVEKTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVPLFNEIHHD 933
            R   L + S     ++   +++S  LWH R GH+ ++ L+ +S K ++ G+P     H +
Sbjct: 441  RMFVLNIQSDGPQCLKMCYKDQS-WLWHLRFGHLNFKGLELLSKKAMVRGLPCIT--HPN 497

Query: 932  VVCPGCQYGKSHCLPFPN-SKNRASAALQLVHSDLMGPTKTPSYSSFRYVMVLVDEFSRF 756
             VC GC  GK   L FP  S +RA   L+L+H+D+ GP K  S     Y ++ +D+FSR 
Sbjct: 498  QVCEGCLLGKQFRLSFPKESDSRAQKPLELIHTDVCGPIKPRSLGKSNYFLLFIDDFSRK 557

Query: 755  TWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCREHDIQRQM 576
            TWVYFL+ KSE F NF KFK  VEKE GL IK +R+D GGE+ S +F  YC ++ I+RQ+
Sbjct: 558  TWVYFLKEKSEVFENFKKFKAHVEKESGLLIKALRSDRGGEFTSKEFQKYCEDNGIRRQL 617

Query: 575  TFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLPSWPGTEPS 396
            T   +PQQNGV ERK   +  M  S L +K LP+E WA AV    ++ NR P+   +  +
Sbjct: 618  TVPRSPQQNGVAERKNRTILEMARSMLKSKKLPKEFWAEAVACAVYLTNRSPTRSVSGKT 677

Query: 395  PFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKGWKCMDPET 216
            P EA     P +S+ RVFG + + HV    ++KLD ++ + IF+GYD + KG+K  +P++
Sbjct: 678  PQEAWSGRKPGISHLRVFGSIAHVHVPDEKRSKLDDKSEKYIFIGYDANSKGYKLYNPDS 737

Query: 215  KKVDVSRDVVFDEVSSLQIDTDRDTIDLSPFP----DGASRERGSNITPT 78
            +K  +SR+VVFDE       T+ +  D + FP    D   +++    TPT
Sbjct: 738  RKTIISRNVVFDEEGEWDWSTNCE--DHTFFPCVEEDDVEQQQQPQETPT 785


>KYP44533.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1342

 Score =  280 bits (717), Expect = 2e-80
 Identities = 154/410 (37%), Positives = 231/410 (56%), Gaps = 5/410 (1%)
 Frame = -3

Query: 1292 ISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEADVLFTGR 1113
            + N     + +VY+VP +  N+ S+ Q+ + G  +     N+ I  N     A V  T  
Sbjct: 381  LKNGEHQFISNVYYVPSMKSNILSLGQLLEKGYDIQLKNNNLSIRDNTSRFIAKVPMTRN 440

Query: 1112 RKESLYLLSASDAYVEKTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVPLFNEIHHD 933
            R   L + S     ++   +++S  LWH R GH+ ++ L+ +S K ++ G+P     H +
Sbjct: 441  RMFVLNIQSDGPQCLKMCYKDQS-WLWHLRFGHLNFKGLELLSKKAMVRGLPCIT--HPN 497

Query: 932  VVCPGCQYGKSHCLPFPN-SKNRASAALQLVHSDLMGPTKTPSYSSFRYVMVLVDEFSRF 756
             VC GC  GK   L FP  S +RA   L+L+H+D+ GP K  S     Y ++ +D+FSR 
Sbjct: 498  QVCEGCLLGKQFRLSFPKESDSRAQKPLELIHTDVCGPIKPRSLGKSNYFLLFIDDFSRK 557

Query: 755  TWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCREHDIQRQM 576
            TWVYFL+ KSE F NF KFK  VEKE GL IK +R+D GGE+ S +F  YC ++ I+RQ+
Sbjct: 558  TWVYFLKEKSEVFENFKKFKAHVEKESGLLIKALRSDRGGEFTSKEFQKYCEDNGIRRQL 617

Query: 575  TFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLPSWPGTEPS 396
            T   +PQQNGV ERK   +  M  S L +K LP+E WA AV    ++ NR P+   +  +
Sbjct: 618  TVPRSPQQNGVAERKNRTILEMARSMLKSKKLPKEFWAEAVACAVYLTNRSPTRSVSGKT 677

Query: 395  PFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKGWKCMDPET 216
            P EA     P +S+ RVFG + + HV    ++KLD ++ + IF+GYD + KG+K  +P++
Sbjct: 678  PQEAWSGRKPGISHLRVFGSIAHVHVPDEKRSKLDDKSEKYIFIGYDANSKGYKLYNPDS 737

Query: 215  KKVDVSRDVVFDEVSSLQIDTDRDTIDLSPFP----DGASRERGSNITPT 78
            +K  +SR+VVFDE       T+ +  D + FP    D   +++    TPT
Sbjct: 738  RKTIISRNVVFDEEGEWDWSTNCE--DHTFFPCVEEDDVEQQQQPQETPT 785


>XP_016709245.1 PREDICTED: LOW QUALITY PROTEIN: pleiotropic drug resistance protein
            3-like [Gossypium hirsutum]
          Length = 2801

 Score =  280 bits (717), Expect = 3e-80
 Identities = 162/430 (37%), Positives = 237/430 (55%), Gaps = 7/430 (1%)
 Frame = -3

Query: 1310 GHFEADISNNRGV-SLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEA 1134
            G  +A IS   G+ ++ +V +VP + +NL SV Q+ + G  ++F  KN  I    K+   
Sbjct: 1709 GKGKALISTKSGIKTISEVLYVPDIDQNLLSVGQLLEKGYSLIFEGKNCLI----KNAAG 1764

Query: 1133 DVLFTGRRKESLYLLSASDAYVEK-TSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVP 957
            +VL T   ++  +++  +    +   SQ++   LWH R+GHV Y  L  +    L+  + 
Sbjct: 1765 EVLTTVAMQDRTFIVDVNQLQAKAYASQSDETDLWHRRMGHVNYNSLNMMQKMDLVSDMS 1824

Query: 956  LFNEIHHDVVCPGCQYGKSHCLPFPNSKN-RASAALQLVHSDLMGPTKTPSYSSFRYVMV 780
                   D VC  CQ GK   LPFP +K  RA   LQLVH+D+ GP KT S +  RY ++
Sbjct: 1825 KIEP--RDAVCEVCQLGKQTRLPFPVNKAWRAHGKLQLVHTDICGPMKTTSLNDSRYFVL 1882

Query: 779  LVDEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCR 600
             +D++SRF WV FL++KS+ + +F KFK  VE +   ++KC+R+DNG EY+S +F   C 
Sbjct: 1883 FIDDYSRFCWVNFLKHKSDVYGSFCKFKALVENQANCKLKCLRSDNGSEYVSQKFQKLCD 1942

Query: 599  EHDIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLP 420
            +  IQ Q+T   TPQQNGV ERK   +  M    L    +P   WA AV +  +++NRLP
Sbjct: 1943 DAGIQHQLTTVYTPQQNGVCERKNRTVLDMARCLLFEAKMPNAFWAEAVNTAVYLLNRLP 2002

Query: 419  SWPGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKG 240
            +      +PFEA +   PSVS+ +VFG +CY  V K  +TKLD R+   +FVGY   +KG
Sbjct: 2003 TNAVKGKTPFEAWFGQKPSVSHLKVFGCLCYVLVPKERRTKLDRRSMPGVFVGYSNVKKG 2062

Query: 239  WKCMDPETKKVDVSRDVVFDEVSSLQIDTDRDTIDLSPFPDG----ASRERGSNITPTKE 72
            ++  DP TKKV VSRDV F E SS + D     +     P+G       ++  N   T+ 
Sbjct: 2063 YRVFDPLTKKVVVSRDVKFSEASSWKWDGIEANL-----PEGEQIDVDLQQAENEEVTEN 2117

Query: 71   NIQEEETTGT 42
               +E   GT
Sbjct: 2118 GYDDEPVRGT 2127


>KZV33171.1 Integrase, catalytic core domain containing protein [Dorcoceras
            hygrometricum]
          Length = 702

 Score =  269 bits (688), Expect = 1e-79
 Identities = 148/396 (37%), Positives = 228/396 (57%), Gaps = 5/396 (1%)
 Frame = -3

Query: 1304 FEADISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEADVL 1125
            FEA   N     L DVY+VP LT N+ S+ Q+ +    +    + + I  +  ++ A V 
Sbjct: 242  FEA--KNGSHKVLSDVYYVPKLTSNILSIGQLLERNYKIYMEDRTLWIRDSDSNLIARVS 299

Query: 1124 FTGRRKESLYLLSASDA--YVEKTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVPLF 951
             T   K +++ L   D      K+   + +  WH R GH+ +  L+ +   K++ G+P  
Sbjct: 300  MT---KNNMFQLDLKDCGPMCLKSFVQDPSWKWHMRFGHLNFGGLKALGDHKMVKGIPKI 356

Query: 950  NEIHHDVVCPGCQYGKSHCLPFPN-SKNRASAALQLVHSDLMGPTKTPSYSSFRYVMVLV 774
            +  H D +C  C + K     FP  S +RA   LQLVH+D+ GP K  S+    Y ++ +
Sbjct: 357  D--HPDQLCEACLFSKHPRKSFPKKSLSRAIKPLQLVHADVCGPIKPQSFGKSCYFVLFI 414

Query: 773  DEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCREH 594
            D+FSR TWVYFL+ KSEAF  F KFK  VEKE G +IK +RTD GGE+ S++F ++C  H
Sbjct: 415  DDFSRKTWVYFLKYKSEAFDAFKKFKTLVEKESGYEIKALRTDRGGEFTSNEFNSFCELH 474

Query: 593  DIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLPSW 414
             I+R +T   +PQQNGV ERK   + +M  + L +KN+P+E WA AV    ++ NR P+ 
Sbjct: 475  GIRRPLTVPRSPQQNGVAERKNRTILNMARTMLKSKNMPKEFWAEAVACAVYLSNRSPTK 534

Query: 413  PGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKGWK 234
                 +P EA    TP V + R+FG + YA V +  ++KLD R+R+ +F+GY+ + KG+K
Sbjct: 535  SLKNVTPQEAWSGQTPGVHHLRIFGSIAYAQVPEQERSKLDDRSRKLVFIGYNENSKGYK 594

Query: 233  CMDPETKKVDVSRDVVFDEVS--SLQIDTDRDTIDL 132
               P+++++ +SRDV FDE +  +    T+ D+ D+
Sbjct: 595  LFSPDSRRIVISRDVEFDEDATWNWSSKTENDSYDI 630


>KYP66220.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1331

 Score =  276 bits (706), Expect = 5e-79
 Identities = 152/416 (36%), Positives = 230/416 (55%), Gaps = 5/416 (1%)
 Frame = -3

Query: 1310 GHFEADISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEAD 1131
            G+    + N     + ++Y+VP +  N+ S+ Q+ + G  +     N+ I  N       
Sbjct: 364  GNVLIQLKNGEHQFISNIYYVPSMKSNILSLGQLLEKGYDIQLKNNNLSIRDNTSRFITK 423

Query: 1130 VLFTGRRKESLYLLSASDAYVEKTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVPLF 951
            V     R   L + S     ++   +++S  LWH R GH+ ++ L  +S K ++ G+P  
Sbjct: 424  VPMMRNRMFVLNIQSDGPQCLKMCYKDQS-WLWHLRFGHLNFKGLDLLSKKAMVRGLPCI 482

Query: 950  NEIHHDVVCPGCQYGKSHCLPFPN-SKNRASAALQLVHSDLMGPTKTPSYSSFRYVMVLV 774
               H + VC GC  GK   L FP  S +RA   L+L+H+D+ GP K  S     Y ++ +
Sbjct: 483  T--HPNQVCEGCLLGKQFRLSFPKESDSRAQKPLELIHTDVCGPIKPRSLGKSNYFLLFI 540

Query: 773  DEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCREH 594
            D+FSR TWVYFL+ KSE F NF KFK  VEKE GL IK +R+D GGE+ S +F  YC ++
Sbjct: 541  DDFSRKTWVYFLKEKSEVFENFKKFKAHVEKESGLLIKALRSDRGGEFTSKEFQKYCEDN 600

Query: 593  DIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLPSW 414
             I+RQ+T   +PQQNGV ERK   +  M  S L +K LP+E WA AV    ++ NR P+ 
Sbjct: 601  GIRRQLTVPRSPQQNGVAERKNRTILEMARSMLKSKKLPKEFWAEAVACAVYLTNRSPTR 660

Query: 413  PGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKGWK 234
              +  +P EA     P +S+ RVFG + + HV    ++KLD ++ + IF+GYD + KG+K
Sbjct: 661  SVSGKTPQEAWSGRKPGISHLRVFGSIAHVHVPDEKRSKLDDKSEKYIFIGYDANSKGYK 720

Query: 233  CMDPETKKVDVSRDVVFDEVSSLQIDTDRDTIDLSPFP----DGASRERGSNITPT 78
              +P+++K  +SR+VVFDE       T+ +  D + FP    D   +++    TPT
Sbjct: 721  LYNPDSRKTIISRNVVFDEEGEWDWSTNCE--DHTFFPCVEEDDVEQQQQPQETPT 774


>KYP57183.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 884

 Score =  270 bits (690), Expect = 1e-78
 Identities = 145/381 (38%), Positives = 213/381 (55%), Gaps = 1/381 (0%)
 Frame = -3

Query: 1310 GHFEADISNNRGVSLKDVYHVPGLTKNLASVSQITDAGRYVLFGPKNVQILSNIKHIEAD 1131
            G  +  + NN   ++ +V+++P L  NL S+ Q+ + G  ++      QI    K +  D
Sbjct: 317  GDIKFHMKNNTVHTISNVFYIPDLKSNLISMGQLQERGYIIIIQQSRCQIHHPEKGLIVD 376

Query: 1130 VLFTGRRKESLYLLSASDAYVEKTSQNESATLWHNRLGHVGYQLLQKISTKKLLDGVPLF 951
               T  R   +++          T   +   LWH R GH+ ++ L+ +  K +++G+P  
Sbjct: 377  AKMTANRMFPMHIQYDIQKCFS-TRVQDPTWLWHLRYGHLSFKGLKTLHEKNMVEGLPKI 435

Query: 950  NEIHHDVVCPGCQYGKSHCLPFPNSKN-RASAALQLVHSDLMGPTKTPSYSSFRYVMVLV 774
            N      +C  C  GK H   FP+ K  RA   LQLVHSD+ GP    S  + RY ++ +
Sbjct: 436  N--CPTEICEDCIVGKQHRDSFPHGKAWRAQQILQLVHSDICGPINPTSNGNKRYFIIFI 493

Query: 773  DEFSRFTWVYFLENKSEAFSNFVKFKEQVEKEFGLQIKCMRTDNGGEYMSDQFLNYCREH 594
            D+ SR TWVYFL+ KSEAF  F  FK +VEKE G  I+ +RTD GGE+ S  F ++C  H
Sbjct: 494  DDHSRKTWVYFLQEKSEAFLIFKSFKSRVEKESGKYIQILRTDRGGEFNSHNFASFCELH 553

Query: 593  DIQRQMTFRETPQQNGVVERKLAYLTSMCLSWLHTKNLPRELWAAAVQSTCHVINRLPSW 414
             IQRQ+T   TPQQNGV ERK   + +M  S L  KN+P+  W  AV  + H++NR P+ 
Sbjct: 554  GIQRQLTAAYTPQQNGVAERKNQTIMNMVRSMLVKKNIPKTFWPEAVNWSVHILNRSPTL 613

Query: 413  PGTEPSPFEALYHHTPSVSYFRVFGLVCYAHVSKTNQTKLDPRARRCIFVGYDTHRKGWK 234
                 +P +A     PSV +F++FG + YAHV    +TKLD ++ +C+FVG     K ++
Sbjct: 614  AVKNITPQQAWSEVKPSVDHFKIFGCIAYAHVPDEKRTKLDDKSVKCVFVGVSEESKAYR 673

Query: 233  CMDPETKKVDVSRDVVFDEVS 171
              +P TKK+ +SRDV+FDE S
Sbjct: 674  LYNPTTKKIIISRDVLFDEES 694


Top