BLASTX nr result

ID: Chrysanthemum21_contig00011858 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00011858
         (1468 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|OTG19252.1| putative reverse transcriptase domain-containing ...   368   e-115
gb|PNY02492.1| auxilin-like protein, partial [Trifolium pratense]     359   e-111
ref|XP_021974097.1| uncharacterized protein LOC110869117 [Helian...   339   e-107
dbj|GAU44045.1| hypothetical protein TSUD_300150 [Trifolium subt...   340   e-103
gb|OTG02685.1| hypothetical protein HannXRQ_Chr13g0415811 [Helia...   332   e-102
ref|XP_022033756.1| uncharacterized protein LOC110935712 [Helian...   294   1e-85
gb|OTG04489.1| putative villin/Gelsolin [Helianthus annuus]           268   6e-83
gb|OTG26899.1| putative reverse transcriptase domain-containing ...   287   7e-83
gb|OTF99774.1| putative reverse transcriptase domain-containing ...   280   5e-82
dbj|GAU48609.1| hypothetical protein TSUD_327220 [Trifolium subt...   223   1e-64
gb|PNX55932.1| hypothetical protein L195_g049566 [Trifolium prat...   181   2e-51
gb|OTF86222.1| putative reverse transcriptase domain-containing ...   175   1e-43
gb|PNY15245.1| auxilin-like protein [Trifolium pratense]              169   1e-41
gb|PNX56805.1| hypothetical protein L195_g050071, partial [Trifo...   117   7e-28
gb|PNX61960.1| hypothetical protein L195_g060921, partial [Trifo...   100   6e-22
gb|PNX86883.1| hypothetical protein L195_g042966 [Trifolium prat...    97   1e-20
dbj|GAU50575.1| hypothetical protein TSUD_409990 [Trifolium subt...   101   7e-19
gb|PNY11520.1| hypothetical protein L195_g008128 [Trifolium prat...    91   2e-17
dbj|GAU47853.1| hypothetical protein TSUD_404300 [Trifolium subt...    91   9e-16
ref|XP_021743396.1| uncharacterized protein LOC110709482 [Chenop...    88   2e-15

>gb|OTG19252.1| putative reverse transcriptase domain-containing protein [Helianthus
            annuus]
          Length = 806

 Score =  368 bits (944), Expect = e-115
 Identities = 194/362 (53%), Positives = 235/362 (64%), Gaps = 3/362 (0%)
 Frame = +3

Query: 390  GAWPCPFRNFHCCPDGEVGNKGTPRLISHIKRLHLSLAERRDVLREALSTDHGLYMAMEE 569
            G WPCPFR FHCCPDG VG KG PRL++HIK  HL   ER+D LR A++ D  L+ ++ E
Sbjct: 8    GFWPCPFRRFHCCPDGLVGAKGFPRLMAHIKSHHLGSDERKDSLRCAIADDLNLFTSVCE 67

Query: 570  TLEVFGQWMCGKCMKLHAISRACHHPDGLIRFTDRV---SGHIIGIVKPSIIVPKTNDHE 740
             L V GQW+CG+CM  HA SRACHH D +IRF        G I+GI +P +    T+  E
Sbjct: 68   ALRVSGQWLCGECMCPHAFSRACHHEDKVIRFVPGERDEQGFIVGITRPGVENVDTSSEE 127

Query: 741  SFXXXXXXXXRVFKAPIVTVKSIPHGCRMAFSQALKTTLYEVVAHPGSIGAWVKLLLLPR 920
                      RVF  PI TVKSIP  CRMAF+Q L   L +VVA P S+ AWV+LL+LPR
Sbjct: 128  -LGVDIALLDRVFSLPIKTVKSIPLSCRMAFAQVLTAALDKVVAMPDSVEAWVRLLILPR 186

Query: 921  CTLQVFKPTNRQESRSGNRKAAQNRSILSSLAT*GKEDGVAILVKNILDGSGPVPLNQGS 1100
            CTL+VFKP  RQ+ RSGNRK  Q  SI  SLA  G  +G A LV+++ D       +   
Sbjct: 187  CTLRVFKPVGRQDKRSGNRKTGQCLSIQQSLAQWGDREGFATLVQSLFDQPARGVTDGIK 246

Query: 1101 VDIQKEKNAGNTNIRACLRKVADGHFTAAVKVLCSSGVAPHNGDTVMALEAXXXXXXXXX 1280
               + +   G TN++ CLRKVADGHFTAAVKVLCSSGVAP N +T+ AL A         
Sbjct: 247  KGTEDDNECGGTNVKQCLRKVADGHFTAAVKVLCSSGVAPLNKNTLEALVAKHPCMPPPS 306

Query: 1281 XXXTIYAEPALVVEVDNVIGCIKSFPKGTSCGRDGLRSQHILDALCGDGSAVASGLLCAI 1460
               ++ +EP LVVE D V+GCIKSFPKGTSCGRDGLR QH+LDA CG+GS +A  LL A 
Sbjct: 307  MPASLPSEPPLVVESDCVLGCIKSFPKGTSCGRDGLRVQHLLDAFCGEGSVIADSLLRAT 366

Query: 1461 TA 1466
            +A
Sbjct: 367  SA 368


>gb|PNY02492.1| auxilin-like protein, partial [Trifolium pratense]
          Length = 909

 Score =  359 bits (922), Expect = e-111
 Identities = 194/368 (52%), Positives = 236/368 (64%), Gaps = 9/368 (2%)
 Frame = +3

Query: 390  GAWPCPFRNFHCCPDGEVGNKGTPRLISHIKRLHLSLAERRDVLREALSTDHGLYMAMEE 569
            G   CPFR FHCC +G  G+KG   ++SH+K  HL   E +  LREA+ +D  L+M +EE
Sbjct: 7    GGLHCPFREFHCCSNGREGSKGISHMVSHLKIQHLCSDEHKSTLREAIKSDLSLFMTLEE 66

Query: 570  TLEVFGQWMCGKCMKLHAISRACHHPDGLIRFT---DRVSGHIIGIVKPSIIVPKTND-- 734
            +L    QW+ G+CM +HA+SRACHH D L+R T     V  HI+ I KPS     TND  
Sbjct: 67   SLRGLRQWLYGRCMTIHALSRACHHSDELVRVTLDSGDVGSHIVDITKPS-----TNDID 121

Query: 735  ----HESFXXXXXXXXRVFKAPIVTVKSIPHGCRMAFSQALKTTLYEVVAHPGSIGAWVK 902
                 E          RV +API TVKSIPH CR+AFS+ALK  LY+VV   GSI AWV+
Sbjct: 122  TLGAKEGLVLDAGLLERVLQAPICTVKSIPHSCRLAFSRALKEALYKVVVETGSINAWVQ 181

Query: 903  LLLLPRCTLQVFKPTNRQESRSGNRKAAQNRSILSSLAT*GKEDGVAILVKNILDGSGPV 1082
            LLLLPRCTLQV KP NRQ+ RSGNRK+ Q   IL  LAT  + DG++ LV  +L   G  
Sbjct: 182  LLLLPRCTLQVVKPQNRQDRRSGNRKSLQQHHILECLATWRETDGLSKLVDRVLGSYGQE 241

Query: 1083 PLNQGSVDIQKEKNAGNTNIRACLRKVADGHFTAAVKVLCSSGVAPHNGDTVMALEAXXX 1262
                G  +I++E      +I+ CLRKVADGHFTAAVKVL SSGVAP+N DT+  L     
Sbjct: 242  GQGYGKDNIEEETQ--KMSIKKCLRKVADGHFTAAVKVLGSSGVAPYNEDTMKILGDKHP 299

Query: 1263 XXXXXXXXXTIYAEPALVVEVDNVIGCIKSFPKGTSCGRDGLRSQHILDALCGDGSAVAS 1442
                     T ++E  LVV+VD V  CIKSFPKGTSCGRDGLR+QH+LDALCG+GSAVA 
Sbjct: 300  YMPPPSMPTTNFSEAPLVVDVDTVFRCIKSFPKGTSCGRDGLRAQHLLDALCGEGSAVAR 359

Query: 1443 GLLCAITA 1466
             LL  IT+
Sbjct: 360  DLLDVITS 367


>ref|XP_021974097.1| uncharacterized protein LOC110869117 [Helianthus annuus]
          Length = 578

 Score =  339 bits (870), Expect = e-107
 Identities = 181/373 (48%), Positives = 232/373 (62%), Gaps = 14/373 (3%)
 Frame = +3

Query: 387  DGAWPCPFRNFHCCPDGEVGNKGTPRLISHIKRLHLSLAERRDVLREALSTDHGLYMAME 566
            +G  PCPFRNFH C DG VG+KG  RL+SH +R HL    R+D LR+A++ D  LY  + 
Sbjct: 12   EGVRPCPFRNFHSCKDGFVGSKGYARLLSHFQRDHLKSDNRKDTLRDAIAKDPDLYEDVG 71

Query: 567  ETLEVFGQWMCGKCMKLHAISRACHHPDGLIRFTDRVSG---HIIGIVKPSIIVPKTNDH 737
            ETL+    W+CG+CM +HA+SR CHH D L++F     G    I+GI KP +   + +  
Sbjct: 72   ETLKTLNNWLCGECMVVHALSRGCHHVDNLVKFVPSSRGTEDFIVGIPKPQVSGGRASGE 131

Query: 738  --ESFXXXXXXXXRVFKAPIVTVKSIPHGCRMAFSQALKTTLYEVVAHPGSIGAWVKLLL 911
               S         RVF  PI TVKSIP  CRMAF+QAL + + +V+A PGS+  WVKLLL
Sbjct: 132  GIASVGVDGSLLERVFSLPITTVKSIPPSCRMAFTQALTSAMRKVIATPGSVEYWVKLLL 191

Query: 912  LPRCTLQVFKPTNRQESRSGNRKAAQNRSILSSLAT*GKEDGVAILV---------KNIL 1064
            LPRCTL+V +PTNRQE RSGNRK  Q  SI  +L+     +G++ LV         K ++
Sbjct: 192  LPRCTLKVVRPTNRQERRSGNRKTLQCSSIQRALSVWKDGEGISELVDSLFQDVKEKGVV 251

Query: 1065 DGSGPVPLNQGSVDIQKEKNAGNTNIRACLRKVADGHFTAAVKVLCSSGVAPHNGDTVMA 1244
            +GS P P  +G          G TN++ CLRKVADGHFTAAVKVL SSGVAP    T+ A
Sbjct: 252  NGSDPFPEGRG----------GRTNVKQCLRKVADGHFTAAVKVLSSSGVAPFGPSTMEA 301

Query: 1245 LEAXXXXXXXXXXXXTIYAEPALVVEVDNVIGCIKSFPKGTSCGRDGLRSQHILDALCGD 1424
            L                  +  L V+ + V+ CIKSFPKGTSCGRDGLR+QH+LD+ CG+
Sbjct: 302  LADKHPVMPPPVVPADTIPQAPLTVDAECVLECIKSFPKGTSCGRDGLRAQHLLDSFCGE 361

Query: 1425 GSAVASGLLCAIT 1463
            GS+ +SGL+ AIT
Sbjct: 362  GSSTSSGLVLAIT 374


>dbj|GAU44045.1| hypothetical protein TSUD_300150 [Trifolium subterraneum]
          Length = 961

 Score =  340 bits (871), Expect = e-103
 Identities = 179/329 (54%), Positives = 221/329 (67%), Gaps = 4/329 (1%)
 Frame = +3

Query: 489  HLSLAERRDVLREALSTDHGLYMAMEETLEVFGQWMCGKCMKLHAISRACHHPDGLIRFT 668
            HL   ER+  +R+A+ +D GL++AMEE+L   G+W+CGKCM +HA+SRACHHPDG IR T
Sbjct: 108  HLCDDERKSTIRDAIESDLGLFLAMEESLRGLGKWLCGKCMSIHALSRACHHPDGFIRVT 167

Query: 669  ---DRVSGHIIGIVKPSIIVPKTND-HESFXXXXXXXXRVFKAPIVTVKSIPHGCRMAFS 836
                 V  HI+GI+KPS   P T D  +          R+ + PI+TVKSIPH CR+AFS
Sbjct: 168  LTTGEVESHIVGILKPSTNDPDTFDVRDELVFDASLLERILQVPILTVKSIPHSCRLAFS 227

Query: 837  QALKTTLYEVVAHPGSIGAWVKLLLLPRCTLQVFKPTNRQESRSGNRKAAQNRSILSSLA 1016
            QALK  LY+VV  P S+ AWV+LLLLP CTLQV KP NR++ RSGNRK+ Q    L  LA
Sbjct: 228  QALKEALYKVVVAPSSVSAWVQLLLLPWCTLQVVKPQNRRDRRSGNRKSLQQHHTLGCLA 287

Query: 1017 T*GKEDGVAILVKNILDGSGPVPLNQGSVDIQKEKNAGNTNIRACLRKVADGHFTAAVKV 1196
            T  +  G+  LV + LD  G   L  G  +I++E +  NTNI  CLRKVADGHFTAAVKV
Sbjct: 288  TWREPGGLTKLVHSALDNYGREGLGSGRENIEEENSKMNTNIEQCLRKVADGHFTAAVKV 347

Query: 1197 LCSSGVAPHNGDTVMALEAXXXXXXXXXXXXTIYAEPALVVEVDNVIGCIKSFPKGTSCG 1376
            L SSGVAP+N DT+  LE             T + E  LVV+VD V+  I+SFPKGTSCG
Sbjct: 348  LGSSGVAPYNEDTLKILEEKHPYMPPPSTPTTRFVEAPLVVDVDIVLKSIQSFPKGTSCG 407

Query: 1377 RDGLRSQHILDALCGDGSAVASGLLCAIT 1463
            R GLR+QH+LDA+CG+GS VA  LL AIT
Sbjct: 408  RYGLRAQHLLDAMCGEGSPVARDLLDAIT 436


>gb|OTG02685.1| hypothetical protein HannXRQ_Chr13g0415811 [Helianthus annuus]
          Length = 813

 Score =  332 bits (852), Expect = e-102
 Identities = 178/365 (48%), Positives = 236/365 (64%), Gaps = 6/365 (1%)
 Frame = +3

Query: 387  DGAWPCPFRNFHCCPDGEVGNKGTPRLISHIKRLHLSLAERRDVLREALSTDHGLYMAME 566
            +G W CPFRNFH C DG+ G+ G  RLI+H ++ H    +R++ L+ ALS D  L+  + 
Sbjct: 24   NGVWLCPFRNFHKCKDGKPGSSGYNRLIAHFQQFHFK-DDRKESLQGALSKDLELFSNVG 82

Query: 567  ETLEVFGQWMCGKCMKLHAISRACHHPDGLIRFTDRVSGH----IIGIVKPS--IIVPKT 728
            ETL V G W+CGKCMK HA+SR CHHPDG+  F+ R SG     ++GI +P   ++V + 
Sbjct: 83   ETLRVLGYWLCGKCMKTHALSRGCHHPDGVFTFS-RKSGSDEDFVVGIPRPQDPVMVVEL 141

Query: 729  NDHESFXXXXXXXXRVFKAPIVTVKSIPHGCRMAFSQALKTTLYEVVAHPGSIGAWVKLL 908
               +          RVF  P+ TVKSIP  CR+AF+QAL   +++VVA PG++ +WVKLL
Sbjct: 142  RAPQGVMGDVDLFERVFSLPVRTVKSIPPSCRLAFAQALTGAIHKVVASPGTVESWVKLL 201

Query: 909  LLPRCTLQVFKPTNRQESRSGNRKAAQNRSILSSLAT*GKEDGVAILVKNILDGSGPVPL 1088
            LLPRCTL+V KP++RQE RSGNRK+ Q  SIL +LA   +  G  +LV ++L   G   +
Sbjct: 202  LLPRCTLKVVKPSSRQERRSGNRKSLQCDSILRALAMWKEGSGFEVLVNSLLADIGEGVM 261

Query: 1089 NQGSVDIQKEKNAGNTNIRACLRKVADGHFTAAVKVLCSSGVAPHNGDTVMALEAXXXXX 1268
              G    +  +   + N++ CLRKV+DGHFT AVKVLCSSGVAP    T+ AL       
Sbjct: 262  RGGKAQREIAEEV-SPNMKQCLRKVSDGHFTVAVKVLCSSGVAPRGESTMQALIGKHPFA 320

Query: 1269 XXXXXXXTIYAEPALVVEVDNVIGCIKSFPKGTSCGRDGLRSQHILDALCGDGSAVASGL 1448
                   T  ++PA+ V+ D V  C+KSFPKGTSCGRDGL++QH+LDAL G+GSA ASGL
Sbjct: 321  PPPNLPSTPLSQPAVSVDEDCVHKCVKSFPKGTSCGRDGLQAQHLLDALSGEGSATASGL 380

Query: 1449 LCAIT 1463
            L  IT
Sbjct: 381  LTDIT 385


>ref|XP_022033756.1| uncharacterized protein LOC110935712 [Helianthus annuus]
 ref|XP_022033758.1| uncharacterized protein LOC110935712 [Helianthus annuus]
 ref|XP_022033759.1| uncharacterized protein LOC110935712 [Helianthus annuus]
          Length = 1081

 Score =  294 bits (753), Expect = 1e-85
 Identities = 162/363 (44%), Positives = 222/363 (61%), Gaps = 5/363 (1%)
 Frame = +3

Query: 390  GAWPCPFRNFHCCPDGEVGNKGTPRLISHIKRLHLSLAERRDVLREALSTDHGLYMAMEE 569
            G   CPF+ FH C DG++G+ G  RL+ H++  H     R+  L+EA+S D  L++ + E
Sbjct: 22   GLRTCPFKFFHECKDGKLGSAGFFRLLEHMQSTHFKTENRKLTLKEAISRDVDLFLDVGE 81

Query: 570  TLEVFGQWMCGKCMKLHAISRACHHPDGLIRF---TDRVSGHIIGIVKPSIIVPKTNDHE 740
             L    +W+CG+CM +HA+SR C H D ++ F   +  V   I+GI KP  +V  T+   
Sbjct: 82   VLRAGDKWLCGRCMVMHALSRGCKHDDEVVSFAIVSGDVEDFIVGIRKPCQVVVDTSPCH 141

Query: 741  --SFXXXXXXXXRVFKAPIVTVKSIPHGCRMAFSQALKTTLYEVVAHPGSIGAWVKLLLL 914
              +         RVF  PI TVKSIP  CR+ F+Q L   L +VV  PGS+  WV+LLLL
Sbjct: 142  PATVGGGVNLLERVFSLPIQTVKSIPPSCRLMFAQVLTGALRKVVVSPGSVENWVQLLLL 201

Query: 915  PRCTLQVFKPTNRQESRSGNRKAAQNRSILSSLAT*GKEDGVAILVKNILDGSGPVPLNQ 1094
            PRCTL+V +P++RQE RSGNRK+ Q  +IL +L       G   LV +++D  G     +
Sbjct: 202  PRCTLRVVRPSSRQERRSGNRKSLQYNNILHALTIWKDGSGFDELVSSLVDSVGEAGTLR 261

Query: 1095 GSVDIQKEKNAGNTNIRACLRKVADGHFTAAVKVLCSSGVAPHNGDTVMALEAXXXXXXX 1274
                ++ +K+  + NI+ CLRKV DGHFTAAVKVL SSGVAP    T+ AL         
Sbjct: 262  RECRMEGDKDK-DPNIKQCLRKVRDGHFTAAVKVLSSSGVAPLCDSTLKALIDKHPVVAP 320

Query: 1275 XXXXXTIYAEPALVVEVDNVIGCIKSFPKGTSCGRDGLRSQHILDALCGDGSAVASGLLC 1454
                   +A+P LVV+ + V+ CI+SFPKGTSCGRDG+R+QH+LDA+ G+GS  +SGLL 
Sbjct: 321  PSLPPNPHAQPTLVVDDECVLKCIRSFPKGTSCGRDGMRAQHLLDAIGGEGSMASSGLLT 380

Query: 1455 AIT 1463
            AIT
Sbjct: 381  AIT 383


>gb|OTG04489.1| putative villin/Gelsolin [Helianthus annuus]
          Length = 299

 Score =  268 bits (684), Expect = 6e-83
 Identities = 147/259 (56%), Positives = 163/259 (62%)
 Frame = +3

Query: 417  FHCCPDGEVGNKGTPRLISHIKRLHLSLAERRDVLREALSTDHGLYMAMEETLEVFGQWM 596
            FHCCPDGEVGNKG  RLISH+KRLHLS  ERR  LREA+STDHGL+  +E          
Sbjct: 2    FHCCPDGEVGNKGISRLISHLKRLHLSSDERRSALREAISTDHGLFKNVE---------- 51

Query: 597  CGKCMKLHAISRACHHPDGLIRFTDRVSGHIIGIVKPSIIVPKTNDHESFXXXXXXXXRV 776
                                            GI +PS     TN HESF        RV
Sbjct: 52   --------------------------------GISRPSPKAHVTNVHESFLLDTVLLDRV 79

Query: 777  FKAPIVTVKSIPHGCRMAFSQALKTTLYEVVAHPGSIGAWVKLLLLPRCTLQVFKPTNRQ 956
            FKAPIVTVKSIPH CR+ FSQALKT LY+VVA P S+ AWV+L LLPRCTLQV +P NRQ
Sbjct: 80   FKAPIVTVKSIPHSCRLVFSQALKTALYKVVAQPRSVEAWVRLFLLPRCTLQVVRPKNRQ 139

Query: 957  ESRSGNRKAAQNRSILSSLAT*GKEDGVAILVKNILDGSGPVPLNQGSVDIQKEKNAGNT 1136
            E RSGNRKA Q RSIL+SLAT GKEDG+ +LVKNI D      L    VD  +E +  NT
Sbjct: 140  ERRSGNRKALQQRSILNSLATWGKEDGIFMLVKNIFDSPTVGSLGPRGVDNPEESSVSNT 199

Query: 1137 NIRACLRKVADGHFTAAVK 1193
            N+R CLRKVADGHFTAAVK
Sbjct: 200  NVRQCLRKVADGHFTAAVK 218


>gb|OTG26899.1| putative reverse transcriptase domain-containing protein [Helianthus
            annuus]
          Length = 1081

 Score =  287 bits (734), Expect = 7e-83
 Identities = 162/363 (44%), Positives = 215/363 (59%), Gaps = 5/363 (1%)
 Frame = +3

Query: 390  GAWPCPFRNFHCCPDGEVGNKGTPRLISHIKRLHLSLAERRDVLREALSTDHGLYMAMEE 569
            G   CP ++FH C DG+ G+ G  RL+ H++  HL    R+  ++EA++ D  L++ + E
Sbjct: 22   GLRACPLKHFHECKDGKPGSSGFFRLLEHMQSTHLKTENRKIHVKEAITKDLDLFLDVGE 81

Query: 570  TLEVFGQWMCGKCMKLHAISRACHHPDGLIRFTDR---VSGHIIGIVKPSIIVPKTN--D 734
             L   G+W+CG+CM +HA+SR C H D ++ F      V   I+GI KP  +V  T    
Sbjct: 82   VLREGGKWLCGECMVMHALSRGCKHGDEMVSFAPTSGDVDDFIVGIRKPCQVVADTFTLS 141

Query: 735  HESFXXXXXXXXRVFKAPIVTVKSIPHGCRMAFSQALKTTLYEVVAHPGSIGAWVKLLLL 914
             E          RVF     TVK IP  CR+AF+Q L   L +VVA PGSI  WV+LLLL
Sbjct: 142  PEGVVTDVALLERVFSLRFQTVKCIPPSCRLAFAQVLTGALRKVVASPGSIENWVQLLLL 201

Query: 915  PRCTLQVFKPTNRQESRSGNRKAAQNRSILSSLAT*GKEDGVAILVKNILDGSGPVPLNQ 1094
            P CTL+V +P++RQE RSGNRK+ Q  SIL +LA      G   LV ++L+  G     +
Sbjct: 202  PCCTLRVVRPSSRQERRSGNRKSLQCSSILQALALWKDGSGFTDLVSSLLNSGGTFGCLR 261

Query: 1095 GSVDIQKEKNAGNTNIRACLRKVADGHFTAAVKVLCSSGVAPHNGDTVMALEAXXXXXXX 1274
            G    + E N  + NI+ CLRKV DGHFTAAVKVLCSSGVAP    T+ AL         
Sbjct: 262  G-FSGRGEDNKKDPNIKQCLRKVRDGHFTAAVKVLCSSGVAPVGESTMKALIDKHPVVPP 320

Query: 1275 XXXXXTIYAEPALVVEVDNVIGCIKSFPKGTSCGRDGLRSQHILDALCGDGSAVASGLLC 1454
                    A+P ++   D V+ CI+SFP+GTSCGRDG+R+QH+LDA+  +GS   SGLL 
Sbjct: 321  PSLPSNPIAQPTIIAAEDCVLKCIQSFPRGTSCGRDGMRAQHLLDAIGCEGSVSFSGLLT 380

Query: 1455 AIT 1463
            AIT
Sbjct: 381  AIT 383


>gb|OTF99774.1| putative reverse transcriptase domain-containing protein [Helianthus
            annuus]
          Length = 796

 Score =  280 bits (715), Expect = 5e-82
 Identities = 160/336 (47%), Positives = 199/336 (59%), Gaps = 3/336 (0%)
 Frame = +3

Query: 468  ISHIKRLHLSLAERRDVLREALSTDHGLYMAMEETLEVFGQWMCGKCMKLHAISRACHHP 647
            ++HIK  HL   ER+D LR               +L V GQW+CG+CM  HA SRA HH 
Sbjct: 1    MAHIKSHHLGSDERKDSLR---------------SLRVSGQWLCGECMCPHAFSRAYHHE 45

Query: 648  DGLIRFTDRV---SGHIIGIVKPSIIVPKTNDHESFXXXXXXXXRVFKAPIVTVKSIPHG 818
            D +IRF        G I+GI +P +     +  E          RV   PI +VKSIP  
Sbjct: 46   DKVIRFVPGERDEQGFIVGITRPGVENVDISSKE-LGVDIALLDRVLSLPIKSVKSIPLS 104

Query: 819  CRMAFSQALKTTLYEVVAHPGSIGAWVKLLLLPRCTLQVFKPTNRQESRSGNRKAAQNRS 998
            CRMAF+Q L   L +VVA P S+ AWV+LL+LPRCTL+VFKP  R++ RSGNRK  Q  S
Sbjct: 105  CRMAFAQILTAALDKVVAMPDSVEAWVRLLILPRCTLRVFKPAGRRDKRSGNRKMGQCLS 164

Query: 999  ILSSLAT*GKEDGVAILVKNILDGSGPVPLNQGSVDIQKEKNAGNTNIRACLRKVADGHF 1178
            I  SLA  G  +G A LV+++ D       +      + +   G TN++ CL KVADGHF
Sbjct: 165  IQRSLAKWGDREGFATLVQSLFDQPARGVTDGIKKGTEIDNECGGTNVKQCLHKVADGHF 224

Query: 1179 TAAVKVLCSSGVAPHNGDTVMALEAXXXXXXXXXXXXTIYAEPALVVEVDNVIGCIKSFP 1358
            T AVKVLCSSGVAP N +T+ AL A            ++  E  LVV  D V+GCIKSFP
Sbjct: 225  TTAVKVLCSSGVAPLNKNTLDALVAKHPCMPPPSMPASLPFESPLVVVTDCVLGCIKSFP 284

Query: 1359 KGTSCGRDGLRSQHILDALCGDGSAVASGLLCAITA 1466
            KGTSCGRDGLR+QH+LDA CG+GS +A  LL A +A
Sbjct: 285  KGTSCGRDGLRAQHLLDAFCGEGSVIADSLLRATSA 320


>dbj|GAU48609.1| hypothetical protein TSUD_327220 [Trifolium subterraneum]
          Length = 362

 Score =  223 bits (567), Expect = 1e-64
 Identities = 115/195 (58%), Positives = 140/195 (71%)
 Frame = +3

Query: 879  GSIGAWVKLLLLPRCTLQVFKPTNRQESRSGNRKAAQNRSILSSLAT*GKEDGVAILVKN 1058
            GS+ AWV+LLLLPRCTL V KP NR++ RSGNRK++Q   +L  LAT  + DG+A LV +
Sbjct: 150  GSVSAWVQLLLLPRCTLHVVKPQNRRDRRSGNRKSSQQHHVLGCLATWREPDGLAKLVHS 209

Query: 1059 ILDGSGPVPLNQGSVDIQKEKNAGNTNIRACLRKVADGHFTAAVKVLCSSGVAPHNGDTV 1238
            +LD  G   L  G  +I++E +  NTNI+ CLRKVADGHFTAAVKVL SSGVAP+N DT+
Sbjct: 210  VLDNYGHAGLGSGRENIEEENSKMNTNIKQCLRKVADGHFTAAVKVLGSSGVAPYNEDTL 269

Query: 1239 MALEAXXXXXXXXXXXXTIYAEPALVVEVDNVIGCIKSFPKGTSCGRDGLRSQHILDALC 1418
              LE             T + E  LVVEVD V+ CI+SFPKGTSCGRDGLR+QH+LDA+C
Sbjct: 270  KILEEKHPYMSPPTAPTTRFIEAPLVVEVDTVLKCIQSFPKGTSCGRDGLRAQHLLDAMC 329

Query: 1419 GDGSAVASGLLCAIT 1463
            G+GS VA  LL AIT
Sbjct: 330  GEGSPVARDLLDAIT 344


>gb|PNX55932.1| hypothetical protein L195_g049566 [Trifolium pratense]
          Length = 161

 Score =  181 bits (460), Expect = 2e-51
 Identities = 89/180 (49%), Positives = 113/180 (62%), Gaps = 3/180 (1%)
 Frame = +3

Query: 390 GAWPCPFRNFHCCPDGEVGNKGTPRLISHIKRLHLSLAERRDVLREALSTDHGLYMAMEE 569
           G   CPFR FHCC DG  G KG   ++ H+K  HL   ERR +LREA+ +D GL+MA+E+
Sbjct: 7   GGLHCPFREFHCCSDGREGGKGVSHMVPHLKAPHLCSDERRSILREAIVSDLGLFMAVEK 66

Query: 570 TLEVFGQWMCGKCMKLHAISRACHHPDGLIRFT---DRVSGHIIGIVKPSIIVPKTNDHE 740
           +L+   QW+CG+CM +HA+SR CHH DGL+R T   + V+ HII                
Sbjct: 67  SLKALKQWLCGRCMHIHALSRTCHHFDGLVRVTLGSEEVTSHII---------------- 110

Query: 741 SFXXXXXXXXRVFKAPIVTVKSIPHGCRMAFSQALKTTLYEVVAHPGSIGAWVKLLLLPR 920
                      V +API T+KSIPH CR+AFSQALK  LY+VVA P S+ AWV+LLLLPR
Sbjct: 111 ---------EEVLQAPIFTMKSIPHSCRIAFSQALKEALYKVVAEPSSVSAWVQLLLLPR 161


>gb|OTF86222.1| putative reverse transcriptase domain-containing protein [Helianthus
            annuus]
          Length = 884

 Score =  175 bits (444), Expect = 1e-43
 Identities = 94/186 (50%), Positives = 128/186 (68%), Gaps = 1/186 (0%)
 Frame = +3

Query: 906  LLLPRCTLQVFKPTNRQESRSGNRKAAQNRSILSSLAT*GKEDGVAILVKNIL-DGSGPV 1082
            ++LPRCTL+VF+P+ RQE +SGNRK+ Q  +I  +L+  G   G   L++++  D   P 
Sbjct: 1    MVLPRCTLKVFRPSCRQERKSGNRKSLQCSAIQRALSVWGGGAGCCDLIRDLFADQGSPS 60

Query: 1083 PLNQGSVDIQKEKNAGNTNIRACLRKVADGHFTAAVKVLCSSGVAPHNGDTVMALEAXXX 1262
            P +   +     ++  + N++ CLRKVADGHFTAAVKVL SSGVAP NG T+ AL     
Sbjct: 61   PASDKGLG-GAARSETSVNVKQCLRKVADGHFTAAVKVLSSSGVAPCNGATMQALLDKHP 119

Query: 1263 XXXXXXXXXTIYAEPALVVEVDNVIGCIKSFPKGTSCGRDGLRSQHILDALCGDGSAVAS 1442
                      ++A+ +LVV+V++V+G IKSFPKGTSCGRDGLR+QH+LDALCG+GS VA 
Sbjct: 120  VLPPPLLPGPLFADQSLVVDVEDVLGGIKSFPKGTSCGRDGLRAQHLLDALCGEGSVVAG 179

Query: 1443 GLLCAI 1460
            G+L AI
Sbjct: 180  GVLQAI 185


>gb|PNY15245.1| auxilin-like protein [Trifolium pratense]
          Length = 793

 Score =  169 bits (428), Expect = 1e-41
 Identities = 86/186 (46%), Positives = 113/186 (60%), Gaps = 4/186 (2%)
 Frame = +3

Query: 390 GAWPCPFRNFHCCPDGEVGNKGTPRLISHIKRLHLSLAERRDVLREALSTDHGLYMAMEE 569
           G+  CP R FHCC DG  G KG   ++ H+K LHL   ERR +L EA+  + GL+M +E+
Sbjct: 7   GSLICPLREFHCCSDGREGGKGVSHMVPHLKALHLYSDERRSMLWEAIVREFGLFMVVEK 66

Query: 570 TLEVFGQWMCGKCMKLHAISRACHHPDGLIRFT---DRVSGHIIGIVKPSIIVPKT-NDH 737
           +L+   QW+CG+CM +HA+SR CHH DGL+  T   + V+ HII I KPSI  P   +  
Sbjct: 67  SLKALKQWLCGRCMHIHALSRTCHHSDGLVCVTLESEEVTSHIICIPKPSIKEPDVLSAK 126

Query: 738 ESFXXXXXXXXRVFKAPIVTVKSIPHGCRMAFSQALKTTLYEVVAHPGSIGAWVKLLLLP 917
           +           V +API TVKSIPH CR+ FSQALK  LY+VV  P S+   V L L  
Sbjct: 127 DGLVLDARLLEEVLQAPIFTVKSIPHSCRLTFSQALKEALYKVVTEPSSV---VNLWLGG 183

Query: 918 RCTLQV 935
           RC + +
Sbjct: 184 RCPMSL 189


>gb|PNX56805.1| hypothetical protein L195_g050071, partial [Trifolium pratense]
          Length = 129

 Score =  117 bits (294), Expect = 7e-28
 Identities = 65/121 (53%), Positives = 78/121 (64%), Gaps = 10/121 (8%)
 Frame = +3

Query: 1131 NTNIRACLRKVADGHFTAAVKVLCSSGVAPHNGDTVMALEAXXXXXXXXXXXXTIYAEPA 1310
            NTNI+  LRKVADGHFT AVKVL SSGVAP+N DT+   E             T++ E  
Sbjct: 2    NTNIKKSLRKVADGHFTVAVKVLGSSGVAPYNEDTMKVSEEKHPYKSPPSAPTTMFVEAP 61

Query: 1311 LVVEV----------DNVIGCIKSFPKGTSCGRDGLRSQHILDALCGDGSAVASGLLCAI 1460
            L V+V          D V+ CI+SFPKGTS GRDGLR+Q++LDA+CG+GS VA  LL  I
Sbjct: 62   LAVKVDIVLKCIQSXDTVLKCIQSFPKGTSGGRDGLRAQYLLDAMCGEGSPVARDLLDVI 121

Query: 1461 T 1463
            T
Sbjct: 122  T 122


>gb|PNX61960.1| hypothetical protein L195_g060921, partial [Trifolium pratense]
          Length = 87

 Score =  100 bits (248), Expect = 6e-22
 Identities = 49/86 (56%), Positives = 58/86 (67%)
 Frame = +3

Query: 1131 NTNIRACLRKVADGHFTAAVKVLCSSGVAPHNGDTVMALEAXXXXXXXXXXXXTIYAEPA 1310
            NTNI+ CLRK ADGHFT AVKVL SSGVAP+N D +  LE             T++ E  
Sbjct: 2    NTNIKQCLRKFADGHFTVAVKVLGSSGVAPYNEDAMKVLEEKHPYRPPPSAPTTMFVEAP 61

Query: 1311 LVVEVDNVIGCIKSFPKGTSCGRDGL 1388
            L  ++D V+ CI+SFPKGTSCGRDGL
Sbjct: 62   LAAKIDIVLKCIQSFPKGTSCGRDGL 87


>gb|PNX86883.1| hypothetical protein L195_g042966 [Trifolium pratense]
          Length = 109

 Score = 97.1 bits (240), Expect = 1e-20
 Identities = 52/99 (52%), Positives = 64/99 (64%)
 Frame = +3

Query: 900  KLLLLPRCTLQVFKPTNRQESRSGNRKAAQNRSILSSLAT*GKEDGVAILVKNILDGSGP 1079
            +LL+LPRCTLQV KP  R++ RSGNRK+ Q   IL  LAT  + DG+A LV  +L   G 
Sbjct: 4    QLLVLPRCTLQVVKPQTRRDRRSGNRKSLQQHHILGCLATWREPDGLAKLVDGVLGNYGQ 63

Query: 1080 VPLNQGSVDIQKEKNAGNTNIRACLRKVADGHFTAAVKV 1196
              L  G  + ++E    NT I+ CLRKVADGHF AAV V
Sbjct: 64   ESLGYGKDNNEEENTKMNTKIKQCLRKVADGHFIAAVLV 102


>dbj|GAU50575.1| hypothetical protein TSUD_409990 [Trifolium subterraneum]
          Length = 1404

 Score =  101 bits (252), Expect = 7e-19
 Identities = 53/89 (59%), Positives = 62/89 (69%)
 Frame = +3

Query: 1197 LCSSGVAPHNGDTVMALEAXXXXXXXXXXXXTIYAEPALVVEVDNVIGCIKSFPKGTSCG 1376
            L SSGVAP+N DT+  L              T++ E  LVV+VD V  CIKSFPKGTSCG
Sbjct: 617  LGSSGVAPYNEDTMKMLGDKHPYKPPPSVPTTLFFEAPLVVDVDTVFRCIKSFPKGTSCG 676

Query: 1377 RDGLRSQHILDALCGDGSAVASGLLCAIT 1463
            RDGLR+QH+LDALCG+GSA+A  LL AIT
Sbjct: 677  RDGLRAQHLLDALCGEGSAMAIDLLDAIT 705


>gb|PNY11520.1| hypothetical protein L195_g008128 [Trifolium pratense]
          Length = 204

 Score = 90.9 bits (224), Expect = 2e-17
 Identities = 60/129 (46%), Positives = 74/129 (57%)
 Frame = +3

Query: 861  EVVAHPGSIGAWVKLLLLPRCTLQVFKPTNRQESRSGNRKAAQNRSILSSLAT*GKEDGV 1040
            EVVA P S+G WV+LLLL RC+LQV KP  R++ R+G  K+ Q   IL  LA   + DGV
Sbjct: 2    EVVAEPSSVGDWVQLLLLSRCSLQVVKPQTRRDCRTG-IKSLQQHHILCYLARWRETDGV 60

Query: 1041 AILVKNILDGSGPVPLNQGSVDIQKEKNAGNTNIRACLRKVADGHFTAAVKVLCSSGVAP 1220
            A LV ++LD  G    ++G                       DGHFTAAVKVL SSGV+P
Sbjct: 61   AKLVDSMLDNYG----HEGQ----------------------DGHFTAAVKVLGSSGVSP 94

Query: 1221 HNGDTVMAL 1247
             NG T+ AL
Sbjct: 95   CNGGTLKAL 103


>dbj|GAU47853.1| hypothetical protein TSUD_404300 [Trifolium subterraneum]
          Length = 610

 Score = 91.3 bits (225), Expect = 9e-16
 Identities = 48/93 (51%), Positives = 58/93 (62%)
 Frame = +3

Query: 1089 NQGSVDIQKEKNAGNTNIRACLRKVADGHFTAAVKVLCSSGVAPHNGDTVMALEAXXXXX 1268
            N G  +I++E     TNI+ CL KVADGHFTAAVKV  SSGVAP+N DT+  L       
Sbjct: 54   NYGKDNIEEETQKMTTNIKQCLHKVADGHFTAAVKVFGSSGVAPYNEDTMKILGDKYPYK 113

Query: 1269 XXXXXXXTIYAEPALVVEVDNVIGCIKSFPKGT 1367
                   T+++E  LVV+VD V  CI SFPKGT
Sbjct: 114  PPPSVSTTLFSEAPLVVDVDTVFRCITSFPKGT 146


>ref|XP_021743396.1| uncharacterized protein LOC110709482 [Chenopodium quinoa]
          Length = 339

 Score = 88.2 bits (217), Expect = 2e-15
 Identities = 48/104 (46%), Positives = 64/104 (61%)
 Frame = +3

Query: 1155 RKVADGHFTAAVKVLCSSGVAPHNGDTVMALEAXXXXXXXXXXXXTIYAEPALVVEVDNV 1334
            RK+ DGH+TAAV+VL SSGVAP+N  T+  L++             +     LV      
Sbjct: 3    RKICDGHYTAAVRVLSSSGVAPYNDATLADLQSKHPTSPAPSLPDILVDGLHLVTTFVVA 62

Query: 1335 IGCIKSFPKGTSCGRDGLRSQHILDALCGDGSAVASGLLCAITA 1466
            +  IKSFP+GTSCGRDGLR+QH+LD L G   AV+  L+ +I+A
Sbjct: 63   LDRIKSFPRGTSCGRDGLRAQHLLDCLSGAAVAVSDELVDSISA 106


Top