BLASTX nr result

ID: Astragalus22_contig00025015 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00025015
         (1989 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX77624.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   422   e-140
gb|PNX89231.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   419   e-138
gb|KYP53324.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]   429   e-137
gb|PNX93203.1| hypothetical protein L195_g016354 [Trifolium prat...   430   e-136
gb|AAO23078.1| polyprotein [Glycine max]                              441   e-135
gb|PNX96484.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   430   e-132
gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]   429   e-132
gb|KYP53386.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]   414   e-132
gb|PNY16670.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   413   e-131
gb|PNY00079.1| hypothetical protein L195_g023353, partial [Trifo...   409   e-131
gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   429   e-130
gb|PNX98514.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   419   e-128
dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subt...   421   e-128
gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   419   e-127
gb|PNY14541.1| hypothetical protein L195_g011223 [Trifolium prat...   402   e-127
dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subt...   417   e-126
dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subte...   416   e-125
dbj|GAU47333.1| hypothetical protein TSUD_101210 [Trifolium subt...   405   e-125
dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subt...   415   e-125
gb|PNX73110.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   403   e-125

>gb|PNX77624.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
 gb|PNY16672.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 367

 Score =  422 bits (1086), Expect = e-140
 Identities = 202/330 (61%), Positives = 252/330 (76%), Gaps = 1/330 (0%)
 Frame = +2

Query: 14   MSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLE 193
            M  IVKLHG+PKSIVSDRD+VFTSSFWQ LFKL GT+LAMSSAYHPQSDGQ+E LNK LE
Sbjct: 1    MHNIVKLHGMPKSIVSDRDKVFTSSFWQQLFKLQGTSLAMSSAYHPQSDGQTEVLNKVLE 60

Query: 194  MYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPP 373
            ++LRCFTF+NPK+W K ++WAEYWYN++F TS+GMTPFKALYGRDPP L +      DPP
Sbjct: 61   LFLRCFTFNNPKSWSKVISWAEYWYNTAFQTSIGMTPFKALYGRDPPYLTKYEAQEIDPP 120

Query: 374  EVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLAL 553
             + +++ ER+                MK QADK R+DV+F++G++VLV+LQPYRQ S+AL
Sbjct: 121  TLQEELKERDKLLQQLKSNLEKAQQYMKHQADKHRKDVKFQVGEMVLVRLQPYRQQSVAL 180

Query: 554  RKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPL 733
            RKN KLGMRYFGPFEI+A +G VAYKLKLP  AKIH VFHVSQLK F+G+ +E YLPLPL
Sbjct: 181  RKNQKLGMRYFGPFEILACVGNVAYKLKLPDNAKIHPVFHVSQLKPFKGNVTEHYLPLPL 240

Query: 734  TSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLE 913
            T  + GPI+ P AVLQ RTI+ G   V Q+L+ WE N++D ATWE++ + +  +PT NLE
Sbjct: 241  TMNDTGPIIQPVAVLQARTIRKGTQKVHQILVQWEQNSKDAATWEDLHDLQFKFPTLNLE 300

Query: 914  DKIQFKGEGIVMNESAAEV-ENEMPRRSMR 1000
            DK+ F GEGIVM  +  ++ EN+   +S R
Sbjct: 301  DKVVFNGEGIVMRPNTTKILENDDSAKSQR 330


>gb|PNX89231.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 408

 Score =  419 bits (1076), Expect = e-138
 Identities = 212/372 (56%), Positives = 259/372 (69%), Gaps = 27/372 (7%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE FM  IVKLHG+PKSIVSDRD+VFTS+FWQ LFKL GT+LAMSSAYHPQSDGQ+E LN
Sbjct: 36   AEAFMHNIVKLHGMPKSIVSDRDKVFTSAFWQQLFKLQGTSLAMSSAYHPQSDGQTEVLN 95

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            K LE++LRCFTF+NPK+W+K ++WAEYWYN++F TS+GMTPFKALYGR+PP LV+     
Sbjct: 96   KVLELFLRCFTFNNPKSWFKVISWAEYWYNTAFQTSIGMTPFKALYGREPPYLVKYEAHE 155

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            +D P +  ++  R+                MK QADK R+DV+F+IG+LVLV+LQPYRQ 
Sbjct: 156  NDSPALQDELRGRDKILQQLKSNLERAQQYMKHQADKHRKDVKFQIGELVLVRLQPYRQQ 215

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ALRKN KLGMRYFGPFEII  +G VAYKLKLP  AKIH VFHVSQLK F+G + E YL
Sbjct: 216  SVALRKNQKLGMRYFGPFEIIDCVGKVAYKLKLPDNAKIHPVFHVSQLKPFKGTTDEQYL 275

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT  + GPI+   AVLQ RTI  G   V Q+L+ WE  +++ ATWEN+ + +  +P 
Sbjct: 276  PLPLTMADTGPIIQSAAVLQARTIMQGSQKVHQILVQWEQQSKEEATWENLHDLQLKFPA 335

Query: 902  FNLEDKIQFKGEGIVM----------NESAA---------------EVENEM--PRRSMR 1000
             NLEDK+ FKGEGIVM          N SAA               E  N++  PRR  R
Sbjct: 336  LNLEDKVVFKGEGIVMRPNVTQLLEENMSAAQSHGDPQNALEMETVEENNKLMGPRRGQR 395

Query: 1001 ARKASVKLNDYC 1036
            ARK+      +C
Sbjct: 396  ARKSHSMWKTFC 407


>gb|KYP53324.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 751

 Score =  429 bits (1102), Expect = e-137
 Identities = 203/316 (64%), Positives = 245/316 (77%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE F   IVKLHG+PKSIVSDRD+VFTS+FWQHLFKL GTTLAMSSAYHPQ+DGQSE LN
Sbjct: 380  AEAFTLHIVKLHGLPKSIVSDRDKVFTSNFWQHLFKLQGTTLAMSSAYHPQTDGQSEVLN 439

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            KCLEM+LRCFTFDNPK+W K L WAEYWYN+SFHTSLGMTPFKALYGRDPP L R   + 
Sbjct: 440  KCLEMFLRCFTFDNPKSWSKGLTWAEYWYNTSFHTSLGMTPFKALYGRDPPTLTRYQRSP 499

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            +DP +V  Q+T+R+               RMK QADK+R D+QF++GD VLVKLQPYRQH
Sbjct: 500  TDPRDVQDQLTKRDQLLDQLKCNLTKAQQRMKHQADKKRSDMQFQVGDQVLVKLQPYRQH 559

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ LRK+ KL MRYFGPF+++ ++G VAYKL+LP TA+IH VFH+SQLK F+G S+ PY+
Sbjct: 560  SVVLRKHQKLSMRYFGPFKVLGKVGVVAYKLELPETARIHPVFHISQLKPFKGISNAPYM 619

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT++ELGP L P A+L  RTI  G  L+ QVL+ W+ ++    +WE+V   K H+P 
Sbjct: 620  PLPLTTSELGPFLQPVAILHARTILQGSKLLSQVLVQWDPSSNVPNSWEDVTFIKTHFPY 679

Query: 902  FNLEDKIQFKGEGIVM 949
             NLEDK+  KGEG VM
Sbjct: 680  LNLEDKVVLKGEGNVM 695


>gb|PNX93203.1| hypothetical protein L195_g016354 [Trifolium pratense]
          Length = 869

 Score =  430 bits (1106), Expect = e-136
 Identities = 213/363 (58%), Positives = 268/363 (73%), Gaps = 19/363 (5%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE+FM  +VKLHG+PKSIVSDRD+VF S FW+ LF+L GTTL+MSSAYHPQ+DGQSE+LN
Sbjct: 505  AEVFMKTVVKLHGLPKSIVSDRDKVFISKFWKELFQLQGTTLSMSSAYHPQTDGQSEALN 564

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            KCLEMYLRC TF NPK+W+KAL WAEYWYN+++H SLGMTPF+ALYGR PP LVR   + 
Sbjct: 565  KCLEMYLRCLTFQNPKSWFKALDWAEYWYNTAYHNSLGMTPFQALYGRTPPTLVRYTHSP 624

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            +D  +V +Q+ ER+                MK QADK RRD QFE+G+ VLVKLQPYRQ+
Sbjct: 625  TDTLDVQQQLMERDRLIATLKDNLKRAQQIMKNQADKHRRDAQFEVGEQVLVKLQPYRQN 684

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ALRKN KLGMRYFGPF II ++G VAYK++LP  AKIH VFH+SQLK+F+G +++PY+
Sbjct: 685  SVALRKNQKLGMRYFGPFTIIEKVGKVAYKVQLPVEAKIHPVFHISQLKQFKGRATDPYI 744

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT+ ELGPIL P AVLQ+R I   +  ++QVLI WE  N+  ATWE+V+E  ++YP 
Sbjct: 745  PLPLTTHELGPILQPIAVLQRRDIVRNEHAIQQVLIKWEGLNDTDATWEDVDEITENYPN 804

Query: 902  FNLEDKIQFKGEGIVMNESAAE-------VENE----------MP--RRSMRARKASVKL 1024
            FNLEDK++ KG+GI M E   +       +ENE          MP  R+ +R R  S+KL
Sbjct: 805  FNLEDKVEVKGKGIAMEEPRQQKGQVSKILENEGATKSVSAPQMPGMRKGVRPRAPSIKL 864

Query: 1025 NDY 1033
             D+
Sbjct: 865  RDF 867


>gb|AAO23078.1| polyprotein [Glycine max]
          Length = 1552

 Score =  441 bits (1134), Expect = e-135
 Identities = 213/344 (61%), Positives = 256/344 (74%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE FMS IVKLHG+P+SIVSDRDRVFTS+FWQHLFKL GTTLAMSSAYHPQSDGQSE LN
Sbjct: 1189 AEAFMSHIVKLHGIPRSIVSDRDRVFTSTFWQHLFKLQGTTLAMSSAYHPQSDGQSEVLN 1248

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            KCLEMYLRCFT+++PK W KAL WAE+WYN+++H SLGMTPF+ALYGR+PP L R   + 
Sbjct: 1249 KCLEMYLRCFTYEHPKGWVKALPWAEFWYNTAYHMSLGMTPFRALYGREPPTLTRQACSI 1308

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
             DP EV +Q+T+R+A               MK QADK+R DV F+IGD VLVKLQPYRQH
Sbjct: 1309 DDPAEVREQLTDRDALLAKLKINLTRAQQVMKRQADKKRLDVSFQIGDEVLVKLQPYRQH 1368

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S  LRKN KL MRYFGPF+++A+IG VAYKL+LPS A+IH VFHVSQLK F G + +PYL
Sbjct: 1369 SAVLRKNQKLSMRYFGPFKVLAKIGDVAYKLELPSAARIHPVFHVSQLKPFNGTAQDPYL 1428

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT TE+GP++ P  +L  R I  G   +EQ+L+ WE+  +D ATWE++E+ K  YPT
Sbjct: 1429 PLPLTVTEMGPVMQPVKILASRIIIRGHNQIEQILVQWENGLQDEATWEDIEDIKASYPT 1488

Query: 902  FNLEDKIQFKGEGIVMNESAAEVENEMPRRSMRARKASVKLNDY 1033
            FNLEDK+ FKGEG V N  +   +      S   R    KL D+
Sbjct: 1489 FNLEDKVVFKGEGNVTNGMSRGEKVNNTAESSSERGLHNKLADF 1532


>gb|PNX96484.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1258

 Score =  430 bits (1106), Expect = e-132
 Identities = 216/365 (59%), Positives = 263/365 (72%), Gaps = 19/365 (5%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE  M  IVKLHG+PKSIVSDRD+VFTSSFWQ LFKL GTTLAMSSAYHPQSDGQSE LN
Sbjct: 894  AEAVMDNIVKLHGMPKSIVSDRDKVFTSSFWQQLFKLQGTTLAMSSAYHPQSDGQSEVLN 953

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            K LE++LRCFTFDNPK+W KAL+W+E+WYN++F TS+GMTPFKALYGRDPP L+R     
Sbjct: 954  KTLELFLRCFTFDNPKSWCKALSWSEFWYNTAFQTSIGMTPFKALYGRDPPALIRYETQA 1013

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            +DPP + +++ ER+                MK QADK R DV+ ++GDLVLVKLQPYRQ 
Sbjct: 1014 NDPPTLQEKLMERDRIIQQLKLNLEKAQQYMKKQADKHRVDVKLQVGDLVLVKLQPYRQQ 1073

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ALRKN KLGMRYFGPFE+IA++G VAYKLKLP  AKIH VFHVSQLK F+GD+ E Y+
Sbjct: 1074 SVALRKNQKLGMRYFGPFEVIAKVGEVAYKLKLPEHAKIHPVFHVSQLKPFKGDNQEQYM 1133

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPL+ T+ GP++ P +VL  RTI  G   ++QVLI W+  +   ATWE+V+  +  +P 
Sbjct: 1134 PLPLSMTDTGPMIQPVSVLATRTIIRGAQRIQQVLIQWDQYSTAEATWEDVDALQSKFPA 1193

Query: 902  FNLEDKIQFKGEGIVMN----------ESAAEVENEM---------PRRSMRARKASVKL 1024
            FNLEDK+ F G+GIVM+          ESA E  N+M         PRR  R RK S +L
Sbjct: 1194 FNLEDKVAFIGDGIVMSPMEENILQEGESAKEGLNDMHERNSVMMGPRRGKRVRKTSKRL 1253

Query: 1025 NDYCV 1039
              Y +
Sbjct: 1254 EGYAL 1258


>gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 1210

 Score =  429 bits (1102), Expect = e-132
 Identities = 203/316 (64%), Positives = 245/316 (77%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE F   IVKLHG+PKSIVSDRD+VFTS+FWQHLFKL GTTLAMSSAYHPQ+DGQSE LN
Sbjct: 839  AEAFTLHIVKLHGLPKSIVSDRDKVFTSNFWQHLFKLQGTTLAMSSAYHPQTDGQSEVLN 898

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            KCLEM+LRCFTFDNPK+W K L WAEYWYN+SFHTSLGMTPFKALYGRDPP L R   + 
Sbjct: 899  KCLEMFLRCFTFDNPKSWSKGLTWAEYWYNTSFHTSLGMTPFKALYGRDPPTLTRYQRSP 958

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            +DP +V  Q+T+R+               RMK QADK+R D+QF++GD VLVKLQPYRQH
Sbjct: 959  TDPRDVQDQLTKRDQLLDQLKCNLTKAQQRMKHQADKKRSDMQFQVGDQVLVKLQPYRQH 1018

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ LRK+ KL MRYFGPF+++ ++G VAYKL+LP TA+IH VFH+SQLK F+G S+ PY+
Sbjct: 1019 SVVLRKHQKLSMRYFGPFKVLGKVGVVAYKLELPETARIHPVFHISQLKPFKGISNAPYM 1078

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT++ELGP L P A+L  RTI  G  L+ QVL+ W+ ++    +WE+V   K H+P 
Sbjct: 1079 PLPLTTSELGPFLQPVAILHARTILQGSKLLSQVLVQWDPSSNVPNSWEDVTFIKTHFPY 1138

Query: 902  FNLEDKIQFKGEGIVM 949
             NLEDK+  KGEG VM
Sbjct: 1139 LNLEDKVVLKGEGNVM 1154


>gb|KYP53386.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 673

 Score =  414 bits (1064), Expect = e-132
 Identities = 199/322 (61%), Positives = 243/322 (75%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE F   IVKLHG+PKSIVSDRD+VFTS+FWQHLFKL GTTLAMSS YH Q+DGQS++LN
Sbjct: 302  AEAFTLHIVKLHGLPKSIVSDRDKVFTSTFWQHLFKLHGTTLAMSSTYHLQTDGQSKALN 361

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            KCLEM+L CFTF+NPK+W K L WAEYWYN+SFHTSLGM+PFKALYGRDPP L R   + 
Sbjct: 362  KCLEMFLSCFTFENPKSWSKGLTWAEYWYNTSFHTSLGMSPFKALYGRDPPTLTRYQRSP 421

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            + P +V  Q+TER+                MK QADK+R D+QF++GD VLVKLQPYRQH
Sbjct: 422  AYPSDVQDQLTERDQLLDQLKCNLTKAQQHMKHQADKKRFDMQFQVGDQVLVKLQPYRQH 481

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ LRK+ KL MRYFGPF++I ++G VAYKL+LP TA+IH VFH+SQLK F+G S+EPY+
Sbjct: 482  SVVLRKHQKLSMRYFGPFKVIGKVGVVAYKLELPETARIHPVFHISQLKPFKGVSNEPYM 541

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT++ELGPIL P A+L  RTI  G  L  QVL+ W+ +     +WE+V   K H+P 
Sbjct: 542  PLPLTTSELGPILQPVAILHARTILQGSKLHSQVLVQWDPSINVPNSWEDVTFIKTHFPH 601

Query: 902  FNLEDKIQFKGEGIVMNESAAE 967
             +LEDK+  KGEG VM  S  +
Sbjct: 602  IDLEDKVVLKGEGNVMKMSVGQ 623


>gb|PNY16670.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 752

 Score =  413 bits (1061), Expect = e-131
 Identities = 203/356 (57%), Positives = 248/356 (69%), Gaps = 12/356 (3%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE FM  I KLHG+PKSIVSDRD+VFTS FWQ+LFK  GT+LAMS+AYHPQ+DGQSE+LN
Sbjct: 384  AEAFMLNIAKLHGIPKSIVSDRDKVFTSGFWQNLFKRQGTSLAMSTAYHPQTDGQSEALN 443

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            KCLEMYLRCFTF NPK WYK L  AEYWYN++FH S GMTPF+ALYGR+ P L+R +   
Sbjct: 444  KCLEMYLRCFTFQNPKGWYKILPMAEYWYNTTFHNSAGMTPFRALYGREAPTLIRYVAQT 503

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            SDPP V +Q+ +R+                MK QA K+R+DV+F++GD VLV+LQPYRQH
Sbjct: 504  SDPPSVKEQLIQRDVIMDQLKQNLMRAQHVMKHQAGKKRKDVEFKLGDKVLVRLQPYRQH 563

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S ALRKN KL MRYFGPFE+IA+IG VAYKL LP +AKIH+VFHV+QLK+F+G + +PYL
Sbjct: 564  SAALRKNQKLSMRYFGPFEVIAKIGTVAYKLDLPPSAKIHSVFHVAQLKEFKGSNDDPYL 623

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT+TE+GP L P  VL  R +        Q+LI W +     A WE+  E K +YP 
Sbjct: 624  PLPLTTTEVGPTLYPTQVLDSRMVMQASVANPQILIQWGNEANAEAKWEDYNEIKNNYPE 683

Query: 902  FNLEDKIQFKGEGIVM-----NESAAEVENEM-------PRRSMRARKASVKLNDY 1033
             NLEDK++FKG GIVM     N   +    +M        R+  R R  + KL  Y
Sbjct: 684  LNLEDKVEFKGGGIVMKGIMGNRGKSSDSTQMITTNEDGIRKGSRKRVTNTKLKGY 739


>gb|PNY00079.1| hypothetical protein L195_g023353, partial [Trifolium pratense]
          Length = 641

 Score =  409 bits (1050), Expect = e-131
 Identities = 197/309 (63%), Positives = 243/309 (78%), Gaps = 1/309 (0%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE FM  +VKLHG+PKSI+SDRD+VF S FW+ LF L GTTL+MSSAYHPQ+DGQSE+LN
Sbjct: 334  AEAFMKTVVKLHGLPKSIISDRDKVFISKFWKELFSLQGTTLSMSSAYHPQTDGQSEALN 393

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            KCLEMYLRC TF NPKTW+KAL WAEYWYN++ HTSLGMTPF+ALYGR PP LVR   + 
Sbjct: 394  KCLEMYLRCLTFQNPKTWFKALDWAEYWYNTAHHTSLGMTPFQALYGRAPPTLVRYNHSP 453

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            SD   V +Q+ ER+                MK QADK RR+VQFE+G+ VLVKLQPYRQ+
Sbjct: 454  SDTVTVQQQLMERDVLITTLKDNLNRAQQVMKAQADKHRREVQFEVGEHVLVKLQPYRQN 513

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ALRKN KLGMRYFGPF II +IG VAYKL+LP+ AKIH VFH+SQLK+F+G + +PY+
Sbjct: 514  SVALRKNQKLGMRYFGPFTIIEKIGKVAYKLQLPAEAKIHPVFHISQLKQFKGQAFDPYI 573

Query: 722  PLPLTSTELGPILLPRAVLQKR-TIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYP 898
            PLPLT+TELGP+L P AVL++R  ++NG+ ++ QVLI W+  N   ATWE+  +  ++YP
Sbjct: 574  PLPLTTTELGPVLQPVAVLKRRDRLRNGE-VISQVLIKWQGLNNTDATWEDAADIVENYP 632

Query: 899  TFNLEDKIQ 925
            TFNLEDK++
Sbjct: 633  TFNLEDKVE 641


>gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1535

 Score =  429 bits (1102), Expect = e-130
 Identities = 209/326 (64%), Positives = 247/326 (75%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE FM  IVKLHG+PKSIVSDRD+VFTS+FWQ LFKL GT+LAMSSAYHPQSDGQ+E LN
Sbjct: 1166 AESFMHNIVKLHGMPKSIVSDRDKVFTSAFWQQLFKLQGTSLAMSSAYHPQSDGQTEVLN 1225

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            K LE++LRCFTF NPK+W K L+WAEYWYN++F TS+GMTPFKALYGRDPP L +     
Sbjct: 1226 KALELFLRCFTFHNPKSWSKVLSWAEYWYNTAFQTSIGMTPFKALYGRDPPYLTKYEAQV 1285

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            +DPP + +++ ER+                MK QADK R+DV F++GDLVLVKLQPYRQ 
Sbjct: 1286 TDPPALQEELMERDKILQQLKSNLDRAQQYMKKQADKHRKDVTFQVGDLVLVKLQPYRQQ 1345

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ALRKN KLGMRYFGPFEIIA IGAVAYKLKLP  AKIH VFHVSQLK F+G +S+ YL
Sbjct: 1346 SVALRKNQKLGMRYFGPFEIIACIGAVAYKLKLPDNAKIHPVFHVSQLKPFKGAASDQYL 1405

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT TE GPI+ P AVLQ RTI  G   V Q+L+ W+ N E  ATWE+ ++ +  +PT
Sbjct: 1406 PLPLTMTETGPIMQPIAVLQARTIMRGTQRVHQILVQWDTNAEAEATWEDFDDLQLKFPT 1465

Query: 902  FNLEDKIQFKGEGIVMNESAAEVENE 979
             NLEDK+ F GEGIVM  +   +  E
Sbjct: 1466 LNLEDKVVFNGEGIVMRPNTTNLLEE 1491


>gb|PNX98514.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1240

 Score =  419 bits (1077), Expect = e-128
 Identities = 199/316 (62%), Positives = 243/316 (76%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE FM  +VKLHG+PKSIVSDRD+VFTS+FWQHLFKL GTTLAM+SAYHPQSDGQ+E LN
Sbjct: 894  AEAFMHNVVKLHGMPKSIVSDRDKVFTSTFWQHLFKLQGTTLAMTSAYHPQSDGQTEVLN 953

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            K LE+YLRCF+F+NPK+W+K L+W+E+WYN++F TS+GMTPFKALYGRDPP L R +   
Sbjct: 954  KGLELYLRCFSFNNPKSWFKMLSWSEFWYNTAFQTSIGMTPFKALYGRDPPYLTRYVAQA 1013

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            SDPP + +++ ER+                MK QADK R D+  +IGDLVLVKLQPYRQH
Sbjct: 1014 SDPPTLQEELMERDKILQQLKDNLIRAQQYMKKQADKHRSDISLKIGDLVLVKLQPYRQH 1073

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ALRKN KLG+RYFGPFEIIAR+G VAYKLKLP  AKIH VFHVSQLK F+G + E YL
Sbjct: 1074 SVALRKNQKLGLRYFGPFEIIARVGEVAYKLKLPDDAKIHPVFHVSQLKPFKGVADEQYL 1133

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT T++GP + P  VLQ RT+  G   + QVLI W+      ATWE++   ++ +P+
Sbjct: 1134 PLPLTMTDIGPSIQPIDVLQVRTVIRGSQQIHQVLIQWDQYPAAQATWEDITTIQEKFPS 1193

Query: 902  FNLEDKIQFKGEGIVM 949
             NLEDK+ F G+GIVM
Sbjct: 1194 LNLEDKVAFNGDGIVM 1209


>dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subterraneum]
          Length = 1451

 Score =  421 bits (1081), Expect = e-128
 Identities = 198/323 (61%), Positives = 250/323 (77%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE+FM+ IVKLHG+PKSIVSDRD+VFTSSFWQHLFKL GT+LAMSSAYHPQSDGQ+E LN
Sbjct: 1111 AEVFMNNIVKLHGMPKSIVSDRDKVFTSSFWQHLFKLQGTSLAMSSAYHPQSDGQTEVLN 1170

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            K LE++LRCFTF+NPK+WYKALAW+E+WYN++ HTS+GMTPFKALYGR+PP L R  + +
Sbjct: 1171 KGLELFLRCFTFNNPKSWYKALAWSEFWYNTALHTSIGMTPFKALYGREPPTLTRYEVQD 1230

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            +DPP + +++ ER+                MK QADK R + +F +GD+VLVKLQPYRQ 
Sbjct: 1231 NDPPALQEELMERDRILQQLKSNLERAQQYMKKQADKHRVEFKFHLGDMVLVKLQPYRQQ 1290

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ALRKN KLGMRYFGPFEIIA +G VAYKLKLP  AKIH VFHVSQLK F+G   + Y+
Sbjct: 1291 SVALRKNQKLGMRYFGPFEIIACVGKVAYKLKLPDHAKIHLVFHVSQLKPFKGVPQQQYM 1350

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT  + GP++ P  VLQ RTI  G   + Q+L+ W+  +   ATWENV++ ++++P 
Sbjct: 1351 PLPLTMFDNGPMIQPVEVLQARTIMQGTQKIHQILVQWDQYDIAEATWENVDDLQKNFPL 1410

Query: 902  FNLEDKIQFKGEGIVMNESAAEV 970
            +NLEDK+ FKG+GIVM     ++
Sbjct: 1411 YNLEDKVIFKGDGIVMRPKGEDI 1433


>gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1502

 Score =  419 bits (1076), Expect = e-127
 Identities = 197/323 (60%), Positives = 244/323 (75%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE FM  IVKLHG+PKSIVSDRD+VFTS+FWQHLFK+ GT+LAMSSAYHPQ+DGQ+E LN
Sbjct: 1169 AEAFMHNIVKLHGMPKSIVSDRDKVFTSAFWQHLFKMQGTSLAMSSAYHPQTDGQTEVLN 1228

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            K LE++LRCFTF NPK+W+K ++WAEYWYN++F TS+GMTPFKALYGRDPP L +  +  
Sbjct: 1229 KTLELFLRCFTFHNPKSWFKVMSWAEYWYNTAFQTSIGMTPFKALYGRDPPYLTKYEVQV 1288

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
             DPP + +++ ER+                MK QADK RR+V F++GDLVLVKLQPY+Q 
Sbjct: 1289 DDPPALREELMERDQILQQLKTNLERAQQYMKQQADKHRREVSFKVGDLVLVKLQPYKQQ 1348

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ALRKN KLGMRYFGPFE+IA +G VAYKL+LP  AKIH VFHVSQLK F G S E YL
Sbjct: 1349 SVALRKNQKLGMRYFGPFEVIACVGKVAYKLQLPENAKIHPVFHVSQLKPFHGTSQEQYL 1408

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT ++ GPI  P  +LQ RTI  G   V Q+ I W+ N+ + A+WE+++E +  +P 
Sbjct: 1409 PLPLTMSDTGPIFQPATILQARTIVRGNKKVHQLQIQWDLNSPEEASWEDLDELQNKFPN 1468

Query: 902  FNLEDKIQFKGEGIVMNESAAEV 970
             NLEDK+ FKGEGIVM  +   +
Sbjct: 1469 INLEDKVVFKGEGIVMRPNNTNI 1491


>gb|PNY14541.1| hypothetical protein L195_g011223 [Trifolium pratense]
          Length = 763

 Score =  402 bits (1033), Expect = e-127
 Identities = 193/316 (61%), Positives = 235/316 (74%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE F+  IVKLHG+ KSIVSDRD+VFTS+FWQ LFKL GT+L MSSAYHPQSDGQ+E LN
Sbjct: 391  AEAFIHNIVKLHGMSKSIVSDRDKVFTSNFWQQLFKLQGTSLTMSSAYHPQSDGQTEVLN 450

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            K LE++LRCF+F+NPK+WYK L+WAEYWYN++F TS+GMTPFKALYGR+PP L +     
Sbjct: 451  KGLELFLRCFSFNNPKSWYKVLSWAEYWYNTTFQTSIGMTPFKALYGREPPSLTKYEAHA 510

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
             D P + +++ ER+                MK QADK R +V  ++G+LVLVKLQPYRQ 
Sbjct: 511  DDSPTIQEELMERDKILQQLKTNLDRAQQYMKKQADKNRTEVNLQVGELVLVKLQPYRQQ 570

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ALRKN KLGMRYFGPFEIIARIG VAYKLKLP  AKIH VFHVSQLK F+G + + YL
Sbjct: 571  SVALRKNQKLGMRYFGPFEIIARIGKVAYKLKLPDNAKIHPVFHVSQLKPFKGTTQDQYL 630

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT +E+GPI+ P ++L  RTI      V Q+LI W+       TWE+ ++ +  +PT
Sbjct: 631  PLPLTMSEVGPIIQPVSILDARTIVRESQKVHQILIQWDQTTPAETTWEDFDDLQNKFPT 690

Query: 902  FNLEDKIQFKGEGIVM 949
             NLEDKI F GEGIVM
Sbjct: 691  LNLEDKIVFNGEGIVM 706


>dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subterraneum]
          Length = 1512

 Score =  417 bits (1072), Expect = e-126
 Identities = 203/323 (62%), Positives = 243/323 (75%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE FM  IVKLHG+PKSIVSDRD+VFTS+FWQ LFKL GT+LAMSSAYHPQSDGQSE LN
Sbjct: 1166 AEAFMHHIVKLHGMPKSIVSDRDKVFTSNFWQQLFKLQGTSLAMSSAYHPQSDGQSEVLN 1225

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            + LE++LRCFTF+NPK WYKAL+W+E+WYN++F TS+GMTPFKALYGRDPP LVR     
Sbjct: 1226 RTLELFLRCFTFNNPKAWYKALSWSEFWYNTAFQTSIGMTPFKALYGRDPPTLVRYEAQA 1285

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
             DPP + +++  R+                MK QADK RRD++ ++GDLVLVKLQPYRQ 
Sbjct: 1286 GDPPALQEELMGRDKLLQQLKSNLERAQQYMKRQADKHRRDIKLQVGDLVLVKLQPYRQQ 1345

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            SLALRKN KLGMRYFGPFEI+A++G VAYKLKLP  AKIH VFH+SQLK F+G S +  L
Sbjct: 1346 SLALRKNQKLGMRYFGPFEILAKVGEVAYKLKLPDHAKIHPVFHISQLKPFKGISQDQSL 1405

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT ++ GP++ P AVL  RTI  G   V QVLI W+   E  ATWE V   +  +P 
Sbjct: 1406 PLPLTMSDTGPLIQPIAVLAARTILKGIQKVHQVLIQWDQYPEAEATWEEVTNLQSKFPY 1465

Query: 902  FNLEDKIQFKGEGIVMNESAAEV 970
            FNLEDK+ FKG+GIVM+    +V
Sbjct: 1466 FNLEDKVVFKGDGIVMSPKEGKV 1488


>dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subterraneum]
          Length = 1500

 Score =  416 bits (1068), Expect = e-125
 Identities = 208/370 (56%), Positives = 256/370 (69%), Gaps = 26/370 (7%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE FM  IVKLHG+PKSIVSDRDRVFTS+FWQHLFKL GT+LAMSSAYHPQSDGQ+E LN
Sbjct: 1131 AETFMHNIVKLHGMPKSIVSDRDRVFTSTFWQHLFKLQGTSLAMSSAYHPQSDGQTEVLN 1190

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            K LE++LRCFTF+NPK+W+K L+W+EYWYN+SF TS+GMTPF+ALYGR PP L + +   
Sbjct: 1191 KALELFLRCFTFNNPKSWFKVLSWSEYWYNTSFQTSIGMTPFQALYGRLPPYLTKYVPQE 1250

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            +DPP +  ++ ER+                MK QADK RRD+ +++GD VL+KLQPYRQH
Sbjct: 1251 NDPPTLQAELIERDNLLQQLKTNLERAQQYMKKQADKHRRDISYQVGDFVLIKLQPYRQH 1310

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ALRKN KLGMRYFGPFEIIA +G +AYKL LP  AKIH VFHVSQLK F+G + + Y+
Sbjct: 1311 SVALRKNQKLGMRYFGPFEIIACVGTIAYKLNLPENAKIHPVFHVSQLKPFKGTTQDQYM 1370

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT +E GPI+ P AVLQ RTIQ G   V QV I W+   E  A+WE++++ K  +PT
Sbjct: 1371 PLPLTMSETGPIIQPIAVLQARTIQRGMQKVHQVQIQWDQTAE--ASWEDLDDLKNKFPT 1428

Query: 902  FNLEDKIQFKGEGIVM-------------------------NESAAEVENEM-PRRSMRA 1003
             NLEDK+  +G  IVM                           S AE++ ++ PRR  R 
Sbjct: 1429 LNLEDKVVVEGGSIVMKPNINNILEAKVPANSIGDPQNMYDGNSVAEIKEDLGPRRGKRV 1488

Query: 1004 RKASVKLNDY 1033
            RK      DY
Sbjct: 1489 RKTHGIWKDY 1498


>dbj|GAU47333.1| hypothetical protein TSUD_101210 [Trifolium subterraneum]
          Length = 1017

 Score =  405 bits (1042), Expect = e-125
 Identities = 197/311 (63%), Positives = 235/311 (75%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE FM  IVKLHG+PKSIVSDRD+VFTS+FWQ LFKL GTTLAMSS+YHPQSDGQ+E LN
Sbjct: 704  AEAFMDNIVKLHGMPKSIVSDRDKVFTSAFWQQLFKLQGTTLAMSSSYHPQSDGQTEVLN 763

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            K LE++LRCF+F+NPK+W K L+W+E+WYN++F TS+GMTPFKALYGRDPP L R +   
Sbjct: 764  KGLELFLRCFSFNNPKSWSKMLSWSEFWYNTAFQTSIGMTPFKALYGRDPPYLTRYVAQE 823

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            +DPP + +++ ER                 MK QADK R D+   +GDLVLVKLQPYRQH
Sbjct: 824  NDPPALQEELMERGRILQQLKNNLIRAQQYMKKQADKHRSDITLNVGDLVLVKLQPYRQH 883

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ALRKN KLG+RYFGPFEIIAR+G VAYKL+LP  AKIH VFHVSQLK F+G + E YL
Sbjct: 884  SVALRKNKKLGLRYFGPFEIIARVGDVAYKLQLPKNAKIHPVFHVSQLKPFKGVAQEQYL 943

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT TE+GPI+ P  VLQ RTI  G   V QVLI W+  +   ATWE+V   K  +P+
Sbjct: 944  PLPLTMTEIGPIVQPIDVLQARTIIQGLQKVHQVLIQWDQYSAAEATWEDVTTVKDKFPS 1003

Query: 902  FNLEDKIQFKG 934
             NLEDK+ F G
Sbjct: 1004 LNLEDKVSFYG 1014


>dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subterraneum]
          Length = 1531

 Score =  415 bits (1067), Expect = e-125
 Identities = 208/363 (57%), Positives = 257/363 (70%), Gaps = 19/363 (5%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE FM  IVKLHG+PKSIVSDRD+VFTSSFWQ LFKL GT+LAMSSAYHPQSDGQSE LN
Sbjct: 1167 AEAFMDNIVKLHGMPKSIVSDRDKVFTSSFWQQLFKLQGTSLAMSSAYHPQSDGQSEVLN 1226

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            K LE++LRCFTF+NPK+W KALAW+E+WYN++F TS+GMTPFKALYGRDPP ++R  I  
Sbjct: 1227 KTLELFLRCFTFENPKSWCKALAWSEFWYNTAFQTSIGMTPFKALYGRDPPAIIRYEIQA 1286

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            SD P + +++ ER+                MK QADK R DV+ ++GD VLVKLQPYRQ 
Sbjct: 1287 SDSPTLQEKLMERDRIIQQLKLNLEKAQQYMKKQADKHRVDVKLQVGDWVLVKLQPYRQQ 1346

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S+ALRKN KLGM+YFGPFE+IA++G VAYKLKLP  AKIH VFHVSQLK F+GD+ E Y+
Sbjct: 1347 SVALRKNQKLGMKYFGPFEVIAKVGEVAYKLKLPDHAKIHPVFHVSQLKPFKGDNQEQYM 1406

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPL+ T++GP++ P AVL  RTI      ++QVLI W+      ATWE++   ++ +PT
Sbjct: 1407 PLPLSMTDIGPMIQPVAVLATRTIIRCAQRIQQVLIQWDQYPIAEATWEDMVALQRKFPT 1466

Query: 902  FNLEDKIQFKGEGIVMNESAAEVENE-------------------MPRRSMRARKASVKL 1024
            FNLEDK+ F G+GIVM+ +   +  E                    PRR  R R  S +L
Sbjct: 1467 FNLEDKVAFIGDGIVMSPNEENILEEGDSSNVGPPDKHEGNYVMMGPRRGKRMRNISKRL 1526

Query: 1025 NDY 1033
              Y
Sbjct: 1527 EGY 1529


>gb|PNX73110.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 937

 Score =  403 bits (1035), Expect = e-125
 Identities = 194/312 (62%), Positives = 234/312 (75%)
 Frame = +2

Query: 2    AELFMSQIVKLHGVPKSIVSDRDRVFTSSFWQHLFKLLGTTLAMSSAYHPQSDGQSESLN 181
            AE FM  IVKLHG+PKSIVSDRD+VFTS+FWQ LFKL GT+LAMSSAYHPQSDGQ+E LN
Sbjct: 626  AEAFMHNIVKLHGMPKSIVSDRDKVFTSAFWQQLFKLQGTSLAMSSAYHPQSDGQTEVLN 685

Query: 182  KCLEMYLRCFTFDNPKTWYKALAWAEYWYNSSFHTSLGMTPFKALYGRDPPQLVRPMITN 361
            K LE++LRCF+F NPK+WYK L+WAEYWYN++F TS+GMTPFKALYGRDPP L +     
Sbjct: 686  KGLELFLRCFSFHNPKSWYKVLSWAEYWYNTAFQTSIGMTPFKALYGRDPPYLTKYEAQV 745

Query: 362  SDPPEVIKQITEREATXXXXXXXXXXXXXRMKCQADKRRRDVQFEIGDLVLVKLQPYRQH 541
            +D P + +++ ER+                MK QADK R +V  ++GDLVLVKLQPYRQ 
Sbjct: 746  TDSPALQEELMERDKILQQLKINLERAQQYMKKQADKHRSEVNLQVGDLVLVKLQPYRQQ 805

Query: 542  SLALRKNNKLGMRYFGPFEIIARIGAVAYKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYL 721
            S++LRKN KLGMRYFGPFEIIAR+G VAYKLKLP  AKIH VFHVSQLK F+G + + YL
Sbjct: 806  SVSLRKNQKLGMRYFGPFEIIARVGNVAYKLKLPDNAKIHPVFHVSQLKPFKGIAQDQYL 865

Query: 722  PLPLTSTELGPILLPRAVLQKRTIQNGQTLVEQVLIMWEDNNEDTATWENVEEFKQHYPT 901
            PLPLT +E GPI+ P A L+ RTI  G   V Q+L+ W+      ATWE+++  +  +PT
Sbjct: 866  PLPLTMSETGPIIQPIAALEARTIMRGMQKVHQILVQWDQMPVTEATWEDLDVLQDKFPT 925

Query: 902  FNLEDKIQFKGE 937
             NLEDKI F GE
Sbjct: 926  LNLEDKIAFNGE 937


Top