BLASTX nr result

ID: Astragalus23_contig00023828 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00023828
         (2472 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KYP53324.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]   697   0.0  
gb|KYP53386.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]   684   0.0  
gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]   697   0.0  
gb|PNX96484.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   698   0.0  
gb|AAO23078.1| polyprotein [Glycine max]                              699   0.0  
dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subt...   687   0.0  
gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   686   0.0  
gb|PNX98514.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   675   0.0  
gb|PNX73110.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   665   0.0  
dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subt...   679   0.0  
gb|PNY14541.1| hypothetical protein L195_g011223 [Trifolium prat...   656   0.0  
dbj|GAU11620.1| hypothetical protein TSUD_346120 [Trifolium subt...   679   0.0  
gb|PNX93203.1| hypothetical protein L195_g016354 [Trifolium prat...   659   0.0  
dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subt...   678   0.0  
gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   671   0.0  
dbj|GAU29525.1| hypothetical protein TSUD_115470 [Trifolium subt...   665   0.0  
gb|PNX93307.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   629   0.0  
dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subte...   660   0.0  
gb|KYP63732.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]   647   0.0  
gb|PNY16560.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   655   0.0  

>gb|KYP53324.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 751

 Score =  697 bits (1799), Expect = 0.0
 Identities = 331/527 (62%), Positives = 401/527 (76%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLHKF+GYD TIEYKPG ENI ADALSRV  MAWS P+SL + +LKQA+  D     L++
Sbjct: 169  WLHKFLGYDFTIEYKPGKENIAADALSRVFFMAWSAPKSLFLHELKQALENDAHLYDLMQ 228

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
             C   AN D  Y   D LLYW+ RLV+P  S LI+ +L EYHSSP+GGHAG         
Sbjct: 229  LCFANANPDARYKLHDGLLYWKDRLVLPSPSPLIQKVLLEYHSSPIGGHAGIARTLARIS 288

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+WP +++DV  +VQQC ICQ+AK   + PAGLLQPLPIP  VW+DVAMDFITGLPNS
Sbjct: 289  SQFYWPKMREDVTRFVQQCIICQQAKVSHSLPAGLLQPLPIPQQVWDDVAMDFITGLPNS 348

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
             GFTVIMVV+DRLSK++HF+P++ D++SK VAE F   IVKLHG+PKSIVSDRD+VFTS+
Sbjct: 349  CGFTVIMVVIDRLSKYSHFVPLKSDFSSKVVAEAFTLHIVKLHGLPKSIVSDRDKVFTSN 408

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQHLFKL GTTLAMSSAYHPQ+DGQSE LNKCLEM+LRCFTFDNPK+W K L WAEYWY
Sbjct: 409  FWQHLFKLQGTTLAMSSAYHPQTDGQSEVLNKCLEMFLRCFTFDNPKSWSKGLTWAEYWY 468

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N+SFHTSLGMTPFKALYGRDPP L R   + +DP +V  Q+T+R+               
Sbjct: 469  NTSFHTSLGMTPFKALYGRDPPTLTRYQRSPTDPRDVQDQLTKRDQLLDQLKCNLTKAQQ 528

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
            RMK QADK+R D+QF++GD VLVKLQPYRQHS+ LRK+ KL MRYFGPF+++ ++G VAY
Sbjct: 529  RMKHQADKKRSDMQFQVGDQVLVKLQPYRQHSVVLRKHQKLSMRYFGPFKVLGKVGVVAY 588

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KL+LP TA+IH VFH+SQLK F+G S+ PY+PLPLT++ELGP L P A+L  RTI  G  
Sbjct: 589  KLELPETARIHPVFHISQLKPFKGISNAPYMPLPLTTSELGPFLQPVAILHARTILQGSK 648

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVM 1584
            L+ QVL+ W+ ++    +WE+V   K H+P  NLEDK+  KGEG VM
Sbjct: 649  LLSQVLVQWDPSSNVPNSWEDVTFIKTHFPYLNLEDKVVLKGEGNVM 695


>gb|KYP53386.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 673

 Score =  684 bits (1764), Expect = 0.0
 Identities = 327/534 (61%), Positives = 401/534 (75%)
 Frame = +1

Query: 1    AWLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLL 180
            AWLHKF+GYD TIEYKPG ENI A ALSRV  MAWSEP+SL + +LKQA+  D    +L+
Sbjct: 90   AWLHKFLGYDFTIEYKPGKENIVAVALSRVFFMAWSEPKSLFLHELKQALEDDAELYELM 149

Query: 181  EACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXX 360
            + C   AN D  Y   D LLYW+ RLV+P  S L++ +L EYHSSP+GGHAG        
Sbjct: 150  QLCFANANPDARYKMHDGLLYWKDRLVLPSPSPLVQKVLLEYHSSPIGGHAGIARTLARI 209

Query: 361  XXQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPN 540
              QF+WP +++DV  +VQQC ICQ+AK   + PAGLLQPLPIP  VW+DVAMDFITGLPN
Sbjct: 210  SSQFYWPKMREDVTRFVQQCIICQQAKVSHSLPAGLLQPLPIPQQVWDDVAMDFITGLPN 269

Query: 541  SHGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTS 720
            S GFTVIMVV+DRLSK++HF+P++ D++SK VAE F   IVKLHG+PKSIVSDRD+VFTS
Sbjct: 270  SCGFTVIMVVIDRLSKYSHFVPLKSDFSSKVVAEAFTLHIVKLHGLPKSIVSDRDKVFTS 329

Query: 721  SFWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYW 900
            +FWQHLFKL GTTLAMSS YH Q+DGQS++LNKCLEM+L CFTF+NPK+W K L WAEYW
Sbjct: 330  TFWQHLFKLHGTTLAMSSTYHLQTDGQSKALNKCLEMFLSCFTFENPKSWSKGLTWAEYW 389

Query: 901  YNSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXX 1080
            YN+SFHTSLGM+PFKALYGRDPP L R   + + P +V  Q+TER+              
Sbjct: 390  YNTSFHTSLGMSPFKALYGRDPPTLTRYQRSPAYPSDVQDQLTERDQLLDQLKCNLTKAQ 449

Query: 1081 XRMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVA 1260
              MK QADK+R D+QF++GD VLVKLQPYRQHS+ LRK+ KL MRYFGPF++I ++G VA
Sbjct: 450  QHMKHQADKKRFDMQFQVGDQVLVKLQPYRQHSVVLRKHQKLSMRYFGPFKVIGKVGVVA 509

Query: 1261 YKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQ 1440
            YKL+LP TA+IH VFH+SQLK F+G S+EPY+PLPLT++ELGPIL P A+L  RTI  G 
Sbjct: 510  YKLELPETARIHPVFHISQLKPFKGVSNEPYMPLPLTTSELGPILQPVAILHARTILQGS 569

Query: 1441 TLVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVMNESAAE 1602
             L  QVL+ W+ +     +WE+V   K H+P  +LEDK+  KGEG VM  S  +
Sbjct: 570  KLHSQVLVQWDPSINVPNSWEDVTFIKTHFPHIDLEDKVVLKGEGNVMKMSVGQ 623


>gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 1210

 Score =  697 bits (1799), Expect = 0.0
 Identities = 331/527 (62%), Positives = 401/527 (76%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLHKF+GYD TIEYKPG ENI ADALSRV  MAWS P+SL + +LKQA+  D     L++
Sbjct: 628  WLHKFLGYDFTIEYKPGKENIAADALSRVFFMAWSAPKSLFLHELKQALENDAHLYDLMQ 687

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
             C   AN D  Y   D LLYW+ RLV+P  S LI+ +L EYHSSP+GGHAG         
Sbjct: 688  LCFANANPDARYKLHDGLLYWKDRLVLPSPSPLIQKVLLEYHSSPIGGHAGIARTLARIS 747

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+WP +++DV  +VQQC ICQ+AK   + PAGLLQPLPIP  VW+DVAMDFITGLPNS
Sbjct: 748  SQFYWPKMREDVTRFVQQCIICQQAKVSHSLPAGLLQPLPIPQQVWDDVAMDFITGLPNS 807

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
             GFTVIMVV+DRLSK++HF+P++ D++SK VAE F   IVKLHG+PKSIVSDRD+VFTS+
Sbjct: 808  CGFTVIMVVIDRLSKYSHFVPLKSDFSSKVVAEAFTLHIVKLHGLPKSIVSDRDKVFTSN 867

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQHLFKL GTTLAMSSAYHPQ+DGQSE LNKCLEM+LRCFTFDNPK+W K L WAEYWY
Sbjct: 868  FWQHLFKLQGTTLAMSSAYHPQTDGQSEVLNKCLEMFLRCFTFDNPKSWSKGLTWAEYWY 927

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N+SFHTSLGMTPFKALYGRDPP L R   + +DP +V  Q+T+R+               
Sbjct: 928  NTSFHTSLGMTPFKALYGRDPPTLTRYQRSPTDPRDVQDQLTKRDQLLDQLKCNLTKAQQ 987

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
            RMK QADK+R D+QF++GD VLVKLQPYRQHS+ LRK+ KL MRYFGPF+++ ++G VAY
Sbjct: 988  RMKHQADKKRSDMQFQVGDQVLVKLQPYRQHSVVLRKHQKLSMRYFGPFKVLGKVGVVAY 1047

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KL+LP TA+IH VFH+SQLK F+G S+ PY+PLPLT++ELGP L P A+L  RTI  G  
Sbjct: 1048 KLELPETARIHPVFHISQLKPFKGISNAPYMPLPLTTSELGPFLQPVAILHARTILQGSK 1107

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVM 1584
            L+ QVL+ W+ ++    +WE+V   K H+P  NLEDK+  KGEG VM
Sbjct: 1108 LLSQVLVQWDPSSNVPNSWEDVTFIKTHFPYLNLEDKVVLKGEGNVM 1154


>gb|PNX96484.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1258

 Score =  698 bits (1802), Expect = 0.0
 Identities = 338/576 (58%), Positives = 425/576 (73%), Gaps = 19/576 (3%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLH+F+GYD +IEYKPG EN+ ADALSRV+ MAWSEPQ  L+ +++ A+ QD T   +++
Sbjct: 683  WLHRFLGYDFSIEYKPGKENVAADALSRVMTMAWSEPQYKLLHQIRAALKQDSTLLGIMQ 742

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
             C +   T++ YT +D+LL+W+ R+V+P+ S LIK +LYE H+SP+GGHAG         
Sbjct: 743  KCVQNNATNSHYTVKDELLFWKHRIVIPKNSELIKQVLYELHTSPIGGHAGMARTLARVK 802

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+WP +K D+  YVQ CAICQ+AKT    PAGLLQPLPIP+ VWEDVAMDFITGLP+S
Sbjct: 803  SQFYWPDMKTDIADYVQNCAICQKAKTTNTLPAGLLQPLPIPSQVWEDVAMDFITGLPSS 862

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
             G+T I+VV+DRL+K+AHFIP++ DY+SK VAE  M  IVKLHG+PKSIVSDRD+VFTSS
Sbjct: 863  QGYTTILVVIDRLTKYAHFIPLKTDYSSKIVAEAVMDNIVKLHGMPKSIVSDRDKVFTSS 922

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQ LFKL GTTLAMSSAYHPQSDGQSE LNK LE++LRCFTFDNPK+W KAL+W+E+WY
Sbjct: 923  FWQQLFKLQGTTLAMSSAYHPQSDGQSEVLNKTLELFLRCFTFDNPKSWCKALSWSEFWY 982

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N++F TS+GMTPFKALYGRDPP L+R     +DPP + +++ ER+               
Sbjct: 983  NTAFQTSIGMTPFKALYGRDPPALIRYETQANDPPTLQEKLMERDRIIQQLKLNLEKAQQ 1042

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
             MK QADK R DV+ ++GDLVLVKLQPYRQ S+ALRKN KLGMRYFGPFE+IA++G VAY
Sbjct: 1043 YMKKQADKHRVDVKLQVGDLVLVKLQPYRQQSVALRKNQKLGMRYFGPFEVIAKVGEVAY 1102

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KLKLP  AKIH VFHVSQLK F+GD+ E Y+PLPL+ T+ GP++ P +VL  RTI  G  
Sbjct: 1103 KLKLPEHAKIHPVFHVSQLKPFKGDNQEQYMPLPLSMTDTGPMIQPVSVLATRTIIRGAQ 1162

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVMN----------ES 1593
             ++QVLI W+  +   ATWE+V+  +  +P FNLEDK+ F G+GIVM+          ES
Sbjct: 1163 RIQQVLIQWDQYSTAEATWEDVDALQSKFPAFNLEDKVAFIGDGIVMSPMEENILQEGES 1222

Query: 1594 AAEVENEM---------PRRSMRARKASVKLNDYCV 1674
            A E  N+M         PRR  R RK S +L  Y +
Sbjct: 1223 AKEGLNDMHERNSVMMGPRRGKRVRKTSKRLEGYAL 1258


>gb|AAO23078.1| polyprotein [Glycine max]
          Length = 1552

 Score =  699 bits (1804), Expect = 0.0
 Identities = 334/556 (60%), Positives = 413/556 (74%)
 Frame = +1

Query: 1    AWLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLL 180
            AWLHKF+GYD  IEYKPG +N  ADALSR+ M+AWSEP S+ + +L+  ++ D   K+L+
Sbjct: 978  AWLHKFLGYDFKIEYKPGKDNQAADALSRMFMLAWSEPHSIFLEELRARLISDPHLKQLM 1037

Query: 181  EACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXX 360
            E   +GA+  + YT R+ LLYW+ R+V+P    ++  IL EYHSSP+GGHAG        
Sbjct: 1038 ETYKQGADA-SHYTVREGLLYWKDRVVIPAEEEIVNKILQEYHSSPIGGHAGITRTLARL 1096

Query: 361  XXQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPN 540
              QF+WP +++DVK Y+Q+C ICQ+AK+    PAGLLQPLPIP  VWEDVAMDFITGLPN
Sbjct: 1097 KAQFYWPKMQEDVKAYIQKCLICQQAKSNNTLPAGLLQPLPIPQQVWEDVAMDFITGLPN 1156

Query: 541  SHGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTS 720
            S G +VIMVV+DRL+K+AHFIP++ DY SK VAE FMS IVKLHG+P+SIVSDRDRVFTS
Sbjct: 1157 SFGLSVIMVVIDRLTKYAHFIPLKADYNSKVVAEAFMSHIVKLHGIPRSIVSDRDRVFTS 1216

Query: 721  SFWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYW 900
            +FWQHLFKL GTTLAMSSAYHPQSDGQSE LNKCLEMYLRCFT+++PK W KAL WAE+W
Sbjct: 1217 TFWQHLFKLQGTTLAMSSAYHPQSDGQSEVLNKCLEMYLRCFTYEHPKGWVKALPWAEFW 1276

Query: 901  YNSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXX 1080
            YN+++H SLGMTPF+ALYGR+PP L R   +  DP EV +Q+T+R+A             
Sbjct: 1277 YNTAYHMSLGMTPFRALYGREPPTLTRQACSIDDPAEVREQLTDRDALLAKLKINLTRAQ 1336

Query: 1081 XRMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVA 1260
              MK QADK+R DV F+IGD VLVKLQPYRQHS  LRKN KL MRYFGPF+++A+IG VA
Sbjct: 1337 QVMKRQADKKRLDVSFQIGDEVLVKLQPYRQHSAVLRKNQKLSMRYFGPFKVLAKIGDVA 1396

Query: 1261 YKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQ 1440
            YKL+LPS A+IH VFHVSQLK F G + +PYLPLPLT TE+GP++ P  +L  R I  G 
Sbjct: 1397 YKLELPSAARIHPVFHVSQLKPFNGTAQDPYLPLPLTVTEMGPVMQPVKILASRIIIRGH 1456

Query: 1441 TLVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVMNESAAEVENEMP 1620
              +EQ+L+ WE+  +D ATWE++E+ K  YPTFNLEDK+ FKGEG V N  +   +    
Sbjct: 1457 NQIEQILVQWENGLQDEATWEDIEDIKASYPTFNLEDKVVFKGEGNVTNGMSRGEKVNNT 1516

Query: 1621 RRSMRARKASVKLNDY 1668
              S   R    KL D+
Sbjct: 1517 AESSSERGLHNKLADF 1532


>dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subterraneum]
          Length = 1531

 Score =  687 bits (1774), Expect = 0.0
 Identities = 330/574 (57%), Positives = 422/574 (73%), Gaps = 19/574 (3%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLH+F+GYD TIEYKPG EN+ ADALSRV+ +AWSEPQ  L+ +++ A+ QD T  +++E
Sbjct: 956  WLHRFLGYDFTIEYKPGKENVAADALSRVMTLAWSEPQYKLLHQIRVALKQDSTLLEIME 1015

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
             CA+ +++++ YT +DDLL+W+ R+V+P+ S L + +LYE H+SP+GGHAG         
Sbjct: 1016 KCAQNSDSNSNYTIKDDLLFWKHRIVIPKHSELRQQVLYELHTSPIGGHAGIARTLARVK 1075

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+W  +K D+  YVQ C ICQ+AKT   PPAGLLQPLPIP+ VWEDVAMDFITGLP+S
Sbjct: 1076 AQFYWLDMKTDIAKYVQNCVICQKAKTTNTPPAGLLQPLPIPSQVWEDVAMDFITGLPSS 1135

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
            HG+T I+VV+DRL+K+AHFIP++ DY+SK VAE FM  IVKLHG+PKSIVSDRD+VFTSS
Sbjct: 1136 HGYTTILVVIDRLTKYAHFIPLKTDYSSKIVAEAFMDNIVKLHGMPKSIVSDRDKVFTSS 1195

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQ LFKL GT+LAMSSAYHPQSDGQSE LNK LE++LRCFTF+NPK+W KALAW+E+WY
Sbjct: 1196 FWQQLFKLQGTSLAMSSAYHPQSDGQSEVLNKTLELFLRCFTFENPKSWCKALAWSEFWY 1255

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N++F TS+GMTPFKALYGRDPP ++R  I  SD P + +++ ER+               
Sbjct: 1256 NTAFQTSIGMTPFKALYGRDPPAIIRYEIQASDSPTLQEKLMERDRIIQQLKLNLEKAQQ 1315

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
             MK QADK R DV+ ++GD VLVKLQPYRQ S+ALRKN KLGM+YFGPFE+IA++G VAY
Sbjct: 1316 YMKKQADKHRVDVKLQVGDWVLVKLQPYRQQSVALRKNQKLGMKYFGPFEVIAKVGEVAY 1375

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KLKLP  AKIH VFHVSQLK F+GD+ E Y+PLPL+ T++GP++ P AVL  RTI     
Sbjct: 1376 KLKLPDHAKIHPVFHVSQLKPFKGDNQEQYMPLPLSMTDIGPMIQPVAVLATRTIIRCAQ 1435

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVMNESAAEVENE--- 1614
             ++QVLI W+      ATWE++   ++ +PTFNLEDK+ F G+GIVM+ +   +  E   
Sbjct: 1436 RIQQVLIQWDQYPIAEATWEDMVALQRKFPTFNLEDKVAFIGDGIVMSPNEENILEEGDS 1495

Query: 1615 ----------------MPRRSMRARKASVKLNDY 1668
                             PRR  R R  S +L  Y
Sbjct: 1496 SNVGPPDKHEGNYVMMGPRRGKRMRNISKRLEGY 1529


>gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1535

 Score =  686 bits (1770), Expect = 0.0
 Identities = 328/537 (61%), Positives = 404/537 (75%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLHKF+GYD TIEYKPG EN+ ADALSR++ ++WSEP+   + +++ A+  D   +++L 
Sbjct: 956  WLHKFLGYDFTIEYKPGKENMAADALSRIMTLSWSEPKCQFIEQIRVALQNDNQMREILM 1015

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
             C  G      Y+ RD LLYW+ RLV+P+ + L+  +L+E+H+SP+GGHAG         
Sbjct: 1016 KCNAG-KAPVQYSMRDGLLYWKQRLVIPKDNDLLYKVLFEFHTSPIGGHAGITRTMARIK 1074

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+WP +KQD+  YVQ+C +CQ+AKT    PAGLLQPLPIP+ VWED+AMDFITGLP S
Sbjct: 1075 SQFYWPDMKQDIIDYVQKCMVCQQAKTTNTSPAGLLQPLPIPSQVWEDIAMDFITGLPLS 1134

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
             G+T IMVVVDRL+K+AHFIPM+ DYTSKSVAE FM  IVKLHG+PKSIVSDRD+VFTS+
Sbjct: 1135 SGYTTIMVVVDRLTKYAHFIPMKSDYTSKSVAESFMHNIVKLHGMPKSIVSDRDKVFTSA 1194

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQ LFKL GT+LAMSSAYHPQSDGQ+E LNK LE++LRCFTF NPK+W K L+WAEYWY
Sbjct: 1195 FWQQLFKLQGTSLAMSSAYHPQSDGQTEVLNKALELFLRCFTFHNPKSWSKVLSWAEYWY 1254

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N++F TS+GMTPFKALYGRDPP L +     +DPP + +++ ER+               
Sbjct: 1255 NTAFQTSIGMTPFKALYGRDPPYLTKYEAQVTDPPALQEELMERDKILQQLKSNLDRAQQ 1314

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
             MK QADK R+DV F++GDLVLVKLQPYRQ S+ALRKN KLGMRYFGPFEIIA IGAVAY
Sbjct: 1315 YMKKQADKHRKDVTFQVGDLVLVKLQPYRQQSVALRKNQKLGMRYFGPFEIIACIGAVAY 1374

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KLKLP  AKIH VFHVSQLK F+G +S+ YLPLPLT TE GPI+ P AVLQ RTI  G  
Sbjct: 1375 KLKLPDNAKIHPVFHVSQLKPFKGAASDQYLPLPLTMTETGPIMQPIAVLQARTIMRGTQ 1434

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVMNESAAEVENE 1614
             V Q+L+ W+ N E  ATWE+ ++ +  +PT NLEDK+ F GEGIVM  +   +  E
Sbjct: 1435 RVHQILVQWDTNAEAEATWEDFDDLQLKFPTLNLEDKVVFNGEGIVMRPNTTNLLEE 1491


>gb|PNX98514.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1240

 Score =  675 bits (1742), Expect = 0.0
 Identities = 320/527 (60%), Positives = 400/527 (75%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLHKF+GYD  IEYKPG EN+ ADALSR++ +AWSEPQ     ++K+AI QD    ++++
Sbjct: 684  WLHKFLGYDFVIEYKPGKENLAADALSRLMTLAWSEPQYNFTQQVKEAIQQDDNLLEIIQ 743

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
             C +G    T YT R+ +LYW+ R+V+P  +ALI+ IL E+H+SP+GGHAG         
Sbjct: 744  KCLQGL-APTNYTVREGILYWKHRMVIPPKAALIQQILEEFHTSPIGGHAGMTRTLARIK 802

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+W ++K+D+  YVQ C +CQ+AKT    PAGLLQPLPIP+ VWED+AMDFITGLP S
Sbjct: 803  SQFYWSAMKKDIFDYVQNCLVCQQAKTTNTLPAGLLQPLPIPSQVWEDIAMDFITGLPLS 862

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
             G+T IMVVVDRL+K+AHFI M+ DYTSKSVAE FM  +VKLHG+PKSIVSDRD+VFTS+
Sbjct: 863  FGYTTIMVVVDRLTKYAHFIAMKTDYTSKSVAEAFMHNVVKLHGMPKSIVSDRDKVFTST 922

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQHLFKL GTTLAM+SAYHPQSDGQ+E LNK LE+YLRCF+F+NPK+W+K L+W+E+WY
Sbjct: 923  FWQHLFKLQGTTLAMTSAYHPQSDGQTEVLNKGLELYLRCFSFNNPKSWFKMLSWSEFWY 982

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N++F TS+GMTPFKALYGRDPP L R +   SDPP + +++ ER+               
Sbjct: 983  NTAFQTSIGMTPFKALYGRDPPYLTRYVAQASDPPTLQEELMERDKILQQLKDNLIRAQQ 1042

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
             MK QADK R D+  +IGDLVLVKLQPYRQHS+ALRKN KLG+RYFGPFEIIAR+G VAY
Sbjct: 1043 YMKKQADKHRSDISLKIGDLVLVKLQPYRQHSVALRKNQKLGLRYFGPFEIIARVGEVAY 1102

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KLKLP  AKIH VFHVSQLK F+G + E YLPLPLT T++GP + P  VLQ RT+  G  
Sbjct: 1103 KLKLPDDAKIHPVFHVSQLKPFKGVADEQYLPLPLTMTDIGPSIQPIDVLQVRTVIRGSQ 1162

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVM 1584
             + QVLI W+      ATWE++   ++ +P+ NLEDK+ F G+GIVM
Sbjct: 1163 QIHQVLIQWDQYPAAQATWEDITTIQEKFPSLNLEDKVAFNGDGIVM 1209


>gb|PNX73110.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 937

 Score =  665 bits (1716), Expect = 0.0
 Identities = 316/523 (60%), Positives = 391/523 (74%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLHKF+GYD TIEYKPG EN+ ADALSR++ +AWSEPQ   + ++K A+  D    +++ 
Sbjct: 416  WLHKFLGYDFTIEYKPGKENMAADALSRLMTLAWSEPQCQFIEQVKLALQNDNQMMEIML 475

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
             CA G      YT R+ LLYW+ RLV+P+ + L+  +LYE+H+SP+GGHAG         
Sbjct: 476  KCASG-KAPIQYTMREGLLYWKQRLVIPKQNELLHKVLYEFHTSPIGGHAGITRTMARIK 534

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+WP +K+D+  YVQ C +CQ+AKT    PAGLLQPLPIP+ VWED+AMDFITGLP S
Sbjct: 535  SQFYWPDMKKDILEYVQNCVVCQQAKTTNTSPAGLLQPLPIPSQVWEDIAMDFITGLPLS 594

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
            +G+T IMVVVDRL+K+AHFIPMR DYTS+SVAE FM  IVKLHG+PKSIVSDRD+VFTS+
Sbjct: 595  YGYTTIMVVVDRLTKYAHFIPMRTDYTSRSVAEAFMHNIVKLHGMPKSIVSDRDKVFTSA 654

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQ LFKL GT+LAMSSAYHPQSDGQ+E LNK LE++LRCF+F NPK+WYK L+WAEYWY
Sbjct: 655  FWQQLFKLQGTSLAMSSAYHPQSDGQTEVLNKGLELFLRCFSFHNPKSWYKVLSWAEYWY 714

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N++F TS+GMTPFKALYGRDPP L +     +D P + +++ ER+               
Sbjct: 715  NTAFQTSIGMTPFKALYGRDPPYLTKYEAQVTDSPALQEELMERDKILQQLKINLERAQQ 774

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
             MK QADK R +V  ++GDLVLVKLQPYRQ S++LRKN KLGMRYFGPFEIIAR+G VAY
Sbjct: 775  YMKKQADKHRSEVNLQVGDLVLVKLQPYRQQSVSLRKNQKLGMRYFGPFEIIARVGNVAY 834

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KLKLP  AKIH VFHVSQLK F+G + + YLPLPLT +E GPI+ P A L+ RTI  G  
Sbjct: 835  KLKLPDNAKIHPVFHVSQLKPFKGIAQDQYLPLPLTMSETGPIIQPIAALEARTIMRGMQ 894

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGE 1572
             V Q+L+ W+      ATWE+++  +  +PT NLEDKI F GE
Sbjct: 895  KVHQILVQWDQMPVTEATWEDLDVLQDKFPTLNLEDKIAFNGE 937


>dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subterraneum]
          Length = 1451

 Score =  679 bits (1751), Expect = 0.0
 Identities = 316/534 (59%), Positives = 408/534 (76%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLHKF+GYD  IEYKPG EN+ ADALSRV+ +AWSEP S ++ ++K  I  D     +++
Sbjct: 900  WLHKFLGYDFVIEYKPGKENLAADALSRVMTLAWSEPISQIIIQIKDEIKADTYWSDIMD 959

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
             C    N+   YT R+ +LYW+ R+V+P+ SALI+ +L+E+HSSP+GGHAG         
Sbjct: 960  KCKSQGNSYLQYTLREGVLYWKNRVVIPKKSALIQQVLHEFHSSPIGGHAGFTRTLARIK 1019

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+W ++K+DV  Y+Q CAICQ+AK     PAGLLQPLPIP+ VWED+AMDFITGLP S
Sbjct: 1020 SQFYWIAMKKDVLEYIQNCAICQQAKHTNTLPAGLLQPLPIPSQVWEDIAMDFITGLPLS 1079

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
            +G+T I+VV+DRL+K+AHF+PM+ DY+SKSVAE+FM+ IVKLHG+PKSIVSDRD+VFTSS
Sbjct: 1080 YGYTTILVVIDRLTKYAHFLPMKTDYSSKSVAEVFMNNIVKLHGMPKSIVSDRDKVFTSS 1139

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQHLFKL GT+LAMSSAYHPQSDGQ+E LNK LE++LRCFTF+NPK+WYKALAW+E+WY
Sbjct: 1140 FWQHLFKLQGTSLAMSSAYHPQSDGQTEVLNKGLELFLRCFTFNNPKSWYKALAWSEFWY 1199

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N++ HTS+GMTPFKALYGR+PP L R  + ++DPP + +++ ER+               
Sbjct: 1200 NTALHTSIGMTPFKALYGREPPTLTRYEVQDNDPPALQEELMERDRILQQLKSNLERAQQ 1259

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
             MK QADK R + +F +GD+VLVKLQPYRQ S+ALRKN KLGMRYFGPFEIIA +G VAY
Sbjct: 1260 YMKKQADKHRVEFKFHLGDMVLVKLQPYRQQSVALRKNQKLGMRYFGPFEIIACVGKVAY 1319

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KLKLP  AKIH VFHVSQLK F+G   + Y+PLPLT  + GP++ P  VLQ RTI  G  
Sbjct: 1320 KLKLPDHAKIHLVFHVSQLKPFKGVPQQQYMPLPLTMFDNGPMIQPVEVLQARTIMQGTQ 1379

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVMNESAAEV 1605
             + Q+L+ W+  +   ATWENV++ ++++P +NLEDK+ FKG+GIVM     ++
Sbjct: 1380 KIHQILVQWDQYDIAEATWENVDDLQKNFPLYNLEDKVIFKGDGIVMRPKGEDI 1433


>gb|PNY14541.1| hypothetical protein L195_g011223 [Trifolium pratense]
          Length = 763

 Score =  656 bits (1692), Expect = 0.0
 Identities = 313/527 (59%), Positives = 391/527 (74%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLHKF+GYD TIEYKPG EN+ ADALSR++ ++WSEPQ  +V +++ A+  D    +++ 
Sbjct: 181  WLHKFLGYDFTIEYKPGKENMAADALSRLMTLSWSEPQCQIVEQIRAALKNDQNMVEIML 240

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
                G      YT RD LLYW+ RLV+P+ + L+  ILY++H+SP+GGHAG         
Sbjct: 241  KYVSG-KAPIQYTMRDGLLYWKQRLVIPKNNDLLHKILYKFHTSPIGGHAGITRTMSRIT 299

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+WP + +D+  YVQ+C ICQ+AKT+   PAGLLQPLPIP+ VWED+AMDFITGLP S
Sbjct: 300  SQFYWPDMNKDIWDYVQKCVICQQAKTVHTSPAGLLQPLPIPSQVWEDIAMDFITGLPLS 359

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
            + +T IMVVVDRL+K+AHFIPMR DYTSKSVAE F+  IVKLHG+ KSIVSDRD+VFTS+
Sbjct: 360  YSYTTIMVVVDRLTKYAHFIPMRSDYTSKSVAEAFIHNIVKLHGMSKSIVSDRDKVFTSN 419

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQ LFKL GT+L MSSAYHPQSDGQ+E LNK LE++LRCF+F+NPK+WYK L+WAEYWY
Sbjct: 420  FWQQLFKLQGTSLTMSSAYHPQSDGQTEVLNKGLELFLRCFSFNNPKSWYKVLSWAEYWY 479

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N++F TS+GMTPFKALYGR+PP L +      D P + +++ ER+               
Sbjct: 480  NTTFQTSIGMTPFKALYGREPPSLTKYEAHADDSPTIQEELMERDKILQQLKTNLDRAQQ 539

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
             MK QADK R +V  ++G+LVLVKLQPYRQ S+ALRKN KLGMRYFGPFEIIARIG VAY
Sbjct: 540  YMKKQADKNRTEVNLQVGELVLVKLQPYRQQSVALRKNQKLGMRYFGPFEIIARIGKVAY 599

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KLKLP  AKIH VFHVSQLK F+G + + YLPLPLT +E+GPI+ P ++L  RTI     
Sbjct: 600  KLKLPDNAKIHPVFHVSQLKPFKGTTQDQYLPLPLTMSEVGPIIQPVSILDARTIVRESQ 659

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVM 1584
             V Q+LI W+       TWE+ ++ +  +PT NLEDKI F GEGIVM
Sbjct: 660  KVHQILIQWDQTTPAETTWEDFDDLQNKFPTLNLEDKIVFNGEGIVM 706


>dbj|GAU11620.1| hypothetical protein TSUD_346120 [Trifolium subterraneum]
          Length = 1479

 Score =  679 bits (1751), Expect = 0.0
 Identities = 320/517 (61%), Positives = 399/517 (77%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLHKF+GYD TIEYKPG +N+ ADALSR++ M WSEPQ   + ++K A+ QD T  ++++
Sbjct: 955  WLHKFLGYDFTIEYKPGKDNLAADALSRMMCMGWSEPQCSWIQQIKTALQQDTTLMEIIQ 1014

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
             C +G  T+T YT RD LLYW+ R+V+P   AL++ +L E+HSSP+GGHAG         
Sbjct: 1015 QCEQG-QTNTHYTVRDGLLYWKHRIVIPVDDALLQQVLKEFHSSPIGGHAGMTRTLARIQ 1073

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+WP++KQD+  YVQ C +CQ+ KT    PAGLLQPLPIP+ +WED+AMDFITGLP+S
Sbjct: 1074 AQFYWPNMKQDIVQYVQNCLVCQQTKTTNTLPAGLLQPLPIPSQIWEDLAMDFITGLPSS 1133

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
            HG+TVI+VVVDRL+K+AHFIPM+ DY+SKSVAE+FM  +VKLHG+PKSIVSDRD+VFTS+
Sbjct: 1134 HGYTVILVVVDRLTKYAHFIPMKTDYSSKSVAEVFMQNVVKLHGLPKSIVSDRDKVFTSA 1193

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQ LFKL GTTLAMSSAYHPQSDGQSE LNK LE+YLRCF+F+NPK W K L+W+E+WY
Sbjct: 1194 FWQQLFKLQGTTLAMSSAYHPQSDGQSEVLNKTLELYLRCFSFNNPKAWSKMLSWSEFWY 1253

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N++F TS+GMTPFKA+YGRDPP L R +  +SDPP +  ++ ER+               
Sbjct: 1254 NTAFQTSIGMTPFKAVYGRDPPYLNRYVAQDSDPPTLRAELMERDTILQQLKNNLLKAQQ 1313

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
             MK QADK R ++QF+IGDLVLVKLQPYRQHS+ALRKN KLG+RYFGPFEIIA++GAVAY
Sbjct: 1314 YMKKQADKHRIELQFQIGDLVLVKLQPYRQHSVALRKNQKLGLRYFGPFEIIAKVGAVAY 1373

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KLKLP  AKIH VFHVSQLK F+G ++E Y PLPLT TE+GP+  P  VLQ RTI  G  
Sbjct: 1374 KLKLPDYAKIHPVFHVSQLKPFKGFTNEQYFPLPLTMTEIGPLSQPIDVLQVRTIIKGTK 1433

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDK 1554
             V QVLI W+    + A+WE+ ++    +P FNLEDK
Sbjct: 1434 KVHQVLIQWDQYPREEASWEDADDVLNKFPAFNLEDK 1470


>gb|PNX93203.1| hypothetical protein L195_g016354 [Trifolium pratense]
          Length = 869

 Score =  659 bits (1699), Expect = 0.0
 Identities = 323/575 (56%), Positives = 415/575 (72%), Gaps = 19/575 (3%)
 Frame = +1

Query: 1    AWLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLL 180
            AW+HKFIGYD TIEYKPG +N+ ADALSRV +MAWSEP+ + + ++++    D   + L+
Sbjct: 295  AWIHKFIGYDFTIEYKPGKDNVAADALSRVCLMAWSEPEIVFLDEVRRCTENDSQLQGLI 354

Query: 181  EACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXX 360
                        +  ++ L+YW  ++V+P    L   +L E+HSSPVGGHAG        
Sbjct: 355  NTSDPVHGHQ--FVRKNGLVYWNNKIVLPDDKNLKTKLLLEFHSSPVGGHAGIARTIARI 412

Query: 361  XXQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPN 540
              QF W ++KQD+K +VQ C ICQ+AK  T  PAGLLQPLPIP  VWED+AMDFITGLP 
Sbjct: 413  AAQFFWKNMKQDIKLFVQNCLICQQAKHDTRAPAGLLQPLPIPEQVWEDIAMDFITGLPP 472

Query: 541  SHGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTS 720
            S+G+TVIMVV+DRL+K++HF P++ DY SK+VAE+FM  +VKLHG+PKSIVSDRD+VF S
Sbjct: 473  SNGYTVIMVVIDRLTKYSHFSPLKIDYNSKTVAEVFMKTVVKLHGLPKSIVSDRDKVFIS 532

Query: 721  SFWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYW 900
             FW+ LF+L GTTL+MSSAYHPQ+DGQSE+LNKCLEMYLRC TF NPK+W+KAL WAEYW
Sbjct: 533  KFWKELFQLQGTTLSMSSAYHPQTDGQSEALNKCLEMYLRCLTFQNPKSWFKALDWAEYW 592

Query: 901  YNSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXX 1080
            YN+++H SLGMTPF+ALYGR PP LVR   + +D  +V +Q+ ER+              
Sbjct: 593  YNTAYHNSLGMTPFQALYGRTPPTLVRYTHSPTDTLDVQQQLMERDRLIATLKDNLKRAQ 652

Query: 1081 XRMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVA 1260
              MK QADK RRD QFE+G+ VLVKLQPYRQ+S+ALRKN KLGMRYFGPF II ++G VA
Sbjct: 653  QIMKNQADKHRRDAQFEVGEQVLVKLQPYRQNSVALRKNQKLGMRYFGPFTIIEKVGKVA 712

Query: 1261 YKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQ 1440
            YK++LP  AKIH VFH+SQLK+F+G +++PY+PLPLT+ ELGPIL P AVLQ+R I   +
Sbjct: 713  YKVQLPVEAKIHPVFHISQLKQFKGRATDPYIPLPLTTHELGPILQPIAVLQRRDIVRNE 772

Query: 1441 TLVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVMNESAAE------ 1602
              ++QVLI WE  N+  ATWE+V+E  ++YP FNLEDK++ KG+GI M E   +      
Sbjct: 773  HAIQQVLIKWEGLNDTDATWEDVDEITENYPNFNLEDKVEVKGKGIAMEEPRQQKGQVSK 832

Query: 1603 -VENE----------MP--RRSMRARKASVKLNDY 1668
             +ENE          MP  R+ +R R  S+KL D+
Sbjct: 833  ILENEGATKSVSAPQMPGMRKGVRPRAPSIKLRDF 867


>dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subterraneum]
          Length = 1512

 Score =  678 bits (1750), Expect = 0.0
 Identities = 325/534 (60%), Positives = 401/534 (75%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLH+F+GYD +IEYKPG +N+ ADALSRV+ +AWSEPQ  L+ ++K    QD T  K+  
Sbjct: 955  WLHRFLGYDFSIEYKPGKDNVVADALSRVMTLAWSEPQFRLLHQIKAIQKQDPTLVKIRM 1014

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
             CA+ +   + YT +DDLL+W+ R+V+P   ALI+ +LYE H+SP+GGHAG         
Sbjct: 1015 ECAQHSQPGSHYTIKDDLLFWKQRIVIPHDGALIQQVLYELHTSPLGGHAGITRTVARVK 1074

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+W  +K+D+  YVQ C ICQ+AKT    PAGLLQPLPIP+ VWEDVAMDFITGLP S
Sbjct: 1075 AQFYWSDMKKDIVEYVQNCEICQKAKTANTLPAGLLQPLPIPSQVWEDVAMDFITGLPLS 1134

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
            HG+TVI+VV+DRL+K+AHFIP++ DYTSK VAE FM  IVKLHG+PKSIVSDRD+VFTS+
Sbjct: 1135 HGYTVILVVIDRLTKYAHFIPLKTDYTSKIVAEAFMHHIVKLHGMPKSIVSDRDKVFTSN 1194

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQ LFKL GT+LAMSSAYHPQSDGQSE LN+ LE++LRCFTF+NPK WYKAL+W+E+WY
Sbjct: 1195 FWQQLFKLQGTSLAMSSAYHPQSDGQSEVLNRTLELFLRCFTFNNPKAWYKALSWSEFWY 1254

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N++F TS+GMTPFKALYGRDPP LVR      DPP + +++  R+               
Sbjct: 1255 NTAFQTSIGMTPFKALYGRDPPTLVRYEAQAGDPPALQEELMGRDKLLQQLKSNLERAQQ 1314

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
             MK QADK RRD++ ++GDLVLVKLQPYRQ SLALRKN KLGMRYFGPFEI+A++G VAY
Sbjct: 1315 YMKRQADKHRRDIKLQVGDLVLVKLQPYRQQSLALRKNQKLGMRYFGPFEILAKVGEVAY 1374

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KLKLP  AKIH VFH+SQLK F+G S +  LPLPLT ++ GP++ P AVL  RTI  G  
Sbjct: 1375 KLKLPDHAKIHPVFHISQLKPFKGISQDQSLPLPLTMSDTGPLIQPIAVLAARTILKGIQ 1434

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVMNESAAEV 1605
             V QVLI W+   E  ATWE V   +  +P FNLEDK+ FKG+GIVM+    +V
Sbjct: 1435 KVHQVLIQWDQYPEAEATWEEVTNLQSKFPYFNLEDKVVFKGDGIVMSPKEGKV 1488


>gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1502

 Score =  671 bits (1731), Expect = 0.0
 Identities = 315/534 (58%), Positives = 404/534 (75%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLHKF+GYD TIEYKPG EN+ ADALSRV++MAWSEP+  L+ ++++A+  D   +++++
Sbjct: 959  WLHKFLGYDFTIEYKPGKENMAADALSRVMVMAWSEPKWQLLDQVRRALENDNQLREVMQ 1018

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
              A G      Y+ R+ LLYW+ RLV+P+   L++ +L+E+H+SP+GGHAG         
Sbjct: 1019 NYAIG-KAPVQYSMREGLLYWKQRLVIPKNDDLLQKLLFEFHTSPIGGHAGITRTIARIK 1077

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+WP +K+D+  YV +C +CQ+AKT    PAGLLQPLPIP+ VWED+AMDFITGLP S
Sbjct: 1078 AQFYWPDMKKDIAEYVHKCVVCQQAKTTNTSPAGLLQPLPIPSQVWEDIAMDFITGLPLS 1137

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
            +G+T IMVVVDRL+K AHFIPM+ DYTSK+VAE FM  IVKLHG+PKSIVSDRD+VFTS+
Sbjct: 1138 YGYTTIMVVVDRLTKSAHFIPMKTDYTSKTVAEAFMHNIVKLHGMPKSIVSDRDKVFTSA 1197

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQHLFK+ GT+LAMSSAYHPQ+DGQ+E LNK LE++LRCFTF NPK+W+K ++WAEYWY
Sbjct: 1198 FWQHLFKMQGTSLAMSSAYHPQTDGQTEVLNKTLELFLRCFTFHNPKSWFKVMSWAEYWY 1257

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N++F TS+GMTPFKALYGRDPP L +  +   DPP + +++ ER+               
Sbjct: 1258 NTAFQTSIGMTPFKALYGRDPPYLTKYEVQVDDPPALREELMERDQILQQLKTNLERAQQ 1317

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
             MK QADK RR+V F++GDLVLVKLQPY+Q S+ALRKN KLGMRYFGPFE+IA +G VAY
Sbjct: 1318 YMKQQADKHRREVSFKVGDLVLVKLQPYKQQSVALRKNQKLGMRYFGPFEVIACVGKVAY 1377

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KL+LP  AKIH VFHVSQLK F G S E YLPLPLT ++ GPI  P  +LQ RTI  G  
Sbjct: 1378 KLQLPENAKIHPVFHVSQLKPFHGTSQEQYLPLPLTMSDTGPIFQPATILQARTIVRGNK 1437

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVMNESAAEV 1605
             V Q+ I W+ N+ + A+WE+++E +  +P  NLEDK+ FKGEGIVM  +   +
Sbjct: 1438 KVHQLQIQWDLNSPEEASWEDLDELQNKFPNINLEDKVVFKGEGIVMRPNNTNI 1491


>dbj|GAU29525.1| hypothetical protein TSUD_115470 [Trifolium subterraneum]
          Length = 1556

 Score =  665 bits (1716), Expect = 0.0
 Identities = 327/583 (56%), Positives = 403/583 (69%), Gaps = 27/583 (4%)
 Frame = +1

Query: 1    AWLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLL 180
            AWLHKFIG+D TIEYKPG EN+  DALSR+ ++AWSEP    + +L+  I QD   K+++
Sbjct: 971  AWLHKFIGFDFTIEYKPGKENLAVDALSRINLLAWSEPHYSFLDELRLEIQQDTHLKEII 1030

Query: 181  EACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXX 360
            + C   +  D  YT +D+LL+WQ RLV+P+ S LI+ IL E+HSSP+GGH+G        
Sbjct: 1031 QQCLNNSCADANYTVKDNLLFWQQRLVIPENSTLIQKILLEFHSSPIGGHSGITRTMARI 1090

Query: 361  XXQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPN 540
              QF+W  ++Q +  +++ C ICQ+AKT T  PAGLL PLPIP LVW D+AMDFITGLP 
Sbjct: 1091 ASQFYWTHMRQHIVAFIKHCVICQQAKTTTTTPAGLLAPLPIPTLVWADIAMDFITGLPP 1150

Query: 541  SHGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTS 720
            S+GFTVI+VV+DRL+K+AHF P++ D+ SK VAE+FM  IVKLHG+P SIVSDRD+VFTS
Sbjct: 1151 SNGFTVILVVIDRLTKYAHFFPLKSDFDSKKVAEIFMQNIVKLHGMPSSIVSDRDKVFTS 1210

Query: 721  SFWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYW 900
            +FW+HLFKL GTTLAM+SAYHPQSDGQSE LNKC+EMYLRC TFDNP  W KAL W E+W
Sbjct: 1211 AFWRHLFKLHGTTLAMTSAYHPQSDGQSEVLNKCVEMYLRCMTFDNPTKWSKALPWTEFW 1270

Query: 901  YNSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXX 1080
            YN+S+HTS  MTPFKALYG DPPQLVR   T  D P++  Q+ ERE              
Sbjct: 1271 YNTSYHTSAAMTPFKALYGSDPPQLVRTKGTTDDHPDLQTQLAEREDLLSQLQVNLHKAQ 1330

Query: 1081 XRMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVA 1260
              MK QADK+R  V+F++ D VLVKLQPYRQ S+ALRK+ KLG+RYFGPF I+A++G VA
Sbjct: 1331 QAMKFQADKKRHHVEFKVDDQVLVKLQPYRQSSVALRKHQKLGLRYFGPFPIVAKVGVVA 1390

Query: 1261 YKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQ 1440
            Y+L LPST KIH VFHVSQLK F G    PY+PLPLT++ELGPIL P A+L  R I  G 
Sbjct: 1391 YRLGLPSTTKIHPVFHVSQLKLFHGPHIPPYMPLPLTTSELGPILQPEALLDSRLIMRGN 1450

Query: 1441 TLVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVM----------NE 1590
            T + QVLI WE      ATWE++ EFK  +P FNLEDK+   G  IV           NE
Sbjct: 1451 TPISQVLISWEGLETADATWEDLVEFKLAHPNFNLEDKVTLNGGSIVRDPIINMDSQNNE 1510

Query: 1591 SAAEVENEMP-----------------RRSMRARKASVKLNDY 1668
            SA + E+ +                  RRS R R  S +L  Y
Sbjct: 1511 SAEDNESALKDEAEASNSVMNPDIVGLRRSTRKRIESTRLEGY 1553


>gb|PNX93307.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 551

 Score =  629 bits (1623), Expect = 0.0
 Identities = 315/574 (54%), Positives = 395/574 (68%), Gaps = 19/574 (3%)
 Frame = +1

Query: 1    AWLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLL 180
            AWLHKF+GYD  IEYKPG EN  ADALSR+  ++WSEP SL + +L+  +  +   K L+
Sbjct: 12   AWLHKFLGYDFKIEYKPGKENQAADALSRMFALSWSEPHSLFLEELRNKLAANDHLKLLM 71

Query: 181  EACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXX 360
              C + A  D  YT R+ LLYW+ RLV+P   ALI+ IL EYHSSP+GGHAG        
Sbjct: 72   IDC-QNATADKHYTVREGLLYWKDRLVIPVEEALIQRILPEYHSSPIGGHAGITRTMARL 130

Query: 361  XXQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPN 540
              QF+WP +++ V+ YVQ C ICQ+AK     P+GLLQPLPIP  VW+D+AMDFITGLP 
Sbjct: 131  KAQFYWPKMQEQVQEYVQNCVICQQAKVSNTLPSGLLQPLPIPQQVWKDIAMDFITGLPI 190

Query: 541  SHGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTS 720
             +GF+VIMVVVDRL+K+AHF+ ++  Y+SK+VAE FMS IVKLHG+P+SIVSDRD+VFTS
Sbjct: 191  VNGFSVIMVVVDRLTKYAHFLTLKAYYSSKTVAEAFMSNIVKLHGMPRSIVSDRDKVFTS 250

Query: 721  SFWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYW 900
            +FWQHLFKL GTTLAMSSAYHPQ+DGQSE LNKCLEMYLRCFT++NPK W KAL W+E+W
Sbjct: 251  AFWQHLFKLQGTTLAMSSAYHPQTDGQSEILNKCLEMYLRCFTYENPKGWVKALPWSEFW 310

Query: 901  YNSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXX 1080
            YN++FHTSLGMTPFKALYGRDPP L R  IT+ DP E+ + +  R++             
Sbjct: 311  YNTAFHTSLGMTPFKALYGRDPPTLTRTQITSQDPMELRELLANRDSLLTKLKTNLFRAQ 370

Query: 1081 XRMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVA 1260
              MK QAD++R++V  ++GD VLVKLQPYRQ S  LRKN KL M+YFGPF+IIA+IG+VA
Sbjct: 371  QAMKAQADRKRQEVIMQVGDHVLVKLQPYRQQSAVLRKNKKLSMKYFGPFKIIAKIGSVA 430

Query: 1261 YKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQ 1440
            YKL LP +A+IH+VFHVSQLK F+G+  EPYLPLPLT                       
Sbjct: 431  YKLDLPPSARIHSVFHVSQLKLFKGNVDEPYLPLPLT----------------------- 467

Query: 1441 TLVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVMNESA-------- 1596
                      E+  +D ATWE++E+ K +YP+FNLEDK+  KGEG VM +          
Sbjct: 468  ----------ENGVQDDATWEDIEDIKSNYPSFNLEDKVDVKGEGNVMGDRTRAQHEPSY 517

Query: 1597 ------AEVENEMPR-----RSMRARKASVKLND 1665
                  A + N+ P+     R  R R+ + KL D
Sbjct: 518  DVSCDDAGMNNQFPKNVDLGRGQRVRRPTWKLRD 551


>dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subterraneum]
          Length = 1500

 Score =  660 bits (1704), Expect = 0.0
 Identities = 320/581 (55%), Positives = 411/581 (70%), Gaps = 26/581 (4%)
 Frame = +1

Query: 4    WLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLLE 183
            WLHKF+GYD TIEYKPG +N+ ADALSR++ ++WSEP+   + ++K A+  D   K +++
Sbjct: 921  WLHKFLGYDFTIEYKPGKDNMAADALSRIMTLSWSEPKCHFIEEVKIALQNDTQMKDIMQ 980

Query: 184  ACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXXX 363
                G      Y++++ LLYW+ RLV+P+ S L+  +L+E+H+SP+GGH+          
Sbjct: 981  KSLLG-KAPVQYSTKEGLLYWKQRLVIPRDSNLLHKVLFEFHTSPIGGHSSITRTLARIK 1039

Query: 364  XQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPNS 543
             QF+WP +K+D+  YVQ+C +CQ+AKT    PAGLLQPLPIP+ VWED+AMDFITGLP+S
Sbjct: 1040 SQFYWPDMKKDIIEYVQKCIVCQQAKTTNTSPAGLLQPLPIPSQVWEDIAMDFITGLPSS 1099

Query: 544  HGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTSS 723
             G+  IMVVVDRL+K+A FIPM+ DYTSKSVAE FM  IVKLHG+PKSIVSDRDRVFTS+
Sbjct: 1100 FGYNTIMVVVDRLTKYADFIPMKSDYTSKSVAETFMHNIVKLHGMPKSIVSDRDRVFTST 1159

Query: 724  FWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYWY 903
            FWQHLFKL GT+LAMSSAYHPQSDGQ+E LNK LE++LRCFTF+NPK+W+K L+W+EYWY
Sbjct: 1160 FWQHLFKLQGTSLAMSSAYHPQSDGQTEVLNKALELFLRCFTFNNPKSWFKVLSWSEYWY 1219

Query: 904  NSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXXX 1083
            N+SF TS+GMTPF+ALYGR PP L + +   +DPP +  ++ ER+               
Sbjct: 1220 NTSFQTSIGMTPFQALYGRLPPYLTKYVPQENDPPTLQAELIERDNLLQQLKTNLERAQQ 1279

Query: 1084 RMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVAY 1263
             MK QADK RRD+ +++GD VL+KLQPYRQHS+ALRKN KLGMRYFGPFEIIA +G +AY
Sbjct: 1280 YMKKQADKHRRDISYQVGDFVLIKLQPYRQHSVALRKNQKLGMRYFGPFEIIACVGTIAY 1339

Query: 1264 KLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQT 1443
            KL LP  AKIH VFHVSQLK F+G + + Y+PLPLT +E GPI+ P AVLQ RTIQ G  
Sbjct: 1340 KLNLPENAKIHPVFHVSQLKPFKGTTQDQYMPLPLTMSETGPIIQPIAVLQARTIQRGMQ 1399

Query: 1444 LVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIVM------------- 1584
             V QV I W+   E  A+WE++++ K  +PT NLEDK+  +G  IVM             
Sbjct: 1400 KVHQVQIQWDQTAE--ASWEDLDDLKNKFPTLNLEDKVVVEGGSIVMKPNINNILEAKVP 1457

Query: 1585 ------------NESAAEVENEM-PRRSMRARKASVKLNDY 1668
                          S AE++ ++ PRR  R RK      DY
Sbjct: 1458 ANSIGDPQNMYDGNSVAEIKEDLGPRRGKRVRKTHGIWKDY 1498


>gb|KYP63732.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 1084

 Score =  647 bits (1668), Expect = 0.0
 Identities = 309/527 (58%), Positives = 387/527 (73%)
 Frame = +1

Query: 1    AWLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLL 180
            AWLHKF+GYD +IEYKPG +N+ ADALSR   MA S+P   +++ LK AI  D    ++L
Sbjct: 499  AWLHKFLGYDFSIEYKPGKDNVGADALSRSFFMAMSQPTWDIISLLKAAIASDSKYNEIL 558

Query: 181  EACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXX 360
            +AC +G      Y+++D LLYW+ RLV+P    LIK +L+E+H+SP+GGHAG        
Sbjct: 559  QACNQGNPPHQNYSAQDQLLYWKHRLVIPPKHPLIKQVLHEFHNSPIGGHAGTARTLARI 618

Query: 361  XXQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPN 540
              QF W  + +D+K +VQQC +CQ+AKT T  PAGLLQPL IP  +WED+AMDFI GLP 
Sbjct: 619  SAQFFWKGMSRDIKNHVQQCLLCQQAKTSTTLPAGLLQPLSIPVQIWEDLAMDFIVGLPP 678

Query: 541  SHGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTS 720
            SHGFTVI+VVVDRLSK+AHF  ++ DYTS  VAE+FM  IVKLHG+PKSIVSDRDRVFTS
Sbjct: 679  SHGFTVILVVVDRLSKYAHFATLKTDYTSTQVAEVFMKNIVKLHGLPKSIVSDRDRVFTS 738

Query: 721  SFWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYW 900
             FWQ LFKL GTTLAMSSAYHPQSDGQSE+LNKCLEMYLRCFT DNP+ W K L WAE+W
Sbjct: 739  KFWQQLFKLSGTTLAMSSAYHPQSDGQSEALNKCLEMYLRCFTVDNPREWSKLLPWAEFW 798

Query: 901  YNSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXX 1080
            YNSSF +S+ MTPF+A+YGRDPP +V+  + ++DPP + + + +R+ T            
Sbjct: 799  YNSSFQSSINMTPFRAVYGRDPPTIVKYQLDSTDPPSLQELLLQRDVTLNQLKTHLVKAQ 858

Query: 1081 XRMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVA 1260
             RMK  ADK+R  ++F+IG+LVLVKLQPYRQHS+ALRK+ KLG+RYFGPF II +IG+VA
Sbjct: 859  QRMKKFADKKRIPLEFDIGELVLVKLQPYRQHSVALRKHQKLGLRYFGPFPIIKKIGSVA 918

Query: 1261 YKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQ 1440
            YKL LP++AKIH+VFHVS LKK +G+   PYLPLPL + E GP++ P  +L  RTI  G 
Sbjct: 919  YKLLLPASAKIHSVFHVSLLKKCKGNHQTPYLPLPLLTNEFGPVVQPSRILDSRTIIRGD 978

Query: 1441 TLVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIV 1581
              + QVLI W+  +   ATWE+     + YP F LEDK+ F G G V
Sbjct: 979  QHIAQVLIQWDGLDATQATWEDATVIHKDYPNFYLEDKVDFYGGGNV 1025


>gb|PNY16560.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1525

 Score =  655 bits (1691), Expect = 0.0
 Identities = 317/571 (55%), Positives = 412/571 (72%), Gaps = 15/571 (2%)
 Frame = +1

Query: 1    AWLHKFIGYDCTIEYKPGTENIPADALSRVLMMAWSEPQSLLVAKLKQAILQDVTTKKLL 180
            AWLHKF+GYD T+EYKPG EN+ ADALSRV+++AWS+P SL + + KQA  Q+   +++ 
Sbjct: 955  AWLHKFLGYDFTVEYKPGKENLAADALSRVMLLAWSQPTSLFLQEFKQACNQEEEIQQIK 1014

Query: 181  EACAEGANTDTGYTSRDDLLYWQGRLVVPQASALIKDILYEYHSSPVGGHAGXXXXXXXX 360
                E    +   T +DDL++W+GRL+VPQ S +   IL E+H+S +GGHAG        
Sbjct: 1015 LKWNENRGYNPHVTIKDDLVFWKGRLMVPQVSHIRDQILQEFHNSAIGGHAGIARTMARI 1074

Query: 361  XXQFHWPSLKQDVKTYVQQCAICQRAKTMTAPPAGLLQPLPIPNLVWEDVAMDFITGLPN 540
              QF+W ++K D+  +VQQC +CQ+AK  T  PAGLLQPLPIP+ VWEDVAMDFITGLPN
Sbjct: 1075 TAQFYWKNMKNDITHFVQQCVVCQQAKHETRNPAGLLQPLPIPDQVWEDVAMDFITGLPN 1134

Query: 541  SHGFTVIMVVVDRLSKFAHFIPMRQDYTSKSVAELFMSQIVKLHGVPKSIVSDRDRVFTS 720
            S+GFTVIMVV+DRLSK++HF P++ DY+SK VAE+FM  IVKLHGVPKSIVSDRD+VF S
Sbjct: 1135 SYGFTVIMVVIDRLSKYSHFSPLKTDYSSKIVAEVFMRNIVKLHGVPKSIVSDRDKVFMS 1194

Query: 721  SFWQHLFKLLGTTLAMSSAYHPQSDGQSESLNKCLEMYLRCFTFDNPKTWYKALAWAEYW 900
             FW+ LF+L GTTLAM+SAYHPQSDGQSE+LNKCLEMYLRC TF NPK+W+KAL  AE W
Sbjct: 1195 KFWKELFQLQGTTLAMTSAYHPQSDGQSEALNKCLEMYLRCLTFQNPKSWFKALDLAELW 1254

Query: 901  YNSSFHTSLGMTPFKALYGRDPPQLVRPMITNSDPPEVIKQITEREATXXXXXXXXXXXX 1080
            YN++FHTSLGMTPFK LYGRDPP ++R   + S      +Q+ +R+A             
Sbjct: 1255 YNTAFHTSLGMTPFKLLYGRDPPTIIRQESSTSGATMAQQQLQDRDAILTQAKLNLERAQ 1314

Query: 1081 XRMKCQADKRRRDVQFEIGDLVLVKLQPYRQHSLALRKNNKLGMRYFGPFEIIARIGAVA 1260
              MK QADKRR ++Q E+GD VLVKLQPYRQ S+ALRKN KLGMRYFGPF ++A+IG VA
Sbjct: 1315 QYMKTQADKRRHELQLEVGDNVLVKLQPYRQQSVALRKNQKLGMRYFGPFTVMAKIGTVA 1374

Query: 1261 YKLKLPSTAKIHAVFHVSQLKKFRGDSSEPYLPLPLTSTELGPILLPRAVLQKRTIQNGQ 1440
            YKL+LP  AKIH VFH+SQLK F+G+ S+PY PLP  + ELGP+L P  +LQ R +  G+
Sbjct: 1375 YKLQLPPEAKIHPVFHISQLKLFKGNCSQPYFPLPALTNELGPVLQPGDILQARQLVQGK 1434

Query: 1441 TLVEQVLIMWEDNNEDTATWENVEEFKQHYPTFNLEDKIQFKGEGIV---MNESAAEV-- 1605
              ++Q+L+ WE+   + ATWE +E+F+Q +P +NLEDK++  G   V   + ES  ++  
Sbjct: 1435 KTIQQILVRWENLTAEEATWEELEDFRQQFPNYNLEDKVEVIGGSDVREPIEESQLDIIE 1494

Query: 1606 -------ENEMPRRSM---RARKASVKLNDY 1668
                     E+P  ++   R R  +++L D+
Sbjct: 1495 ESTVNTRGQEIPIENVHEKRTRTLNIRLKDF 1525


Top