BLASTX nr result

ID: Rehmannia30_contig00019511 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia30_contig00019511
         (2413 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX92470.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   770   0.0  
gb|PNX73902.1| transposon Ty3 gag-pol polyprotein, partial [Trif...   761   0.0  
gb|PNY07310.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   771   0.0  
gb|PNX94328.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   754   0.0  
gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   754   0.0  
gb|PNX93203.1| hypothetical protein L195_g016354 [Trifolium prat...   732   0.0  
dbj|GAU30089.1| hypothetical protein TSUD_392450 [Trifolium subt...   759   0.0  
gb|PNX73110.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   729   0.0  
gb|PNY17781.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   746   0.0  
gb|AAO23078.1| polyprotein [Glycine max]                              747   0.0  
gb|PNX96484.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   736   0.0  
gb|PNX98514.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   734   0.0  
gb|PNX91810.1| transposon Ty3 gag-pol polyprotein [Trifolium pra...   720   0.0  
gb|OMO81561.1| reverse transcriptase [Corchorus capsularis]           742   0.0  
gb|PNY17392.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   740   0.0  
gb|KYP39589.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]   738   0.0  
dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subt...   739   0.0  
gb|OMO55704.1| reverse transcriptase [Corchorus capsularis]           751   0.0  
gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   737   0.0  
gb|PNY00428.1| hypothetical protein L195_g023708, partial [Trifo...   723   0.0  

>gb|PNX92470.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 856

 Score =  770 bits (1989), Expect = 0.0
 Identities = 394/795 (49%), Positives = 524/795 (65%), Gaps = 2/795 (0%)
 Frame = +2

Query: 23   FGNFKHHHFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGF 202
            F       F++K SKCSF Q+ + YLGH+V  G V  DP K +A+  WP P +VK LRGF
Sbjct: 74   FTILSEQQFHLKASKCSFGQSKVSYLGHIVEGGTVAPDPLKIQAVDDWPTPKSVKGLRGF 133

Query: 203  LGLIGYYRRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTM 382
            LGL G+YR+FVR YASIA PLT+LL  DAF+W+ DA   F+ LK A+V APVL LPDF++
Sbjct: 134  LGLSGFYRKFVRNYASIAHPLTELLKKDAFKWSSDAQTTFDALKSALVNAPVLALPDFSV 193

Query: 383  EFVVETDASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLL 562
             FVV+TDAS+  +GAVL Q  HPI YFSK   P+M  ASTYL+ELHAI  AVK+WRQYLL
Sbjct: 194  PFVVQTDASSQAMGAVLLQGDHPIAYFSKLFCPRMTRASTYLRELHAITSAVKRWRQYLL 253

Query: 563  GRSFIIRTDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGD 742
            G+ FII+TDH+S+KEL  QV+ TPEQQ Y+ KL+GY + I+Y+PGK N+VADALSR    
Sbjct: 254  GQFFIIQTDHRSLKELLTQVIQTPEQQFYLSKLLGYHYDIQYKPGKSNTVADALSR---- 309

Query: 743  ENALEPLSSGHHFQLESRPIWDFLDAIKSENSVNPELQELHQRI-ATDSSLQNFSVKEGI 919
              A E + +  H  + S P + FLD ++ + S +P   EL  +I +T      F++++ +
Sbjct: 310  --AFEGVDATLH--ILSLPQFLFLDELRQDLSNDPAYIELRAKIESTPHQFPEFTLRDAL 365

Query: 920  LYFQHRYVIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKS 1099
            +   ++  +   S  K  +M EFH T + GH GI +TL R++  FYW  MR DV++F+  
Sbjct: 366  ILRHNKIWVSPTSRFKRLLMKEFHETPVGGHGGIVKTLKRLSENFYWANMRTDVKEFINC 425

Query: 1100 CQVCQQIKYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAH 1279
            C VCQQ KYST  P GLLQPLPIPS +W+++++DF+TGLP S G+ VILVVVD+ +K+ H
Sbjct: 426  CVVCQQTKYSTAKPGGLLQPLPIPSNIWEDISLDFVTGLPLSHGYTVILVVVDRFSKAVH 485

Query: 1280 FGALPAQFSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSA 1459
             GALP QF+A + AELF ++V K+HG P SI+SDRDPIF+S FW  LF+ +GT LR SS+
Sbjct: 486  LGALPTQFTAYRVAELFVNLVCKLHGLPKSIVSDRDPIFISRFWADLFRFSGTLLRMSSS 545

Query: 1460 FHPQTDGQTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFG 1639
            +HPQTDGQTEV NR++EQYLR F   +P +W   L WAEF YNT++HSA  +TP+Q ++G
Sbjct: 546  YHPQTDGQTEVTNRTIEQYLRSFVHARPSQWFRFLPWAEFHYNTSFHSAAGMTPYQVIYG 605

Query: 1640 RPPPSIPPYRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIG 1819
            + PP+IP Y   S S+ A D++L  R+E+L  L+ NL +AQ RM   A+ HRRE  + +G
Sbjct: 606  KIPPTIPSYIEGSSSVNACDDMLNSREEILALLRRNLAKAQARMKANADSHRREVSFEVG 665

Query: 1820 DKV*VRLQPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSV 1999
              V V+LQPYRQ +V      KL KR+YGPF VLE+ G V YKL+L   S+IH VFH SV
Sbjct: 666  SWVYVKLQPYRQISVTGEKYSKLSKRFYGPFVVLERIGEVAYKLELPSHSKIHNVFHCSV 725

Query: 2000 LKPFVGNEDTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESST 2179
            LK   G + + + PLP +  +NHP+ +P+AI     V   G    QVLVQW+    + ++
Sbjct: 726  LKQHQGPDPSEIDPLPFDSVDNHPLVEPLAIINSKTVTSAGVSKRQVLVQWTGLSPDDTS 785

Query: 2180 WGDLDQLQQHFPNLHLEDKVVLQGGESDMPNPTDLGLNHPESAQHEIISQAQEEEPMR-K 2356
            W D D L   +   +LEDKV L G      NP    L   E +     ++  E  P+  K
Sbjct: 786  WEDWDNLSSVY---NLEDKVGLDGEGIVTYNP----LITREKSTSTESNKIDEAGPVNTK 838

Query: 2357 SNRLKITPKWHKDYV 2401
              R+   P  HKDYV
Sbjct: 839  PKRITKPPVKHKDYV 853


>gb|PNX73902.1| transposon Ty3 gag-pol polyprotein, partial [Trifolium pratense]
          Length = 893

 Score =  761 bits (1965), Expect = 0.0
 Identities = 387/793 (48%), Positives = 524/793 (66%), Gaps = 1/793 (0%)
 Frame = +2

Query: 23   FGNFKHHHFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGF 202
            F   + HHF++K +KCSF Q+++ YLGH+V AG V  DP+K + +  WP P +VK LRGF
Sbjct: 116  FSILETHHFHLKTTKCSFCQSTIAYLGHIVTAGTVAPDPQKIQGVLDWPTPKSVKNLRGF 175

Query: 203  LGLIGYYRRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTM 382
            LGL G+YRRFVR YA++A PLT LL  +AF WT  A  AF++LK A+  APVL LP+FT+
Sbjct: 176  LGLSGFYRRFVRNYATMAHPLTSLLKKNAFDWTSAAQLAFDQLKIALSTAPVLSLPNFTI 235

Query: 383  EFVVETDASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLL 562
             FVV+TDAS   +GAVL    HPI YFSK   P++  ASTY++ELHAI  AVK+WRQYLL
Sbjct: 236  PFVVQTDASGQAMGAVLLHGDHPIAYFSKIFCPRLSKASTYIRELHAITSAVKRWRQYLL 295

Query: 563  GRSFIIRTDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGD 742
            G  F+I++DH+S+KEL  QV+ TPEQQ Y+ KL+GY + I+Y+PGK N VADALSR    
Sbjct: 296  GHFFVIQSDHRSLKELLTQVIQTPEQQFYLSKLLGYHYDIQYKPGKTNVVADALSR---- 351

Query: 743  ENALEPLSSGHHFQLESRPIWDFLDAIKSENSVNPELQEL-HQRIATDSSLQNFSVKEGI 919
                EP S+     L S P + F++ ++ E   N   QEL H  I+  +   +F +  G+
Sbjct: 352  --CCEPSSA--ELNLLSTPPFLFINELREELQQNSSYQELCHNVISDPTKFPDFVISNGL 407

Query: 920  LYFQHRYVIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKS 1099
            L    R  I + S  K+ ++ EFH T + GH G+ +TL R++A FYW  MR++++DFV  
Sbjct: 408  LLMNGRIWIPANSKFKVLLLKEFHETPVGGHAGVIKTLKRLSANFYWQHMRKEIKDFVAR 467

Query: 1100 CQVCQQIKYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAH 1279
            C +CQQ KYST  P+GLLQPLPIPS VW+++++DFITGLP S G++V+LVVVD+ +K  H
Sbjct: 468  CFICQQTKYSTSKPSGLLQPLPIPSNVWEDISLDFITGLPLSGGYSVLLVVVDRFSKYTH 527

Query: 1280 FGALPAQFSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSA 1459
             GALP+ F+A + AELF +MV K+HG P SI+SDRDPIF+S FW  LFK +GT LR SS+
Sbjct: 528  LGALPSHFTAYKVAELFVNMVCKLHGMPRSIVSDRDPIFISKFWSDLFKFSGTLLRMSSS 587

Query: 1460 FHPQTDGQTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFG 1639
            +HPQTDGQTEV NR++EQYLR F  ++P  W   L WAE+ YNT+YH+A  LTP+Q ++G
Sbjct: 588  YHPQTDGQTEVTNRTIEQYLRAFVHQRPMLWHRFLPWAEYHYNTSYHTAAGLTPYQVVYG 647

Query: 1640 RPPPSIPPYRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIG 1819
            + PP+I  Y   +  + A D+LL ER+E+L  L+ NL +AQ RM   A+ HRR+  + + 
Sbjct: 648  KEPPTIATYVLGTSKVAATDDLLNEREEVLAMLRKNLTKAQERMKTLADNHRRDVSFEVN 707

Query: 1820 DKV*VRLQPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSV 1999
              V V+LQPYRQ++V+     KL KRY+GPF VLE+ G V YKL+L   S+IH VFH S+
Sbjct: 708  SYVHVKLQPYRQSSVSGVKYNKLQKRYFGPFRVLERIGQVAYKLELPSHSKIHNVFHCSL 767

Query: 2000 LKPFVGNEDTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESST 2179
            LKP++G        LP +  +NHP   P+AI     +   G+++ QVLVQW+  P E +T
Sbjct: 768  LKPYLGPIPPDAPILPNDSVDNHPTVTPLAILNSRIISVNGQQMQQVLVQWNGLPPEDTT 827

Query: 2180 WGDLDQLQQHFPNLHLEDKVVLQGGESDMPNPTDLGLNHPESAQHEIISQAQEEEPMRKS 2359
            W +   LQ  +   +LEDKV   G   DM +     +N+   AQ    S +    P+R +
Sbjct: 828  WENWQDLQISY---NLEDKVEFDGIGDDMDSNAAAAINNGPEAQ----SNSATTRPVR-N 879

Query: 2360 NRLKITPKWHKDY 2398
             RL    K H  Y
Sbjct: 880  KRLPAKFKDHHLY 892


>gb|PNY07310.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
 gb|PNY07311.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1494

 Score =  771 bits (1991), Expect = 0.0
 Identities = 399/798 (50%), Positives = 528/798 (66%), Gaps = 1/798 (0%)
 Frame = +2

Query: 23   FGNFKHHHFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGF 202
            F     H F++K+SKCSF Q  + YLGHVV AG V  DP K +A+  WP P TVK LRGF
Sbjct: 710  FSVLSQHQFHLKISKCSFCQPQIAYLGHVVAAGTVAPDPAKIQAIIDWPPPKTVKGLRGF 769

Query: 203  LGLIGYYRRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTM 382
            LGL G+YR+FVRGYA++A PLT LL  DAF+W+++   A   LKQA+  APVL LP F  
Sbjct: 770  LGLSGFYRKFVRGYAALALPLTQLLRKDAFKWSQEEQKALETLKQALATAPVLGLPKFDE 829

Query: 383  EFVVETDASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLL 562
             F+V+TDAS   +GAVL Q  HP+ YFSK   P+M  ASTYL+ELHA+  AVK+WRQYLL
Sbjct: 830  PFIVQTDASGKAMGAVLLQGEHPLAYFSKVFCPRMMKASTYLRELHAVTSAVKRWRQYLL 889

Query: 563  GRSFIIRTDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGD 742
            G  FII+TDHKS+KEL  QV+ TPEQQ Y+ KL+GY + I+Y+PG  N+VADALSR    
Sbjct: 890  GHYFIIQTDHKSLKELLTQVIQTPEQQFYLSKLLGYHYDIQYKPGSTNTVADALSR---- 945

Query: 743  ENALEPLSSGHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATD-SSLQNFSVKEGI 919
              +L+P  S       S P + F++ +K +   +   QEL   I TD +    ++VK G+
Sbjct: 946  --SLDP--SDASIMALSVPQFVFINELKQDLEADSTFQELRACIETDPAGNPGYAVKNGL 1001

Query: 920  LYFQHRYVIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKS 1099
            + +Q R  I   S  K  ++ EFH T + GH GI +TL R+AA FYW  MR++V+ F+ S
Sbjct: 1002 ILYQGRIWISPTSHYKSLLLKEFHETPIGGHAGIIKTLKRLAANFYWSDMRKEVKQFIAS 1061

Query: 1100 CQVCQQIKYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAH 1279
            C +CQQ KYST  P GLLQPLPIPS VW++++MDF+TGLP+S G++VILVVVD+ +K+ H
Sbjct: 1062 CVICQQTKYSTAKPGGLLQPLPIPSNVWEDISMDFVTGLPQSNGYSVILVVVDRFSKAVH 1121

Query: 1280 FGALPAQFSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSA 1459
             GALPAQF+A + AELF +MV KIHG P SI+SDRDPIF+S FW  LFK +GT LR SS+
Sbjct: 1122 LGALPAQFTAYKVAELFINMVCKIHGLPRSIVSDRDPIFISKFWADLFKFSGTLLRMSSS 1181

Query: 1460 FHPQTDGQTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFG 1639
            +HPQTDGQTEV NR++EQYLR F   +P  W   L WAE+ YNT Y+++  L+PFQ +FG
Sbjct: 1182 YHPQTDGQTEVTNRTIEQYLRAFVHAKPSIWFRFLPWAEYHYNTAYNTSSGLSPFQVMFG 1241

Query: 1640 RPPPSIPPYRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIG 1819
            +PPPSIP Y   S S+ A D LL++R  +L+ L+ NL++AQ  M   A+ HRRE  Y  G
Sbjct: 1242 KPPPSIPSYAIGSSSVDACDLLLSDRAAILELLRKNLLKAQQVMKHNADAHRREVNYDAG 1301

Query: 1820 DKV*VRLQPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSV 1999
              V V+LQPYRQT++      KL KR+YGPF ++E+ G V YKL+L   S+IH VFH SV
Sbjct: 1302 TWVYVKLQPYRQTSLTGTKYNKLSKRFYGPFRIIERVGKVAYKLELPSYSKIHNVFHCSV 1361

Query: 2000 LKPFVGNEDTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESST 2179
            LKP +G+  T V  LP +  +NHP+  P+AI A  +    GK   QVLVQW     + ++
Sbjct: 1362 LKPHIGSIPTVVDDLPHDAVDNHPLVSPLAILATKEEFIDGKNQVQVLVQWEGLSPDETS 1421

Query: 2180 WGDLDQLQQHFPNLHLEDKVVLQGGESDMPNPTDLGLNHPESAQHEIISQAQEEEPMRKS 2359
            W   ++LQ  +   +LEDKV   G    M + T +G   PE+    +  +  +  P R  
Sbjct: 1422 WESWNKLQAVY---NLEDKVGFDGEGIVMNSSTTIG---PEATIPIVGPETVQGRPKR-- 1473

Query: 2360 NRLKITPKWHKDYVMQQK 2413
              +K+  K + DYV+ ++
Sbjct: 1474 -NIKLPIKLN-DYVLSKQ 1489


>gb|PNX94328.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1474

 Score =  754 bits (1948), Expect = 0.0
 Identities = 386/761 (50%), Positives = 509/761 (66%), Gaps = 1/761 (0%)
 Frame = +2

Query: 41   HHFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGY 220
            + F +K SKCSFAQ S++YLGH+V A  V  DP K  AM +WP P+ VKQLRGFLGL G+
Sbjct: 725  NEFCLKASKCSFAQTSIDYLGHIVSAEGVGPDPSKISAMVNWPPPTNVKQLRGFLGLTGF 784

Query: 221  YRRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVET 400
            YR+FVR YAS+AAPLT LL  DAF WTE A  AF  LK+AM E PVL LP+F  +F++ET
Sbjct: 785  YRKFVRNYASLAAPLTSLLKRDAFEWTERAQQAFEGLKRAMTEVPVLGLPNFDEKFILET 844

Query: 401  DASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFII 580
            DAS +G+GAVL Q GHPI YFSK+  P+M  ASTY++EL AI  AVKKWR YLLG +F I
Sbjct: 845  DASGVGMGAVLMQSGHPICYFSKQFCPRMLQASTYVRELCAITTAVKKWRTYLLGNTFAI 904

Query: 581  RTDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENALEP 760
             TD +S++EL  QV+ TPEQQ Y+ KL+GYS+ I Y+PG  N VADALSRV         
Sbjct: 905  YTDQRSLRELMTQVIQTPEQQFYLAKLLGYSYEIIYKPGPQNRVADALSRV--------- 955

Query: 761  LSSGHHFQLESRPIWDFLDAIKSENSVNPELQE-LHQRIATDSSLQNFSVKEGILYFQHR 937
                 H  + + P  DFL   K + +V+ E Q+ L Q  A  +    F +  G+L+F+ +
Sbjct: 956  -----HCLVLTIPQMDFLTTFKQQLAVDTEFQQFLAQVQAKPAEYSEFEIMNGLLFFKGK 1010

Query: 938  YVIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQ 1117
              I + S LKL ++ EFHA+ + GH GI RT  R+    +W GMR DV  FVKSC +CQQ
Sbjct: 1011 LFIPATSPLKLTLLEEFHASTIGGHSGIHRTYGRLQENVFWYGMRNDVTHFVKSCSICQQ 1070

Query: 1118 IKYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPA 1297
             K +  +P GLLQPLPIP  VW+++++DFI GLP  +   VI VVVD+L+K+AHFG+LP 
Sbjct: 1071 TKPANHSPYGLLQPLPIPEKVWEDISLDFIVGLPSVQSHTVIFVVVDRLSKAAHFGSLPT 1130

Query: 1298 QFSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTD 1477
             F+A + A+LFA MV K+HG P SI+SDRDPIFLS FWQ+LFKL+GT LR S+A+HPQ+D
Sbjct: 1131 HFTAIKVADLFAKMVCKLHGMPRSIVSDRDPIFLSQFWQELFKLSGTKLRMSTAYHPQSD 1190

Query: 1478 GQTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSI 1657
            GQTE+VN+ L+QYLRCF  ++P +W   L WAE+ YNT  H++  L+PFQ ++GRPPP++
Sbjct: 1191 GQTEIVNKVLQQYLRCFVHDKPNQWEQFLHWAEWHYNTAIHTSTGLSPFQIVYGRPPPAL 1250

Query: 1658 PPYRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VR 1837
              Y   S SIQA+D  L +RD +LQ+LK  L +AQ  M ++A+ HR   ++ +GD V V+
Sbjct: 1251 ADYIPGSSSIQAIDATLIDRDMMLQNLKNKLQKAQAIMKEQADQHRIPHKFKVGDLVFVK 1310

Query: 1838 LQPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVG 2017
            L+PYRQ +V      KL KR+YGPF +++  G V ++L+L P SRIHPVFHVS LKP   
Sbjct: 1311 LRPYRQNSVMGRRIHKLSKRFYGPFKLIKAIGDVAFELELPPTSRIHPVFHVSQLKPCF- 1369

Query: 2018 NEDTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDLDQ 2197
            +E +  L LPLE   N PV KP+A+  + K    G E  +VL+QW     E +TW    +
Sbjct: 1370 DETSEPLDLPLEALGNQPVIKPLAV-LDWKQNESG-EFVEVLIQWEGLFPEDATWEKYQE 1427

Query: 2198 LQQHFPNLHLEDKVVLQGGESDMPNPTDLGLNHPESAQHEI 2320
            +Q  +P   LEDKV    G  D+ N  +  +   +  Q +I
Sbjct: 1428 IQSTYPTFDLEDKVNFD-GTWDVTNQVETDIEDMDMGQDDI 1467


>gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1502

 Score =  754 bits (1947), Expect = 0.0
 Identities = 380/755 (50%), Positives = 501/755 (66%), Gaps = 1/755 (0%)
 Frame = +2

Query: 50   YIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGYYRR 229
            Y+KLSKCSF    +EYLGHVV    V  D  K +A+ +WP PS VKQLRGFLGL GYYRR
Sbjct: 760  YVKLSKCSFGVLEIEYLGHVVTGQGVSMDKDKVQAVVNWPTPSNVKQLRGFLGLTGYYRR 819

Query: 230  FVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVETDAS 409
            F++ YA IA+PLTDLL  +A++WTE A  AF++LK A+  APVL LP+F   F++ETDAS
Sbjct: 820  FIKSYAKIASPLTDLLKKEAYQWTEQAEVAFHQLKNAITTAPVLILPNFKEPFILETDAS 879

Query: 410  NIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFIIRTD 589
             IGIGAVL Q+GHPI YFSKKL P+ Q  S Y +E+ AI EA+ K+R YLLG  FIIRTD
Sbjct: 880  GIGIGAVLHQQGHPIAYFSKKLVPRNQKKSAYFREMLAIAEAIAKFRHYLLGHKFIIRTD 939

Query: 590  HKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENALEPLSS 769
             KS++ L +Q + TPEQQ ++ K +GY F IEY+PGK N  ADALSRV            
Sbjct: 940  QKSLRNLMEQSLQTPEQQEWLHKFLGYDFTIEYKPGKENMAADALSRVM----------- 988

Query: 770  GHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATDSSLQNFSVKEGILYFQHRYVIG 949
                   S P W  LD ++     + +L+E+ Q  A   +   +S++EG+LY++ R VI 
Sbjct: 989  ---VMAWSEPKWQLLDQVRRALENDNQLREVMQNYAIGKAPVQYSMREGLLYWKQRLVIP 1045

Query: 950  SKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQIKYS 1129
                L  K++ EFH + + GH GI RT+ARI A FYW  M++D+ ++V  C VCQQ K +
Sbjct: 1046 KNDDLLQKLLFEFHTSPIGGHAGITRTIARIKAQFYWPDMKKDIAEYVHKCVVCQQAKTT 1105

Query: 1130 TQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPAQFSA 1309
              +P GLLQPLPIPS VW+++ MDFITGLP S G+  I+VVVD+LTKSAHF  +   +++
Sbjct: 1106 NTSPAGLLQPLPIPSQVWEDIAMDFITGLPLSYGYTTIMVVVDRLTKSAHFIPMKTDYTS 1165

Query: 1310 TQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTDGQTE 1489
               AE F   +VK+HG P SI+SDRD +F S FWQ LFK+ GT+L  SSA+HPQTDGQTE
Sbjct: 1166 KTVAEAFMHNIVKLHGMPKSIVSDRDKVFTSAFWQHLFKMQGTSLAMSSAYHPQTDGQTE 1225

Query: 1490 VVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSIPPYR 1669
            V+N++LE +LRCFT   PK W  ++ WAE+ YNT + +++ +TPF+AL+GR PP +  Y 
Sbjct: 1226 VLNKTLELFLRCFTFHNPKSWFKVMSWAEYWYNTAFQTSIGMTPFKALYGRDPPYLTKYE 1285

Query: 1670 RDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VRLQPY 1849
                   A+ E L ERD++LQ LK NL +AQ  M Q+A+ HRRE  + +GD V V+LQPY
Sbjct: 1286 VQVDDPPALREELMERDQILQQLKTNLERAQQYMKQQADKHRREVSFKVGDLVLVKLQPY 1345

Query: 1850 RQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVGNEDT 2029
            +Q +VA    QKL  RY+GPF V+   G V YKL L   ++IHPVFHVS LKPF G    
Sbjct: 1346 KQQSVALRKNQKLGMRYFGPFEVIACVGKVAYKLQLPENAKIHPVFHVSQLKPFHGTSQE 1405

Query: 2030 SVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDLDQLQQH 2209
              LPLPL   +  P+ +P  I     ++R  K++ Q+ +QW     E ++W DLD+LQ  
Sbjct: 1406 QYLPLPLTMSDTGPIFQPATILQARTIVRGNKKVHQLQIQWDLNSPEEASWEDLDELQNK 1465

Query: 2210 FPNLHLEDKVVLQG-GESDMPNPTDLGLNHPESAQ 2311
            FPN++LEDKVV +G G    PN T++ L   ESA+
Sbjct: 1466 FPNINLEDKVVFKGEGIVMRPNNTNI-LEASESAK 1499


>gb|PNX93203.1| hypothetical protein L195_g016354 [Trifolium pratense]
          Length = 869

 Score =  732 bits (1889), Expect = 0.0
 Identities = 361/738 (48%), Positives = 490/738 (66%), Gaps = 2/738 (0%)
 Frame = +2

Query: 68   CSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGYYRRFVRGYA 247
            CSF    ++YLGH V    V  D  K + + +WP P+ +KQLRGFLGL GYYRRF++GYA
Sbjct: 103  CSFGMEEVDYLGHTVSGTGVAMDKDKVQTVLAWPQPTNIKQLRGFLGLTGYYRRFIKGYA 162

Query: 248  SIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVETDASNIGIGA 427
            SIA+PLTDLL  + F+WT +AT AF+KLK A+  APVL LP F + F +ETDAS  G+GA
Sbjct: 163  SIASPLTDLLKKEGFKWTAEATMAFDKLKVAITTAPVLALPQFFLPFTIETDASGTGVGA 222

Query: 428  VLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFIIRTDHKSIKE 607
            VL+Q GHPI +FSKK+ P+MQ  S Y +EL AI EA+ K+R YLLG  F+IRTD KS++ 
Sbjct: 223  VLSQLGHPIAFFSKKMVPRMQKQSAYTRELFAITEAIAKFRHYLLGHKFVIRTDQKSLRS 282

Query: 608  LFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENALEPLSSGHHFQL 787
            L  Q + TPEQQ +I K +GY F IEY+PGK N  ADALSRV                  
Sbjct: 283  LMDQSLQTPEQQAWIHKFIGYDFTIEYKPGKDNVAADALSRVC--------------LMA 328

Query: 788  ESRPIWDFLDAIKSENSVNPELQELHQRIATDSSLQN--FSVKEGILYFQHRYVIGSKSA 961
             S P   FLD ++     + +LQ L   I T   +    F  K G++Y+ ++ V+     
Sbjct: 329  WSEPEIVFLDEVRRCTENDSQLQGL---INTSDPVHGHQFVRKNGLVYWNNKIVLPDDKN 385

Query: 962  LKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQIKYSTQAP 1141
            LK K++ EFH++ + GH GI RT+ARIAA F+W  M++D++ FV++C +CQQ K+ T+AP
Sbjct: 386  LKTKLLLEFHSSPVGGHAGIARTIARIAAQFFWKNMKQDIKLFVQNCLICQQAKHDTRAP 445

Query: 1142 TGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPAQFSATQTA 1321
             GLLQPLPIP  VW+++ MDFITGLP S G+ VI+VV+D+LTK +HF  L   +++   A
Sbjct: 446  AGLLQPLPIPEQVWEDIAMDFITGLPPSNGYTVIMVVIDRLTKYSHFSPLKIDYNSKTVA 505

Query: 1322 ELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTDGQTEVVNR 1501
            E+F   VVK+HG P SI+SDRD +F+S FW++LF+L GTTL  SSA+HPQTDGQ+E +N+
Sbjct: 506  EVFMKTVVKLHGLPKSIVSDRDKVFISKFWKELFQLQGTTLSMSSAYHPQTDGQSEALNK 565

Query: 1502 SLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSIPPYRRDSV 1681
             LE YLRC T + PK W   L WAE+ YNT YH++L +TPFQAL+GR PP++  Y     
Sbjct: 566  CLEMYLRCLTFQNPKSWFKALDWAEYWYNTAYHNSLGMTPFQALYGRTPPTLVRYTHSPT 625

Query: 1682 SIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VRLQPYRQTT 1861
                V + L ERD L+ +LK NL +AQ  M  +A+ HRR+ Q+ +G++V V+LQPYRQ +
Sbjct: 626  DTLDVQQQLMERDRLIATLKDNLKRAQQIMKNQADKHRRDAQFEVGEQVLVKLQPYRQNS 685

Query: 1862 VARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVGNEDTSVLP 2041
            VA    QKL  RY+GPFT++EK G V YK+ L  E++IHPVFH+S LK F G      +P
Sbjct: 686  VALRKNQKLGMRYFGPFTIIEKVGKVAYKVQLPVEAKIHPVFHISQLKQFKGRATDPYIP 745

Query: 2042 LPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDLDQLQQHFPNL 2221
            LPL   E  P+ +P+A+     ++R    I QVL++W       +TW D+D++ +++PN 
Sbjct: 746  LPLTTHELGPILQPIAVLQRRDIVRNEHAIQQVLIKWEGLNDTDATWEDVDEITENYPNF 805

Query: 2222 HLEDKVVLQGGESDMPNP 2275
            +LEDKV ++G    M  P
Sbjct: 806  NLEDKVEVKGKGIAMEEP 823


>dbj|GAU30089.1| hypothetical protein TSUD_392450 [Trifolium subterraneum]
          Length = 1853

 Score =  759 bits (1959), Expect = 0.0
 Identities = 371/731 (50%), Positives = 514/731 (70%), Gaps = 1/731 (0%)
 Frame = +2

Query: 47   FYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGYYR 226
            F +KLSKCSFAQNS+ YLGH+V A  V  DP+K EAM +WP P+T+KQLRGFLGL G+YR
Sbjct: 753  FILKLSKCSFAQNSISYLGHIVSAEGVGPDPEKIEAMVNWPPPTTLKQLRGFLGLTGFYR 812

Query: 227  RFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVETDA 406
            +FV+ YA IA PLT+LL  DAF+W+E A  AF+ LK AM +APVL LP+F  +F++ETDA
Sbjct: 813  KFVKDYAIIAQPLTELLKKDAFQWSEQAQVAFDHLKAAMTKAPVLALPNFEEDFMIETDA 872

Query: 407  SNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFIIRT 586
            S IG+GAVL Q  HPI YFS+K  PK+  AS Y++EL AI  AVKKWR YLLGR F++ T
Sbjct: 873  SGIGMGAVLIQNNHPICYFSQKFCPKLMNASAYVRELCAITSAVKKWRTYLLGRKFVVHT 932

Query: 587  DHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENALEPLS 766
            D +S++EL  QV+ TPEQQ Y+ KL+GYS+ I+Y+PG  N VADALSRV  +  ++  ++
Sbjct: 933  DQRSLRELMTQVIQTPEQQFYLAKLLGYSYEIKYKPGTQNRVADALSRVHENFPSVMAIT 992

Query: 767  SGHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATD-SSLQNFSVKEGILYFQHRYV 943
              H         W FL+ +++E + + E+++L  ++A D +S  NF + +G+LYF+ R  
Sbjct: 993  IPH---------WKFLEKLQAEINQDSEVKDLMSKVANDPNSYPNFKIIKGLLYFKGRLY 1043

Query: 944  IGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQIK 1123
            I + S+ K  ++ E HAT + GH GI+RT  R+   FYWVG+++DV ++V SC  CQQ K
Sbjct: 1044 IPASSSFKNILLEEIHATPVGGHSGIQRTYGRMKENFYWVGLKQDVVNYVNSCHTCQQTK 1103

Query: 1124 YSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPAQF 1303
              T AP GLLQPLPIP  +W++++MDFI GLP  + + VI VVVD+ +K+AHFG LP  F
Sbjct: 1104 DPTHAPYGLLQPLPIPKHIWEDISMDFIVGLPSFQHYTVIFVVVDRFSKAAHFGMLPTGF 1163

Query: 1304 SATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTDGQ 1483
            +A + AELF  MV K+HG P SI+SDRDPIFLS FWQ+LF L+GT LR S+A+HPQ+DGQ
Sbjct: 1164 TAVKVAELFTTMVCKLHGMPHSIVSDRDPIFLSKFWQELFHLSGTKLRMSTAYHPQSDGQ 1223

Query: 1484 TEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSIPP 1663
            TEVVN++L+QYLRCF  +QPKRW   + WAE+ YNT  H++   +PFQ ++G+PPPS+P 
Sbjct: 1224 TEVVNKTLQQYLRCFVHDQPKRWGKYIHWAEWHYNTAIHTSTGYSPFQVVYGKPPPSLPQ 1283

Query: 1664 YRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VRLQ 1843
            Y   +  ++A+D  L+ R+ +LQ+LK  L++AQ  M   A+ HR    +  GD V V+L+
Sbjct: 1284 YLAGTSQLEALDSELSNREIILQNLKKKLLKAQQNMKIYADQHRSPHTFKTGDLVYVKLR 1343

Query: 1844 PYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVGNE 2023
            PYRQT++      KL KR+YGPF +L + G V ++L+L PES+IHPVFHVS LKP   + 
Sbjct: 1344 PYRQTSLPAQRTHKLSKRFYGPFKLLRQIGDVAFELELPPESKIHPVFHVSKLKP-CHDP 1402

Query: 2024 DTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDLDQLQ 2203
             +  L LP +  +N P+ +P+A+    +    G+   QVL+QW+    E +TW   ++++
Sbjct: 1403 ASKPLVLPPDAVDNSPMVQPLAVLDWKE--EPGQTSPQVLIQWAGLYPEDATWESFEEIK 1460

Query: 2204 QHFPNLHLEDK 2236
            + +P+LHLEDK
Sbjct: 1461 KAYPHLHLEDK 1471


>gb|PNX73110.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 937

 Score =  729 bits (1883), Expect = 0.0
 Identities = 360/734 (49%), Positives = 491/734 (66%)
 Frame = +2

Query: 50   YIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGYYRR 229
            ++KLSKCSF    +EYLGH+V    V  D  K +A+ +WP P  VKQLRGFLGL GYYRR
Sbjct: 217  FVKLSKCSFGVLEIEYLGHMVTGQGVSMDKDKVQAVLNWPTPKNVKQLRGFLGLTGYYRR 276

Query: 230  FVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVETDAS 409
            F++ YA IA+PLTDLL  +A+ W + A  AF +LK A+  APVL LP+F   F++ETDAS
Sbjct: 277  FIKSYAKIASPLTDLLKKEAYAWNDLAELAFQQLKSAVTTAPVLALPNFHQPFILETDAS 336

Query: 410  NIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFIIRTD 589
             +GIGAVL Q+GHPI YFSKKL P+ Q  S Y +E+ AI EA+ K+R YLLG  FIIRTD
Sbjct: 337  GVGIGAVLHQEGHPIAYFSKKLVPRNQKKSAYFREMLAIAEAIAKFRHYLLGHKFIIRTD 396

Query: 590  HKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENALEPLSS 769
             KS++ L +Q + TPEQQ ++ K +GY F IEY+PGK N  ADALSR       L  L+ 
Sbjct: 397  QKSLRSLMEQSLQTPEQQEWLHKFLGYDFTIEYKPGKENMAADALSR-------LMTLAW 449

Query: 770  GHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATDSSLQNFSVKEGILYFQHRYVIG 949
                   S P   F++ +K     + ++ E+  + A+  +   ++++EG+LY++ R VI 
Sbjct: 450  -------SEPQCQFIEQVKLALQNDNQMMEIMLKCASGKAPIQYTMREGLLYWKQRLVIP 502

Query: 950  SKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQIKYS 1129
             ++ L  K+++EFH + + GH GI RT+ARI + FYW  M++D+ ++V++C VCQQ K +
Sbjct: 503  KQNELLHKVLYEFHTSPIGGHAGITRTMARIKSQFYWPDMKKDILEYVQNCVVCQQAKTT 562

Query: 1130 TQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPAQFSA 1309
              +P GLLQPLPIPS VW+++ MDFITGLP S G+  I+VVVD+LTK AHF  +   +++
Sbjct: 563  NTSPAGLLQPLPIPSQVWEDIAMDFITGLPLSYGYTTIMVVVDRLTKYAHFIPMRTDYTS 622

Query: 1310 TQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTDGQTE 1489
               AE F   +VK+HG P SI+SDRD +F S FWQQLFKL GT+L  SSA+HPQ+DGQTE
Sbjct: 623  RSVAEAFMHNIVKLHGMPKSIVSDRDKVFTSAFWQQLFKLQGTSLAMSSAYHPQSDGQTE 682

Query: 1490 VVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSIPPYR 1669
            V+N+ LE +LRCF+   PK W  +L WAE+ YNT + +++ +TPF+AL+GR PP +  Y 
Sbjct: 683  VLNKGLELFLRCFSFHNPKSWYKVLSWAEYWYNTAFQTSIGMTPFKALYGRDPPYLTKYE 742

Query: 1670 RDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VRLQPY 1849
                   A+ E L ERD++LQ LK NL +AQ  M ++A+ HR E    +GD V V+LQPY
Sbjct: 743  AQVTDSPALQEELMERDKILQQLKINLERAQQYMKKQADKHRSEVNLQVGDLVLVKLQPY 802

Query: 1850 RQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVGNEDT 2029
            RQ +V+    QKL  RY+GPF ++ + G+V YKL L   ++IHPVFHVS LKPF G    
Sbjct: 803  RQQSVSLRKNQKLGMRYFGPFEIIARVGNVAYKLKLPDNAKIHPVFHVSQLKPFKGIAQD 862

Query: 2030 SVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDLDQLQQH 2209
              LPLPL   E  P+ +P+A      ++R  +++ Q+LVQW   P+  +TW DLD LQ  
Sbjct: 863  QYLPLPLTMSETGPIIQPIAALEARTIMRGMQKVHQILVQWDQMPVTEATWEDLDVLQDK 922

Query: 2210 FPNLHLEDKVVLQG 2251
            FP L+LEDK+   G
Sbjct: 923  FPTLNLEDKIAFNG 936


>gb|PNY17781.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1478

 Score =  746 bits (1927), Expect = 0.0
 Identities = 384/780 (49%), Positives = 514/780 (65%), Gaps = 1/780 (0%)
 Frame = +2

Query: 17   CSFGNFKHHHFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLR 196
            C F     H FY+K  KCSFAQ  + YLGHVV AG V  DP+K  A+  WP+P +VK LR
Sbjct: 703  CVFELLAKHQFYLKPQKCSFAQQQIGYLGHVVSAGTVAPDPEKINAIMDWPVPHSVKTLR 762

Query: 197  GFLGLIGYYRRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDF 376
            GFLGL GYYR+FVR YAS+A+PLT LL  DAF+W++ A  +FN LKQA+  APVL LP+F
Sbjct: 763  GFLGLAGYYRKFVRNYASLASPLTSLLKKDAFQWSDSALDSFNALKQALTTAPVLALPNF 822

Query: 377  TMEFVVETDASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQY 556
            +  F+V+TDASN  +GAVL Q+GHP+ YFSK   P++  ASTY++ELHAI  AVK+WRQY
Sbjct: 823  SNTFIVQTDASNHAMGAVLLQQGHPLAYFSKMFCPRLAKASTYIRELHAITAAVKRWRQY 882

Query: 557  LLGRSFIIRTDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVP 736
            LLG  FII+TDH+S+KEL  QV+ TPEQQ Y+ KL+GY++ I+YRPG  N  ADALSR  
Sbjct: 883  LLGNFFIIQTDHRSLKELLTQVIQTPEQQHYLSKLLGYNYEIQYRPGNTNLAADALSRAS 942

Query: 737  GDENALEPLSSGHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATDSSL-QNFSVKE 913
                    +   +   L + P   F++ ++ E S +    EL ++I  D SL   F +  
Sbjct: 943  --------VIVTNELYLLTVPNLLFMEELRKELSTDSVYLELCRKIQADPSLFPKFKLTN 994

Query: 914  GILYFQHRYVIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFV 1093
            G L +  R  I   S  K  ++ E+H ++ AGH GI +T+ R++  FYW  M++DV++ +
Sbjct: 995  GWLSYNGRIWISPNSRFKTLLLQEYHDSLSAGHAGISKTMKRLSENFYWEHMKQDVQNHI 1054

Query: 1094 KSCQVCQQIKYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKS 1273
            + C +CQQ KYST  P+GLLQPLPIP+ +W++++MDFITGLP SKG +VI VVVD+ +K 
Sbjct: 1055 RHCTICQQTKYSTARPSGLLQPLPIPNHIWEDLSMDFITGLPLSKGHSVIFVVVDRFSKG 1114

Query: 1274 AHFGALPAQFSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHS 1453
             H GAL + F+A + AELF  +V K HG P SI+SDRDPIF+S FW+ LFK +GT LR S
Sbjct: 1115 IHLGALSSGFTAYKVAELFVSIVCKHHGIPRSIVSDRDPIFISKFWRDLFKFSGTFLRMS 1174

Query: 1454 SAFHPQTDGQTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQAL 1633
            S++HPQTDGQTEV+NR++EQYLR F  ++P  WVTLL W E+ YNT+ HS  +L+PFQ +
Sbjct: 1175 SSYHPQTDGQTEVMNRTIEQYLRAFVHDKPYNWVTLLPWVEYHYNTSTHSGSELSPFQVM 1234

Query: 1634 FGRPPPSIPPYRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYS 1813
            FG+ PPSIP Y   S SI+A D +L  RDE+L  L+ NL +AQ RM   A+ HR+E  + 
Sbjct: 1235 FGKSPPSIPSYIAGSSSIEACDSVLQSRDEILTLLRKNLNKAQVRMKANADKHRKEVNFD 1294

Query: 1814 IGDKV*VRLQPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHV 1993
            IG  V V+LQPYRQ +++R    KL KRYYGP+ +  K G+V Y+L+L P ++IH VFHV
Sbjct: 1295 IGAWVYVKLQPYRQISLSRTKYHKLAKRYYGPYLISAKIGTVAYQLELPPHAKIHNVFHV 1354

Query: 1994 SVLKPFVGNEDTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLES 2173
            S+LK + G     V  LP    +NHP+  P+AI      +  G      LVQW     + 
Sbjct: 1355 SLLKLYEGPAPQQVDQLPAFSVDNHPIVSPLAILNFRTQMVDGVPTRFALVQWDGLLPDD 1414

Query: 2174 STWGDLDQLQQHFPNLHLEDKVVLQGGESDMPNPTDLGLNHPESAQHEIISQAQEEEPMR 2353
            ++W   ++L+  +    LEDKV L GG   M + + LG    E+A  + I    EE P R
Sbjct: 1415 TSWEPWNELKHTY---DLEDKVDLDGGSIVMDSTSTLG---QETAHKDKI----EERPKR 1464


>gb|AAO23078.1| polyprotein [Glycine max]
          Length = 1552

 Score =  747 bits (1929), Expect = 0.0
 Identities = 388/792 (48%), Positives = 517/792 (65%), Gaps = 8/792 (1%)
 Frame = +2

Query: 35   KHHHFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLI 214
            K H  + +LSKCSF    ++YLGH V    V  +  K +A+  WP P+ VKQLRGFLGL 
Sbjct: 775  KQHQLFARLSKCSFGDTEVDYLGHKVSGLGVSMENTKVQAVLDWPTPNNVKQLRGFLGLT 834

Query: 215  GYYRRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVV 394
            GYYRRF++ YA+IA PLTDLL  D+F W  +A AAF KLK+AM EAPVL LPDF+  F++
Sbjct: 835  GYYRRFIKSYANIAGPLTDLLQKDSFLWNNEAEAAFVKLKKAMTEAPVLSLPDFSQPFIL 894

Query: 395  ETDASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSF 574
            ETDAS IG+GAVL Q GHPI YFSKKL P+MQ  S Y +EL AI EA+ K+R YLLG  F
Sbjct: 895  ETDASGIGVGAVLGQNGHPIAYFSKKLAPRMQKQSAYTRELLAITEALSKFRHYLLGNKF 954

Query: 575  IIRTDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENAL 754
            IIRTD +S+K L  Q + TPEQQ ++ K +GY F+IEY+PGK N  ADALSR+       
Sbjct: 955  IIRTDQRSLKSLMDQSLQTPEQQAWLHKFLGYDFKIEYKPGKDNQAADALSRM------- 1007

Query: 755  EPLSSGHHFQLE-SRPIWDFLDAIKSENSVNPELQELHQRIATDSSLQNFSVKEGILYFQ 931
                    F L  S P   FL+ +++    +P L++L +     +   +++V+EG+LY++
Sbjct: 1008 --------FMLAWSEPHSIFLEELRARLISDPHLKQLMETYKQGADASHYTVREGLLYWK 1059

Query: 932  HRYVIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVC 1111
             R VI ++  +  KI+ E+H++ + GH GI RTLAR+ A FYW  M+ DV+ +++ C +C
Sbjct: 1060 DRVVIPAEEEIVNKILQEYHSSPIGGHAGITRTLARLKAQFYWPKMQEDVKAYIQKCLIC 1119

Query: 1112 QQIKYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGAL 1291
            QQ K +   P GLLQPLPIP  VW++V MDFITGLP S G +VI+VV+D+LTK AHF  L
Sbjct: 1120 QQAKSNNTLPAGLLQPLPIPQQVWEDVAMDFITGLPNSFGLSVIMVVIDRLTKYAHFIPL 1179

Query: 1292 PAQFSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQ 1471
             A +++   AE F   +VK+HG P SI+SDRD +F S FWQ LFKL GTTL  SSA+HPQ
Sbjct: 1180 KADYNSKVVAEAFMSHIVKLHGIPRSIVSDRDRVFTSTFWQHLFKLQGTTLAMSSAYHPQ 1239

Query: 1472 TDGQTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPP 1651
            +DGQ+EV+N+ LE YLRCFT E PK WV  L WAEF YNT YH +L +TPF+AL+GR PP
Sbjct: 1240 SDGQSEVLNKCLEMYLRCFTYEHPKGWVKALPWAEFWYNTAYHMSLGMTPFRALYGREPP 1299

Query: 1652 SIPPYRRDSVSIQ---AVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGD 1822
            ++    R + SI     V E L +RD LL  LK NL +AQ  M ++A+  R +  + IGD
Sbjct: 1300 TL---TRQACSIDDPAEVREQLTDRDALLAKLKINLTRAQQVMKRQADKKRLDVSFQIGD 1356

Query: 1823 KV*VRLQPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVL 2002
            +V V+LQPYRQ +      QKL  RY+GPF VL K G V YKL+L   +RIHPVFHVS L
Sbjct: 1357 EVLVKLQPYRQHSAVLRKNQKLSMRYFGPFKVLAKIGDVAYKLELPSAARIHPVFHVSQL 1416

Query: 2003 KPFVGNEDTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTW 2182
            KPF G      LPLPL   E  PV +P+ I A   ++R   +I Q+LVQW +   + +TW
Sbjct: 1417 KPFNGTAQDPYLPLPLTVTEMGPVMQPVKILASRIIIRGHNQIEQILVQWENGLQDEATW 1476

Query: 2183 GDLDQLQQHFPNLHLEDKVVLQGGESDMPNPTDLG--LNH--PESAQHEIISQAQEEEPM 2350
             D++ ++  +P  +LEDKVV + GE ++ N    G  +N+    S++  + ++  + E +
Sbjct: 1477 EDIEDIKASYPTFNLEDKVVFK-GEGNVTNGMSRGEKVNNTAESSSERGLHNKLADFEEL 1535

Query: 2351 RKSNRLKITPKW 2386
             +  R K  P W
Sbjct: 1536 GRGKREK-KPSW 1546


>gb|PNX96484.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1258

 Score =  736 bits (1900), Expect = 0.0
 Identities = 375/791 (47%), Positives = 514/791 (64%), Gaps = 3/791 (0%)
 Frame = +2

Query: 41   HHFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGY 220
            +  Y+KLSKCSF    +EYLGHVV    V  D  K +A+  WP P  +KQLRGFLGL GY
Sbjct: 481  NQLYVKLSKCSFGVLEIEYLGHVVSGEGVSMDKTKIQAVVDWPPPKNIKQLRGFLGLTGY 540

Query: 221  YRRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVET 400
            YRRF++ YA IA+PLTDLL  +A+ WT    +AF KLK A+  APVL LPDFT  FV+ET
Sbjct: 541  YRRFIQSYAKIASPLTDLLKKEAYTWTSQEESAFQKLKHAITTAPVLALPDFTKPFVLET 600

Query: 401  DASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFII 580
            DAS IGIGAVL Q+GHPI YFSKKL P+ Q  S Y +E+ AI EA+ K+R YLLG  FII
Sbjct: 601  DASGIGIGAVLHQEGHPIAYFSKKLVPRNQRKSAYFREMLAIAEAIAKFRHYLLGHKFII 660

Query: 581  RTDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENALEP 760
            RTD KS++ L +Q + TP+QQ ++ + +GY F IEY+PGK N  ADALSRV      +  
Sbjct: 661  RTDQKSLRNLMEQALQTPDQQEWLHRFLGYDFSIEYKPGKENVAADALSRV------MTM 714

Query: 761  LSSGHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATDSSLQNFSVKEGILYFQHRY 940
              S   ++L    +     A+K ++++   +Q+  Q  AT+S   +++VK+ +L+++HR 
Sbjct: 715  AWSEPQYKL----LHQIRAALKQDSTLLGIMQKCVQNNATNS---HYTVKDELLFWKHRI 767

Query: 941  VIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQI 1120
            VI   S L  ++++E H + + GH G+ RTLAR+ + FYW  M+ D+ D+V++C +CQ+ 
Sbjct: 768  VIPKNSELIKQVLYELHTSPIGGHAGMARTLARVKSQFYWPDMKTDIADYVQNCAICQKA 827

Query: 1121 KYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPAQ 1300
            K +   P GLLQPLPIPS VW++V MDFITGLP S+G+  ILVV+D+LTK AHF  L   
Sbjct: 828  KTTNTLPAGLLQPLPIPSQVWEDVAMDFITGLPSSQGYTTILVVIDRLTKYAHFIPLKTD 887

Query: 1301 FSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTDG 1480
            +S+   AE   D +VK+HG P SI+SDRD +F S+FWQQLFKL GTTL  SSA+HPQ+DG
Sbjct: 888  YSSKIVAEAVMDNIVKLHGMPKSIVSDRDKVFTSSFWQQLFKLQGTTLAMSSAYHPQSDG 947

Query: 1481 QTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSIP 1660
            Q+EV+N++LE +LRCFT + PK W   L W+EF YNT + +++ +TPF+AL+GR PP++ 
Sbjct: 948  QSEVLNKTLELFLRCFTFDNPKSWCKALSWSEFWYNTAFQTSIGMTPFKALYGRDPPALI 1007

Query: 1661 PYRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VRL 1840
             Y   +     + E L ERD ++Q LK NL +AQ  M ++A+ HR + +  +GD V V+L
Sbjct: 1008 RYETQANDPPTLQEKLMERDRIIQQLKLNLEKAQQYMKKQADKHRVDVKLQVGDLVLVKL 1067

Query: 1841 QPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVGN 2020
            QPYRQ +VA    QKL  RY+GPF V+ K G V YKL L   ++IHPVFHVS LKPF G+
Sbjct: 1068 QPYRQQSVALRKNQKLGMRYFGPFEVIAKVGEVAYKLKLPEHAKIHPVFHVSQLKPFKGD 1127

Query: 2021 EDTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDLDQL 2200
                 +PLPL   +  P+ +P+++ A   ++R  + I QVL+QW       +TW D+D L
Sbjct: 1128 NQEQYMPLPLSMTDTGPMIQPVSVLATRTIIRGAQRIQQVLIQWDQYSTAEATWEDVDAL 1187

Query: 2201 QQHFPNLHLEDKVVLQGGESDMPNPTDLGLNHPESAQHEIISQAQEEEPM---RKSNRLK 2371
            Q  FP  +LEDKV   G    M    +  L   ESA+  +    +    M   R+  R++
Sbjct: 1188 QSKFPAFNLEDKVAFIGDGIVMSPMEENILQEGESAKEGLNDMHERNSVMMGPRRGKRVR 1247

Query: 2372 ITPKWHKDYVM 2404
             T K  + Y +
Sbjct: 1248 KTSKRLEGYAL 1258


>gb|PNX98514.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1240

 Score =  734 bits (1895), Expect = 0.0
 Identities = 368/734 (50%), Positives = 491/734 (66%)
 Frame = +2

Query: 50   YIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGYYRR 229
            ++KLSKCSF  + +EYLGH+V    V  D  K +A+ +W  P+ VKQLRGFLGL GYYRR
Sbjct: 485  FVKLSKCSFGVSEIEYLGHMVTGQGVSMDRDKVQAVLNWSTPTNVKQLRGFLGLTGYYRR 544

Query: 230  FVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVETDAS 409
            F++ YA IAAPLTDLL  +++RW + A  AF +LK+A+  APVL LP+F   F++ETDAS
Sbjct: 545  FIKSYAKIAAPLTDLLKKESYRWNDQADIAFQQLKEAVTSAPVLALPNFHKPFILETDAS 604

Query: 410  NIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFIIRTD 589
             +GIGAVL Q  HPI YFSKKL P+ Q  S Y +E+ AI EA+ K+R YLLG  FIIRTD
Sbjct: 605  GVGIGAVLHQDNHPIAYFSKKLVPRNQKKSAYFREMLAIAEAIAKFRHYLLGHRFIIRTD 664

Query: 590  HKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENALEPLSS 769
             KS++ L  Q + TPEQQ ++ K +GY F IEY+PGK N  ADALSR       L  L+ 
Sbjct: 665  QKSLRSLMDQSLQTPEQQEWLHKFLGYDFVIEYKPGKENLAADALSR-------LMTLAW 717

Query: 770  GHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATDSSLQNFSVKEGILYFQHRYVIG 949
                   S P ++F   +K     +  L E+ Q+     +  N++V+EGILY++HR VI 
Sbjct: 718  -------SEPQYNFTQQVKEAIQQDDNLLEIIQKCLQGLAPTNYTVREGILYWKHRMVIP 770

Query: 950  SKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQIKYS 1129
             K+AL  +I+ EFH + + GH G+ RTLARI + FYW  M++D+ D+V++C VCQQ K +
Sbjct: 771  PKAALIQQILEEFHTSPIGGHAGMTRTLARIKSQFYWSAMKKDIFDYVQNCLVCQQAKTT 830

Query: 1130 TQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPAQFSA 1309
               P GLLQPLPIPS VW+++ MDFITGLP S G+  I+VVVD+LTK AHF A+   +++
Sbjct: 831  NTLPAGLLQPLPIPSQVWEDIAMDFITGLPLSFGYTTIMVVVDRLTKYAHFIAMKTDYTS 890

Query: 1310 TQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTDGQTE 1489
               AE F   VVK+HG P SI+SDRD +F S FWQ LFKL GTTL  +SA+HPQ+DGQTE
Sbjct: 891  KSVAEAFMHNVVKLHGMPKSIVSDRDKVFTSTFWQHLFKLQGTTLAMTSAYHPQSDGQTE 950

Query: 1490 VVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSIPPYR 1669
            V+N+ LE YLRCF+   PK W  +L W+EF YNT + +++ +TPF+AL+GR PP +  Y 
Sbjct: 951  VLNKGLELYLRCFSFNNPKSWFKMLSWSEFWYNTAFQTSIGMTPFKALYGRDPPYLTRYV 1010

Query: 1670 RDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VRLQPY 1849
              +     + E L ERD++LQ LK NL++AQ  M ++A+ HR +    IGD V V+LQPY
Sbjct: 1011 AQASDPPTLQEELMERDKILQQLKDNLIRAQQYMKKQADKHRSDISLKIGDLVLVKLQPY 1070

Query: 1850 RQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVGNEDT 2029
            RQ +VA    QKL  RY+GPF ++ + G V YKL L  +++IHPVFHVS LKPF G  D 
Sbjct: 1071 RQHSVALRKNQKLGLRYFGPFEIIARVGEVAYKLKLPDDAKIHPVFHVSQLKPFKGVADE 1130

Query: 2030 SVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDLDQLQQH 2209
              LPLPL   +  P  +P+ +     V+R  ++I QVL+QW   P   +TW D+  +Q+ 
Sbjct: 1131 QYLPLPLTMTDIGPSIQPIDVLQVRTVIRGSQQIHQVLIQWDQYPAAQATWEDITTIQEK 1190

Query: 2210 FPNLHLEDKVVLQG 2251
            FP+L+LEDKV   G
Sbjct: 1191 FPSLNLEDKVAFNG 1204


>gb|PNX91810.1| transposon Ty3 gag-pol polyprotein [Trifolium pratense]
          Length = 843

 Score =  720 bits (1859), Expect = 0.0
 Identities = 354/744 (47%), Positives = 488/744 (65%), Gaps = 1/744 (0%)
 Frame = +2

Query: 23   FGNFKHHHFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGF 202
            F       FY+K SKC  AQ  +EYLGH++    V+ DP K  AM  WP+P+++  LRGF
Sbjct: 75   FSTLLSAQFYLKQSKCLLAQRKLEYLGHIISGKGVQVDPSKISAMVDWPVPTSITSLRGF 134

Query: 203  LGLIGYYRRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTM 382
            LGL G+YR+F+R YA+IA PLT LL  DAF WTE+A  AFN LKQAM +AP+L  P+F +
Sbjct: 135  LGLTGFYRKFIRNYAAIATPLTRLLRKDAFNWTEEAQLAFNSLKQAMTKAPLLASPNFNI 194

Query: 383  EFVVETDASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLL 562
             F++ETDAS I +GAVL Q  HPI +FSK    ++  +STY++ELHAI  AVKKWRQYLL
Sbjct: 195  PFILETDASGIAMGAVLMQNNHPIAFFSKPFCQRLLNSSTYVRELHAITTAVKKWRQYLL 254

Query: 563  GRSFIIRTDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGD 742
            G  FII TDHKS+K+L  QV+ TPEQQ+Y+ KL+G+ F I+Y+ G  N VADALSR+   
Sbjct: 255  GHHFIIFTDHKSLKQLISQVIQTPEQQVYLSKLLGFDFTIQYKAGSTNVVADALSRITPS 314

Query: 743  ENALEPLSSGHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATD-SSLQNFSVKEGI 919
             + L          L + P + FL+ +KS+ S   E Q+L   IA   +S  +    + +
Sbjct: 315  ASCL----------LLTVPHFVFLNELKSQLSTTQEFQQLKLAIAQQPASYSDNEEHQDL 364

Query: 920  LYFQHRYVIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKS 1099
            ++F+ +  +    +LK +I+HEFH + ++GH G+ +T  R+ A F+W GMR+D+  F+  
Sbjct: 365  IFFKKKIWLPRDFSLKDRILHEFHNSPISGHMGVDKTFHRLQANFFWQGMRQDIRSFIAK 424

Query: 1100 CQVCQQIKYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAH 1279
            C VCQ  KY T+ P GLLQPLP+PS +W+++++DFITGLP S G+  I VVVD+ +K AH
Sbjct: 425  CSVCQSTKYETKKPAGLLQPLPVPSGIWEDLSLDFITGLPPSHGYTTIFVVVDRFSKGAH 484

Query: 1280 FGALPAQFSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSA 1459
            F ALP  ++A + A+LF D + K HG P S++SDRDP+F+S FW++LFKL+GT LR S+A
Sbjct: 485  FSALPTTYTAYKVAQLFLDTICKHHGMPRSLVSDRDPVFISQFWRELFKLSGTQLRMSTA 544

Query: 1460 FHPQTDGQTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFG 1639
            +HP+TDGQTEV+NRSLEQYLR F   +P  W T L  AE+SYNT+ HSA   +PF  ++G
Sbjct: 545  YHPETDGQTEVLNRSLEQYLRAFVHHKPSLWFTFLSLAEWSYNTSKHSATGYSPFHVVYG 604

Query: 1640 RPPPSIPPYRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIG 1819
            + P SIP Y   +  I+AVD +LAER   LQ L+  L++ Q+ M + A+  RR  ++++G
Sbjct: 605  KDPVSIPQYVLGTSPIEAVDSMLAERQAFLQFLRRKLLKVQSHMKEIADKKRRHVEFNVG 664

Query: 1820 DKV*VRLQPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSV 1999
            D V ++L+PYRQ ++      KL KRY+GP+ +L+K G V YKLDL   S+IHPVFH S+
Sbjct: 665  DFVYLKLRPYRQRSITLASYNKLSKRYFGPYKILQKIGPVAYKLDLPSTSKIHPVFHCSL 724

Query: 2000 LKPFVGNEDTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESST 2179
            LK   G+   +   LP    ++ P+ +P+AI               VLVQW   PLE ++
Sbjct: 725  LKLHQGDLPAAHAELPPSTIDHQPIIEPLAIVDTKMDTATDPPTRMVLVQWLGLPLEETS 784

Query: 2180 WGDLDQLQQHFPNLHLEDKVVLQG 2251
            W   D LQ  +   HLEDKV   G
Sbjct: 785  WETWDDLQATY---HLEDKVTFPG 805


>gb|OMO81561.1| reverse transcriptase [Corchorus capsularis]
          Length = 1523

 Score =  742 bits (1915), Expect = 0.0
 Identities = 369/790 (46%), Positives = 519/790 (65%), Gaps = 1/790 (0%)
 Frame = +2

Query: 44   HFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGYY 223
            H++ K SKCSFAQ S++YLGH++    V  D  K +A+ +WP+P+ +K+LR FLGL GYY
Sbjct: 750  HYFAKFSKCSFAQGSVDYLGHIISGLGVAVDHSKIDAILAWPVPTFIKKLRVFLGLTGYY 809

Query: 224  RRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVETD 403
            R+FVRGYAS+AAPLTDLL  D F W+ +AT AF  LK  +V APVL +PDF+  FV+ETD
Sbjct: 810  RKFVRGYASLAAPLTDLLKKDNFIWSAEATKAFESLKHVLVTAPVLAIPDFSQPFVLETD 869

Query: 404  ASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFIIR 583
            AS   IGAVL+Q GHPI YFS+KL P+MQ AS Y +E+ AI E+VKKWRQYLLGR F+I 
Sbjct: 870  ASMTAIGAVLSQHGHPIAYFSRKLNPQMQTASAYAREMFAITESVKKWRQYLLGRPFLIY 929

Query: 584  TDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENALEPL 763
            TD +S++ L  Q + TPEQQ ++ KL+GY + I Y+ G  N VADALSR   ++  L  +
Sbjct: 930  TDQQSLRNLMNQTIQTPEQQRWLAKLLGYQYTILYKLGVQNKVADALSRSFPEQGELNAI 989

Query: 764  SSGHHFQLESRPIWDFLDAIKSENSVNPE-LQELHQRIATDSSLQNFSVKEGILYFQHRY 940
            S          P + FL  ++   + +P   Q  +      +   +  V++G+L    R 
Sbjct: 990  SG---------PTFPFLTQLRDYYANDPHGKQHFNDVKEHPNKFPDLLVQDGLLLRHGRI 1040

Query: 941  VIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQI 1120
            VI     L+ +++ E+H T+  GH G+ +TL+R+A+ F+W  MR+ + +F+ +C+ CQ++
Sbjct: 1041 VIPENHPLQQQLLFEYHCTLTGGHAGVAKTLSRLASNFFWSNMRKTMANFISTCRTCQEV 1100

Query: 1121 KYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPAQ 1300
            K       GLLQPLPIP+ +W ++ MDFIT LP S G   I V+VD+L+K AHF A+PA 
Sbjct: 1101 KCLPTKQAGLLQPLPIPTHIWQDIAMDFITHLPFSNGKTTIWVIVDRLSKYAHFLAIPAH 1160

Query: 1301 FSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTDG 1480
             +A Q A LF+  + K+HG P SI+SDRDPIF+S+FW++LF+L GT L HSSA+HPQ+DG
Sbjct: 1161 TTAPQLAALFSQEIGKLHGLPRSIVSDRDPIFISSFWKELFRLQGTKLNHSSAYHPQSDG 1220

Query: 1481 QTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSIP 1660
            Q+EV+NR LE YLRCF  + P+ W  +  WAE+SYNT  HSA+ +TPF+A++G  PP++ 
Sbjct: 1221 QSEVLNRCLETYLRCFAGDNPRSWSKIFHWAEWSYNTALHSAINMTPFEAVYGYAPPTVA 1280

Query: 1661 PYRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VRL 1840
             Y      I  +D+ L ER  LL  LK NL +AQNRM  +A+ HR+E+ +  G+ V V+L
Sbjct: 1281 SYLPGPSKIAQLDDCLVERQTLLARLKVNLARAQNRMKMQADRHRKEKHFEEGEWVWVKL 1340

Query: 1841 QPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVGN 2020
            QPYRQ +V +   QKL K+Y+GPF ++++ G+V Y+L L  +SRIHPVFHVS+LK + GN
Sbjct: 1341 QPYRQQSVVKRTTQKLAKKYFGPFQIIKRVGTVAYELKLPADSRIHPVFHVSLLKAYRGN 1400

Query: 2021 EDTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDLDQL 2200
             D +  PLP    E+ PV +P  +    +V  + + + Q+LV+W + P   +TW  LD +
Sbjct: 1401 LDINPTPLPALAIEDQPVLEPEVVLKTREVKYQDQNLPQILVKWKNLPEAEATWEWLDDV 1460

Query: 2201 QQHFPNLHLEDKVVLQGGESDMPNPTDLGLNHPESAQHEIISQAQEEEPMRKSNRLKITP 2380
            Q ++P  HLEDKVV     SD  + TD  L     + +E  SQ   E PMR+ NR +  P
Sbjct: 1461 QTNYPAFHLEDKVV-----SDRES-TDTSL---APSPNEPSSQDNSEAPMRRGNRARNAP 1511

Query: 2381 KWHKDYVMQQ 2410
             WH D++ Q+
Sbjct: 1512 PWHVDFIRQR 1521


>gb|PNY17392.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1510

 Score =  740 bits (1910), Expect = 0.0
 Identities = 390/794 (49%), Positives = 524/794 (65%), Gaps = 6/794 (0%)
 Frame = +2

Query: 41   HHFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGY 220
            H F +K SKCSFAQ+S++YLGH+V A  V  DP K EAM SWP+PS VKQLRGFLGL G+
Sbjct: 731  HEFCLKQSKCSFAQSSIDYLGHIVSAEGVGPDPTKIEAMVSWPVPSNVKQLRGFLGLTGF 790

Query: 221  YRRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVET 400
            YR+F+  YASIAAPLT LL  DAF WT+ A  AF+ LK+AM EAPVL LP+F  +F++ET
Sbjct: 791  YRKFICKYASIAAPLTALLKRDAFIWTDAAQQAFDMLKRAMSEAPVLSLPNFEDQFILET 850

Query: 401  DASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFII 580
            DAS +G+GAVL QKGHPI YFSK+  P+M  ASTY++EL AI  AVKKWR YLLG +FII
Sbjct: 851  DASGMGMGAVLIQKGHPICYFSKQFCPRMLVASTYVRELCAITTAVKKWRTYLLGNTFII 910

Query: 581  RTDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENALEP 760
             TD +S++EL  QV+ TPEQQ Y+ KL+GYS+ I Y+PG  N VADALSRV         
Sbjct: 911  YTDQRSLRELMTQVIQTPEQQFYLAKLLGYSYEIMYKPGPQNRVADALSRV--------- 961

Query: 761  LSSGHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATDSSLQ-NFSVKEGILYFQHR 937
                 H    + P  DFL  +K +   + E Q+L   +  +      F + + +L+F+ +
Sbjct: 962  -----HCLAITVPHLDFLHTLKEQLVQDDEFQQLLTNVKENPDAHMGFEILDDLLFFKGK 1016

Query: 938  YVIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQ 1117
              I S S LK+ ++ EFH++ + GH GI RT  R+    +W GMR DV  FVKSC +CQQ
Sbjct: 1017 LFIPSNSPLKVTLLEEFHSSTIGGHSGIHRTFGRLQENVFWHGMRNDVTQFVKSCAICQQ 1076

Query: 1118 IKYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPA 1297
             K  T +P GLLQPLPIP  VW+++++DF+ GLP  +   V+LVVVD+L+K+AHFG LP 
Sbjct: 1077 TKPPTHSPYGLLQPLPIPDKVWEDISLDFVVGLPSFQTHTVVLVVVDRLSKAAHFGLLPT 1136

Query: 1298 QFSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTD 1477
             F+A + A+LFA MV K+HG P SI+SDRDPIFLS+FWQ+LF+L+GT LR S+A+HPQ+D
Sbjct: 1137 HFTAAKVADLFAKMVCKLHGMPRSIVSDRDPIFLSHFWQELFRLSGTKLRMSTAYHPQSD 1196

Query: 1478 GQTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSI 1657
            GQTE VN+ L+QYLRCF  ++PK+W   L WAE+ YNT  H++  ++P++ ++GRPPP++
Sbjct: 1197 GQTENVNKVLQQYLRCFVHDKPKQWGHYLHWAEWHYNTAIHTSTGISPYEVVYGRPPPTL 1256

Query: 1658 PPYRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VR 1837
              Y   S  +QAVD  L ERD +++ LK  L++AQN M + A+L R   Q+ +GD V V+
Sbjct: 1257 ADYVPGSSKLQAVDATLTERDIVIEVLKNKLLKAQNTMKEYADLKRIPHQFKVGDWVFVK 1316

Query: 1838 LQPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVG 2017
            L+PYRQ +V      KL KR+YGPF ++   G V ++L+L   SRIHPVFHVS LKP   
Sbjct: 1317 LRPYRQNSVLGRRFHKLSKRFYGPFKLVRAIGEVAFELELPDTSRIHPVFHVSQLKPCF- 1375

Query: 2018 NEDTSVLP--LPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDL 2191
              D +V+P  LP E  +N P   P+AI         G++  + LVQW     E +TW + 
Sbjct: 1376 --DNTVVPLALPPETVDNQPCITPIAILDWRTNEENGQQ--EALVQWEGLFPEDATWENY 1431

Query: 2192 DQLQQHFPNLHLEDKVVLQGGESDMPNPTDLGLNHPESAQHEIISQAQEEEPMRKSNRLK 2371
              L+  +P    ED V L   + D+ N  D+GL+  E  Q E  +Q  EEE +    ++K
Sbjct: 1432 QDLKNSYPTFDHEDMVSLD-EQRDVMNQNDMGLD--EEIQ-ENWAQENEEEVLGHQPKVK 1487

Query: 2372 ---ITPKWHKDYVM 2404
               + PK   D+V+
Sbjct: 1488 RKIVRPKHLDDFVI 1501


>gb|KYP39589.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 1510

 Score =  738 bits (1906), Expect = 0.0
 Identities = 361/740 (48%), Positives = 504/740 (68%)
 Frame = +2

Query: 41   HHFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGY 220
            H  Y K+SKCSF    +EYLGHVV    V  +  K +A+  WP+P T+KQLRGFLGL GY
Sbjct: 727  HELYAKMSKCSFGLEQVEYLGHVVSGDGVSMETSKVQAVIDWPVPKTIKQLRGFLGLTGY 786

Query: 221  YRRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVET 400
            YRRF++GYASIA PLTDLL  D F+W+ +A AAF  LKQA+  APVL LPDF+  FV+ET
Sbjct: 787  YRRFIQGYASIANPLTDLLKKDNFKWSNEADAAFIALKQAITTAPVLSLPDFSQPFVLET 846

Query: 401  DASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFII 580
            DAS  GIGAVL+Q  HPI +FSKKL  +M   S Y +E +AI EA+ K+R YLLG  FII
Sbjct: 847  DASGSGIGAVLSQNKHPIAFFSKKLSNRMTKQSAYTREFYAITEAIAKFRHYLLGHRFII 906

Query: 581  RTDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENALEP 760
            RTD KS+K L  Q + TPEQQ ++ K +GY F IEY+PG  N  ADALSR     +A+  
Sbjct: 907  RTDQKSLKSLLDQTLQTPEQQAWLHKFLGYDFSIEYKPGTENLAADALSRSFFMASAVTA 966

Query: 761  LSSGHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATDSSLQNFSVKEGILYFQHRY 940
                H  +           A+ S+ ++ P L    Q  A  +    +S  +G+L+++ R 
Sbjct: 967  SDLVHQIKA----------ALGSDTALQPILTAHSQGKALSAP---YSFLDGLLFWKGRI 1013

Query: 941  VIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQI 1120
            V+ +  A++ +I+ EFH++ L GH GI RT AR+AA F+W GM +D+++FV+ C VCQQ 
Sbjct: 1014 VVPNVPAIQNQILQEFHSSPLGGHSGIARTFARVAAQFFWPGMNKDIKNFVQQCCVCQQA 1073

Query: 1121 KYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPAQ 1300
            K +T  P GLLQPLPIP+ +W++++MDFI GLP ++G+ VI V+VD+L+K AHF  L + 
Sbjct: 1074 KTATVLPAGLLQPLPIPTQIWEDISMDFIVGLPPAEGYTVIFVIVDRLSKYAHFAPLKSD 1133

Query: 1301 FSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTDG 1480
            F++ + A++F   VVK+HGFP+SI+SDRD +F S FWQ L KL+GTTL+ S+A+HPQ+DG
Sbjct: 1134 FNSKRVADVFLHTVVKLHGFPNSIVSDRDKVFTSTFWQHLLKLSGTTLKLSTAYHPQSDG 1193

Query: 1481 QTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSIP 1660
            QTE +N+ LE YLRCFT E+PK W+  L WAEF YNT++H + +++PF+ ++GR PP++ 
Sbjct: 1194 QTEALNKCLEMYLRCFTHEKPKDWIKFLPWAEFWYNTSFHHSAQMSPFKVVYGRDPPTLV 1253

Query: 1661 PYRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VRL 1840
             Y   +    ++ E+L +RD +L  LK NL+ AQ RM + A+  R  +++  G+ V V+L
Sbjct: 1254 KYSHSATDPPSIQEMLLQRDRVLAQLKVNLMLAQQRMKKYADQKRLHKEFVEGEMVLVKL 1313

Query: 1841 QPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVGN 2020
            QPYRQ ++A    QKL  RY+GPF + ++ GSV YKL L   ++IHPVFH+S LK F G 
Sbjct: 1314 QPYRQHSLALRKNQKLGLRYFGPFPIQKRIGSVAYKLLLPDYAKIHPVFHISQLKQFRGV 1373

Query: 2021 EDTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDLDQL 2200
             DT  +PLPL      PV +P+ + +   +++ GK + QVLVQW    ++++TW DLD+L
Sbjct: 1374 TDTVYVPLPLTTAVEGPVVQPIQVLSVRDIIQAGKLVRQVLVQWEGFGVDAATWEDLDKL 1433

Query: 2201 QQHFPNLHLEDKVVLQGGES 2260
            +Q +PN++LEDKV+ +GG S
Sbjct: 1434 EQSYPNINLEDKVIAKGGSS 1453


>dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subterraneum]
          Length = 1531

 Score =  739 bits (1907), Expect = 0.0
 Identities = 367/740 (49%), Positives = 492/740 (66%), Gaps = 1/740 (0%)
 Frame = +2

Query: 35   KHHHFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLI 214
            K +  Y+KLSKCSF    +EYLGHVV    V  D  K + +  WP P  +KQLRGFLGL 
Sbjct: 752  KDNELYVKLSKCSFGVLEIEYLGHVVSGEGVYMDKSKIQVVVDWPSPKNIKQLRGFLGLT 811

Query: 215  GYYRRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVV 394
            GYYRRF++ YA IA+PLTDLL  DA+ W  +  AAF KLK A+  APVL LPDFT  F++
Sbjct: 812  GYYRRFIQSYAKIASPLTDLLKKDAYTWNSEMEAAFQKLKHAITTAPVLALPDFTKPFIL 871

Query: 395  ETDASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSF 574
            ETDAS IGIGAVL Q+GHPI YFSKKL P+ Q  S Y +E+ AI EA+ K+R YLLG  F
Sbjct: 872  ETDASGIGIGAVLHQEGHPIAYFSKKLVPRNQRKSAYFREMLAIAEAIAKFRHYLLGHKF 931

Query: 575  IIRTDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENAL 754
            IIRTD KS++ L +Q + TP+QQ ++ + +GY F IEY+PGK N  ADALSRV       
Sbjct: 932  IIRTDQKSLRNLMEQALQTPDQQEWLHRFLGYDFTIEYKPGKENVAADALSRVMT----- 986

Query: 755  EPLSSGHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATDS-SLQNFSVKEGILYFQ 931
                        S P +  L  I+     +  L E+ ++ A +S S  N+++K+ +L+++
Sbjct: 987  ---------LAWSEPQYKLLHQIRVALKQDSTLLEIMEKCAQNSDSNSNYTIKDDLLFWK 1037

Query: 932  HRYVIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVC 1111
            HR VI   S L+ ++++E H + + GH GI RTLAR+ A FYW+ M+ D+  +V++C +C
Sbjct: 1038 HRIVIPKHSELRQQVLYELHTSPIGGHAGIARTLARVKAQFYWLDMKTDIAKYVQNCVIC 1097

Query: 1112 QQIKYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGAL 1291
            Q+ K +   P GLLQPLPIPS VW++V MDFITGLP S G+  ILVV+D+LTK AHF  L
Sbjct: 1098 QKAKTTNTPPAGLLQPLPIPSQVWEDVAMDFITGLPSSHGYTTILVVIDRLTKYAHFIPL 1157

Query: 1292 PAQFSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQ 1471
               +S+   AE F D +VK+HG P SI+SDRD +F S+FWQQLFKL GT+L  SSA+HPQ
Sbjct: 1158 KTDYSSKIVAEAFMDNIVKLHGMPKSIVSDRDKVFTSSFWQQLFKLQGTSLAMSSAYHPQ 1217

Query: 1472 TDGQTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPP 1651
            +DGQ+EV+N++LE +LRCFT E PK W   L W+EF YNT + +++ +TPF+AL+GR PP
Sbjct: 1218 SDGQSEVLNKTLELFLRCFTFENPKSWCKALAWSEFWYNTAFQTSIGMTPFKALYGRDPP 1277

Query: 1652 SIPPYRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV* 1831
            +I  Y   +     + E L ERD ++Q LK NL +AQ  M ++A+ HR + +  +GD V 
Sbjct: 1278 AIIRYEIQASDSPTLQEKLMERDRIIQQLKLNLEKAQQYMKKQADKHRVDVKLQVGDWVL 1337

Query: 1832 VRLQPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPF 2011
            V+LQPYRQ +VA    QKL  +Y+GPF V+ K G V YKL L   ++IHPVFHVS LKPF
Sbjct: 1338 VKLQPYRQQSVALRKNQKLGMKYFGPFEVIAKVGEVAYKLKLPDHAKIHPVFHVSQLKPF 1397

Query: 2012 VGNEDTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDL 2191
             G+     +PLPL   +  P+ +P+A+ A   ++R  + I QVL+QW   P+  +TW D+
Sbjct: 1398 KGDNQEQYMPLPLSMTDIGPMIQPVAVLATRTIIRCAQRIQQVLIQWDQYPIAEATWEDM 1457

Query: 2192 DQLQQHFPNLHLEDKVVLQG 2251
              LQ+ FP  +LEDKV   G
Sbjct: 1458 VALQRKFPTFNLEDKVAFIG 1477


>gb|OMO55704.1| reverse transcriptase [Corchorus capsularis]
          Length = 2083

 Score =  751 bits (1939), Expect = 0.0
 Identities = 375/746 (50%), Positives = 506/746 (67%), Gaps = 1/746 (0%)
 Frame = +2

Query: 47   FYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGYYR 226
            FY KLSKCSFAQ+S++YL H++    V+ DP K EA+ +WP PS VK LRGFLGL GYYR
Sbjct: 761  FYAKLSKCSFAQSSIDYLEHIISDQGVQVDPSKIEAVMAWPQPSNVKSLRGFLGLTGYYR 820

Query: 227  RFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVETDA 406
            +FV  YA+IAAPLTDLL + AF WT+ A+  F KLKQA+   P L LPDF+  F V TDA
Sbjct: 821  KFVAHYATIAAPLTDLLKSKAFHWTQSASDVFEKLKQALTSTPCLALPDFSKPFEVTTDA 880

Query: 407  SNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFIIRT 586
            SN+ +GAVL+Q  HP+ YFSKKL PK+Q +STY++E++AI EA KKWRQYLLGR FII T
Sbjct: 881  SNVAVGAVLSQDSHPLAYFSKKLNPKLQNSSTYVREMYAITEAFKKWRQYLLGRPFIIYT 940

Query: 587  DHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSR-VPGDENALEPL 763
            D +S++ L  Q + TPEQQ ++ KL+G+ + I+Y+PG  N V DALSR  P + N L   
Sbjct: 941  DQQSLRGLMNQTIQTPEQQKWLVKLLGFQYSIQYKPGTQNKVVDALSRSFPVEANCLAI- 999

Query: 764  SSGHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATDSSLQNFSVKEGILYFQHRYV 943
                     S  ++ FLD ++ +  ++    +L  +   DS  + F++  G++    R V
Sbjct: 1000 ---------SGLVFSFLDDLR-QYFISDTRGKLFFQQCQDSPSE-FTIVNGLIMRDGRIV 1048

Query: 944  IGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQIK 1123
            I     L+  ++HEFH+T+  GH GI RTL R++A F+W  MR+ V++FV +C+VCQ++K
Sbjct: 1049 IPDSHPLQQTLLHEFHSTLTGGHAGISRTLVRLSASFWWNNMRKSVKEFVSTCKVCQEVK 1108

Query: 1124 YSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPAQF 1303
            Y T  P GLL+PLPIPS  W ++ MDFIT LP S G   I V+VD+ +K AHF ALPA  
Sbjct: 1109 YLTSKPQGLLEPLPIPSQAWQDIAMDFITHLPISHGKVTIWVIVDRFSKYAHFLALPAGV 1168

Query: 1304 SATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTDGQ 1483
            +A   A +FA  +VK+HG P SI+SDRDP+F+S FW +LFKL GT L  SSA+HPQTDGQ
Sbjct: 1169 TAPHLAAIFAQEIVKLHGIPRSIVSDRDPLFVSKFWNELFKLQGTQLPMSSAYHPQTDGQ 1228

Query: 1484 TEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSIPP 1663
            +EV+NR LE YLR F  E PK+W  +L WAE+SYN+++HSA  +TPFQAL+G PPPSIP 
Sbjct: 1229 SEVLNRCLETYLRAFVSENPKQWTRILHWAEWSYNSSFHSAACMTPFQALYGFPPPSIPS 1288

Query: 1664 YRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VRLQ 1843
            Y   S ++  +D+ L +R +LL+ LKANL +A NRM  +A+  R E+Q++ GD V V+LQ
Sbjct: 1289 YLPGSTTVAQLDDSLIDRQQLLKQLKANLARASNRMKIQADRKRVEKQFTEGDLVLVKLQ 1348

Query: 1844 PYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVGNE 2023
            PYRQ +V     QKL K+Y+GP+ +L+K G V YKL+L   SR+HPVFHVS+L+ F G+ 
Sbjct: 1349 PYRQQSVVSRTSQKLSKKYFGPYKILQKIGPVAYKLELPEGSRVHPVFHVSLLRAFKGDL 1408

Query: 2024 DTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDLDQLQ 2203
              +  PLP EC    P  +P  I    +V +  K +TQVLV+W   P   STW   + + 
Sbjct: 1409 PATPSPLPTECVNGQPTLEPELILKSRQVKQAKKVLTQVLVKWKQLPESDSTWEWAEDIS 1468

Query: 2204 QHFPNLHLEDKVVLQGGESDMPNPTD 2281
              FPN +LE+KVV+Q G + M + ++
Sbjct: 1469 SSFPNFNLENKVVIQEGSNVMSSSSN 1494


>gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1535

 Score =  737 bits (1902), Expect = 0.0
 Identities = 376/792 (47%), Positives = 513/792 (64%), Gaps = 9/792 (1%)
 Frame = +2

Query: 50   YIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGYYRR 229
            Y+KLSKC+F    +EYLGHVV    V  D  K +A+ +WP P  VKQLRGFLGL GYYRR
Sbjct: 757  YVKLSKCNFGVLEIEYLGHVVTGQGVSMDKDKVQAVLNWPTPMNVKQLRGFLGLTGYYRR 816

Query: 230  FVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVETDAS 409
            F++ YA IA+PLTDLL  +A++W   A  AF +LK A+  APVL LP+F + F++ETDAS
Sbjct: 817  FIKSYAKIASPLTDLLKKEAYQWNAQAEEAFKQLKNAITTAPVLALPNFKLPFILETDAS 876

Query: 410  NIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFIIRTD 589
             +GIGAVL Q+GHPI YFSKKL P+ Q  S Y +E+ AI EA+ K+R YLLG  FIIRTD
Sbjct: 877  GVGIGAVLHQQGHPIAYFSKKLVPRNQKKSAYFREMLAIAEAIAKFRHYLLGHKFIIRTD 936

Query: 590  HKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENALEPLSS 769
             KS++ L +Q + TP+QQ ++ K +GY F IEY+PGK N  ADALSR+         LS 
Sbjct: 937  QKSLRSLMEQSLQTPDQQEWLHKFLGYDFTIEYKPGKENMAADALSRIM-------TLSW 989

Query: 770  GHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATDSSLQNFSVKEGILYFQHRYVIG 949
                   S P   F++ I+     + +++E+  +     +   +S+++G+LY++ R VI 
Sbjct: 990  -------SEPKCQFIEQIRVALQNDNQMREILMKCNAGKAPVQYSMRDGLLYWKQRLVIP 1042

Query: 950  SKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQIKYS 1129
              + L  K++ EFH + + GH GI RT+ARI + FYW  M++D+ D+V+ C VCQQ K +
Sbjct: 1043 KDNDLLYKVLFEFHTSPIGGHAGITRTMARIKSQFYWPDMKQDIIDYVQKCMVCQQAKTT 1102

Query: 1130 TQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPAQFSA 1309
              +P GLLQPLPIPS VW+++ MDFITGLP S G+  I+VVVD+LTK AHF  + + +++
Sbjct: 1103 NTSPAGLLQPLPIPSQVWEDIAMDFITGLPLSSGYTTIMVVVDRLTKYAHFIPMKSDYTS 1162

Query: 1310 TQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTDGQTE 1489
               AE F   +VK+HG P SI+SDRD +F S FWQQLFKL GT+L  SSA+HPQ+DGQTE
Sbjct: 1163 KSVAESFMHNIVKLHGMPKSIVSDRDKVFTSAFWQQLFKLQGTSLAMSSAYHPQSDGQTE 1222

Query: 1490 VVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSIPPYR 1669
            V+N++LE +LRCFT   PK W  +L WAE+ YNT + +++ +TPF+AL+GR PP +  Y 
Sbjct: 1223 VLNKALELFLRCFTFHNPKSWSKVLSWAEYWYNTAFQTSIGMTPFKALYGRDPPYLTKYE 1282

Query: 1670 RDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VRLQPY 1849
                   A+ E L ERD++LQ LK+NL +AQ  M ++A+ HR++  + +GD V V+LQPY
Sbjct: 1283 AQVTDPPALQEELMERDKILQQLKSNLDRAQQYMKKQADKHRKDVTFQVGDLVLVKLQPY 1342

Query: 1850 RQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVGNEDT 2029
            RQ +VA    QKL  RY+GPF ++   G+V YKL L   ++IHPVFHVS LKPF G    
Sbjct: 1343 RQQSVALRKNQKLGMRYFGPFEIIACIGAVAYKLKLPDNAKIHPVFHVSQLKPFKGAASD 1402

Query: 2030 SVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDLDQLQQH 2209
              LPLPL   E  P+ +P+A+     ++R  + + Q+LVQW       +TW D D LQ  
Sbjct: 1403 QYLPLPLTMTETGPIMQPIAVLQARTIMRGTQRVHQILVQWDTNAEAEATWEDFDDLQLK 1462

Query: 2210 FPNLHLEDKVVLQG-GESDMPNPTDLGLNHPESAQ--------HEIISQAQEEEPMRKSN 2362
            FP L+LEDKVV  G G    PN T+L L    SA+        H+ +S  +E    R+  
Sbjct: 1463 FPTLNLEDKVVFNGEGIVMRPNTTNL-LEENVSAKFHKGPQDMHDSVSGGKELSGPRRGQ 1521

Query: 2363 RLKITPKWHKDY 2398
            R K      ++Y
Sbjct: 1522 RAKKPHSMWREY 1533


>gb|PNY00428.1| hypothetical protein L195_g023708, partial [Trifolium pratense]
          Length = 1181

 Score =  723 bits (1865), Expect = 0.0
 Identities = 360/738 (48%), Positives = 499/738 (67%), Gaps = 1/738 (0%)
 Frame = +2

Query: 41   HHFYIKLSKCSFAQNSMEYLGHVVVAGEVRADPKKNEAMTSWPIPSTVKQLRGFLGLIGY 220
            + F+ KLSKC F  +S+EYLGH++    VRADP K +AM SWP+P  +  LR FLGL G+
Sbjct: 423  NQFFAKLSKCQFGVSSVEYLGHIISVEGVRADPSKLQAMVSWPVPKNITALRAFLGLTGF 482

Query: 221  YRRFVRGYASIAAPLTDLLCTDAFRWTEDATAAFNKLKQAMVEAPVLYLPDFTMEFVVET 400
            YRRFV  YASIA+PLTDLL  ++F W++ A  AFN LK AM   P+L LP+FT+ F V T
Sbjct: 483  YRRFVLNYASIASPLTDLLKANSFAWSDAANTAFNTLKNAMANLPLLTLPNFTLPFEVTT 542

Query: 401  DASNIGIGAVLTQKGHPITYFSKKLGPKMQAASTYLKELHAIVEAVKKWRQYLLGRSFII 580
            DAS   +GAVL+Q   P+ +FSKKL  ++ A+STY++EL+A+ EA+KKWRQYLLG  F I
Sbjct: 543  DASLTAVGAVLSQNSKPLAFFSKKLSARLSASSTYVRELYALTEAIKKWRQYLLGSPFKI 602

Query: 581  RTDHKSIKELFQQVVHTPEQQIYIQKLMGYSFRIEYRPGKLNSVADALSRVPGDENALEP 760
             TDHKS+K L  Q + TPEQQ ++ KL+GY++ I Y+PGK N VADALSRV   E+ +E 
Sbjct: 603  FTDHKSLKSLMTQTIQTPEQQKWLTKLLGYTYEIHYKPGKENVVADALSRV--QESPMEG 660

Query: 761  LSSGHHFQLESRPIWDFLDAIKSENSVNPELQELHQRIATDSSLQN-FSVKEGILYFQHR 937
              +     L + PI   +  ++S  S NP   +L  +  TD  +Q  F VK G+L+FQ+R
Sbjct: 661  ECA-----LLTFPISTLISQLQSFFSSNPAGTKLMNKAVTDPKMQQQFQVKAGLLHFQNR 715

Query: 938  YVIGSKSALKLKIMHEFHATMLAGHPGIKRTLARIAAVFYWVGMRRDVEDFVKSCQVCQQ 1117
              I  +S L   ++ EFH++   GH GI+ TLAR++A FYW GM +DV+ FV +C VCQ 
Sbjct: 716  LFIPFESGLTTSLLQEFHSSPTGGHSGIQATLARLSATFYWPGMYKDVKQFVNACSVCQH 775

Query: 1118 IKYSTQAPTGLLQPLPIPSMVWDEVTMDFITGLPESKGFAVILVVVDQLTKSAHFGALPA 1297
             KYSTQ+P GLLQPLP+P  VW++++MDFIT LP +   + I V+VD+LTK AHF ALP 
Sbjct: 776  NKYSTQSPYGLLQPLPLPQQVWEDISMDFITHLPMTHNRSCIWVIVDRLTKFAHFIALPG 835

Query: 1298 QFSATQTAELFADMVVKIHGFPSSIISDRDPIFLSNFWQQLFKLNGTTLRHSSAFHPQTD 1477
             F+A   A +F   + ++HG P +I+SDRD +F+S FW+ LF   GT+L  SS++HPQ+D
Sbjct: 836  SFTAASLAPIFITEIYRLHGAPKTIVSDRDRVFVSQFWRALFHHLGTSLAFSSSYHPQSD 895

Query: 1478 GQTEVVNRSLEQYLRCFTQEQPKRWVTLLKWAEFSYNTNYHSALKLTPFQALFGRPPPSI 1657
            GQTEV+NR LE YLRCF  ++P+ W+  L  AEF YNT++H+A+ +TPF+AL+GR PP++
Sbjct: 896  GQTEVLNRCLETYLRCFVSDEPRLWLRFLALAEFWYNTSFHTAIGMTPFEALYGRKPPTL 955

Query: 1658 PPYRRDSVSIQAVDELLAERDELLQSLKANLVQAQNRMVQKANLHRREQQYSIGDKV*VR 1837
              Y   +  I+++DELL ++  +L+ LK NLV+A+NRM+ +AN HR+++ + +G  V ++
Sbjct: 956  VHYTPGTSKIESLDELLTQKTLVLKVLKENLVKARNRMIIQANQHRQDRNFEVGQWVYLK 1015

Query: 1838 LQPYRQTTVARCPCQKLDKRYYGPFTVLEKFGSVTYKLDLSPESRIHPVFHVSVLKPFVG 2017
            LQPYRQ +V      KL KRYYGPF +L+K G V Y+LDL   SR+HPVFHVS+LK   G
Sbjct: 1016 LQPYRQHSVHHRESHKLAKRYYGPFRILKKIGKVAYELDLPAASRVHPVFHVSLLKLCHG 1075

Query: 2018 NEDTSVLPLPLECFENHPVHKPMAICAEHKVLRRGKEITQVLVQWSDAPLESSTWGDLDQ 2197
               T V P+       +P   P+ +   ++ +  G +I + LV+W D PL  +TW     
Sbjct: 1076 EPTTQVTPIADP--STYPPIIPVPVAIRNRRISAG-DIEEFLVEWKDLPLSEATWVAKTT 1132

Query: 2198 LQQHFPNLHLEDKVVLQG 2251
             Q  FPN +LEDK++  G
Sbjct: 1133 FQDQFPNTNLEDKILFDG 1150


Top