BLASTX nr result

ID: Astragalus22_contig00032118 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00032118
         (4938 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAO23078.1| polyprotein [Glycine max]                             1856   0.0  
gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium prat...  1832   0.0  
gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifo...  1814   0.0  
dbj|GAU11620.1| hypothetical protein TSUD_346120 [Trifolium subt...  1795   0.0  
dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subt...  1783   0.0  
dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subt...  1768   0.0  
gb|PNY16560.1| Ty3/gypsy retrotransposon protein [Trifolium prat...  1766   0.0  
dbj|GAU29525.1| hypothetical protein TSUD_115470 [Trifolium subt...  1716   0.0  
dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subt...  1706   0.0  
gb|KYP39589.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]  1659   0.0  
gb|PNX92398.1| Ty3/gypsy retrotransposon protein [Trifolium prat...  1633   0.0  
gb|PNX98514.1| Ty3/gypsy retrotransposon protein, partial [Trifo...  1600   0.0  
gb|PNY16671.1| retrotransposon-related protein, partial [Trifoli...  1585   0.0  
gb|PNX94483.1| retrotransposon-related protein, partial [Trifoli...  1578   0.0  
gb|PNX96484.1| Ty3/gypsy retrotransposon protein [Trifolium prat...  1553   0.0  
dbj|GAU23504.1| hypothetical protein TSUD_39830 [Trifolium subte...  1522   0.0  
gb|KYP39590.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]  1470   0.0  
dbj|GAU35592.1| hypothetical protein TSUD_295280 [Trifolium subt...  1402   0.0  
gb|PNX98300.1| Ty3/gypsy retrotransposon protein, partial [Trifo...  1310   0.0  
gb|PNX73110.1| Ty3/gypsy retrotransposon protein, partial [Trifo...  1301   0.0  

>gb|AAO23078.1| polyprotein [Glycine max]
          Length = 1552

 Score = 1856 bits (4808), Expect = 0.0
 Identities = 922/1564 (58%), Positives = 1143/1564 (73%), Gaps = 32/1564 (2%)
 Frame = +1

Query: 223  MADNTRLKELTTELRRQAETI-----------EKNEAASRARFEELSSMQKASEARFNQL 369
            MADNTR+KE+  EL++ A+ I           E+ EA + A+ E++  MQ  ++++F+QL
Sbjct: 1    MADNTRMKEVYAELKKNADAITRVSDDLQNHIERLEATNHAQMEKIEVMQSTNDSQFSQL 60

Query: 370  ADAFEKLMQQT---PTVTHGGTNSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIPTMSS 540
                 +++Q+    P  +HG +NS                                    
Sbjct: 61   NAVMSQVLQRLQNIPMSSHGASNSQKE------------------------------QQR 90

Query: 541  TPFQVRHIKLDFPRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIASVHLDQEVVPWFQMV 720
            + FQVR +KLDFPRF+GKNV+DWIFKAEQFF YY TPD +RLIIASVHLDQ+VVPW+QM+
Sbjct: 91   SSFQVRSVKLDFPRFDGKNVMDWIFKAEQFFDYYATPDADRLIIASVHLDQDVVPWYQML 150

Query: 721  NRSRPFQTWMDFTRALELDFGPSMYDCPRASLFKLQQSKSVNEYYLEFTALSNRVYGISN 900
             ++ PF +W  FTRALELDFGPS YDCPRA+LFKL QS +VNEYY++FTAL NRV G+S 
Sbjct: 151  QKTEPFSSWQAFTRALELDFGPSAYDCPRATLFKLNQSATVNEYYMQFTALVNRVDGLSA 210

Query: 901  DALIDCFVSGLKDELRRDVMLHTPTSITXXXXXXXXXXXXXXXXNTNVQNQKYNAGSTAV 1080
            +A++DCFVSGL++E+ RDV    P ++T                   +  +KY   ++  
Sbjct: 211  EAILDCFVSGLQEEISRDVKAMEPRTLTKAVAL------------AKLFEEKY---TSPP 255

Query: 1081 QARAPFNPTRN--SQTQATEKAXXXXXXXXXXXXXMSQLQKNPA----------IKRIS- 1221
            + +   N  RN  S T AT+K              +  L   P+          IK+IS 
Sbjct: 256  KTKTFSNLARNFTSNTSATQKYPPTNQKNDNPKPNLPPLLPTPSTKPFNLRNQNIKKISP 315

Query: 1222 AEMQLRREKGLCYFCDDKFSFSHRCPNKHLMLLEL-----XXXXXXXXXXXXANSDDDRE 1386
            AE+QLRREK LCYFCD+KFS +H+CPN+ +MLL+L                 AN DDD  
Sbjct: 316  AEIQLRREKNLCYFCDEKFSPAHKCPNRQVMLLQLEETDEDQTDEQVMVTEEANMDDD-- 373

Query: 1387 IQKPVEHHLSLNALNGATGMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNLPIE 1566
                  HHLSLNA+ G+ G+G IRF G +G + V IL+DGGSSD+FIQPR+A  L LP+E
Sbjct: 374  -----THHLSLNAMRGSNGVGTIRFTGQVGGIAVKILVDGGSSDNFIQPRVAQVLKLPVE 428

Query: 1567 PVTRCNVLVGNGQKMQAEGVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASLGPH 1746
            P     VLVGNGQ + AEG++Q L + +QG ++ VP YLL ++G+DVILG+ WLA+LGPH
Sbjct: 429  PAPNLRVLVGNGQILSAEGIVQQLPLHIQGQEVKVPVYLLQISGADVILGSTWLATLGPH 488

Query: 1747 VADYATSKLRFYLEGKFVTLQGDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAVAPE 1926
            VADYA   L+F+   KF+TLQG+ + +   AQLH F+RLQ+  +I E F +Q I+   PE
Sbjct: 489  VADYAALTLKFFQNDKFITLQGEGNSEATQAQLHHFRRLQNTKSIEECFAIQLIQKEVPE 548

Query: 1927 DNWKDMPVTVEPEMATLLYTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPHS 2106
            D  KD+P  ++PE+A LL+TY +VF VP  LPP RE  H I LK+G+ PVKV+PYRYPH+
Sbjct: 549  DTLKDLPTNIDPELAILLHTYAQVFAVPASLPPQREQDHAIPLKQGSGPVKVRPYRYPHT 608

Query: 2107 QKEQIEKMVQEMLEEGXXXXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPMP 2286
            QK+QIEKM+QEML +G              LVKKKDG+WR CTDYRALNAIT+KDSFPMP
Sbjct: 609  QKDQIEKMIQEMLVQGIIQPSNSPFSLPILLVKKKDGSWRFCTDYRALNAITVKDSFPMP 668

Query: 2287 TVDELLDELHGAKFFSKFDLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAPA 2466
            TVDELLDELHGA++FSK DLRSGYHQILV+PEDR+KTAFRTH GHYEWLVMPFGLTNAPA
Sbjct: 669  TVDELLDELHGAQYFSKLDLRSGYHQILVQPEDREKTAFRTHHGHYEWLVMPFGLTNAPA 728

Query: 2467 TFQRLMNEIFQPFLRKCVLVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCSF 2646
            TFQ LMN+IFQ  LRK VLVFFDDIL+YS SW  HL+ +E VLQ L QH L+A+LSKCSF
Sbjct: 729  TFQCLMNKIFQFALRKFVLVFFDDILIYSASWKDHLKHLESVLQTLKQHQLFARLSKCSF 788

Query: 2647 GLTEVEYLGHTVSGSGVSMDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASIA 2826
            G TEV+YLGH VSG GVSM+  KV AVLDWP P+N+KQLRGFLGLTGYYRRFIK+YA+IA
Sbjct: 789  GDTEVDYLGHKVSGLGVSMENTKVQAVLDWPTPNNVKQLRGFLGLTGYYRRFIKSYANIA 848

Query: 2827 GPLTNLLKKDAFQWNENTQLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVLS 3006
            GPLT+LL+KD+F WN   + AF +LKKA+T APVL LPDFSQPF+LETDASG GVGAVL 
Sbjct: 849  GPLTDLLQKDSFLWNNEAEAAFVKLKKAMTEAPVLSLPDFSQPFILETDASGIGVGAVLG 908

Query: 3007 QGGHPIAFFSKKMAPRMQLQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXXX 3186
            Q GHPIA+FSKK+APRMQ QSAYTRE  AIT AL+KFRHYLLG+KFI+RT          
Sbjct: 909  QNGHPIAYFSKKLAPRMQKQSAYTRELLAITEALSKFRHYLLGNKFIIRTDQRSLKSLMD 968

Query: 3187 XXXXTPEQQAWLHKFIGYDFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEVA 3366
                TPEQQAWLHKF+GYDF+IEYKPGKDNQAADALSR+  ++WS P ++FLEEL+  + 
Sbjct: 969  QSLQTPEQQAWLHKFLGYDFKIEYKPGKDNQAADALSRMFMLAWSEPHSIFLEELRARLI 1028

Query: 3367 TNEHLKMIVQQCEQNTVADRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGHA 3546
            ++ HLK +++  +Q   A  +Y V+ GLLYWK ++VIP E +++ +IL+EYH SPIGGHA
Sbjct: 1029 SDPHLKQLMETYKQGADAS-HYTVREGLLYWKDRVVIPAEEEIVNKILQEYHSSPIGGHA 1087

Query: 3547 GVTRTLARIGAQFYWPKMRQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDVT 3726
            G+TRTLAR+ AQFYWPKM++DVK Y+  C++CQQAKS  T  AGLL+PLPIP QVW+DV 
Sbjct: 1088 GITRTLARLKAQFYWPKMQEDVKAYIQKCLICQQAKSNNTLPAGLLQPLPIPQQVWEDVA 1147

Query: 3727 MDFITGLPPSHGYTVIMVVVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSII 3906
            MDFITGLP S G +VIMVV+DRLTK+AHFIP+K DY+SK VAEAFM  +V+LHG+P+SI+
Sbjct: 1148 MDFITGLPNSFGLSVIMVVIDRLTKYAHFIPLKADYNSKVVAEAFMSHIVKLHGIPRSIV 1207

Query: 3907 SDRDRVFTSKFWQNLFELQGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSWY 4086
            SDRDRVFTS FWQ+LF+LQGTTLAMSSAYHPQ+DGQSE +NKCLEMYLRCFT+ +PK W 
Sbjct: 1208 SDRDRVFTSTFWQHLFKLQGTTLAMSSAYHPQSDGQSEVLNKCLEMYLRCFTYEHPKGWV 1267

Query: 4087 KALCWAELWYNTSHHTSAGMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAVITE 4266
            KAL WAE WYNT++H S GM+PF+ALYGREPP LTR A +  DP E+++QL DRDA++ +
Sbjct: 1268 KALPWAEFWYNTAYHMSLGMTPFRALYGREPPTLTRQACSIDDPAEVREQLTDRDALLAK 1327

Query: 4267 LKRNLARAQQIMKNNADKKRRDVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRYFGPFT 4446
            LK NL RAQQ+MK  ADKKR DV F++GD+VLVKLQPYRQ S  LRKNQKL MRYFGPF 
Sbjct: 1328 LKINLTRAQQVMKRQADKKRLDVSFQIGDEVLVKLQPYRQHSAVLRKNQKLSMRYFGPFK 1387

Query: 4447 ILAKIGKVAYKLQLPDTAKIHSVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILEPEGIL 4626
            +LAKIG VAYKL+LP  A+IH VFHVSQLK F GT  +PY+PLPLT  E GP+++P  IL
Sbjct: 1388 VLAKIGDVAYKLELPSAARIHPVFHVSQLKPFNGTAQDPYLPLPLTVTEMGPVMQPVKIL 1447

Query: 4627 KTRTILRNEQQVPQVLVKWQHLQVAEATWEDKNEMAIQYPNLNLEDKIALEGEGDVMNNK 4806
             +R I+R   Q+ Q+LV+W++    EATWED  ++   YP  NLEDK+  +GEG+V N  
Sbjct: 1448 ASRIIIRGHNQIEQILVQWENGLQDEATWEDIEDIKASYPTFNLEDKVVFKGEGNVTNGM 1507

Query: 4807 SVNE 4818
            S  E
Sbjct: 1508 SRGE 1511


>gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1535

 Score = 1832 bits (4746), Expect = 0.0
 Identities = 889/1432 (62%), Positives = 1095/1432 (76%), Gaps = 5/1432 (0%)
 Frame = +1

Query: 535  SSTPFQVRHIKLDFPRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIASVHLDQEVVPWFQ 714
            S  PFQVR++KL+FPRF+G NV +WIF+AEQFF YY+TPD +RL IASVHLD++VVPW+Q
Sbjct: 67   SRPPFQVRNVKLEFPRFDGTNVHEWIFRAEQFFEYYDTPDLDRLTIASVHLDKDVVPWYQ 126

Query: 715  MVNRSRPFQTWMDFTRALELDFGPSMYDCPRASLFKLQQSKSVNEYYLEFTALSNRVYGI 894
            MV R+ PF +W++FTRALELDFGPS+YDCPRA+LFKL+Q+ +V EYYL+FT+L+NRVYG+
Sbjct: 127  MVQRTHPFTSWIEFTRALELDFGPSVYDCPRATLFKLKQTGTVAEYYLQFTSLANRVYGL 186

Query: 895  SNDALIDCFVSGLKDELRRDVMLHTPTSITXXXXXXXXXXXXXXXXNTNVQNQKYNAGST 1074
            SNDALIDCFVSGL DE+RRDV++HTP S+                 N N +N   +  + 
Sbjct: 187  SNDALIDCFVSGLNDEIRRDVLIHTPISLVKAMSLAKVYEEKYSYNNKNQKNYSNSYSTN 246

Query: 1075 AVQARAPFNPTRNSQTQATEKAXXXXXXXXXXXXXMSQLQKNPAIKRIS-AEMQLRREKG 1251
                  P   TRN+                     MSQ Q NP IKR+S AE QLRR+KG
Sbjct: 247  KPNTNKPDYTTRNTAP----------ILNTPPTRPMSQFQNNPNIKRMSQAERQLRRDKG 296

Query: 1252 LCYFCDDKFSFSHRCPNKHLMLLELXXXXXXXXXXXXANSDDDREIQK----PVEHHLSL 1419
            LCY+CDDKFSF+H+CPN+ LML++                  +  I+       EHHLSL
Sbjct: 297  LCYWCDDKFSFTHKCPNRQLMLIQNDDDLDADQVLDQLTQTTETTIKSLDTNQPEHHLSL 356

Query: 1420 NALNGATGMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNLPIEPVTRCNVLVGN 1599
            NA+ G + MGV+RF G I  + V IL+DGGSSD+F+QPRIA  L LPIEP  + NVLVGN
Sbjct: 357  NAMKGTSNMGVLRFAGSIEHIGVQILIDGGSSDNFLQPRIAKFLKLPIEPGPQFNVLVGN 416

Query: 1600 GQKMQAEGVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASLGPHVADYATSKLRF 1779
            G+ M AEGVIQNL + +QG  + VP +LLPVAG+DVILGA WLA+LGPHVADYA+  L+F
Sbjct: 417  GEIMTAEGVIQNLPLEIQGHKLEVPVFLLPVAGADVILGASWLATLGPHVADYASLTLKF 476

Query: 1780 YLEGKFVTLQGDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAVAPEDNWKDMPVTVE 1959
            +L+ KFVTL G   P+   AQ H FKRL + D+I+E FT+Q +++    D +KD+P   E
Sbjct: 477  FLKDKFVTLTGQAVPRPTPAQFHHFKRLANTDSIAECFTVQCLKSTDDADIFKDLPTNTE 536

Query: 1960 PEMATLLYTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPHSQKEQIEKMVQE 2139
            PE+A LLYTY+ VF+ P  LPP R  +H I L EGA PVKVKPYRYPHSQKEQIE M+Q+
Sbjct: 537  PEIAMLLYTYKNVFKTPTALPPDRFHNHTIPLIEGATPVKVKPYRYPHSQKEQIEAMIQD 596

Query: 2140 MLEEGXXXXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPMPTVDELLDELHG 2319
            ML +G              LVKKKDGTWR CTDYRALNAIT+KDSFP+PTVDELLDEL G
Sbjct: 597  MLHQGIIQNSTSPFSSPIVLVKKKDGTWRFCTDYRALNAITVKDSFPIPTVDELLDELFG 656

Query: 2320 AKFFSKFDLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAPATFQRLMNEIFQ 2499
            AK+FSK DLRSGYHQIL++PEDR KTAFRTHQGHYEWLVMPFGLTNAPATFQ LMN IFQ
Sbjct: 657  AKYFSKLDLRSGYHQILLQPEDRHKTAFRTHQGHYEWLVMPFGLTNAPATFQSLMNHIFQ 716

Query: 2500 PFLRKCVLVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCSFGLTEVEYLGHT 2679
              LRK VLVFFDDIL+YS +W  HLQ ++ VL++L +++LY KLSKC+FG+ E+EYLGH 
Sbjct: 717  HALRKYVLVFFDDILIYSNTWQDHLQHLDVVLRVLQENILYVKLSKCNFGVLEIEYLGHV 776

Query: 2680 VSGSGVSMDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASIAGPLTNLLKKDA 2859
            V+G GVSMD++KV AVL+WP P N+KQLRGFLGLTGYYRRFIK+YA IA PLT+LLKK+A
Sbjct: 777  VTGQGVSMDKDKVQAVLNWPTPMNVKQLRGFLGLTGYYRRFIKSYAKIASPLTDLLKKEA 836

Query: 2860 FQWNENTQLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVLSQGGHPIAFFSK 3039
            +QWN   + AF+QLK AIT+APVL LP+F  PF+LETDASG G+GAVL Q GHPIA+FSK
Sbjct: 837  YQWNAQAEEAFKQLKNAITTAPVLALPNFKLPFILETDASGVGIGAVLHQQGHPIAYFSK 896

Query: 3040 KMAPRMQLQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXXXXXXXTPEQQAW 3219
            K+ PR Q +SAY RE  AI  A+AKFRHYLLGHKFI+RT              TP+QQ W
Sbjct: 897  KLVPRNQKKSAYFREMLAIAEAIAKFRHYLLGHKFIIRTDQKSLRSLMEQSLQTPDQQEW 956

Query: 3220 LHKFIGYDFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEVATNEHLKMIVQQ 3399
            LHKF+GYDF IEYKPGK+N AADALSR+MT+SWS P+  F+E+++  +  +  ++ I+ +
Sbjct: 957  LHKFLGYDFTIEYKPGKENMAADALSRIMTLSWSEPKCQFIEQIRVALQNDNQMREILMK 1016

Query: 3400 CEQNTVADRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGHAGVTRTLARIGA 3579
            C     A   Y+++ GLLYWKQ+LVIP ++ L+ ++L E+H SPIGGHAG+TRT+ARI +
Sbjct: 1017 CNAGK-APVQYSMRDGLLYWKQRLVIPKDNDLLYKVLFEFHTSPIGGHAGITRTMARIKS 1075

Query: 3580 QFYWPKMRQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDVTMDFITGLPPSH 3759
            QFYWP M+QD+ +YV  C+VCQQAK+  T+ AGLL+PLPIP+QVW+D+ MDFITGLP S 
Sbjct: 1076 QFYWPDMKQDIIDYVQKCMVCQQAKTTNTSPAGLLQPLPIPSQVWEDIAMDFITGLPLSS 1135

Query: 3760 GYTVIMVVVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSIISDRDRVFTSKF 3939
            GYT IMVVVDRLTK+AHFIPMK+DY+SKSVAE+FMH++V+LHGMPKSI+SDRD+VFTS F
Sbjct: 1136 GYTTIMVVVDRLTKYAHFIPMKSDYTSKSVAESFMHNIVKLHGMPKSIVSDRDKVFTSAF 1195

Query: 3940 WQNLFELQGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSWYKALCWAELWYN 4119
            WQ LF+LQGT+LAMSSAYHPQ+DGQ+E +NK LE++LRCFTF+NPKSW K L WAE WYN
Sbjct: 1196 WQQLFKLQGTSLAMSSAYHPQSDGQTEVLNKALELFLRCFTFHNPKSWSKVLSWAEYWYN 1255

Query: 4120 TSHHTSAGMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAVITELKRNLARAQQI 4299
            T+  TS GM+PFKALYGR+PP LT+Y A  +DP  LQ++L++RD ++ +LK NL RAQQ 
Sbjct: 1256 TAFQTSIGMTPFKALYGRDPPYLTKYEAQVTDPPALQEELMERDKILQQLKSNLDRAQQY 1315

Query: 4300 MKNNADKKRRDVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRYFGPFTILAKIGKVAYK 4479
            MK  ADK R+DV F+VGD VLVKLQPYRQ SVALRKNQKLGMRYFGPF I+A IG VAYK
Sbjct: 1316 MKKQADKHRKDVTFQVGDLVLVKLQPYRQQSVALRKNQKLGMRYFGPFEIIACIGAVAYK 1375

Query: 4480 LQLPDTAKIHSVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILEPEGILKTRTILRNEQQ 4659
            L+LPD AKIH VFHVSQLK FKG  ++ Y+PLPLT  E GPI++P  +L+ RTI+R  Q+
Sbjct: 1376 LKLPDNAKIHPVFHVSQLKPFKGAASDQYLPLPLTMTETGPIMQPIAVLQARTIMRGTQR 1435

Query: 4660 VPQVLVKWQHLQVAEATWEDKNEMAIQYPNLNLEDKIALEGEGDVMNNKSVN 4815
            V Q+LV+W     AEATWED +++ +++P LNLEDK+   GEG VM   + N
Sbjct: 1436 VHQILVQWDTNAEAEATWEDFDDLQLKFPTLNLEDKVVFNGEGIVMRPNTTN 1487


>gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1502

 Score = 1814 bits (4699), Expect = 0.0
 Identities = 895/1532 (58%), Positives = 1125/1532 (73%), Gaps = 10/1532 (0%)
 Frame = +1

Query: 250  LTTELRRQAETIEKNEAASRARFEELSSMQKASEARFNQLADAFEKLMQQTPTVTHGGTN 429
            +  EL+R AE +   +A     FE L   + A++ +F+++  A + L+ Q+P+  H G  
Sbjct: 9    IEAELKRNAEDMRLYQAEM---FERLERSEAANKEKFDKIFTAIDILIDQSPSKHHHGAG 65

Query: 430  SGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIPTMSSTPFQVRHIKLDFPRFNGKNVLDW 609
                                               +  PFQVR++KL+FPRF+G NV +W
Sbjct: 66   LNN--------------------------------NRPPFQVRNVKLEFPRFDGTNVHEW 93

Query: 610  IFKAEQFFGYYNTPDTERLIIASVHLDQEVVPWFQMVNRSRPFQTWMDFTRALELDFGPS 789
            IF+AEQFF YY+TPD++RL I+SVHLD++VVPW+QMV R RPF +W++FTRALELDFGPS
Sbjct: 94   IFRAEQFFDYYDTPDSDRLTISSVHLDKDVVPWYQMVQRLRPFTSWVEFTRALELDFGPS 153

Query: 790  MYDCPRASLFKLQQSKSVNEYYLEFTALSNRVYGISNDALIDCFVSGLKDELRRDVMLHT 969
            +YDCPRA+LFKL Q+ +V EYYL+FT+L+NRVYG+SNDA++DCFVSGL +++RRDV++HT
Sbjct: 154  VYDCPRATLFKLSQTGTVAEYYLQFTSLANRVYGLSNDAMVDCFVSGLNNQIRRDVLIHT 213

Query: 970  PTSITXXXXXXXXXXXXXXXXNTNVQNQKY----NAGSTAVQARAPFNPTRNSQTQATEK 1137
            P S+                    V  +KY    N     +   +   P  N    AT  
Sbjct: 214  PPSLVKAVSL------------AKVYEEKYADAMNTQKATINNHSTNKPFINKPEIATRN 261

Query: 1138 AXXXXXXXXXXXXXMSQLQKNPAIKRIS-AEMQLRREKGLCYFCDDKFSFSHRCPNKHLM 1314
                          MSQ QKNP IKR+S AE Q+RR+KGLCY+CD+KFSF+H+CPN+ ++
Sbjct: 262  TAPILNTPPTRP--MSQFQKNPNIKRMSPAERQVRRDKGLCYWCDEKFSFTHKCPNRQML 319

Query: 1315 LLELXXXXXXXXXXXXANSDDDR-----EIQKPVEHHLSLNALNGATGMGVIRFKGCIGP 1479
            LL+               +  ++     +     EHHLSLNAL G + MGVIRF G I  
Sbjct: 320  LLQYDDDDNDADQVFDTLTQTEQVTTNGQTTNLPEHHLSLNALKGTSNMGVIRFAGSIEH 379

Query: 1480 VDVSILLDGGSSDSFIQPRIAHCLNLPIEPVTRCNVLVGNGQKMQAEGVIQNLTVRVQGI 1659
            + V IL+DGGSSD+F+QPRIA  L LPIEP  + NVLVGNG+ M AEG+IQ L + +QG 
Sbjct: 380  IGVQILIDGGSSDNFMQPRIAKFLKLPIEPGPQFNVLVGNGEVMSAEGMIQKLPLHIQGH 439

Query: 1660 DITVPAYLLPVAGSDVILGAPWLASLGPHVADYATSKLRFYLEGKFVTLQGDTDPKTAIA 1839
             I VP YLLP+AG+DVILGA WLA+LGPHVADYA+  L+F+L+G+FVTL G+   + + A
Sbjct: 440  VIEVPVYLLPIAGADVILGASWLATLGPHVADYASLTLKFFLKGQFVTLTGEPMCRPSQA 499

Query: 1840 QLHQFKRLQHMDAISELFTLQQIEAVAPEDNWKDMPVTVEPEMATLLYTYREVFQVPKGL 2019
            QLH FKRL   D+I+E FT+Q I+    ED +KD+P  +EPE+A LL+ YR +FQ P  L
Sbjct: 500  QLHHFKRLATTDSIAECFTVQCIKPTVEEDIFKDLPDNIEPEIAMLLHQYRGLFQTPTTL 559

Query: 2020 PPPRELSHEILLKEGAPPVKVKPYRYPHSQKEQIEKMVQEMLEEGXXXXXXXXXXXXXXL 2199
            PP R  +H I L EG+ PVKVKPYRYPHSQK+QIE M+QEML++G              L
Sbjct: 560  PPNRSHNHSIPLLEGSAPVKVKPYRYPHSQKQQIELMIQEMLQQGIIQNSNSPFSSPIIL 619

Query: 2200 VKKKDGTWRCCTDYRALNAITIKDSFPMPTVDELLDELHGAKFFSKFDLRSGYHQILVKP 2379
            VKKKDGTWR CTDYRALNAIT+KD FP+PTVDELLDELHGAK+FSK DLRSGYHQIL++P
Sbjct: 620  VKKKDGTWRFCTDYRALNAITVKDCFPIPTVDELLDELHGAKYFSKLDLRSGYHQILLQP 679

Query: 2380 EDRQKTAFRTHQGHYEWLVMPFGLTNAPATFQRLMNEIFQPFLRKCVLVFFDDILVYSPS 2559
            EDR KTAFRTHQGHYEWLVMPFGLTNAPATFQ LMN IFQ  LRK VLVFFDDIL+YS +
Sbjct: 680  EDRYKTAFRTHQGHYEWLVMPFGLTNAPATFQSLMNHIFQHALRKYVLVFFDDILIYSSN 739

Query: 2560 WHTHLQQVECVLQILSQHVLYAKLSKCSFGLTEVEYLGHTVSGSGVSMDRNKVNAVLDWP 2739
            W  HL  ++ VL +L  + LY KLSKCSFG+ E+EYLGH V+G GVSMD++KV AV++WP
Sbjct: 740  WQEHLHHLDTVLTVLQDNTLYVKLSKCSFGVLEIEYLGHVVTGQGVSMDKDKVQAVVNWP 799

Query: 2740 VPSNLKQLRGFLGLTGYYRRFIKAYASIAGPLTNLLKKDAFQWNENTQLAFEQLKKAITS 2919
             PSN+KQLRGFLGLTGYYRRFIK+YA IA PLT+LLKK+A+QW E  ++AF QLK AIT+
Sbjct: 800  TPSNVKQLRGFLGLTGYYRRFIKSYAKIASPLTDLLKKEAYQWTEQAEVAFHQLKNAITT 859

Query: 2920 APVLILPDFSQPFVLETDASGTGVGAVLSQGGHPIAFFSKKMAPRMQLQSAYTREFYAIT 3099
            APVLILP+F +PF+LETDASG G+GAVL Q GHPIA+FSKK+ PR Q +SAY RE  AI 
Sbjct: 860  APVLILPNFKEPFILETDASGIGIGAVLHQQGHPIAYFSKKLVPRNQKKSAYFREMLAIA 919

Query: 3100 AALAKFRHYLLGHKFILRTXXXXXXXXXXXXXXTPEQQAWLHKFIGYDFQIEYKPGKDNQ 3279
             A+AKFRHYLLGHKFI+RT              TPEQQ WLHKF+GYDF IEYKPGK+N 
Sbjct: 920  EAIAKFRHYLLGHKFIIRTDQKSLRNLMEQSLQTPEQQEWLHKFLGYDFTIEYKPGKENM 979

Query: 3280 AADALSRLMTMSWSGPENVFLEELKKEVATNEHLKMIVQQCEQNTVADRNYAVKAGLLYW 3459
            AADALSR+M M+WS P+   L+++++ +  +  L+ ++Q       A   Y+++ GLLYW
Sbjct: 980  AADALSRVMVMAWSEPKWQLLDQVRRALENDNQLREVMQNYAIGK-APVQYSMREGLLYW 1038

Query: 3460 KQKLVIPLESKLIQQILKEYHDSPIGGHAGVTRTLARIGAQFYWPKMRQDVKEYVAACIV 3639
            KQ+LVIP    L+Q++L E+H SPIGGHAG+TRT+ARI AQFYWP M++D+ EYV  C+V
Sbjct: 1039 KQRLVIPKNDDLLQKLLFEFHTSPIGGHAGITRTIARIKAQFYWPDMKKDIAEYVHKCVV 1098

Query: 3640 CQQAKSATTAYAGLLEPLPIPNQVWDDVTMDFITGLPPSHGYTVIMVVVDRLTKFAHFIP 3819
            CQQAK+  T+ AGLL+PLPIP+QVW+D+ MDFITGLP S+GYT IMVVVDRLTK AHFIP
Sbjct: 1099 CQQAKTTNTSPAGLLQPLPIPSQVWEDIAMDFITGLPLSYGYTTIMVVVDRLTKSAHFIP 1158

Query: 3820 MKTDYSSKSVAEAFMHSVVRLHGMPKSIISDRDRVFTSKFWQNLFELQGTTLAMSSAYHP 3999
            MKTDY+SK+VAEAFMH++V+LHGMPKSI+SDRD+VFTS FWQ+LF++QGT+LAMSSAYHP
Sbjct: 1159 MKTDYTSKTVAEAFMHNIVKLHGMPKSIVSDRDKVFTSAFWQHLFKMQGTSLAMSSAYHP 1218

Query: 4000 QTDGQSEAVNKCLEMYLRCFTFNNPKSWYKALCWAELWYNTSHHTSAGMSPFKALYGREP 4179
            QTDGQ+E +NK LE++LRCFTF+NPKSW+K + WAE WYNT+  TS GM+PFKALYGR+P
Sbjct: 1219 QTDGQTEVLNKTLELFLRCFTFHNPKSWFKVMSWAEYWYNTAFQTSIGMTPFKALYGRDP 1278

Query: 4180 PMLTRYAANSSDPVELQQQLLDRDAVITELKRNLARAQQIMKNNADKKRRDVVFEVGDQV 4359
            P LT+Y     DP  L+++L++RD ++ +LK NL RAQQ MK  ADK RR+V F+VGD V
Sbjct: 1279 PYLTKYEVQVDDPPALREELMERDQILQQLKTNLERAQQYMKQQADKHRREVSFKVGDLV 1338

Query: 4360 LVKLQPYRQTSVALRKNQKLGMRYFGPFTILAKIGKVAYKLQLPDTAKIHSVFHVSQLKL 4539
            LVKLQPY+Q SVALRKNQKLGMRYFGPF ++A +GKVAYKLQLP+ AKIH VFHVSQLK 
Sbjct: 1339 LVKLQPYKQQSVALRKNQKLGMRYFGPFEVIACVGKVAYKLQLPENAKIHPVFHVSQLKP 1398

Query: 4540 FKGTMTEPYIPLPLTTVEEGPILEPEGILKTRTILRNEQQVPQVLVKWQHLQVAEATWED 4719
            F GT  E Y+PLPLT  + GPI +P  IL+ RTI+R  ++V Q+ ++W      EA+WED
Sbjct: 1399 FHGTSQEQYLPLPLTMSDTGPIFQPATILQARTIVRGNKKVHQLQIQWDLNSPEEASWED 1458

Query: 4720 KNEMAIQYPNLNLEDKIALEGEGDVMNNKSVN 4815
             +E+  ++PN+NLEDK+  +GEG VM   + N
Sbjct: 1459 LDELQNKFPNINLEDKVVFKGEGIVMRPNNTN 1490


>dbj|GAU11620.1| hypothetical protein TSUD_346120 [Trifolium subterraneum]
          Length = 1479

 Score = 1795 bits (4649), Expect = 0.0
 Identities = 883/1488 (59%), Positives = 1098/1488 (73%), Gaps = 4/1488 (0%)
 Frame = +1

Query: 316  FEELSSMQKASEARFNQLADAFEKLMQQTPTVTHGGTNSGXXXXXXXXXXXXXXXXXXXX 495
            FE +  ++ A+ ++F+++  A + L+ QTP+    G                        
Sbjct: 28   FERMERLELANASKFDKIFSALDVLIDQTPSKHRHGIG---------------------- 65

Query: 496  XXXXXXXXXIPTMSSTPFQVRHIKLDFPRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIA 675
                        +S  PFQVR++KL+FPRF+G NV +WIF+AEQFF YY+TPD +RL I+
Sbjct: 66   ------------LSKAPFQVRNVKLEFPRFDGSNVHEWIFRAEQFFDYYDTPDHDRLTIS 113

Query: 676  SVHLDQEVVPWFQMVNRSRPFQTWMDFTRALELDFGPSMYDCPRASLFKLQQSKSVNEYY 855
            SVHLD++VVPW+QM+ R+ PF +W++ TRALELDFGPS+YDCPRA+LFKL+QS SV+EYY
Sbjct: 114  SVHLDKDVVPWYQMMQRTHPFTSWIELTRALELDFGPSIYDCPRATLFKLKQSGSVSEYY 173

Query: 856  LEFTALSNRVYGISNDALIDCFVSGLKDELRRDVMLHTPTSITXXXXXXXXXXXXXXXXN 1035
            ++FT+L+NRVYG+SNDALIDCFVSGL DE+RRDV++HTP+S+                 N
Sbjct: 174  MKFTSLANRVYGLSNDALIDCFVSGLSDEIRRDVLIHTPSSLVKAVSLAKVYEEKYAM-N 232

Query: 1036 TNVQNQKYNAGSTAVQARAPFNPTRNSQTQATEKAXXXXXXXXXXXXXMSQLQKNPAIKR 1215
            +  Q + Y+  ST         P  N    AT  +             MSQ QKNP IKR
Sbjct: 233  SKSQTRNYSNYSTN-------KPLYNKSEIATRNSAPILNTPPTRP--MSQYQKNPNIKR 283

Query: 1216 IS-AEMQLRREKGLCYFCDDKFSFSHRCPNKHLMLLELXXXXXXXXXXXXANSDD---DR 1383
            IS AEMQ+RR+KGLCY+CD+KFSF+H+CPN+ LMLL                 +    D 
Sbjct: 284  ISPAEMQVRRDKGLCYWCDEKFSFTHKCPNRQLMLLHYDDSDEEQLVEPSITLEPKTIDS 343

Query: 1384 EIQKPVEHHLSLNALNGATGMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNLPI 1563
             I    +HHLSLNA+ G   MGV+RF G I    V +L+DGGSSD+F+QPRIA  L LPI
Sbjct: 344  SITNTPDHHLSLNAMKGNNTMGVLRFTGAIEQFKVQVLIDGGSSDNFLQPRIAKFLKLPI 403

Query: 1564 EPVTRCNVLVGNGQKMQAEGVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASLGP 1743
            EP     VLVGNG+ M AEGVIQ L + +QG  I +P +LLPV G+D++LGA WLA+LGP
Sbjct: 404  EPGPTFRVLVGNGEIMTAEGVIQELPLDIQGHKIHIPVFLLPVVGADIVLGASWLATLGP 463

Query: 1744 HVADYATSKLRFYLEGKFVTLQGDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAVAP 1923
            HVADYA+  L+F+LEGKFVTL G+ + +   AQLHQFKR  + DAI+E FT+Q I+   P
Sbjct: 464  HVADYASLTLKFFLEGKFVTLVGEHENRPVTAQLHQFKRFHNTDAIAECFTIQCIKTTEP 523

Query: 1924 EDNWKDMPVTVEPEMATLLYTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPH 2103
             D   ++P  +EPE+A LL+ Y ++FQ P  LPP R  +H I L EG+ PVKVKPYRYPH
Sbjct: 524  ADILHELPSNMEPELAILLHNYSKLFQSPAVLPPSRSHNHCIPLIEGSSPVKVKPYRYPH 583

Query: 2104 SQKEQIEKMVQEMLEEGXXXXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPM 2283
            SQK QIE MV++ML++G              LVKKKDGTWR CTDYRALNAITIKDSF +
Sbjct: 584  SQKAQIEIMVEDMLQQGIIQPSTSPFSSPIVLVKKKDGTWRFCTDYRALNAITIKDSFLI 643

Query: 2284 PTVDELLDELHGAKFFSKFDLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAP 2463
            PTVDELLDEL GAK+FSK DLRSGYHQIL++PEDR KTAFRTH GHYEWLVMPFGLTNAP
Sbjct: 644  PTVDELLDELFGAKYFSKLDLRSGYHQILLQPEDRFKTAFRTHHGHYEWLVMPFGLTNAP 703

Query: 2464 ATFQRLMNEIFQPFLRKCVLVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCS 2643
            ATFQ LMN IFQ  LRK VLVFFDDIL+YS SWH HLQ +  VL++L  + L+ KLSKCS
Sbjct: 704  ATFQSLMNHIFQQALRKYVLVFFDDILIYSSSWHDHLQHLNKVLEMLQANTLFVKLSKCS 763

Query: 2644 FGLTEVEYLGHTVSGSGVSMDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASI 2823
            FG+ E++YLGH V+G GV+M+++KV AVLDWP P N+KQLRGFLGLTGYYRRFIK+YA I
Sbjct: 764  FGVNEIDYLGHIVTGQGVTMEKSKVQAVLDWPTPGNIKQLRGFLGLTGYYRRFIKSYAKI 823

Query: 2824 AGPLTNLLKKDAFQWNENTQLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVL 3003
            A PLT+LLKK+A++WNE  + AF+QLK A+TSAPVL LP+F++PF+LETDASG G+GAVL
Sbjct: 824  AAPLTDLLKKEAYKWNEQAEAAFQQLKIAVTSAPVLALPNFNEPFILETDASGIGIGAVL 883

Query: 3004 SQGGHPIAFFSKKMAPRMQLQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXX 3183
             Q  HPIA+FSKK+ PR Q +SAY RE  AI  A+AKFRHYLLGHKFI+RT         
Sbjct: 884  HQNNHPIAYFSKKLVPRNQKKSAYFREMLAIAEAIAKFRHYLLGHKFIIRTDQKSLRSLM 943

Query: 3184 XXXXXTPEQQAWLHKFIGYDFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEV 3363
                 TP+QQ WLHKF+GYDF IEYKPGKDN AADALSR+M M WS P+  +++++K  +
Sbjct: 944  EQSLQTPDQQEWLHKFLGYDFTIEYKPGKDNLAADALSRMMCMGWSEPQCSWIQQIKTAL 1003

Query: 3364 ATNEHLKMIVQQCEQNTVADRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGH 3543
              +  L  I+QQCEQ    + +Y V+ GLLYWK ++VIP++  L+QQ+LKE+H SPIGGH
Sbjct: 1004 QQDTTLMEIIQQCEQGQ-TNTHYTVRDGLLYWKHRIVIPVDDALLQQVLKEFHSSPIGGH 1062

Query: 3544 AGVTRTLARIGAQFYWPKMRQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDV 3723
            AG+TRTLARI AQFYWP M+QD+ +YV  C+VCQQ K+  T  AGLL+PLPIP+Q+W+D+
Sbjct: 1063 AGMTRTLARIQAQFYWPNMKQDIVQYVQNCLVCQQTKTTNTLPAGLLQPLPIPSQIWEDL 1122

Query: 3724 TMDFITGLPPSHGYTVIMVVVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSI 3903
             MDFITGLP SHGYTVI+VVVDRLTK+AHFIPMKTDYSSKSVAE FM +VV+LHG+PKSI
Sbjct: 1123 AMDFITGLPSSHGYTVILVVVDRLTKYAHFIPMKTDYSSKSVAEVFMQNVVKLHGLPKSI 1182

Query: 3904 ISDRDRVFTSKFWQNLFELQGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSW 4083
            +SDRD+VFTS FWQ LF+LQGTTLAMSSAYHPQ+DGQSE +NK LE+YLRCF+FNNPK+W
Sbjct: 1183 VSDRDKVFTSAFWQQLFKLQGTTLAMSSAYHPQSDGQSEVLNKTLELYLRCFSFNNPKAW 1242

Query: 4084 YKALCWAELWYNTSHHTSAGMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAVIT 4263
             K L W+E WYNT+  TS GM+PFKA+YGR+PP L RY A  SDP  L+ +L++RD ++ 
Sbjct: 1243 SKMLSWSEFWYNTAFQTSIGMTPFKAVYGRDPPYLNRYVAQDSDPPTLRAELMERDTILQ 1302

Query: 4264 ELKRNLARAQQIMKNNADKKRRDVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRYFGPF 4443
            +LK NL +AQQ MK  ADK R ++ F++GD VLVKLQPYRQ SVALRKNQKLG+RYFGPF
Sbjct: 1303 QLKNNLLKAQQYMKKQADKHRIELQFQIGDLVLVKLQPYRQHSVALRKNQKLGLRYFGPF 1362

Query: 4444 TILAKIGKVAYKLQLPDTAKIHSVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILEPEGI 4623
             I+AK+G VAYKL+LPD AKIH VFHVSQLK FKG   E Y PLPLT  E GP+ +P  +
Sbjct: 1363 EIIAKVGAVAYKLKLPDYAKIHPVFHVSQLKPFKGFTNEQYFPLPLTMTEIGPLSQPIDV 1422

Query: 4624 LKTRTILRNEQQVPQVLVKWQHLQVAEATWEDKNEMAIQYPNLNLEDK 4767
            L+ RTI++  ++V QVL++W      EA+WED +++  ++P  NLEDK
Sbjct: 1423 LQVRTIIKGTKKVHQVLIQWDQYPREEASWEDADDVLNKFPAFNLEDK 1470


>dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subterraneum]
          Length = 1512

 Score = 1783 bits (4618), Expect = 0.0
 Identities = 883/1503 (58%), Positives = 1092/1503 (72%), Gaps = 6/1503 (0%)
 Frame = +1

Query: 316  FEELSSMQKASEARFNQLADAFEKLMQQTPTVTHGGTNSGXXXXXXXXXXXXXXXXXXXX 495
            FE L   + A+  +F ++  A + L+ QTP+  + G                        
Sbjct: 28   FERLERSEAANAEKFAKIFTALDILIDQTPSKQNHG------------------------ 63

Query: 496  XXXXXXXXXIPTMSSTPFQVRHIKLDFPRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIA 675
                     I   + TPFQVR++KL+FPRF+G NV +WIF+AEQFF YY+TPD +RL IA
Sbjct: 64   ---------IGLNNRTPFQVRNVKLEFPRFDGNNVHEWIFRAEQFFDYYDTPDLDRLTIA 114

Query: 676  SVHLDQEVVPWFQMVNRSRPFQTWMDFTRALELDFGPSMYDCPRASLFKLQQSKSVNEYY 855
            SVHLD++VVPW+QMV R+ PFQ+WMDFTRALELDFGPS+Y+CPRA+LFKL Q+ +V EYY
Sbjct: 115  SVHLDKDVVPWYQMVQRTTPFQSWMDFTRALELDFGPSIYECPRATLFKLNQTGTVAEYY 174

Query: 856  LEFTALSNRVYGISNDALIDCFVSGLKDELRRDVMLHTPTSITXXXXXXXXXXXXXXXXN 1035
            L+FT+L+NRVYG+SNDALIDCF+SGL  ++RRDV++HTP SI                  
Sbjct: 175  LQFTSLANRVYGLSNDALIDCFISGLSADIRRDVLIHTPNSIVKAVSLAKVYEEKYTTTL 234

Query: 1036 TNVQNQKYNAGSTAVQARAPFNPTRNSQTQATEKAXXXXXXXXXXXXXMSQLQKNPAIKR 1215
               +  + N  +       P N TRNS                     MSQ QKNP IKR
Sbjct: 235  KPQKTYQNNYSNIKPLTTKPENSTRNSAP----------ILNTPPTRPMSQFQKNPNIKR 284

Query: 1216 IS-AEMQLRREKGLCYFCDDKFSFSHRCPNKHLMLLELXXXXXXXXXXXXANSDD----- 1377
            IS AEMQ+RR+KGLCY+CD+KFSF+H+CPN+ LMLL+                D      
Sbjct: 285  ISPAEMQIRRDKGLCYWCDEKFSFTHKCPNRQLMLLQYDDNETQLFDGSPEPPDSPTNSL 344

Query: 1378 DREIQKPVEHHLSLNALNGATGMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNL 1557
            D  I    +HHLS+NA+ G + MGVIRF G I  ++V IL+DGGSSD+F+QPRIA  L L
Sbjct: 345  DTNIP---DHHLSMNAMKGTSNMGVIRFVGSIEHIEVQILIDGGSSDNFVQPRIAKFLKL 401

Query: 1558 PIEPVTRCNVLVGNGQKMQAEGVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASL 1737
            PIEP     VLVGNG+ M AEGVI+ L + +QG  + VPA+LLPVAG DV+LGA WLA+L
Sbjct: 402  PIEPAPVFKVLVGNGEIMNAEGVIKQLPIDIQGHKLEVPAFLLPVAGVDVVLGASWLATL 461

Query: 1738 GPHVADYATSKLRFYLEGKFVTLQGDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAV 1917
            GPHVADYA+  L+F+L GKFVTL G+   +    Q H  KR  +  AI+E F +Q+++  
Sbjct: 462  GPHVADYASLTLKFFLNGKFVTLVGEPLARPEPTQFHHLKRCCNTKAIAECFIVQRLKTT 521

Query: 1918 APEDNWKDMPVTVEPEMATLLYTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRY 2097
               D +K++P   EPE+A LL+ Y+EVF+ P GLPP R  +H I L EG+ PVKVKPYRY
Sbjct: 522  EATDIFKELPTNTEPEIAMLLHNYQEVFKTPNGLPPTRAHNHSIPLLEGSNPVKVKPYRY 581

Query: 2098 PHSQKEQIEKMVQEMLEEGXXXXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSF 2277
            PHSQK QIE MVQ+ML++G              LVKKKDGTWR CTDYRALNAIT+KDSF
Sbjct: 582  PHSQKTQIEHMVQDMLQQGIIQPSTSPFSSPIILVKKKDGTWRFCTDYRALNAITVKDSF 641

Query: 2278 PMPTVDELLDELHGAKFFSKFDLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTN 2457
            P+PTVDELLDEL GAK FSK DLRSGYHQIL++PEDR KTAFRTHQGHYEWLVMPFGLTN
Sbjct: 642  PIPTVDELLDELFGAKHFSKLDLRSGYHQILLQPEDRYKTAFRTHQGHYEWLVMPFGLTN 701

Query: 2458 APATFQRLMNEIFQPFLRKCVLVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSK 2637
            APATFQ LMN IFQ  LRK VLVFFDDILVYS +W  HL+ +  VLQ+L  + L+ KLSK
Sbjct: 702  APATFQSLMNTIFQNVLRKYVLVFFDDILVYSKTWQEHLKHLAAVLQVLQDNELFVKLSK 761

Query: 2638 CSFGLTEVEYLGHTVSGSGVSMDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYA 2817
            CSFG++E+EYLGH V+G GVSMD+ K+ AVLDWP P N+KQLRGFL LTGYYRRFI++YA
Sbjct: 762  CSFGVSEIEYLGHVVNGQGVSMDKEKIQAVLDWPPPKNIKQLRGFLCLTGYYRRFIQSYA 821

Query: 2818 SIAGPLTNLLKKDAFQWNENTQLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGA 2997
             IA PLT+LLKK+A+ W    + AF+QLK AITSAPVL LP+F++ F+LETDASG G+GA
Sbjct: 822  KIASPLTDLLKKEAYVWTTQAKAAFQQLKNAITSAPVLALPNFTKTFILETDASGVGIGA 881

Query: 2998 VLSQGGHPIAFFSKKMAPRMQLQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXX 3177
            VL Q GHPIA+FSKK+ PR Q +SAY RE  AI  A+AKFRHYLLGHKFI+RT       
Sbjct: 882  VLHQEGHPIAYFSKKLVPRNQKKSAYFREMLAIAEAIAKFRHYLLGHKFIIRTDQKSLRN 941

Query: 3178 XXXXXXXTPEQQAWLHKFIGYDFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKK 3357
                   TP+QQ WLH+F+GYDF IEYKPGKDN  ADALSR+MT++WS P+   L ++K 
Sbjct: 942  LMEQTLQTPDQQEWLHRFLGYDFSIEYKPGKDNVVADALSRVMTLAWSEPQFRLLHQIKA 1001

Query: 3358 EVATNEHLKMIVQQCEQNTVADRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIG 3537
                +  L  I  +C Q++    +Y +K  LL+WKQ++VIP +  LIQQ+L E H SP+G
Sbjct: 1002 IQKQDPTLVKIRMECAQHSQPGSHYTIKDDLLFWKQRIVIPHDGALIQQVLYELHTSPLG 1061

Query: 3538 GHAGVTRTLARIGAQFYWPKMRQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWD 3717
            GHAG+TRT+AR+ AQFYW  M++D+ EYV  C +CQ+AK+A T  AGLL+PLPIP+QVW+
Sbjct: 1062 GHAGITRTVARVKAQFYWSDMKKDIVEYVQNCEICQKAKTANTLPAGLLQPLPIPSQVWE 1121

Query: 3718 DVTMDFITGLPPSHGYTVIMVVVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPK 3897
            DV MDFITGLP SHGYTVI+VV+DRLTK+AHFIP+KTDY+SK VAEAFMH +V+LHGMPK
Sbjct: 1122 DVAMDFITGLPLSHGYTVILVVIDRLTKYAHFIPLKTDYTSKIVAEAFMHHIVKLHGMPK 1181

Query: 3898 SIISDRDRVFTSKFWQNLFELQGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPK 4077
            SI+SDRD+VFTS FWQ LF+LQGT+LAMSSAYHPQ+DGQSE +N+ LE++LRCFTFNNPK
Sbjct: 1182 SIVSDRDKVFTSNFWQQLFKLQGTSLAMSSAYHPQSDGQSEVLNRTLELFLRCFTFNNPK 1241

Query: 4078 SWYKALCWAELWYNTSHHTSAGMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAV 4257
            +WYKAL W+E WYNT+  TS GM+PFKALYGR+PP L RY A + DP  LQ++L+ RD +
Sbjct: 1242 AWYKALSWSEFWYNTAFQTSIGMTPFKALYGRDPPTLVRYEAQAGDPPALQEELMGRDKL 1301

Query: 4258 ITELKRNLARAQQIMKNNADKKRRDVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRYFG 4437
            + +LK NL RAQQ MK  ADK RRD+  +VGD VLVKLQPYRQ S+ALRKNQKLGMRYFG
Sbjct: 1302 LQQLKSNLERAQQYMKRQADKHRRDIKLQVGDLVLVKLQPYRQQSLALRKNQKLGMRYFG 1361

Query: 4438 PFTILAKIGKVAYKLQLPDTAKIHSVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILEPE 4617
            PF ILAK+G+VAYKL+LPD AKIH VFH+SQLK FKG   +  +PLPLT  + GP+++P 
Sbjct: 1362 PFEILAKVGEVAYKLKLPDHAKIHPVFHISQLKPFKGISQDQSLPLPLTMSDTGPLIQPI 1421

Query: 4618 GILKTRTILRNEQQVPQVLVKWQHLQVAEATWEDKNEMAIQYPNLNLEDKIALEGEGDVM 4797
             +L  RTIL+  Q+V QVL++W     AEATWE+   +  ++P  NLEDK+  +G+G VM
Sbjct: 1422 AVLAARTILKGIQKVHQVLIQWDQYPEAEATWEEVTNLQSKFPYFNLEDKVVFKGDGIVM 1481

Query: 4798 NNK 4806
            + K
Sbjct: 1482 SPK 1484


>dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subterraneum]
          Length = 1531

 Score = 1768 bits (4579), Expect = 0.0
 Identities = 870/1508 (57%), Positives = 1089/1508 (72%), Gaps = 8/1508 (0%)
 Frame = +1

Query: 316  FEELSSMQKASEARFNQLADAFEKLMQQTPTVTHGGTNSGXXXXXXXXXXXXXXXXXXXX 495
            FE L   + A+  +F ++  A + L+ QTP+    G                        
Sbjct: 28   FERLERSEAANTGKFEKIFAALDILIDQTPSKHQQGAELHHHR----------------- 70

Query: 496  XXXXXXXXXIPTMSSTPFQVRHIKLDFPRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIA 675
                            PFQVR++KL+FPRF+G NV +WIF+AEQFF YY+TPD +RL IA
Sbjct: 71   ---------------APFQVRNVKLEFPRFDGTNVHEWIFRAEQFFEYYDTPDLDRLTIA 115

Query: 676  SVHLDQEVVPWFQMVNRSRPFQTWMDFTRALELDFGPSMYDCPRASLFKLQQSKSVNEYY 855
            +VHLD++VVPW+QM+ RS PFQ+W+DFTRALELDFGPS+YDCPRA+LFKL Q+ +V EY+
Sbjct: 116  AVHLDKDVVPWYQMMQRSHPFQSWIDFTRALELDFGPSIYDCPRATLFKLVQTGTVAEYF 175

Query: 856  LEFTALSNRVYGISNDALIDCFVSGLKDELRRDVMLHTPTSITXXXXXXXXXXXXXXXXN 1035
            ++FT+L+NRVYG+SNDAL+DCF+SGL  ++RRDV++HTP+S+                  
Sbjct: 176  VQFTSLANRVYGLSNDALVDCFISGLNPDIRRDVLIHTPSSLVKAVSL------------ 223

Query: 1036 TNVQNQKYNAGSTAVQARAPFNPTRNSQTQATEKAXXXXXXXXXXXXX-----MSQLQKN 1200
              V  +KY   +T ++ + P+  T ++      K                   MSQ QKN
Sbjct: 224  AKVYEEKY---TTTMKPQKPYTQTYSTNKPYNNKPENSTRNTAPILNTPPTRPMSQFQKN 280

Query: 1201 PAIKRIS-AEMQLRREKGLCYFCDDKFSFSHRCPNKHLMLLELXXXXXXXXXXXXANSDD 1377
            P +KRIS AEMQLRR+KGLCY+CDDKFSF+H+CPN+ LMLL+                D 
Sbjct: 281  PNVKRISPAEMQLRRDKGLCYWCDDKFSFTHKCPNRQLMLLQYEDSEDQVLDEITDPPDP 340

Query: 1378 DRE--IQKPVEHHLSLNALNGATGMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCL 1551
                      + HLS++A+ G++ MGV+RF G I  + V IL+DGGSSD+F+QPRIA  L
Sbjct: 341  TTNGLTTNLPKLHLSMSAMKGSSHMGVLRFTGAIEHIQVQILIDGGSSDNFVQPRIAKFL 400

Query: 1552 NLPIEPVTRCNVLVGNGQKMQAEGVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLA 1731
             LPIEP     VLVGNG+ M AEG+++ L + VQG  + VP YLLPVAG+DVILGA WL+
Sbjct: 401  KLPIEPAPIFKVLVGNGEVMTAEGIVKQLPLDVQGHRLQVPVYLLPVAGADVILGASWLS 460

Query: 1732 SLGPHVADYATSKLRFYLEGKFVTLQGDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIE 1911
            +LGPHVADYA+  ++F+L  KFVTL G+   +   AQ H  KR  H DAI E F +Q  +
Sbjct: 461  TLGPHVADYASLTIKFFLHDKFVTLVGEPIARPEPAQFHHMKRFHHTDAIEECFAIQWFK 520

Query: 1912 AVAPEDNWKDMPVTVEPEMATLLYTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPY 2091
                 D +K++P   +PE+A LLY Y+ VF+ P  LPP R  +H I L EG  P+KVKPY
Sbjct: 521  DTEAADIFKELPTNTDPEIAMLLYNYQAVFKTPTMLPPTRAHNHAIPLLEGTNPIKVKPY 580

Query: 2092 RYPHSQKEQIEKMVQEMLEEGXXXXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKD 2271
            RYPHSQK QIE MVQ+ML+ G              LVKKKDGTWR CTDYRALNAIT+KD
Sbjct: 581  RYPHSQKTQIETMVQDMLQHGIIQPSTSPFSSPIILVKKKDGTWRFCTDYRALNAITVKD 640

Query: 2272 SFPMPTVDELLDELHGAKFFSKFDLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGL 2451
            SFP+PTVDELLDEL GAK FSK DLRSGYHQIL++PEDR KTAFRTHQGHYEWLVMPFGL
Sbjct: 641  SFPIPTVDELLDELFGAKHFSKLDLRSGYHQILLQPEDRHKTAFRTHQGHYEWLVMPFGL 700

Query: 2452 TNAPATFQRLMNEIFQPFLRKCVLVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKL 2631
            TNAPATFQ LMN IFQ  LRK VLVFFDDILVYS +WH HL+ +  VLQ+L  + LY KL
Sbjct: 701  TNAPATFQSLMNTIFQIVLRKYVLVFFDDILVYSKTWHEHLKHLAAVLQVLKDNELYVKL 760

Query: 2632 SKCSFGLTEVEYLGHTVSGSGVSMDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKA 2811
            SKCSFG+ E+EYLGH VSG GV MD++K+  V+DWP P N+KQLRGFLGLTGYYRRFI++
Sbjct: 761  SKCSFGVLEIEYLGHVVSGEGVYMDKSKIQVVVDWPSPKNIKQLRGFLGLTGYYRRFIQS 820

Query: 2812 YASIAGPLTNLLKKDAFQWNENTQLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGV 2991
            YA IA PLT+LLKKDA+ WN   + AF++LK AIT+APVL LPDF++PF+LETDASG G+
Sbjct: 821  YAKIASPLTDLLKKDAYTWNSEMEAAFQKLKHAITTAPVLALPDFTKPFILETDASGIGI 880

Query: 2992 GAVLSQGGHPIAFFSKKMAPRMQLQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXX 3171
            GAVL Q GHPIA+FSKK+ PR Q +SAY RE  AI  A+AKFRHYLLGHKFI+RT     
Sbjct: 881  GAVLHQEGHPIAYFSKKLVPRNQRKSAYFREMLAIAEAIAKFRHYLLGHKFIIRTDQKSL 940

Query: 3172 XXXXXXXXXTPEQQAWLHKFIGYDFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEEL 3351
                     TP+QQ WLH+F+GYDF IEYKPGK+N AADALSR+MT++WS P+   L ++
Sbjct: 941  RNLMEQALQTPDQQEWLHRFLGYDFTIEYKPGKENVAADALSRVMTLAWSEPQYKLLHQI 1000

Query: 3352 KKEVATNEHLKMIVQQCEQNTVADRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSP 3531
            +  +  +  L  I+++C QN+ ++ NY +K  LL+WK ++VIP  S+L QQ+L E H SP
Sbjct: 1001 RVALKQDSTLLEIMEKCAQNSDSNSNYTIKDDLLFWKHRIVIPKHSELRQQVLYELHTSP 1060

Query: 3532 IGGHAGVTRTLARIGAQFYWPKMRQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQV 3711
            IGGHAG+ RTLAR+ AQFYW  M+ D+ +YV  C++CQ+AK+  T  AGLL+PLPIP+QV
Sbjct: 1061 IGGHAGIARTLARVKAQFYWLDMKTDIAKYVQNCVICQKAKTTNTPPAGLLQPLPIPSQV 1120

Query: 3712 WDDVTMDFITGLPPSHGYTVIMVVVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGM 3891
            W+DV MDFITGLP SHGYT I+VV+DRLTK+AHFIP+KTDYSSK VAEAFM ++V+LHGM
Sbjct: 1121 WEDVAMDFITGLPSSHGYTTILVVIDRLTKYAHFIPLKTDYSSKIVAEAFMDNIVKLHGM 1180

Query: 3892 PKSIISDRDRVFTSKFWQNLFELQGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNN 4071
            PKSI+SDRD+VFTS FWQ LF+LQGT+LAMSSAYHPQ+DGQSE +NK LE++LRCFTF N
Sbjct: 1181 PKSIVSDRDKVFTSSFWQQLFKLQGTSLAMSSAYHPQSDGQSEVLNKTLELFLRCFTFEN 1240

Query: 4072 PKSWYKALCWAELWYNTSHHTSAGMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRD 4251
            PKSW KAL W+E WYNT+  TS GM+PFKALYGR+PP + RY   +SD   LQ++L++RD
Sbjct: 1241 PKSWCKALAWSEFWYNTAFQTSIGMTPFKALYGRDPPAIIRYEIQASDSPTLQEKLMERD 1300

Query: 4252 AVITELKRNLARAQQIMKNNADKKRRDVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRY 4431
             +I +LK NL +AQQ MK  ADK R DV  +VGD VLVKLQPYRQ SVALRKNQKLGM+Y
Sbjct: 1301 RIIQQLKLNLEKAQQYMKKQADKHRVDVKLQVGDWVLVKLQPYRQQSVALRKNQKLGMKY 1360

Query: 4432 FGPFTILAKIGKVAYKLQLPDTAKIHSVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILE 4611
            FGPF ++AK+G+VAYKL+LPD AKIH VFHVSQLK FKG   E Y+PLPL+  + GP+++
Sbjct: 1361 FGPFEVIAKVGEVAYKLKLPDHAKIHPVFHVSQLKPFKGDNQEQYMPLPLSMTDIGPMIQ 1420

Query: 4612 PEGILKTRTILRNEQQVPQVLVKWQHLQVAEATWEDKNEMAIQYPNLNLEDKIALEGEGD 4791
            P  +L TRTI+R  Q++ QVL++W    +AEATWED   +  ++P  NLEDK+A  G+G 
Sbjct: 1421 PVAVLATRTIIRCAQRIQQVLIQWDQYPIAEATWEDMVALQRKFPTFNLEDKVAFIGDGI 1480

Query: 4792 VMNNKSVN 4815
            VM+    N
Sbjct: 1481 VMSPNEEN 1488


>gb|PNY16560.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1525

 Score = 1766 bits (4573), Expect = 0.0
 Identities = 874/1535 (56%), Positives = 1100/1535 (71%), Gaps = 11/1535 (0%)
 Frame = +1

Query: 223  MADNTRLKELTTELRRQAET-----IEKNEAASRARFEELSSMQKASEARFNQLADAFEK 387
            M D+  LKE+  E+R+  ++     ++  +   R      + ++ +S+ R  ++  A E 
Sbjct: 1    MTDSANLKEIQDEIRQDVQSQLKKFMDNLDIRDRDIANRFTQLEDSSDDRIARIETAVES 60

Query: 388  LMQQTPTVTHGGTNSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIPTMSSTPFQVRHIK 567
            L+Q          NS                                     PFQ R++K
Sbjct: 61   LLQNAADSQRDAANS---------------------------------RHRPPFQTRNVK 87

Query: 568  LDFPRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIASVHLDQEVVPWFQMVNRSRPFQTW 747
            L+FPRF+G + ++WIFKAEQFF YY TPD +RL IA+VHLDQ+VVPW+QM+ R+ PFQ+W
Sbjct: 88   LEFPRFDGNHAIEWIFKAEQFFDYYATPDEDRLTIAAVHLDQKVVPWYQMMQRTNPFQSW 147

Query: 748  MDFTRALELDFGPSMYDCPRASLFKLQQSKSVNEYYLEFTALSNRVYGISNDALIDCFVS 927
              FTRA+E+DFGPS+Y+CPRA+LFKL Q  +VNEYY+EFT+L+NRVYG+SN+A++DCFVS
Sbjct: 148  QIFTRAIEVDFGPSVYECPRATLFKLVQKNTVNEYYMEFTSLANRVYGLSNEAMLDCFVS 207

Query: 928  GLKDELRRDVMLHTPTSITXXXXXXXXXXXXXXXXNTNVQNQKYNAGSTAVQARAPFNPT 1107
            GL   L+R+V+ HTPTSI                 + +   QK +  +         NP 
Sbjct: 208  GLNPALQREVIAHTPTSIQRAVALAKLFEEKCQPPSKSNYYQKASTTNNP-------NPN 260

Query: 1108 RNSQ-----TQATEKAXXXXXXXXXXXXXMSQLQKNPAIKRIS-AEMQLRREKGLCYFCD 1269
            R +Q     T +T+ +                 QKN  +K IS AEMQLRREKGLCY C+
Sbjct: 261  RPTQPPLLPTPSTKPSLFT--------------QKNNIVKNISSAEMQLRREKGLCYTCE 306

Query: 1270 DKFSFSHRCPNKHLMLLELXXXXXXXXXXXXANSDDDREIQKPVEHHLSLNALNGATGMG 1449
            DK+SF+H+CPNKH+MLL +            +N        + +E HLSLNAL GA+G+G
Sbjct: 307  DKWSFNHKCPNKHVMLLIVEDDSQIEPEIDQSNHTQIDSNHQDLELHLSLNALKGASGLG 366

Query: 1450 VIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNLPIEPVTRCNVLVGNGQKMQAEGVI 1629
             I+F G +    + IL+DGGSSD+F+QPRIA  L L IEP     VLVGNG  + +EG++
Sbjct: 367  TIKFTGQVSNTPLQILVDGGSSDNFLQPRIAQFLKLDIEPAPLFKVLVGNGNALTSEGIV 426

Query: 1630 QNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASLGPHVADYATSKLRFYLEGKFVTLQ 1809
            ++L V VQG ++ +P YLLP+ G+D+ILGA WLA+LGPHVADY    L+F+ +G F+TLQ
Sbjct: 427  KDLKVSVQGQELKLPVYLLPIVGADLILGATWLATLGPHVADYQALTLKFFQQGHFITLQ 486

Query: 1810 GDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAVAPEDNWKDMPVTVEPEMATLLYTY 1989
            GD       AQLH  +RL   +AISE FT+Q+ +   P+DNW  +   ++PE+A LL  Y
Sbjct: 487  GDKSTTPQQAQLHHMRRLHTTEAISEYFTIQRSDPDGPQDNWLTLAKDMDPELALLLNQY 546

Query: 1990 REVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPHSQKEQIEKMVQEMLEEGXXXXX 2169
            + VF  P GLPP R   H I L+ GA PVKV+PYRYP SQKEQIE MV+EMLEEG     
Sbjct: 547  KAVFGKPIGLPPQRLQDHVIPLEPGAKPVKVRPYRYPQSQKEQIEIMVKEMLEEGIIQPS 606

Query: 2170 XXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPMPTVDELLDELHGAKFFSKFDLR 2349
                     LVKKKDGTWR CTDYRALNAIT+KDSFPMPTVDELLDEL GA++FSK DLR
Sbjct: 607  TSPFSSPIVLVKKKDGTWRFCTDYRALNAITVKDSFPMPTVDELLDELFGAQYFSKLDLR 666

Query: 2350 SGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAPATFQRLMNEIFQPFLRKCVLVF 2529
            SGYHQILV P+DR KTAFRTHQGHYEWLVMPFGLTNAPA+FQ LMN+IFQPFLRK VLVF
Sbjct: 667  SGYHQILVDPQDRHKTAFRTHQGHYEWLVMPFGLTNAPASFQCLMNQIFQPFLRKFVLVF 726

Query: 2530 FDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCSFGLTEVEYLGHTVSGSGVSMDR 2709
            FDDILVYSPS+  HL+ +E VLQIL  H L+ KLSKC+FG+ +V+YLGHTV GSGV+MDR
Sbjct: 727  FDDILVYSPSFAAHLKHLEIVLQILQDHTLFVKLSKCAFGIQQVDYLGHTVLGSGVAMDR 786

Query: 2710 NKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASIAGPLTNLLKKDAFQWNENTQLA 2889
            +K+  VLDWP P N+KQLRGFLGLTGYYR+FIK+YA IA PLT LLKKDAF+WN   ++A
Sbjct: 787  SKIQTVLDWPTPENIKQLRGFLGLTGYYRKFIKSYAQIATPLTELLKKDAFKWNHAAEMA 846

Query: 2890 FEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVLSQGGHPIAFFSKKMAPRMQLQS 3069
            F  LKKAITSAPVL LP F+ PF LETDASG+G+GAVLSQ GHPIAFFS+K+ PRMQ QS
Sbjct: 847  FSTLKKAITSAPVLALPQFTIPFTLETDASGSGIGAVLSQNGHPIAFFSRKLGPRMQKQS 906

Query: 3070 AYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXXXXXXXTPEQQAWLHKFIGYDFQ 3249
            AYTREF+AIT A+AKFRHYLLGHKFI++T              TPEQQAWLHKF+GYDF 
Sbjct: 907  AYTREFFAITEAIAKFRHYLLGHKFIIKTDQKSLKSLMDQSLQTPEQQAWLHKFLGYDFT 966

Query: 3250 IEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEVATNEHLKMIVQQCEQNTVADRN 3429
            +EYKPGK+N AADALSR+M ++WS P ++FL+E K+     E ++ I  +  +N   + +
Sbjct: 967  VEYKPGKENLAADALSRVMLLAWSQPTSLFLQEFKQACNQEEEIQQIKLKWNENRGYNPH 1026

Query: 3430 YAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGHAGVTRTLARIGAQFYWPKMRQD 3609
              +K  L++WK +L++P  S +  QIL+E+H+S IGGHAG+ RT+ARI AQFYW  M+ D
Sbjct: 1027 VTIKDDLVFWKGRLMVPQVSHIRDQILQEFHNSAIGGHAGIARTMARITAQFYWKNMKND 1086

Query: 3610 VKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDVTMDFITGLPPSHGYTVIMVVVD 3789
            +  +V  C+VCQQAK  T   AGLL+PLPIP+QVW+DV MDFITGLP S+G+TVIMVV+D
Sbjct: 1087 ITHFVQQCVVCQQAKHETRNPAGLLQPLPIPDQVWEDVAMDFITGLPNSYGFTVIMVVID 1146

Query: 3790 RLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSIISDRDRVFTSKFWQNLFELQGT 3969
            RL+K++HF P+KTDYSSK VAE FM ++V+LHG+PKSI+SDRD+VF SKFW+ LF+LQGT
Sbjct: 1147 RLSKYSHFSPLKTDYSSKIVAEVFMRNIVKLHGVPKSIVSDRDKVFMSKFWKELFQLQGT 1206

Query: 3970 TLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSWYKALCWAELWYNTSHHTSAGMS 4149
            TLAM+SAYHPQ+DGQSEA+NKCLEMYLRC TF NPKSW+KAL  AELWYNT+ HTS GM+
Sbjct: 1207 TLAMTSAYHPQSDGQSEALNKCLEMYLRCLTFQNPKSWFKALDLAELWYNTAFHTSLGMT 1266

Query: 4150 PFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAVITELKRNLARAQQIMKNNADKKRR 4329
            PFK LYGR+PP + R  +++S     QQQL DRDA++T+ K NL RAQQ MK  ADK+R 
Sbjct: 1267 PFKLLYGRDPPTIIRQESSTSGATMAQQQLQDRDAILTQAKLNLERAQQYMKTQADKRRH 1326

Query: 4330 DVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRYFGPFTILAKIGKVAYKLQLPDTAKIH 4509
            ++  EVGD VLVKLQPYRQ SVALRKNQKLGMRYFGPFT++AKIG VAYKLQLP  AKIH
Sbjct: 1327 ELQLEVGDNVLVKLQPYRQQSVALRKNQKLGMRYFGPFTVMAKIGTVAYKLQLPPEAKIH 1386

Query: 4510 SVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILEPEGILKTRTILRNEQQVPQVLVKWQH 4689
             VFH+SQLKLFKG  ++PY PLP  T E GP+L+P  IL+ R +++ ++ + Q+LV+W++
Sbjct: 1387 PVFHISQLKLFKGNCSQPYFPLPALTNELGPVLQPGDILQARQLVQGKKTIQQILVRWEN 1446

Query: 4690 LQVAEATWEDKNEMAIQYPNLNLEDKIALEGEGDV 4794
            L   EATWE+  +   Q+PN NLEDK+ + G  DV
Sbjct: 1447 LTAEEATWEELEDFRQQFPNYNLEDKVEVIGGSDV 1481


>dbj|GAU29525.1| hypothetical protein TSUD_115470 [Trifolium subterraneum]
          Length = 1556

 Score = 1716 bits (4443), Expect = 0.0
 Identities = 839/1442 (58%), Positives = 1044/1442 (72%), Gaps = 18/1442 (1%)
 Frame = +1

Query: 544  PFQVRHIKLDFPRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIASVHLDQEVVPWFQMVN 723
            PF  R+++L+FPRFNG +VL WIFKA+QFF YY+TPD ERL IA+VHLD  VVPWFQM+ 
Sbjct: 75   PFHTRNVRLEFPRFNGTDVLAWIFKADQFFDYYHTPDPERLTIAAVHLDHTVVPWFQMIQ 134

Query: 724  RSRPFQTWMDFTRALELDFGPSMYDCPRASLFKLQQSKSVNEYYLEFTALSNRVYGISND 903
            R  PF +W +FT ALE+++GPS+++ PR +LFKL Q+ +V+EYYL+FT L+NRV G+  D
Sbjct: 135  RETPFSSWQEFTHALEMEYGPSVFERPRTTLFKLAQTGTVHEYYLQFTNLANRVTGLDPD 194

Query: 904  ALIDCFVSGLKDELRRDVMLHTPTSITXXXXXXXXXXXXXXXXNTNVQNQKYNAGSTAVQ 1083
            AL+DCF+SGL+ +L+R+V+   PT++                    +  +K+       Q
Sbjct: 195  ALLDCFISGLQRDLQREVISQHPTTLGQAVSV------------ARLFEEKFQPTKAVTQ 242

Query: 1084 ARAPFNPTRNSQTQATEKAXXXXXXXXXXXXXMSQLQKNP---------AIKRIS-AEMQ 1233
                  PT NSQ QA                  + L   P         ++KRIS AEMQ
Sbjct: 243  PLLSKGPTYNSQYQARSTNSQGYTSNLTQRPNSTPLLPTPINRPPMANRSVKRISPAEMQ 302

Query: 1234 LRREKGLCYFCDDKFSFSHRCPNKHLMLLELXXXXXXXXXXXX------ANSDDDREIQK 1395
            LRREKGLCY+CD+ FSF HRCPNKHL LL+L                    S      Q 
Sbjct: 303  LRREKGLCYYCDETFSFRHRCPNKHLYLLQLEDPIPTSPDSEPNIAASEPQSPTSDTPQP 362

Query: 1396 PVEHHLSLNALNGATGMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNLPIEPVT 1575
             ++ HLSLNALNG+  +G +RF G I    + IL+DGGSSD+F QPR+AH L LP+EP  
Sbjct: 363  ELDLHLSLNALNGSRKLGTLRFTGSIAGTTIQILVDGGSSDNFFQPRLAHFLQLPVEPAP 422

Query: 1576 RCNVLVGNGQKMQAEGVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASLGPHVAD 1755
              NVLVGNGQ M AEG I  L+V +QG  I +P +LLP +G+D+ILG+ WLA+LGPH+A+
Sbjct: 423  GFNVLVGNGQSMMAEGKISKLSVTIQGQTIVLPVFLLPFSGADLILGSSWLATLGPHIAN 482

Query: 1756 YATSKLRFYLEGKFVTLQG--DTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAVAPED 1929
            YA   LRF ++GKFVTL G  D+ P  A AQLH  +R+Q   AISE++ +Q     +   
Sbjct: 483  YADLSLRFVVDGKFVTLNGMPDSLPTPAPAQLHHIRRMQRTHAISEVYMMQLFHVSSDAP 542

Query: 1930 NWKDMPVTVEPEMATLLYTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPHSQ 2109
            + + +  T+ P++  LL+ Y ++FQ P  LPPPR   H I L + +  +KV+PYRYPHSQ
Sbjct: 543  SVEQITSTLSPDLQELLHRYSDIFQTPTSLPPPRAQDHSIPLMDESTVIKVRPYRYPHSQ 602

Query: 2110 KEQIEKMVQEMLEEGXXXXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPMPT 2289
            KEQIEKMV+EML++G              LVKKKDGTWRCCTDYRALN +T+KDSFP+P+
Sbjct: 603  KEQIEKMVKEMLQQGIIQPSTSPFSSPIILVKKKDGTWRCCTDYRALNTVTVKDSFPLPS 662

Query: 2290 VDELLDELHGAKFFSKFDLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAPAT 2469
            VDELLDEL+GA++FSK DLRSGYHQIL+ P DR KTAFRTHQGHYEWLVMPFGL+NAPA+
Sbjct: 663  VDELLDELYGARYFSKLDLRSGYHQILMNPNDRHKTAFRTHQGHYEWLVMPFGLSNAPAS 722

Query: 2470 FQRLMNEIFQPFLRKCVLVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCSFG 2649
            FQ LMNE+FQ  LRK VLVFFDDILVYSP W  HL  +E VL  L Q VLY KLSKCSFG
Sbjct: 723  FQCLMNEVFQFALRKFVLVFFDDILVYSPDWSAHLDHLERVLLTLKQQVLYVKLSKCSFG 782

Query: 2650 LTEVEYLGHTVSGSGVSMDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASIAG 2829
            + +V+YLGHTVSG GV+MD+ KV  + DWP+P N+KQLRGFLGLTGYYRRFIKAY+ IA 
Sbjct: 783  MLQVDYLGHTVSGMGVAMDKEKVRVIADWPLPINVKQLRGFLGLTGYYRRFIKAYSQIAA 842

Query: 2830 PLTNLLKKDAFQWNENTQLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVLSQ 3009
            PLTNLLKK+ F W++    AF  LK  I++APVL LPDFS+PF+LETDAS TG+GA+LSQ
Sbjct: 843  PLTNLLKKEGFHWDDTATTAFYNLKLVISTAPVLALPDFSKPFILETDASNTGIGAILSQ 902

Query: 3010 GGHPIAFFSKKMAPRMQLQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXXXX 3189
             GHPIAFFSKK+ P  Q QSAY REF AIT ALAKFRHYLLGHKFI+RT           
Sbjct: 903  EGHPIAFFSKKLGPTAQKQSAYVREFRAITEALAKFRHYLLGHKFIIRTDQKSLKALLEQ 962

Query: 3190 XXXTPEQQAWLHKFIGYDFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEVAT 3369
               TPEQQAWLHKFIG+DF IEYKPGK+N A DALSR+  ++WS P   FL+EL+ E+  
Sbjct: 963  TLQTPEQQAWLHKFIGFDFTIEYKPGKENLAVDALSRINLLAWSEPHYSFLDELRLEIQQ 1022

Query: 3370 NEHLKMIVQQCEQNTVADRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGHAG 3549
            + HLK I+QQC  N+ AD NY VK  LL+W+Q+LVIP  S LIQ+IL E+H SPIGGH+G
Sbjct: 1023 DTHLKEIIQQCLNNSCADANYTVKDNLLFWQQRLVIPENSTLIQKILLEFHSSPIGGHSG 1082

Query: 3550 VTRTLARIGAQFYWPKMRQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDVTM 3729
            +TRT+ARI +QFYW  MRQ +  ++  C++CQQAK+ TT  AGLL PLPIP  VW D+ M
Sbjct: 1083 ITRTMARIASQFYWTHMRQHIVAFIKHCVICQQAKTTTTTPAGLLAPLPIPTLVWADIAM 1142

Query: 3730 DFITGLPPSHGYTVIMVVVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSIIS 3909
            DFITGLPPS+G+TVI+VV+DRLTK+AHF P+K+D+ SK VAE FM ++V+LHGMP SI+S
Sbjct: 1143 DFITGLPPSNGFTVILVVIDRLTKYAHFFPLKSDFDSKKVAEIFMQNIVKLHGMPSSIVS 1202

Query: 3910 DRDRVFTSKFWQNLFELQGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSWYK 4089
            DRD+VFTS FW++LF+L GTTLAM+SAYHPQ+DGQSE +NKC+EMYLRC TF+NP  W K
Sbjct: 1203 DRDKVFTSAFWRHLFKLHGTTLAMTSAYHPQSDGQSEVLNKCVEMYLRCMTFDNPTKWSK 1262

Query: 4090 ALCWAELWYNTSHHTSAGMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAVITEL 4269
            AL W E WYNTS+HTSA M+PFKALYG +PP L R    + D  +LQ QL +R+ ++++L
Sbjct: 1263 ALPWTEFWYNTSYHTSAAMTPFKALYGSDPPQLVRTKGTTDDHPDLQTQLAEREDLLSQL 1322

Query: 4270 KRNLARAQQIMKNNADKKRRDVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRYFGPFTI 4449
            + NL +AQQ MK  ADKKR  V F+V DQVLVKLQPYRQ+SVALRK+QKLG+RYFGPF I
Sbjct: 1323 QVNLHKAQQAMKFQADKKRHHVEFKVDDQVLVKLQPYRQSSVALRKHQKLGLRYFGPFPI 1382

Query: 4450 LAKIGKVAYKLQLPDTAKIHSVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILEPEGILK 4629
            +AK+G VAY+L LP T KIH VFHVSQLKLF G    PY+PLPLTT E GPIL+PE +L 
Sbjct: 1383 VAKVGVVAYRLGLPSTTKIHPVFHVSQLKLFHGPHIPPYMPLPLTTSELGPILQPEALLD 1442

Query: 4630 TRTILRNEQQVPQVLVKWQHLQVAEATWEDKNEMAIQYPNLNLEDKIALEGEGDVMNNKS 4809
            +R I+R    + QVL+ W+ L+ A+ATWED  E  + +PN NLEDK+ L G G ++ +  
Sbjct: 1443 SRLIMRGNTPISQVLISWEGLETADATWEDLVEFKLAHPNFNLEDKVTLNG-GSIVRDPI 1501

Query: 4810 VN 4815
            +N
Sbjct: 1502 IN 1503


>dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subterraneum]
          Length = 1451

 Score = 1706 bits (4419), Expect = 0.0
 Identities = 850/1501 (56%), Positives = 1075/1501 (71%), Gaps = 4/1501 (0%)
 Frame = +1

Query: 316  FEELSSMQKASEARFNQLADAFEKLMQQTPTVTHGGTNSGXXXXXXXXXXXXXXXXXXXX 495
            FE L   +  + A+FN++  A + L+ QTP+  H G                        
Sbjct: 15   FERLERSKAENGAKFNKIFAALDILIDQTPSKQHHGAGL--------------------- 53

Query: 496  XXXXXXXXXIPTMSSTPFQVRHIKLDFPRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIA 675
                       T +  PFQVR++KL+FPRF+GKNV +WIF+AEQFF YY+TPD +RL IA
Sbjct: 54   -----------THNRPPFQVRNVKLEFPRFDGKNVHEWIFRAEQFFEYYDTPDLDRLTIA 102

Query: 676  SVHLDQEVVPWFQMVNRSRPFQTWMDFTRALELDFGPSMYDCPRASLFKLQQSKSVNEYY 855
            SVHLD++VVPW+QM+ R+ PF +W++ TRALEL FGPS+YDCPRA+LFKL Q+ SV +YY
Sbjct: 103  SVHLDKDVVPWYQMMQRTHPFMSWIELTRALELGFGPSIYDCPRATLFKLNQTGSVADYY 162

Query: 856  LEFTALSNRVYGISNDALIDCFVSGLKDELRRDVMLHTPTSITXXXXXXXXXXXXXXXXN 1035
            L+FT+L+NRVYG+SNDAL+DCFVSGL +E+RRDV++HTP S+                  
Sbjct: 163  LQFTSLANRVYGLSNDALVDCFVSGLNNEIRRDVLIHTPPSLVKAVSLAKVYEEKYA--- 219

Query: 1036 TNVQNQKYNAGSTAVQARAPFNPTRNSQTQATEKAXXXXXXXXXXXXXMSQLQKNPAIKR 1215
            +N++ QK+N  +T      PF  T   +T     A             MSQ QKNP IKR
Sbjct: 220  SNLKTQKFN--NTNYATNKPF--TNKPETITRNSAPILNTPPTRP---MSQFQKNPNIKR 272

Query: 1216 IS-AEMQLRREKGLCYFCDDKFSFSHRCPNKHLMLLELXXXXXXXXXXXXANSDD---DR 1383
            IS AEMQ+RR+KGLCY+CDDKFS++H+CP++ LMLL+              +S D   + 
Sbjct: 273  ISPAEMQVRRDKGLCYWCDDKFSYTHKCPSRQLMLLQYDDNEEEHDDEKANDSPDIATES 332

Query: 1384 EIQKPVEHHLSLNALNGATGMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNLPI 1563
                 +EHHLS NA+ G + MG++RF G I  + V +L+DGGSSD+F+QPRIA  L LPI
Sbjct: 333  SDNTQLEHHLSFNAMKGNSSMGILRFTGTIEHIQVQVLIDGGSSDNFLQPRIAKFLKLPI 392

Query: 1564 EPVTRCNVLVGNGQKMQAEGVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASLGP 1743
            EP     VLVGNGQ M AE                       VAG+DV+LGA WLA+LGP
Sbjct: 393  EPGPVFKVLVGNGQIMTAE-----------------------VAGADVVLGASWLATLGP 429

Query: 1744 HVADYATSKLRFYLEGKFVTLQGDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAVAP 1923
            HVADYA+  L+F++ GKFVTL+G+T  K   AQ H FKRL   DAI+E FT+Q +++   
Sbjct: 430  HVADYASLTLKFFINGKFVTLKGETTSKPTAAQFHHFKRLHFTDAIAECFTIQWLKSHTD 489

Query: 1924 EDNWKDMPVTVEPEMATLLYTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPH 2103
            ED ++++P  +EPE+A LL+TY+E+F+ P  LPP R  +H I L EG  PVKVKPYRYPH
Sbjct: 490  EDIFRELPTNIEPEIAMLLHTYKELFKTPNTLPPHRLHNHNIPLIEGYNPVKVKPYRYPH 549

Query: 2104 SQKEQIEKMVQEMLEEGXXXXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPM 2283
            SQK QIE MVQ+ML++G              LVKKKDGTWR CTDYRALNAIT+KDSFP+
Sbjct: 550  SQKNQIELMVQDMLQQGIIQPNTSPFSSPIVLVKKKDGTWRFCTDYRALNAITVKDSFPI 609

Query: 2284 PTVDELLDELHGAKFFSKFDLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAP 2463
            PTVDELLDEL GAKFFSK DLRSGY+QIL++PEDR KTAF+THQGHYEWLVMPFGLTNAP
Sbjct: 610  PTVDELLDELFGAKFFSKLDLRSGYYQILLQPEDRYKTAFKTHQGHYEWLVMPFGLTNAP 669

Query: 2464 ATFQRLMNEIFQPFLRKCVLVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCS 2643
            ATFQ LMN IFQ  LRK VLVFFDDILVYS +W  HL  ++ VL+ L  + L+ KLSKCS
Sbjct: 670  ATFQSLMNHIFQHALRKYVLVFFDDILVYSSTWQDHLNHLDTVLKTLQDNDLFVKLSKCS 729

Query: 2644 FGLTEVEYLGHTVSGSGVSMDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASI 2823
            FG+ E+EYLGH V+G GVSMD++KV A+L+WP P+N+KQLRGFLGLTGYYRRFIK YA +
Sbjct: 730  FGVVEIEYLGHLVTGQGVSMDKDKVRAILEWPTPNNIKQLRGFLGLTGYYRRFIKHYARL 789

Query: 2824 AGPLTNLLKKDAFQWNENTQLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVL 3003
            A PLT+LLKK+ + WN  T++ F++LK A+TSAPVL LP+F +PF+ ETDASG       
Sbjct: 790  AAPLTDLLKKENYYWNAQTEMGFQELKVAMTSAPVLALPNFQKPFIPETDASGY------ 843

Query: 3004 SQGGHPIAFFSKKMAPRMQLQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXX 3183
             QG               + +SAY RE  AI+ A+AKFRHY LGH+FI+RT         
Sbjct: 844  -QG--------------TRKKSAYFREMLAISEAIAKFRHYFLGHRFIIRTDQKSLRSLM 888

Query: 3184 XXXXXTPEQQAWLHKFIGYDFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEV 3363
                 TP+QQ WLHKF+GYDF IEYKPGK+N AADALSR+MT++WS P +  + ++K E+
Sbjct: 889  DQSLQTPDQQQWLHKFLGYDFVIEYKPGKENLAADALSRVMTLAWSEPISQIIIQIKDEI 948

Query: 3364 ATNEHLKMIVQQCEQNTVADRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGH 3543
              + +   I+ +C+    +   Y ++ G+LYWK ++VIP +S LIQQ+L E+H SPIGGH
Sbjct: 949  KADTYWSDIMDKCKSQGNSYLQYTLREGVLYWKNRVVIPKKSALIQQVLHEFHSSPIGGH 1008

Query: 3544 AGVTRTLARIGAQFYWPKMRQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDV 3723
            AG TRTLARI +QFYW  M++DV EY+  C +CQQAK   T  AGLL+PLPIP+QVW+D+
Sbjct: 1009 AGFTRTLARIKSQFYWIAMKKDVLEYIQNCAICQQAKHTNTLPAGLLQPLPIPSQVWEDI 1068

Query: 3724 TMDFITGLPPSHGYTVIMVVVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSI 3903
             MDFITGLP S+GYT I+VV+DRLTK+AHF+PMKTDYSSKSVAE FM+++V+LHGMPKSI
Sbjct: 1069 AMDFITGLPLSYGYTTILVVIDRLTKYAHFLPMKTDYSSKSVAEVFMNNIVKLHGMPKSI 1128

Query: 3904 ISDRDRVFTSKFWQNLFELQGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSW 4083
            +SDRD+VFTS FWQ+LF+LQGT+LAMSSAYHPQ+DGQ+E +NK LE++LRCFTFNNPKSW
Sbjct: 1129 VSDRDKVFTSSFWQHLFKLQGTSLAMSSAYHPQSDGQTEVLNKGLELFLRCFTFNNPKSW 1188

Query: 4084 YKALCWAELWYNTSHHTSAGMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAVIT 4263
            YKAL W+E WYNT+ HTS GM+PFKALYGREPP LTRY    +DP  LQ++L++RD ++ 
Sbjct: 1189 YKALAWSEFWYNTALHTSIGMTPFKALYGREPPTLTRYEVQDNDPPALQEELMERDRILQ 1248

Query: 4264 ELKRNLARAQQIMKNNADKKRRDVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRYFGPF 4443
            +LK NL RAQQ MK  ADK R +  F +GD VLVKLQPYRQ SVALRKNQKLGMRYFGPF
Sbjct: 1249 QLKSNLERAQQYMKKQADKHRVEFKFHLGDMVLVKLQPYRQQSVALRKNQKLGMRYFGPF 1308

Query: 4444 TILAKIGKVAYKLQLPDTAKIHSVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILEPEGI 4623
             I+A +GKVAYKL+LPD AKIH VFHVSQLK FKG   + Y+PLPLT  + GP+++P  +
Sbjct: 1309 EIIACVGKVAYKLKLPDHAKIHLVFHVSQLKPFKGVPQQQYMPLPLTMFDNGPMIQPVEV 1368

Query: 4624 LKTRTILRNEQQVPQVLVKWQHLQVAEATWEDKNEMAIQYPNLNLEDKIALEGEGDVMNN 4803
            L+ RTI++  Q++ Q+LV+W    +AEATWE+ +++   +P  NLEDK+  +G+G VM  
Sbjct: 1369 LQARTIMQGTQKIHQILVQWDQYDIAEATWENVDDLQKNFPLYNLEDKVIFKGDGIVMRP 1428

Query: 4804 K 4806
            K
Sbjct: 1429 K 1429


>gb|KYP39589.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 1510

 Score = 1659 bits (4296), Expect = 0.0
 Identities = 803/1418 (56%), Positives = 1031/1418 (72%), Gaps = 1/1418 (0%)
 Frame = +1

Query: 544  PFQVRHIKLDFPRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIASVHLDQEVVPWFQMVN 723
            P QVR++KLDFPRF+G +VL WIF+AEQFF YYNTPD +R++IA+VH ++ VVPW+QM+ 
Sbjct: 55   PLQVRNVKLDFPRFDGSDVLQWIFRAEQFFEYYNTPDEQRIVIAAVHFEKNVVPWYQMMQ 114

Query: 724  RSRPFQTWMDFTRALELDFGPSMYDCPRASLFKLQQSKSVNEYYLEFTALSNRVYGISND 903
            R+ P  +W   TR+LEL+FGPS +D PR++LFKL Q+ SVN+YY+EFT L+NR+YG+S +
Sbjct: 115  RTSPIISWNTLTRSLELEFGPSPFDSPRSTLFKLVQTGSVNDYYIEFTNLANRIYGVSAE 174

Query: 904  ALIDCFVSGLKDELRRDVMLHTPTSITXXXXXXXXXXXXXXXXNTNVQNQKYNAGSTAVQ 1083
            AL+DCF+SGLK +++R+++   P S+                    +  +KY+  S   Q
Sbjct: 175  ALLDCFISGLKPDIKREIIAQAPNSLLKAISL------------ARLFEEKYSFRSR--Q 220

Query: 1084 ARAPFNPTRNSQTQATEKAXXXXXXXXXXXXXMSQLQKNPAIKRIS-AEMQLRREKGLCY 1260
            +    N + ++  Q+                  +   +N A++++S AEMQ RRE+GLC+
Sbjct: 221  SFVTRNTSHSAGNQSYTNPAQQPLLNTPNIKPAAFPNRNTAVRKMSPAEMQSRRERGLCF 280

Query: 1261 FCDDKFSFSHRCPNKHLMLLELXXXXXXXXXXXXANSDDDREIQKPVEHHLSLNALNGAT 1440
             CD++FS +HRCPNK  +LL++              + D   ++  +EHHLS NAL G  
Sbjct: 281  TCDERFSANHRCPNKQYLLLQVEDEEELEE----TTNVDSTALEDELEHHLSFNALKGVA 336

Query: 1441 GMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNLPIEPVTRCNVLVGNGQKMQAE 1620
             +G +RF G I   +V ILLD GSSD+F+QP++AH L LPIEP     V+VGNG  +  E
Sbjct: 337  TVGTMRFTGSIAGKEVHILLDSGSSDNFLQPKLAHYLKLPIEPAAGLQVMVGNGSSLSTE 396

Query: 1621 GVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASLGPHVADYATSKLRFYLEGKFV 1800
            G I NL V+VQG  + +P YLL V+G+D++LGA WLA+LGPH+ADY +  ++FY + K V
Sbjct: 397  GKILNLQVQVQGQVLQLPVYLLSVSGADLVLGAAWLATLGPHIADYGSLTIKFYKDKKLV 456

Query: 1801 TLQGDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAVAPEDNWKDMPVTVEPEMATLL 1980
            TLQG+     A++Q H  KRL H   I+E++TLQ + +    D WKD+P  V+PE+A LL
Sbjct: 457  TLQGEKSRPAAMSQFHHLKRLNHTQGIAEVYTLQLLSSFVETDQWKDIPDNVDPEIALLL 516

Query: 1981 YTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPHSQKEQIEKMVQEMLEEGXX 2160
            + YR++F  P GLPPPR  +H I L +G+ PVKV+PY+YPHSQK+QIE M++EMLE+G  
Sbjct: 517  HYYRQIFAKPTGLPPPRSQNHRIPLLQGSGPVKVRPYKYPHSQKQQIELMIKEMLEDGII 576

Query: 2161 XXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPMPTVDELLDELHGAKFFSKF 2340
                        LVKKKDG+WR CTDYRALNAIT+KDSFP+PTV+ELLDEL GAK+FSK 
Sbjct: 577  APSSSPFSSPIILVKKKDGSWRFCTDYRALNAITVKDSFPIPTVEELLDELFGAKYFSKL 636

Query: 2341 DLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAPATFQRLMNEIFQPFLRKCV 2520
            DLR+GYHQILV+ EDR KTAFRTHQGHYEWLVMPFGLTNAPATFQ LMN+IFQ  LRK V
Sbjct: 637  DLRAGYHQILVQEEDRYKTAFRTHQGHYEWLVMPFGLTNAPATFQNLMNDIFQGLLRKSV 696

Query: 2521 LVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCSFGLTEVEYLGHTVSGSGVS 2700
            LVFFDDILVYS SW  HLQ ++ VL IL++H LYAK+SKCSFGL +VEYLGH VSG GVS
Sbjct: 697  LVFFDDILVYSSSWFLHLQHLQQVLDILAKHELYAKMSKCSFGLEQVEYLGHVVSGDGVS 756

Query: 2701 MDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASIAGPLTNLLKKDAFQWNENT 2880
            M+ +KV AV+DWPVP  +KQLRGFLGLTGYYRRFI+ YASIA PLT+LLKKD F+W+   
Sbjct: 757  METSKVQAVIDWPVPKTIKQLRGFLGLTGYYRRFIQGYASIANPLTDLLKKDNFKWSNEA 816

Query: 2881 QLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVLSQGGHPIAFFSKKMAPRMQ 3060
              AF  LK+AIT+APVL LPDFSQPFVLETDASG+G+GAVLSQ  HPIAFFSKK++ RM 
Sbjct: 817  DAAFIALKQAITTAPVLSLPDFSQPFVLETDASGSGIGAVLSQNKHPIAFFSKKLSNRMT 876

Query: 3061 LQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXXXXXXXTPEQQAWLHKFIGY 3240
             QSAYTREFYAIT A+AKFRHYLLGH+FI+RT              TPEQQAWLHKF+GY
Sbjct: 877  KQSAYTREFYAITEAIAKFRHYLLGHRFIIRTDQKSLKSLLDQTLQTPEQQAWLHKFLGY 936

Query: 3241 DFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEVATNEHLKMIVQQCEQNTVA 3420
            DF IEYKPG +N AADALSR   M+ +   +  + ++K  + ++  L+ I+    Q    
Sbjct: 937  DFSIEYKPGTENLAADALSRSFFMASAVTASDLVHQIKAALGSDTALQPILTAHSQGKAL 996

Query: 3421 DRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGHAGVTRTLARIGAQFYWPKM 3600
               Y+   GLL+WK ++V+P    +  QIL+E+H SP+GGH+G+ RT AR+ AQF+WP M
Sbjct: 997  SAPYSFLDGLLFWKGRIVVPNVPAIQNQILQEFHSSPLGGHSGIARTFARVAAQFFWPGM 1056

Query: 3601 RQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDVTMDFITGLPPSHGYTVIMV 3780
             +D+K +V  C VCQQAK+AT   AGLL+PLPIP Q+W+D++MDFI GLPP+ GYTVI V
Sbjct: 1057 NKDIKNFVQQCCVCQQAKTATVLPAGLLQPLPIPTQIWEDISMDFIVGLPPAEGYTVIFV 1116

Query: 3781 VVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSIISDRDRVFTSKFWQNLFEL 3960
            +VDRL+K+AHF P+K+D++SK VA+ F+H+VV+LHG P SI+SDRD+VFTS FWQ+L +L
Sbjct: 1117 IVDRLSKYAHFAPLKSDFNSKRVADVFLHTVVKLHGFPNSIVSDRDKVFTSTFWQHLLKL 1176

Query: 3961 QGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSWYKALCWAELWYNTSHHTSA 4140
             GTTL +S+AYHPQ+DGQ+EA+NKCLEMYLRCFT   PK W K L WAE WYNTS H SA
Sbjct: 1177 SGTTLKLSTAYHPQSDGQTEALNKCLEMYLRCFTHEKPKDWIKFLPWAEFWYNTSFHHSA 1236

Query: 4141 GMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAVITELKRNLARAQQIMKNNADK 4320
             MSPFK +YGR+PP L +Y+ +++DP  +Q+ LL RD V+ +LK NL  AQQ MK  AD+
Sbjct: 1237 QMSPFKVVYGRDPPTLVKYSHSATDPPSIQEMLLQRDRVLAQLKVNLMLAQQRMKKYADQ 1296

Query: 4321 KRRDVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRYFGPFTILAKIGKVAYKLQLPDTA 4500
            KR    F  G+ VLVKLQPYRQ S+ALRKNQKLG+RYFGPF I  +IG VAYKL LPD A
Sbjct: 1297 KRLHKEFVEGEMVLVKLQPYRQHSLALRKNQKLGLRYFGPFPIQKRIGSVAYKLLLPDYA 1356

Query: 4501 KIHSVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILEPEGILKTRTILRNEQQVPQVLVK 4680
            KIH VFH+SQLK F+G     Y+PLPLTT  EGP+++P  +L  R I++  + V QVLV+
Sbjct: 1357 KIHPVFHISQLKQFRGVTDTVYVPLPLTTAVEGPVVQPIQVLSVRDIIQAGKLVRQVLVQ 1416

Query: 4681 WQHLQVAEATWEDKNEMAIQYPNLNLEDKIALEGEGDV 4794
            W+   V  ATWED +++   YPN+NLEDK+  +G   V
Sbjct: 1417 WEGFGVDAATWEDLDKLEQSYPNINLEDKVIAKGGSSV 1454


>gb|PNX92398.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1571

 Score = 1633 bits (4229), Expect = 0.0
 Identities = 811/1430 (56%), Positives = 1014/1430 (70%), Gaps = 5/1430 (0%)
 Frame = +1

Query: 544  PFQVRHIKLDFPRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIASVHLDQEVVPWFQMVN 723
            PFQVR++KLDFPRF+G  VL+WIFKAEQFF YY+TPD  RL IA+VHLD++VVPW+QM+ 
Sbjct: 104  PFQVRNVKLDFPRFDGTEVLNWIFKAEQFFDYYSTPDNHRLTIAAVHLDKDVVPWYQMIT 163

Query: 724  RSRPFQTWMDFTRALELDFGPSMYDCPRASLFKLQQSKSVNEYYLEFTALSNRVYGISND 903
            R+ PF++W  FT ALE+++GPS ++ PR +LFKL Q+ +V+EYY  FT L+NR +G+S D
Sbjct: 164  RNNPFRSWTAFTSALEMEYGPSPFESPRPTLFKLTQTTTVSEYYSTFTTLANRTHGLSPD 223

Query: 904  ALIDCFVSGLKDELRRDVMLHTPTSITXXXXXXXXXXXXXXXXNTNVQNQKYNAG--STA 1077
            A++DCF+SGL+ E+RRDV++  PTSI+                        YN    + +
Sbjct: 224  AILDCFISGLQPEIRRDVIVQKPTSISQALSLAKLFAEKYTKSTPPTTKPYYNPSPQTNS 283

Query: 1078 VQARAPFNPTRNSQTQATEKAXXXXXXXXXXXXXMSQLQKNPAIKRISAEMQLRREKGLC 1257
               +  +NP+ NS T+ T                  + +  P     +AE+Q RR+ G C
Sbjct: 284  PNPKTHYNPSPNSLTRPTNAILPTPANRPQTQTPNPRAR--PIRNITAAEIQRRRDTGQC 341

Query: 1258 YFCDDKFSFSHRCPNKHLMLLELXXXXXXXXXXXXANSDDDREIQKPVEHHLSLNALNGA 1437
            ++CDD+FS SHRCPNKH +LL++             + D   E +   EHHLS NAL G 
Sbjct: 342  FYCDDRFSSSHRCPNKHYLLLQVDDSDTPEQETDPPDIDPLVE-ELEAEHHLSFNALKGV 400

Query: 1438 TGMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNLPIEPVTRCNVLVGNGQKMQA 1617
               G +RF+G I  V V +LLD GSSD+F+QPRIA CL LPI+   +  VLVGNG  + A
Sbjct: 401  YSAGTLRFQGQIQGVQVQVLLDSGSSDNFLQPRIAQCLKLPIQEAPQFQVLVGNGSTLTA 460

Query: 1618 EGVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASLGPHVADYATSKLRFYLEGKF 1797
             G+I +L V +QG  + +P YLLP+ G+D++LGAPWL +LGPH+ADY    ++FYL  KF
Sbjct: 461  SGLIPDLPVTIQGHTLQLPVYLLPITGADLVLGAPWLKTLGPHIADYDALSIKFYLNNKF 520

Query: 1798 VTLQGDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAVAPEDNWKDMPVTVEPEMATL 1977
            +TL+GD       A+ H  +RL +  +I   +TLQ      P         T+  E+A L
Sbjct: 521  ITLRGDRFMDPGQAEFHHIRRLNNTHSIDLSYTLQFNSIATPSS--AVALETLPAELAAL 578

Query: 1978 LYTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPHSQKEQIEKMVQEMLEEGX 2157
            L+ Y +VF+ P  LPPPR   H I L +G+ PVKVKPYRYPHSQK +IEKMV +ML++G 
Sbjct: 579  LHQYWQVFEEPTSLPPPRAQDHYIPLIDGSNPVKVKPYRYPHSQKAEIEKMVADMLKQGI 638

Query: 2158 XXXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPMPTVDELLDELHGAKFFSK 2337
                         LVKKKDG+WR CTDYRALNAIT+KDSFP+PTVDELLDEL GA +FSK
Sbjct: 639  IQPSTSPFSSPVLLVKKKDGSWRFCTDYRALNAITVKDSFPIPTVDELLDELFGASYFSK 698

Query: 2338 FDLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAPATFQRLMNEIFQPFLRKC 2517
             DLRSGYHQILV PEDR KTAFRTHQG YEWLVMPFGL+NAPA+FQ LMN +F+  LRK 
Sbjct: 699  LDLRSGYHQILVNPEDRHKTAFRTHQGLYEWLVMPFGLSNAPASFQSLMNSVFRDQLRKF 758

Query: 2518 VLVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCSFGLTEVEYLGHTVSGSGV 2697
            VLVFFDDILVYSPSW THL+ +E VL++L ++ L+AK SKC FGLT V+YLGHT+S  GV
Sbjct: 759  VLVFFDDILVYSPSWSTHLEHLEVVLRVLQENQLFAKFSKCCFGLTSVDYLGHTISNQGV 818

Query: 2698 SMDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASIAGPLTNLLKKDAFQWNEN 2877
             MD++K+ AV++W +PS+LKQLRGFLGL+GYYRRFI+ YA+IA PLT LLKKDAFQW   
Sbjct: 819  EMDKSKIQAVMEWKIPSSLKQLRGFLGLSGYYRRFIRNYAAIASPLTELLKKDAFQWTST 878

Query: 2878 TQLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVLSQGGHPIAFFSKKMAPRM 3057
               AF +LK+AIT APVL LPDF  PF LETDASG+GVGAVLSQ  HPIA+FSKK+ PRM
Sbjct: 879  ATEAFNKLKQAITQAPVLSLPDFRLPFTLETDASGSGVGAVLSQARHPIAYFSKKLCPRM 938

Query: 3058 QLQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXXXXXXXTPEQQAWLHKFIG 3237
            Q QSAY RE YAIT ALAKFRHYL+GHKF+++T              TPEQQ WLHKF+G
Sbjct: 939  QKQSAYVRELYAITEALAKFRHYLIGHKFVIKTDQRSLKSLTDQSIQTPEQQHWLHKFLG 998

Query: 3238 YDFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEVATNEHLKMIVQQCEQNTV 3417
            YDF IEYKPG DN AAD+LSR    + S   +  +  +   V  +  L  I  QC   T 
Sbjct: 999  YDFTIEYKPGVDNIAADSLSRSFCFALSVQTSQLVAMIHTAVNDDPALAEIRNQCLLGTN 1058

Query: 3418 ADRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGHAGVTRTLARIGAQFYWPK 3597
               +Y VK  +L+WK +LVIP + +LIQ +L E+H S +GGHAG+TRT AR+ +QF+WP 
Sbjct: 1059 TAPHYQVKHDMLFWKNRLVIPQKPELIQLLLTEFHSSALGGHAGITRTKARVASQFFWPT 1118

Query: 3598 MRQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDVTMDFITGLPPSHGYTVIM 3777
            M +D+KE+V+ C++CQQAK +T   AGLL PLPIP Q+W+D++MDFI GLPPSH YTVI+
Sbjct: 1119 MAKDIKEFVSKCLICQQAKHSTAVPAGLLHPLPIPQQIWEDLSMDFIIGLPPSHTYTVIL 1178

Query: 3778 VVVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSIISDRDRVFTSKFWQNLFE 3957
            VVVDRL+K++HFIP+K DY+S  VAEAF++++V+LHG PK+I+SDRDRVFTS+FWQ+LF+
Sbjct: 1179 VVVDRLSKYSHFIPLKGDYTSSKVAEAFVNTIVKLHGFPKTIVSDRDRVFTSQFWQHLFK 1238

Query: 3958 LQGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSWYKALCWAELWYNTSHHTS 4137
            + GTTLAMS+AYHPQTDGQSEAVNKCLE+YLRCFT ++PK W K L WAE WYNTS HTS
Sbjct: 1239 ISGTTLAMSTAYHPQTDGQSEAVNKCLELYLRCFTSDSPKQWVKLLPWAEFWYNTSFHTS 1298

Query: 4138 AGMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAVITELKRNLARAQQIMKNNAD 4317
             GM+PFK +YGR+PP L RY  N+ DP  LQQ L +RD  I  LK NLARAQQIMK  AD
Sbjct: 1299 IGMTPFKVVYGRDPPSLIRYEVNNDDPPLLQQLLTERDDTIITLKANLARAQQIMKKFAD 1358

Query: 4318 KKRRDVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRYFGPFTILAKIGKVAYKLQLPDT 4497
             KRR V F+VGD VLVKLQPYRQ SV LR+NQKL MRYFGPF ++ +IG+VAYKL LP +
Sbjct: 1359 NKRRFVEFQVGDMVLVKLQPYRQHSVVLRRNQKLSMRYFGPFPVIERIGQVAYKLMLPPS 1418

Query: 4498 AKIHSVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILEPEGILKTRTILRNEQQVPQVLV 4677
            AKIH VFHVS LKL KG  +   +PLPLTT E GP+L P  I+  RTI++N Q V QV V
Sbjct: 1419 AKIHPVFHVSALKLCKGEHSTQLMPLPLTTTEHGPMLSPIAIIDRRTIIQNAQPVQQVQV 1478

Query: 4678 KWQHLQVAEATWEDKNEMAIQYPNLNLEDKIALEGEGDVM---NNKSVNE 4818
            +W+ L   E TWE  +E    YP  NLEDK+ +    +VM   NN S  E
Sbjct: 1479 QWEGLLPTETTWEKWDEFHSSYP--NLEDKVLVNEGSNVMIKDNNMSWQE 1526


>gb|PNX98514.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1240

 Score = 1600 bits (4144), Expect = 0.0
 Identities = 771/1218 (63%), Positives = 953/1218 (78%), Gaps = 12/1218 (0%)
 Frame = +1

Query: 1180 MSQLQKNPAIKRIS-AEMQLRREKGLCYFCDDKFSFSHRCPNKHLMLLELXXXXXXXXXX 1356
            MSQ QKNP IKRIS AE QLRR+KGLCY+CD+KFSF+H+CPN+ LMLL+           
Sbjct: 1    MSQFQKNPNIKRISPAERQLRRDKGLCYWCDEKFSFTHKCPNRQLMLLQYDDNDE----- 55

Query: 1357 XXANSDDDREIQKP-----------VEHHLSLNALNGATGMGVIRFKGCIGPVDVSILLD 1503
               NS+ D  IQ P            EHHLS NA+ G + MG++RF G I  + V IL+D
Sbjct: 56   ---NSEIDSAIQSPDSTTDSPTTNIPEHHLSFNAMKGTSHMGILRFTGSIEQIKVQILID 112

Query: 1504 GGSSDSFIQPRIAHCLNLPIEPVTRCNVLVGNGQKMQAEGVIQNLTVRVQGIDITVPAYL 1683
            GGSSD+F+QPRIA CL LP+EP +   VLVGNG+ M AEG+I  L + +QG  + +P YL
Sbjct: 113  GGSSDNFLQPRIAKCLKLPVEPASTFRVLVGNGEIMTAEGMINQLPLDIQGHKLEIPVYL 172

Query: 1684 LPVAGSDVILGAPWLASLGPHVADYATSKLRFYLEGKFVTLQGDTDPKTAIAQLHQFKRL 1863
            LPVAG+DVILGA WLA+LGPHVADYA+  L+F+LEGKFVTL G+++ + + AQ H  KRL
Sbjct: 173  LPVAGADVILGASWLATLGPHVADYASLTLKFFLEGKFVTLVGESEVRPSAAQFHHLKRL 232

Query: 1864 QHMDAISELFTLQQIEAVAPEDNWKDMPVTVEPEMATLLYTYREVFQVPKGLPPPRELSH 2043
            Q+ DAI E FT+Q +++    D +KD P  +EPE+A LL+TY+++F  P  LPP R  +H
Sbjct: 233  QNTDAIDECFTVQWLKSTEITDIFKDFPTNIEPEIAILLHTYKDLFNTPSALPPSRSHNH 292

Query: 2044 EILLKEGAPPVKVKPYRYPHSQKEQIEKMVQEMLEEGXXXXXXXXXXXXXXLVKKKDGTW 2223
             I L EGA PVKVKPYRYPHSQKEQIE MVQ+ML++G              LVKKKDGTW
Sbjct: 293  TIPLIEGANPVKVKPYRYPHSQKEQIETMVQDMLQQGIIQPSTSPFSSPIVLVKKKDGTW 352

Query: 2224 RCCTDYRALNAITIKDSFPMPTVDELLDELHGAKFFSKFDLRSGYHQILVKPEDRQKTAF 2403
            R CTDYRALNA+TIKDSFP+PTVDELLDEL GA++FSK DLRSGYHQIL+ P+DR KTAF
Sbjct: 353  RFCTDYRALNALTIKDSFPIPTVDELLDELFGARYFSKLDLRSGYHQILLHPQDRYKTAF 412

Query: 2404 RTHQGHYEWLVMPFGLTNAPATFQRLMNEIFQPFLRKCVLVFFDDILVYSPSWHTHLQQV 2583
            RTH GHYEWLVMPFGLTNAPATFQ LMN IFQ  LRK VLVFFDDIL+YS SW  HL  +
Sbjct: 413  RTHHGHYEWLVMPFGLTNAPATFQSLMNHIFQHVLRKYVLVFFDDILIYSSSWQDHLTHL 472

Query: 2584 ECVLQILSQHVLYAKLSKCSFGLTEVEYLGHTVSGSGVSMDRNKVNAVLDWPVPSNLKQL 2763
            + VL++L  + L+ KLSKCSFG++E+EYLGH V+G GVSMDR+KV AVL+W  P+N+KQL
Sbjct: 473  DAVLKVLQDNSLFVKLSKCSFGVSEIEYLGHMVTGQGVSMDRDKVQAVLNWSTPTNVKQL 532

Query: 2764 RGFLGLTGYYRRFIKAYASIAGPLTNLLKKDAFQWNENTQLAFEQLKKAITSAPVLILPD 2943
            RGFLGLTGYYRRFIK+YA IA PLT+LLKK++++WN+   +AF+QLK+A+TSAPVL LP+
Sbjct: 533  RGFLGLTGYYRRFIKSYAKIAAPLTDLLKKESYRWNDQADIAFQQLKEAVTSAPVLALPN 592

Query: 2944 FSQPFVLETDASGTGVGAVLSQGGHPIAFFSKKMAPRMQLQSAYTREFYAITAALAKFRH 3123
            F +PF+LETDASG G+GAVL Q  HPIA+FSKK+ PR Q +SAY RE  AI  A+AKFRH
Sbjct: 593  FHKPFILETDASGVGIGAVLHQDNHPIAYFSKKLVPRNQKKSAYFREMLAIAEAIAKFRH 652

Query: 3124 YLLGHKFILRTXXXXXXXXXXXXXXTPEQQAWLHKFIGYDFQIEYKPGKDNQAADALSRL 3303
            YLLGH+FI+RT              TPEQQ WLHKF+GYDF IEYKPGK+N AADALSRL
Sbjct: 653  YLLGHRFIIRTDQKSLRSLMDQSLQTPEQQEWLHKFLGYDFVIEYKPGKENLAADALSRL 712

Query: 3304 MTMSWSGPENVFLEELKKEVATNEHLKMIVQQCEQNTVADRNYAVKAGLLYWKQKLVIPL 3483
            MT++WS P+  F +++K+ +  +++L  I+Q+C Q  +A  NY V+ G+LYWK ++VIP 
Sbjct: 713  MTLAWSEPQYNFTQQVKEAIQQDDNLLEIIQKCLQG-LAPTNYTVREGILYWKHRMVIPP 771

Query: 3484 ESKLIQQILKEYHDSPIGGHAGVTRTLARIGAQFYWPKMRQDVKEYVAACIVCQQAKSAT 3663
            ++ LIQQIL+E+H SPIGGHAG+TRTLARI +QFYW  M++D+ +YV  C+VCQQAK+  
Sbjct: 772  KAALIQQILEEFHTSPIGGHAGMTRTLARIKSQFYWSAMKKDIFDYVQNCLVCQQAKTTN 831

Query: 3664 TAYAGLLEPLPIPNQVWDDVTMDFITGLPPSHGYTVIMVVVDRLTKFAHFIPMKTDYSSK 3843
            T  AGLL+PLPIP+QVW+D+ MDFITGLP S GYT IMVVVDRLTK+AHFI MKTDY+SK
Sbjct: 832  TLPAGLLQPLPIPSQVWEDIAMDFITGLPLSFGYTTIMVVVDRLTKYAHFIAMKTDYTSK 891

Query: 3844 SVAEAFMHSVVRLHGMPKSIISDRDRVFTSKFWQNLFELQGTTLAMSSAYHPQTDGQSEA 4023
            SVAEAFMH+VV+LHGMPKSI+SDRD+VFTS FWQ+LF+LQGTTLAM+SAYHPQ+DGQ+E 
Sbjct: 892  SVAEAFMHNVVKLHGMPKSIVSDRDKVFTSTFWQHLFKLQGTTLAMTSAYHPQSDGQTEV 951

Query: 4024 VNKCLEMYLRCFTFNNPKSWYKALCWAELWYNTSHHTSAGMSPFKALYGREPPMLTRYAA 4203
            +NK LE+YLRCF+FNNPKSW+K L W+E WYNT+  TS GM+PFKALYGR+PP LTRY A
Sbjct: 952  LNKGLELYLRCFSFNNPKSWFKMLSWSEFWYNTAFQTSIGMTPFKALYGRDPPYLTRYVA 1011

Query: 4204 NSSDPVELQQQLLDRDAVITELKRNLARAQQIMKNNADKKRRDVVFEVGDQVLVKLQPYR 4383
             +SDP  LQ++L++RD ++ +LK NL RAQQ MK  ADK R D+  ++GD VLVKLQPYR
Sbjct: 1012 QASDPPTLQEELMERDKILQQLKDNLIRAQQYMKKQADKHRSDISLKIGDLVLVKLQPYR 1071

Query: 4384 QTSVALRKNQKLGMRYFGPFTILAKIGKVAYKLQLPDTAKIHSVFHVSQLKLFKGTMTEP 4563
            Q SVALRKNQKLG+RYFGPF I+A++G+VAYKL+LPD AKIH VFHVSQLK FKG   E 
Sbjct: 1072 QHSVALRKNQKLGLRYFGPFEIIARVGEVAYKLKLPDDAKIHPVFHVSQLKPFKGVADEQ 1131

Query: 4564 YIPLPLTTVEEGPILEPEGILKTRTILRNEQQVPQVLVKWQHLQVAEATWEDKNEMAIQY 4743
            Y+PLPLT  + GP ++P  +L+ RT++R  QQ+ QVL++W     A+ATWED   +  ++
Sbjct: 1132 YLPLPLTMTDIGPSIQPIDVLQVRTVIRGSQQIHQVLIQWDQYPAAQATWEDITTIQEKF 1191

Query: 4744 PNLNLEDKIALEGEGDVM 4797
            P+LNLEDK+A  G+G VM
Sbjct: 1192 PSLNLEDKVAFNGDGIVM 1209


>gb|PNY16671.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 1284

 Score = 1585 bits (4103), Expect = 0.0
 Identities = 765/1245 (61%), Positives = 951/1245 (76%), Gaps = 8/1245 (0%)
 Frame = +1

Query: 529  TMSSTPFQVRHIKLDFPRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIASVHLDQEVVPW 708
            T +  PFQVR++KL+FPRF+GKNV +WIF+AEQFF YY+TPD +RL IASVHLD++VVPW
Sbjct: 55   TSNKPPFQVRNVKLEFPRFDGKNVHEWIFRAEQFFEYYDTPDLDRLTIASVHLDKDVVPW 114

Query: 709  FQMVNRSRPFQTWMDFTRALELDFGPSMYDCPRASLFKLQQSKSVNEYYLEFTALSNRVY 888
            +QMV R+ PFQ+W++FTRALELDFGPS+Y+CPRA+LFKL QS +V EYYL+FT L+NRVY
Sbjct: 115  YQMVQRTHPFQSWIEFTRALELDFGPSVYECPRATLFKLNQSGTVAEYYLKFTTLANRVY 174

Query: 889  GISNDALIDCFVSGLKDELRRDVMLHTPTSITXXXXXXXXXXXXXXXXNTNVQNQKYNAG 1068
            G+S+DALIDCF+SGL +++RRDVM+HTP S+                  +N   +K+N  
Sbjct: 175  GLSSDALIDCFISGLNNDIRRDVMIHTPPSLVKAFSLAKVYEEKY---TSNTNQKKFN-- 229

Query: 1069 STAVQARAPFNP----TRNSQTQATEKAXXXXXXXXXXXXXMSQLQKNPAIKRIS-AEMQ 1233
            +T      PFN     TR+S                     MSQ QKNP I+RIS AE Q
Sbjct: 230  TTNYATNKPFNKPEILTRDSAP----------ILNTPPTRPMSQFQKNPNIRRISPAERQ 279

Query: 1234 LRREKGLCYFCDDKFSFSHRCPNKHLMLLELXXXXXXXXXXXXANSDD---DREIQKPVE 1404
            +R EKGLCY+CD+KFSF+H+CPN+ LML++                ++   +  I    E
Sbjct: 280  MRSEKGLCYWCDEKFSFTHKCPNRQLMLIQCDDSDADQMFEPMTQPEESTINSSITNQTE 339

Query: 1405 HHLSLNALNGATGMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNLPIEPVTRCN 1584
            HHLSLNA+ G + MGV+RF G I  + V +L+DGGSSD+F+QPRIA  L LPIE   + N
Sbjct: 340  HHLSLNAMKGTSNMGVLRFTGSIEQIKVQVLIDGGSSDNFLQPRIAKFLKLPIESGPQFN 399

Query: 1585 VLVGNGQKMQAEGVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASLGPHVADYAT 1764
            VLVGNG+ M AEG+IQ L + +QG  + VP +LLP+AG+DVILGA WLA+LGPHVADYA+
Sbjct: 400  VLVGNGETMTAEGIIQKLPLEIQGHKLDVPVFLLPIAGADVILGASWLATLGPHVADYAS 459

Query: 1765 SKLRFYLEGKFVTLQGDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAVAPEDNWKDM 1944
              L+F+ + KF+TL G+  P+   AQ H FKRL   D+ISE FT+Q +++   ED +KD+
Sbjct: 460  LTLKFFWKDKFITLSGEAVPRPIPAQFHHFKRLATTDSISECFTVQWLKSADTEDIFKDL 519

Query: 1945 PVTVEPEMATLLYTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPHSQKEQIE 2124
            P  ++PE+A LL+TY+ +FQ P  LPP R  +H I L +G+ PVKVKPYRYPHSQK QIE
Sbjct: 520  PTNIDPEIAILLHTYKNLFQTPSALPPNRTHNHTIPLIDGSNPVKVKPYRYPHSQKAQIE 579

Query: 2125 KMVQEMLEEGXXXXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPMPTVDELL 2304
             M+QEML++G              LVKKKDGTWR CTDYRALNAIT+KD FP+PTVDELL
Sbjct: 580  HMIQEMLQQGIIQPSTSPFSSPIILVKKKDGTWRFCTDYRALNAITVKDCFPIPTVDELL 639

Query: 2305 DELHGAKFFSKFDLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAPATFQRLM 2484
            DEL GAK+FSK DLRSGYHQIL++PEDR KTAFRTHQGHYEWLVMPFGLTNAPATFQ LM
Sbjct: 640  DELCGAKYFSKLDLRSGYHQILLQPEDRYKTAFRTHQGHYEWLVMPFGLTNAPATFQSLM 699

Query: 2485 NEIFQPFLRKCVLVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCSFGLTEVE 2664
            N IFQ  LRK VLVFFDDILVYS +WH H+Q +E VL++L  + LY KLSKCSFG+ E+E
Sbjct: 700  NHIFQHALRKYVLVFFDDILVYSSNWHEHIQHLEMVLKVLQDNTLYVKLSKCSFGVMEIE 759

Query: 2665 YLGHTVSGSGVSMDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASIAGPLTNL 2844
            YLGH V+G GVSMD+ KV AVL+WP+P N+KQLRGFLGLTGYYRRFI++YA IA PLT+L
Sbjct: 760  YLGHIVTGQGVSMDKEKVQAVLNWPIPKNVKQLRGFLGLTGYYRRFIQSYAKIASPLTDL 819

Query: 2845 LKKDAFQWNENTQLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVLSQGGHPI 3024
            LKK+ F+WNE  + AF++LK A+TSAPVL LP+F  PF+LETDASG G+GAVL Q GHPI
Sbjct: 820  LKKEEFRWNEQAEQAFQKLKVAMTSAPVLALPNFKLPFILETDASGVGIGAVLQQQGHPI 879

Query: 3025 AFFSKKMAPRMQLQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXXXXXXXTP 3204
            A+FSKK+ PR Q +SAY RE  AI  ++AKFRHYL+GHKFI+RT              TP
Sbjct: 880  AYFSKKLVPRNQKKSAYFREMLAIAESIAKFRHYLMGHKFIIRTDQKSLRSLMEQSLQTP 939

Query: 3205 EQQAWLHKFIGYDFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEVATNEHLK 3384
            EQQ WLH+F+GYDF IEYKPGK+N AADALSR+MT+SWS P+  F+E+LK  + ++  + 
Sbjct: 940  EQQEWLHRFLGYDFTIEYKPGKENIAADALSRVMTLSWSEPKYHFIEQLKVALQSDHDMS 999

Query: 3385 MIVQQCEQNTVADRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGHAGVTRTL 3564
             I+++C+    A   Y+++ GLLYWKQ+L+IP +S L+ +IL E+H SPIGGHAG+TRT+
Sbjct: 1000 NIIRKCKSGK-APIQYSIRDGLLYWKQRLMIPKDSDLLNKILFEFHTSPIGGHAGITRTI 1058

Query: 3565 ARIGAQFYWPKMRQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDVTMDFITG 3744
            ARI +QFYW  M+QD+ EYV  C+VCQQAK+  T+ AGLL+PLPIP QVW+D+ MDFITG
Sbjct: 1059 ARIKSQFYWQDMKQDITEYVQKCVVCQQAKTTNTSPAGLLQPLPIPAQVWEDIAMDFITG 1118

Query: 3745 LPPSHGYTVIMVVVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSIISDRDRV 3924
            LPPS GYT IMV+VDRLTK+AHFIPMK+DYSSKSVAEAFMH++V+LHGMPKSI+SDRD+V
Sbjct: 1119 LPPSSGYTTIMVIVDRLTKYAHFIPMKSDYSSKSVAEAFMHNIVKLHGMPKSIVSDRDKV 1178

Query: 3925 FTSKFWQNLFELQGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSWYKALCWA 4104
            FTS FWQ LF+LQGT+LAMSSAYHPQ+DGQ+E +NK LE++LRCFTFNNPKSW K + WA
Sbjct: 1179 FTSSFWQQLFKLQGTSLAMSSAYHPQSDGQTEVLNKVLELFLRCFTFNNPKSWSKVISWA 1238

Query: 4105 ELWYNTSHHTSAGMSPFKALYGREPPMLTRYAANSSDPVELQQQL 4239
            E WYNT+  TS GM+PFKALYGR+PP LT+Y A   DP  LQ++L
Sbjct: 1239 EYWYNTAFQTSIGMTPFKALYGRDPPYLTKYEAQEIDPPTLQEEL 1283


>gb|PNX94483.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 1287

 Score = 1578 bits (4085), Expect = 0.0
 Identities = 780/1321 (59%), Positives = 976/1321 (73%), Gaps = 5/1321 (0%)
 Frame = +1

Query: 256  TELRRQAETIEKNEAASRARFEELSSMQKASEARFNQLADAFEKLMQQTPTVTHGGTNSG 435
            +E++R  E  +  +A     FE L+  +  + A+F+++  A + L+ QTP+  H G    
Sbjct: 11   SEIKRNTEETKLLQAEI---FERLARSEAENGAKFDKIFAALDILIDQTPSKHHHGAGLN 67

Query: 436  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXIPTMSSTPFQVRHIKLDFPRFNGKNVLDWIF 615
                                             +  PFQVR++KL+FPRF+GKNV +WIF
Sbjct: 68   N--------------------------------NRPPFQVRNVKLEFPRFDGKNVHEWIF 95

Query: 616  KAEQFFGYYNTPDTERLIIASVHLDQEVVPWFQMVNRSRPFQTWMDFTRALELDFGPSMY 795
            +AEQFF YY+TPD +RL IASVHLD++VVPW+QM+ R+ PF +W++ TRALELDFGPS+Y
Sbjct: 96   RAEQFFEYYDTPDPDRLTIASVHLDKDVVPWYQMMQRTHPFMSWIELTRALELDFGPSIY 155

Query: 796  DCPRASLFKLQQSKSVNEYYLEFTALSNRVYGISNDALIDCFVSGLKDELRRDVMLHTPT 975
            +CPRA+LFKL Q+ SV +YYL+FT+L+NRVYG+SNDAL+DCFVSGL +E+RRDV++HTP 
Sbjct: 156  ECPRATLFKLNQTGSVADYYLQFTSLANRVYGLSNDALVDCFVSGLSNEIRRDVLIHTPP 215

Query: 976  SITXXXXXXXXXXXXXXXXNTNVQNQKYNAGSTAVQARAPFNPTRNSQTQATEKAXXXXX 1155
            S+                  +N+++QK N   T      PF  T   +T     A     
Sbjct: 216  SLVKAVSLAKVYEEKYA---SNLKSQKSN--HTNYSTNQPF--TNKPETITRNSAPILNT 268

Query: 1156 XXXXXXXXMSQLQKNPAIKRIS-AEMQLRREKGLCYFCDDKFSFSHRCPNKHLMLLELXX 1332
                    MSQ QKNP IKRIS AE Q+RR+KGLCY+CDDKFS++H+CPN+ LMLL+   
Sbjct: 269  PPTRP---MSQFQKNPNIKRISPAERQVRRDKGLCYWCDDKFSYTHKCPNRQLMLLQYDD 325

Query: 1333 XXXXXXXXXXANSDD----DREIQKPVEHHLSLNALNGATGMGVIRFKGCIGPVDVSILL 1500
                      ++S +      E  +P EHHLS NA+ G + MG++RF G I  + V IL+
Sbjct: 326  NEEENVVEIPSDSSELAINTLETTQP-EHHLSFNAMKGNSSMGILRFSGTIEHIQVQILI 384

Query: 1501 DGGSSDSFIQPRIAHCLNLPIEPVTRCNVLVGNGQKMQAEGVIQNLTVRVQGIDITVPAY 1680
            DGGSSD+F+QPRIA  L LPIEP     VLVGNG+ M AEGVIQNL + +QG ++ VP +
Sbjct: 385  DGGSSDNFLQPRIARFLKLPIEPGPVFKVLVGNGEIMTAEGVIQNLALNIQGTELQVPVF 444

Query: 1681 LLPVAGSDVILGAPWLASLGPHVADYATSKLRFYLEGKFVTLQGDTDPKTAIAQLHQFKR 1860
            LLPVAG+DVILGA WLA+LGPHVADYA+  L+F++ GKFVTLQG+  P+ A AQ H FKR
Sbjct: 445  LLPVAGADVILGASWLATLGPHVADYASLTLKFFINGKFVTLQGEATPRPAAAQFHHFKR 504

Query: 1861 LQHMDAISELFTLQQIEAVAPEDNWKDMPVTVEPEMATLLYTYREVFQVPKGLPPPRELS 2040
            L + DAI+E FT+Q +++   ED ++D+P  +EPE+A LL+TY+E+F+ P  LPP R  +
Sbjct: 505  LHYTDAIAECFTIQWLKSHTDEDIFRDLPTNIEPEIAILLHTYKELFKTPNTLPPHRSHN 564

Query: 2041 HEILLKEGAPPVKVKPYRYPHSQKEQIEKMVQEMLEEGXXXXXXXXXXXXXXLVKKKDGT 2220
            H I L EG  PVKVKPYRYPHSQK QIE MVQ+ML++G              LVKKKDGT
Sbjct: 565  HSIPLIEGHNPVKVKPYRYPHSQKNQIELMVQDMLQQGIIQPSTSPFSSPIVLVKKKDGT 624

Query: 2221 WRCCTDYRALNAITIKDSFPMPTVDELLDELHGAKFFSKFDLRSGYHQILVKPEDRQKTA 2400
            WR CTDYRALNAIT+KDSFP+PTVDELLDEL GA FFSK DLRSGYHQIL++PEDR KTA
Sbjct: 625  WRFCTDYRALNAITVKDSFPIPTVDELLDELFGAXFFSKLDLRSGYHQILLQPEDRYKTA 684

Query: 2401 FRTHQGHYEWLVMPFGLTNAPATFQRLMNEIFQPFLRKCVLVFFDDILVYSPSWHTHLQQ 2580
            FRTHQG YEWLVMPFGLTNAPATFQ LMN IFQ  LRK VLVFFDDILVYS +W  HL  
Sbjct: 685  FRTHQGLYEWLVMPFGLTNAPATFQGLMNHIFQHALRKYVLVFFDDILVYSSTWQDHLSH 744

Query: 2581 VECVLQILSQHVLYAKLSKCSFGLTEVEYLGHTVSGSGVSMDRNKVNAVLDWPVPSNLKQ 2760
            ++ VL+ L  + L+ KLSKCSFG++E+EYLGH V+G GVSMDR+K+ AVL+WP+P N+KQ
Sbjct: 745  LDTVLKTLLDNDLFVKLSKCSFGVSEIEYLGHLVTGQGVSMDRDKIQAVLEWPIPKNIKQ 804

Query: 2761 LRGFLGLTGYYRRFIKAYASIAGPLTNLLKKDAFQWNENTQLAFEQLKKAITSAPVLILP 2940
            LRGFLGLTGYYRRFIK YA +A PLT+LLKK+ + WN  T+LAF++LK A+TSAPVL LP
Sbjct: 805  LRGFLGLTGYYRRFIKHYARLAAPLTDLLKKENYNWNAQTELAFQELKAAMTSAPVLALP 864

Query: 2941 DFSQPFVLETDASGTGVGAVLSQGGHPIAFFSKKMAPRMQLQSAYTREFYAITAALAKFR 3120
            +F +PF+LETDASG GVGAVL Q GHPIA+FSKK+ PR Q +SAY RE  AI+ A+AKFR
Sbjct: 865  NFQKPFILETDASGVGVGAVLHQDGHPIAYFSKKLVPRNQKKSAYFREMLAISEAIAKFR 924

Query: 3121 HYLLGHKFILRTXXXXXXXXXXXXXXTPEQQAWLHKFIGYDFQIEYKPGKDNQAADALSR 3300
            HYLLGH+FI+RT              TPEQQ WLHKF+GYDF IEYKPGK+N AADALSR
Sbjct: 925  HYLLGHRFIIRTDQKSLRSLMDQSLQTPEQQQWLHKFLGYDFVIEYKPGKENLAADALSR 984

Query: 3301 LMTMSWSGPENVFLEELKKEVATNEHLKMIVQQCEQNTVADRNYAVKAGLLYWKQKLVIP 3480
            +MT++WS P +  + ++K E+  + +   I+++C     +   Y ++ G+LYWK ++VIP
Sbjct: 985  VMTLAWSEPISQLITQIKGEIKADIYWSDIIEKCISQGNSYLQYTLRDGVLYWKNRVVIP 1044

Query: 3481 LESKLIQQILKEYHDSPIGGHAGVTRTLARIGAQFYWPKMRQDVKEYVAACIVCQQAKSA 3660
             +S LIQQ+L E+H SPIGGHAG TRTLARI +QFYW  M++DV EY+  C++CQQAK+ 
Sbjct: 1045 QKSALIQQVLHEFHSSPIGGHAGFTRTLARIKSQFYWVAMKKDVLEYIQNCVICQQAKTT 1104

Query: 3661 TTAYAGLLEPLPIPNQVWDDVTMDFITGLPPSHGYTVIMVVVDRLTKFAHFIPMKTDYSS 3840
             T  AGLL+PLPIP QVW+D+ MDFITGLP S+GYT I+VVVDRLTK+AHF+PMKTDYSS
Sbjct: 1105 NTLPAGLLQPLPIPTQVWEDIAMDFITGLPLSYGYTTILVVVDRLTKYAHFLPMKTDYSS 1164

Query: 3841 KSVAEAFMHSVVRLHGMPKSIISDRDRVFTSKFWQNLFELQGTTLAMSSAYHPQTDGQSE 4020
            KSVAE FM+ +V+LHGMPKSI+SDRD+VFTS FWQ+LF+LQGT+LAMSSAYHPQ+DGQ+E
Sbjct: 1165 KSVAEVFMNHIVKLHGMPKSIVSDRDKVFTSSFWQHLFKLQGTSLAMSSAYHPQSDGQTE 1224

Query: 4021 AVNKCLEMYLRCFTFNNPKSWYKALCWAELWYNTSHHTSAGMSPFKALYGREPPMLTRYA 4200
             +NK LE++LRCFTFNNPKSWYKAL W+E WYNT+ HTS GM+PFKALYGREPP LTRY 
Sbjct: 1225 VLNKGLELFLRCFTFNNPKSWYKALAWSEFWYNTALHTSIGMTPFKALYGREPPTLTRYE 1284

Query: 4201 A 4203
            A
Sbjct: 1285 A 1285


>gb|PNX96484.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1258

 Score = 1553 bits (4020), Expect = 0.0
 Identities = 756/1215 (62%), Positives = 919/1215 (75%), Gaps = 3/1215 (0%)
 Frame = +1

Query: 1180 MSQLQKNPAIKRIS-AEMQLRREKGLCYFCDDKFSFSHRCPNKHLMLLELXXXXXXXXXX 1356
            MSQ Q NP +KRIS AE QLRR+KGLCY+CDDKFSF+H+CPN+ LMLL+           
Sbjct: 1    MSQFQTNPNVKRISPAERQLRRDKGLCYWCDDKFSFTHKCPNRQLMLLQYEDSDDQVLDE 60

Query: 1357 XXANSDDDRE--IQKPVEHHLSLNALNGATGMGVIRFKGCIGPVDVSILLDGGSSDSFIQ 1530
                 D           E HLS+NA+ G++ MGV+RF G I  + V IL+DGGSSD+F+Q
Sbjct: 61   MTQPPDQTTNGPTNNLPELHLSMNAMKGSSHMGVLRFTGAIEHIQVQILIDGGSSDNFVQ 120

Query: 1531 PRIAHCLNLPIEPVTRCNVLVGNGQKMQAEGVIQNLTVRVQGIDITVPAYLLPVAGSDVI 1710
            PRIA  L LPIEP     VLVGNG+ M AEG+++ L + VQG  + VP YLLPVAG+DVI
Sbjct: 121  PRIAKFLKLPIEPAPVFKVLVGNGEVMTAEGIVKQLPLDVQGHRLQVPVYLLPVAGADVI 180

Query: 1711 LGAPWLASLGPHVADYATSKLRFYLEGKFVTLQGDTDPKTAIAQLHQFKRLQHMDAISEL 1890
            LGA WL++LGPHVADYA+  ++F+L  KFVTL G+   +   AQ H  KR  H DAI E 
Sbjct: 181  LGASWLSTLGPHVADYASLTIKFFLHDKFVTLVGEPVARPEPAQFHHMKRFHHTDAIDEC 240

Query: 1891 FTLQQIEAVAPEDNWKDMPVTVEPEMATLLYTYREVFQVPKGLPPPRELSHEILLKEGAP 2070
            F +Q  +     D +K++P   EPE+A LLY Y+ VF+ P  LPP R  +H I L EG  
Sbjct: 241  FAIQWFKDNEVADIFKELPTNTEPEIAMLLYNYQAVFKTPTTLPPTRAHNHAIPLLEGTN 300

Query: 2071 PVKVKPYRYPHSQKEQIEKMVQEMLEEGXXXXXXXXXXXXXXLVKKKDGTWRCCTDYRAL 2250
            PVKVKPYRYPHSQK QIE MVQ+ML++G              LVKKKDGTWR CTDYRAL
Sbjct: 301  PVKVKPYRYPHSQKTQIEIMVQDMLQQGIIQPSTSPFSSPIILVKKKDGTWRFCTDYRAL 360

Query: 2251 NAITIKDSFPMPTVDELLDELHGAKFFSKFDLRSGYHQILVKPEDRQKTAFRTHQGHYEW 2430
            NAIT+KDSFP+PTVDELLDEL GAK FSK DLRSGYHQIL++PEDR KTAFRTHQGHYEW
Sbjct: 361  NAITVKDSFPIPTVDELLDELFGAKHFSKLDLRSGYHQILLQPEDRHKTAFRTHQGHYEW 420

Query: 2431 LVMPFGLTNAPATFQRLMNEIFQPFLRKCVLVFFDDILVYSPSWHTHLQQVECVLQILSQ 2610
            LVMPFGLTNAPATFQ LMN IFQ  LRK VLVFFDDILVYS +WH HL+ +  VLQ+L  
Sbjct: 421  LVMPFGLTNAPATFQSLMNTIFQTVLRKYVLVFFDDILVYSKTWHEHLKHLAAVLQVLQD 480

Query: 2611 HVLYAKLSKCSFGLTEVEYLGHTVSGSGVSMDRNKVNAVLDWPVPSNLKQLRGFLGLTGY 2790
            + LY KLSKCSFG+ E+EYLGH VSG GVSMD+ K+ AV+DWP P N+KQLRGFLGLTGY
Sbjct: 481  NQLYVKLSKCSFGVLEIEYLGHVVSGEGVSMDKTKIQAVVDWPPPKNIKQLRGFLGLTGY 540

Query: 2791 YRRFIKAYASIAGPLTNLLKKDAFQWNENTQLAFEQLKKAITSAPVLILPDFSQPFVLET 2970
            YRRFI++YA IA PLT+LLKK+A+ W    + AF++LK AIT+APVL LPDF++PFVLET
Sbjct: 541  YRRFIQSYAKIASPLTDLLKKEAYTWTSQEESAFQKLKHAITTAPVLALPDFTKPFVLET 600

Query: 2971 DASGTGVGAVLSQGGHPIAFFSKKMAPRMQLQSAYTREFYAITAALAKFRHYLLGHKFIL 3150
            DASG G+GAVL Q GHPIA+FSKK+ PR Q +SAY RE  AI  A+AKFRHYLLGHKFI+
Sbjct: 601  DASGIGIGAVLHQEGHPIAYFSKKLVPRNQRKSAYFREMLAIAEAIAKFRHYLLGHKFII 660

Query: 3151 RTXXXXXXXXXXXXXXTPEQQAWLHKFIGYDFQIEYKPGKDNQAADALSRLMTMSWSGPE 3330
            RT              TP+QQ WLH+F+GYDF IEYKPGK+N AADALSR+MTM+WS P+
Sbjct: 661  RTDQKSLRNLMEQALQTPDQQEWLHRFLGYDFSIEYKPGKENVAADALSRVMTMAWSEPQ 720

Query: 3331 NVFLEELKKEVATNEHLKMIVQQCEQNTVADRNYAVKAGLLYWKQKLVIPLESKLIQQIL 3510
               L +++  +  +  L  I+Q+C QN   + +Y VK  LL+WK ++VIP  S+LI+Q+L
Sbjct: 721  YKLLHQIRAALKQDSTLLGIMQKCVQNNATNSHYTVKDELLFWKHRIVIPKNSELIKQVL 780

Query: 3511 KEYHDSPIGGHAGVTRTLARIGAQFYWPKMRQDVKEYVAACIVCQQAKSATTAYAGLLEP 3690
             E H SPIGGHAG+ RTLAR+ +QFYWP M+ D+ +YV  C +CQ+AK+  T  AGLL+P
Sbjct: 781  YELHTSPIGGHAGMARTLARVKSQFYWPDMKTDIADYVQNCAICQKAKTTNTLPAGLLQP 840

Query: 3691 LPIPNQVWDDVTMDFITGLPPSHGYTVIMVVVDRLTKFAHFIPMKTDYSSKSVAEAFMHS 3870
            LPIP+QVW+DV MDFITGLP S GYT I+VV+DRLTK+AHFIP+KTDYSSK VAEA M +
Sbjct: 841  LPIPSQVWEDVAMDFITGLPSSQGYTTILVVIDRLTKYAHFIPLKTDYSSKIVAEAVMDN 900

Query: 3871 VVRLHGMPKSIISDRDRVFTSKFWQNLFELQGTTLAMSSAYHPQTDGQSEAVNKCLEMYL 4050
            +V+LHGMPKSI+SDRD+VFTS FWQ LF+LQGTTLAMSSAYHPQ+DGQSE +NK LE++L
Sbjct: 901  IVKLHGMPKSIVSDRDKVFTSSFWQQLFKLQGTTLAMSSAYHPQSDGQSEVLNKTLELFL 960

Query: 4051 RCFTFNNPKSWYKALCWAELWYNTSHHTSAGMSPFKALYGREPPMLTRYAANSSDPVELQ 4230
            RCFTF+NPKSW KAL W+E WYNT+  TS GM+PFKALYGR+PP L RY   ++DP  LQ
Sbjct: 961  RCFTFDNPKSWCKALSWSEFWYNTAFQTSIGMTPFKALYGRDPPALIRYETQANDPPTLQ 1020

Query: 4231 QQLLDRDAVITELKRNLARAQQIMKNNADKKRRDVVFEVGDQVLVKLQPYRQTSVALRKN 4410
            ++L++RD +I +LK NL +AQQ MK  ADK R DV  +VGD VLVKLQPYRQ SVALRKN
Sbjct: 1021 EKLMERDRIIQQLKLNLEKAQQYMKKQADKHRVDVKLQVGDLVLVKLQPYRQQSVALRKN 1080

Query: 4411 QKLGMRYFGPFTILAKIGKVAYKLQLPDTAKIHSVFHVSQLKLFKGTMTEPYIPLPLTTV 4590
            QKLGMRYFGPF ++AK+G+VAYKL+LP+ AKIH VFHVSQLK FKG   E Y+PLPL+  
Sbjct: 1081 QKLGMRYFGPFEVIAKVGEVAYKLKLPEHAKIHPVFHVSQLKPFKGDNQEQYMPLPLSMT 1140

Query: 4591 EEGPILEPEGILKTRTILRNEQQVPQVLVKWQHLQVAEATWEDKNEMAIQYPNLNLEDKI 4770
            + GP+++P  +L TRTI+R  Q++ QVL++W     AEATWED + +  ++P  NLEDK+
Sbjct: 1141 DTGPMIQPVSVLATRTIIRGAQRIQQVLIQWDQYSTAEATWEDVDALQSKFPAFNLEDKV 1200

Query: 4771 ALEGEGDVMNNKSVN 4815
            A  G+G VM+    N
Sbjct: 1201 AFIGDGIVMSPMEEN 1215


>dbj|GAU23504.1| hypothetical protein TSUD_39830 [Trifolium subterraneum]
          Length = 1160

 Score = 1522 bits (3940), Expect = 0.0
 Identities = 748/1150 (65%), Positives = 888/1150 (77%), Gaps = 4/1150 (0%)
 Frame = +1

Query: 1219 SAEMQLRREKGLCYFCDDKFSFSHRCPNKHLMLLELXXXXXXXXXXXXANSDDDREIQ-- 1392
            SAEMQLRREKGLCY CDDK+SF+H+CPNKHLM+L+L            AN   D  I   
Sbjct: 3    SAEMQLRREKGLCYTCDDKWSFNHKCPNKHLMILQLAAEDIEEPP---ANEPPDPPITDS 59

Query: 1393 --KPVEHHLSLNALNGATGMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNLPIE 1566
              K +E HLSLNAL G++G+G I+F G IG + V IL+DGGSSD+F+QPRIA  L L +E
Sbjct: 60   ETKQLELHLSLNALKGSSGVGTIKFTGHIGHIAVQILVDGGSSDNFLQPRIAKFLKLDVE 119

Query: 1567 PVTRCNVLVGNGQKMQAEGVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASLGPH 1746
                  VLVGNG  +  EG I NL V VQG ++ +P YLLPV G+D+ILGA WLA+LGPH
Sbjct: 120  SAPLFKVLVGNGHSLNPEGSISNLCVTVQGHEMHMPVYLLPVVGADLILGATWLATLGPH 179

Query: 1747 VADYATSKLRFYLEGKFVTLQGDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAVAPE 1926
            VADY    L+F+  GKF+TL G+       A LH  +RL   +AI   F + + + +   
Sbjct: 180  VADYQALCLKFFYNGKFITLHGEKSILPQQAHLHHMRRLYQTNAIEACFAIHRADPMIHL 239

Query: 1927 DNWKDMPVTVEPEMATLLYTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPHS 2106
            DNW ++P  +EPEM  LL  Y+ VF  P GLPPPR+ +H I L  GA  VKV+PYRYP S
Sbjct: 240  DNWLELPENMEPEMVLLLNNYKAVFHKPTGLPPPRQQNHTIPLLGGAKHVKVRPYRYPQS 299

Query: 2107 QKEQIEKMVQEMLEEGXXXXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPMP 2286
            QKEQIE M+QEML EG              LVKKKDGTWR CTDYRALNAITIKDSFPMP
Sbjct: 300  QKEQIELMIQEMLAEGIIQPSNSPFSSPIVLVKKKDGTWRFCTDYRALNAITIKDSFPMP 359

Query: 2287 TVDELLDELHGAKFFSKFDLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAPA 2466
            TVDELLDEL GA+ FSK DLRSGYHQILV+P DR KTAFRTHQGHYEWLVMPFGLTNAPA
Sbjct: 360  TVDELLDELFGAQCFSKLDLRSGYHQILVEPADRYKTAFRTHQGHYEWLVMPFGLTNAPA 419

Query: 2467 TFQRLMNEIFQPFLRKCVLVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCSF 2646
            +FQ LMN+IFQP LRK VLVFFDDILVYSPS+ THLQ +E VLQIL  H L+ KLSKCSF
Sbjct: 420  SFQCLMNQIFQPLLRKYVLVFFDDILVYSPSFSTHLQHLETVLQILQHHTLFVKLSKCSF 479

Query: 2647 GLTEVEYLGHTVSGSGVSMDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASIA 2826
            G+ EV+YLGHTVSGSGV+MD+ KV  VLDWP P+N+KQLRGFLGLTGYYRRFIK+YA+IA
Sbjct: 480  GMKEVDYLGHTVSGSGVAMDKLKVQTVLDWPKPTNVKQLRGFLGLTGYYRRFIKSYATIA 539

Query: 2827 GPLTNLLKKDAFQWNENTQLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVLS 3006
             PLTNLLKK+AF+WN N + AF  LK AITSAPVL LP F+QPF LETDASGTGVGAVLS
Sbjct: 540  TPLTNLLKKEAFEWNSNAEQAFSHLKSAITSAPVLALPQFNQPFTLETDASGTGVGAVLS 599

Query: 3007 QGGHPIAFFSKKMAPRMQLQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXXX 3186
            QGGHPIAFFSKKM PRMQ QSAYTRE +AIT A+AKFRHYLLGHKF ++T          
Sbjct: 600  QGGHPIAFFSKKMVPRMQKQSAYTRELFAITEAIAKFRHYLLGHKFTIKTDQKSLRSLMD 659

Query: 3187 XXXXTPEQQAWLHKFIGYDFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEVA 3366
                TPEQQAWLHKFIGYDF IEYKPGK+N AADALSR++ ++WS P+  FL+EL+   +
Sbjct: 660  QSLQTPEQQAWLHKFIGYDFNIEYKPGKENVAADALSRVLLVAWSEPKLQFLDELRAATS 719

Query: 3367 TNEHLKMIVQQCEQNTVADRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGHA 3546
             +  L+ +  Q       D  Y    GL+++  ++++P   +L ++IL E+H SPIGGHA
Sbjct: 720  NDADLRTMSSQ-------DPRYVHHDGLIFFNNRIMVPTHQELKEKILFEFHSSPIGGHA 772

Query: 3547 GVTRTLARIGAQFYWPKMRQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDVT 3726
            G+TRTLARI AQFYW  M+QD++++V  C++CQQAK  T A AGLL+PLPIP QVW+DV 
Sbjct: 773  GITRTLARISAQFYWVNMKQDIQQFVQNCVICQQAKHETRAPAGLLQPLPIPEQVWEDVA 832

Query: 3727 MDFITGLPPSHGYTVIMVVVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSII 3906
            MDFITGLP SHGYTVIMVV+DRLTK++HFIP+K DY+SKSVAEAFM  +V+LHG+PKSI+
Sbjct: 833  MDFITGLPSSHGYTVIMVVIDRLTKYSHFIPLKMDYNSKSVAEAFMKYIVKLHGLPKSIV 892

Query: 3907 SDRDRVFTSKFWQNLFELQGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSWY 4086
            SDRD+VF S+FW++LF+LQGTTLAMSSAYHPQTDGQSEA+NKCLEMYLRC TF  PK+W 
Sbjct: 893  SDRDKVFVSQFWKDLFKLQGTTLAMSSAYHPQTDGQSEALNKCLEMYLRCLTFQQPKTWS 952

Query: 4087 KALCWAELWYNTSHHTSAGMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAVITE 4266
            KAL WAE WYNT+HHTS GM+PF+ALYGR PP L RY  +  D   +QQQL DRD ++ +
Sbjct: 953  KALDWAEYWYNTAHHTSLGMTPFRALYGRHPPSLVRYHPSPHDAATVQQQLSDRDNLLAQ 1012

Query: 4267 LKRNLARAQQIMKNNADKKRRDVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRYFGPFT 4446
            +K NL RAQQ+MK+ ADK RR VVFEVGD+VLVKLQPYRQ S+ALRK+QKLGMR+FGPFT
Sbjct: 1013 IKVNLCRAQQVMKDQADKHRRHVVFEVGDKVLVKLQPYRQNSIALRKHQKLGMRFFGPFT 1072

Query: 4447 ILAKIGKVAYKLQLPDTAKIHSVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILEPEGIL 4626
            I+ K+G VAYKLQLP  A+IH VFH+SQLKLFKG+ +EPY+PLPLTT E GPIL+P  +L
Sbjct: 1073 IIEKVGMVAYKLQLPTEARIHPVFHISQLKLFKGSCSEPYMPLPLTTHELGPILQPARVL 1132

Query: 4627 KTRTILRNEQ 4656
            + R +L+  +
Sbjct: 1133 QDRVVLKGTE 1142


>gb|KYP39590.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 1292

 Score = 1470 bits (3806), Expect = 0.0
 Identities = 705/1247 (56%), Positives = 911/1247 (73%), Gaps = 1/1247 (0%)
 Frame = +1

Query: 544  PFQVRHIKLDFPRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIASVHLDQEVVPWFQMVN 723
            P QVR++KLDFPRF+G +VL WIF+AEQFF YYNTPD +R++IA+VH ++ VVPW+QM+ 
Sbjct: 55   PLQVRNVKLDFPRFDGSDVLQWIFRAEQFFEYYNTPDEQRIVIAAVHFEKNVVPWYQMMQ 114

Query: 724  RSRPFQTWMDFTRALELDFGPSMYDCPRASLFKLQQSKSVNEYYLEFTALSNRVYGISND 903
            R+ P  +W   TR+LEL+FGPS +D PR++LFKL Q+ SVN+YY+EFT L+NR+YG+S +
Sbjct: 115  RTSPIISWNTLTRSLELEFGPSPFDSPRSTLFKLVQTGSVNDYYIEFTNLANRIYGVSAE 174

Query: 904  ALIDCFVSGLKDELRRDVMLHTPTSITXXXXXXXXXXXXXXXXNTNVQNQKYNAGSTAVQ 1083
            AL+DCF+SGLK +++R+++   P S+                    +  +KY+  S   Q
Sbjct: 175  ALLDCFISGLKPDIKREIIAQAPNSLLKAISL------------ARLFEEKYSFRSR--Q 220

Query: 1084 ARAPFNPTRNSQTQATEKAXXXXXXXXXXXXXMSQLQKNPAIKRIS-AEMQLRREKGLCY 1260
            +    N + ++  Q+                  +   +N A++++S AEMQ RRE+GLC+
Sbjct: 221  SFVTRNTSHSAGNQSYTNPAQQPLLNTPNIKPAAFPNRNTAVRKMSPAEMQSRRERGLCF 280

Query: 1261 FCDDKFSFSHRCPNKHLMLLELXXXXXXXXXXXXANSDDDREIQKPVEHHLSLNALNGAT 1440
             CD++FS +HRCPNK  +LL++              + D   ++  +EHHLS NAL G  
Sbjct: 281  TCDERFSANHRCPNKQYLLLQVEDEEELEE----TTNVDSTALEDELEHHLSFNALKGVA 336

Query: 1441 GMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNLPIEPVTRCNVLVGNGQKMQAE 1620
             +G +RF G I   +V ILLD GSSD+F+QP++AH L LPIEP     V+VGNG  +  E
Sbjct: 337  TVGTMRFTGSIAGKEVHILLDSGSSDNFLQPKLAHYLKLPIEPAAGLQVMVGNGSSLSTE 396

Query: 1621 GVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASLGPHVADYATSKLRFYLEGKFV 1800
            G I NL V+VQG  + +P YLL V+G+D++LGA WLA+LGPH+ADY +  ++FY + K V
Sbjct: 397  GKILNLQVQVQGQVLQLPVYLLSVSGADLVLGAAWLATLGPHIADYGSLTIKFYKDKKLV 456

Query: 1801 TLQGDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAVAPEDNWKDMPVTVEPEMATLL 1980
            TLQG+     A++Q H  KRL H   I+E++TLQ + +    D WKD+P  V+PE+A LL
Sbjct: 457  TLQGEKSRPAAMSQFHHLKRLNHTQGIAEVYTLQLLSSFVETDQWKDIPDNVDPEIALLL 516

Query: 1981 YTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPHSQKEQIEKMVQEMLEEGXX 2160
            + YR++F  P GLPPPR  +H I L +G+ PVKV+PY+YPHSQK+QIE M++EMLE+G  
Sbjct: 517  HYYRQIFAKPTGLPPPRSQNHRIPLLQGSGPVKVRPYKYPHSQKQQIELMIKEMLEDGII 576

Query: 2161 XXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPMPTVDELLDELHGAKFFSKF 2340
                        LVKKKDG+WR CTDYRALNAIT+KDSFP+PTV++LLDEL GAK+FSK 
Sbjct: 577  APSSSPFSSPIILVKKKDGSWRFCTDYRALNAITVKDSFPIPTVEDLLDELFGAKYFSKL 636

Query: 2341 DLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAPATFQRLMNEIFQPFLRKCV 2520
            DLR+GYHQILV+ EDR KTAFRTHQGHYEWLVMPFGLTNAPATFQ LMN+IFQ  LRK V
Sbjct: 637  DLRAGYHQILVQEEDRYKTAFRTHQGHYEWLVMPFGLTNAPATFQNLMNDIFQGLLRKSV 696

Query: 2521 LVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCSFGLTEVEYLGHTVSGSGVS 2700
            LVFFDDILVYS SW  HLQ ++ VL IL++H LYAK+SKCSFGL +VEYLGH VSG GVS
Sbjct: 697  LVFFDDILVYSSSWFLHLQHLQQVLDILAKHELYAKMSKCSFGLEQVEYLGHVVSGDGVS 756

Query: 2701 MDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASIAGPLTNLLKKDAFQWNENT 2880
            M+ +KV AV+DWPVP  +KQLRGFLGLTGYYRRFI+ YASIA PLT+LLKKD F+W+   
Sbjct: 757  METSKVQAVIDWPVPKTIKQLRGFLGLTGYYRRFIQGYASIANPLTDLLKKDNFKWSNEA 816

Query: 2881 QLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVLSQGGHPIAFFSKKMAPRMQ 3060
              AF  LK+AIT+APVL LPDFSQPFVLETDASG+G+GAVLSQ  HPIAFFSKK++ RM 
Sbjct: 817  DAAFIALKQAITTAPVLSLPDFSQPFVLETDASGSGIGAVLSQNKHPIAFFSKKLSNRMT 876

Query: 3061 LQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXXXXXXXTPEQQAWLHKFIGY 3240
             QSAYTREFYAIT A+AKFRHYLLGH+FI+RT              TPEQQAWLHKF+GY
Sbjct: 877  KQSAYTREFYAITEAIAKFRHYLLGHRFIIRTDQKSLKSLLDQTLQTPEQQAWLHKFLGY 936

Query: 3241 DFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEVATNEHLKMIVQQCEQNTVA 3420
            DF IEYKPG +N AADALSR   M+ +   +  + ++K  + ++  L+ I+    Q    
Sbjct: 937  DFSIEYKPGTENLAADALSRSFFMASAVTASDLVHQIKAALGSDTALQPILTAHSQGKAL 996

Query: 3421 DRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGHAGVTRTLARIGAQFYWPKM 3600
               Y+   GLL+WK ++V+P    +  QIL+E+H SP+GGH+G+ RT AR+ AQF+WP M
Sbjct: 997  SAPYSFLDGLLFWKGRIVVPNVPAIQNQILQEFHSSPLGGHSGIARTFARVAAQFFWPGM 1056

Query: 3601 RQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDVTMDFITGLPPSHGYTVIMV 3780
             +D+K +V  C VCQQAK+AT   AGLL+PLPIP Q+W+D++MDFI GLPP+ GYTVI V
Sbjct: 1057 NKDIKNFVQQCCVCQQAKTATVLPAGLLQPLPIPTQIWEDISMDFIVGLPPAEGYTVIFV 1116

Query: 3781 VVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSIISDRDRVFTSKFWQNLFEL 3960
            +VDRL+K+AHF P+K+D++SK VA+ F+H+VV+LHG P SI+SDRD+VFTS FWQ+L +L
Sbjct: 1117 IVDRLSKYAHFAPLKSDFNSKRVADVFLHTVVKLHGFPNSIVSDRDKVFTSTFWQHLLKL 1176

Query: 3961 QGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSWYKALCWAELWYNTSHHTSA 4140
             GTTL +S+AYHPQ+DGQ+EA+NKCLEMYLRCFT   PK W K L WAE WYNTS H SA
Sbjct: 1177 SGTTLKLSTAYHPQSDGQTEALNKCLEMYLRCFTHEKPKDWIKFLPWAEFWYNTSFHHSA 1236

Query: 4141 GMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAVITELKRNL 4281
             MSPFK +YGR+PP L +Y+ +++DP  +Q+ LL RD V+ +LK NL
Sbjct: 1237 QMSPFKVVYGRDPPTLVKYSHSATDPPSIQEMLLQRDRVLAQLKVNL 1283


>dbj|GAU35592.1| hypothetical protein TSUD_295280 [Trifolium subterraneum]
          Length = 1358

 Score = 1402 bits (3628), Expect = 0.0
 Identities = 692/1137 (60%), Positives = 837/1137 (73%)
 Frame = +1

Query: 1399 VEHHLSLNALNGATGMGVIRFKGCIGPVDVSILLDGGSSDSFIQPRIAHCLNLPIEPVTR 1578
            ++HHLSLNA+NG + +G +R  G +   DV IL+DGGS+D+F QPR+A  L LPIEP++ 
Sbjct: 199  IDHHLSLNAMNGCSSLGTLRLTGQVLGKDVHILVDGGSTDNFFQPRLAKFLKLPIEPISN 258

Query: 1579 CNVLVGNGQKMQAEGVIQNLTVRVQGIDITVPAYLLPVAGSDVILGAPWLASLGPHVADY 1758
             NVLVGNG KM AEG I NLTV VQG ++ +P +LLP A +D+ILG+ WLA+LGPHVADY
Sbjct: 259  FNVLVGNGHKMTAEGKITNLTVSVQGHEMMIPVFLLPFASADLILGSSWLATLGPHVADY 318

Query: 1759 ATSKLRFYLEGKFVTLQGDTDPKTAIAQLHQFKRLQHMDAISELFTLQQIEAVAPEDNWK 1938
            +   L+F+L GKFVTLQ   +     AQLH   RL +  AI+EL+T+     +       
Sbjct: 319  SALSLKFFLNGKFVTLQCTANHTPNSAQLHHLSRLHNTHAIAELYTMYYFNMMYEPTPPL 378

Query: 1939 DMPVTVEPEMATLLYTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPHSQKEQ 2118
            ++  TV  ++A++L  +  +F  P  LPP R  +H I L +    VKV+PYRYPHSQK Q
Sbjct: 379  ELSSTVPHDLASILTQFSHLFHSPTILPPQRSENHYIPLVDETKVVKVRPYRYPHSQKAQ 438

Query: 2119 IEKMVQEMLEEGXXXXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPMPTVDE 2298
            IE MV++ML++G              LVKKK GTWR CTDYRALNA+TIKDSFP+PTVDE
Sbjct: 439  IELMVEDMLKQGIIQPSTSPFSSPIILVKKKGGTWRFCTDYRALNALTIKDSFPLPTVDE 498

Query: 2299 LLDELHGAKFFSKFDLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAPATFQR 2478
            LLDEL+GAKFFSK DLRSGYHQIL+ P+DR KTAFRTH GHYEWLVMPFGL+NAPA+FQ 
Sbjct: 499  LLDELYGAKFFSKLDLRSGYHQILMNPKDRHKTAFRTHHGHYEWLVMPFGLSNAPASFQS 558

Query: 2479 LMNEIFQPFLRKCVLVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCSFGLTE 2658
            LMN++FQ  LRK VLVFFDDILVY+ +W +HL  +E VLQ+L  H L+ KLSKCSFG++E
Sbjct: 559  LMNQVFQKVLRKFVLVFFDDILVYNATWSSHLLHLEIVLQLLDHHKLFVKLSKCSFGMSE 618

Query: 2659 VEYLGHTVSGSGVSMDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASIAGPLT 2838
            V+YLGHT+ G GV MD+ KV AVL WP P+NLKQLRGFLGLTGYYRRFIK+YA+IA PLT
Sbjct: 619  VDYLGHTIFGMGVIMDKTKVQAVLQWPTPTNLKQLRGFLGLTGYYRRFIKSYATIASPLT 678

Query: 2839 NLLKKDAFQWNENTQLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVLSQGGH 3018
            +LLKKD+F W++N   AF  LK+AITSAPVLILP+FSQPF+LETDASG G+GAVLSQ GH
Sbjct: 679  DLLKKDSFNWSDNAMKAFNTLKQAITSAPVLILPNFSQPFMLETDASGIGIGAVLSQQGH 738

Query: 3019 PIAFFSKKMAPRMQLQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXXXXXXX 3198
            PIA+FSKK+ P  Q QS Y REF AIT ALAKFRHYLLGHKFI+RT              
Sbjct: 739  PIAYFSKKLGPTSQKQSGYLREFRAITEALAKFRHYLLGHKFIIRTDQQSLESLLDQTLQ 798

Query: 3199 TPEQQAWLHKFIGYDFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEVATNEH 3378
            TPEQQAWLHKFIG+DF IEYKPGKDN+AADALSR+ ++SWS P++ FL++L +E+   ++
Sbjct: 799  TPEQQAWLHKFIGFDFTIEYKPGKDNKAADALSRMFSLSWSEPQSQFLKDLHQEIQRCDY 858

Query: 3379 LKMIVQQCEQNTVADRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGHAGVTR 3558
               I  QC  +   D  YAV+ GLLYW  +L IP +S +I +IL E+HDSPIGGHAG+TR
Sbjct: 859  FYPIFLQCVSDKPLDPAYAVRDGLLYWNNRLAIPPKSAMISKILTEFHDSPIGGHAGITR 918

Query: 3559 TLARIGAQFYWPKMRQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDVTMDFI 3738
            TLARI AQF+WP MRQ + +YV  C +CQQAK  T+   GLL PLPIP  VW+D+ MDFI
Sbjct: 919  TLARISAQFFWPHMRQTISQYVKQCSICQQAKHTTSLPLGLLCPLPIPKHVWEDIAMDFI 978

Query: 3739 TGLPPSHGYTVIMVVVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSIISDRD 3918
            TG P S G+TVI+VVVDR TKF HF  +K D+ SK VAE  M  +V+L+GMPKSI+SDRD
Sbjct: 979  TGFPNSGGFTVILVVVDRFTKFGHFFALKKDFDSKKVAEIMMQHIVKLYGMPKSIVSDRD 1038

Query: 3919 RVFTSKFWQNLFELQGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSWYKALC 4098
            +VFTSKFWQ+LF+LQGTTLAMSSAYHPQ+DGQ+EA+NKCLEMYLRC TF NP  W K L 
Sbjct: 1039 KVFTSKFWQHLFKLQGTTLAMSSAYHPQSDGQTEALNKCLEMYLRCLTFANPNIWSKMLH 1098

Query: 4099 WAELWYNTSHHTSAGMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAVITELKRN 4278
             A+ WYNTS HTSA M+PFKALYG++PP LTR                          +N
Sbjct: 1099 LAQYWYNTSFHTSAAMTPFKALYGKDPPTLTR------------------------SNQN 1134

Query: 4279 LARAQQIMKNNADKKRRDVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRYFGPFTILAK 4458
            L +AQQ MK  ADKKR  + F +GD VLVKLQPYRQT VA R N K  ++YFGPF + AK
Sbjct: 1135 LHKAQQAMKFQADKKRLPMEFTIGDMVLVKLQPYRQTVVATRANHKSSLKYFGPFPVTAK 1194

Query: 4459 IGKVAYKLQLPDTAKIHSVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILEPEGILKTRT 4638
            IG +AYKLQLP TA+IH VFH+SQLK F G+ T+PY PL  TT   GP+L+PE ILK RT
Sbjct: 1195 IGSIAYKLQLPSTARIHPVFHISQLKKFNGSATDPYYPLSDTTTVLGPLLQPESILKVRT 1254

Query: 4639 ILRNEQQVPQVLVKWQHLQVAEATWEDKNEMAIQYPNLNLEDKIALEGEGDVMNNKS 4809
            IL+    VPQVLVKWQ +  + ATWEDK E+   YPN NLEDKI   G G ++   S
Sbjct: 1255 ILKGPLLVPQVLVKWQDIDESLATWEDKKEILENYPNFNLEDKIVFNG-GSIVREPS 1310



 Score =  140 bits (353), Expect = 2e-29
 Identities = 86/249 (34%), Positives = 118/249 (47%), Gaps = 2/249 (0%)
 Frame = +1

Query: 223 MADNTRLKELTTELRRQAETIEKNEAASRARFEELSSMQKASEARFNQLADAFEKLMQQ- 399
           MAD TR K L T+L    E I        ++  E  +   +S A  + +    E L+Q  
Sbjct: 1   MADGTRFKSLETQLAHLTEIINTQIQTQISQINETVANHSSSIAETSSVLQRMETLLQSL 60

Query: 400 -TPTVTHGGTNSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIPTMSSTPFQVRHIKLDF 576
              T+THG                                   P     PFQ R +KL+F
Sbjct: 61  AATTITHGSQP--------------------------------PPRHHHPFQARSVKLEF 88

Query: 577 PRFNGKNVLDWIFKAEQFFGYYNTPDTERLIIASVHLDQEVVPWFQMVNRSRPFQTWMDF 756
           PRF+  + L+WIF+A QFF YY+TPD E L                              
Sbjct: 89  PRFDDSHALEWIFRANQFFDYYDTPDPEHL------------------------------ 118

Query: 757 TRALELDFGPSMYDCPRASLFKLQQSKSVNEYYLEFTALSNRVYGISNDALIDCFVSGLK 936
           +RA+E +FGPS +D PR +LFKL Q+ SV++YY+ FTAL NR  G++ DAL+DCF+SGL+
Sbjct: 119 SRAIEREFGPSAFDRPRTTLFKLAQTGSVDDYYMVFTALDNRSTGLTPDALLDCFISGLQ 178

Query: 937 DELRRDVML 963
            EL+R+  L
Sbjct: 179 KELQREASL 187


>gb|PNX98300.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1139

 Score = 1310 bits (3390), Expect = 0.0
 Identities = 640/1051 (60%), Positives = 779/1051 (74%), Gaps = 3/1051 (0%)
 Frame = +1

Query: 1663 ITVPAYLLPVAGSDVILGAPWLASLGPHVADYATSKLRFYLEGKFVTLQGDTDPKTAIAQ 1842
            IT+P YLLP++G+D++LGA WLA+LGPH+ADY    ++FY++  FVTL GD       AQ
Sbjct: 1    ITLPVYLLPISGADLVLGAAWLATLGPHIADYNALSIKFYIDNHFVTLFGDKPSLPRPAQ 60

Query: 1843 LHQFKRLQHMDAISELFTLQQIEAVAPEDNWKDMPVT---VEPEMATLLYTYREVFQVPK 2013
             H  +RL   DAI ++++LQ        D+  D+ ++   + P+M  LL  + EVF  P 
Sbjct: 61   FHHLRRLHQTDAIEQVYSLQL------HDDSLDVSLSSQSLHPQMKELLLQFPEVFATPS 114

Query: 2014 GLPPPRELSHEILLKEGAPPVKVKPYRYPHSQKEQIEKMVQEMLEEGXXXXXXXXXXXXX 2193
            GLPP R   H I L EG+ PVKVKPYRYPHSQK QIE MVQEMLE+G             
Sbjct: 115  GLPPSRSHDHAIPLLEGSNPVKVKPYRYPHSQKTQIELMVQEMLEQGIIQPSTSPFSSPV 174

Query: 2194 XLVKKKDGTWRCCTDYRALNAITIKDSFPMPTVDELLDELHGAKFFSKFDLRSGYHQILV 2373
             LVKKKDGTWR CTDYRALN IT+KDSFP+PTVDELLDEL+GA  FSK DLRSGYHQILV
Sbjct: 175  LLVKKKDGTWRFCTDYRALNTITVKDSFPIPTVDELLDELYGASHFSKLDLRSGYHQILV 234

Query: 2374 KPEDRQKTAFRTHQGHYEWLVMPFGLTNAPATFQRLMNEIFQPFLRKCVLVFFDDILVYS 2553
             P+DR KTAFRTHQG YEWLVMPFGLTNAPATFQ LMN++FQ  LRKCVLVFFDDILVYS
Sbjct: 235  APQDRYKTAFRTHQGLYEWLVMPFGLTNAPATFQSLMNQVFQHLLRKCVLVFFDDILVYS 294

Query: 2554 PSWHTHLQQVECVLQILSQHVLYAKLSKCSFGLTEVEYLGHTVSGSGVSMDRNKVNAVLD 2733
            PSW+ HL  +  VLQ+L  H LYAK+SKCSFG+T+++YLGH VSG+GV MD +KV AV++
Sbjct: 295  PSWNEHLLHLHEVLQLLRDHCLYAKVSKCSFGVTKIDYLGHIVSGNGVEMDDSKVKAVME 354

Query: 2734 WPVPSNLKQLRGFLGLTGYYRRFIKAYASIAGPLTNLLKKDAFQWNENTQLAFEQLKKAI 2913
            WP+P N+KQLRGFLGLTGYYRRFIK YA IA PLT LLKKD FQW+     AF +L++AI
Sbjct: 355  WPLPCNIKQLRGFLGLTGYYRRFIKNYALIALPLTALLKKDQFQWSSEATDAFHKLQQAI 414

Query: 2914 TSAPVLILPDFSQPFVLETDASGTGVGAVLSQGGHPIAFFSKKMAPRMQLQSAYTREFYA 3093
            T APVL LP+F + FVLETDASG+GVGAVLSQ  HPIAFFSKK+ PRMQ QSAY RE YA
Sbjct: 415  TQAPVLALPNFKEDFVLETDASGSGVGAVLSQNHHPIAFFSKKLNPRMQRQSAYVRELYA 474

Query: 3094 ITAALAKFRHYLLGHKFILRTXXXXXXXXXXXXXXTPEQQAWLHKFIGYDFQIEYKPGKD 3273
            IT A+AKFRHYLLGH+FI+RT              TPEQQ WLHKF+G+DF IEYKPG+D
Sbjct: 475  ITEAMAKFRHYLLGHRFIIRTDQKSLKSLMDQTIQTPEQQNWLHKFLGFDFTIEYKPGRD 534

Query: 3274 NQAADALSRLMTMSWSGPENVFLEELKKEVATNEHLKMIVQQCEQNTVADRNYAVKAGLL 3453
            N AADALSR   M++S  +++ L +++  +  +  L  I   C +    D  Y V+  +L
Sbjct: 535  NIAADALSRSFMMAFSSQQSLLLHQIRLAIQQDHELAAIKALCIEGKAPDPFYTVQHDML 594

Query: 3454 YWKQKLVIPLESKLIQQILKEYHDSPIGGHAGVTRTLARIGAQFYWPKMRQDVKEYVAAC 3633
            +WK +LV+P    ++ QIL EYH SP+GGH+GV RT ARI  QF+WP M +D+ ++V  C
Sbjct: 595  FWKNRLVVPKSPHIVNQILLEYHASPVGGHSGVQRTKARICQQFFWPHMSKDITKFVTEC 654

Query: 3634 IVCQQAKSATTAYAGLLEPLPIPNQVWDDVTMDFITGLPPSHGYTVIMVVVDRLTKFAHF 3813
            + CQQAKS+T   AGLL+PLPIP Q+W+D+ MDFITGLPPS+G+TVI VVVDRL+K+ HF
Sbjct: 655  LTCQQAKSSTALPAGLLQPLPIPLQIWEDIAMDFITGLPPSNGFTVIFVVVDRLSKYGHF 714

Query: 3814 IPMKTDYSSKSVAEAFMHSVVRLHGMPKSIISDRDRVFTSKFWQNLFELQGTTLAMSSAY 3993
             P+K+D+ +  VA AF +++V+LHG+PKSI+SDRD+VFTS+FWQ LF L GTTLAMS+AY
Sbjct: 715  TPLKSDFDAPKVAAAFFNNIVKLHGIPKSIVSDRDKVFTSQFWQQLFRLSGTTLAMSTAY 774

Query: 3994 HPQTDGQSEAVNKCLEMYLRCFTFNNPKSWYKALCWAELWYNTSHHTSAGMSPFKALYGR 4173
            HPQTDGQSEA+NKCLEMYLRCFT  NPKSW + L WAE WYNT+   S  M+PFKA+YGR
Sbjct: 775  HPQTDGQSEALNKCLEMYLRCFTGTNPKSWARLLPWAEYWYNTAFQNSIAMTPFKAVYGR 834

Query: 4174 EPPMLTRYAANSSDPVELQQQLLDRDAVITELKRNLARAQQIMKNNADKKRRDVVFEVGD 4353
            +PP L RY   S+DP  LQ+ L++RD V+ +LK NL  AQ  MK  ADKKRR + F+VGD
Sbjct: 835  DPPSLIRYVPQSNDPPNLQELLIERDVVLQQLKSNLLTAQGFMKKFADKKRRILEFQVGD 894

Query: 4354 QVLVKLQPYRQTSVALRKNQKLGMRYFGPFTILAKIGKVAYKLQLPDTAKIHSVFHVSQL 4533
             VLVKLQPYRQ SV LRKNQKLG+RYFGPFT+LAK+G VAY+LQLP   KIH VFHVS L
Sbjct: 895  SVLVKLQPYRQHSVVLRKNQKLGLRYFGPFTVLAKVGPVAYRLQLPPGTKIHPVFHVSLL 954

Query: 4534 KLFKGTMTEPYIPLPLTTVEEGPILEPEGILKTRTILRNEQQVPQVLVKWQHLQVAEATW 4713
            K  KG     Y+PLPL     GP+L P  +L+TR +  N Q + Q L++W  L  AEATW
Sbjct: 955  KPCKGEHDNTYLPLPLLQHAHGPLLTPLRVLQTRMVPSNGQLIAQALIQWDGLTEAEATW 1014

Query: 4714 EDKNEMAIQYPNLNLEDKIALEGEGDVMNNK 4806
            ED   +   YP+LNLEDK+   G G+V  +K
Sbjct: 1015 EDCLTLKKNYPSLNLEDKVVFNGGGNVTYDK 1045


>gb|PNX73110.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 937

 Score = 1301 bits (3368), Expect = 0.0
 Identities = 618/937 (65%), Positives = 754/937 (80%)
 Frame = +1

Query: 1975 LLYTYREVFQVPKGLPPPRELSHEILLKEGAPPVKVKPYRYPHSQKEQIEKMVQEMLEEG 2154
            LL+TY+++F+ P  LPP R  +H I L +GA PVKVKPYRYPHSQK QIE M+QEML++G
Sbjct: 2    LLHTYKDLFKAPVALPPNRSHNHSIPLIDGANPVKVKPYRYPHSQKAQIEHMIQEMLQQG 61

Query: 2155 XXXXXXXXXXXXXXLVKKKDGTWRCCTDYRALNAITIKDSFPMPTVDELLDELHGAKFFS 2334
                          LVKKKDGTWR CTDYRALNAIT+KD FP+PTVDELLDEL GAK+FS
Sbjct: 62   IIQTSTSPFSSPIILVKKKDGTWRFCTDYRALNAITVKDCFPIPTVDELLDELFGAKYFS 121

Query: 2335 KFDLRSGYHQILVKPEDRQKTAFRTHQGHYEWLVMPFGLTNAPATFQRLMNEIFQPFLRK 2514
            K DLRSGYHQIL++PEDR KTAFRTHQGHYEWLVMPFGLTNAPATFQ LMN IFQ  LRK
Sbjct: 122  KLDLRSGYHQILLQPEDRHKTAFRTHQGHYEWLVMPFGLTNAPATFQSLMNHIFQHALRK 181

Query: 2515 CVLVFFDDILVYSPSWHTHLQQVECVLQILSQHVLYAKLSKCSFGLTEVEYLGHTVSGSG 2694
             VLVFFDDIL+YS +W  HL+ ++ VL+IL ++ L+ KLSKCSFG+ E+EYLGH V+G G
Sbjct: 182  YVLVFFDDILIYSRTWQDHLKHLDAVLKILQENTLFVKLSKCSFGVLEIEYLGHMVTGQG 241

Query: 2695 VSMDRNKVNAVLDWPVPSNLKQLRGFLGLTGYYRRFIKAYASIAGPLTNLLKKDAFQWNE 2874
            VSMD++KV AVL+WP P N+KQLRGFLGLTGYYRRFIK+YA IA PLT+LLKK+A+ WN+
Sbjct: 242  VSMDKDKVQAVLNWPTPKNVKQLRGFLGLTGYYRRFIKSYAKIASPLTDLLKKEAYAWND 301

Query: 2875 NTQLAFEQLKKAITSAPVLILPDFSQPFVLETDASGTGVGAVLSQGGHPIAFFSKKMAPR 3054
              +LAF+QLK A+T+APVL LP+F QPF+LETDASG G+GAVL Q GHPIA+FSKK+ PR
Sbjct: 302  LAELAFQQLKSAVTTAPVLALPNFHQPFILETDASGVGIGAVLHQEGHPIAYFSKKLVPR 361

Query: 3055 MQLQSAYTREFYAITAALAKFRHYLLGHKFILRTXXXXXXXXXXXXXXTPEQQAWLHKFI 3234
             Q +SAY RE  AI  A+AKFRHYLLGHKFI+RT              TPEQQ WLHKF+
Sbjct: 362  NQKKSAYFREMLAIAEAIAKFRHYLLGHKFIIRTDQKSLRSLMEQSLQTPEQQEWLHKFL 421

Query: 3235 GYDFQIEYKPGKDNQAADALSRLMTMSWSGPENVFLEELKKEVATNEHLKMIVQQCEQNT 3414
            GYDF IEYKPGK+N AADALSRLMT++WS P+  F+E++K  +  +  +  I+ +C    
Sbjct: 422  GYDFTIEYKPGKENMAADALSRLMTLAWSEPQCQFIEQVKLALQNDNQMMEIMLKCASGK 481

Query: 3415 VADRNYAVKAGLLYWKQKLVIPLESKLIQQILKEYHDSPIGGHAGVTRTLARIGAQFYWP 3594
             A   Y ++ GLLYWKQ+LVIP +++L+ ++L E+H SPIGGHAG+TRT+ARI +QFYWP
Sbjct: 482  -APIQYTMREGLLYWKQRLVIPKQNELLHKVLYEFHTSPIGGHAGITRTMARIKSQFYWP 540

Query: 3595 KMRQDVKEYVAACIVCQQAKSATTAYAGLLEPLPIPNQVWDDVTMDFITGLPPSHGYTVI 3774
             M++D+ EYV  C+VCQQAK+  T+ AGLL+PLPIP+QVW+D+ MDFITGLP S+GYT I
Sbjct: 541  DMKKDILEYVQNCVVCQQAKTTNTSPAGLLQPLPIPSQVWEDIAMDFITGLPLSYGYTTI 600

Query: 3775 MVVVDRLTKFAHFIPMKTDYSSKSVAEAFMHSVVRLHGMPKSIISDRDRVFTSKFWQNLF 3954
            MVVVDRLTK+AHFIPM+TDY+S+SVAEAFMH++V+LHGMPKSI+SDRD+VFTS FWQ LF
Sbjct: 601  MVVVDRLTKYAHFIPMRTDYTSRSVAEAFMHNIVKLHGMPKSIVSDRDKVFTSAFWQQLF 660

Query: 3955 ELQGTTLAMSSAYHPQTDGQSEAVNKCLEMYLRCFTFNNPKSWYKALCWAELWYNTSHHT 4134
            +LQGT+LAMSSAYHPQ+DGQ+E +NK LE++LRCF+F+NPKSWYK L WAE WYNT+  T
Sbjct: 661  KLQGTSLAMSSAYHPQSDGQTEVLNKGLELFLRCFSFHNPKSWYKVLSWAEYWYNTAFQT 720

Query: 4135 SAGMSPFKALYGREPPMLTRYAANSSDPVELQQQLLDRDAVITELKRNLARAQQIMKNNA 4314
            S GM+PFKALYGR+PP LT+Y A  +D   LQ++L++RD ++ +LK NL RAQQ MK  A
Sbjct: 721  SIGMTPFKALYGRDPPYLTKYEAQVTDSPALQEELMERDKILQQLKINLERAQQYMKKQA 780

Query: 4315 DKKRRDVVFEVGDQVLVKLQPYRQTSVALRKNQKLGMRYFGPFTILAKIGKVAYKLQLPD 4494
            DK R +V  +VGD VLVKLQPYRQ SV+LRKNQKLGMRYFGPF I+A++G VAYKL+LPD
Sbjct: 781  DKHRSEVNLQVGDLVLVKLQPYRQQSVSLRKNQKLGMRYFGPFEIIARVGNVAYKLKLPD 840

Query: 4495 TAKIHSVFHVSQLKLFKGTMTEPYIPLPLTTVEEGPILEPEGILKTRTILRNEQQVPQVL 4674
             AKIH VFHVSQLK FKG   + Y+PLPLT  E GPI++P   L+ RTI+R  Q+V Q+L
Sbjct: 841  NAKIHPVFHVSQLKPFKGIAQDQYLPLPLTMSETGPIIQPIAALEARTIMRGMQKVHQIL 900

Query: 4675 VKWQHLQVAEATWEDKNEMAIQYPNLNLEDKIALEGE 4785
            V+W  + V EATWED + +  ++P LNLEDKIA  GE
Sbjct: 901  VQWDQMPVTEATWEDLDVLQDKFPTLNLEDKIAFNGE 937


Top