BLASTX nr result

ID: Glycyrrhiza29_contig00010413 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza29_contig00010413
         (1217 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU10537.1 hypothetical protein TSUD_419470, partial [Trifolium ...   368   e-120
GAU25065.1 hypothetical protein TSUD_257590 [Trifolium subterran...   373   e-120
GAU48536.1 hypothetical protein TSUD_282880 [Trifolium subterran...   392   e-120
GAU33402.1 hypothetical protein TSUD_20950 [Trifolium subterraneum]   389   e-119
GAU46303.1 hypothetical protein TSUD_283280 [Trifolium subterran...   382   e-116
XP_006577560.1 PREDICTED: uncharacterized protein LOC102665607 [...   326   e-106
GAU51623.1 hypothetical protein TSUD_414500 [Trifolium subterran...   339   e-105
KHN41376.1 hypothetical protein glysoja_032562, partial [Glycine...   323   e-105
GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterran...   330   e-102
GAU33427.1 hypothetical protein TSUD_380630 [Trifolium subterran...   332   e-100
GAU47212.1 hypothetical protein TSUD_403530 [Trifolium subterran...   317   1e-97
GAU47952.1 hypothetical protein TSUD_06860 [Trifolium subterraneum]   313   1e-96
KHN28364.1 hypothetical protein glysoja_029625, partial [Glycine...   296   9e-95
GAU24549.1 hypothetical protein TSUD_148900 [Trifolium subterran...   318   9e-95
XP_019433565.1 PREDICTED: uncharacterized protein LOC109340346 [...   296   9e-94
GAU30605.1 hypothetical protein TSUD_62250 [Trifolium subterraneum]   306   1e-93
GAU51438.1 hypothetical protein TSUD_413380, partial [Trifolium ...   307   2e-92
GAU50864.1 hypothetical protein TSUD_411040 [Trifolium subterran...   304   7e-92
KYP46096.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]       300   1e-91
GAU32122.1 hypothetical protein TSUD_218730 [Trifolium subterran...   307   9e-91

>GAU10537.1 hypothetical protein TSUD_419470, partial [Trifolium subterraneum]
          Length = 557

 Score =  368 bits (945), Expect = e-120
 Identities = 171/359 (47%), Positives = 230/359 (64%), Gaps = 1/359 (0%)
 Frame = +3

Query: 144  GKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDIARFNQFISEMHL 323
            GK + W +LI  +       WC+ GDFNS++ + E+RG+SN     +   F Q I  M L
Sbjct: 3    GKRKLWHDLIEFRMNNAPGEWCLGGDFNSITKTSERRGSSNWSGNTERTEFVQIIETMEL 62

Query: 324  VDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSV 503
            +D+PVLGKKFTW + D  A SRLDRFLLSE LI K  +  QWVG+RDISDH PIWL+C+ 
Sbjct: 63   IDIPVLGKKFTWSNSDNSAMSRLDRFLLSEGLIEKGGITNQWVGDRDISDHYPIWLECNN 122

Query: 504  YDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKEVF 683
             +W PKPF+FNNCWL H +F  FV+  W+   + G K +               WN EVF
Sbjct: 123  RNWCPKPFKFNNCWLEHPDFIPFVKASWESMDIHGRKAFILKEKLKRLKESLKKWNHEVF 182

Query: 684  GFMDLNIENIVQDMNVLDDIVGSGDSR-VGENRKELTSRFWHQIHAKESLIRQKARCKWI 860
            G MDLNIE  V+++N +++++ +G+S  +  N K+ +  FW Q+  KES+++QK+R KWI
Sbjct: 183  GIMDLNIEKTVKELNEIEEMIANGNSHPMYPNSKKQSKMFWEQLRFKESILKQKSRTKWI 242

Query: 861  AEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHFQDFFLEPNESRPV 1040
             EGD+NT FFHA ++GR R N+I  +RK + W+EGVD +K   K H+   F E   SRP 
Sbjct: 243  QEGDSNTSFFHATIKGRHRSNRIAKIRKGNEWIEGVDEIKQAAKDHYSVHFSEEWHSRPF 302

Query: 1041 LDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFTFYKQFWELIKTE 1217
            L GI F  +SA DN  L EPF  EE++D VWSC+G K PGPDGFN  F+K  W ++K +
Sbjct: 303  LQGIDFNSLSADDNAFLLEPFGEEEVRDTVWSCDGNKCPGPDGFNINFFKVCWSIVKDD 361


>GAU25065.1 hypothetical protein TSUD_257590 [Trifolium subterraneum]
          Length = 721

 Score =  373 bits (958), Expect = e-120
 Identities = 176/378 (46%), Positives = 246/378 (65%), Gaps = 1/378 (0%)
 Frame = +3

Query: 81   RKRRSSDVFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGA 260
            R  R  D   +VNVYSPC   GK + W++L+  K+  GG  WCV GDFNS+  S E++G+
Sbjct: 196  RVDREGDELNIVNVYSPCIISGKKKLWEDLLALKQSTGGGKWCVRGDFNSILHSSERKGS 255

Query: 261  SNNYRRRDIARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVA 440
            S   R+ + + FN+F+ EM L+D PVLGKKF+W+S DG++ SR+DRFLLS+  + K+ + 
Sbjct: 256  SIVSRQNESSLFNRFVEEMELIDTPVLGKKFSWFSADGKSMSRIDRFLLSDGFVSKFGIT 315

Query: 441  AQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIY 620
             +W+G+RDIS H PIWL CS Y+WGPKPFR  N W+ H +F  FVE  WK F V G K  
Sbjct: 316  GKWIGDRDISYHCPIWLLCSSYNWGPKPFRVINGWMEHPDFFDFVETTWKSFDVHGKK-- 373

Query: 621  AFXXXXXXXXXXXXXWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSR 797
                              EV+GF+DLNIE  V D+NV+++++G  D  +   R+  L   
Sbjct: 374  -----------------GEVYGFLDLNIEKTVTDINVIENLLGGDDEEIDLTRRAGLNKD 416

Query: 798  FWHQIHAKESLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGV 977
            FW Q+  KESL++QK+R +W+ EGD+N++FFH  ++ RRRRNQ++AL+  D WV+GVD V
Sbjct: 417  FWKQLIHKESLLKQKSRMRWVKEGDSNSKFFHESIKSRRRRNQLVALKDGDRWVQGVDDV 476

Query: 978  KHEVKRHFQDFFLEPNESRPVLDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSP 1157
            K  VK +F++ F E    RP L+GI F  +S  DN  L  PF+++E+++ +WS +  K P
Sbjct: 477  KAFVKNYFENNFREDWAYRPNLNGIAFQSLSEEDNLSLMAPFSIDEVREVIWSSDWNKCP 536

Query: 1158 GPDGFNFTFYKQFWELIK 1211
            GPDG NF F K  WE+IK
Sbjct: 537  GPDGINFNFLKACWEIIK 554


>GAU48536.1 hypothetical protein TSUD_282880 [Trifolium subterraneum]
          Length = 1794

 Score =  392 bits (1006), Expect = e-120
 Identities = 180/406 (44%), Positives = 257/406 (63%), Gaps = 1/406 (0%)
 Frame = +3

Query: 3    MIIIWDTNVLEXXXXXXXXXXXXXCARKRRSSDVFFMVNVYSPCDWGGKNQCWKELILCK 182
            ++I+W+  +               C   +    + F++N+YSPC   GK + W +L+  K
Sbjct: 748  LLIMWNAGLFNLKFSFTGDNFLGLCVECKEG--ILFIINIYSPCSLSGKRKLWSDLLEFK 805

Query: 183  RILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDIARFNQFISEMHLVDVPVLGKKFTWW 362
            +      WC+ GDFN V  + E++G+S   R+ +   F QF+  M L DVPV GKKF+W+
Sbjct: 806  QNNEQGEWCLGGDFNVVLKTGERKGSSALCRQNERLEFCQFVEAMELCDVPVAGKKFSWF 865

Query: 363  SGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNC 542
            S DG + SRLDRFLLSE+ I   +V  QW+G+RDISDH PIWL CS  +WGPKPF+ NNC
Sbjct: 866  SADGTSMSRLDRFLLSEKFIDSEKVTGQWIGSRDISDHCPIWLLCSNLNWGPKPFKVNNC 925

Query: 543  WLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKEVFGFMDLNIENIVQD 722
            WL H EFK FVE+ W    V G K +               WN++VFG +DLNIENIV++
Sbjct: 926  WLEHPEFKPFVEKTWNKLNVEGKKAFVIKEKLKRLKEELRRWNRDVFGILDLNIENIVRE 985

Query: 723  MNVLDDIVG-SGDSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTRFFHAC 899
            +N  + ++   G + V  +   +  +FW Q+H KESLI+QK+R KW+ EGD+N+RFFHA 
Sbjct: 986  LNEAEGLLAIDGANSVTCDVSAINKKFWDQLHFKESLIKQKSRLKWVREGDSNSRFFHAS 1045

Query: 900  VRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHFQDFFLEPNESRPVLDGIVFPQISAAD 1079
            ++ RRRRNQ+  LR+ + W++GVD +K EVK +F   F E   +RP + GI F  +SA D
Sbjct: 1046 IKSRRRRNQLSILRRGEEWIQGVDNIKSEVKNYFVTNFTEDWHNRPFVHGINFNVLSAKD 1105

Query: 1080 NGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFTFYKQFWELIKTE 1217
            N  L +PF+ E++++ +WSC+G KSPGPDGFNF F K+ W ++K++
Sbjct: 1106 NDFLLQPFSEEDVREVLWSCDGNKSPGPDGFNFNFLKECWSILKSD 1151


>GAU33402.1 hypothetical protein TSUD_20950 [Trifolium subterraneum]
          Length = 1594

 Score =  389 bits (999), Expect = e-119
 Identities = 176/369 (47%), Positives = 248/369 (67%), Gaps = 1/369 (0%)
 Frame = +3

Query: 108  FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDI 287
            +++NVYSPC   GK + W +L+  K       WC+ GDFN V +  E++G++++ R+ + 
Sbjct: 704  YIINVYSPCSLSGKRKLWSDLLEFKLNNEQGEWCLRGDFNVVLNVGERKGSTSSARQNER 763

Query: 288  ARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 467
              F QF+  M L+DVPV GKKF+W+S DG A SRLDRFLLS+  I K +VA QW+GN DI
Sbjct: 764  LEFCQFVEAMELIDVPVAGKKFSWFSADGNAISRLDRFLLSDNFIEKEEVAGQWIGNHDI 823

Query: 468  SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 647
            SDH PIWL CS  +WGPKPF+ NNCWL H EFK FVE+ W+   + G K +         
Sbjct: 824  SDHCPIWLMCSNLNWGPKPFKVNNCWLEHSEFKLFVEKTWEKLNIRGKKAFVIKEKLKRL 883

Query: 648  XXXXXXWNKEVFGFMDLNIENIVQDMNVLDDIVGS-GDSRVGENRKELTSRFWHQIHAKE 824
                  WN+EVF  +DLNIE  V+++N ++ +VG+ G + V  ++  +  +FW Q++ KE
Sbjct: 884  KEELRGWNREVFSILDLNIEKTVKELNEVEGLVGNDGVNSVMGDKSGVNRKFWEQLYFKE 943

Query: 825  SLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHFQ 1004
            S+I+QK+R KW+ EGD+NTRFF A ++ RRRRNQ++ LR+ D  ++GVD +K EVK HF 
Sbjct: 944  SMIKQKSRLKWVREGDSNTRFFQASLKNRRRRNQLVLLRRGDDLIQGVDNIKMEVKNHFA 1003

Query: 1005 DFFLEPNESRPVLDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFTF 1184
              F E    RP ++GI F ++S  DN  L +PF+ E +++ +WSC+G KSPGPDGFNF F
Sbjct: 1004 RNFTEEWHHRPFVNGINFNELSTEDNEFLLQPFSEERVREVIWSCDGNKSPGPDGFNFNF 1063

Query: 1185 YKQFWELIK 1211
            +K+FW  +K
Sbjct: 1064 WKEFWSTLK 1072


>GAU46303.1 hypothetical protein TSUD_283280 [Trifolium subterraneum]
          Length = 1636

 Score =  382 bits (980), Expect = e-116
 Identities = 178/371 (47%), Positives = 241/371 (64%), Gaps = 1/371 (0%)
 Frame = +3

Query: 108  FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDI 287
            ++VN+YSPC   G                       GDFNS++   E+RG+      R+ 
Sbjct: 670  YIVNIYSPCTMAG-----------------------GDFNSITKIGERRGSHGGSVYRER 706

Query: 288  ARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 467
              F+QFI  M LVD+PVLGKKFTW++ D  A SRLDRFLLSE  I K  ++ QWVGNRDI
Sbjct: 707  IEFSQFIDAMELVDIPVLGKKFTWFNSDCSAMSRLDRFLLSEGFIEKGGISNQWVGNRDI 766

Query: 468  SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 647
            SDH PIWL+ S  +WGPKPF+FNNCWL H +F  FV+  W+   + G K +         
Sbjct: 767  SDHCPIWLESSNINWGPKPFKFNNCWLEHSDFLPFVKATWEKMNIHGKKAFIIKEKLKRL 826

Query: 648  XXXXXXWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGE-NRKELTSRFWHQIHAKE 824
                  WN+EVFG MDLNIE  V+D+N +++++ +GD+++   N KEL+ +FW Q+H KE
Sbjct: 827  KEALKTWNQEVFGIMDLNIEKTVKDLNEIEELIANGDNQLDSVNSKELSKKFWEQLHFKE 886

Query: 825  SLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHFQ 1004
            S+++QK+R KWI EGD+NTRFFHA ++GRRRRN+I+ L+K + W++GV  +K+  K HF 
Sbjct: 887  SILQQKSRTKWIQEGDSNTRFFHASIKGRRRRNRIVKLKKGNEWIQGVTEIKNVTKDHFA 946

Query: 1005 DFFLEPNESRPVLDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFTF 1184
              F E   +RP L GI F  +S ADN  L EPF  EE+++ +WSC+G KSPGPDGFNF F
Sbjct: 947  KHFSEEWPNRPFLQGIDFHTLSDADNAFLVEPFNEEEVRETIWSCDGNKSPGPDGFNFNF 1006

Query: 1185 YKQFWELIKTE 1217
             K  W ++K++
Sbjct: 1007 MKACWSIVKSD 1017


>XP_006577560.1 PREDICTED: uncharacterized protein LOC102665607 [Glycine max]
          Length = 326

 Score =  326 bits (836), Expect = e-106
 Identities = 157/302 (51%), Positives = 204/302 (67%), Gaps = 1/302 (0%)
 Frame = +3

Query: 315  MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 494
            M+L+D P+ G KFT++  DG A+SRLDRFL+S+ ++  WQ   Q VG RDI DH PIWL+
Sbjct: 1    MNLIDPPLHGNKFTYFCSDGIAASRLDRFLVSDGIMNLWQEKGQRVGKRDIYDHCPIWLE 60

Query: 495  CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNK 674
            CS  +WGPKPFRFNNCWL H +FKSF+ E WK  Q+ G K Y               WNK
Sbjct: 61   CSNLNWGPKPFRFNNCWLEHDDFKSFIVEEWKKIQITGRKAYVIKEKLKIIRESLKKWNK 120

Query: 675  EVFGFMDLNIENIVQDMNVLDDIVGSG-DSRVGENRKELTSRFWHQIHAKESLIRQKARC 851
            EVFG++DLNIENIV +MN LD  +  G +      +KE  + FW Q+  KESL++QK+R 
Sbjct: 121  EVFGWLDLNIENIVAEMNKLDRGIEEGCNLNEVVKKKEAKALFWQQLMMKESLLKQKSRL 180

Query: 852  KWIAEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHFQDFFLEPNES 1031
            +WI EGD NT+FFH+C++ RRR+NQIL+L+ +   VE V  VK EV+R F++ F E + S
Sbjct: 181  RWIKEGDYNTKFFHSCLQDRRRKNQILSLQVEGRCVEQVGEVKMEVRRFFEEGFKEASFS 240

Query: 1032 RPVLDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFTFYKQFWELIK 1211
            RPVL GI F  + + +N  L  PF+ EEIKD VWSC+G K PGPDGFN  F K+ WE +K
Sbjct: 241  RPVLGGIEFQTLGSEENSFLVAPFSEEEIKDVVWSCDGNKCPGPDGFNLRFIKKCWEFVK 300

Query: 1212 TE 1217
             +
Sbjct: 301  DD 302


>GAU51623.1 hypothetical protein TSUD_414500 [Trifolium subterraneum]
          Length = 838

 Score =  339 bits (870), Expect = e-105
 Identities = 151/300 (50%), Positives = 204/300 (68%), Gaps = 1/300 (0%)
 Frame = +3

Query: 315  MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 494
            M L+D+PV+GKKF+W+S DG+A SRLDRFLLS+  I K ++  QW+GNRDISDH P+WL 
Sbjct: 1    MELIDIPVIGKKFSWFSADGKAMSRLDRFLLSDNFIAKEEILGQWIGNRDISDHCPVWLI 60

Query: 495  CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNK 674
            CS  +WGPKPF+FNNCWL H E   FV   W    V G K +               WN+
Sbjct: 61   CSNLNWGPKPFKFNNCWLKHPELSLFVTRIWVKMNVTGKKAFVIKEKLKRLKEELRGWNR 120

Query: 675  EVFGFMDLNIENIVQDMNVLDDIVG-SGDSRVGENRKELTSRFWHQIHAKESLIRQKARC 851
            EVFG +DLNIEN V+++N L+ +    G + +  ++  +  +FW Q++ KESLIRQK+R 
Sbjct: 121  EVFGILDLNIENTVKELNELEGLAAIDGTNSMLVDKGGINKKFWDQLNFKESLIRQKSRA 180

Query: 852  KWIAEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHFQDFFLEPNES 1031
             W++EGD+NTRFFHA ++ RRRRNQ++ LR+ D W++GV+ +K EVK HF   F E  E+
Sbjct: 181  NWVSEGDSNTRFFHASLKSRRRRNQMIMLRRGDEWIQGVENIKLEVKNHFAGNFTEDWEN 240

Query: 1032 RPVLDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFTFYKQFWELIK 1211
            RP +  I F ++S  DN  L +PF  EE+K+ VWSC+G KSP PDGFNF F K+ W ++K
Sbjct: 241  RPFVHDINFKELSEEDNAFLIQPFVEEEVKEVVWSCDGNKSPVPDGFNFNFLKECWSMVK 300


>KHN41376.1 hypothetical protein glysoja_032562, partial [Glycine soja]
          Length = 362

 Score =  323 bits (829), Expect = e-105
 Identities = 157/305 (51%), Positives = 208/305 (68%), Gaps = 1/305 (0%)
 Frame = +3

Query: 204  WCVAGDFNSVSSSQEKRGASNNYRRRDIARFNQFISEMHLVDVPVLGKKFTWWSGDGRAS 383
            WC+ GDFN+VS+ +E+ G S N+   D+  FN F++EM+L+D P+ G KFT++  DG A+
Sbjct: 58   WCLVGDFNAVSNREERTGRSENWGYIDMVDFNAFVNEMNLIDPPLHGNKFTYFCSDGIAA 117

Query: 384  SRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEF 563
            SRLDRFL+S+ ++  WQV  Q VG RDISDH PIWL+CS  +WGPKPFRFNNCWL H  F
Sbjct: 118  SRLDRFLVSDGIMNLWQVKGQRVGKRDISDHCPIWLECSNLNWGPKPFRFNNCWLEHDGF 177

Query: 564  KSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKEVFGFMDLNIENIVQDMNVLDDI 743
            KSF+ E WK  Q+ G K Y               WNKEVFG++DLNIENIV DMN LD  
Sbjct: 178  KSFIVEEWKKIQITGRKAYVIKEKLKIIRESLKKWNKEVFGWLDLNIENIVADMNELDRG 237

Query: 744  VGSG-DSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTRFFHACVRGRRRR 920
            +  G +  V   +KE  + FW Q+  KESL++QK+R +WI EGD+NT+FFH+C++ RRR+
Sbjct: 238  IEEGCNLNVVVKKKEANALFWQQLMMKESLLKQKSRLRWIKEGDSNTKFFHSCLQDRRRK 297

Query: 921  NQILALRKDDAWVEGVDGVKHEVKRHFQDFFLEPNESRPVLDGIVFPQISAADNGVLTEP 1100
            NQIL+L+ +   VE V  VK EV+R F++ F E + SRPVL GI F  + + +N  L  P
Sbjct: 298  NQILSLQVEGRCVEQVGEVKMEVRRFFEEGFKEASFSRPVLGGIEFQTLGSEENSFLVAP 357

Query: 1101 FTMEE 1115
            F+ EE
Sbjct: 358  FSEEE 362


>GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterraneum]
          Length = 767

 Score =  330 bits (845), Expect = e-102
 Identities = 158/369 (42%), Positives = 219/369 (59%), Gaps = 1/369 (0%)
 Frame = +3

Query: 108  FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDI 287
            + VN+Y  C   GK + W++LI  K +     WC+ GDFNS++   ++ G+SN    ++ 
Sbjct: 76   YFVNIYFACSLAGKRKLWRDLIDFKLLNTPGEWCLGGDFNSITKVSKRSGSSNGSSNKER 135

Query: 288  ARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 467
              F QFI  M LVD+PV GKKFTW + D  A SRLDRFLLSE LI K  ++ QWVG RDI
Sbjct: 136  TEFAQFIDAMELVDIPVFGKKFTWSNSDNSAMSRLDRFLLSEGLIEKGGISNQWVGGRDI 195

Query: 468  SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 647
            SDH PIWL+CS  +WGPKPF+FNN WL H +F  FV+  W+   + G K +         
Sbjct: 196  SDHHPIWLECSNINWGPKPFKFNNFWLDHPDFIPFVKATWESMNIHGKKAFILKEKLKRL 255

Query: 648  XXXXXXWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSR-VGENRKELTSRFWHQIHAKE 824
                  WN+EVFG MDL+IE  V+D+N +++++ +GD   +  N K+L+ +FW Q+H KE
Sbjct: 256  KEVLKTWNREVFGIMDLDIEKTVKDLNEVEEMIANGDCHPLFSNAKDLSKKFWEQLHNKE 315

Query: 825  SLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHFQ 1004
            S                           RRR N+I+ LRK + W++GV  +K+E + HF 
Sbjct: 316  S---------------------------RRRSNRIVKLRKGNGWIQGVAEIKNEAQDHFS 348

Query: 1005 DFFLEPNESRPVLDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFTF 1184
              F E   +RP L+GI F  +S  DN  L + F+ EE+++ VWSC+G KSPGPDG+N  F
Sbjct: 349  KHFSEEWHNRPFLNGINFNTLSVIDNCFLLDNFSEEEVRETVWSCDGNKSPGPDGYNINF 408

Query: 1185 YKQFWELIK 1211
             K  W ++K
Sbjct: 409  LKACWSIVK 417


>GAU33427.1 hypothetical protein TSUD_380630 [Trifolium subterraneum]
          Length = 1110

 Score =  332 bits (850), Expect = e-100
 Identities = 166/360 (46%), Positives = 212/360 (58%), Gaps = 2/360 (0%)
 Frame = +3

Query: 138  WGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDIARFNQFISEM 317
            + GK + WK L+  K       WC+ GDFN+V  + E+RG S                  
Sbjct: 317  FSGKRKLWKGLLDFKLQNDHGEWCLGGDFNAVLKTGERRGCSGG---------------- 360

Query: 318  HLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDC 497
                            G+G A SRLD FLLSE  I K  ++ QW+G+RDISDH PIWL C
Sbjct: 361  ----------------GNGSAMSRLDCFLLSEGFIEKGGISNQWIGDRDISDHCPIWLVC 404

Query: 498  SVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKE 677
            S  DWGPKPF+FNNCWL H  F  FV E WK  +V G K Y               WN+E
Sbjct: 405  SNLDWGPKPFKFNNCWLEHPNFIPFVTETWKKLEVKGKKAYVLKEKLKRLKDSLNVWNRE 464

Query: 678  VFGFMDLNIENIVQDMNVLDDIV--GSGDSRVGENRKELTSRFWHQIHAKESLIRQKARC 851
            VFG +DLNI   V+++N  +D++  G+GD  + +  KEL   FW QI  KESL+ QKAR 
Sbjct: 465  VFGIIDLNINKTVKELNEAEDLIANGNGDPTLFKT-KELVKSFWDQILYKESLLHQKART 523

Query: 852  KWIAEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHFQDFFLEPNES 1031
            KWI EGD+N+RFFHA ++ RRRRNQ++ L+K + W++GV  +K EVK HF   F E   +
Sbjct: 524  KWIQEGDSNSRFFHASIKSRRRRNQLVMLKKGEGWIQGVTNIKKEVKDHFAHHFTEEWNN 583

Query: 1032 RPVLDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFTFYKQFWELIK 1211
            RP L GI F  +S+ +N  L EPFT EE+KD +WSC+G KSPGPDG NF F K  W ++K
Sbjct: 584  RPFLQGITFNTLSSEENSFLLEPFTEEEVKDTIWSCDGNKSPGPDGLNFNFLKSCWCIVK 643


>GAU47212.1 hypothetical protein TSUD_403530 [Trifolium subterraneum]
          Length = 799

 Score =  317 bits (813), Expect = 1e-97
 Identities = 166/404 (41%), Positives = 238/404 (58%), Gaps = 1/404 (0%)
 Frame = +3

Query: 3    MIIIWDTNVLEXXXXXXXXXXXXXCARKRRSSDVFFMVNVYSPCDWGGKNQCWKELILCK 182
            M+ IW+T++               C     S  V +++NVY+PC   GK + W +L+  K
Sbjct: 34   MLTIWNTDLFVFKYSFTGDGFLGICVDWNGS--VLYIINVYAPCTLSGKRKLWDDLLNFK 91

Query: 183  RILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDIARFNQFISEMHLVDVPVLGKKFTWW 362
                   WC+ GDFN++    E++G+S+  R+ +   F QF+  M +VDVPV GKKFTW+
Sbjct: 92   SSNEEGEWCLGGDFNAILKIGERKGSSSLIRQNERWEFRQFVEGMEVVDVPVTGKKFTWF 151

Query: 363  SGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNC 542
            S D +A SRLDRFLLSE  I K  V+ QWVG+RDISDH PIWL  ++   G K F     
Sbjct: 152  SADRKAMSRLDRFLLSEGFIDKAGVSGQWVGDRDISDHCPIWLSLNIE--GKKAFVL--- 206

Query: 543  WLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKEVFGFMDLNIENIVQD 722
                KE  + ++E  KG                        WN+EVF  +DLNIE  V++
Sbjct: 207  ----KEKLTKLKEALKG------------------------WNREVFRVLDLNIEKPVKE 238

Query: 723  MNVLDDIVGSGDSRVG-ENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTRFFHAC 899
            +N ++ ++ S D      +R  +   FW Q+H KESL++QK+R +W+  GD N+R+ HA 
Sbjct: 239  LNEVESMLASDDVMADFVDRGGIQKNFWDQLHYKESLLKQKSRMRWVKYGDTNSRYIHAS 298

Query: 900  VRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHFQDFFLEPNESRPVLDGIVFPQISAAD 1079
            ++GRRRRNQ++ ++K D W++GVD +K+EVK+HF+  F E   +RP L GI F  ++  D
Sbjct: 299  LKGRRRRNQMVTIKKGDEWLQGVDCIKNEVKQHFEKNFSEEWLNRPFLSGIDFNVLNNED 358

Query: 1080 NGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFTFYKQFWELIK 1211
            N +L EPF  EE+++ +WS +G KSPGPDGFNF F K  W +IK
Sbjct: 359  NSLLLEPFGEEEVREVIWSSDGNKSPGPDGFNFNFLKACWTMIK 402


>GAU47952.1 hypothetical protein TSUD_06860 [Trifolium subterraneum]
          Length = 724

 Score =  313 bits (802), Expect = 1e-96
 Identities = 143/300 (47%), Positives = 200/300 (66%), Gaps = 1/300 (0%)
 Frame = +3

Query: 315  MHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLD 494
            M L+DVPVLGKKF+W+S +G++ SR+DRFLLS+  + K+ +  QW+G+RDISDH PIWL 
Sbjct: 1    MTLLDVPVLGKKFSWFSANGKSMSRIDRFLLSDGFVSKYGITGQWIGDRDISDHCPIWLL 60

Query: 495  CSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNK 674
             S Y WGPKPFR  N WL H +F  FVE  WK F V G K Y               WNK
Sbjct: 61   VSSYKWGPKPFRVINGWLDHPDFLPFVESAWKSFVVHGKKAYVLKEKFRLLKERLRKWNK 120

Query: 675  EVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSRFWHQIHAKESLIRQKARC 851
            EV+G++DLNIE  V ++N +++++G  D  V   R++ L   FW Q++ KESL++QK+R 
Sbjct: 121  EVYGYLDLNIEKTVNEINDIENMLGDDDMEVELTRRQGLNKEFWSQLYHKESLLKQKSRT 180

Query: 852  KWIAEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHFQDFFLEPNES 1031
            +W+ EGD+N+R+FH  ++ RRRRNQ++AL+  D WV+GVD VK  VK  F++ F E   +
Sbjct: 181  RWVKEGDSNSRYFHESIKSRRRRNQLVALKDGDRWVQGVDEVKRFVKNFFENNFKENWAN 240

Query: 1032 RPVLDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFTFYKQFWELIK 1211
            RP L+GI F  +S  DN  L  PF+++E+++ VW+ +  K P PDG NF F K  W ++K
Sbjct: 241  RPNLNGITFQSLSEEDNVSLLPPFSIDEVREVVWNSDRNKCPSPDGLNFNFLKVCWNVLK 300


>KHN28364.1 hypothetical protein glysoja_029625, partial [Glycine soja]
          Length = 326

 Score =  296 bits (757), Expect = 9e-95
 Identities = 141/269 (52%), Positives = 187/269 (69%), Gaps = 1/269 (0%)
 Frame = +3

Query: 204  WCVAGDFNSVSSSQEKRGASNNYRRRDIARFNQFISEMHLVDVPVLGKKFTWWSGDGRAS 383
            WC+ GDFN+VS+ +E+ G S N+   D+  FN F++EM+L+D P+ G KFT++  DG A+
Sbjct: 58   WCLVGDFNAVSNREERTGRSENWGYIDMVDFNAFVNEMNLIDPPLHGNKFTYFCSDGIAA 117

Query: 384  SRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEF 563
            SRLDRFL+S+ ++  WQV  Q VG RDISDH PIWL+CS  +WGPKPFRFNNCWL H  F
Sbjct: 118  SRLDRFLVSDGIMNLWQVKGQRVGKRDISDHCPIWLECSNLNWGPKPFRFNNCWLEHDGF 177

Query: 564  KSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKEVFGFMDLNIENIVQDMNVLDDI 743
            KSF+ E WK  Q+ G K Y               WNKEVFG++DLNIENIV DMN LD  
Sbjct: 178  KSFIVEEWKKIQITGRKAYVIKEKLKIIRESLKKWNKEVFGWLDLNIENIVADMNELDRG 237

Query: 744  VGSG-DSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTRFFHACVRGRRRR 920
            +  G +  V   +KE  + FW Q+  KESL++QK+R +WI EGD+NT+FFH+C++ RRR+
Sbjct: 238  IEEGCNLNVVVKKKEANALFWQQLMMKESLLKQKSRLRWIKEGDSNTKFFHSCLQDRRRK 297

Query: 921  NQILALRKDDAWVEGVDGVKHEVKRHFQD 1007
            NQIL+L+ +   VE V  VK EV+R F++
Sbjct: 298  NQILSLQVEGRCVEQVGEVKMEVRRFFEE 326


>GAU24549.1 hypothetical protein TSUD_148900 [Trifolium subterraneum]
          Length = 1239

 Score =  318 bits (814), Expect = 9e-95
 Identities = 157/377 (41%), Positives = 224/377 (59%), Gaps = 7/377 (1%)
 Frame = +3

Query: 102  VFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNN---- 269
            V  +VNVYS CD G K   W  L+  +R +GG  WCV GDFN+V    E+ G ++     
Sbjct: 103  VCIVVNVYSKCDVGSKRLLWNNLLNVRRGIGGGRWCVVGDFNAVCRRDERMGVNSGDGGG 162

Query: 270  YRRRDIARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQW 449
                +I  F +FI E+ LVD+P++G++FTW+  +GRA SR+DR L+S+E  L+W     W
Sbjct: 163  SSLTEIGEFCKFIEELELVDLPLVGRRFTWYHANGRAMSRIDRILISDEWALRWGNCDLW 222

Query: 450  VGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFX 629
            V  RD+SDH P+ L  +   WGPKPFRFNN WL +K+ K  VE CW   +V GW  +   
Sbjct: 223  VLPRDVSDHCPLILKYNQDGWGPKPFRFNNFWLQNKKLKEVVESCWSNLRVEGWMGFVLK 282

Query: 630  XXXXXXXXXXXXWNKEVFGFMDLNIENIVQDMNVLD---DIVGSGDSRVGENRKELTSRF 800
                        W+K+ +  ++  +E ++ +++ LD   + VG     V E RKE     
Sbjct: 283  EKLKGLKSTLKEWHKKEYESLEARVEELMGEISELDKRGEEVGLSQHEV-EIRKEKFGGL 341

Query: 801  WHQIHAKESLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVK 980
            W  + +KE+L+ Q++R KW+ EGDANT FFH  VR R + N+I ALR D+ W++  + + 
Sbjct: 342  WKLLKSKEALLFQRSRSKWLKEGDANTNFFHQSVRSRLKSNRISALRVDEVWLDSPNLII 401

Query: 981  HEVKRHFQDFFLEPNESRPVLDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPG 1160
              V  +FQ+        RP L+G+VF  +S  +N  LTE F++EEIK+ VW  +G KSPG
Sbjct: 402  GAVNSYFQNHVSSNYVVRPKLEGVVFSMLSEEENVSLTENFSLEEIKEVVWCSDGNKSPG 461

Query: 1161 PDGFNFTFYKQFWELIK 1211
            PDGFNF F K FWEL++
Sbjct: 462  PDGFNFAFLKNFWELLR 478


>XP_019433565.1 PREDICTED: uncharacterized protein LOC109340346 [Lupinus
            angustifolius]
          Length = 410

 Score =  296 bits (758), Expect = 9e-94
 Identities = 145/372 (38%), Positives = 210/372 (56%), Gaps = 3/372 (0%)
 Frame = +3

Query: 111  MVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDIA 290
            ++NVY PCD  GK + W+E    K  L   LWCV GDFNS+   +E++G   ++ RR+  
Sbjct: 8    IMNVYCPCDLDGKRKFWEEAKATKLGLSSNLWCVVGDFNSILQQEERKGIGLHHNRREGE 67

Query: 291  RFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDIS 470
             FNQF+ +M L  +P+ GKKFTW+  +G A SR+DRFL+++  ++ W    Q    +  S
Sbjct: 68   EFNQFVLDMELFYLPLSGKKFTWFLSNGNAMSRIDRFLVNDGWLVSWGNLVQLGLPKTFS 127

Query: 471  DHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXX 650
            DH PI L     DWGP PF  NNCW     F  FV E WK  ++ G   + F        
Sbjct: 128  DHCPILLKIDNSDWGPTPFHTNNCWFSDHRFNQFVVEEWKKLEINGRGSFVFKEKLKKLK 187

Query: 651  XXXXXWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGEN---RKELTSRFWHQIHAK 821
                 WNK  FG +D  IE+ V+ +N + D  GS    + E+    +E T+  W      
Sbjct: 188  VALKRWNKHHFGMLDRKIEDQVEVINYV-DAKGSSSIILDEDINLSREATTELWRLSRQN 246

Query: 822  ESLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHF 1001
            ++L+ QK+R +WI +GD+N++FFH  +   ++  +   L  +  W+E    VK+ +   F
Sbjct: 247  DNLLLQKSRQRWIRDGDSNSKFFHLSINKNQKFKKFTGLSIEGEWIEDPTRVKNYISSSF 306

Query: 1002 QDFFLEPNESRPVLDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFT 1181
            +  F E    RP LDGI F  ++A +N  LT  F +EEIK A+WSC+G+KSP PDGFNF+
Sbjct: 307  EAKFEECTGCRPSLDGIGFNHLAAEENAFLTAKFEVEEIKGAIWSCDGDKSPSPDGFNFS 366

Query: 1182 FYKQFWELIKTE 1217
            F K+FWE +K +
Sbjct: 367  FLKKFWECLKDD 378


>GAU30605.1 hypothetical protein TSUD_62250 [Trifolium subterraneum]
          Length = 779

 Score =  306 bits (785), Expect = 1e-93
 Identities = 138/279 (49%), Positives = 186/279 (66%), Gaps = 1/279 (0%)
 Frame = +3

Query: 384  SRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEF 563
            SRLDRFLLSE  I K  +  QWVG+RDISDH PIWL+ S  +WGPKPF+FNNCWL H +F
Sbjct: 2    SRLDRFLLSEGFIEKGSITNQWVGDRDISDHCPIWLESSNSNWGPKPFKFNNCWLDHPDF 61

Query: 564  KSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKEVFGFMDLNIENIVQDMNVLDDI 743
              FV+  W+   + G K +               WN+E+FG MDLNIE  V+DMN ++++
Sbjct: 62   IPFVKTTWEQMDICGKKAFIVKEKMKKLKEALKAWNREIFGLMDLNIEKTVKDMNEVEEM 121

Query: 744  VGSGDSR-VGENRKELTSRFWHQIHAKESLIRQKARCKWIAEGDANTRFFHACVRGRRRR 920
            + +GDS  +  N KEL+  FW Q+H KESL++QK+R KWI EGD+NTR+FHA ++GRRRR
Sbjct: 122  LANGDSHPIFINSKELSKNFWEQLHFKESLLQQKSRTKWIKEGDSNTRYFHASIKGRRRR 181

Query: 921  NQILALRKDDAWVEGVDGVKHEVKRHFQDFFLEPNESRPVLDGIVFPQISAADNGVLTEP 1100
            N ++ ++K   W++GV  +K E   H+   F E    RP L GI F  +SA DN  L EP
Sbjct: 182  NNVVKIKKGTEWLQGVAAIKSEAIDHYSKLFSEEWLQRPFLQGINFKTLSADDNAFLLEP 241

Query: 1101 FTMEEIKDAVWSCEGEKSPGPDGFNFTFYKQFWELIKTE 1217
            F  +E+++ +WSC+G KSPGPDGFN  F K  W ++K++
Sbjct: 242  FAEDEVRETIWSCDGNKSPGPDGFNINFLKACWSIVKSD 280


>GAU51438.1 hypothetical protein TSUD_413380, partial [Trifolium subterraneum]
          Length = 948

 Score =  307 bits (786), Expect = 2e-92
 Identities = 152/357 (42%), Positives = 212/357 (59%), Gaps = 4/357 (1%)
 Frame = +3

Query: 159  WKELILCKRILGGELWCVAGDFNSVSSSQEKRGAS-NNYRRRDIARFNQFISEMHLVDVP 335
            W +L++  R+ G + +C+ GDFNSV S  E++GAS       D+  FN FI    L+D+P
Sbjct: 195  WVDLLVALRVYGADHYCILGDFNSVRSRDERKGASAGGEAAEDMRIFNIFIENSGLIDLP 254

Query: 336  VLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSVYDWG 515
            ++G+KFTW   +GR  SRLDR L+S+    +W     W   RD+SDHSPI +  + +DWG
Sbjct: 255  LMGRKFTWMQPNGRCLSRLDRVLVSQNWHKEWGNETLWGLKRDVSDHSPILVKYNDHDWG 314

Query: 516  PKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKEVFGFMD 695
            PKPFRFNN W  +  F+  V E W  + V GWK Y               WNKEV+G +D
Sbjct: 315  PKPFRFNNYWWSNPSFRKVVSEAWGSYNVTGWKGYVVKEKMKLLKGVLKTWNKEVYGNID 374

Query: 696  LNIENIVQDMNVLD---DIVGSGDSRVGENRKELTSRFWHQIHAKESLIRQKARCKWIAE 866
              IE +  D++VL+   + VG  +  + + RK++    W  +  K++L  QK+R +W+ E
Sbjct: 375  SKIEELTNDIDVLELKSETVGLEEEELIK-RKKMFHELWGLLKCKDTLEFQKSRSRWLME 433

Query: 867  GDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHFQDFFLEPNESRPVLD 1046
            GDANT +FHACV+GRRR N I+AL+K  +W+   + ++ E+  +F   F E    RP LD
Sbjct: 434  GDANTGYFHACVKGRRRSNSIVALKKGRSWISNPNEIRMEIVSYFMKHFEEVYRVRPRLD 493

Query: 1047 GIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFTFYKQFWELIKTE 1217
            G+VFP I   D   L   F  EEI D +   +G KSPGPDGFNF+F+K FW LIK E
Sbjct: 494  GVVFPAIGDEDRSRLEIDFLEEEIVDIISMADGNKSPGPDGFNFSFFKNFWGLIKRE 550


>GAU50864.1 hypothetical protein TSUD_411040 [Trifolium subterraneum]
          Length = 862

 Score =  304 bits (778), Expect = 7e-92
 Identities = 143/322 (44%), Positives = 208/322 (64%), Gaps = 1/322 (0%)
 Frame = +3

Query: 90   RSSDVFFMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNN 269
            R   V  +VN+YSPC   GK + W++L+  K++  G   C+ GDFN++  S E++GAS +
Sbjct: 409  REGAVTHLVNIYSPCSLSGKKKLWEDLLEIKQLFTGGECCLRGDFNAILHSSERKGASAD 468

Query: 270  YRRRDIARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQW 449
             R+ +   FN+F+ EM ++DVPVLG K +W S DG++ SRLDRF+LS+  I K+ +  QW
Sbjct: 469  SRQGERMMFNRFVEEMEMIDVPVLGMKVSWVSADGKSMSRLDRFILSDGFITKFGIIGQW 528

Query: 450  VGNRDISDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFX 629
            +GNR+I DH PIWL  S  +WGPKPFR  N  L H +F  F+E CWK F + G K Y   
Sbjct: 529  IGNRNIFDHCPIWLYASAKNWGPKPFRAINGCLEHPDFLVFLESCWKSFDIQGTKAYVLK 588

Query: 630  XXXXXXXXXXXXWNKEVFGFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSRFWH 806
                        WNKEVFG +DLNI+  V+++N ++ ++G  D  V   R+E L S FW 
Sbjct: 589  EKLRFLKEILKKWNKEVFGILDLNIDKTVKELNDIEKMLGDDDPDVELTRREGLNSEFWS 648

Query: 807  QIHAKESLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHE 986
            Q+H KE L++QK+R + + EGD+N++FFH  ++ RRR+NQ++ L+  D WVEG++ VK  
Sbjct: 649  QLHFKEILLQQKSRTRRVKEGDSNSKFFHESIKRRRRKNQLVVLKDGDQWVEGMEEVKGY 708

Query: 987  VKRHFQDFFLEPNESRPVLDGI 1052
            VK  F++ F E   +RP L+G+
Sbjct: 709  VKNFFENNFRERWPNRPNLNGM 730


>KYP46096.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]
          Length = 729

 Score =  300 bits (768), Expect = 1e-91
 Identities = 148/372 (39%), Positives = 207/372 (55%), Gaps = 2/372 (0%)
 Frame = +3

Query: 108  FMVNVYSPCDWGGKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDI 287
            F+VN+YSPCD  GK   W+E+   K   G   WC+ GDFN+V    E++G       +++
Sbjct: 25   FIVNIYSPCDLRGKKNLWEEIHKIKNSYGSGRWCICGDFNTVRLKSERKGVHTRREEKEM 84

Query: 288  ARFNQFISEMHLVDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDI 467
              +NQFI ++ L+D+P+ G K+TW+  +   +SR+DRFL+S+E + +W   +Q    RD+
Sbjct: 85   LCYNQFIEDVELIDLPLGGGKYTWFRPNRIIASRIDRFLVSQEWLTQWPHCSQKALQRDV 144

Query: 468  SDHSPIWLDCSVYDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXX 647
            SDH PI L     DWGPKPFR  NCW     F  FVEE WKGF V GW  +         
Sbjct: 145  SDHRPILLKDIRLDWGPKPFRSLNCWFDDPSFLGFVEEKWKGFSVTGWGAFILKEKLKHL 204

Query: 648  XXXXXXWNKEVFGFMDLNIENIVQDMNVLDDIV--GSGDSRVGENRKELTSRFWHQIHAK 821
                  WNK+ FG +   IE + +++N LD IV   S + R   +R+ L  + W  ++ K
Sbjct: 205  KKSIKEWNKQAFGNIHTQIEEVKRNINSLDSIVETRSLNERKVSDRRNLNVKLWDLLNKK 264

Query: 822  ESLIRQKARCKWIAEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHF 1001
            ESL+ QK+R KW  EGD+N+ FFH CV  RR+ N+I+ L  +  W               
Sbjct: 265  ESLLLQKSRLKWAREGDSNSSFFHMCVNKRRKMNEIIGLDVNGKW--------------- 309

Query: 1002 QDFFLEPNESRPVLDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFT 1181
                        +LDGI F Q++      LT PFT EEI++AVWSCE  KSPG DGFN  
Sbjct: 310  ------------LLDGIQFQQLNTHQCRSLTRPFTAEEIREAVWSCESNKSPGSDGFNML 357

Query: 1182 FYKQFWELIKTE 1217
            F K+ W+++K +
Sbjct: 358  FIKKCWDILKND 369


>GAU32122.1 hypothetical protein TSUD_218730 [Trifolium subterraneum]
          Length = 1246

 Score =  307 bits (786), Expect = 9e-91
 Identities = 148/359 (41%), Positives = 218/359 (60%), Gaps = 1/359 (0%)
 Frame = +3

Query: 144  GKNQCWKELILCKRILGGELWCVAGDFNSVSSSQEKRGASNNYRRRDIARFNQFISEMHL 323
            GK + W++L++ K+  GG  WC+ GDFN+V  S E++G S + R  + A FN+F+ EM  
Sbjct: 422  GKKKLWEDLVIFKQQSGGGEWCLGGDFNAVLHSSERKGISADSRHAERACFNRFVEEMEE 481

Query: 324  VDVPVLGKKFTWWSGDGRASSRLDRFLLSEELILKWQVAAQWVGNRDISDHSPIWLDCSV 503
            +DVP+LGKKF+W+S DG                           +RDISDH  +WL    
Sbjct: 482  IDVPILGKKFSWFSTDG---------------------------DRDISDHCLVWLVSES 514

Query: 504  YDWGPKPFRFNNCWLGHKEFKSFVEECWKGFQVGGWKIYAFXXXXXXXXXXXXXWNKEVF 683
             +WGPKPF+  N WL H +F SFVE+  KGF+V G K Y               WN+EVF
Sbjct: 515  KNWGPKPFKVINGWLEHPKFFSFVEKSRKGFKVSGKKAYVLKEKFRMLKECLRKWNREVF 574

Query: 684  GFMDLNIENIVQDMNVLDDIVGSGDSRVGENRKE-LTSRFWHQIHAKESLIRQKARCKWI 860
            G +DLNIE  V+D+N ++ ++G  +  +   R+E L   FW Q+H KESL++QK+R +W+
Sbjct: 575  GILDLNIEKTVKDLNNIEGLMGDDEMDLELTRREGLNKEFWRQLHLKESLLKQKSRMRWV 634

Query: 861  AEGDANTRFFHACVRGRRRRNQILALRKDDAWVEGVDGVKHEVKRHFQDFFLEPNESRPV 1040
             EGD+N+R+FH  ++  RRRN ++AL+  +  V+GV+ VK  VK  F + F E  E  P 
Sbjct: 635  KEGDSNSRYFHESIKSIRRRNHLVALKDGEQRVQGVEEVKVFVKNFFDNNFRESLEDIPN 694

Query: 1041 LDGIVFPQISAADNGVLTEPFTMEEIKDAVWSCEGEKSPGPDGFNFTFYKQFWELIKTE 1217
            L+G+ F  ++  DN  L +PF+++E+K+A+W  +G K P PDGFNF F+K  WE++K +
Sbjct: 695  LNGVQFQSLTDEDNLSLLDPFSIDEVKEAIWCSDGNKCPRPDGFNFNFFKTCWEIVKDD 753


Top