BLASTX nr result

ID: Glycyrrhiza36_contig00023893 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza36_contig00023893
         (1440 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU48536.1 hypothetical protein TSUD_282880 [Trifolium subterran...   283   e-100
GAU25065.1 hypothetical protein TSUD_257590 [Trifolium subterran...   280   9e-99
GAU14107.1 hypothetical protein TSUD_169320 [Trifolium subterran...   278   6e-89
GAU33402.1 hypothetical protein TSUD_20950 [Trifolium subterraneum]   234   5e-86
GAU43007.1 hypothetical protein TSUD_187280 [Trifolium subterran...   263   5e-83
KHN41376.1 hypothetical protein glysoja_032562, partial [Glycine...   231   8e-82
GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterran...   260   9e-82
GAU46303.1 hypothetical protein TSUD_283280 [Trifolium subterran...   209   6e-78
GAU47735.1 hypothetical protein TSUD_386940 [Trifolium subterran...   232   2e-76
GAU10537.1 hypothetical protein TSUD_419470, partial [Trifolium ...   198   7e-76
GAU50864.1 hypothetical protein TSUD_411040 [Trifolium subterran...   239   1e-75
KYP36320.1 Transposon TX1 uncharacterized [Cajanus cajan]             241   5e-74
KHN28364.1 hypothetical protein glysoja_029625, partial [Glycine...   231   9e-74
KYP32205.1 Putative ribonuclease H protein At1g65750 family [Caj...   238   3e-73
GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterran...   225   5e-73
GAU32122.1 hypothetical protein TSUD_218730 [Trifolium subterran...   201   1e-70
GAU48812.1 hypothetical protein TSUD_406450 [Trifolium subterran...   213   5e-70
GAU51943.1 hypothetical protein TSUD_417260, partial [Trifolium ...   215   1e-68
KYP31897.1 Putative ribonuclease H protein At1g65750 family [Caj...   216   4e-68
GAU09987.1 hypothetical protein TSUD_393040 [Trifolium subterran...   208   9e-68

>GAU48536.1 hypothetical protein TSUD_282880 [Trifolium subterraneum]
          Length = 1794

 Score =  283 bits (725), Expect(3) = e-100
 Identities = 137/334 (41%), Positives = 189/334 (56%), Gaps = 1/334 (0%)
 Frame = +3

Query: 111  GDRAKRRRLRSLIINGKFDCIFLQETKCSDITNQLIETFWRDCDFEWVARGASGLSSGLL 290
            G  AKRRRL  L+ +G FD   LQETK  +  + +I+  W   D EWVA+ + GLS GLL
Sbjct: 690  GSCAKRRRLSKLLASGTFDLCLLQETKRDNFDDLMIQKLWGHKDVEWVAKESIGLSGGLL 749

Query: 291  TVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCHLVNIYSPCSIDGKRRLWGHLVMSKRG 470
             +W + + ++++ F+G+ F+G+C   K   L   ++NIYSPCS+ GKR+LW  L+  K+ 
Sbjct: 750  IMWNAGLFNLKFSFTGDNFLGLCVECKEGILF--IINIYSPCSLSGKRKLWSDLLEFKQN 807

Query: 471  FGGTTWCVAGDFNAITVYSESRGVSAQFGCRECVEFNNFISEMNIVDIPVMGKKFTWFNA 650
                 WC+ GDFN +    E +G SA     E +EF  F+  M + D+PV GKKF+WF+A
Sbjct: 808  NEQGEWCLGGDFNVVLKTGERKGSSALCRQNERLEFCQFVEAMELCDVPVAGKKFSWFSA 867

Query: 651  DGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCSTQDWGPKPFKFNNC*L 830
            DG++M                +   QW+G RDISDHCPIW+LCS  +WGPKPFK NNC L
Sbjct: 868  DGTSMSRLDRFLLSEKFIDSEKVTGQWIGSRDISDHCPIWLLCSNLNWGPKPFKVNNCWL 927

Query: 831  EHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVFGFVDLNIEKIVEDIN 1010
            EH     FVE+ W    V G K               R+WNR+VFG +DLNIE IV ++N
Sbjct: 928  EHPEFKPFVEKTWNKLNVEGKKAFVIKEKLKRLKEELRRWNRDVFGILDLNIENIVRELN 987

Query: 1011 ILDAVAASNNLED-NSRRKELTVQFWQQIQNKES 1109
              + + A +           +  +FW Q+  KES
Sbjct: 988  EAEGLLAIDGANSVTCDVSAINKKFWDQLHFKES 1021



 Score =  102 bits (255), Expect(3) = e-100
 Identities = 49/99 (49%), Positives = 69/99 (69%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            SL+KQKSR KW+ EGD+N+RFFHA + ++RRRNQL  L+ G  WI GV  +K+EVKN+F 
Sbjct: 1021 SLIKQKSRLKWVREGDSNSRFFHASIKSRRRRNQLSILRRGEEWIQGVDNIKSEVKNYFV 1080

Query: 1285 RLFHEVGYSRLVLDGVNFSQINAEDNFLLTAPFPPKKSR 1401
              F E  ++R  + G+NF+ ++A+DN  L  PF  +  R
Sbjct: 1081 TNFTEDWHNRPFVHGINFNVLSAKDNDFLLQPFSEEDVR 1119



 Score = 32.7 bits (73), Expect(3) = e-100
 Identities = 10/19 (52%), Positives = 16/19 (84%)
 Frame = +3

Query: 1383 SPEEIKDAIWSCDGDKCPG 1439
            S E++++ +WSCDG+K PG
Sbjct: 1114 SEEDVREVLWSCDGNKSPG 1132


>GAU25065.1 hypothetical protein TSUD_257590 [Trifolium subterraneum]
          Length = 721

 Score =  280 bits (717), Expect(3) = 9e-99
 Identities = 145/345 (42%), Positives = 201/345 (58%), Gaps = 1/345 (0%)
 Frame = +3

Query: 78   MKILSLNVRGLGDRAKRRRLRSLIINGKFDCIFLQETKCSDITNQLIETFWRDCDFEWVA 257
            ++I+SLN+RG G  AKRRRL S +  G FD   LQETK +DI + LI   W   D  WVA
Sbjct: 103  VEIVSLNMRGWGGSAKRRRLSSFLQKGAFDVCLLQETKKADIEDFLIHNLWGHKDVNWVA 162

Query: 258  RGASGLSSGLLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCHLVNIYSPCSIDGKRR 437
            +  +GLS G+L +W      +   + G+ ++GI    +   L  ++VN+YSPC I GK++
Sbjct: 163  KNPTGLSGGMLIIWNFDFFSLLNSYYGDGYLGIRVDREGDEL--NIVNVYSPCIISGKKK 220

Query: 438  LWGHLVMSKRGFGGTTWCVAGDFNAITVYSESRGVSAQFGCRECVEFNNFISEMNIVDIP 617
            LW  L+  K+  GG  WCV GDFN+I   SE +G S      E   FN F+ EM ++D P
Sbjct: 221  LWEDLLALKQSTGGGKWCVRGDFNSILHSSERKGSSIVSRQNESSLFNRFVEEMELIDTP 280

Query: 618  VMGKKFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCSTQDWG 797
            V+GKKF+WF+ADG +M              ++    +W+GDRDIS HCPIW+LCS+ +WG
Sbjct: 281  VLGKKFSWFSADGKSMSRIDRFLLSDGFVSKFGITGKWIGDRDISYHCPIWLLCSSYNWG 340

Query: 798  PKPFKFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVFGFVD 977
            PKPF+  N  +EH +   FVE  W+SF V G K                    EV+GF+D
Sbjct: 341  PKPFRVINGWMEHPDFFDFVETTWKSFDVHGKK-------------------GEVYGFLD 381

Query: 978  LNIEKIVEDINILDAVAASNNLE-DNSRRKELTVQFWQQIQNKES 1109
            LNIEK V DIN+++ +   ++ E D +RR  L   FW+Q+ +KES
Sbjct: 382  LNIEKTVTDINVIENLLGGDDEEIDLTRRAGLNKDFWKQLIHKES 426



 Score =  102 bits (254), Expect(3) = 9e-99
 Identities = 49/93 (52%), Positives = 67/93 (72%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            SLLKQKSR +W+ EGD+N++FFH  + ++RRRNQL+AL+ G+ W+ GV +VKA VKN+FE
Sbjct: 426  SLLKQKSRMRWVKEGDSNSKFFHESIKSRRRRNQLVALKDGDRWVQGVDDVKAFVKNYFE 485

Query: 1285 RLFHEVGYSRLVLDGVNFSQINAEDNFLLTAPF 1383
              F E    R  L+G+ F  ++ EDN  L APF
Sbjct: 486  NNFREDWAYRPNLNGIAFQSLSEEDNLSLMAPF 518



 Score = 29.6 bits (65), Expect(3) = 9e-99
 Identities = 10/19 (52%), Positives = 15/19 (78%)
 Frame = +3

Query: 1383 SPEEIKDAIWSCDGDKCPG 1439
            S +E+++ IWS D +KCPG
Sbjct: 519  SIDEVREVIWSSDWNKCPG 537


>GAU14107.1 hypothetical protein TSUD_169320 [Trifolium subterraneum]
          Length = 695

 Score =  278 bits (711), Expect(2) = 6e-89
 Identities = 139/331 (41%), Positives = 190/331 (57%), Gaps = 1/331 (0%)
 Frame = +3

Query: 120  AKRRRLRSLIINGKFDCIFLQETKCSDITNQLIETFWRDCDFEWVARGASGLSSGLLTVW 299
            AKRRRL SLI  G FD   LQETK  +  + +I   W   D EWVA+ ++GLS GLL+VW
Sbjct: 2    AKRRRLSSLIKTGAFDMCMLQETKRDNFEDYMIHNLWGHTDVEWVAKKSNGLSGGLLSVW 61

Query: 300  KSSMLHVQYHFSGNRFIGICASFKNSGLMCHLVNIYSPCSIDGKRRLWGHLVMSKRGFGG 479
               +   +Y F+G+ F+G+C  +K +GL+ ++VN+YS C++ GKR+LW  L+  K     
Sbjct: 62   NKDLFSFRYSFTGDGFLGVCVEWK-AGLL-YIVNVYSSCNVSGKRKLWNDLIDFKLNNEP 119

Query: 480  TTWCVAGDFNAITVYSESRGVSA-QFGCRECVEFNNFISEMNIVDIPVMGKKFTWFNADG 656
              WC+ GDFN+I+   E RG S+  +   E +EF  FI  + +VDIP+  K FTWFN+DG
Sbjct: 120  EEWCLGGDFNSISKVGERRGSSSGAWRQGERIEFIQFIDALEVVDIPLKDKMFTWFNSDG 179

Query: 657  SAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCSTQDWGPKPFKFNNC*LEH 836
            SAM              +   + QWVGDRDISDHCPIW++CS  +WGPKPF FNNC LEH
Sbjct: 180  SAMSRLNHFLVSEGFIEKGSLSYQWVGDRDISDHCPIWLMCSNINWGPKPFTFNNCWLEH 239

Query: 837  KNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVFGFVDLNIEKIVEDINIL 1016
                 FV+E W++  + G K               + WNREVFGF++L I+K V ++N  
Sbjct: 240  PKFFEFVKETWENMDIRGKKAFIIKEKLKGLKEALKVWNREVFGFMELKIDKTVNELN-- 297

Query: 1017 DAVAASNNLEDNSRRKELTVQFWQQIQNKES 1109
                                +FW+Q+  KES
Sbjct: 298  --------------------EFWEQLNFKES 308



 Score = 80.1 bits (196), Expect(2) = 6e-89
 Identities = 35/76 (46%), Positives = 53/76 (69%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            SLL QKSR KW  EGD+N+R+FHA + ++RR+NQ++ L+    WI GV E+K EV++H+ 
Sbjct: 308  SLLHQKSRTKWAKEGDSNSRYFHASIKSRRRKNQIVTLKKDGEWIQGVAEIKEEVRDHYS 367

Query: 1285 RLFHEVGYSRLVLDGV 1332
            + F E   ++  L G+
Sbjct: 368  KHFFEEWGNKPFLQGL 383


>GAU33402.1 hypothetical protein TSUD_20950 [Trifolium subterraneum]
          Length = 1594

 Score =  234 bits (597), Expect(3) = 5e-86
 Identities = 118/328 (35%), Positives = 170/328 (51%), Gaps = 1/328 (0%)
 Frame = +3

Query: 129  RRLRSLIINGKFDCIFLQETKCSDITNQLIETFWRDCDFEWVARGASGLSSGLLTVWKSS 308
            RR+R     G FD   LQETK  +  + +I+  W   D EWVA+G+ GLS G L      
Sbjct: 650  RRMREQRNQGTFDICLLQETKRDNFDDFMIQNVWGHKDVEWVAKGSVGLSGGNL------ 703

Query: 309  MLHVQYHFSGNRFIGICASFKNSGLMCHLVNIYSPCSIDGKRRLWGHLVMSKRGFGGTTW 488
                                       +++N+YSPCS+ GKR+LW  L+  K       W
Sbjct: 704  ---------------------------YIINVYSPCSLSGKRKLWSDLLEFKLNNEQGEW 736

Query: 489  CVAGDFNAITVYSESRGVSAQFGCRECVEFNNFISEMNIVDIPVMGKKFTWFNADGSAMX 668
            C+ GDFN +    E +G ++     E +EF  F+  M ++D+PV GKKF+WF+ADG+A+ 
Sbjct: 737  CLRGDFNVVLNVGERKGSTSSARQNERLEFCQFVEAMELIDVPVAGKKFSWFSADGNAIS 796

Query: 669  XXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCSTQDWGPKPFKFNNC*LEHKNLM 848
                         + +   QW+G+ DISDHCPIW++CS  +WGPKPFK NNC LEH    
Sbjct: 797  RLDRFLLSDNFIEKEEVAGQWIGNHDISDHCPIWLMCSNLNWGPKPFKVNNCWLEHSEFK 856

Query: 849  SFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVFGFVDLNIEKIVEDINILDAVA 1028
             FVE+ W+   + G K               R WNREVF  +DLNIEK V+++N ++ + 
Sbjct: 857  LFVEKTWEKLNIRGKKAFVIKEKLKRLKEELRGWNREVFSILDLNIEKTVKELNEVEGLV 916

Query: 1029 ASNNLEDNSRRKE-LTVQFWQQIQNKES 1109
             ++ +      K  +  +FW+Q+  KES
Sbjct: 917  GNDGVNSVMGDKSGVNRKFWEQLYFKES 944



 Score =  102 bits (255), Expect(3) = 5e-86
 Identities = 50/103 (48%), Positives = 70/103 (67%)
 Frame = +1

Query: 1093 YKIRSLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVK 1272
            Y   S++KQKSR KW+ EGD+NTRFF A +  +RRRNQL+ L+ G+  I GV  +K EVK
Sbjct: 940  YFKESMIKQKSRLKWVREGDSNTRFFQASLKNRRRRNQLVLLRRGDDLIQGVDNIKMEVK 999

Query: 1273 NHFERLFHEVGYSRLVLDGVNFSQINAEDNFLLTAPFPPKKSR 1401
            NHF R F E  + R  ++G+NF++++ EDN  L  PF  ++ R
Sbjct: 1000 NHFARNFTEEWHHRPFVNGINFNELSTEDNEFLLQPFSEERVR 1042



 Score = 32.7 bits (73), Expect(3) = 5e-86
 Identities = 11/19 (57%), Positives = 15/19 (78%)
 Frame = +3

Query: 1383 SPEEIKDAIWSCDGDKCPG 1439
            S E +++ IWSCDG+K PG
Sbjct: 1037 SEERVREVIWSCDGNKSPG 1055


>GAU43007.1 hypothetical protein TSUD_187280 [Trifolium subterraneum]
          Length = 1892

 Score =  263 bits (671), Expect(3) = 5e-83
 Identities = 125/352 (35%), Positives = 200/352 (56%), Gaps = 7/352 (1%)
 Frame = +3

Query: 66   KWCSMKILSLNVRGLGDRAKRRRLRSLIINGKFDCIFLQETKCSDITNQLIETFWRDCDF 245
            KWCSM + +LN+RGLG R KRR++R  +   K D + LQETK    ++  I++ W   + 
Sbjct: 749  KWCSMIVGTLNIRGLGSRVKRRKVREFVSGEKVDFLALQETKLESFSDSFIQSLWGSENC 808

Query: 246  EWVARGASGLSSGLLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCHLVNIYSPCSID 425
            +W    A G S GL+++WK S+  V Y FSG+ F+G+C         C ++N+Y+ C++ 
Sbjct: 809  DWACLPAIGNSGGLISIWKKSLFSVVYTFSGHGFVGVCLDVVQDQSRCFVLNVYAKCNLS 868

Query: 426  GKRRLWGHLVMSKRGFGGTTWCVAGDFNAITVYSESRG-----VSAQFGCRECVEFNNFI 590
             KRRLWG ++MS+RGFG   WCV GDFNA+   SE RG     V++Q   +E +EF+ F+
Sbjct: 869  DKRRLWGEIIMSRRGFGRGCWCVLGDFNAVRDVSERRGARQLVVNSQ--SKEVLEFDLFL 926

Query: 591  SEMNIVDIPVMGKKFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIW 770
             E+ ++D+P++G++FTWF+ +G AM              +W+    W   RD+SDHCP+ 
Sbjct: 927  EELELIDMPLIGRRFTWFHPNGVAMSRLDRVLLSAEWISKWENPNVWALSRDVSDHCPLV 986

Query: 771  ILCSTQDWGPKPFKFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKW 950
            +  +  DWGPKPF+FNN  L + +    V + W+    SGW                + W
Sbjct: 987  VRYNNMDWGPKPFRFNNFWLHNNSFRELVVKTWEDQTFSGWMGFVLKDRLKGLKVSIKGW 1046

Query: 951  NREVFGFVDLNIEKIVEDINILDAVAASN--NLEDNSRRKELTVQFWQQIQN 1100
            + EV+G  +   ++++E I  +D  + S+  +L++ + RK L    W  +++
Sbjct: 1047 SAEVYGKAEEKKKQLIEKILEIDLRSESSGISLDEVAVRKSLFDDLWMLLKS 1098



 Score = 72.8 bits (177), Expect(3) = 5e-83
 Identities = 32/72 (44%), Positives = 50/72 (69%)
 Frame = +1

Query: 1117 QKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFERLFH 1296
            Q+SR+KW+ EGD+NTR+FHA V A++R N LLA+Q    W++G V V+  + + F++ F 
Sbjct: 1105 QRSRSKWLKEGDSNTRYFHARVMARKRTNNLLAIQTPEGWVEGPVNVREAIVSFFKKHFD 1164

Query: 1297 EVGYSRLVLDGV 1332
               ++R  L+GV
Sbjct: 1165 NEAWNRPQLEGV 1176



 Score = 24.3 bits (51), Expect(3) = 5e-83
 Identities = 9/16 (56%), Positives = 12/16 (75%)
 Frame = +3

Query: 1392 EIKDAIWSCDGDKCPG 1439
            EI+ A+ + DG KCPG
Sbjct: 1197 EIEAAVKASDGSKCPG 1212


>KHN41376.1 hypothetical protein glysoja_032562, partial [Glycine soja]
          Length = 362

 Score =  231 bits (589), Expect(2) = 8e-82
 Identities = 117/300 (39%), Positives = 162/300 (54%), Gaps = 1/300 (0%)
 Frame = +3

Query: 213  LIETFWRDCDFEWVARGASGLSSGLLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCH 392
            ++E  W +   +W+A  +SGLS GLL +WK  +  V+ +FSG+ FIG+C  F ++G    
Sbjct: 1    VVENMWGNQLIDWIALPSSGLSGGLLMMWKRGLWVVKSNFSGHGFIGVCVEFNSAG---- 56

Query: 393  LVNIYSPCSIDGKRRLWGHLVMSKRGFGGTTWCVAGDFNAITVYSESRGVSAQFGCRECV 572
                                           WC+ GDFNA++   E  G S  +G  + V
Sbjct: 57   ------------------------------EWCLVGDFNAVSNREERTGRSENWGYIDMV 86

Query: 573  EFNNFISEMNIVDIPVMGKKFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDIS 752
            +FN F++EMN++D P+ G KFT+F +DG A                WQ   Q VG RDIS
Sbjct: 87   DFNAFVNEMNLIDPPLHGNKFTYFCSDGIAASRLDRFLVSDGIMNLWQVKGQRVGKRDIS 146

Query: 753  DHCPIWILCSTQDWGPKPFKFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXX 932
            DHCPIW+ CS  +WGPKPF+FNNC LEH    SF+ E+W+  Q++G K +          
Sbjct: 147  DHCPIWLECSNLNWGPKPFRFNNCWLEHDGFKSFIVEEWKKIQITGRKAYVIKEKLKIIR 206

Query: 933  XXXRKWNREVFGFVDLNIEKIVEDINILD-AVAASNNLEDNSRRKELTVQFWQQIQNKES 1109
               +KWN+EVFG++DLNIE IV D+N LD  +    NL    ++KE    FWQQ+  KES
Sbjct: 207  ESLKKWNKEVFGWLDLNIENIVADMNELDRGIEEGCNLNVVVKKKEANALFWQQLMMKES 266



 Score =  103 bits (256), Expect(2) = 8e-82
 Identities = 49/93 (52%), Positives = 66/93 (70%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            SLLKQKSR +WI EGD+NT+FFH+C+  +RR+NQ+L+LQV    ++ V EVK EV+  FE
Sbjct: 266  SLLKQKSRLRWIKEGDSNTKFFHSCLQDRRRKNQILSLQVEGRCVEQVGEVKMEVRRFFE 325

Query: 1285 RLFHEVGYSRLVLDGVNFSQINAEDNFLLTAPF 1383
              F E  +SR VL G+ F  + +E+N  L APF
Sbjct: 326  EGFKEASFSRPVLGGIEFQTLGSEENSFLVAPF 358


>GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterraneum]
          Length = 767

 Score =  260 bits (665), Expect(3) = 9e-82
 Identities = 134/317 (42%), Positives = 181/317 (57%), Gaps = 1/317 (0%)
 Frame = +3

Query: 162  FDCIFLQETKCSDITNQLIETFWRDCDFEWVARGASGLSSGLLTVWKSSMLHVQYHFSGN 341
            FD   LQETK       LI   W   D EWV + + GLS GLL+VW       ++ F+G+
Sbjct: 2    FDMCMLQETKRESFAEFLIHNLWGHRDVEWVHKESRGLSGGLLSVWNKDFCSFRHSFTGD 61

Query: 342  RFIGICASFKNSGLMCHLVNIYSPCSIDGKRRLWGHLVMSKRGFGGTTWCVAGDFNAITV 521
             F+GIC  +K++  + + VNIY  CS+ GKR+LW  L+  K       WC+ GDFN+IT 
Sbjct: 62   GFLGICVEWKDT--LVYFVNIYFACSLAGKRKLWRDLIDFKLLNTPGEWCLGGDFNSITK 119

Query: 522  YSESRGVSAQFGCRECVEFNNFISEMNIVDIPVMGKKFTWFNADGSAMXXXXXXXXXXXX 701
             S+  G S     +E  EF  FI  M +VDIPV GKKFTW N+D SAM            
Sbjct: 120  VSKRSGSSNGSSNKERTEFAQFIDAMELVDIPVFGKKFTWSNSDNSAMSRLDRFLLSEGL 179

Query: 702  XXEWQANAQWVGDRDISDHCPIWILCSTQDWGPKPFKFNNC*LEHKNLMSFVEEKWQSFQ 881
              +   + QWVG RDISDH PIW+ CS  +WGPKPFKFNN  L+H + + FV+  W+S  
Sbjct: 180  IEKGGISNQWVGGRDISDHHPIWLECSNINWGPKPFKFNNFWLDHPDFIPFVKATWESMN 239

Query: 882  VSGWKLHTFXXXXXXXXXXXRKWNREVFGFVDLNIEKIVEDINILDAVAASNNLED-NSR 1058
            + G K               + WNREVFG +DL+IEK V+D+N ++ + A+ +     S 
Sbjct: 240  IHGKKAFILKEKLKRLKEVLKTWNREVFGIMDLDIEKTVKDLNEVEEMIANGDCHPLFSN 299

Query: 1059 RKELTVQFWQQIQNKES 1109
             K+L+ +FW+Q+ NKES
Sbjct: 300  AKDLSKKFWEQLHNKES 316



 Score = 60.5 bits (145), Expect(3) = 9e-82
 Identities = 27/72 (37%), Positives = 46/72 (63%)
 Frame = +1

Query: 1186 AQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFERLFHEVGYSRLVLDGVNFSQINAEDNF 1365
            ++RR N+++ L+ GN WI GV E+K E ++HF + F E  ++R  L+G+NF+ ++  DN 
Sbjct: 316  SRRRSNRIVKLRKGNGWIQGVAEIKNEAQDHFSKHFSEEWHNRPFLNGINFNTLSVIDNC 375

Query: 1366 LLTAPFPPKKSR 1401
             L   F  ++ R
Sbjct: 376  FLLDNFSEEEVR 387



 Score = 34.7 bits (78), Expect(3) = 9e-82
 Identities = 11/21 (52%), Positives = 17/21 (80%)
 Frame = +3

Query: 1377 SLSPEEIKDAIWSCDGDKCPG 1439
            + S EE+++ +WSCDG+K PG
Sbjct: 380  NFSEEEVRETVWSCDGNKSPG 400


>GAU46303.1 hypothetical protein TSUD_283280 [Trifolium subterraneum]
          Length = 1636

 Score =  209 bits (531), Expect(3) = 6e-78
 Identities = 114/281 (40%), Positives = 158/281 (56%), Gaps = 7/281 (2%)
 Frame = +3

Query: 288  LTVWKSSM-LHVQ-----YHFSGNRFIGICASFKNSGLMCHLVNIYSPCSIDGKRRLWGH 449
            ++VWK ++ L V+       F G+ F+GIC  ++   ++ ++VNIYSPC++ G       
Sbjct: 632  VSVWKGAVDLGVEGDEEDESFFGDGFLGICVEWQ--AVLVYIVNIYSPCTMAG------- 682

Query: 450  LVMSKRGFGGTTWCVAGDFNAITVYSESRGVSAQFGCRECVEFNNFISEMNIVDIPVMGK 629
                            GDFN+IT   E RG       RE +EF+ FI  M +VDIPV+GK
Sbjct: 683  ----------------GDFNSITKIGERRGSHGGSVYRERIEFSQFIDAMELVDIPVLGK 726

Query: 630  KFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCSTQDWGPKPF 809
            KFTWFN+D SAM              +   + QWVG+RDISDHCPIW+  S  +WGPKPF
Sbjct: 727  KFTWFNSDCSAMSRLDRFLLSEGFIEKGGISNQWVGNRDISDHCPIWLESSNINWGPKPF 786

Query: 810  KFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVFGFVDLNIE 989
            KFNNC LEH + + FV+  W+   + G K               + WN+EVFG +DLNIE
Sbjct: 787  KFNNCWLEHSDFLPFVKATWEKMNIHGKKAFIIKEKLKRLKEALKTWNQEVFGIMDLNIE 846

Query: 990  KIVEDIN-ILDAVAASNNLEDNSRRKELTVQFWQQIQNKES 1109
            K V+D+N I + +A  +N  D+   KEL+ +FW+Q+  KES
Sbjct: 847  KTVKDLNEIEELIANGDNQLDSVNSKELSKKFWEQLHFKES 887



 Score =  100 bits (249), Expect(3) = 6e-78
 Identities = 47/99 (47%), Positives = 66/99 (66%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            S+L+QKSR KWI EGD+NTRFFHA +  +RRRN+++ L+ GN WI GV E+K   K+HF 
Sbjct: 887  SILQQKSRTKWIQEGDSNTRFFHASIKGRRRRNRIVKLKKGNEWIQGVTEIKNVTKDHFA 946

Query: 1285 RLFHEVGYSRLVLDGVNFSQINAEDNFLLTAPFPPKKSR 1401
            + F E   +R  L G++F  ++  DN  L  PF  ++ R
Sbjct: 947  KHFSEEWPNRPFLQGIDFHTLSDADNAFLVEPFNEEEVR 985



 Score = 33.5 bits (75), Expect(3) = 6e-78
 Identities = 11/17 (64%), Positives = 15/17 (88%)
 Frame = +3

Query: 1389 EEIKDAIWSCDGDKCPG 1439
            EE+++ IWSCDG+K PG
Sbjct: 982  EEVRETIWSCDGNKSPG 998


>GAU47735.1 hypothetical protein TSUD_386940 [Trifolium subterraneum]
          Length = 1905

 Score =  232 bits (592), Expect(3) = 2e-76
 Identities = 117/335 (34%), Positives = 185/335 (55%), Gaps = 9/335 (2%)
 Frame = +3

Query: 108  LGDRAKR--RRLRSLIINGKFDCIFLQETKCSDITNQLIETFWRDCDFEWVARGASGLSS 281
            L DRA++  R+++ L+   K + + LQETK       + ++ W   D EW    A G S 
Sbjct: 750  LRDRAEKNGRKVKELVRAEKLEFLALQETKLESFLESVPQSLWGSEDCEWACLSAVGNSG 809

Query: 282  GLLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCHLVNIYSPCSIDGKRRLWGHLVMS 461
            GLL++WK S+  V + F+G+ F+G+C         C ++N+Y+ C++  KRRLWG ++MS
Sbjct: 810  GLLSIWKKSLFSVVFTFTGHGFVGVCLDVLQDQSRCFVINVYAKCNLVDKRRLWGEILMS 869

Query: 462  KRGFGGTTWCVAGDFNAITVYSESRG-----VSAQFGCRECVEFNNFISEMNIVDIPVMG 626
            + GFGG  WCV GDFNA+   +E RG     V++Q+  RE  EF+ F+ E+ +VD+P++G
Sbjct: 870  RMGFGGGCWCVLGDFNAVREANERRGVGNVLVNSQY--REMAEFDAFVEELEMVDMPIIG 927

Query: 627  KKFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCSTQDWGPKP 806
            ++FTWF+ +G AM              +W     WV  RD+SDHCP+ +  +  DWG KP
Sbjct: 928  RRFTWFHPNGVAMSRLDRVFLSSEWVSKWVNPNVWVLSRDVSDHCPLVVRYNNLDWGSKP 987

Query: 807  FKFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVFGFVDLNI 986
            F+FNN  L++K+    V + W+S   SGW                + W+ EV+G V+   
Sbjct: 988  FRFNNFWLQNKSFKELVVQTWESQMFSGWMGFVLKNRLKGLKACIKGWSAEVYGKVEEKK 1047

Query: 987  EKIVEDINILDAVAASNNL--EDNSRRKELTVQFW 1085
            + ++E I   D  + +  +  E+ + RK L  + W
Sbjct: 1048 KHLIEKIIEFDLRSETMGISSEEVTVRKRLFDELW 1082



 Score = 81.6 bits (200), Expect(3) = 2e-76
 Identities = 39/89 (43%), Positives = 55/89 (61%)
 Frame = +1

Query: 1117 QKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFERLFH 1296
            Q+SRAKW+ EGD N+R+FH CV A+RR N +LAL+    W++G V V+  V   F+R F 
Sbjct: 1094 QRSRAKWLKEGDVNSRYFHFCVNARRRSNSILALRTPIGWVEGPVRVREAVVTFFKRHFD 1153

Query: 1297 EVGYSRLVLDGVNFSQINAEDNFLLTAPF 1383
               ++R +L+GV    +N E   LL   F
Sbjct: 1154 NEMWNRPLLEGVVLPTLNEESKTLLVETF 1182



 Score = 23.5 bits (49), Expect(3) = 2e-76
 Identities = 8/16 (50%), Positives = 11/16 (68%)
 Frame = +3

Query: 1392 EIKDAIWSCDGDKCPG 1439
            EI+  + + DG KCPG
Sbjct: 1186 EIEAVVMASDGSKCPG 1201


>GAU10537.1 hypothetical protein TSUD_419470, partial [Trifolium subterraneum]
          Length = 557

 Score =  198 bits (504), Expect(3) = 7e-76
 Identities = 99/231 (42%), Positives = 135/231 (58%), Gaps = 3/231 (1%)
 Frame = +3

Query: 426  GKRRLWGHLVMSKRGFGGTTWCVAGDFNAITVYSESRGVSAQFGCRECVEFNNFISEMNI 605
            GKR+LW  L+  +       WC+ GDFN+IT  SE RG S   G  E  EF   I  M +
Sbjct: 3    GKRKLWHDLIEFRMNNAPGEWCLGGDFNSITKTSERRGSSNWSGNTERTEFVQIIETMEL 62

Query: 606  VDIPVMGKKFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCST 785
            +DIPV+GKKFTW N+D SAM              +     QWVGDRDISDH PIW+ C+ 
Sbjct: 63   IDIPVLGKKFTWSNSDNSAMSRLDRFLLSEGLIEKGGITNQWVGDRDISDHYPIWLECNN 122

Query: 786  QDWGPKPFKFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVF 965
            ++W PKPFKFNNC LEH + + FV+  W+S  + G K               +KWN EVF
Sbjct: 123  RNWCPKPFKFNNCWLEHPDFIPFVKASWESMDIHGRKAFILKEKLKRLKESLKKWNHEVF 182

Query: 966  GFVDLNIEKIVEDINILDAVAASNN---LEDNSRRKELTVQFWQQIQNKES 1109
            G +DLNIEK V+++N ++ + A+ N   +  NS+++  +  FW+Q++ KES
Sbjct: 183  GIMDLNIEKTVKELNEIEEMIANGNSHPMYPNSKKQ--SKMFWEQLRFKES 231



 Score = 99.0 bits (245), Expect(3) = 7e-76
 Identities = 45/99 (45%), Positives = 66/99 (66%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            S+LKQKSR KWI EGD+NT FFHA +  + R N++  ++ GN WI+GV E+K   K+H+ 
Sbjct: 231  SILKQKSRTKWIQEGDSNTSFFHATIKGRHRSNRIAKIRKGNEWIEGVDEIKQAAKDHYS 290

Query: 1285 RLFHEVGYSRLVLDGVNFSQINAEDNFLLTAPFPPKKSR 1401
              F E  +SR  L G++F+ ++A+DN  L  PF  ++ R
Sbjct: 291  VHFSEEWHSRPFLQGIDFNSLSADDNAFLLEPFGEEEVR 329



 Score = 38.5 bits (88), Expect(3) = 7e-76
 Identities = 12/17 (70%), Positives = 16/17 (94%)
 Frame = +3

Query: 1389 EEIKDAIWSCDGDKCPG 1439
            EE++D +WSCDG+KCPG
Sbjct: 326  EEVRDTVWSCDGNKCPG 342


>GAU50864.1 hypothetical protein TSUD_411040 [Trifolium subterraneum]
          Length = 862

 Score =  239 bits (610), Expect(2) = 1e-75
 Identities = 115/297 (38%), Positives = 174/297 (58%), Gaps = 1/297 (0%)
 Frame = +3

Query: 219  ETFWRDCDFEWVARGASGLSSGLLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCHLV 398
            +  W   D  WV +   GLS GLL +W S   ++   FSG  ++GI  + +  G + HLV
Sbjct: 360  KNLWGHKDVRWVVKDLVGLSGGLLVMWNSDSFNLVNSFSGESYLGI--TVEREGAVTHLV 417

Query: 399  NIYSPCSIDGKRRLWGHLVMSKRGFGGTTWCVAGDFNAITVYSESRGVSAQFGCRECVEF 578
            NIYSPCS+ GK++LW  L+  K+ F G   C+ GDFNAI   SE +G SA     E + F
Sbjct: 418  NIYSPCSLSGKKKLWEDLLEIKQLFTGGECCLRGDFNAILHSSERKGASADSRQGERMMF 477

Query: 579  NNFISEMNIVDIPVMGKKFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDH 758
            N F+ EM ++D+PV+G K +W +ADG +M              ++    QW+G+R+I DH
Sbjct: 478  NRFVEEMEMIDVPVLGMKVSWVSADGKSMSRLDRFILSDGFITKFGIIGQWIGNRNIFDH 537

Query: 759  CPIWILCSTQDWGPKPFKFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXX 938
            CPIW+  S ++WGPKPF+  N  LEH + + F+E  W+SF + G K +            
Sbjct: 538  CPIWLYASAKNWGPKPFRAINGCLEHPDFLVFLESCWKSFDIQGTKAYVLKEKLRFLKEI 597

Query: 939  XRKWNREVFGFVDLNIEKIVEDINILDAVAASNNLE-DNSRRKELTVQFWQQIQNKE 1106
             +KWN+EVFG +DLNI+K V+++N ++ +   ++ + + +RR+ L  +FW Q+  KE
Sbjct: 598  LKKWNKEVFGILDLNIDKTVKELNDIEKMLGDDDPDVELTRREGLNSEFWSQLHFKE 654



 Score = 74.3 bits (181), Expect(2) = 1e-75
 Identities = 35/75 (46%), Positives = 53/75 (70%)
 Frame = +1

Query: 1108 LLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFER 1287
            LL+QKSR + + EGD+N++FFH  +  +RR+NQL+ L+ G+ W++G+ EVK  VKN FE 
Sbjct: 656  LLQQKSRTRRVKEGDSNSKFFHESIKRRRRKNQLVVLKDGDQWVEGMEEVKGYVKNFFEN 715

Query: 1288 LFHEVGYSRLVLDGV 1332
             F E   +R  L+G+
Sbjct: 716  NFRERWPNRPNLNGM 730


>KYP36320.1 Transposon TX1 uncharacterized [Cajanus cajan]
          Length = 1401

 Score =  241 bits (614), Expect(3) = 5e-74
 Identities = 125/343 (36%), Positives = 174/343 (50%), Gaps = 2/343 (0%)
 Frame = +3

Query: 81   KILSLNVRGLGDRAKRRRLRSLIINGKFDCIFLQETKCSDITNQLIETFWRDCDFEWVAR 260
            K + + VRG  D   R+++ S+     F  I +QETK            W    FEW A 
Sbjct: 593  KSIGMEVRGNEDEV-RQKIISMEQRDHF--IAIQETKLEMCDTARCAQLWGSTKFEWFAS 649

Query: 261  GASGLSSGLLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCHLVNIYSPCSIDGKRRL 440
             + G S GLL++W S    + + FSG+ F G+C  +      C +VN+YS C +  KRRL
Sbjct: 650  PSHGRSGGLLSIWNSDRGKLLFSFSGSGFHGVCLQWGVDAYRCVVVNVYSSCHLVDKRRL 709

Query: 441  WGHLVMSKRGFGGTTWCVAGDFNAITVYSESRGVSAQFGCRECVEFNNFISEMNIVDIPV 620
            WG ++MSKRGFG   WC+ GDFN +    E +G     G R+  EFN+FI+EM ++D+P+
Sbjct: 710  WGDIIMSKRGFGSCLWCIVGDFNTVRRLEERKGGFGDHGARDMEEFNSFITEMELIDVPL 769

Query: 621  MGKKFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCSTQDWGP 800
            +GK+FTWF +DGS M               W A    V  RD+SDHCP+ +     +WGP
Sbjct: 770  VGKRFTWFRSDGSIMSRLDRVLVSESWSAHWGAGFVKVIPRDVSDHCPLILNHKVLNWGP 829

Query: 801  KPFKFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVFGFVDL 980
            KPF+FNNC L H  +   V   W+      W                +KWN EVFG VD 
Sbjct: 830  KPFRFNNCWLSHCGIEGVVRSAWEKQVQGPWAAQRLRSKLLNVKNALKKWNIEVFGNVDT 889

Query: 981  NIEKIVEDINILDAVAASNNL--EDNSRRKELTVQFWQQIQNK 1103
             I+ +  ++  LDA      L   + +R+KEL    W   +NK
Sbjct: 890  MIKSLTNELKELDAKNEEQVLIQSERNRQKELVAGIWSARRNK 932



 Score = 67.4 bits (163), Expect(3) = 5e-74
 Identities = 33/93 (35%), Positives = 55/93 (59%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            +LL QK+R +W   GD N+++FHAC+  ++RRNQ++AL++G   ++ V E+K  V ++F+
Sbjct: 934  TLLAQKARIRWGKYGDQNSKYFHACIRGRQRRNQIVALKMGERMVEEVHEIKQVVWSYFD 993

Query: 1285 RLFHEVGYSRLVLDGVNFSQINAEDNFLLTAPF 1383
              F    + R  L    F  ++ E N  L   F
Sbjct: 994  EHFKARSWLRPRLSLAGFPVVSNEQNARLVGDF 1026



 Score = 21.6 bits (44), Expect(3) = 5e-74
 Identities = 9/17 (52%), Positives = 10/17 (58%)
 Frame = +3

Query: 1389 EEIKDAIWSCDGDKCPG 1439
            EE+   I   DGDK PG
Sbjct: 1029 EEVSCLIRESDGDKSPG 1045


>KHN28364.1 hypothetical protein glysoja_029625, partial [Glycine soja]
          Length = 326

 Score =  231 bits (590), Expect(2) = 9e-74
 Identities = 117/300 (39%), Positives = 162/300 (54%), Gaps = 1/300 (0%)
 Frame = +3

Query: 213  LIETFWRDCDFEWVARGASGLSSGLLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCH 392
            ++E  W +   +W+A  +SGLS GLL +WK  +  V+ +FSG+ FIG+C  F ++G    
Sbjct: 1    VVENMWGNQLIDWIALPSSGLSGGLLMMWKKGLWVVKSNFSGHGFIGVCVEFNSAG---- 56

Query: 393  LVNIYSPCSIDGKRRLWGHLVMSKRGFGGTTWCVAGDFNAITVYSESRGVSAQFGCRECV 572
                                           WC+ GDFNA++   E  G S  +G  + V
Sbjct: 57   ------------------------------EWCLVGDFNAVSNREERTGRSENWGYIDMV 86

Query: 573  EFNNFISEMNIVDIPVMGKKFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDIS 752
            +FN F++EMN++D P+ G KFT+F +DG A                WQ   Q VG RDIS
Sbjct: 87   DFNAFVNEMNLIDPPLHGNKFTYFCSDGIAASRLDRFLVSDGIMNLWQVKGQRVGKRDIS 146

Query: 753  DHCPIWILCSTQDWGPKPFKFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXX 932
            DHCPIW+ CS  +WGPKPF+FNNC LEH    SF+ E+W+  Q++G K +          
Sbjct: 147  DHCPIWLECSNLNWGPKPFRFNNCWLEHDGFKSFIVEEWKKIQITGRKAYVIKEKLKIIR 206

Query: 933  XXXRKWNREVFGFVDLNIEKIVEDINILD-AVAASNNLEDNSRRKELTVQFWQQIQNKES 1109
               +KWN+EVFG++DLNIE IV D+N LD  +    NL    ++KE    FWQQ+  KES
Sbjct: 207  ESLKKWNKEVFGWLDLNIENIVADMNELDRGIEEGCNLNVVVKKKEANALFWQQLMMKES 266



 Score = 75.9 bits (185), Expect(2) = 9e-74
 Identities = 35/60 (58%), Positives = 47/60 (78%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            SLLKQKSR +WI EGD+NT+FFH+C+  +RR+NQ+L+LQV    ++ V EVK EV+  FE
Sbjct: 266  SLLKQKSRLRWIKEGDSNTKFFHSCLQDRRRKNQILSLQVEGRCVEQVGEVKMEVRRFFE 325


>KYP32205.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1079

 Score =  238 bits (608), Expect(2) = 3e-73
 Identities = 121/342 (35%), Positives = 166/342 (48%)
 Frame = +3

Query: 78   MKILSLNVRGLGDRAKRRRLRSLIINGKFDCIFLQETKCSDITNQLIETFWRDCDFEWVA 257
            MK+ + N RGLG + K  R+  LI + + D I +QETK            W    FEW A
Sbjct: 1    MKVGTFNCRGLGGKVKSHRISELIRSEELDFIAIQETKLEMCDTARCAQLWGSTKFEWFA 60

Query: 258  RGASGLSSGLLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCHLVNIYSPCSIDGKRR 437
              + G S GLL++W S    + + FSG+ F G+C  +      C +VN+YS C +  KRR
Sbjct: 61   SPSHGRSGGLLSIWNSDRGKLLFSFSGSGFHGVCLQWGVDAYRCVVVNVYSSCHLVDKRR 120

Query: 438  LWGHLVMSKRGFGGTTWCVAGDFNAITVYSESRGVSAQFGCRECVEFNNFISEMNIVDIP 617
            LWG ++MSKRGFG   WC+ GDFN +    E +G     G R+  EFN+FI+EM ++D+P
Sbjct: 121  LWGDIIMSKRGFGSCLWCIVGDFNTVRRLEERKGGFGDHGARDMEEFNSFITEMELIDVP 180

Query: 618  VMGKKFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCSTQDWG 797
            ++GK+FTWF +DGS M               W A    V  RD+SDHCP+ +     +WG
Sbjct: 181  LVGKRFTWFRSDGSIMSRLDRVLVSESWSAHWGAGFVKVIPRDVSDHCPLILNHKVLNWG 240

Query: 798  PKPFKFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVFGFVD 977
            PKPF+FNNC L H  +   V   W+      W                +KWN EVF    
Sbjct: 241  PKPFRFNNCWLSHCGIEGVVRSAWEKQVQGPWAAQRLRSKLLNVKNALKKWNIEVF---- 296

Query: 978  LNIEKIVEDINILDAVAASNNLEDNSRRKELTVQFWQQIQNK 1103
                                   + +R+KEL    W   +NK
Sbjct: 297  -----------------------ERNRQKELVAGIWSARRNK 315



 Score = 67.4 bits (163), Expect(2) = 3e-73
 Identities = 33/93 (35%), Positives = 55/93 (59%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            +LL QK+R +W   GD N+++FHAC+  ++RRNQ++AL++G   ++ V E+K  V ++F+
Sbjct: 317  TLLAQKARIRWGKYGDQNSKYFHACIRGRQRRNQIVALKMGERMVEEVHEIKQVVWSYFD 376

Query: 1285 RLFHEVGYSRLVLDGVNFSQINAEDNFLLTAPF 1383
              F    + R  L    F  ++ E N  L   F
Sbjct: 377  EHFKARSWLRPRLSLAGFPVVSNEQNARLVGDF 409


>GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterraneum]
          Length = 1985

 Score =  225 bits (573), Expect(3) = 5e-73
 Identities = 107/342 (31%), Positives = 177/342 (51%), Gaps = 5/342 (1%)
 Frame = +3

Query: 99   VRGLGDRAKRRRLRSLIINGKFDCIFLQETKCSDITNQLIETFWRDCDFEWVARGASGLS 278
            VRGLG+R KRR++R L+   K D + +QETK     +  ++  W   D +W    + G S
Sbjct: 759  VRGLGNRVKRRKVRELVQMEKLDFLAIQETKMEAFPDNFVQGLWGSNDCDWCFLPSEGRS 818

Query: 279  SGLLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCHLVNIYSPCSIDGKRRLWGHLVM 458
             G+L++W      + + F G  F+G C      G  C +VN+Y+ C++  KR LW +++M
Sbjct: 819  GGILSIWNKVKSTLVFSFIGEGFVGACLDLVAEGKKCFIVNVYAKCNLRNKRTLWANILM 878

Query: 459  SKRGFGGTTWCVAGDFNAITVYSESRGVSAQFG---CRECVEFNNFISEMNIVDIPVMGK 629
            SK GFG   WCV GDFN++   +E RGV          E V F+ F++ +++VD+P++G+
Sbjct: 879  SKSGFGEGLWCVLGDFNSVRDSNERRGVVGNVDGQRSSEMVAFDLFLNNLDLVDMPLIGR 938

Query: 630  KFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCSTQDWGPKPF 809
            +FTWF+ +G +M               W     W  DRD++DHCP+ +  S  DWGP+PF
Sbjct: 939  RFTWFHPNGVSMSRLDRILISSDWADVWGTPNVWAMDRDVADHCPLVLRYSLADWGPRPF 998

Query: 810  KFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVFGFVDLNIE 989
            +F+N  LEH+     ++  W +    GW                ++W+R  +G  +   +
Sbjct: 999  RFSNFWLEHREFKEVIKTAWDAHVAEGWMGFILKERLKVLKGVVKEWSRRTYGEAEAKKK 1058

Query: 990  KIVEDINILDAVAASNNLEDNS--RRKELTVQFWQQIQNKES 1109
            ++++DI  LD  + +  L       RK L    W  +++ ++
Sbjct: 1059 RLIKDILALDLKSETTGLLQGEVVERKILFDDLWITLKSMDA 1100



 Score = 75.9 bits (185), Expect(3) = 5e-73
 Identities = 34/93 (36%), Positives = 59/93 (63%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            +++ Q+SR+KW+ EGD N+++FH C+ A++RRN ++AL+  N W++G   V+ EV + F 
Sbjct: 1100 AMIFQRSRSKWLKEGDTNSQYFHNCIKARKRRNNMVALRTRNGWVEGPSLVREEVVSFFR 1159

Query: 1285 RLFHEVGYSRLVLDGVNFSQINAEDNFLLTAPF 1383
              F    + R  L+G+ F +++      LTA F
Sbjct: 1160 NHFSNEEWHRPTLNGIEFPRLSLARVEELTAMF 1192



 Score = 25.4 bits (54), Expect(3) = 5e-73
 Identities = 9/17 (52%), Positives = 11/17 (64%)
 Frame = +3

Query: 1389 EEIKDAIWSCDGDKCPG 1439
            EEI + +  CDG K PG
Sbjct: 1195 EEISEVVRGCDGSKSPG 1211


>GAU32122.1 hypothetical protein TSUD_218730 [Trifolium subterraneum]
          Length = 1246

 Score =  201 bits (510), Expect(3) = 1e-70
 Identities = 109/311 (35%), Positives = 158/311 (50%), Gaps = 1/311 (0%)
 Frame = +3

Query: 180  QETKCSDITNQLIETFWRDCDFEWVARGASGLSSGLLTVWKSSMLHVQYHFSGNRFIGIC 359
            Q  +C  I   +I + W   D  WVA+ + GLS G+L +W S   ++             
Sbjct: 377  QRNQCMKI---MIHSLWGHKDVGWVAKESEGLSGGMLVIWNSDTFNMA------------ 421

Query: 360  ASFKNSGLMCHLVNIYSPCSIDGKRRLWGHLVMSKRGFGGTTWCVAGDFNAITVYSESRG 539
                                  GK++LW  LV+ K+  GG  WC+ GDFNA+   SE +G
Sbjct: 422  ----------------------GKKKLWEDLVIFKQQSGGGEWCLGGDFNAVLHSSERKG 459

Query: 540  VSAQFGCRECVEFNNFISEMNIVDIPVMGKKFTWFNADGSAMXXXXXXXXXXXXXXEWQA 719
            +SA     E   FN F+ EM  +D+P++GKKF+WF+ DG                     
Sbjct: 460  ISADSRHAERACFNRFVEEMEEIDVPILGKKFSWFSTDG--------------------- 498

Query: 720  NAQWVGDRDISDHCPIWILCSTQDWGPKPFKFNNC*LEHKNLMSFVEEKWQSFQVSGWKL 899
                  DRDISDHC +W++  +++WGPKPFK  N  LEH    SFVE+  + F+VSG K 
Sbjct: 499  ------DRDISDHCLVWLVSESKNWGPKPFKVINGWLEHPKFFSFVEKSRKGFKVSGKKA 552

Query: 900  HTFXXXXXXXXXXXRKWNREVFGFVDLNIEKIVEDINILDAVAASNNLE-DNSRRKELTV 1076
            +             RKWNREVFG +DLNIEK V+D+N ++ +   + ++ + +RR+ L  
Sbjct: 553  YVLKEKFRMLKECLRKWNREVFGILDLNIEKTVKDLNNIEGLMGDDEMDLELTRREGLNK 612

Query: 1077 QFWQQIQNKES 1109
            +FW+Q+  KES
Sbjct: 613  EFWRQLHLKES 623



 Score = 86.7 bits (213), Expect(3) = 1e-70
 Identities = 45/93 (48%), Positives = 58/93 (62%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            SLLKQKSR +W+ EGD+N+R+FH  + + RRRN L+AL+ G   + GV EVK  VKN F+
Sbjct: 623  SLLKQKSRMRWVKEGDSNSRYFHESIKSIRRRNHLVALKDGEQRVQGVEEVKVFVKNFFD 682

Query: 1285 RLFHEVGYSRLVLDGVNFSQINAEDNFLLTAPF 1383
              F E       L+GV F  +  EDN  L  PF
Sbjct: 683  NNFRESLEDIPNLNGVQFQSLTDEDNLSLLDPF 715



 Score = 31.2 bits (69), Expect(3) = 1e-70
 Identities = 11/18 (61%), Positives = 15/18 (83%)
 Frame = +3

Query: 1383 SPEEIKDAIWSCDGDKCP 1436
            S +E+K+AIW  DG+KCP
Sbjct: 716  SIDEVKEAIWCSDGNKCP 733


>GAU48812.1 hypothetical protein TSUD_406450 [Trifolium subterraneum]
          Length = 655

 Score =  213 bits (541), Expect(2) = 5e-70
 Identities = 108/340 (31%), Positives = 176/340 (51%), Gaps = 5/340 (1%)
 Frame = +3

Query: 105  GLGDRAKRRRLRSLIINGKFDCIFLQETKCSDITNQLIETFWRDCDFEWVARGASGLSSG 284
            GLG R KRR++R ++ + + D + +QETK   I++ L+   W + D  W    + G S G
Sbjct: 199  GLGSRVKRRKIRDMVRDEQLDFLAIQETKLEVISDALVLALWGNNDCCWSYLPSVGNSGG 258

Query: 285  LLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCHLVNIYSPCSIDGKRRLWGHLVMSK 464
            +L++W      + + F G+ F+G+C         C ++N+Y+ CS   KR LW +++MSK
Sbjct: 259  ILSIWNKVKASLVFTFIGDGFVGVCLDLLTENKRCFVINVYAKCSSRDKRTLWSNILMSK 318

Query: 465  RGFGGTTWCVAGDFNAITVYSESRGVSAQF---GCRECVEFNNFISEMNIVDIPVMGKKF 635
            RGFG   WC+ GDFN+I   SE RGV+         E   FN F+ ++++ D+P++G+ F
Sbjct: 319  RGFGDGLWCIVGDFNSIRDSSERRGVNLSHRDDPSAEMRSFNEFVGDLDLFDMPLVGRMF 378

Query: 636  TWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCSTQDWGPKPFKF 815
            TWF+ +G AM               W  +   V DRD+SDHCP+ +   + DWGPKPF+F
Sbjct: 379  TWFHPNGIAMSRLDRLLVSPLWLDSWGDSFVRVLDRDVSDHCPLVLRYCSVDWGPKPFRF 438

Query: 816  NNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVFGFVDLNIEKI 995
            NN  L+ +     ++  W S +  GW                ++W    FG  +   +++
Sbjct: 439  NNFCLQSREFKDVIKVTWASQEFIGWMGFILKERLKGLKGVIKEWTVRNFGDAEGKKKRL 498

Query: 996  VEDINILDAVAASNNLEDNS--RRKELTVQFWQQIQNKES 1109
              +I  LD+ +    L D     RK+L  + W  ++N ++
Sbjct: 499  TIEIAELDSKSEGLGLVDAEVVLRKKLFEELWILLRNMDA 538



 Score = 82.4 bits (202), Expect(2) = 5e-70
 Identities = 40/93 (43%), Positives = 62/93 (66%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            +L+ Q+SR++WI EGD+N+R+FH CV A++RRN LLAL+  + W++G   V+  V ++F+
Sbjct: 538  ALIFQRSRSRWIKEGDSNSRYFHNCVKARKRRNNLLALRTPSGWVEGPTLVREAVVSYFK 597

Query: 1285 RLFHEVGYSRLVLDGVNFSQINAEDNFLLTAPF 1383
              F    + R  LDG+ F Q++A     LTA F
Sbjct: 598  NHFDNGRWHRPTLDGIVFPQLSANKVEDLTAIF 630


>GAU51943.1 hypothetical protein TSUD_417260, partial [Trifolium subterraneum]
          Length = 421

 Score =  215 bits (547), Expect(2) = 1e-68
 Identities = 103/291 (35%), Positives = 157/291 (53%), Gaps = 5/291 (1%)
 Frame = +3

Query: 228  WRDCDFEWVARGASGLSSGLLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCHLVNIY 407
            W + D +WV+  A G S G+L++W+ S+  V + F+G+ FIG+C    +  + C ++N+Y
Sbjct: 32   WGNNDCDWVSLPAVGNSGGILSLWRKSLGPVVFSFTGDGFIGVCLDLVDKHVRCCVINVY 91

Query: 408  SPCSIDGKRRLWGHLVMSKRGFGGTTWCVAGDFNAITVYSESRGV---SAQFGCRECVEF 578
            + C+I  KRRLW  L+M+KRGFG   WC+ GDFN++   SE RGV   +     RE  EF
Sbjct: 92   AKCNIVDKRRLWSDLLMTKRGFGDIVWCIVGDFNSVVDSSERRGVVVGAVHSQSREMREF 151

Query: 579  NNFISEMNIVDIPVMGKKFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDH 758
              F+ E+ +VD+P++G+ FTWF+ +G  M               W     WV  RD+SDH
Sbjct: 152  GQFLEELEVVDLPLIGRSFTWFHPNGITMSRLDRILVSTDWIPLWGNPNVWVASRDVSDH 211

Query: 759  CPIWILCSTQDWGPKPFKFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXX 938
            CP+++   + DWGPKPF+FNN  L++ N  + V   W++   SGW  +            
Sbjct: 212  CPLFLRYDSTDWGPKPFRFNNFWLKNNNFRALVINTWEAQNFSGWMGYILKDRLKGLKIV 271

Query: 939  XRKWNREVFGFVDLNIEKIVEDINILDAVAASNNLEDN--SRRKELTVQFW 1085
             + WN EV+G        +VE I +LD  +    + D     R+ L  + W
Sbjct: 272  IKNWNGEVYGKPVERKRSLVEKIKVLDLKSEQVGISDEEVEVRRRLFDELW 322



 Score = 75.1 bits (183), Expect(2) = 1e-68
 Identities = 34/82 (41%), Positives = 53/82 (64%)
 Frame = +1

Query: 1117 QKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFERLFH 1296
            Q+SRA+W+ EGDANT++FHA V A+ RRN + AL   + W++G   V+  V + F++ F 
Sbjct: 334  QRSRARWLKEGDANTKYFHAHVKARGRRNNISALLTEDGWVEGPTNVRQAVVSFFQQHFA 393

Query: 1297 EVGYSRLVLDGVNFSQINAEDN 1362
               + R  L+GV+F  ++ E N
Sbjct: 394  TTAWERPTLEGVDFPLLSEESN 415


>KYP31897.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1101

 Score =  216 bits (550), Expect(2) = 4e-68
 Identities = 114/348 (32%), Positives = 172/348 (49%), Gaps = 4/348 (1%)
 Frame = +3

Query: 78   MKILSLNVRGLGDRAKRRRLRSLIINGKFDCIFLQETKCSDITNQLIETFWRDCDFEWVA 257
            MKI++ N+RGLG R KRR LR +I   +   + LQETK  +IT  + ++FW + D+EW A
Sbjct: 1    MKIITFNIRGLGGRVKRRNLREMIQKERVQLLCLQETKTREITEAMCKSFWGEDDYEWRA 60

Query: 258  RGASGLSSGLLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCHLVNIYSPCSIDGKRR 437
              A   + GLL +W+         F G+ ++G+   + + G     VN+Y PC    K  
Sbjct: 61   IPAVNTAGGLLCIWRKEAFQCCSVFEGSSYLGLEGIWLDDGSRVIFVNVYMPCIQAQKEL 120

Query: 438  LWGHLVMSKRGFGGTTWCVAGDFNAITVYSE--SRGVSAQFGCRECVEFNNFISEMNIVD 611
            +W  LV  K       WCV GDFN I    E  +  VS     RE  +FN FIS+M + +
Sbjct: 121  IWNELVEMKNCSQVQMWCVLGDFNCIRRAEERVNVDVSNNNRTREMTQFNRFISQMELEE 180

Query: 612  IPVMGKKFTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCSTQD 791
            +P++GKK+TW+  +G                 EW   +Q V  R +SDHCPI +     D
Sbjct: 181  VPIIGKKYTWYKPNGRVKSRLDRIFVTKECLLEWSNISQKVMKRSVSDHCPILLQSKMVD 240

Query: 792  WGPKPFKFNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVFGF 971
            WGPKPF+  +C ++     + VE+ W+   + GW  +             ++WN + F  
Sbjct: 241  WGPKPFRSLDCWIQDNQFHTVVEDAWRRMSIQGWGAYVLQQKLKQLKITLKEWNAKDFKS 300

Query: 972  VDLNIEKIVEDINILDAVAASNNLE--DNSRRKELTVQFWQQIQNKES 1109
              +   ++VE++N LD +    +L   + SR+ EL   FW      ES
Sbjct: 301  QKMEERRVVEEMNKLDIIEEERSLTEVEISRKCELLQLFWDAAIRNES 348



 Score = 72.4 bits (176), Expect(2) = 4e-68
 Identities = 36/93 (38%), Positives = 54/93 (58%)
 Frame = +1

Query: 1105 SLLKQKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFE 1284
            S+  QKSR++WI EGD NT+FFH  V  + R+N++  L + + W++    VK    ++FE
Sbjct: 348  SIWCQKSRSQWIKEGDMNTKFFHLMVKWRSRKNEIKGLFIDDQWVEEPKVVKNNALSYFE 407

Query: 1285 RLFHEVGYSRLVLDGVNFSQINAEDNFLLTAPF 1383
              F E    R  LDG +F  I++  N +L A F
Sbjct: 408  NRFQEQSIVRPKLDGAHFKTISSSQNEMLVAVF 440


>GAU09987.1 hypothetical protein TSUD_393040 [Trifolium subterraneum]
          Length = 1815

 Score =  208 bits (529), Expect(2) = 9e-68
 Identities = 107/333 (32%), Positives = 169/333 (50%), Gaps = 5/333 (1%)
 Frame = +3

Query: 102  RGLGDRAKRRRLRSLIINGKFDCIFLQETKCSDITNQLIETFWRDCDFEWVARGASGLSS 281
            RGLG R KRR+++ +I   K D + LQETK  ++T+ L  + W + D++           
Sbjct: 747  RGLGSRVKRRKVKEVIGVEKLDFLALQETKLEEVTSTLCRSLWGNDDWDG---------- 796

Query: 282  GLLTVWKSSMLHVQYHFSGNRFIGICASFKNSGLMCHLVNIYSPCSIDGKRRLWGHLVMS 461
                                 F+G+C    +  +   ++N+Y+ C++  KRRLW  L+M+
Sbjct: 797  ---------------------FVGVCLDLVDLQVRVCVINVYAKCNLSDKRRLWSDLLMT 835

Query: 462  KRGFGGTTWCVAGDFNAITVYSESRGVS---AQFGCRECVEFNNFISEMNIVDIPVMGKK 632
            KRGFG   WC+ GDFN++   SE RG++   A    RE +EF  F+ ++ +VD+P +G++
Sbjct: 836  KRGFGDIVWCIVGDFNSVLDTSERRGIALGAAHSPTREMMEFGQFMEDLELVDLPFIGRR 895

Query: 633  FTWFNADGSAMXXXXXXXXXXXXXXEWQANAQWVGDRDISDHCPIWILCSTQDWGPKPFK 812
            FTWF+ +G+ M               W     WV  RD+SDHCPI +   + DWGPKPF+
Sbjct: 896  FTWFHPNGTTMSRLDRVLVSLDWIPLWGNPNAWVAPRDVSDHCPIILRYDSTDWGPKPFR 955

Query: 813  FNNC*LEHKNLMSFVEEKWQSFQVSGWKLHTFXXXXXXXXXXXRKWNREVFGFVDLNIEK 992
            FNN  L+H N  + V++ W++ Q +GW  +             + WN  V+G        
Sbjct: 956  FNNFWLKHNNFRTLVKDTWEAQQFTGWMGYILKDRLKGLKIAIKNWNGVVYGKPVERKNS 1015

Query: 993  IVEDINILDAVAASNNL--EDNSRRKELTVQFW 1085
            +VE I  LD  +    +  E+ + RK+L  + W
Sbjct: 1016 LVEKIKALDLKSEQVGISGEEVATRKKLFDELW 1048



 Score = 79.3 bits (194), Expect(2) = 9e-68
 Identities = 37/89 (41%), Positives = 56/89 (62%)
 Frame = +1

Query: 1117 QKSRAKWILEGDANTRFFHACVGAQRRRNQLLALQVGNSWIDGVVEVKAEVKNHFERLFH 1296
            Q+SRA+W+ EGDANT++FH+ V A+ R N++ AL   N W++G + V+    + F+  F 
Sbjct: 1060 QRSRARWLKEGDANTKYFHSQVKARGRINKVSALLTDNGWVEGPINVRQATLSFFQHHFA 1119

Query: 1297 EVGYSRLVLDGVNFSQINAEDNFLLTAPF 1383
               + R  LDGV F  ++ + N  LTAPF
Sbjct: 1120 STEWERPTLDGVGFPVLSDDCNSALTAPF 1148


Top