BLASTX nr result

ID: Glycyrrhiza29_contig00027229 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza29_contig00027229
         (996 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU25065.1 hypothetical protein TSUD_257590 [Trifolium subterran...   262   2e-78
GAU48536.1 hypothetical protein TSUD_282880 [Trifolium subterran...   261   4e-75
GAU14107.1 hypothetical protein TSUD_169320 [Trifolium subterran...   251   3e-74
GAU33402.1 hypothetical protein TSUD_20950 [Trifolium subterraneum]   242   2e-68
GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterran...   232   1e-66
GAU42690.1 hypothetical protein TSUD_302000 [Trifolium subterran...   230   1e-64
GAU24549.1 hypothetical protein TSUD_148900 [Trifolium subterran...   228   2e-63
GAU10126.1 hypothetical protein TSUD_423110, partial [Trifolium ...   208   1e-62
GAU43007.1 hypothetical protein TSUD_187280 [Trifolium subterran...   225   2e-62
GAU14347.1 hypothetical protein TSUD_309070 [Trifolium subterran...   221   2e-62
GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterran...   221   3e-61
KYP70042.1 hypothetical protein KK1_009250 [Cajanus cajan]            211   3e-61
KYP31897.1 Putative ribonuclease H protein At1g65750 family [Caj...   217   7e-60
GAU50864.1 hypothetical protein TSUD_411040 [Trifolium subterran...   214   1e-59
KHN28364.1 hypothetical protein glysoja_029625, partial [Glycine...   203   1e-59
KHN41376.1 hypothetical protein glysoja_032562, partial [Glycine...   203   4e-59
GAU22765.1 hypothetical protein TSUD_129770 [Trifolium subterran...   206   4e-59
KYP35677.1 Transposon TX1 uncharacterized [Cajanus cajan]             203   5e-59
KYP32205.1 Putative ribonuclease H protein At1g65750 family [Caj...   214   8e-59
GAU50762.1 hypothetical protein TSUD_410610 [Trifolium subterran...   209   3e-58

>GAU25065.1 hypothetical protein TSUD_257590 [Trifolium subterraneum]
          Length = 721

 Score =  262 bits (670), Expect = 2e-78
 Identities = 133/278 (47%), Positives = 176/278 (63%)
 Frame = +1

Query: 115 AI*VSMKILSLNIRGVGD*AKRRRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGV 294
           A+ + ++I+SLN+RG G  AKRRR+ S L +G    C LQETK ++I   ++ +LW    
Sbjct: 98  ALKLGVEIVSLNMRGWGGSAKRRRLSSFLQKGAFDVCLLQETKKADIEDFLIHNLWGHKD 157

Query: 295 YDFVSKDAEGLSGGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNW 474
            ++V+K+  GLSGGMLI+W+ +                  R     D L +VNVYSPC  
Sbjct: 158 VNWVAKNPTGLSGGMLIIWNFDFFSLLNSYYGDGYLGI--RVDREGDELNIVNVYSPCII 215

Query: 475 DNKKELWKDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSSGYGRREIGGFNQFISEMN 654
             KK+LW+DLL  K+  GG  WCV GDFN++    ER+G S    + E   FN+F+ EM 
Sbjct: 216 SGKKKLWEDLLALKQSTGGGKWCVRGDFNSILHSSERKGSSIVSRQNESSLFNRFVEEME 275

Query: 655 LIDVPVLGKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCS 834
           LID PVLGKKF+W+S DG + SR+DRFLLS+  +SK+ +  +W+GDRDIS HCPIWL CS
Sbjct: 276 LIDTPVLGKKFSWFSADGKSMSRIDRFLLSDGFVSKFGITGKWIGDRDISYHCPIWLLCS 335

Query: 835 VYDWGPKPFRFNNCWLGHKQFKSFVEESWASFQVRGWK 948
            Y+WGPKPFR  N W+ H  F  FVE +W SF V G K
Sbjct: 336 SYNWGPKPFRVINGWMEHPDFFDFVETTWKSFDVHGKK 373


>GAU48536.1 hypothetical protein TSUD_282880 [Trifolium subterraneum]
          Length = 1794

 Score =  261 bits (667), Expect = 4e-75
 Identities = 129/278 (46%), Positives = 169/278 (60%)
 Frame = +1

Query: 163  GD*AKRRRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFVSKDAEGLSGGML 342
            G  AKRRR+  +L  G    C LQETK  N    ++  LW     ++V+K++ GLSGG+L
Sbjct: 690  GSCAKRRRLSKLLASGTFDLCLLQETKRDNFDDLMIQKLWGHKDVEWVAKESIGLSGGLL 749

Query: 343  IVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNWDNKKELWKDLLWCKRV 522
            I+W+A                 C   +    +LF++N+YSPC+   K++LW DLL  K+ 
Sbjct: 750  IMWNAGLFNLKFSFTGDNFLGLCVECKEG--ILFIINIYSPCSLSGKRKLWSDLLEFKQN 807

Query: 523  FGGDLWCVAGDFNAVSDLQERRGVSSGYGRREIGGFNQFISEMNLIDVPVLGKKFTWWSG 702
                 WC+ GDFN V    ER+G S+   + E   F QF+  M L DVPV GKKF+W+S 
Sbjct: 808  NEQGEWCLGGDFNVVLKTGERKGSSALCRQNERLEFCQFVEAMELCDVPVAGKKFSWFSA 867

Query: 703  DGNAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSVYDWGPKPFRFNNCWL 882
            DG + SRLDRFLLSE+ I   +V  QW+G RDISDHCPIWL CS  +WGPKPF+ NNCWL
Sbjct: 868  DGTSMSRLDRFLLSEKFIDSEKVTGQWIGSRDISDHCPIWLLCSNLNWGPKPFKVNNCWL 927

Query: 883  GHKQFKSFVEESWASFQVRGWKTYAFKEKLKMLRVKLR 996
             H +FK FVE++W    V G K +  KEKLK L+ +LR
Sbjct: 928  EHPEFKPFVEKTWNKLNVEGKKAFVIKEKLKRLKEELR 965


>GAU14107.1 hypothetical protein TSUD_169320 [Trifolium subterraneum]
          Length = 695

 Score =  251 bits (641), Expect = 3e-74
 Identities = 125/276 (45%), Positives = 172/276 (62%), Gaps = 1/276 (0%)
 Frame = +1

Query: 172 AKRRRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFVSKDAEGLSGGMLIVW 351
           AKRRR+ S++  G    C LQETK  N    ++ +LW     ++V+K + GLSGG+L VW
Sbjct: 2   AKRRRLSSLIKTGAFDMCMLQETKRDNFEDYMIHNLWGHTDVEWVAKKSNGLSGGLLSVW 61

Query: 352 DAEELEXXXXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNWDNKKELWKDLLWCKRVFGG 531
           + +                C   +    +L++VNVYS CN   K++LW DL+  K     
Sbjct: 62  NKDLFSFRYSFTGDGFLGVCVEWK--AGLLYIVNVYSSCNVSGKRKLWNDLIDFKLNNEP 119

Query: 532 DLWCVAGDFNAVSDLQERRGVSSGYGRR-EIGGFNQFISEMNLIDVPVLGKKFTWWSGDG 708
           + WC+ GDFN++S + ERRG SSG  R+ E   F QFI  + ++D+P+  K FTW++ DG
Sbjct: 120 EEWCLGGDFNSISKVGERRGSSSGAWRQGERIEFIQFIDALEVVDIPLKDKMFTWFNSDG 179

Query: 709 NAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSVYDWGPKPFRFNNCWLGH 888
           +A SRL+ FL+SE  I K  ++ QWVGDRDISDHCPIWL CS  +WGPKPF FNNCWL H
Sbjct: 180 SAMSRLNHFLVSEGFIEKGSLSYQWVGDRDISDHCPIWLMCSNINWGPKPFTFNNCWLEH 239

Query: 889 KQFKSFVEESWASFQVRGWKTYAFKEKLKMLRVKLR 996
            +F  FV+E+W +  +RG K +  KEKLK L+  L+
Sbjct: 240 PKFFEFVKETWENMDIRGKKAFIIKEKLKGLKEALK 275


>GAU33402.1 hypothetical protein TSUD_20950 [Trifolium subterraneum]
          Length = 1594

 Score =  242 bits (618), Expect = 2e-68
 Identities = 123/272 (45%), Positives = 160/272 (58%)
 Frame = +1

Query: 181  RRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFVSKDAEGLSGGMLIVWDAE 360
            RR+R    QG    C LQETK  N    ++ ++W     ++V+K + GLSGG L      
Sbjct: 650  RRMREQRNQGTFDICLLQETKRDNFDDFMIQNVWGHKDVEWVAKGSVGLSGGNL------ 703

Query: 361  ELEXXXXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNWDNKKELWKDLLWCKRVFGGDLW 540
                                       +++NVYSPC+   K++LW DLL  K       W
Sbjct: 704  ---------------------------YIINVYSPCSLSGKRKLWSDLLEFKLNNEQGEW 736

Query: 541  CVAGDFNAVSDLQERRGVSSGYGRREIGGFNQFISEMNLIDVPVLGKKFTWWSGDGNAKS 720
            C+ GDFN V ++ ER+G +S   + E   F QF+  M LIDVPV GKKF+W+S DGNA S
Sbjct: 737  CLRGDFNVVLNVGERKGSTSSARQNERLEFCQFVEAMELIDVPVAGKKFSWFSADGNAIS 796

Query: 721  RLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSVYDWGPKPFRFNNCWLGHKQFK 900
            RLDRFLLS+  I K +VA QW+G+ DISDHCPIWL CS  +WGPKPF+ NNCWL H +FK
Sbjct: 797  RLDRFLLSDNFIEKEEVAGQWIGNHDISDHCPIWLMCSNLNWGPKPFKVNNCWLEHSEFK 856

Query: 901  SFVEESWASFQVRGWKTYAFKEKLKMLRVKLR 996
             FVE++W    +RG K +  KEKLK L+ +LR
Sbjct: 857  LFVEKTWEKLNIRGKKAFVIKEKLKRLKEELR 888


>GAU42970.1 hypothetical protein TSUD_188430 [Trifolium subterraneum]
          Length = 767

 Score =  232 bits (592), Expect = 1e-66
 Identities = 112/258 (43%), Positives = 159/258 (61%)
 Frame = +1

Query: 223 CFLQETKCSNIHRNVVDSLWYGGVYDFVSKDAEGLSGGMLIVWDAEELEXXXXXXXXXXX 402
           C LQETK  +    ++ +LW     ++V K++ GLSGG+L VW+ +              
Sbjct: 5   CMLQETKRESFAEFLIHNLWGHRDVEWVHKESRGLSGGLLSVWNKDFCSFRHSFTGDGFL 64

Query: 403 XXCARKRNSTDVLFLVNVYSPCNWDNKKELWKDLLWCKRVFGGDLWCVAGDFNAVSDLQE 582
             C   +++  +++ VN+Y  C+   K++LW+DL+  K +     WC+ GDFN+++ + +
Sbjct: 65  GICVEWKDT--LVYFVNIYFACSLAGKRKLWRDLIDFKLLNTPGEWCLGGDFNSITKVSK 122

Query: 583 RRGVSSGYGRREIGGFNQFISEMNLIDVPVLGKKFTWWSGDGNAKSRLDRFLLSEELISK 762
           R G S+G   +E   F QFI  M L+D+PV GKKFTW + D +A SRLDRFLLSE LI K
Sbjct: 123 RSGSSNGSSNKERTEFAQFIDAMELVDIPVFGKKFTWSNSDNSAMSRLDRFLLSEGLIEK 182

Query: 763 WQVAAQWVGDRDISDHCPIWLDCSVYDWGPKPFRFNNCWLGHKQFKSFVEESWASFQVRG 942
             ++ QWVG RDISDH PIWL+CS  +WGPKPF+FNN WL H  F  FV+ +W S  + G
Sbjct: 183 GGISNQWVGGRDISDHHPIWLECSNINWGPKPFKFNNFWLDHPDFIPFVKATWESMNIHG 242

Query: 943 WKTYAFKEKLKMLRVKLR 996
            K +  KEKLK L+  L+
Sbjct: 243 KKAFILKEKLKRLKEVLK 260


>GAU42690.1 hypothetical protein TSUD_302000 [Trifolium subterraneum]
          Length = 1092

 Score =  230 bits (587), Expect = 1e-64
 Identities = 126/287 (43%), Positives = 162/287 (56%)
 Frame = +1

Query: 136 ILSLNIRGVGD*AKRRRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFVSKD 315
           I S +IR       R   + +  +G    C LQETK ++    ++ +LW     ++V K+
Sbjct: 164 ICSSDIRNCNKVFLRNYEQKVATKGAFDVCLLQETKKADFEDYLIHNLWGHKDVNWVVKE 223

Query: 316 AEGLSGGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNWDNKKELW 495
             G S     VW    +                       VL +VNVYSPCN   KK+LW
Sbjct: 224 PVGFSR----VWTWRGV-----------------------VLHIVNVYSPCNISGKKQLW 256

Query: 496 KDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSSGYGRREIGGFNQFISEMNLIDVPVL 675
           +DLL  K+     LWCV GDFNA+    ER+G S+   + E   FN F+ EM LID+PVL
Sbjct: 257 EDLLELKQRVAEGLWCVGGDFNAILHSFERQGSSTDSRKSERVLFNSFVEEMELIDIPVL 316

Query: 676 GKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSVYDWGPK 855
           GKKF+W+S DG + SR+DRFLLS+  +SK+ +  QW+GDRDISDHCPIWL  S   WGPK
Sbjct: 317 GKKFSWFSADGKSMSRIDRFLLSDGFVSKFGITGQWIGDRDISDHCPIWLLFSSNIWGPK 376

Query: 856 PFRFNNCWLGHKQFKSFVEESWASFQVRGWKTYAFKEKLKMLRVKLR 996
           PFR  N WL H  F +FVE +W SF V G K Y  KEK K+L+  LR
Sbjct: 377 PFRVINGWLDHPDFLTFVETTWKSFAVHGKKAYILKEKFKLLKDSLR 423


>GAU24549.1 hypothetical protein TSUD_148900 [Trifolium subterraneum]
          Length = 1239

 Score =  228 bits (580), Expect = 2e-63
 Identities = 120/293 (40%), Positives = 164/293 (55%), Gaps = 4/293 (1%)
 Frame = +1

Query: 130 MKILSLNIRGVGD*AKRRRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFVS 309
           M + S N+RG+G   KRRRIR ++    +    LQETK   +      SLW     ++V 
Sbjct: 1   MIVSSFNVRGLGGVMKRRRIRELVRHQKIDFLALQETKMEVLSEAFCYSLWGSDDCEWVF 60

Query: 310 KDAEGLSGGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNWDNKKE 489
             + G SGG+L +W                   C        V  +VNVYS C+  +K+ 
Sbjct: 61  LPSVGRSGGILSIWGKTNNSLIFSFVGDGFVGICLEWGVLKTVCIVVNVYSKCDVGSKRL 120

Query: 490 LWKDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSSGYGR----REIGGFNQFISEMNL 657
           LW +LL  +R  GG  WCV GDFNAV    ER GV+SG G      EIG F +FI E+ L
Sbjct: 121 LWNNLLNVRRGIGGGRWCVVGDFNAVCRRDERMGVNSGDGGGSSLTEIGEFCKFIEELEL 180

Query: 658 IDVPVLGKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSV 837
           +D+P++G++FTW+  +G A SR+DR L+S+E   +W     WV  RD+SDHCP+ L  + 
Sbjct: 181 VDLPLVGRRFTWYHANGRAMSRIDRILISDEWALRWGNCDLWVLPRDVSDHCPLILKYNQ 240

Query: 838 YDWGPKPFRFNNCWLGHKQFKSFVEESWASFQVRGWKTYAFKEKLKMLRVKLR 996
             WGPKPFRFNN WL +K+ K  VE  W++ +V GW  +  KEKLK L+  L+
Sbjct: 241 DGWGPKPFRFNNFWLQNKKLKEVVESCWSNLRVEGWMGFVLKEKLKGLKSTLK 293


>GAU10126.1 hypothetical protein TSUD_423110, partial [Trifolium subterraneum]
          Length = 238

 Score =  208 bits (529), Expect = 1e-62
 Identities = 106/238 (44%), Positives = 144/238 (60%), Gaps = 4/238 (1%)
 Frame = +1

Query: 151 IRGVGD*AKRRRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFVSKDAEGLS 330
           +RG G  AKRRR+ S +  G+   C LQET  ++I   ++ +LW      +++ +  G S
Sbjct: 1   MRGWGGSAKRRRLSSFIQMGSFDVCLLQETNKADIEDFLIHNLWGHNDVRWIANNPVGFS 60

Query: 331 GGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTD----VLFLVNVYSPCNWDNKKELWK 498
           GG+ +  + + L                   +S D    VL +VNVY+PCN   KK+LW+
Sbjct: 61  GGVSVDREGDVLHIVNVYAPCNIGGDFNAVLSSVDREGDVLHIVNVYAPCNISGKKKLWE 120

Query: 499 DLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSSGYGRREIGGFNQFISEMNLIDVPVLG 678
           DL   K+  GG  WCV GDF AV    ER+GVS+   + E   FN F+ EM LIDVPVLG
Sbjct: 121 DLSMLKQQIGGGKWCVGGDFIAVLHSSERKGVSTDTRQSERFLFNCFVEEMELIDVPVLG 180

Query: 679 KKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSVYDWGP 852
           KKFTW+S DGN+ SR+DRFLLS+  ++K+ +  QW+GDRDISDHCP+WL  S  +WGP
Sbjct: 181 KKFTWFSADGNSMSRIDRFLLSDGFVTKYDITGQWIGDRDISDHCPVWLLSSSVNWGP 238


>GAU43007.1 hypothetical protein TSUD_187280 [Trifolium subterraneum]
          Length = 1892

 Score =  225 bits (573), Expect = 2e-62
 Identities = 113/293 (38%), Positives = 163/293 (55%), Gaps = 3/293 (1%)
 Frame = +1

Query: 127  SMKILSLNIRGVGD*AKRRRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFV 306
            SM + +LNIRG+G   KRR++R  +    V    LQETK  +   + + SLW     D+ 
Sbjct: 752  SMIVGTLNIRGLGSRVKRRKVREFVSGEKVDFLALQETKLESFSDSFIQSLWGSENCDWA 811

Query: 307  SKDAEGLSGGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNWDNKK 486
               A G SGG++ +W                   C          F++NVY+ CN  +K+
Sbjct: 812  CLPAIGNSGGLISIWKKSLFSVVYTFSGHGFVGVCLDVVQDQSRCFVLNVYAKCNLSDKR 871

Query: 487  ELWKDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSS---GYGRREIGGFNQFISEMNL 657
             LW +++  +R FG   WCV GDFNAV D+ ERRG          +E+  F+ F+ E+ L
Sbjct: 872  RLWGEIIMSRRGFGRGCWCVLGDFNAVRDVSERRGARQLVVNSQSKEVLEFDLFLEELEL 931

Query: 658  IDVPVLGKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSV 837
            ID+P++G++FTW+  +G A SRLDR LLS E ISKW+    W   RD+SDHCP+ +  + 
Sbjct: 932  IDMPLIGRRFTWFHPNGVAMSRLDRVLLSAEWISKWENPNVWALSRDVSDHCPLVVRYNN 991

Query: 838  YDWGPKPFRFNNCWLGHKQFKSFVEESWASFQVRGWKTYAFKEKLKMLRVKLR 996
             DWGPKPFRFNN WL +  F+  V ++W      GW  +  K++LK L+V ++
Sbjct: 992  MDWGPKPFRFNNFWLHNNSFRELVVKTWEDQTFSGWMGFVLKDRLKGLKVSIK 1044


>GAU14347.1 hypothetical protein TSUD_309070 [Trifolium subterraneum]
          Length = 758

 Score =  221 bits (562), Expect = 2e-62
 Identities = 115/289 (39%), Positives = 160/289 (55%)
 Frame = +1

Query: 130 MKILSLNIRGVGD*AKRRRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFVS 309
           MKI S NIRG+G   KR+ +R ++ +       LQETK      ++  SLW    + +  
Sbjct: 1   MKICSWNIRGLGGCEKRKEVRQLMGELQPFIVCLQETKMGLCDDSLCASLWGSSPHAYSY 60

Query: 310 KDAEGLSGGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNWDNKKE 489
           + + G SGG+LIVWD EE+E               R   + +  +L NVY+PC  + K+ 
Sbjct: 61  RPSVGASGGLLIVWDTEEVEVWSSTSFNHVVQIHGRFIKTDEEFYLFNVYAPCEDNEKQM 120

Query: 490 LWKDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSSGYGRREIGGFNQFISEMNLIDVP 669
           LW  L    +   G   CV GDFNAV   +ERR +  G G R+ G FNQFI    L+D+P
Sbjct: 121 LWDSLSGKLQQLEGKKVCVCGDFNAVRCDEERRSIRHGTGSRDHGPFNQFIEVNGLVDLP 180

Query: 670 VLGKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSVYDWG 849
           + G+ FTW+ GDG++ SRLDRFLLSE+    W    Q    R +SDHCP+ L     +WG
Sbjct: 181 LSGRSFTWFKGDGSSMSRLDRFLLSEDWCLTWANCIQTAQLRGLSDHCPLVLSVDEENWG 240

Query: 850 PKPFRFNNCWLGHKQFKSFVEESWASFQVRGWKTYAFKEKLKMLRVKLR 996
           P+P R   CW     F+ FV + W+S QV GW  +  KEKLK++++ L+
Sbjct: 241 PRPVRMLKCWHDTPSFRKFVIDKWSSLQVDGWGGFVLKEKLKLIKLALK 289


>GAU24087.1 hypothetical protein TSUD_388800 [Trifolium subterraneum]
          Length = 1985

 Score =  221 bits (564), Expect = 3e-61
 Identities = 106/281 (37%), Positives = 160/281 (56%), Gaps = 3/281 (1%)
 Frame = +1

Query: 151  IRGVGD*AKRRRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFVSKDAEGLS 330
            +RG+G+  KRR++R ++    +    +QETK      N V  LW     D+    +EG S
Sbjct: 759  VRGLGNRVKRRKVRELVQMEKLDFLAIQETKMEAFPDNFVQGLWGSNDCDWCFLPSEGRS 818

Query: 331  GGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNWDNKKELWKDLLW 510
            GG+L +W+  +               C          F+VNVY+ CN  NK+ LW ++L 
Sbjct: 819  GGILSIWNKVKSTLVFSFIGEGFVGACLDLVAEGKKCFIVNVYAKCNLRNKRTLWANILM 878

Query: 511  CKRVFGGDLWCVAGDFNAVSDLQERRGVSS---GYGRREIGGFNQFISEMNLIDVPVLGK 681
             K  FG  LWCV GDFN+V D  ERRGV     G    E+  F+ F++ ++L+D+P++G+
Sbjct: 879  SKSGFGEGLWCVLGDFNSVRDSNERRGVVGNVDGQRSSEMVAFDLFLNNLDLVDMPLIGR 938

Query: 682  KFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSVYDWGPKPF 861
            +FTW+  +G + SRLDR L+S +    W     W  DRD++DHCP+ L  S+ DWGP+PF
Sbjct: 939  RFTWFHPNGVSMSRLDRILISSDWADVWGTPNVWAMDRDVADHCPLVLRYSLADWGPRPF 998

Query: 862  RFNNCWLGHKQFKSFVEESWASFQVRGWKTYAFKEKLKMLR 984
            RF+N WL H++FK  ++ +W +    GW  +  KE+LK+L+
Sbjct: 999  RFSNFWLEHREFKEVIKTAWDAHVAEGWMGFILKERLKVLK 1039


>KYP70042.1 hypothetical protein KK1_009250 [Cajanus cajan]
          Length = 446

 Score =  211 bits (536), Expect = 3e-61
 Identities = 105/282 (37%), Positives = 154/282 (54%)
 Frame = +1

Query: 130 MKILSLNIRGVGD*AKRRRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFVS 309
           MKILS NIRG+G   K   +R+++   NV    LQETK  ++ + +  SLW    +++  
Sbjct: 1   MKILSFNIRGLGGRLKVVEVRNLVRSENVDMVCLQETKKESVDKKLCASLWGADDFEWAF 60

Query: 310 KDAEGLSGGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNWDNKKE 489
             +EG SGG++ +W     +               R        ++VNVY+PC    KK+
Sbjct: 61  YPSEGRSGGIVSIWKTSIFKLETSIIQPNFIALYGRWGEQNLDCWVVNVYAPCVQQLKKD 120

Query: 490 LWKDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSSGYGRREIGGFNQFISEMNLIDVP 669
           LW  L       GG  WC+ GDFN+V D +ER+GV+  + R +   F +FI + +LID+P
Sbjct: 121 LWVRLHALMDEKGGARWCLVGDFNSVKDAKERKGVAVNFRREKAECFAEFIQKTSLIDLP 180

Query: 670 VLGKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSVYDWG 849
           + G+K+TW+  DG   SR++RFL++   + +W   +QW   R +SDHCPI L     DWG
Sbjct: 181 LSGRKYTWYKPDGTCMSRINRFLITIGWLDQWPNLSQWALSRGVSDHCPIILKMEDLDWG 240

Query: 850 PKPFRFNNCWLGHKQFKSFVEESWASFQVRGWKTYAFKEKLK 975
           PKPF+  NCW     F  FV+  W   +V GW  +  KEKL+
Sbjct: 241 PKPFKVLNCWRNEVGFVDFVKNEWRGLKVEGWAGFILKEKLR 282


>KYP31897.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1101

 Score =  217 bits (552), Expect = 7e-60
 Identities = 108/291 (37%), Positives = 162/291 (55%), Gaps = 2/291 (0%)
 Frame = +1

Query: 130 MKILSLNIRGVGD*AKRRRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFVS 309
           MKI++ NIRG+G   KRR +R M+ +  V+   LQETK   I   +  S W    Y++ +
Sbjct: 1   MKIITFNIRGLGGRVKRRNLREMIQKERVQLLCLQETKTREITEAMCKSFWGEDDYEWRA 60

Query: 310 KDAEGLSGGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNWDNKKE 489
             A   +GG+L +W  E  +                  +    +  VNVY PC    K+ 
Sbjct: 61  IPAVNTAGGLLCIWRKEAFQCCSVFEGSSYLGLEGIWLDDGSRVIFVNVYMPCIQAQKEL 120

Query: 490 LWKDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGV--SSGYGRREIGGFNQFISEMNLID 663
           +W +L+  K      +WCV GDFN +   +ER  V  S+    RE+  FN+FIS+M L +
Sbjct: 121 IWNELVEMKNCSQVQMWCVLGDFNCIRRAEERVNVDVSNNNRTREMTQFNRFISQMELEE 180

Query: 664 VPVLGKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSVYD 843
           VP++GKK+TW+  +G  KSRLDR  +++E + +W   +Q V  R +SDHCPI L   + D
Sbjct: 181 VPIIGKKYTWYKPNGRVKSRLDRIFVTKECLLEWSNISQKVMKRSVSDHCPILLQSKMVD 240

Query: 844 WGPKPFRFNNCWLGHKQFKSFVEESWASFQVRGWKTYAFKEKLKMLRVKLR 996
           WGPKPFR  +CW+   QF + VE++W    ++GW  Y  ++KLK L++ L+
Sbjct: 241 WGPKPFRSLDCWIQDNQFHTVVEDAWRRMSIQGWGAYVLQQKLKQLKITLK 291


>GAU50864.1 hypothetical protein TSUD_411040 [Trifolium subterraneum]
          Length = 862

 Score =  214 bits (546), Expect = 1e-59
 Identities = 106/241 (43%), Positives = 147/241 (60%)
 Frame = +1

Query: 274  SLWYGGVYDFVSKDAEGLSGGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLFLVN 453
            +LW      +V KD  GLSGG+L++W+++                   +  +  V  LVN
Sbjct: 361  NLWGHKDVRWVVKDLVGLSGGLLVMWNSDSFNLVNSFSGESYLGITVEREGA--VTHLVN 418

Query: 454  VYSPCNWDNKKELWKDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSSGYGRREIGGFN 633
            +YSPC+   KK+LW+DLL  K++F G   C+ GDFNA+    ER+G S+   + E   FN
Sbjct: 419  IYSPCSLSGKKKLWEDLLEIKQLFTGGECCLRGDFNAILHSSERKGASADSRQGERMMFN 478

Query: 634  QFISEMNLIDVPVLGKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHC 813
            +F+ EM +IDVPVLG K +W S DG + SRLDRF+LS+  I+K+ +  QW+G+R+I DHC
Sbjct: 479  RFVEEMEMIDVPVLGMKVSWVSADGKSMSRLDRFILSDGFITKFGIIGQWIGNRNIFDHC 538

Query: 814  PIWLDCSVYDWGPKPFRFNNCWLGHKQFKSFVEESWASFQVRGWKTYAFKEKLKMLRVKL 993
            PIWL  S  +WGPKPFR  N  L H  F  F+E  W SF ++G K Y  KEKL+ L+  L
Sbjct: 539  PIWLYASAKNWGPKPFRAINGCLEHPDFLVFLESCWKSFDIQGTKAYVLKEKLRFLKEIL 598

Query: 994  R 996
            +
Sbjct: 599  K 599


>KHN28364.1 hypothetical protein glysoja_029625, partial [Glycine soja]
          Length = 326

 Score =  203 bits (516), Expect = 1e-59
 Identities = 101/244 (41%), Positives = 138/244 (56%)
 Frame = +1

Query: 265 VVDSLWYGGVYDFVSKDAEGLSGGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLF 444
           VV+++W   + D+++  + GLSGG+L++W                   C    ++ +   
Sbjct: 1   VVENMWGNQLIDWIALPSSGLSGGLLMMWKKGLWVVKSNFSGHGFIGVCVEFNSAGE--- 57

Query: 445 LVNVYSPCNWDNKKELWKDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSSGYGRREIG 624
                                          WC+ GDFNAVS+ +ER G S  +G  ++ 
Sbjct: 58  -------------------------------WCLVGDFNAVSNREERTGRSENWGYIDMV 86

Query: 625 GFNQFISEMNLIDVPVLGKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDIS 804
            FN F++EMNLID P+ G KFT++  DG A SRLDRFL+S+ +++ WQV  Q VG RDIS
Sbjct: 87  DFNAFVNEMNLIDPPLHGNKFTYFCSDGIAASRLDRFLVSDGIMNLWQVKGQRVGKRDIS 146

Query: 805 DHCPIWLDCSVYDWGPKPFRFNNCWLGHKQFKSFVEESWASFQVRGWKTYAFKEKLKMLR 984
           DHCPIWL+CS  +WGPKPFRFNNCWL H  FKSF+ E W   Q+ G K Y  KEKLK++R
Sbjct: 147 DHCPIWLECSNLNWGPKPFRFNNCWLEHDGFKSFIVEEWKKIQITGRKAYVIKEKLKIIR 206

Query: 985 VKLR 996
             L+
Sbjct: 207 ESLK 210


>KHN41376.1 hypothetical protein glysoja_032562, partial [Glycine soja]
          Length = 362

 Score =  203 bits (516), Expect = 4e-59
 Identities = 101/244 (41%), Positives = 138/244 (56%)
 Frame = +1

Query: 265 VVDSLWYGGVYDFVSKDAEGLSGGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLF 444
           VV+++W   + D+++  + GLSGG+L++W                   C    ++ +   
Sbjct: 1   VVENMWGNQLIDWIALPSSGLSGGLLMMWKRGLWVVKSNFSGHGFIGVCVEFNSAGE--- 57

Query: 445 LVNVYSPCNWDNKKELWKDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSSGYGRREIG 624
                                          WC+ GDFNAVS+ +ER G S  +G  ++ 
Sbjct: 58  -------------------------------WCLVGDFNAVSNREERTGRSENWGYIDMV 86

Query: 625 GFNQFISEMNLIDVPVLGKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDIS 804
            FN F++EMNLID P+ G KFT++  DG A SRLDRFL+S+ +++ WQV  Q VG RDIS
Sbjct: 87  DFNAFVNEMNLIDPPLHGNKFTYFCSDGIAASRLDRFLVSDGIMNLWQVKGQRVGKRDIS 146

Query: 805 DHCPIWLDCSVYDWGPKPFRFNNCWLGHKQFKSFVEESWASFQVRGWKTYAFKEKLKMLR 984
           DHCPIWL+CS  +WGPKPFRFNNCWL H  FKSF+ E W   Q+ G K Y  KEKLK++R
Sbjct: 147 DHCPIWLECSNLNWGPKPFRFNNCWLEHDGFKSFIVEEWKKIQITGRKAYVIKEKLKIIR 206

Query: 985 VKLR 996
             L+
Sbjct: 207 ESLK 210


>GAU22765.1 hypothetical protein TSUD_129770 [Trifolium subterraneum]
          Length = 494

 Score =  206 bits (525), Expect = 4e-59
 Identities = 96/186 (51%), Positives = 124/186 (66%)
 Frame = +1

Query: 439 LFLVNVYSPCNWDNKKELWKDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSSGYGRRE 618
           L +VN+YSPCN   KK+LW +LL  K+  GG  WCV GDFN +    ER+G+S+   + E
Sbjct: 271 LHIVNIYSPCNISGKKQLWDNLLALKQNSGGGKWCVGGDFNVILHASERKGISTDSRQGE 330

Query: 619 IGGFNQFISEMNLIDVPVLGKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRD 798
              FN+F+ EM L+DVPVLGKKFTW+S DG + SR+DRF +S+   +K+ +  Q + DRD
Sbjct: 331 RILFNRFVEEMELVDVPVLGKKFTWFSADGKSMSRIDRFFMSDGFAAKYDITGQSIRDRD 390

Query: 799 ISDHCPIWLDCSVYDWGPKPFRFNNCWLGHKQFKSFVEESWASFQVRGWKTYAFKEKLKM 978
           ISDH P+WL  S  +WGPKPFR  N WL H  F  FVE +W SF V G K +  KEK K+
Sbjct: 391 ISDHFPVWLIVSSNNWGPKPFRVINGWLDHPDFFPFVENTWKSFDVHGKKAFILKEKFKL 450

Query: 979 LRVKLR 996
           L+  LR
Sbjct: 451 LKGCLR 456


>KYP35677.1 Transposon TX1 uncharacterized [Cajanus cajan]
          Length = 389

 Score =  203 bits (517), Expect = 5e-59
 Identities = 100/268 (37%), Positives = 150/268 (55%), Gaps = 1/268 (0%)
 Frame = +1

Query: 196 MLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFVSKDAEGLSGGMLIVWDAEELEXX 375
           M+ +  +    +QETK  NI   +V  LW  G  +     +   +GG+L +WD  ++   
Sbjct: 1   MVAKHGIDLLCIQETKKENIPETLVKKLWGSGDCECAWSPSINTAGGLLCIWDPNKINVT 60

Query: 376 XXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNWDNKKELWKDLLWCKRVFGGDLWCVAGD 555
                          + S DV+ +VNVY+P     +K LW +L+ C+      LWC+ GD
Sbjct: 61  SQFSGLGYLGLIGIVKESGDVVVMVNVYAPSEGGVRKNLWDELMSCREDSSNSLWCMVGD 120

Query: 556 FNAVSDLQERRGVSSG-YGRREIGGFNQFISEMNLIDVPVLGKKFTWWSGDGNAKSRLDR 732
           FN++  L+ER G++SG Y   +I  FN FI  M + DVP+ G+KFTW+  +G  KSR+DR
Sbjct: 121 FNSIRSLEERVGLASGMYAVTDIAMFNGFIQLMEMEDVPLAGRKFTWYRPNGAVKSRIDR 180

Query: 733 FLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSVYDWGPKPFRFNNCWLGHKQFKSFVE 912
            L+S+E   +W  A+Q V ++ ISDHCPI +     DWGPKPFR  N WL  ++ K  V 
Sbjct: 181 VLVSKEWSMRWPCASQLVLNQGISDHCPILMRNDGADWGPKPFRIFNSWLQREEIKKMVT 240

Query: 913 ESWASFQVRGWKTYAFKEKLKMLRVKLR 996
           + W+   V GW  +  KEK+K+L+ K++
Sbjct: 241 KEWSDLVVLGWAAFRLKEKIKLLKHKIK 268


>KYP32205.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1079

 Score =  214 bits (544), Expect = 8e-59
 Identities = 110/290 (37%), Positives = 160/290 (55%), Gaps = 1/290 (0%)
 Frame = +1

Query: 130 MKILSLNIRGVGD*AKRRRIRSMLYQGNVKCCFLQETKCSNIHRNVVDSLWYGGVYDFVS 309
           MK+ + N RG+G   K  RI  ++    +    +QETK           LW    +++ +
Sbjct: 1   MKVGTFNCRGLGGKVKSHRISELIRSEELDFIAIQETKLEMCDTARCAQLWGSTKFEWFA 60

Query: 310 KDAEGLSGGMLIVWDAEELEXXXXXXXXXXXXXCARKRNSTDVLFLVNVYSPCNWDNKKE 489
             + G SGG+L +W+++  +             C +         +VNVYS C+  +K+ 
Sbjct: 61  SPSHGRSGGLLSIWNSDRGKLLFSFSGSGFHGVCLQWGVDAYRCVVVNVYSSCHLVDKRR 120

Query: 490 LWKDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSSGYGRREIGGFNQFISEMNLIDVP 669
           LW D++  KR FG  LWC+ GDFN V  L+ER+G    +G R++  FN FI+EM LIDVP
Sbjct: 121 LWGDIIMSKRGFGSCLWCIVGDFNTVRRLEERKGGFGDHGARDMEEFNSFITEMELIDVP 180

Query: 670 VLGKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGDRDISDHCPIWLDCSVYDWG 849
           ++GK+FTW+  DG+  SRLDR L+SE   + W      V  RD+SDHCP+ L+  V +WG
Sbjct: 181 LVGKRFTWFRSDGSIMSRLDRVLVSESWSAHWGAGFVKVIPRDVSDHCPLILNHKVLNWG 240

Query: 850 PKPFRFNNCWLGHKQFKSFVEESWASFQVRG-WKTYAFKEKLKMLRVKLR 996
           PKPFRFNNCWL H   +  V  +W   QV+G W     + KL  ++  L+
Sbjct: 241 PKPFRFNNCWLSHCGIEGVVRSAWEK-QVQGPWAAQRLRSKLLNVKNALK 289


>GAU50762.1 hypothetical protein TSUD_410610 [Trifolium subterraneum]
          Length = 736

 Score =  209 bits (532), Expect = 3e-58
 Identities = 95/184 (51%), Positives = 130/184 (70%), Gaps = 1/184 (0%)
 Frame = +1

Query: 436 VLFLVNVYSPCNWDNKKELWKDLLWCKRVFGGDLWCVAGDFNAVSDLQERRGVSSGYGRR 615
           +L++VNVYSPC    K++LW DL+  K       WC+AGDFN+++ + ERRG +SG GR+
Sbjct: 301 LLYIVNVYSPCKMSGKRKLWSDLIHVKLNNEPSEWCLAGDFNSITKVGERRG-NSGEGRQ 359

Query: 616 -EIGGFNQFISEMNLIDVPVLGKKFTWWSGDGNAKSRLDRFLLSEELISKWQVAAQWVGD 792
            E   F QFI  + ++D+P+ GK +TW++ DG+A SR DRFL+SE+ I K +++ QWVGD
Sbjct: 360 GEKVEFTQFIDALEVVDIPLKGKMYTWFNADGSAMSRFDRFLVSEDFIEKGRLSYQWVGD 419

Query: 793 RDISDHCPIWLDCSVYDWGPKPFRFNNCWLGHKQFKSFVEESWASFQVRGWKTYAFKEKL 972
           RDISDHCPIWL  S  +WGPKPF FNNCW+ H  F  FV+++W    +RG K +  KEKL
Sbjct: 420 RDISDHCPIWLVSSNLNWGPKPFLFNNCWIEHPSFFKFVKDTWERLDIRGKKAFIIKEKL 479

Query: 973 KMLR 984
           K L+
Sbjct: 480 KCLK 483


Top